CN112861720A - Remote sensing image small sample target detection method based on prototype convolutional neural network - Google Patents

Remote sensing image small sample target detection method based on prototype convolutional neural network Download PDF

Info

Publication number
CN112861720A
CN112861720A CN202110172985.6A CN202110172985A CN112861720A CN 112861720 A CN112861720 A CN 112861720A CN 202110172985 A CN202110172985 A CN 202110172985A CN 112861720 A CN112861720 A CN 112861720A
Authority
CN
China
Prior art keywords
image
prototype
network
target detection
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110172985.6A
Other languages
Chinese (zh)
Other versions
CN112861720B (en
Inventor
程塨
施佩珍
闫博唯
姚西文
韩军伟
郭雷
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Northwestern Polytechnical University
Original Assignee
Northwestern Polytechnical University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Northwestern Polytechnical University filed Critical Northwestern Polytechnical University
Priority to CN202110172985.6A priority Critical patent/CN112861720B/en
Publication of CN112861720A publication Critical patent/CN112861720A/en
Application granted granted Critical
Publication of CN112861720B publication Critical patent/CN112861720B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/10Terrestrial scenes
    • G06V20/13Satellite images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/25Determination of region of interest [ROI] or a volume of interest [VOI]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/46Descriptors for shape, contour or point-related descriptors, e.g. scale invariant feature transform [SIFT] or bags of words [BoW]; Salient regional features
    • G06V10/462Salient features, e.g. scale invariant feature transforms [SIFT]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/07Target detection

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Multimedia (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Computational Linguistics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • Astronomy & Astrophysics (AREA)
  • Remote Sensing (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Image Analysis (AREA)

Abstract

The invention provides a remote sensing image small sample target detection method based on a prototype convolutional neural network. A target detection network mainly comprising a feature extraction and class prototype acquisition module, a prototype guide RPN module, a redirection feature graph module and a detector module is constructed, basic learning of the network model is firstly carried out on a base class data set containing a large number of labeled samples, then fine tuning is carried out on the network model on a balanced subdata set, and finally multi-class target detection of the small-sample remote sensing image is realized through post-processing operations such as non-maximum suppression. The method can quickly and accurately detect different types of targets from the optical remote sensing image with the complex background by using a small amount of new-type labeled data, and has higher detection precision and higher detection speed.

Description

Remote sensing image small sample target detection method based on prototype convolutional neural network
Technical Field
The invention belongs to the technical field of remote sensing image processing, and particularly relates to a remote sensing image small sample target detection method based on a prototype convolutional neural network, which can be applied to remote sensing image target detection under the condition of small samples with few labeled sample data.
Background
With the continuous development of satellite remote sensing technology, the acquisition of massive remote sensing images becomes easier. However, the labeling work of the remote sensing image requires a lot of manpower and financial resources. In addition, some categories of targets are rare, and there is a problem that data is difficult to acquire. Therefore, how to use a small amount of labeled samples to realize target detection of remote sensing images becomes one of the problems to be solved urgently at present.
The existing small sample target detection methods can be broadly divided into three categories, namely target detection based on meta learning, target detection based on metric learning and target detection based on fine adjustment. The target detection based on the meta-learning is mainly to construct different tasks through each iteration, so that the network has strong generalization performance, and can adapt quickly when new tasks of new categories are encountered, thereby realizing the target detection task of a small sample. The target detection based on metric learning is mainly realized by learning a metric space, in the space, the closer the targets 'distances' of the same category are, the better, the farther the targets 'distances' of different categories are, the better, the distances are obtained by some metric ways, such as common Euclidean distance, cosine similarity, and the like. The fine-tuning-based target detection mainly comprises the steps of training an initial model in advance, and then constructing a balanced data set to fine tune parameters of the initial model, so that small-sample target detection is realized. However, the remote sensing image has the characteristics of large scale change, dense arrangement and the like, and is influenced by illumination, cloud layers, target forms and complex background environments, so that the remote sensing image and the natural scene image have larger difference. These problems present a significant challenge to the task of target detection of small samples of optically remotely sensed images. The existing small sample target detection method is difficult to be directly applied to remote sensing images.
Disclosure of Invention
In order to overcome the defects of the prior art, the invention provides a remote sensing image small sample target detection method based on a prototype convolutional neural network. Constructing a target detection network mainly comprising a feature extraction and classification prototype acquisition module, a prototype guide RPN module, a redirection feature map module and a detector module, wherein the feature extraction and classification prototype acquisition module respectively extracts the features of a support set image and a query set image by using a parameter-shared deep convolutional neural network, and performs global average pooling operation on the feature map of the support set image to acquire a classification prototype; the prototype-guided RPN module guides the process of the region of interest generated by the region-proposed network by using the acquired type prototype; the redirection feature map module adopts a channel attention mechanism to re-model the channel information of the foreground target candidate frame features; and the detection module detects the characteristics after the re-modeling by using the classification network and the regression network and outputs a detection frame containing the target prediction category and the target prediction position. The method comprises the steps of firstly carrying out network model basic learning on a base class data set, then finely adjusting a network model on a balanced subdata set, wherein the subdata set simultaneously comprises a base class and a new class sample, and finally carrying out post-processing operations such as non-maximum suppression and the like to realize multi-class target detection of the remote sensing image of the small sample. The method can quickly and accurately detect different types of targets from the optical remote sensing image with the complex background by using a small amount of new-type labeled data, and has higher detection precision and higher detection speed.
A remote sensing image small sample target detection method based on a prototype convolutional neural network is characterized by comprising the following steps:
step 1, preparing base class training data: training an image dataset D with a base classbaseThe target instance images of all the categories form a support set image, wherein the target instance image is an imageThe middle target example surrounds the image block in the frame; training an image dataset D from a base classbaseRandomly extracting the whole image containing any type and quantity of target examples as a query set image; then, preprocessing all the support set images and the query set images, wherein the preprocessed images serve as training data; the preprocessing operation comprises the steps of carrying out normalization processing on all images, adjusting the length and width of the images in the support set to be M multiplied by M, and adjusting the length and width of the images in the query set to be M '× M'; the value of M 'is 0.8 to 1.2 times of the size of the base class training image, and the value of M is 0.28 times of M'; the base class training image data set is a remote sensing image target detection DIOR data set;
step 2, constructing a target detection network: the target detection network mainly comprises a feature extraction and classification prototype acquisition module, a prototype guide RPN module, a redirection feature map module and a detector module;
the specific implementation process of the feature extraction and classification prototype acquisition module is as follows: firstly, inputting a support set image and a query set image, respectively extracting the features of the support set image and the query set image by using a feature extraction backbone network B to obtain corresponding feature maps, and recording the feature map of the support set image as { F }s,1,Fs,2,...,Fs,CF, inquiring a characteristic graph of the set imageqWherein, the feature extraction backbone network B adopts a convolution layer of the first four stages of the ResNet-101 network, Fs,iA feature map representing a support image of the ith category, i being 1,2, …, C being the number of categories; then, feature map { F) of support set images,1,Fs,2,...,Fs,CPerforming global average pooling operation to obtain prototype vector p of each category1,p2,...,pC},piA prototype vector representing the ith class, i ═ 1,2, …, C;
the prototype-guided RPN module is used for generating an interested area possibly containing a target, and the specific implementation process comprises the following steps: class prototype vector { p) obtained by feature extraction and class prototype acquisition module1,p2,...,pCThe input is a three-layer full-connection network, and a length and all RPN classifiers are outputRecombining the vectors with the same length after the convolution kernels are unfolded according to the shape which is the same as that of the convolution kernels of the RPN classifier to form a group of new convolution kernel parameters, taking the parameters as the parameters of the auxiliary classifier, respectively carrying out front-background scoring on the anchor points in the feature map of the query set image by using the auxiliary classifier and the RPN classifier, and adding the obtained two scores to serve as the foreground target score of the anchor points; then, determining the label of each anchor point according to the anchor point division rule of the RPN, wherein the label comprises a foreground type, a background type and a neglected sample; ranking the foreground target scores from high to low, and adjusting r anchor points with higher scores by using an RPN regressor to obtain r interested areas;
the redirection feature map module performs feature extraction on the region of interest obtained by the prototype-guided RPN module by using the RoI Align operation of the Mask R-CNN network to obtain a feature map { F) of the region of interest1,F2,...,FrIn which FiA feature map representing the ith region of interest, i being 1,2, …, r, r being the number of regions of interest; then, multiplying the category prototype vector by the feature map of the region of interest channel by channel to obtain a redirected feature map;
the detector module utilizes a second-stage detector of the Faster R-CNN network to detect the redirection characteristic diagram output by the redirection characteristic diagram module and outputs a detection frame containing the target prediction category and position; wherein, the classification loss of the detector module adopts a cross entropy loss function, and the regression loss of the detector module adopts a SmoothL1 loss function;
step 3, training a target detection network: inputting the preprocessed support set images and query set images obtained in the step 1 into the target detection network constructed in the step 2 for training to obtain the target detection network trained by the base class data set;
step 4, fine tuning training data preparation: first, an image dataset D is trained from the base classbaseRandomly extracting 3N labeled sample images of each category, and combining the labeled sample images with a new training image data set DnovelCombined to form a fine-tuning training image data set Dfew(ii) a The new classThe number of the labeled samples contained in the image data set is not more than 30, and N is the number of the labeled samples of each category;
then, the fine-tuning training image data set DfewSubstitute base class training image dataset DbaseExecuting the processing in the step 1 to obtain fine tuning training data;
step 5, fine adjustment of the target detection network is carried out: inputting the support set images and the query set images of the fine tuning training data obtained in the step 4 into the trained target detection network obtained in the step 3, and training the network again to obtain the target detection network trained by the fine tuning data set;
then, the data set D constructed in step 4 is usedfewInputting each type of labeled sample into the trained feature extraction backbone network B, obtaining feature representative vectors thereof through global average pooling operation, and calculating the average vector of all the feature representative vectors as a prototype vector of the type; so processed, each class gets a prototype vector, get CfewPersonal prototype vector, CfewAs a data set DfewThe number of image categories contained in (a);
step 6, target detection: inputting the preprocessed data to be detected as a query set image into the feature extraction backbone network B trained in the step 5 to obtain the image features of the query set, and inputting the C obtained in the step 5fewInputting the individual category prototype vector and the image features of the query set into a trained prototype guide RPN module, then obtaining a detection frame containing a target prediction category and a target prediction position through a trained redirection feature map module and a trained detector module, filtering out redundant detection frames by adopting a non-maximum suppression method, and obtaining a final target detection result of the data to be detected through the residual detection frames.
The invention has the beneficial effects that: due to the adoption of the feature extraction and class prototype acquisition module of the shared backbone network, the overfitting problem can be effectively relieved, the memory is saved, and the calculation speed is increased; due to the adoption of the prototype-guided RPN, the quality of the obtained region of interest is higher, and the detection of a subsequent detector is facilitated; the detection precision can be further improved due to the adoption of a processing mode of feature redirection; the base class training data and the fine tuning training data are respectively adopted to train the target detection network in sequence, so that the target detection network can realize a target detection task of a complex background remote sensing image under the condition that only a small number of new class label samples are contained, and the target detection network has high detection precision and good robustness.
Drawings
FIG. 1 is a flow chart of a method for detecting a small sample target of a remote sensing image based on a prototype convolutional neural network;
FIG. 2 is a schematic diagram of a two-stage training of the method of the present invention;
FIG. 3 is an exemplary graph of the test results of the method of the present invention.
Detailed Description
The present invention will be further described with reference to the following drawings and examples, which include, but are not limited to, the following examples.
The hardware environment for implementation is: intel (R) core (TM) i3-8100 CPU computer, 8.0GB memory, the running software environment is: ubuntu16.04.5lts and Pycharm 2018. The experiment used a large-scale public database of remote sensing images, DIOR Dataset, with 23463 images in total, of which 5862 were partitioned into the training set, 5863 were partitioned into the validation set, and the remaining 11738 were partitioned into the test set. Covering 20 common remote sensing image objects. Each category contains approximately 1200 images with an image size of 800 x 800 pixels and a spatial resolution ranging from 0.5 meters per pixel to 30 meters per pixel. In order to verify the effectiveness of the proposed solution, baseball fields, basketball fields, bridges, chimneys, ships, airplanes, airports, highway toll stations, docks, track and field sites, dams, golf courses, oil storage tanks, tennis courts and vehicles are used as the base classes, and highway service areas, overpasses, stadiums, train stations and windmills are used as the new classes. 11725 images of the training set and the verification set are used for training, images containing the new class in the 11725 images are removed according to the division of the base class and the new class, and the residual 8573 images are used as a base class training data set Dbase. The number of the labeled samples of the new class is 10 per class, and the labeled samples of the new class are connected with each class in the base classRandomly drawn 10 labeled samples jointly construct a small sample dataset Dfew. Finally, the remaining 11738 images were tested as a test set.
As shown in fig. 1, the method for detecting a small sample target of a remote sensing image based on a prototype convolutional neural network of the present invention is implemented as follows:
1. base class training data preparation
First, the image dataset D is trained with the base classbaseAnd on the basis, respectively constructing a support set image and a query set image. Wherein the query set images are directly from DbaseThe method is obtained by random extraction, namely, the whole image containing any type and number of target instances is randomly extracted to be used as a query set image. Support set image training image dataset by base class DbaseThe target instance images are image blocks in a target instance surrounding frame in the image, and the target instance can be obtained by extracting the target instance from the whole remote sensing image by utilizing a real label frame of the image.
Then, respectively preprocessing the support set image and the query set image, including: (1) using the mean value R of three channel components of RGB per imagemean、Gmean、BmeanAnd standard deviation Rstd、Gstd、BstdAnd normalizing the image:
Figure BDA0002939382510000051
wherein, Ip,cRepresents the c-channel component, I 'of the pre-normalized image'p,cRepresenting the c-channel component, Mean, of the normalized imagecMean, Std, representing the c channel of the imagecRepresenting the standard deviation of the c channel of the image.
(2) Adjusting the length and width of the images in the support set to M multiplied by M and the length and width of the images in the query set to M ' × M ', wherein the value of M ' is DbaseThe size of the original image is 0.8 to 1.2 times, and the value of M is 0.28 times of that of M'.
2. Building an object detection network
The target detection network mainly comprises a feature extraction and classification prototype acquisition module, a prototype guide RPN module, a redirection feature diagram module and a detector module.
(1) Feature extraction and category prototype acquisition module
The invention adopts the convolution layers of the first four stages of the ResNet-101 network as the feature extraction backbone network B. The ResNet-101 network structure is described in the documents "K.He, X.Zhang, S.Ren, and J.Sun," Deep residual learning for image recognition, "in Proceedings of the IEEE conference on computer vision and pattern recognition,2016, pp.770-778".
Firstly, inputting a support set image and a query set image, respectively extracting the features of the support set image and the query set image by using a feature extraction backbone network B to obtain corresponding feature maps, and recording the feature map of the support set image as { F }s,1,Fs,2,...,Fs,CF, inquiring a characteristic graph of the set imageqWherein F iss,iA feature map indicating the support image of the ith category, i being 1,2, …, C being the number of categories included in the support set image; then, feature map { F) of support set images,1,Fs,2,...,Fs,CPerforming global average pooling operation to obtain prototype vector p of each category1,p2,...,pC},piA prototype vector representing the ith class, i ═ 1,2, …, C.
(2) Prototype-guided RPN module
The prototype-guided RPN module is used for generating a region of interest possibly containing a target, and the specific implementation process is as follows: class prototype vector { p) obtained by feature extraction and class prototype acquisition module1,p2,...,pCInputting the result into a three-layer fully-connected network, outputting a vector with the same length as the unfolded convolution kernels of the RPN classifier, recombining the vector according to the same shape as the convolution kernels of the RPN classifier to form a group of new convolution kernel parameters, taking the parameters as the parameters of an auxiliary classifier, and respectively performing front-background printing on anchor points in a feature map of an image of the query set by using the auxiliary classifier and the RPN classifierDividing, and adding the obtained two fractions to serve as the foreground target score of the anchor point; then, determining the label of each anchor point according to the anchor point division rule of the RPN, wherein the label comprises a foreground type, a background type and a neglected sample; the foreground target scores are sorted from high to low, and r anchor points with higher scores are adjusted by using an RPN regressor to obtain r regions of interest, where r is 256 in this embodiment.
Among them, the RPN method is described in "S.ren, R.Girshick, R.Girshick, and J.Sun," Faster R-CNN: Towards read-Time Object Detection with Region pro-posal Networks, "IEEE Transactions on Pattern Analysis & Machine Analysis, vol.39, No.6, pp.1137-1149,2017".
(3) Redirection feature map module
The redirection feature map module firstly performs feature extraction on the region of interest obtained by the prototype-guided RPN module by using the RoI Align operation proposed by Kaiming He in the Mask R-CNN network proposed in 2017 to obtain a feature map { F) of the region of interest1,F2,...,FrIn which FiA feature map representing the ith region of interest, i being 1,2, …, r, r being the number of regions of interest; then, multiplying the class prototype vector and the feature map of the region of interest channel by channel to obtain a redirected feature map { F }1c1,F1c2,...,F1c,F2c1,F2c2,...,F2c,...,Frc1,Frc2...,FrcIn which Fi,jAnd the redirection feature map is used for performing channel-by-channel multiplication on the feature map representing the ith interested area and the prototype vector with the category j.
(4) Detector module
Detecting the redirection feature map output by the redirection feature map module by using a second-stage detector of the Faster R-CNN network, and outputting a detection frame containing the target prediction type and position; wherein, the classification loss of the detector module adopts a cross entropy loss function, and the regression loss of the detector module adopts a SmoothL1 loss function.
The Faster R-CNN is described in the documents "S.Ren, R.Girshick, R.Girshick, and J.Sun," Faster R-CNN: Towards read-Time Object Detection with Region pro-posal Networks, "IEEE Transactions on Pattern Analysis & Machine understanding, vol.39, No.6, pp.1137-1149,2017.7".
3. Training target detection network
And (3) inputting the preprocessed support set images and query set images obtained in the step (1) into the target detection network constructed in the step (2) for training to obtain the target detection network trained by the base class data set.
4. Fine-tuning training data preparation
Constructing a data set D simultaneously containing the base class labeled sample and the new class labeled samplefewNamely: training an image dataset D from a base classbaseRandomly extracting 3N labeled sample images of each category, and combining the labeled sample images with a new training image data set DnovelCombined to form a fine-tuning training image data set Dfew(ii) a The new-class image data set contains less labeled samples, the number of labeled samples does not exceed 30, N is the number of labeled samples of each class, and in the embodiment, N is 10.
Then, the fine-tuning training image data set DfewSubstitute base class training image dataset DbaseAnd (4) executing the processing in the step (1) to obtain fine tuning training data.
5. Fine tuning of target detection networks
And (4) the fine tuning training data obtained through the processing in the step (4) also comprises preprocessed support set images and query set images, the support set images and the query set images are respectively input into the target detection network trained on the base class data set obtained in the step (3), the network is trained again to perform network fine tuning, and the target detection network trained through the fine tuning data set is obtained. As shown in fig. 2.
Then, the data set DfewInputting each type of labeled sample into the retrained feature extraction backbone network B, obtaining feature representative vectors thereof through global average pooling operation, and calculating the average vector of all the feature representative vectors as a prototype vector of the type; so processed, each class gets a prototype vector, get CfewPersonal prototype vector, CfewAs a data set DfewNumber of image categories contained in (1), C in the present embodimentfewIs 20.
6. Target detection
And (3) taking the data to be detected preprocessed according to the method in the step (1) as a query set image, sending the query set image into a retrained backbone network B, combining 20 class prototype vectors obtained in the step (5), and guiding an RPN module, a redirection feature map module and a detector module through the retrained prototype to obtain a detection frame for predicting the class and the position of the target in the query set image. And filtering redundant detection frames by adopting a non-maximum value inhibition method, and giving a final detection result of each picture, wherein a score threshold value is set to be 0.3, and an NMS coincidence threshold value is set to be 0.5 to filter the redundant detection frames.
Such NMS methods are described in the literature "A.Neubeck and L.Gool," efficiency Non-Maximum suppression. "18 th International Conference on Pattern Recognition,2006, pp.850-855.
FIG. 3 shows an exemplary graph of the test results obtained by the present invention. Meanwhile, in order to verify the effectiveness of the method, the detection result is measured by selecting the mAP value, the value is between 0 and 1, and the larger the value is, the better the detection effect is. The method of mAP calculation is described in The literature "M.Everingham, SMA.Eslami and L Van Gool," The passive visual object classes challenge: A retroactive, "International journal of computer vision,2015, pp.98-136.
The detection result obtained by the method is compared with a small sample target detection method Meta R-CNN method, and the comparison result is shown in Table 1, so that the method has higher mAP value on both the base class and the new class and has higher detection precision.
TABLE 1
mAP of base class mAP of new class
Meta R-CNN process 52.3% 17.0%
The method of the invention 52.6% 18.6%

Claims (1)

1. A remote sensing image small sample target detection method based on a prototype convolutional neural network is characterized by comprising the following steps:
step 1, preparing base class training data: training an image dataset D with a base classbaseThe target example images of all the categories form a support set image, wherein the target example image is an image block in a target example surrounding frame in the image; training an image dataset D from a base classbaseRandomly extracting the whole image containing any type and quantity of target examples as a query set image; then, preprocessing all the support set images and the query set images, wherein the preprocessed images serve as training data; the preprocessing operation comprises the steps of carrying out normalization processing on all images, adjusting the length and width of the images in the support set to be M multiplied by M, and adjusting the length and width of the images in the query set to be M '× M'; the value of M 'is 0.8 to 1.2 times of the size of the base class training image, and the value of M is 0.28 times of M'; the base class training image data set is a remote sensing image target detection DIOR data set;
step 2, constructing a target detection network: the target detection network mainly comprises a feature extraction and classification prototype acquisition module, a prototype guide RPN module, a redirection feature map module and a detector module;
the specific implementation process of the feature extraction and classification prototype acquisition module is as follows: firstly, inputting a support set image and a query set image, respectively extracting the features of the support set image and the query set image by using a feature extraction backbone network B to obtain corresponding feature maps, and recording the feature map of the support set image as { F }s,1,Fs,2,...,Fs,CF, inquiring a characteristic graph of the set imageqWherein, the feature extraction backbone network B adopts a convolution layer of the first four stages of the ResNet-101 network, Fs,iA feature map representing a support image of the ith category, i being 1,2, …, C being the number of categories; then, feature map { F) of support set images,1,Fs,2,...,Fs,CPerforming global average pooling operation to obtain prototype vector p of each category1,p2,...,pC},piA prototype vector representing the ith class, i ═ 1,2, …, C;
the prototype-guided RPN module is used for generating an interested area possibly containing a target, and the specific implementation process comprises the following steps: class prototype vector { p) obtained by feature extraction and class prototype acquisition module1,p2,...,pCInputting the vectors into a three-layer fully-connected network, outputting a vector with the same length as that of all convolution kernels of an RPN classifier after expansion, recombining the vector according to the shape same as that of the convolution kernels of the RPN classifier to form a group of new convolution kernel parameters, taking the parameters as parameters of an auxiliary classifier, respectively carrying out front-background scoring on anchor points in a feature map of an image of a query set by using the auxiliary classifier and the RPN classifier, and adding the obtained two scores to serve as foreground target scores of the anchor points; then, determining the label of each anchor point according to the anchor point division rule of the RPN, wherein the label comprises a foreground type, a background type and a neglected sample; ranking the foreground target scores from high to low, and adjusting r anchor points with higher scores by using an RPN regressor to obtain r interested areas;
the redirection feature map module performs feature extraction on the region of interest obtained by the prototype-guided RPN module by using the RoI Align operation of the Mask R-CNN network to obtain a feature map { F) of the region of interest1,F2,...,FrIn which FiA feature map representing the ith region of interest, i being 1,2, …, r, r being the number of regions of interest; then, multiplying the category prototype vector by the feature map of the region of interest channel by channel to obtain a redirected feature map;
the detector module utilizes a second-stage detector of the Faster R-CNN network to detect the redirection characteristic diagram output by the redirection characteristic diagram module and outputs a detection frame containing the target prediction category and position; wherein, the classification loss of the detector module adopts a cross entropy loss function, and the regression loss of the detector module adopts a SmoothL1 loss function;
step 3, training a target detection network: inputting the preprocessed support set images and query set images obtained in the step 1 into the target detection network constructed in the step 2 for training to obtain the target detection network trained by the base class data set;
step 4, fine tuning training data preparation: first, an image dataset D is trained from the base classbaseRandomly extracting 3N labeled sample images of each category, and combining the labeled sample images with a new training image data set DnovelCombined to form a fine-tuning training image data set Dfew(ii) a The number of the labeled samples contained in the new image data set is not more than 30, and N is the number of the labeled samples of each category;
then, the fine-tuning training image data set DfewSubstitute base class training image dataset DbaseExecuting the processing in the step 1 to obtain fine tuning training data;
step 5, fine adjustment of the target detection network is carried out: inputting the support set images and the query set images of the fine tuning training data obtained in the step 4 into the trained target detection network obtained in the step 3, and training the network again to obtain the target detection network trained by the fine tuning data set;
then, the data set D constructed in step 4 is usedfewRespectively inputting each type of labeled sample into the trained feature extraction backbone network B, obtaining feature representative vectors thereof through global average pooling operation, and calculating average vectors of all the feature representative vectorsAs prototype vectors for the class; so processed, each class gets a prototype vector, get CfewPersonal prototype vector, CfewAs a data set DfewThe number of image categories contained in (a);
step 6, target detection: inputting the preprocessed data to be detected as a query set image into the feature extraction backbone network B trained in the step 5 to obtain the image features of the query set, and inputting the C obtained in the step 5fewInputting the individual category prototype vector and the image features of the query set into a trained prototype guide RPN module, then obtaining a detection frame containing a target prediction category and a target prediction position through a trained redirection feature map module and a trained detector module, filtering out redundant detection frames by adopting a non-maximum suppression method, and obtaining a final target detection result of the data to be detected through the residual detection frames.
CN202110172985.6A 2021-02-08 2021-02-08 Remote sensing image small sample target detection method based on prototype convolutional neural network Active CN112861720B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110172985.6A CN112861720B (en) 2021-02-08 2021-02-08 Remote sensing image small sample target detection method based on prototype convolutional neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110172985.6A CN112861720B (en) 2021-02-08 2021-02-08 Remote sensing image small sample target detection method based on prototype convolutional neural network

Publications (2)

Publication Number Publication Date
CN112861720A true CN112861720A (en) 2021-05-28
CN112861720B CN112861720B (en) 2024-05-14

Family

ID=75989205

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110172985.6A Active CN112861720B (en) 2021-02-08 2021-02-08 Remote sensing image small sample target detection method based on prototype convolutional neural network

Country Status (1)

Country Link
CN (1) CN112861720B (en)

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113283513A (en) * 2021-05-31 2021-08-20 西安电子科技大学 Small sample target detection method and system based on target interchange and metric learning
CN113378936A (en) * 2021-06-11 2021-09-10 长沙军民先进技术研究有限公司 Small sample target detection method based on fast RCNN
CN113408546A (en) * 2021-06-21 2021-09-17 武汉工程大学 Single-sample target detection method based on mutual global context attention mechanism
CN113642574A (en) * 2021-07-30 2021-11-12 中国人民解放军军事科学院国防科技创新研究院 Small sample target detection method based on feature weighting and network fine tuning
CN113705570A (en) * 2021-08-31 2021-11-26 长沙理工大学 Few-sample target detection method based on deep learning
CN113743455A (en) * 2021-07-23 2021-12-03 北京迈格威科技有限公司 Target retrieval method, device, electronic equipment and storage medium
CN113780272A (en) * 2021-07-02 2021-12-10 北京建筑大学 SAR image ship detection method and device, electronic equipment and storage medium
CN113822368A (en) * 2021-09-29 2021-12-21 成都信息工程大学 Anchor-free incremental target detection method
CN114124437A (en) * 2021-09-28 2022-03-01 西安电子科技大学 Encrypted flow identification method based on prototype convolutional network
CN114169442A (en) * 2021-12-08 2022-03-11 中国电子科技集团公司第五十四研究所 Remote sensing image small sample scene classification method based on double prototype network
CN114219804A (en) * 2022-02-22 2022-03-22 汉斯夫(杭州)医学科技有限公司 Small sample tooth detection method based on prototype segmentation network and storage medium
CN114743045A (en) * 2022-03-31 2022-07-12 电子科技大学 Small sample target detection method based on double-branch area suggestion network
CN114861842A (en) * 2022-07-08 2022-08-05 中国科学院自动化研究所 Few-sample target detection method and device and electronic equipment
CN115049870A (en) * 2022-05-07 2022-09-13 电子科技大学 Target detection method based on small sample
CN115049944A (en) * 2022-06-02 2022-09-13 北京航空航天大学 Small sample remote sensing image target detection method based on multitask optimization
CN115100532A (en) * 2022-08-02 2022-09-23 北京卫星信息工程研究所 Small sample remote sensing image target detection method and system
CN115115898A (en) * 2022-08-31 2022-09-27 南京航空航天大学 Small sample target detection method based on unsupervised feature reconstruction
CN115984621A (en) * 2023-01-09 2023-04-18 宁波拾烨智能科技有限公司 Small sample remote sensing image classification method based on restrictive prototype comparison network
CN116129226A (en) * 2023-04-10 2023-05-16 之江实验室 Method and device for detecting few-sample targets based on multi-prototype mixing module
WO2023109208A1 (en) * 2021-12-15 2023-06-22 北京邮电大学 Few-shot object detection method and apparatus
CN116310894A (en) * 2023-02-22 2023-06-23 中交第二公路勘察设计研究院有限公司 Unmanned aerial vehicle remote sensing-based intelligent recognition method for small-sample and small-target Tibetan antelope
CN117409250A (en) * 2023-10-27 2024-01-16 北京信息科技大学 Small sample target detection method, device and medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109063594A (en) * 2018-07-13 2018-12-21 吉林大学 Remote sensing images fast target detection method based on YOLOv2
CN109919108A (en) * 2019-03-11 2019-06-21 西安电子科技大学 Remote sensing images fast target detection method based on depth Hash auxiliary network
CN110189304A (en) * 2019-05-07 2019-08-30 南京理工大学 Remote sensing image target on-line quick detection method based on artificial intelligence
CN111160249A (en) * 2019-12-30 2020-05-15 西北工业大学深圳研究院 Multi-class target detection method of optical remote sensing image based on cross-scale feature fusion
CN111666836A (en) * 2020-05-22 2020-09-15 北京工业大学 High-resolution remote sensing image target detection method of M-F-Y type lightweight convolutional neural network
CN111797676A (en) * 2020-04-30 2020-10-20 南京理工大学 High-resolution remote sensing image target on-orbit lightweight rapid detection method

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109063594A (en) * 2018-07-13 2018-12-21 吉林大学 Remote sensing images fast target detection method based on YOLOv2
CN109919108A (en) * 2019-03-11 2019-06-21 西安电子科技大学 Remote sensing images fast target detection method based on depth Hash auxiliary network
CN110189304A (en) * 2019-05-07 2019-08-30 南京理工大学 Remote sensing image target on-line quick detection method based on artificial intelligence
CN111160249A (en) * 2019-12-30 2020-05-15 西北工业大学深圳研究院 Multi-class target detection method of optical remote sensing image based on cross-scale feature fusion
CN111797676A (en) * 2020-04-30 2020-10-20 南京理工大学 High-resolution remote sensing image target on-orbit lightweight rapid detection method
CN111666836A (en) * 2020-05-22 2020-09-15 北京工业大学 High-resolution remote sensing image target detection method of M-F-Y type lightweight convolutional neural network

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
胡秀华;郭雷;李晖晖;: "基于特征融合与分类器在线学习的目标跟踪算法", 控制与决策, no. 09 *

Cited By (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113283513A (en) * 2021-05-31 2021-08-20 西安电子科技大学 Small sample target detection method and system based on target interchange and metric learning
CN113283513B (en) * 2021-05-31 2022-12-13 西安电子科技大学 Small sample target detection method and system based on target interchange and metric learning
CN113378936A (en) * 2021-06-11 2021-09-10 长沙军民先进技术研究有限公司 Small sample target detection method based on fast RCNN
CN113378936B (en) * 2021-06-11 2024-03-08 长沙军民先进技术研究有限公司 Faster RCNN-based few-sample target detection method
CN113408546A (en) * 2021-06-21 2021-09-17 武汉工程大学 Single-sample target detection method based on mutual global context attention mechanism
CN113408546B (en) * 2021-06-21 2023-03-07 武汉工程大学 Single-sample target detection method based on mutual global context attention mechanism
CN113780272A (en) * 2021-07-02 2021-12-10 北京建筑大学 SAR image ship detection method and device, electronic equipment and storage medium
CN113743455A (en) * 2021-07-23 2021-12-03 北京迈格威科技有限公司 Target retrieval method, device, electronic equipment and storage medium
WO2023000764A1 (en) * 2021-07-23 2023-01-26 北京迈格威科技有限公司 Target retrieval method and apparatus, device and storage medium
CN113642574A (en) * 2021-07-30 2021-11-12 中国人民解放军军事科学院国防科技创新研究院 Small sample target detection method based on feature weighting and network fine tuning
CN113642574B (en) * 2021-07-30 2022-11-29 中国人民解放军军事科学院国防科技创新研究院 Small sample target detection method based on feature weighting and network fine tuning
CN113705570A (en) * 2021-08-31 2021-11-26 长沙理工大学 Few-sample target detection method based on deep learning
CN113705570B (en) * 2021-08-31 2023-12-08 长沙理工大学 Deep learning-based few-sample target detection method
CN114124437A (en) * 2021-09-28 2022-03-01 西安电子科技大学 Encrypted flow identification method based on prototype convolutional network
CN113822368B (en) * 2021-09-29 2023-06-20 成都信息工程大学 Anchor-free incremental target detection method
CN113822368A (en) * 2021-09-29 2021-12-21 成都信息工程大学 Anchor-free incremental target detection method
CN114169442A (en) * 2021-12-08 2022-03-11 中国电子科技集团公司第五十四研究所 Remote sensing image small sample scene classification method based on double prototype network
WO2023109208A1 (en) * 2021-12-15 2023-06-22 北京邮电大学 Few-shot object detection method and apparatus
CN114219804A (en) * 2022-02-22 2022-03-22 汉斯夫(杭州)医学科技有限公司 Small sample tooth detection method based on prototype segmentation network and storage medium
CN114219804B (en) * 2022-02-22 2022-05-24 汉斯夫(杭州)医学科技有限公司 Small sample tooth detection method based on prototype segmentation network and storage medium
CN114743045A (en) * 2022-03-31 2022-07-12 电子科技大学 Small sample target detection method based on double-branch area suggestion network
CN114743045B (en) * 2022-03-31 2023-09-26 电子科技大学 Small sample target detection method based on double-branch area suggestion network
CN115049870A (en) * 2022-05-07 2022-09-13 电子科技大学 Target detection method based on small sample
CN115049944A (en) * 2022-06-02 2022-09-13 北京航空航天大学 Small sample remote sensing image target detection method based on multitask optimization
CN115049944B (en) * 2022-06-02 2024-05-28 北京航空航天大学 Small sample remote sensing image target detection method based on multitasking optimization
CN114861842A (en) * 2022-07-08 2022-08-05 中国科学院自动化研究所 Few-sample target detection method and device and electronic equipment
CN114861842B (en) * 2022-07-08 2022-10-28 中国科学院自动化研究所 Few-sample target detection method and device and electronic equipment
CN115100532A (en) * 2022-08-02 2022-09-23 北京卫星信息工程研究所 Small sample remote sensing image target detection method and system
CN115100532B (en) * 2022-08-02 2023-04-07 北京卫星信息工程研究所 Small sample remote sensing image target detection method and system
CN115115898A (en) * 2022-08-31 2022-09-27 南京航空航天大学 Small sample target detection method based on unsupervised feature reconstruction
CN115984621B (en) * 2023-01-09 2023-07-11 宁波拾烨智能科技有限公司 Small sample remote sensing image classification method based on restrictive prototype comparison network
CN115984621A (en) * 2023-01-09 2023-04-18 宁波拾烨智能科技有限公司 Small sample remote sensing image classification method based on restrictive prototype comparison network
CN116310894A (en) * 2023-02-22 2023-06-23 中交第二公路勘察设计研究院有限公司 Unmanned aerial vehicle remote sensing-based intelligent recognition method for small-sample and small-target Tibetan antelope
CN116310894B (en) * 2023-02-22 2024-04-16 中交第二公路勘察设计研究院有限公司 Unmanned aerial vehicle remote sensing-based intelligent recognition method for small-sample and small-target Tibetan antelope
CN116129226A (en) * 2023-04-10 2023-05-16 之江实验室 Method and device for detecting few-sample targets based on multi-prototype mixing module
CN117409250A (en) * 2023-10-27 2024-01-16 北京信息科技大学 Small sample target detection method, device and medium
CN117409250B (en) * 2023-10-27 2024-04-30 北京信息科技大学 Small sample target detection method, device and medium

Also Published As

Publication number Publication date
CN112861720B (en) 2024-05-14

Similar Documents

Publication Publication Date Title
CN112861720B (en) Remote sensing image small sample target detection method based on prototype convolutional neural network
CN111160249A (en) Multi-class target detection method of optical remote sensing image based on cross-scale feature fusion
CN101976258B (en) Video semantic extraction method by combining object segmentation and feature weighing
CN101236608B (en) Human face detection method based on picture geometry
CN110287826B (en) Video target detection method based on attention mechanism
CN111833273B (en) Semantic boundary enhancement method based on long-distance dependence
CN113111727B (en) Feature alignment-based method for detecting rotating target in remote sensing scene
CN110082821B (en) Label-frame-free microseism signal detection method and device
Lei et al. End-to-end change detection using a symmetric fully convolutional network for landslide mapping
CN104408692A (en) Image fuzzy model parameter analysis method based on depth learning
CN110188828A (en) A kind of image sources discrimination method based on virtual sample integrated study
CN115063786A (en) High-order distant view fuzzy license plate detection method
CN114495010A (en) Cross-modal pedestrian re-identification method and system based on multi-feature learning
Pramestya et al. Road defect classification using gray level co-occurrence matrix (GLCM) and radial basis function (RBF)
Patel et al. A novel approach for semantic segmentation of automatic road network extractions from remote sensing images by modified UNet
Yin et al. Road Damage Detection and Classification based on Multi-level Feature Pyramids.
CN112446305A (en) Pedestrian re-identification method based on classification weight equidistant distribution loss model
CN112613474A (en) Pedestrian re-identification method and device
Khosravi et al. Vehicle speed and dimensions estimation using on-road cameras by identifying popular vehicles
Xia et al. Abnormal event detection method in surveillance video based on temporal CNN and sparse optical flow
CN110866435A (en) Far infrared pedestrian training method with self-similarity gradient oriented histogram
CN107679528A (en) A kind of pedestrian detection method based on AdaBoost SVM Ensemble Learning Algorithms
CN112348011B (en) Vehicle damage assessment method and device and storage medium
CN111046861B (en) Method for identifying infrared image, method for constructing identification model and application
CN114202536A (en) Method for extracting disaster environment features based on AT-EnFCM-SCNN

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant