CN116258944B

CN116258944B - Remote sensing image classification model sample increment learning method based on double networks

Info

Publication number: CN116258944B
Application number: CN202310540710.2A
Authority: CN
Inventors: 姚光乐; 李雪; 王洪辉; 李军; 孙思源; 周皓然; 叶绍泽; 陈才华; 徐晓宇
Original assignee: Chengdu Univeristy of Technology
Current assignee: Chengdu Univeristy of Technology
Priority date: 2023-05-15
Filing date: 2023-05-15
Publication date: 2023-07-21
Anticipated expiration: 2043-05-15
Also published as: CN116258944A

Abstract

The invention discloses a remote sensing image classification model sample increment learning method based on a double network, which belongs to the field of data identification and comprises the following steps of using D ₀ Training the classification model to obtain a classification model M for identifying N remote sensing image categories ₀ The method comprises the steps of carrying out a first treatment on the surface of the Constructing a sample increment learned data stream D ₁ ~D _t ，D _t Class set and D of middle training samples ₀ The same; the incremental learning is sequentially carried out, and the process of the t-th incremental learning is as follows: constructing a dual network model including a first network model M _t1 And a second network model M _t2 ，M _t1 With the model M obtained by last learning _t‑1 Inter-knowledge distillation for maintaining M _t‑1 Old knowledge contained in output, relieving problem of forgetting knowledge, M _t2 Will learn D _t The resulting output is taken as M _t1 And the consistency loss is carried out on the output of the two networks in each training step, so that the model can effectively learn new knowledge, network parameters are updated through a knowledge collaborative algorithm, and the classification performance of the model is improved.

Description

Remote sensing image classification model sample increment learning method based on double networks

Technical Field

The invention relates to an image classification method, in particular to a remote sensing image classification model sample increment learning method based on a double network.

Background

The classification of the remote sensing image is one of the most basic tasks of the interpretation of the remote sensing image. The traditional remote sensing image classification work is mainly based on manual design features such as scale-invariant feature transformation and visual word bag models. However, because the remote sensing image has a complex geometry and a spatial pattern of differences, the manually designed features may not be able to obtain high-level semantic information of the remote sensing image. In recent years, remote sensing image classification has been greatly advanced under the push of Convolutional Neural Networks (CNNs). Wherein Xue et al extract advanced features of the remote sensing image using a pre-trained CNN as a feature extractor. Xu Suhui and the like extract multi-scale features of the image by using CNN, and realize remote sensing image classification by using a multi-core support vector machine. Song Zhongshan and the like propose to fuse features of different scales by utilizing active rotation polymerization, compensate a convolutional neural network (FAC-CNN) by feature fusion, and apply the convolutional neural network to remote sensing image classification to improve the generalization capability of a model. Qu Zhen and the like propose a new remote sensing image classification model based on an effective channel attention (Efficient Channel Attention, ECA) mechanism, and further improve the classification accuracy and generalization capability of the model.

However, as new remote sensing image data is continuously collected, the remote sensing image classification model based on the deep neural network needs to continuously learn new knowledge in the new data. In order to improve the recognition performance of the remote sensing image, it is important that the model learn new knowledge and keep the memory of old knowledge. Assuming that old data is available, one of the simplest methods is to train on new data and old data with Joint training methods. However, this approach requires the storage of historical data, resulting in wasted storage and computing resources. In practical applications, this is obviously inefficient. If the old data is not stored and used, the new data is directly used for Fine adjustment on the model, and the method is English Fine-tuning and FT is abbreviated. This approach can suffer from forgetfulness, resulting in a decrease in the overall classification performance of the model.

Disclosure of Invention

The invention aims to provide a remote sensing image classification model sample increment learning method based on a double network, which solves the problems, can not forget the capability of the data trained before in the process of continuously learning new data, does not need to store historical data, greatly saves storage space and can improve the overall classification performance of remote sensing images.

In order to achieve the above purpose, the technical scheme adopted by the invention is as follows: a remote sensing image classification model sample increment learning method based on a double network comprises the following steps of;

(1) Using the original training dataset D ₀ Training the classification model to obtain a classification model M capable of identifying N remote sensing image categories ₀ The method comprises the steps of carrying out a first treatment on the surface of the The D is ₀ The training sample in the training model is a remote sensing image with a remote sensing image type label, D ₀ Is c= { C ₁ ，C ₂ ，…，C _N N is the total number of remote sensing image categories;

(2) Constructing a data stream d= { D for sample increment learning ₁ ，D ₂ ，…，D _t T=1, 2, …, training set for the t-th incremental learning is D _t The model obtained by learning is M _t ，D _t Class set and D of middle training samples ₀ The same;

(3) The incremental learning is sequentially carried out, and the process of the t-th incremental learning comprises (31) - (33)

(31) Constructing a dual network model including a first network model M _t1 And a second network model M _t2 Model M obtained by t-1 time increment learning _t-1 Initialization results, when t=1, M _t-1 Is M ₀ ；

(32) By D _t Training a first network model M _t1 Calculation of M by knowledge distillation _t1 And M is as follows _t-1 Distillation loss L of (2) _kd The method comprises the steps of carrying out a first treatment on the surface of the By D _t Training a second network model M _t2 Calculate M _t2 And M _t1 Consistency loss L of (2) _con ；

(33) Knowledge collaborative learning;

according to a first network model M _t1 Calculates the classification loss L thereof _ce ；

Calculating a first network model M _t1 Is the complete loss function L of (2) _all Updating M through knowledge collaboration policy _t2 Weights of (2), and then M _t2 Model M obtained as this learning _t ；

In the method, in the process of the invention,λandβrespectively is L _kd And L _ce Is a weight of (2).

As preferable: for training set D _t ，Wherein x is _i For D _t In (i) th training sample, y _i Is x _i Is a remote sensing image category label, n is D _t Is a sample count of the total number of samples in the sample.

As preferable: in step (2), the D ₁ The trained classification model is a ResNet network model.

As preferable: in the step (32), the distillation loss L is calculated by the following formula _kd ；

Wherein n is D _t In the total amount of samples in (a),and->Are respectively M _t-1 Neutralization of M _t1 A softmax function with a temperature parameter T corresponding to the ith training sample;

wherein the method comprises the steps ofAnd->Respectively D _t The ith training sample in (1) at M _t-1 Neutralization of M _t1 K=1 to n, and represents the kth remote sensing image category.

As preferable: calculating a consistency loss L by _con ；

Wherein the method comprises the steps ofAnd->Representing the ith training sample at M _t1 And M _t2 In (2), MSE () is +.>And->A desired distance therebetween.

As preferable: in step (33), M is calculated by the following formula _t1 Classification loss L of (2) _ce ；

Wherein n is D _t Total number of training samples, x _i For D _t In (i) th training sample, y _i Is x _i Is characterized by that the remote sensing image class label,is x _i At M _t1 Output probability of (a) is determined.

As preferable: updating M in step (33) by knowledge of collaborative policies _t Specifically, M is updated by _t Weights of (2);

wherein j represents the j-th training in the t-th incremental learning,αis the super-parameter of the smoothing coefficient,and->Respectively denoted as a first network model M _t1 Second network model M _t2 Is included in the weight information.

Compared with the prior art, the invention has the advantages that: in the prior art, the sample increment is to learn new classification by continuously learning new data sets, so that the recognition performance of the remote sensing image model on old data is reduced due to forgetting knowledge, but the invention is completely different from the prior art, and the invention provides a method for cooperated working of a double-network structure and a model obtained by original sample increment learning on the basis that the sample types in each continuously learned data set are the same, wherein each learning is not to increase the number of the types, but to continuously improve the classification efficiency and the classification precision. The invention can also effectively overcome the problem of forgetting knowledge, and history data is not required to be stored, thereby greatly saving the storage space. Therefore, the incremental learning algorithm for classifying the remote sensing images has wide application prospect.

Wherein, with respect to the first network model: the method aims at preserving old knowledge, if an existing sample increment learning method is adopted, each increment learning is finely tuned on a subsequent training set, the recognition capability of the model on old data is reduced due to forgetting of knowledge, in order to guide a remote sensing image classification model to preserve old knowledge in the process of learning new data, the output probability of each image is expected to be close to the output of an original network for each increment training set, a first network model is introduced, knowledge distillation is adopted, distillation loss between the model and the model obtained by the previous round of learning is calculated, and the method can well encourage the output of the first network model to be close to the output of the previous round of model, so that the method is an improved cross entropy loss and the weight of smaller probability is increased. And in order to avoid the need of large storage requirement, the distillation loss of the invention only depends on the classification model obtained in the previous round, namely, for the t-th incremental learning stage, the distillation loss only depends on M _t-1 Is provided.

Regarding the second network model: the purpose of setting is to learn new knowledge, the second network model inputs the data set Dt of the training round, so the second network model has the probability output corresponding to the second network model, but the second network model takes the output of the second network model as the learning target of the first network, and the classification model of the invention can effectively learn new knowledge in new data by continuously maintaining the consistency of the two networks. In the invention, the consistency is realized by calculating the consistency loss of the first network model and the second network model.

Regarding collaborative learning: the invention actually learns the last increment to obtain the classification model M _t-1 First network model M constructed by incremental learning _t1 Second network model M _t2 In combination, jointly control a first network model andthe internal weights of the second network model. In each training step, the first network model trains each data set in a supervised manner by minimizing standard cross entropy loss, namely, in step (33), the classification loss is calculated according to the output of the first network model, then the internal weight of the first network model is updated according to the classification loss, distillation loss and consistency loss, the weight of the second network model is updated by using a knowledge collaborative strategy, the second network model is used as the classification model obtained by the current learning, the second network model updates the internal weight in an updating mode of the knowledge collaborative algorithm, the second network model is not only related to the first network model, the old knowledge and the ability of learning new knowledge can be reserved when the first network model is obtained, and meanwhile, the parameter updating is more robust, and the prediction effect is better.

Drawings

FIG. 1 is a diagram of the overall framework of the present invention;

FIG. 2 is a diagram illustrating a remote sensing image sample of the AID dataset of example 2;

FIG. 3 is the experimental results of example 2 on AID datasets using five methods;

FIG. 4 is the experimental results of example 2 on the NWPU-RESISC45 dataset using five methods;

FIG. 5 is a diagram showing the forgetting of old knowledge using the classification model of the remote sensing image in example 2;

FIG. 6 is M _t1 And M _t2 And a result graph for classification prediction.

Description of the embodiments

The invention will be further described with reference to the accompanying drawings.

Example 1: referring to fig. 1, a remote sensing image classification model sample increment learning method based on a dual network includes the following steps;

(33) Knowledge collaborative learning;

In this embodiment: for training set D _t ，Wherein x is _i For D _t In (i) th training sample, y _i Is x _i Is a remote sensing image category label, n is D _t Is a sample count of the total number of samples in the sample.

In step (2), the D ₁ The trained classification model is a ResNet network model.

In the step (32), the distillation loss L is calculated by the following formula _kd ；

In the present invention, the consistency loss L is calculated by the following formula _con ；

In step (33), M is calculated by the following formula _t1 Classification loss L of (2) _ce ；

Updating M in step (33) by knowledge of collaborative policies _t Specifically, M is updated by _t Weights of (2);

Example 2: referring to fig. 1 to 4, based on embodiment 1, we perform a remote sensing image sample incremental learning experiment using two challenging remote sensing image data sets AID and NWPU-RESISC 45. Hereinafter, NWPU-RESISC45 is abbreviated NWPU. The AID dataset mainly comprises 30 remote sensing image categories, and totally comprises 10000 marked remote sensing images. The number of images per category is between 220 and 420. The NWPU dataset is 45 classes, each class containing 700 remote sensing images, each image size being 256 x 256 pixels. The two data set images cover a plurality of countries and regions of the world, are photographed under different imaging conditions and seasons, and are suitable for the proposed sample increment learning experimental scene. Fig. 2 is a remote sensing image sample example of an AID dataset.

Construction of data sets for each study: in step (1) of the present invention we need to construct the original training dataset D ₀ In step (2) of the present invention, it is also necessary to construct a data stream d= { D for sample increment learning ₁ ，D ₂ ，…，D _t }. For a specific AID dataset and NWPU dataset we set specifically as follows:

for the AID dataset we split it into 5 training sets: training set 1 as D ₀ The training set comprises 5000 remote sensing images used for basic model learning, and the rest 4 training sets are used as D in the data stream D ₁ To D ₄ Each containing 1000 remote sensing images for incremental learning. The size of each picture is cropped to 224 x 224 pixels.

For the NWPU dataset, the same theory is divided into 5 training sets, the 1 st training set is taken as D ₀ The method comprises 4500 remote sensing images which are used for basic model learning. The remaining 4 training sets are taken as D in the data stream D ₁ To D ₄ Each containing 4500 remote sensing images for incremental learning. Incremental learning settings for both data sets are shown in table 1.

Table 1: incremental learning setting table for two data sets

Experiment setting: all experiments were based on Pytorch implementation, using ResNet18 as the trained deep neural network model, for model training, in the raw training dataset D ₀ Training was performed on 50 epochs, with an initial learning rate set to 0.01 and a batch size set to 128. In the incremental learning process, D _t The upper batch size was set to 64, 20 training rounds were set, the initial learning rate was set to 0.01, and after 15 epochs it was reduced to 0.001. The momentum attenuation coefficient and the weight attenuation coefficient are 0.9 and 0.0001, respectively. In addition, willλAndβthe temperature parameters are respectively set to 0.2 and 1, and the temperature parameter T is set to 2.

Sample increment learning is carried out on two data sets by the method and four methods in the prior art, and the accuracy of the five methods in the increment learning stage is compared.

The four methods in the prior art are respectively as follows:

the method comprises the following steps: the join method, also called a Joint training method, refers to the Joint training of all old data and current new data by a remote sensing image classification model.

The second method is as follows: the FT method is used for Fine tuning new data on a trained remote sensing image classification model, and does not perform any action of retaining old data.

And a third method: lwF, english Learning without Forgetting, the method used for simultaneously optimizing the data contained in the old knowledge and the new knowledge, reduces forgetting by continuously enhancing the stability of the model output on the old knowledge.

The method four: EWC, english Elastic Weight Consolidation, adopts a regularization strategy, and attempts to reduce forgetting by regularizing the weights of the old knowledge network to constrain the changes of important parameters while learning new knowledge.

The results of the experiments based on the AID dataset are shown in table 2 for the examples and the NWPU-RESISC45 dataset are shown in table 3 for the examples.

Table 2: AID dataset experimental result comparison table

Table 3: NWPU-RESISC45 dataset experimental result comparison table

The experimental results on AID and NWPU-RESISC45 datasets using the five methods described above are shown in FIGS. 3 and 4.

As can be seen in combination with tables 2, 3, and fig. 3, 4:

(1) On two remote sensing data sets, the method provided by the invention is superior to the FT method in each incremental learning stage, so that the problem of forgetting knowledge of a remote sensing image classification model is effectively reduced, old knowledge can be well reserved, and the overall classification performance of the model on the remote sensing image can be effectively improved.

(2) For the AID remote sensing dataset, in the second, third and fourth incremental learning processes, the FT method is used, so that the classification performance of the model is not improved, but the accuracy is reduced, and the model is proved to have the effect of forgetting knowledge. In contrast, the methods herein improve classification accuracy during each incremental learning process and outperform other incremental setting methods. After all incremental learning is completed, the proposed method achieves a classification accuracy of 77.40%, which is closest to 79.90% of the join method. The sample increment learning method provided by the invention has the advantages that the classification accuracy of the model to the known remote sensing images can be remarkably improved when new knowledge is learned.

(3) For an NWPU remote sensing dataset, the join method finally achieves 82.17% accuracy after all incremental learning is completed. Compared with other methods, the sample increment learning method provided by the method achieves the optimal result of 80.53 percent, which is closest to the point method. The incremental learning method provided by the method can help the remote sensing image classification model to effectively consolidate old knowledge and learn new knowledge under the condition of not storing old data, and the classification performance of the remote sensing image is continuously improved.

Example 3: referring to fig. 1 to 6, on the basis of embodiment 2, we have performed an ablation experiment to verify the validity of the first network model and the second network model in the collaborative learning knowledge model of the present invention.

Referring to fig. 5, forgetting curves of two remote sensing image classification models for old knowledge are shown, one of which is the FT method and one of which is the ft+kd method, wherein KD represents knowledge distillation, and the method uses knowledge distillation loss while using the FT method. And classifying and identifying the two trained models in all old samples, wherein Total represents the Total number of the old samples in all categories. It can be seen from the figure that the classification accuracy of the model to the old knowledge is remarkably reduced by using the FT method, and the classification performance of the model to the old knowledge can be improved by adopting the FT+KD method under most conditions.

Referring to FIG. 6, a trained network model M is shown _t1 And M _t2 As a result of classification prediction, the network model M can be seen from the incremental ensemble learning process _t2 Ratio model M _t1 The recognition effect is better, the classification accuracy is higher, so the model M is used in the experiment _t2 As a final model of remote sensing image classification.

See table 4: comparing the overall test accuracy of the method in the last increment learning by adopting the FT method, combining the FT method with the distillation loss, combining the FT method with the consistency loss;

table 4: ablation experiment accuracy contrast table

From the results in table 4, the knowledge distillation and knowledge collaborative learning methods proposed herein can improve the recognition performance of the remote sensing image classification model. Based on knowledge distillation, knowledge collaborative learning is adopted, so that the classification accuracy of the model can be further improved by 6.7 percent compared with a fine adjustment method. This illustrates that they can help the remote sensing image classification model retain old knowledge and learn new knowledge.

In conclusion, the knowledge distillation method is adopted by the first network model and the network model obtained by the previous study to help the model to keep the memory of old knowledge, so that the forgetting of knowledge can be effectively relieved. The second network model takes learning knowledge of the new data as a learning target of the first network. The two networks learn new data together, and training is carried out by adopting a knowledge collaborative algorithm to improve the classification accuracy of the model. The effectiveness of the method is verified on two remote sensing data sets, and experimental results show that the remote sensing image classification model based on sample increment learning provided by the method can effectively reduce forgetting of knowledge and continuously improve the overall recognition performance of the known remote sensing images.

The foregoing description of the preferred embodiments of the invention is not intended to be limiting, but rather is intended to cover all modifications, equivalents, and alternatives falling within the spirit and principles of the invention.

Claims

1. A remote sensing image classification model sample increment learning method based on a double network is characterized in that: comprises the following steps of;

(3) Sequentially performing incremental learning, wherein the process of the t-th incremental learning comprises (31) - (33);

(31) Constructing a dual network model including a first network model M _t1 And a second network model M _t2 Model M obtained by t-1 time increment learning _t-1 Initial initiationObtained by conversion, when t=1, M _t-1 Is M ₀ ；

(33) Knowledge collaborative learning;

Calculating a first network model M _t1 Is the complete loss function L of (2) _all Updating M through knowledge collaborative policies _t2 Weights of (2), and then M _t2 Model M obtained as this learning _t ；In the method, in the process of the invention,λandβrespectively is L _kd And L _ce Weights of (2);

Wherein n is D _t Total number of training samples, x _i For D _t In (i) th training sample, y _i Is x _i Is a remote sensing image category label->Is x _i At M _t1 Output probability of (a);

updating M in step (33) by knowledge collaborative policy _t Specifically, M is updated by _t Weights of (2);

wherein j represents the j-th training in the t-th incremental learning,αis a smoothing coefficient superparameter,/">And->Respectively denoted as a first network model M _t1 Second network model M _t2 Is included in the weight information.

2. The method for learning the sample increment of the classification model of the remote sensing image based on the double network according to claim 1, wherein the method comprises the following steps of: for training set D _t ，In which x is _i For D _t In (i) th training sample, y _i Is x _i Is a remote sensing image category label, n is D _t Is a sample count of the total number of samples in the sample.

3. The method for learning the sample increment of the classification model of the remote sensing image based on the double network according to claim 1, wherein the method comprises the following steps of: in step (2), D ₁ The trained classification model is a ResNet network model.

4. The method for learning the sample increment of the classification model of the remote sensing image based on the double network according to claim 1, wherein the method comprises the following steps of: in the step (32), the distillation loss L is calculated by the following formula _kd ；

Wherein n is D _t Total amount of samples in>And->Are respectively M _t-1 Neutralization of M _t1 The ith training sample pair in (a)A softmax function with a temperature parameter T;

wherein->And->Respectively D _t The ith training sample in (1) at M _t-1 Neutralization of M _t1 K=1 to n, and represents the kth remote sensing image category.

5. The method for learning the sample increment of the classification model of the remote sensing image based on the double network according to claim 1, wherein the method comprises the following steps of: calculating a consistency loss L by _con ；

Wherein->And->Representing the ith training sample at M _t1 And M _t2 In (2), MSE () is +.>And->A desired distance therebetween.