CN114037899A

CN114037899A - VIT-based hyperspectral remote sensing image-oriented classification radial accumulation position coding system

Info

Publication number: CN114037899A
Application number: CN202111453939.XA
Authority: CN
Inventors: 翁谦; 吴雨阳; 林嘉雯; 林智明; 陈浩; 黄志铭
Original assignee: Fuzhou University
Current assignee: Fuzhou University
Priority date: 2021-12-01
Filing date: 2021-12-01
Publication date: 2022-02-11

Abstract

The invention relates to a VIT-based hyperspectral remote sensing image-oriented radial accumulation position coding system, which comprises: the data extraction preprocessing module is used for intercepting data from the data set, preprocessing the data through Principal Component Analysis (PCA) and inputting the processed data into the model frame; the position coding module is used for superposing input data from a central point to the periphery so that the data of each point is attached with position information; the data splitting module divides data into a plurality of block-shaped areas, stretches the block-shaped areas into one-dimensional vectors respectively, and inputs the vectors into the ViT model; and the main body model module comprises an ViT model and is used for processing input data and outputting a classification result. The invention adds more effective spatial information to the data input to ViT, thereby improving the classification accuracy of pixel level.

Description

VIT-based hyperspectral remote sensing image-oriented classification radial accumulation position coding system

Technical Field

The invention relates to the field of remote sensing, in particular to a VIT-based hyperspectral remote sensing image-oriented radial accumulation position coding system.

Background

The hyperspectral image classification aims at distributing each pixel to a certain class, the research work of the technology has attracted great attention in academia and industry, and the hyperspectral image classification has significant application value in the aspects of land coverage detection, city planning and traffic monitoring. The hyperspectral image has the characteristics of high spectral feature dimensionality and low spatial resolution, and the great challenge is provided for fully utilizing the spectral information contained in each pixel and the spatial information contained in the pixels around each pixel. The current hyperspectral image classification work mainly adopts a deep learning method, compared with the traditional machine learning method, the precision is obviously improved, but the multilayer convolution structure also causes a great increase of the calculated amount. Vision Transformer (ViT) is a neural network model based on a self-attention mechanism which is created in recent years, and global context information is captured in an attention mode so as to establish long-distance dependence on a target, so that the characteristic capability of the feature is improved. Under the condition that the receptive field is the same, the calculation amount is greatly reduced compared with the deep learning method.

For the ViT model, the position coding method determines the utilization effect of the model on data space information, the commonly used position coding method is trigonometric function position coding, the method is a commonly used position coding method in Natural Language Processing (NLP), the method is well represented in one-dimensional data, but the two-dimensional space relationship between pixel points cannot be well expressed, and the method is not suitable for high-dimensional hyperspectral remote sensing images.

Disclosure of Invention

In view of this, the present invention provides a VIT-based radial accumulation position coding system for hyperspectral remote sensing image classification, so that more effective spatial information can be added to data input to ViT, thereby improving the pixel-level classification accuracy.

In order to achieve the purpose, the invention adopts the following technical scheme:

a VIT-based hyperspectral remote sensing image-oriented radial accumulation position coding system comprises:

the data extraction preprocessing module is used for intercepting data from the data set, preprocessing the data through Principal Component Analysis (PCA) and inputting the processed data into the model frame;

the position coding module is used for superposing input data from a central point to the periphery so that the data of each point is attached with position information;

the data splitting module divides data into a plurality of block-shaped areas, stretches the block-shaped areas into one-dimensional vectors respectively, and inputs the vectors into the ViT model;

and the main body model module comprises an ViT model and is used for processing input data and outputting a classification result.

Furthermore, the position coding module achieves the effect of distinguishing the positions of all the points similar to the traditional position coding by accumulating the data from the central point to the periphery in the radial direction, and enables the model to have absolute rotation invariance when the size of the block-shaped area is divided into 1 x 1.

Furthermore, when the input data is preprocessed by the PCA principal component analysis method, the dimensionality of the data is not changed.

Further, the data splitting module divides the input data into N block-like regions according to the size of 1 × 1 or 3 × 3, stretches the N block-like regions into one-dimensional vectors, and inputs the vectors into the ViT model.

A coding method for a hyperspectral remote sensing image classification-oriented radial accumulation position coding system based on VIT comprises the following steps:

step S1: intercepting data from the data set according to a window size, wherein the category of the central point is set as a classification label; preprocessing by PCA principal component analysis, and inputting the processed data into a model frame;

step S2: processing the input data by using the radial accumulation position coding method provided by the text, and overlapping the input data from a central point to the periphery to enable the data of each point to be accompanied with position information;

step S3: dividing input data into N block-shaped areas according to the size of 1 × 1 or 3 × 3, stretching the N block-shaped areas into one-dimensional vectors respectively, and inputting the vectors into an ViT model;

step S4: the data is input into ViT model to get classification result.

Compared with the prior art, the invention has the following beneficial effects:

the invention adds more effective spatial information to the data input to ViT, thereby improving the classification accuracy of pixel level.

Drawings

FIG. 1 is a schematic diagram of the system architecture of the present invention;

FIG. 2 is a schematic diagram of radial accumulation position encoding according to an embodiment of the present invention;

fig. 3 is a schematic diagram of an overlay method according to an embodiment of the invention.

Detailed Description

The invention is further explained below with reference to the drawings and the embodiments.

Referring to fig. 1, the present invention provides a VIT-based radial accumulation position coding system for hyperspectral remote sensing image classification, which includes:

the position encoding module, instead of the trigonometric function position encoding commonly used in the ViT model, superimposes the input data from the center point to the periphery, as shown in fig. 2, so that the data of each point is accompanied by position information. The superposition method comprises the three conditions as shown in figure 3, which respectively correspond to the current point on the diagonal line relative to the central point, between the diagonal line and the vertical bisector and on the vertical bisector;

Preferably, in this embodiment, the position coding module achieves an effect of distinguishing positions of each point similar to the conventional position coding by accumulating data from a central point radially to the periphery, and makes the model have absolute rotation invariance when dividing the block-shaped region into 1 × 1.

Preferably, in this embodiment, when the input data is preprocessed by the PCA principal component analysis method, the dimensionality of the data is not changed.

Preferably, in this embodiment, the data splitting module divides the input data into N block-like regions according to the size 1 × 1 or 3 × 3, stretches the N block-like regions into one-dimensional vectors, and inputs the vectors into the ViT model.

step S4: the data is input into ViT model to get classification result.

The above description is only a preferred embodiment of the present invention, and all equivalent changes and modifications made in accordance with the claims of the present invention should be covered by the present invention.

Claims

1. A VIT-based hyperspectral remote sensing image-oriented radial accumulation position coding system is characterized by comprising:

2. The VIT-based hyperspectral remote sensing image-oriented radial accumulation position coding system according to claim 1, wherein the position coding module achieves an effect of distinguishing positions of each point similar to a traditional position coding by accumulating data from a central point to the periphery in a radial direction, and enables a model to have absolute rotation invariance when a block area is divided into 1 x 1.

3. The VIT-based hyperspectral remote sensing image-oriented radial accumulation position coding system according to claim 1, wherein the input data is preprocessed by a PCA Principal Component Analysis (PCA) method without changing the dimensionality of the data.

4. The VIT-based hyperspectral remote sensing image-oriented radial accumulation position coding system according to claim 1, wherein the data splitting module divides input data into N block-shaped areas according to the size of 1 x 1 or 3 x 3, stretches the N block-shaped areas into one-dimensional vectors respectively, and inputs the vectors into the ViT model.

5. A coding method for a hyperspectral remote sensing image classification-oriented radial accumulation position coding system based on VIT is characterized by comprising the following steps:

step S4: the data is input into ViT model to get classification result.