CN111582146A

CN111582146A - High-resolution remote sensing image city function partitioning method based on multi-feature fusion

Info

Publication number: CN111582146A
Application number: CN202010372618.6A
Authority: CN
Inventors: 孙伟伟; 高子为; 杨刚
Original assignee: Ningbo University
Current assignee: Ningbo University
Priority date: 2020-05-06
Filing date: 2020-05-06
Publication date: 2020-08-25
Anticipated expiration: 2040-05-06
Also published as: CN111582146B

Abstract

The invention relates to a high-resolution remote sensing image city function partitioning method based on multi-feature fusion, which comprises the following steps: step 1, preprocessing an image; step 2, distributing the characteristic value in each image to the visual word which is most similar to the characteristic value, and counting the corresponding word frequency of each visual word to form the visual word characteristic; constructing a multi-feature BoW visual dictionary; step 3, constructing an LDA probability topic model, and mining a high-dimensional semantic vector of the image by using the LDA probability topic model; step 4, training an SVM classifier according to the high-dimensional semantic vector obtained in the step 3; and 5, carrying out city function partitioning on the test set by using an SVM classifier. The invention has the beneficial effects that: POI data are introduced, so that the error score of the remote sensing data caused by same-object different spectrums and same-spectrum foreign objects is reduced; by comprehensively utilizing various characteristics of the image, including local characteristics, spectral characteristics, texture characteristics, surface temperature characteristics, spatial three-dimensional characteristics and POI characteristics, higher classification accuracy can be obtained under the condition that single characteristics of the image are not obvious.

Description

High-resolution remote sensing image city function partitioning method based on multi-feature fusion

Technical Field

The invention relates to the field of remote sensing image classification, in particular to a high-resolution remote sensing image urban function partitioning method based on multi-feature fusion.

Background

With the development of remote sensing technology, the time and space resolution of remote sensing images are continuously improved, and the data volume of the remote sensing images is increased rapidly. When a large amount of remote sensing data is faced, a method of manual visual interpretation needs to spend a great deal of time and labor for interpreting the remote sensing image. Therefore, how to automatically interpret the remote sensing image by using a computer becomes a hot research problem in the field of remote sensing. Meanwhile, Chinese economy and cities in a new normal state enter a new development stage, and a traditional city development mode faces a plurality of problems, so that new requirements and challenges are provided for city planning concepts, strategies and construction. In order to enhance the reasonable planning of urban functional areas and determine the most reasonable spatial layout of the urban functional areas, on the basis of improving the urban land utilization efficiency, various urban industries are gathered and exert the maximum efficiency, the urban land utilization efficiency is improved to a certain extent, and the effective implementation of a novel town strategy is ensured.

In order to overcome the gap between the low-level visual features and the high-level semantics of the remote sensing images, a method for modeling and describing land utilization scene semantics based on the middle-level features gradually gets wide attention. Especially, a Bag of words model (Bag of words model) in recent years has achieved great success in the application of image analysis and image classification, becomes a new and effective research idea for image content expression, and achieves certain results in remote sensing image land use scene classification. It represents the image as some local image blocks, each image block is represented as a sentence like words with different proportions, and the set of all words constitutes a visual dictionary.

The bag-of-words model has the characteristics of simple calculation, robustness to noise, illumination and local shielding and the like. However, the remote sensing image has abundant texture information and more local feature points, and the visual words obtained directly by the clustering method cannot necessarily reflect the scene features, so that the overall classification accuracy is not high. And for different building types in the high-resolution remote sensing image, the performance of each characteristic is not consistent. For example, some classes may be most suitable for classification by using spectral features due to the significance of the spectral features, while some classes have abundant textures, so that local features are more obvious. When the remote sensing image is only used for city function partitioning, the situations of 'same-object different-spectrum' and 'same-spectrum foreign matter' can occur, and the classification precision is reduced. It is obvious that the method of classifying different types of ground objects using the same feature is not applicable.

In summary, it is very important to provide a high-resolution remote sensing image city function partitioning method based on multi-feature fusion.

Disclosure of Invention

The invention aims to overcome the defects in the prior art, and provides a high-resolution remote sensing image city function partitioning method based on multi-feature fusion, which specifically comprises the following steps:

step 1, preprocessing an image, segmenting the image by using a proper grid, and selecting a training set and a testing set; calculating the POI type quantity proportion of each grid to obtain POI characteristics; respectively extracting local features, spectral features, textural features, earth surface temperature features and three-dimensional space features in a training set of the remote sensing image:

step 1.1, extracting local features of the remote sensing image: selecting an SURF algorithm, and solving a Hessian matrix corresponding to each pixel point (x, y) by adopting a Hessian matrix determinant approximate value image:

in the above formula, (x, y) is the pixel coordinate, and f (x, y) is the gray value of the coordinate point;

when the discrimination of the Hessian matrix obtains a local maximum value, judging that the current point is a brighter point or a darker point than other points in the surrounding neighborhood, and positioning the position of a key point; the discriminant of the Hessian matrix is:

performing box filtering calculation on the integral image by using a box filtering template added with the gradual change information to construct a scale space; comparing the det (H) of each pixel point with the det (H) of all adjacent points in the adjacent scale space, and when the det (H) of each pixel point is more than or less than all the adjacent points, determining that the point is an extreme point, and recording the position of the extreme point as a feature point;

constructing a 60-degree fan-shaped sliding window in a circular area with the characteristic point as the center of a circle and 6s as the radius; the s is a scale space of the feature points; traversing the whole circular area in a rotating way by using the radian of 0.2, and calculating Haar wavelet characteristic values in a fan-shaped sliding window; selecting the direction with the maximum sum of the Haar wavelet characteristic values as the main direction of the characteristic point; the solving method of the Haar wavelet characteristic sum is to accumulate the Harr wavelet characteristic values dx and dy of the image to obtain a vector (m)_ω,θ_ω)：

m_ω＝∑_ωdx+∑_ωdy (3)

θ_ω＝arctan(∑_ωdx/∑_ωdy) (4)

The main direction θ is a direction corresponding to the maximum Harr wavelet feature value of the image, namely:

θ＝θω|max{m_ω} (5)

taking a square frame with a theta direction at the characteristic point, wherein the side length of the frame is 20s, and s is a scale space of the characteristic point; dividing a square frame with a theta direction into 4 multiplied by 4 sub-regions, and counting the sum of the Harr wavelet characteristic values of each sub-region in the horizontal direction, the sum of the absolute values in the horizontal direction, the sum of the vertical direction and the sum of the absolute values in the vertical direction; the SURF feature descriptor is composed of 4 multiplied by 64-dimensional feature vectors;

step 1.2, extracting spectral characteristics of the remote sensing image, and calculating the mean value and standard deviation of each wave band, wherein the implementation mode is as follows:

in the above formulas (6) and (7), n is the total number of pixels in the grid, v_iThe gray value of the ith pixel in the wave band;

step 1.3, extracting texture features of the remote sensing image, selecting an LBP operator, defining the LBP operator in a window of 3 multiplied by 3 pixels:

in the above formula, (x)_c，y_c) Representing the central point of each 3 × 3 area, wherein p represents the p-th pixel point except the central pixel point in the 3 × 3 window;

taking the window center pixel as a threshold, comparing the center pixel value with the gray values of the adjacent 8 pixels: if the neighboring pixel value is greater than the center pixel value, then the position is marked as 1; otherwise, the flag is 0:

in the above formula, x is the central pixel value; obtaining an 8-bit binary number, and using the 8-bit binary number as an LBP value of a window center pixel point to reflect the texture information of the 3 x 3 pixel window;

step 1.4, extracting surface temperature characteristics, and calculating the surface specific index according to NDVI:

in the above formula, the specific radiance of the earth surface is shown; NDVI is the normalized vegetation index; the surface temperature is calculated according to a single window algorithm, which has the following formula:

T_s＝{a·(1-C-D)[b·(1-C-D)+C+D]·T-D·T_a}/C (11)

in the above formula, T_sIs the surface temperature; a and b are empirical coefficients, wherein a is-67.35535 and b is 0.458608; t is the brightness temperature; t is_aIs the average temperature of action of the atmosphere; the calculation formula for C and D is as follows:

C＝τ· (12)

D＝(1-τ)·[1+τ·(1-)](13)

in the above formulas (12) to (13), τ is the atmospheric transmittance and the surface emissivity;

step 1.5, extracting three-dimensional space characteristics: three-dimensional space point cloud data are obtained through a LiDAR technology, and then a digital earth surface model DSM characteristic is generated through a Krigin space interpolation algorithm;

step 2, distributing the characteristic value in each image to the visual word which is most similar to the characteristic value, and counting the corresponding word frequency of each visual word to form the visual word characteristic; constructing a multi-feature BoW visual dictionary:

step 2.1, constructing various feature vocabulary lists: respectively carrying out K-means clustering on the local features, the spectral features, the textural features, the earth surface temperature features and the three-dimensional space features extracted in the step 1; each clustering center is a word, and all clustering centers are converged into a word list;

step 2.2, by calculating the distance between the characteristics of each grid and each word, assigning the characteristics to the word with the closest distance in the word list;

step 2.3, counting word frequencies corresponding to the words to generate a K-dimensional feature vector, splicing the K-dimensional feature vectors in a stacking mode, and representing each grid by using a multi-dimensional vector:

f_i＝{sur_i,spe_i,lbp_i,lst_i,dsm_i,poi_i} (14)

in the above formula, sur_iFor local feature vectors, spe_iAs spectral feature vector, lbp_iAs texture feature vectors, lst_iAs surface temperature feature vector, dsm_iAs a three-dimensional spatial featureVector, poi_iAs feature vectors of POI

Each mesh is described as a document with a characteristic vocabulary:

Doc_i＝{word_sur,word_spe,word_lbp,word_lst,word_dsm,word_poi}_i(15)

step 3, constructing an LDA probability topic model, mining high-dimensional semantic vectors of the image by using the LDA probability topic model, and distributing the probability of each feature vector by extracting the high-dimensional semantic vectors contained in each feature vector by the LDA probability topic model:

step 3.1, giving each document in the document set according to the form of probability distribution by using an LDA probability topic model:

P(w|d)＝P(w|t)×P(t|d) (16)

in the above formula, w is a word, d is a document, and t is a theme; with topic t as intermediate layer, passing through two vectors

Give P (w | t) and P (t | d), θ, respectively_dRepresenting the probability that for an individual document D in each aggregate set D corresponds to a different individual topic t,

representing the probability vectors that different words are generated for the individual subjects T in each aggregate set T;

step 3.2, the learning process of the LDA probability topic model is as follows:

for all d and t, first give θ randomly_d,

Assigning; for a particular document d_sThe ith word w in_iIf the word w_iCorresponding topic is t_jThe formula (15) is rewritten as:

P_j(w_i|d_s)＝P(w_i|t_j)×P(t_j|d_s) (17)

enumerating topics in the aggregate set Tt, all P are obtained_j(w_i|d_s) (ii) a According to P_j(w_i|d_s) Result of (a) is d_sThe ith word w in_iSelecting a topic t, w_iCorresponding subject t gets P_j(w_i|d_s) Topic t with the highest probability_j；

An iterative process is defined as: performing P (w | D) calculation once on all w in all the documents D in the document total set D, and reselecting a theme;

the iterative process is repeated until theta_d,

Convergence and final output of the estimated parameter theta_d,

Obtaining a theme of each word and a high-dimensional semantic vector of each theme;

step 4, training an SVM classifier according to the high-dimensional semantic vector obtained in the step 3;

and 5, carrying out city function partitioning on the test set by using an SVM classifier.

Preferably, the POI feature in step 1 is used to re-classify the POI data according to the location name information.

Preferably, the SURF algorithm in step 1.1 constructs a scale pyramid by changing the size of the box filter while keeping the image size unchanged.

Preferably, in step 1.4, the earth surface temperature features are extracted in a satellite remote sensing data inversion mode.

Preferably, the SURF descriptors of step 1.1 have scale and rotation invariance, as well as invariance to changes in illumination.

Preferably, the LDA probabilistic topic model in step 3 is an unsupervised bayesian model.

The invention has the beneficial effects that: according to the method, POI data are introduced, so that the misclassification of the remote sensing data caused by same-object different spectrums and same-spectrum foreign objects is reduced; the method comprehensively utilizes various characteristics of the image, including local characteristics, spectral characteristics, texture characteristics, surface temperature characteristics, space three-dimensional characteristics and POI characteristics, and can obtain higher classification accuracy under the condition that single characteristics of the image are not obvious.

Drawings

FIG. 1 is a technical flow chart of the present invention.

Detailed Description

The present invention will be further described with reference to the following examples. The following examples are set forth merely to aid in the understanding of the invention. It should be noted that, for a person skilled in the art, several modifications can be made to the invention without departing from the principle of the invention, and these modifications and modifications also fall within the protection scope of the claims of the present invention.

A high-resolution remote sensing image city function partitioning method based on multi-feature fusion is disclosed, a technical flow chart of which is shown in figure 1, and the method specifically comprises the following steps:

m_ω＝∑_ωdx+∑_ωdy (3)

θ_ω＝arctan(∑_ωdx/∑_ωdy) (4)

θ＝θω|max{m_ω} (5)

T_s＝{a·(1-C-D)[b·(1-C-D)+C+D]·T-D·T_a}/C (11)

the upper typeIn, T_sIs the surface temperature; a and b are empirical coefficients, wherein a is-67.35535 and b is 0.458608; t is the brightness temperature; t is_aIs the average temperature of action of the atmosphere; the calculation formula for C and D is as follows:

C＝τ· (12)

D＝(1-τ)·[1+τ·(1-)](13)

f_i＝{sur_i,spe_i,lbp_i,lst_i,dsm_i,poi_i} (14)

in the above formula, sur_iFor local feature vectors, spe_iAs spectral feature vector, lbp_iAs texture feature vectors, lst_iAs surface temperature feature vector, dsm_iFor three-dimensional spatial feature vectors, poi_iAs feature vectors of POI

Each mesh is described as a document with a characteristic vocabulary:

Doc_i＝{word_sur,word_spe,word_lbp,word_lst,word_dsm,word_poi}_i(15)

step 3, constructing an LDA probability topic model, mining high-dimensional semantic vectors of the image by using the LDA probability topic model, and distributing the probability of each feature vector by extracting the high-dimensional semantic vectors contained in each feature vector by the LDA probability topic model: in the semantic model classification, the probability of the same word appearing in different subject contexts is different. And extending to an image classification neighborhood, wherein the probability of each characteristic value appearing in different class topics is different, and introducing an LDA topic model in order to determine the corresponding relation between the appearance probability of each characteristic value and the class topics.

P(w|d)＝P(w|t)×P(t|d) (16)

for all d and t, first give θ randomly_d,

P_j(w_i|d_s)＝P(w_i|t_j)×P(t_j|d_s) (17)

enumerating the subject T in the total set T to obtain all P_j(w_i|d_s) (ii) a According to P_j(w_i|d_s) Result of (a) is d_sThe ith word w in_iSelecting a topic t, w_iCorresponding subject t gets P_j(w_i|d_s) Topic t with the highest probability_j；

the iterative process is repeated until theta_d,

Convergence and final output of the estimated parameter theta_d,

step 4, training an SVM classifier according to the high-dimensional semantic vector obtained in the step 3; and labeling each grid by visual interpretation according to the classification standard of urban land classification and planning construction land standard of the samples in the training set to obtain a label set, training an SVM classifier by combining potential semantic feature vectors contained in each training sample, and classifying the test samples.

And the POI characteristics in the step 1 are used for reclassifying the POI data according to the place name information. The introduced POI data solves the problem of building functions that the remote sensing data can not be extracted due to 'same-object different spectrum' and 'same-spectrum foreign matter'.

In the step 1.1, the SURF algorithm constructs the scale pyramid by changing the size of the box filter while keeping the size of the image unchanged. The SURF algorithm is an improvement over the SIFT algorithm.

And in the step 1.4, the earth surface temperature characteristics are extracted in a satellite remote sensing data inversion mode.

The SURF feature descriptor described in step 1.1 has scale and rotation invariance, and also has invariance to changes in illumination.

The LDA probability topic model in the step 3 is an unsupervised Bayesian model.

In the experimental data research of the example, the region of Wanke city in the town sea area of Ningbo at 11 th of 9 th of 2019 is selected, and the data includes score 2 data, TM data, LiDAR data and Baidu POI data.

According to 11/1/2017, the city function is divided into the following parts: the method comprises the following steps of classifying test sets by using trained classifiers, wherein the test sets comprise eight types of ground object types including residential sites, public management and public service sites, commercial service facility sites, industrial sites, logistics storage sites, road and traffic facility sites, public facility sites and green and square site sites.

Claims

1. A high-resolution remote sensing image city function partitioning method based on multi-feature fusion is characterized by comprising the following steps:

step 1, preprocessing an image, segmenting the image by using a proper grid, and selecting a training set and a testing set; calculating the POI type quantity proportion of each grid to obtain POI characteristics; respectively extracting local features, spectral features, textural features, surface temperature features and three-dimensional space features in a training set of the remote sensing image;

f_i＝{sur_i,spe_i,lbp_i,lst_i,dsm_i,poi_i} (1)

in the above formula, sur_iFor local feature vectors, spe_iAs spectral feature vector, lbp_iAs texture feature vectors, lst_iAs surface temperature feature vector, dsm_iFor three-dimensional spatial feature vectors, poi_iIs POI feature vector;

each mesh is described as a document with a characteristic vocabulary:

Doc_i＝{word_sur,word_spe,word_lbp,word_lst,word_dsm,word_poi}_i(2)

step 3, constructing an LDA probability topic model, mining high-dimensional semantic vectors of the image by using the LDA probability topic model, and distributing the probability of each feature vector by extracting the high-dimensional semantic vectors contained in each feature vector by using the LDA probability topic model;

2. The urban functional partitioning method for the high-resolution remote sensing images based on the multi-feature fusion as claimed in claim 1, wherein: and in the step 1, POI data is combined to be used as a geographic space big data characteristic to participate in city functional partitioning.

3. The urban functional partitioning method for the high-resolution remote sensing images based on the multi-feature fusion as claimed in claim 1, wherein: and 2, constructing a BoW dictionary in a multi-feature library cooperation mode.

4. The urban functional partitioning method for the high-resolution remote sensing images based on the multi-feature fusion as claimed in claim 1, wherein: the LDA probability topic model in the step 3 is an unsupervised Bayesian model.