CN104966085A

CN104966085A - Remote sensing image region-of-interest detection method based on multi-significant-feature fusion

Info

Publication number: CN104966085A
Application number: CN201510331174.0A
Authority: CN
Inventors: 张立保; 吕欣然; 王士一
Original assignee: Beijing Normal University
Current assignee: Beijing Normal University
Priority date: 2015-06-16
Filing date: 2015-06-16
Publication date: 2015-10-07
Anticipated expiration: 2035-06-16
Also published as: CN104966085B

Abstract

The invention discloses a remote sensing image interest region detection method based on multi-significant feature fusion, which belongs to the technical field of remote sensing image processing and image recognition. The implementation process includes: 1) Obtain a set of color channels of the input remote sensing image and calculate the color histogram of each channel; 2) Calculate the normalized saliency weight of each color channel from these histograms; 3) Calculate the information saliency feature map; 4) Combine a Group input remote sensing images are converted from RGB color space to CIE Lab color space; 5) Clusters are obtained by clustering algorithm; 6) Salient values of each cluster are calculated to obtain a common salient feature map; The final saliency map is obtained from the figure; 8) The region of interest is extracted by threshold segmentation using the maximum between-class variance method. Compared with the traditional method, the present invention realizes the accurate detection of the region of interest in the remote sensing image without prior knowledge base, and can be widely used in the fields of environmental monitoring, land utilization and agricultural investigation and the like.

Description

A Method for Detection of Regions of Interest in Remote Sensing Images Based on Fusion of Multiple Salient Features

技术领域technical field

本发明属于遥感图像处理及图像识别技术领域，具体涉及一种基于多显著特征融合的遥感图像感兴趣区域检测方法。The invention belongs to the technical field of remote sensing image processing and image recognition, and in particular relates to a method for detecting a region of interest in a remote sensing image based on fusion of multiple salient features.

背景技术Background technique

随着遥感技术的快速发展，遥感影像的数据规模迅速扩大，而遥感图像的感兴趣区域提取能够降低遥感图像分析处理的复杂度，因此遥感图像的感兴趣区域提取也是最近一段时间的关注热点，怎样准确、快速地实现遥感影像感兴趣区域检测已成为当下亟待解决的问题之一。该问题的有效解决将对缓解遥感影像高速获取与低速解译之间的矛盾具有重要意义，对土地利用、灾害评估、城镇规划以及环境监测等相关领域也具有重要的实际应用价值。With the rapid development of remote sensing technology, the data scale of remote sensing images is rapidly expanding, and the extraction of regions of interest in remote sensing images can reduce the complexity of remote sensing image analysis and processing, so the extraction of regions of interest in remote sensing images is also a hot spot of attention recently. How to accurately and quickly realize the detection of regions of interest in remote sensing images has become one of the problems to be solved urgently. An effective solution to this problem will be of great significance for alleviating the contradiction between high-speed acquisition and low-speed interpretation of remote sensing images, and will also have important practical application value for related fields such as land use, disaster assessment, town planning, and environmental monitoring.

传统遥感图像感兴趣区域检测大多是基于全局的，需要先验知识的。但是先验知识库的建立本身是一个很复杂的问题，需综合考虑专家知识库、目标区域特征、背景区域特点等信息。有的方法需要引入对颜色呈现和眼动的心理物理学数据的训练，有的方法对遥感影像感兴趣区域检测与分类则要借助同区域的数字地图。这些算法都需要先验知识库，且计算复杂度较高。The ROI detection of traditional remote sensing images is mostly based on the global situation and requires prior knowledge. However, the establishment of the prior knowledge base itself is a very complicated problem, which needs to comprehensively consider the information of the expert knowledge base, the characteristics of the target area, and the characteristics of the background area. Some methods need to introduce training on psychophysical data of color presentation and eye movement, and some methods rely on digital maps of the same area for the detection and classification of regions of interest in remote sensing images. These algorithms all require a priori knowledge base and have high computational complexity.

视觉注意模型为遥感图像感兴趣区检测提供了一个全新的视角，不同于传统的检测方法，视觉注意模型完全由数据驱动，不涉及知识库等外部因素的影响，并且具有识别快速结果准确等优势，视觉注意模型受到越来越多的关注，将视觉注意模型引入遥感图像感兴趣区域的检测具有重大的意义。The visual attention model provides a new perspective for the detection of regions of interest in remote sensing images. Unlike traditional detection methods, the visual attention model is completely driven by data, does not involve the influence of external factors such as knowledge bases, and has the advantages of rapid identification and accurate results. , the visual attention model has received more and more attention, and it is of great significance to introduce the visual attention model into the detection of regions of interest in remote sensing images.

在基于低层视觉特征的视觉注意模型方面，Itti等人在文章“A Model of Saliency-BasedVisual Attention for Rapid Scene Analysis”中提出了Itti视觉注意方法，该模型接近于人类视觉系统，利用各种视觉特性产生显著图。在基于数学方法的视觉注意模型方面，Harel等人在文章“Graph-Based Visual Saliency”中提出了基于图论的算法(Graph-based visual saliency，GBVS)，该算法通过使用传统的Itti模型模拟视觉注意机制来完成特征提取步骤，继而使用图结构表示图像之间的像素关联，最后引入马尔可夫链(Markow chains)计算显著图。在基于频域分析注意模型方面，Achanta等人在文章“Frequency-tuned Salient RegionDetection”中提出用于显著区域检测的频率调谐法(Frequency-tuned，FT)，将输入的RGB图像转换到CIELab颜色空间并进行高斯平滑，再减去图像特征向量的算术平均后，按点求幅度即得到均匀且边界清晰的显著图。In terms of the visual attention model based on low-level visual features, Itti et al. proposed the Itti visual attention method in the article "A Model of Saliency-Based Visual Attention for Rapid Scene Analysis". This model is close to the human visual system and uses various visual characteristics. Generate a saliency map. In terms of visual attention models based on mathematical methods, Harel et al. proposed a graph-based algorithm (Graph-based visual saliency, GBVS) in the article "Graph-Based Visual Saliency", which uses the traditional Itti model to simulate visual The attention mechanism is used to complete the feature extraction step, and then the graph structure is used to represent the pixel association between images, and finally Markow chains are introduced to calculate the saliency map. In terms of attention models based on frequency domain analysis, Achanta et al. proposed a frequency tuning method (Frequency-tuned, FT) for salient region detection in the article "Frequency-tuned Salient RegionDetection", which converts the input RGB image to CIELab color space After performing Gaussian smoothing, subtracting the arithmetic mean of the image feature vector, and calculating the magnitude by point, a uniform salient map with clear boundaries can be obtained.

基于低层视觉特征的视觉注意模型较好地模拟了人眼视觉的关注方式，但是没有充分考虑图像的频域特征，同时计算速度慢、效率低，难以达到实时应用的要求。基于频域分析方法的视觉注意模型形式简洁，易于解释和实现，但是当显著区域占整个图像的比例过大时，或者图像背景过于复杂时，该方法得到的显著图会误将部分背景标为显著区域，且其生物合理性不是非常清楚。近年来国内外学者也提出了将视觉显著性应用于遥感影像感兴趣区域检测的新算法。例如Zhang等人在文章“Fast Detection of Visual Saliency Regions inRemote Sensing Image based on Region Growing”中提出基于小波变换，降低图像分辨率，在视觉特征中引入二维离散矩变换，生成显著图。但是，这些算法都有共同的缺点，它们都只能将显著区域提取出来，却无法区分这些显著区域之间的差别。而一组具有相似感兴趣区域的遥感影像，如果能够利用它们的相似性，就可以排除对感兴趣区域检测有干扰的其他区域。The visual attention model based on low-level visual features can better simulate the attention of human eyes, but it does not fully consider the frequency domain characteristics of images, and at the same time, the calculation speed is slow and the efficiency is low, which is difficult to meet the requirements of real-time applications. The visual attention model based on the frequency domain analysis method is simple in form, easy to explain and implement, but when the salient region accounts for too much of the whole image, or the image background is too complex, the saliency map obtained by this method will mistakenly mark part of the background as significant region, and its biological plausibility is not very clear. In recent years, scholars at home and abroad have also proposed new algorithms that apply visual saliency to the detection of regions of interest in remote sensing images. For example, Zhang et al. proposed in the article "Fast Detection of Visual Saliency Regions in Remote Sensing Image based on Region Growing" to reduce image resolution based on wavelet transform and introduce two-dimensional discrete moment transform into visual features to generate saliency maps. However, these algorithms have a common shortcoming, they can only extract salient regions, but cannot distinguish the differences between these salient regions. For a group of remote sensing images with similar ROIs, if their similarity can be used, other areas that interfere with the ROI detection can be excluded.

在计算感兴趣区域掩模方面，传统方法常用一个固定半径圆来描述感兴趣区域，它在识别随机区域时会带来大量冗余信息，而使用单一阈值的速度非常快，但是感兴趣区域会有很多小碎片，区域描述不准确。最大类间方差法(Ostu方法)是一种自动的非参数、无监督的阈值选择法，该方法是自适应计算单阈值的简单高效方法，该方法具有计算简单、自适应强等优点。In terms of calculating the mask of the region of interest, the traditional method often uses a circle with a fixed radius to describe the region of interest, which will bring a lot of redundant information when identifying random regions, and the speed of using a single threshold is very fast, but the region of interest will be There are many small pieces and the area description is not accurate. The method of maximum inter-class variance (Ostu method) is an automatic non-parametric and unsupervised threshold selection method. This method is a simple and efficient method for adaptively calculating a single threshold. This method has the advantages of simple calculation and strong self-adaptation.

发明内容Contents of the invention

本发明的目的在于提供了一种基于多显著特征融合的遥感图像感兴趣区域检测方法，该方法用于对遥感图像的感兴趣区域进行精确检测。现有的感兴趣区域检测方法主要是基于全局的，需要先验知识的。但是先验知识库的建立本身是一个很复杂的问题，需综合考虑专家知识库、目标区域特征、背景区域特点等信息。所以本发明方法主要关注两个方面：The object of the present invention is to provide a method for detecting a region of interest in a remote sensing image based on the fusion of multiple salient features, which is used to accurately detect the region of interest in the remote sensing image. Existing ROI detection methods are mainly global-based and require prior knowledge. However, the establishment of the prior knowledge base itself is a very complicated problem, which needs to comprehensively consider the information of the expert knowledge base, the characteristics of the target area, and the characteristics of the background area. Therefore, the method of the present invention mainly focuses on two aspects:

1)无需基于全局搜索和建立先验知识库；1) There is no need to search and establish a priori knowledge base based on the global;

2)提升遥感图像感兴趣区域检测精度，获得更为准确的感兴趣区域信息。2) Improve the detection accuracy of the region of interest in the remote sensing image and obtain more accurate information of the region of interest.

本发明所使用的技术方案包括遥感图像的信息量显著特征图生成，共有显著特征图生成，最终显著图生成，感兴趣区域模板生成以及感兴趣区域生成五个主要过程，具体包括以下步骤：The technical solution used in the present invention includes five main processes of generating information-aware feature maps of remote sensing images, generating a total of salient feature maps, generating final saliency maps, generating region-of-interest templates, and generating region-of-interest, specifically including the following steps:

步骤一：计算颜色直方图，即输入一组尺寸为M×N的遥感图像，分别提取每幅图像的每一个颜色通道，用f_c(x，y)表示在颜色通道c中(x，y)位置的颜色强度，构建每幅遥感图像在不同颜色通道的强度直方图H_c(i)，其中M表示图像的长，N表示图像的宽，x、y分别表示图像的横、纵坐标，x＝1、2……M，y＝1、2……N，c表示颜色通道，c＝1、2、3，i表示像素强度值，i＝0、1……255；Step 1: Calculate the color histogram, that is, input a set of remote sensing images with a size of M×N, extract each color channel of each image separately, and use f _c (x, y) to represent the color channel c (x, y ) position, construct the intensity histogram H _c (i) of each remote sensing image in different color channels, where M represents the length of the image, N represents the width of the image, and x and y represent the horizontal and vertical coordinates of the image respectively, x=1, 2...M, y=1, 2...N, c represents the color channel, c=1, 2, 3, i represents the pixel intensity value, i=0, 1...255;

步骤二：计算颜色通道c的标准化显著权重，即根据颜色通道c的颜色直方图H_c(i)，计算该颜色通道中每一个像素强度值i的信息量In_c(i)，并将该信息量赋给与该像素强度值相等的像素点，完成全部计算与赋值后，得到颜色通道c的信息量图LOG_c(x，y)，利用该信息量图，得到颜色通道c的显著度h_c，再利用各颜色通道的显著度，计算得到每幅图像的各颜色通道标准化显著权重w_c；Step 2: Calculate the normalized saliency weight of color channel c, that is, calculate the information amount In _c (i) of each pixel intensity value i in the color channel according to the color histogram H _c (i) of color channel c, and calculate the The amount of information is assigned to the pixel point equal to the intensity value of the pixel. After all calculations and assignments are completed, the information amount map LOG _c (x, y) of the color channel c is obtained. Using this information amount map, the salience degree of the color channel c is obtained. h _c , and then use the saliency of each color channel to calculate the normalized saliency weight w _c of each color channel of each image;

步骤三：计算信息量显著特征图，即利用各颜色通道的标准化显著权重w_c，加权计算得到每幅图像初步的信息量显著特征图，对初步获得的信息量显著特征图进行高斯平滑滤波，滤除噪声后得到每幅图像的最终的信息量显著特征图；Step 3: Calculate the informative feature map, that is, use the standardized saliency weight w _c of each color channel to weight and calculate the preliminary informative feature map of each image, and perform Gaussian smoothing filtering on the initially obtained informative feature map, After filtering the noise, the final informative feature map of each image is obtained;

步骤四：将一组遥感图像从RGB颜色空间转换至CIE Lab颜色空间，即分别提取每幅图像每个像素的R、G、B三个颜色通道值，将它们转换至CIE Lab颜色空间，获取L、a、b三个分量，RGB颜色空间中，R表示red红色，G表示green绿色，B表示blue蓝色，CIELab颜色空间中，L表示亮度，L＝0代表黑色，L＝100代表白色，a表示颜色在红/绿之间的位置，a为负值代表绿色，a为正值代表红色，b表示颜色在蓝/黄之间的位置，b为负值代表蓝色，b为正值代表黄色；Step 4: Convert a set of remote sensing images from the RGB color space to the CIE Lab color space, that is, extract the R, G, and B color channel values of each pixel of each image, and convert them to the CIE Lab color space to obtain L, a, b three components, in the RGB color space, R means red red, G means green green, B means blue blue, in CIELab color space, L means brightness, L=0 means black, L=100 means white , a represents the position of the color between red/green, a negative value represents green, a positive value represents red, b represents the position of the color between blue/yellow, b represents blue when negative, and b represents positive Value represents yellow;

步骤五：利用k-means聚类算法完成CIE Lab颜色空间的像素聚类，即通过k-means聚类算法，将这组原始遥感图像映射到CIE Lab颜色空间上的所有像素点的值进行聚类，得到k个簇；Step 5: Use the k-means clustering algorithm to complete the pixel clustering in the CIE Lab color space, that is, use the k-means clustering algorithm to map this group of original remote sensing images to the values of all pixels in the CIE Lab color space for clustering Class, get k clusters;

步骤六：计算共有显著特征图，即将第j个簇中含有的像素数与图像总像素数相除，相除的结果定义为第j个簇的权重，其中j＝1、2……k，得到所有k个簇的权重后，利用簇的权重与簇之间的距离计算簇的显著值，把簇的显著值赋给每一个属于该簇的像素点，由此获得一组共有显著特征图；Step 6: Calculate the common salient feature map, that is, divide the number of pixels contained in the jth cluster by the total number of pixels in the image, and the result of the division is defined as the weight of the jth cluster, where j=1, 2...k, After obtaining the weights of all k clusters, calculate the saliency value of the cluster by using the weight of the cluster and the distance between the clusters, and assign the saliency value of the cluster to each pixel belonging to the cluster, thereby obtaining a set of common saliency feature maps ;

步骤七：计算最终显著图，即利用各颜色通道直方图信息所获得的信息量显著特征图，与在CIE Lab颜色空间中通过k-means聚类获得的共有显著特征图相乘，从而获得多显著特征融合后的最终显著图；Step 7: Calculate the final saliency map, that is, the informative saliency map obtained by using the histogram information of each color channel is multiplied by the shared saliency map obtained by k-means clustering in the CIE Lab color space, so as to obtain multiple The final saliency map after salient feature fusion;

步骤八：感兴趣区域提取，即通过最大类间方差法得到最终显著图的分割阈值，利用该阈值将最终显著图分割为一幅二值图像模板，用“1”代表感兴趣区域，用“0”代表非感兴趣区，最后将二值图像模板与原始图像相乘得到最终的感兴趣区提取结果。Step 8: Extracting the region of interest, that is, the segmentation threshold of the final saliency map is obtained by the maximum inter-class variance method, and the final saliency map is segmented into a binary image template by using the threshold, with "1" representing the region of interest, and " 0" represents the non-interest area, and finally the binary image template is multiplied by the original image to obtain the final ROI extraction result.

附图说明Description of drawings

图1为本发明的流程图。Fig. 1 is a flowchart of the present invention.

图2为本发明所使用的一组四幅遥感图像示例图片。Fig. 2 is a group of four sample pictures of remote sensing images used in the present invention.

图3为本发明的特征图与最终显著图。(a)为示例图片的信息量显著特征图，(b)为示例图片的共有显著特征图，(c)为示例图片的最终显著图。Fig. 3 is the feature map and the final saliency map of the present invention. (a) is the informative saliency map of the example image, (b) is the shared saliency map of the example image, and (c) is the final saliency map of the example image.

图4为示例图片采用本发明方法和其他方法生成的显著图的比较。(a)为Itti方法生成的显著图，(b)为GBVS方法生成的显著图，(c)为FT方法生成的显著图，(d)为本发明方法生成的显著图。Fig. 4 is a comparison of saliency maps generated by the method of the present invention and other methods for an example image. (a) is the saliency map generated by the Itti method, (b) is the saliency map generated by the GBVS method, (c) is the saliency map generated by the FT method, and (d) is the saliency map generated by the method of the present invention.

图5为示例图片采用本发明方法和其他方法所检测到的感兴趣区域比较。(a)为Itti方法检测到的感兴趣区域图，(b)为GBVS方法检测到的感兴趣区域，(c)为FT方法检测到的感兴趣区域，(d)为本发明方法检测到的感兴趣区域。FIG. 5 is a comparison of ROIs detected by the method of the present invention and other methods in an example picture. (a) is the region of interest figure detected by the Itti method, (b) is the region of interest detected by the GBVS method, (c) is the region of interest detected by the FT method, (d) is the region of interest detected by the method of the present invention area of interest.

具体实施方式Detailed ways

下面结合附图对本发明做进一步详细说明。本发明的总体框架如图1所示，现介绍每一步实现细节。The present invention will be described in further detail below in conjunction with the accompanying drawings. The overall framework of the present invention is shown in Figure 1, and now introduces the implementation details of each step.

步骤一：计算颜色直方图；Step 1: Calculate the color histogram;

输入一组尺寸为M×N的遥感图像如图2所示，分别得到每幅图像I_p的每一个颜色通道，用f_c(x，y)表示图像I_p在颜色通道c中(x，y)位置的颜色强度，构建该遥感图像在不同颜色通道的强度直方图H_c(i)，其中M表示图像的长，N表示图像的宽，该组遥感图像的总数为Q，用表示数量为Q的遥感图像组，I_p表示一组遥感图像的第p幅，p＝1、2……Q，x、y分别表示图像的横、纵坐标，x＝1、2……M，y＝1、2……N，c表示颜色通道，c＝1、2、3，i表示像素强度值，i＝0、1……255；Input a set of remote sensing images of size M×N As shown in Figure 2, each color channel of each image I _p is obtained separately, and f _c (x, y) is used to represent the color intensity of the image I _p in the position (x, y) of the color channel c, and the remote sensing image is constructed The intensity histogram H _c (i) of different color channels, where M represents the length of the image, N represents the width of the image, the total number of remote sensing images in this group is Q, and Indicates a group of remote sensing images with a quantity of Q, I _p represents the pth piece of a group of remote sensing images, p=1, 2...Q, x, y represent the horizontal and vertical coordinates of the image respectively, x=1, 2...M , y=1, 2...N, c represents the color channel, c=1, 2, 3, i represents the pixel intensity value, i=0, 1...255;

该组图像中的每幅图像的每一个颜色通道的直方图可以用如下公式得到：The histogram of each color channel of each image in the group of images can be obtained by the following formula:

${H h}_{c c} ((i i)) = = {Σ Σ}_{x x = = 11}^{M m} {Σ Σ}_{y the y = = 11}^{N N} {δ δ}_{c c} ((x x,, y the y)) / / ((M m \times \times N N))$

其中，δ_c(x，y)表示颜色通道c的二值化图像，计算公式为：Among them, δ _c (x, y) represents the binarized image of color channel c, and the calculation formula is:

${δ δ}_{c c} ((x x,, y the y)) = = \{\begin{matrix} 11,, & {f f}_{c c} ((x x,, y the y)) = = i i \\ 00,, & otherwie others wie \end{matrix}$

步骤二：计算颜色通道c的标准化显著权重；Step 2: Calculate the normalized saliency weight of the color channel c;

根据图像I_p的颜色通道c的颜色直方图H_c(i)，计算该颜色通道中每一个像素强度值i的信息量In_c(i)，利用该信息量进行计算和赋值，最终得到图像I_p的各颜色通道标准化显著权重w_c，具体由以下四个步骤实现；According to the color histogram H _c (i) of the color channel c of the image I _p , calculate the information amount In _c (i) of each pixel intensity value i in the color channel, use the information amount to calculate and assign, and finally obtain the image The normalized saliency weight w _c of each color channel of I _p is specifically realized by the following four steps;

(1)根据图像I_p的颜色通道c中的颜色直方图H_c(i)，利用如下公式计算该颜色通道中每一个像素强度值的信息量In_c(i)：(1) According to the color histogram H _c (i) in the color channel c of the image I _p , use the following formula to calculate the information amount In _c (i) of each pixel intensity value in the color channel:

In(i)_c＝-ln(H_c(i))In(i) _c ＝-ln(H _c (i))

(2)将该信息量赋给颜色通道c中与该像素强度值相等的像素点，得到颜色通道c的信息量图LOG_c(x，y)，即：(2) assign the amount of information to the pixel in the color channel c equal to the pixel intensity value, and obtain the information amount map LOG _c (x, y) of the color channel c, namely:

i＝f_c(x，y)i = f _c (x, y)

(3)利用颜色通道c的信息量图LOG_c(x，y)，计算得到显著度h_c，计算公式如下：(3) Using the information quantity map LOG _c (x, y) of the color channel c, the saliency degree h _c is calculated, and the calculation formula is as follows:

${h h}_{c c} = = \frac{{Σ Σ}_{x x = = 11}^{M m} {Σ Σ}_{y the y = = 11}^{N N} {LOG LOG}_{c c} ((x x,, y the y))}{{Σ Σ}_{c c = = 11}^{33} {Σ Σ}_{x x = = 11}^{M m} {Σ Σ}_{y the y = = 11}^{N N} {LOG LOG}_{c c} ((x x,, y the y))}$

其中有三个颜色通道，则h₁表示颜色通道1的显著度，h₂表示颜色通道2的显著度，h₃表示颜色通道3的显著度；There are three color channels, h ₁ represents the salience of color channel 1, h ₂ represents the salience of color channel 2, and h ₃ represents the salience of color channel 3;

(4)将颜色通道c的显著度除以三个颜色通道的显著度的，得数取负对数，得到颜色通道标准化显著权重w_c：(4) Divide the saliency of the color channel c by the salience of the three color channels, and take the negative logarithm of the result to obtain the normalized saliency weight w _c of the color channel:

$\{\begin{matrix} {w w}_{11} = = - - log log ((\frac{{h h}_{11}}{{h h}_{11} + + {h h}_{22} + + {h h}_{33}})) \\ {w w}_{22} = = - - log log ((\frac{{h h}_{22}}{{h h}_{11} + + {h h}_{22} + + {h h}_{33}})) \\ {w w}_{33} = = - - log log ((\frac{{h h}_{33}}{{h h}_{11} + + {h h}_{22} + + {h h}_{33}})) \end{matrix}$

其中有三个颜色通道，则w₁表示颜色通道1的标准化显著权重，w₂表示颜色通道2的标准化显著权重，w₃表示颜色通道3的标准化显著权重；There are three color channels, w ₁ represents the normalized saliency weight of color channel 1, w ₂ represents the normalized saliency weight of color channel 2, and w ₃ represents the normalized saliency weight of color channel 3;

步骤三：计算信息量显著特征图；Step 3: Calculate the significant feature map of the amount of information;

利用图像I_p的各颜色通道的标准化显著权重w_c，加权计算得到该图像初步的信息量显著特征图Smap(x，y)，对初步的信息量显著特征图进行高斯平滑滤波，滤除噪声后得到最终的信息量显著特征图SS(x，y)：Using the normalized saliency weight w _c of each color channel of the image _Ip , the weighted calculation obtains the preliminary information saliency feature map Smap(x, y) of the image, and performs Gaussian smoothing filtering on the preliminary information saliency feature map to filter out the noise Finally, the final informative feature map SS(x, y) is obtained:

$Smap Smap ((x x,, y the y)) = = {Σ Σ}_{c c = = 11}^{33} {w w}_{c c} {f f}_{c c} ((x x,, y the y))$

其中，表示高斯平滑滤波器；in, Represents a Gaussian smoothing filter;

经过以上步骤，获得了遥感图像组中每一幅遥感图像的信息量显著特征图。After the above steps, the remote sensing image group is obtained Informative salient feature map of each remote sensing image.

步骤四：将遥感图像从RGB颜色空间转换至CIE Lab颜色空间；Step 4: Convert the remote sensing image from the RGB color space to the CIE Lab color space;

由于CIELab的颜色通道在一定程度上去除了亮度信息，反映的内容更接近于色彩感知的本质，因此可以更好地体现出颜色光滑性，基于CIE Lab空间在颜色均匀性上的明显优势，选择在CIE Lab颜色空间上进行聚类，下面先进性颜色空间转换：Since the color channel of CIELab removes the brightness information to a certain extent, the reflected content is closer to the essence of color perception, so it can better reflect the color smoothness. Based on the obvious advantages of the CIE Lab space in color uniformity, the choice of Clustering is performed on the CIE Lab color space, and the following advanced color space conversion is performed:

分别提取遥感图像组中每幅图像每个像素的R、G、B三个颜色通道值，将它们转换至CIE Lab颜色空间，获取每个像素的L、a、b三个分量，在CIE Lab颜色空间的遥感图像组记为RGB颜色空间中R表示red红色，G表示green绿色，B表示blue蓝色，CIE Lab颜色空间的三个通道分别代表亮度L，L＝0代表黑色，L＝100代表白色，颜色在红/绿之间的位置a，a为负值代表绿色，a为正值代表红色，颜色在蓝/黄之间的位置b，b为负值代表蓝色，b为正值代表黄色；Separately extract remote sensing image groups The R, G, B three color channel values of each pixel in each image are converted to the CIE Lab color space, and the L, a, and b three components of each pixel are obtained. The remote sensing image in the CIE Lab color space group as In the RGB color space, R means red, G means green, and B means blue. The three channels of CIE Lab color space represent brightness L, L=0 represents black, L=100 represents white, and the color is red/green The position a between, a negative value represents green, a positive value represents red, the color is between blue/yellow position b, b negative value represents blue, b positive value represents yellow;

步骤五：颜色特征聚类；Step five: color feature clustering;

利用k-means聚类算法，完成CIE Lab颜色空间的像素聚类，即在CIE Lab颜色空间上，将这一组图像所有像素点的值进行聚类，得到k个簇，具体实现步骤如下：Use the k-means clustering algorithm to complete the pixel clustering in the CIE Lab color space, that is, in the CIE Lab color space, cluster the values of all the pixels in this group of images to obtain k clusters. The specific implementation steps are as follows :

(1)提取遥感图像组中每幅图像在CIE Lab颜色空间的L、a、b三个通道，调整三个通道中的像素点值的范围，使调整后的三个通道的像素点值的范围相同；(1) Extract remote sensing image group In the three channels of L, a, and b of each image in the CIE Lab color space, adjust the range of the pixel point value in the three channels, so that the range of the pixel point value of the adjusted three channels is the same;

(2)同时对图像组中所有图像的三个通道的像素值进行计算，使每个像素点值与最近聚类中心的距离平方和最小，此时，所有最近聚类中心相同的像素点为一个簇，可利用如下公式计算距离平方和W：(2) Simultaneously calculate the pixel values of the three channels of all images in the image group, so that the sum of the squares of the distances between each pixel value and the nearest cluster center is the smallest. At this time, all the pixels with the same nearest cluster center are For a cluster, the sum of squared distances W can be calculated using the following formula:

$W W = = min min (({Σ Σ}_{r r = = 11}^{n no} {| | {pi p}_{r r} - - {a a}_{j j} | |}^{22}))$

式中pi_r表示像素值，其中r＝1、2……n，n为图像像素点数，a_j表示聚类中心，其中j＝1、2……k；In the formula, pi _r represents the pixel value, where r=1, 2...n, n is the number of image pixel points, and a _j represents the cluster center, where j=1, 2...k;

步骤六：计算共有显著特征图；Step 6: Calculate the common salient feature map;

计算得到所有k个簇的权重后，则可利用簇的权重与簇之间的距离计算簇的显著值，把簇的显著值赋给每一个属于该簇的像素点，由此获得一组共有显著特征图，具体实现需要以下三个步骤：After calculating the weights of all k clusters, you can use the weights of the clusters and the distance between the clusters to calculate the saliency value of the cluster, and assign the saliency value of the cluster to each pixel belonging to the cluster, thus obtaining a set of common The salient feature map, the specific implementation requires the following three steps:

(1)将第j个簇l_j中含有的像素数与图像组总像素数相除，相除的结果定义为第j个簇的权重ω(l_j)，其中j＝1、2……k；(1) Divide the number of pixels contained in the j-th cluster l _j by the total number of pixels in the image group, and the result of the division is defined as the weight ω(l _j ) of the j-th cluster, where j=1, 2... k;

(2)定义D(l_t，l_j)为两个簇l_t、l_j的颜色距离，每一个簇的显著值CL(l_j)可用如下公式计算：(2) Define D(l _t , l _j ) as the color distance between two clusters l _t , l _j , and the saliency value CL(l _j ) of each cluster can be calculated by the following formula:

$CL CL (({l l}_{j j})) = = \frac{\underset{i i &NotEqual; &NotEqual; j j}{Σ Σ} ω ω (({l l}_{j j})) {D D.}_{l l} (({l l}_{t t},, {l l}_{j j}))}{ω ω (({l l}_{j j}))}$

其中，in,

$D D. (({l l}_{t t},, {l l}_{j j})) = = - - ln ln ((11 - - \frac{11}{22} {Σ Σ}_{s the s = = 11}^{m m} \frac{{(({q q}_{ts ts} - - {q q}_{js js}))}^{22}}{{q q}_{ts ts} + + {q q}_{js js}}))$

式中j，t取值均为1、2……k，q_ts为第s个颜色在第t个簇的m种颜色中出现的概率，即第t个簇中有m种像素值，s＝1、2……m；In the formula, the values of j and t are both 1, 2...k, q _ts is the probability that the sth color appears in the m colors of the tth cluster, that is, there are m kinds of pixel values in the tth cluster, and s = 1, 2...m;

(3)经过聚类，使每一个像素点的显著值等于该像素点所在簇的显著值，由此获得共有特征显著图SM(x，y)：(3) After clustering, the saliency value of each pixel is equal to the saliency value of the cluster where the pixel is located, thus obtaining the common feature saliency map SM(x, y):

当ILab_p(x，y)∈l_j，其中j＝1、2……k，p＝1、2……Q，When ILab _p (x, y)∈l _j , where j=1, 2...k, p=1, 2...Q,

SM(x，y)＝CL(l_j)SM(x,y)=CL(l _j )

经过以上步骤，获得了遥感图像组中每一幅遥感图像的共有显著特征图。After the above steps, the remote sensing image group is obtained The common salient feature map of each remote sensing image in .

步骤七：计算最终显著图；Step 7: Calculate the final saliency map;

将通过各颜色通道获得的信息量显著特征图，与在CIE Lab颜色空间通过k-means聚类获得的共有显著特征图对应相乘，从而获得这组遥感图像中每幅图像多显著特征融合后的最终显著图S(x，y)：Multiply the informative salient feature map obtained through each color channel with the shared salient feature map obtained by k-means clustering in the CIE Lab color space, so as to obtain the multi-significant feature map of each image in this group of remote sensing images. The final saliency map S(x, y) of :

S(x，y)＝SS(x，y)×SM(x，y)S(x,y)=SS(x,y)×SM(x,y)

步骤八：感兴趣区域提取；Step 8: Region of interest extraction;

通过最大类间方差法得到最终显著图的分割阈值，利用该阈值将最终显著图分割为一幅二值图像模板，用“1”代表感兴趣区域，用“0”代表非感兴趣区，最后将二值图像模板与原始图像相乘得到最终的感兴趣区提取结果。The segmentation threshold of the final saliency map is obtained by the method of maximum inter-class variance, and the threshold is used to segment the final saliency map into a binary image template, using "1" to represent the region of interest, and "0" to represent the non-interest region, and finally Multiply the binary image template with the original image to get the final ROI extraction result.

本发明的效果可通过以下实验结果与分析进一步说明：Effect of the present invention can be further illustrated by following experimental results and analysis:

1.实验数据1. Experimental data

本发明从SPOT5卫星源图中选取了一组北京某郊区的可见光遥感图像，并分别从中截取生成了大小为1024×1024的一组图作为本文实验源图，如图2所示。The present invention selects a group of visible light remote sensing images of a certain suburb of Beijing from the SPOT5 satellite source image, and intercepts and generates a group of images with a size of 1024×1024 as the source image of this experiment, as shown in Figure 2.

2.对比实验2. Comparative experiment

为了评价本发明方法的性能，我们设计了如下的对比实验，选取了现有的具有代表性的视觉注意方法选取了ITTI方法，GBVS方法，FT方法与本发明方法进行性能对比。从主观上分别对比了不同方法生成的显著图和感兴趣区域图，如图4和图5所示。图4中，(a)为Itti方法生成的显著图，(b)为GBVS方法生成的显著图，(c)为FT方法生成的显著图，(d)为本发明方法生成的显著图。图5中，(a)为Itti方法生成的感兴趣区域图，(b)为GBVS方法生成的感兴趣区域图，(c)为FT方法生成的感兴趣区域图，(d)为本发明方法生成的感兴趣区域图。In order to evaluate the performance of the method of the present invention, we have designed the following comparative experiments, selected existing representative visual attention methods and selected ITTI method, GBVS method, FT method and the method of the present invention to carry out performance comparison. The saliency map and ROI map generated by different methods are compared subjectively, as shown in Figure 4 and Figure 5. In Figure 4, (a) is the saliency map generated by the Itti method, (b) is the saliency map generated by the GBVS method, (c) is the saliency map generated by the FT method, and (d) is the saliency map generated by the method of the present invention. In Fig. 5, (a) is the region of interest map that Itti method generates, (b) is the region of interest map that GBVS method generates, (c) is the region of interest map that FT method generates, (d) is the method of the present invention The resulting region-of-interest map.

经过对比可以看出，利用Itti模型得到的显著图分辨率很低，仅有原图大小的1/256，当最终提取感兴趣区域时，要将显著图放大。而GBVS模型是基于Itti模型的，只是在得到显著图时，利用马尔科夫链。由这两个模型得到的感兴趣区域都会比原本需要提取的区域范围大，即会提取出不需要的部分。利用FT模型，在背景频率变化不大时能够得到较好的提取结果，然而当背景频率变化大时，就会对提取结果造成干扰，而本文的算法则能得到较好的检测结果。After comparison, it can be seen that the resolution of the saliency map obtained by using the Itti model is very low, only 1/256 of the size of the original image. When finally extracting the region of interest, the saliency map should be enlarged. The GBVS model is based on the Itti model, but only uses the Markov chain when obtaining the saliency map. The region of interest obtained by these two models will be larger than the region that needs to be extracted originally, that is, the unnecessary part will be extracted. Using the FT model, better extraction results can be obtained when the background frequency changes little, but when the background frequency changes greatly, it will interfere with the extraction results, while the algorithm in this paper can get better detection results.

Claims

1. A method for detecting regions of interest in remote sensing images based on the fusion of multiple salient features. This method processes a group of remote sensing images. First, it uses the color information of the remote sensing images to construct color histograms of different color channels and perform weighted calculations. Obtain the salient feature map of the amount of information, and then use the k-means clustering algorithm to cluster a group of remote sensing images in the CIE Lab color space and calculate the salient value, so as to obtain a group of common salient feature maps in the CIE Lab color space, and then fuse The above two groups of images obtain the final saliency map, and finally perform threshold segmentation to extract the region of interest through the maximum inter-class variance method, which is characterized in that it includes the following steps:

Step 1: Calculate the color histogram, that is, input a set of remote sensing images with a size of M×N, extract each color channel of each image separately, and use f _c (x, y) to represent the color channel c (x, y ) position, construct the intensity histogram H _c (i) of each remote sensing image in different color channels, where M represents the length of the image, N represents the width of the image, and x and y represent the horizontal and vertical coordinates of the image respectively, x=1, 2...M, y=1, 2...N, c represents the color channel, c=1, 2, 3, i represents the pixel intensity value, i=0, 1...255;

Step 2: Calculate the normalized saliency weight of color channel c, that is, calculate the information amount In _c (i) of each pixel intensity value i in the color channel according to the color histogram H _c (i) of color channel c, and calculate the The amount of information is assigned to the pixel point equal to the intensity value of the pixel. After all calculations and assignments are completed, the information amount map LOG _c (x, y) of the color channel c is obtained. Using this information amount map, the salience degree of the color channel c is obtained. h _c , and then use the saliency of each color channel to calculate the normalized saliency weight w _c of each color channel of each image;

Step 3: Calculate the informative feature map, that is, use the standardized saliency weight w _c of each color channel to weight and calculate the preliminary informative feature map of each image, and perform Gaussian smoothing filtering on the initially obtained informative feature map, After filtering the noise, the final informative feature map of each image is obtained;

Step 4: Convert a group of remote sensing images from the RGB color space to the CIE Lab color space, that is, extract the three color channel values of R, G, and B for each pixel of each image, and convert them to the CIE Lab color space to obtain L, a, b three components, in the RGB color space, R means red red, G means green green, B means blue blue, in CIE Lab color space, L means brightness, L=0 means black, L=100 means White, a represents the position of the color between red/green, a is negative for green, a is positive for red, b represents the position of the color between blue/yellow, b is negative for blue, b is Positive values represent yellow;

Step 5: Use the k-means clustering algorithm to complete the pixel clustering in the CIE Lab color space, that is, use the k-means clustering algorithm to map this group of original remote sensing images to the values of all pixels in the CIE Lab color space for clustering Class, get k clusters;

Step 6: Calculate the common salient feature map, that is, divide the number of pixels contained in the jth cluster by the total number of pixels in the image, and the result of the division is defined as the weight of the jth cluster, where j=1, 2...k, After obtaining the weights of all k clusters, calculate the saliency value of the cluster by using the weight of the cluster and the distance between the clusters, and assign the saliency value of the cluster to each pixel belonging to the cluster, thereby obtaining a set of common saliency feature maps ;

Step 7: Calculate the final saliency map, that is, the information saliency feature map obtained by using the histogram information of each color channel is multiplied by the shared saliency map obtained by k-means clustering in the CIE Lab color space, so as to obtain multiple The final saliency map after salient feature fusion;

Step 8: Extracting the region of interest, that is, the segmentation threshold of the final saliency map is obtained by the maximum inter-class variance method, and the final saliency map is segmented into a binary image template by using the threshold, with "1" representing the region of interest, and " 0" represents the non-interest area, and finally the binary image template is multiplied by the original image to obtain the final ROI extraction result.

2. a kind of remote sensing image region of interest extraction method based on salient feature clustering according to claim 1, is characterized in that, the specific process of described step 2 is:

1) Calculate the information amount In _c (i) of each pixel intensity value according to the color histogram H _c (i) in the color channel c:

In(i) _c ＝-ln(H _c (i))

2) assign the information amount to the pixel point equal to the pixel intensity value, and obtain the information amount map LOG _c (x, y) of the color channel c, namely:

i = f _c (x, y),

3) Calculate the saliency h _c by using the information quantity map LOG _c (x, y) of the color channel c. Since the image contains three color channels, h ₁ represents the saliency of color channel 1, and h ₂ represents the color The salience of channel 2, h ₃ represents the salience of color channel 3:

{h h}_{c c} = = \frac{{Σ Σ}_{x x = = 11}^{M m} {Σ Σ}_{y the y = = 11}^{M m} {LOG LOG}_{c c} ((x x,, y the y))}{{Σ Σ}_{c c = = 11}^{33} {Σ Σ}_{x x = = 11}^{M m} {Σ Σ}_{y the y = = 11}^{M m} {LOG LOG}_{c c} ((x x,, y the y))}

4) Divide the saliency of the color channel c by the sum of the salience of the three color channels, and take the negative logarithm of the result to obtain the normalized saliency weight w _c of the color channel,

\{\begin{matrix} {w w}_{11} = = - - log log ((\frac{{h h}_{11}}{{h h}_{11} + + {h h}_{22} + + {h h}_{33}})) \\ {w w}_{22} = = - - log log ((\frac{{h h}_{22}}{{h h}_{11} + + {h h}_{22} + + {h h}_{33}})) \\ {w w}_{33} = = - - log log ((\frac{{h h}_{33}}{{h h}_{11} + + {h h}_{22} + + {h h}_{33}})) \end{matrix}

Since the image contains three color channels, w ₁ represents the normalized saliency weight of color channel 1, w ₂ represents the normalized saliency weight of color channel 2, and w ₃ represents the normalized saliency weight of color channel 3.