CN103336801B - Remote sensing image retrieval method based on multiple features LSH index combination - Google Patents

Remote sensing image retrieval method based on multiple features LSH index combination Download PDF

Info

Publication number
CN103336801B
CN103336801B CN201310245809.6A CN201310245809A CN103336801B CN 103336801 B CN103336801 B CN 103336801B CN 201310245809 A CN201310245809 A CN 201310245809A CN 103336801 B CN103336801 B CN 103336801B
Authority
CN
China
Prior art keywords
index
lsh
remote sensing
feature space
subimage
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201310245809.6A
Other languages
Chinese (zh)
Other versions
CN103336801A (en
Inventor
李士进
谢萍
冯钧
於慧
万定生
朱跃龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hohai University HHU
Original Assignee
Hohai University HHU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hohai University HHU filed Critical Hohai University HHU
Priority to CN201310245809.6A priority Critical patent/CN103336801B/en
Publication of CN103336801A publication Critical patent/CN103336801A/en
Application granted granted Critical
Publication of CN103336801B publication Critical patent/CN103336801B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a kind of remote sensing image retrieval method based on multiple features LSH index combination, belong to remote Sensing Image Retrieval technical field.The LSH of one of index technology best for high-dimensional feature space is indexed and introduces remote Sensing Image Retrieval field by the present invention, can effectively solve dimension disaster and retrieve time-consuming problem, it is achieved that the quick-searching of remote sensing images in scale;Simultaneously, the present invention is directed to LSH index and propose the index Effective exponent (FDIVI) of a new index Validity Index feature based distinguishing ability, select distinguish the feature of target and background by the LSH on each feature space being indexed assessment, thus be effectively increased the accuracy of retrieval result.Compared to existing technology, the present invention can be realized more rapidly and accurately the retrieval of magnanimity remote sensing image data.

Description

Remote sensing image retrieval method based on multiple features LSH index combination
Technical field
The present invention relates to remote Sensing Image Retrieval technical field, particularly relate to a kind of based on multiple features LSH index combination distant Sense image search method.
Background technology
Over nearly 40 years, earth observation data volume sharply increases along with the development of remote sensing technology, and various information systeies are over the ground The requirement of thing identification technology is more and more higher.Carry out remote sensing images the most quickly and efficiently automatically to classify and retrieve and become urgent need One of problem solved.
In general image retrieval, searched targets is piece image or is included in piece image, and range of search is figure As other independent some images in storehouse.Different with general natural image retrieval, remote Sensing Image Retrieval has a characteristic that to be treated Searched targets is in the remote sensing images that an amplitude ratio is bigger, other similar purpose be dispersed in cover certain territorial scope a width or In several other remote sensing images, scattered target has certain visual similarity.Content-based Remote Sensing Image Retrieval System based on content carries Since going out, researcher has been achieved for many achievements.As Zhu etc. uses Gabor textural characteristics to retrieve aerial image [Zhu Bin, Marshall R, Hsinchun C.Creating a large-scale content-based airphoto image Digital library [J] .IEEE Trans.on Image Processing, 2000,9 (1): 163-167], Lu Lizhen etc. [Lu Lizhen, Liu Renyi, Liu Nan. the remote sensing image retrieval method [J] of a kind of Fusion of Color and textural characteristics. China's image graphics Journal, 2004,9 (3): 328-332] Gabor textural characteristics is merged in proposition and color characteristic carries out remote Sensing Image Retrieval, and use The linear weighted function of texture and color characteristic Euclidean distance carrys out measured similarity.Zeng Zhiming et al. [Zeng Zhiming, Li Feng, Fu Kun. wait one Plant the Texture Segmentation Algorithm [J] of large scale remote sensing images information retrieval based on contents. Wuhan University Journal: information science version, 2005,30 (12): 1080-1083] the co-occurrence matrix textural characteristics improved is utilized to carry out large scale remote Sensing Image Retrieval.Geng Li River etc. [Geng Lichuan, Wu Yundong, Geng Zexun, Wang Yonggang. remote Sensing Image Retrieval [J] based on artificial immune system. China's image figure Shape journal, 2010,01:155-160] propose a kind of remote Sensing Image Retrieval algorithm based on artificial immune system.Zhu Jiali etc. [Zhu Jiali, Li Shijin, Wan Dingsheng, Feng Jun. feature based selects and the remote Sensing Image Retrieval [J] of semi-supervised learning. China's image Figure journal, 2011.16 (8): 1474-1482] selection of a kind of feature based and the remote Sensing Image Retrieval of semi-supervised learning are proposed Method.
The method realizing remote sensing images similarity is a lot, and that most basic is sequential scan algorithm (Sequential Scan Algorithm, SSA).Also having the tree algorithm divided based on space, such as: R-tree, Kd-tree, SR-tree, these are calculated The result that method returns is the most more accurate, but they time efficiencies on High Dimensional Data Set are the highest.Document [Weber R, Schek H,Blott S.A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces[C]//Proc.of the24th Intl.Conf.on Very Large Data Bases (VLDB) .1998:194-205] in point out, in dimension higher than after 10, the calculation divided based on space Method is on the contrary not as linear search.
In remote Sensing Image Retrieval based on content, quick-searching goes out a satisfactory similar subgraph image set, than flower The expense long period retrieves the image set more attractive complying fully with requirement.And to different types of searched targets, feature should This is different, selects to represent that the feature of searched targets content, to carry out image retrieval, can improve accessibility further Energy.
Summary of the invention
The technical problem to be solved is to overcome prior art not enough, it is provided that a kind of based on multiple features LSH rope Draw the remote sensing image retrieval method of combination, different characteristic collection is set up LSH index respectively, and according to indexing Validity Index certainly The employing optimal characteristics adapted to retrieves image, can be effectively improved the accuracy of retrieval result.
The present invention solves above-mentioned technical problem the most by the following technical solutions:
Remote sensing image retrieval method based on multiple features LSH index combination, comprises the following steps:
Step 1, remote sensing images carrying out piecemeal, each image block constitutes subgraph as a subimage, all image blocks Image set;Step 2, in the feature space that at least two is different, respectively with described subgraph image set for data set set up LSH index; Described feature space is color feature space or/and textural feature space;
Step 3, for given query sample, the LSH index being utilized respectively each feature space carries out LSH retrieval, obtains LSH in each feature space retrieves result;
Step 4, retrieve result according to the LSH in each feature space, calculate the index Validity Index of each feature space: face The index Validity Index FDIVI in color characteristic spacecAnd the index Validity Index FDIVI of textural feature spacetRespectively according to Below equation calculates:
FDIVI c = R t ( 1 ) R t ( 0 )
FDIVI t = R t ( 0 ) R t ( 1 )
Wherein, R t ( i ) = 1 | C i | Σ x ∈ C i D ( x , q ) , i = 0,1 ,
In formula, | Ci| represent bunch CiSize;C0Represent the mesh that the subimage in the Hash bucket hit by query sample is constituted Mark bunch;C1Represent by the non-targeted bunch constituted with the subimage in the Hash bucket of query sample arest neighbors;(x q) represents sample x to D And the distance between query sample q, for color characteristic, (x q) uses rectangular histogram to hand over distance to D;For textural characteristics, and D (x, q) Use Euclidean distance;RtI () represents the average distance between query sample and each bunch;
Step 5, the color characteristic that selection index Validity Index value is minimum respectively, textural characteristics are as optimum color characteristic With optimum textural characteristics, and determine optimum color characteristic and the binaryzation weight of optimum textural characteristics in accordance with the following methods: index Validity Index value is more than 1, then weight is set to 0;Otherwise, weight is set to 1;
Step 6, when two kinds of optimal characteristics exist the feature that weight is 0, only use weight be 1 feature space in LSH retrieval result is as final remote Sensing Image Retrieval result;When two kinds of optimal characteristics weights are 1, optimum to both All subimages that LSH retrieval result in feature space is comprised, by being weighted the similarity of two kinds of optimal characteristics Linear operation generates comprehensive similarity, and several have the subimage of maximum comprehensive similarity as finally before therefrom selecting Remote Sensing Image Retrieval result.
Preferably, generate comprehensive by the similarity of two kinds of optimal characteristics is weighted linear operation described in step 5 Close similarity, the most in accordance with the following methods:
Given query sample and described subgraph is calculated respectively in optimum color feature space and optimum textural feature space The variance of the spacing of each subimage in image set;
Then according to below equation determines optimum color characteristic and the comprehensive similarity weight w of optimum textural characteristicsc、wt:
w c = σ c σ c + σ t ,
w t = σ t σ c + σ t ,
Wherein, σcAnd σtRepresent respectively in optimum color feature space and optimum textural feature space given query sample with Described subimage concentrates the variance of the spacing of each subimage;
Given comprehensive distance D between query sample and arbitrary subimage i is obtained finally according to following formulai, comprehensive distance is more Little, show that comprehensive similarity is the highest:
Di=wcDci+wtDti,
Wherein, DciAnd DtiRepresent given query sample in optimum color feature space and optimum textural feature space respectively And the distance between subimage i.
Further, described therefrom select before several have the subimage of maximum comprehensive similarity as final remote sensing Image searching result, the most in accordance with the following methods:
Selected part subimage is concentrated from described subimage, and in optimum color feature space and optimum textural feature space Middle average T calculating this partial subgraph picture distance each other respectivelycAnd Tt
Determine similarity threshold T according to the following formula:
T=wcTc+wtTt,
Wherein, wcAnd wtIt is respectively optimum color characteristic and the comprehensive similarity weight of optimum textural characteristics;
In all subimages that last LSH retrieval result from two kinds of optimal characteristics spaces is comprised, choose with given The comprehensive distance between the query sample subimage less than or equal to similarity threshold T, as final remote Sensing Image Retrieval result.
Compared to existing technology, the method have the advantages that
First, the LSH of one of index technology best for high-dimensional feature space is indexed and introduces remote Sensing Image Retrieval by the present invention Field, can effectively solve dimension disaster in scale and retrieve time-consuming problem, it is achieved that the quick-searching of remote sensing images;
Secondly, the present invention is directed to LSH index and propose a new index Validity Index feature based distinguishing ability Index Effective exponent (feature discriminativeness-based indexing validation index, FDIVI), select distinguish the feature of target and background by the LSH on each feature space being indexed assessment, thus effectively Improve the accuracy of retrieval result.
Accompanying drawing explanation
Fig. 1 is the flow chart of present invention remote sensing image retrieval method based on multiple features LSH index combination;
Fig. 2 (a) Fig. 2 (c) is followed successively by the remote sensing figure using the inventive method, sequential scan algorithm, expert manually to give Settlement place retrieval result in Xiang;
Fig. 3 (a) Fig. 3 (c) is followed successively by the remote sensing figure using the inventive method, sequential scan algorithm, expert manually to give Cultured In The Lake Pen retrieval result in Xiang;
Fig. 4 (a) Fig. 4 (c) is followed successively by the remote sensing figure using the inventive method, sequential scan algorithm, expert manually to give Retrieval result in forest land, the Maoshan Mountain in Xiang;
Fig. 5 (a) Fig. 5 (c) is followed successively by the remote sensing figure using the inventive method, sequential scan algorithm, expert manually to give Soil erosion retrieval result near Xuzhou in Xiang.
Detailed description of the invention
Below in conjunction with the accompanying drawings technical scheme is described in detail:
The thinking of the present invention is LSH index technology to introduce the automatic of magnanimity remote sensing image data inquire about and in retrieval, profit Solve the dimension disaster in remote Sensing Image Retrieval with LSH index and retrieve time-consuming problem, thus realizing the quick of remote sensing images Retrieval;And the index proposing a new index Validity Index feature based distinguishing ability for LSH index is effective Sex index (feature discriminativeness-based indexing validation index, FDIVI), passes through LSH on each feature space is indexed assessment and selects distinguish the feature of target and background, thus be effectively improved retrieval knot The accuracy of fruit.
For the ease of public understanding technical solution of the present invention, the most first LSH index technology is briefly introduced.
1998, Indyk and Motwani [Indyk P, Motawani R.Approximate Nearest Neighbors:Towards Removing the Curse of Dimensionality[C]//Proc.of the30th Annual ACM Symp.on Theory of Computing, 1998:p604-613] propose quick for nearest neighbor search problem The theoretical basis of sense position Hash (Locality Sensitive Hashing, LSH) algorithm.1999, Gionis etc. [Gionis A,Indyk P,Motwani R.Similarity search in high dimensions via hashing [C]//Proc.of the25th International Conference on Very Large Data Bases(VLDB) .1999] it improved and tested, using the way of Hash to solve the quick-searching problem of high dimensional data.2002, Tang Pretty China etc. [Tang Junhua, Yan Baoping. rapid image retrieval [J] based on LSH index. computer engineering and application, 2002,24: 20-21,63] LSH Index Algorithm is applied in Content-Based Image Retrieval system, its performance is better than traditional indexing means. 2007, Lv et al. [Qin Lv, William Josephson, Zhe Wang, Moses Charikar, Kai Li, Multi- Probe LSH:Efficient Indexing for High-Dimensional Similarity Search[C]// Proceeding-VLDB07Proceedings of the33rd international conference on Very Large data bases.2007] on the basis of LSH algorithm based on entropy, it is proposed that investigate LSH (Multi-Probe) more and calculate Method, more effectively utilizes every Hash table to reach to improve the purpose of retrieval quality.2009, Cai Heng etc. [Cai Heng, Li Zhoujun, Sun Jian, Li Yang. Chinese text quick-searching [J] based on LSH. computer science .2009,36 (8): 201-204,230] for The LSH algorithm of binary vector has been done further improvement by text retrieval.
LSH Index Algorithm is by building Hash table and search algorithm two parts form, it is assumed that current subimage feature composition Data set is P.L is Hash table number, and each Hash table comprises the hash function of M stochastic generation, and Hash table width is W.Structure Build hash table algorithm as follows:
1, every kind of feature to subimage carries out the conversion of Hamming (Hamming) space, P is mapped to Hamming spatially.
2, suitable parameter L, M, W are chosen.Initialize hash function race H={h1,...,hL, calculate feature in the index Position, the value of each hash function is determined by following formula:
h a , b = a * v + b W - - - ( 1 )
Wherein, a and b obeys gaussian random distribution, and v is the characteristic vector of subimage, obtains after calculating g ( v ) = ( h a 1 , b 1 ( v ) , . . . , h a M , b M ( v ) ) .
3, utilize these hash functions, feature is stored in corresponding hash table.
Search algorithm is as follows:
1, query sample q is selected, and by the maps feature vectors of q to Hamming space.
2, the structure of similar developing algorithm Hash table, calculates cryptographic Hash g (q) of q, returns in the Hash bucket that g (q) points to Object.
3, Candidate Set is chosen.For LSH multi-dimensional indexing, owing to, in same hash function race, random function is independent With distribution, so regarding any one Hash table as a grader.And importance between Hash table is phase in indexing With, general ballot method just can calculate the synthesis result of many subseries.
The present invention proposes to be applied in remote Sensing Image Retrieval LSH Index Algorithm, and its basic thought is: carry out subimage Specific hash, in every Hash table, two subimages similar on higher dimensional space can fall into same on the biggest probability In individual Hash bucket.Regarding the set that subimage forms as a small-sized image library, system is near with query image by returning One group of image as patibhaga-nimitta, then searches for the target image the most similar to query image in the image set returned, reduces and use The time that family is retrieved every time.
Different types of searched targets in remote sensing images, the description to its content, the effectiveness of different characteristic also differs, If able to select the optimal feature representing searched targets content to carry out image retrieval, then can be greatly enhanced accessibility Energy.According to research LSH directoried data set classifying quality carried out in the past, it was found that such a conclusion: the k of inquiry target is near Neighbour is mainly distributed in target bucket, and only minority can be distributed in the Hash bucket of target bucket.Remote sensing images subimage amount with The quantity of large database is compared less mutually, and has redundancy between subimage, so the present invention considers when index selection With the non-targeted bucket that target bucket distance is 1.
In the multiclass feature describing remote sensing images, targets different in color feature remote sensing images, textural characteristics Can make up the deficiency that spatial information is distributed by color characteristic, it is former that index corresponding to both features meets that grader selects Then, not only there is High Defferential degree but also there is high complementary information.Therefore, the present invention uses this two category feature to combine to carry out remote sensing images Retrieval.
The present invention utilizes the characteristic vector of sample image, according to the representativeness of query sample in LSH index classification more than bunch The collection center representativeness to whole gathering, it is proposed that a kind of index Validity Index feature based mirror for LSH index Index Effective exponent (the feature discriminativeness-based indexing validation of other ability Index, FDIVI), its computing formula is specific as follows:
R t ( i ) = 1 | C i | Σ x ∈ C i D ( x , q ) , i = 0,1 - - - ( 2 )
FDIVI c = R t ( 1 ) R t ( 0 )
FDIVI t = R t ( 0 ) R t ( 1 ) - - - ( 3 )
Wherein, in formula (2), | Ci| represent bunch CiSize, C0Represent the subgraph in the Hash bucket hit by query sample As the target bunch constituted, C1Represent by the non-targeted bunch constituted with the subimage in the Hash bucket of query sample arest neighbors.Q represents Query sample, RtI () represents the average distance between query point and each bunch.For color characteristic, (x q) uses rectangular histogram to D Distance is handed over to calculate;For textural characteristics, (x q) uses Euclidean distance to calculate to D.
In formula (3), FDIVIcRepresent the desired value of color characteristic, FDIVI in feature spacetRepresent the finger of textural characteristics Scale value.When considering two kinds of color characteristics, FDIVIcRefer to that target value is the least, illustrate on the color feature space of its correspondence, Rt And R (0)t(1) value difference is away from the biggest, and LSH index is the best to the classifying quality of subimage, and therefore this feature is elected as optimum color Feature.When considering two kinds of textural characteristics, similar color characteristic, choose FDIVItIt is worth less feature special as optimum texture Levy.
I.e. can get technical scheme according to above-mentioned analysis, come skill of the present invention with a specific embodiment below Art scheme is further described.
The remote sensing image retrieval method based on multiple features LSH index combination of the present invention, as it is shown in figure 1, include following step Rapid:
Step 1, remote sensing images carrying out piecemeal, each image block constitutes subgraph as a subimage, all image blocks Image set.
The present embodiment uses overlap partition strategy conventional in remote Sensing Image Retrieval.When overlapping size takes minima 1*1 Pixel, the most overlapping between subimage after piecemeal, some Small object may be divided into several subimages dissimilar with sample, Retrieval result is worst;When overlapping size takes maximum 63*63, the most intensive between subimage after piecemeal, retrieval result is the most accurate, but time Empty complexity is the highest.Owing to present invention employs index strategy, therefore between subimage block, overlapping degree can be of a relatively high, real Remote sensing images are preferably divided in border the image block of series of partially overlapping 64*64 size, and between block, overlapping size is 60* 60。
Step 2, in the feature space that at least two is different, set up LSH rope with described subgraph image set for data set respectively Draw;Described feature space is color feature space or/and textural feature space.
In this embodiment, HSI color characteristic, Lab color characteristic, gray level co-occurrence matrixes it are extracted respectively (Gray level Co-Occurrence Matrix, GLCM) textural characteristics and Gabor textural characteristics are selected multiple In feature space, respectively each subimage is carried out feature extraction;Then in each feature space, LSH index is set up respectively: by height Dimensional feature is mapped to Hamming space, generates the hash function race of LSH, then every kind of spy of antithetical phrase image set according to default parameter Levy and carry out position sensing Hash hash, set up LSH index.In this specific embodiment, institute in Hash table number L, each Hash table Hash function quantity M comprised and the preferred value of Hash table width W be: L=20, M=5, W=4.
It is relatively time-consuming owing to view picture remote sensing images are extracted various features ratio, so off-line completes this process effectively save use of energy The time of family inquiry.Therefore the present invention is preferably by step 1, step 2 off-line execution.
Step 3, for given query sample, the LSH index being utilized respectively each feature space carries out LSH retrieval, obtains LSH in each feature space retrieves result.
For given query sample, utilizing the data set that LSH indexed search is close is prior art, and here is omitted.
Step 4, retrieve result according to the LSH in each feature space, calculate the index Validity Index of each feature space: face The index Validity Index FDIVI in color characteristic spacecAnd the index Validity Index FDIVI of textural feature spacetRespectively according to Below equation calculates:
FDIVI c = R t ( 1 ) R t ( 0 )
FDIVI t = R t ( 0 ) R t ( 1 )
Wherein, R t ( i ) = 1 | C i | Σ x ∈ C i D ( x , q ) , i = 0,1 ,
In formula, | Ci| represent bunch CiSize;C0Represent the mesh that the subimage in the Hash bucket hit by query sample is constituted Mark bunch;C1Represent by the non-targeted bunch constituted with the subimage in the Hash bucket of query sample arest neighbors;(x q) represents sample x to D And the distance between query sample q, for color characteristic, D (x, q) use rectangular histogram hand over distance (see document [M.J.Swain, D.H.Ballard,Color indexing,International Journal of Computer Vision,7:1,11-32 (1991)]);For textural characteristics, (x q) uses Euclidean distance to D;RtI () represents between query sample and each bunch average Distance.
Step 5, the color characteristic that selection index Validity Index value is minimum respectively, textural characteristics are as optimum color characteristic With optimum textural characteristics, and determine optimum color characteristic and the binaryzation weight of optimum textural characteristics.
When considering two or more color characteristic, FDIVIcRefer to that target value is the least, the color in its correspondence is described On feature space, RtAnd R (0)t(1) value difference is away from the biggest, and LSH index is the best, therefore by this feature to the classifying quality of subimage Elect optimum color characteristic as.When considering two or more textural characteristics, similar color characteristic, choose FDIVItIt is worth less Feature as optimum textural characteristics.
When the FDIVI index of selected optimal characteristics is more than 1, for color characteristic, represent query sample and target bunch it Between average distance less than with arest neighbors non-targeted bunch between average distance, illustrate can not significantly distinguish in color space Target bunch and arest neighbors non-targeted bunch, be now set to 0 by the weight of color characteristic, be otherwise 1;For textural characteristics, then it represents that Average distance between query sample and target bunch is more than the average distance between arest neighbors non-targeted bunch, illustrates at texture empty Can not significantly distinguish target bunch and arest neighbors non-targeted bunch between, now the weight of textural characteristics is set to 0, be otherwise 1.
Step 6, when two kinds of optimal characteristics exist the feature that weight is 0, only use weight be 1 feature space in LSH retrieval result is as final remote Sensing Image Retrieval result;When two kinds of optimal characteristics weights are 1, optimum to both All subimages that LSH retrieval result in feature space is comprised, by being weighted the similarity of two kinds of optimal characteristics Linear operation generates comprehensive similarity, and several have the subimage of maximum comprehensive similarity as finally before therefrom selecting Remote Sensing Image Retrieval result.
This specific embodiment uses following methods determine the comprehensive similarity between subimage and given query sample:
Given query sample and described subgraph is calculated respectively in optimum color feature space and optimum textural feature space The variance of the spacing of each subimage in image set;
Then according to below equation determines optimum color characteristic and the comprehensive similarity weight w of optimum textural characteristicsc、wt:
w c = σ c σ c + σ t ,
w t = σ t σ c + σ t ,
Wherein, σcAnd σtRepresent respectively in optimum color feature space and optimum textural feature space given query sample with Described subimage concentrates the variance of the spacing of each subimage;
Given comprehensive distance D between query sample and arbitrary subimage i is obtained finally according to following formulai, comprehensive distance is more Little, show that comprehensive similarity is the highest:
Di=wcDci+wtDti,
Wherein, DciAnd DtiRepresent given query sample in optimum color feature space and optimum textural feature space respectively And the distance between subimage i.
So can retrieve in all subimages that result is comprised by the LSH from two kinds of optimal characteristics spaces, select The partial subgraph picture that comprehensive similarity is the highest, as final retrieval result.This specific embodiment specifically uses following methods: From described subimage concentrate selected part subimage (the present embodiment is from subimage concentrate randomly select 10% subimage), And calculate this partial subgraph picture distance each other respectively in optimum color feature space and optimum textural feature space Average, be designated as T respectivelycAnd Tt;Then according to the comprehensive similarity power of the optimum color characteristic obtained and optimum textural characteristics Weight wcAnd wt, determine similarity threshold T:T=wcTc+wtTt;Last LSH retrieval result from two kinds of optimal characteristics spaces is wrapped In all subimages contained, choose and give comprehensive distance D between query sampleiThe subimage of≤T, as final remote sensing Image searching result.
Below by the remote Sensing Image Retrieval of different ground mulching being verified the effectiveness of the inventive method, and make It is estimated with precision ratio based on area and recall ratio.
If Area (s) is all similar to the query sample region area retrieved in inquiry, Area (v) is in image But the region area that be not retrieved similar with query sample, Area (u) is that retrieve and that query sample is unrelated area surface Long-pending, then the recall ratio retrieved and precision ratio can be expressed as:
Precision ratio: precision = Area ( s ) Area ( s ) + Area ( u ) - - - ( 4 )
Recall ratio: recall = Area ( s ) Area ( s ) + Area ( v ) - - - ( 5 )
Taking into account retrieval precision and efficiency, use image subblock size 64*64, between sub-block, piecemeal overlap size is 60 pictures Element.To user's index building time, the retrieval time of index of reference and use sequential scan algorithm is made to carry out similarity measurement and need The time three wanted carries out Statistical Comparison, and corresponding time showing is in Table 1.
The remote sensing images LSH indexing means of the different ground mulching of table 1 contrasted with the sequential scanning method time
Picture number LSH builds the time (s) LSH query time (s) Sequential query time (s)
2 110.5 0.031 6.692
3 64.49 0.024 10.37
4 26.38 0.031 8.456
5 12.23 0.02 4.508
Found out by table 1, when single inquiry is carried out for all images, the query time of LSH index and sequential scan time Compare and greatly reduce.
Fig. 2 (a)~Fig. 5 (c) is to Soil erosion region near settlement place, Cultured In The Lake Pen, forest land, the Maoshan Mountain and Xuzhou respectively Retrieval result, original image pixels is respectively 1024*768,1024*768,800*749,812*531, has scaling herein.Its In, Fig. 2 (a), Fig. 3 (a), Fig. 4 (a), Fig. 5 (a) they are the retrieval results using the inventive method to obtain, Fig. 2 (b), Fig. 3 (b), figure 4(b), Fig. 5 (b) be the retrieval result utilizing sequential scan algorithm to obtain, Fig. 2 (c), Fig. 3 (c), Fig. 4 (c), Fig. 5 (c) are experts The similar area that really should retrieve manually be given.
As can be seen from the figure, compared with sequential scan algorithm, the retrieval effectiveness utilizing feature combination method to obtain compares Satisfactory, the region of each target area false retrieval missing inspection is less.For Fig. 2 (a) Fig. 2 (c), it can be seen that the inventive method Retrieve the settlement place region, one piece of the lower right corner that sequential scanning method does not retrieve, and the picture that sequential index retrieves is right The region of side is wrong, utilizes feature combination method then it can be avoided that false retrieval;Equally, Fig. 3 (a) Fig. 3 (c) can be seen that profit By the zone errors that inventive process avoids below the picture retrieved by sequential scanning method;Can in Fig. 4 (a) Fig. 4 (c) Finding out that forest land, the Maoshan Mountain regional extent that the inventive method retrieval obtains is more than sequential index, recall ratio and precision ratio are higher;Fig. 5 A, in () Fig. 5 (c), the inventive method retrieves Soil erosion near the Xuzhou, one piece of the lower right corner that sequential index does not retrieve Region.
The sharpest edges of the inventive method are combination of multiple features, and it is by mesh different in color feature remote sensing images Mark, introduces textural characteristics and makes up the deficiency that spatial information distribution is described by color characteristic.
When two width subimages are dissimilar on color characteristic, further by analyzing textural characteristics between image vector away from From thus become similar, near that block Xuzhou, the lower right corner in that block settlement place region in the lower right corner and Fig. 5 (a) in Fig. 2 (a) Soil erosion region, they are not the most consistent with user's query sample on color characteristic, when using sequential scanning method Time, these regions can be missed, and can be retrieved by analyzing textural characteristics.Similar, when two width subimages are in face On color characteristic similar time, when use sequential scanning method, these regions can be mistakenly detected out, the present invention by analyze texture Feature increases the distance between image vector, thus reduces error detection.As left in the region below picture in Fig. 3 (a) and Fig. 5 (a) Side region part, they compare similar with user's query sample on color characteristic, can draw it by analyzing textural characteristics Be not target subimage, this region can be avoided to be mistakenly detected out.
Therefore, after introduced feature selects, the method using color and textural characteristics to combine can be effective to searched targets region Return, and need not manual intervention, significantly reduce the burden of user.
Table 2 is that the inventive method and sequential scanning method use recall ratio based on area and the knot of precision ratio assessment Really, it can be seen that the present invention based on multiple features LSH index combination remote sensing image retrieval method on recall ratio and precision ratio all Increase.
Table 2 accuracy result
From above-mentioned experimental result it can be seen that the index index utilizing the present invention to propose selects the feature being suitable for, and move State determines that the weight of feature is rational, and remote sensing image retrieval method based on multiple features LSH index combination can obtain preferably Retrieval effectiveness, and need not manual intervention, retrieval performance can be effectively improved.Experimental result is it is also shown that based on interior In the remote Sensing Image Retrieval held, being not that the feature kind used is the most, the performance of retrieval is the best.Therefore by index knot Really, selecting optimal characteristics by feature selection, recycling index index automatically determines the weight of feature, uses suitable combination side Method carries out image retrieval and is possible not only to improve retrieval precision, moreover it is possible to effectively reduces the amount of calculation in retrieving, improves retrieval Speed.

Claims (6)

1. remote sensing image retrieval method based on multiple features LSH index combination, it is characterised in that comprise the following steps:
Step 1, remote sensing images carrying out piecemeal, each image block constitutes subgraph image set as a subimage, all image blocks;
Step 2, in the feature space that at least two is different, respectively with described subgraph image set for data set set up LSH index;Institute Stating feature space is color feature space or/and textural feature space;
Step 3, for given query sample, the LSH index being utilized respectively each feature space carries out LSH retrieval, obtains each spy Levy the LSH in space and retrieve result;
Step 4, retrieve result according to the LSH in each feature space, calculate the index Validity Index of each feature space:
The index Validity Index FDIVI of color feature spacecAnd the index Validity Index of textural feature space
FDIVItCalculate according to below equation respectively:
FDIVI c = R t ( 1 ) R t ( 0 )
FDIVI t = R t ( 0 ) R t ( 1 )
Wherein, R t ( i ) = 1 | C i | Σ x ∈ C i D ( x , q ) , i = 0 , 1 ,
In formula, | Ci| represent bunch CiSize;C0Represent the target that the subimage in the Hash bucket hit by query sample is constituted Bunch;C1Represent by the non-targeted bunch constituted with the subimage in the Hash bucket of query sample arest neighbors;D (x, q) represent sample x with Distance between query sample q, for color characteristic, (x q) uses rectangular histogram to hand over distance to D;For textural characteristics, (x q) adopts D Use Euclidean distance;RtI () represents the average distance between query sample and each bunch;
Step 5, select the minimum color characteristic of index Validity Index value, textural characteristics as optimum color characteristic and the most respectively Excellent textural characteristics, and determine optimum color characteristic and the binaryzation weight of optimum textural characteristics in accordance with the following methods: index is effectively Property desired value more than 1, then weight is set to 0;Otherwise, weight is set to 1;
Step 6, when two kinds of optimal characteristics exist the feature that weight is 0, only use weight be 1 feature space in LSH Retrieval result is as final remote Sensing Image Retrieval result;When two kinds of optimal characteristics weights are 1, to both optimal characteristics All subimages that LSH retrieval result in space is comprised, by being weighted linearly the similarity of two kinds of optimal characteristics Computing generates comprehensive similarity, and several have the subimage of maximum comprehensive similarity as final distant before therefrom selecting Sense image searching result;Wherein,
Described generate comprehensive similarity by the similarity of two kinds of optimal characteristics being weighted linear operation, specifically according to Lower method:
Given query sample and described subgraph image set is calculated respectively in optimum color feature space and optimum textural feature space In the variance of spacing of each subimage;
Then according to below equation determines optimum color characteristic and the comprehensive similarity weight w of optimum textural characteristicsc、wt:
w c = σ c σ c + σ t ,
w t = σ t σ c + σ t ,
Wherein, σcAnd σtRepresent that in optimum color feature space and optimum textural feature space, given query sample is with described respectively Subimage concentrates the variance of the spacing of each subimage;
Given comprehensive distance D between query sample and arbitrary subimage i is obtained finally according to following formulai, comprehensive distance is the least, table Bright comprehensive similarity is the highest:
Di=wcDci+wtDti,
Wherein, DciAnd DtiRepresent given query sample and subgraph in optimum color feature space and optimum textural feature space respectively As the distance between i;
Described therefrom select before several have the subimage of maximum comprehensive similarity as final remote Sensing Image Retrieval result, The most in accordance with the following methods:
Concentrate selected part subimage from described subimage, and divide in optimum color feature space and optimum textural feature space Do not calculate average T of this partial subgraph picture distance each othercAnd Tt
Determine similarity threshold T according to the following formula:
T=wcTc+wtTt,
Wherein, wcAnd wtIt is respectively optimum color characteristic and the comprehensive similarity weight of optimum textural characteristics;
In all subimages that last LSH retrieval result from two kinds of optimal characteristics spaces is comprised, choose and given inquiry The comprehensive distance between the sample subimage less than or equal to similarity threshold T, as final remote Sensing Image Retrieval result.
2. the remote sensing image retrieval method combined based on multiple features LSH index as claimed in claim 1, it is characterised in that described The feature space that at least two is different includes the textural characteristics that color feature space that at least two is different is different with at least two Space.
3. the remote sensing image retrieval method combined based on multiple features LSH index as claimed in claim 1, it is characterised in that from institute State subimage concentration selected part subimage to specifically refer to concentrate the subimage randomly selecting 10% from subimage.
4. remote sensing image retrieval method based on multiple features LSH index combination, its feature as described in any one of claims 1 to 3 It is, described piecemeal that remote sensing images are carried out, uses overlap partition strategy, remote sensing images are divided into series of partially overlapping The image block of 64*64 size, between block, overlapping size is 60*60.
5. remote sensing image retrieval method based on multiple features LSH index combination, its feature as described in any one of claims 1 to 3 Being, described step 1, step 2 off-line execution, described step 3 step 6 performs online.
6. remote sensing image retrieval method based on multiple features LSH index combination, its feature as described in any one of claims 1 to 3 It is, when setting up LSH index, hash function quantity M included in Hash table number L, each Hash table and Hash table width The value of degree W is: L=20, M=5, W=4.
CN201310245809.6A 2013-06-20 2013-06-20 Remote sensing image retrieval method based on multiple features LSH index combination Expired - Fee Related CN103336801B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310245809.6A CN103336801B (en) 2013-06-20 2013-06-20 Remote sensing image retrieval method based on multiple features LSH index combination

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310245809.6A CN103336801B (en) 2013-06-20 2013-06-20 Remote sensing image retrieval method based on multiple features LSH index combination

Publications (2)

Publication Number Publication Date
CN103336801A CN103336801A (en) 2013-10-02
CN103336801B true CN103336801B (en) 2016-08-10

Family

ID=49244966

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310245809.6A Expired - Fee Related CN103336801B (en) 2013-06-20 2013-06-20 Remote sensing image retrieval method based on multiple features LSH index combination

Country Status (1)

Country Link
CN (1) CN103336801B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105184308B (en) * 2015-08-03 2020-09-29 北京航空航天大学 Remote sensing image building detection classification method based on global optimization decision

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104239551B (en) * 2014-09-24 2017-04-19 河海大学 Multi-feature VP-tree index-based remote sensing image retrieval method and multi-feature VP-tree index-based remote sensing image retrieval device
CN104462196B (en) * 2014-10-30 2019-01-22 南京信息工程大学 Multiple features combining Hash information search method
CN104615994B (en) * 2015-02-11 2017-08-29 山东正元数字城市建设有限公司 A kind of monitoring image real-time processing method
CN106611016B (en) * 2015-10-27 2019-04-05 北京航空航天大学 A kind of image search method based on decomposable word packet model
CN105512677B (en) * 2015-12-01 2019-02-01 南京信息工程大学 Classifying Method in Remote Sensing Image based on Hash coding
CN105653656B (en) * 2015-12-28 2019-06-07 成都希盟泰克科技发展有限公司 A kind of multiple features document retrieval method based on improvement LSH
CN105760468B (en) * 2016-02-05 2019-05-31 大连大学 The extensive image query system of sensitive hash index is set under mobile environment based on ranking
CN108629233A (en) 2017-03-20 2018-10-09 华为技术有限公司 A kind of pedestrian retrieval method and device
CN108733679B (en) * 2017-04-14 2021-10-26 华为技术有限公司 Pedestrian retrieval method, device and system
CN108399211B (en) * 2018-02-02 2020-11-24 清华大学 Large-scale image retrieval algorithm based on binary characteristics
CN109614510B (en) * 2018-11-23 2021-05-07 腾讯科技(深圳)有限公司 Image retrieval method, image retrieval device, image processor and storage medium

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101866366A (en) * 2010-07-15 2010-10-20 哈尔滨工业大学 Image formula Chinese document retrieval method based on content

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7353224B2 (en) * 2001-12-04 2008-04-01 Hewlett-Packard Development Company, L.P. System and method for efficiently finding near-similar images in massive databases

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101866366A (en) * 2010-07-15 2010-10-20 哈尔滨工业大学 Image formula Chinese document retrieval method based on content

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105184308B (en) * 2015-08-03 2020-09-29 北京航空航天大学 Remote sensing image building detection classification method based on global optimization decision

Also Published As

Publication number Publication date
CN103336801A (en) 2013-10-02

Similar Documents

Publication Publication Date Title
CN103336801B (en) Remote sensing image retrieval method based on multiple features LSH index combination
US11803591B2 (en) Method and apparatus for multi-dimensional content search and video identification
US11354352B1 (en) Geo-visual search
CN101877007B (en) Remote sensing image retrieval method with integration of spatial direction relation semanteme
CN102254015B (en) Image retrieval method based on visual phrases
CN101692224A (en) High-resolution remote sensing image search method fused with spatial relation semantics
CN104392250A (en) Image classification method based on MapReduce
Gudmundsson et al. Of motifs and goals: mining trajectory data
CN102609441A (en) Local-sensitive hash high-dimensional indexing method based on distribution entropy
Jasiewicz et al. GeoPAT: A toolbox for pattern-based information retrieval from large geospatial databases
CN102364498A (en) Multi-label-based image recognition method
CN102930275B (en) Based on the characteristics of remote sensing image system of selection of Cramer ' s V index
Wang et al. Tea picking point detection and location based on Mask-RCNN
Li et al. A GCN-based method for extracting power lines and pylons from airborne LiDAR data
Li et al. A comparison of deep learning methods for airborne lidar point clouds classification
Nurmasari et al. Oil palm plantation detection in Indonesia using Sentinel-2 and Landsat-8 optical satellite imagery (case study: Rokan Hulu regency, Riau Province)
Bai et al. Multi-scale hierarchical sampling change detection using Random Forest for high-resolution satellite imagery
Dutta et al. Symbol spotting in line drawings through graph paths hashing
Kyriakaki-Grammatikaki et al. Geometric primitive extraction from semantically enriched point clouds
Sjahputera et al. Clustering of detected changes in high-resolution satellite imagery using a stabilized competitive agglomeration algorithm
Alburshaid et al. Palm trees detection using the integration between gis and deep learning
CN114860974A (en) Remote sensing image retrieval positioning method
CN115719453A (en) Rice planting structure remote sensing extraction method based on deep learning
Liu et al. Adaptive algorithm for automated polygonal approximation of high spatial resolution remote sensing imagery segmentation contours
Divecha et al. Large-scale geolocalization of overhead imagery

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20160810

Termination date: 20200620

CF01 Termination of patent right due to non-payment of annual fee