CN102789489A - Image retrieval method and system based on hand-held terminal - Google Patents

Image retrieval method and system based on hand-held terminal Download PDF

Info

Publication number
CN102789489A
CN102789489A CN2012102287115A CN201210228711A CN102789489A CN 102789489 A CN102789489 A CN 102789489A CN 2012102287115 A CN2012102287115 A CN 2012102287115A CN 201210228711 A CN201210228711 A CN 201210228711A CN 102789489 A CN102789489 A CN 102789489A
Authority
CN
China
Prior art keywords
paster
picture
space
rightarrow
vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012102287115A
Other languages
Chinese (zh)
Other versions
CN102789489B (en
Inventor
杨震群
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201210228711.5A priority Critical patent/CN102789489B/en
Publication of CN102789489A publication Critical patent/CN102789489A/en
Application granted granted Critical
Publication of CN102789489B publication Critical patent/CN102789489B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Image Analysis (AREA)

Abstract

The invention realizes an image retrieval system. The system allows a user to photograph an interested object or an interested scene through the hand-held terminal. After a picture is uploaded to a server, the system matches the existing picture in a database with the picture photographed by the user to discover the object or the scene photographed by the user, thereby determining an interesting point or a photographing position of the user; and then the related information is returned to the hand-held terminal of the user. A picture matching algorithm developed by the invention has the characteristics of resisting photographing angles, scales, rotation and light conversion, the matching precision achieves above 95.34 percent, and the application level is achieved.

Description

Image search method and system based on handheld terminal
Technical field
The present invention relates to the picture search technical field, a kind of image search method and system based on handheld terminal is provided.
Background technology
Present main flow search engine all is based on literal, yet the request of oneself inquiring about all can not perhaps inconveniently be described under many circumstances by domestic consumer with literal.The mode that the present invention allows the user to take pictures through handheld terminal is expressed oneself query requests with picture, and is directly perceived and make things convenient for, and satisfied the actual demand of people's lives.At present the similar image retrieval algorithm of success mostly be based on the partial interest point (Local Interesting Points, LIPS).Yet because it is very high to extract the algorithm time complexity of required LIPs, existed algorithms is difficult to real-time network picture retrieval.Algorithm of the present invention roughly levels off on precision based on the traditional algorithm of LIPs, but on time complexity, is far smaller than classic method, has satisfied the picture retrieval needs of real-time network environment.
Summary of the invention
The objective of the invention is to overcome the deficiency of prior art, propose the image matching method of a kind of anti-shooting angle, yardstick, rotation, illumination conversion.
The present invention is for realizing that above-mentioned purpose adopts following technical scheme:
1, a kind of image search method based on handheld terminal is characterized in that may further comprise the steps:
1) the camera routine interface that calls handheld terminal carries out photo acquisition;
2) the networking transmission interface that calls handheld terminal is sent to server with picture;
3) result who at last server is returned shows in user interface.
In the such scheme, server process adopts treatment step to comprise after receiving picture in the said step 2:
21) accept the picture that client-side program is uploaded;
22) picture being carried out paster decomposes;
23) paster is mapped in the paster space;
24) with the storehouse in existing picture mate;
25) according to the point of interest of matching result predictive user or the position of taking pictures;
26) relevant information is sent to client-side program.
In the such scheme, said step 22) the paster decomposition method may further comprise the steps in:
31) make up quadrature paster spatial model:
A, every width of cloth picture is modeled as the combination of one group of paster, the statistical nature that on contained all the member's pasters of a width of cloth picture, draws is used for representing the characteristic of this picture;
B, through collecting large-scale picture; And the paster of all pictures gathered and carry out cluster analysis; Find the paster that compacts a most set
Figure BDA00001848795000021
through the right similarity of all pasters in weighing
Figure BDA00001848795000022
; Construct a similarity matrix R; R is carried out obtaining an eigenvectors after the spectral factorization; The vector of this group paster during vector is
Figure BDA00001848795000023
is expressed, and the space that this eigenvectors is opened is the paster space of asking;
C, use spectral factorization (Spectral Decomposition) are optimized the orthogonality in space, and process is following: at first, calculate
Figure BDA00001848795000024
In all pasters to (u i, u j) the cosine similarity and put into matrix R=[r Ij] M * m, wherein
r ij = Sim ( u → i , u → j ) = cos ( u → i , u → j ) = u → i u → j | u → i | | u → j |
Figure BDA00001848795000026
Be paster u iAnd u jCorresponding minutia vector then, carries out spectral factorization to R, R=V Λ V T=(V Λ 1/2V T) (V Λ 1/2V T)=C TC
Wherein Λ is the diagonal matrix that has comprised the R eigenwert; The corresponding vector of all pasters in V is the characteristic of correspondence vector matrix, and Matrix C has comprised
Figure BDA00001848795000027
.The actual base vector that comprises one group of linear space of C that obtain this moment, the space that all base vectors are opened is the quadrature paster space of being asked.
32) be independent of the feature representation technology of conversion: 31) on the basis in said paster space, arbitrary non-ly in the paster space, be expressed as with reference to paster
Figure BDA00001848795000028
:
C T u → = R u
u → = ( C T ) - 1 R u
R wherein uBe the m dimensional vector, expressed paster u with In all relations with reference to paster.This moment, any image T can be expressed as the vector sum of the vector that all corresponding pasters of this picture form in the paster space in the paster space, promptly
T = Σ k = 1 n u → k
33) Adaptive Matching Algorithm: the picture that will be regarded as the candidate above the picture of certain threshold value with source picture analogies degree; Use hidden Markov model (Hidden Markov Model) that the spatial relation between paster is carried out modeling; Come to recognize automatically the pattern that comprises in the picture; Then, in all candidate's picture set, search for these patterns to find final result.
With respect to the traditional image matching technique, the model that the present invention proposes has following several advantages:
One, the balance of speed and precision: with respect to the method (as: color moment that lattice extracts) of global characteristics, the feature representation that the present invention proposes has certain resistance transducing power, thereby precision is higher; Traditional relatively local feature method (as: SIFT, SURF etc.), the method algorithm time complexity that the present invention proposes is lower, and detection speed is faster, is more suitable for large scale network and calculates.
Two, extensibility: the vector in the paster space that the present invention makes up can calculate mutually, for further mining algorithm has stayed development space.The structure that adds the space has been considered orthogonality (be each base vector not linear dependence), can guarantee the correlation computations mathematical model of convergence linear space more in the space, thereby more accurate.
Three, core of the present invention is the images match model based on the paster space.The characteristics of image expression is carried out in paster space of this model construction, has improved the mode of in the classic method all pasters in two width of cloth Target Photos (perhaps unique point) being compared one by one, thereby has reduced the time complexity of copy picture detection algorithm.Simultaneously, picture is carried out the method for paster decomposition and broken the locus strong constraint between local feature point in the picture, reduced of the influence of various copy picture editor modes, thereby improved the robustness of algorithm feature representation.
Description of drawings
Fig. 1 is a system block diagram of the present invention;
Fig. 2 is an image paster decomposing schematic representation of the present invention;
Fig. 3 is a system flow synoptic diagram of the present invention.
Embodiment
Embodiment of the present invention are made up of client algorithm and server end algorithm two parts.
The objective of the invention is to overcome the deficiency of prior art, propose the image matching method of a kind of anti-shooting angle, yardstick, rotation, illumination conversion.
Technical scheme of the present invention is:
At first make up the paster spatial model that to express picture; Picture can be tried one's best in the expression in this space do not receive the influence of various picture conversion, be created on this basis then and can both satisfy network (being mainly cell phone network) picture Matching Algorithm on speed and the precision.So correlative study is divided into three parts: quadrature paster spatial model, be independent of the feature representation technology of conversion and Adaptive Matching Algorithm.Below do respectively and briefly introduce:
Quadrature paster spatial model: this model is modeled as the combination (as shown in Figure 2) of one group of paster (a little image block) with every width of cloth picture, and the statistical nature that therefore on contained all the member's pasters of a width of cloth picture, draws can be used for representing the characteristic of this picture.
Through collecting large-scale picture; And the paster of all pictures gathered and carry out cluster analysis; Can find the paster that compacts most set
Figure BDA00001848795000041
this set to comprise the most representative one group of paster, react the various minutias that picture possibly carry.Through the right similarity of all pasters in weighing
Figure BDA00001848795000042
, can construct a similarity matrix R.R is carried out can obtaining an eigenvectors after the spectral factorization.The vector of this group paster during vector is
Figure BDA00001848795000043
is expressed.The space that this eigenvectors is opened is the paster space of asking.Because
Figure BDA00001848795000044
is the most representative paster group; Therefore any paster (even not in
Figure BDA00001848795000045
) can be weighed its entrained details classification through the similarity of all pasters in weighing with
Figure BDA00001848795000046
.For example, if a paster X follows
Figure BDA00001848795000047
In two paster u iAnd u jThe most similar, explain that then the entrained details of X is u iAnd u jThe combination of entrained minutia.So the vector x that the similarity of all pasters constitutes among each picture and
Figure BDA00001848795000048
has been expressed the distribution of its entrained details statistics.
But because with reference to also having certain similarity between the paster (i.e. all pasters in
Figure BDA00001848795000049
), direct in theory to make up the result that the method in space obtains with paster and their similarity be nonopiate.This nonorthogonality can influence the accuracy of net result.Therefore can use spectral factorization (Spectral Decomposition) that the orthogonality in space is optimized.Process is following: at first, calculate
Figure BDA000018487950000410
In all pasters to (u i, u j) the cosine similarity and put into matrix R=[r Ij] m * m, wherein
r ij = Sim ( u → i , u → j ) = cos ( u → i , u → j ) = u → i u → j | u → i | | u → j | - - - ( 1 )
Figure BDA000018487950000412
Be paster u iAnd u jCorresponding minutia vector (can use color moment, small echo texture etc.).Then, matrix R is carried out spectral factorization,
R=VΛV T=(VΛ 1/2V T)(VΛ 1/2V T)=C TC (2)
C TBe the transposed matrix of Matrix C, wherein Λ is the diagonal matrix that has comprised the R eigenwert, and V is the characteristic of correspondence vector matrix.The corresponding vector of all pasters can be expressed as the actual base vector that comprises one group of linear space of
Figure BDA000018487950000414
C during Matrix C had comprised
Figure BDA000018487950000413
.These base vectors open space be exactly the quadrature paster space that we require.Its orthogonality can guarantee that the expression of paster in the space has global coherency (Global Consistency).Last arbitrary non-ly can in the paster space, be expressed as with reference to paster
Figure BDA00001848795000051
:
C T u → = R u
u → = ( C T ) - 1 R u - - - ( 3 )
R wherein uBe the m dimensional vector, expressed paster u with In all relations (being cosine similarity shown in the formula 1) with reference to paster.Any image T can be expressed as the vector sum of the vector that all corresponding pasters of this picture form in the paster space in the paster space, promptly
T = Σ k = 1 n u → k - - - ( 4 )
Through above-mentioned model, any image can be mapped in the paster space and be expressed as a vector.Therefore similarity between image can be judged through the similarity (or distance) of calculating their corresponding vectors simply.This similarity account form has been avoided multiple spot characteristic matching mode in twos, and the time complexity of algorithm is from O (N 2) ease down to O (cN) (wherein c is a constant, represents the dimension in paster space).
Be independent of the feature representation technology of conversion: the paster that the paster spatial model carries out image decomposes breaks up the strong position constraint of each local feature on the picture.Because the influence that local detail is brought by various editor's characteristic changes is very little,, can more effectively deal with the influence of the editing operations such as rotation, displacement, montage and small-scale convergent-divergent of image so the vector that obtains in paster space expression robustness is higher.
Adapting to image matching algorithm: judge that whether two width of cloth pictures comprise unified object or scene, can weigh according to their similarity simply.But,, make the pattern that has (Pattern) of some object itself also lose owing to broken up the position constraint of local feature thereupon.Therefore, in the research only similarity as a kind of reference value of selecting candidate's copy, that is: will be regarded as candidate's picture with the picture that source picture analogies degree surpasses certain threshold value.In further matching algorithm, use hidden Markov model that the spatial relation between paster is carried out modeling, can recognize the pattern that comprises in the picture automatically like this.Then, in all candidate's picture set, search for these patterns to find final result.The discovery of these patterns does not need to train in advance, is adaptive.
Touch image matching system operational scheme constructed on the basis of type in the paster space as shown in Figure 3.System is divided into off-line training and two parts of online coupling.In the off-line part, at first collect a large amount of pictures as training set, all pasters to training set carry out carrying out cluster analysis after paster extracts again, use said method structure paster space then.In the online compatible portion, when each tests picture arrival, at first carry out paster and extract, re-use the paster space that trains each test picture is carried out paster expression and other vector quantization expression of picture level.Carry out similarity according to vector expression then and calculate, the foundation that its result judges as the copy picture of same object or scene (promptly comprise to).

Claims (3)

1. image search method based on handheld terminal is characterized in that may further comprise the steps:
1) the camera routine interface that calls handheld terminal carries out photo acquisition;
2) the networking transmission interface that calls handheld terminal is sent to server with picture;
3) result who at last server is returned shows in user interface.
2. a kind of image search method based on handheld terminal according to claim 1 is characterized in that: server process adopts treatment step to comprise after receiving picture in the said step 2:
21) accept the picture that client-side program is uploaded;
22) picture being carried out paster decomposes;
23) paster is mapped in the paster space;
24) with the storehouse in existing picture mate;
25) according to the point of interest of matching result predictive user or the position of taking pictures;
26) relevant information is sent to client-side program.
3. a kind of image search method based on handheld terminal according to claim 2 is characterized in that: the paster decomposition method may further comprise the steps said step 22):
31) make up quadrature paster spatial model:
A, every width of cloth picture is modeled as the combination of one group of paster, the statistical nature that on contained all the member's pasters of a width of cloth picture, draws is used for representing the characteristic of this picture;
B, through collecting large-scale picture; And the paster of all pictures gathered and carry out cluster analysis; Find the paster that compacts a most set
Figure FDA00001848794900011
through the right similarity of all pasters in weighing
Figure FDA00001848794900012
; Construct a similarity matrix R; R is carried out obtaining an eigenvectors after the spectral factorization; The vector of this group paster during vector is
Figure FDA00001848794900013
is expressed, and the space that this eigenvectors is opened is the paster space of asking;
C, use spectral factorization (Spectral Decomposition) are optimized the orthogonality in space, and process is following: at first, calculate
Figure FDA00001848794900014
In all pasters to (u i, u j) the cosine similarity and put into matrix R=[r Ij] M * m, wherein
r ij = Sim ( u → i , u → j ) = cos ( u → i , u → j ) = u → i u → j | u → i | | u → j |
Figure FDA00001848794900016
Be paster u iAnd u jCorresponding minutia vector then, carries out spectral factorization to R,
R=VΛV T=(VΛ 1/2V T)(VΛ 1/2V T)=C TC
Wherein Λ is the diagonal matrix that has comprised the R eigenwert; The corresponding vector of all pasters in V is the characteristic of correspondence vector matrix, and Matrix C has comprised
Figure FDA00001848794900021
.The actual base vector that comprises one group of linear space of C that obtain this moment, the space that all base vectors are opened is the quadrature paster space of being asked.
32) be independent of the feature representation technology of conversion: 31) on the basis in said paster space, arbitrary non-ly in the paster space, be expressed as with reference to paster
Figure FDA00001848794900022
:
C T u → = R u
u → = ( C T ) - 1 R u
R wherein uBe the m dimensional vector, expressed paster u with
Figure FDA00001848794900025
In all relations with reference to paster.This moment, any image T can be expressed as the vector sum of the vector that all corresponding pasters of this picture form in the paster space in the paster space, promptly
T = Σ k = 1 n u → k
33) Adaptive Matching Algorithm: the picture that will be regarded as the candidate above the picture of certain threshold value with source picture analogies degree; Use hidden Markov model (Hidden Markov Model) that the spatial relation between paster is carried out modeling; Come to recognize automatically the pattern that comprises in the picture; Then, in all candidate's picture set, search for these patterns to find final result.
CN201210228711.5A 2012-07-04 2012-07-04 Image retrieval method and system based on hand-held terminal Active CN102789489B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210228711.5A CN102789489B (en) 2012-07-04 2012-07-04 Image retrieval method and system based on hand-held terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210228711.5A CN102789489B (en) 2012-07-04 2012-07-04 Image retrieval method and system based on hand-held terminal

Publications (2)

Publication Number Publication Date
CN102789489A true CN102789489A (en) 2012-11-21
CN102789489B CN102789489B (en) 2014-08-27

Family

ID=47154892

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210228711.5A Active CN102789489B (en) 2012-07-04 2012-07-04 Image retrieval method and system based on hand-held terminal

Country Status (1)

Country Link
CN (1) CN102789489B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103347032A (en) * 2013-08-01 2013-10-09 赵频 Method and system for making friends
WO2014114144A1 (en) * 2013-01-25 2014-07-31 Tencent Technology (Shenzhen) Company Limited Method, server and terminal for information interaction
CN104125487A (en) * 2013-04-28 2014-10-29 腾讯科技(深圳)有限公司 Method and device for uploading image data
CN104933602A (en) * 2015-06-26 2015-09-23 南京鹰眼传媒科技有限公司 Mobile outdoor advertisement monitoring service system and method
CN106156252A (en) * 2015-04-28 2016-11-23 联想(北京)有限公司 A kind of information processing method and electronic equipment
CN110262503A (en) * 2019-07-01 2019-09-20 百度在线网络技术(北京)有限公司 Unmanned sales cart dispatching method, device, equipment and readable storage medium storing program for executing

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101847154A (en) * 2010-02-26 2010-09-29 宇龙计算机通信科技(深圳)有限公司 Method and system for inquiring information and method for mobile terminal to inquire information
CN101924992A (en) * 2010-07-30 2010-12-22 中国电信股份有限公司 Method, system and equipment for acquiring scene information through mobile terminal
CN101945327A (en) * 2010-09-02 2011-01-12 郑茂 Wireless positioning method and system based on digital image identification and retrieve

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101847154A (en) * 2010-02-26 2010-09-29 宇龙计算机通信科技(深圳)有限公司 Method and system for inquiring information and method for mobile terminal to inquire information
CN101924992A (en) * 2010-07-30 2010-12-22 中国电信股份有限公司 Method, system and equipment for acquiring scene information through mobile terminal
CN101945327A (en) * 2010-09-02 2011-01-12 郑茂 Wireless positioning method and system based on digital image identification and retrieve

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014114144A1 (en) * 2013-01-25 2014-07-31 Tencent Technology (Shenzhen) Company Limited Method, server and terminal for information interaction
US9832614B2 (en) 2013-01-25 2017-11-28 Tencent Technology (Shenzhen) Company Limited Method, server and terminal for information interaction
US10863310B2 (en) 2013-01-25 2020-12-08 Tencent Technology (Shenzhen) Company Limited Method, server and terminal for information interaction
CN104125487A (en) * 2013-04-28 2014-10-29 腾讯科技(深圳)有限公司 Method and device for uploading image data
CN104125487B (en) * 2013-04-28 2016-05-11 腾讯科技(深圳)有限公司 A kind of method and apparatus of uploading view data
CN103347032A (en) * 2013-08-01 2013-10-09 赵频 Method and system for making friends
CN106156252A (en) * 2015-04-28 2016-11-23 联想(北京)有限公司 A kind of information processing method and electronic equipment
CN104933602A (en) * 2015-06-26 2015-09-23 南京鹰眼传媒科技有限公司 Mobile outdoor advertisement monitoring service system and method
CN110262503A (en) * 2019-07-01 2019-09-20 百度在线网络技术(北京)有限公司 Unmanned sales cart dispatching method, device, equipment and readable storage medium storing program for executing
CN110262503B (en) * 2019-07-01 2022-06-03 百度在线网络技术(北京)有限公司 Unmanned vending vehicle scheduling method, device, equipment and readable storage medium

Also Published As

Publication number Publication date
CN102789489B (en) 2014-08-27

Similar Documents

Publication Publication Date Title
CN102789489B (en) Image retrieval method and system based on hand-held terminal
CN110245709B (en) 3D point cloud data semantic segmentation method based on deep learning and self-attention
CN107133325B (en) Internet photo geographic space positioning method based on street view map
Li et al. Estimating building age from Google street view images using deep learning (short paper)
Cheng et al. A data-driven point cloud simplification framework for city-scale image-based localization
CN110969648B (en) 3D target tracking method and system based on point cloud sequence data
CN105678590B (en) Cloud model-based topN recommendation method for social network
CN103530649A (en) Visual searching method applicable mobile terminal
CN109711443A (en) Floor plan recognition methods, device, equipment and storage medium neural network based
CN114241464A (en) Cross-view image real-time matching geographic positioning method and system based on deep learning
CN108537887A (en) Sketch based on 3D printing and model library 3-D view matching process
CN110070573A (en) Joint figure determines method, apparatus, equipment and storage medium
CN110659369A (en) On-orbit high-precision lightweight global image control point database construction method and system
CN111566700A (en) Method and control device for determining the position of an object in the surroundings of the object by means of multi-task learning
CN112668662B (en) Outdoor mountain forest environment target detection method based on improved YOLOv3 network
CN114494881A (en) Method, device and terminal for detecting remote sensing image change based on subdivision grid
US11138785B2 (en) Method and system for generating 3D image of character
Liu et al. Robust and accurate mobile visual localization and its applications
Zhu et al. Potentials of RGB-D cameras in as-built indoor environment modeling
CN114239250A (en) System and method for territorial space planning design
CN107133284A (en) A kind of view method for searching three-dimension model based on prevalence study
Yi et al. Cloud-Based Positioning Method with Visualized Signal Images
Bing Wireless network for computer puzzle online software cloud platform based on CBIR and sustainable computing
CN111553371A (en) Image semantic description method and system based on multi-feature extraction
Harshit et al. Advancements in open-source photogrammetry with a point cloud standpoint

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant