JP2018022390A

JP2018022390A - Verification device, method, and program

Info

Publication number: JP2018022390A
Application number: JP2016153959A
Authority: JP
Inventors: 豪入江; Takeshi Irie; 之人渡邉; Yukito Watanabe; 勇五十嵐; Isamu Igarashi; 黒住隆行; Takayuki Kurozumi; 隆行黒住; 杵渕　哲也; Tetsuya Kinebuchi; 哲也杵渕
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2016-08-04
Filing date: 2016-08-04
Publication date: 2018-02-08
Anticipated expiration: 2036-08-04
Also published as: JP6482505B2

Abstract

PROBLEM TO BE SOLVED: To enable collation between images with high accuracy and with high efficiency.SOLUTION: A feature quantity coding part 11 calculates a feature quantity code obtained by quantizing a feature quantity vector associated with each partial area specified in a reference image to one or more discrete values. A corresponding candidate calculation part 12 calculates distance between a certain calculated feature quantity code and a feature quantity vector associated with each partial area specified in a query image, and calculates a corresponding candidate between the partial area specified in the reference image corresponding to the feature quantity code and the partial area specified in the query image on the basis of the distance. A determination part 13 determines whether the corresponding candidate is appropriate on the basis of a geometric relation between geometric information associated with the partial area specified in the reference image being a corresponding candidate and geometric information associated with the partial area specified in the query image, and determines the corresponding candidate to be corresponding when the corresponding candidate is appropriate.SELECTED DRAWING: Figure 1

Description

本発明は、検証装置、方法、及びプログラムに係り、特に、２枚の画像の各々に規定された部分領域間の対応の適否を検証する検証装置、方法、及びプログラムに関する。 The present invention relates to a verification apparatus, method, and program, and more particularly, to a verification apparatus, method, and program for verifying the suitability of correspondence between partial areas defined in each of two images.

近年、画像認識技術の進展が目覚ましい。これまでは、顔・指紋認証やファクトリーオートメーション等、認識する対象や環境が限定されている利用領域が中心的であった。最近では、スマートフォン等の小型な撮像デバイスの普及に伴い、一般利用者が自由な場所や環境で、任意の対象を撮影したような自由撮影画像の認識に対する産業上の要請も増えてきている。実世界とウェブ世界の商品を相互につなぐO2Oサービスや、実環境に存在する様々なランドマークを認識して情報を提供する情報案内／ナビゲーションサービスなどへの期待は特に高い。 In recent years, progress in image recognition technology has been remarkable. Up to now, the use areas where the recognition target and environment are limited such as face / fingerprint authentication and factory automation have been the focus. Recently, with the spread of small imaging devices such as smartphones, there is an increasing industrial demand for recognition of free shot images in which a general user has shot an arbitrary object in a free place or environment. Expectations are particularly high for O2O services that connect products in the real world and the web world, and information guidance / navigation services that recognize various landmarks in the real environment and provide information.

このような新たな用途に供される画像認識技術にはいくつかの形態がありうるが、代表的なものの一つが物体検索に基づく画像認識である。その典型的な手続きを、非特許文献１に記載の技術に従って概説する。まず、各々の画像の輝度値を解析することで、特徴的な部分領域を多数抽出し、各部分領域の特徴を実数値や整数値からなる特徴量ベクトルとして表現する（一般に局所特徴量などと呼ばれる）。次に、互いに異なる二つの画像に含まれる部分領域について特徴量ベクトル同士のユークリッド距離を測ることで、異なる画像間の部分領域同士の対応を取り、多数の対応が存在する画像ペアほど同一の物体が写っていると見做す。 There can be several forms of image recognition technology for such new applications, but one of the typical ones is image recognition based on object retrieval. The typical procedure is outlined according to the technique described in Non-Patent Document 1. First, by analyzing the luminance values of each image, a large number of characteristic partial areas are extracted, and the characteristics of each partial area are expressed as feature quantity vectors consisting of real values and integer values (generally, local feature quantities etc. be called). Next, by measuring the Euclidean distance between feature quantity vectors for partial areas included in two different images, the correspondence between the partial areas between different images is taken, and image pairs having a large number of correspondences have the same object. It is assumed that is reflected.

物体検索に基づく画像認識では、事前に認識したい物体を撮影した画像（参照画像）のデータベースを構築する。物体検索によって、データベース内の参照画像のうち、撮影したクエリ画像と同一の物体が写っているものを検索することにより、クエリ画像中に存在する物体を特定することができるのである。 In image recognition based on object retrieval, a database of images (reference images) obtained by photographing an object to be recognized in advance is constructed. By searching for a reference image in the database that contains the same object as the captured query image, the object existing in the query image can be specified by object search.

物体検索の最大の特徴の一つは、一枚の画像を一つ以上の部分領域（およびそれを記述する特徴量ベクトル）の集合として表現することである。一言で同一の物体が写った画像といっても、どの画像にも同じ位置や姿勢（部分領域の角度）、大きさで写っているわけではなく、画像によってさまざまな写り方で撮影されているのが普通である。まして、一般利用者が自由撮影したような画像においては、事前に物体の写り方を知ることは多くの場合ほぼ不可能である。然るに、画像を記述する特徴量ベクトルは、位置・姿勢・大きさに依らない不変性を持つことが望ましい。 One of the greatest features of object search is to express one image as a set of one or more partial regions (and feature vector describing them). Even if it is an image that shows the same object in a single word, it does not appear in the same position, orientation (partial area angle), and size in every image, but it is taken in various ways depending on the image. It is normal. In addition, it is almost impossible to know in advance how to capture an object in an image that a general user freely shoots. However, it is desirable that the feature vector describing the image has invariance independent of the position, orientation, and size.

画像一枚全体を一つのベクトルで表現するような大域的な特徴量では、望ましい不変性を得ることは難しい。例えば各ピクセルの色（RGB値）をベクトルに並べたものは、位置・姿勢・大きさいずれに対しても不変ではない。一方、一部の情報を抽象化したもの、例えば、色ヒストグラム等は、位置や姿勢に対する不変性は持ちうるが、大きさに対しては不変的ではない。また、物体の一部が欠けていたりする場合に対しても脆弱であるなど、精度が容易に低下しやすい。 It is difficult to obtain the desired invariance with a global feature amount in which an entire image is represented by a single vector. For example, the color (RGB value) of each pixel arranged in a vector is not invariant with respect to position, orientation, and size. On the other hand, an abstraction of some information, such as a color histogram, can have invariance to position and orientation, but is not invariant to size. In addition, the accuracy is easily lowered, for example, the object is weak even when a part of the object is missing.

一方、物体検索では、部分領域の集合によって画像を表現するが、この場合、各部分領域は画像中のどこに存在したとしても、部分領域としては同一となるから、位置に対して不変性を持っている。また、部分領域を記述する特徴量ベクトルとして、姿勢や大きさに対して不変性を持つものが発明されている。例えば非特許文献１に記載のScale Invariant Feature Transform (SIFT)が代表例である。 On the other hand, in object search, an image is expressed by a set of partial areas. In this case, the partial areas are the same as the partial areas wherever they exist in the image. ing. In addition, a feature vector that describes a partial region has been invented that has invariance with respect to posture and size. For example, Scale Invariant Feature Transform (SIFT) described in Non-Patent Document 1 is a representative example.

以上の通り、物体検索の典型的な手続きによれば、画像を一つ以上の部分領域の集合によって表現することで、位置・姿勢・大きさによらず、頑健に同一の物体を含む画像を検索することができるのである。 As described above, according to a typical procedure for object search, an image is represented by a set of one or more partial regions, so that an image including the same object can be robustly used regardless of position, posture, and size. You can search.

しかしながら、上記物体検索の手続きにも重大な問題がある。部分領域同士の対応は、異なる２枚の画像それぞれから抽出された部分領域の特徴量ベクトル間の距離に基づいて判定するが、現実的には、たとえ異なる物体同士であっても非常に近しい特徴量ベクトルが得られてしまうような場合が多い。この結果として、本来対応すべきではないような部分領域間での誤対応が発生し、異なる物体が検索されてしまうことがあるのである。 However, the object search procedure has a serious problem. The correspondence between the partial areas is determined based on the distance between the feature vectors of the partial areas extracted from two different images, but in reality, even if they are different objects, they are very close features. In many cases, a quantity vector is obtained. As a result, an erroneous correspondence between partial areas that should not be handled originally occurs, and a different object may be searched.

このような問題を鑑み、異なる物体が検索されてしまうことを防止することを目的として、部分領域同士の対応の適否を検証する検証技術が発明されてきている。 In view of such a problem, a verification technique for verifying the suitability of correspondence between partial areas has been invented for the purpose of preventing different objects from being searched.

非特許文献１には、一般化ハフ変換に基づく検証方法が開示されている。同一の物体から得られた部分領域であれば、物体上の対応する部分領域間の位置・姿勢・大きさの変化は、撮影視点に依存して一貫性を持つという仮定に基づいて検証を実行する方法である。まず、異なる２枚の画像間で、特徴量ベクトル同士の実数値距離を計算して部分領域間の対応候補を得たのち、対応候補となった部分領域間の位置・姿勢・大きさの「ずれ」を求める。これらのずれは４次元の実数値、すなわち、部分領域の位置（画像上のx, y座標）、角度、および、部分領域の大きさ（スケール）の４つの量のずれとして求められる。この４次元のずれに基づいて、４次元ヒストグラムを構成したとすると、同一の物体から得られた対応する部分領域の集合は、部分領域の位置・姿勢・大きさについて一貫性を持ってずれているはずであるから、これらはヒストグラムのごく少数のビンに集中して分布することが想定される。したがって、頻度の高いビンに分布している対応候補のみを真に有効な対応であると見做し、それら以外を有効な対応ではないとして削除する。 Non-Patent Document 1 discloses a verification method based on a generalized Hough transform. For partial regions obtained from the same object, verification is performed based on the assumption that changes in position, posture, and size between corresponding partial regions on the object are consistent depending on the shooting viewpoint. It is a method to do. First, between two different images, a real value distance between feature vectors is calculated to obtain a correspondence candidate between the partial areas, and then the position, posture, and size of the partial areas that have become correspondence candidates are expressed as “ Find the "shift". These deviations are obtained as four-dimensional real values, that is, deviations of four amounts of the position of the partial area (x and y coordinates on the image), the angle, and the size (scale) of the partial area. If a four-dimensional histogram is constructed based on this four-dimensional displacement, the set of corresponding partial areas obtained from the same object will be consistently displaced with respect to the position, orientation, and size of the partial areas. As these should be, they are assumed to be concentrated in a very small number of bins in the histogram. Accordingly, only the correspondence candidates distributed in the bins with high frequency are regarded as truly effective correspondences, and the other candidates are deleted as not being effective correspondences.

特許文献１には、非特許文献１を改良した技術が開示されている。対応候補である部分領域の位置・姿勢・大きさのずれに基づいて対応の適否を判定することは同様であるが、特徴量ベクトルとしてアフィン不変局所特徴量を用いることによって、姿勢として３次元回転角度を考えている。結果として、非特許文献１の技術よりもさらに精細な検証を可能にしている。 Patent Document 1 discloses a technique obtained by improving Non-Patent Document 1. Although it is the same to determine the suitability of the correspondence based on the position, orientation, and size deviation of the partial area that is a candidate for correspondence, the affine-invariant local feature quantity is used as the feature quantity vector, so that the orientation is three-dimensionally rotated. Thinking about the angle. As a result, a finer verification than that of Non-Patent Document 1 is possible.

非特許文献２には、異なるアプローチに基づく検証方法が開示されている。やはり、異なる画像間で部分領域同士の対応候補を得るまでは先の先行技術と同様であるが、この方法では、複数の対応候補を集合として見たときの部分領域の位置のずれが、特定の線形変換に拘束されているような対応候補のみを有効な対応であるとみなすことにより、有効ではない対応候補を削除する。 Non-Patent Document 2 discloses a verification method based on a different approach. After all, it is the same as the prior art until the corresponding candidates for the partial areas between different images are obtained, but with this method, the position shift of the partial areas when a plurality of corresponding candidates are viewed as a set is specified. By considering only correspondence candidates that are constrained by the linear transformation to be valid correspondences, correspondence candidates that are not valid are deleted.

特開２０１５−９５１５６号公報JP-A-2015-95156

D.G.Lowe,“Distinctive Image Features from Scale-Invariant Keypoints ”, International Journal of Computer Vision, pp.91-110, 2004D.G.Lowe, “Distinctive Image Features from Scale-Invariant Keypoints”, International Journal of Computer Vision, pp.91-110, 2004 J. Philbin, O. Chum, M. Isard, Josef Sivic and Andrew Zisserman. Object retrieval with large vocabularies and fast spatial matching 1470-1477, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2007.J. Philbin, O. Chum, M. Isard, Josef Sivic and Andrew Zisserman.Object retrieval with large vocabularies and fast spatial matching 1470-1477, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2007.

大局的にみれば、既存の発明は、まず特徴量ベクトル同士の高次元な実数値／整数値距離の計算に基づいて対応候補を得たのち、部分領域の位置のずれを精緻に解析することによって、対応候補の適否を検証している。このような方式による検証は、高精度ではあるものの、膨大な時間・空間計算量がかかるという問題があった。 From a global perspective, the existing invention first obtains correspondence candidates based on the calculation of high-dimensional real / integer value distances between feature vectors, and then analyzes the position shift of the partial region precisely. Thus, the suitability of the response candidate is verified. Although verification by such a method is highly accurate, there is a problem that it takes a huge amount of time and space.

すなわち、対応候補を得るには、ある画像に含まれる全ての特徴量ベクトルと、別の画像に含まれるそれらとの全ての高次元な実数値／整数値の距離を求める必要あるが、これには非常に多くの計算が必要となり、それだけ処理時間を要してしまう。また、参照画像データベースに存在する全ての画像について、各々から抽出された全ての実数値／整数値の高次元な実数値／整数値の特徴量ベクトルを記憶しておく必要があるため、HDD容量／メモリ量も膨大なものとなってしまう。 That is, in order to obtain a correspondence candidate, it is necessary to obtain all high-dimensional real / integer distances between all feature vectors contained in one image and those contained in another image. Requires a lot of calculation and requires much processing time. In addition, for all images existing in the reference image database, it is necessary to store a high-dimensional real value / integer value feature vector extracted from each real value / integer value. / The amount of memory will be enormous.

さらに、部分領域の位置（あるいは姿勢・大きさ）についても、これらは通常実数値で表現されるため、これらの精緻なずれを求める場合にもそれぞれ多数の実数値演算が必要となるし、また記憶にも相当の容量が必要となってしまう。 In addition, since the position (or posture / size) of the partial area is usually expressed by a real value, a large number of real value operations are required to obtain these precise deviations. A considerable amount of memory is also required for storage.

以上のように、現在に至るまで、高精度でありながら、時間・空間計算量的に効率的に対応の適否を検証できる検証技術は発明されていなかった。 As described above, up to the present, no verification technique has been invented that can verify the suitability of the correspondence efficiently in terms of time and space calculations while being highly accurate.

本発明は、上記問題点を解決するために成されたものであり、実数値／整数値の特徴量ベクトルを低容量な符号に圧縮することにより、高精度かつ高効率に画像間の照合を可能にする検証装置、方法、及びプログラムを提供することを目的とする。 The present invention has been made to solve the above-described problems. By compressing a real-value / integer-value feature vector into a low-capacity code, collation between images can be performed with high accuracy and high efficiency. An object of the present invention is to provide a verification apparatus, a method, and a program that enable the verification.

上記目的を達成するために、第１の発明に係る検証装置は、第一の画像に規定された一つ以上の部分領域の各々に関連づけられた、当該部分領域の少なくとも大きさを含む幾何情報及び特徴量ベクトルと、第二の画像に規定された一つ以上の部分領域の各々に関連づけられた前記幾何情報及び前記特徴量ベクトルとが入力された時に、前記第一の画像に規定された前記部分領域と、前記第二の画像に規定された前記部分領域との対応を求める検証装置であって、前記第一の画像に規定された各々の前記部分領域に関連づけられた前記特徴量ベクトルを、一つ以上の離散値に量子化した特徴量符号を求める特徴量符号化部と、前記第一の画像について求めたある前記特徴量符号と、前記第二の画像に規定された各々の前記部分領域に関連づけられた前記特徴量ベクトルとの距離を求め、当該距離に基づいて、前記特徴量符号に対応する前記第一の画像に規定された前記部分領域と、前記第二の画像に規定された前記部分領域との対応候補を求める対応候補計算部と、前記対応候補である前記第一の画像に規定された前記部分領域に関連づけられた前記幾何情報と、前記第二の画像に規定された前記部分領域に関連づけられた前記幾何情報との幾何的関係に基づいて、当該対応候補が適当であるか否かを判定し、適当である場合にはこれを対応とする判定部と、を含んで構成されている。 In order to achieve the above object, the verification device according to the first aspect of the invention relates to geometric information including at least the size of the partial area associated with each of the one or more partial areas defined in the first image. And the feature quantity vector and the geometric information and the feature quantity vector associated with each of the one or more partial areas defined in the second image are specified in the first image. A verification apparatus for obtaining a correspondence between the partial area and the partial area defined in the second image, wherein the feature vector associated with each partial area defined in the first image A feature amount encoding unit for obtaining a feature amount code quantized into one or more discrete values, the feature amount code obtained for the first image, and each of the feature amount codes defined for the second image Associated with the subregion Further, a distance from the feature quantity vector is obtained, and based on the distance, the partial area defined in the first image corresponding to the feature quantity code and the partial area defined in the second image A correspondence candidate calculation unit that obtains a correspondence candidate with: the geometric information associated with the partial region defined in the first image that is the correspondence candidate; and the partial region defined in the second image A determination unit that determines whether the corresponding candidate is appropriate based on a geometric relationship with the geometric information associated with the information, and if it is appropriate, a determination unit that corresponds to the determination. ing.

また、第２の発明に係る検証装置において、前記幾何情報を一つ以上の離散値に量子化し、幾何情報符号を求める幾何情報符号化部を更に含み、前記判定部は、前記対応候補である前記第一の画像に規定された前記部分領域に関連づけられた前記幾何情報から求めた前記幾何情報符号と、前記第二の画像に規定された前記部分領域に関連づけられた前記幾何情報から求めた前記幾何情報符号との幾何的関係に基づいて、当該対応候補が適当であるか否かを判定し、適当である場合にはこれを対応とするようにしてもよい。 In the verification device according to the second aspect of the present invention, the verification device further includes a geometric information encoding unit that quantizes the geometric information into one or more discrete values and obtains a geometric information code, and the determination unit is the correspondence candidate Obtained from the geometric information code obtained from the geometric information associated with the partial area defined in the first image and the geometric information associated with the partial area defined in the second image Based on the geometric relationship with the geometric information code, it may be determined whether or not the corresponding candidate is appropriate, and if it is appropriate, it may be made to correspond.

また、第３の発明に係る検証装置において、前記特徴量符号化部が、前記第一の画像に規定された各々の前記部分領域に関連づけられた前記特徴量ベクトルの各要素についてその正負に基づいて二値量子化することで求めた二値ベクトルを特徴量符号として求めるようにしてもよい。 Further, in the verification device according to the third invention, the feature amount encoding unit is based on the sign of each element of the feature amount vector associated with each of the partial areas defined in the first image. Alternatively, a binary vector obtained by binary quantization may be obtained as a feature amount code.

また、第４の発明に係る検証装置において、前記判定部は、前記対応候補毎の前記幾何的関係に基づくヒストグラムを求め、前記ヒストグラムの頻度が高いビンに対応する前記対応候補が適当であると判定するようにしてもよい。 In the verification device according to the fourth invention, the determination unit obtains a histogram based on the geometric relationship for each correspondence candidate, and the correspondence candidate corresponding to a bin having a high frequency of the histogram is appropriate. You may make it determine.

第５の発明に係る検証方法は、第一の画像に規定された一つ以上の部分領域の各々に関連づけられた、当該部分領域の少なくとも大きさを含む幾何情報及び特徴量ベクトルと、第二の画像に規定された一つ以上の部分領域の各々に関連づけられた前記幾何情報及び前記特徴量ベクトルとが入力された時に、前記第一の画像に規定された前記部分領域と、前記第二の画像に規定された前記部分領域との対応を求める検証装置における検証方法であって、特徴量符号化部が、前記第一の画像に規定された各々の前記部分領域に関連づけられた前記特徴量ベクトルを、一つ以上の離散値に量子化した特徴量符号を求めるステップと、対応候補計算部が、前記第一の画像について求めたある前記特徴量符号と、前記第二の画像に規定された各々の前記部分領域に関連づけられた前記特徴量ベクトルとの距離を求め、当該距離に基づいて、前記特徴量符号に対応する前記第一の画像に規定された前記部分領域と、前記第二の画像に規定された前記部分領域との対応候補を求めるステップと、判定部が、前記対応候補である前記第一の画像に規定された前記部分領域に関連づけられた前記幾何情報と、前記第二の画像に規定された前記部分領域に関連づけられた前記幾何情報との幾何的関係に基づいて、当該対応候補が適当であるか否かを判定し、適当である場合にはこれを対応とするステップと、を含んで実行することを特徴とする。 A verification method according to a fifth aspect of the present invention includes a geometric information and a feature quantity vector including at least the size of the partial area associated with each of the one or more partial areas defined in the first image; When the geometric information and the feature vector associated with each of the one or more partial regions defined in the image of the first image are input, the partial region defined in the first image, and the second A verification method in a verification device for obtaining a correspondence with the partial area defined in the image of the feature, wherein a feature amount encoding unit is associated with each partial area defined in the first image. A step of obtaining a feature amount code obtained by quantizing a quantity vector into one or more discrete values, a certain feature amount code obtained by the correspondence candidate calculation unit for the first image, and the second image Each said A distance from the feature vector associated with the partial area is obtained, and based on the distance, the partial area defined in the first image corresponding to the feature code and the second image are defined. A step of obtaining a correspondence candidate with the partial area, and a determination unit that adds the geometric information associated with the partial area defined in the first image that is the correspondence candidate to the second image. Determining whether or not the corresponding candidate is appropriate based on the geometric relationship with the geometric information associated with the defined partial region, and if appropriate, corresponding to the step; It is characterized by including.

また、第６の発明に係る検証装置において、幾何情報符号化部が、前記幾何情報を一つ以上の離散値に量子化し、幾何情報符号を求めるステップを更に含み、前記判定部が判定するステップは、前記対応候補である前記第一の画像に規定された前記部分領域に関連づけられた前記幾何情報から求めた前記幾何情報符号と、前記第二の画像に規定された前記部分領域に関連づけられた前記幾何情報から求めた前記幾何情報符号との幾何的関係に基づいて、当該対応候補が適当であるか否かを判定し、適当である場合にはこれを対応とするようにしてもよい。 Further, in the verification device according to the sixth invention, the geometric information encoding unit further includes a step of quantizing the geometric information into one or more discrete values to obtain a geometric information code, wherein the determination unit determines Is associated with the geometric information code obtained from the geometric information associated with the partial area defined in the first image as the correspondence candidate and the partial area defined in the second image. Further, based on the geometric relationship with the geometric information code obtained from the geometric information, it is determined whether or not the corresponding candidate is appropriate. .

また、第７の発明に係る検証装置において、前記特徴量符号化部が符号化するステップが、前記第一の画像に規定された各々の前記部分領域に関連づけられた前記特徴量ベクトルの各要素についてその正負に基づいて二値量子化することで求めた二値ベクトルを特徴量符号として求めるようにしてもよい。 Further, in the verification device according to the seventh invention, the step of encoding by the feature amount encoding unit includes each element of the feature amount vector associated with each of the partial regions defined in the first image. A binary vector obtained by performing binary quantization on the basis of the sign may be obtained as a feature amount code.

第８の発明に係るプログラムは、コンピュータを、第１〜第４の発明に係る検証装置の各部として機能させるためのプログラムである。 A program according to an eighth invention is a program for causing a computer to function as each part of the verification device according to the first to fourth inventions.

本発明の検証装置、方法、及びプログラムによれば、第一の画像に規定された各々の部分領域に関連づけられた特徴量ベクトルを、一つ以上の離散値に量子化した特徴量符号を求め、求めたある特徴量符号と、第二の画像に規定された各々の部分領域に関連づけられた特徴量ベクトルとの距離を求め、当該距離に基づいて、特徴量符号に対応する第一の画像に規定された部分領域と、第二の画像に規定された部分領域との対応候補を求め、対応候補である第一の画像に規定された部分領域に関連づけられた幾何情報と、第二の画像に規定された部分領域に関連づけられた幾何情報との幾何的関係に基づいて、当該対応候補が適当であるか否かを判定し、適当である場合にはこれを対応とすることにより、高精度かつ高効率に画像間の照合を可能にする、という効果が得られる。 According to the verification apparatus, method, and program of the present invention, a feature amount code obtained by quantizing a feature amount vector associated with each partial region defined in the first image into one or more discrete values is obtained. The distance between the obtained feature quantity code and the feature quantity vector associated with each partial area defined in the second image is obtained, and the first image corresponding to the feature quantity code is obtained based on the distance. A candidate for correspondence between the partial region defined in the second image and the partial region defined in the second image, the geometric information associated with the partial region defined in the first image as the correspondence candidate, and the second Based on the geometric relationship with the geometric information associated with the partial area defined in the image, it is determined whether or not the corresponding candidate is appropriate. Collation between images is possible with high accuracy and high efficiency To, the effect is obtained that.

本発明の第１の実施の形態に係る検証装置の構成を示すブロック図である。It is a block diagram which shows the structure of the verification apparatus which concerns on the 1st Embodiment of this invention. 本発明の第１の実施の形態に係る検証装置における検証処理ルーチンを示すフローチャートである。It is a flowchart which shows the verification process routine in the verification apparatus which concerns on the 1st Embodiment of this invention. 二枚の画像の部分領域の幾何情報の幾何的関係を示す図。The figure which shows the geometric relationship of the geometric information of the partial area | region of two images. 二枚の画像の部分領域の幾何情報の幾何的関係を対比したテーブルを示す図。The figure which shows the table which contrasted the geometric relationship of the geometric information of the partial area | region of two images. 本発明の第２の実施の形態に係る検証装置の構成を示すブロック図である。It is a block diagram which shows the structure of the verification apparatus which concerns on the 2nd Embodiment of this invention. 本発明の第２の実施の形態に係る検証装置における検証処理ルーチンを示すフローチャートである。It is a flowchart which shows the verification process routine in the verification apparatus which concerns on the 2nd Embodiment of this invention.

＜本発明の実施の形態に係る概要＞ <Outline according to Embodiment of the Present Invention>

まず、本発明の実施の形態における概要を説明する。 First, an outline of the embodiment of the present invention will be described.

本実施の形態に係る検証装置は、参照画像に規定された一つ以上の部分領域の各々に関連づけられた、当該部分領域の少なくとも大きさを含む幾何情報及び特徴量ベクトルと、クエリ画像に規定された一つ以上の部分領域の各々に関連づけられた幾何情報及び特徴量ベクトルとが入力された時に、参照画像に規定された部分領域と、クエリ画像に規定された部分領域との対応を求める検証装置である。なお、第１の画像が参照画像に対応し、第２の画像がクエリ画像に対応する。 The verification apparatus according to the present embodiment is defined in the query image and the geometric information and feature quantity vector including at least the size of the partial area, which are associated with each of the one or more partial areas defined in the reference image. When the geometric information and the feature vector associated with each of the one or more partial regions are input, the correspondence between the partial region defined in the reference image and the partial region defined in the query image is obtained. It is a verification device. Note that the first image corresponds to the reference image, and the second image corresponds to the query image.

本発明の実施の形態では、２枚の画像のそれぞれから得られた部分領域間の対応候補を求める際に、特徴量ベクトルを量子化することによって、高次元の実数値／整数値距離を計算することなく、対応を検証することができるようになる。また、幾何情報を量子化することにより、より高効率な検証が可能となる。また、参照画像については、元の高次元な実数値／整数値特徴量ベクトルではなく、量子化した低容量な特徴量ベクトルのみを記憶しておけば検証を実行できるので、容量を低減することができる。加えて、本発明の実施の形態において用いる部分領域の幾何情報は、少なくとも部分領域の大きさのみを保持していればよいのであり、位置・姿勢・大きさなどの多次元の幾何情報を統合的かつ精緻に分析する必要は必ずしもない。結果的に、時間・空間双方の観点で、高効率に幾何検証を実行することが可能になる。 In the embodiment of the present invention, a high-dimensional real value / integer value distance is calculated by quantizing a feature vector when obtaining correspondence candidates between partial areas obtained from two images. Without having to do so. In addition, more efficient verification can be performed by quantizing the geometric information. In addition, with respect to the reference image, verification can be executed if only the quantized low-capacity feature vector is stored instead of the original high-dimensional real-value / integer-value feature vector, thus reducing the capacity. Can do. In addition, the geometric information of the partial area used in the embodiment of the present invention only needs to hold at least the size of the partial area, and multidimensional geometric information such as position, posture, and size is integrated. It is not always necessary to analyze accurately and precisely. As a result, it is possible to perform geometric verification with high efficiency in terms of both time and space.

さらに、特徴量ベクトルの各要素が正負いずれであるかに基づいて量子化した場合、結果的に生成される符号であるところの二値ベクトルは、特徴量ベクトルの距離を測る上で重要な元の特徴量ベクトルの情報を精度よく保存することができる。また、時間・空間計算効率に影響の少ないクエリ画像については元の特徴量ベクトルをそのまま利用することで、量子化する情報を最小限に抑えている。結果として、精度を落とさず、高精度な検証が可能である。 Furthermore, when quantizing based on whether each element of the feature vector is positive or negative, the binary vector, which is the resulting code, is an important element for measuring the distance of the feature vector. The feature vector information can be stored with high accuracy. For query images that have little effect on the time / space calculation efficiency, the original feature vector is used as it is to minimize the information to be quantized. As a result, high-precision verification is possible without reducing accuracy.

＜本発明の第１の実施の形態に係る検証装置＞ <Verification apparatus according to the first embodiment of the present invention>

以下、図面を参照して本発明の第１の実施の形態を詳細に説明する。 Hereinafter, a first embodiment of the present invention will be described in detail with reference to the drawings.

＜＜第１の実施の形態の全体構成＞＞ << Overall Configuration of First Embodiment >>

図１は、本発明の第１の実施形態における検証装置１の構成の一例を示すブロック図である。図１に示す検証装置１は、特徴量符号化部１１と、対応候補計算部１２と、判定部１３とを備える。 FIG. 1 is a block diagram showing an example of the configuration of the verification apparatus 1 according to the first embodiment of the present invention. The verification device 1 illustrated in FIG. 1 includes a feature amount encoding unit 11, a correspondence candidate calculation unit 12, and a determination unit 13.

検証装置１は、データベース２と通信手段を介して接続されて相互に情報通信し、任意の画像の情報をデータベース２に登録したり、読み出したりすることができる構成を採る。 The verification device 1 is connected to the database 2 via a communication unit and communicates information with each other, so that information on an arbitrary image can be registered in the database 2 or read out.

ここでいう画像の情報には、少なくとも各画像の部分領域の幾何情報および特徴量ベクトルを含むものとし、これらは相互に関連づけられているものとする。画像の部分領域は、画像の一部領域であればどのように定められても構わない。好ましくは非特許文献１に記載のSIFTなど、公知の部分領域抽出方法によって規定する。 The image information here includes at least the geometric information and the feature amount vector of the partial area of each image, and these are associated with each other. The partial area of the image may be determined in any way as long as it is a partial area of the image. Preferably, it is defined by a known partial region extraction method such as SIFT described in Non-Patent Document 1.

また、部分領域の幾何情報としては、部分領域の位置・姿勢・大きさなどの情報を含むものとするが、本発明の第１の実施形態の一例にあたっては少なくとも大きさが含まれていればよく、好ましくはさらに姿勢、位置も含んでいるものとする。部分領域の大きさについてはどのように定められていても構わないが、例えば部分領域を円形と見做した時の半径や、方形であると見做した時の長辺などに基づいて規定すればよい。大きさに限らず、姿勢や位置などについても、例えば非特許文献１に記載の方法等、適する公知の方法を選択して求めたものを利用することができる。 Further, the geometric information of the partial area includes information such as the position, posture, and size of the partial area, but it is sufficient that at least the size is included in the example of the first embodiment of the present invention. Preferably, it further includes posture and position. The size of the partial area may be determined in any way, but for example, it is defined based on the radius when the partial area is considered to be a circle, the long side when the partial area is assumed to be a square, etc. That's fine. Not only the size but also the posture and position can be obtained by selecting a suitable known method such as the method described in Non-Patent Document 1, for example.

データベース２は、例えば、一般的な汎用コンピュータに実装されているファイルシステムによって構成できる。各画像それぞれを一意に識別可能な識別子（例えば、通し番号によるIDやユニークな画像ファイル名等）を与えるものとし、さらに、当該画像に規定された部分領域、並びに、特徴量ベクトル、あるいは特徴量符号ファイルを記述したファイルも、当該画像の識別子と関連づけて記憶しているものとする。あるいは、同様にRDBMS(Relational Database Management System）などで実装・構成されていても構わない。その他、メタデータとして、例えば画像の内容を表現するもの（画像のタイトル、概要文、又はキーワード等）、画像のフォーマットに関するもの（画像のデータ量、サムネイル等のサイズ）などを含んでいても構わないが、本発明の実施においては必須ではない。 The database 2 can be configured by, for example, a file system implemented on a general general-purpose computer. Each image is given an identifier (for example, an ID by a serial number or a unique image file name) that can uniquely identify each image, and further, a partial region defined in the image, a feature vector, or a feature code It is assumed that the file describing the file is also stored in association with the identifier of the image. Alternatively, it may be similarly implemented and configured by RDBMS (Relational Database Management System) or the like. In addition, the metadata may include, for example, data representing the content of the image (image title, summary text, or keyword), data related to the image format (image data amount, thumbnail size, etc.), and the like. Although not required, it is not essential in the practice of the present invention.

データベース２は、検証装置１の内部にあっても外部にあっても構わず、通信手段は任意の公知ものを用いることができるが、本実施の形態においては、外部にあるものとして、通信手段は、インターネット、TCP/IP等のネットワーク（図示省略）により通信するよう接続されているものとする。 The database 2 may be inside or outside the verification apparatus 1, and any known communication means can be used. In the present embodiment, the communication means is assumed to be outside. Are connected to communicate via a network (not shown) such as the Internet or TCP / IP.

また、検証装置１が備える各部及びデータベース２は、演算処理装置、記憶装置等を備えたコンピュータやサーバ等により構成して、各部の処理がプログラムによって実行されるものとしてもよい。このプログラムは検証装置１が備える記憶装置に記憶されており、磁気ディスク、光ディスク、半導体メモリ等の記録媒体に記録することも、ネットワークを通して提供することも可能である。もちろん、その他いかなる構成要素についても、単一のコンピュータやサーバによって実現しなければならないものではなく、ネットワークによって接続された複数のコンピュータに分散して実現しても構わない。 Further, each unit and database 2 provided in the verification device 1 may be configured by a computer or a server provided with an arithmetic processing device, a storage device, or the like, and the processing of each unit may be executed by a program. This program is stored in a storage device included in the verification apparatus 1, and can be recorded on a recording medium such as a magnetic disk, an optical disk, or a semiconductor memory, or provided through a network. Of course, any other components need not be realized by a single computer or server, but may be realized by being distributed to a plurality of computers connected by a network.

なお、画像の情報自体は必ずしもデータベース２に格納されている必要はなく、たとえば適宜外部から直接入力されるような構成をとっても構わない。このような構成は、例えば物体検索のために本発明の実施の形態を利用するような場合、参照画像については事前に参照画像情報について必要な処理を実施の上データベース２に格納し、クエリ画像については適宜問い合わせのタイミングで外部からクエリ画像情報の入力を受け付けて処理するような用途に向く。具体例を挙げると、図１に記載の検証装置構成の一例においては、データベース２に予め一枚以上の参照画像に関する画像の情報である参照画像情報３が格納されており、これらは上記説明した通り検証装置１と相互に読み出し／登録可能な形態で接続されている。このほか、問い合わせとして入力されるクエリ画像情報４を外部から受け付けられるような構成を取っている。 The image information itself does not necessarily have to be stored in the database 2, and may be configured to be directly input from the outside as appropriate, for example. In such a configuration, for example, when the embodiment of the present invention is used for object search, a reference image is subjected to necessary processing for reference image information in advance and stored in the database 2 to obtain a query image. Is suitable for the purpose of receiving and processing input of query image information from the outside at the timing of inquiry as appropriate. As a specific example, in the example of the configuration of the verification apparatus illustrated in FIG. 1, the reference image information 3 that is image information related to one or more reference images is stored in the database 2 in advance. It is connected to the verification apparatus 1 in a form that can be read / registered mutually. In addition, the configuration is such that the query image information 4 input as an inquiry can be received from the outside.

以降、本発明の第１の実施形態の一例においては、２枚の画像の対応を検証する場合、特に物体検索における用途を見据えて、参照画像情報３として記憶されている参照画像のうちの１枚と、クエリ画像情報４として入力されたクエリ画像のうちの１枚との対応を検証する場合を例に説明していく。複数組の画像の対応を検証するような場合には、以降説明する処理を検証したい組の数分だけ繰り返せばよい。 Thereafter, in the example of the first embodiment of the present invention, when verifying the correspondence between two images, one of the reference images stored as the reference image information 3 especially for the purpose of object search. An example will be described in which the correspondence between the image and one of the query images input as the query image information 4 is verified. In the case of verifying the correspondence between a plurality of sets of images, the processing described below may be repeated as many times as the number of sets to be verified.

＜＜第１の実施の形態の処理部＞＞ << Processing Unit of First Embodiment >>

本実施の形態における検証装置１の各処理部について説明する。 Each processing unit of the verification apparatus 1 in the present embodiment will be described.

特徴量符号化部１１は、参照画像に規定された各々の部分領域に関連づけられた特徴量ベクトルを、一つ以上の離散値に量子化した特徴量符号を求める。ここでは、データベース２から、参照画像情報３を受け取った下で、参照画像情報３の各部分領域に関連づけられた特徴量ベクトルを一つ以上の離散値に量子化した特徴量符号を求める。 The feature amount encoding unit 11 obtains a feature amount code obtained by quantizing a feature amount vector associated with each partial region defined in the reference image into one or more discrete values. Here, after receiving the reference image information 3 from the database 2, a feature amount code obtained by quantizing the feature amount vector associated with each partial region of the reference image information 3 into one or more discrete values is obtained.

求めた特徴量符号は、必要に応じて参照画像情報３と対応づける形でデータベース２に格納する場合もあるし、対応候補計算部１２に出力する場合もある。本発明の第１の実施形態の一例においては、参照画像側に対してのみ本処理を適用することを考え、この場合について説明する。クエリ画像側に本処理を適用せず、参照画像側に対してのみ本処理を適用する効果として、次のことが挙げられる。 The obtained feature amount code may be stored in the database 2 in association with the reference image information 3 as necessary, or may be output to the correspondence candidate calculation unit 12. In the example of the first embodiment of the present invention, the case where the present process is applied only to the reference image side will be described. As an effect of applying this processing only to the reference image side without applying this processing to the query image side, the following can be mentioned.

（１）通常、必ず記憶される参照画像の場合とは異なり、必ずしもクエリ画像を記憶する必要はないため、クエリ画像を低容量に符号化したとしても時間・空間的な計算量への影響はほとんどない。
（２）符号化を行うことで、元の特徴量ベクトルの情報を幾分か落としてしまうことになるため、必要以上の符号化を避けることで精度を保持することができる。 (1) Normally, unlike the case of a reference image that is always stored, it is not always necessary to store a query image. Therefore, even if the query image is encoded at a low capacity, the influence on the temporal and spatial calculation amount is not affected. rare.
(2) By performing the encoding, the information of the original feature vector is somewhat lost, so that it is possible to maintain the accuracy by avoiding unnecessary encoding.

もちろん、クエリ画像側にも同様の処理を適用することもできるし、クエリ画像側のみに対して本処理を適用することもできることはいうまでもない。 Of course, the same processing can be applied to the query image side, and it is needless to say that this processing can be applied only to the query image side.

対応候補計算部１２は、特徴量符号化部１１で参照画像について求めたある特徴量符号と、外部から受け取ったクエリ画像情報４におけるクエリ画像に規定された各々の部分領域に関連づけられた特徴量ベクトルとの距離を求め、当該距離に基づいて、特徴量符号に対応する参照画像に規定された部分領域と、クエリ画像に規定された部分領域との対応候補を求める。クエリ画像情報４には、クエリ画像、並びに当該クエリ画像の部分領域及び特徴量ベクトルの集合の情報を含んでいる。なお、参照画像の全ての部分領域に対して、必ずしも対応候補となるクエリ画像の部分領域が求められる必要はない。 The correspondence candidate calculation unit 12 includes a certain feature amount code obtained for the reference image by the feature amount encoding unit 11 and a feature amount associated with each partial region defined in the query image in the query image information 4 received from the outside. A distance to the vector is obtained, and based on the distance, a correspondence candidate between the partial area defined in the reference image corresponding to the feature amount code and the partial area defined in the query image is obtained. The query image information 4 includes information on a query image and a set of partial regions and feature quantity vectors of the query image. Note that it is not always necessary to obtain a partial area of a query image that is a candidate for correspondence with respect to all partial areas of the reference image.

判定部１３は、対応候補である参照画像に規定された部分領域に関連づけられた幾何情報と、クエリ画像に規定された部分領域に関連づけられた幾何情報との組の各々について、当該組の幾何的関係を求め、求められた幾何的関係に基づいて、当該組の対応候補が適当であるか否かを判定し、適当である場合にはこれを対応とすると判定して判定結果５を出力する。ここで、判定部１３は、対応候補毎の幾何的関係に基づくヒストグラムを求め、ヒストグラムの頻度が高いビンに対応する対応候補が適当であると判定する。 The determination unit 13 performs, for each set of geometric information associated with the partial area defined in the reference image that is a correspondence candidate, and geometric information associated with the partial area defined in the query image, the geometric information of the set. And determines whether the corresponding candidate for the set is appropriate based on the determined geometric relationship, and if it is appropriate, determines that this is a corresponding candidate and outputs determination result 5 To do. Here, the determination unit 13 obtains a histogram based on the geometric relationship for each corresponding candidate, and determines that a corresponding candidate corresponding to a bin having a high histogram frequency is appropriate.

＜＜処理概要＞＞ << Process overview >>

次に、本実施の形態における検証装置１の処理について説明する。図２は、処理の流れを示すフローチャートである。 Next, the process of the verification apparatus 1 in this Embodiment is demonstrated. FIG. 2 is a flowchart showing the flow of processing.

まず、ステップＳ３０１では、特徴量符号化部１１が、参照画像に規定された各々の部分領域に関連づけられた特徴量ベクトルを、一つ以上の離散値に量子化した特徴量符号を求める。求めた特徴量符号は、データベース２に格納する場合もあるし、対応候補計算部１２に伝達する場合もある。また、当該処理は、クエリ画像を受け付ける前に事前に実施しておいても構わない。 First, in step S301, the feature amount encoding unit 11 obtains a feature amount code obtained by quantizing a feature amount vector associated with each partial region defined in the reference image into one or more discrete values. The obtained feature amount code may be stored in the database 2 or may be transmitted to the correspondence candidate calculation unit 12. In addition, this process may be performed in advance before receiving a query image.

続いて、ステップＳ３０２では、対応候補計算部１２が、クエリ画像情報４にある部分領域および特徴量ベクトルの集合を入力として受け付けると、クエリ画像情報４に規定された各々の部分領域に関連づけられた特徴量ベクトルと、参照画像の各部分領域の特徴量符号との距離を求め、この距離に基づいて、当該特徴量符号に対応する参照画像に規定された部分領域と、クエリ画像に規定された部分領域との対応候補として求め、判定部１３に伝達する。 Subsequently, in step S302, when the correspondence candidate calculation unit 12 accepts a set of partial areas and feature amount vectors in the query image information 4 as input, the correspondence candidate calculation unit 12 associates the partial areas defined in the query image information 4 with each partial area. The distance between the feature vector and the feature code of each partial area of the reference image is obtained, and based on this distance, the partial area defined in the reference image corresponding to the feature code and the query image It is obtained as a candidate for correspondence with the partial area, and is transmitted to the determination unit 13.

続いて、ステップＳ３０３では、判定部１３が、対応候補である参照画像に規定された部分領域に関連づけられた幾何情報と、クエリ画像に規定された部分領域に関連づけられた幾何情報との組の各々について、当該組の幾何的関係を求め、求められた幾何的関係に基づいて、当該組の対応候補が適当であるか否かを判定し、適当である場合にはこれを対応とすると判定して判定結果５を出力する。ここで、判定部１３は、対応候補毎の幾何的関係に基づくヒストグラムを求め、ヒストグラムの頻度が高いビンに対応する対応候補が適当であると判定する。 Subsequently, in step S303, the determination unit 13 sets a set of geometric information associated with the partial area defined in the reference image that is a correspondence candidate and geometric information associated with the partial area defined in the query image. For each, the geometric relationship of the set is obtained, and based on the obtained geometric relationship, it is determined whether or not the corresponding candidate for the set is appropriate. Then, the determination result 5 is output. Here, the determination unit 13 obtains a histogram based on the geometric relationship for each corresponding candidate, and determines that a corresponding candidate corresponding to a bin having a high histogram frequency is appropriate.

＜＜第１の実施の形態の各処理の処理詳細＞＞ << Processing Details of Each Process in First Embodiment >>

以降、各処理の詳細処理について、本実施形態における一例を説明する。 Hereinafter, an example of the detailed processing of each processing will be described in the present embodiment.

［特徴量符号化処理］ [Feature encoding]

まず、特徴量符号化部１１における、参照画像のある部分領域の特徴量ベクトルを量子化して特徴量符号を求める処理について説明する。 First, a description will be given of processing in the feature amount encoding unit 11 to obtain a feature amount code by quantizing a feature amount vector of a partial region having a reference image.

高次元の特徴量ベクトルを一つ以上の離散値に量子化する問題は、ベクトル量子化の問題として捉えることができ、任意のベクトル量子化法を適用することができる。例えば、ベクトル量子化の代表的な方法の一つであるＫ平均法（Ｋは自然数）に基づく量子化方法が存在する。 The problem of quantizing a high-dimensional feature quantity vector into one or more discrete values can be regarded as a problem of vector quantization, and any vector quantization method can be applied. For example, there is a quantization method based on the K-average method (K is a natural number), which is one of the typical vector quantization methods.

Ｋ平均法（あるいは、これに限らず実用に供される数多くの量子化法）は、事前に量子化器を学習しておく必要がある。例えばＫ平均法の場合は、予め一定数の特徴量ベクトルの集合を学習データとしてＫ平均法を適用し、Ｋ個のクラスタ中心を求めておく。実際にある特徴量ベクトルを量子化する際には、その特徴量ベクトルに最もクラスタ中心を求め、そのクラスタのクラスタ中心（ベクトル）を特徴量符号として用いればよい。 The K-average method (or many other quantization methods not limited to this) needs to learn a quantizer in advance. For example, in the case of the K-average method, the K-average method is applied in advance using a set of a certain number of feature quantity vectors as learning data to obtain K cluster centers. When a certain feature quantity vector is actually quantized, the cluster center is most determined from the feature quantity vector, and the cluster center (vector) of the cluster may be used as the feature quantity code.

符号化には好ましくは、二値量子化を用いる。二値量子化では、元の実数値または整数値の特徴量ベクトルを、二値ベクトルへと量子化する。二値ベクトルとは、ベクトルの各要素が二値のみを取るようなベクトルである。要素の値は特に問わないが、例えば｛０，１｝、｛１,−１｝などとすることができるであろう。好ましくは、のちの距離計算を簡便にするため、｛１,−１｝とするのがよく、以降これを前提に説明する。仮に二値ベクトルの次元がＢ（Ｂは自然数）であるとすると、あり得る二値ベクトルの場合の数は２Ｂ通りであるから、すなわち２Ｂ個の離散値に量子化していることに相当する。 For encoding, binary quantization is preferably used. In binary quantization, an original real value or integer value feature vector is quantized into a binary vector. A binary vector is a vector in which each element of the vector takes only a binary value. The value of the element is not particularly limited, but may be {0, 1}, {1, -1}, for example. Preferably, in order to simplify the distance calculation later, it is better to set {1, -1}, which will be described below. If the dimension of the binary vector is B (B is a natural number), the number of possible binary vectors is 2B, that is, it is equivalent to quantization to 2B discrete values.

実数値／整数値特徴量ベクトルを二値ベクトルへと量子化する処理の一例について説明する。便宜上、元の実数値／整数値特徴量ベクトルをｘ、二値ベクトルをｂと表すことにすると、例えば以下（１）式により二値ベクトルを得ることができる。 An example of a process of quantizing a real value / integer value feature quantity vector into a binary vector will be described. For convenience, if the original real value / integer value feature quantity vector is represented by x and the binary vector is represented by b, a binary vector can be obtained by the following equation (1), for example.

・・・（１） ... (1)

ここで、ｓｉｇｎは要素ごとの特徴量符号を取る関数であり、より厳密には、特徴量ベクトルｘの要素のうち、０以上を取る要素には１、０未満の要素には−１を返す関数であるとする。上記（１）式を用いることで、元の特徴量ベクトルがどのようなものであっても、それと同一次元の二値ベクトルを得ることができる。 Here, sign is a function that takes a feature quantity code for each element. More strictly, among elements of the feature quantity vector x, 1 is returned for an element that takes 0 or more, and -1 is returned for an element that is less than 0. Suppose that it is a function. By using the above equation (1), it is possible to obtain a binary vector having the same dimension as that of the original feature vector.

一方、物体検索に用いられる特徴量の中には、元来非負値の実数値／整数値によって構成されるものも多く、その場合、上記（１）式による二値ベクトルは常に全要素が１となってしまうという問題がある。この問題を避けるため、好ましくはバイアス成分μを導入する。 On the other hand, many of the feature quantities used for object search are originally composed of non-negative real / integer values. In this case, the binary vector according to the above equation (1) always has 1 for all elements. There is a problem of becoming. In order to avoid this problem, a bias component μ is preferably introduced.

・・・（２） ... (2)

μはｘと同一次元を持つベクトルである。その要素の値は自由に定めてよいが、好ましくは特徴量ベクトル群の平均値、中央値、平均ベクトル、中央値ベクトルなどを用いるとよい。 μ is a vector having the same dimension as x. Although the value of the element may be freely determined, it is preferable to use an average value, median value, average vector, median value vector, or the like of the feature vector group.

さらに、上記（１）式、及び（２）式のように単に要素の符号を取るだけでは情報の欠損が大きく、対応候補を求める際の距離計算の精度が大きく劣化するという問題がある。可能な限り情報を損なわないような二値量子化を実行するため、好ましくは以下（３）式による二値量子化を用いる。 Furthermore, there is a problem that the information loss is large simply by taking the element code as in the above formulas (1) and (2), and the accuracy of the distance calculation when determining the correspondence candidates is greatly deteriorated. In order to execute binary quantization that does not impair information as much as possible, preferably binary quantization according to the following equation (3) is used.

・・・（３） ... (3)

ただし、Ａはｘの次元数と同じサイズの行列であり、Ａ’はＡの転置を表す。この行列Ａは任意の公知の行列構成法で構成することができるが、主成分分析により求めた場合、特徴ベクトル群の情報量を最適に保存するような行列を求めることができるため好適である。また、主成分分析は次元削減法としての側面も持っていることから、二値ベクトルの次元を落としてより軽量な特徴量符号を得ることもできる点で利点がある。 Here, A is a matrix having the same size as the number of dimensions of x, and A ′ represents the transpose of A. This matrix A can be formed by any known matrix construction method, but is preferred because it can be obtained by a principal component analysis because a matrix that optimally stores the information amount of the feature vector group can be obtained. . Since principal component analysis also has an aspect as a dimension reduction method, there is an advantage in that it is possible to obtain a lighter feature quantity code by reducing the dimension of a binary vector.

これらの方法により求めた二値ベクトルは、元の特徴量ベクトルがなす角度成分をよく保存する。対応候補は特徴量ベクトル間の距離に基づいて計算されるが、距離を求める上で角度成分の影響は大きいことが知られている。したがって、角度成分をよく保存することによって、距離計算の精度を劣化させることなく情報を圧縮することが可能になるという利点がある。 The binary vector obtained by these methods well preserves the angle component formed by the original feature vector. The correspondence candidate is calculated based on the distance between the feature quantity vectors, but it is known that the influence of the angle component is large in obtaining the distance. Therefore, by preserving the angle component well, there is an advantage that information can be compressed without degrading the accuracy of distance calculation.

［対応候補計算処理］ [Correspondence candidate calculation processing]

次に、対応候補計算部１２における、異なる２枚の画像間に規定された部分領域同士の対応候補を求める処理について説明する。本発明の第１の実施形態の一例においては、クエリ画像とそれぞれの参照画像の間で、対応する部分領域を決定するために用いる処理である。 Next, a process for obtaining correspondence candidates between partial areas defined between two different images in the correspondence candidate calculation unit 12 will be described. In an example of the first embodiment of the present invention, the process is used to determine a corresponding partial region between a query image and each reference image.

クエリ画像から抽出されたある部分領域をＱ_ｉ、参照画像から抽出されたある部分領域をＲ_ｊと表すものとする。以下では、部分領域Ｑ_ｉとＲ_ｊを例にとり、これらが対応候補であるか否かを判断する処理を説明する。 A partial area extracted from the query image is represented as Q _i , and a partial area extracted from the reference image is represented as R _j . Hereinafter, taking the partial areas Q _i and R _j as an example, a process for determining whether or not these are correspondence candidates will be described.

各部分領域はその部分領域を表現する特徴量ベクトル、あるいは、これを量子化した特徴量符号が関連づけられている。部分領域Ｑ_ｉを記述する特徴量符号をq、Ｒ_ｊを記述する特徴量符号をｒと表すとする。このとき、部分領域同士の距離ｄｉｓｔ（Ｑ_ｉ，Ｒ_ｊ）を次式により求める。 Each partial region is associated with a feature vector expressing the partial region or a feature code obtained by quantizing the feature vector. It is assumed that the feature code describing the partial region Q _i is represented as q and the feature code describing R _j is represented as r. At this time, a distance dist (Q _i , R _j ) between the partial areas is obtained by the following equation.

・・・（４） ... (4)

ここで、本発明の第１の実施形態の一例によれば、ｒは特徴量符号であり、特にこれが｛１, −１｝の二値ベクトルで表現されている場合、上記（４）式は以下のように変換できる。 Here, according to an example of the first embodiment of the present invention, r is a feature value code, and particularly when this is expressed by a binary vector of {1, -1}, the above equation (4) is It can be converted as follows.

・・・（５）
... (5)

さらに、ｑ_ｍ、ｒ_ｍはそれぞれベクトルｑおよびｒのm番目の要素である。ここで、ｒ_ｍは｛１,−１｝のいずれかの値しかとらないから、結局のところ上記（５）式はｑの要素の加減算のみで評価可能であり、距離計算を大きく効率化することができる。 Further, q _m and r _m are the m-th elements of vectors q and r, respectively. Here, since r _m is {1, -1} only take one of the values, after all the (5) is evaluable only addition and subtraction of the elements of q, to increase efficiency of the distance calculation be able to.

実際には、この距離は全ての部分領域の組み合わせに対して計算する。 In practice, this distance is calculated for all partial region combinations.

続いて、求めた部分領域間の距離に基づいて、各部分領域の組が対応候補であるか否かを判定する。参照画像の部分領域Ｒ_ｊに着目したとき、これに最も近いクエリ画像の部分領域がＱ_ｉ、その次に近い部分領域がＱ_ｋであったとする。このとき、下記（６）式の条件を満たすとき、Ｒ_ｊとＱ_ｉが対応候補であると判定する。 Subsequently, based on the obtained distance between the partial areas, it is determined whether or not each set of partial areas is a correspondence candidate. When focusing on the partial region R _j of the reference image, it is assumed that the partial region of the query image closest to this is Q _i and the next partial region thereof is Q _k . At this time, when the condition of the following expression (6) is satisfied, it is determined that R _j and Q _i are correspondence candidates.

・・・（６） ... (6)

ここで、Ｔは事前に決めておくパラメータであり、０＜Ｔ≦１の任意の値を取ってよい。例えばＴ＝０．８などとすればよい。 Here, T is a parameter determined in advance, and may take an arbitrary value of 0 <T ≦ 1. For example, T = 0.8 may be set.

以上の計算を全てのクエリ画像に対する参照画像の各々における、全ての部分領域の組に対して実施することで、対応候補となる部分領域を求めることが可能である。 By performing the above calculation for all sets of partial areas in each of the reference images for all query images, it is possible to obtain partial areas that are candidates for correspondence.

なお、このように求めた対応候補は重複を許す。つまり、ある参照画像の部分領域Ｒ_ｊに対して、クエリ画像の複数の部分領域が対応候補となってしまう可能性がある。通常、物体が同一であるにも関わらず、物体のある一部分領域に対して、複数の部分領域が対応することはあまりない。そこで、対応候補の重複を許さないよう、後処理を導入しても構わない。例えば、一度上記方法によって全ての対応を求めた後、複数の候補領域と対応しているクエリ画像側の候補領域を列挙する。続いて、そのクエリ画像側の部分領域と対応している参照画像の部分領域のうち、最も距離の近いものだけを有効な対応候補であると判断し、それ以外の組については対応候補を棄却する。以上のような処理を導入することにより、全ての部分領域は必ず一対一対応するように制約することができる。 Note that the correspondence candidates obtained in this way allow duplication. That is, for the partial area of a reference image R _j, a plurality of partial regions of the query image is likely to become the corresponding candidates. Usually, even though the objects are the same, a plurality of partial areas do not often correspond to a certain partial area of the object. Therefore, post-processing may be introduced so as not to allow duplication of correspondence candidates. For example, after all the correspondences are obtained by the above method, candidate regions on the query image side corresponding to a plurality of candidate regions are listed. Subsequently, among the partial regions of the reference image corresponding to the partial region on the query image side, only the closest region is determined to be an effective corresponding candidate, and the corresponding candidate is rejected for the other pairs To do. By introducing the processing as described above, it is possible to constrain all the partial regions to always correspond one-to-one.

［判定処理］ [Determination process]

続いて、判定部１３における、部分領域の幾何情報に基づいて、求めた部分領域の対応候補の対応の適否を判定する処理について説明する。ここでは、対応候補計算部１２で求めた対応候補のうち、有効な対応ではない（つまり、物体から抽出された部分領域同士の対応ではない）と考えられる組を削除する。 Next, a process of determining whether or not the corresponding partial area correspondence candidate is appropriate based on the geometric information of the partial area in the determination unit 13 will be described. Here, among the correspondence candidates obtained by the correspondence candidate calculation unit 12, a pair that is considered not to be an effective correspondence (that is, not a correspondence between partial areas extracted from an object) is deleted.

仮に、クエリ画像と参照画像が同一の物体を含んでいるとする。物体が剛体であるならば、クエリ画像中の物体と参照画像中の物体は異なる視点から撮影されているにすぎず、現実的な仮定の下、この視点変動は部分領域の見え方に一貫性を与える。言い換えれば、仮に対応候補となっている部分領域同士が、正しく同一物体上に存在する部分領域の組である場合には、クエリ側の部分領域の幾何情報と、対応候補となっている参照画像側の領域の幾何情報の幾何的関係（ずれ方）には一貫性があることになる。したがって、このずれ方に一貫性がある対応候補の組のみを有効な対応であると見做し、そうでない組を棄却すればよい。 Suppose that the query image and the reference image include the same object. If the object is a rigid body, the object in the query image and the object in the reference image are only taken from different viewpoints. Under realistic assumptions, this viewpoint variation is consistent with the appearance of the subregion. give. In other words, if the partial areas that are candidates for correspondence are pairs of partial areas that are correctly present on the same object, the geometric information of the partial area on the query side and the reference image that is a candidate for correspondence The geometric relationship (displacement) of the geometric information of the side area is consistent. Therefore, it is only necessary to consider a pair of correspondence candidates that are consistent in this way of deviation as an effective correspondence and reject those that are not.

図３、図４を用いてわかりやすく説明する。図３に、同一の物体を含む２枚の画像Ａおよび画像Ｂを示す。それぞれ、破線で囲った３種類の模様５１Ａ、５１Ｂ、５２Ａ、５２Ｂ、５３Ａ、５３Ｂが部分領域として規定されており、また互いに同一の番号により表される部分領域の組（例えば５１Ａと５１Ｂ）が、対応候補として判定されているとする。目的は、同一の物体上に存在する部分領域の組（この場合は５１Ａと５１Ｂおよび５２Ａと５２Ｂ）だけを有効な対応と判定することである。 This will be described in an easy-to-understand manner with reference to FIGS. FIG. 3 shows two images A and B including the same object. Each of the three types of patterns 51A, 51B, 52A, 52B, 53A, and 53B surrounded by a broken line is defined as a partial area, and a set of partial areas (for example, 51A and 51B) represented by the same numbers. Suppose that it is determined as a corresponding candidate. The purpose is to determine only a pair of partial areas (in this case, 51A and 51B and 52A and 52B) existing on the same object as an effective correspondence.

図３を見ればわかるように、同一の物体上にある５１Ａと５１Ｂおよび５２Ａと５２Ｂは、カメラの撮影視点のみに依存して同じようにその位置・姿勢・大きさが変化していることがわかる。位置を画像上での部分領域の存在するｘ座標、ｙ座標、姿勢を角度で定量化し、具体的にその幾何的関係を求めた結果を図６に示す。５１Ａと５１Ｂおよび５２Ａと５２Ｂは、ずれ量が近い値になるため、このずれ量についてヒストグラムを構成した場合、５１Ａと５１Ｂおよび５２Ａと５２Ｂの対応候補はヒストグラム上の同じビンに集中して分布する。したがって、結果として頻度が高いヒストグラムのビンにある部分領域の対応候補のみを有効な対応であると判定すれば、有効な対応ではない組を削除することができる。 As can be seen from FIG. 3, the positions, postures, and sizes of 51A and 51B and 52A and 52B on the same object change in the same manner depending only on the shooting viewpoint of the camera. Recognize. FIG. 6 shows the result of quantifying the x-coordinate, y-coordinate, and posture where the position of the partial region on the image is quantified, and specifically obtaining the geometric relationship. Since 51A and 51B and 52A and 52B are close to each other in the amount of deviation, if a histogram is constructed for this amount of deviation, the corresponding candidates of 51A and 51B and 52A and 52B are concentrated and distributed in the same bin on the histogram. . Therefore, as a result, if it is determined that only the correspondence candidates of the partial areas in the bins of the histogram with high frequency are effective correspondences, it is possible to delete a pair that is not an effective correspondence.

図４の例では、幾何情報として位置・姿勢・大きさを用いたが、先に述べた通り、本発明の実施の形態においては、幾何情報としては少なくとも大きさが規定されていればよい。この場合、ヒストグラムが１次元に簡易化されるため、効率的な処理が可能となる。 In the example of FIG. 4, the position / posture / size is used as the geometric information. However, as described above, in the embodiment of the present invention, it is sufficient that at least the size is defined as the geometric information. In this case, since the histogram is simplified to one dimension, efficient processing is possible.

また、このようなヒストグラムに基づく判定に限らず、その他公知の幾何的関係性の検証法を用いても構わない。例えば、同一物体上の部分領域の幾何的関係は、適当な条件の下で線形変換に拘束されることが知られている。このような線形変換と、これに従う幾何的関係を持つ対応候補の組を求める手法として参考文献１に記載のRANSACアルゴリズムや参考文献２に記載のLO-RANSACアルゴリズムなど、公知の有効な方法が存在するため、これらを用いても構わない。 Further, the determination is not limited to such a histogram, and other known geometric relationship verification methods may be used. For example, it is known that the geometric relationship between partial areas on the same object is constrained by linear transformation under appropriate conditions. There are known effective methods such as the RANSAC algorithm described in Reference 1 and the LO-RANSAC algorithm described in Reference 2 as a method for obtaining such a linear transformation and a pair of correspondence candidates having a geometric relationship according to the linear transformation. Therefore, these may be used.

［参考文献１］M. A. Fischler and R. C. Bolles,“Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography,” Comm. ACM, vol. 24, no. 6, pp. 381-395, 1981. [Reference 1] MA Fischler and RC Bolles, “Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography,” Comm. ACM, vol. 24, no. 6, pp. 381-395, 1981 .

［参考文献２］O. Chum, J. Matas, and S. Obdrzalek,“Enhancing RANSAC by generalized model optimization,” Proceedings of Asian Conference on Computer Vision, pp. 812-817, 2004. [Reference 2] O. Chum, J. Matas, and S. Obdrzalek, “Enhancing RANSAC by generalized model optimization,” Proceedings of Asian Conference on Computer Vision, pp. 812-817, 2004.

以上の手続きにより、クエリ画像と参照画像に同一の物体を含むか否かを判定することができる。 With the above procedure, it can be determined whether or not the query image and the reference image include the same object.

以上説明したように、本発明の第１の実施の形態に係る検証装置によれば、参照画像に規定された各々の部分領域に関連づけられた特徴量ベクトルを、一つ以上の離散値に量子化した特徴量符号を求め、求めたある特徴量符号と、クエリ画像に規定された各々の部分領域に関連づけられた特徴量ベクトルとの距離を求め、当該距離に基づいて、特徴量符号に対応する参照画像に規定された部分領域と、クエリ画像に規定された部分領域との対応候補を求め、対応候補である参照画像に規定された部分領域に関連づけられた幾何情報と、クエリ画像に規定された部分領域に関連づけられた幾何情報との幾何的関係に基づいて、当該対応候補が適当であるか否かを判定し、適当である場合にはこれを対応とすることにより、高精度かつ高効率に画像間の照合を可能にする。 As described above, according to the verification apparatus according to the first embodiment of the present invention, the feature vector associated with each partial region defined in the reference image is quantized into one or more discrete values. The feature value code is obtained, the distance between the obtained feature value code and the feature value vector associated with each partial area specified in the query image is obtained, and the feature value code is supported based on the distance. To obtain a candidate for correspondence between the partial area specified in the reference image and the partial area specified in the query image, the geometric information associated with the partial area specified in the reference image that is a correspondence candidate, and the query image It is possible to determine whether the corresponding candidate is appropriate based on the geometric relationship with the geometric information associated with the specified partial area, and to determine whether the corresponding candidate is appropriate. High efficiency To enable the collation between.

＜本発明の第２の実施の形態に係る検証装置＞ <Verification Device According to Second Embodiment of the Present Invention>

以下、図面を参照して本発明の第２の実施の形態を詳細に説明する。なお、第１の実施の形態と同様となる箇所については同一符号を付して説明を省略する。 Hereinafter, a second embodiment of the present invention will be described in detail with reference to the drawings. In addition, the same code | symbol is attached | subjected about the location similar to 1st Embodiment, and description is abbreviate | omitted.

第２の実施の形態では、部分領域の幾何情報を符号化する点が第１の実施の形態と異なっている。 The second embodiment is different from the first embodiment in that the geometric information of the partial region is encoded.

＜＜第２の実施の形態の全体構成＞＞ << Overall Configuration of Second Embodiment >>

図５は、本発明の第２の実施形態における検証装置１０１の構成の一例を示すブロック図である。図１に示す検証装置１０１は、特徴量符号化部１１と、対応候補計算部１２と、幾何情報符号化部１４と、判定部１３とを備える。 FIG. 5 is a block diagram showing an example of the configuration of the verification apparatus 101 according to the second embodiment of the present invention. The verification apparatus 101 illustrated in FIG. 1 includes a feature amount encoding unit 11, a correspondence candidate calculation unit 12, a geometric information encoding unit 14, and a determination unit 13.

＜＜第２の実施の形態の処理部＞＞ << Processing Unit of Second Embodiment >>

幾何情報符号化部１４は、対応候補である参照画像に規定された部分領域に関連づけられた幾何情報と、クエリ画像に規定された部分領域に関連づけられた幾何情報のそれぞれについて、一つ以上の離散値に量子化し、幾何情報符号を求める。なお、参照画像又はクエリ画像のいずれか一方のみについて幾何情報符号を求めるようにしてもよい。 The geometric information encoding unit 14 performs one or more of each of the geometric information associated with the partial area defined in the reference image that is a correspondence candidate and the geometric information associated with the partial area defined in the query image. Quantize to discrete values to obtain geometric information code. Note that the geometric information code may be obtained for only one of the reference image and the query image.

判定部１３は、対応候補である参照画像に規定された部分領域に関連づけられた幾何情報から求めた幾何情報符号と、クエリ画像に規定された部分領域に関連づけられた幾何情報から求めた幾何情報符号との組の各々について、当該組の幾何的関係を求め、求められた幾何的関係に基づいて、当該組の対応候補が適当であるか否かを判定し、適当である場合にはこれを対応とすると判定して判定結果５を出力する。 The determination unit 13 uses the geometric information code obtained from the geometric information associated with the partial area defined in the reference image that is a correspondence candidate and the geometric information obtained from the geometric information associated with the partial area defined in the query image. For each pair of codes, the geometric relationship of the pair is obtained, and based on the obtained geometric relationship, it is determined whether or not the corresponding candidate for the pair is appropriate. Is determined to correspond, and a determination result 5 is output.

＜＜処理概要＞＞ << Process overview >>

次に、本実施の形態における検証装置１０１の処理について説明する。図６は、処理の流れを示すフローチャートである。 Next, the process of the verification apparatus 101 in this Embodiment is demonstrated. FIG. 6 is a flowchart showing the flow of processing.

ステップＳ３０３Ｂでは、幾何情報符号化部１４が、対応候補である参照画像に規定された部分領域に関連づけられた幾何情報と、クエリ画像に規定された部分領域に関連づけられた幾何情報のそれぞれについて、一つ以上の離散値に量子化し、幾何情報符号を求める。 In step S303B, the geometric information encoding unit 14 performs each of the geometric information associated with the partial region defined in the reference image that is a correspondence candidate and the geometric information associated with the partial region defined in the query image. Quantize to one or more discrete values to obtain a geometric information code.

ステップＳ３０４Ｂでは、判定部１３が、対応候補となっている参照画像側の部分領域の幾何情報符号と、クエリ画像側の部分領域の幾何情報符号との組の各々について、その幾何的関係を求め、これに基づいてこの対応候補が適当であるか否かを判定し、適当である場合にはこれを対応とする判定して判定結果５を出力する。 In step S304B, the determination unit 13 obtains the geometric relationship for each set of the geometric information code of the partial area on the reference image side and the geometric information code of the partial area on the query image side, which is a correspondence candidate. Based on this, it is determined whether or not the corresponding candidate is appropriate. If it is appropriate, it is determined that the corresponding candidate is appropriate, and a determination result 5 is output.

なお、第２の実施の形態の他の構成及び作用については第１の実施の形態と同様であるため説明を省略する。 Since other configurations and operations of the second embodiment are the same as those of the first embodiment, description thereof is omitted.

＜＜第２の実施の形態の各処理の処理詳細＞＞ << Processing Details of Each Process in Second Embodiment >>

幾何情報符号化部１４における、幾何情報の符号化処理について説明する。なお、第１の実施の形態で説明した判定処理では、幾何情報はいずれも実数値であるという前提で判定処理を行ったが、幾何情報は符号化されていても構わない。 The geometric information encoding process in the geometric information encoding unit 14 will be described. In the determination process described in the first embodiment, the determination process is performed on the assumption that the geometric information is a real value, but the geometric information may be encoded.

幾何情報の符号化も、特徴量ベクトルの符号化と同様、量子化に基づいて実行することができる。例えば大きさを例にとって説明すると、大きさの範囲をいくつかの区分に分割しておき、ある部分領域の大きさがどの区分に属するかに基づいて符号化する。より具体的には、仮に大きさを０以上２未満、２以上４未満、４以上６未満、６以上の４区分に分割するとし、値の小さい区分から順に「１」、「２」、「３」、「４」と幾何情報符号を定めておく。このとき、仮にある部分領域の大きさが２．５であったとすると、当該部分領域の大きさは２の区分に属するので、大きさの幾何情報符号は２となる。 The geometric information can be encoded based on the quantization as in the case of the feature vector encoding. For example, taking the size as an example, the size range is divided into several sections, and encoding is performed based on which section the size of a partial area belongs to. More specifically, suppose that the size is divided into four categories of 0 or more, less than 2, 2 or more, less than 4, 4 or more, less than 6, or 6 or more, and “1”, “2”, “ The geometric information code is defined as “3” and “4”. At this time, if the size of a partial area is 2.5, the size of the partial area belongs to 2 sections, so the geometric information code of the size is 2.

以上のように符号化することにより幾何的関係の演算も単純化できる。例えばある対応候補である部分領域の組について、一方の大きさの幾何情報符号が２、もう一方の大きさの幾何情報符号が３であるとすると、そのずれは３−２＝１として簡単に評価することができる。 By performing the encoding as described above, the calculation of the geometric relationship can be simplified. For example, if a geometric information code of one size is 2 and a geometric information code of the other size is 3 for a set of partial areas that are certain correspondence candidates, the deviation is simply 3-2 = 1. Can be evaluated.

ここでは大きさの場合を例に説明したが、もちろんその他の種類の幾何情報（位置・姿勢等）を用いる場合についても同様に処理可能であることは言うまでもない。 Although the case of size has been described as an example here, it is needless to say that processing can be similarly performed when other types of geometric information (position, posture, etc.) are used.

このように、部分領域の幾何情報を低容量な符号に圧縮することにより、より高効率な検証を可能にする。 Thus, by compressing the geometric information of the partial region into a low-capacity code, it is possible to perform more efficient verification.

以上説明したように、本発明の第２の実施の形態に係る検証装置によれば、参照画像に規定された各々の部分領域に関連づけられた特徴量ベクトルを、一つ以上の離散値に量子化した特徴量符号を求め、求めたある特徴量符号と、クエリ画像に規定された各々の部分領域に関連づけられた特徴量ベクトルとの距離を求め、当該距離に基づいて、特徴量符号に対応する参照画像に規定された部分領域と、クエリ画像に規定された部分領域との対応候補を求め、幾何情報を一つ以上の離散値に量子化し、幾何情報符号を求め、対応候補である参照画像に規定された部分領域に関連づけられた幾何情報から求めた幾何情報符号と、クエリ画像に規定された部分領域に関連づけられた幾何情報から求めた幾何情報符号との幾何的関係に基づいて、当該対応候補が適当であるか否かを判定し、適当である場合にはこれを対応とすることにより、高精度かつ高効率に画像間の照合を可能にする。 As described above, according to the verification apparatus according to the second embodiment of the present invention, the feature vector associated with each partial region defined in the reference image is quantized into one or more discrete values. The feature value code is obtained, the distance between the obtained feature value code and the feature value vector associated with each partial area specified in the query image is obtained, and the feature value code is supported based on the distance. The candidate corresponding to the partial area specified in the reference image and the partial area specified in the query image is obtained, the geometric information is quantized into one or more discrete values, the geometric information code is obtained, and the reference that is the corresponding candidate Based on the geometric relationship between the geometric information code obtained from the geometric information associated with the partial area defined in the image and the geometric information code obtained from the geometric information associated with the partial area defined in the query image, Concerned Response candidate is equal to or appropriate, by the corresponding this where appropriate, to allow matching between images with high accuracy and high efficiency.

なお、本発明は、上述した実施の形態に限定されるものではなく、この発明の要旨を逸脱しない範囲内で様々な変形や応用が可能である。 The present invention is not limited to the above-described embodiment, and various modifications and applications can be made without departing from the gist of the present invention.

１、１０１検証装置
２データベース
３参照画像情報
４クエリ画像情報
５判定結果
１１特徴量符号化部
１２対応候補計算部
１３判定部
１４幾何情報符号化部 DESCRIPTION OF SYMBOLS 1,101 Verification apparatus 2 Database 3 Reference image information 4 Query image information 5 Determination result 11 Feature-value encoding part 12 Corresponding candidate calculation part 13 Determination part 14 Geometric information encoding part

Claims

Geometric information and feature quantity vector including at least the size of the partial area associated with each of the one or more partial areas defined in the first image, and one or more defined in the second image When the geometric information and the feature vector associated with each of the partial areas are input, the partial area defined in the first image, and the partial area defined in the second image A verification device that requests
A feature amount encoding unit for obtaining a feature amount code obtained by quantizing the feature amount vector associated with each of the partial regions defined in the first image into one or more discrete values;
A distance between the certain feature amount code obtained for the first image and the feature amount vector associated with each of the partial regions defined in the second image is obtained, and based on the distance, A correspondence candidate calculation unit for obtaining a correspondence candidate between the partial region defined in the first image corresponding to the feature amount code and the partial region defined in the second image;
Geometric relationship between the geometric information associated with the partial area defined in the first image as the correspondence candidate and the geometric information associated with the partial area defined in the second image On the basis of whether or not the corresponding candidate is appropriate, and if it is appropriate, a determination unit corresponding to this,
Verification device including

A geometric information encoding unit that quantizes the geometric information into one or more discrete values and obtains a geometric information code;
The determination unit includes the geometric information code obtained from the geometric information associated with the partial area defined in the first image that is the correspondence candidate, and the partial area defined in the second image. A determination is made as to whether or not the corresponding candidate is appropriate based on a geometric relationship with the geometric information code obtained from the geometric information associated with the item, and if it is appropriate, it is determined as a corresponding item. The verification apparatus according to 1.

The feature amount encoding unit obtains a binary value obtained by performing binary quantization on each element of the feature amount vector associated with each of the partial areas defined in the first image based on the positive / negative. 3. The verification apparatus according to claim 1, wherein the vector is obtained as a feature amount code.

The determination unit obtains a histogram based on the geometric relationship for each correspondence candidate, and determines that the correspondence candidate corresponding to a bin having a high frequency in the histogram is appropriate. The verification apparatus according to item 1.

Geometric information and feature quantity vector including at least the size of the partial area associated with each of the one or more partial areas defined in the first image, and one or more defined in the second image When the geometric information and the feature vector associated with each of the partial areas are input, the partial area defined in the first image, and the partial area defined in the second image A verification method in a verification device that seeks the correspondence of
A feature amount encoding unit obtaining a feature amount code obtained by quantizing the feature amount vector associated with each of the partial areas defined in the first image into one or more discrete values;
The correspondence candidate calculation unit obtains a distance between the certain feature amount code obtained for the first image and the feature amount vector associated with each partial region defined in the second image, and Obtaining a candidate for correspondence between the partial area defined in the first image corresponding to the feature code and the partial area defined in the second image based on a distance;
The determination unit includes the geometric information associated with the partial area defined in the first image as the correspondence candidate, and the geometric information associated with the partial area defined in the second image. Determining whether or not the corresponding candidate is appropriate based on the geometric relationship of
Verification method including

A geometric information encoding unit further comprising: quantizing the geometric information into one or more discrete values to obtain a geometric information code;
The step of determining by the determining unit is defined in the geometric information code obtained from the geometric information associated with the partial area defined in the first image that is the correspondence candidate, and in the second image. Based on the geometric relationship with the geometric information code obtained from the geometric information associated with the partial area, it is determined whether or not the corresponding candidate is appropriate. The verification method according to claim 5.

The step of encoding by the feature amount encoding unit performs binary quantization on each element of the feature amount vector associated with each of the partial areas defined in the first image based on the positive / negative. The verification method according to claim 5 or 6, wherein the binary vector obtained in step (5) is obtained as a feature amount code.

The program for functioning a computer as each part of the verification apparatus of any one of Claims 1-4.