JP6218237B2

JP6218237B2 - Image conversion program, apparatus and method for parallelizing photographed image

Info

Publication number: JP6218237B2
Application number: JP2014119126A
Authority: JP
Inventors: 有哉巻渕; 加藤　晴久; 晴久加藤
Original assignee: KDDI Corp
Current assignee: KDDI Corp
Priority date: 2014-06-09
Filing date: 2014-06-09
Publication date: 2017-10-25
Anticipated expiration: 2034-06-09
Also published as: JP2015233212A

Description

本発明は、撮影画像を平行化する技術に関する。 The present invention relates to a technique for parallelizing a captured image.

「撮影画像の平行化」とは、平面に対して斜めの角度から撮影された「撮影画像（カメラ画像）」を、垂直上方から見た「俯瞰画像」へ変換する技術をいう。これは、３次元復元技術の一種であって、撮影画像から被写体の２次元平面の幾何情報（平行線や直交線）を復元するものである。この技術によれば、俯瞰画像についてのみ保持される特定の幾何情報も簡易に検出することができる。また、俯瞰画像の生成と同時に、被写体に対する撮影姿勢も取得することができ、ＣＧ(Computer Graphics)を現実空間に重ね合わせて表示するＡＲ(Augmented Reality)システムに応用することもできる。 “Parallelization of photographed image” refers to a technique for converting a “captured image (camera image)” photographed from an oblique angle with respect to a plane into an “overhead image” viewed from vertically above. This is a kind of three-dimensional restoration technique and restores geometric information (parallel lines and orthogonal lines) of a two-dimensional plane of a subject from a captured image. According to this technique, it is possible to easily detect specific geometric information held only for the overhead image. Simultaneously with the generation of the overhead view image, the photographing posture with respect to the subject can be acquired, and the present invention can be applied to an AR (Augmented Reality) system that displays CG (Computer Graphics) superimposed on the real space.

従来、カメラ画像と俯瞰画像との点対応を用いて、撮影画像を平行化する技術がある（例えば特許文献１参照）。この技術によれば、具体的には、被写体上に存在する既知の幾何形状とカメラ画像内形状との対応を用いて、最少４組の点対応から俯瞰画像へ変換する平面射影変換行列を計算する。例えば、正方形のマーカを被写体上に事前に設置することによって、正方形の４頂点に関して、俯瞰画像上の座標とカメラ画像上のピクセル座標との間で対応付けて簡易に平行化を実現する。 2. Description of the Related Art Conventionally, there is a technique for parallelizing a captured image using point correspondence between a camera image and a bird's-eye view image (for example, see Patent Document 1). Specifically, according to this technology, a plane projective transformation matrix for converting from a minimum of four sets of point correspondences to a bird's-eye view image is calculated using correspondence between known geometric shapes existing on the subject and shapes in the camera image. To do. For example, by arranging square markers in advance on the subject, parallelization is easily realized by associating the coordinates of the four vertexes of the square with the coordinates on the overhead image and the pixel coordinates on the camera image.

また、被写体上から検出された平行線を用いて、撮影画像を平行化する技術もある（例えば特許文献２参照）。この技術によれば、具体的には、平面上に存在する最少２組の平行線を検出し、この平行線を用いて、平面図上の無限遠直線に相当するカメラ画像中の消失線の位置を計算する。カメラ画像中の消失線とカメラの内部パラメータとから、被写体の法線方向を復元し、俯瞰画像を作成する。 There is also a technique for parallelizing a captured image using parallel lines detected from the subject (see, for example, Patent Document 2). Specifically, according to this technique, a minimum of two sets of parallel lines existing on a plane are detected, and using these parallel lines, the disappearance line in the camera image corresponding to the infinity line on the plan view is detected. Calculate the position. The normal direction of the subject is restored from the disappearance line in the camera image and the internal parameters of the camera, and an overhead image is created.

前述した特許文献１及び２に記載の技術によれば、撮影画像（被写体）上に、特定の幾何形状や平行線が存在するものに限定されるという課題があった。 According to the techniques described in Patent Documents 1 and 2 described above, there is a problem in that the technique is limited to those in which a specific geometric shape or parallel line exists on a captured image (subject).

これに対し、多視点で撮影された各カメラ画像間の平面射影変換行列を用いて、撮影画像を平行化する技術もある（例えば非特許文献１、２、３参照）。この技術によれば、カメラ画像を複数枚用いるという点で、前述した特許文献１及び２に記載の技術とは大きく異なる。 On the other hand, there is also a technique for parallelizing a captured image using a planar projection transformation matrix between camera images captured from multiple viewpoints (see, for example, Non-Patent Documents 1, 2, and 3). This technique is significantly different from the techniques described in Patent Documents 1 and 2 described above in that a plurality of camera images are used.

図１は、従来技術における画像変換プログラムの機能構成図である。 FIG. 1 is a functional configuration diagram of a conventional image conversion program.

非特許文献１に記載の技術によれば、具体的には、撮影画像の平行化パラメータと、各カメラ画像間の平面射影変換行列とに関して、コスト関数を予め定義している。その上で、カメラから入力された複数枚の撮影画像から、局所特徴量に基づく点対応を計算する（図１の点対応計算部に対応）。次に、その点対応から、平面射影変換行列を計算する（図１の平面射影変換行列計算部に対応）。そして、各平面射影変換行列を入力としてそれを最小化することにより、平行化パラメータを推定する。 Specifically, according to the technique described in Non-Patent Document 1, a cost function is defined in advance with respect to a parallelization parameter of a captured image and a planar projective transformation matrix between the camera images. Then, the point correspondence based on the local feature amount is calculated from a plurality of captured images input from the camera (corresponding to the point correspondence calculation unit in FIG. 1). Next, a plane projection transformation matrix is calculated from the point correspondence (corresponding to the plane projection transformation matrix calculation unit in FIG. 1). Then, a parallelization parameter is estimated by minimizing each planar projective transformation matrix as an input.

この技術によれば、平面に関する事前情報を一切利用しないために、被写体上の幾何情報や平行線が存在しない場合であっても適用できる。例えば、被写体を斜め方向から撮影して得られるカメラ画像については、平面上で平行の関係にある２本の直線が平行ではなくなるため、平行線の検出が困難となるが、この技術によれば、このような場合であっても、撮影画像を平行化することができる。 This technique can be applied even when there is no geometric information or parallel lines on the subject because no prior information about the plane is used. For example, for a camera image obtained by photographing a subject from an oblique direction, it is difficult to detect a parallel line because two straight lines that are parallel to each other on a plane are not parallel. Even in such a case, the captured image can be parallelized.

平行化計算では、最適解を含む複数の平行化パラメータの解が得られる。平行化パラメータは、入力画像の平面射影変換に対応しており、最適解に基づく変換により正面方向から撮影された俯瞰画像が得られる。 In the parallelization calculation, a plurality of parallelization parameter solutions including an optimal solution are obtained. The parallelization parameter corresponds to the planar projective transformation of the input image, and an overhead image taken from the front direction is obtained by the transformation based on the optimal solution.

また、非特許文献２に記載の技術によれば、各俯瞰画像候補について計算されるコスト値を比較し、最小のコスト値を有するものを最終的に得られる俯瞰画像として選択する（図１の俯瞰画像計算部に対応）。更に、非特許文献３に記載の技術によれば、最小のコスト値を有する俯瞰画像を出力する際の判定基準として、最小のコスト値に対する２番目に小さなコスト値の比率を使用する。 Further, according to the technique described in Non-Patent Document 2, the cost values calculated for each overhead image candidate are compared, and the one having the minimum cost value is selected as the finally obtained overhead image (FIG. 1). Corresponding to the overhead image calculation unit). Furthermore, according to the technique described in Non-Patent Document 3, the ratio of the second smallest cost value to the minimum cost value is used as a criterion for outputting an overhead image having the minimum cost value.

特開２００６−１４６７６０号公報JP 2006-146760 A 特許第４６１７９９３号公報Japanese Patent No. 4617993

A. Ruiz, P. E. L. de Teruel, and L. Fernandez、Practical planar metric rectification. In Proc. BMVC 2006, 2006、[online]、［平成２６年６月３日検索］、インターネット＜URL: http://www.bmva.org/bmvc/2006/papers/188.pdf＞A. Ruiz, PEL de Teruel, and L. Fernandez, Practical planar metric rectification. In Proc. BMVC 2006, 2006, [online], [Search June 3, 2014], Internet <URL: http: // www .bmva.org / bmvc / 2006 / papers / 188.pdf> C. Pirchheim and G. Reitmayr, “Homography-based planar mapping and tracking for mobile phones,” in Proceedings of the 10th IEEE and ACM International Symposium on Mixed and Augmented Reality (ISMAR ’11), 2011, pp. 27.36、[online]、［平成２６年６月３日検索］、インターネット＜URL:http://www.researchgate.net/publication/221221458_Homography-based_planar_mapping_and_tracking_for_mobile_phones＞C. Pirchheim and G. Reitmayr, “Homography-based planar mapping and tracking for mobile phones,” in Proceedings of the 10th IEEE and ACM International Symposium on Mixed and Augmented Reality (ISMAR '11), 2011, pp. 27.36, [online ], [Search June 3, 2014], Internet <URL: http://www.researchgate.net/publication/221221458_Homography-based_planar_mapping_and_tracking_for_mobile_phones> 巻渕有哉，加藤晴久，柳原広昌，“マーカーレスARのための複数平面を利用したカメラ姿勢推定手法の検討，” 情報処理学会研究報告. [オーディオビジュアル複合情報処理] 2014-AVM-84(4), 1-6, 2014-02-14、[online]、［平成２６年６月３日検索］、インターネット＜URL:https://ipsj.ixsq.nii.ac.jp/ej/?action=pages_view_main&active_action=repository_view_main_item_detail&item_id=98561&item_no=1&page_id=13&block_id=8＞Yuya Makibuchi, Haruhisa Kato, Hiromasa Yanagihara, “Examination of Camera Posture Estimation Method Using Multiple Planes for Markerless AR,” IPSJ SIG. 2014-AVM-84 (4) , 1-6, 2014-02-14, [online], [Search June 3, 2014], Internet <URL: https://ipsj.ixsq.nii.ac.jp/ej/?action=pages_view_main&active_action = repository_view_main_item_detail & item_id = 98561 & item_no = 1 & page_id = 13 & block_id = 8> 「ホモグラフィの計算」、[online]、［平成２６年６月３日検索］、インターネット＜URL:http://tessy.org/wiki/index.php?%A5%DB%A5%E2%A5%B0%A5%E9%A5%D5%A5%A3%A4%CE%B7%D7%BB%BB＞“Calculation of homography”, [online], [Search June 3, 2014], Internet <URL: http://tessy.org/wiki/index.php?%A5%DB%A5%E2%A5 % B0% A5% E9% A5% D5% A5% A3% A4% CE% B7% D7% BB% BB> 「活字文書ＯＣＲライブラリ」、[online]、［平成２６年６月３日検索］、インターネット＜URL:http://mediadrive.jp/products/library/katsuji_library/windows/index.html＞“Type Document OCR Library”, [online], [Search June 3, 2014], Internet <URL: http: //mediadrive.jp/products/library/katsuji_library/windows/index.html> 「FAX OCR/スキャナー OCR「TeleForm」を構成する、高性能な4つのアプリケーション」、[online]、［平成２６年６月３日検索］、インターネット＜URL:http://www.hammock.jp/ocr/kinou/verifier.html＞"Four high-performance applications that make up FAX OCR / Scanner OCR" TeleForm "", [online], [Search June 3, 2014], Internet <URL: http://www.hammock.jp/ ocr / kinou / verifier.html>

非特許文献１、２及び３に記載の技術によれば、平行化の精度がカメラ画像の枚数やカメラ画像間の点対応計算の精度に大きく依存し、誤った俯瞰画像候補が得られる場合がある。特に、非特許文献２及び３に記載の技術は、正確な俯瞰画像を判定するためにコスト値の比較では不十分であり、高精度の俯瞰画像が得られないとい課題がある。このような従来技術によれば、平行化パラメータ間の比較により最適解の判定をしており、誤った平行化パラメータが最適解として判定される可能性があった。 According to the techniques described in Non-Patent Documents 1, 2, and 3, the accuracy of parallelization largely depends on the number of camera images and the accuracy of point correspondence calculation between camera images, and an erroneous overhead image candidate may be obtained. is there. In particular, the techniques described in Non-Patent Documents 2 and 3 are insufficient to compare cost values in order to determine an accurate overhead image, and there is a problem that a high-accuracy overhead image cannot be obtained. According to such a conventional technique, the optimum solution is determined by comparison between the parallelization parameters, and an incorrect parallelization parameter may be determined as the optimum solution.

そこで、本発明は、カメラによる撮影画像を高精度に平行化することができる画像変換プログラム、装置及び方法を提供することを目的とする。 SUMMARY OF THE INVENTION An object of the present invention is to provide an image conversion program, apparatus, and method that can parallelize an image captured by a camera with high accuracy.

本発明によれば、撮影画像を平行化するようにコンピュータを機能させる画像変換プログラムにおいて、
撮影画像には、被写体に付与された識別コードが写っており、
複数の撮影画像について任意の代表画像と各従属画像との間の特徴点の点対応を計算する点対応計算手段と、
点対応毎に、代表画像と各従属画像との間の平面射影変換行列を計算する平面射影変換行列計算手段と、
各平面射影変換行列を用いて平行化パラメータを推定し、各平行化パラメータを用いて代表画像を俯瞰画像へ変換した複数の俯瞰画像候補を出力する俯瞰画像候補計算手段と、
複数の俯瞰画像候補の中から、識別コードの画像的な読み取りに成功した俯瞰画像を、平行化された俯瞰画像として出力するコード読取判定手段と
してコンピュータを機能させることを特徴とする。 According to the present invention, in an image conversion program for causing a computer to function to parallelize a captured image,
The captured image shows the identification code given to the subject.
Point correspondence calculation means for calculating point correspondence of feature points between an arbitrary representative image and each subordinate image for a plurality of captured images;
A plane projection transformation matrix calculating means for calculating a plane projection transformation matrix between the representative image and each subordinate image for each point correspondence;
An overhead image candidate calculation means for estimating a parallelization parameter using each planar projection transformation matrix and outputting a plurality of overhead image candidates obtained by converting a representative image into an overhead image using each parallelization parameter;
The computer is caused to function as a code reading determination unit that outputs a bird's-eye view image that has been successfully image-reading an identification code from among a plurality of bird's-eye view image candidates.

本発明の画像変換プログラムにおける他の実施形態によれば、
俯瞰画像候補計算手段は、平行化パラメータ空間について局所最小解と判定された複数の平行化パラメータを推定する
ようにコンピュータを機能させることも好ましい。 According to another embodiment of the image conversion program of the present invention,
The overhead image candidate calculation means preferably causes the computer to function so as to estimate a plurality of parallelization parameters determined as the local minimum solution for the parallelization parameter space.

本発明の画像変換プログラムにおける他の実施形態によれば、
俯瞰画像候補計算手段は、平行化パラメータ空間の中で一定距離以下に存在する複数の平行化パラメータが推定された場合、いずれか１つの平行パラメータのみを選択する
ようにコンピュータを機能させることも好ましい。 According to another embodiment of the image conversion program of the present invention,
It is also preferable that the overhead image candidate calculation means allows the computer to function so as to select only one of the parallel parameters when a plurality of parallel parameters existing within a certain distance in the parallel parameter space are estimated. .

本発明の画像変換プログラムにおける他の実施形態によれば、
俯瞰画像候補計算手段は、俯瞰画像候補同士の間で類似度を算出し、当該類似度が所定閾値以上となる俯瞰画像候補のみを選択する
ようにコンピュータを機能させることも好ましい。 According to another embodiment of the image conversion program of the present invention,
It is also preferable that the overhead image candidate calculation means calculates the similarity between the overhead image candidates and causes the computer to select only the overhead image candidates whose similarity is equal to or greater than a predetermined threshold.

本発明の画像変換プログラムにおける他の実施形態によれば、
識別コードは、バーコード、ＱＲ(Quick Response)（登録商標）コード、ＯＣＲ(Optical Character Recognition)用文字又は線分のような幾何特徴画像である
ようにコンピュータを機能させることも好ましい。 According to another embodiment of the image conversion program of the present invention,
It is also preferable to make the computer function so that the identification code is a bar code, a QR (Quick Response) (registered trademark) code, an OCR (Optical Character Recognition) character, or a geometric feature image such as a line segment.

本発明の画像変換プログラムにおける他の実施形態によれば、
コード読取判定手段は、識別コードの画像的な読み取りが成功した俯瞰画像候補の中で、その信頼値が最も高い俯瞰画像を出力する
ようにコンピュータを機能させることも好ましい。尚、信頼値は、コード読取判定手段が出力する読取り結果の確からしさを示す数値である。 According to another embodiment of the image conversion program of the present invention,
The code reading determination means preferably causes the computer to function so as to output the bird's-eye view image having the highest reliability value among the bird's-eye view image images that have been successfully read in the image form of the identification code. The confidence value is a numerical value indicating the certainty of the reading result output from the code reading determining means.

本発明の画像変換プログラムにおける他の実施形態によれば、
コード読取判定手段は、識別コードの画像的な読み取りが成功したその識別データをアプリケーションへ出力する
ようにコンピュータを機能させることも好ましい。 According to another embodiment of the image conversion program of the present invention,
The code reading determination means preferably causes the computer to function so as to output the identification data, which has been successfully read in the image form of the identification code, to the application.

本発明によれば、撮影画像を平行化する画像変換装置において、
撮影画像には、被写体に付与された識別コードが写っており、
複数の撮影画像について任意の代表画像と各従属画像との間の特徴点の点対応を計算する点対応計算手段と、
点対応毎に、代表画像と各従属画像との間の平面射影変換行列を計算する平面射影変換行列計算手段と、
各平面射影変換行列を用いて平行化パラメータを推定し、各平行化パラメータを用いて代表画像を俯瞰画像へ変換した複数の俯瞰画像候補を出力する俯瞰画像候補計算手段と、
複数の俯瞰画像候補の中から、識別コードの画像的な読み取りに成功した俯瞰画像を、平行化された俯瞰画像として出力するコード読取判定手段と
を有することを特徴とする。 According to the present invention, in an image conversion apparatus for parallelizing a captured image,
The captured image shows the identification code given to the subject.
Point correspondence calculation means for calculating point correspondence of feature points between an arbitrary representative image and each subordinate image for a plurality of captured images;
A plane projection transformation matrix calculating means for calculating a plane projection transformation matrix between the representative image and each subordinate image for each point correspondence;
An overhead image candidate calculation means for estimating a parallelization parameter using each planar projection transformation matrix and outputting a plurality of overhead image candidates obtained by converting a representative image into an overhead image using each parallelization parameter;
Code reading determination means for outputting, as a paralleled bird's-eye view image, a bird's-eye view image that has succeeded in image-wise reading of an identification code from among a plurality of bird's-eye view image candidates.

本発明によれば、ユーザ操作の情報機器を用いて、撮影画像を平行化する画像変換方法において、
撮影画像には、被写体に付与された識別コードが写っており、
複数の撮影画像について任意の代表画像と各従属画像との間の特徴点の点対応を計算する第１のステップと、
点対応毎に、代表画像と各従属画像との間の平面射影変換行列を計算する第２のステップと、
各平面射影変換行列を用いて平行化パラメータを推定し、各平行化パラメータを用いて代表画像を俯瞰画像へ変換した複数の俯瞰画像候補を出力する第３のステップと、
複数の俯瞰画像候補の中から、識別コードの画像的な読み取りに成功した俯瞰画像を、平行化された俯瞰画像として出力する第４のステップと
を有することを特徴とする。 According to the present invention, in an image conversion method for parallelizing a captured image using a user-operated information device,
The captured image shows the identification code given to the subject.
A first step of calculating a point correspondence of feature points between an arbitrary representative image and each subordinate image for a plurality of captured images;
A second step of calculating a planar projective transformation matrix between the representative image and each dependent image for each point correspondence;
A third step of estimating a parallelization parameter using each planar projective transformation matrix and outputting a plurality of overhead image candidates obtained by converting a representative image into an overhead image using each parallelization parameter;
And a fourth step of outputting a bird's-eye view image, which has been successfully read as an identification code, from among a plurality of bird's-eye view image candidates, as a parallel bird's-eye view image.

本発明の画像変換プログラム、装置及び方法によれば、カメラによる撮影画像を高精度に平行化することができる。 According to the image conversion program, apparatus, and method of the present invention, it is possible to parallelize an image captured by a camera with high accuracy.

従来技術における画像変換プログラムの機能構成図である。It is a functional block diagram of the image conversion program in a prior art. 本発明における画像変換プログラムの機能構成図である。It is a functional block diagram of the image conversion program in this invention. バーコードが付与された被写体に対する斜め方向からの撮影画像である。It is the picked-up image from the diagonal direction with respect to the to-be-photographed object. カメラを移動させながら撮影した複数枚の撮影画像である。It is the several picked-up image image | photographed while moving the camera. Homography行列に基づくinlier及びoutlierを表す画像対応図である。It is an image correspondence figure showing inlier and outlier based on a Homography matrix. 離散化されたコスト値を白黒の濃淡で表すコスト値行列である。It is a cost value matrix which expresses the discretized cost value with black and white shading. 各平行化パラメータに対応して表された３つの俯瞰画像候補である。Three bird's-eye view image candidates represented in correspondence with each parallelization parameter. 識別コードを読み取った識別データに対するアプリケーションの画面表示図である。It is the screen display figure of the application with respect to the identification data which read the identification code.

以下、本発明の実施の形態について、図面を用いて詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

図２は、本発明における画像変換プログラムの機能構成図である。 FIG. 2 is a functional configuration diagram of an image conversion program according to the present invention.

本発明における画像変換プログラムは、撮影画像を平行化するようにコンピュータを機能させるものであって、例えばスマートフォンやタブレットのような端末にインストールされて実行される。このような端末１は一般的に、周辺状況を撮影するカメラモジュール（Ｗｅｂカメラ）１０１と、ディスプレイ１０２とを有する。カメラモジュール１０１は、例えば連写や動画によって被写体を撮影した複数枚の画像を、画像変換プログラムへ出力する。尚、カメラモジュール１０１について、焦点距離や光軸のずれ、歪みパラメータ等の内部パラメータは、事前のキャリブレーションによって既知であるとする。 The image conversion program according to the present invention causes a computer to function in parallel with a captured image, and is installed and executed in a terminal such as a smartphone or a tablet. Such a terminal 1 generally includes a camera module (Web camera) 101 that captures a surrounding situation and a display 102. For example, the camera module 101 outputs a plurality of images obtained by photographing a subject by continuous shooting or a moving image to an image conversion program. For the camera module 101, it is assumed that internal parameters such as a focal length, an optical axis shift, and a distortion parameter are known by prior calibration.

複数枚の画像は、１台のカメラで撮影されたものに限られず、複数の異なるカメラで異なる視点から撮影されたものであってもよい。また、それら複数枚の画像は、ユーザの所望の指示タイミングで撮影されたものであってもよい。これら撮影画像は、逐次的にバッファリングされるが、直前に入力された数枚（例えば５枚程度）のみに限定することによって使用メモリ量を節約することができる。 The plurality of images are not limited to those taken with one camera, and may be taken from different viewpoints with a plurality of different cameras. Further, the plurality of images may be taken at a user's desired instruction timing. These photographed images are sequentially buffered, but the memory usage can be saved by limiting to only a few (for example, about 5) inputted immediately before.

ディスプレイ１０２は、情報機器に組み込まれたものに限られず、外部に接続されたテレビや、パーソナルコンピュータ用ディプレイ、又はヘッドマウントディスプレイであってもよい。そのようなディスプレイには、平行化された俯瞰画像や、識別コードの識別データを表示することができる。 The display 102 is not limited to a display incorporated in an information device, and may be a television connected to the outside, a display for a personal computer, or a head mounted display. Such a display can display a parallel bird's-eye view image and identification code identification data.

図３は、バーコードが付与された被写体に対する斜め方向からの撮影画像である。
図４は、カメラを移動させながら撮影した複数枚の撮影画像である。 FIG. 3 is a photographed image from an oblique direction with respect to a subject to which a barcode is assigned.
FIG. 4 shows a plurality of captured images taken while moving the camera.

本発明によれば、被写体に、視覚認識可能な「識別コード」が付加されていることを要する。そのため、撮影画像には、被写体に付与された識別コードが写り込むこととなる。識別コードとしては、例えばバーコード（１次元コード）、ＱＲコード（２次元コード）、ＯＣＲ用文字又は線分のような幾何特徴画像である。本発明によれば、このような識別コードは、その識別データの読み取りを主目的としたものではなく、撮影画像の平行化を主目的としたものである。勿論、識別コードの識別データの読み取りの用途に適用するものであってもよい。 According to the present invention, it is necessary that an “identification code” that can be visually recognized is added to the subject. Therefore, the identification code assigned to the subject is reflected in the photographed image. The identification code is a geometric feature image such as a barcode (one-dimensional code), a QR code (two-dimensional code), an OCR character or a line segment. According to the present invention, such an identification code is not mainly intended for reading the identification data, but mainly intended for parallelization of the photographed image. Of course, it may be applied to the use of reading the identification data of the identification code.

画像変換プログラムは、点対応計算部１１と、平面射影変換行列計算部１２と、俯瞰画像候補計算部１３と、コード読取判定部１４とを有する。これら機能構成部は、装置に搭載されたコンピュータを機能させるプログラムを実行することによって実現される。尚、これら機能構成部は、画像変換装置の内部構造として理解できると共に、その処理の流れは、情報機器の画像変換方法としても理解できる。 The image conversion program includes a point correspondence calculation unit 11, a plane projection conversion matrix calculation unit 12, an overhead image candidate calculation unit 13, and a code reading determination unit 14. These functional components are realized by executing a program that causes a computer installed in the apparatus to function. These functional components can be understood as the internal structure of the image conversion apparatus, and the processing flow can also be understood as an image conversion method for information equipment.

［点対応計算部１１］
点対応計算部１１は、被写体（識別コードを含む）を撮影した複数の撮影画像それぞれから局所特徴量を抽出し、任意の代表画像の特徴点ｐと各従属画像の特徴点ｐ'との間の点対応Ｐ＝{(p, p')}を計算する。 [Point correspondence calculation unit 11]
The point correspondence calculation unit 11 extracts a local feature amount from each of a plurality of captured images obtained by capturing a subject (including an identification code), and between a feature point p of an arbitrary representative image and a feature point p ′ of each subordinate image. The point correspondence P = {(p, p ′)} is calculated.

点対応計算部１１は、ここでは最初に、複数枚の撮影画像の中から、１枚の代表画像を選択し、その他を「従属画像」とする。例えば図４によれば、最後に入力された撮影画像を「代表画像」として選択しているが、これに限定されるものではない。代表画像の選択は、様々な基準の下で決定されるものであってもよい。 Here, the point correspondence calculation unit 11 first selects one representative image from a plurality of photographed images, and designates the others as “subordinate images”. For example, according to FIG. 4, the last input photographed image is selected as the “representative image”, but the present invention is not limited to this. The selection of the representative image may be determined under various criteria.

局所特徴点の抽出アルゴリズムとしては、例えばＳＩＦＴ(Scale-Invariant Feature Transform)やＳＵＲＦ(Speeded Up Robust Features)、ＯＲＢ(Oriented FAST and Rotated BRIEF)が用いられる。これらの局所特徴点は、以下の要素によって記述される。
座標ｐ＝(x,y)、方向θ、局所特徴ベクトルf
尚、点対応を検出する撮影画像間では、その特徴ベクトルについて同じ次元数である。 For example, SIFT (Scale-Invariant Feature Transform), SURF (Speeded Up Robust Features), or ORB (Oriented FAST and Rotated BRIEF) is used as an extraction algorithm for local feature points. These local feature points are described by the following elements.
Coordinate p = (x, y), direction θ, local feature vector f
It should be noted that the feature vectors have the same number of dimensions between captured images for which point correspondence is detected.

例えば、ＳＩＦＴの場合、１枚の画像からは１２８次元の特徴点集合が抽出される。ＳＩＦＴとは、スケールスペースを用いて特徴的な局所領域を解析し、そのスケール変化及び回転に不変となる特徴ベクトルを記述する技術である。一方で、ＳＵＲＦの場合、ＳＩＦＴよりも高速処理が可能であって、１枚の画像から６４次元の特徴点集合が抽出される。また、ＯＲＢは、バイナリコードによる特徴記述としてＢＲＩＥＦ(Binary Robust Independent Elementary Features)を用いて、１つのコンテンツから２５６ビットのバイナリ特徴ベクトルの集合を抽出する。特に、ＯＲＢによれば、ＳＩＦＴやＳＵＲＦと比較して、同等以上の精度を保持すると共に、数百倍の高速化を実現することができる。 For example, in the case of SIFT, a 128-dimensional feature point set is extracted from one image. SIFT is a technique for analyzing a characteristic local region using a scale space and describing a feature vector that is invariant to scale change and rotation. On the other hand, in the case of SURF, higher-speed processing is possible than in SIFT, and a 64-dimensional feature point set is extracted from one image. In addition, the ORB extracts a set of 256-bit binary feature vectors from one content by using BRIEF (Binary Robust Independent Elementary Features) as a feature description by a binary code. In particular, according to the ORB, it is possible to maintain an accuracy equal to or higher than that of SIFT or SURF and realize a speed increase of several hundred times.

次に、点対応計算部１１は、代表画像の特徴点集合と従属画像の特徴点集合とをマッチングし、局所特徴点を点対応させる。代表画像の特徴点ｐに対して、各従属画像の特徴点ｐ'を対応付ける。２つの特徴点ｐ'とｐとの間の距離が短いほど、類似度が高い。 Next, the point correspondence calculation unit 11 matches the feature point set of the representative image with the feature point set of the subordinate image, and associates the local feature points with the points. The feature point p ′ of each subordinate image is associated with the feature point p of the representative image. The shorter the distance between the two feature points p ′ and p, the higher the similarity.

また、点対応計算部１１は、点対応計算の精度や処理速度を向上させるために、以下の方法を用いることも好ましい。
（１）局所特徴点間の距離に基づいて点対応をソートし、距離が所定閾値以下の点対応のみを用いる。
（２）局所特徴点間の距離に基づいて点対応をソートし、距離が１番目に近いものと２番目に近いものを探索し、それらの距離の比（２番目との距離に対する１番目との距離）が所定閾値以下のものを用いる。
（３）その他、計算コストに優れるバイナリ特徴量や、ＳＳＤ(Sum of Squared Difference)、正規化相互相関（ＮＣＣ）等も用いることもできる。 The point correspondence calculation unit 11 preferably uses the following method in order to improve the accuracy and processing speed of the point correspondence calculation.
(1) Sort point correspondences based on the distance between local feature points, and use only point correspondences whose distance is equal to or less than a predetermined threshold.
(2) Sort the point correspondences based on the distance between the local feature points, search for the closest distance and the second closest distance, and the ratio of those distances (the first and the second distance to the second distance) ) Is less than a predetermined threshold.
(3) In addition, binary feature quantities that are excellent in calculation cost, SSD (Sum of Squared Difference), normalized cross correlation (NCC), and the like can also be used.

そして、点対応計算部１１は、代表画像と従属画像各々との間における特徴点の点対応を、平面射影変換行列計算部１２へ出力する。 Then, the point correspondence calculation unit 11 outputs the point correspondence of the feature points between the representative image and each of the subordinate images to the planar projective transformation matrix calculation unit 12.

［平面射影変換行列計算部１２］
平面射影変換行列計算部１２は、点対応毎に、代表画像と各従属画像との間の平面射影変換行列を計算する。 [Plane Projection Transformation Matrix Calculation Unit 12]
The plane projection transformation matrix calculation unit 12 calculates a plane projection transformation matrix between the representative image and each subordinate image for each point correspondence.

平面射影変換行列計算部１２は、具体的は、点対応Ｐ＝{(p, p')}集合から、例えばＲＡＮＳＡＣ(RAndom SAmple Consensus)のようなロバスト推定アルゴリズムを用いて、従属画像を代表画像へ変換する平面射影変換行列を算出する。これによって、誤った点対応を除去することができる。変換行列は、好ましくは「Homography行列」であって、最低４組の点対応が必要である。４組全ての点対応が、inlierでなければ正解のHomography行列が得られない。 Specifically, the plane projective transformation matrix calculation unit 12 uses the robust correspondence algorithm such as RANSAC (RAndom SAmple Consensus) from the point correspondence P = {(p, p ′)} set to represent the subordinate image as the representative image. A plane projective transformation matrix to be converted to is calculated. As a result, erroneous point correspondences can be removed. The transformation matrix is preferably a “Homography matrix” and requires at least four pairs of points. If all four pairs of points are not inlier, a correct Homography matrix cannot be obtained.

図５は、Homography行列に基づくinlier及びoutlierを表す画像対応図である。 FIG. 5 is an image correspondence diagram showing inliers and outliers based on the Homography matrix.

算出されたHomography行列を用いて従属画像の特徴点集合を射影し、代表画像の特徴点とのユークリッド距離が所定閾値以下の対応組を正(inlier)として判定し、それ以外を否(outlier)として判定する。正(inlier)と判定された対応組数が所定閾値以上である場合、当該代表画像が検出されたと判定し、そのHomography行列を採用する。逆に、正(inlier)と判定された対応組数が所定閾値よりも少ない場合、未検出と判定し、そのHomography行列を除去する。 The feature point set of the subordinate image is projected using the calculated Homography matrix, and the corresponding pair whose Euclidean distance from the feature point of the representative image is equal to or less than a predetermined threshold is determined as positive (inlier), and the other is out (outlier) Judge as. If the number of corresponding pairs determined to be inlier is equal to or greater than a predetermined threshold, it is determined that the representative image has been detected, and the Homography matrix is adopted. On the contrary, when the number of corresponding pairs determined to be inlier is smaller than a predetermined threshold, it is determined as not detected, and the homography matrix is removed.

平面射影変換行列であるHomography行列Ｈは、以下のように表される（例えば非特許文献４参照）。これは、従属画像の特徴点ｐ'＝(ｘ₁,ｙ₁)と、代表画像の特徴点ｐ＝(ｘ₂,ｙ₂)との関係を表す。
A Homography matrix H that is a planar projective transformation matrix is expressed as follows (for example, see Non-Patent Document 4). This represents the relationship between the feature point p ′ = (x ₁ , y ₁ ) of the subordinate image and the feature point p = (x ₂ , y ₂ ) of the representative image.

Homography行列の算出には、代表画像の特徴点集合と従属画像の特徴点集合とが用いられる。Homography行列Ｈの未知パラメータ数は、８個（h0〜h7、h8=1）であり、一組の対応点は２個の制約式を与える。従って、この行列Ｈは、４組以上の対応点があれば、最小二乗法によって算出することができる。このようなカメラ姿勢を推定する技術は、ＡＲシステムでは一般的なものである。 For the calculation of the Homography matrix, the feature point set of the representative image and the feature point set of the subordinate image are used. The number of unknown parameters in the Homography matrix H is 8 (h0 to h7, h8 = 1), and one set of corresponding points gives two constraint equations. Therefore, this matrix H can be calculated by the least square method if there are four or more pairs of corresponding points. Such a technique for estimating the camera posture is common in the AR system.

そして、Homography行列Ｈを用いて、各撮影画像特徴点を射影した際に、以下のように判定する。
（１）目標画像の特徴点に対して所定閾値以下の近くに射影されれば、inlierと判定する。
（２）逆に、所定閾値よりも遠くに射影されれば、outlierと判定する。図５によれば、outlierは、破線で表されている。
この処理を複数回実行した後、inlierの数が所定閾値以上となったHomography行列Ｈのみを採用する。 Then, when each captured image feature point is projected using the Homography matrix H, the determination is made as follows.
(1) If it is projected close to a predetermined threshold or less with respect to the feature point of the target image, it is determined as inlier.
(2) Conversely, if it is projected farther than the predetermined threshold, it is determined as outlier. According to FIG. 5, outlier is represented by a broken line.
After this process is executed a plurality of times, only the Homography matrix H in which the number of inliers is equal to or greater than a predetermined threshold is employed.

そして、以下のような関係式で、平面射影変換行列Ｈ_jが表される。
Ｉ_i (i=1,2,3…,N)：N枚の撮影座標
ｍ_i (i=1,2,3…,N)：撮影座標Ｉ_iのピクセル座標系
Ｉ₁ ：代表画像
Ｋ：カメラの内部パラメータ
ｘ_i ：Ｋを用いて正規化されたピクセル座標系
関係式：ｘ_i〜Ｋ^-1ｍ_i
（ｍ_i及びｘ_iは、それぞれの同次座標表現）
〜：スケール不定
そして、関係式ｘ_j〜Ｈ_jｘ₁（j＝2,3…,N)を満たす平面射影変換行列Ｈ_jを算出する。 The planar projective transformation matrix H _j is represented by the following relational expression.
I _i (i = 1, 2, 3..., N): N shooting coordinates m _i (i = 1, 2, 3..., N): Pixel coordinate system of shooting coordinates I _i I ₁ : Representative image K: Camera internal parameters x _i : Pixel coordinate system normalized using K Relational expressions: x _{i to} K ⁻¹ m _i
(M _i and x _i are each homogeneous coordinate representation)
~: Scale indefinite And a plane projection transformation matrix H _j that satisfies the relational expressions x _{j to} H _j x ₁ (j = 2, 3,..., N) is calculated.

このような処理を、代表画像と各従属画像とについて繰り返す。従属画像に映る代表画像として検出される毎に、その平面射影変換行列Ｈが出力される。 Such processing is repeated for the representative image and each subordinate image. Each time it is detected as a representative image shown in the subordinate image, its planar projective transformation matrix H is output.

［俯瞰画像候補計算部１３］
俯瞰画像候補計算部１３は、各平面射影変換行列を用いて平行化パラメータを推定し、各平行化パラメータを用いて代表画像を俯瞰画像へ変換した複数の俯瞰画像候補を出力する。 [Overhead Image Candidate Calculation Unit 13]
The overhead image candidate calculation unit 13 estimates a parallelization parameter using each planar projection transformation matrix, and outputs a plurality of overhead image candidates obtained by converting a representative image into an overhead image using each parallelization parameter.

Ｉ₀：被写体の俯瞰画像
ｍ₀：俯瞰画像Ｉ₀のピクセル座標系
ｘ₀：俯瞰画像Ｉ₀の正規化ピクセル座標系
そして、関係式ｘ₁〜Ｃｘ₀を満たす平行化行列Ｃを定義する。
また、平行化行列Ｃの逆行列Ｃ^-1によって、代表画像Ｉ₁を俯瞰画像Ｉ₀に変換する。 I _0: overhead image m of the object _0: overhead image I pixel coordinate system x ₀ _0: normalized pixel coordinate system of the bird's-eye view image I ₀ and defines a collimating matrix C satisfying the relation x ₁ ~Cx _0.
Further, the representative image I ₁ is converted into the overhead image I ₀ by the inverse matrix C ⁻¹ of the parallelization matrix C.

平行化行列Ｃは、２軸の回転行列によって構成され、以下のように表される。
Ｃ（φ₁,φ₂）＝Ｒ_Z（φ₁）Ｒ_Z（φ₂）
（φ₁,φ₂）：各軸の回転量を表す平行化パラメータ
Ｈ_jについて、コスト関数を以下のように定義する。
ｅ_j（φ₁,φ₂）＝（ｈ₁₂／ｈ₁₁）²＋（ｈ₂₂／ｈ₁₁−１）² where (Ｃ^TＨ_j ^T)(Ｈ_j ^TＣ^T)
＝｛ｈ_uv｝ The parallelization matrix C is constituted by a biaxial rotation matrix and is expressed as follows.
C (φ ₁ , φ ₂ ) = R _Z (φ ₁ ) R _Z (φ ₂ )
(Φ ₁ , φ ₂ ): A cost function is defined as follows for the parallelization parameter H _j representing the rotation amount of each axis.
e _j (φ ₁ , φ ₂ ) = (h ₁₂ / h ₁₁ ) ² + (h ₂₂ / h ₁₁ −1) ² where (C ^T H _j ^T ) (H _j ^T C ^T )
= {H _uv }

平行化パラメータ（φ₁,φ₂）は、入力された全ての平面射影変換行列について計算されたコスト値の総和を最小にするものとして、探索される。
argmin_φ1,φ2（Ｅ（φ₁,φ₂））＝Σ_j=2 ^Nｅ_j（φ₁,φ₂）
平行化パラメータΦ＝（φ₁,φ₂）の解の範囲：［(-π/2,π/2)、(-π/2,π/2)］ The parallelization parameters (φ ₁ , φ ₂ ) are searched for as minimizing the sum of the cost values calculated for all input planar projective transformation matrices.
_{argmin φ1, φ2 (E (φ} 1, φ 2)) = Σ j = 2 N e j (φ 1, φ 2)
Range of solution of parallelization parameter Φ = (φ ₁ , φ ₂ ): [(-π / 2, π / 2), (-π / 2, π / 2)]

このパラメータ空間について、大域的な最適解を探索する。 A global optimal solution is searched for this parameter space.

（Ｓ１３１）最初に、一定間隔（例えば10°）毎に、上式に従ってコスト値をサンプリングする。その結果、各要素にコスト値を持つコスト値行列が得られる。 (S131) First, the cost value is sampled according to the above equation at regular intervals (for example, 10 °). As a result, a cost value matrix having cost values for each element is obtained.

図６は、離散化されたコスト値を白黒の濃淡で表すコスト値行列である。黒は、Ｅ＝０値を表す。 FIG. 6 is a cost value matrix representing the discretized cost values with black and white shading. Black represents E = 0 value.

（Ｓ１３２）次に、俯瞰画像候補計算部１３は、平行化パラメータ空間について局所最小解と判定された複数の平行化パラメータを推定する。各要素の値を、周囲の８つの隣接要素と比較することによって「局所最小解」を検出する。計算コストを削減するため、上下左右の４つの隣接要素と比較してもよい。局所最小解の各パラメータについて、それを初期値として、最適化手法によってコスト値をリファインする。例えば、滑降シンプレックス法を用いることができる。図６によれば、３つのリファインされた局所最小解Φ_１、Φ_２及びΦ_３が表されている。 (S132) Next, the overhead image candidate calculation unit 13 estimates a plurality of parallelization parameters determined to be the local minimum solution for the parallelization parameter space. The “local minimum solution” is detected by comparing the value of each element with the surrounding eight neighboring elements. In order to reduce the calculation cost, it may be compared with four adjacent elements on the top, bottom, left and right. For each parameter of the local minimum solution, the initial value is used to refine the cost value by an optimization method. For example, the downhill simplex method can be used. According to FIG. 6, three refined local minimum solutions Φ ₁ , Φ ₂ and Φ ₃ are represented.

（Ｓ１３３）そして、俯瞰画像候補計算部１３は、複数の局所最小解それぞれについて、各平行化パラメータを用いて次式によって、代表画像を俯瞰画像に変換し、複数の俯瞰画像候補を出力する。
ｍ₀〜（ＫＣ（φ₁,φ₂）Ｋ^-1）^-1ｍ₁
本発明の特徴は、俯瞰画像候補計算部１３が、複数の俯瞰画像候補を出力することにある。 (S133) Then, for each of the plurality of local minimum solutions, the bird's-eye view image candidate calculation unit 13 converts the representative image into a bird's-eye view image using each parallelization parameter and outputs a plurality of bird's-eye view image candidates.
m ₀ to (KC (φ ₁ , φ ₂ ) K ⁻¹ ) ⁻¹ m ₁
A feature of the present invention is that the overhead image candidate calculation unit 13 outputs a plurality of overhead image candidates.

例えば非特許文献１〜３に記載の技術（図１参照）によれば、俯瞰画像候補計算部は、１つの局所最小解を導出しようとする。しかしながら、平行化の精度が、撮影画像の枚数や撮影画像間の点対応計算の精度に大きく依存し、誤った俯瞰画像候補が得られる場合がある。これは、誤った平行化パラメータが最適解として判定される可能性があったことによる。そこで、本発明の俯瞰画像候補計算部１３は、算出された複数の局所最小解それぞれについて、俯瞰画像を候補として出力する。 For example, according to the techniques described in Non-Patent Documents 1 to 3 (see FIG. 1), the overhead image candidate calculation unit attempts to derive one local minimum solution. However, the accuracy of parallelization largely depends on the number of captured images and the accuracy of point correspondence calculation between captured images, and an erroneous overhead image candidate may be obtained. This is because an incorrect parallelization parameter may be determined as an optimal solution. Therefore, the overhead image candidate calculation unit 13 of the present invention outputs an overhead image as a candidate for each of the plurality of calculated local minimum solutions.

尚、本発明の俯瞰画像候補計算部１３は、必ずしも全ての平行化パラメータに対応する俯瞰画像候補を出力する必要もない。例えば、以下のように選択するものであってもよい。
（選択１）俯瞰画像候補計算部１３は、平行化パラメータ空間の中で一定距離以下に存在する複数の平行化パラメータが推定された場合、いずれか１つの平行パラメータのみを選択する。平行化パラメータは２次元であるため、平行化パラメータ間の距離をユークリッド距離によって定めてもよい。
（選択２）俯瞰画像候補計算部１３は、俯瞰画像候補同士の間で類似度を算出し、当該類似度が所定閾値以上となる俯瞰画像候補のみを選択する。尚、類似度としては、例えばＳＳＤ(Sum of Squared Difference)や正規化相互相関（ＮＣＣ）も用いてもよい。 Note that the overhead image candidate calculation unit 13 of the present invention does not necessarily need to output the overhead image candidates corresponding to all the parallelization parameters. For example, you may select as follows.
(Selection 1) The bird's-eye view image candidate calculation unit 13 selects only one parallel parameter when a plurality of parallelization parameters existing within a certain distance in the parallelization parameter space are estimated. Since the parallelization parameter is two-dimensional, the distance between the parallelization parameters may be determined by the Euclidean distance.
(Selection 2) The overhead image candidate calculation unit 13 calculates the similarity between the overhead image candidates, and selects only the overhead image candidates whose similarity is equal to or greater than a predetermined threshold. As the similarity, for example, SSD (Sum of Squared Difference) or normalized cross correlation (NCC) may be used.

［コード読取判定部１４］
コード読取判定部１４は、複数の俯瞰画像候補の中から、識別コードの画像的な読み取りに成功した俯瞰画像を、平行化された俯瞰画像として出力する。即ち、コード読取判定部１４は、コードリーダ（バーコードリーダ、ＱＲコードリーダ等）の機能を備えている。 [Code reading determination unit 14]
The code reading determination unit 14 outputs a bird's-eye view image that has succeeded in image-wise reading of the identification code from among a plurality of bird's-eye view image candidates as a parallelized bird's-eye view image. That is, the code reading determination unit 14 has a function of a code reader (bar code reader, QR code reader, etc.).

図７は、各平行化パラメータに対応して表された３つの俯瞰画像候補である。 FIG. 7 shows three bird's-eye view image candidates corresponding to each parallelization parameter.

コード読取判定部１４は、複数の俯瞰画像候補それぞれについて、識別コードに対する「読み取りの成功／失敗」（及び信頼値）を判定する。コード読取判定部１４は、複数の俯瞰画像候補の中で、「読み取り成功」となった俯瞰画像を出力する。また、「読み取り成功」であって、且つ、最も信頼値が高い俯瞰画像を出力することも好ましい。「信頼値」とは、コードリーダから出力される、認識結果（読み取ったコードデータ）の確からしさをいう（例えば非特許文献５又は６参照）。例えば、図７の例によれば、Φ_１が俯瞰画像として出力される。 The code reading determining unit 14 determines “reading success / failure” (and reliability value) for the identification code for each of the plurality of overhead image candidates. The code reading determination unit 14 outputs an overhead image that has been “read successfully” among a plurality of overhead image candidates. It is also preferable to output a bird's-eye view image that is “reading success” and has the highest reliability value. “Reliability value” refers to the certainty of the recognition result (read code data) output from the code reader (see, for example, Non-Patent Document 5 or 6). For example, according to the example of FIG. 7, Φ ₁ is output as an overhead image.

図８は、識別コードを読み取った識別データに対するアプリケーションの画面表示図である。 FIG. 8 is a screen display of an application for the identification data obtained by reading the identification code.

図８によれば、コード読取判定部１４によって出力された俯瞰画像が、端末１のディスプレイ１０２に表示されている。また、コード読取判定部１４によって、識別コードの画像的な読み取りが成功したその識別データが、アプリケーションによって表示されている。その識別データが、例えばＵＲＬ(Uniform Resource Locator)である場合、ユーザのタップによってそのＵＲＬへアクセスすることもできる。 According to FIG. 8, the overhead image output by the code reading determination unit 14 is displayed on the display 102 of the terminal 1. In addition, the identification data on which the identification code has been successfully read by the code reading determination unit 14 is displayed by the application. When the identification data is, for example, a URL (Uniform Resource Locator), the URL can be accessed by a user's tap.

以上、詳細に説明したように、本発明の画像変換プログラム、装置及び方法によれば、カメラによる撮影画像を高精度に平行化することができる。 As described above in detail, according to the image conversion program, apparatus, and method of the present invention, it is possible to parallelize a captured image by a camera with high accuracy.

前述した本発明の種々の実施形態について、本発明の技術思想及び見地の範囲の種々の変更、修正及び省略は、当業者によれば容易に行うことができる。前述の説明はあくまで例であって、何ら制約しようとするものではない。本発明は、特許請求の範囲及びその均等物として限定するものにのみ制約される。 Various changes, modifications, and omissions of the above-described various embodiments of the present invention can be easily made by those skilled in the art. The above description is merely an example, and is not intended to be restrictive. The invention is limited only as defined in the following claims and the equivalents thereto.

１端末
１０１カメラ
１０２ディスプレイ
１１点対応計算部
１２平面射影変換行列計算部
１３俯瞰画像候補計算部
１４コード読取判定部 DESCRIPTION OF SYMBOLS 1 Terminal 101 Camera 102 Display 11 Point correspondence calculation part 12 Planar projection transformation matrix calculation part 13 Overhead image candidate calculation part 14 Code reading determination part

Claims

In an image conversion program that causes a computer to function to parallelize a captured image,
The photographed image includes an identification code given to the subject,
Point correspondence calculation means for calculating point correspondence of feature points between an arbitrary representative image and each subordinate image for a plurality of captured images;
A plane projection transformation matrix calculating means for calculating a plane projection transformation matrix between the representative image and each subordinate image for each point correspondence;
An overhead image candidate calculation means for estimating a parallelization parameter using each planar projection transformation matrix and outputting a plurality of overhead image candidates obtained by converting a representative image into an overhead image using each parallelization parameter;
An image conversion program that causes a computer to function as a code reading determination unit that outputs a bird's-eye view image that has been successfully image-reading an identification code from among a plurality of bird's-eye view image candidates.

The image conversion program according to claim 1, wherein the overhead image candidate calculation unit causes the computer to function to estimate a plurality of parallelization parameters determined to be local minimum solutions for the parallelization parameter space.

The overhead image candidate calculation means causes the computer to function so as to select only one parallel parameter when a plurality of parallel parameters existing within a certain distance in the parallel parameter space are estimated. The image conversion program according to claim 1 or 2, characterized in that

The bird's-eye view image candidate calculation means calculates a similarity between the bird's-eye view image candidates, and causes the computer to function so as to select only the bird's-eye view image candidates whose similarity is equal to or greater than a predetermined threshold. The image conversion program according to any one of 1 to 3.

5. The computer according to claim 1, wherein the identification code is a bar code, a QR (Quick Response) code, an OCR (Optical Character Recognition) character, or a geometric feature image such as a line segment. The image conversion program according to any one of the above.

2. The code reading determination unit causes the computer to function so as to output an overhead image having the highest reliability value among the overhead image candidates for which the identification code has been successfully image-read. 6. The image conversion program according to any one of items 5.

The image according to any one of claims 1 to 6, wherein the code reading determination unit causes the computer to function so as to output the identification data of which the identification code has been successfully read as an image to an application. Conversion program.

In an image conversion device that parallelizes a captured image,
The photographed image includes an identification code given to the subject,
Point correspondence calculation means for calculating point correspondence of feature points between an arbitrary representative image and each subordinate image for a plurality of captured images;
A plane projection transformation matrix calculating means for calculating a plane projection transformation matrix between the representative image and each subordinate image for each point correspondence;
An overhead image candidate calculation means for estimating a parallelization parameter using each planar projection transformation matrix and outputting a plurality of overhead image candidates obtained by converting a representative image into an overhead image using each parallelization parameter;
An image conversion apparatus comprising: a code reading determination unit that outputs, from a plurality of bird's-eye view image candidates, a bird's-eye view image that has been successfully read as an identification code as a parallel bird's-eye view image.

In an image conversion method for parallelizing a captured image using a user-operated information device,
The photographed image includes an identification code given to the subject,
A first step of calculating a point correspondence of feature points between an arbitrary representative image and each subordinate image for a plurality of captured images;
A second step of calculating a planar projective transformation matrix between the representative image and each dependent image for each point correspondence;
A third step of estimating a parallelization parameter using each planar projective transformation matrix and outputting a plurality of overhead image candidates obtained by converting a representative image into an overhead image using each parallelization parameter;
And a fourth step of outputting, from the plurality of bird's-eye view image candidates, the bird's-eye view image that has succeeded in image-wise reading of the identification code as a parallel bird's-eye view image, .