JPH1139506A

JPH1139506A - Optional view point image generator

Info

Publication number: JPH1139506A
Application number: JP9191399A
Authority: JP
Inventors: Senguputa Kuntaru; セングプタクンタル; Tatsuki Sakaguchi; 竜己坂口; Atsushi Otani; 淳大谷
Original assignee: ATR CHINO EIZO TSUSHIN KENKYUS; ATR CHINO EIZO TSUSHIN KENKYUSHO KK
Current assignee: ATR CHINO EIZO TSUSHIN KENKYUS; ATR CHINO EIZO TSUSHIN KENKYUSHO KK
Priority date: 1997-07-16
Filing date: 1997-07-16
Publication date: 1999-02-12
Anticipated expiration: 2017-07-16
Also published as: JP3122629B2

Abstract

PROBLEM TO BE SOLVED: To reduce coordinate errors and to generate stable images no matter which spot of a space a view point is present at by transforming the coordinate of an affine coordinate system to an affine coordinate at a virtual view point position by a pinhole projection method and generating the image at the virtual view point position. SOLUTION: An image set A is photographed (50) and the reference vector of the affine coordinate is prepared (58). The image set B is photographed and the weak calibration of a camera is performed by using it (60). Scene photographing for generating an optional view point image is performed, the image set C is attained, correspondence between the images is obtained (62) by stereo matching by using the result of a processing 60, transformation to the affine coordinate is performed (64) and the affine coordinate of a virtual camera position is calculated (66) by a virtual camera parameter 56 stored beforehand. It is transformed (68) to a camera view and the image for which the virtual camera position is the view point is formed. Then, the virtual camera parameter 56 is changed and the image of an optional position is generated.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、任意の視点から
見た実空間の画像を生成するための装置に関し、特に、
ステレオ画像に基づいて、実空間の画像を実時間で容易
に生成できる任意視点画像生成装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an apparatus for generating an image of a real space viewed from an arbitrary viewpoint,
The present invention relates to an arbitrary viewpoint image generation device that can easily generate a real space image in real time based on a stereo image.

【０００２】[0002]

【従来の技術】近年、遠隔地間映通信の重要性が増加し
ている。しかし、現存するテレビ会議システムでは、ユ
ーザ間での距離の隔たりを意識させないような画像を提
供することは困難である。2. Description of the Related Art In recent years, the importance of remote area video communication has been increasing. However, it is difficult for an existing video conference system to provide an image that does not make the distance between users conscious.

【０００３】この問題は、遠隔地間のユーザが一つの実
世界に共存しているように感じる環境を作り出すことで
解決できると考えられる。このためには、任意の視点か
ら見た実空間の視野画像を生成することが必要である。
しかもそうした生成を実時間で行なう必要がある。[0003] It is thought that this problem can be solved by creating an environment where users between remote locations feel as if they coexist in one real world. For this purpose, it is necessary to generate a visual field image of the real space viewed from an arbitrary viewpoint.
Moreover, such generation must be performed in real time.

【０００４】そうした手法の一つに、ステレオ画像から
任意視野画像を生成するものがある。そうした従来の手
法は、ステレオ画像から３次元画像を再構築し、この３
次元画像から任意の視点の画像を生成するものが一般的
である。そのために、ある手法では、３次元再構築問題
を、強カメラキャリブレーションを用いて解決してい
る。また他の手法では、画像間の対応関係を３次元アフ
ィン変換または透視投影変換と仮定して３次元構築問題
を解決している。このようにして３次元座標系で表わさ
れたオブジェクトを新しい画像に投影することで、任意
視点から見た画像を生成する。One of such techniques is to generate an arbitrary visual field image from a stereo image. Such conventional methods reconstruct a three-dimensional image from a stereo image,
Generally, an image of an arbitrary viewpoint is generated from a dimensional image. Therefore, in one method, a three-dimensional reconstruction problem is solved using strong camera calibration. In another method, the three-dimensional construction problem is solved by assuming that the correspondence between images is three-dimensional affine transformation or perspective projection transformation. By projecting the object represented by the three-dimensional coordinate system onto a new image in this manner, an image viewed from an arbitrary viewpoint is generated.

【０００５】シーンの再投影や新しい視野画像合成のた
めのより直接的なアプローチとしてエピポーラ交差法が
ある。また、最近では、２つの画像が与えられた場合、
いわゆるモーフィングを用いて、２つのカメラの光学中
心を結ぶ線上にある任意の位置からの画像を合成する手
法も提案されている。A more direct approach to scene reprojection and new view image synthesis is epipolar crossing. Also, recently, when two images are given,
A method of synthesizing an image from an arbitrary position on a line connecting the optical centers of two cameras using so-called morphing has also been proposed.

【０００６】[0006]

【発明が解決しようとする課題】しかし、３次元構造再
構築をした後に新しい画像を投影する手法では、３次元
構造再構築の過程における画像の測定ミス、またはその
他のあらゆる要因に対して不安定要素があり、得られる
結果もまた不安定となる問題がある。またエピポーラ交
差法を用いた従来の方法では、カメラの強キャリブレー
ションを行なう必要があり、これを正確に行なうことは
困難である。さらに少なくとも３枚の画像間で８点の対
応点が必要であり、これら各点の座標の測定には誤差が
含まれるため、より少ない対応点を用いる場合と比較し
て再投影時により大きな座標誤差を招くという問題点が
ある。また、モーフィングを用いた手法では、ステレオ
マッチングという厄介な問題を避けることができる半
面、任意位置の仮想カメラからの画像合成へと拡張する
ことができない。すなわち、モーフィングを用いた手法
では、２つの実カメラのカメラ中心を結んだ直線上に仮
想カメラがある場合しか利用できないという問題点があ
る。However, the method of projecting a new image after reconstructing the three-dimensional structure is unstable with respect to a measurement error of the image in the process of reconstructing the three-dimensional structure or any other factors. There is a problem that there are factors and the obtained result is also unstable. Further, in the conventional method using the epipolar intersection method, it is necessary to perform strong calibration of the camera, and it is difficult to accurately perform the calibration. In addition, eight corresponding points are required between at least three images, and the measurement of the coordinates of these points includes an error. Therefore, larger coordinates are required at the time of reprojection than when fewer corresponding points are used. There is a problem that an error is caused. In addition, the technique using morphing can avoid the troublesome problem of stereo matching, but cannot be extended to image synthesis from a virtual camera at an arbitrary position. That is, the method using morphing has a problem that it can be used only when the virtual camera is on a straight line connecting the camera centers of the two real cameras.

【０００７】それ故に、本発明の目的は、座標誤差が少
なく、空間のどの地点に視点があっても安定して画像を
生成することができる任意視点画像生成装置を提供する
ことである。SUMMARY OF THE INVENTION It is therefore an object of the present invention to provide an arbitrary viewpoint image generating apparatus which has a small coordinate error and can stably generate an image even at any point in space.

【０００８】[0008]

【課題を解決するための手段】本願の請求項１に記載の
発明にかかる任意視点画像生成装置は、ステレオ画像を
取得するための画像取得手段と、画像取得手段により取
得されたステレオ画像に基づいて、所定のアフィン座標
系の基準ベクトルを作成するための手段と、画像取得手
段により取得されたステレオ画像に基づいて、画像取得
時の視点の相対的位置を弱キャリブレーションするため
の手段と、弱キャリブレーションの結果に基づいて、画
像取得手段により取得されたステレオ画像間の点の対応
をとるための手段と、基準ベクトルと対応をとるための
手段の出力とに基づいて、ステレオ画像の各点の座標を
アフィン座標系の座標に変換するための第１の変換手段
と、仮想視点位置データに基づいて、アフィン座標系の
座標を、ピンホール投影法を用いて仮想視点位置でのア
フィン座標に変換するための第２の変換手段と、第２の
変換手段の出力から、仮想視点位置における画像を生成
するための手段とを含む。According to a first aspect of the present invention, there is provided an arbitrary viewpoint image generating apparatus, comprising: an image obtaining unit for obtaining a stereo image; and a stereoscopic image obtained by the image obtaining unit. Means for creating a reference vector of a predetermined affine coordinate system, and means for weakly calibrating the relative position of the viewpoint at the time of image acquisition based on the stereo image acquired by the image acquisition means, On the basis of the result of the weak calibration, a means for associating points between the stereo images acquired by the image acquiring means, and each of the stereo images based on the output of the means for associating with the reference vector. First conversion means for converting the coordinates of the point into the coordinates of the affine coordinate system, and the coordinates of the affine coordinate system based on the virtual viewpoint position data. And second conversion means for converting the affine coordinates of the virtual viewpoint position by using the projection method, the output of the second conversion means, and means for generating an image in a virtual viewpoint position.

【０００９】ステレオ画像からアフィン変換の基準ベク
トルを作成したのち、処理対象となるステレオ画像を取
得したときの視点の相対的位置を弱キャリブレーション
する。さらに、ステレオ画像間の点の対応をとり、既に
求められている基準ベクトルを用いて各点の座標をアフ
ィン変換したのち、ピンホール投影法を用いて仮想視点
位置でのアフィン座標に変換する。このようにして得ら
れたアフィン座標から、仮想視点位置における画像を生
成する。After the affine transformation reference vector is created from the stereo image, the relative position of the viewpoint when the stereo image to be processed is obtained is weakly calibrated. Further, the correspondence between points in the stereo images is obtained, and the coordinates of each point are affine-transformed using the reference vector already obtained, and then converted into affine coordinates at the virtual viewpoint position using the pinhole projection method. From the affine coordinates obtained in this way, an image at the virtual viewpoint position is generated.

【００１０】弱キャリブレーションを用いているので、
３次元再構築法と比較してより直接的に画像を再投影で
き、必要な参照点の数も少なく測定誤差も少なくなるの
で、より安定した結果を得ることができる。また、この
処理においては視点の位置が限定されないので、視点が
空間中のどこにあっても画像の生成を行うことができ
る。Since weak calibration is used,
As compared with the three-dimensional reconstruction method, the image can be reprojected more directly, the number of necessary reference points is reduced, and the measurement error is reduced, so that a more stable result can be obtained. Further, since the position of the viewpoint is not limited in this processing, an image can be generated regardless of where the viewpoint is in the space.

【００１１】[0011]

BEST MODE FOR CARRYING OUT THE INVENTION

［本願発明の原理］まず、本願発明の任意視点画像生成
装置の原理について説明する。本願発明は、物体認識手
法として文献「D.W. Jacobs:"Space Efficient 3D Mode
l Indexing"(Proceedings of CVPR,pp. 439-444,1992)
」で提案された、非常に単純な原理に基づく。この文
献では、３次元剛体モデルを撮影することによって得ら
れる２次元画像のペアは、高次α−βアフィン空間上の
２本の直線として最適に表現できると述べられている。
また、この手法を採用したとして、カメラの投影モデル
として正射影を仮定することもできるが、その場合に
は、後述するように得られる画像の劣化が見られる。そ
こで本願発明では、再合成された画像の質を維持するた
めに、射影にピンホールカメラ形を採用している。[Principle of the Present Invention] First, the principle of the arbitrary viewpoint image generating apparatus of the present invention will be described. The present invention is based on the literature "DW Jacobs:" Space Efficient 3D Mode
l Indexing "(Proceedings of CVPR, pp. 439-444, 1992)
Based on a very simple principle proposed in This document states that a pair of two-dimensional images obtained by imaging a three-dimensional rigid body model can be optimally represented as two straight lines on a higher-order α-β affine space.
In addition, if this method is adopted, an orthographic projection can be assumed as a projection model of the camera, but in this case, deterioration of an obtained image is observed as described later. Therefore, in the present invention, in order to maintain the quality of the recombined image, a pinhole camera is used for projection.

【００１２】発明者は、射影にピンホールカメラを採用
し、ステレオカメラから撮影された２枚の画像間で対応
する点におけるアフィン空間の特性を調べた。こうして
得られた特性を、新しい視野画像の合成等のアプリケー
ションのためにシーン再投影手法として用いている。The inventor employed a pinhole camera for projection, and examined the characteristics of the affine space at points corresponding to two images taken from a stereo camera. The characteristics thus obtained are used as a scene reprojection method for applications such as synthesis of a new visual field image.

【００１３】［カメラキャリブレーションモデル］ここ
では、強カメラキャリブレーションと、本願発明で用い
る弱カメラキャリブレーションとについて簡単に解説す
る。[Camera Calibration Model] Here, the strong camera calibration and the weak camera calibration used in the present invention will be briefly described.

【００１４】（１）強カメラキャリブレーション図１において強カメラキャリブレーションに基づき空間
中の点Ｐを画像平面中の点ｐに投影する場合には、３×
４の変換マトリクスＣを計算する必要がある。点Ｐの同
次座標系での座標が[X Y Z 1]であり、点ｐの座標が(u,
v)であるならば、以下の式が成立する。 [tu tv 1]'=C[X Y Z 1]' …(1) ここでｔはパラメータである。(1) Strong Camera Calibration In FIG. 1, when projecting a point P in space to a point p on an image plane based on strong camera calibration, 3 ×
4 need to be calculated. The coordinates of the point P in the homogeneous coordinate system are [XYZ 1], and the coordinates of the point p are (u,
If v), the following equation holds. [tu tv 1] '= C [XYZ 1]' (1) where t is a parameter.

【００１５】変換マトリクスＣの全体のスケーリングフ
ァクタは重要でないので、キャリブレーションでは変換
マトリクスＣの１１個の要素を推定することとなる。す
なわち、少なくとも６個の３次元座標中の点のマッピン
グ情報を知識として、強カメラキャリブレーションパラ
メータを推定することができる。Since the overall scaling factor of the transformation matrix C is not important, the calibration involves estimating the eleven elements of the transformation matrix C. That is, the strong camera calibration parameters can be estimated using the mapping information of at least six points in the three-dimensional coordinates as knowledge.

【００１６】図２に示すステレオペアが与えられ、双方
のカメラでの強カメラキャリブレーションパラメータが
既知のものとするならば、第１の画像中のある点(u
_1,v₁) が第２の画像中の点(u_2,v₂) に対応する場合に
は、それらの点に対応する点Ｐの３次元座標を標準的な
三角測量法を使ってキャリブレーションパラメータから
推定することができる。これにより、この点Ｐを仮想カ
メラの投影行列を使って任意の画像中に再投影すること
ができる。しかし、カメラが適切にキャリブレーション
されていない場合には、キャリブレーション過程におい
て測定誤差を生じ、光学中心Ｃ１、Ｃ２に対して誤差を
含む光学中心Ｃ１’、Ｃ２’が設定されることになり、
３次元再構築の際には図２中点線で示す様に点Ｐの座標
を誤って点Ｐ’の座標のように推定してしまう。これは
新しい視点からの画像を生成する際の再投影処理の誤り
をも導く。Given the stereo pair shown in FIG. 2 and letting the strong camera calibration parameters for both cameras be known, a point (u) in the first image
_{If (1,} v ₁ ) corresponds to points (u _2, v ₂ ) in the second image, the three-dimensional coordinates of point P corresponding to those points are calibrated using standard triangulation. Can be inferred from the parameters. Thereby, the point P can be re-projected into an arbitrary image using the projection matrix of the virtual camera. However, if the camera is not properly calibrated, a measurement error occurs during the calibration process, and the optical centers C1 ′ and C2 ′ including the error with respect to the optical centers C1 and C2 are set.
At the time of three-dimensional reconstruction, the coordinates of the point P are erroneously estimated as the coordinates of the point P 'as shown by a dotted line in FIG. This also leads to errors in the reprojection process when generating an image from a new viewpoint.

【００１７】（２）弱カメラキャリブレーションこれに対して、本願発明で用いる弱カメラキャリブレー
ションとは、２つのステレオ画像間でのエピポーラ位置
関係を求める手法である。すなわち弱キャリブレーショ
ンとは、画像取得時の視点の相対的位置をキャリブレー
ションすることをいう。図３に示すように、（ｕ₁，ｖ
₁）が第１の画像中の点ｐの座標を表すとすると、３×
３の行列Ｆを用いて、第２の画像中のエピポーラライン
Ｌを計算することができる。点ｐに対応した第２の画像
中の点はこのエピポーラライン上に位置する。もし、エ
ピポーララインがl₁u₂＋l₂v₂＋l₃＝０で表されるなら
ば、その関係は以下の式で表される。 [l₁ l₂l₃]'=F[u₁v₁ 1]'… (2) スケーリングファクタまで含めてＦを解くためには、２
枚の画像中で８点の対応関係が必要になる。この行列Ｆ
を用いて、画像間の点の対応を見つけるためのステレオ
マッチング問題を解くことができる。本手法では、エピ
ポーラライン上に探索範囲を限定した相関マッチングに
よって与えられた画像間の点の対応を得ている。また、
新しい視点からの画像を生成する際にもこの弱カメラキ
ャリブレーションの結果を用いている。(2) Weak Camera Calibration On the other hand, weak camera calibration used in the present invention is a method for finding an epipolar positional relationship between two stereo images. That is, weak calibration refers to calibration of the relative position of the viewpoint at the time of image acquisition. As shown in FIG. 3, (u ₁ , v
_{If 1} ) represents the coordinates of point p in the first image, 3 ×
Using the matrix F of 3, the epipolar line L in the second image can be calculated. The point in the second image corresponding to point p is located on this epipolar line. If the epipolar line is represented by l ₁ u ₂ + l ₂ v ₂ + l ₃ = 0, the relationship is represented by the following equation. [l ₁ l ₂ l ₃ ] '= F [u ₁ v ₁ 1]' ... (2) To solve F including the scaling factor, 2
Eight points of correspondence are required in one image. This matrix F
Can be used to solve the stereo matching problem for finding point correspondences between images. In this method, the correspondence of points between images given by correlation matching with a limited search range on an epipolar line is obtained. Also,
The result of this weak camera calibration is also used when generating an image from a new viewpoint.

【００１８】後に述べるようにこの手法を使って算出し
た点の再投影後の座標値は、参照点の座標値測定時にお
ける誤差に対して、強カメラキャリブレーションに基づ
いた再投影法と比べ、より安定した性能を持っている。As will be described later, the coordinate values after reprojection of a point calculated using this method are compared with the reprojection method based on strong camera calibration with respect to the error in measuring the coordinate value of the reference point. Has more stable performance.

【００１９】［アフィン座標の特性］本願発明では、ア
フィン変換を用いて各点の再投影を行なっている。ここ
ではアフィン座標に基づいた再投影の考え方を、平行投
影とピンホール射影との２つの場合に分けて検討する。
本願発明では、このうちピンホール射影の考え方を用い
ている。平行投影の場合、シーン中の点は何処にあって
も良く、カメラの投影法を正射影と仮定する。ピンホー
ル射影の場合、シーン中の点は何処にあっても良く、カ
メラの投影法をピンホール射影と仮定する。[Characteristics of Affine Coordinates] In the present invention, each point is re-projected using an affine transformation. Here, the concept of reprojection based on affine coordinates will be considered separately for two cases, parallel projection and pinhole projection.
In the present invention, the concept of pinhole projection is used. In the case of parallel projection, the point in the scene may be anywhere, and the projection method of the camera is assumed to be orthographic. In the case of pinhole projection, the point in the scene may be anywhere, and the projection method of the camera is assumed to be pinhole projection.

【００２０】（１）平行投影の場合カメラ投影のモデルを平面への平行投影とそれに続くア
フィン変換とで単純化する手法が既に提案されている。
図４を参照して、点の組（Ｐ₁，Ｐ₂，．．．Ｐ_n）が
あったとして、３つの点Ｐ₁、Ｐ₂、Ｐ₃を通る仮想の
面（基準面）２０を考える。点Ｐ₄から基準面２０に下
ろした垂線の足を点ｐ₄'とする。点ｐ₄'の（Ｐ₁，
Ｐ₂，Ｐ₃）を基準としたアフィン座標は（ａ₄，
ｂ₄）である。同様に、ｉ番目の点から基準面２０へ下
ろした垂線の足をｐ_i’、そしてアフィン座標は
（ａ_i，ｂ_i）とする。更に、ｄ₄とｄ_iとをそれぞ
れ、点Ｐ₄，Ｐ_iから基準面２０までの距離とする。(1) In the case of parallel projection A method of simplifying a camera projection model by parallel projection onto a plane and subsequent affine transformation has already been proposed.
Referring to FIG. 4, _assuming that a set of points (P ₁ , P ₂ ,... P _n ) exists, a virtual plane (reference plane) 20 passing through _three points P ₁ , P ₂ , and P ₃ is defined. Think. The perpendicular foot drawn from the point P ₄ to the reference plane 20 and the point p ₄ '. (P ₁ of the point p ₄ ',
The affine coordinates based on (P ₂ , P ₃ ) are (a ₄ ,
b ₄ ). Similarly, let the leg of the perpendicular line lowered from the i-th point to the reference plane 20 be p _i ′, and the affine coordinates are (a _i , b _i ). Further, each of the d ₄ and d _i, the distance from the point P _4, P _i to the reference plane 20.

【００２１】アフィン座標（α₄、β₄）とは、視点か
ら点Ｐ₄を見たときに基準面２０に投影される点のアフ
ィン座標を示している。この（Ｐ₁，Ｐ₂，Ｐ₃）に対
してアフィン座標（α₄、β₄）を持つ点をｐ_b4とす
る。点ｐ_b4と点Ｐ₄とを結ぶライン２４が視線方向とな
る。この線は画像平面２２（この面の法線は視線と平
行）と点ｑ₄で交わる。したがって、点ｑ₄は点Ｐ₄の
像となる。同様に（Ｐ₁，Ｐ₂，Ｐ₃）をそれぞれ画像
平面２２の（ｑ₁，ｑ₂，ｑ₃）に投影する。この点
（ｑ₁，ｑ₂，ｑ₃）により定まる面を基準に選択した
場合、画像平面２２に対して（平行移動、回転、拡大縮
小、などの）アフィン変換を施した場合にも点ｑ₄はア
フィン座標（α₄、β₄）を持つ。The affine coordinates (α ₄ , β ₄ ) indicate the affine coordinates of a point projected on the reference plane 20 when the point P ₄ is viewed from the viewpoint. A point having affine coordinates (α ₄ , β ₄ ) with respect to (P ₁ , P ₂ , P ₃ ) is defined as p _b4 . The line 24 connecting the point p _b4 and the point P ₄ is the viewing direction. This line image plane 22 (the normal of this plane sight parallel) intersect at a point q _4. Therefore, the point q ₄ becomes an image of the point P ₄ . Similarly projected to _{_{(P 1, P 2, P}} 3) of each image plane _{_{22 (q 1, q 2,}} q 3). When a plane determined by these points (q ₁ , q ₂ , q ₃ ) is selected as a reference, the point q is also obtained when an affine transformation (such as translation, rotation, enlargement / reduction, etc.) is performed on the image plane 22. ₄ has affine coordinates (α ₄ , β ₄ ).

【００２２】次に、残った投影点のアフィン座標
（α_i，β_i）もこの与えられた視線方向２４から（α
₄、β₄）と同様に計算する。ｐ_b ⁱを基準面２０とＰ
_iを通る視線方向からの平行光線との交点とするなら
ば、ｑ_iはその画像平面２２への写像となる。同様に、
点ｐ_biと点ｑ_iとは、（Ｐ₁，Ｐ₂，Ｐ₃）、（ｑ₁，
ｑ₂，ｑ₃）をそれぞれ基準としたときに、アフィン座
標（α_i，β_i）を持つ。Next, the affine coordinates (α _i , β _i ) of the remaining projection points are also calculated from the given viewing direction 24 by (α
₄ , β ₄ ). p _b ⁱ the reference plane 20 and the P
If it is an intersection with a parallel ray from the line of sight passing through _i , q _i will be mapped onto the image plane 22. Similarly,
The points p _bi and q _i are (P ₁ , P ₂ , P ₃ ), (q ₁ ,
q ₂ , q ₃ ) have affine coordinates (α _i , β _i ), respectively.

【００２３】三角形Ｐ₄ｐ_b4ｐ₄'と三角形Ｐ_iｐ
_biｐ_i’の三角形の相似を用いるならばその関係式は以
下のように表される。The triangle P ₄ p _b4 p ₄ ′ and the triangle P _i p
If the similarity of the triangle of _bi p _i ′ is used, the relational expression is expressed as follows.

【００２４】[0024]

【数１】 (Equation 1)

【００２５】これをアフィン座標で表すと、以下のよう
になる。When this is represented by affine coordinates, it is as follows.

【００２６】[0026]

【数２】 (Equation 2)

【００２７】αだけを考慮すれば、さらに以下のように
表せる。Considering only α, it can be further expressed as follows.

【００２８】[0028]

【数３】 (Equation 3)

【００２９】この式中の、ａ_i，ａ₄，ｄ_i，ｄ₄は画
像内の全ての点で同一の定数で、これらは与えられた３
次元上の点から生成できる。すなわち、（Ｐ₁，．．．
Ｐ_n）から生成されるすべての可能な像において、α₄
とα_iとは直線にプロットされ、その傾きは図５の様に
ｄ_i／ｄ₄となる。この傾きは点Ｐ_iが基準面からどれ
だけ離れているかを示す。同様に、β₄、β_iもαと同
じ傾きで直線となる。In this equation, a _i , a ₄ , d _i , and d ₄ are constants which are the same at all points in the image.
It can be generated from points on a dimension. That is, (P ₁ ,.
P _n ), α ₄
And α _i are plotted in a straight line, and the slope is d _i / d _{4 as} shown in FIG. The slope indicates whether the point P _i is how far away from the reference plane. Similarly, β ₄ and β _i are also straight lines with the same inclination as α.

【００３０】（２）ピンホールの場合ここでは、３次元上の点を平面上に投影するために標準
的なピンホールカメラの座標系を仮定する。その後これ
らの投影した点のアフィン変換を行う。図６を参照し
て、（Ｐ₁，Ｐ₂，Ｐ₃，．．．，Ｐ_n）を３次元上の
点とし、点Ｐ₁，Ｐ₂，Ｐ₃を通る仮想平面（基準面）
２０を図６の様に定義する。点Ｐ₄から基準面に下ろし
た垂線の足をｐ₄'とし、その（Ｐ₁，Ｐ₂，Ｐ₃）を基
準としたアフィン座標を（ａ₄，ｂ₄）とする。同様に
点ｐ_i’を作成する。（α₄、β₄）は視点位置から基
準面２０へ投影した点ｐ_b4に対して（Ｐ₁，Ｐ₂，
Ｐ₃）を基準としたアフィン座標である。点ｐ_b4と点ｐ
₄とを通る直線の延長上にカメラの光学中心Ｃがある。
この位置は任意に定めたものである。視線２４について
も任意に選んだ。直線Ｐ₄ｐ_b4は点ｑ₄で画像平面２２
と交わる。点ｑ₄は点Ｐ₄の像である。同様に、点
Ｐ₁，Ｐ₂，Ｐ₃を画像平面２２にそれぞれ点ｑ₁，ｑ
₂，ｑ₃として写像する。点ｑ₄は、この（ｑ₁，
ｑ₂，ｑ₃）を基準として、アフィン座標（α₄、
β₄）を持っている。(2) In the case of a pinhole Here, a standard three-dimensional point is projected on a plane.
The coordinate system of a typical pinhole camera is assumed. Then this
An affine transformation of these projected points is performed. Referring to FIG.
And (P₁, P_Two, P_Three,. . . , P_n) On three dimensions
Point P₁, P_Two, P_ThreeVirtual plane (reference plane) passing through
20 is defined as shown in FIG. Point P_FourTo the reference plane
P_Four'And its (P₁, P_Two, P_ThreeBased on
The standard affine coordinates are (a_Four, B_Four). Likewise
Point p_i’. (Α_Four, Β_Four) Is based on the viewpoint position
Point p projected on the reference plane 20_b4For (P₁, P_Two,
P_Three) As affine coordinates. Point p_b4And point p
_FourThe optical center C of the camera is on an extension of a straight line passing through the camera.
This position is arbitrarily determined. About line of sight 24
Also arbitrarily chosen. Straight line P_Fourp_b4Is the point q_FourAt image plane 22
Intersect with Point q_FourIs the point P_FourIt is an image of. Similarly, the point
P₁, P_Two, P_ThreeTo the image plane 22 at points q₁, Q
_Two, Q_ThreeIs mapped as Point q_FourIs this (q₁,
q_Two, Q_Three) To the affine coordinates (α_Four,
β_Four)have.

【００３１】点Ｐ_iを画像平面２２に投影した点をｑ_i
とする。前述の通り、点Ｐ_biと点ｑ _iとはそれぞれ、
（Ｐ₁，Ｐ₂，Ｐ₃），（ｑ₁，ｑ₂，ｑ₃）を基準と
した際に共にアフィン座標（α_i、β_i）を持つ。ま
た、ｃ’とｄ_cとをそれぞれ、カメラ中心Ｃから基準面
２０に垂直に下ろした点、およびそこからカメラ中心ま
での距離とする。三角形Ｃｐ_bic'と三角形Ｐ_iｐ
_biｐ_i’との２つの三角形の近似を用いて、以下の式が
成立する。Point P_iAre projected onto the image plane 22 by q_i
And As mentioned above, point P_biAnd point q _iAnd
(P₁, P_Two, P_Three), (Q₁, Q_Two, Q_Three) Based on
Affine coordinates (α_i, Β_i)have. Ma
C 'and d_cAnd from the camera center C to the reference plane
20 perpendicular to the camera and from there to the camera center
And the distance. Triangle Cp_bic 'and triangle P_ip
_bip_i′, Using the approximation of two triangles,
To establish.

【００３２】[0032]

【数４】 (Equation 4)

【００３３】同様に三角形Ｃｐ_b4c'と三角形Ｐ₄ｐ_b4ｐ
₄’との近似を用いて、以下の式が成立する。Similarly, the triangle Cp _b4 c 'and the triangle P ₄ p _b4 p
Using the approximation with ₄ ', the following equation holds.

【００３４】[0034]

【数５】 (Equation 5)

【００３５】(6)と(7)式とから、次の式（８）が成立す
る。From the equations (6) and (7), the following equation (8) is established.

【００３６】[0036]

【数６】 (Equation 6)

【００３７】アフィン座標で上記の式を書き直せば、以
下のようになる。If the above equation is rewritten in affine coordinates, the following is obtained.

【００３８】[0038]

【数７】 (Equation 7)

【００３９】αだけを取り出すと、次のようになる。When only α is extracted, the following is obtained.

【００４０】[0040]

【数８】 (Equation 8)

【００４１】ここで、以下の式が成立する。Here, the following equation is established.

【００４２】[0042]

【数９】 (Equation 9)

【００４３】式(11)中、ａ_i、ａ₄、ｄ_iそしてｄ₄は
生成可能なすべての画像に対して同じ値であり、ａ_cは
カメラパラメータに依存する。つまり、ａ_cとａ₄とが
分かっていれば、与えられた画像に対して簡単にα₄'を
計算することができる。すなわち、生成可能なすべての
画像に対して、α₄'、α_iは傾きｄ_i／ｄ₄の直線上に
プロットされる。この特性を、図７に示した。β空間に
描かれる直線もα空間に描かれる直線と同様の傾きであ
る。In equation (11), a _i , a ₄ , d _i and d ₄ have the same value for all images that can be generated, and a _c depends on camera parameters. That is, if a _c and a ₄ are known, α ₄ ′ can be easily calculated for a given image. That is, for all images that can be generated, α ₄ ′ and α _i are plotted on a straight line having a slope d _i / d ₄ . This characteristic is shown in FIG. The straight line drawn in the β space has the same inclination as the straight line drawn in the α space.

【００４４】ｉ番目の点に対応した直線の傾きはその点
から基準面までの距離に正比例している。The slope of the straight line corresponding to the i-th point is directly proportional to the distance from that point to the reference plane.

【００４５】［新しい視野画像の合成と他のアプリケー
ション］新しい視野画像の生成このようにして得られたアフィン座標から、新しい視野
画像を生成することが最終的に必要である。ここでも、
平行投影と、ピンホール射影との２種類の考え方がある
が、本願発明ではピンホール射影を採用した。以下、平
行投影を用いた方法と、本願発明のようにピンホール射
影を採用した方法とを順次説明する。[Synthesis of New View Image and Other Applications] Generation of New View Image It is finally necessary to generate a new view image from the affine coordinates obtained in this manner. even here,
Although there are two types of concepts, parallel projection and pinhole projection, the present invention employs pinhole projection. Hereinafter, a method using parallel projection and a method using pinhole projection as in the present invention will be sequentially described.

【００４６】（１）平行投影の場合２つの画像Ｉ₁、Ｉ₂が与えられたとする。新しい視野
画像の生成のためには、２つの画像の間の点の対応関係
を必要とする。そのため、画像Ｉ₁上の点ｐ_i ¹に対応
する画像Ｉ₂上の点をｐ_i ²と定義する。αやβ空間で
直線を生成するためには、４つの参照点が必要であるた
め、ここでは画像上で点ｐ₁'，ｐ₂'，ｐ ₃'，ｐ₄'を定め
る。簡単にするために、線分Ｐ₁Ｐ₂と線分Ｐ₁Ｐ₃お
よび線分Ｐ₁Ｐ₄が直角で、かつ｜Ｐ₁Ｐ₂｜＝｜Ｐ₁
Ｐ₃｜＝｜Ｐ₁Ｐ₄｜となるように点ｐ₁'，ｐ₂'，
ｐ₃'，ｐ₄'を定めた。この構造をあらかじめ実験の前に
２台のカメラで撮影し、その投影後の点ｐ₁'、ｐ₂'、ｐ
₃'、ｐ₄'の座標値を記録しておく。画像Ｉ₁中の点ｐ₄'
と点ｐ_i’とのアフィン座標はそれぞれ、（α₄'、
β₄'）、（α_i ^'、β_i ^'）である。α空間中の直線は
２つの画像に対応した点（α ₄'、β₄'）と、（α₄ ²、β
_i ²）とを通る。(1) In the case of parallel projection Two images I₁, I_TwoIs given. New perspective
For image generation, point correspondence between two images
Need. Therefore, image I₁Upper point p_i ¹Compatible with
Image I_TwoP above_i ^TwoIs defined. In α and β space
To generate a straight line, four reference points were needed
Here, the point p₁', P_Two', P _Three', P_Four'
You. For simplicity, the line segment P₁P_TwoAnd line segment P₁P_ThreeYou
And line segment P₁P_FourIs a right angle and | P₁P_Two| = | P₁
P_Three| = | P₁P_FourThe point p such that |₁', P_Two',
p_Three', P_Four'Determined. Before this experiment,
A point p after shooting with two cameras and projecting₁', P_Two', P
_Three', P_FourRecord the coordinates of '. Image I₁Middle point p_Four'
And point p_iAffine coordinates with (α_Four',
β_Four'), (Α_i ^', Β_i ^'). The straight line in α space is
Points corresponding to two images (α _Four', Β_Four') And (α_Four ^Two, Β
_i ^Two) And pass.

【００４７】ここで点Ｐ_i，Ｐ₂，Ｐ₃，Ｐ₄のワール
ド座標を(0,0,0),(0,0,0),(0,1,0),(0,0,1)とそれぞれ
仮定する。これらの点は仮想画像中の点ｐ₁ ^v，
ｐ₂ ^v，ｐ₃ ^v，ｐ₄ ^vに３×３の仮想カメラの透視変
換行列をつかって投影される。（α₄ ^v，β₄ ^v）は点
ｐ₄ ^vの（ｐ₁ ^v，ｐ₂ ^v，ｐ₃ ^v）を基準としたアフ
ィン座標である。次に、画像Ｉ₁中の点ｐ_i ¹を仮想画
像に再投影するためには仮想画像中のそのアフィン座標
を点ｐ_i ¹に対応するα空間上の直線から計算すればよ
い。直線の方程式がα_i＝κ₀α₄＋κ₁ならば、α_i
^V＝κ₀α₄ ^v＋κ₁として求めることができる。同様
に、β_i ^Vについても計算する。画像中の点ｐ _i ^vの座
標は式（１２）を使って求めることができる。Here, the point P_i, P_Two, P_Three, P_FourWhirl
Code coordinates (0,0,0), (0,0,0), (0,1,0), (0,0,1) respectively
Assume. These points are the points p in the virtual image₁ ^v,
p_Two ^v, P_Three ^v, P_Four ^v3x3 virtual camera perspective change
It is projected using a permutation matrix. (Α_Four ^v, Β_Four ^v) Is a point
p_Four ^v(P₁ ^v, P_Two ^v, P_Three ^vAf) based on
Coordinates. Next, image I₁Middle point p_i ¹The virtual picture
The affine coordinates in the virtual image to reproject onto the image
To the point p_i ¹Can be calculated from the straight line in α space corresponding to
No. The equation of the straight line is α_i= Κ₀α_Four+ Κ₁Then α_i
^V= Κ₀α_Four ^v+ Κ₁Can be obtained as As well
And β_i ^VIs also calculated. Point p in the image _i ^vSeat
The target can be obtained using equation (12).

【００４８】[0048]

【数１０】 (Equation 10)

【００４９】ここで画像Ｉ₁中の複数の点が仮想画面上
の１点にマッピングされてしまうことに注意しなければ
ならない。この問題は、マッピングするべき点を選択す
る指標としてアフィン空間上の直線の傾きを利用するこ
とで解決できる。傾きの大きい直線上の点は、基準面か
らの距離が離れている、すなわち、仮想カメラが基準面
をのぞき込んでいる形になっているとすれば、仮想画面
との距離がより近いことになる。Here, it should be noted that a plurality of points in the image I ₁ are mapped to one point on the virtual screen. This problem can be solved by using the inclination of a straight line in the affine space as an index for selecting a point to be mapped. A point on a straight line having a large inclination is farther from the reference plane, that is, if the virtual camera is looking into the reference plane, the distance to the virtual screen is closer. .

【００５０】このようにして選択、再投影された画像に
は画素抜けが生じてしまう恐れがある。なぜならこの再
投影処理はすべての画素に画像Ｉ₁中の点がマッピング
されることを保証してはいないからである。ここではこ
れらの問題をコンピュータグラフィックスの技法である
Ｚバッファ、テクスチャマッピングを使って解決した。Pixels may be missing in the image thus selected and reprojected. Because the reprojection process because does not guarantee that the point of the image I in _one to all of the pixels is mapped. Here, these problems were solved using the Z buffer and texture mapping, which are computer graphics techniques.

【００５１】まず図８に示すように、画像Ｉ₁を一辺が
１ピクセル長の四角形に分割する。図中のＰ_i ¹のｘ，
ｙ座標は、それぞれＰ_i ^vのｘ，ｙ座標に投影される。
ｚ座標の値はアフィン空間での直線の傾きの逆数とし、
テクスチャは画像Ｉ₁のものを使う。この処理を残った
四角形の３点Ｐ_i+1 ¹,Ｐ_i+2 ¹,Ｐ_i+3 ¹ およびグリッド
上のすべての点にも繰り返す。次にこのポリゴンにテク
スチャをマッピングする。この処理によって、新しく生
成された画像を滑らかに補間することができる。First, as shown in FIG. 8, the image I ₁ is divided into rectangles each having a length of one pixel. X of P _i ^{1 in} the figure,
y coordinates, x each P _i ^v, it is projected onto the y-coordinate.
The value of the z coordinate is the reciprocal of the slope of the straight line in the affine space,
Texture use the one of the image I _1. 3 points P _{i + 1} ¹ of the processing remaining rectangle is repeated _{^{_{P i + 2 1, P i}}} + 3 to all the points on the ^first and the grid. Next, a texture is mapped to this polygon. By this processing, a newly generated image can be smoothly interpolated.

【００５２】（２）ピンホールの場合２つの画像Ｉ₁、Ｉ₂が与えられたとする。新しい視野
画像の生成のためには、２つの画像の間の点の対応関係
を必要とする。そこで、画像Ｉ₁上の点ｐ_i ¹に対応す
る画像Ｉ₂上の点をｐ_i ²と定義する。αやβ空間で直
線を生成するためには、４つの参照点が必要である。こ
こでは、画像Ｉ_j（ｊ＝１，２）上で点ｐ₁ ^j，
ｐ₂ ^j，ｐ₃ ^j，ｐ₄ ^jを定める。簡単にするために、
図９に示されるように線分Ｐ₁Ｐ₂と線分Ｐ₁Ｐ₃およ
び線分Ｐ₁Ｐ₄が直角で、かつ｜Ｐ₁Ｐ ₂｜＝｜Ｐ₁Ｐ
₃｜＝｜Ｐ₁Ｐ₄｜となるように定めた。この構造をあ
らかじめ実験の前に２台のカメラで撮影し、その投影後
の点ｐ₁ ^j，ｐ₂ ^j，ｐ₃ ^j，ｐ ₄ ^jの座標値を記録し
ておく。画像Ｉ_j中の点ｐ₄ ^jと点ｐ_i ^jとのアフィン
座標はそれぞれ（α₄ ^j, β₄ ^j）、（α_i ^j，
β_i ^j）である。α空間中の直線は２つの画像に対応し
た点（α₄ ¹',α_i ¹）と（α₄ ²',α_i ²）とを通る。(2) In the case of a pinhole Two images I₁, I_TwoIs given. New perspective
For image generation, point correspondence between two images
Need. Then, image I₁Upper point p_i ¹Corresponding to
Image I_TwoP above_i ^TwoIs defined. Direct in α and β space
To create a line, four reference points are required. This
Here, image I_jThe point p on (j = 1,2)₁ ^j,
p_Two ^j, P_Three ^j, P_Four ^jIs determined. For simplicity,
As shown in FIG.₁P_TwoAnd line segment P₁P_ThreeAnd
Line segment P₁P_FourIs a right angle and | P₁P _Two| = | P₁P
_Three| = | P₁P_Four| This structure
Before shooting, shoot with two cameras before the experiment and after projection
The point p₁ ^j, P_Two ^j, P_Three ^j, P _Four ^jRecord the coordinates of
Keep it. Image I_jMiddle point p_Four ^jAnd point p_i ^jAffine with
The coordinates are (α_Four ^j, β_Four ^j), (Α_i ^j,
β_i ^j). A straight line in α space corresponds to two images
Point (α_Four ¹', α_i ¹) And (α_Four ^Two', α_i ^Two) And pass.

【００５３】ｊ番目の画像において、式(11)を使ってα
₄ ^j’を計算するためには、ａ_C ^jとａ₄とを知ってい
る必要がある。選んだ点Ｐ₁，．．．，Ｐ₄ではａ₄の
値は０である。ａ_C ^jを計算するためには、５番目の制
御点Ｐ₅が必要である。便宜上点Ｐ₅を線分Ｐ₁Ｐ₄の
延長上に選び、｜Ｐ₅Ｐ₁｜＝ｋ｜Ｐ₄Ｐ₁｜と定義し
た（図９）。点Ｐ₅のｊ番目のカメラへの写像は点ｐ₅
^jであり、そのアフィン座標は（α₅ ^j，β₅ ^j）であ
る。ａ₅＝０であるから式(10)(11)を用いて、以下の式
を得ることができる。In the j-th image, using equation (11), α
_In order to calculate ₄ ^j ′, it is necessary to know a _C ^j and a ₄ . The selected points P ₁ ,. . . , The value of the P ₄ a ₄ is 0. To calculate a _C ^j , a fifth control point P ₅ is required. For convenience select the point P ₅ on the extension of the line segment _{_{_{P 1 P 4, | P 5}}} P 1 | = k | P 4 P 1 | defined (FIG. 9). The mapping of point P _{5 to} the j-th camera is point p ₅
^j and its affine coordinates are (α ₅ ^j , β ₅ ^j ). Since a ₅ = 0, the following equation can be obtained using equations (10) and (11).

【００５４】[0054]

【数１１】 [Equation 11]

【００５５】仮想画像を生成するために、点Ｐ₁，
Ｐ₂，Ｐ₃，Ｐ₄，Ｐ₅のワールド座標をそれぞれ(0,
0,0),(1,0,0),(0,1,0),(0,0,1),(0,0,k) と仮定する。
これらの点は仮想画像に３×４の仮想カメラの透視変換
行列を使ってそれぞれ（ｐ₁ ^v，．．．，ｐ₅ ^v）へ投
影される。ｉ番目の点ｐ_i ¹の再投影は（ｐ₁ ^v，ｐ₂
^v，ｐ₃ ^v）を基準としたアフィン座標（α_i ^V，β_i
^V）を計算することで実現できる。α空間中の直線の方
程式がα_i＝κ₀α₄'＋κ₁とすると、(10)(11)式によ
り次のようになる。To generate a virtual image, the points P ₁ ,
The world coordinates of P ₂ , P ₃ , P ₄ , and P ₅ are respectively (0,
Assume that (0,0), (1,0,0), (0,1,0), (0,0,1), (0,0, k).
These points are projected onto the virtual image using the perspective transformation matrix of a 3 × 4 virtual camera, respectively (p ₁ ^v ,..., P ₅ ^v ). The reprojection of the i-th point p _i ¹ is (p ₁ ^v , p ₂
^v , p ₃ ^v ) as the affine coordinates (α _i ^V , β _i
^V ). Assuming that the equation of the straight line in the α space is α _i = κ ₀ α ₄ ′ + κ ₁ , the following equations are obtained from the equations (10) and (11).

【００５６】[0056]

【数１２】 (Equation 12)

【００５７】ここで、α₄ ^vとａ_C ^vとはそれぞれ点ｐ
₄ ^vのアフィン座標、および仮想カメラの中心の写像で
ある。同様に、β_i ^Vについても計算する。画像中の全
ての点は仮想画面上にこの手法を使って投影することが
できる。新しい画像中に発生する画素抜けや、Ｚ値の問
題は、平行投影の場合で述べた方法で解決する。Here, α ₄ ^v and a _C ^v are points p
₄ ^v affine coordinates, and a mapping of the center of the virtual camera. Similarly, β _i ^V is calculated. All points in the image can be projected on this virtual screen using this technique. The problem of pixel omission and Z value occurring in a new image is solved by the method described in the case of parallel projection.

【００５８】［装置の構成］図１０に、本願発明を実施
するための装置の構成を示す。図１０を参照して、この
装置は、ＲＳ２３２Ｃ端子４０、４２および映像出力端
子４４を有するコンピュータ３０と、それぞれＲＳ２３
２Ｃ端子４０および４２においてコンピュータ３０に接
続された、シーンまたは物体３８を撮影するためのデジ
タルカメラ３２および３４と、映像出力端子４４におい
てコンピュータ３０に接続されたモニタ３６とを含む。
コンピュータ３０は、周知の構成を備えており、内部で
任意視点画像生成のためのプログラムを実行することに
より、任意視点画像生成装置として機能する。なお、デ
ジタルカメラとして本例では２つのデジタルカメラを使
用しているが、ステレオ画像を撮影できるカメラであれ
ばどの様なものでもよく、また１つのカメラを用い、カ
メラ視点を移動させることでステレオ画像を撮影するも
のでもよい。なお、本例ではデジタルカメラを用いてい
るが、通常のビデオカメラを用いてもよい。その場合、
コンピュータ３０は、映像入力端子を有し、ビデオ画像
をデジタル化する能力を有する必要がある。[Structure of Apparatus] FIG. 10 shows the structure of an apparatus for carrying out the present invention. Referring to FIG. 10, the apparatus includes a computer 30 having RS232C terminals 40 and 42 and a video output terminal 44, and an RS232C terminal.
It includes digital cameras 32 and 34 connected to the computer 30 at 2C terminals 40 and 42 for photographing a scene or an object 38, and a monitor 36 connected to the computer 30 at a video output terminal 44.
The computer 30 has a well-known configuration, and functions as an arbitrary viewpoint image generation device by executing a program for generating an arbitrary viewpoint image inside. In this example, two digital cameras are used as digital cameras, but any camera can be used as long as it can capture a stereo image. An image may be taken. Although a digital camera is used in this example, a normal video camera may be used. In that case,
The computer 30 must have a video input terminal and have the ability to digitize video images.

【００５９】図１１に、このコンピュータ３０が実行す
る任意視点画像生成プログラムの処理の流れをブロック
図形式で示す。図１１を参照して、まず制御点の撮影と
して、画像セットＡの撮影を行なう（５０）。この画像
セットＡから、アフィン座標の基準ベクトルを作成する
（５８）。次に、シーンの撮影として、画像セットＢの
撮影を行なう。この画像セットＢを使用して、カメラの
弱キャリブレーションを行なう（６０）。FIG. 11 is a block diagram showing the flow of processing of an arbitrary viewpoint image generation program executed by the computer 30. Referring to FIG. 11, image set A is first photographed as control point photographing (50). From this image set A, a reference vector of affine coordinates is created (58). Next, image set B is photographed as a scene photograph. Using this image set B, weak calibration of the camera is performed (60).

【００６０】さらに、任意視点画像を生成するためのシ
ーンの撮影を行なう。この時の画像セットを画像セット
Ｃとする。前述の画像セットＢは、画像セットＣと同一
のものであってもよい。画像セットＣに対し、処理６０
の結果を用いたステレオマッチングにより画像間の対応
を得る（６２）。さらに、このように対応が得られた画
像に対して処理５８で得られたアフィン座標の基準ベク
トルを用いてアフィン座標への変換を行なう（６４）。Further, a scene for generating an arbitrary viewpoint image is photographed. The image set at this time is referred to as an image set C. The aforementioned image set B may be the same as the image set C. Processing 60 for image set C
The correspondence between images is obtained by stereo matching using the result of (62). Further, the image to which the correspondence has been obtained is converted into affine coordinates using the affine coordinate reference vector obtained in the processing 58 (64).

【００６１】予め、仮想カメラの光学中心位置などを定
めた仮想カメラパラメータ５６をメモリ等に記憶してお
き、処理６４で得られたアフィン座標に対し、仮想カメ
ラ位置でのアフィン座標計算を行なう（６６）。こうし
て得られた仮想カメラ位置でのアフィン座標をカメラビ
ューに変換し（６８）、仮想カメラ位置を視点とする画
像を生成する。仮想カメラパラメータ５６を任意に変化
させることにより、任意位置での画像をリアルタイムで
生成することができる。A virtual camera parameter 56 that defines the optical center position of the virtual camera is stored in a memory or the like in advance, and the affine coordinates calculated at the virtual camera position are calculated with respect to the affine coordinates obtained in the process 64 (FIG. 66). The affine coordinates at the virtual camera position thus obtained are converted into a camera view (68), and an image is generated with the virtual camera position as a viewpoint. By arbitrarily changing the virtual camera parameters 56, an image at an arbitrary position can be generated in real time.

【００６２】［誤差に対する性能の実験的な比較］本願
発明による任意視点画像生成装置の効果を以下に示す。
まず、アフィン座標に基づく再投影アルゴリズムが３次
元再構築手法と比較してより安定していることを実験的
に検証する。このため、図１２に示すようなチェック模
様の箱を含んだシーン（ステレオ画像）を考え、カメラ
のキャリブレーションのために箱から６点を選んだ。手
動で点の対応を取り、その点の３次元座標を計算し、新
しい画像へと投影した。そのカメラマトリクスは任意に
定めた。定めた６つの参照点それぞれのｘ座標値の測定
誤差が−１から１の間に均一に分布していると仮定し、
再投影後の点の座標値の分布をシミュレートした。[Experimental Comparison of Performance to Error] The effect of the arbitrary viewpoint image generating apparatus according to the present invention will be described below.
First, experimentally verify that the reprojection algorithm based on affine coordinates is more stable than the three-dimensional reconstruction method. Therefore, considering a scene (stereo image) including a box with a check pattern as shown in FIG. 12, six points were selected from the box for camera calibration. We manually matched the points, calculated the three-dimensional coordinates of the points, and projected them onto a new image. The camera matrix was arbitrarily determined. Assuming that the measurement errors of the x coordinate values of each of the six reference points are uniformly distributed between −1 and 1,
The distribution of the coordinate values of the points after reprojection was simulated.

【００６３】５点の参照点をアフィン座標を基にした再
投影アルゴリズムのために選択し、測定誤差には同一の
性質を仮定し、再投影点の座標値に与える影響を調べ
た。３つの異なった点を任意の画像の組から選び、３次
元再構築手法と、アフィン座標を基にした手法との比較
結果を考察する。この３点(i)(ii)(iii)に対して、ｘ，
ｙ座標のヒストグラムを、３次元再構築手法（ａ）、ア
フィン再投影（ｂ）それぞれについてプロットしたもの
を図１３〜図１８に示す。図１３、図１５、図１７はｘ
座標についてのもの、図１４、図１６、図１８がｙ座標
についてのものである。図１３および図１４が点(i) に
ついて、図１５および図１６が点(ii)について、図１７
および図１８が点(iii) についてのものである。図中、
３次元再構築手法によるものには「ａ」を、アフィン座
標変換を基にした本願発明の手法によるものには「ｂ」
をそれぞれ付してある。Five reference points were selected for the reprojection algorithm based on affine coordinates, and the effect on the coordinate values of the reprojected points was examined, assuming the same properties for the measurement error. Three different points are selected from an arbitrary set of images, and the results of comparison between a three-dimensional reconstruction method and a method based on affine coordinates are considered. For these three points (i) (ii) (iii), x,
FIGS. 13 to 18 show histograms of the y-coordinate plotted for each of the three-dimensional reconstruction method (a) and the affine reprojection (b). 13, 15, and 17 show x
FIGS. 14, 16 and 18 are for the y coordinate. FIGS. 13 and 14 correspond to point (i), and FIGS. 15 and 16 correspond to point (ii).
And FIG. 18 is for point (iii). In the figure,
"A" is used for the three-dimensional reconstruction method, and "b" is used for the method of the present invention based on the affine coordinate transformation.
Is attached to each.

【００６４】アフィン座標を元にした再投影法のｘ座標
のヒストグラムは、３点共に明確に鋭いピークを持って
おり、かつ分散も３次元再構築手法に比べて非常に小さ
いことが分かる。一方で、ｙ座標については、点(i)(i
i)において３次元再構築手法の方により鋭いピークが表
れている。しかしこの場合でもヒストグラムを検討する
と、３次元再構築法ではアフィン座標を基にした手法に
比べて分散が大きいことが分かる。投影点の正確なｘ，
ｙ座標という一つの根拠が欠如しているため、二つの手
法の正確さをどのように評価するかが問題となるが、手
法の正確さをピークの高さと対応するヒストグラムの広
がりで判定するとすれば、本願発明の直接的再投影手法
によって、３次元再構築による手法に比べてより安定し
た結果が得られ、より正確であると言うことができる。It can be seen that the histogram of the x coordinate of the reprojection method based on the affine coordinates has clearly sharp peaks at all three points, and that the variance is much smaller than that of the three-dimensional reconstruction method. On the other hand, for the y coordinate, the points (i) (i
In i), a sharper peak appears in the three-dimensional reconstruction method. However, examination of the histogram in this case also shows that the variance is larger in the three-dimensional reconstruction method than in the method based on affine coordinates. Exact x,
The lack of one basis, the y-coordinate, poses a question of how to evaluate the accuracy of the two approaches, but if the accuracy of the approaches is determined by the height of the peak and the corresponding histogram spread, For example, it can be said that the direct reprojection method of the present invention provides more stable results and is more accurate than the method based on three-dimensional reconstruction.

【００６５】［アプリケーション］アフィン座標を用い
たシーン再投影の手法を検証するために、図１０に示す
構成のコンピュータに接続されたデジタルカメラで、あ
るシーンの３枚の画像を撮影した。その画像を図１９〜
図２２に示す。（画像（ｃ）と（ｄ）とは同じもの）こ
れまでの説明では、３枚目の画像は常に仮想画像だった
が、３番目の画像が実画像の場合にも本願発明の手法は
利用できる。[Application] In order to verify the technique of scene reprojection using affine coordinates, three images of a certain scene were photographed by a digital camera connected to a computer having the configuration shown in FIG. The images are shown in FIGS.
As shown in FIG. (Images (c) and (d) are the same) In the description so far, the third image is always a virtual image, but the technique of the present invention is also used when the third image is a real image. it can.

【００６６】図１９を参照して、画像上のチェック状の
ブロックから、３枚の画像中の対応点５点を選び出し、
「ｘ」でマークした。また、７つの対応点を最初の二つ
の画像から選び出し、図１９および図２０に黒いドット
で示した。図２１および図２２はそれぞれピンホールモ
デル、平行投影モデルによりこれらの点が３枚目の画像
にどのように投影されるかを示した図である。図２１に
示すピンホールモデルの方が図２２に示す平行投影モデ
ルに比べて「ｘ」点のずれが少なく、良い再投影結果を
示していることが分かる。Referring to FIG. 19, five corresponding points in three images are selected from the check-like blocks on the image.
Marked with "x". In addition, seven corresponding points were selected from the first two images, and are indicated by black dots in FIGS. 19 and 20. FIG. 21 and FIG. 22 are diagrams showing how these points are projected on the third image by the pinhole model and the parallel projection model, respectively. It can be seen that the pinhole model shown in FIG. 21 has less deviation of the “x” point than the parallel projection model shown in FIG. 22 and shows a good reprojection result.

【００６７】次に、図２３に示すような２枚のステレオ
画像を基に新しい視野画像の合成を行う。新しい視野画
像の生成結果を図２４および図２５に示した。図２４
は、左側の実カメラより更に左にある仮想カメラから生
成した画像を示す。図２５は極端に左上にある仮想カメ
ラから生成した画像を示す。特にこのような例では前に
挙げた例に比べて、点の対応誤差が明白に再投影結果に
反映されてしまう。これは主に入力画像の質の悪さによ
るステレオマッチングの誤りに起因しているが、入力画
像の質を上げることにより改善される。この新しい視野
画像の生成アルゴリズムはコンピュータにおいてほぼリ
アルタイムに実行することができる。Next, a new visual field image is synthesized based on the two stereo images as shown in FIG. FIGS. 24 and 25 show the results of generating a new visual field image. FIG.
Indicates an image generated from the virtual camera further left than the real camera on the left. FIG. 25 shows an image generated from the virtual camera at the extreme upper left. In particular, in such an example, the corresponding error of the point is clearly reflected in the reprojection result as compared with the example described above. This is mainly caused by an error in stereo matching due to poor quality of the input image, but can be improved by increasing the quality of the input image. The algorithm for generating the new visual field image can be executed in a computer in almost real time.

【００６８】［まとめ］以上のように本発明により、新
しい視野画像の生成のアプリケーションのための、ステ
レオ画像におけるアフィン座標の特徴を使った統一的な
考え方を使用した方法を実現することができる。シーン
再投影のための計算は(14)式に示されているとおり、非
常に単純であることが特徴である。このため、新しい視
野画像の合成は、グラフィックスコンピュータを使うこ
とによってほぼリアルタイムに実現できる。また本手法
は、標準的なアプローチに比較して、５点という少ない
対応点数で実現可能であり、ステレオマッチング時の誤
差の再投影画像への影響を最小限に抑えることができ
る。[Summary] As described above, according to the present invention, it is possible to realize a method using a unified concept using features of affine coordinates in a stereo image for an application of generating a new visual field image. The calculation for scene reprojection is characterized by being very simple, as shown in equation (14). For this reason, the synthesis of a new visual field image can be realized almost in real time by using a graphics computer. In addition, this method can be realized with a small number of corresponding points of five points as compared with the standard approach, and can minimize the influence of errors in stereo matching on the reprojected image.

【００６９】本手法を動的なシーンに適用することも可
能と考えられる。その場合の最大の問題点は、フレーム
レートでのステレオマッチングである。本発明を動的な
シーンに適用するためには、標準的なステレオマッチン
グのアルゴリズムより効率的な手法と組み合わせること
が必要である。It is considered that the present method can be applied to a dynamic scene. The biggest problem in that case is stereo matching at the frame rate. In order to apply the invention to dynamic scenes, it is necessary to combine it with a more efficient approach than standard stereo matching algorithms.

【００７０】［本手法と他の直接的な手法との比較］既
に述べたように、再投影手法の一つであるエピポーララ
イン交差法は強カメラキャリブレーションまたは３枚の
画像（２枚は実画像、１枚は新しい画像）中の８つの対
応点が必要である。強カメラキャリブレーションパラメ
ータを正確に推定することは困難であり、これは再投影
時の誤差に通じる。また３枚の画像中の８つの参照点座
標に含まれる測定誤差は、５点の参照点が必要なアフィ
ン座標に基づいた方法と比較して、再投影時のより大き
な座標誤差を導く。また、シーンの再投影は「希薄な」
画像を生成するだけである。本願発明のようにアフィン
座標を基にした方法では、アフィン空間中の直線の傾き
をそれに対応する平面との距離の指標として使い、「穴
埋め」が行える。エピポーラライン交差アルゴリズムで
は、再投影した点に対応する深さ（画像平面に対するｚ
値）は明示的には計算されず、ゆえに新しい視野の生成
へ拡張することは困難である。[Comparison between the present method and another direct method] As described above, the epipolar line intersection method, which is one of the reprojection methods, uses strong camera calibration or three images (two images are real images). (One image is a new image). It is difficult to accurately estimate the strong camera calibration parameters, which leads to errors in reprojection. Also, the measurement errors contained in the eight reference point coordinates in the three images lead to larger coordinate errors during reprojection as compared to methods based on affine coordinates that require five reference points. Also, the reprojection of the scene is "sparse"
It just generates an image. In the method based on affine coordinates as in the present invention, “filling in holes” can be performed using the inclination of a straight line in the affine space as an index of the distance from the corresponding plane. In the epipolar line intersection algorithm, the depth corresponding to the reprojected point (z
Value) is not calculated explicitly and is therefore difficult to extend to the generation of new fields of view.

【００７１】既にのべたように、モーフィング手法は、
厄介なステレオマッチング問題を避けることができる反
面、２つの実カメラのカメラ中心を結んだ直線上に仮想
カメラがある場合にしか利用できない。これに対し本願
発明の手法では、まず点の対応問題を解く必要がある
が、仮想カメラは空間中の何処にあっても良いという利
点がある。As already mentioned, the morphing method is
While the troublesome stereo matching problem can be avoided, it can be used only when the virtual camera is on a straight line connecting the camera centers of the two real cameras. On the other hand, in the method of the present invention, it is necessary to first solve the point correspondence problem, but there is an advantage that the virtual camera may be located anywhere in the space.

[Brief description of the drawings]

【図１】強カメラキャリブレーションを説明するため
の図である。FIG. 1 is a diagram for describing strong camera calibration.

【図２】３次元再構築法の原理を説明するための図で
ある。FIG. 2 is a diagram for explaining the principle of a three-dimensional reconstruction method.

【図３】エピポーララインを説明するための図であ
る。FIG. 3 is a diagram for explaining an epipolar line.

【図４】平行投影の場合のアフィン変換の原理を説明
するための図である。FIG. 4 is a diagram for explaining the principle of affine transformation in the case of parallel projection.

【図５】平行投影の場合のα₄とα_iとの関係を示す
図である。FIG. 5 is a diagram showing the relationship between α ₄ and α _{i in} the case of parallel projection.

【図６】ピンホール射影系のアフィン座標を説明する
ための図である。FIG. 6 is a diagram for explaining affine coordinates of a pinhole projection system.

【図７】ピンホール射影の場合のα₄ ^'とα_iとの関
係を示す図である。FIG. 7 is a diagram showing the relationship between α ₄ ^′ and α _{i in} the case of pinhole projection.

【図８】画素抜けを防止するための処理の原理を説明
するための図である。FIG. 8 is a diagram for explaining the principle of processing for preventing pixel omission;

【図９】本願発明における５つの参照点の配置を示す
図である。FIG. 9 is a diagram showing an arrangement of five reference points in the present invention.

【図１０】本願発明の実施の形態の任意視点画像生成
装置の構成を示す図である。FIG. 10 is a diagram illustrating a configuration of an arbitrary viewpoint image generation device according to an embodiment of the present invention.

【図１１】本願発明の実施の形態の任意視点画像生成
装置において行なわれる処理の流れをブロック図形式で
示す図である。FIG. 11 is a diagram showing, in a block diagram form, a flow of processing performed in the arbitrary viewpoint image generating device according to the embodiment of the present invention;

【図１２】本願発明の実施の形態の装置の効果を検証
するための実験に用いたステレオ画像を示す図である。FIG. 12 is a diagram showing a stereo image used in an experiment for verifying the effect of the device according to the embodiment of the present invention.

【図１３】第１の点に対する、３次元再構築手法と本
願発明による手法との効果の違いを、ｘ座標のヒストグ
ラム形式で示す図である。FIG. 13 is a diagram showing the difference between the effect of the three-dimensional reconstruction method and the method according to the present invention on the first point, in the form of a histogram of x coordinates.

【図１４】第１の点に対する、３次元再構築手法と本
願発明による手法との効果の違いを、ｙ座標のヒストグ
ラム形式で示す図である。FIG. 14 is a diagram illustrating a difference between the effect of the three-dimensional reconstruction method and the method according to the present invention on the first point in the form of a histogram of y-coordinates.

【図１５】第２の点に対する、３次元再構築手法と本
願発明による手法との効果の違いを、ｘ座標のヒストグ
ラム形式で示す図である。FIG. 15 is a diagram showing the difference between the effect of the three-dimensional reconstruction method and the method according to the present invention on the second point in the form of a histogram of x coordinates.

【図１６】第２の点に対する、３次元再構築手法と本
願発明による手法との効果の違いを、ｙ座標のヒストグ
ラム形式で示す図である。FIG. 16 is a diagram showing the difference between the effect of the three-dimensional reconstruction method and the method according to the present invention on the second point in the form of a histogram of y-coordinates.

【図１７】第３の点に対する、３次元再構築手法と本
願発明による手法との効果の違いを、ｘ座標のヒストグ
ラム形式で示す図である。FIG. 17 is a diagram showing the difference between the effect of the three-dimensional reconstruction method and the method according to the present invention on the third point, in the form of a histogram of x coordinates.

【図１８】第３の点に対する、３次元再構築手法と本
願発明による手法との効果の違いを、ｙ座標のヒストグ
ラム形式で示す図である。FIG. 18 is a diagram showing the difference between the effect of the three-dimensional reconstruction method and the method according to the present invention on the third point in the form of a histogram of y-coordinates.

【図１９】本願発明によるシーン再投影の効果を検証
するための実験に用いられた画像を示す図である。FIG. 19 is a diagram showing an image used in an experiment for verifying the effect of scene reprojection according to the present invention.

【図２０】本願発明によるシーン再投影の効果を検証
するための実験に用いられた他の画像を示す図である。FIG. 20 is a diagram showing another image used in an experiment for verifying the effect of scene reprojection according to the present invention.

【図２１】本願発明によるシーン再投影の効果を検証
するための実験に用いられたさらに他の画像と、ピンホ
ールモデルによる投影結果とを示す図である。FIG. 21 is a diagram showing still another image used in an experiment for verifying the effect of scene reprojection according to the present invention, and a projection result by a pinhole model.

【図２２】本願発明によるシーン再投影の効果を検証
するための実験に用いられたさらに他の画像と、平行投
影モデルによる投影結果とを示す図である。FIG. 22 is a diagram showing still another image used in an experiment for verifying the effect of scene reprojection according to the present invention, and a projection result by a parallel projection model.

【図２３】本願発明によるシーン再投影の効果を検証
するための他の実験に用いられた画像を示す図である。FIG. 23 is a diagram showing an image used in another experiment for verifying the effect of scene reprojection according to the present invention.

【図２４】本願発明によるシーン再投影の効果を検証
するための実験における、ピンホールモデルによる投影
結果を示す図である。FIG. 24 is a diagram showing a projection result by a pinhole model in an experiment for verifying the effect of scene reprojection according to the present invention.

【図２５】本願発明によるシーン再投影の効果を検証
するための実験における、極端に左上のカメラ位置から
のピンホールモデルによる投影結果を示す図である。FIG. 25 is a diagram illustrating a projection result by a pinhole model from an extremely upper left camera position in an experiment for verifying the effect of scene reprojection according to the present invention.

[Explanation of symbols]

２０基準面２２画像平面２４視線方向３０コンピュータ３２、３４デジタルカメラ３６モニタ Reference Signs List 20 Reference plane 22 Image plane 24 Viewing direction 30 Computer 32, 34 Digital camera 36 Monitor

───────────────────────────────────────────────────── フロントページの続き (72)発明者坂口竜己京都府相楽郡精華町大字乾谷小字三平谷５番地株式会社エイ・ティ・アール知能映像通信研究所内 (72)発明者大谷淳京都府相楽郡精華町大字乾谷小字三平谷５番地株式会社エイ・ティ・アール知能映像通信研究所内 ──────────────────────────────────────────────────の Continuing on the front page (72) Inventor Tatsumi Sakaguchi 5th Sanraya, Inaya small character, Seika-cho, Soraku-gun, Kyoto Pref. 5 Shiraya, Seiya-cho, Seika-cho, Soraku-gun ATI Intelligent Motion Picture Communication Laboratory, Inc.

Claims

[Claims]

An image acquisition unit for acquiring a stereo image; a unit for creating a reference vector in a predetermined affine coordinate system based on the stereo image acquired by the image acquisition unit; A means for weakly calibrating the relative position of the viewpoint at the time of image acquisition based on the stereo image acquired by the means; and a stereo image acquired by the image acquisition means based on the result of the weak calibration. And converting the coordinates of each point of the stereo image into the coordinates of the predetermined affine coordinate system based on the means for establishing correspondence between the points and the output of the means for establishing the correspondence with the reference vector. The first affine coordinate system based on the virtual viewpoint position data and the virtual viewpoint position using a pinhole projection method. And second conversion means for converting the affine coordinates, from the output of said second converting means, and means for generating an image in the virtual viewpoint position, arbitrary viewpoint image generating device.