JP2005078554A

JP2005078554A - Restoration method and device for fish-eye camera motion and three-dimensional information, and recording medium with program for implementing it recorded

Info

Publication number: JP2005078554A
Application number: JP2003311013A
Authority: JP
Inventors: Isao Miyagawa; 勲宮川; Yuji Ishikawa; 裕治石川; Yoshiori Wakabayashi; 佳織若林; Kenichi Arakawa; 賢一荒川
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2003-09-03
Filing date: 2003-09-03
Publication date: 2005-03-24

Abstract

<P>PROBLEM TO BE SOLVED: To simultaneously restore motion related to a camera view point and three-dimensional information of the outside world in a manner robust against noise; and to reduce a calculation cost. <P>SOLUTION: This restoration device comprises: a time-series database 1 with omnidirectional images stored; a characteristic point measurement part 2 for measuring a coordinate value of a characteristic point in each image; a measurement matrix conversion part 3 for converting the image coordinate value obtained by the measurement part to a uv image; a measurement matrix input part 4 for providing, as an initial value, a matrix including a uv coordinate value as a matrix element; and a factorization method processing part 5 for restoring motion of a camera and three-dimensional information of the outside world by a factorization method from the input data. The processing part 5 comprises: a three-dimensional information restoration part 5A by means of planar motion; a Z-axis motion restoration part 5B; and a rotary motion restoration part 5C. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、魚眼レンズを装着したカメラ（魚眼カメラ）を使って取得した車載画像または室内画像、船上からの海上画像、空撮などの時系列画像全般、さらに、魚眼カメラまたは魚眼レンズを装着したカメラを用いて手動撮影して取得した時系列画像から、魚眼カメラの視点に関するロール、ピッチ、ヨー回転から構成される三軸（ＸＹＺ）周りの回転運動、三軸（ＸＹＺ）方向の並進運動、並びに、時系列画像に写っている外界の３次元形状、すなわち、被写体（物体）の外観形状を構成する３次元情報を復元する方法および装置に関する。 The present invention is a vehicle-mounted image or indoor image acquired using a camera equipped with a fisheye lens (fisheye camera), a marine image from a ship, general time-series images such as aerial photography, and a fisheye camera or a fisheye lens. From the time-series images acquired by manual shooting with the camera, the rotational movement around the three axes (XYZ) composed of the roll, pitch, and yaw rotation related to the viewpoint of the fisheye camera, the translational movement in the three axes (XYZ) direction In addition, the present invention relates to a method and apparatus for restoring the three-dimensional shape of the external world reflected in a time-series image, that is, the three-dimensional information constituting the appearance shape of a subject (object).

コンピュータビジョン分野では、時系列画像データから、対象物の形状を計測または獲得する手法には、ステレオ計測やエピポーラ面解析を用いた３次元解析手法がある。この手法によれば、物体が撮影されている複数の時系列画像から、空間形状または空間構造に関する３次元位置情報、並びに、カメラ視点に関する運動を復元することができる。しかし、移動手段などを利用して撮影カメラを動かしながら撮影した時系列映像においては、撮影時の環境、撮影カメラの微小な動きによりシームレスに映像取得が困難であり、時系列映像中にランダム性の雑音が混入し、カメラ運動や物体形状を正確に復元することが困難な場合がある。 In the computer vision field, methods for measuring or acquiring the shape of an object from time-series image data include three-dimensional analysis methods using stereo measurement and epipolar plane analysis. According to this method, it is possible to restore the three-dimensional position information related to the spatial shape or the spatial structure and the motion related to the camera viewpoint from a plurality of time-series images in which the object is photographed. However, it is difficult to seamlessly acquire images in time-series images taken while moving the camera using moving means, etc., due to the shooting environment and the minute movement of the camera. In some cases, it is difficult to accurately restore camera motion and object shape.

このような問題に対して、画像面を平面とした直交座標系で表現できる画像座標値から、ユークリッド空間でのカメラ運動と物体形状を同時に、かつ、ロバストに復元する手法がある。例えば、取得する画像において特徴点をつけ、この特徴点を時系列に追跡して得た画像座標値から、カメラ運動と３次元情報を復元する手法がある（例えば、非特許文献１、２参照）。 In order to solve such a problem, there is a method of simultaneously and robustly restoring the camera motion and the object shape in the Euclidean space from image coordinate values that can be expressed by an orthogonal coordinate system with the image plane as a plane. For example, there is a method of restoring camera motion and three-dimensional information from image coordinate values obtained by attaching feature points to an acquired image and tracking the feature points in time series (for example, see Non-Patent Documents 1 and 2). ).

これら文献（以下、文献［１］、［２］と呼ぶ）の方法では、式（Ａ１）に示すように、画像上において取得した時系列のｘｙ画像座標値（式（Ａ１）左辺）から、カメラ運動に関する行列（右辺の左側の行列、時系列に変換する行列要素）と３次元情報（Ｘ，Ｙ，Ｚ）に関する行列（右辺の右側の行列、時系列に不変的な行列要素）に分解する手法であり、この分解には特異値分解なる数学的手法が使われている。式（Ａ１）左辺は、文献［１］、［２］では、計測行列と呼ばれており、行方向は特徴点の数を、列方向にフレーム数を表す。（ｘ_ij，ｙ_ij）は第ｉフレームの第ｊ番目の特徴点の画像座標値となっている。 In the methods of these documents (hereinafter referred to as documents [1] and [2]), as shown in equation (A1), from time-series xy image coordinate values (left side of equation (A1)) acquired on the image, Decomposed into a matrix related to camera motion (matrix on the left side of the right side, matrix element to be converted to time series) and a matrix related to three-dimensional information (X, Y, Z) (right side matrix on the right side, matrix element invariant to time series) A mathematical method called singular value decomposition is used for this decomposition. The left side of the formula (A1) is called a measurement matrix in documents [1] and [2], the row direction represents the number of feature points, and the column direction represents the number of frames. (X _ij , y _ij ) is the image coordinate value of the j-th feature point in the i-th frame.

上記の文献［１］、［２］による方法では、式（Ａ１）左辺は画像から得られる２次元情報（あらかじめ各フレームの重心座標値を引いた２次元座標値としている）だけになっており、式（Ａ１）右辺の行列により行列演算する（線形演算する）ことで左辺の画像座標値が得られるという投影モデルに基づいている。 In the method according to the above documents [1] and [2], the left side of the formula (A1) is only two-dimensional information obtained from the image (a two-dimensional coordinate value obtained by subtracting the barycentric coordinate value of each frame in advance). Formula (A1) is based on a projection model in which a matrix calculation (linear calculation) is performed using a matrix on the right side to obtain an image coordinate value on the left side.

上記の方法には欠点もある。つまり、式（Ａ１）のように左辺で得られる画像座標値が右辺で表されるという形式、すなわち、投影モデルとはならない場合は、文献［１］、［２］を応用してカメラ運動と３次元情報を同時に復元することは難しい。さらに、有限画角のカメラを使用する場合、カメラの微小な回転運動を復元することが困難であり、文献［１］、［２］を応用しても並進と回転運動の判別が暖味となり、魚眼カメラの正確なカメラ運動と正確な３次元情報の復元ができない場合もある。 The above method also has drawbacks. In other words, when the image coordinate value obtained on the left side is represented by the right side as in the formula (A1), that is, if it is not a projection model, the camera motion can be applied by applying the literatures [1] and [2]. It is difficult to restore 3D information at the same time. Furthermore, when a camera with a finite field angle is used, it is difficult to restore the minute rotational motion of the camera, and the translation and rotational motion discrimination becomes warm even when the literatures [1] and [2] are applied. In some cases, accurate camera movement of the fisheye camera and accurate three-dimensional information cannot be restored.

一方、有限画角の問題を回避するため、一度に３６０度の景観を取得する手段として、全方位カメラによる撮像が考えられる。全方位カメラは、そのカメラ光軸を鉛直上向きに設置して利用することが多く、全方位カメラ視点の姿勢推定や外界に関する空間計測には適する。また、全方位カメラの利用により、その微小な回転運動の復元も可能であり、並進運動と回転運動を双方とも正確に復元する点で優れている（例えば、非特許文献３、特許文献４を参照）。 On the other hand, in order to avoid the problem of a finite field angle, imaging with an omnidirectional camera is conceivable as a means for acquiring a landscape of 360 degrees at a time. An omnidirectional camera is often used with its camera optical axis set vertically upward, and is suitable for posture estimation of an omnidirectional camera viewpoint and spatial measurement related to the outside world. Further, by using an omnidirectional camera, it is possible to restore the minute rotational motion, which is excellent in that both the translational motion and the rotational motion are accurately restored (for example, Non-Patent Document 3 and Patent Document 4). reference).

全方位カメラにはミラーを双曲面に変形した反射光学系、または、錘体状に設計した反射光学系を利用したカメラが多い。これに対して、魚眼カメラは魚眼レンズ単体で全方位の画像を取得することができ、光学投影の関係も線形的であるという特長がある。このため、上記の非特許文献３、特許文献４（以下、文献［３］、［４］と呼ぶ）による復元方法ではカメラ運動と３次元情報の復元において、魚眼カメラの利用は計算コストを低減させる点で優れている。
C.Tomasi&T.Kanade;“Shapeand Motion from Image Streams Under Orthography:A Factorization Method"、International Journal of Computer Vision,Vol.9,No.2.1992 C.J.Poelman & T.Kanade;“A Paraperspective Factorization Method for Shape and Motion Recovery"、IEEE Trans.Pattern Anal.& Mach.Intell,Vol.19,No.3,pp.206-218.1997 宮川、若林、有川；“全方位投影型因子分解法による全方位画像からの運動と形状復元”、パターン認識・メディア理解（ＰＲＭＵ）研究会、June,2002 宮川、小澤、若林、有川：“全方位カメラ視点運動並びに物体形状復元方法、装置、全方位カメラ視点運動並びに物体形状復元方法プログラム、及び、該プログラムを記録した記録媒体”、特願2002-71623 Many omnidirectional cameras use a reflection optical system in which a mirror is deformed into a hyperboloid, or a reflection optical system designed in the shape of a spindle. On the other hand, the fish-eye camera has a feature that the fish-eye lens can acquire an omnidirectional image and the optical projection relationship is linear. For this reason, in the restoration method according to Non-Patent Document 3 and Patent Document 4 (hereinafter referred to as documents [3] and [4]), the use of a fisheye camera reduces the computational cost in restoring the camera motion and three-dimensional information. Excellent in terms of reduction.
C. Tomasi & T. Kanade; “Shapeand Motion from Image Streams Under Orthography: A Factorization Method”, International Journal of Computer Vision, Vol. 9, No. 2.1992 CJPoelman & T. Kanade; “A Paraperspective Factorization Method for Shape and Motion Recovery”, IEEE Trans.Pattern Anal. & Mach.Intell, Vol.19, No.3, pp.206-218.1997 Miyakawa, Wakabayashi, Arikawa; “Motion and shape restoration from omnidirectional images by omnidirectional projection factorization”, Pattern Recognition and Media Understanding (PRMU) Study Group, June, 2002 Miyagawa, Ozawa, Wakabayashi, Arikawa: “Omnidirectional camera viewpoint movement and object shape restoration method and apparatus, omnidirectional camera viewpoint movement and object shape restoration method program, and recording medium on which the program is recorded”, Japanese Patent Application 2002-71623

移動手段を使って、全方位カメラのような一度に３６０度の景観を映像化する撮影環境で取得した時系列画像において、ステレオ視の原理を応用した計測方法により、外界の物体の空間情報を獲得、復元できる。しかし、様々な撮影環境の中での移動撮影においては、カメラが微小に動くため、容易にシームレスな時系列画像を取得できず、ランダム性雑音の影響も大きく、常に、安定的に、カメラの動きと物体の形状を、同時に、かつ、高精度に復元することは困難である。 In a time-series image acquired in a shooting environment that visualizes a 360-degree landscape at a time, such as an omnidirectional camera, using moving means, the spatial information of objects in the outside world can be obtained by a measurement method that applies the principle of stereo vision. Can be acquired and restored. However, in moving shooting in various shooting environments, the camera moves minutely, so seamless time-series images cannot be acquired easily, and the influence of random noise is large. It is difficult to restore the movement and the shape of the object at the same time and with high accuracy.

例えば、全方位カメラ（魚眼カメラ）で取得した時系列画像において、文献［３］［４］の手法を適用した場合、図１０に示すように、カメラの光軸回転（ヨー回転）以外にロール回転、ピッチ回転、さらには、Ｚ軸方向の並進運動が加わることで、正確な運動と３次元形状の復元は困難となる。 For example, in the case of applying the technique of [3] and [4] to a time-series image acquired by an omnidirectional camera (fisheye camera), as shown in FIG. 10, other than the optical axis rotation (yaw rotation) of the camera. By adding roll rotation, pitch rotation, and translational motion in the Z-axis direction, accurate motion and three-dimensional shape recovery become difficult.

つまり、図１０に示すように、カメラは光軸を鉛直上向きに設置した姿勢で用いることが多く、このため、ヨー回転とＸＹ並進運動からなる自由度３の平面運動を復元することに主眼が置かれていた。 In other words, as shown in FIG. 10, the camera is often used in a posture in which the optical axis is set vertically upward. Therefore, the main purpose is to restore the planar motion with 3 degrees of freedom composed of yaw rotation and XY translational motion. It was placed.

しかし、実環境においては、このような平面運動以外に、ロール回転、ピッチ回転、Ｚ軸並進運動を含む場合、カメラによる簡便な計算が使えず、カメラ運動と３次元形状を復元するための計算が複雑になり、上記のような全方位カメラに関する微小なカメラ姿勢変化を雑音と混在してしまい、正確に、かつ、ロバストに復元できない。これを魚眼カメラの投影モデルを例にして説明する。 However, in a real environment, when roll rotation, pitch rotation, and Z-axis translational motion are included in addition to such planar motion, simple calculation by the camera cannot be used, and calculation for restoring the camera motion and the three-dimensional shape. Becomes complicated, and minute camera posture changes related to the omnidirectional camera as described above are mixed with noise, and cannot be accurately and robustly restored. This will be described using a projection model of a fisheye camera as an example.

魚眼レンズを装着したカメラを使って外界を撮像し、その得られた画像系列からカメラ視点の運動と３次元情報を復元しようとするとき、魚眼カメラの視点に関する運動と対象物の３次元座標値は、図１０の状況において、式（Ａ２）で関係付けられる。 When taking a picture of the outside world using a camera equipped with a fisheye lens and trying to restore the camera viewpoint movement and 3D information from the obtained image sequence, the fisheye camera movement and 3D coordinate values of the object Are related by equation (A2) in the situation of FIG.

式（Ａ２）の（ｘ'_ij，ｙ'_ij，ｚ'_ij）は視点を原点とした単位球面座標値である。魚眼カメラの画像上へ投影されて得られる対象物（特徴点）の画像座標値として、式（Ｂ１）に示す（ｘ_ij，ｙ_ij）が観測できる。ｆは魚眼レンズの焦点距離であり、ＤはＤ＝πｆ／２Ｒで決まる定数（Ｒはイメージサークルの半径）、（Ｃｘ，Ｃｙ）は投影中心Ｃの画像座標値である。 (X ′ _ij , y ′ _ij , z ′ _ij ) in Expression (A2) is a unit spherical coordinate value with the viewpoint as the origin. (X _ij , y _ij ) shown in Expression (B1) can be observed as the image coordinate values of the object (feature point) obtained by being projected onto the image of the fisheye camera. f is the focal length of the fisheye lens, D is a constant determined by D = πf / 2R (R is the radius of the image circle), and (Cx, Cy) is the image coordinate value of the projection center C.

式（Ａ２）の左辺には、画像座標値から直接得られないλ_ijがあるが、時系列に依存する成分（サフィックスがｉのもの）と時系列に依存しない成分（サフィックスがｊのもの）に分離できないため、左辺においてある。右辺は４つの行列からなっているが、３つの回転行列を展開して１つの行列にして、式（Ａ１）右辺のカメラ運動に対応する行列とすることもできるが、式（Ａ２）そのものから分かるように、式（Ａ１）のように左辺だけに画像座標値から得た情報、右辺にはカメラ運動に対応する情報（サフィックスがｉのもの）と３次元情報に対応する情報（サフィックスがｊのもの）から、式（Ａ１）に示すような線形行列計算を行い、左辺の画像座標値を構成することは、投影モデルが複雑すぎて容易ではない。言い換えれば、式（Ａ２）に前記の文献［１］、［２］、［３］、［４］を応用してカメラ運動と３次元情報を復元することはできない。 On the left side of the formula (A2), there is λ _ij that cannot be obtained directly from the image coordinate values, but a component that depends on the time series (suffix is i) and a component that does not depend on the time series (suffix is j) Since it cannot be separated, it is on the left side. Although the right side is composed of four matrices, the three rotation matrices can be expanded into one matrix, which can be a matrix corresponding to the camera movement on the right side of Equation (A1), but from Equation (A2) itself As can be seen, the information obtained from the image coordinate values only on the left side as in equation (A1), the information on the right side corresponding to camera motion (suffix i), and the information corresponding to three-dimensional information (suffix j It is not easy to perform the linear matrix calculation as shown in the equation (A1) to construct the image coordinate value on the left side because the projection model is too complex. In other words, it is impossible to restore the camera motion and the three-dimensional information by applying the above-mentioned documents [1], [2], [3], and [4] to the formula (A2).

さらに、これまでの監視カメラや画像計測などのアプリケーションでは、全方位カメラの利用形態が限られていた。通常、全方位カメラ（魚眼カメラ）は、図１０のように光軸を鉛直上向きの姿勢で使われることが多い。この図では、光軸とほぼ平行となる線分として、屋外では建物の高さ方向の垂直線分（室内では床から天井までの垂直線分）を画像中から抽出する（図１１参照）。この線分は、ロール回転、ピッチ回転がある程度の角度範囲の限定の下で、ある１点（交点Ａ）で交わるという性質を利用して、ロール回転・ピッチ回転に比例した座標値（Ｄｘ，Ｄｙ）を得ることができる。 Furthermore, the application form of an omnidirectional camera has been limited in conventional applications such as monitoring cameras and image measurement. In general, an omnidirectional camera (fisheye camera) is often used in a posture in which the optical axis is vertically upward as shown in FIG. In this figure, as a line segment that is substantially parallel to the optical axis, a vertical line segment in the height direction of the building outdoors (a vertical line segment from the floor to the ceiling in the room) is extracted from the image (see FIG. 11). This line segment has a coordinate value (Dx, Rx) proportional to roll rotation / pitch rotation by utilizing the property that roll rotation and pitch rotation intersect at a certain point (intersection point A) with a certain degree of angular range limitation. Dy) can be obtained.

しかし、屋外において魚眼カメラを使う場合、図１２に示すように、街路樹などの遮蔽物が映り込む場合がある。この図では、交点を求めるための線分を全画像において検出することは難しく、また、線分抽出において雑音が付加されると、交点の検出にも影響する。例えば、屋外では街路樹や信号機などがカメラの視点を遮るため、求める交点に寄与する直線だけを自動的に抽出することは難しい。特に市街地での撮影では、このような遮蔽物を無視することはできない。 However, when a fish-eye camera is used outdoors, as shown in FIG. In this figure, it is difficult to detect the line segment for obtaining the intersection point in the entire image, and if noise is added in the line segment extraction, the detection of the intersection point is also affected. For example, since roadside trees, traffic lights, etc. block the camera's viewpoint outdoors, it is difficult to automatically extract only the straight line that contributes to the desired intersection. In particular, when shooting in urban areas, such a shield cannot be ignored.

このように、本発明が解決しようとする課題は、式（Ａ２）に示す投影モデルにおいて文献［３］、［４］を利用して、カメラ運動と３次元情報を同時に、かつ、ロバストに復元することと、図１１のようにロール回転とピッチ回転を反映した交点Ａを直線検出などの画像処理手法で求めず、画像中に設定した特徴点（遮蔽物についた特徴点も含む）の時間的動きだけから、ロール回転、ピッチ回転を求めるとともに、カメラ視点に関する運動と外界の３次元情報を同時に、かつ、雑音にロバストに復元することである。 As described above, the problem to be solved by the present invention is to restore the camera motion and the three-dimensional information at the same time and robustly using the documents [3] and [4] in the projection model shown in the formula (A2). 11 and the intersection point A reflecting the roll rotation and the pitch rotation as shown in FIG. 11 is not obtained by an image processing method such as straight line detection, and the time of the feature points set in the image (including the feature points on the shield) It is to obtain a roll rotation and a pitch rotation from only the target movement, and to simultaneously restore the motion related to the camera viewpoint and the three-dimensional information of the outside world and robust to noise.

また、計算アルゴリズムの点からも、時系列画像から運動と３次元情報を復元するには計算コストが高いという問題を本発明は解決することにある。 Also, from the viewpoint of calculation algorithm, the present invention is to solve the problem of high calculation cost for restoring motion and three-dimensional information from a time-series image.

本発明は、文献［１］、［２］、［３］、［４］を応用することを前提に、式（Ａ２）においてカメラ運動に対して制限をつけ、その条件付きのカメラ運動で撮像した投影モデルを前提とする。すなわち、魚眼レンズを装着したカメラにおいて、式（Ａ２）においてｉ＝１，２，…，Ｆ；ｊ＝１，２，…，Ｐに行列展開した投影モデルにおいて、ロール回転、ピッチ回転が微小なとき、すなわち、ｃｏｓψ≒１、ｓｉｎψ≒ψ、ｃｏｓω≒１、ｓｉｎω≒ωとして、式（Ａ３）に変形すると、 In the present invention, on the premise that the documents [1], [2], [3], and [4] are applied, the camera motion is limited in the equation (A2), and imaging is performed with the conditional camera motion. The projected model is assumed. That is, in a camera equipped with a fisheye lens, when roll rotation and pitch rotation are very small in a projection model in which matrix expansion is performed in formula (A2) with i = 1, 2,... That is, when cos ψ≈1, sin ψ≈ψ, cos ω≈1, sin ω≈ω,

となる。式（Ａ３）での（ｕ_ij，ｖ_ij）は本発明で定義しているｕｖ座標値であり、ｕ_ij＝ｘ_ij／ｚ_ij、ｖ_ij＝ｙ_ij／ｚ_ijである。この投影モデルでは、式（Ａ３）右辺を見るとわかるように、右辺左側の行列は時系列に依存する情報（カメラの運動に関する情報）であり、右辺右側の行列は時系列に依存しない不変な情報（外界の３次元情報）となっている。つまり、式（Ａ３）左辺は画像から得られる情報以外に、カメラ運動と３次元情報に関する情報がかかわっており、上記のカメラ運動を制限した場合においても、投影モデルは煩雑すぎて文献［１］、［２］、［３］、［４］を安易に応用し、カメラ運動と３次元情報を同時に復元することはできない。 It becomes. (U _ij , v _ij ) in the equation (A3) are uv coordinate values defined in the present invention, and u _ij = x _ij / z _ij and v _ij = y _ij / z _ij . In this projection model, as can be seen from the right side of Expression (A3), the matrix on the left side of the right side is information that depends on the time series (information on camera motion), and the matrix on the right side of the right side is invariant that does not depend on the time series. It is information (three-dimensional information of the outside world). That is, the left side of equation (A3) is related to information about camera motion and three-dimensional information in addition to the information obtained from the image. Even when the above camera motion is limited, the projection model is too complicated and the document [1]. , [2], [3], [4] cannot be easily applied to restore camera motion and three-dimensional information at the same time.

本発明では、式（Ａ３）の投影モデルに基づき、ロール回転ω_i、ピッチ回転ψ_i、ヨー回転θ_i、ＸＹＺ並進運動（Ｔｘ_i，Ｔｙ_i，Ｔｚ_i）；ｉ＝１，２，…，Ｆ（本発明が復元するカメラパラメータであり、単にカメラ運動と称する場合もある）、並びに外界の３次元情報（Ｘ_j，Ｙ_j，Ｚ_j）；ｊ＝１，２，…，Ｐ（以下、３次元情報とは、Ｐ個の特徴点に関する３次元座標値を指す）を復元するための復元処理方式とするものである。ただし、文献［１］、［２］、［３］、［４］を強制的に、式（Ａ３）の投影モデルに応用しない。 In the present invention, roll rotation ω _i , pitch rotation ψ _i , yaw rotation θ _i , XYZ translational motion (Tx _i , Ty _i , Tz _i ); i = 1, 2,. , F (camera parameters restored by the present invention, which may be simply referred to as camera motion), and external three-dimensional information (X _j , Y _j , Z _j ); j = 1, 2,. Hereinafter, the three-dimensional information is a restoration processing method for restoring (three-dimensional coordinate values related to P feature points). However, the documents [1], [2], [3], and [4] are not forcibly applied to the projection model of Expression (A3).

本発明では、式（Ａ３）を解くために、段階的な復元を繰り返し行うことで、近似的に、式（Ａ３）の投影モデルに基づいたカメラ運動と３次元情報を復元する。ここで、式（Ａ３）の投影モデルでは簡単にカメラ運動と３次元情報を復元することは困難なので、式（Ａ３）において初期化を行い、簡略化したモデルを使ってカメラ運動と３次元情報を復元する。まず、ロール回転、ピッチ回転を無視して、ε_ij＝０，ｉ＝１,２,…,Ｆ、ｊ＝１,２,…,Ｐ、並びに（ζ_i,η_i）＝（０,０），ｉ＝１,２,…,Ｆとする。すると、 In the present invention, in order to solve the equation (A3), the stepwise restoration is repeatedly performed to approximately restore the camera motion and the three-dimensional information based on the projection model of the equation (A3). Here, since it is difficult to easily restore camera motion and three-dimensional information with the projection model of equation (A3), initialization is performed in equation (A3), and camera motion and three-dimensional information are used using the simplified model. To restore. First, ignoring roll rotation and pitch rotation, ε _ij = 0, i = 1, 2,..., F, j = 1, 2,..., P, and (ζ _i , η _i ) = (0, 0 ), I = 1, 2,... Then

と簡略化できる。ところが、式（Ａ４）左辺にはＴｚ_i；ｉ＝１，２，…，ＦというＺ軸並進運動の成分とＺ_j；ｊ＝１，２，…，Ｐという３次元情報の高さ情報が含まれている。したがって、ｕｖ座標値からなる計測行列だけから式（Ａ４）右辺のようにカメラ運動と３次元情報に相当する情報を得ることができない。 And can be simplified. However, in the left side of the formula (A4), there are Z-axis translational components Tz _i ; i = 1, 2,..., F and three-dimensional information height information Z _j ; include. Accordingly, information corresponding to camera motion and three-dimensional information cannot be obtained from only the measurement matrix composed of the uv coordinate values as in the right side of Expression (A4).

本発明の平面運動・３次元情報復元手法では、式（Ａ４）において、まず、Ｚ軸並進運動が無いと見なした投影モデルを使って運動と３次元情報を復元する処理をする。つまり、文献［３］、［４］を応用するため、投影モデルを、 In the planar motion / three-dimensional information restoration method of the present invention, in Expression (A4), first, a process of restoring motion and three-dimensional information is performed using a projection model that is regarded as having no Z-axis translational motion. In other words, in order to apply documents [3] and [4], the projection model is

とする。式（Ａ５）は後に示す式（５ｂ）において後に示す式（６）の初期化を行うと、平面運動と３次元情報を復元するために処理する行列分解データとなる。この行列データ（計測行列）は計測行列測定ステップにて得られるものである。式（Ａ５）の左辺のｕｖ座標値（画像から求められる情報）からなる計測行列から、式（Ａ５）の投影モデルに基づき、式（Ａ５）右辺を文献［３］、［４］を応用して、カメラ運動（式（Ａ５）右辺の左側の行列は各フレームにおけるヨー回転、すなわち，図１０のＺ軸（光軸）周りの回転θと，図１０でのＸＹ並進運動（Ｔｘ,Ｔｙ）の成分を含んでおり、この自由度３の運動を平面運動と称する）と３次元情報を復元することができる。 And Expression (A5) becomes matrix decomposition data to be processed to restore plane motion and three-dimensional information when initialization of Expression (6) described later is performed in Expression (5b) described later. This matrix data (measurement matrix) is obtained in the measurement matrix measurement step. From the measurement matrix consisting of the uv coordinate value (information obtained from the image) on the left side of equation (A5), based on the projection model of equation (A5), the right side of equation (A5) is applied to documents [3] and [4]. The matrix on the left side of the camera motion (formula (A5)) is the yaw rotation in each frame, that is, the rotation θ around the Z axis (optical axis) in FIG. 10, and the XY translational motion (Tx, Ty) in FIG. This motion with 3 degrees of freedom is referred to as plane motion), and three-dimensional information can be restored.

式（Ａ５）は、文献［３］、［４］が利用している投影モデルであり、この場合、カメラは平面運動をすることを前提としている。つまり、ある面に対してカメラがロール回転、ピッチ回転、Ｚ軸並進の運動をしないことを前提とした投影モデルである。 Expression (A5) is a projection model used in documents [3] and [4]. In this case, it is assumed that the camera performs a planar motion. That is, the projection model is based on the assumption that the camera does not perform roll rotation, pitch rotation, or Z-axis translation with respect to a certain surface.

本発明は、文献［３］、［４］を利用して、平面運動と３次元情報を復元し、式（Ａ５）と式（Ａ４）の間の誤差からＺ軸並進運動を復元し、さらに、これまでに復元したカメラ運動、３次元情報と、式（Ａ３）の投影モデルとの誤差からロール回転、ピッチ回転を復元する。このような処理を繰り返す中で，線形演算主体の文献［３］、［４］を利用して、式（Ａ３）の投影モデルに従って魚眼カメラのカメラ運動（図１０のロール回転、ピッチ回転、ヨー回転，ＸＹＺ並進運動）と外界の３次元情報（ＸＹＺ）を同時に、かつ、ロバストに復元する。 The present invention uses the documents [3] and [4] to restore the plane motion and the three-dimensional information, restore the Z-axis translational motion from the error between the equations (A5) and (A4), The roll rotation and the pitch rotation are restored from the error between the camera motion and the three-dimensional information restored so far and the projection model of the formula (A3). In repeating such processing, the camera motion of the fisheye camera (roll rotation, pitch rotation, FIG. 10) according to the projection model of the formula (A3) using the documents [3] and [4] mainly based on linear operations. Yaw rotation, XYZ translation) and the external three-dimensional information (XYZ) are restored simultaneously and robustly.

以上のことから、本発明は以下の方法、装置、および記録媒体を特徴とする。 As described above, the present invention is characterized by the following method, apparatus, and recording medium.

（方法の発明）
（１）魚眼カメラなどの画像入力装置で取得した時系列の全方位画像または広視野画像中において、対象とする画像に配置した特徴点に関する画像座標値の時間的変化量から、時系列におけるカメラ視点の運動、並びに、外界の物体形状を構成する３次元情報を復元する方法であって、
外界の３次元座標値を、視点を中心とした球面に投影した座標値において、カメラの進行方向の直交方向成分を高さ方向成分で割った値、及びカメラの進行方向成分を高さ方向成分で割った値を、それぞれをｕ座標及びｖ座標とし、時系列の全方位画像または広視野画像に設定した特徴点座標系において、各画像における特徴点の画像座標値を得て、各画像座標値から算出される位相角と仰角により、その特徴点のｕｖ座標値を求める計測行列測定ステップと、
前記計測行列測定ステップで得た全ｕｖ座標値を用いて、復元処理するための行列分解データを生成し、この行列分解データを特異値分解し、雑音除去を行って運動情報を表す行列データと３次元情報を表す行列データを得て、その運動情報において、運動を規定するために設定した条件を満足する変換行列を求め、運動情報を表す行列データにこの変換行列を作用させてカメラ視点に関する光軸になるＺ軸周りの回転運動と二軸のＸＹ軸方向の並進運動を復元し、並びに、３次元情報を表す行列データにこの変換行列の逆行列を作用させて物体形状を構成する３次元情報を復元する平面運動・３次元情報復元ステップと、
前記平面運動・３次元情報復元ステップで得た平面運動と３次元情報から各特徴点に関するｕｖ座標系での再投影座標値を求め、この再投影座標値と前記計測行列測定ステップで得た各特徴点のｕｖ座標値との間の誤差と、前記平面運動・３次元情報復元ステップで得た各特徴点のＺ座標値を使って、カメラ視点のＺ軸方向の並進運動を復元するＺ軸運動復元ステップと、
前記行列分解データの中から、光軸以外の他の軸周りの回転運動に寄与する成分を取り出し、それらの成分と前記平面運動・３次元情報復元ステップとＺ軸運動復元ステップで復元した運動、並びに３次元情報を使って、光軸以外の他軸周りの回転運動を復元する回転運動復元ステップと、
前記Ｚ軸運動復元ステップで求めた誤差がある一定値以下に収束しているかどうかを判断し、もし、収束していないならば、平面運動と３次元情報の復元ステップで得たカメラ視点のＸＹ並進成分と３次元情報、並びに、光軸以外のＸＹ軸周りの回転角を使って、ｕｖ座標値を変形させる係数εを求め、前記平面運動と３次元情報の復元ステップで得た各特徴点の高さ情報と前記Ｚ軸並進運動復元ステップで得た各視点のＺ軸上の位置を使って、ｕｖ座標値をスケール変換する係数δを求め、係数εを前記計測行列測定ステップで得た計測行列の各行列要素にそれぞれ掛け合わせて得た値と、前記計測行列測定ステップで得た計測行列の各行列要素から交点座標値を差し引いた各座標値に係数δをそれぞれ掛け合わせて得た値を合わせた新たな行列要素とする行列分解データを生成し、前記情報復元ステップに戻り、誤差が一定値以下に収束するまで反復して、カメラ運動と３次元情報を復元することを特徴とする。 (Invention of method)
(1) In a time-series omnidirectional image or a wide-field image acquired by an image input device such as a fish-eye camera, a temporal change amount of an image coordinate value related to a feature point arranged in a target image. A method for restoring the three-dimensional information constituting the movement of the camera viewpoint and the object shape of the outside world,
The value obtained by dividing the orthogonal direction component of the moving direction of the camera by the height direction component in the coordinate value obtained by projecting the three-dimensional coordinate value of the outside world onto the spherical surface with the viewpoint as the center, and the moving direction component of the camera as the height direction component In the feature point coordinate system set as a time-series omnidirectional image or wide-field image, with the values divided by u as the u coordinate and v coordinate, the image coordinate values of the feature points in each image are obtained, and the image coordinates A measurement matrix measurement step for obtaining a uv coordinate value of the feature point from the phase angle and the elevation angle calculated from the values;
Using all the uv coordinate values obtained in the measurement matrix measurement step, matrix decomposition data for restoration processing is generated, the matrix decomposition data is subjected to singular value decomposition, noise is removed, and matrix data representing motion information is obtained. Matrix data representing three-dimensional information is obtained, and in the motion information, a transformation matrix that satisfies the conditions set for defining the motion is obtained, and this transformation matrix is applied to the matrix data representing the motion information to relate to the camera viewpoint. The rotational motion around the Z-axis that becomes the optical axis and the translational motion in the XY-axis directions of the two axes are restored, and the inverse matrix of this transformation matrix is applied to matrix data representing three-dimensional information to form the object shape 3 Planar motion / three-dimensional information restoration step for restoring dimensional information;
The reprojection coordinate value in the uv coordinate system for each feature point is obtained from the plane motion and the three-dimensional information obtained in the plane motion / three-dimensional information restoration step, and each reprojection coordinate value and each of the measurement matrix measurement steps obtained in the measurement matrix measurement step are obtained. The Z axis that restores the translational motion of the camera viewpoint in the Z axis direction using the error between the uv coordinate value of the feature point and the Z coordinate value of each feature point obtained in the plane motion / three-dimensional information restoration step A motion recovery step;
From the matrix decomposition data, components that contribute to rotational motion around axes other than the optical axis are extracted, and those components and motion restored in the plane motion / three-dimensional information restoration step and Z-axis motion restoration step, In addition, using the three-dimensional information, a rotational motion restoring step for restoring rotational motion around another axis other than the optical axis;
It is determined whether or not the error obtained in the Z-axis motion restoration step has converged to a certain value or less. If not, the camera viewpoint XY obtained in the planar motion and three-dimensional information restoration step is determined. Using the translation component, the three-dimensional information, and the rotation angle around the XY axis other than the optical axis, a coefficient ε for deforming the uv coordinate value is obtained, and each feature point obtained in the plane motion and the three-dimensional information restoration step The coefficient δ for scaling the uv coordinate value is obtained using the height information of the image and the position on the Z axis of each viewpoint obtained in the Z axis translational motion restoration step, and the coefficient ε is obtained in the measurement matrix measurement step. Obtained by multiplying the coefficient δ by the value obtained by multiplying each matrix element of the measurement matrix and each coordinate value obtained by subtracting the intersection coordinate value from each matrix element of the measurement matrix obtained in the measurement matrix measurement step. New value combined Generates matrix decomposition data to column element, returns to the information restoration step, iteratively until the error converges below a predetermined value, characterized in that to restore the camera motion and three-dimensional information.

（２）魚眼カメラなどの画像入力装置で取得した時系列の全方位画像または広視野画像中において、対象とする画像に配置した特徴点に関する画像座標値の時間的変化量から、時系列におけるカメラ視点の運動、並びに、外界の物体形状を構成する３次元情報を復元する方法であって、
画像入力装置が取得した全方位画像または広視野画像に対して、各画像座標値から得られる位相角と仰角により、各画像座標値をｕｖ座標値に変換して各画素値をそのｕｖ座標値に対応付けるように時系列ｕｖ画像を生成し、このｕｖ画像に設定した特徴点座標系において各画像における特徴点のｕｖ座標値を測定するｕｖ計測行列測定ステップと、
前記ｕｖ計測行列測定ステップで得た各ｕｖ座標値に対して、復元処理するための行列分解データを生成し、この行列分解データを特異値分解し、雑音除去を行って運動情報を表す行列データと３次元情報を表す行列データを得て、その運動情報において、運動を規定するために設定した条件を満足する変換行列を求め、運動情報を表す行列データにこの変換行列を作用させてカメラ視点に関する光軸となるＺ軸周りの回転運動と二軸のＸＹ軸方向の並進運動、並びに、３次元情報を表す行列データにこの変換行列の逆行列を作用させて物体形状を構成する３次元情報を復元する平面運動・３次元情報復元ステップと、
前記平面運動・３次元情報復元ステップで得た平面運動と３次元情報から各特徴点に関するｕｖ座標系での再投影値を求め、この再投影値と前記ｕｖ計測行列測定ステップで得た各特徴点のｕｖ座標値との間の誤差と、前記平面運動・３次元情報復元ステップで得た各特徴点のＺ座標値を使って、カメラ視点のＺ軸方向の並進運動を復元するＺ軸運動復元ステップと、
前記行列分解データの中から、光軸以外の他の軸周りの回転運動に寄与する成分を取り出し、それらの成分と、前記平面運動・３次元情報復元ステップとＺ軸運動復元ステップで復元した運動、並びに３次元情報を使って、光軸以外の他軸周りの回転運動を復元する回転運動復元ステップと、
前記Ｚ軸運動復元ステップで求めた誤差がある一定値以下に収束しているかどうかを判断し、もし、収束していないならば、平面運動と前記３次元情報の復元ステップで得たカメラ視点のＸＹ並進成分と３次元情報、並びに、光軸以外のＸＹ軸周りの回転角を使って、ｕｖ座標値を変形させる係数εを求め、前記平面運動と３次元情報の復元ステップで得た各特徴点の高さ情報と前記Ｚ軸並進運動復元ステップで得た各視点のＺ軸上の位置を使って、ｕｖ座標値をスケール変換する係数δを求め、前記係数εを計測行列測定ステップで得た計測行列の各行列要素にそれぞれ掛け合わせて得た値と、前記計測行列測定ステップで得た計測行列の各行列要素から交点座標値を差し引いた各座標値に前記係数δをそれぞれ掛け合わせて得た値を合わせた新たな行列要素とする行列分解データを生成し、前記情報復元ステップに戻り、誤差が一定値以下に収束するまで反復して、カメラ運動と３次元情報を復元することを特徴とする。 (2) From a temporal change amount of image coordinate values regarding feature points arranged in a target image in a time-series omnidirectional image or a wide-field image acquired by an image input device such as a fisheye camera, A method for restoring the three-dimensional information constituting the movement of the camera viewpoint and the object shape of the outside world,
With respect to the omnidirectional image or wide-field image acquired by the image input device, each image coordinate value is converted into a uv coordinate value by using a phase angle and an elevation angle obtained from each image coordinate value, and each pixel value is converted into its uv coordinate value. A uv measurement matrix measurement step of generating a time series uv image so as to be associated with each other, and measuring a uv coordinate value of a feature point in each image in a feature point coordinate system set in the uv image;
Matrix decomposition data for restoration processing is generated for each uv coordinate value obtained in the uv measurement matrix measurement step, the matrix decomposition data is subjected to singular value decomposition, noise is removed, and matrix data representing motion information is generated. And obtaining matrix data representing 3D information, obtaining a transformation matrix satisfying the conditions set for defining the motion in the motion information, and applying this transformation matrix to the matrix data representing the motion information, 3D information that constitutes the object shape by applying the inverse matrix of this transformation matrix to the matrix data representing the 3D information and the rotational motion around the Z axis that is the optical axis and the translational motion in the X and Y directions of the 2 axes Plane motion and 3D information restoration step to restore
A reprojection value in the uv coordinate system for each feature point is obtained from the plane motion and the three-dimensional information obtained in the plane motion / three-dimensional information restoration step, and each feature obtained in the reprojection value and the uv measurement matrix measurement step is obtained. A Z-axis motion that restores the translational motion of the camera viewpoint in the Z-axis direction using the error between the uv coordinate value of the point and the Z-coordinate value of each feature point obtained in the plane motion / three-dimensional information restoration step. A restore step,
From the matrix decomposition data, components that contribute to rotational motion around other axes other than the optical axis are extracted, and those components and motion restored in the plane motion / three-dimensional information restoration step and Z-axis motion restoration step. And a rotational motion restoring step for restoring rotational motion around another axis other than the optical axis using the three-dimensional information,
It is determined whether or not the error obtained in the Z-axis motion restoration step has converged to a certain value or less. If not, the camera viewpoint obtained in the planar motion and the three-dimensional information restoration step is determined. Using the XY translation component, the three-dimensional information, and the rotation angle around the XY axis other than the optical axis, a coefficient ε for deforming the uv coordinate value is obtained, and each feature obtained in the plane motion and the three-dimensional information restoration step Using the point height information and the position of each viewpoint on the Z-axis obtained in the Z-axis translational motion restoration step, a coefficient δ for scaling the uv coordinate value is obtained, and the coefficient ε is obtained in the measurement matrix measurement step. The value obtained by multiplying each matrix element of the measured matrix and the coordinate value obtained by subtracting the intersection coordinate value from each matrix element of the measurement matrix obtained in the measurement matrix measurement step are multiplied by the coefficient δ, respectively. Combine the obtained values Generates matrix decomposition data as a new matrix elements back to the information restoration step, iteratively until the error converges below a predetermined value, characterized in that to restore the camera motion and three-dimensional information.

（３）前記計測行列測定ステップにおいて、計測行列を特異値分解して得る特異値成分において、その成分から運動を表す判定値を算出し、その判定値がある一定値未満の場合は、平面運動と見なして光軸周りの回転とその光軸に垂直な平面上の運動からなる自由度３の平面運動と３次元情報を復元する処理を行い、判定値がある一定値以上の場合は、カメラに関する回転と並進運動からなる自由度６の運動と外界の３次元情報を復元することを特徴とする。 (3) In the measurement matrix measurement step, in a singular value component obtained by singular value decomposition of the measurement matrix, a determination value representing motion is calculated from the component, and if the determination value is less than a certain value, planar motion If the judgment value is greater than a certain value, a process of restoring the three-dimensional information with three degrees of freedom consisting of rotation around the optical axis and movement on a plane perpendicular to the optical axis is performed. It is characterized by restoring the motion of the degree of freedom 6 consisting of rotation and translational motion and the three-dimensional information of the outside world.

（４）前記ｕｖ計測行列測定ステップにおいて、計測行列を特異値分解して得る特異値成分において、その成分から運動を表す判定値を算出し、その判定値がある一定値未満の場合は、平面運動と見なして光軸周りの回転とその光軸に垂直な平面上の運動からなる自由度３の平面運動と３次元情報を復元する処理を行い、判定値がある一定値以上の場合は、カメラに関する回転と並進運動からなる自由度６の運動と外界の３次元情報を復元することを特徴とする。 (4) In the uv measurement matrix measurement step, in a singular value component obtained by singular value decomposition of the measurement matrix, a determination value representing motion is calculated from the component, and if the determination value is less than a certain value, a plane If the judgment value is more than a certain value, the processing is to restore the 3D freedom of motion and 3D freedom, consisting of rotation around the optical axis and motion on a plane perpendicular to the optical axis. It is characterized by restoring motion with a degree of freedom of 6 consisting of rotation and translational motion and three-dimensional information of the outside world.

（装置の発明）
（５）魚眼カメラなどの画像入力装置で取得した時系列の全方位画像または広視野画像中において、対象とする画像に配置した特徴点に関する画像座標値の時間的変化量から、時系列におけるカメラ視点の運動、並びに、外界の物体形状を構成する３次元情報を復元する装置であって、
外界の３次元座標値を、視点を中心とした球面に投影した座標値において、カメラの進行方向の直交方向成分を高さ方向成分で割った値、及びカメラの進行方向成分を高さ方向成分で割った値を、それぞれをｕ座標及びｖ座標とし、時系列の全方位画像または広視野画像に設定した特徴点座標系において、各画像における特徴点の画像座標値を得て、各画像座標値から算出される位相角と仰角により、その特徴点のｕｖ座標値を求める計測行列測定手段と、
前記計測行列測定手段で得た全ｕｖ座標値を用いて、復元処理するための行列分解データを生成し、この行列分解データを特異値分解し、雑音除去を行って運動情報を表す行列データと３次元情報を表す行列データを得て、その運動情報において、運動を規定するために設定した条件を満足する変換行列を求め、運動情報を表す行列データにこの変換行列を作用させてカメラ視点に関する光軸になるＺ軸周りの回転運動と二軸のＸＹ軸方向の並進運動を復元し、並びに、３次元情報を表す行列データにこの変換行列の逆行列を作用させて物体形状を構成する３次元情報を復元する平面運動・３次元情報復元手段と、
前記平面運動・３次元情報復元手段で得た平面運動と３次元情報から各特徴点に関するｕｖ座標系での再投影座標値を求め、この再投影座標値と前記計測行列測定手段で得た各特徴点のｕｖ座標値との間の誤差と、前記平面運動・３次元情報復元手段で得た各特徴点のＺ座標値を使って、カメラ視点のＺ軸方向の並進運動を復元するＺ軸運動復元手段と、
前記行列分解データの中から、光軸以外の他の軸周りの回転運動に寄与する成分を取り出し、それらの成分と前記平面運動・３次元情報復元手段とＺ軸運動復元手段で復元した運動、並びに３次元情報を使って、光軸以外の他軸周りの回転運動を復元する回転運動復元手段と、
前記Ｚ軸運動復元手段で求めた誤差がある一定値以下に収束しているかどうかを判断し、もし、収束していないならば、平面運動と３次元情報の復元手段で得たカメラ視点のＸＹ並進成分と３次元情報、並びに、光軸以外のＸＹ軸周りの回転角を使って、ｕｖ座標値を変形させる係数εを求め、前記平面運動と３次元情報の復元手段で得た各特徴点の高さ情報と前記Ｚ軸並進運動復元手段で得た各視点のＺ軸上の位置を使って、ｕｖ座標値をスケール変換する係数δを求め、係数εを前記計測行列測定手段で得た計測行列の各行列要素にそれぞれ掛け合わせて得た値と、前記計測行列測定手段で得た計測行列の各行列要素から交点座標値を差し引いた各座標値に係数δをそれぞれ掛け合わせて得た値を合わせた新たな行列要素とする行列分解データを生成し、前記情報復元手段に戻り、誤差が一定値以下に収束するまで反復して、カメラ運動と３次元情報を復元することを特徴とする。 (Invention of the device)
(5) From a temporal change amount of image coordinate values regarding feature points arranged in a target image in a time-series omnidirectional image or a wide-field image acquired by an image input device such as a fisheye camera, A device that restores the movement of the camera viewpoint and the three-dimensional information constituting the object shape of the outside world,
The value obtained by dividing the orthogonal direction component of the moving direction of the camera by the height direction component in the coordinate value obtained by projecting the three-dimensional coordinate value of the outside world onto the spherical surface with the viewpoint as the center, and the moving direction component of the camera as the height direction component In the feature point coordinate system set as a time-series omnidirectional image or wide-field image, with the values divided by u as the u coordinate and v coordinate, the image coordinate values of the feature points in each image are obtained, and the image coordinates A measurement matrix measuring means for obtaining a uv coordinate value of the feature point from a phase angle and an elevation angle calculated from the values;
Using all uv coordinate values obtained by the measurement matrix measurement means, matrix decomposition data for restoration processing is generated, and this matrix decomposition data is subjected to singular value decomposition, noise is removed, and matrix data representing motion information is obtained. Matrix data representing three-dimensional information is obtained, and in the motion information, a transformation matrix that satisfies the conditions set for defining the motion is obtained, and this transformation matrix is applied to the matrix data representing the motion information to relate to the camera viewpoint. The rotational motion around the Z-axis that becomes the optical axis and the translational motion in the XY-axis directions of the two axes are restored, and the inverse matrix of this transformation matrix is applied to matrix data representing three-dimensional information to form the object shape 3 Plane motion / three-dimensional information restoring means for restoring dimensional information;
The reprojection coordinate value in the uv coordinate system for each feature point is obtained from the plane motion obtained by the plane motion / three-dimensional information restoration means and the three-dimensional information, and the reprojection coordinate value and each measurement matrix measurement means obtained by the measurement matrix measurement means are obtained. The Z axis that restores the translational motion of the camera viewpoint in the Z axis direction using the error between the uv coordinate value of the feature point and the Z coordinate value of each feature point obtained by the plane motion / three-dimensional information restoration means Movement recovery means,
From the matrix decomposition data, components that contribute to rotational motion around axes other than the optical axis are extracted, and those components and motion restored by the planar motion / three-dimensional information restoring means and Z-axis motion restoring means, And rotational motion restoring means for restoring rotational motion around another axis other than the optical axis using three-dimensional information,
It is determined whether or not the error obtained by the Z-axis motion restoring means has converged to a certain value or less. If not converged, XY of the camera viewpoint obtained by the planar motion and three-dimensional information restoring means is determined. Using the translation component, the three-dimensional information, and the rotation angle around the XY axis other than the optical axis, a coefficient ε for deforming the uv coordinate value is obtained, and each feature point obtained by the plane motion and the three-dimensional information restoration means The coefficient δ for scaling the uv coordinate value is obtained using the height information of the image and the position on the Z-axis of each viewpoint obtained by the Z-axis translational movement restoring means, and the coefficient ε is obtained by the measurement matrix measuring means. Obtained by multiplying the value obtained by multiplying each matrix element of the measurement matrix by the coefficient δ by each coordinate value obtained by subtracting the intersection coordinate value from each matrix element of the measurement matrix obtained by the measurement matrix measurement means Matrix decomposition data with new matrix elements combined with values It generates data, return to the information restoring means, iteratively until the error converges below a predetermined value, characterized in that to restore the camera motion and three-dimensional information.

（６）魚眼カメラなどの画像入力装置で取得した時系列の全方位画像または広視野画像中において、対象とする画像に配置した特徴点に関する画像座標値の時間的変化量から、時系列におけるカメラ視点の運動、並びに、外界の物体形状を構成する３次元情報を復元する装置であって、
画像入力装置が取得した全方位画像または広視野画像に対して、各画像座標値から得られる位相角と仰角により、各画像座標値をｕｖ座標値に変換して各画素値をそのｕｖ座標値に対応付けるように時系列ｕｖ画像を生成し、このｕｖ画像に設定した特徴点座標系において各画像における特徴点のｕｖ座標値を測定するｕｖ計測行列測定手段と、
前記ｕｖ計測行列測定手段で得た各ｕｖ座標値に対して、復元処理するための行列分解データを生成し、この行列分解データを特異値分解し、雑音除去を行って運動情報を表す行列データと３次元情報を表す行列データを得て、その運動情報において、運動を規定するために設定した条件を満足する変換行列を求め、運動情報を表す行列データにこの変換行列を作用させてカメラ視点に関する光軸となるＺ軸周りの回転運動と二軸のＸＹ軸方向の並進運動、並びに、３次元情報を表す行列データにこの変換行列の逆行列を作用させて物体形状を構成する３次元情報を復元する平面運動・３次元情報復元手段と、
前記平面運動・３次元情報復元手段で得た平面運動と３次元情報から各特徴点に関するｕｖ座標系での再投影値を求め、この再投影値と前記ｕｖ計測行列測定手段で得た各特徴点のｕｖ座標値との間の誤差と、前記平面運動・３次元情報復元手段で得た各特徴点のＺ座標値を使って、カメラ視点のＺ軸方向の並進運動を復元するＺ軸運動復元手段と、
前記行列分解データの中から、光軸以外の他の軸周りの回転運動に寄与する成分を取り出し、それらの成分と、前記平面運動・３次元情報復元手段とＺ軸運動復元手段で復元した運動、並びに３次元情報を使って、光軸以外の他軸周りの回転運動を復元する回転運動復元手段と、
前記Ｚ軸運動復元手段で求めた誤差がある一定値以下に収束しているかどうかを判断し、もし、収束していないならば、平面運動と前記３次元情報の復元手段で得たカメラ視点のＸＹ並進成分と３次元情報、並びに、光軸以外のＸＹ軸周りの回転角を使って、ｕｖ座標値を変形させる係数εを求め、前記平面運動と３次元情報の復元手段で得た各特徴点の高さ情報と前記Ｚ軸並進運動復元手段で得た各視点のＺ軸上の位置を使って、ｕｖ座標値をスケール変換する係数δを求め、前記係数εを計測行列測定手段で得た計測行列の各行列要素にそれぞれ掛け合わせて得た値と、前記計測行列測定手段で得た計測行列の各行列要素から交点座標値を差し引いた各座標値に前記係数δをそれぞれ掛け合わせて得た値を合わせた新たな行列要素とする行列分解データを生成し、前記情報復元手段に戻り、誤差が一定値以下に収束するまで反復して、カメラ運動と３次元情報を復元することを特徴とする。 (6) From a temporal change amount of image coordinate values regarding feature points arranged in a target image in a time-series omnidirectional image or a wide-field image acquired by an image input device such as a fish-eye camera, A device that restores the movement of the camera viewpoint and the three-dimensional information constituting the object shape of the outside world,
With respect to the omnidirectional image or wide-field image acquired by the image input device, each image coordinate value is converted into a uv coordinate value by using a phase angle and an elevation angle obtained from each image coordinate value, and each pixel value is converted into its uv coordinate value. A uv measurement matrix measuring means for generating a time series uv image so as to be associated with each other and measuring the uv coordinate value of the feature point in each image in the feature point coordinate system set in the uv image;
For each uv coordinate value obtained by the uv measurement matrix measurement means, matrix decomposition data for restoration processing is generated, the matrix decomposition data is subjected to singular value decomposition, noise is removed, and matrix data representing motion information is generated. And obtaining matrix data representing 3D information, obtaining a transformation matrix satisfying the conditions set for defining the motion in the motion information, and applying this transformation matrix to the matrix data representing the motion information, 3D information that constitutes the object shape by applying the inverse matrix of this transformation matrix to the matrix data representing the 3D information and the rotational motion around the Z axis that is the optical axis and the translational motion in the X and Y directions of the 2 axes Plane motion / three-dimensional information restoration means for restoring
A reprojection value in the uv coordinate system for each feature point is obtained from the plane motion obtained by the plane motion / three-dimensional information restoration means and the three-dimensional information, and the reprojection value and each feature obtained by the uv measurement matrix measurement means. Z-axis motion that restores the translational motion of the camera viewpoint in the Z-axis direction using the error between the uv coordinate value of the point and the Z-coordinate value of each feature point obtained by the plane motion / three-dimensional information restoration means Recovery means,
From the matrix decomposition data, components that contribute to rotational motion around axes other than the optical axis are extracted, and those components and motion restored by the planar motion / three-dimensional information restoring means and the Z-axis motion restoring means. , As well as rotational motion restoring means for restoring rotational motion around another axis other than the optical axis using three-dimensional information;
It is determined whether or not the error obtained by the Z-axis motion restoration means has converged to a certain value or less. If not, the camera viewpoint obtained by the planar motion and the three-dimensional information restoration means is determined. Using the XY translation component, the three-dimensional information, and the rotation angle around the XY axis other than the optical axis, a coefficient ε for deforming the uv coordinate value is obtained, and each feature obtained by the plane motion and the three-dimensional information restoration means Using the point height information and the position of each viewpoint on the Z-axis obtained by the Z-axis translational motion restoration means, a coefficient δ for scaling the uv coordinate value is obtained, and the coefficient ε is obtained by the measurement matrix measurement means. The value obtained by multiplying each matrix element of the measured matrix and the coordinate value obtained by subtracting the intersection coordinate value from each matrix element of the measurement matrix obtained by the measurement matrix measuring means are multiplied by the coefficient δ, respectively. A new matrix element combining the obtained values Generate separation data, return to the information restoring means, iteratively until the error converges below a predetermined value, characterized in that to restore the camera motion and three-dimensional information.

（７）前記計測行列測定手段において、計測行列を特異値分解して得る特異値成分において、その成分から運動を表す判定値を算出し、その判定値がある一定値未満の場合は、平面運動と見なして光軸周りの回転とその光軸に垂直な平面上の運動からなる自由度３の平面運動と３次元情報を復元する処理を行い、判定値がある一定値以上の場合は、カメラに関する回転と並進運動からなる自由度６の運動と外界の３次元情報を復元することを特徴とする。 (7) In the measurement matrix measurement means, in a singular value component obtained by singular value decomposition of the measurement matrix, a determination value representing motion is calculated from the component, and if the determination value is less than a certain value, plane motion If the judgment value is greater than a certain value, a process of restoring the three-dimensional information with three degrees of freedom consisting of rotation around the optical axis and movement on a plane perpendicular to the optical axis is performed. It is characterized by restoring the motion of the degree of freedom 6 consisting of rotation and translational motion and the three-dimensional information of the outside world.

（８）前記ｕｖ計測行列測定手段において、計測行列を特異値分解して得る特異値成分において、その成分から運動を表す判定値を算出し、その判定値がある一定値未満の場合は、平面運動と見なして光軸周りの回転とその光軸に垂直な平面上の運動からなる自由度３の平面運動と３次元情報を復元する処理を行い、判定値がある一定値以上の場合は、カメラに関する回転と並進運動からなる自由度６の運動と外界の３次元情報を復元することを特徴とする。 (8) In the uv measurement matrix measurement means, in a singular value component obtained by singular value decomposition of the measurement matrix, a determination value representing motion is calculated from the component, and if the determination value is less than a certain value, a plane If the judgment value is more than a certain value, the processing is to restore the 3D freedom of motion and 3D freedom, consisting of rotation around the optical axis and motion on a plane perpendicular to the optical axis. It is characterized by restoring motion with a degree of freedom of 6 consisting of rotation and translational motion and three-dimensional information of the outside world.

（記録媒体の発明）
（９）上記の（１）〜（４）のいずれか１項に記載の魚眼カメラ運動と３次元情報の復元方法を、コンピュータで実行可能に記載したプログラムを記録したことを特徴とする。 (Invention of recording medium)
(9) A program in which the fisheye camera motion and the three-dimensional information restoration method according to any one of (1) to (4) above are recorded so as to be executable by a computer is recorded.

本発明によれば、魚眼カメラを使って取得した時系列画像全般（移動手段を利用して撮影した車載画像、海上画像、空撮画像、屋内画像、さらには、魚眼カメラや魚眼レンズを装着したカメラを手動撮影により取得した時系列画像）から、対象物に関する物体形状を高精度に獲得、復元することが可能となる。また、これまでの測量技術並の高精度な３次元計測が可能である。 According to the present invention, all time-series images acquired using a fish-eye camera (vehicle-mounted images, marine images, aerial images, indoor images taken using moving means, and further equipped with a fish-eye camera or fish-eye lens) It is possible to acquire and restore the object shape related to the object with high accuracy from the time-series images acquired by manual shooting of the camera. In addition, high-accuracy three-dimensional measurement equivalent to conventional surveying techniques is possible.

また、本発明は、車載カメラなどは走行中の振動がある状態においても雑音にロバストに、カメラ姿勢の微小な変動（三軸周り回転運動）、並びに、ＧＰＳなどのリモートセンサを補間する精度の三軸方向の並進運動を正確に計測することが可能となる。 In addition, the present invention is such that in-vehicle cameras and the like are robust to noise even in the presence of vibration while traveling, with minute fluctuations in camera posture (rotation around three axes), and accuracy of interpolation of remote sensors such as GPS. It becomes possible to accurately measure the translational motion in three axes.

また、本発明では、利用する計算処理が大半が線形演算で構成されるため、コンピュータ言語での実装が容易である。 Further, in the present invention, most of the calculation processing to be used is composed of linear operations, so that implementation in a computer language is easy.

また、本発明では、全方位カメラを使った画像計測などのアプリケーションにおいて、魚眼カメラの光軸を鉛直上向きにした姿勢を前提としてない。つまり、魚眼カメラを鉛直上向きの姿勢に向けずに用いる場合（図１３）で取得した時系列画像にも適用でき、カメラの運動（三軸周りの回転と三軸方向の並進運動）と３次元情報を同時に、かつ、ロバストに復元することができる。 In the present invention, in an application such as image measurement using an omnidirectional camera, the posture with the optical axis of the fisheye camera vertically upward is not assumed. That is, it can also be applied to a time-series image acquired when the fisheye camera is used without being directed vertically upward (FIG. 13). The camera motion (rotation around three axes and translational motion in three axes) and 3 Dimensional information can be restored simultaneously and robustly.

（実施形態１）
図１は請求項１等の発明に関する基本構成図である。本実施形態では、等距離投影（光軸からの入射角に比例した投影）で光学的に設計された魚眼レンズを使用した魚眼カメラ（以降、カメラ）の場合で説明するが、他の投影（立体角投影、等立体角投影）で設計された魚眼レンズを使用した魚眼カメラ撮像装置で取得した時系列画像に対しても適用可能である。 (Embodiment 1)
FIG. 1 is a basic configuration diagram relating to the first aspect of the invention. In the present embodiment, a fisheye camera (hereinafter referred to as a camera) using a fisheye lens optically designed by equidistant projection (projection proportional to the incident angle from the optical axis) will be described. The present invention can also be applied to a time-series image acquired by a fish-eye camera imaging device using a fish-eye lens designed with a three-dimensional angle projection or an equal solid angle projection.

図１０において、本発明で復元する対象の空間中の点Ｐ（Ｘ_j，Ｙ_j，Ｚ_j）と、カメラの運動、すなわち、ロール回転（ω_i）、ピッチ回転（ψ_i）、ヨー回転（θ_i）、並びに、並進運動Ｔ_i（Ｔｘ_i，Ｔｙ_i，Ｔｚ_i）を説明する。図１０は、カメラと対象物（被写体）との位置関係を表しており、説明の都合上、カメラが進行する軸をＹ軸とする。カメラは、並進運動（Ｔｘ_i，Ｔｙ_i，Ｔｚ_i）で移動しながら、ロール回転（ω_i）、ピッチ回転（ψ_i）、ヨー回転（θ_i）の回転をする。このとき、カメラ光軸をＺ軸方向とする。 In FIG. 10, the point P (X _j , Y _j , Z _j ) in the space to be restored in the present invention and the movement of the camera, that is, roll rotation (ω _i ), pitch rotation (ψ _i ), yaw rotation (Θ _i ) and translational movement T _i (Tx _i , Ty _i , Tz _i ) will be described. FIG. 10 shows the positional relationship between the camera and the object (subject). For convenience of explanation, the axis on which the camera travels is the Y axis. The camera rotates in roll rotation (ω _i ), pitch rotation (ψ _i ), and yaw rotation (θ _i ) while moving in translational motion (Tx _i , Ty _i , Tz _i ). At this time, the camera optical axis is the Z-axis direction.

また、投影中心Ｃの位置（視点位置Ｔ_i）は運動の中心であり、第ｉフレームでの並進運動をＴ_i（Ｔｘ_i，Ｔｙ_i，Ｔｚ_i）の位置とする。さらに、進行軸周りの回転をロール回転（Ｙ軸周りの回転）、カメラ光軸周りの回転をヨー回転（Ｚ軸周りの回転）、カメラ光軸と進行軸で張られる面の法線周りの回転をピッチ回転（Ｘ軸周りの回転）とする。 The position of the projection center C (viewpoint position T _i ) is the center of motion, and the translational motion in the i-th frame is the position of T _i (Tx _i , Ty _i , Tz _i ). Furthermore, the rotation around the traveling axis is roll rotation (rotating around the Y axis), the rotation around the camera optical axis is the yaw rotation (rotating around the Z axis), and around the normal of the surface stretched between the camera optical axis and the traveling axis. The rotation is referred to as pitch rotation (rotation around the X axis).

一方、空間中の点Ｐ（Ｘ_j，Ｙ_j，Ｚ_j）は、イメージサークル内の画像面において画像座標値（ｘ_ij，ｙ_ij）へ投影されるとする。式（１）は等距離投影と呼ばれる光学的投影であり、図１０の位相角ρ_ijは、画像座標値（ｘ_ij，ｙ_ij）から、式（２）を使って得ることができる。式（１）において、ｆは魚眼レンズの焦点距離であり、Ｄは、Ｄ＝πｆ／（２Ｒ）で決まる定数である。ただし、Ｒは図１０に示すイメージサークルの半径である。また、図１０では画像面での投影中心Ｃの画像座標値を（Ｃｘ、Ｃｙ）としている。 On the other hand, a point P (X _j , Y _j , Z _j ) in space is projected onto an image coordinate value (x _ij , y _ij ) on the image plane in the image circle. Equation (1) is an optical projection called equidistant projection, and the phase angle ρ _{ij in} FIG. 10 can be obtained from the image coordinate values (x _ij , y _ij ) using Equation (2). In Equation (1), f is the focal length of the fisheye lens, and D is a constant determined by D = πf / (2R). Here, R is the radius of the image circle shown in FIG. In FIG. 10, the image coordinate value of the projection center C on the image plane is (Cx, Cy).

次に、図１の基本構成について説明する。本発明は、全方位画像を蓄積した時系列データベース１、各画像における特徴点の座標値を測定する特徴点計測部２、この計測部で得た画像座標値をｕｖ画像に変換する計測行列変換部３、ｕｖ座標値を行列要素とする行列を初期値として与える計測行列入力部４、これらの入力データから因子分解法によりカメラの運動と外界の３次元情報を復元する因子分解法処理部５から構成される。因子分解法処理部５は、平面運動・３次元情報復元部５Ａ、Ｚ軸運動復元部５Ｂ、回転運動復元部５Ｃから構成される。 Next, the basic configuration of FIG. 1 will be described. The present invention includes a time-series database 1 that stores omnidirectional images, a feature point measurement unit 2 that measures the coordinate values of feature points in each image, and a measurement matrix conversion that converts image coordinate values obtained by the measurement unit into uv images. Unit 3, a measurement matrix input unit 4 that gives a matrix having uv coordinate values as matrix elements as initial values, and a factorization method processing unit 5 that restores camera motion and three-dimensional information of the external world from these input data by a factorization method Consists of The factorization processing unit 5 includes a plane motion / three-dimensional information restoration unit 5A, a Z-axis motion restoration unit 5B, and a rotational motion restoration unit 5C.

この構成において、時系列画像データベース１には、ハードディスク、ＲＡＩＤ装置、ＣＤ−ＲＯＭなどの記録媒体を利用する、または、ネットワークを介したリモートなデータ資源を利用する形態でもどちらでも構わない。さらに、また、図１ａは画像入力部１Ａを有してリアルタイムで処理する場合の処理構成図であり、本発明は必ずしも各データベース部などの記憶装置を必要としない。 In this configuration, the time-series image database 1 may be in the form of using a recording medium such as a hard disk, a RAID device, a CD-ROM, or a remote data resource via a network. Furthermore, FIG. 1a is a processing block diagram in the case where the image input unit 1A is provided for processing in real time, and the present invention does not necessarily require a storage device such as each database unit.

まず、特徴点計測部２において、景観を撮影した全方位画像を時系列画像データベース１から取り出し、全方位画像上に特徴点を設定する。この特徴点設定には、濃淡の１次元的、または、２次元的変化による勾配値が大きくなる画素の検出（エッジ検出など）、Ｃａｎｎｙオペレータ、ハフ変換などの画像処理手法により特徴点を自動的に発生させる。または、作業者が介在して適当に特徴点をマークするなどして初期画像上に特徴点を配置しても構わない。 First, the feature point measurement unit 2 takes out an omnidirectional image obtained by photographing a landscape from the time-series image database 1 and sets a feature point on the omnidirectional image. For this feature point setting, feature points are automatically detected by image processing techniques such as pixel detection (edge detection, etc.), Canny operator, Hough transform, etc., where the gradient value increases due to one-dimensional or two-dimensional changes in shading. To generate. Alternatively, the feature points may be arranged on the initial image by appropriately marking the feature points with an operator intervening.

このとき配置する特徴点の数をＰ個（ｊ＝１，２，…，Ｐ）とし、配置したときの特徴点の２次元座標値として、図１０での画像座標系での原点からの座標値（ｘ_1j，ｙ_1j）；ｊ＝１，２，…，Ｐを記録しておく。次に、初期画像に続く時系列画像をデータベースから１枚ずつ読み込み、初期画像に配置した特徴点を、時系列画像間の濃淡の変化に着目した手法、特徴点を中心とした画素パターンを各フレーム間で追跡する方法（例えば、ＫＬＴ法：Ｋａｎａｄｅ−Ｌｕｃａｓ−Ｔｏｍａｓｉ法）、または、全方位画像特有の投影歪みを考慮した手法を駆使して追跡し、各時系列画像（初期画像から第ｉ番目の画像）の特徴点の画像座標値として、図１０での画像座標系での原点からの２次元座標値（ｘ_1j，ｙ_1j）を記録する。時系列画像を読み出し続けた場合、特徴点が画像間で消失やオクルージョンにより消失してしまったときは追跡を停止し、特徴点追跡を終了する。特徴点追跡が終了した時点で、読み出した時系列画像の数は、初期画像を含めてＦ枚とする（ｉ＝１，２，…，Ｆ）。特徴点計測部２では、各時系列画像における特徴点の時間的な画像座標的配置の変化量は式（３）のデータ形式に配列して、計測行列［Ａ］を保持しておく。 At this time, the number of feature points to be arranged is P (j = 1, 2,..., P), and the coordinates from the origin in the image coordinate system in FIG. The values (x _1j , y _1j ); j = 1, 2,..., P are recorded. Next, the time-series images following the initial image are read from the database one by one, the feature points arranged in the initial image are changed to a method that focuses on the change in shading between the time-series images, and the pixel patterns centered on the feature points are Tracking between frames (for example, KLT method: Kanade-Lucas-Tomasi method) or a method that takes into account projection distortion peculiar to omnidirectional images, and tracking each time-series image (i. The two-dimensional coordinate values (x _1j , y _1j ) from the origin in the image coordinate system in FIG. 10 are recorded as the image coordinate values of the feature points of the (th image). When the time-series image is continuously read out, when the feature point disappears between images due to disappearance or occlusion, the tracking is stopped and the feature point tracking is ended. When the feature point tracking is completed, the number of time-series images read out is F including the initial image (i = 1, 2,..., F). The feature point measuring unit 2 arranges the amount of change in temporal image coordinate arrangement of feature points in each time-series image in the data format of Expression (3), and holds the measurement matrix [A].

次に、計測行列変換部３では、特徴点計測部２で取得した画像座標値を要素とする計測行列［Ａ］から、第ｉフレームでの第ｊ番目の特徴点の画像座標値（ｘ_ij，ｙ_ij）を取り出する。さらに、投影中心（Ｃｘ，Ｃｙ）を原点とした相対的な画像座標値に一時変換し、式（１）（２）を使って位相角ρ_ijと仰角φ_ijを求める。この位相角と仰角から式（４）の変換により座標変換し、ｕｖ座標値（ｕ_ij，ｖ_ij）を得る。 Next, in the measurement matrix conversion unit 3, the image coordinate value (x _ij) of the j-th feature point in the i-th frame is obtained from the measurement matrix [A] having the image coordinate value acquired by the feature point measurement unit 2 as an element. , Y _ij ). Further, the image is temporarily converted into relative image coordinate values with the projection center (Cx, Cy) as the origin, and the phase angle ρ _ij and the elevation angle φ _ij are obtained using equations (1) and (2). From this phase angle and elevation angle, coordinate transformation is performed by transformation of equation (4) to obtain uv coordinate values (u _ij , v _ij ).

ここで、本発明におけるｕｖ変換とｕｖ座標値の物理的意味を説明しておく。外界の３次元座標値（Ｘ，Ｙ，Ｚ）を視点Ｔｉを中心とした半径１の半球面に投影すると（Ｘ'，Ｙ'，Ｚ'）＝（ｃｏｓ（Φ）ｃｏｓ（ρ）、ｃｏｓ（Φ）ｓｉｎ（ρ），ｓｉｎ（Φ））と表される。ただし、球面座標系では図１０においてＺ軸からの角をΦとして（ｓｉｎ（Φ）ｃｏｓ（ρ）、ｓｉｎ（Φ）ｓｉｎ（ρ），ｃｏｓ（Φ））と表すが、本実施形態では、図１０での視点から３次元座標値（Ｘ，Ｙ，Ｚ）を見上げたときの仰角をΦとして、（ｓｉｎ（π／２−Φ）ｃｏｓ（ρ），ｓｉｎ（π／２−Φ）ｓｉｎ（ρ），ｃｏｓ（π／２−Φ））＝（ｃｏｓ（Φ）ｃｏｓ（ρ），ｃｏｓ（Φ）ｓｉｎ（ρ），ｓｉｎ（Φ））と表している。 Here, the physical meaning of the uv transformation and the uv coordinate value in the present invention will be described. When the three-dimensional coordinate values (X, Y, Z) of the outside world are projected onto a hemisphere having a radius 1 centered on the viewpoint Ti, (X ′, Y ′, Z ′) = (cos (Φ) cos (ρ), cos (Φ) sin (ρ), sin (Φ)). However, in the spherical coordinate system, the angle from the Z-axis in FIG. 10 is expressed as (sin (Φ) cos (ρ), sin (Φ) sin (ρ), cos (Φ)) with Φ, but in this embodiment, Assuming that the elevation angle when looking up the three-dimensional coordinate value (X, Y, Z) from the viewpoint in FIG. 10 is Φ, (sin (π / 2−Φ) cos (ρ), sin (π / 2−Φ) sin (Ρ), cos (π / 2−Φ)) = (cos (Φ) cos (ρ), cos (Φ) sin (ρ), sin (Φ)).

ｕｖ座標値はこの単位球面座標値（Ｘ'，Ｙ'，Ｚ'）から、（ｕ,ｖ）＝（Ｘ’／Ｚ’，Ｙ’／Ｚ’）で得られる。また、図１４に、魚眼カメラが撮像した画像の例を挙げる。これを図４に従ってｕｖ座標値に変換して、図５に示したｕｖ画像を生成した例を図１５に示す。図１５に示すように、本座標変換では、仰角Φが小さい特徴点（建物の底辺に近い点）ほどｖ軸から遠ざかる点に、測定誤差が強調されて変換される。すなわち、同じ操作により特徴点を測定した場合、ｘｙ画像平面上では分かりづらい誤差などがｕｖ平面ではより正確に把握できるようになる。したがって、この強調された誤差が最小となるようにカメラ視点の運動および３次元座標値を求めれば、やはり高精度に計測ができる。さらに、ｕｖ平面において、ロール回転運動はｕ方向の座標移動、ピッチ回転運動はｖ方向の座標移動、Ｚ軸並進運動はｕｖ座標系におけるスケール変換に簡略的に換算できるため、ｘｙ平面よりもロール回転、ピッチ回転、Ｚ軸並進運動の判別が比較的容易となる。この判別の容易性を利用して、本発明では反復的に変形係数を使ってｕｖ座標値を変形させて、繰り返しカメラ運動と３次元座標値の復元を続けることで、より高精度なカメラ運動と３次元座標値を復元することが可能となっている。 The uv coordinate value is obtained from this unit spherical coordinate value (X ′, Y ′, Z ′) by (u, v) = (X ′ / Z ′, Y ′ / Z ′). FIG. 14 shows an example of an image captured by a fisheye camera. FIG. 15 shows an example in which this is converted into uv coordinate values according to FIG. 4 and the uv image shown in FIG. 5 is generated. As shown in FIG. 15, in this coordinate conversion, the measurement error is emphasized and converted to a point that moves away from the v-axis as the feature point (point closer to the bottom of the building) with a smaller elevation angle Φ. That is, when feature points are measured by the same operation, errors that are difficult to understand on the xy image plane can be grasped more accurately on the uv plane. Therefore, if the motion of the camera viewpoint and the three-dimensional coordinate value are obtained so that the emphasized error is minimized, the measurement can be performed with high accuracy. Further, in the uv plane, the roll rotation movement can be converted into the u-direction coordinate movement, the pitch rotation movement can be converted into the v-direction coordinate movement, and the Z-axis translation movement can be simply converted into the scale conversion in the uv coordinate system. Determination of rotation, pitch rotation, and Z-axis translational movement is relatively easy. By utilizing this ease of discrimination, the present invention repeatedly deforms the uv coordinate value using the deformation coefficient, and continues to reconstruct the camera motion and the three-dimensional coordinate value repeatedly, so that the camera motion with higher accuracy can be obtained. It is possible to restore the three-dimensional coordinate values.

このように、全方位画像において雑音の影響などにより正確に運動と３次元形状を復元できなかったという問題に対して、全方位画像をｕｖ座標値で表される画像に変換することで、画像雑音の影響をより鮮明に浮き立たせる効果があり、雑音に埋もれることなく、この変換を行うことで、微小なカメラの回転運動を含むカメラ運動、並びに、３次元情報の復元がし易くなるように好適に作用する。 As described above, the omnidirectional image is converted into an image represented by the uv coordinate value in response to the problem that the motion and the three-dimensional shape cannot be accurately restored due to the influence of noise in the omnidirectional image. It has the effect of raising the effect of noise more clearly, and by performing this conversion without being buried in the noise, it is easy to restore the camera movement including the rotational movement of the minute camera and the three-dimensional information. It works suitably.

上記のｕｖ変換を全フレーム、全特徴点に対して行う。変換後のデータ形式として、式（５ａ）に示す要素形式の行列［Ｂ］を、計測行列入力部において行列［Ｂ］の初期値として保持しておく。行列［Ｂ］の行列形式（行列要素の配置）は、文献［３］、［４］に用いられる計測行列と呼ばれる行列である。 The above uv conversion is performed on all frames and all feature points. As a data format after conversion, a matrix [B] in an element format shown in Expression (5a) is held as an initial value of the matrix [B] in the measurement matrix input unit. The matrix format (arrangement of matrix elements) of the matrix [B] is a matrix called a measurement matrix used in documents [3] and [4].

次に、因子分解法処理部５における処理概要を説明する。この処理部は３つの処理部から構成（平面運動・３次元情報復元部５Ａ、Ｚ軸運動復元部５Ｂ、回転運動復元部５Ｃ）されている。図２はこの処理部における処理フローとなっている。まず、係数ε_ij、δ_ijを式（６）に従って初期化し、並びに、（ζ_i，η_i）＝（０，０），ｉ＝１，２，…，Ｆと初期化する。次に、初期値の行列［Ｂ］から、式（５ｂ）に示す行列分解データを生成しておき、これを改めて行列［Ｂ］として保持しておく（計測行列入力部４に保持してある初期値の行列［Ｂ］は上書きせず、図２の処理が終了するまで保持しておく）。これが図２での行列分解データである。 Next, an outline of processing in the factorization method processing unit 5 will be described. This processing unit includes three processing units (planar motion / three-dimensional information restoring unit 5A, Z-axis motion restoring unit 5B, and rotational motion restoring unit 5C). FIG. 2 shows a processing flow in this processing unit. First, the coefficients ε _ij and δ _ij are initialized according to the equation (6), and are initialized as (ζ _i , η _i ) = (0, 0), i = 1, 2,. Next, matrix decomposition data shown in Expression (5b) is generated from the matrix [B] of the initial value, and this is again held as the matrix [B] (held in the measurement matrix input unit 4). The initial value matrix [B] is not overwritten and held until the processing in FIG. 2 is completed). This is the matrix decomposition data in FIG.

平面運動・３次元情報復元ステップでは、特異値分解では式（７）に示す３つの行列［Ｕ］、［Ｗ］、［Ｖ］に行列分解する。ここで、［Ｕ］は２Ｆ×Ｐサイズの行列、［Ｗ］はＰ×Ｐサイズの対角行列、［Ｖ］はＰ×Ｐサイズの行列である。 In the plane motion / three-dimensional information restoration step, singular value decomposition performs matrix decomposition into three matrices [U], [W], and [V] shown in Expression (7). Here, [U] is a 2F × P size matrix, [W] is a P × P size diagonal matrix, and [V] is a P × P size matrix.

さらに、図２の雑音除去において、式（８）の第二項に示すように、ランク４以上の各行列の成分を削除する。 Further, in the noise removal of FIG. 2, as shown in the second term of Expression (8), the components of each matrix of rank 4 or higher are deleted.

この削除のときは、行列［Ｕ］を取り出し、この行列の要素において第４から第Ｐ列目までを削除し、残りの成分からなる行列を保持し、行列［Ｗ］を取り出し、この行列の要素において第４から第Ｐ行目並びに第４から第Ｐ列目までを削除し、残りの成分からなる行列を保持し、行列［Ｖ］を取り出し、この行列の要素において第４から第Ｐ行目までを削除し、残りの成分からなる行列をそれぞれ保持する（式（９））。 At the time of this deletion, the matrix [U] is taken out, the fourth to Pth columns are deleted from the elements of this matrix, the matrix consisting of the remaining components is held, the matrix [W] is taken out, The fourth to Pth rows and the fourth to Pth columns are deleted in the element, the matrix composed of the remaining components is retained, the matrix [V] is extracted, and the fourth to Pth rows are extracted from the elements of this matrix. Delete up to the eye and hold the matrix composed of the remaining components (Equation (9)).

次に、第４から第Ｐ行目並びに第４から第Ｐ列目までを削除した行列［Ｗ］の対角要素の平方をとった行列から、式（１０）（１１）に示す行列［Ｕ'］と行列［Ｖ'］を得る。 Next, from the matrix obtained by taking the square of the diagonal elements of the matrix [W] from which the 4th to Pth rows and the 4th to Pth columns are deleted, the matrix [U shown in Equations (10) and (11) is used. '] And matrix [V'] are obtained.

図２の次処理の変換行列算出では、保持してある行列［Ｕ'］を取り出し、式（１２ａ）〜（１２ｃ）で得られる値を行列要素にもつ式（１３）で示す行列［Ｄ］を準備し、この行列［Ｄ］と式（１４）に示す計算を行い値ａ，ｂ，ｃ，ｄ，ｅ，ｆを求める。なお、式（１４）の右辺の最後の行列は１の値を上から２Ｆ個、続けて０の値をＦ個並べた３Ｆ×１の列ベクトルである。 In calculation of the transformation matrix in the next process of FIG. 2, the held matrix [U ′] is taken out, and the matrix [D] represented by the equation (13) having values obtained by the equations (12a) to (12c) as matrix elements. And the calculation shown in the matrix [D] and the equation (14) is performed to obtain the values a, b, c, d, e, and f. Note that the last matrix on the right side of Expression (14) is a 3F × 1 column vector in which 2 values from the top are 1F, and F values from 0 are subsequently arranged.

上記の値ａ，ｂ，ｃ，ｄ，ｅ，ｆを式（１５）に示す要素に入れた行列［Ｃ］を用意し、この行列［Ｃ］を式（１６）に示すように固有値分解する。ここで、固有値行列の平方と固有値行列から、式（１７）の行列［Ｃ'］を生成し、この行列要素を成分にもつ行列［Ｑ］を式（１８）に従って算出する。 A matrix [C] in which the above values a, b, c, d, e, and f are put in the elements shown in Expression (15) is prepared, and this matrix [C] is subjected to eigenvalue decomposition as shown in Expression (16). . Here, a matrix [C ′] of Expression (17) is generated from the square of the eigenvalue matrix and the eigenvalue matrix, and a matrix [Q] having this matrix element as a component is calculated according to Expression (18).

次に、式（Ａ５）の投影モデル式に基づいて、式（Ａ５）右辺左側の行列を式（１９）で得られる行列［Ｍ'］に，式（Ａ５）右辺右側の行列を式（２２）で得られる行列［Ｓ'］とする。この対応付けにより、式（１９）の行列成分から，式（２０）の計算によりヨー回転角θを、式（２１）からＸＹ並進運動を復元することができる。 Next, based on the projection model formula of Formula (A5), the matrix on the right side of Formula (A5) is changed to the matrix [M ′] obtained by Formula (19), and the matrix on the right side of Formula (A5) is Formula (22) ) To obtain the matrix [S ′] obtained. By this association, the yaw rotation angle θ can be restored from the matrix component of Equation (19) by the calculation of Equation (20), and the XY translational motion can be restored from Equation (21).

一方，式（Ａ５）右辺右側の行列と行列［Ｓ'］の対応付けにより、式（２２）にしたがってユークリッド空間での３次元情報（Ｘ，Ｙ，Ｚ）を復元することができる。すなわち、求めた行列［Ｑ］と保持しておいた行列［Ｕ'］から、式（１９）の行列演算により行列［Ｍ'］を計算する．行列［Ｍ'］から各フレーム（第ｉフレーム）の行列要素（ｍ_ix,ｎ_ix）または（ｍ_iy,ｎ_iy）を取り出し，式（２０）を使って、ヨー回転θ_i，ｉ＝１，２，…，Ｆを復元する。また、行列［Ｍ'］から各フレーム（第ｉフレーム）の行列要素（Ｔ_iu,Ｔ_iv）を取り出す。この（Ｔ_iu，Ｔ_iv）から、式（２１）を使って第ｉフレームにおけるユークリッド空間でのＸＹ並進運動（Ｔｘ_i,Ｔｙ_i），ｉ＝１，２，…，Ｆを計算する。 On the other hand, the three-dimensional information (X, Y, Z) in the Euclidean space can be restored according to the equation (22) by associating the matrix on the right side of the equation (A5) with the matrix [S ′]. That is, the matrix [M ′] is calculated from the obtained matrix [Q] and the stored matrix [U ′] by the matrix operation of Expression (19). The matrix element (m _ix , n _ix ) or (m _iy , n _iy ) of each frame (i-th frame) is extracted from the matrix [M ′], and the yaw rotation θ _i , i = 1 is obtained using equation (20). , 2, ..., F are restored. Further, the matrix element (T _iu , T _iv ) of each frame (i-th frame) is extracted from the matrix [M ′]. From this (T _iu , T _iv ), XY translational motion (Tx _i , Ty _i ), i = 1, 2,..., F in the Euclidean space in the i-th frame is calculated using equation (21).

一方、３次元情報復元では，先に保持しておいた行列［Ｖ'］と、変換行列算出で得られた行列［Ｑ］から、式（２２）に示す行列演算を行い，行列［Ｓ'］を求める。次に、行列［Ｓ'］の要素に対して、式（２３）に示す計算を行い，これを要素とする行列を［Ｐ］とする。行列を［Ｐ］の列ベクトルは、それぞれ第ｊ番目の特徴点のユークリッド空間での３次元座標直（Ｘ_j，Ｙ_j，Ｚ_j）になっている。 On the other hand, in the three-dimensional information restoration, the matrix operation shown in Expression (22) is performed from the matrix [V ′] held in advance and the matrix [Q] obtained by the transformation matrix calculation, and the matrix [S ′ ]. Next, the calculation shown in Expression (23) is performed on the elements of the matrix [S ′], and a matrix having these elements as [P]. The column vector of the matrix [P] is a three-dimensional coordinate straight (X _j , Y _j , Z _j ) in the Euclidean space of the jth feature point.

ここまでの処理では、式（Ａ５）の投影モデルを使って平面運動と３次元情報を復元しているため、式（Ａ４）の投影モデルで処理する場合と比べて、Ｚ軸並進運動成分（Ｔｚ_i）が式（９）の処理の段階で雑音成分と見なされ削除されている、つまり、ロール回転、ピッチ回転、Ｚ軸並進運動は“無いと見なして”、平面運動と３次元情報だけを復元している。 In the processing so far, since the plane motion and the three-dimensional information are restored using the projection model of the formula (A5), compared with the case of processing with the projection model of the formula (A4), the Z-axis translational motion component ( Tz _i ) is regarded as a noise component at the stage of the processing of equation (9) and deleted, that is, roll rotation, pitch rotation, and Z-axis translation are “not considered to exist”, only plane motion and three-dimensional information. Is restoring.

次のＺ軸運動復元ステップでは、ε_ij，ｉ＝１，２，…，Ｆ；ｊ＝１，２，…，Ｐ並びに、（ζ_i，η_i）＝（０，０），ｉ＝１，２，…，Ｆとして、ロール回転とピッチ回転を無視して、式（Ａ４）の投影モデルを使って、まだ復元しないＺ軸並進運動（Ｔｚ_i）を復元する。そこで、式（９）で削除したランク４以降の成分、すなわち、 In the next Z-axis motion restoration step, ε _ij , i = 1, 2,..., F; j = 1, 2,..., P and (ζ _i , η _i ) = (0, 0), i = 1 , 2,..., F, ignoring roll rotation and pitch rotation, and using the projection model of equation (A4), restore the Z-axis translation (Tz _i ) that has not yet been restored. Therefore, the components after rank 4 deleted in equation (9), that is,

を含めて，このステップでは、式（Ａ４）の投影モデルに戻ってＺ軸並進運動を復元する。これが実現できる原理について，以下に数式を交えて説明する。 In this step, the Z axis translational motion is restored by returning to the projection model of the formula (A4). The principle that this can be achieved will be described below with mathematical formulas.

式（Ａ５）だけを使って（Ｚ軸並進運動を無視して）平面運動と３次元情報を復元すると、式（２４）、（２５）に示す成分の誤差が発生する。この誤差の中から式（Ａ４）でのＺ軸の並進成分を抽出する。 If the plane motion and the three-dimensional information are restored using only the equation (A5) (ignoring the Z-axis translational motion), errors of the components shown in the equations (24) and (25) occur. From this error, the translation component of the Z axis in the formula (A4) is extracted.

式（Ａ４）において、ｉフレームのｊ番目の特徴点に関する投影モデルだけを抜き出すと、 In formula (A4), if only the projection model related to the jth feature point of the i frame is extracted,

と表される。つまり、Ｚ軸並進運動Ｔｚ_i，ｉ＝１，２，…，Ｆに関する成分は式（Ａ７）の右辺の誤差として現れる。これを、さらに以下の式に変形することができる。 It is expressed. That is, the component relating to the Z-axis translational motion Tz _i , i = 1, 2,..., F appears as an error on the right side of the equation (A7). This can be further transformed into the following equation.

式（Ａ８）をｉ＝１，２，…，Ｆ；ｊ＝１，２，…，Ｐに対して行列展開すると， When formula (A8) is expanded into a matrix for i = 1, 2,..., F;

となる。したがって、式（Ａ５）だけを使って平面運動と３次元情報を復元したとき、式（２４）に示す誤差を使って、式（Ａ９）に示す連立方程式を、式（２６）により計算し、Ｚ軸並進運動を復元することができる。 It becomes. Therefore, when the plane motion and the three-dimensional information are restored using only the formula (A5), the simultaneous equation shown in the formula (A9) is calculated by the formula (26) using the error shown in the formula (24). Z-axis translational motion can be restored.

したがって、Ｚ軸運動復元ステップの処理フローでは、式（２４）（２５）に示す誤差を、全フレーム、並びに、全特徴点について計算する（ｕｖ座標系での再投影誤差）。これらの再投影誤差と先に求めた各特徴点のＺ座標値（Ｚ_j,ｊ＝１，２，…，Ｐ）を使って、式（２６）の計算を行い，各フレーム（第ｉフレーム）でのＺ軸の並進運動Ｔｚ_i，ｉ＝１，２，…，Ｆを復元することができる。 Therefore, in the processing flow of the Z-axis motion restoration step, the errors shown in equations (24) and (25) are calculated for all frames and all feature points (reprojection error in the uv coordinate system). Using these reprojection errors and the previously obtained Z coordinate values (Z _j , j = 1, 2,..., P) of each feature point, the calculation of Expression (26) is performed, and each frame (i-th frame) is calculated. ), The Z axis translational motion Tz _i , i = 1, 2,..., F can be restored.

Ｚ軸運動復元ステップの段階で、まだロール回転とピッチ回転を復元していない。つまり、式（Ａ３）の投影モデルに基づいて運動と３次元情報を復元しておらず、式（Ａ４）の投影モデルに従って運動と３次元情報を復元した。 At the stage of the Z-axis motion restoration step, roll rotation and pitch rotation have not yet been restored. That is, the motion and the three-dimensional information are not restored based on the projection model of the formula (A3), and the motion and the three-dimensional information are restored according to the projection model of the formula (A4).

次の回転運動復元ステップでは、式（Ａ３）において、これまでに復元したカメラ運動、３次元情報からなる行列成分と、まだ復元していないロール回転とピッチ回転に付随する行列成分に分離し、ロール回転とピッチ回転に関連する成分だけを抽出し、この成分からロール回転とピッチ回転を復元する。これを実現する原理を数式を交えて以下に説明する。 In the next rotational motion restoration step, in the formula (A3), the camera motion restored so far, the matrix component consisting of three-dimensional information, and the matrix component associated with the roll rotation and pitch rotation that have not been restored yet are separated. Only components related to roll rotation and pitch rotation are extracted, and roll rotation and pitch rotation are restored from these components. The principle for realizing this will be described below with mathematical expressions.

説明の簡単のために、式（Ａ３）からｉフレームのｊ番目の特徴点に関する投影モデルだけを取り出すと、 For simplicity of explanation, if only the projection model related to the jth feature point of the i frame is extracted from the equation (A3),

となる。ここで、未だ復元していないロール回転，ピッチ回転を未知数とする方程式に変形すると、 It becomes. Here, when transforming into an equation with unknown roll rotation and pitch rotation that have not yet been restored,

が得られる。ここで、 Is obtained. here,

と置き、全特徴点（ｊ＝１，２，…，Ｐ）に展開すると、 And expand to all feature points (j = 1, 2,..., P),

となる。式（Ａ１３）に示す誤差ΔＵ_ij、ΔＶ_ijは式（Ａ４）の投影モデルで復元したときに発生する誤差であり、この誤差を式（Ａ１３）の計算で抽出し全特徴点に対して連立した方程式（式（Ａ１４））を式（２７）〜（３２）に従って解くことで、ロール回転角、ピッチ回転を得ることができる。式（３２）の処理は各フレームに対して行い、全時系列のロール回転角ω_i（ｉ＝１，２，…，Ｆ）、ピッチ回転ψ_i（ｉ＝１，２，…，Ｆ）を復元するできる。 It becomes. The errors ΔU _ij and ΔV _ij shown in the equation (A13) are errors generated when the projection model of the equation (A4) is restored, and these errors are extracted by the calculation of the equation (A13) and are simultaneous for all feature points. The roll rotation angle and the pitch rotation can be obtained by solving the equation (formula (A14)) according to formulas (27) to (32). The processing of Expression (32) is performed for each frame, and the roll rotation angles ω _i (i = 1, 2,..., F) and pitch rotations ψ _i (i = 1, 2,..., F) in all time series. Can be restored.

したがって、回転運動復元ステップでは、式（５ａ）に示すデータ形式の行列分解データと、係数δ_ij、並びに、上記で復元した平面運動と３次元情報から、式（２７）に示す誤差を計算する。さらに、式（２８）〜（３１）の値を計算し、これらの各値と式（２７）に示す誤差から、式（３２）に示す計算式によりロール回転（ω_i）、ピッチ回転（ψ_i）を復元する。これを全フレーム（ｊ＝１，２，…，Ｆ）にわたり求める。このとき、同時に、式（３３）の計算により各フレームでの（ζ_i、η_i）、ｉ＝１，２，…，Ｆを求め、以前の（ζ_i、η_i）、ｉ＝１，２，…，Ｆを更新する。 Therefore, in the rotational motion restoration step, the error shown in Equation (27) is calculated from the matrix decomposition data in the data format shown in Equation (5a), the coefficient δ _ij , the planar motion restored as described above, and the three-dimensional information. . Further, the values of the equations (28) to (31) are calculated, and from these values and the error shown in the equation (27), the roll rotation (ω _i ) and pitch rotation (ψ _i ) restore. This is obtained over all frames (j = 1, 2,..., F). At this time, at the same time, (ζ _i , η _i ), i = 1, 2,..., F are obtained for each frame by the calculation of equation (33), and the previous (ζ _i , η _i ), i = 1, 2, ..., F are updated.

以上の処理フローにしたがって、ロール回転、ピッチ回転、ヨー回転、ＸＹＺ並進、並びに、３次元情報を復元できるが、式（Ａ３）の投影モデルに対して段階的な処理（平面運動・３次元情報復元ステップ、Ｚ軸運動復元ステップ、回転運動復元ステップ）を行ってこれらの情報を復元することができる。すなわち、式（Ａ５）に基づく平面運動と３次元情報の復元、式（Ａ４）に基づいたＺ軸並進の復元、式（Ａ３）に基づくロール回転、ピッチ回転の復元を順番に実行している。 According to the above processing flow, roll rotation, pitch rotation, yaw rotation, XYZ translation, and 3D information can be restored. However, stepwise processing (planar motion / 3D information) is performed on the projection model of equation (A3). These information can be restored by performing a restoration step, a Z-axis motion restoration step, and a rotational motion restoration step. That is, plane motion and three-dimensional information restoration based on equation (A5), Z-axis translation restoration based on equation (A4), roll rotation based on equation (A3), and pitch rotation restoration are sequentially performed. .

ここで、画像から観測したｕｖ座標値、これまでの処理で復元したカメラ運動と３次元情報、並びに、この段階での変形係数（ε_ij，δ_ij）を用いて、モデルの適合度を表す式（３４）を計算して式（３５）に示す誤差を求めると誤差は大きい。なぜなら、初期段階（これまでの段階）では、式（Ａ３）において、ε_ij＝０、並びに、（ζ_i，η_i）＝（０，０）；ｉ＝１，２，…，Ｆとして、これまでの処理が実行されてきたからである。つまり、平面運動・３次元情報復元部では、ロール回転、ピッチ回転、ヨー回転、ＸＹＺ並進運動からなるカメラ運動により投影された（実際の全方位画像における）ｕｖ座標値に対して、近似的に平面運動のカメラ運動により投影されたｕｖ座標値と見なして、平面運動と３次元情報を復元している。 Here, the uv coordinate value observed from the image, the camera motion and the three-dimensional information restored by the processing so far, and the deformation coefficients (ε _ij , δ _ij ) at this stage are used to express the model adaptability. When the equation (34) is calculated to obtain the error shown in the equation (35), the error is large. Because, in the initial stage (the previous stage), in equation (A3), ε _ij = 0 and (ζ _i , η _i ) = (0, 0); i = 1, 2,. This is because the processing so far has been executed. That is, the plane motion / three-dimensional information restoration unit approximately approximates the uv coordinate value (in the actual omnidirectional image) projected by the camera motion including roll rotation, pitch rotation, yaw rotation, and XYZ translational motion. The plane motion and the three-dimensional information are restored by regarding the uv coordinate value projected by the camera motion of the plane motion.

そこで，回転運動復元ステップに続く処理フローでは、これまでの処理において、ε_ij＝０、並びに、（ζ_i，η_i）＝（０，０）；ｉ＝１，２，…，Ｆと置いた値を、これまでの処理で得た情報を使って更新して，これまでの同様の処理を繰り返す。つまり、ε_ijを、これまでに復元したロール回転角ω_i、ピッチ回転ψ_i，３次元情報（Ｘ_j，Ｙ_j，Ｚ_j）、並びにＸＹ並進運動（Ｔｘ_i，Ｔｙ_i）を式（３６）の最初の式に代入して更新する。同様に、Ｚ軸並進運動Ｔｚ_iと３次元情報の高さＺ_jを式（３６）の二番目の式に代入してδ_ijを更新する。さらに、更新した変形係数（ε_ij，δ_ij）を使って、式（５ｂ）に従って、行列分解データとなる新たな行列［Ｂ］を準備する。これ以降は、これまでと同様に、新たに得た計測行列［Ｂ］から、平面運動・３次元情報復元ステップにおいてヨー回転、ＸＹ並進、並びに、３次元情報を復元し、次に、Ｚ軸運動復元ステップにおいてＺ軸並進運動を復元し、さらに、回転運動復元ステップにおいて、ロール回転、ピッチ回転を復元する。 Therefore, in the processing flow following the rotational motion restoration step, ε _ij = 0 and (ζ _i , η _i ) = (0, 0); i = 1, 2,. The same value is updated using the information obtained in the previous processing, and the same processing is repeated. That is, ε _ij is _expressed by the roll rotation angle ω _i , pitch rotation ψ _i , three-dimensional information (X _j , Y _j , Z _j ), and XY translational motion (Tx _i , Ty _i ) restored so far. Substitute and update in the first expression of 36). Similarly, δ _ij is updated by substituting the Z-axis translational movement Tz _i and the height Z _j of the three-dimensional information into the second expression of Expression (36). Further, a new matrix [B] to be matrix decomposition data is prepared according to the equation (5b) using the updated deformation coefficient (ε _ij , δ _ij ). Thereafter, as in the past, from the newly obtained measurement matrix [B], yaw rotation, XY translation, and 3D information are restored in the plane motion / 3D information restoration step, and then the Z axis In the motion restoring step, the Z-axis translational motion is restored, and in the rotational motion restoring step, roll rotation and pitch rotation are restored.

すべてのカメラ運動と３次元情報が復元できると、この段階での式（３４）、式（３５）で得られる誤差を求めて、この誤差が１つ前の段階での誤差と比較して変化がない（誤差が一定）、または、許容値以下と判定したとき、処理を停止し、この段階でのカメラ運動と３次元情報を出力する。そうでないときいは、これまでの処理で得たカメラ運動と３次元情報から変形係数（ε_ij，δ_ij）を更新し、平面運動・３次元情報復元ステップ、Ｚ軸運動復元ステップ、回転運動復元ステップでの処理を繰り返す。このような反復処理を行うことで、式（Ａ３）に示す投影モデルを平面運動・３次元情報復元ステップ、Ｚ軸運動復元ステップ、回転運動復元ステップと段階的に処理を行っても、誤差が均等に配分されて、カメラ運動と３次元情報の精度に偏りがなく、雑音にロバストに復元することができる。言い換えれば、魚眼カメラの投影モデル式（Ａ３）に対して本発明は、雑音にロバストに、かつ、高精度にカメラ運動と３次元情報を同時に復元することができる。 When all camera movements and 3D information can be restored, the error obtained in Equation (34) and Equation (35) at this stage is obtained, and this error changes compared to the error in the previous stage. When it is determined that there is no error (the error is constant) or less than the allowable value, the processing is stopped, and the camera motion and three-dimensional information at this stage are output. Otherwise, the deformation coefficient (ε _ij , δ _ij ) is updated from the camera motion and 3D information obtained in the previous processing, and the plane motion / 3D information restoration step, Z-axis motion restoration step, rotational motion Repeat the process in the restore step. By performing such an iterative process, even if the projection model shown in the formula (A3) is processed stepwise in the plane motion / three-dimensional information restoration step, the Z-axis motion restoration step, and the rotational motion restoration step, an error is generated. Evenly distributed, there is no bias in the accuracy of camera motion and 3D information, and noise can be restored robustly. In other words, with respect to the projection model formula (A3) of the fisheye camera, the present invention can simultaneously restore camera motion and three-dimensional information with high accuracy and robustness against noise.

以上、本発明の実施形態により、画像系列における特徴点の時間的動きから、カメラ視点の運動、すなわち、三軸周りの回転運動と三軸方向の並進運動、並びに、物体形状を構成する３次元情報を復元することができる。 As described above, according to the embodiment of the present invention, from the temporal movement of the feature points in the image series, the movement of the camera viewpoint, that is, the rotational movement around the three axes and the translational movement in the three axial directions, and the three-dimensional that constitutes the object shape Information can be restored.

（実施形態２）
図３は請求項２等に関する基本構成図である。本実施形態では、実施形態１に対して、時系列画像データベースから取り出した画像に対して、画像変換処理部６で変形画像（ｕｖ画像）を生成する点が異なり、この処理部について説明する。 (Embodiment 2)
FIG. 3 is a basic configuration diagram relating to claim 2 and the like. This embodiment is different from the first embodiment in that a deformed image (uv image) is generated by the image conversion processing unit 6 with respect to an image extracted from the time-series image database, and this processing unit will be described.

本実施形態は、全方位画像を蓄積した時系列データベース１、各画像をｕｖ画像に変換する画像変換処理部６、各画像における特徴点の座標値を測定する特徴点計測部２、ｕｖ座標値を行列要素とする行列を初期値として与える計測行列入力部４、これらの入力データから因子分解法によりカメラの運動と外界の３次元情報を復元する因子分解法処理部５から構成される。 The present embodiment includes a time-series database 1 in which omnidirectional images are stored, an image conversion processing unit 6 that converts each image into a uv image, a feature point measurement unit 2 that measures coordinate values of feature points in each image, and uv coordinate values. A measurement matrix input unit 4 that gives a matrix having a matrix element as an initial value, and a factorization processing unit 5 that restores the three-dimensional information of the camera motion and the external world from these input data by a factorization method.

この構成において、時系列画像データベース１には、ハードディスク、ＲＡＩＤ装置、ＣＤ−ＲＯＭなどの記録媒体を利用する、または、ネットワークを介したリモートなデータ資源を利用する形態でもどちらでも構わない。さらに、また、図３ａはリアルタイムで処理する場合の処理構成図であり、本発明は必ずしも各データベース部などの記憶装置を必要としない。 In this configuration, the time-series image database 1 may be in the form of using a recording medium such as a hard disk, a RAID device, a CD-ROM, or a remote data resource via a network. Furthermore, FIG. 3A is a processing configuration diagram in the case of processing in real time, and the present invention does not necessarily require a storage device such as each database unit.

本実施形態では、時系列画像データベースから取り出した画像に対して、図３の画像変換処理部においてｕｖ画像を生成する。ｕｖ画像を生成するのに、ｕｖ座標値を指定して、これに該当するｘｙ座標値を算出し、その座標値の画素値を抽出してｕｖ画素とする（図５参照）。 In the present embodiment, an uv image is generated in the image conversion processing unit in FIG. 3 for an image extracted from the time-series image database. In order to generate a uv image, an uv coordinate value is designated, an xy coordinate value corresponding to this is calculated, and a pixel value of the coordinate value is extracted to be a uv pixel (see FIG. 5).

この処理フローを図４に示す。まず、ｕｖ画像におけるｕ方向、ｖ方向の単位画素あたりの重み付けを行う。ｕｖ画像の画像サイズをＵ×Ｖ画素とし、並びに、仰角φについてφ_min≦φ≦９０度の範囲と決めたとする。これにより、ｕ方向、ｖ方向の単位画素あたりの重み付けを式（３７）に従い決定する。また、ｕｖ画像のｕｖ座標値範囲は画像の中心を原点（０，０）として、ｕ方向に−Ｕ／２から＋Ｕ／２，ｖ方向に−Ｖ／２から＋Ｖ／２とする。ｕ方向、ｖ方向の各座標値を表す整数ｍ，ｎ（−Ｍ≦ｍ≦Ｍ、−Ｎ≦ｎ≦Ｎ，Ｍ：整数、Ｎ：整数、ＭとＮは同じ、すなわちｕｖ画像が正方形でもよい）を使って、ｕｖ座標値を式（３７）に従って求める。整数ｍ，ｎを−Ｍ≦ｍ≦Ｍ、−Ｎ≦ｎ≦Ｎの範囲で変化させながら、式（３８）に従ってその（ｕ，ｖ）座標値に対応する位相角ρと仰角φを求める。次に、式（３９）により画像座標値を算出する。この画像座標値を四捨五入で整数化したものを（ｘ，ｙ）とする（式４０）。 This processing flow is shown in FIG. First, weighting per unit pixel in the u direction and the v direction in the uv image is performed. Assume that the image size of the uv image is U × V pixels, and the elevation angle φ is determined to be in a range of φ _min ≦ φ ≦ 90 degrees. Thereby, the weighting per unit pixel in the u direction and the v direction is determined according to the equation (37). Further, the uv coordinate value range of the uv image is set from the center of the image to the origin (0, 0), from −U / 2 to + U / 2 in the u direction, and from −V / 2 to + V / 2 in the v direction. Integer m, n representing each coordinate value in u direction and v direction (-M ≦ m ≦ M, −N ≦ n ≦ N, M: integer, N: integer, M and N are the same, that is, even if the uv image is square. The uv coordinate value is obtained according to the equation (37). While changing the integers m and n in the range of −M ≦ m ≦ M and −N ≦ n ≦ N, the phase angle ρ and the elevation angle φ corresponding to the (u, v) coordinate value are obtained according to the equation (38). Next, an image coordinate value is calculated by Expression (39). A value obtained by rounding the image coordinate value to an integer is (x, y) (Formula 40).

次に、画像座標値（ｘ，ｙ）に対応する画素値を取り出し、ｕｖ画像上でのｕｖ座標値に該当する座標値にその画素値を埋め込む。これを全ｕｖ座標値に対して行うことで、ｕｖ画像を生成する。さらに、対象とする全時系列画像を逐次、この変換を行って時系列のｕｖ画像を生成する。これ以降は実施形態１と同様の処理手順により、画像系列における特徴点の時間的動きから、カメラ視点の運動、すなわち、三軸周りの回転運動と三軸方向の並進運動、並びに、物体形状を構成する３次元情報を復元することができる。 Next, a pixel value corresponding to the image coordinate value (x, y) is extracted, and the pixel value is embedded in a coordinate value corresponding to the uv coordinate value on the uv image. By performing this operation for all uv coordinate values, a uv image is generated. Further, all the time series images to be processed are sequentially converted to generate a time series uv image. From this point onward, according to the same processing procedure as in the first embodiment, from the temporal movement of the feature points in the image sequence, the movement of the camera viewpoint, that is, the rotational movement around the three axes and the translational movement in the three axial directions, and the object shape are obtained. The three-dimensional information to be configured can be restored.

（実施形態３）
図６は請求項３の発明に関する基本構成図である。本実施形態では、実施形態１に対して、時系列画像データベースから取り出した画像に対して、特徴点計測部２により得た特徴点の画像座標値を計測行列変換部３においてｕｖ座標値に変換したとき、復元処理判定部７でカメラ運動と３次元情報を復元するための処理を判定する点が異なるため、この点だけについて説明する。 (Embodiment 3)
FIG. 6 is a basic configuration diagram relating to the third aspect of the present invention. In the present embodiment, in contrast to the first embodiment, the image coordinate value of the feature point obtained by the feature point measurement unit 2 is converted into the uv coordinate value in the measurement matrix conversion unit 3 with respect to the image extracted from the time series image database. In this case, since the restoration process determination unit 7 determines the process for restoring the camera motion and the three-dimensional information, only this point will be described.

本実施形態は、時系列画像データベース１には、ハードディスク、ＲＡＩＤ装置、ＣＤ−ＲＯＭなどの記録媒体を利用する、または、ネットワークを介したリモートなデータ資源を利用する形態でもどちらでも構わない。さらに、また、図６ａはリアルタイムで処理する場合の処理構成図であり、本発明は必ずしも各データベース部などの記憶装置を必要としない。 In this embodiment, the time-series image database 1 may use either a hard disk, a RAID device, a CD-ROM or other recording medium, or a remote data resource via a network. Furthermore, FIG. 6A is a processing configuration diagram in the case of processing in real time, and the present invention does not necessarily require a storage device such as each database unit.

図６の特徴点計測部２により得た特徴点の画像座標値から、図６の計測行列変換部３においてｕｖ座標値に変換して計測行列［Ｂ］を保持する。この計測行列［Ｂ］は式（５ａ）の形式となっている。 The image coordinate value of the feature point obtained by the feature point measurement unit 2 in FIG. 6 is converted into the uv coordinate value in the measurement matrix conversion unit 3 in FIG. 6 to hold the measurement matrix [B]. This measurement matrix [B] is in the form of equation (5a).

次に、図６の復元処理判定部７での処理を行う。この処理部での処理フローは図７である。この行列［Ｂ］に対して、式（８）に示すように特異値分解を行い、特異値行列［Ｗ］を得る。特異値行列［Ｗ］は対角要素であり、その各要素である特異値は昇降順の並びで、かつ、全て正の実数となっている。この特異値の中からの行列要素Ｗ₃₃と行列要素Ｗ₄₄の特異値を取り出す。ランク検出では、式（４１ａ）または式（４１ｂ）に示す計算を行い判定量Ｅｗを得る。 Next, processing in the restoration processing determination unit 7 in FIG. 6 is performed. The processing flow in this processing unit is shown in FIG. Singular value decomposition is performed on this matrix [B] as shown in Equation (8) to obtain a singular value matrix [W]. The singular value matrix [W] is a diagonal element, and the singular values that are the respective elements are arranged in ascending / descending order and are all positive real numbers. The singular values of the matrix element W ₃₃ and the matrix element W ₄₄ are extracted from the singular values. In rank detection, the calculation shown in Formula (41a) or Formula (41b) is performed to obtain the determination amount Ew.

判定量Ｅｗがある許容値δ_W未満であるか、または、判定量Ｅｗが許容値δ_W以上かを判定する。この許容値δ_Wは特定の一定値であり、作業者が逐次、値を設定することもできる。もし、判定量Ｅｗがある許容値δ_W未満の場合は、カメラ運動が平面運動をしたと判断して処理Ａに進み、判定量Ｅｗがある許容値δ_W以上の場合は、カメラ運動が一般運動（平面運動以外）をしたと判断して処理Ｂに進む。処理Ｂとは実施形態１の処理を行って、画像系列における特徴点の時間的動きから、カメラ視点の運動、すなわち、三軸周りの回転運動と三軸方向の並進運動、並びに、物体形状を構成する３次元情報を復元する処理である。そのため、以降では、処理Ａについて説明する。 Determining the amount or Ew is less than a certain tolerance value [delta] _W is or determines whether determination amount Ew allowable value [delta] _W or more. This allowable value δ _W is a specific constant value, and the operator can also set the value sequentially. If less than the allowable value [delta] _W there is determination amount Ew is camera motion is determined to have a planar motion proceeds to the process A, the case of more than the allowable value [delta] _W there is determination amount Ew, the camera motion is generally It is determined that the movement (other than the planar movement) has been performed, and the process proceeds to process B. The process B performs the process of the first embodiment, and from the temporal movement of the feature points in the image sequence, the motion of the camera viewpoint, that is, the rotational motion around the three axes, the translational motion in the three axial directions, and the object shape. This is a process for restoring the three-dimensional information to be configured. Therefore, hereinafter, the process A will be described.

図６における因子分解法処理部（実質、実施形態１の平面運動・３次元情報復元部の処理内容と同じ）での処理を処理Ａとし、この処理フローを図８に示す。図６の復元処理判定部において行った特異値分解により式（７）の３つの行列［Ｕ］、［Ｗ］、［Ｖ］に行列分解した各行列データを入力する。ここで、［Ｕ］は２Ｆ×Ｐサイズの行列、［Ｗ］はＰ×Ｐサイズの対角行列、［Ｖ］はＰ×Ｐサイズの行列である。 The process in the factorization method processing unit in FIG. 6 (substantially the same as the processing content of the plane motion / three-dimensional information restoration unit in the first embodiment) is referred to as processing A, and this processing flow is shown in FIG. Each matrix data obtained by matrix decomposition into the three matrices [U], [W], and [V] of Expression (7) by singular value decomposition performed in the restoration processing determination unit in FIG. 6 is input. Here, [U] is a 2F × P size matrix, [W] is a P × P size diagonal matrix, and [V] is a P × P size matrix.

まず、図８の雑音除去において、式（８）の第二項に示すように、ランク４以上の各行列の成分を削除する。この削除のときは、行列［Ｕ］を取り出し、この行列の要素において第４から第Ｐ列目までを削除し、残りの成分からなる行列を保持し、行列［Ｗ］を取り出し、この行列の要素において第４から第Ｐ行目並びに第４から第Ｐ列目までを削除し、残りの成分からなる行列を保持し、行列［Ｖ］を取り出し、この行列の要素において第４から第Ｐ行目までを削除し、残りの成分からなる行列をそれぞれ保持する（式（９））。 First, in the noise removal of FIG. 8, the components of each matrix of rank 4 or higher are deleted as shown in the second term of equation (8). At the time of this deletion, the matrix [U] is taken out, the fourth to Pth columns are deleted from the elements of this matrix, the matrix consisting of the remaining components is held, the matrix [W] is taken out, The fourth to Pth rows and the fourth to Pth columns are deleted in the element, the matrix composed of the remaining components is retained, the matrix [V] is extracted, and the fourth to Pth rows are extracted from the elements of this matrix. Delete up to the eye and hold the matrix composed of the remaining components (Equation (9)).

図２の次処理の変換行列算出では、保持してある行列［Ｕ'］を取り出し、式（１２ａ）〜（１２ｃ）で得られる値を行列要素にもつ式（１３）で示す行列［Ｄ］を準備し、この行列［Ｄ］と式（１４）に示す計算を行い、値ａ，ｂ，ｃ，ｄ，ｅ，ｆを求める。なお、式（１４）の右辺の最後の行列は１の値を上から２Ｆ個、続けて０の値をＦ個並べた３Ｆ×１の列ベクトルである。値ａ，ｂ，ｃ，ｄ，ｅ，ｆを式（１５）に示す要素に入れた行列［Ｃ］を用意し、この行列［Ｃ］を式（１６）に示すように固有値分解する。ここで、固有値行列の平方と固有値行列から、式（１７）の行列［Ｃ'］を生成し、この行列要素を成分にもつ行列［Ｑ］を式（１８）に従って算出する。 In calculation of the transformation matrix in the next process of FIG. 2, the held matrix [U ′] is taken out, and the matrix [D] represented by the equation (13) having values obtained by the equations (12a) to (12c) as matrix elements. And the calculation shown in the matrix [D] and the equation (14) is performed to obtain the values a, b, c, d, e, and f. Note that the last matrix on the right side of Expression (14) is a 3F × 1 column vector in which 2 values from the top are 1F, and F values from 0 are subsequently arranged. A matrix [C] in which the values a, b, c, d, e, and f are put in the elements shown in Expression (15) is prepared, and this matrix [C] is subjected to eigenvalue decomposition as shown in Expression (16). Here, a matrix [C ′] of Expression (17) is generated from the square of the eigenvalue matrix and the eigenvalue matrix, and a matrix [Q] having this matrix element as a component is calculated according to Expression (18).

さらに、式（Ａ５）右辺左側の行列を式（１９）で得られる行列［Ｍ］に、式（Ａ５）右辺右側の行列を式（２２）で得られる行列［Ｓ'］とする。この対応付けにより、式（１９）の行列成分から、式（２０）の計算によりヨー回転角θを、式（２１）からＸＹ並進運動を復元することができる。 Furthermore, let the matrix on the left side of the right side of Expression (A5) be the matrix [M] obtained by Expression (19), and the matrix on the right side of Expression (A5) be the matrix [S ′] obtained by Expression (22). By this association, the yaw rotation angle θ can be restored from the matrix component of Equation (19) by the calculation of Equation (20), and the XY translational motion can be restored from Equation (21).

一方、式（Ａ５）右辺右側の行列と行列［Ｓ'］の対応付けにより、式（２２）にしたがってユークリッド空間での３次元情報（ｘ，ｙ，ｚ）を復元することができる。すなわち、求めた行列［Ｑ］と、保持しておいた行列［Ｕ'］から、式（１９）の行列演算により行列［Ｍ'］を計算する。行列［Ｍ'］から各フレーム（第ｉフレーム）の行列要素（ｍ_ix，ｎ_ix）または（ｍ_iy，ｎ_iy）を取り出し、式（２０）を使って、ヨー回転θ_i，ｉ＝１，２，…，Ｆを復元する。また、行列［Ｍ'］から各フレーム（第ｉフレーム）の行列要素（Ｔ_iu，Ｔ_iv）を取り出す。この（Ｔ_iu，Ｔ_iv）から、式（２１）を使って第ｉフレームにおけるユークリッド空間での、ＸＹ並進運動（Ｔｘ_i，Ｔｙ_i），ｉ＝１，２，…，Ｆを計算する。 On the other hand, the three-dimensional information (x, y, z) in the Euclidean space can be restored according to the equation (22) by associating the matrix on the right side of the equation (A5) with the matrix [S ′]. That is, the matrix [M ′] is calculated from the obtained matrix [Q] and the stored matrix [U ′] by the matrix operation of Expression (19). The matrix element (m _ix , n _ix ) or (m _iy , n _iy ) of each frame (i-th frame) is extracted from the matrix [M ′], and the yaw rotation θ _i , i = 1 is obtained using equation (20). , 2, ..., F are restored. Further, the matrix element (T _iu , T _iv ) of each frame (i-th frame) is extracted from the matrix [M ′]. From this (T _iu , T _iv ), XY translational motion (Tx _i , Ty _i ), i = 1, 2,..., F in the Euclidean space in the i-th frame is calculated using equation (21).

一方、３次元情報復元では、先に保持しておいた行列［Ｖ'］と、変換行列算出で得られた行列［Ｑ］から、式（２２）に示す行列演算を行い、行列［Ｓ'］を求める。次に、行列［Ｓ'］の要素に対して、式（２３）に示す計算を行い、これを要素とする行列を［Ｐ］とする。行列を［Ｐ］の列ベクトルは、それぞれ第ｊ番目の特徴点のユークリッド空間での３次元座標値（Ｘ_j，Ｙ_j，Ｚ_j）になっている。 On the other hand, in the three-dimensional information restoration, the matrix operation shown in Expression (22) is performed from the previously held matrix [V ′] and the matrix [Q] obtained by the transformation matrix calculation, and the matrix [S ′ ]. Next, the calculation shown in Expression (23) is performed on the elements of the matrix [S ′], and the matrix having these elements as [P]. The column vector of the matrix [P] is a three-dimensional coordinate value (X _j , Y _j , Z _j ) in the Euclidean space of the jth feature point.

処理Ａは図６の復元処理制定部において平面運動と判定して処理が行われるため、画像系列における特徴点の時間的動きから、カメラ視点の運動、すなわち、光軸周りの回転と二軸方向（ＸＹ）の並進運動、並びに、物体形状を構成する３次元情報を復元する。 Since the process A is performed by determining the planar motion in the restoration processing establishment unit in FIG. 6, from the temporal movement of the feature points in the image sequence, the camera viewpoint motion, that is, rotation around the optical axis and biaxial direction The translational motion of (XY) and the three-dimensional information constituting the object shape are restored.

なお、図６の復元処理判定部において一般運動と判定した場合は、実施形態１と同様に、画像系列における特徴点の時間的動きから、カメラ視点の運動、すなわち、三軸周りの回転運動と三軸方向の並進運動、並びに、物体形状を構成する３次元情報を復元することができる。 When the restoration processing determination unit in FIG. 6 determines that the motion is a general motion, the camera viewpoint motion, that is, the rotational motion around the three axes, from the temporal motion of the feature points in the image series, as in the first embodiment. It is possible to restore three-dimensional translational motion and three-dimensional information constituting the object shape.

（実施形態４）
図９は請求項４の発明に関する基本構成図である。本実施形態では、実施形態２に対して、画像変換処理部６から入力されるｕｖ画像において、特徴点計測部２により得た特徴点のｕｖ座標値からなる行列分解データを用いて、復元処理判定部７でカメラ運動と３次元情報を復元する処理を判定する点が異なるため、この点だけについて説明する。 (Embodiment 4)
FIG. 9 is a basic structural view relating to the invention of claim 4. In the present embodiment, a restoration process is performed on the uv image input from the image conversion processing unit 6 by using matrix decomposition data composed of uv coordinate values of the feature points obtained by the feature point measurement unit 2 in the uv image input from the image conversion processing unit 6. Since the determination unit 7 differs in the point of determining the camera motion and the process of restoring the three-dimensional information, only this point will be described.

本実施形態は、時系列画像データベース１には、ハードディスク、ＲＡＩＤ装置、ＣＤ−ＲＯＭなどの記録媒体を利用する、または、ネットワークを介したリモートなデータ資源を利用する形態でもどちらでも構わない。さらに、また、図９ａはリアルタイムで処理する場合の処理構成図であり、本発明は必ずしも各データベース部などの記憶装置を必要としない。 In this embodiment, the time-series image database 1 may use either a hard disk, a RAID device, a CD-ROM or other recording medium, or a remote data resource via a network. Further, FIG. 9A is a processing configuration diagram in the case of processing in real time, and the present invention does not necessarily require a storage device such as each database unit.

図９の特徴点計測部により得た特徴点のｕｖ座標値を行列要素とする計測行列［Ｂ］を保持する。この計測行列［Ｂ］は式（５ａ）の形式となっている。次に、図６の復元処理判定部での処理を行う。この処理部での処理フローは図７である。この行列［Ｂ］に対して、式（８）に示すように特異値分解を行い、特異値行列［Ｗ］を得る。特異値行列［Ｗ］は対角行列であり、その各要素である特異値は昇降順の並びで、かつ、全て正の実数となっている。この特異値の中から行列要素Ｗ₃₃と行列要素Ｗ₄₄の特異値を取り出す。ランク検出では、式（４１ａ）または式（４１ｂ）に示す計算を行い判定量Ｅ_Wを得る。判定量Ｅ_Wがある許容値δ_W未満であるか、または、判定量Ｅ_Wが許容値δ_W以上かを判定する。この許容値δ_Wは特定の一定値であり、作業者が逐次、値を設定することもできる。もし、判定量Ｅ_Wがある許容値δ_W未満の場合は、カメラ運動が平面運動をしたと判断して処理Ａに進み、判定量Ｅ_Wがある許容値δ_W以上の場合は、カメラ運動が一般運動（平面運動以外）をしたと判断して処理Ｂに進む。 A measurement matrix [B] having the uv coordinate values of the feature points obtained by the feature point measurement unit in FIG. 9 as matrix elements is held. This measurement matrix [B] is in the form of equation (5a). Next, processing in the restoration processing determination unit in FIG. 6 is performed. The processing flow in this processing unit is shown in FIG. Singular value decomposition is performed on this matrix [B] as shown in Equation (8) to obtain a singular value matrix [W]. The singular value matrix [W] is a diagonal matrix, and the singular values that are each element thereof are arranged in ascending / descending order and are all positive real numbers. Singular values of the matrix element W ₃₃ and the matrix element W ₄₄ are extracted from the singular values. In rank detection, the calculation shown in Formula (41a) or Formula (41b) is performed to obtain the determination amount E _W. Determining the amount of E or _W is less than a certain tolerance value [delta] _W is or determines whether determination amount E _W allowable value [delta] _W or more. This allowable value δ _W is a specific constant value, and the operator can also set the value sequentially. If the determination amount E _W is less than a certain allowable value δ _W , it is determined that the camera motion has moved in a plane, and the process proceeds to processing A. If the determination amount E _W is greater than the certain allowable value δ _W , the camera motion Is determined to have performed general motion (other than planar motion), and the process proceeds to process B.

処理Ｂとは実施形態１の処理を行って、画像系列における特徴点の時間的動きから、カメラ視、点の運動、すなわち、三軸周りの回転と三軸方向の並進運動、並びに、物体形状を構成する３次元情報を復元する処理である。なお、処理Ａについては実施形態３の処理Ａと同じであるため説明を省く。 The process B is the process of the first embodiment. From the temporal movement of the feature points in the image series, the camera view, the movement of the points, that is, the rotation around the three axes and the translational movement in the three axes, and the object shape Is a process for restoring the three-dimensional information constituting the. Note that the processing A is the same as the processing A of the third embodiment, and thus description thereof is omitted.

請求項１等の画像蓄積型の基本構成図。A basic configuration diagram of an image storage type according to claim 1 or the like. 請求項１等のリアルタイム処理型の基本構成図。A basic configuration diagram of a real-time processing type according to claim 1 or the like. 請求項１等の処理フロー図。The processing flowchart of Claim 1 grade | etc.,. 請求項２等の画像蓄積型の某本構成図。The image storage type copy block diagram of Claim 2 grade | etc.,. 請求項２等のリアルタイム処理型の基本構成図。A basic configuration diagram of a real-time processing type according to claim 2 or the like. ｕｖ画像変換の処理フロー。The process flow of uv image conversion. ｕｖ画像生成を説明する図。The figure explaining uv image generation. 請求項３等の画像蓄積型の基本構成図。4. A basic configuration diagram of an image storage type according to claim 3. 請求項３等のリアルタイム処理型の基本構成図。A basic configuration diagram of a real-time processing type according to claim 3 or the like. 復元処理判定部での処理フロー。The processing flow in a restoration process determination part. 処理Ａの処理フロー。Process flow of process A. 請求項４等の画像蓄積型の基本構成図。5. A basic configuration diagram of an image storage type according to claim 4. 請求項４等のリアルタイム処理型の基本構成図。A basic configuration diagram of a real-time processing type according to claim 4 or the like. 全方位カメラと復元する空間情報とカメラ座標系を示す図。The figure which shows the omnidirectional camera, the spatial information to restore | restore, and a camera coordinate system. 画像から垂直線分を検出し、交点Ａを求めることを説明する図。The figure explaining detecting a perpendicular line segment from an image and calculating intersection A. 街路樹などの遮蔽物により、画像から検出した垂直線分で交点Ａが求められないことを示す図。The figure which shows that the intersection A is not calculated | required by the vertical line segment detected from the image by obstructions, such as a roadside tree. カメラ運動と３次元情報を復元するとき、カメラ光軸を鉛直向きに向けない利用形態。Usage mode in which the camera optical axis is not oriented vertically when restoring camera motion and 3D information. 魚眼カメラが撮像した屋外映像における建物に付随するエッジラインを示す図。The figure which shows the edge line accompanying the building in the outdoor image | video which the fisheye camera imaged. 図１４のエッジをｕｖ画像として変換した結果を示す図。The figure which shows the result of having converted the edge of FIG. 14 as a uv image.

Explanation of symbols

１時系列画像データベース
１Ａ画像入力部
２特徴点計測部
３計測行列変換部
４計測行列入力部
５因子分解法処理部
５Ａ平面運動・３次元情報復元部
５ＢＺ軸運動復元部
５Ｃ回転運動復元部
６画像変換処理部
７復元処理判定部 DESCRIPTION OF SYMBOLS 1 Time-sequential image database 1A Image input part 2 Feature point measurement part 3 Measurement matrix conversion part 4 Measurement matrix input part 5 Factor decomposition method processing part 5A Plane motion / three-dimensional information restoration part 5B Z-axis movement restoration part 5C Rotation movement restoration part 6 Image conversion processing unit 7 Restoration processing determination unit

Claims

From the temporal change in image coordinate values for feature points placed in the target image in a time-series omnidirectional image or wide-field image acquired by an image input device such as a fisheye camera, the time-series camera viewpoint A method for restoring motion and 3D information constituting the object shape of the outside world,
The value obtained by dividing the orthogonal direction component of the moving direction of the camera by the height direction component in the coordinate value obtained by projecting the three-dimensional coordinate value of the outside world onto the spherical surface with the viewpoint as the center, and the moving direction component of the camera as the height direction component In the feature point coordinate system set as a time-series omnidirectional image or wide-field image, with the values divided by u as the u coordinate and v coordinate, the image coordinate values of the feature points in each image are obtained, and the image coordinates A measurement matrix measurement step for obtaining a uv coordinate value of the feature point from the phase angle and the elevation angle calculated from the values;
Using all the uv coordinate values obtained in the measurement matrix measurement step, matrix decomposition data for restoration processing is generated, the matrix decomposition data is subjected to singular value decomposition, noise is removed, and matrix data representing motion information is obtained. Matrix data representing three-dimensional information is obtained, and in the motion information, a transformation matrix that satisfies the conditions set for defining the motion is obtained, and this transformation matrix is applied to the matrix data representing the motion information to relate to the camera viewpoint. The rotational motion around the Z-axis that becomes the optical axis and the translational motion in the XY-axis directions of the two axes are restored, and the inverse matrix of this transformation matrix is applied to matrix data representing three-dimensional information to form the object shape 3 Planar motion / three-dimensional information restoration step for restoring dimensional information;
The reprojection coordinate value in the uv coordinate system for each feature point is obtained from the plane motion and the three-dimensional information obtained in the plane motion / three-dimensional information restoration step, and each reprojection coordinate value and each of the measurement matrix measurement steps obtained in the measurement matrix measurement step are obtained. The Z axis that restores the translational motion of the camera viewpoint in the Z axis direction using the error between the uv coordinate value of the feature point and the Z coordinate value of each feature point obtained in the plane motion / three-dimensional information restoration step A motion recovery step;
From the matrix decomposition data, components that contribute to rotational motion around axes other than the optical axis are extracted, and those components and motion restored in the plane motion / three-dimensional information restoration step and Z-axis motion restoration step, In addition, using the three-dimensional information, a rotational motion restoring step for restoring rotational motion around another axis other than the optical axis;
It is determined whether or not the error obtained in the Z-axis motion restoration step has converged to a certain value or less. If not, the camera viewpoint XY obtained in the planar motion and three-dimensional information restoration step is determined. Using the translation component, the three-dimensional information, and the rotation angle around the XY axis other than the optical axis, a coefficient ε for deforming the uv coordinate value is obtained, and each feature point obtained in the plane motion and the three-dimensional information restoration step The coefficient δ for scaling the uv coordinate value is obtained using the height information of the image and the position on the Z axis of each viewpoint obtained in the Z axis translational motion restoration step, and the coefficient ε is obtained in the measurement matrix measurement step. Obtained by multiplying the coefficient δ by the value obtained by multiplying each matrix element of the measurement matrix and each coordinate value obtained by subtracting the intersection coordinate value from each matrix element of the measurement matrix obtained in the measurement matrix measurement step. New value combined Generating a matrix decomposition data as column elements, returning to the information restoration step, and repeating until the error converges to a certain value or less to restore the camera movement and the three-dimensional information; A method for restoring three-dimensional information.

From the temporal change in image coordinate values for feature points placed in the target image in a time-series omnidirectional image or wide-field image acquired by an image input device such as a fisheye camera, the time-series camera viewpoint A method for restoring motion and 3D information constituting the object shape of the outside world,
With respect to the omnidirectional image or wide-field image acquired by the image input device, each image coordinate value is converted into a uv coordinate value by using a phase angle and an elevation angle obtained from each image coordinate value, and each pixel value is converted into its uv coordinate value. A uv measurement matrix measurement step of generating a time series uv image so as to be associated with each other, and measuring a uv coordinate value of a feature point in each image in a feature point coordinate system set in the uv image;
Matrix decomposition data for restoration processing is generated for each uv coordinate value obtained in the uv measurement matrix measurement step, the matrix decomposition data is subjected to singular value decomposition, noise is removed, and matrix data representing motion information is generated. And obtaining matrix data representing 3D information, obtaining a transformation matrix satisfying the conditions set for defining the motion in the motion information, and applying this transformation matrix to the matrix data representing the motion information, 3D information that constitutes the object shape by applying the inverse matrix of this transformation matrix to the matrix data representing the 3D information and the rotational motion around the Z axis that is the optical axis and the translational motion in the X and Y directions of the 2 axes Plane motion and 3D information restoration step to restore
A reprojection value in the uv coordinate system for each feature point is obtained from the plane motion and the three-dimensional information obtained in the plane motion / three-dimensional information restoration step, and each feature obtained in the reprojection value and the uv measurement matrix measurement step is obtained. A Z-axis motion that restores the translational motion of the camera viewpoint in the Z-axis direction using the error between the uv coordinate value of the point and the Z-coordinate value of each feature point obtained in the plane motion / three-dimensional information restoration step. A restore step,
From the matrix decomposition data, components that contribute to rotational motion around other axes other than the optical axis are extracted, and those components and motion restored in the plane motion / three-dimensional information restoration step and Z-axis motion restoration step. And a rotational motion restoring step for restoring rotational motion around another axis other than the optical axis using the three-dimensional information,
It is determined whether or not the error obtained in the Z-axis motion restoration step has converged to a certain value or less. If not, the camera viewpoint obtained in the planar motion and the three-dimensional information restoration step is determined. Using the XY translation component, the three-dimensional information, and the rotation angle around the XY axis other than the optical axis, a coefficient ε for deforming the uv coordinate value is obtained, and each feature obtained in the plane motion and the three-dimensional information restoration step Using the point height information and the position of each viewpoint on the Z-axis obtained in the Z-axis translational motion restoration step, a coefficient δ for scaling the uv coordinate value is obtained, and the coefficient ε is obtained in the measurement matrix measurement step. The value obtained by multiplying each matrix element of the measured matrix and the coordinate value obtained by subtracting the intersection coordinate value from each matrix element of the measurement matrix obtained in the measurement matrix measurement step are multiplied by the coefficient δ, respectively. Combine the obtained values A fish-eye camera which generates matrix decomposition data as new matrix elements, returns to the information restoration step, and repeats until the error converges to a certain value or less to restore camera motion and three-dimensional information How to restore motion and 3D information.

In the measurement matrix measurement step, in the singular value component obtained by singular value decomposition of the measurement matrix, a determination value representing motion is calculated from the component, and if the determination value is less than a certain value, it is regarded as planar motion. Perform 3 degrees of freedom of plane motion consisting of rotation around the optical axis and motion on a plane perpendicular to the optical axis, and processing to restore 3D information. If the judgment value is above a certain value, 3. The method of restoring fisheye camera motion and 3D information according to claim 1 or 2, wherein the motion of 6 degrees of freedom consisting of translational motion and the external 3D information are restored.

In the uv measurement matrix measurement step, in a singular value component obtained by singular value decomposition of the measurement matrix, a determination value representing motion is calculated from the component, and if the determination value is less than a certain value, it is regarded as plane motion. If the judgment value is greater than a certain value, it performs a rotation about the optical axis and a plane motion with 3 degrees of freedom consisting of a rotation around the optical axis and a motion on a plane perpendicular to the optical axis. 3. The method of restoring fisheye camera motion and three-dimensional information according to claim 1 or 2, wherein the motion with a degree of freedom of 6 comprising translational motion and the three-dimensional information of the outside world are restored.

From the temporal change in image coordinate values for feature points placed in the target image in a time-series omnidirectional image or wide-field image acquired by an image input device such as a fisheye camera, the time-series camera viewpoint A device for restoring motion and three-dimensional information constituting the object shape of the outside world,
The value obtained by dividing the orthogonal direction component of the moving direction of the camera by the height direction component in the coordinate value obtained by projecting the three-dimensional coordinate value of the outside world onto the spherical surface with the viewpoint as the center, and the moving direction component of the camera as the height direction component In the feature point coordinate system set as a time-series omnidirectional image or wide-field image, with the values divided by u as the u coordinate and v coordinate, the image coordinate values of the feature points in each image are obtained, and the image coordinates A measurement matrix measuring means for obtaining a uv coordinate value of the feature point from a phase angle and an elevation angle calculated from the values;
Using all uv coordinate values obtained by the measurement matrix measurement means, matrix decomposition data for restoration processing is generated, and this matrix decomposition data is subjected to singular value decomposition, noise is removed, and matrix data representing motion information is obtained. Matrix data representing three-dimensional information is obtained, and in the motion information, a transformation matrix that satisfies the conditions set for defining the motion is obtained, and this transformation matrix is applied to the matrix data representing the motion information to relate to the camera viewpoint. The rotational motion around the Z-axis that becomes the optical axis and the translational motion in the XY-axis directions of the two axes are restored, and the inverse matrix of this transformation matrix is applied to matrix data representing three-dimensional information to form the object shape 3 Plane motion / three-dimensional information restoring means for restoring dimensional information;
The reprojection coordinate value in the uv coordinate system for each feature point is obtained from the plane motion obtained by the plane motion / three-dimensional information restoration means and the three-dimensional information, and the reprojection coordinate value and each measurement matrix measurement means obtained by the measurement matrix measurement means are obtained. The Z axis that restores the translational motion of the camera viewpoint in the Z axis direction using the error between the uv coordinate value of the feature point and the Z coordinate value of each feature point obtained by the plane motion / three-dimensional information restoration means Motion recovery means,
From the matrix decomposition data, components that contribute to rotational motion around axes other than the optical axis are extracted, and those components and motion restored by the planar motion / three-dimensional information restoring means and Z-axis motion restoring means, And rotational motion restoring means for restoring rotational motion around another axis other than the optical axis using three-dimensional information,
It is determined whether or not the error obtained by the Z-axis motion restoration means has converged to a certain value or less. If not, the camera viewpoint XY obtained by the planar motion and the three-dimensional information restoration means is determined. Using the translation component, the three-dimensional information, and the rotation angle around the XY axis other than the optical axis, a coefficient ε for deforming the uv coordinate value is obtained, and each feature point obtained by the plane motion and the three-dimensional information restoration means The coefficient δ for scaling the uv coordinate value is obtained using the height information of the image and the position on the Z-axis of each viewpoint obtained by the Z-axis translational movement restoring means, and the coefficient ε is obtained by the measurement matrix measuring means. Obtained by multiplying the value obtained by multiplying each matrix element of the measurement matrix by the coefficient δ by each coordinate value obtained by subtracting the intersection coordinate value from each matrix element of the measurement matrix obtained by the measurement matrix measurement means Matrix decomposition data with new matrix elements combined with values A reconstructing device for fisheye camera motion and three-dimensional information, wherein the camera motion and three-dimensional information are reconstructed by returning to the information restoring means and repeating until the error converges to a certain value or less. .

From the temporal change in image coordinate values for feature points placed in the target image in a time-series omnidirectional image or wide-field image acquired by an image input device such as a fisheye camera, the time-series camera viewpoint A device for restoring motion and three-dimensional information constituting the object shape of the outside world,
With respect to the omnidirectional image or wide-field image acquired by the image input device, each image coordinate value is converted into a uv coordinate value by using a phase angle and an elevation angle obtained from each image coordinate value, and each pixel value is converted into its uv coordinate value. A uv measurement matrix measuring means for generating a time series uv image so as to be associated with each other and measuring the uv coordinate value of the feature point in each image in the feature point coordinate system set in the uv image;
For each uv coordinate value obtained by the uv measurement matrix measurement means, matrix decomposition data for restoration processing is generated, the matrix decomposition data is subjected to singular value decomposition, noise is removed, and matrix data representing motion information is generated. And obtaining matrix data representing 3D information, obtaining a transformation matrix satisfying the conditions set for defining the motion in the motion information, and applying this transformation matrix to the matrix data representing the motion information, 3D information that constitutes the object shape by applying the inverse matrix of this transformation matrix to the matrix data representing the 3D information and the rotational motion around the Z axis that is the optical axis and the translational motion in the X and Y directions of the 2 axes Plane motion / three-dimensional information restoration means for restoring
A reprojection value in the uv coordinate system for each feature point is obtained from the plane motion obtained by the plane motion / three-dimensional information restoration means and the three-dimensional information, and the reprojection value and each feature obtained by the uv measurement matrix measurement means. Z-axis motion that restores the translational motion of the camera viewpoint in the Z-axis direction using the error between the uv coordinate value of the point and the Z-coordinate value of each feature point obtained by the plane motion / three-dimensional information restoration means Recovery means,
From the matrix decomposition data, components that contribute to rotational motion around axes other than the optical axis are extracted, and those components and motion restored by the planar motion / three-dimensional information restoring means and the Z-axis motion restoring means. , As well as rotational motion restoring means for restoring rotational motion around another axis other than the optical axis using three-dimensional information;
It is determined whether or not the error obtained by the Z-axis motion restoration means has converged to a certain value or less. If not, the camera viewpoint obtained by the planar motion and the three-dimensional information restoration means is determined. Using the XY translation component, the three-dimensional information, and the rotation angle around the XY axis other than the optical axis, a coefficient ε for deforming the uv coordinate value is obtained, and each feature obtained by the plane motion and the three-dimensional information restoration means Using the point height information and the position of each viewpoint on the Z-axis obtained by the Z-axis translational motion restoration means, a coefficient δ for scaling the uv coordinate value is obtained, and the coefficient ε is obtained by the measurement matrix measurement means. The value obtained by multiplying each matrix element of the measured matrix and the coordinate value obtained by subtracting the intersection coordinate value from each matrix element of the measurement matrix obtained by the measurement matrix measuring means are multiplied by the coefficient δ, respectively. A new matrix element combining the obtained values Reconstructing the fisheye camera motion and 3D information, generating decomposed data, returning to the information restoring means, and repeating until the error converges below a certain value to restore the camera motion and 3D information apparatus.

In the measurement matrix measurement means, in the singular value component obtained by singular value decomposition of the measurement matrix, a determination value representing motion is calculated from the component, and if the determination value is less than a certain value, it is regarded as planar motion. Perform 3 degrees of freedom of plane motion consisting of rotation around the optical axis and motion on a plane perpendicular to the optical axis, and processing to restore 3D information. If the judgment value is above a certain value, The apparatus for restoring fisheye camera motion and three-dimensional information according to claim 5 or 6, wherein the motion with six degrees of freedom consisting of translational motion and the three-dimensional information of the outside world are restored.

In the uv measurement matrix measurement means, in a singular value component obtained by singular value decomposition of the measurement matrix, a determination value representing motion is calculated from the component, and if the determination value is less than a certain value, it is regarded as plane motion. If the judgment value is greater than a certain value, it performs a rotation about the optical axis and a plane motion with 3 degrees of freedom consisting of a rotation around the optical axis and a motion on a plane perpendicular to the optical axis. 7. The apparatus for restoring fisheye camera motion and three-dimensional information according to claim 5 or 6, wherein the motion with a degree of freedom of 6 consisting of translational motion and the three-dimensional information of the outside world are restored.

5. A recording medium having recorded thereon a program that can execute the fisheye camera motion and the three-dimensional information restoration method according to any one of claims 1 to 4 by a computer.