JP2006172026A

JP2006172026A - Camera motion, and device and method for restoring three-dimensional information, and program

Info

Publication number: JP2006172026A
Application number: JP2004362152A
Authority: JP
Inventors: Isao Miyagawa; 勲宮川; Yoshiori Wakabayashi; 佳織若林; Kenichi Arakawa; 賢一荒川
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2004-12-15
Filing date: 2004-12-15
Publication date: 2006-06-29

Abstract

<P>PROBLEM TO BE SOLVED: To stably and robustly restor camera motion and three-dimensional information at the same time. <P>SOLUTION: This restoring device is composed of a time-series image database 1 for storing time-series images, an asymptotic matrix generating part 2 for extracting an image series from the database 1, observing image coordinate values of a characteristic point between full frames and generating asymptotic matrix data to be a source for restoring camera motion and three-dimensional information, a plane motion/three-dimensional information restoring part 3 for restoring the plane motion and the three-dimensional information from the asymptotic matrix data, and a stabilization processing part 4 for restoring rotational motion other than optical axial rotation and optical axial translational motion, determining whether to require the next iteration to asymptotically make (stabilize) camera motion to plane motion and transferring information required to generate asymptotic matrix data to the asymptotic matrix generating part 2 when the iteration is necessary. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、カメラを使って取得した車載画像または室内画像、船上からの海上画像、空撮画像、全方位カメラで撮影した全方位画像、歩行しながら撮影した歩行撮影画像などの時系列画像全般に対してカメラ運動と３次元情報を復元する装置、方法に係り、時系列画像から、カメラの視点を原点として設定したカメラ座標系におけるロール、ピッチ、ヨー回転から構成される三軸（ＸＹＺ）周りの回転運動、三軸（ＸＹＺ）方向の並進運動、並びに、時系列映像に写っている外界の３次元形状、すなわち、被写体（物体）の外観形状を構成する３次元情報を復元する装置、方法に関する。 The present invention generally relates to time-series images such as an in-vehicle image or indoor image acquired using a camera, a marine image from a ship, an aerial image, an omnidirectional image captured by an omnidirectional camera, and a walking image captured while walking. Three-dimensional (XYZ) composed of roll, pitch, and yaw rotation in a camera coordinate system set with the camera viewpoint as the origin from a time-series image A device that restores the three-dimensional information that constitutes the rotational shape of the surroundings, the translational motion in the three-axis (XYZ) direction, and the three-dimensional shape of the external environment reflected in the time-series image, that is, the appearance shape of the subject (object), Regarding the method.

コンピュータビジョン分野では、時系列画像から、対象物の形状を計測または獲得する手法には、ステレオ計測やエピポーラ面解析を用いた３次元解析手法がある。この手法によれば、物体が撮影されている複数の時系列画像から、空間形状または空間構造に関する３次元位置情報、並びに、カメラ視点に関する運動を復元することができる。しかし、移動手段などを利用して撮影カメラを動かしながら撮影した時系列映像においては、撮影時の環境、撮影カメラの微小な動きによりシームレスに映像取得が困難であり、時系列映像中にランダム性の雑音が混入し、カメラ運動や物体形状を正確に復元することが困難な場合がある。 In the computer vision field, methods for measuring or acquiring the shape of an object from time-series images include a three-dimensional analysis method using stereo measurement and epipolar surface analysis. According to this method, it is possible to restore the three-dimensional position information related to the spatial shape or the spatial structure and the motion related to the camera viewpoint from a plurality of time-series images in which the object is photographed. However, it is difficult to seamlessly acquire images in time-series images taken while moving the camera using moving means, etc., due to the shooting environment and the minute movement of the camera. In some cases, it is difficult to accurately restore camera motion and object shape.

このような問題に対して、カメラで撮影した映像シーンから、ユークリッド空間でのカメラ運動と物体形状を同時に、かつ、ロバストに復元する手法が存在する。例えば、取得する画像において特徴点をつけ、この特徴点を時系列に追跡して得た画像座標値から、カメラ運動と３次元情報を復元する代表的な手法として因子分解法がある（例えば、被特許文献１参照）。 In order to solve such a problem, there is a technique for simultaneously and robustly restoring the camera motion and the object shape in the Euclidean space from the video scene photographed by the camera. For example, there is a factorization method as a typical method for restoring camera motion and three-dimensional information from image coordinate values obtained by attaching feature points to an acquired image and tracking the feature points in time series (for example, (See Patent Document 1).

この因子分解法では、式（Ａ１）に示すように、画像上において取得した時系列のｘｙ画像座標値（式（Ａ１）左辺）から、カメラ運動に関する行列（右辺の左側の行列（（ｍ_ix，ｍ_iy，ｍ_iz）と（ｎ_ix，ｎ_iy，ｎ_iz）は第ｉフレームでの投影モデルに従ったカメラ運動の成分を表す）と３次元情報（Ｘ，Ｙ，Ｚ）に関する行列（右辺の右側の行列、（Ｘ_j，Ｙ_j，Ｚ_j）は第ｊ番目の３次元座標値）に分解する手法である。この行列分解には特異値分解なる数学的手法が使われている。式（Ａ１）左辺は計測行列と呼ばれており、行方向は特徴点の数を、列方向はフレーム順を表す。（ｘ_ij，ｙ_ij）は第ｉフレームの第ｊ番目の特徴点の画像座標値となっている。 In this factorization method, as shown in equation (A1), a camera motion matrix (matrix on the left side of the right side ((m _ix )) is obtained from time-series xy image coordinate values (left side of equation (A1)) acquired on the image. , _{M iy} , m _iz ) and (n _ix , n _iy , n _iz ) represent the components of the camera motion according to the projection model in the i-th frame) and a matrix (X, Y, Z) relating to the three-dimensional information (X, Y, Z) This is a method of decomposing the matrix on the right side of the right side, (X _j , Y _j , Z _j ) is the j-th three-dimensional coordinate value. The left side of equation (A1) is called a measurement matrix, the row direction represents the number of feature points, the column direction represents the frame order, and (x _ij , y _ij ) represents the jth feature point of the i-th frame. Image coordinate values.

因子分解法では、式（Ａ１）左辺は画像から得られる２次元情報（あらかじめ各フレームの重心座標値を引いた２次元座標値としている）だけになっており、式（Ａ１）右辺の行列により行列演算する（線形演算する）ことで左辺の画像座標値が得られるという投影モデルに基づいている。しかし、式（Ａ１）に分解できる投影モデルは正射影、弱透視投影、平行透視投影モデルに限る。これらの投影モデルは現実の透視投影モデルの近似形式であり、被写体とカメラの関係によってはそれぞれの条件が近似的に成り立つ場合もあるが、一般的には透視投影モデルとのギャップのために式（Ａ１）のように因子分解した場合はカメラ運動、３次元形状の精度は悪い。 In the factorization method, the left side of the equation (A1) is only two-dimensional information obtained from the image (a two-dimensional coordinate value obtained by subtracting the barycentric coordinate value of each frame in advance). This is based on a projection model in which an image coordinate value on the left side is obtained by performing a matrix operation (linear operation). However, the projection models that can be decomposed into the formula (A1) are limited to orthographic projection, weak perspective projection, and parallel perspective projection models. These projection models are approximate forms of actual perspective projection models, and depending on the relationship between the subject and the camera, the respective conditions may approximately hold. When factorization is performed as in (A1), the accuracy of camera motion and three-dimensional shape is poor.

これに対して、透視投影型因子分解法を使うと、式（Ａ１）の因子分解法を反復的に利用して透視投影モデルに漸近させて、透視投影像の条件でカメラ運動と３次元形状を同時に復元することができる。この方法により正射影、弱透視、平行透視投影モデルより高精度にカメラ運動と３次元形状が得られるが、各反復において分解した際の符号を考慮する必要があり、場合によっては透視投影を満たす正確なカメラ運動と３次元形状を求めることが困難であった。
C.Tomasi and T.Kanade;"Shape and Motion from Image Streams UnderOrthography:A Factorization Method",International Journal of Computer Vision,Vol.9,No.2,1992。 On the other hand, when the perspective projection type factorization method is used, the factorization method of the formula (A1) is repeatedly used to asymptotically approach the perspective projection model, and the camera motion and the three-dimensional shape are obtained under the conditions of the perspective projection image. Can be restored at the same time. This method can obtain camera motion and three-dimensional shape with higher accuracy than orthographic, weak perspective, and parallel perspective projection models. However, it is necessary to consider the sign at the time of decomposition in each iteration. It was difficult to obtain accurate camera motion and 3D shape.
C. Tomasi and T. Kanade; "Shape and Motion from Image Streams UnderOrthography: A Factorization Method", International Journal of Computer Vision, Vol. 9, No. 2,1992.

カメラを車両などの移動手段に搭載し、市街地を移動観測する場合、路面と車両の関係でカメラ振動（カメラの揺れ）が発生する。また、カメラ付き携帯電話、ディジタルカメラなどの市販カメラで被写体を撮影したとき、手振れが発生し、撮影中に微小に振動することで映像シーンが揺れるという問題がある。 When a camera is mounted on a moving means such as a vehicle and moving and observed in an urban area, camera vibration (camera shake) occurs due to the relationship between the road surface and the vehicle. In addition, when a subject is photographed with a commercially available camera such as a mobile phone with a camera or a digital camera, there is a problem that a camera shake occurs and the video scene is shaken by a slight vibration during the photographing.

このようなカメラ振動や手振れのある映像シーンにおいてカメラ運動と３次元形状を復元しようとするとき、因子分解法が有効な手法であるが、透視投影型因子分解法を使う場合、カメラ振動や手振れによるカメラ振動を正確に求めることができず、そのため３次元形状が歪んでしまうという問題があった。この問題は並進運動として復元されなければならないカメラ運動が回転運動に転換されてしまうことが原因の一つとなっている。 The factorization method is effective when trying to restore the camera motion and 3D shape in a video scene with camera vibration and camera shake. However, when using the perspective projection type factorization method, the camera vibration and camera shake are effective. Therefore, there is a problem in that the camera vibration cannot be accurately obtained, and the three-dimensional shape is distorted. This problem is caused by the fact that the camera motion that must be restored as translational motion is converted into rotational motion.

一方、市販カメラにはカメラ振動や手振れ補正機能を有するカメラもあり、撮影中のカメラ振動や手振れを低減したシームレスな映像シーンを撮影することができる。一般的に、手振れ補正は角度センサやジャイロセンサによりカメラ振動や手振れの振動を検出し、それに合わせて集光レンズを移動させるという光学式手振れ補正タイプと、画面中のオプティカルフローを検出してＣＣＤ面において画素をシフトして揺れを低減させる電子式手振れ補正タイプに大別できる。しかし、どちらの補正方式においてもカメラ運動と３次元形状を復元するための投影モデルを定式化することが困難であり、正確な投影モデルを得るには多くのパラメータが介在するため、因子分解法の適用が困難である。 On the other hand, some commercially available cameras have camera vibration and camera shake correction functions, and can shoot seamless video scenes with reduced camera vibration and camera shake during shooting. In general, camera shake correction is an optical camera shake correction type that detects camera vibration or camera shake vibration using an angle sensor or gyro sensor, and moves the condensing lens accordingly, and CCD that detects optical flow in the screen. It can be roughly classified into an electronic image stabilization type that shifts pixels on the surface to reduce shaking. However, in both correction methods, it is difficult to formulate a projection model for restoring the camera motion and the three-dimensional shape, and many parameters are involved in obtaining an accurate projection model. Is difficult to apply.

そこで、移動観測でのカメラ振動や手振れがある映像シーンにおいてもロバスト、かつ、高精度にカメラ運動と３次元形状を同時に復元することが重要であるが、透視投影型因子分解法のように解を得るために符号を考慮せず、かつ、安定してカメラ運動と３次元形状を求める必要がある。 Therefore, it is important to restore the camera motion and 3D shape at the same time with high accuracy and robustness even in video scenes with camera vibrations and camera shakes during movement observation. Therefore, it is necessary to obtain the camera motion and the three-dimensional shape stably without considering the sign.

カメラを使って外界を撮像し、その得られた画像系列からカメラ視点の運動と３次元情報を復元しようとするとき、カメラ視点に関する運動と対象物の３次元座標値は、図１４の状況において、式（Ａ２）の式で関係付けられる。 When the camera is used to image the outside world and the motion of the camera viewpoint and the three-dimensional information are to be restored from the obtained image sequence, the motion related to the camera viewpoint and the three-dimensional coordinate value of the object are as shown in FIG. , (A2).

式（Ａ２）の（ｘ'_ij，ｙ'_ij，ｚ'_ij）は視点を原点とした単位半球面座標値であり、画像上へ投影されて得られる対象物（特徴点）の画像座標値は（ｘ_ij，ｙ_ij）＝（ｘ'_ij／ｚ’_ij，ｙ'_ij／ｚ'_ij）として観測できる。ただし、λ_ijは画像座標値から直接得られず、しかも、時系列に依存する成分（サフィックスがｉのもの）と時系列に依存しない成分（サフィックスがｊのもの）に分離できないため、式（Ａ２）左辺においてある。式（Ａ２）右辺は４つの行列からなっているが、３つの回転行列を展開して１つの行列にして、式（Ａ１）右辺のカメラ運動に対応する行列とすることもできるが、式（Ａ２）そのものから分かるように、式（Ａ１）のように左辺には画像座標値から得た情報、右辺にはカメラ運動に対応する情報（サフィックスがｉのもの）と３次元情報に対応する情報（サフィックスがｊのもの）に変形することは困難である。言い換えれば、式（Ａ２）に基づいて因子分解法を応用してカメラ運動と３次元情報を復元することはできない。 (X ′ _ij , y ′ _ij , z ′ _ij ) in Expression (A2) is a unit hemispherical coordinate value with the viewpoint as the origin, and the image coordinate value of the object (feature point) obtained by being projected onto the image. Can be observed as (x _ij , y _ij ) = (x ′ _ij / z ′ _ij , y ′ _ij / z ′ _ij ). However, since λ _ij cannot be obtained directly from the image coordinate values and cannot be separated into a component that depends on time series (suffix i) and a component that does not depend on time series (suffix j), A2) On the left side. Although the right side of Expression (A2) is composed of four matrices, the three rotation matrices can be expanded into one matrix, which can be a matrix corresponding to the camera movement on the right side of Expression (A1). As can be seen from A2) itself, information obtained from image coordinate values on the left side, information corresponding to camera motion (suffix i) and information corresponding to three-dimensional information are shown on the right side as in equation (A1). It is difficult to transform into (suffix j). In other words, the camera motion and the three-dimensional information cannot be restored by applying the factorization method based on the formula (A2).

本発明が解決しようとする課題は、式（Ａ２）に示す投影モデルにおいて、下記の特許文献の方法を利用して、解の符号などのあいまいさを考慮せず、安定してカメラ運動と３次元情報を同時に、かつ、ロバストに復元することを課題とする。 The problem to be solved by the present invention is that the projection model shown in the formula (A2) uses the method of the following patent document, and does not take into account the ambiguity such as the sign of the solution and stably The problem is to restore the dimension information simultaneously and robustly.

特許文献「特開２００３−２７１９２５、宮川，小澤，若林，有川：“全方位カメラ視点運動並びに物体形状復元方法、装置、全方位カメラ視点運動並びに物体形状復元方法プログラム、及び、該プログラムを記録した記録媒体”」 Patent document “Japanese Patent Laid-Open No. 2003-271925, Miyagawa, Ozawa, Wakabayashi, Arikawa:“ Omnidirectional camera viewpoint movement and object shape restoration method and apparatus, omnidirectional camera viewpoint movement and object shape restoration method program, and recording the program recoding media""

（原理的な説明）
本発明は、カメラ振動を回転運動の小さな変化であるとして、式（Ａ２）においてカメラ運動に対して制限をつけ、その制限された回転運動のカメラ運動で撮像した投影モデルを前提とする。すなわち、式（Ａ２）においてｉ＝１，２，…，Ｆ；ｊ＝１，２，…，Ｐに行列展開した投影モデルにおいて、ロール回転、ピッチ回転が微小なとき、すなわち、ｃｏｓψ≒１，ｓｉｎψ≒ψ，ｃｏｓω≒１，ｓｉｎω≒ωとして、変形し整理すると、 (Principle explanation)
The present invention presupposes a projection model in which camera vibration is a small change in rotational motion, the camera motion is limited in equation (A2), and an image is captured with the camera motion of the limited rotational motion. That is, in the projection model in which the matrix expansion is performed with i = 1, 2,..., F; j = 1, 2,..., P in equation (A2), when the roll rotation and pitch rotation are very small, that is, cos ψ≈1, When sin ψ≈ψ, cos ω≈1, sin ω≈ω,

となる。この投影モデルでは、式（Ａ３）右辺を見るとわかるように、右辺左側の行列は時系列に依存する情報（カメラの運動に関する情報）であり、右辺右側の行列は時系列に依存しない不変な情報（外界の３次元情報）となっている。これは前記の特許文献で因子分解される平面運動と３次元情報の形態となっている。一方、式（Ａ３）左辺は画像から得られる情報以外に、カメラ運動と３次元情報に関する情報がかかわっており、上記のカメラ運動を制限した場合においても、投影モデルは煩雑すぎて前記の特許文献を安易に応用して、カメラ運動と３次元情報を同時に復元することはできない。しかし、画像で観測した画像座標値（ｘ_ij，ｙ_ij）から式（Ａ３）左辺のように変換ができれば、前記の特許文献の手法によりカメラ運動と３次元形状を復元することができる。 It becomes. In this projection model, as can be seen from the right side of Expression (A3), the matrix on the left side of the right side is information that depends on the time series (information on camera motion), and the matrix on the right side of the right side is invariant that does not depend on the time series. It is information (three-dimensional information of the outside world). This is in the form of plane motion and three-dimensional information factorized in the above-mentioned patent document. On the other hand, the left side of equation (A3) is related to information about camera motion and three-dimensional information in addition to the information obtained from the image. Even when the above-mentioned camera motion is limited, the projection model is too complicated and the above-mentioned patent document. Cannot be restored easily to restore camera motion and 3D information at the same time. However, if the image coordinate values (x _ij , y _ij ) observed in the image can be converted as in the left side of equation (A3), the camera motion and the three-dimensional shape can be restored by the method of the above-mentioned patent document.

本発明では、式（Ａ３）の投影モデルに基づき、ロール回転ω_i、ピッチ回転ψ_i、ヨー回転θ_i、ＸＹＺ並進運動（Ｔｘ_i，Ｔｙ_i，Ｔｚ_i）；ｉ＝１，２，…，Ｆ、並びに、外界の３次元情報（Ｘ_j，Ｙ_j，Ｚ_j）；ｊ＝１，２，…，Ｐ（以降、３次元情報とは、Ｐ個の特徴点に関する３次元座標値を指す）を復元する。式（Ａ３）を解くために、本発明における漸近行列入力ステップにおいて式（Ａ３）左辺の行列要素を反復的に生成し、本発明における平面運動・３次元情報復元ステップにおいて前記の特許文献の手法を利用して平面運動と３次元情報を復元する。 In the present invention, roll rotation ω _i , pitch rotation ψ _i , yaw rotation θ _i , XYZ translational motion (Tx _i , Ty _i , Tz _i ); i = 1, 2,. , F, and the external three-dimensional information (X _j , Y _j , Z _j ); j = 1, 2,..., P (hereinafter, the three-dimensional information is a three-dimensional coordinate value for P feature points. Restore). In order to solve the equation (A3), the matrix element on the left side of the equation (A3) is repeatedly generated in the asymptotic matrix input step in the present invention, and the method of the above-mentioned patent document is performed in the plane motion / three-dimensional information restoration step in the present invention. To restore plane motion and 3D information.

本発明はカメラ運動を平面運動へ漸近させることで、カメラ振動により揺れるカメラ運動を平面運動へ安定化させる。つまり、次の反復での式（Ａ３）左辺の行列要素を本発明における漸近行列入力ステップにおいて、平面運動以外のピッチ、ロール回転（光軸回転以外の回転）と光軸並進運動を使って、観測した画像座標値から平面運動で投影した画像座標値へ変換する。この変換はカメラ運動を平面運動に安定化させることと等価である。本発明における安定化処理ステップでは安定化のための必要な変換係数、並びに、平面運動以外のピッチ、ロール回転（光軸回転以外の回転）と光軸並進運動を求めており、安定化が進むことで、本発明の平面運動・３次元情報復元ステップにおいて平面運動と３次元情報を高精度に求められる。 The present invention stabilizes the camera motion that is shaken by the camera vibration to the planar motion by making the camera motion asymptotic to the planar motion. That is, in the asymptotic matrix input step in the present invention, the matrix element on the left side of the formula (A3) in the next iteration uses a pitch other than plane motion, roll rotation (rotation other than optical axis rotation) and optical axis translational motion, The observed image coordinate values are converted into image coordinate values projected by plane motion. This transformation is equivalent to stabilizing the camera motion to a planar motion. In the stabilization processing step according to the present invention, necessary conversion coefficients for stabilization, pitch other than plane motion, roll rotation (rotation other than optical axis rotation) and optical axis translational motion are obtained, and stabilization proceeds. Thus, the plane motion and the three-dimensional information can be obtained with high accuracy in the plane motion / three-dimensional information restoration step of the present invention.

このように、本発明では、前記の特許文献の手法を反復的に利用して段階的に復元処理を繰り返し行うことで、近似的に、式（Ａ３）の投影モデルに基づいたカメラ運動と３次元情報を復元することを可能としている。 As described above, according to the present invention, the reconstruction process is repeatedly performed stepwise using the method of the above-described patent document, so that the camera motion based on the projection model of Expression (A3) is approximately 3 It is possible to restore the dimension information.

以上のことから、本発明は、以下の装置、方法およびプログラムを特徴とする。 As described above, the present invention is characterized by the following apparatuses, methods, and programs.

（装置の発明）
（１）時系列画像中において、対象とする画像に配置した特徴点に関する画像座標値の時間的変化量から、時系列におけるカメラ視点の運動、並びに、外界の物体形状を構成する３次元情報を復元する装置であって、
時系列画像に設定した特徴点座標系において、各フレーム画像における特徴点の画像座標値（観測座標値）を入力し、その観測座標値にカメラの回転運動、光軸座標値、並進運動、並びに、３次元情報からなる係数をかけた座標値を要素とする漸近行列データを生成する漸近行列生成手段と、
前記漸近行列データを特異値分解し、雑音除去を行って運動情報を表す行列データと３次元情報を表す行列データを得て、その運動情報において、運動を規定するために設定した条件を満足する変換行列を求め、運動情報を表す行列データに前記変換行列を作用させてカメラ視点に関する光軸周りの回転運動と光軸と垂直な平面上の並進運動（これらの成分からなる自由度３の平面運動）を復元し、並びに、３次元情報を表す行列データに前記変換行列の逆行列を作用させて物体形状を構成する３次元情報を復元する平面運動・３次元情報復元手段と、
前記平面運動・３次元情報復元手段で得た平面運動と３次元情報から算出する再投影誤差と、前記漸近行列生成手段で得た観測座標値に、係数εと光軸座標値で変換した座標値を行列要素とする行列データを求め、その行列データを特異値分解して雑音を除去した行列データと、前記平面運動・３次元情報復元手段で得た各特徴点のＺ座標値（Ｚ方向を鉛直方向にしたときの特徴点位置の高さ）を要素とする行列から、カメラ視点の光軸方向の並進運動を復元し、その復元した光軸並進運動により係数δを更新する光軸運動復元手段と、
前記漸近行列生成手段で得た観測座標値と、前記平面運動・３次元情報復元手段で得た平面運動と３次元情報から得る再投影誤差に係数δで変換した座標値の間の誤差を求め、その誤差を行列要素とする行列データと、前記平面運動・３次元情報復元手段で得た平面運動、３次元情報、並びに、前記光軸運動復元手段で得た光軸並進運動から、光軸以外の互いに直交する軸周りの回転運動を求め、その復元した回転運動により係数εと光軸座標値を更新する回転運動復元手段と、
前記漸近行列生成手段で得た観測座標値を前記回転運動復元手段で更新された変換係数（係数ε）と光軸座標値、並びに、前記光軸運動復元手段で更新された変換係数（係数δ）を使って変換した座標値と、前記平面運動・３次元情報復元手段で得た平面運動と３次元情報から各フレーム画像に対する特徴点の再投影座標値との間で平面運動への漸近値を表す漸近誤差を求め、この漸近誤差の増減により前記光軸並進運動復元手段による処理と前記回転運動復元手段における処理を切り替えて該漸近誤差を減少（カメラ運動を平面運動へ漸近）させる処理を繰り返す安定化処理手段と、
を有することを特徴とする。 (Invention of the device)
(1) In a time-series image, the movement of the camera viewpoint in the time series and the three-dimensional information constituting the object shape of the outside world are obtained from the temporal change amount of the image coordinate values regarding the feature points arranged in the target image. A device to restore,
In the feature point coordinate system set for the time series image, the image coordinate value (observation coordinate value) of the feature point in each frame image is input, and the rotation coordinate of the camera, the optical axis coordinate value, the translational motion, and Asymptotic matrix generation means for generating asymptotic matrix data whose elements are coordinate values multiplied by a coefficient consisting of three-dimensional information;
The asymptotic matrix data is subjected to singular value decomposition, noise removal is performed to obtain matrix data representing motion information and matrix data representing three-dimensional information, and the motion information satisfies the conditions set for defining the motion. A transformation matrix is obtained, and the transformation matrix is applied to the matrix data representing the motion information, so that the rotational motion around the optical axis with respect to the camera viewpoint and the translational motion on a plane perpendicular to the optical axis (a plane with three degrees of freedom consisting of these components). A plane motion / three-dimensional information restoring means for restoring the three-dimensional information constituting the object shape by applying an inverse matrix of the transformation matrix to the matrix data representing the three-dimensional information.
The plane motion obtained by the plane motion / three-dimensional information restoration means, the reprojection error calculated from the three-dimensional information, and the coordinates converted by the coefficient ε and the optical axis coordinate values into the observed coordinate values obtained by the asymptotic matrix generation means Matrix data having values as matrix elements is obtained, matrix data obtained by singular value decomposition of the matrix data to remove noise, and the Z coordinate value (Z direction) of each feature point obtained by the plane motion / three-dimensional information restoration means Optical axis motion that restores the translational motion in the optical axis direction of the camera viewpoint from the matrix whose element is the height of the feature point position when the is vertically oriented), and updates the coefficient δ by the restored optical axis translational motion Recovery means,
An error is obtained between the observed coordinate value obtained by the asymptotic matrix generation means, the plane motion obtained by the plane motion / three-dimensional information restoration means, and the re-projection error obtained from the three-dimensional information by the coefficient δ. From the matrix data having the error as a matrix element, the plane motion obtained by the plane motion / three-dimensional information restoring means, the three-dimensional information, and the optical axis translation obtained by the optical axis motion restoring means, the optical axis Rotational motion restoration means for obtaining rotational motion around mutually orthogonal axes other than and updating the coefficient ε and the optical axis coordinate value by the restored rotational motion,
The observed coordinate values obtained by the asymptotic matrix generating means are converted into the conversion coefficient (coefficient ε) and the optical axis coordinate value updated by the rotational motion restoring means, and the conversion coefficient (coefficient δ) updated by the optical axis motion restoring means. ), The asymptotic value to the plane motion between the plane motion obtained by the plane motion / three-dimensional information restoration means and the reprojected coordinate value of the feature point for each frame image from the three-dimensional information. An asymptotic error is calculated, and processing for reducing the asymptotic error (camera motion asymptotically approaches planar motion) by switching between processing by the optical axis translational motion restoring means and processing in the rotational motion restoring means by increasing or decreasing the asymptotic error. Repetitive stabilization means,
It is characterized by having.

（２）上記（１）において、全方位画像に設定した特徴点座標系において、各フレーム画像における特徴点の画像座標値を入力する手段と、その画像座標値からある基準軸からの方位角（位相角）と、全方位カメラに使用されている投影方式に従って求められる光軸方向からの角（仰角）を求める手段と、前記位相角と仰角を使って座標変換した座標値を前記観測座標値として求める手段とを有することを特徴とする。 (2) In the feature point coordinate system set for the omnidirectional image in the above (1), means for inputting the image coordinate value of the feature point in each frame image, and an azimuth angle from a reference axis based on the image coordinate value ( (Phase angle), means for obtaining an angle (elevation angle) from the optical axis direction obtained according to the projection method used in the omnidirectional camera, and coordinate values obtained by coordinate conversion using the phase angle and elevation angle It has the means to obtain as, It is characterized by the above-mentioned.

（３）上記（１）において、時系列画像に設定した特徴点座標系において、各フレーム画像における特徴点の画像座標値を入力する手段と、その観測座標値を行列要素とする行列を特異値分解する手段と、この特異値の成分から運動の自由度を表す判定値を算出する手段と、この判定値がある一定値未満の場合は、平面運動と見なして光軸周りの回転とその光軸に垂直な平面上の運動からなる自由度３の平面運動と３次元情報を復元する手段と、前記判定値がある一定値以上の場合は、カメラの回転運動と並進運動からなる自由度６の運動と３次元情報を復元する手段とを有することを特徴とする。 (3) In the feature point coordinate system set in the time-series image in (1) above, means for inputting image coordinate values of feature points in each frame image, and a matrix having the observed coordinate values as matrix elements are singular values. A means for decomposing, a means for calculating a determination value representing the degree of freedom of movement from the component of this singular value, and if this determination value is less than a certain value, the rotation around the optical axis and its light Means for restoring three-dimensional information with three degrees of freedom consisting of movement on a plane perpendicular to the axis, and a degree of freedom of six consisting of rotational movement and translational movement of the camera if the determination value exceeds a certain value. And a means for restoring three-dimensional information.

（４）上記（１）において、全方位画像に設定した特徴点座標系において、各フレーム画像における特徴点の画像座標値を入力する手段と、その画像座標値からある基準軸からの方位角（位相角）と、全方位カメラに使用されている投影方式に従って求められる光軸方向からの角（仰角）を求める手段と、前記位相角と仰角を使って座標変換した座標値を前記観測座標値として求める手段と、
前記観測座標値を行列要素とする行列を特異値分解する手段と、この特異値の成分から運動の自由度を表す判定値を算出する手段と、この判定値がある一定値未満の場合は、平面運動と見なして光軸周りの回転とその光軸に垂直な平面上の運動からなる自由度３の平面運動と３次元情報を復元する手段と、前記判定値がある一定値以上の場合は、カメラの回転運動と並進運動からなる自由度６の運動と３次元情報を復元する手段と、
を有することを特徴とする。 (4) In the feature point coordinate system set in the omnidirectional image in (1) above, means for inputting the image coordinate value of the feature point in each frame image, and an azimuth angle from a reference axis (from the image coordinate value) (Phase angle), means for obtaining an angle (elevation angle) from the optical axis direction obtained according to the projection method used in the omnidirectional camera, and coordinate values obtained by coordinate conversion using the phase angle and elevation angle As a means to seek
Means for singular value decomposition of a matrix having the observed coordinate values as matrix elements, means for calculating a determination value representing the degree of freedom of movement from the component of this singular value, and when the determination value is less than a certain value, If the judgment value is greater than a certain value, it is considered to be a planar motion, means for restoring three-dimensional information with three degrees of freedom consisting of rotation around the optical axis and motion on a plane perpendicular to the optical axis. A means for restoring the three-dimensional information and the six-degree-of-freedom motion comprising the rotational motion and translational motion of the camera;
It is characterized by having.

（方法の発明）
（５）時系列画像中において、対象とする画像に配置した特徴点に関する画像座標値の時間的変化量から、時系列におけるカメラ視点の運動、並びに、外界の物体形状を構成する３次元情報を復元する方法であって、
時系列画像に設定した特徴点座標系において、各フレーム画像における特徴点の画像座標値（観測座標値）を入力し、その観測座標値にカメラの回転運動、光軸座標値、並進運動、並びに、３次元情報からなる係数をかけた座標値を要素とする漸近行列データを生成する漸近行列生成ステップと、
前記漸近行列データを特異値分解し、雑音除去を行って運動情報を表す行列データと３次元情報を表す行列データを得て、その運動情報において、運動を規定するために設定した条件を満足する変換行列を求め、運動情報を表す行列データに前記変換行列を作用させてカメラ視点に関する光軸周りの回転運動と光軸と垂直な平面上の並進運動（これらの成分からなる自由度３の平面運動）を復元し、並びに、３次元情報を表す行列データに前記変換行列の逆行列を作用させて物体形状を構成する３次元情報を復元する平面運動・３次元情報復元ステップと、
前記平面運動・３次元情報復元ステップで得た平面運動と３次元情報から算出する再投影誤差と、前記漸近行列生成ステップで得た観測座標値に、係数εと光軸座標値で変換した座標値を行列要素とする行列データを求め、その行列データを特異値分解して雑音を除去した行列データと、前記平面運動・３次元情報復元ステップで得た各特徴点のＺ座標値（Ｚ方向を鉛直方向にしたときの特徴点位置の高さ）を要素とする行列から、カメラ視点の光軸方向の並進運動を復元し、その復元した光軸並進運動により係数δを更新する光軸運動復元ステップと、
前記漸近行列生成ステップで得た観測座標値と、前記平面運動・３次元情報復元ステップで得た平面運動と３次元情報から得る再投影誤差に係数δで変換した座標値の間の誤差を求め、その誤差を行列要素とする行列データと、前記平面運動・３次元情報復元ステップで得た平面運動、３次元情報、並びに、前記光軸運動復元ステップで得た光軸並進運動から、光軸以外の互いに直交する軸周りの回転運動を求め、その復元した回転運動により係数εと光軸座標値を更新する回転運動復元ステップと、
前記漸近行列生成ステップで得た観測座標値を前記回転運動復元ステップで更新された変換係数（係数ε）と光軸座標値、並びに、前記光軸運動復元ステップで更新された変換係数（係数δ）を使って変換した座標値と、前記平面運動・３次元情報復元ステップで得た平面運動と３次元情報から各フレーム画像に対する特徴点の再投影座標値との間で平面運動への漸近値を表す漸近誤差を求め、この漸近誤差の増減により前記光軸並進運動復元ステップによる処理と前記回転運動復元ステップにおける処理を切り替えて該漸近誤差を減少（カメラ運動を平面運動へ漸近）させる処理を繰り返す安定化処理ステップと、
を有することを特徴とする。 (Invention of method)
(5) In the time-series image, the movement of the camera viewpoint in the time series and the three-dimensional information constituting the object shape of the outside world are obtained from the temporal change amount of the image coordinate value regarding the feature point arranged in the target image. A method of restoring,
In the feature point coordinate system set for the time series image, the image coordinate value (observation coordinate value) of the feature point in each frame image is input, and the rotation coordinate of the camera, the optical axis coordinate value, the translational motion, and An asymptotic matrix generating step for generating asymptotic matrix data having a coordinate value multiplied by a coefficient consisting of three-dimensional information as an element;
The asymptotic matrix data is subjected to singular value decomposition, noise removal is performed to obtain matrix data representing motion information and matrix data representing three-dimensional information, and the motion information satisfies the conditions set for defining the motion. A transformation matrix is obtained, and the transformation matrix is applied to the matrix data representing the motion information, so that the rotational motion around the optical axis with respect to the camera viewpoint and the translational motion on a plane perpendicular to the optical axis (a plane with three degrees of freedom consisting of these components). A plane motion / three-dimensional information restoring step for restoring the three-dimensional information constituting the object shape by applying an inverse matrix of the transformation matrix to the matrix data representing the three-dimensional information.
Coordinates obtained by converting the plane motion obtained in the plane motion / three-dimensional information restoration step, the reprojection error calculated from the three-dimensional information, and the observed coordinate value obtained in the asymptotic matrix generation step using the coefficient ε and the optical axis coordinate value. Matrix data having values as matrix elements is obtained, the matrix data obtained by singular value decomposition of the matrix data to remove noise, and the Z coordinate value (Z direction) of each feature point obtained in the plane motion / three-dimensional information restoration step. Optical axis motion that restores the translational motion in the optical axis direction of the camera viewpoint from the matrix whose element is the height of the feature point position when the is vertically oriented), and updates the coefficient δ by the restored optical axis translational motion A restore step,
An error between the observed coordinate value obtained in the asymptotic matrix generation step, the plane value obtained in the plane motion / three-dimensional information restoration step, and the re-projection error obtained from the three-dimensional information is obtained by a coefficient δ. From the matrix data having the error as a matrix element, the plane motion obtained in the plane motion / three-dimensional information restoration step, the three-dimensional information, and the optical axis translation obtained in the optical axis motion restoration step, the optical axis A rotational motion restoring step for obtaining rotational motion around mutually orthogonal axes other than and updating the coefficient ε and the optical axis coordinate value by the restored rotational motion,
The observed coordinate values obtained in the asymptotic matrix generation step are converted into the conversion coefficient (coefficient ε) and the optical axis coordinate value updated in the rotational motion restoration step, and the conversion coefficient (coefficient δ) updated in the optical axis motion restoration step. ), The asymptotic value to the plane motion between the plane motion obtained in the plane motion / three-dimensional information restoration step and the reprojected coordinate value of the feature point for each frame image from the three-dimensional information. An asymptotic error representing the following is obtained, and processing to reduce the asymptotic error (camera motion is asymptotic to planar motion) by switching the processing in the optical axis translational motion restoration step and the processing in the rotational motion restoration step by increasing or decreasing the asymptotic error. Repeated stabilization steps;
It is characterized by having.

（６）上記（５）において、全方位画像に設定した特徴点座標系において、各フレーム画像における特徴点の画像座標値を入力するステップと、その画像座標値からある基準軸からの方位角（位相角）と、全方位カメラに使用されている投影方式に従って求められる光軸方向からの角（仰角）を求めるステップと、前記位相角と仰角を使って座標変換した座標値を前記観測座標値として求めるステップとを有することを特徴とする。 (6) In the feature point coordinate system set in the omnidirectional image in (5) above, the step of inputting the image coordinate value of the feature point in each frame image, and the azimuth angle from a reference axis based on the image coordinate value ( Phase angle), a step of obtaining an angle (elevation angle) from the optical axis direction obtained according to the projection method used for the omnidirectional camera, and a coordinate value obtained by performing coordinate transformation using the phase angle and the elevation angle. And a step of obtaining as follows.

（７）上記（５）において、時系列画像に設定した特徴点座標系において、各フレーム画像における特徴点の画像座標値を入力するステップと、その観測座標値を行列要素とする行列を特異値分解するステップと、この特異値の成分から運動の自由度を表す判定値を算出するステップと、この判定値がある一定値未満の場合は、平面運動と見なして光軸周りの回転とその光軸に垂直な平面上の運動からなる自由度３の平面運動と３次元情報を復元するステップと、前記判定値がある一定値以上の場合は、カメラの回転運動と並進運動からなる自由度６の運動と３次元情報を復元するステップとを有することを特徴とする。 (7) In the feature point coordinate system set in the time series image in (5) above, a step of inputting image coordinate values of feature points in each frame image, and a matrix having the observed coordinate values as matrix elements are singular values. A step of decomposing, a step of calculating a determination value representing the degree of freedom of movement from the component of this singular value, and if this determination value is less than a certain value, the rotation around the optical axis and the light A step of restoring three-dimensional information with three degrees of freedom consisting of movement on a plane perpendicular to the axis, and a degree of freedom of six consisting of rotational movement and translational movement of the camera if the determination value is greater than a certain value. And a step of restoring three-dimensional information.

（８）上記（５）において、全方位画像に設定した特徴点座標系において、各フレーム画像における特徴点の画像座標値を入力するステップと、その画像座標値からある基準軸からの方位角（位相角）と、全方位カメラに使用されている投影方式に従って求められる光軸方向からの角（仰角）を求めるステップと、前記位相角と仰角を使って座標変換した座標値を前記観測座標値として求めるステップと、
前記観測座標値を行列要素とする行列を特異値分解するステップと、この特異値の成分から運動の自由度を表す判定値を算出するステップと、この判定値がある一定値未満の場合は、平面運動と見なして光軸周りの回転とその光軸に垂直な平面上の運動からなる自由度３の平面運動と３次元情報を復元するステップと、前記判定値がある一定値以上の場合は、カメラの回転運動と並進運動からなる自由度６の運動と３次元情報を復元するステップと、
を有することを特徴とする。 (8) In the feature point coordinate system set in the omnidirectional image in (5) above, the step of inputting the image coordinate value of the feature point in each frame image, and the azimuth angle from a certain reference axis from the image coordinate value ( Phase angle), a step of obtaining an angle (elevation angle) from the optical axis direction obtained according to the projection method used for the omnidirectional camera, and a coordinate value obtained by performing coordinate transformation using the phase angle and the elevation angle. And asking for steps
A step of singular value decomposition of a matrix having the observed coordinate values as matrix elements, a step of calculating a determination value representing the degree of freedom of movement from the component of the singular value, and the determination value is less than a certain value, A step of restoring three-dimensional information with three degrees of freedom consisting of rotation around the optical axis and movement on a plane perpendicular to the optical axis as if it were a plane movement, Reconstructing the three-dimensional information and the six-degree-of-freedom motion comprising the rotational motion and translational motion of the camera;
It is characterized by having.

（プログラムの発明）
上記（５）〜（８）のいずれか１項に記載のカメラ運動と３次元情報の復元法における処理手順をコンピュータで実行可能に構成したことを特徴とする。 (Invention of the program)
The processing procedure in the camera motion and three-dimensional information restoration method described in any one of (5) to (8) above is configured to be executable by a computer.

以上のとおり、本発明によれば、カメラを使って取得した時系列画像全般（移動手段を利用して撮影した車載画像、海上画像、空撮画像、屋内画像、魚眼カメラや全方位カメラ、または手動撮影した画像）から、カメラの運動（回転運動と並進運動）と対象物に関する物体形状を高精度に獲得、復元することが可能となる。 As described above, according to the present invention, all time-series images acquired using a camera (vehicle-mounted images, sea images, aerial images, indoor images, fish-eye cameras, omnidirectional cameras, Alternatively, it is possible to acquire and restore the motion of the camera (rotational motion and translational motion) and the object shape related to the object with high accuracy from the manually captured image.

手動撮影には手振れがあり、車載カメラには走行中の振動があるため、本発明は、このようなカメラ振動においても雑音にロバストに、微小なカメラ姿勢の変動（三軸周り回転運動）を復元することができる。特に、図１５でのカメラを車載した移動観測に本発明を応用した場合、ＧＰＳなどのリモートセンサを補間する精度の三軸方向の並進運動を正確に計測することが可能となる。 Since manual shooting has camera shake and the on-vehicle camera has vibration during traveling, the present invention is robust to noise even in such camera vibration, and can perform minute camera posture fluctuations (rotation around three axes). Can be restored. In particular, when the present invention is applied to the mobile observation with the camera shown in FIG. 15, it is possible to accurately measure the translational motion in the three-axis directions with the accuracy of interpolating a remote sensor such as GPS.

本発明で使用する計算は、大半が線形演算で構成されるため、コンピュータ言語での実装が容易である。 Since most of the calculations used in the present invention are composed of linear operations, implementation in a computer language is easy.

（実施形態１）
図１は請求項１等に関する基本構成図であり、図２は時系列画像データベース部などの記憶装置を必要としない、リアルタイムで処理する場合の処理構成図であり、図３は図１または図２における波線ブロック部分の処理フローである。 (Embodiment 1)
FIG. 1 is a basic configuration diagram relating to claim 1 and the like, FIG. 2 is a processing configuration diagram in the case of processing in real time that does not require a storage device such as a time-series image database unit, and FIG. 2 is a processing flow of a wavy line block portion in FIG.

本実施形態を図１〜図３により説明する。本実施形態では、時系列画像を格納する時系列画像データベース部１、そのデータベース部１から画像系列を取り出し、全フレーム間の特徴点の画像座標値を観測し、カメラ運動と３次元情報を復元するための元になる行列データ（以下、漸近行列データ）を生成する漸近行列生成部２、漸近行列データから前記の特許文献の手法を利用して平面運動と３次元情報を復元する平面運動・３次元情報復元部３、光軸回転以外の回転運動と光軸並進運動を復元し、カメラ運動を平面運動へ漸近（安定化）させるために次の反復が必要かどうかの判定を行い、反復が必要な場合は漸近行列生成部２へ漸近行列データを生成するために必要な情報を渡す安定化処理部４から構成される。この構成において、時系列画像データベース部１には、ハードディスク、ＲＡＩＤ装置、ＣＤ−ＲＯＭなどの記録媒体を利用する形態、または、ネットワークを介したリモートなデータ資源を利用する形態でもどちらでも構わない。また、図２の画像入力部１Ａは時系列画像データベース部１に代えて、データ資源をリアルタイムで得る。 This embodiment will be described with reference to FIGS. In this embodiment, a time-series image database unit 1 for storing time-series images, an image sequence is extracted from the database unit 1, image coordinate values of feature points between all frames are observed, and camera motion and three-dimensional information are restored. An asymptotic matrix generator 2 for generating matrix data (hereinafter referred to as asymptotic matrix data) to be a base for performing plane motion and three-dimensional information to restore plane motion and three-dimensional information from the asymptotic matrix data using the method of the above-mentioned patent document 3D information restoration unit 3, restores rotational motion other than optical axis rotation and optical axis translational motion, and determines whether the next iteration is necessary to make the camera motion asymptotic (stabilized) to planar motion. Is required, the stabilization processing unit 4 passes information necessary for generating asymptotic matrix data to the asymptotic matrix generation unit 2. In this configuration, the time series image database unit 1 may be in a form using a recording medium such as a hard disk, a RAID device, a CD-ROM, or a form using remote data resources via a network. Also, the image input unit 1A in FIG. 2 obtains data resources in real time instead of the time-series image database unit 1.

図１４において、本発明で復元する対象の空間中の点Ｐ_j（Ｘ_j，Ｙ_j，Ｚ_j）と、カメラの運動、すなわち、ロール回転（ω_i）、ピッチ回転（ψ_i）、ヨー回転（θ_i）、並びに、並進運動Ｔ_i（Ｔｘ_i，Ｔｙ_i，Ｔｚ_i）を説明する。図１４ではカメラと対象物（被写体）との位置関係を表しており、運動の中心は視点としており、視点を原点としたカメラ座標系ＸＹＺ、原点Ｏとした世界座標系ＸｗＹｗＺｗを設定する。説明の都合上、カメラ光軸をＺ軸方向とし、光軸に垂直な平面をＸＹ平面とする。この座標系において、カメラは、並進運動（Ｔｘ_i，Ｔｙ_i，Ｔｚ_i）で移動しながら、ロール回転（ω_i）、ピッチ回転（ψ_i）、ヨー回転（θ_i）の回転をして点Ｐ_jを観測する。像が投影される投影中心（主点）の位置（視点位置Ｔ_i）はカメラ運動の中心であり、第ｉフレームでの並進運動Ｔ_i（Ｔｘ_i，Ｔｙ_i，Ｔｚ_i）の位置とする。対象物の点Ｐ_j（Ｘ_j，Ｙ_j，Ｚ_j）はカメラにより画像面において投影中心を原点とした画像座標値（ｘ_ij，ｙ_ij）へ投影されるとする。なお、初期フレームでの視点とＯは一致しているとし、光軸はＺｗ軸と平行関係にあり、θ_iはＸとＸｗ軸との成す角とするが、一般性を損なわない。 In FIG. 14, the point P _j (X _j , Y _j , Z _j ) in the space to be restored in the present invention and the camera motion, that is, roll rotation (ω _i ), pitch rotation (ψ _i ), yaw The rotation (θ _i ) and the translational movement T _i (Tx _i , Ty _i , Tz _i ) will be described. FIG. 14 shows the positional relationship between the camera and the object (subject), the center of motion is the viewpoint, and the camera coordinate system XYZ with the viewpoint as the origin and the world coordinate system XwYwZw with the origin O are set. For convenience of explanation, the camera optical axis is the Z-axis direction, and the plane perpendicular to the optical axis is the XY plane. In this coordinate system, the camera rotates by roll rotation (ω _i ), pitch rotation (ψ _i ), and yaw rotation (θ _i ) while moving in translational motion (Tx _i , Ty _i , Tz _i ). Observe the point P _j . The position (viewpoint position T _i ) of the projection center (principal point) on which the image is projected is the center of the camera movement, and is the position of the translational movement T _i (Tx _i , Ty _i , Tz _i ) in the i-th frame. . It is assumed that the point P _j (X _j , Y _j , Z _j ) of the object is projected by the camera onto the image coordinate value (x _ij , y _ij ) with the projection center as the origin on the image plane. Note that the viewpoint in the initial frame and O coincide with each other, the optical axis is parallel to the Zw axis, and θ _i is the angle formed by the X and Xw axes, but the generality is not impaired.

まず、図１又は図２の漸近行列生成部２において、対象物を撮影した時系列画像として時系列画像データベース部１からフレーム数Ｆの画像系列を取り出す。この取り出した画像系列において特徴点追跡を行う。特徴点は従来から用いられているような以下の手順により抽出する。初期画像（画像１）の領域において（１）各画素に対する２×２のヘッセ行列を求める。次に、（２）各点の３×３近傍領域において極大点かどうか判定し、極大点以外の点を削除する（non-maxima suppression）。さらに、（３）得られた各点のヘッセ行列の固有値σ_l，σ_s（σ_s≦σ_l）を求め、σ_sが所定の許容値σ_p以上となる点を抽出する。最後に、（４）抽出した点のσ_sの大きさの順にソートし、上位の点から順番にその点（ｐ_l）より上位の点（ｐ_h）が所定の画素数σ_d以内の距離に存在するかどうかを判定し、もし、存在する場合は下位の点ｐ_lを削除する。さらに、抽出した特徴点（ｊ＝１，２，…，Ｐ）をＫＬＴ法（Kanade-Lucas-Tomasi）により画像ｉ（ｉ＝２，…，Ｆ）にわたって追跡し、画像座標値（ｘ_ij，ｙ_ij）を観測する。このようにして得られた特徴の画像座標値を式（１）に示す配列に並べた２Ｆ×Ｐの行列データ（行列データ[Ａ]）を用意する。 First, in the asymptotic matrix generation unit 2 of FIG. 1 or FIG. 2, an image sequence of the number of frames F is extracted from the time-series image database unit 1 as a time-series image obtained by photographing the object. Feature point tracking is performed on the extracted image series. The feature points are extracted by the following procedure as used conventionally. In the region of the initial image (image 1), (1) a 2 × 2 Hessian matrix for each pixel is obtained. Next, (2) it is determined whether or not the point is a local maximum in the 3 × 3 neighborhood of each point, and points other than the local maximum are deleted (non-maxima suppression). Further, (3) eigenvalues σ _l and σ _s (σ _s ≦ σ _l ) of the obtained Hessian matrix are obtained, and points where σ _s is equal to or greater than a predetermined allowable value σ _p are extracted. Finally, (4) sorting is performed in the order of the size of σ _s of the extracted points, and the point (p _h ) higher than the point (p _l ) in order from the upper point is the distance within the predetermined number of pixels σ _d If it exists, the lower point p ₁ is deleted. Further, the extracted feature points (j = 1, 2,..., P) are tracked over the image i (i = 2,..., F) by the KLT method (Kanade-Lucas-Tomasi), and the image coordinate values (x _ij , y _ij ) is observed. 2F × P matrix data (matrix data [A]) in which the image coordinate values of the features obtained in this way are arranged in the array shown in Expression (1) is prepared.

図３の観測座標値の入力（Ｓ１）から行列データ［Ａ］を漸近行列データの生成（Ｓ２）へ渡す。漸近行列データの生成（Ｓ２）では、係数ε_ij＝１，δ_ij＝１，（ζ_i，η_i）＝（０，０），Ｔｚ_i＝０と初期化し、式（２）に従って変換座標値（ｘ'_ij，ｙ'_ij）を得る。変換座標値（ｘ'_ij，ｙ'_ij）を行列要素とする２Ｆ×Ｐの式（１ａ）の漸近行列データ［Ｂ］を生成する。このとき、初期設定として、安定化モードを“回転モード”にしておく。 The matrix data [A] is transferred to the generation of asymptotic matrix data (S2) from the input of observed coordinate values (S1) in FIG. In the generation of asymptotic matrix data (S2), coefficients ε _ij = 1, δ _ij = 1, (ζ _i , η _i ) = (0, 0), Tz _i = 0 are initialized, and converted coordinates according to equation (2) A value (x ′ _ij , y ′ _ij ) is obtained. Asymptotic matrix data [B] of 2F × P expression (1a) having the converted coordinate values (x ′ _ij , y ′ _ij ) as matrix elements is generated. At this time, as an initial setting, the stabilization mode is set to the “rotation mode”.

次に、図１または図２の平面運動・３次元情報復元部３に漸近行列データが与えられると、図３の特異値分解（Ｓ３）において式（３）に示す３つの行列［Ｕ］，［Ｗ］，［Ｖ］に行列分解する。 Next, when asymptotic matrix data is given to the planar motion / three-dimensional information restoration unit 3 of FIG. 1 or FIG. 2, three matrices [U], shown in the equation (3) in the singular value decomposition (S3) of FIG. Matrix decomposition into [W] and [V].

ここで、［Ｕ］は２Ｆ×Ｐサイズの行列、［Ｗ］はＰ×Ｐサイズの対角行列、［Ｖ］はＰ×Ｐサイズの行列である。さらに、図３の雑音除去（Ｓ４）では、式（４）の第二項に示すように、ランク４以上の各行列の成分を削除する。 Here, [U] is a 2F × P size matrix, [W] is a P × P size diagonal matrix, and [V] is a P × P size matrix. Further, in the noise removal (S4) of FIG. 3, as shown in the second term of the equation (4), the components of each matrix of rank 4 or higher are deleted.

この削除のときは、行列［Ｕ］を取り出し、この行列の要素において第４から第Ｐ列目までを削除し、残りの成分からなる行列を保持し、行列［Ｗ］を取り出し、この行列の要素において第４から第Ｐ行目並びに第４から第Ｐ列目までを削除し、残りの成分からなる行列を保持し、行列［Ｖ］を取り出し、この行列の要素において第４から第Ｐ行目までを削除し、残りの成分からなる行列をそれぞれ保持する。この雑音除去は、式（５）に示すようになる。 At the time of this deletion, the matrix [U] is taken out, the fourth to Pth columns are deleted from the elements of this matrix, the matrix consisting of the remaining components is held, the matrix [W] is taken out, The fourth to Pth rows and the fourth to Pth columns are deleted in the element, the matrix composed of the remaining components is retained, the matrix [V] is extracted, and the fourth to Pth rows are extracted from the elements of this matrix. Delete up to the eyes and keep the matrix of the remaining components. This noise removal is as shown in equation (5).

次に、第４から第Ｐ行目並びに第４から第Ｐ列目までを削除した行列［Ｗ］の対角要素の平方をとった行列から、式（６）、（７）に示す行列［Ｕ’］と行列［Ｖ’］を得る。 Next, from the matrix obtained by taking the square of the diagonal elements of the matrix [W] from which the 4th to Pth rows and the 4th to Pth columns have been deleted, the matrix shown in Equations (6) and (7) [ U ′] and matrix [V ′] are obtained.

図３の変換行列算出（Ｓ５）では、保持してある行列［Ｕ’］を取り出し、式（８）〜（１０）で得られる値を行列要素にもつ式（１１）の行列［Ｄ］を準備し、この行列［Ｄ］と式（１２）に示す計算を行い、値ａ，ｂ，ｃ，ｄ，ｅ，ｆを求める。なお、式（１２）の右辺の最後の行列は１の値を上から２Ｆ個、続けて０の値をＦ個並べた３Ｆ×１の列ベクトルである。値ａ，ｂ，ｃ，ｄ，ｅ，ｆを式（１３）に示す要素に入れた行列［Ｃ］を用意し、この行列［Ｃ］を式（１４）に示すように固有値分解する。ここで、固有値行列の平方と固有値行列から、式（１５）の行列［Ｃ’］を生成し、この行列要素を成分にもつ行列［Ｑ］を式（１６）に従って算出する。 In the transformation matrix calculation (S5) of FIG. 3, the held matrix [U ′] is taken out, and the matrix [D] of Expression (11) having values obtained by Expressions (8) to (10) as matrix elements is obtained. Prepare and perform the calculation shown in the matrix [D] and the equation (12) to obtain the values a, b, c, d, e, and f. Note that the last matrix on the right side of Expression (12) is a 3F × 1 column vector in which 2 values from the top are 1F, and F values from 0 are subsequently arranged. A matrix [C] in which the values a, b, c, d, e, and f are put in the elements shown in the equation (13) is prepared, and the matrix [C] is subjected to eigenvalue decomposition as shown in the equation (14). Here, a matrix [C ′] of Expression (15) is generated from the square of the eigenvalue matrix and the eigenvalue matrix, and a matrix [Q] having these matrix elements as components is calculated according to Expression (16).

次に、図３の平面運動復元（Ｓ６）では、前記の式（Ａ３）の投影モデル式に基づいて、式（Ａ３）右辺左側の行列を式（１７）で得られる行列［Ｍ’］に、式（Ａ３）右辺右側の行列を式（２０）で得られる行列［Ｓ’］とする。この対応付けにより、式（１７）の行列成分から光軸周りの回転（ヨー回転角）θ_iとＸＹ並進運動を復元し、式（２０）の行列要素からユークリッド空間での３次元情報（Ｘ，Ｙ，Ｚ）を復元する。求めた行列［Ｑ］と、保持しておいた行列［Ｕ’］から、式（１７）の行列演算により行列［Ｍ’］を計算する。行列［Ｍ’］から各フレーム（第ｉフレーム）の行列要素（ｍ_ix，ｎ_ix）または（ｍ_iy，ｎ_iy）を取り出し、式（１８）を使って、ヨー回転θ_i，ｉ＝１，２，…，Ｆを復元する。また、行列［Ｍ’］から各フレーム（第ｉフレーム）の行列要素（Ｔ_iu，Ｔ_iv）を取り出す。この（Ｔ_iu，Ｔ_iv）から、式（１９）を使って第ｉフレームにおけるユークリッド空間でのＸＹ並進運動（Ｔ_xi，Ｔ_yi），ｉ＝１，２，…，Ｆを計算する。 Next, in the plane motion restoration (S6) of FIG. 3, based on the projection model formula of the formula (A3), the matrix on the left side of the formula (A3) is changed to the matrix [M ′] obtained by the formula (17). The matrix on the right side of the right side of equation (A3) is defined as the matrix [S ′] obtained by equation (20). By this association, the rotation (yaw rotation angle) θ _i around the optical axis and the XY translational motion are restored from the matrix component of Expression (17), and the three-dimensional information (X in the Euclidean space (X , Y, Z). From the obtained matrix [Q] and the retained matrix [U ′], the matrix [M ′] is calculated by the matrix operation of Expression (17). The matrix element (m _ix , n _ix ) or (m _iy , n _iy ) of each frame (i-th frame) is extracted from the matrix [M ′], and the yaw rotation θ _i , i = 1 is obtained using equation (18). , 2, ..., F are restored. Further, the matrix element (T _iu , T _iv ) of each frame (i-th frame) is extracted from the matrix [M ′]. From this (T _iu , T _iv ), XY translational motion (T _xi , T _yi ), i = 1, 2,..., F in the Euclidean space in the i-th frame is calculated using equation (19).

一方、図３の３次元情報復元（Ｓ７）では、先に保持しておいた行列［Ｖ’］と、変換行列算出（Ｓ５）で得られた行列［Ｑ］から、式（２０）に示す行列演算を行い、行列［Ｓ’］を求める。次に、行列［Ｓ’］の要素に対して、式（２１）に示す計算を行い、これを要素とする行列を［Ｐ］とする。行列を［Ｐ］の列ベクトルは、それぞれ第ｊ番目の特徴点のユークリッド空間での３次元座標値（Ｘ_j，Ｙ_j，Ｚ_j）になっている。 On the other hand, in the three-dimensional information restoration (S7) in FIG. 3, the matrix [V ′] previously held and the matrix [Q] obtained by the transformation matrix calculation (S5) are shown in Expression (20). Matrix operation is performed to obtain a matrix [S ′]. Next, the calculation shown in Expression (21) is performed on the elements of the matrix [S ′], and the matrix having these elements as [P]. The column vector of the matrix [P] is a three-dimensional coordinate value (X _j , Y _j , Z _j ) in the Euclidean space of the jth feature point.

次に、図１又は図２の安定化処理部４では、図３のチェック（Ｓ８）により、安定化モードが回転モードか光軸並進モードのどちらであるかをチェックし、異なる処理を行う。もし、安定化モードが回転モードのときは図３の回転運動安定化（Ｓ９）に、安定化モードが光軸並進モードのときは図３の光軸並進運動安定化（Ｓ１０）に処理を進める。 Next, the stabilization processing unit 4 shown in FIG. 1 or 2 checks whether the stabilization mode is the rotation mode or the optical axis translation mode by performing the check (S8) in FIG. 3, and performs different processing. If the stabilization mode is the rotation mode, the process proceeds to rotational motion stabilization (S9) in FIG. 3, and if the stabilization mode is the optical axis translation mode, the process proceeds to optical axis translational motion stabilization (S10) in FIG. .

回転運動安定化（Ｓ９）では、係数δ_ijが既知としてピッチ回転とロール回転を求める。この処理では、画像系列から観測した画像座標値（ｘ_ij，ｙ_ij）と、平面運動・３次元情報復元部３で得られる再投影座標値（Ｕ_ij，Ｖ_ij）の間の誤差が、ピッチ回転とロール回転から発生する誤差であるとしている。これを数式で表現すると、 In the rotational motion stabilization (S9), pitch rotation and roll rotation are obtained assuming that the coefficient δ _ij is known. In this processing, an error between the image coordinate values (x _ij , y _ij ) observed from the image series and the reprojection coordinate values (U _ij , V _ij ) obtained by the plane motion / three-dimensional information restoration unit 3 is The error is caused by pitch rotation and roll rotation. If this is expressed in mathematical formulas,

となる。これは式（Ａ３）左辺の行列要素で表現すると、 It becomes. This can be expressed by the matrix element on the left side of equation (A3).

となる。式（Ａ８）左辺を式（Ａ４），（Ａ６）を使って展開すると、 It becomes. When the left side of Expression (A8) is expanded using Expressions (A4) and (A6),

となる。行列［Ｒ_i］，［Ａ_i］は式（２２），（２２ａ）〜（２２ｄ），（２３）である。したがって、式（Ａ９）の関係から、ピッチ回転ψ_i、ロール回転ω_iは、式（２４）の計算で求められる。 It becomes. The matrices [R _i ] and [A _i ] are the expressions (22), (22a) to (22d), and (23). Therefore, the pitch rotation ψ _i and the roll rotation ω _i are obtained by the calculation of the equation (24) from the relationship of the equation (A9).

以上の計算についての安定化処理（Ｓ９）、（Ｓ１０）の詳細な処理フローが図４および図５である。図４に示す回転運動安定化（Ｓ９）では、式（１）に示すデータ形式の行列データ［Ａ］の各行列要素と、係数δ_ij、並びに、上記で復元した平面運動・３次元情報復元部３による再投影座標値の算出（Ｓ２２）と３次元情報復元（Ｓ２３）から、前記の式（Ａ７）に示す誤差（Δｘ_ij，Δｙ_ij）を計算する（Ｓ２１）。さらに、式（２２ａ）〜（２２ｄ）の値を計算し（Ｓ２４）、これを要素とする式（２２）の行列［Ｒ_i］を準備する（Ｓ２５）。一方、誤差（Δｘ_ij，Δｙ_ij）を行列要素とする式（２３）の行列［Ａ_i］を準備し（Ｓ２６）、式（２４）の演算を行って、ロール回転ω_i、ピッチ回転ψ_i、を復元する（Ｓ２７）。これを全フレーム（ｉ＝１，２，…，Ｆ）にわたり求める。このとき、同時に、式（２５）の計算により各フレームでの（ζ_i，η_i），ｉ＝１，２，…，Ｆを求め、以前の（ζ_i，η_i），ｉ＝１，２，…，Ｆを更新すると共に、式（Ａ６）により求めたピッチ回転ψ_i、ロール回転ω_iを代入して係数ε_ijを更新する（Ｓ２８）。 FIG. 4 and FIG. 5 show detailed processing flows of the stabilization processing (S9) and (S10) for the above calculation. In the rotational motion stabilization (S9) shown in FIG. 4, each matrix element of the matrix data [A] in the data format shown in Expression (1), the coefficient δ _ij , and the planar motion / three-dimensional information restoration restored above. The error (Δx _ij , Δy _ij ) shown in the above equation (A7) is calculated from the reprojection coordinate value calculation (S22) and the three-dimensional information restoration (S23) by the unit 3 (S21). Further, the values of the equations (22a) to (22d) are calculated (S24), and the matrix [R _i ] of the equation (22) having these as elements is prepared (S25). On the other hand, a matrix [A _i ] of Expression (23) having errors (Δx _ij , Δy _ij ) as matrix elements is prepared (S26), and the calculation of Expression (24) is performed to perform roll rotation ω _i and pitch rotation ψ. _i is restored (S27). This is obtained over all frames (i = 1, 2,..., F). At the same time, (ζ _i , η _i ), i = 1, 2,..., F are obtained for each frame by the calculation of equation (25), and the previous (ζ _i , η _i ), i = 1, 2,..., F are updated, and the coefficient ε _ij is updated by substituting the pitch rotation ψ _i and roll rotation ω _i obtained by the equation (A6) (S28).

図３に示す光軸並進運動安定化Ｓ１０では、係数ε_ij、並びに、光軸座標値（ζ_i，η_i）が既知として光軸並進運動Ｔｚ_iを求める。このとき、式（Ａ３）左辺の行列要素は、 In the optical axis translational motion stabilization S10 shown in FIG. 3, the optical axis translational motion Tz _i is obtained on the assumption that the coefficient ε _ij and the optical axis coordinate values (ζ _i , η _i ) are known. At this time, the matrix element on the left side of Expression (A3) is

となる。式（Ａ１０）は座標値（ｕ'_ij，ｖ'_ij）の（１−Ｔｚ_i／Ｚ_j）倍が再投影座標値（ｕ_ij，ｖ_ij）となる形式である。この関係を利用して光軸並進運動Ｔｚ_iを求める。そこで、式（Ａ１０）を、 It becomes. Expression (A10) is a format in which (1-Tz _i / Z _j ) times the coordinate value (u ′ _ij , v ′ _ij ) is the reprojected coordinate value (u _ij , v _ij ). Using this relationship, the optical axis translational motion Tz _i is obtained. Therefore, the equation (A10) is

と整理し、全フレームｉ＝１，２，…，Ｆ、並びに、全特徴点ｊ＝１，２，…，Ｐに対して行列展開すると、 And matrix expansion for all frames i = 1, 2,..., F and all feature points j = 1, 2,.

となる（ただし、Δｗ_ijは式（２６）である）。したがって、式（Ａ１３）に示す連立方程式を求めると光軸並進運動を復元することができる。 (Where Δw _ij is Equation (26)). Therefore, the optical axis translational motion can be restored by obtaining simultaneous equations shown in Formula (A13).

図５に示す光軸並進運動安定化（Ｓ１０）の詳細な処理フローでは、まず、観測座標値の入力から座標値（ｘ_ij，ｙ_ij）が与えられ、係数ε_ijと光軸座標値（ζ_i，η_i）を使って式（Ａ１１）の座標値（ｕ'_ij，ｖ'_ij）を生成する。この座標値（ｕ'_ij，ｖ'_ij）と平面運動・３次元情報復元部３からの平面運動と３次元情報から求めた再投影座標値（ｕ_ij，ｖ_ij）から式（２６）のΔｗ_ijを計算し（Ｓ３１，Ｓ３２）、これを行列要素とするＦ×Ｐの式（２７）の行列［ΔＷ］を準備する（Ｓ３３）。このとき、式（Ａ１３）左辺を見ると分かるようにランクは１である。それに対して、式（Ａ１３）右辺の行列［ΔＷ］はＦ×Ｐとなっている。そこで、式（２８）に示すように特異値分解を行い、３つの行列、すなわち、Ｆ×Ｐの［Ｕ_w］，Ｐ×Ｐの［Ｗ_w］，Ｐ×Ｐの［Ｖ_w］に分解する（Ｓ３４）。 In the detailed processing flow of optical axis translational stabilization (S10) shown in FIG. 5, first, coordinate values (x _ij , y _ij ) are given from the input of observed coordinate values, and coefficient ε _ij and optical axis coordinate values ( The coordinate values (u ′ _ij , v ′ _ij ) of the equation (A11) are generated using ζ _i , η _i ). From the coordinate values (u ′ _ij , v ′ _ij ) and the re-projection coordinate values (u _ij , v _ij ) obtained from the plane motion and the three-dimensional information from the plane motion / three-dimensional information restoration unit 3, Δw _ij is calculated (S31, S32), and a matrix [ΔW] of Formula (27) of F × P using this as a matrix element is prepared (S33). At this time, the rank is 1 as can be seen from the left side of the formula (A13). On the other hand, the matrix [ΔW] on the right side of Expression (A13) is F × P. Therefore, singular value decomposition is performed as shown in Expression (28), and decomposition into three matrices, that is, F × P [U _w ], P × P [W _w ], and P × P [V _w ]. (S34).

次いで、雑音除去（Ｓ３５）では、ランク１以上の各行列の成分を削除する。この削除のときは、行列［Ｕ_w］を取り出し、この行列の要素において第２から第Ｐ列目までを削除し、残りの成分からなる行列を保持し（行列［Ｕ'_w］）、行列［Ｗ_w］を取り出し、この行列の要素において第２から第Ｐ行目並びに第２から第Ｐ列目までを削除し、残りの成分からなる行列を保持し（行列［Ｗ'_w］）、行列［Ｖ_w］を取り出し、この行列の要素において第２から第Ｐ行目までを削除し、残りの成分からなる行列をそれぞれ保持（行列［Ｖ'_w］）し、式（２８ａ）に示す行列演算を行い、これを行列［ΔＷ］として保持する（Ｓ３６）。さらに、３次元情報復元から特徴点のＺ値の逆数からなる式（３０）の行列［１／Ｚ］を準備し（Ｓ３７）、式（２９）の計算を行い、光軸並進運動Ｔｚ_iを求める（Ｓ３８）。ここで、式（Ａ６）により求めた光軸並進運動を使って係数δ_ijを更新しておく（Ｓ３９）。 Next, in noise removal (S35), the components of each matrix of rank 1 or higher are deleted. At the time of this deletion, the matrix [U _w ] is taken out, the second to Pth columns are deleted from the elements of this matrix, the matrix consisting of the remaining components is held (matrix [U ′ _w ]), and the matrix [W _w ] is taken out, the 2nd to Pth rows and the 2nd to Pth columns are deleted from the elements of this matrix, and a matrix consisting of the remaining components is held (matrix [W ′ _w ]). The matrix [V _w ] is taken out, the second to P-th rows are deleted from the elements of this matrix, and the matrices composed of the remaining components are held (matrix [V ′ _w ]), respectively, as shown in equation (28a) Matrix operation is performed and held as a matrix [ΔW] (S36). Further, a matrix [1 / Z] of Expression (30) consisting of the reciprocal of the Z value of the feature point is prepared from the three-dimensional information restoration (S37), and the calculation of Expression (29) is performed to calculate the optical axis translational motion Tz _i . Obtain (S38). Here, the coefficient δ _ij is updated using the optical axis translation obtained by the equation (A6) (S39).

以上、安定化モードが回転モードか光軸並進モードにより、上記の処理に振り分けて、ピッチ回転とロール回転、並びに、光軸並進運動を求める。 As described above, depending on whether the stabilization mode is the rotation mode or the optical axis translation mode, the pitch rotation, the roll rotation, and the optical axis translational motion are obtained by distributing to the above processing.

図３に戻って、この時点での係数ε_ij、光軸座標値（ζ_i，η_i）、係数δ_ijを使って、式（２）の計算により座標値（ｘ'_ij，ｙ'_ij）を得て、式（３１）に示す漸近誤差ΔＥを求める（Ｓ１１）。この漸近誤差ΔＥが前のΔＥより減少している場合は、漸近行列データの生成に戻り、更新した係数ε_ij、光軸座標値（ζ_i，η_i）、係数δ_ijを使って得た式（２）の座標値（ｘ’_ij，ｙ’_ij）を新たな行列要素とする漸近行列［Ｂ］を得て、平面運動・３次元情報復元の処理を続ける（Ｓ１２）。 Returning to FIG. 3, using the coefficient ε _ij , the optical axis coordinate values (ζ _i , η _i ), and the coefficient δ _ij at this time, the coordinate values (x ′ _ij , y ′ _ij ) are calculated by the equation (2). ) To obtain an asymptotic error ΔE shown in equation (31) (S11). When this asymptotic error ΔE is smaller than the previous ΔE, the process returns to the generation of asymptotic matrix data, and is obtained using the updated coefficient ε _ij , optical axis coordinate values (ζ _i , η _i ), and coefficient δ _ij . An asymptotic matrix [B] having the coordinate values (x ′ _ij , y ′ _ij ) of Expression (2) as a new matrix element is obtained, and the process of plane motion / three-dimensional information restoration is continued (S12).

一方、この漸近誤差ΔＥが前のΔＥより増加した場合は、安定化モードを切り替える（Ｓ１３，Ｓ１４）。すなわち、回転モードから光軸並進モードへ、または、光軸並進モードから回転モードへ切り替えて、漸近行列データの生成へ処理を進める。なお、安定化モードの切替回数が最初から数えてＮ回を超えた時点で漸近誤差ΔＥが収束したと判断して反復処理を停止し、その時点でのカメラ運動と３次元情報を出力して、処理を終了する。 On the other hand, when this asymptotic error ΔE increases from the previous ΔE, the stabilization mode is switched (S13, S14). That is, the process proceeds to generation of asymptotic matrix data by switching from the rotation mode to the optical axis translation mode or from the optical axis translation mode to the rotation mode. When the number of times of switching the stabilization mode exceeds N times from the beginning, it is determined that the asymptotic error ΔE has converged, and the iterative process is stopped, and the camera motion and three-dimensional information at that time are output. The process is terminated.

以上、本実施形態により、画像系列における特徴点の時間的動きから、カメラ視点の運動、すなわち、三軸周りの回転運動と三軸方向の並進運動、並びに、物体形状を構成する３次元情報を復元することができる。 As described above, according to the present embodiment, from the temporal movement of the feature points in the image series, the movement of the camera viewpoint, that is, the rotational movement around the three axes and the translational movement in the three axial directions, and the three-dimensional information constituting the object shape are obtained. Can be restored.

（実施形態２）
全方位カメラはロボットビジョンや移動観測において利用されており、図１５に示すように車両の屋根に搭載し、光軸を天空方向に向けて設置される。図１５は車両による移動観測のときの模式図であり、全方位カメラにより建物壁面などの市街地景観を撮像する。移動観測では車両振動により全方位カメラが変動しており、本実施形態は、このような車両振動において揺れた画像系列からカメラ運動と３次元情報を復元する。 (Embodiment 2)
The omnidirectional camera is used in robot vision and mobile observation, and is installed on the roof of a vehicle as shown in FIG. 15 and installed with the optical axis facing the sky. FIG. 15 is a schematic diagram at the time of moving observation by a vehicle, and an urban landscape such as a building wall surface is imaged by an omnidirectional camera. In the moving observation, the omnidirectional camera is fluctuated due to the vehicle vibration, and the present embodiment restores the camera motion and the three-dimensional information from the image sequence shaken by the vehicle vibration.

図６は、請求項２等に関する基本構成図である。本実施形態ではカメラを全方位カメラとした場合の実施形態であり、実施形態１に対して、時系列画像データベース部１から取り出した画像において、特徴点観測部５により得た特徴点の画像座標値を座標変換部６で変換した座標値を実施形態１での特徴点の画像座標値とする点だけが異なるため、この部分だけについて説明する。 FIG. 6 is a basic configuration diagram relating to claim 2 and the like. In the present embodiment, the camera is an omnidirectional camera. Compared with the first embodiment, the image coordinates of the feature points obtained by the feature point observation unit 5 in the image extracted from the time-series image database unit 1 are described. Since only the point where the coordinate value obtained by converting the value by the coordinate conversion unit 6 is the image coordinate value of the feature point in the first embodiment is different, only this portion will be described.

本実施形態は、時系列画像データベース部には、ハードディスク、ＲＡＩＤ装置、ＣＤ−ＲＯＭなどの記録媒体を利用する、または、ネットワークを介したリモートなデータ資源を利用する形態でもどちらでも構わない。さらに、図７は、リアルタイムで処理する場合の処理構成図であり、本実施形態では必ずしも各データベース部１などの記憶装置を必要としない。 In this embodiment, the time-series image database unit may use a recording medium such as a hard disk, a RAID device, a CD-ROM, or a remote data resource via a network. Furthermore, FIG. 7 is a processing configuration diagram in the case of processing in real time, and in this embodiment, a storage device such as each database unit 1 is not necessarily required.

全方位カメラは市販のカメラと異なり広視野を撮像できるように設計されているため、全方位画像中の特徴点の画像座標値のままでは本実施形態を応用することができない。そこで、図１６に示すように、特徴点を示す画像座標値を位相角ρ_ijと仰角φ_ijを利用する。空間中の点Ｐ_j（Ｘ_j，Ｙ_j，Ｚ_j）は、画像面において画像座標値（ｘ_ij，ｙ_ij）へ投影されるとする。式（３２ａ）は焦点距離をｆとした等距離投影と呼ばれる魚眼レンズでの光学的投影であり、図１６の位相角ρ_ijは、画像座標値（ｘ_ij，ｙ_ij）から、式（３３）を使って得ることができる。ただし、図１６は、魚眼レンズを取り付けたカメラで画像を撮像する例であり、Ｒはイメージサークルの半径であり、画像面での投影中心の画像座標値を原点としている。これ以外の全方位カメラとして、放物面ミラーで反射する投影の場合は式（３２ｂ）により仰角φ_ijが得られ（ｈは放物面のｘｙ平面での半径）、双曲線ミラーで反射する全方位カメラの場合では、式（３２ｃ）により仰角φ_ijが得られる（ｂ，ｃは双曲線パラメータ、ｆは焦点距離）。なお、これらの全方位カメラでは画像面とＸＹ面が平行と考えることができるので、全般的な全方位カメラに対して式（３３）により位相角ρ_ijを得ることができる。 Since the omnidirectional camera is designed to capture a wide field of view unlike a commercially available camera, the present embodiment cannot be applied with the image coordinate values of the feature points in the omnidirectional image. Therefore, as shown in FIG. 16, the phase angle ρ _ij and the elevation angle φ _ij are used as the image coordinate values indicating the feature points. It is assumed that a point P _j (X _j , Y _j , Z _j ) in space is projected onto an image coordinate value (x _ij , y _ij ) on the image plane. Expression (32a) is optical projection with a fish-eye lens called equidistance projection with the focal length f, and the phase angle ρ _{ij in} FIG. 16 is obtained from the image coordinate values (x _ij , y _ij ) from Expression (33). Can be obtained using However, FIG. 16 is an example in which an image is captured by a camera equipped with a fisheye lens, R is the radius of the image circle, and the origin is the image coordinate value of the projection center on the image plane. As an omnidirectional camera other than this, in the case of projection reflected by a parabolic mirror, the elevation angle φ _ij is obtained by the equation (32b) (h is the radius of the paraboloid in the xy plane), and all reflected by the hyperbolic mirror. In the case of the azimuth camera, the elevation angle φ _ij is obtained by the equation (32c) (b and c are hyperbolic parameters, and f is a focal length). In these omnidirectional cameras, since the image plane and the XY plane can be considered parallel, the phase angle ρ _ij can be obtained from the general omnidirectional camera by Expression (33).

図６の特徴点観測部５において、特徴点の画像座標値から位相角と仰角を得ると、次に、座標変換部６において、 When the phase angle and the elevation angle are obtained from the image coordinate value of the feature point in the feature point observation unit 5 in FIG.

なる座標変換を行い、座標値（ｐ_ij，ｑ_ij）を求める。 To obtain coordinate values (p _ij , q _ij ).

この座標変換の効果を図１７、図１８で説明する。図１７は全方位カメラ（魚眼など）で移動観測したときに取得した画像の例である。この図のように全方位画像では建物の壁に位置する水平、垂直のエッジが、本来、水平と垂直のエッジが直線として投影されるべきところを、曲線として投影される。これに対して、式（Ａ１４）で座標変換すると、図１８のように、水平と垂直のエッジを線分として得ることができる。この座標変換は全方位投影から透視投影への座標変換である。 The effect of this coordinate transformation will be described with reference to FIGS. FIG. 17 is an example of an image acquired when moving and observing with an omnidirectional camera (fisheye or the like). As shown in this figure, in the omnidirectional image, the horizontal and vertical edges located on the wall of the building are projected as curves where the horizontal and vertical edges should be projected as straight lines. On the other hand, if the coordinates are converted by the equation (A14), horizontal and vertical edges can be obtained as line segments as shown in FIG. This coordinate transformation is a coordinate transformation from omnidirectional projection to perspective projection.

座標変換部６では、式（Ａ１４）で得た（ｐ_ij，ｑ_ij）を（ｘ_ij，ｙ_ij）と見なし、この（ｘ_ij，ｙ_ij）を特徴点の画像座標値（観測座標値）として、図６の漸近行列生成部２にて扱う。 The coordinate conversion unit 6 regards (p _ij , q _ij ) obtained by the equation (A14) as (x _ij , y _ij ), and uses this (x _ij , y _ij ) as the image coordinate value (observed coordinate value) of the feature point. ) In the asymptotic matrix generation unit 2 in FIG.

以上、本発明の実施形態により、全方位画像系列における特徴点の時間的動きから、カメラ視点の運動、すなわち、三軸周りの回転運動と三軸方向の並進運動、並びに、物体形状を構成する３次元情報を復元することができる。 As described above, according to the embodiment of the present invention, the motion of the camera viewpoint, that is, the rotational motion around the three axes and the translational motion in the three axial directions, and the object shape are configured from the temporal motion of the feature points in the omnidirectional image sequence. Three-dimensional information can be restored.

（実施形態３）
図８は本実施形態の処理構成図であり、実施形態１に対して復元処理判定部７の処理が異なるため、以降ではこの部分の説明だけにする。この構成において、時系列画像データベース部１には、ハードディスク、ＲＡＩＤ装置、ＣＤ−ＲＯＭなどの記録媒体を利用する形態、または、ネットワークを介したリモートなデータ資源を利用する形態でもどちらでも構わない。さらに、図９はリアルタイムで処理する場合の処理構成図であり、本実施形態では必ずしも時系列画像データベース部１などの記憶装置を必要としない。 (Embodiment 3)
FIG. 8 is a process configuration diagram of the present embodiment. Since the process of the restoration process determination unit 7 is different from that of the first embodiment, only this part will be described below. In this configuration, the time series image database unit 1 may be in a form using a recording medium such as a hard disk, a RAID device, a CD-ROM, or a form using remote data resources via a network. Furthermore, FIG. 9 is a processing configuration diagram in the case of processing in real time, and in this embodiment, a storage device such as the time-series image database unit 1 is not necessarily required.

図８の復元処理判定部では、時系列画像データベース部１から取り出した画像系列に対して特徴点の画像座標値を観測し、式（１）の形式の行列［Ａ］として保持する。図１０はその後の処理フローであり、この処理フローに従って説明する。行列［Ａ］に対して、式（３）に示す３つの行列［Ｕ］，［Ｗ］，［Ｖ］に行列分解する。特異値行列［Ｗ］は対角要素であり、その各要素である特異値は昇降順の並びで、かつ、全て正の実数となっている。この特異値行列の中から式（４）に示す（３，３）の行列要素Ｗ₃₃と（４，４）の行列要素Ｗ₄₄の特異値を取り出す（Ｓ４１，Ｓ４２）。ランク検出では、式（３４ａ）または式（３４ｂ）に示す計算を行い、判定量Ｅ_wを得る（Ｓ４３）。この判定量Ｅ_wが許容値未満であるか、または、判定量Ｅ_wが許容値以上かを判定する（Ｓ４４）。この許容値は特定の一定値であり、ユーザ（または作業者）が逐次、値を設定することもできる。もし、判定量Ｅ_wがある許容値未満の場合は、カメラ運動が平面運動であると判断して処理Ａに進み（Ｓ４５，Ｓ４６）、判定量Ｅ_wがある許容値以上の場合は、カメラ運動が一般運動（平面運動以外）であると判断して処理Ｂに進む（Ｓ４７，Ｓ４８）。 In the restoration processing determination unit in FIG. 8, the image coordinate values of the feature points are observed for the image series extracted from the time-series image database unit 1, and held as a matrix [A] in the form of equation (1). FIG. 10 shows the subsequent processing flow, which will be described according to this processing flow. The matrix [A] is subjected to matrix decomposition into three matrices [U], [W], and [V] shown in Expression (3). The singular value matrix [W] is a diagonal element, and the singular values that are the respective elements are arranged in ascending / descending order and are all positive real numbers. From this singular value matrix, the singular values of the matrix element W ₃₃ of (3, 3) and the matrix element W ₄₄ of (4, 4) shown in Expression (4) are extracted (S41, S42). In rank detection, the calculation shown in Formula (34a) or Formula (34b) is performed to obtain the determination amount E _w (S43). It is determined whether this determination amount E _w is less than the allowable value or whether the determination amount E _w is greater than or equal to the allowable value (S44). This allowable value is a specific constant value, and the user (or worker) can also set the value sequentially. If less than the allowable value is determined amount E _w is camera motion proceeds to the process it is determined that the planar motion A (S45, S46), when exceeding the allowable value is determined amount E _w is the camera It is determined that the motion is a general motion (other than planar motion), and the process proceeds to process B (S47, S48).

処理Ｂとは実施形態１の処理であり、処理Ｂにより画像系列における特徴点の時間的動きから、カメラ視点の運動、すなわち、三軸周りの回転運動と三軸方向の並進運動（自由度６の運動）、並びに、物体形状を構成する３次元情報を復元する。一方、処理Ａと判定した場合、図１１にしたがって平面運動と３次元情報を復元する。この処理は、実施形態１の平面運動・３次元情報復元部の処理内容と同等であるため説明を省く。 The process B is the process of the first embodiment, and from the temporal movement of the feature points in the image series by the process B, the movement of the camera viewpoint, that is, the rotational movement around the three axes and the translational movement in the three axial directions (6 degrees of freedom). And the three-dimensional information constituting the object shape are restored. On the other hand, when it is determined as the process A, the plane motion and the three-dimensional information are restored according to FIG. Since this processing is equivalent to the processing content of the planar motion / three-dimensional information restoration unit of the first embodiment, a description thereof will be omitted.

以上の処理により、カメラ運動が平面運動の場合は、実施形態１の処理に回さずに、平面運動だけを計算するようにして、計算コストを大幅に低減させて、カメラ運動と３次元情報を復元することができる。 With the above processing, when the camera motion is a plane motion, the calculation cost is greatly reduced by calculating only the plane motion without going to the processing of the first embodiment, and the camera motion and the three-dimensional information are calculated. Can be restored.

（実施形態４）
図１２は請求項４等に関する基本構成図である。本実施形態では、実施形態２に対して、復元処理判定部７の処理のみが異なる。本実施形態は、時系列画像データベース部１には、ハードディスク、ＲＡＩＤ装置、ＣＤ−ＲＯＭなどの記録媒体を利用する、または、ネットワークを介したリモートなデータ資源を利用する形態でもどちらでも構わない。さらに、図１３はリアルタイムで処理する場合の処理構成図であり、本実施形態は必ずしも各データベース部１などの記憶装置を必要としない。 (Embodiment 4)
FIG. 12 is a basic configuration diagram relating to claim 4 and the like. In the present embodiment, only the process of the restoration process determination unit 7 is different from the second embodiment. In the present embodiment, the time-series image database unit 1 may use a recording medium such as a hard disk, a RAID device, a CD-ROM, or a remote data resource via a network. Furthermore, FIG. 13 is a processing configuration diagram in the case of processing in real time, and this embodiment does not necessarily require a storage device such as each database unit 1.

図１２の復元処理判定部７では、時系列画像データベース部１から取り出した画像系列に対して特徴点の画像座標値を観測し、式（１）の形式の行列[Ａ]として保持する。次に、各特徴点の画像座標値から位相角と仰角を全方位カメラの投影に応じて求め、式（Ａ１４）の座標変換を行い、座標値（ｐ_ij，ｑ_ij）を求める。この（ｐ_ij，ｑ_ij）を（ｘ_ij，ｙ_ij）と見なし、この（ｘ_ij，ｙ_ij）を特徴点の画像座標値（観測座標値）として、図１２の復元処理判定部７にて扱う。復元処理判定部７の処理は実施形態３の処理と同じであるため説明を省く。 In the restoration processing determination unit 7 of FIG. 12, the image coordinate values of the feature points are observed for the image series extracted from the time-series image database unit 1 and held as a matrix [A] in the form of equation (1). Next, the phase angle and the elevation angle are obtained from the image coordinate values of each feature point according to the projection of the omnidirectional camera, the coordinate transformation of equation (A14) is performed, and the coordinate values (p _ij , q _ij ) are obtained. This (p _ij , q _ij ) is regarded as (x _ij , y _ij ), and this (x _ij , y _ij ) is used as the image coordinate value (observation coordinate value) of the feature point to the restoration processing determination unit 7 in FIG. Handle. Since the process of the restoration process determination unit 7 is the same as the process of the third embodiment, a description thereof will be omitted.

以上の処理により、全方位カメラ運動が平面運動の場合は、実施形態１の処理に回さずに、平面運動だけを計算するようにして、計算コストを大幅に低減させて、カメラ運動と３次元情報を復元することができる。 By the above processing, when the omnidirectional camera motion is a plane motion, the calculation cost is greatly reduced by calculating only the plane motion without going to the processing of the first embodiment, and the camera motion and 3 Dimensional information can be restored.

なお、本発明は、図３〜図５等に示した方法の一部又は全部の処理機能をプログラムとして構成してコンピュータを用いて実行可能にすることができる。また、このプログラムを記録した記録媒体を、ネットワークを通して提供することも可能である。 In the present invention, some or all of the processing functions of the methods shown in FIGS. 3 to 5 and the like can be configured as a program and can be executed using a computer. It is also possible to provide a recording medium recording this program through a network.

請求項１等に係る画像蓄積型の復元装置の基本構成図。The basic block diagram of the image storage type decompression | restoration apparatus which concerns on Claim 1 grade | etc.,. 請求項１等に係るリアルタイム処理型の復元装置の基本構成図。A basic configuration diagram of a real-time processing type restoration apparatus according to claim 1 and the like. 請求項１等に係る復元方法の処理フロー。A processing flow of the restoration method according to claim 1 and the like. 回転安定化の処理フロー。Processing flow for rotation stabilization. 光軸並進安定化の処理フロー。Processing flow for optical axis translation stabilization. 請求項２等に係る画像蓄積型の復元装置の基本構成図。The basic block diagram of the image storage type decompression | restoration apparatus which concerns on Claim 2 grade | etc.,. 請求項２等に係るリアルタイム処理型の復元装置の基本構成図。A basic configuration diagram of a real-time processing type restoration apparatus according to claim 2 or the like. 請求項３等に係る画像蓄積型の復元装置の基本構成図。The basic block diagram of the image storage type decompression | restoration apparatus which concerns on Claim 3 grade | etc.,. 請求項３等に係るリアルタイム処理型の復元装置の基本構成図。A basic configuration diagram of a real-time processing type restoration apparatus according to claim 3 and the like. 復元処理判定部での処理フロー。The processing flow in a restoration process determination part. 処理Ａの処理フロー。Process flow of process A. 請求項４等に係る画像蓄積型の復元装置の基本構成図。The basic block diagram of the image storage type decompression | restoration apparatus based on Claim 4 grade | etc.,. 請求項４等に係るリアルタイム処理型の復元装置の基本構成図。A basic configuration diagram of a real-time processing type restoration apparatus according to claim 4 and the like. カメラと被写体の位置関係とカメラ座標系を示す図。The figure which shows the positional relationship of a camera, a to-be-photographed object, and a camera coordinate system. 全方位カメラを車両の天井に固定して移動観測する場合の図。The figure in the case of carrying out mobile observation, fixing an omnidirectional camera to the ceiling of a vehicle. 図１５における全方位カメラと被写体の位置関係とカメラ座標系を示す図。The figure which shows the positional relationship of an omnidirectional camera and a to-be-photographed object in FIG. 15, and a camera coordinate system. 全方位カメラが撮像した建物壁面のエッジラインを示す図。The figure which shows the edge line of the building wall surface which the omnidirectional camera imaged. 図１７のエッジを座標変換した結果を示す図。The figure which shows the result of having coordinate-transformed the edge of FIG.

Explanation of symbols

１時系列画像データベース部
１Ａ画像入力部
２漸近行列生成部
３平面運動・３次元情報復元部
４安定化処理部
５特徴点観測部
６座標変換部
７復元処理判定部
DESCRIPTION OF SYMBOLS 1 Time series image database part 1A Image input part 2 Asymptotic matrix production | generation part 3 Plane motion and three-dimensional information restoration part 4 Stabilization process part 5 Feature point observation part 6 Coordinate conversion part 7 Restoration process determination part

Claims

An apparatus for restoring the movement of the camera viewpoint in the time series and the three-dimensional information constituting the object shape of the outside world from the temporal change amount of the image coordinate value regarding the feature point arranged in the target image in the time series image Because
In the feature point coordinate system set for the time series image, the image coordinate value (observation coordinate value) of the feature point in each frame image is input, and the rotation coordinate of the camera, the optical axis coordinate value, the translational motion, and Asymptotic matrix generation means for generating asymptotic matrix data whose elements are coordinate values multiplied by a coefficient consisting of three-dimensional information;
The asymptotic matrix data is subjected to singular value decomposition, noise removal is performed to obtain matrix data representing motion information and matrix data representing three-dimensional information, and the motion information satisfies the conditions set for defining the motion. A transformation matrix is obtained, and the transformation matrix is applied to the matrix data representing the motion information, so that the rotational motion around the optical axis with respect to the camera viewpoint and the translational motion on a plane perpendicular to the optical axis (a plane with three degrees of freedom consisting of these components). A plane motion / three-dimensional information restoring means for restoring the three-dimensional information constituting the object shape by applying an inverse matrix of the transformation matrix to the matrix data representing the three-dimensional information.
The plane motion obtained by the plane motion / three-dimensional information restoration means, the reprojection error calculated from the three-dimensional information, and the coordinates converted by the coefficient ε and the optical axis coordinate values into the observed coordinate values obtained by the asymptotic matrix generation means Matrix data having values as matrix elements is obtained, matrix data obtained by singular value decomposition of the matrix data to remove noise, and the Z coordinate value (Z direction) of each feature point obtained by the plane motion / three-dimensional information restoration means Optical axis motion that restores the translational motion in the optical axis direction of the camera viewpoint from the matrix whose element is the height of the feature point position when the is vertically oriented), and updates the coefficient δ by the restored optical axis translational motion Recovery means,
An error is obtained between the observed coordinate value obtained by the asymptotic matrix generation means, the plane motion obtained by the plane motion / three-dimensional information restoration means, and the re-projection error obtained from the three-dimensional information by the coefficient δ. From the matrix data having the error as a matrix element, the plane motion obtained by the plane motion / three-dimensional information restoring means, the three-dimensional information, and the optical axis translation obtained by the optical axis motion restoring means, the optical axis Rotational motion restoration means for obtaining rotational motion around mutually orthogonal axes other than and updating the coefficient ε and the optical axis coordinate value by the restored rotational motion,
The observed coordinate values obtained by the asymptotic matrix generating means are converted into the conversion coefficient (coefficient ε) and the optical axis coordinate value updated by the rotational motion restoring means, and the conversion coefficient (coefficient δ) updated by the optical axis motion restoring means. ), The asymptotic value to the plane motion between the plane motion obtained by the plane motion / three-dimensional information restoration means and the reprojected coordinate value of the feature point for each frame image from the three-dimensional information. An asymptotic error is calculated, and processing for reducing the asymptotic error (camera motion asymptotically approaches planar motion) by switching between processing by the optical axis translational motion restoring means and processing in the rotational motion restoring means by increasing or decreasing the asymptotic error. Repetitive stabilization means,
A device for restoring camera motion and three-dimensional information, comprising:

In claim 1,
In the feature point coordinate system set for the omnidirectional image, the means for inputting the image coordinate value of the feature point in each frame image, the azimuth angle (phase angle) from a reference axis from the image coordinate value, and the omnidirectional camera Means for obtaining an angle (elevation angle) from the optical axis direction obtained according to the projection method used; means for obtaining a coordinate value obtained by coordinate conversion using the phase angle and elevation angle as the observation coordinate value;
A device for restoring camera motion and three-dimensional information, comprising:

In claim 1,
In the feature point coordinate system set for the time series image, means for inputting the image coordinate value of the feature point in each frame image, means for decomposing a matrix having the observed coordinate value as a matrix element, and the singular value A means for calculating a judgment value representing the degree of freedom of movement from the component, and if this judgment value is less than a certain value, it is regarded as a plane motion, and from rotation around the optical axis and movement on a plane perpendicular to the optical axis. Means for restoring three-dimensional information with three degrees of freedom, and if the determination value is greater than a certain value, restores six-degree-of-freedom movement and three-dimensional information consisting of rotational and translational motions of the camera. Means,
A device for restoring camera motion and three-dimensional information, comprising:

In claim 1,
In the feature point coordinate system set for the omnidirectional image, the means for inputting the image coordinate value of the feature point in each frame image, the azimuth angle (phase angle) from a reference axis from the image coordinate value, and the omnidirectional camera Means for obtaining an angle (elevation angle) from the optical axis direction obtained according to the projection method used; means for obtaining a coordinate value obtained by coordinate conversion using the phase angle and elevation angle as the observation coordinate value;
Means for singular value decomposition of a matrix having the observed coordinate values as matrix elements, means for calculating a determination value representing the degree of freedom of movement from the component of this singular value, and when the determination value is less than a certain value, If the judgment value is greater than a certain value, it is considered to be a planar motion, means for restoring three-dimensional information with three degrees of freedom consisting of rotation around the optical axis and motion on a plane perpendicular to the optical axis. A means for restoring the three-dimensional information and the six-degree-of-freedom motion comprising the rotational motion and translational motion of the camera;
A device for restoring camera motion and three-dimensional information, comprising:

A method for reconstructing three-dimensional information constituting the motion of the camera viewpoint in the time series and the object shape of the outside world from the temporal change amount of the image coordinate value regarding the feature point arranged in the target image in the time series image Because
In the feature point coordinate system set for the time series image, the image coordinate value (observation coordinate value) of the feature point in each frame image is input, and the rotation coordinate of the camera, the optical axis coordinate value, the translational motion, and An asymptotic matrix generating step for generating asymptotic matrix data having a coordinate value multiplied by a coefficient consisting of three-dimensional information as an element;
The asymptotic matrix data is subjected to singular value decomposition, noise removal is performed to obtain matrix data representing motion information and matrix data representing three-dimensional information, and the motion information satisfies the conditions set for defining the motion. A transformation matrix is obtained, and the transformation matrix is applied to the matrix data representing the motion information, so that the rotational motion around the optical axis with respect to the camera viewpoint and the translational motion on a plane perpendicular to the optical axis (a plane with three degrees of freedom consisting of these components). A plane motion / three-dimensional information restoring step for restoring the three-dimensional information constituting the object shape by applying an inverse matrix of the transformation matrix to the matrix data representing the three-dimensional information.
Coordinates obtained by converting the plane motion obtained in the plane motion / three-dimensional information restoration step, the reprojection error calculated from the three-dimensional information, and the observed coordinate value obtained in the asymptotic matrix generation step using the coefficient ε and the optical axis coordinate value. Matrix data having values as matrix elements is obtained, the matrix data obtained by singular value decomposition of the matrix data to remove noise, and the Z coordinate value (Z direction) of each feature point obtained in the plane motion / three-dimensional information restoration step. Optical axis motion that restores the translational motion in the optical axis direction of the camera viewpoint from the matrix whose element is the height of the feature point position when the is vertically oriented), and updates the coefficient δ by the restored optical axis translational motion A restore step,
An error between the observed coordinate value obtained in the asymptotic matrix generation step, the plane value obtained in the plane motion / three-dimensional information restoration step, and the re-projection error obtained from the three-dimensional information is obtained by a coefficient δ. From the matrix data having the error as a matrix element, the plane motion obtained in the plane motion / three-dimensional information restoration step, the three-dimensional information, and the optical axis translation obtained in the optical axis motion restoration step, the optical axis A rotational motion restoring step for obtaining rotational motion around mutually orthogonal axes other than and updating the coefficient ε and the optical axis coordinate value by the restored rotational motion,
The observed coordinate values obtained in the asymptotic matrix generation step are converted into the conversion coefficient (coefficient ε) and optical axis coordinate value updated in the rotational motion restoration step, and the conversion coefficient (coefficient δ) updated in the optical axis motion restoration step. ), The asymptotic value to the plane motion between the plane motion obtained in the plane motion / three-dimensional information restoration step and the reprojected coordinate value of the feature point for each frame image from the three-dimensional information. An asymptotic error representing the following is obtained, and processing for reducing the asymptotic error (camera motion asymptotically approaches planar motion) by switching the processing in the optical axis translational motion restoration step and the processing in the rotational motion restoration step by increasing or decreasing the asymptotic error. Repeated stabilization steps;
A method for restoring camera motion and three-dimensional information, comprising:

In claim 5,
In the feature point coordinate system set for the omnidirectional image, the step of inputting the image coordinate value of the feature point in each frame image, the azimuth angle (phase angle) from a certain reference axis from the image coordinate value, and the omnidirectional camera Obtaining an angle (elevation angle) from the optical axis direction obtained according to the projection method used, obtaining a coordinate value obtained by coordinate conversion using the phase angle and the elevation angle, as the observed coordinate value;
A method for restoring camera motion and three-dimensional information, comprising:

In claim 5,
In the feature point coordinate system set for the time series image, a step of inputting image coordinate values of feature points in each frame image, a step of singular value decomposition of a matrix having the observed coordinate values as matrix elements, and A step of calculating a judgment value representing the degree of freedom of movement from the component, and if this judgment value is less than a certain value, it is regarded as a plane motion, and is determined from a rotation around the optical axis and a movement on a plane perpendicular to the optical axis. If the determination value is greater than or equal to a certain value, the motion with the degree of freedom 6 consisting of the rotational motion and the translational motion and the 3D information are restored. Steps,
A method for restoring camera motion and three-dimensional information, comprising:

In claim 5,
In the feature point coordinate system set for the omnidirectional image, the step of inputting the image coordinate value of the feature point in each frame image, the azimuth angle (phase angle) from a certain reference axis from the image coordinate value, and the omnidirectional camera Obtaining an angle (elevation angle) from the optical axis direction obtained according to the projection method used, obtaining a coordinate value obtained by coordinate conversion using the phase angle and the elevation angle, as the observed coordinate value;
A step of singular value decomposition of a matrix having the observed coordinate values as matrix elements, a step of calculating a determination value representing the degree of freedom of movement from the component of the singular value, and the determination value is less than a certain value, A step of restoring three-dimensional information with three degrees of freedom consisting of rotation around the optical axis and movement on a plane perpendicular to the optical axis as if it were a plane movement, Reconstructing the three-dimensional information and the six-degree-of-freedom motion comprising the rotational motion and translational motion of the camera;
A method for restoring camera motion and three-dimensional information, comprising:

9. A program configured to execute a processing procedure in the camera motion and three-dimensional information restoration method according to claim 5 by a computer.