JP2016119580A

JP2016119580A - Image processing apparatus, imaging apparatus, image processing method, and program

Info

Publication number: JP2016119580A
Application number: JP2014258378A
Authority: JP
Inventors: 正雄松原; Masao Matsubara; 達也八木; Tatsuya Yagi
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 2014-12-22
Filing date: 2014-12-22
Publication date: 2016-06-30
Anticipated expiration: 2034-12-22
Also published as: JP6421587B2

Abstract

PROBLEM TO BE SOLVED: To execute camera shake correction in consideration of an influence of parallax with simple calculation.SOLUTION: A specification part 101 specifies one of a plurality of images constituting moving images. A fundamental matrix acquisition part 102 acquires a fundamental matrix representing a relationship with the specified image for each of the predetermined number of images of the plurality of images other than the image specified by the specification part 101. A selection part 103 selects, from the predetermined number of images, an image for performing epipolar projection to the specified image on the basis of the fundamental matrix acquired for each of the predetermined number of images by the fundamental matrix acquisition part 102. A virtual feature point generation part 104 performs epipolar projection to feature points in the selected image corresponding to each other or an image to which a virtual feature point is specified, on the basis of the fundamental matrix acquired for the image selected by the selection part 103, out of the fundamental matrices acquired by the fundamental matrix acquisition part 102 to generate a virtual feature point in the specified image.SELECTED DRAWING: Figure 2

Description

本発明は、画像処理装置、撮像装置、画像処理方法、及びプログラムに関する。 The present invention relates to an image processing device, an imaging device, an image processing method, and a program.

動画に対する手ぶれを補正する方法として、特徴点追跡方式と呼ばれる技術が知られている。特徴点追跡方式においては、動画に含まれる各フレームの画像から特徴点を抽出し、抽出した特徴点の移動ベクトルから構成したオプティカルフローを時間的に平滑化することにより、動画に対して手ぶれを補正する。 A technique called a feature point tracking method is known as a method for correcting camera shake for a moving image. In the feature point tracking method, a feature point is extracted from an image of each frame included in the moving image, and an optical flow composed of the extracted feature point movement vector is temporally smoothed, thereby causing a camera shake on the moving image. to correct.

しかし、特徴点追跡方式は三次元物体の視差の影響を考慮していないため、手ぶれ補正後の動画に不自然な歪みが生じることがあるという欠点があった。 However, since the feature point tracking method does not consider the influence of parallax of a three-dimensional object, there is a drawback in that an unnatural distortion may occur in a moving image after camera shake correction.

特徴点追跡方式が有する上述の欠点を克服する方法として、エピポーラ幾何を応用した手ぶれ補正の方法（以下、エピポーラ転送方式と呼ぶ）が知られている。例えば、非特許文献１は、エピポーラ転送方式で手ぶれを補正することにより、視差の影響を抑制し、手ぶれ補正後の動画に生じる不自然な歪みを軽減する技術を開示している。 As a method of overcoming the above-described drawbacks of the feature point tracking method, a method for correcting camera shake using epipolar geometry (hereinafter referred to as an epipolar transfer method) is known. For example, Non-Patent Document 1 discloses a technique that suppresses the influence of parallax by correcting camera shake using an epipolar transfer method, and reduces unnatural distortion that occurs in a moving image after camera shake correction.

Amit Goldstein and. Raanan Fattal, "Video Stabilization using Epipolar Geometry", ACM Transactions on Graphics (TOG), Volume 31, Issue 5, August 2012, Article No.126Amit Goldstein and. Raanan Fattal, "Video Stabilization using Epipolar Geometry", ACM Transactions on Graphics (TOG), Volume 31, Issue 5, August 2012, Article No.126

しかしながら、エピポーラ転送方式の手ぶれ補正は、手ぶれ補正の対象となる各フレームの画像について、例えば過去１０フレームの画像との間でのオプティカルフローの計算、及び１フレーム当たり数千本から数万本に及ぶエピポーラ線の投影等、多くの座標計算を必要とする。このため、演算負荷が非常に大きかった。 However, the image stabilization of the epipolar transfer method is based on the calculation of the optical flow with respect to the image of the past 10 frames, for example, the image of each frame to be subjected to the image stabilization, and from thousands to tens of thousands per frame. Many coordinate calculations are required, such as projection of epipolar lines. For this reason, the calculation load was very large.

本発明は、上記の課題に鑑みてなされたものであり、視差の影響を考慮した手ぶれ補正を簡易な演算により実行可能な画像処理装置、撮像装置、画像処理方法、及びプログラムを提供することを目的とする。 The present invention has been made in view of the above problems, and provides an image processing apparatus, an imaging apparatus, an image processing method, and a program capable of performing camera shake correction in consideration of the influence of parallax by a simple calculation. Objective.

上記目的を達成するため、本発明に係る画像処理装置は、
動画を構成する複数の画像の中から１つを指定する指定部と、
前記複数の画像のうちの前記指定部によって指定された画像以外の所定数の画像のそれぞれについて、該指定された画像との関係を表す基礎行列を取得する基礎行列取得部と、
前記基礎行列取得部によって前記所定数の画像のそれぞれについて取得された基礎行列に基づいて、前記所定数の画像の中から、前記指定された画像にエピポーラ投影するための画像を選別する選別部と、
前記基礎行列取得部によって取得された基礎行列のうち、前記選別部によって選別された画像について取得された基礎行列に基づいて、該選別された画像内の互いに対応する特徴点又は仮想特徴点を前記指定された画像にエピポーラ投影することにより、前記指定された画像内における仮想特徴点を生成する仮想特徴点生成部と、
前記複数の画像の中から前記指定部が指定する画像を変えて、前記指定部、前記基礎行列取得部、前記選別部、及び前記仮想特徴点生成部の処理を繰り返すことにより、仮想特徴点軌道を構築する仮想特徴点軌道構築部と、
前記仮想特徴点軌道構築部によって構築された仮想特徴点軌道を時間方向に平滑化する平滑化部と、
前記仮想特徴点軌道構築部によって構築された仮想特徴点軌道と、前記平滑化部によって平滑化された仮想特徴点軌道と、の間の関係に基づいて、前記複数の画像のそれぞれを補正する補正部と、
を備えることを特徴とする。 In order to achieve the above object, an image processing apparatus according to the present invention provides:
A designating unit for designating one of a plurality of images constituting the video,
A basic matrix acquisition unit that acquires a basic matrix representing a relationship with the designated image for each of a predetermined number of images other than the image designated by the designation unit of the plurality of images;
A selection unit that selects an image for epipolar projection on the designated image from the predetermined number of images based on the basic matrix acquired for each of the predetermined number of images by the basic matrix acquisition unit; ,
Among the basic matrices acquired by the basic matrix acquisition unit, based on the basic matrix acquired for the image selected by the selection unit, the feature points or virtual feature points corresponding to each other in the selected image are A virtual feature point generation unit that generates a virtual feature point in the specified image by performing an epipolar projection on the specified image;
By changing the image designated by the designation unit from the plurality of images and repeating the processing of the designation unit, the basic matrix acquisition unit, the selection unit, and the virtual feature point generation unit, a virtual feature point trajectory A virtual feature point trajectory construction unit for constructing
A smoothing unit that smoothes the virtual feature point trajectory constructed by the virtual feature point trajectory construction unit in the time direction;
Correction that corrects each of the plurality of images based on the relationship between the virtual feature point trajectory constructed by the virtual feature point trajectory construction unit and the virtual feature point trajectory smoothed by the smoothing unit. And
It is characterized by providing.

本発明によれば、視差の影響を考慮した手ぶれ補正を簡易な演算により実行できる。 According to the present invention, camera shake correction in consideration of the influence of parallax can be executed by a simple calculation.

本発明の実施形態に係る撮像装置の構成を例示するブロック図である。It is a block diagram which illustrates the composition of the imaging device concerning the embodiment of the present invention. 本発明の実施形態に係る撮像装置及び画像処理装置の機能構成を例示するブロック図である。1 is a block diagram illustrating a functional configuration of an imaging apparatus and an image processing apparatus according to an embodiment of the present invention. エピポーラ幾何の概念を説明するための図である。It is a figure for demonstrating the concept of epipolar geometry. エピポーラ転送の概念を説明するための図である。It is a figure for demonstrating the concept of epipolar transfer. エピポーラ転送を利用した仮想特徴点軌道の構築を説明するための図である。It is a figure for demonstrating construction | assembly of the virtual feature point locus | trajectory using epipolar transfer. エピポーラ転送方式の手ぶれ補正の概要を説明するための図である。It is a figure for demonstrating the outline | summary of the camera-shake correction of an epipolar transfer system. エピポーラ転送を利用した手ぶれ補正後の特徴点軌道の構築を説明するための図である。It is a figure for demonstrating construction | assembly of the feature point locus | trajectory after the camera-shake correction | amendment using epipolar transfer. 本発明の実施形態に係る画像処理装置が実行する手ぶれ補正処理を説明するためのフローチャートである。5 is a flowchart for explaining camera shake correction processing executed by the image processing apparatus according to the embodiment of the present invention. 本発明の実施形態に係る画像処理装置が実行するオプティカルフロー取得処理を説明するためのフローチャートである。It is a flowchart for demonstrating the optical flow acquisition process which the image processing apparatus which concerns on embodiment of this invention performs. 本発明の実施形態に係る画像処理装置が実行するオプティカルフロー取得処理を説明するための図である。It is a figure for demonstrating the optical flow acquisition process which the image processing apparatus which concerns on embodiment of this invention performs. 本発明の実施形態に係る画像処理装置が実行する基礎行列取得処理を説明するためのフローチャートである。It is a flowchart for demonstrating the basic matrix acquisition process which the image processing apparatus which concerns on embodiment of this invention performs. 本発明の実施形態に係る画像処理装置が実行するフレーム選別処理を説明するためのフローチャートである。It is a flowchart for demonstrating the frame selection process which the image processing apparatus which concerns on embodiment of this invention performs. 本発明の実施形態に係る画像処理装置が実行するフレーム選別処理を説明するための図である。It is a figure for demonstrating the frame selection process which the image processing apparatus which concerns on embodiment of this invention performs. 本発明の実施形態に係る画像処理装置が実行する仮想特徴点生成処理を説明するためのフローチャートである。It is a flowchart for demonstrating the virtual feature point production | generation process which the image processing apparatus which concerns on embodiment of this invention performs. 本発明の実施形態に係る画像処理装置が実行する仮想特徴点軌道準備処理を説明するためのフローチャートである。It is a flowchart for demonstrating the virtual feature point orbit preparation process which the image processing apparatus which concerns on embodiment of this invention performs.

以下、本発明の実施形態に係る画像処理装置、撮像装置、及び画像処理方法を、図面を参照しながら詳細に説明する。尚、図中同一又は同等の部分には同じ符号を付す。 Hereinafter, an image processing apparatus, an imaging apparatus, and an image processing method according to embodiments of the present invention will be described in detail with reference to the drawings. In the drawings, the same or equivalent parts are denoted by the same reference numerals.

撮像装置１は、図１に示すように、撮像部１０と、データ処理部２０と、ユーザインタフェース部３０と、を備える。 As illustrated in FIG. 1, the imaging apparatus 1 includes an imaging unit 10, a data processing unit 20, and a user interface unit 30.

撮像部１０は、光学レンズ１１とイメージセンサ１２とを含む。撮像部１０は、後述する操作部３２が受け付けたユーザの操作に従って被写体を撮像することにより、手ぶれ補正の対象となる動画を生成する。生成された動画は、時間的に連続して撮像された（連続する複数のフレームにわたって撮像された）複数の画像を含んでいる。 The imaging unit 10 includes an optical lens 11 and an image sensor 12. The imaging unit 10 generates a moving image to be subjected to camera shake correction by imaging a subject in accordance with a user operation received by the operation unit 32 described later. The generated moving image includes a plurality of images that are continuously captured in time (captured over a plurality of consecutive frames).

光学レンズ１１は、被写体から射出された光を集光するレンズと、焦点、露出、ホワイトバランス等の撮像設定パラメータを調整するための周辺回路と、を備える。 The optical lens 11 includes a lens that collects light emitted from a subject, and a peripheral circuit for adjusting imaging setting parameters such as focus, exposure, and white balance.

イメージセンサ１２は、例えば、ＣＣＤ（ＣｈａｒｇｅＣｏｕｐｌｅｄＤｅｖｉｃｅ）やＣＭＯＳ（ＣｏｍｐｌｅｍｅｎｔａｒｙＭｅｔａｌＯｘｉｄｅＳｅｍｉｃｏｎｄｕｃｔｏｒ）等を備える。イメージセンサ１２は、光学レンズ１１が光を集光することによって結像した被写体の光学像を取得して、取得した光学像の電圧情報をアナログ／デジタル変換器（図示せず）によりデジタル画像データに変換する。そして、得られたデジタル画像データを後述する外部記憶部２３に保存する。 The image sensor 12 includes, for example, a CCD (Charge Coupled Device), a CMOS (Complementary Metal Oxide Semiconductor), and the like. The image sensor 12 acquires an optical image of a subject formed by the optical lens 11 condensing light, and the voltage information of the acquired optical image is converted into digital image data by an analog / digital converter (not shown). Convert to Then, the obtained digital image data is stored in the external storage unit 23 described later.

データ処理部２０は、主記憶部２１と、出力部２２と、外部記憶部２３と、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）２４と、画像処理装置１００と、を含む。 The data processing unit 20 includes a main storage unit 21, an output unit 22, an external storage unit 23, a CPU (Central Processing Unit) 24, and an image processing apparatus 100.

主記憶部２１は、例えばＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）等を備える。主記憶部２１は、ＣＰＵ２４のワークメモリとして機能し、画像データやプログラムを一時的に記憶する。 The main storage unit 21 includes, for example, a RAM (Random Access Memory). The main storage unit 21 functions as a work memory for the CPU 24 and temporarily stores image data and programs.

出力部２２は、主記憶部２１や外部記憶部２３に記憶された画像データを読み出し、この画像データに対応するＲＧＢ（Ｒ（Ｒｅｄ、赤）、Ｇ（Ｇｒｅｅｎ、緑）、Ｂ（Ｂｌｕｅ、青））信号を生成して、後述する表示部３１に出力する。また、出力部２２は、生成したＲＧＢ信号をＣＰＵ２４や画像処理装置１００へ供給する。 The output unit 22 reads out image data stored in the main storage unit 21 or the external storage unit 23, and RGB (R (Red, red), G (Green, green), B (Blue, blue) corresponding to the image data. )) A signal is generated and output to the display unit 31 described later. The output unit 22 supplies the generated RGB signal to the CPU 24 and the image processing apparatus 100.

外部記憶部２３は、不揮発性メモリ（例えば、フラッシュメモリやハードディスク）を備え、撮像装置１全体の制御に必要な制御プログラムを含む種々のプログラム、種々の固定データ等を固定的に記憶する。外部記憶部２３は、記憶しているプログラムやデータをＣＰＵ２４や後述する画像処理装置１００へ供給し、撮像部１０が生成した動画を含む種々のデータを固定的に記憶する。 The external storage unit 23 includes a non-volatile memory (for example, a flash memory or a hard disk), and stores various programs including a control program necessary for controlling the entire imaging apparatus 1 and various fixed data. The external storage unit 23 supplies the stored program and data to the CPU 24 and the image processing apparatus 100 described later, and fixedly stores various data including moving images generated by the imaging unit 10.

ＣＰＵ２４は、外部記憶部２３に記憶された制御プログラムを実行することにより撮像装置１全体を制御するとともに、外部記憶部２３に記憶された種々のプログラムを実行する。 The CPU 24 controls the entire imaging apparatus 1 by executing a control program stored in the external storage unit 23 and executes various programs stored in the external storage unit 23.

画像処理装置１００は、動画に対してエピポーラ転送方式の手ぶれ補正を施す。画像処理装置１００は、図２に示すように、機能的に、指定部１０１、基礎行列取得部１０２、選別部１０３、仮想特徴点生成部１０４、仮想特徴点軌道構築部１０５、平滑化部１０６、補正部１０７、及び評価部１０８を備える。これら各部は、ＣＰＵ２４の機能によって実現される。 The image processing apparatus 100 performs camera shake correction using an epipolar transfer method on a moving image. As illustrated in FIG. 2, the image processing apparatus 100 functionally includes a designation unit 101, a basic matrix acquisition unit 102, a selection unit 103, a virtual feature point generation unit 104, a virtual feature point trajectory construction unit 105, and a smoothing unit 106. A correction unit 107 and an evaluation unit 108. These units are realized by the functions of the CPU 24.

尚、画像処理装置１００は、通常の画像処理装置と同様に、トリミング機能や画像拡大・縮小機能等を有するが、以下では、本実施形態に特徴的な、エピポーラ転送方式の手ぶれ補正を動画に施す機能を中心に説明する。 The image processing apparatus 100 has a trimming function, an image enlargement / reduction function, and the like, as in a normal image processing apparatus. In the following, the epipolar transfer type camera shake correction characteristic of the present embodiment is converted into a moving image. The function to be performed will be mainly described.

指定部１０１は、手ぶれ補正対象の動画を構成する複数の画像の中から１つを指定する。基礎行列取得部１０２は、手ぶれ補正対象の動画を構成する複数の画像のうちの指定部１０１によって指定された画像以外の所定数（本実施形態では、１０フレーム分）の画像のそれぞれについて、指定部１０１によって指定された画像との関係を表す基礎行列Ｆを取得する。 The designation unit 101 designates one of a plurality of images constituting a moving image to be corrected for camera shake. The basic matrix acquisition unit 102 designates each of a predetermined number of images (in this embodiment, 10 frames) other than the image designated by the designation unit 101 among the plurality of images constituting the image to be corrected for camera shake. The basic matrix F representing the relationship with the image specified by the unit 101 is acquired.

具体的には、基礎行列取得部１０２は、オプティカルフロー取得部１０２ａを含み、オプティカルフロー取得部１０２ａによって取得されたオプティカルフローに基づいて、基礎行列Ｆを取得する。 Specifically, the basic matrix acquisition unit 102 includes an optical flow acquisition unit 102a, and acquires the basic matrix F based on the optical flow acquired by the optical flow acquisition unit 102a.

オプティカルフロー取得部１０２ａは、オプティカルフロー取得処理を実行することにより、画像間の特徴点ｐ^ｉの移動ベクトルによって形成される画像間のオプティカルフローと、特徴点ｐ^ｉの複数のフレームに亘る移動の軌跡である特徴点軌道ｐと、を取得する。 The optical flow acquisition unit 102a executes an optical flow acquisition process, thereby performing an optical flow between images formed by a movement vector of the feature point p ⁱ between images and a movement of the feature point p ⁱ over a plurality of frames. A feature point trajectory p that is a trajectory is acquired.

具体的には、オプティカルフロー取得部１０２ａは、手ぶれ補正対象の動画に含まれる連続する複数のフレームの画像それぞれから互いに対応する特徴点ｐ^ｉを抽出する（特徴点ｐ^ｉの座標（例えば、同次座標）を取得する）。 Specifically, the optical flow acquisition unit 102a extracts feature points p ⁱ corresponding to each other from images of a plurality of consecutive frames included in the moving image to be corrected for camera shake (the coordinates of the feature points p ⁱ (for example, the same Next coordinate)).

以下、特徴点をｐ^ｉ _ｋと表記する。ｉは特徴点を識別するためのインデックスであり、ｋは特徴点が含まれている画像がどのフレームの画像かを示すインデックスである。なお、どのフレームの画像であるか特定する必要がない場合は、インデックスｋを省略してｐ^ｉと表記する。 Hereinafter, the feature point is expressed as p ⁱ _k . i is an index for identifying a feature point, and k is an index indicating which frame an image including the feature point is. If it is not necessary to identify whether an image of which frame is denoted as p ⁱ to omit the index k.

三次元空間内の（被写体上の）単一の点が、第Ｅフレームの画像では特徴点ｐ^ｉ _Ｅに投影され、第Ｆフレームの画像では特徴点ｐ^ｉ _Ｆに投影されている場合、２つの特徴点ｐ^ｉ _Ｅ、ｐ^ｉ _Ｆは「互いに対応している」と表現する。 When a single point (on the subject) in the three-dimensional space is projected on the feature point p ⁱ _E in the image of the E-th frame and is projected on the feature point p ⁱ _F in the image of the F-th frame, 2 The two feature points p ⁱ _E and p ⁱ _F are expressed as “corresponding to each other”.

異なる１対のフレームの画像がそれぞれ含む互いに対応する１対の特徴点ｐ^ｉの一方を始点として、他方を終点として有するベクトルを、これらの画像間の特徴点ｐ^ｉの移動ベクトルという。特徴点ｐ^ｉの移動ベクトルは、一方の画像における特徴点ｐ^ｉが他方の画像においてどこへ移動しているかを示す。異なるフレームの画像間のオプティカルフローは、少なくとも１つ以上の特徴点ｐ^ｉの、これらの画像間における移動ベクトルによって形成され、特徴点ｐ^ｉの移動ベクトルの画像内における分布を示す。 A vector having one of a pair of feature points p ⁱ corresponding to each other included in images of different pairs of frames as a start point and the other as an end point is referred to as a movement vector of the feature points p ⁱ between these images. Moving vector of the feature point p ⁱ indicates which feature point p ⁱ in one image is moving anywhere in the other image. The optical flow between images of different frames is formed by movement vectors of at least one or more feature points p ⁱ between these images, and indicates the distribution of the movement vectors of the feature points p ⁱ in the image.

オプティカルフロー取得部１０２ａは、手ぶれ補正対象の動画に含まれる各フレーム中の互いに対応する特徴点ｐ^ｉ、すなわち特徴点ｐ^ｉの移動ベクトルの始点または終点、の座標を取得することにより、特徴点ｐ^ｉの移動ベクトルと、これらの特徴点ｐ^ｉの移動ベクトルが形成するオプティカルフローと、を取得する。 The optical flow acquisition unit 102a acquires the coordinates of feature points p ⁱ corresponding to each other in each frame included in the moving image subject to camera shake correction, that is, the start point or end point of the movement vector of the feature point p ^i. obtaining a movement vector of p ^i, and optical flow motion vectors of these characteristic points p ⁱ forms, the.

特徴点軌道ｐは、特徴点ｐ^ｉの複数のフレームに亘る軌跡を示す。オプティカルフロー取得部１０２ａは、オプティカルフロー取得処理を実行することにより、手ぶれ補正対象の動画が含む各フレームの画像から互いに対応する特徴点ｐ^ｉを抽出し、抽出した特徴点ｐ^ｉを追跡することにより特徴点軌道ｐを取得する。 Feature point trajectory p indicates a track over a plurality of frames of the feature point p ^i. The optical flow acquisition unit 102a performs an optical flow acquisition process to extract feature points p ⁱ corresponding to each other from images of each frame included in a moving image to be corrected for camera shake, and to track the extracted feature points p ⁱ To obtain the feature point trajectory p.

なお、オプティカルフロー取得部１０２ａは、任意の公知技術を用いて複数のフレームの画像から互いに対応する特徴点ｐ^ｉを抽出する。例えば、ＫＬＴ法（ＫＬＴトラッキング）を用いることができる。ＫＬＴ法を用いて特徴点ｐ^ｉを抽出する技術は、非特許文献１に開示されているとおり、当該技術分野において周知であるため詳細な説明は省略する。 Note that the optical flow acquisition unit 102a extracts feature points p ⁱ corresponding to each other from images of a plurality of frames using any known technique. For example, the KLT method (KLT tracking) can be used. Since the technique for extracting the feature points p ⁱ using the KLT method is well known in the technical field as disclosed in Non-Patent Document 1, detailed description thereof is omitted.

オプティカルフロー取得部１０２ａが実行するオプティカルフロー取得処理については、後に図９のフローチャートを参照しながら詳細に説明する。 The optical flow acquisition process executed by the optical flow acquisition unit 102a will be described in detail later with reference to the flowchart of FIG.

オプティカルフロー取得部１０２ａによって取得されたオプティカルフローに基づいて、基礎行列取得部１０２は、基礎行列Ｆを取得する。基礎行列Ｆは、手ぶれ補正前の２つの画像間の幾何的関係（エピポーラ関係）を表す３行３列の行列である。 Based on the optical flow acquired by the optical flow acquisition unit 102a, the basic matrix acquisition unit 102 acquires the basic matrix F. The basic matrix F is a 3 × 3 matrix representing a geometric relationship (epipolar relationship) between two images before camera shake correction.

図３を参照して、エピポーラ幾何の概念を説明する。異なる時刻に撮像された第Ｑフレームの画像と第Ｒフレームの画像は、それぞれ、互いに異なる点Ｃ１と点Ｃ２とに配置された撮像装置１によって撮像された画像と見なすことができる。すなわち、第Ｑフレームの画像と第Ｒフレームの画像は、それぞれ、点Ｃ１と点Ｃ２とを投影中心とする投影面を表す画像と見なすことができる。点Ｃ１と点Ｃ２との位置の相違は、手ぶれ等の影響により撮像装置１の位置が変動したことに由来する。三次元空間内の単一の点Ｐは、第Ｑフレームの画像中では点ｍ１に投射される一方、第Ｒフレームの画像中では点ｍ２に投影される。すなわち、点ｍ１と点ｍ２は互いに対応する点である。 The concept of epipolar geometry will be described with reference to FIG. The Q-th frame image and the R-th frame image captured at different times can be regarded as images captured by the imaging devices 1 arranged at different points C1 and C2, respectively. That is, the image of the Qth frame and the image of the Rth frame can be regarded as images representing the projection planes having the points C1 and C2 as the projection centers, respectively. The difference in position between the point C1 and the point C2 is derived from the change in the position of the imaging device 1 due to the influence of camera shake or the like. A single point P in the three-dimensional space is projected to the point m1 in the image of the Qth frame, and is projected to the point m2 in the image of the Rth frame. That is, the points m1 and m2 correspond to each other.

三次元空間内の点Ｐと点Ｃ１とを結ぶ直線は、第Ｑフレームの画像上では点ｍ１に投影される一方、第Ｒフレームの画像上では、点Ｐ、点Ｃ１、及び点Ｃ２を通る平面（エピポーラ面）と第Ｒフレームの画像との交線である線Ｌ１として投影される。線Ｌ１は、第Ｑフレームの画像中の点ｍ１が第Ｒフレームの画像に投影するエピポーラ線である。同様に、線Ｌ２は、第Ｒフレームの画像中の点ｍ２が第Ｑフレームの画像に投影するエピポーラ線である。 A straight line connecting the point P and the point C1 in the three-dimensional space is projected to the point m1 on the Qth frame image, and passes through the point P, the point C1, and the point C2 on the Rth frame image. Projected as a line L1 that is an intersection line between the plane (epipolar plane) and the image of the R-th frame. The line L1 is an epipolar line projected from the point m1 in the Qth frame image onto the Rth frame image. Similarly, the line L2 is an epipolar line projected from the point m2 in the R-th frame image onto the Q-th frame image.

第Ｑフレームの画像中の任意の点ｑが第Ｒフレームの画像に投影するエピポーラ線Ｌ_Ｑ、Ｒは、次の式（１）により表される。ここで、Ｆ_Ｑ，Ｒは、第Ｑフレームの画像と第Ｒフレームの画像との間のエピポーラ関係を表す基礎行列である。Ｆ_Ｑ，Ｒは、第Ｑフレームの画像中の任意の点ｍ１と、点ｍ１に対応する第Ｒフレームの画像中の点ｍ２と、を用いて次の式（２）で表される。

Epipolar lines LQ _{and R} projected from an arbitrary point q in the image of the Qth frame to the image of the Rth frame are expressed by the following equation (1). Here, F _{Q, R} is a basic matrix representing an epipolar relationship between the image of the Qth frame and the image of the Rth frame. F _{Q, R} is expressed by the following equation (2) using an arbitrary point m1 in the image of the Qth frame and a point m2 in the image of the Rth frame corresponding to the point m1.

式（２）から明らかなように、基礎行列Ｆは、ｆ_１１からｆ_３２までの８個の未知数を有するため、少なくとも８組の対応する点ｍ１、ｍ２に基づいて求めることができる。すなわち、８箇所の対応点が与えられれば基礎行列Ｆを算出することができる。 As is apparent from the equation (2), the basic matrix F has eight unknowns from f ₁₁ to f _32, and therefore can be obtained based on at least eight sets of corresponding points m 1 and m 2. That is, if eight corresponding points are given, the basic matrix F can be calculated.

基礎行列取得部１０２は、指定部１０１によって指定された画像と指定部１０１によって指定された画像の直前１０フレーム分の画像とからそれぞれオプティカルフロー取得部１０２ａが抽出した８組の互いに対応する特徴点ｐ^ｉの座標に基づき、標準的な８ポイントアルゴリズムと誤差を最小化するためのＲＡＮＳＡＣ（ＲＡＮｄｏｍＳＡｍｐｌｅＣｏｎｓｅｎｓｕｓ）法とを用いて基礎行列Ｆを取得する。８ポイントアルゴリズムとＲＡＮＳＡＣ法を用いて基礎行列Ｆを取得する方法は、非特許文献１に開示されているとおり、当該技術分野において周知であるため詳細な説明は省略する。 The basic matrix acquisition unit 102 includes eight sets of feature points corresponding to each other extracted by the optical flow acquisition unit 102a from the image specified by the specification unit 101 and the image for the last 10 frames of the image specified by the specification unit 101. Based on the coordinates of p ⁱ , the basic matrix F is obtained using a standard 8-point algorithm and a RANSAC (RANdom Sample Consensus) method for minimizing errors. As disclosed in Non-Patent Document 1, the method for obtaining the basic matrix F using the 8-point algorithm and the RANSAC method is well known in the technical field, and thus detailed description thereof is omitted.

基礎行列取得部１０２が実行する基礎行列取得処理については、後に図１１のフローチャートを参照しながら詳細に説明する。 The basic matrix acquisition process executed by the basic matrix acquisition unit 102 will be described in detail later with reference to the flowchart of FIG.

選別部１０３は、基礎行列取得部１０２によって取得された基礎行列に基づいて、指定部１０１によって指定された画像より前の所定数（本実施形態では、１０フレーム分）の画像の中から、指定された画像にエピポーラ投影するための画像を選別する。具体的には、選別部１０３は、画像の選別のために、評価部１０８による基礎行列の類似度の評価結果を参酌する。 Based on the basic matrix acquired by the basic matrix acquisition unit 102, the selection unit 103 specifies a predetermined number of images (10 frames in the present embodiment) before the image specified by the specifying unit 101. An image for epipolar projection is selected on the obtained image. Specifically, the selection unit 103 refers to the evaluation result of the similarity of the basic matrix by the evaluation unit 108 for image selection.

基礎行列の類似度とは、異なる２つの基礎行列が類似しているか否かの度合いを示す指標である。例えば、第Ｑフレームの画像と第Ｒフレームの画像との間のエピポーラ関係を表す基礎行列Ｆ_Ｑ，Ｒと、第Ｑフレームの画像と第Ｓフレームの画像との間のエピポーラ関係を表す基礎行列Ｆ_Ｑ，Ｓと、が類似している場合、第Ｒフレームの画像と第Ｓフレームの画像とが類似していることを意味する。ここで、異なる２フレームの画像が互いに類似している場合、これら２つの画像中の仮想特徴点ｖ^ｉの座標はほぼ同じである。そのため、これら２つの画像中の仮想特徴点ｖ^ｉがそれぞれ投影する２本のエピポーラ線はほぼ平行であり、これらのエピポーラ線が与える交点（エピポーラ交点）は信頼性に欠ける。 The similarity between basic matrices is an index indicating the degree of whether two different basic matrices are similar. For example, a basic matrix F _{Q, R} representing the epipolar relationship between the Q-th frame image and the R-th frame image, and a basic matrix representing the epipolar relationship between the Q-th frame image and the S-th frame image. When _{FQ and S} are similar, it means that the image of the Rth frame and the image of the Sth frame are similar. Here, if the image of two different frames are similar to each other, the coordinates of the virtual feature point v ⁱ in the two images are similar. Therefore, the two epipolar lines projected by the virtual feature points v ⁱ in these two images are almost parallel, and the intersection (epipolar intersection) given by these epipolar lines is not reliable.

そこで、選別部１０３は、類似している複数の基礎行列を発見した場合、そのうち何れか１つの基礎行列のみを、エピポーラ投影に用いる基礎行列として選別する。すなわち、選別部１０３は、評価部による評価の結果、所定数の画像の中に互いに類似している２以上の画像がある場合、該２以上の画像のうちのいずれか１つ以外の画像を、指定部１０１によって指定された画像にエピポーラ投影するための画像から除外する。これにより、類似度が高い複数の画像が含む仮想特徴点ｖ^ｉからそれぞれエピポーラ線が投影され、信頼性に欠けるエピポーラ交点が生成されることを防止することができる。 Therefore, when a plurality of similar basic matrices are found, the selection unit 103 selects only one of them as a basic matrix used for epipolar projection. That is, as a result of the evaluation by the evaluation unit, when there are two or more images that are similar to each other in the predetermined number of images, the selection unit 103 selects an image other than any one of the two or more images. And excluded from the image for epipolar projection on the image specified by the specifying unit 101. Thereby, it is possible to prevent epipolar lines from being projected from virtual feature points v ⁱ included in a plurality of images having a high degree of similarity, and to prevent generation of epipolar intersections lacking in reliability.

選別部１０３が実行する選別処理については、後に図１２のフローチャートを参照しながら詳細に説明する。 The sorting process executed by the sorting unit 103 will be described in detail later with reference to the flowchart of FIG.

評価部１０８は、指定部１０１によって指定された画像より前の所定数の画像のそれぞれについて基礎行列取得部１０２によって取得された基礎行列の類似度を評価する。具体的には、評価部１０８は、異なる２つの基礎行列の間において、一方の基礎行列に含まれる各要素と、他方の基礎行列に含まれる各要素と、を比較することにより、この２つの基礎行列の類似度を評価する。類似度は、２つの基礎行列に含まれる要素同士を比較した結果を様々に用いて、取得することができる。 The evaluation unit 108 evaluates the similarity of the basic matrix acquired by the basic matrix acquisition unit 102 for each of a predetermined number of images prior to the image specified by the specifying unit 101. Specifically, the evaluation unit 108 compares the two elements in one basic matrix with each element included in the other basic matrix between two different basic matrices, thereby comparing the two basic matrices. Evaluate the similarity of the base matrix. The similarity can be obtained by using various results obtained by comparing elements included in the two basic matrices.

例えば、評価部１０８は、異なる２つの基礎行列の間において、一方の基礎行列を正規化した行列に含まれる各要素と、他方の基礎行列を正規化した行列における対応する要素と、の差分をとった値がいずれも閾値以下である場合に、この２つの基礎行列が類似していると評価する。 For example, the evaluation unit 108 calculates a difference between each element included in a matrix obtained by normalizing one basic matrix and a corresponding element in a matrix obtained by normalizing the other basic matrix between two different basic matrices. When both of the values taken are equal to or less than the threshold value, it is evaluated that the two basic matrices are similar.

すなわち、基礎行列は３行３列の行列であって９つの要素を有するため、評価部１０８は、比較対象の一方の基礎行列に含まれる９つの要素と他方の基礎行列に含まれる９つの要素との同じ位置にある要素同士での差分をとる。そして、得られた９つの差分値を、それぞれ予め定められた９つの閾値と比較する。これは、比較対象の一方の基礎行列と他方の基礎行列との差分をとった３行３列の差分行列を計算し、得られた差分行列を、予め定められた９つの閾値を要素として有する３行３列の閾行列と比較することと同じである。９つの差分値と比較する９つの閾値は、全て同じ値であってもよいし、互いに異なってもよい。比較の結果、９つの差分値全てが閾値以下である場合に、評価部１０８は、この２つの基礎行列が類似していると評価する。 That is, since the base matrix is a matrix of 3 rows and 3 columns and has nine elements, the evaluation unit 108 includes nine elements included in one base matrix to be compared and nine elements included in the other base matrix. The difference between elements at the same position as is taken. Then, the obtained nine difference values are compared with nine predetermined threshold values. This calculates a difference matrix of 3 rows and 3 columns taking the difference between one base matrix to be compared and the other base matrix, and the obtained difference matrix has nine predetermined threshold values as elements. This is the same as comparing with a 3 × 3 threshold matrix. The nine threshold values to be compared with the nine difference values may all be the same value or may be different from each other. As a result of the comparison, when all nine difference values are equal to or less than the threshold value, the evaluation unit 108 evaluates that the two basic matrices are similar.

或いは、評価部１０８は、異なる２つの基礎行列の間において、一方の基礎行列を正規化した行列に含まれる各要素と、他方の基礎行列を正規化した行列における対応する要素と、の差分の２乗和をとった値が閾値以下である場合に、この２つの基礎行列が類似していると評価してもよい。 Alternatively, the evaluation unit 108 calculates a difference between each element included in a matrix obtained by normalizing one basic matrix and a corresponding element in a matrix obtained by normalizing the other basic matrix between two different basic matrices. When the value obtained by taking the square sum is equal to or less than the threshold value, it may be evaluated that the two basic matrices are similar.

この場合、評価部１０８は、比較対象の一方の基礎行列に含まれる９つの要素と他方の基礎行列に含まれる９つの要素との同じ位置にある要素同士での差分をとり、得られた９つの差分値のそれぞれを２乗して加算する。そして、得られた加算値を予め定められた１つの閾値と比較して、加算値が閾値以下である場合に、評価部１０８は、この２つの基礎行列が類似していると評価する。 In this case, the evaluation unit 108 obtains the difference between the elements at the same position of the nine elements included in one base matrix to be compared and the nine elements included in the other base matrix and obtains the obtained 9 Each of the two difference values is squared and added. Then, the obtained addition value is compared with a predetermined threshold value, and when the addition value is equal to or less than the threshold value, the evaluation unit 108 evaluates that the two basic matrices are similar.

このように、評価部１０８は、指定部１０１によって指定された画像より前の所定数の画像のそれぞれについて取得された所定数の基礎行列の中に、互いに類似している複数の基礎行列があるか否かを評価する。そして、選別部１０３は、評価部１０８による評価結果に基づいて、所定数の画像の中から指定された画像にエピポーラ投影するための画像を選別する。 As described above, the evaluation unit 108 includes a plurality of basic matrices similar to each other among the predetermined number of basic matrices acquired for each of the predetermined number of images prior to the image specified by the specifying unit 101. Evaluate whether or not. Based on the evaluation result by the evaluation unit 108, the selection unit 103 selects an image for epipolar projection from a predetermined number of images to a designated image.

仮想特徴点生成部１０４は、仮想特徴点生成処理を実行し、基礎行列取得部１０２によって取得された基礎行列のうち、選別部１０３によって選別された画像について取得された基礎行列に基づいて、選別部１０３によって選別された画像内の互いに対応する特徴点ｐ^ｉ又は仮想特徴点ｖ^ｉを指定部１０１により指定された画像にエピポーラ投影することにより、指定された画像内における仮想特徴点ｖ^ｉを生成する。 The virtual feature point generation unit 104 executes virtual feature point generation processing, and selects based on the basic matrix acquired for the images selected by the selection unit 103 among the basic matrices acquired by the basic matrix acquisition unit 102 by epipolar projected to the designated image by specifying unit 101 the feature point p ⁱ or virtual feature points v ⁱ corresponding to each other in the image that has been selected by section 103, a virtual feature point v ⁱ within the specified image Generate.

具体的には、仮想特徴点生成部１０４は、画像間のエピポーラ関係を表す基礎行列Ｆに基づいて、対応点を生成するエピポーラ転送という方法により仮想特徴点ｖ^ｉを生成する。 Specifically, the virtual feature point generation unit 104 generates virtual feature points v ⁱ by a method called epipolar transfer that generates corresponding points based on the basic matrix F that represents the epipolar relationship between images.

図４を参照して、エピポーラ転送の概念を説明する。第Ｘフレームの画像と第Ｙフレームの画像とにおける互いに対応する点ｍ３、ｍ４がそれぞれ第Ｚフレームの画像に投影するエピポーラ線Ｌ３、Ｌ４は、第Ｚフレームの画像中の、点ｍ３及び点ｍ４に対応する点ｍ５で交わる。エピポーラ線Ｌ３、Ｌ４及びその交点である点ｍ５は、画像間のエピポーラ関係を表す基礎行列Ｆ_Ｘ、Ｚ、Ｆ_Ｙ、Ｚに基づいて算出できる。すなわち、第Ｚフレーム中の対応点である点ｍ５を、画像間のエピポーラ関係を表す基礎行列Ｆに基づいて生成できる。 The concept of epipolar transfer will be described with reference to FIG. The epipolar lines L3 and L4 that the corresponding points m3 and m4 in the image of the Xth frame and the image of the Yth frame respectively project on the image of the Zth frame are the points m3 and m4 in the image of the Zth frame. At the point m5 corresponding to. The epipolar lines L3 and L4 and the intersection m5 can be calculated based on the basic matrices F _{X, Z} , F _{Y and Z} representing the epipolar relationship between images. That is, the point m5 which is a corresponding point in the Zth frame can be generated based on the basic matrix F representing the epipolar relationship between images.

仮想特徴点生成部１０４は、このエピポーラ転送を用いて、指定された画像内における仮想特徴点ｖ^ｉを生成する。 Virtual feature point generation unit 104 uses the epipolar transfer, it generates the virtual feature point v ⁱ at the designated within an image.

図４に示すように、少なくとも２本のエピポーラ線があればその交点を仮想特徴点ｖ^ｉとして取得できるものの、実際にはノイズやトラッキングエラー、モデリングエラー等の影響により交点の信頼性は損なわれている。そこで、仮想特徴点生成部１０４は、最大１０本のエピポーラ線のうち組み合わせ可能な異なる２本のエピポーラ線の交点（最大で_１０Ｃ_２＝４５個の交点）を求め、求めた交点の座標の平均を求めることにより、より正確な特徴点ｖ^ｉを取得する。 As shown in FIG. 4, although the ability to retrieve the intersection if at least two epipolar lines as a virtual feature point v ^i, actually noise and tracking error, the intersection of the reliability due to the influence of such modeling error impaired ing. Therefore, the virtual feature point generation unit 104 obtains intersections of two different epipolar lines that can be combined among a maximum of _ten epipolar lines (up to ₁₀ C ₂ = 45 intersections), and calculates the coordinates of the obtained intersection points. By obtaining the average, a more accurate feature point v ⁱ is obtained.

具体的には、図５に示すように、仮想特徴点生成部１０４は、第ｔフレームの画像に、この画像の直前に撮像された連続する１０フレーム分の画像（第（ｔ−１０）フレームから第（ｔ−１）フレームまでの過去１０フレーム）に含まれる仮想特徴点ｖ^ｉ _ｔ−１〜ｖ^ｉ _ｔ−１０がそれぞれ投影する１０本のエピポーラ線を、基礎行列取得部１０２によって取得された基礎行列Ｆ_{ｔ、ｔ−１}〜Ｆ_{ｔ、ｔ−１０}に基づいて求める。そして、求めた１０本のエピポーラ線の交点の平均を、第ｔフレームの画像中の仮想特徴点ｖ^ｉ _ｔとして取得する。 Specifically, as illustrated in FIG. 5, the virtual feature point generation unit 104 adds an image of ten consecutive frames ((t−10) frames captured immediately before this image to the image of the t th frame. the first (t-1) virtual feature points included in the past 10 frames) to the frame _{^v _i} ^t-1 ^~v _i 10 pieces of epipolar _{lines t-10} is projected respectively, it is acquired by the fundamental matrix acquisition unit 102 from Obtained based on the basic matrix F _{t, t-1 to} F _{t, t-10} . Then, the average of the obtained intersections of the ten epipolar lines is acquired as a virtual feature point v ⁱ _{t in} the image of the t-th frame.

仮想特徴点生成部１０４が実行する仮想特徴点生成処理については、後に図１４のフローチャートを参照しながら詳細に説明する。 The virtual feature point generation processing executed by the virtual feature point generation unit 104 will be described in detail later with reference to the flowchart of FIG.

仮想特徴点軌道構築部１０５は、手ぶれ補正対象の動画を構成する複数の画像の中から指定部１０１が指定する画像を変えて、指定部１０１、基礎行列取得部１０２、選別部１０３、及び仮想特徴点生成部１０４の処理を繰り返すことにより、仮想特徴点軌道ｖを構築する。 The virtual feature point trajectory construction unit 105 changes the image designated by the designation unit 101 from among a plurality of images constituting the image to be corrected for camera shake, and designates the designation unit 101, the basic matrix acquisition unit 102, the selection unit 103, and the virtual By repeating the process of the feature point generation unit 104, a virtual feature point trajectory v is constructed.

仮想特徴点軌道ｖは、仮想特徴点ｖ^ｉの複数のフレームに亘る軌跡である。仮想特徴点軌道構築部１０５は、仮想特徴点生成部１０４に、手ぶれ補正対象の動画を構成する各フレームの画像において仮想特徴点ｖ^ｉを生成させることにより仮想特徴点ｖ^ｉを時間的に追跡し（複数のフレームに亘って追跡し）、仮想特徴点軌道ｖを構築する。 Virtual feature point trajectory v is a trajectory over a plurality of frames of the virtual feature point v ^i. Virtual feature point trajectory construction unit 105, a virtual feature point generation unit 104, time tracking virtual feature point v ⁱ by generating a virtual feature point v ⁱ in the image of each frame constituting the video image stabilization target (Tracking over a plurality of frames) to construct a virtual feature point trajectory v.

仮想特徴点軌道ｖは、図６に示すように、手ぶれ補正前の（手ぶれの影響を受けた）撮像装置１の動きを表している。 As shown in FIG. 6, the virtual feature point trajectory v represents the movement of the imaging device 1 before the camera shake correction (affected by the camera shake).

特徴点軌道ｐが画像からＫＬＴ法を用いて抽出された特徴点ｐ^ｉによって構成されるのに対し、仮想特徴点軌道ｖはエピポーラ転送により生成された仮想特徴点ｖ^ｉによって構成される。 While the feature point trajectory p is constituted by the feature point p ⁱ extracted with KLT method from the image, the virtual feature point trajectory v is constituted by the virtual feature point v ⁱ generated by the epipolar transfer.

三次元空間内の被写体の一部分が、あるフレームの画像において物体の影に隠れてしまったり、撮像装置１の視野から外れてしまったりした場合、この部分に含まれる点の、当該フレームの画像における投影点である特徴点ｐ^ｉをＫＬＴ法により抽出することはできない。しかし、このフレームの画像内の仮想特徴点ｖ^ｉは、過去のフレームの画像内の対応する仮想特徴点ｖ^ｉをエピポーラ転送することにより生成できる。また、ＫＬＴ法ではトラッキングエラーとなり特徴点ｐ^ｉが抽出できない場合でも、エピポーラ転送によれば仮想特徴点ｖ^ｉを生成することができる。 When a part of the subject in the three-dimensional space is hidden in the shadow of the object in the image of a certain frame or deviated from the field of view of the imaging device 1, the point included in this part of the image of the frame The feature point p ⁱ that is a projection point cannot be extracted by the KLT method. However, the virtual feature point v ⁱ in the image of this frame can be generated by epipolar transfer of the corresponding virtual feature point v ⁱ in the image of the past frame. Further, even when the KLT method results in a tracking error and the feature point p ⁱ cannot be extracted, the virtual feature point v ⁱ can be generated by epipolar transfer.

このため、仮想特徴点軌道ｖは、特徴点軌道ｐよりも長く（多くのフレームにわたって）連続している。 For this reason, the virtual feature point trajectory v is longer (over many frames) than the feature point trajectory p.

仮想特徴点軌道構築部１０５が実行する仮想特徴点軌道ｖの構築処理については、後に図８のフローチャートを参照しながら詳細に説明する。 The construction process of the virtual feature point trajectory v executed by the virtual feature point trajectory construction unit 105 will be described in detail later with reference to the flowchart of FIG.

平滑化部１０６は、仮想特徴点軌道構築部１０５によって構築された仮想特徴点軌道ｖを時間方向に平滑化することにより、平滑化された仮想特徴点軌道〜ｖを取得する。 The smoothing unit 106 acquires the smoothed virtual feature point trajectory to v by smoothing the virtual feature point trajectory v constructed by the virtual feature point trajectory construction unit 105 in the time direction.

具体的には、平滑化部１０６は、下記の式（３）を用い、仮想特徴点軌道ｖを構成する仮想特徴点ｖ^ｉそれぞれの座標とガウシアンカーネルとの畳み込みを作ることにより、平滑化された仮想特徴点軌道〜ｖを構成する手ぶれ補正後の仮想特徴点〜ｖ^ｉ _ｔの座標を取得する。ここで、ｇは下記の式（４）で表されるガウシアンカーネルである。

Specifically, the smoothing unit 106, using Equation (3) below, by making the convolution between the virtual feature point v ⁱ respective coordinates and Gaussian kernel to configure the virtual feature point trajectory v, is smoothed to obtain the coordinates of the virtual feature point to v ⁱ _t after image stabilization composing the virtual feature point trajectory to v were. Here, g is a Gaussian kernel represented by the following formula (4).

本実施形態では、σ＝５０のガウシアンカーネルを用いて仮想特徴点軌道ｖを平滑化する。 In this embodiment, the virtual feature point trajectory v is smoothed using a Gaussian kernel with σ = 50.

仮想特徴点軌道ｖが手ぶれ補正前の（手ぶれの影響を受けた）撮像装置１の動きを表していたのに対し、平滑化された仮想特徴点軌道〜ｖは、図６に示すように、手ぶれ補正後の（手ぶれの影響を除去した）撮像装置１の動きを表す。 While the virtual feature point trajectory v represents the movement of the imaging apparatus 1 before the camera shake correction (affected by the camera shake), the smoothed virtual feature point trajectory ~ v is as shown in FIG. It represents the movement of the imaging apparatus 1 after camera shake correction (the effect of camera shake has been removed).

具体的には図６に示すように、手ぶれ補正後の動画を構成する各フレームの画像は、平滑化された仮想特徴点軌道〜ｖによって定義される。すなわち、手ぶれ補正前の動画を構成する各フレームの画像と手ぶれ補正後の動画を構成する各フレームの画像との間の関係は、仮想特徴点軌道ｖと平滑化された仮想特徴点軌道〜ｖとの間の関係によって定義される。 Specifically, as shown in FIG. 6, the image of each frame constituting the moving image after camera shake correction is defined by the smoothed virtual feature point trajectory ~ v. That is, the relationship between the image of each frame composing the moving image before camera shake correction and the image of each frame composing the moving image after camera shake correction is as follows: virtual feature point trajectory v and smoothed virtual feature point trajectory ~ v Defined by the relationship between

補正部１０７は、仮想特徴点軌道構築部１０５によって構築された仮想特徴点軌道ｖと、平滑化部１０６によって平滑化された仮想特徴点軌道〜ｖと、の間の関係に基づいて、手ぶれ補正対象の動画を構成する複数の画像それぞれを補正する。 The correcting unit 107 corrects camera shake based on the relationship between the virtual feature point trajectory v constructed by the virtual feature point trajectory constructing unit 105 and the virtual feature point trajectory to v smoothed by the smoothing unit 106. Each of a plurality of images constituting the target moving image is corrected.

具体的には、補正部１０７は、手ぶれ補正前の動画が含む各フレームの画像に対し、手ぶれ補正前の画像が含む特徴点ｐ^ｉを手ぶれ補正後の画像が含む手ぶれ補正後の特徴点〜ｐ^ｉへ移すような射影変換を施すことにより、手ぶれ補正を実行する。 Specifically, the correction unit 107, image stabilization on the image of each frame before the video contains feature points after image stabilization comprising the image after image stabilization feature point p ⁱ where camera shake correction previous image contains ~ by performing projective transformation such as move to p ^i, it executes image stabilization.

例えば、補正部１０７は、手ぶれ補正前の動画を構成する各フレームの画像に対し、各フレームの画像における任意の点（ｘ、ｙ）を次の式（５）を満たす点（ｘ’、ｙ’）へ移す射影変換を施すことにより、手ぶれ補正後の画像を取得する。ここで、式（５）中の射影変換パラメータａ〜ｈは、下記の式（６）を用いて求めることができる。

For example, the correction unit 107 replaces an arbitrary point (x, y) in the image of each frame with a point (x ′, y) that satisfies the following expression (5) with respect to the image of each frame constituting the moving image before camera shake correction. The image after camera shake correction is acquired by performing the projective transformation to '). Here, the projective transformation parameters a to h in the equation (5) can be obtained using the following equation (6).

式（６）から明らかなように、射影変換パラメータａ〜ｈは、少なくとも４組の対応点（ｘ１，ｙ１）と（ｘ１’、ｙ１’）〜（ｘ４，ｙ４）と（ｘ４’とｙ４’）が与えられれば求めることができる。補正部１０７は、手ぶれ補正前の各フレームの画像に含まれる特徴点ｐ^ｉと、これに対応する手ぶれ補正後のフレームの画像に含まれる手ぶれ補正後の特徴点〜ｐ^ｉと、を式（６）に代入することにより、射影変換パラメータａ〜ｈを取得し、射影変換を行う。 As is clear from the equation (6), the projective transformation parameters a to h include at least four pairs of corresponding points (x1, y1), (x1 ′, y1 ′) to (x4, y4), (x4 ′ and y4 ′). ) Can be obtained. Correcting unit 107, the feature point p ⁱ included in the image of each frame before camera shake compensation, a feature point ~p ⁱ after image stabilization included in the image frame after image stabilization corresponding thereto, of formula ( By substituting in 6), the projective transformation parameters a to h are acquired and the projective transformation is performed.

手ぶれ補正後の画像が手ぶれ補正後の撮像装置１の動きを表す平滑化された仮想特徴点軌道〜ｖによって定義されるため、手ぶれ補正後の特徴点〜ｐ^ｉは、仮想特徴点軌道ｖと平滑化された仮想特徴点軌道〜ｖとの間の関係に基づいて取得することができる。 Since the image after camera shake correction is defined by the virtual feature point trajectory ~v smoothed representing the motion of the imaging apparatus 1 after image stabilization feature point ~p ⁱ after camera shake correction, a virtual feature point trajectory v Based on the relationship between the smoothed virtual feature point trajectory ~ v.

具体的には、図６に示すように、手ぶれ補正前の画像と手ぶれ補正後の画像との間のエピポーラ関係を表す基礎行列〜Ｆが、仮想特徴点軌道ｖと平滑化された仮想特徴点軌道〜ｖとの間の関係に基づいて取得される。そして、図７に示すように、手ぶれ補正後の各フレームの画像に、前後５フレーム分の手ぶれ補正前の画像に含まれる特徴点ｐ^ｉを、取得された基礎行列〜Ｆに基づいてエピポーラ転送することにより、手ぶれ補正後の特徴点〜ｐ^ｉが生成される。 Specifically, as shown in FIG. 6, the basic matrix to F representing the epipolar relationship between the image before camera shake correction and the image after camera shake correction are virtual feature points smoothed with the virtual feature point trajectory v. Acquired based on the relationship between the trajectories ~ v. Then, as shown in FIG. 7, the image of each frame after camera shake compensation, a feature point p ⁱ included before and after 5 frames of image stabilization previous image, on the basis of the obtained fundamental matrix ~F epipolar transfer by feature points ~p ⁱ after camera shake correction is generated.

このように、補正部１０７は、仮想特徴点軌道ｖと平滑化された仮想特徴点軌道〜ｖとの間の関係に基づいて求めた射影変換パラメータを用いて射影変換を行うことにより、手ぶれ補正対象の動画を構成する各フレームの画像に手ぶれ補正を施す。上述したとおり、エピポーラ転送を用いて構築される仮想特徴点軌道ｖは長く（多くのフレームにわたって）連続しているため、大規模なフィルタを適用し（σ＝５０の大きなガウシアンカーネルを用いて時間方向に平滑化し）、手ぶれ補正後の画像を定義することができる。 As described above, the correction unit 107 performs the camera shake correction by performing the projective transformation using the projective transformation parameter obtained based on the relationship between the virtual feature point trajectory v and the smoothed virtual feature point trajectory to v. Camera shake correction is applied to each frame image constituting the target moving image. As described above, since the virtual feature point trajectory v constructed using epipolar transfer is long and continuous (over many frames), a large-scale filter is applied (time using a large Gaussian kernel with σ = 50). Smoothed in the direction), and an image after camera shake correction can be defined.

図１に戻って、ユーザインタフェース部３０は、表示部３１と、操作部３２と、外部インタフェース３３と、を含む。 Returning to FIG. 1, the user interface unit 30 includes a display unit 31, an operation unit 32, and an external interface 33.

表示部３１は、例えばＬＣＤ（ＬｉｑｕｉｄＣｒｙｓｔａｌＤｉｓｐｌｅｙ）やＣＲＴ（ＣａｔｈｏｄｅＲａｙＴｕｂｅ）、有機ＥＬ（ＥｌｅｃｔｒｏＬｕｍｉｎｅｓｃｅｎｃｅ）ディスプレイ等を備え、出力部２２から供給されたＲＧＢ信号に基づいて、撮像部１０により生成され、外部記憶部２３により記憶されている手ぶれ補正対象の動画、画像処理装置１００が手ぶれ補正対象の動画に手ぶれ補正処理を施すことにより生成した動画等を含む種々の動画像を表示する。 The display unit 31 includes, for example, an LCD (Liquid Crystal Display), a CRT (Cathode Ray Tube), an organic EL (Electro Luminescence) display, and the like. The display unit 31 is generated by the imaging unit 10 based on the RGB signal supplied from the output unit 22. Various moving images including a moving image to be subjected to camera shake correction stored in the external storage unit 23 and a moving image generated by the image processing apparatus 100 performing camera shake correction processing on the moving image to be subjected to camera shake correction are displayed.

操作部３２は、ユーザからの操作指示を受け付ける。操作部３２は、撮像装置１の電源スイッチ、シャッタボタン、撮像装置１の種々の機能を選択するためのボタン等、各種の操作ボタンを備える。操作部３２は、ユーザから操作指示を受け付けると、受け付けた指示情報を撮像部１０やデータ処理部２０のＣＰＵ２４等に供給する。 The operation unit 32 receives an operation instruction from the user. The operation unit 32 includes various operation buttons such as a power switch of the image pickup apparatus 1, a shutter button, and buttons for selecting various functions of the image pickup apparatus 1. When receiving an operation instruction from the user, the operation unit 32 supplies the received instruction information to the imaging unit 10, the CPU 24 of the data processing unit 20, and the like.

なお、表示部３１と操作部３２とは、互いに重畳して配置されたいわゆるタッチパネルによって構成されるものであってもよい。 In addition, the display part 31 and the operation part 32 may be comprised by what is called a touch panel arrange | positioned mutually superimposed.

外部インタフェース３３は、撮像装置１の外部の機器とデータをやり取りするためのインタフェースである。例えば、外部インタフェース３３は、画像処理装置１００が手ぶれ補正対象の動画に手ぶれ補正処理を施すことによって生成した動画を、ＵＳＢ（ＵｎｉｖｅｒｓａｌＳｅｒｉａｌＢｕｓ）規格のデータに変換して、ＵＳＢケーブルを介して外部の機器との間でデータを送受信する。 The external interface 33 is an interface for exchanging data with an external device of the imaging apparatus 1. For example, the external interface 33 converts a moving image generated by the image processing apparatus 100 performing a camera shake correction process on a moving image to be subjected to image stabilization to USB (Universal Serial Bus) standard data, and externally transmits the data via a USB cable. Send data to and receive data from other devices.

以下、撮像装置１及び画像処理装置１００が動画に手ぶれ補正を施す動作の詳細を、図８〜図１５を参照しながら説明する。 Hereinafter, details of the operation of the image capturing apparatus 1 and the image processing apparatus 100 for performing camera shake correction on a moving image will be described with reference to FIGS. 8 to 15.

撮像装置１が備える撮像部１０は、予め、被写体を撮像することにより、手ぶれ補正対象の動画を生成している。当該動画は、時間的に連続して撮像された複数フレームの画像を含んでいる。生成された動画は、外部記憶部２３によって記憶される。 The imaging unit 10 included in the imaging apparatus 1 generates a moving image to be subjected to camera shake correction by imaging a subject in advance. The moving image includes a plurality of frames of images captured continuously in time. The generated moving image is stored in the external storage unit 23.

ユーザは、動画の手ぶれを補正することを所望する場合、操作部３２を操作することにより、手ぶれ補正対象の動画のデータを主記憶部２１に展開する。そして、撮像装置１が備える複数の動作モードの１つである「手ぶれ補正モード」を選択する。 When the user desires to correct the camera shake of the moving image, the user operates the operation unit 32 to develop the moving image data to be corrected in the main storage unit 21. Then, the “camera shake correction mode” which is one of a plurality of operation modes provided in the imaging apparatus 1 is selected.

操作部３２が、ユーザによる「手ぶれ補正モード」を選択する操作を受け付けると、ＣＰＵ２４は、特徴点抽出プログラムや画像処理プログラムを含む、図８のフローチャートに示す手ぶれ補正処理を実行するためのプログラムを外部記憶部２３から読み出し、主記憶部２１に展開する。 When the operation unit 32 receives an operation for selecting the “camera shake correction mode” by the user, the CPU 24 executes a program for executing the camera shake correction process shown in the flowchart of FIG. 8, including a feature point extraction program and an image processing program. Read from the external storage unit 23 and expand in the main storage unit 21.

このような状態において、ユーザが、操作部３２を操作して手ぶれ補正の開始を指示すると、画像処理装置１００が、図８のフローチャートに示す手ぶれ補正処理を開始する。 In such a state, when the user operates the operation unit 32 to instruct the start of camera shake correction, the image processing apparatus 100 starts the camera shake correction process shown in the flowchart of FIG.

手ぶれ補正処理を開始すると、まず、指定部１０１が、手ぶれ補正開始フレームを指定する（ステップＳ１）。手ぶれ補正開始フレームは、手ぶれ補正対象の動画に含まれる複数のフレームのうち最初の（撮像時刻が最も古い）フレームである。以下、指定部１０１により指定されたフレームを、第ｔフレームと表記する。 When the camera shake correction process is started, the designation unit 101 first designates a camera shake correction start frame (step S1). The camera shake correction start frame is the first frame (having the oldest imaging time) among a plurality of frames included in a moving image to be subjected to camera shake correction. Hereinafter, the frame designated by the designation unit 101 is referred to as a t-th frame.

オプティカルフロー取得部１０２ａは、オプティカルフロー取得処理を実行する（ステップＳ２）。これにより、オプティカルフロー取得部１０２ａは、第ｔフレームより以前の１０フレーム（第（ｔ−１）フレーム〜第（ｔ−１０）フレーム）分の画像それぞれから互いに対応する特徴点ｐ^ｉを抽出する。そして、画像間の特徴点ｐ^ｉの移動ベクトルによって形成される画像間のオプティカルフローを取得すると共に、特徴点軌道ｐを取得する。 The optical flow acquisition unit 102a executes an optical flow acquisition process (step S2). As a result, the optical flow acquisition unit 102a extracts feature points p ⁱ corresponding to each other from images for 10 frames (the (t−1) th frame to the (t−10) th frame) before the tth frame. . Then, obtains the optical flow between images formed by the movement vector of the feature point p ⁱ between images, it obtains the feature point trajectory p.

以下、ステップＳ２のオプティカルフロー取得処理の詳細を、図９のフローチャートを参照しながら説明する。 Details of the optical flow acquisition process in step S2 will be described below with reference to the flowchart of FIG.

ステップＳ２のオプティカルフロー取得処理を開始すると、オプティカルフロー取得部１０２ａは、まず、図８のフローチャートのステップＳ１において指定部１０１により指定されたフレーム（第ｔフレーム）を第Ａフレームとして設定し（ステップＳ２０１）、第（ｔ−１０）フレーム（第ｔフレームの１０フレーム前のフレーム）を第Ｂフレームとして設定する（ステップＳ２０２）。 When the optical flow acquisition process of step S2 is started, the optical flow acquisition unit 102a first sets the frame (tth frame) specified by the specification unit 101 in step S1 of the flowchart of FIG. S201), the (t-10) th frame (the frame 10 frames before the tth frame) is set as the Bth frame (step S202).

次に、オプティカルフロー取得部１０２ａは、第Ａフレームの画像と第Ｂフレームの画像との間のオプティカルフローが、以前に実行されたオプティカルフロー取得処理によって取得済みであるか否かを判別する（ステップＳ２０３）。 Next, the optical flow acquisition unit 102a determines whether or not the optical flow between the image of the Ath frame and the image of the Bth frame has been acquired by the previously executed optical flow acquisition process ( Step S203).

取得済みであると判別すると（ステップＳ２０３；ＹＥＳ）、処理はステップＳ２０８へ移る。これにより、既に取得したオプティカルフローを重複して取得することを防止できる。 If it is determined that it has been acquired (step S203; YES), the process proceeds to step S208. Thereby, it is possible to prevent the already acquired optical flows from being acquired in duplicate.

オプティカルフローが未取得であると判別すると（ステップＳ２０３；ＮＯ）、オプティカルフロー取得部１０２ａは、第Ａフレームの画像から、ＫＬＴ法を用いて特徴点ｐ^ｉを抽出する（ステップＳ２０４）。 When optical flows is determined to be in non-acquired (step S203; NO), the optical flow obtaining unit 102a, the image of the A-frame, extracts feature points ^{p i} using KLT method (step S204).

そして、オプティカルフロー取得部１０２ａは、第Ｂフレームの画像から、ステップＳ２０４で第Ａフレームの画像から抽出した各特徴点ｐ^ｉに対応する特徴点ｐ^ｉを、ＫＬＴ法を用いて抽出する（ステップＳ２０５）。 Then, the optical flow acquisition unit 102a extracts feature points p ⁱ corresponding to the feature points p ⁱ extracted from the image of the A frame in step S204 from the image of the B frame using the KLT method (step S204). S205).

ステップＳ２０４及びステップＳ２０５で互いに対応する特徴点ｐ^ｉを抽出すると、オプティカルフロー取得部１０２ａは、抽出した特徴点ｐ^ｉ間のベクトル（特徴点ｐ^ｉの移動ベクトル）を取得することにより、これらの移動ベクトルによって形成される、第Ａフレームの画像と第Ｂフレームの画像との間のオプティカルフローを取得する（ステップＳ２０６）。また、オプティカルフロー取得部１０２ａは、ステップＳ２０４及びステップＳ２０５で抽出した特徴点ｐ^ｉによって構成される特徴点軌道ｐを作成する（ステップＳ２０７）。 When the feature points p ⁱ corresponding to each other are extracted in step S204 and step S205, the optical flow acquisition unit 102a acquires these vectors between the extracted feature points p ⁱ (movement vectors of the feature points p ⁱ ), thereby An optical flow between the image of the Ath frame and the image of the Bth frame formed by the movement vector is acquired (step S206). Also, the optical flow obtaining unit 102a creates a feature point trajectory p constituted by the feature point ^{p i} extracted in step S204 and step S205 (step S207).

オプティカルフローと特徴点軌道ｐとを取得した後、オプティカルフロー取得部１０２ａは、値Ｂを１だけインクリメントする（ステップＳ２０８）。すなわち、オプティカルフロー取得部１０２ａは、ステップＳ２０３〜Ｓ２０７の処理対象となる第Ｂフレームを、次の（次に撮像時刻が新しい）フレームに移す。例えば、直前に実行したステップＳ２０３〜Ｓ２０７の処理において、第（ｔ−１０）フレームが第Ｂフレームとして設定されていた場合、ステップＳ２０８において、第（ｔ−１０）フレームの次のフレームである第（ｔ−９）フレームが、新たに第Ｂフレームとして設定される。 After acquiring the optical flow and the feature point trajectory p, the optical flow acquisition unit 102a increments the value B by 1 (step S208). That is, the optical flow acquisition unit 102a moves the B-th frame to be processed in steps S203 to S207 to the next (next imaging time is the newest) frame. For example, if the (t-10) th frame is set as the Bth frame in the processing of steps S203 to S207 executed immediately before, the next frame after the (t-10) frame is set in step S208. (T-9) A frame is newly set as the Bth frame.

値Ｂをインクリメントした後、オプティカルフロー取得部１０２ａは、インクリメントされた後の値Ｂが値Ａに一致するか否か（第Ｂフレームと第Ａフレームとが同一であるか否か）を判別する（ステップＳ２０９）。 After incrementing the value B, the optical flow acquisition unit 102a determines whether the incremented value B matches the value A (whether the Bth frame and the Ath frame are the same). (Step S209).

インクリメント後の値Ｂが値Ａに一致しない（第Ｂフレームと第Ａフレームとが同一ではない）と判別すると（ステップＳ２０９；ＮＯ）、処理はステップＳ２０３へ戻る。 If it is determined that the incremented value B does not match the value A (the Bth frame and the Ath frame are not the same) (step S209; NO), the process returns to step S203.

すなわち、オプティカルフロー取得部１０２ａは、値Ｂを１ずつインクリメントしながら、インクリメント後の値Ｂが値Ａに一致する（第Ｂフレームと第Ａフレームとが同一である）と判別するまで（ステップＳ２０９においてＹＥＳと判別されるまで）ステップＳ２０３〜Ｓ２０８の処理を繰り返す。これにより、特徴点軌道ｐを取得すると共に、図１０に示すように、第Ａフレームの画像と、第ｔフレーム直前の１０フレームのうち第Ａフレームより以前のフレームの画像と、の間のオプティカルフローを取得する。 That is, the optical flow acquisition unit 102a increments the value B by 1 and determines that the incremented value B matches the value A (the Bth frame and the Ath frame are the same) (step S209). Steps S203 to S208 are repeated. As a result, the feature point trajectory p is obtained and, as shown in FIG. 10, an optical between the image of the Ath frame and the image of the frame before the Ath frame among the 10 frames immediately before the tth frame. Get the flow.

最終的に、インクリメント後の値Ｂが値Ａに一致する（第Ｂフレームと第Ａフレームとが同一である）と判別すると（ステップＳ２０９；ＹＥＳ）、オプティカルフロー取得部１０２ａは、値Ａを１だけディクリメントする（ステップＳ２１０）。すなわち、ステップＳ２０２〜Ｓ２０９の処理対象となる第Ａフレームを、直前の（次に撮像時刻が古い）フレームに移す。例えば、直前に実行したステップＳ２０２〜Ｓ２０９の処理において、第ｔフレームが第Ａフレームとして設定されていた場合、ステップＳ２１０において、第ｔフレームの直前のフレームである第（ｔ−１）フレームが、新たに第Ａフレームとして設定される。 When it is finally determined that the incremented value B matches the value A (the B-th frame and the A-th frame are the same) (step S209; YES), the optical flow acquisition unit 102a sets the value A to 1. Only decrement (step S210). That is, the Ath frame to be processed in steps S202 to S209 is moved to the immediately preceding frame (the next imaging time is the oldest). For example, if the t-th frame is set as the A-th frame in the processing of steps S202 to S209 executed immediately before, the (t−1) -th frame that is the frame immediately before the t-th frame is determined in step S210. A new frame A is set.

ステップＳ２１０で値Ａを１だけディクリメントした後、オプティカルフロー取得部１０２ａは、ディクリメント後の値Ａが値（ｔ−１０）に一致するか否かを判別する（ステップＳ２１１）。 After decrementing the value A by 1 in step S210, the optical flow acquisition unit 102a determines whether or not the decremented value A matches the value (t-10) (step S211).

ディクリメント後の値Ａが値（ｔ−１０）に一致しないと判別すると（ステップＳ２１１；ＮＯ）、処理はステップＳ２０２へ戻る。 If it is determined that the decremented value A does not match the value (t−10) (step S211; NO), the process returns to step S202.

すなわち、オプティカルフロー取得部１０２ａは、値Ａを１ずつディクリメントしながら、ディクリメント後の値Ａが値（ｔ−１０）に一致すると判別するまで（ステップＳ２１１においてＹＥＳと判別するまで）、ステップＳ２０２〜Ｓ２１０の処理を繰り返す。これにより、特徴点軌道ｐを取得すると共に、図１０に示すように、第ｔフレーム及び第ｔフレームの直前の１０フレームそれぞれについて、各フレームの画像と、各フレームより以前のフレームの画像と、の間のオプティカルフローを取得する。 That is, the optical flow acquisition unit 102a decrements the value A by 1 until it is determined that the decremented value A matches the value (t-10) (until YES in step S211). The processes of S202 to S210 are repeated. As a result, the feature point trajectory p is acquired, and as shown in FIG. 10, for each of the 10 frames immediately before the t-th frame and the t-th frame, an image of each frame, an image of a frame before each frame, Get the optical flow between.

ディクリメント後の値Ａが値（ｔ−１０）に一致すると判別すると（ステップＳ２１１；ＹＥＳ）、オプティカルフロー取得部１０２ａは、オプティカルフロー取得処理を終了する。 When it is determined that the decremented value A matches the value (t-10) (step S211; YES), the optical flow acquisition unit 102a ends the optical flow acquisition process.

図８のフローチャートに戻って、ステップＳ２のオプティカルフロー取得処理終了後、基礎行列取得部１０２は、基礎行列取得処理を実行する（ステップＳ３）。 Returning to the flowchart of FIG. 8, after the optical flow acquisition process of step S <b> 2 ends, the basic matrix acquisition unit 102 executes the basic matrix acquisition process (step S <b> 3).

以下、ステップＳ３の基礎行列取得処理の詳細を、図１１のフローチャートを参照しながら説明する。 Hereinafter, the details of the basic matrix acquisition process of step S3 will be described with reference to the flowchart of FIG.

ステップＳ３の基礎行列取得処理を開始すると、基礎行列取得部１０２は、まず、第（ｔ−１０）フレーム（第ｔフレームより１０フレーム前のフレーム）を第Ｇフレームとして設定する（ステップＳ３０１）。 When the basic matrix acquisition process in step S3 is started, the basic matrix acquisition unit 102 first sets the (t-10) th frame (a frame 10 frames before the tth frame) as the Gth frame (step S301).

次に、基礎行列取得部１０２は、オプティカルフロー取得処理により取得された、第ｔフレームの画像と第Ｇフレームの画像との間のオプティカルフローに基づいて、第Ｇフレームの画像と第ｔフレームの画像との間のエピポーラ関係を表す基礎行列Ｆ_G,tを取得する（ステップＳ３０２）。 Next, based on the optical flow between the t-th frame image and the G-th frame image acquired by the optical flow acquisition process, the basic matrix acquisition unit 102 performs the G-th frame image and the t-th frame. A basic matrix F _{G, t} representing an epipolar relationship with an image is acquired (step S302).

具体的には、基礎行列取得部１０２は、第Ｇフレームの画像と第ｔフレームの画像との間のオプティカルフローを形成する特徴点ｐ^ｉの移動ベクトルのうち、複数の特徴点ｐ^ｉの移動ベクトルの始点と終点の座標（例えば、同次座標）を対応点として用い、８ポイントアルゴリズムとＲＡＮＳＡＣ法により上述の式（２）で表される基礎行列Ｆの各パラメータを求める。 Specifically, the basic matrix acquisition unit 102 moves a plurality of feature points p ⁱ out of the movement vectors of the feature points p ⁱ that form an optical flow between the G-th frame image and the t-th frame image. Using the coordinates of the start point and end point of the vector (for example, homogeneous coordinates) as corresponding points, each parameter of the basic matrix F expressed by the above equation (2) is obtained by the 8-point algorithm and the RANSAC method.

ステップＳ３０２において基礎行列Ｆ_G,tを取得した後、基礎行列取得部１０２は、値Ｇを１だけインクリメントし（ステップＳ３０３）、インクリメント後の値Ｇが値ｔに一致するか否かを判別する（ステップＳ３０４）。一致しないと判別された場合（ステップＳ３０４；ＮＯ）、処理はステップＳ３０２へ戻る。 After acquiring the basic matrix F _{G, t} in step S302, the basic matrix acquisition unit 102 increments the value G by 1 (step S303), and determines whether or not the incremented value G matches the value t. (Step S304). If it is determined that they do not match (step S304; NO), the process returns to step S302.

すなわち、基礎行列取得部１０２は、値Ｇを１ずつインクリメントしながら、インクリメント後の値Ｇが値ｔに一致すると判別されるまで（ステップＳ３０４においてＹＥＳと判別されるまで）、ステップＳ３０２〜Ｓ３０３の処理を繰り返す。これにより、指定部１０１によって指定された第ｔフレームの画像と、指定されたフレームの画像の直前１０フレーム分の画像それぞれと、の間のエピポーラ関係を表す基礎行列Ｆ_t-10,t〜Ｆ_t-1,tを取得する。 That is, the basic matrix acquisition unit 102 increments the value G by 1 until it is determined that the incremented value G matches the value t (until determined as YES in step S304), steps S302 to S303. Repeat the process. As a result, the basic matrix F _{t-10, t} to F representing the epipolar relationship between the image of the t-th frame specified by the specifying unit 101 and each of the 10 frames immediately before the image of the specified frame. _{Get t-1, t} .

インクリメント後の値Ｇが値ｔに一致すると判別した場合（ステップＳ３０４；ＹＥＳ）、基礎行列取得部１０２は、基礎行列取得処理を終了する。 When it is determined that the incremented value G matches the value t (step S304; YES), the basic matrix acquisition unit 102 ends the basic matrix acquisition process.

図８のフローチャートに戻って、ステップＳ３の基礎行列取得処理が終了した後、選別部１０３は、フレーム選別処理を実行する（ステップＳ４）。 Returning to the flowchart of FIG. 8, after the basic matrix acquisition process in step S3 is completed, the selection unit 103 executes a frame selection process (step S4).

以下、ステップＳ４のフレーム選別処理の詳細について、図１２のフローチャートを参照しながら説明する。 Details of the frame selection process in step S4 will be described below with reference to the flowchart of FIG.

ステップＳ４のフレーム選別処理を開始すると、選別部１０３は、指定部１０１が指定したフレーム（第ｔフレーム）の１０フレーム前のフレーム（第（ｔ−１０）フレーム）を第Ｃフレームとして設定する（ステップＳ４０１）。 When the frame selection process in step S4 is started, the selection unit 103 sets the frame (the (t-10) frame) 10 frames before the frame (the t-th frame) specified by the specification unit 101 as the C-th frame ( Step S401).

次に、評価部１０８が、基礎行列取得処理によって取得された基礎行列のうち、第Ｃフレームの画像と第ｔフレームの画像と間の関係を表す基礎行列Ｆ_C,tと、第ｔフレームの画像との関係を表す他の基礎行列Ｆ_C+1,t〜Ｆ_t-1,tとの類似性を評価する（ステップＳ４０２）。 Next, the evaluation unit 108 selects the basic matrix F _{C, t} representing the relationship between the C-th frame image and the t-th frame image among the basic matrices acquired by the basic matrix acquisition process _, and the t-th frame. The similarity with other basic matrices F _{C + 1, t to} F _{t−1, t} representing the relationship with the image is evaluated (step S402).

具体的には図１３に示すように、評価部１０８は、第Ｃフレームの画像と第ｔフレームの画像との間の関係を表す基礎行列Ｆ_C,tと、第ｔフレームより以前の１０フレームのうち第Ｃフレームより後に撮像された（ｔ−Ｃ−１）個の各フレームの画像と第ｔフレームの画像との間の関係を表す基礎行列Ｆ_C+1,t〜Ｆ_t-1,tと、が類似しているか否かを順次判別していく。そして、基礎行列Ｆ_C+1,t〜Ｆ_t-1,tの中に、基礎行列Ｆ_C,tと類似しているものがあるか否かを評価する。評価部１０８による基礎行列の類似度の評価は、具体的には上述したように、異なる２つの基礎行列の間において、一方の基礎行列に含まれる各要素と、他方の基礎行列に含まれる各要素と、を比較することによって得られる。 Specifically, as illustrated in FIG. 13, the evaluation unit 108 calculates a basic matrix F _{C, t} representing the relationship between the C-th frame image and the t-th frame image, and 10 frames before the t-th frame. Among the basic matrices F _{C + 1, t to} F _t−1 representing the relationship between (t−C−1) frames of images taken after the C frame and images of the t frame _. It is sequentially determined whether or not _t is similar. Then, it is evaluated whether or not any of the basic matrices F _{C + 1, t to} F _t−1 _{, t} is similar to the basic matrix F _{C, t} . Specifically, as described above, the evaluation of the similarity of the base matrix by the evaluation unit 108 is performed between each element included in one base matrix and each base matrix included between two different base matrices. Obtained by comparing the elements.

類似性の評価の結果、選別部１０３は、基礎行列Ｆ_C,tと類似している基礎行列があるか否かを判別する（ステップＳ４０３）。判別の結果、基礎行列Ｆ_C,tと類似している基礎行列がある場合（ステップＳ４０３；ＹＥＳ）、選別部１０３は、第Ｃフレームを、第ｔフレームの画像へのエピポーラ転送をスキップするフレームとして、ＲＡＭ等のメモリに一時的に保存する（ステップＳ４０４）。 As a result of the similarity evaluation, the selection unit 103 determines whether there is a basic matrix similar to the basic matrix F _{C, t} (step S403). As a result of the determination, if there is a basic matrix similar to the basic matrix F _{C, t} (step S403; YES), the selection unit 103 skips the epipolar transfer of the Cth frame to the image of the tth frame. Is temporarily stored in a memory such as a RAM (step S404).

一方、基礎行列Ｆ_C,tと類似している基礎行列がない場合（ステップＳ４０３；ＮＯ）、選別部１０３は、第Ｃフレームはスキップするフレームではないと判別する。そのため、ステップＳ４０４の処理を実行せず、第Ｃフレームをスキップするフレームとしては保存しない。 On the other hand, when there is no basic matrix similar to the basic matrix F _{C, t} (step S403; NO), the selection unit 103 determines that the C-th frame is not a skipped frame. For this reason, the process of step S404 is not executed, and the C-th frame is not saved as a skipped frame.

このように第Ｃフレームをスキップするか否かを決定すると、選別部１０３は、値Ｃを１だけインリメントする（ステップＳ４０５）。そして、インクリメント後の値Ｃが値（ｔ−１）に一致するか否かを判別する（ステップＳ４０６）。一致しないと判別した場合（ステップＳ４０６；ＮＯ）、処理はステップＳ４０２へ戻る。 When determining whether or not to skip the C-th frame in this way, the selecting unit 103 increments the value C by 1 (step S405). Then, it is determined whether or not the incremented value C matches the value (t−1) (step S406). If it is determined that they do not match (step S406; NO), the process returns to step S402.

すなわち、選別部１０３は、値Ｃを１ずつインクリメントしながら、インクリメント後の値Ｃが値（ｔ−１）に達するまで、ステップＳ４０２〜Ｓ４０５の処理を繰り返す。これにより、図１３に示すように、第ｔフレームの直前の１０フレーム分それぞれについて、第Ｃフレームより後に撮像された各フレームの画像と第ｔフレームの画像との間の関係を表す基礎行列Ｆ_C+1,t〜Ｆ_t-1,tと、が類似しているか否かを順次判別していく。そして、基礎行列Ｆ_C+1,t〜Ｆ_t-1,tの中に基礎行列Ｆ_C,tと類似しているものがある場合に、第Ｃフレームを、第ｔフレームの画像へのエピポーラ転送をスキップするフレームとして保存していく。 That is, the selection unit 103 increments the value C by 1 and repeats the processes of steps S402 to S405 until the incremented value C reaches the value (t−1). As a result, as shown in FIG. 13, for each of the 10 frames immediately before the t-th frame, a basic matrix F representing the relationship between the image of each frame captured after the C-th frame and the image of the t-th frame. It is sequentially determined whether or not _{C + 1, t to} F _{t-1, t} are similar. Then, if there is something similar to the basic matrix F _{C, t} among the basic matrices F _{C + 1, t to} F _{t-1, t} , the C-th frame is converted to an epipolar to the image of the t-th frame. Save as a frame to skip the transfer.

最終的に、値Ｃが値（ｔ−１）に一致すると（ステップＳ４０６；ＹＥＳ）、第Ｃフレームが第（ｔ−１）フレームにまで達したため、基礎行列Ｆ_C,tとの類似度を評価する基礎行列がなくなる。そのため、選別部１０３は、フレーム選別処理を終了する。 Finally, when the value C matches the value (t−1) (step S406; YES), since the Cth frame has reached the (t−1) th frame, the similarity with the basic matrix F _{C, t} is determined. There is no basis matrix to evaluate. Therefore, the selection unit 103 ends the frame selection process.

図８のフローチャートに戻って、ステップＳ４のフレーム選別処理が終了した後、仮想特徴点生成部１０４は、仮想特徴点生成処理を実行する（ステップＳ５）。これにより、基礎行列取得処理によって取得された基礎行列Ｆに基づいて、ステップＳ１で指定部１０１によって指定されたフレーム（第ｔフレーム）より前の１０フレーム分の画像内の互いに対応する仮想特徴点ｖ^ｉを第ｔフレームの画像にエピポーラ転送することにより、第ｔフレームの画像内における仮想特徴点ｖ^ｉを生成する。 Returning to the flowchart of FIG. 8, after the frame selection process in step S4 is completed, the virtual feature point generation unit 104 executes the virtual feature point generation process (step S5). Thereby, based on the basic matrix F acquired by the basic matrix acquisition process, the virtual feature points corresponding to each other in the image for 10 frames before the frame (t-th frame) specified by the specifying unit 101 in step S1. The virtual feature points v ⁱ in the t-th frame image are generated by epipolar transfer of v ⁱ to the t-th frame image.

以下、ステップＳ５の仮想特徴点生成処理の詳細について、図１４のフローチャートを参照しながら説明する。 Hereinafter, the details of the virtual feature point generation processing in step S5 will be described with reference to the flowchart of FIG.

仮想特徴点生成処理においては、第ｔフレームより前の１０フレーム分の画像が含む仮想特徴点ｖ^ｉが第ｔフレームへ投影するエピポーラ線を求める必要がある。しかし、手ぶれ補正処理の初期段階においては、エピポーラ線の投影元となるべき過去のフレームの画像に含まれる仮想特徴点ｖ^ｉが未だ生成されていないことがある。そこで、仮想特徴点生成処理を開始すると、仮想特徴点生成部１０４は、まず、仮想特徴点軌道準備処理を実行する（ステップＳ５０１）。 In the virtual feature point generation process is a virtual feature point v ⁱ which image contains 10 frames before the t-th frame is necessary to obtain the epipolar line to be projected to the t frame. However, in the initial stage of the image stabilization process, it is the virtual feature point v ⁱ included in the image of the past frame to be the projection source epipolar line is not generated yet. Thus, when the virtual feature point generation process is started, the virtual feature point generation unit 104 first executes a virtual feature point trajectory preparation process (step S501).

以下、ステップＳ５０１の仮想特徴点軌道準備処理の詳細について、図１５のフローチャートを参照しながら説明する。 Details of the virtual feature point trajectory preparation process in step S501 will be described below with reference to the flowchart of FIG.

仮想特徴点軌道準備処理を開始すると、仮想特徴点生成部１０４は、まず、第ｔフレームより以前のフレームにおいて構築された仮想特徴点軌道ｖのうち何れか１つを選択する（ステップＳ５０１ａ）。 When the virtual feature point trajectory preparation process is started, the virtual feature point generation unit 104 first selects one of the virtual feature point trajectories v constructed in the frame before the t-th frame (step S501a).

そして、仮想特徴点生成部１０４は、選択した仮想特徴点軌道ｖが第（ｔ−１）フレームまで伸びているか否かを判別する（ステップＳ５０１ｂ）。すなわち、仮想特徴点生成部１０４は、選択した仮想特徴点軌道ｖが第（ｔ−１）フレームより前のフレームから、第（ｔ−１）フレームまで連続しているか否かを判別する。 Then, the virtual feature point generation unit 104 determines whether or not the selected virtual feature point trajectory v extends to the (t−1) th frame (step S501b). In other words, the virtual feature point generation unit 104 determines whether or not the selected virtual feature point trajectory v is continuous from the frame before the (t−1) frame to the (t−1) frame.

仮想特徴点軌道ｖが第（ｔ−１）フレームまで伸びていると判別した場合（ステップＳ５０１ｂ；ＹＥＳ）、仮想特徴点生成部１０４は、選択した仮想特徴点軌道ｖを仮想特徴点生成のための初期軌道として用いることができるので、処理はステップＳ５０１ｄへ移る。 If it is determined that the virtual feature point trajectory v extends to the (t−1) th frame (step S501b; YES), the virtual feature point generation unit 104 uses the selected virtual feature point trajectory v to generate a virtual feature point. Therefore, the process moves to step S501d.

一方、仮想特徴点軌道ｖが第（ｔ−１）フレームまで伸びていないと判別された場合（ステップＳ５０１ｂ；ＮＯ）、仮想特徴点生成部１０４は、第ｔフレームの直前に撮像された５フレーム分の画像から取得された特徴点軌道ｐを仮想特徴点軌道ｖにコピーする（ステップＳ５０１ｃ）。すなわち、仮想特徴点生成部１０４は、第（ｔ−１）〜第（ｔ−５）フレームの画像から抽出済みの特徴点ｐ^ｉを、これらの画像中の仮想特徴点ｖ^ｉと見なし、仮想特徴点生成のための初期軌道を取得する。 On the other hand, when it is determined that the virtual feature point trajectory v does not extend to the (t−1) th frame (step S501b; NO), the virtual feature point generation unit 104 captures the five frames captured immediately before the tth frame. The feature point trajectory p acquired from the minute image is copied to the virtual feature point trajectory v (step S501c). That is, the virtual feature point generation unit 104 regards the feature points p ⁱ already extracted from the images of the (t−1) th to (t-5) th frames as the virtual feature points v ⁱ in these images, Get initial trajectory for feature point generation.

本実施形態では、１００本の仮想特徴点軌道ｖにより、手ぶれ補正前の撮像装置１の動きを定義する（手ぶれ補正前の各フレームの画像を定義する）。そこで、ステップＳ５０１ｄにおいて、仮想特徴点生成部１０４は、１００個の仮想特徴点軌道ｖに対してステップＳ５０１ｂ〜Ｓ５０１ｃの処理を施したか否か判別する（ステップＳ５０１ｄ）。１００個の仮想特徴点軌道ｖに処理を施していないと判別すると（ステップＳ５０１ｄ；ＮＯ）、処理はステップＳ５０１ａへ戻り、仮想特徴点生成部１０４は、別の仮想特徴点軌道ｖを選択してステップＳ５０１ｂ〜Ｓ５０１ｃの処理を実行する。 In this embodiment, the motion of the imaging device 1 before camera shake correction is defined by 100 virtual feature point trajectories v (an image of each frame before camera shake correction is defined). Therefore, in step S501d, the virtual feature point generation unit 104 determines whether or not the processing of steps S501b to S501c has been performed on 100 virtual feature point trajectories v (step S501d). If it is determined that the process has not been performed on 100 virtual feature point trajectories v (step S501d; NO), the process returns to step S501a, and the virtual feature point generation unit 104 selects another virtual feature point trajectory v. Steps S501b to S501c are executed.

最終的に、１００個の仮想特徴点軌道ｖに処理を施したと判別すると（ステップＳ５０１ｄ；ＹＥＳ）、仮想特徴点生成部１０４は、仮想特徴点軌道準備処理を終了する。 When it is finally determined that 100 virtual feature point trajectories v have been processed (step S501d; YES), the virtual feature point generation unit 104 ends the virtual feature point trajectory preparation process.

図１４のフローチャートに戻って、ステップＳ５０１の仮想特徴点軌道準備処理を終了した後、仮想特徴点生成部１０４は、仮想特徴点軌道準備処理を施された１００本の仮想特徴点軌道ｖのうち何れか１つを選択する（ステップＳ５０２）。 Returning to the flowchart of FIG. 14, after the virtual feature point trajectory preparation process in step S <b> 501 is completed, the virtual feature point generation unit 104 out of the 100 virtual feature point trajectories v subjected to the virtual feature point trajectory preparation process. Either one is selected (step S502).

次に、仮想特徴点生成部１０４は、第（ｔ−１０）フレーム（第ｔフレームより１０フレーム前のフレーム）を第Ｈフレームとして設定し（ステップＳ５０３）、第Ｈフレームが、ステップＳ４におけるフレーム選別処理において、非スキップ対象と評価されたか否かを判別する（ステップＳ５０４）。 Next, the virtual feature point generation unit 104 sets the (t-10) th frame (a frame 10 frames before the tth frame) as the Hth frame (step S503), and the Hth frame is the frame in step S4. In the sorting process, it is determined whether or not it is evaluated as a non-skip target (step S504).

第Ｈフレームの画像と第ｔフレームの画像との間の関係を表す基礎行列が、第Ｈフレームから第ｔフレームまでの他のフレームの画像と第ｔフレームの画像との関係を表す基礎行列の何れかに類似している場合、第Ｈフレームは、フレーム選別処理により、スキップ対象と評価されている。この場合、仮想特徴点生成部１０４は、第Ｈフレームを非スキップ対象と評価されていないと判別し（ステップＳ５０４；ＮＯ）、処理はステップＳ５０６へ移る。 The basic matrix representing the relationship between the image of the Hth frame and the image of the tth frame is a basic matrix representing the relationship between the image of the other frame from the Hth frame to the tth frame and the image of the tth frame. If they are similar to any of them, the H-th frame is evaluated as a skip target by the frame selection process. In this case, the virtual feature point generation unit 104 determines that the H-th frame is not evaluated as a non-skip target (step S504; NO), and the process proceeds to step S506.

第Ｈフレームの画像と第ｔフレームの画像との間の関係を表す基礎行列が、第Ｈフレームから第ｔフレームまでの他のフレームの画像と第ｔフレームの画像との関係を表す基礎行列の何れにも類似していなかった場合、第Ｈフレームは、フレーム選別処理により、非スキップ対象と評価されている。この場合、仮想特徴点生成部１０４は、非スキップ対象と評価されていると判別する（ステップＳ５０４；ＹＥＳ）。そして、仮想特徴点生成部１０４は、ステップＳ３における基礎行列取得処理によって取得された、第ｔフレームの画像と第Ｈフレームの画像との間のエピポーラ関係を表す基礎行列Ｆ_ｔ、Ｈに基づいて、選択した仮想特徴点軌道ｖに含まれる第Ｈフレームの画像の仮想特徴点ｖ^ｉから、第ｔフレームの画像へ、エピポーラ線を投影する（ステップＳ５０５）。 The basic matrix representing the relationship between the image of the Hth frame and the image of the tth frame is a basic matrix representing the relationship between the image of the other frame from the Hth frame to the tth frame and the image of the tth frame. If they are not similar to each other, the H-th frame is evaluated as a non-skip target by the frame selection process. In this case, the virtual feature point generation unit 104 determines that it is evaluated as a non-skip target (step S504; YES). Then, the virtual feature point generation unit 104 is based on the basic matrices F _{t and H} representing the epipolar relationship between the image of the t th frame and the image of the H frame acquired by the basic matrix acquisition process in step S3. , from the virtual feature point v ⁱ of the image of the H frame included in the virtual feature point trajectory v selected, the image of the t frame, projecting the epipolar line (step S505).

エピポーラ線を取得した後、仮想特徴点生成部１０４は、値Ｈを１だけインクリメントし（ステップＳ５０６）、インクリメント後の値Ｈが値ｔに一致するか否かを判別する（ステップＳ５０７）。一致しないと判別すると（ステップＳ５０７；ＮＯ）、処理はステップＳ５０４へ戻る。 After acquiring the epipolar line, the virtual feature point generation unit 104 increments the value H by 1 (step S506), and determines whether the incremented value H matches the value t (step S507). If it is determined that they do not match (step S507; NO), the process returns to step S504.

すなわち、仮想特徴点生成部１０４は、値Ｈを１ずつインクリメントしながら、インクリメント後の値Ｈが値ｔに一致すると判別されるまで（ステップＳ５０７においてＹＥＳと判別されるまで）、ステップＳ５０４〜Ｓ５０６の処理を繰り返す。これにより、第ｔフレームより前の１０フレームのうち非スキップ対象と判別された各フレームの画像に含まれる仮想特徴点ｖ^ｉから第ｔフレームの画像へ、順次エピポーラ線を投影する。 That is, the virtual feature point generation unit 104 increments the value H by 1 until it is determined that the incremented value H matches the value t (until determined as YES in step S507), steps S504 to S506. Repeat the process. Thus, from the virtual feature point v ⁱ included in the image of each frame is determined as a non-skipped among the 10 frames before the t-th frame to the image of the t frame, projecting the sequential epipolar line.

最終的に、インクリメント後の値Ｈが値ｔに一致すると判別した場合（ステップＳ５０７；ＹＥＳ）、仮想特徴点生成部１０４は、第ｔフレームの画像に投影した複数のエピポーラ線から任意の１組を選択する（ステップＳ５０８）。そして、選択した１組のエピポーラ線が互いに成す角が１．５°以上であるか否かを判別する（ステップＳ５０９）。 Finally, when it is determined that the incremented value H matches the value t (step S507; YES), the virtual feature point generation unit 104 selects an arbitrary set from a plurality of epipolar lines projected on the image of the t-th frame. Is selected (step S508). Then, it is determined whether or not the angle formed by the selected pair of epipolar lines is 1.5 ° or more (step S509).

１組のエピポーラ線が平行に近いほど、これらのエピポーラ線が与える交点の信頼度は低い。そこで、仮想特徴点生成部１０４は、一定以上に大きな角を成すエピポーラ線の組が与える交点の座標のみを取得することにより、信頼性の低い交点を排除する。 The closer a pair of epipolar lines are to parallel, the lower the reliability of the intersection given by these epipolar lines. Therefore, the virtual feature point generation unit 104 eliminates intersections with low reliability by acquiring only the coordinates of the intersections given by a pair of epipolar lines that form a corner larger than a certain angle.

具体的には、成す角が１．５°を下回ると判別すると（ステップＳ５０９；ＮＯ）、処理はステップＳ５１１へ移る。一方、成す角が１．５°以上であると判別すると（ステップＳ５０９；ＹＥＳ）、仮想特徴点生成部１０４は、これらのエピポーラ線の交点の座標を取得し（ステップＳ５１０）、処理はステップＳ５１１へ移る。 Specifically, if it is determined that the formed angle is less than 1.5 ° (step S509; NO), the process proceeds to step S511. On the other hand, if it is determined that the formed angle is 1.5 ° or more (step S509; YES), the virtual feature point generation unit 104 acquires the coordinates of the intersections of these epipolar lines (step S510), and the process proceeds to step S511. Move on.

ステップＳ５１１において、仮想特徴点生成部１０４は、第ｔフレームの画像に投影した複数のエピポーラ線から選択可能な全ての組み合わせを選択したか否かを判別する（ステップＳ５１１）。選択されていない組み合わせがあると判別すると（ステップＳ５１１；ＮＯ）、処理はステップＳ５０８へ戻り、仮想特徴点生成部１０４は、未だ選択されていない組み合わせを選択して、同様の処理を実行する。 In step S511, the virtual feature point generation unit 104 determines whether all selectable combinations have been selected from the plurality of epipolar lines projected on the image of the t-th frame (step S511). If it is determined that there is a combination that has not been selected (step S511; NO), the process returns to step S508, and the virtual feature point generation unit 104 selects a combination that has not yet been selected and executes the same process.

選択可能な全ての組み合わせが選択されたと判別すると（ステップＳ５１１；ＹＥＳ）、仮想特徴点生成部１０４は、取得された全ての交点座標の平均値を求め、この平均値と、取得された全ての交点座標の中央値と、の間の距離が５画素より小さいか否か判別する（ステップＳ５１２）。 When it is determined that all selectable combinations have been selected (step S511; YES), the virtual feature point generation unit 104 obtains an average value of all the acquired intersection coordinates, and this average value and all the acquired It is determined whether or not the distance between the median value of the intersection coordinates is smaller than 5 pixels (step S512).

具体的には、平均値と中央値との間の距離が５画素より小さいと判別すると（ステップＳ５１２；ＹＥＳ）、仮想特徴点生成部１０４は、平均値を第ｔフレーム中の仮想特徴点ｖ^ｉの座標として取得し（ステップＳ５１３）、処理はステップＳ５１４へ移る。 Specifically, when it is determined that the distance between the average value and the median value is smaller than 5 pixels (step S512; YES), the virtual feature point generation unit 104 determines the average value as the virtual feature point v in the t-th frame. Obtained as the coordinates of ⁱ (step S513), and the process proceeds to step S514.

平均値が中央値から５画素以上離れていると判別すると（ステップＳ５１２；ＮＯ）、処理はステップＳ５１４へ移る。すなわち、交点座標の平均値が中央値から５画素以上離れている場合、トラッキングエラーである可能性が高い。そこで、中央値から５画素以上離れている平均値の取得をスキップすることにより、信頼性に欠ける交点を排除する。 If it is determined that the average value is 5 pixels or more away from the median value (step S512; NO), the process proceeds to step S514. That is, when the average value of the intersection coordinates is 5 pixels or more away from the median value, there is a high possibility of a tracking error. Therefore, by skipping the acquisition of an average value that is 5 pixels or more away from the median value, intersections lacking in reliability are eliminated.

ステップＳ５１４において、仮想特徴点生成部１０４は、ステップＳ５０１の仮想特徴点軌道準備処理により処理された１００本の仮想特徴点軌道ｖ全てが選択されたか否かを判別する（ステップＳ５１４）。未だ選択されていない仮想特徴点軌道ｖがあると判別すると（ステップＳ５１４；ＮＯ）、処理はステップＳ５０２へ戻り、仮想特徴点生成部１０４は、未選択の仮想特徴点軌道ｖのうち何れか１本を選択して、同様の仮想特徴点生成処理を実行する。 In step S514, the virtual feature point generation unit 104 determines whether all 100 virtual feature point trajectories v processed by the virtual feature point trajectory preparation process in step S501 have been selected (step S514). If it is determined that there is an unselected virtual feature point trajectory v (step S514; NO), the process returns to step S502, and the virtual feature point generation unit 104 selects any one of the unselected virtual feature point trajectories v. A book is selected and a similar virtual feature point generation process is executed.

最終的に、全ての仮想特徴点軌道ｖが選択済みであると判別すると（ステップＳ５１４；ＹＥＳ）、仮想特徴点生成部１０４は、仮想特徴点生成処理を終了する。 Finally, when it is determined that all the virtual feature point trajectories v have been selected (step S514; YES), the virtual feature point generation unit 104 ends the virtual feature point generation process.

図８のフローチャートに戻って、ステップＳ５の仮想特徴点生成処理が終了した後、仮想特徴点軌道構築部１０５は、仮想特徴点生成処理によって生成された仮想特徴点ｖ^ｉを、仮想特徴点軌道ｖを構成する点に追加することにより、仮想特徴点軌道ｖを延長する（ステップＳ６）。 Returning to the flowchart of FIG. 8, after the virtual feature point generation process in step S5 is completed, the virtual feature point trajectory construction unit 105, a virtual feature point v ⁱ generated by the virtual feature point generation process, the virtual feature point trajectory By adding to the points constituting v, the virtual feature point trajectory v is extended (step S6).

そして、仮想特徴点軌道構築部１０５は、指定部１０１が指定したフレーム（第ｔフレーム）が手ぶれ補正終了フレームであるか否かを判別する（ステップＳ７）。手ぶれ補正終了フレームは、手ぶれ補正対象の動画が含む複数のフレームのうち最後の（撮像時刻が最も新しい）フレームである。 Then, the virtual feature point trajectory construction unit 105 determines whether or not the frame (tth frame) designated by the designation unit 101 is a camera shake correction end frame (step S7). The camera shake correction end frame is the last frame (the latest imaging time) among a plurality of frames included in the moving image to be corrected for camera shake.

第ｔフレームが手ぶれ補正終了フレームではないと判別すると（ステップＳ７；ＮＯ）、仮想特徴点軌道構築部１０５は、指定部１０１に、直前に実行されたステップＳ２〜Ｓ６の処理において第ｔフレームとして設定されていたフレームの次の（次に撮像時刻が新しい）フレームを、新たな第ｔフレームとして設定させ（ステップＳ１３）、処理はステップＳ２へ戻る。 When it is determined that the t-th frame is not the camera shake correction end frame (step S7; NO), the virtual feature point trajectory construction unit 105 causes the designation unit 101 to execute the t-th frame as the t-th frame in the processing of steps S2 to S6 executed immediately before. The frame next to the set frame (the next imaging time is the newest) is set as a new t-th frame (step S13), and the process returns to step S2.

すなわち、仮想特徴点軌道構築部１０５は、手ぶれ補正対象の動画が含む複数フレームの画像の中から指定部１０１が指定する画像を変えて、ステップＳ２〜Ｓ６の処理を繰り返す。これにより、仮想特徴点軌道構築部１０５は、仮想特徴点軌道ｖを構築する。 That is, the virtual feature point trajectory constructing unit 105 changes the image designated by the designation unit 101 from among a plurality of frames included in the moving image subject to camera shake correction, and repeats the processes of steps S2 to S6. Thereby, the virtual feature point trajectory construction unit 105 constructs a virtual feature point trajectory v.

第ｔフレームが手ぶれ補正終了フレームであると判別されると（ステップＳ７；ＹＥＳ）、平滑化部１０６が、構築された仮想特徴点軌道ｖをσ＝５０のガウシアンカーネルにより時間方向に平滑化し、平滑化された仮想特徴点軌道〜ｖを取得する（ステップＳ８）。 If it is determined that the t-th frame is a camera shake correction end frame (step S7; YES), the smoothing unit 106 smoothes the constructed virtual feature point trajectory v in the time direction by a Gaussian kernel with σ = 50, The smoothed virtual feature point trajectory ~ v is acquired (step S8).

そして、補正部１０７は、仮想特徴点軌道ｖと平滑化された仮想特徴点軌道〜ｖとの間の関係に基づいて、手ぶれ補正前の各フレームの画像と手ぶれ補正後の各フレームの画像との間のエピポーラ関係を表す基礎行列〜Ｆを取得する（ステップＳ９）。 Then, the correcting unit 107, based on the relationship between the virtual feature point trajectory v and the smoothed virtual feature point trajectory ~ v, the image of each frame before camera shake correction and the image of each frame after camera shake correction, Basic matrix ~ F representing the epipolar relationship between the two is acquired (step S9).

具体的には、補正部１０７は、仮想特徴点軌道ｖを形成する仮想特徴点ｖ^ｉと、平滑化された仮想特徴点軌道〜ｖを形成する手ぶれ補正後の仮想特徴点〜ｖ^ｉと、を対応点として用い、８ポイントアルゴリズムとＲＡＮＳＡＣ法により基礎行列〜Ｆを求める。 Specifically, the correction unit 107, a virtual feature point v ⁱ to form a virtual feature point trajectory v, and a virtual feature point to v ⁱ after image stabilization to form a virtual feature point trajectory to v smoothed, Are used as corresponding points, and the basic matrix ~ F is obtained by the 8-point algorithm and the RANSAC method.

そして、補正部１０７は、ステップＳ９で取得した基礎行列〜Ｆに基づいてエピポーラ転送を行うことにより、手ぶれ補正後の特徴点軌道〜ｐを取得する（ステップＳ１０）。 And the correction | amendment part 107 acquires the feature point trajectory -p after camera-shake correction | amendment by performing epipolar transfer based on the basic matrix -F acquired at step S9 (step S10).

具体的には、補正部１０７は、手ぶれ補正後の各フレームの画像に、手ぶれ補正前の前後５フレームの画像に含まれる特徴点ｐ^ｉが投影するエピポーラ線を、基礎行列〜Ｆに基づいて取得する。そして、図６に示すように、エピポーラ線の交点により定まる点を、手ぶれ補正後の各フレームの画像が含む手ぶれ補正後の特徴点〜ｐ^ｉとして取得する。手ぶれ補正後の特徴点軌道〜ｐは、手ぶれ補正後の特徴点〜ｐ^ｉによって構成される軌道として取得される。 Specifically, the correction unit 107 calculates epipolar lines projected by the feature points p ⁱ included in the images of the five frames before and after the camera shake correction on the image of each frame after the camera shake correction based on the basic matrixes ~ F. get. Then, as shown in FIG. 6, the point determined by the intersection of the epipolar line, obtained as feature points ~p ⁱ after anti-shake image of each frame after image stabilization can contain. Feature point trajectory ~p after image stabilization is obtained as a track constituted by the feature point ~p ⁱ after camera shake compensation.

補正部１０７は、取得された手ぶれ補正後の特徴点軌道〜ｐをσ＝６のガウシアンカーネルを用いて時間方向に平滑化することにより、高周波ジッタを除去する（ステップＳ１１）。 The correction unit 107 removes high-frequency jitter by smoothing the acquired feature point trajectory to p after camera shake correction using the Gaussian kernel with σ = 6 in the time direction (step S11).

補正部１０７は、手ぶれ補正対象の動画が含む各フレームの画像に対し、特徴点軌道ｐと手ぶれ補正後の特徴点軌道〜ｐとの間の関係に基づいて射影変換を実行することにより手ぶれを補正する（ステップＳ１２）。以上により、図８のフローチャートに示した手ぶれ補正処理を終了する。 The correction unit 107 performs the camera shake by performing projective transformation on the image of each frame included in the image to be corrected for camera shake based on the relationship between the feature point trajectory p and the feature point trajectory to p after the camera shake correction. Correction is performed (step S12). Thus, the camera shake correction process shown in the flowchart of FIG. 8 ends.

具体的には、補正部１０７は、特徴点軌道ｐを構成する特徴点ｐ^ｉの座標と、手ぶれ補正後の特徴点軌道〜ｐを構成する手ぶれ補正後の特徴点〜ｐ^ｉの座標と、を対応点として式（６）に代入することにより射影変換パラメータを求める。そして、手ぶれ補正対象の動画を構成する複数の画像それぞれを射影変換し、射影変換された複数の画像によって構成される動画を、手ぶれを補正した動画として生成する。 Specifically, the correction unit 107, the coordinates of the feature point p ⁱ that make up the feature point trajectory p, and the feature point ~p ⁱ of coordinates after camera shake compensation constituting the feature point trajectory ~p after camera shake compensation, Is assigned to the equation (6) as a corresponding point to obtain a projective transformation parameter. Then, each of a plurality of images constituting the moving image subject to camera shake correction is subjected to projective transformation, and a moving image composed of the plurality of images subjected to the projective transformation is generated as a moving image in which camera shake is corrected.

以上説明したように、本実施形態に係る撮像装置１及び画像処理装置１００は、エピポーラ幾何を利用することにより、視差の影響を考慮した手ぶれ補正を動画に対して施すことができる。 As described above, the imaging apparatus 1 and the image processing apparatus 100 according to the present embodiment can perform camera shake correction in consideration of the influence of parallax on a moving image by using epipolar geometry.

さらに、本実施形態に係る撮像装置１及び画像処理装置１００は、手ぶれ補正対象の動画を構成する各画像間の関係を表す基礎行列の類似度を評価して、評価結果に基づいてエピポーラ投影の計算を実行する画像を選別している。これにより、精度の良くないエピポーラ転送を行うフレームを事前に選別して、エピポーラ転送の計算をスキップできるため、計算量を削減できる。そのため、手ぶれ補正処理の時間短縮につながる。 Furthermore, the imaging apparatus 1 and the image processing apparatus 100 according to the present embodiment evaluate the similarity of the basic matrix that represents the relationship between the images constituting the image to be corrected for camera shake, and perform epipolar projection based on the evaluation result. The images to be calculated are selected. As a result, it is possible to select in advance a frame for performing epipolar transfer with low accuracy and skip the calculation of epipolar transfer, thereby reducing the amount of calculation. As a result, the camera shake correction processing time is shortened.

以上に本発明の実施形態について説明したが、これらの実施形態は一例であり、本発明の適用範囲はこれに限られない。すなわち、本発明の実施形態は種々の応用が可能であり、あらゆる実施の形態が本発明の範囲に含まれる。 Although the embodiments of the present invention have been described above, these embodiments are merely examples, and the scope of application of the present invention is not limited thereto. That is, the embodiments of the present invention can be applied in various ways, and all the embodiments are included in the scope of the present invention.

上記実施形態において、画像処理装置１００は、撮像装置１の内部に具備されていた。しかし、本発明に係る画像処理装置は、撮像装置から独立した装置であってもよい。例えば、コンピュータ等の情報処理装置が、本発明に係る画像処理装置として機能することができる。この場合、画像処理装置は、外部の撮像装置が撮像した動画を手ぶれ補正対象の動画として取得し、上述の手ぶれ補正を施せばよい。 In the above embodiment, the image processing apparatus 100 is provided inside the imaging apparatus 1. However, the image processing apparatus according to the present invention may be an apparatus independent of the imaging apparatus. For example, an information processing apparatus such as a computer can function as the image processing apparatus according to the present invention. In this case, the image processing apparatus may acquire a moving image captured by an external imaging device as a moving image to be subjected to camera shake correction, and perform the above-described camera shake correction.

上記実施形態において、画像処理装置１００は、撮像部１０が被写体を撮像することにより生成し外部記憶部２３に記憶されていた動画を、外部記憶部２３から手ぶれ補正の対象として取得した。撮像装置１は、外部記憶部２３からではなく、外部の画像入力装置（例えば、デジタルカメラやメモリカード、ネットワーク）から手ぶれ補正対象の動画を予め取得し、記憶しておいてもよい。 In the above-described embodiment, the image processing apparatus 100 acquires, from the external storage unit 23, a camera shake correction target as a moving image generated by the imaging unit 10 capturing an image of a subject and stored in the external storage unit 23. The imaging device 1 may acquire and store in advance a camera shake correction target moving image from an external image input device (for example, a digital camera, a memory card, or a network) instead of from the external storage unit 23.

上記実施形態では、ＫＬＴ法を用いて各フレームの画像から互いに対応する特徴点ｐ^ｉを抽出した。しかし、本発明では、ＫＬＴ法以外の方法を用いて特徴点ｐ^ｉを抽出してもよい。例えば、Ｈａｒｒｉｓオペレータ、ＳＵＳＡＮオペレータ、Ｆｏｅｒｓｔｎｅｒオペレータ、Ｓｏｊａｋオペレータ、ＳＩＦＴ等を用いることができる。 In the above embodiment, feature points p ⁱ corresponding to each other are extracted from the image of each frame using the KLT method. However, in the present invention, the feature points p ⁱ may be extracted using a method other than the KLT method. For example, a Harris operator, a SUSAN operator, a Forersner operator, a Sojak operator, SIFT, or the like can be used.

上記実施形態では、仮想特徴点軌道ｖをガウシアンカーネルで平滑化することにより、平滑化された仮想特徴点軌道〜ｖを取得した。しかし、本発明では、平滑化は、ガウシアンカーネル以外の関数（例えば、ラプラシアンフィルタ）を用いて行ってもよい。 In the above-described embodiment, the virtual feature point trajectory v is smoothed by the Gaussian kernel to obtain the smoothed virtual feature point trajectory to v. However, in the present invention, smoothing may be performed using a function other than the Gaussian kernel (for example, a Laplacian filter).

上記実施形態では、手ぶれ補正対象の画像全体に対して一括して射影変換を施すことにより手ぶれ補正を実行した。しかし、本発明では、画像が含む複数の画像領域それぞれに射影変換を施し、射影変換後の画像領域を再結合することにより手ぶれ補正を施すこともできる。 In the embodiment described above, camera shake correction is performed by performing projective transformation on the entire image to be corrected for camera shake in a lump. However, in the present invention, it is also possible to perform camera shake correction by performing projective transformation on each of a plurality of image regions included in an image and recombining the image regions after the projective transformation.

上記実施形態では、本発明に係る画像処理装置、撮像装置及び画像処理方法を、撮像装置１及び画像処理装置１００を例に用いて説明した。本発明に係る画像処理装置、撮像装置、及び画像処理方法は、コンピュータ、携帯電話機、デジタルカメラ、ＰＤＡ（ＰｅｒｓｏｎａｌＤｉｇｉｔａｌＡｓｓｉｓｔａｎｃｅ）等の任意の電子機器によって実現することができる。 In the above embodiment, the image processing apparatus, the imaging apparatus, and the image processing method according to the present invention have been described using the imaging apparatus 1 and the image processing apparatus 100 as examples. The image processing apparatus, the imaging apparatus, and the image processing method according to the present invention can be realized by an arbitrary electronic device such as a computer, a mobile phone, a digital camera, or a PDA (Personal Digital Assistance).

具体的には、コンピュータ、携帯電話機、デジタルカメラ、ＰＤＡ等を本発明に係る撮像装置及び画像処理装置として動作させるためのプログラムを、これらの電子機器が読み取り可能な記録媒体（例えば、メモリカードやＣＤ−ＲＯＭ（ＣｏｍｐａｃｔＤｉｓｃＲｅａｄ−ＯｎｌｙＭｅｍｏｒｙ）、ＤＶＤ−ＲＯＭ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃＲｅａｄ−ＯｎｌｙＭｅｍｏｒｙ）等）に格納して配布し、インストールすることにより本発明に係る撮像装置及び画像処理装置を実現することができる。 Specifically, a program for causing a computer, a mobile phone, a digital camera, a PDA, etc. to operate as an imaging apparatus and an image processing apparatus according to the present invention is recorded on a recording medium (for example, a memory card or An image pickup apparatus and an image processing apparatus according to the present invention are realized by storing, distributing, and installing on a CD-ROM (Compact Disc Read-Only Memory), DVD-ROM (Digital Versatile Disc Read-Only Memory), and the like. be able to.

あるいは、上記プログラムを、インターネット等の通信ネットワーク上のサーバ装置が有する記憶装置（例えば、ディスク装置等）に格納しておき、コンピュータ、携帯電話機、デジタルカメラ、ＰＤＡ等がこのプログラムをダウンロードすることによって本発明に係る撮像装置及び画像処理装置を実現してもよい。 Alternatively, the above program is stored in a storage device (for example, a disk device) included in a server device on a communication network such as the Internet, and the computer, mobile phone, digital camera, PDA, or the like downloads this program. You may implement | achieve the imaging device and image processing apparatus which concern on this invention.

また、本発明に係る撮像装置及び画像処理装置の機能を、オペレーティングシステム（ＯＳ：ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）とアプリケーションプログラムとの協働又は分担により実現する場合には、アプリケーションプログラム部分のみを記録媒体や記憶装置に格納してもよい。 Further, when the functions of the imaging apparatus and the image processing apparatus according to the present invention are realized by cooperation or sharing of an operating system (OS) and an application program, only the application program portion is recorded on a recording medium or a storage device. May be stored.

また、アプリケーションプログラムを搬送波に重畳し、通信ネットワークを介して配信してもよい。例えば、通信ネットワーク上の掲示板（ＢＢＳ：ＢｕｌｌｅｔｉｎＢｏａｒｄＳｙｓｔｅｍ）にアプリケーションプログラムを掲示し、ネットワークを介してアプリケーションプログラムを配信してもよい。そして、このアプリケーションプログラムをコンピュータにインストールして起動し、ＯＳの制御下で、他のアプリケーションプログラムと同様に実行することにより、本発明に係る撮像装置及び画像処理装置を実現してもよい。 Further, the application program may be superimposed on a carrier wave and distributed via a communication network. For example, an application program may be posted on a bulletin board (BBS: Bulletin Board System) on a communication network, and the application program may be distributed via the network. Then, the imaging apparatus and the image processing apparatus according to the present invention may be realized by installing and starting up the application program in a computer and executing the application program in the same manner as other application programs under the control of the OS.

以上、本発明の好ましい実施形態について説明したが、本発明は係る特定の実施形態に限定されるものではなく、本発明には、特許請求の範囲に記載された発明とその均等の範囲が含まれる。以下に、本願出願当初の特許請求の範囲に記載された発明を付記する。 As mentioned above, although preferable embodiment of this invention was described, this invention is not limited to the specific embodiment which concerns, This invention includes the invention described in the claim, and its equivalent range It is. The invention described in the scope of claims at the beginning of the present application will be appended.

（付記１）
動画を構成する複数の画像の中から１つを指定する指定部と、
前記複数の画像のうちの前記指定部によって指定された画像以外の所定数の画像のそれぞれについて、該指定された画像との関係を表す基礎行列を取得する基礎行列取得部と、
前記基礎行列取得部によって前記所定数の画像のそれぞれについて取得された基礎行列に基づいて、前記所定数の画像の中から、前記指定された画像にエピポーラ投影するための画像を選別する選別部と、
前記基礎行列取得部によって取得された基礎行列のうち、前記選別部によって選別された画像について取得された基礎行列に基づいて、該選別された画像内の互いに対応する特徴点又は仮想特徴点を前記指定された画像にエピポーラ投影することにより、前記指定された画像内における仮想特徴点を生成する仮想特徴点生成部と、
前記複数の画像の中から前記指定部が指定する画像を変えて、前記指定部、前記基礎行列取得部、前記選別部、及び前記仮想特徴点生成部の処理を繰り返すことにより、仮想特徴点軌道を構築する仮想特徴点軌道構築部と、
前記仮想特徴点軌道構築部によって構築された仮想特徴点軌道を時間方向に平滑化する平滑化部と、
前記仮想特徴点軌道構築部によって構築された仮想特徴点軌道と、前記平滑化部によって平滑化された仮想特徴点軌道と、の間の関係に基づいて、前記複数の画像のそれぞれを補正する補正部と、
を備えることを特徴とする画像処理装置。 (Appendix 1)
A designating unit for designating one of a plurality of images constituting the video,
A basic matrix acquisition unit that acquires a basic matrix representing a relationship with the designated image for each of a predetermined number of images other than the image designated by the designation unit of the plurality of images;
A selection unit that selects an image for epipolar projection on the designated image from the predetermined number of images based on the basic matrix acquired for each of the predetermined number of images by the basic matrix acquisition unit; ,
Among the basic matrices acquired by the basic matrix acquisition unit, based on the basic matrix acquired for the image selected by the selection unit, the feature points or virtual feature points corresponding to each other in the selected image are A virtual feature point generation unit that generates a virtual feature point in the specified image by performing an epipolar projection on the specified image;
By changing the image designated by the designation unit from the plurality of images and repeating the processing of the designation unit, the basic matrix acquisition unit, the selection unit, and the virtual feature point generation unit, a virtual feature point trajectory A virtual feature point trajectory construction unit for constructing
A smoothing unit that smoothes the virtual feature point trajectory constructed by the virtual feature point trajectory construction unit in the time direction;
Correction that corrects each of the plurality of images based on the relationship between the virtual feature point trajectory constructed by the virtual feature point trajectory construction unit and the virtual feature point trajectory smoothed by the smoothing unit. And
An image processing apparatus comprising:

（付記２）
前記基礎行列取得部によって前記所定数の画像のそれぞれについて取得された基礎行列の類似度を評価する評価部をさらに備え、
前記選別部は、前記評価部による評価結果に基づいて、前記所定数の画像の中から前記指定された画像にエピポーラ投影するための画像を選別する、
ことを特徴とする付記１に記載の画像処理装置。 (Appendix 2)
An evaluation unit that evaluates the similarity of the basic matrix acquired for each of the predetermined number of images by the basic matrix acquisition unit;
The selecting unit selects an image for epipolar projection on the designated image from the predetermined number of images based on the evaluation result by the evaluating unit.
The image processing apparatus according to appendix 1, wherein:

（付記３）
前記選別部は、前記評価部による評価の結果、前記所定数の画像の中に互いに類似している２以上の画像がある場合、該２以上の画像のうちのいずれか１つ以外の画像を、前記指定された画像にエピポーラ投影するための画像から除外する、
ことを特徴とする付記２に記載の画像処理装置。 (Appendix 3)
When there are two or more images that are similar to each other in the predetermined number of images as a result of the evaluation by the evaluation unit, the selecting unit selects an image other than any one of the two or more images. Exclude from the image for epipolar projection to the specified image,
The image processing apparatus according to Supplementary Note 2, wherein

（付記４）
前記評価部は、異なる２つの基礎行列の間において、一方の基礎行列を正規化した行列に含まれる各要素と、他方の基礎行列を正規化した行列における対応する要素と、の差分をとった値がいずれも閾値以下である場合に、該２つの基礎行列が類似していると評価する、
ことを特徴とする付記２又は３に記載の画像処理装置。 (Appendix 4)
The evaluation unit takes a difference between each element included in a matrix obtained by normalizing one basic matrix and a corresponding element in a matrix obtained by normalizing the other basic matrix between two different basic matrices. Evaluates that the two base matrices are similar if both values are below a threshold,
The image processing apparatus according to appendix 2 or 3, characterized by the above.

（付記５）
前記評価部は、異なる２つの基礎行列の間において、一方の基礎行列を正規化した行列に含まれる各要素と、他方の基礎行列を正規化した行列における対応する要素と、の差分の２乗和をとった値が閾値以下である場合に、該２つの基礎行列が類似していると評価する、
ことを特徴とする付記２又は３に記載の画像処理装置。 (Appendix 5)
The evaluation unit squares a difference between each element included in a matrix obtained by normalizing one basic matrix and a corresponding element in a matrix obtained by normalizing the other basic matrix between two different basic matrices. If the sum is less than or equal to the threshold, the two base matrices are evaluated as being similar,
The image processing apparatus according to appendix 2 or 3, characterized by the above.

（付記６）
付記１乃至５の何れか１つに記載の画像処理装置と、
被写体を撮像することにより、前記動画を構成する画像を生成する撮像部と、
を備えることを特徴とする撮像装置。 (Appendix 6)
The image processing apparatus according to any one of appendices 1 to 5,
An imaging unit that generates an image constituting the moving image by imaging a subject;
An imaging apparatus comprising:

（付記７）
動画を構成する複数の画像の中から１つを指定する指定処理と、
前記複数の画像のうちの前記指定処理によって指定された画像以外の所定数の画像のそれぞれについて、該指定された画像との関係を表す基礎行列を取得する基礎行列取得処理と、
前記基礎行列取得処理によって前記所定数の画像のそれぞれについて取得された基礎行列に基づいて、前記所定数の画像の中から、前記指定された画像にエピポーラ投影するための画像を選別する選別処理と、
前記基礎行列取得処理によって取得された基礎行列のうち、前記選別処理によって選別された画像について取得された基礎行列に基づいて、該選別された画像内の互いに対応する特徴点又は仮想特徴点を前記指定された画像にエピポーラ投影することにより、前記指定された画像内における仮想特徴点を生成する仮想特徴点生成処理と、
前記複数の画像の中から前記指定処理が指定する画像を変えて、前記指定処理、前記基礎行列取得処理、前記選別処理、及び前記仮想特徴点生成処理を繰り返すことにより、仮想特徴点軌道を構築する仮想特徴点軌道構築処理と、
前記仮想特徴点軌道構築処理によって構築された仮想特徴点軌道を時間方向に平滑化する平滑化処理と、
前記仮想特徴点軌道構築処理によって構築された仮想特徴点軌道と、前記平滑化部によって平滑化された仮想特徴点軌道と、の間の関係に基づいて、前記複数の画像のそれぞれを補正する補正処理と、
を含むことを特徴とする画像処理方法。 (Appendix 7)
A designation process for designating one of a plurality of images constituting a movie;
A basic matrix acquisition process for acquiring a basic matrix representing a relationship with the specified image for each of a predetermined number of images other than the image specified by the specifying process among the plurality of images;
A selection process for selecting an image for epipolar projection on the designated image from the predetermined number of images based on the basic matrix acquired for each of the predetermined number of images by the basic matrix acquisition process; ,
Among the basic matrices acquired by the basic matrix acquisition process, based on the basic matrix acquired for the image selected by the selection process, the feature points or virtual feature points corresponding to each other in the selected image are A virtual feature point generation process for generating a virtual feature point in the specified image by performing an epipolar projection on the specified image;
A virtual feature point trajectory is constructed by changing the image designated by the designation processing from the plurality of images and repeating the designation processing, the basic matrix acquisition processing, the selection processing, and the virtual feature point generation processing. Virtual feature point trajectory construction processing,
Smoothing processing for smoothing the virtual feature point trajectory constructed by the virtual feature point trajectory construction processing in the time direction;
Correction that corrects each of the plurality of images based on the relationship between the virtual feature point trajectory constructed by the virtual feature point trajectory construction process and the virtual feature point trajectory smoothed by the smoothing unit. Processing,
An image processing method comprising:

（付記８）
コンピュータを、
動画を構成する複数の画像の中から１つを指定する指定部、
前記複数の画像のうちの前記指定部によって指定された画像以外の所定数の画像のそれぞれについて、該指定された画像との関係を表す基礎行列を取得する基礎行列取得部、
前記基礎行列取得部によって前記所定数の画像のそれぞれについて取得された基礎行列に基づいて、前記所定数の画像の中から、前記指定された画像にエピポーラ投影するための画像を選別する選別部、
前記基礎行列取得部によって取得された基礎行列のうち、前記選別部によって選別された画像について取得された基礎行列に基づいて、該選別された画像内の互いに対応する特徴点又は仮想特徴点を前記指定された画像にエピポーラ投影することにより、前記指定された画像内における仮想特徴点を生成する仮想特徴点生成部、
前記複数の画像の中から前記指定部が指定する画像を変えて、前記指定部、前記基礎行列取得部、前記選別部、及び前記仮想特徴点生成部の処理を繰り返すことにより、仮想特徴点軌道を構築する仮想特徴点軌道構築部、
前記仮想特徴点軌道構築部によって構築された仮想特徴点軌道を時間方向に平滑化する平滑化部、
前記仮想特徴点軌道構築部によって構築された仮想特徴点軌道と、前記平滑化部によって平滑化された仮想特徴点軌道と、の間の関係に基づいて、前記複数の画像のそれぞれを補正する補正部、
として機能させることを特徴とするプログラム。 (Appendix 8)
Computer
A designating part for designating one of a plurality of images constituting the video,
A basic matrix acquisition unit that acquires a basic matrix representing a relationship with the designated image for each of a predetermined number of images other than the image designated by the designation unit among the plurality of images;
Based on the basic matrix acquired for each of the predetermined number of images by the basic matrix acquisition unit, a selection unit that selects an image for epipolar projection on the specified image from the predetermined number of images,
Among the basic matrices acquired by the basic matrix acquisition unit, based on the basic matrix acquired for the image selected by the selection unit, the feature points or virtual feature points corresponding to each other in the selected image are A virtual feature point generation unit that generates a virtual feature point in the specified image by performing an epipolar projection on the specified image;
By changing the image designated by the designation unit from the plurality of images and repeating the processing of the designation unit, the basic matrix acquisition unit, the selection unit, and the virtual feature point generation unit, a virtual feature point trajectory A virtual feature point trajectory construction unit,
A smoothing unit that smoothes the virtual feature point trajectory constructed by the virtual feature point trajectory construction unit in the time direction;
Correction that corrects each of the plurality of images based on the relationship between the virtual feature point trajectory constructed by the virtual feature point trajectory construction unit and the virtual feature point trajectory smoothed by the smoothing unit. Part,
A program characterized by functioning as

１…撮像装置、１０…撮像部、１１…光学レンズ、１２…イメージセンサ、２０…データ処理部、２１…主記憶部、２２…出力部、２３…外部記憶部、２４…ＣＰＵ、３０…ユーザインタフェース部、３１…表示部、３２…操作部、３３…外部インタフェース、１００…画像処理装置、１０１…指定部、１０２…基礎行列取得部、１０２ａ…オプティカルフロー取得部、１０３…選別部、１０４…仮想特徴点生成部、１０５…仮想特徴点軌道構築部、１０６…平滑化部、１０７…補正部、１０８…評価部 DESCRIPTION OF SYMBOLS 1 ... Imaging device, 10 ... Imaging part, 11 ... Optical lens, 12 ... Image sensor, 20 ... Data processing part, 21 ... Main memory part, 22 ... Output part, 23 ... External storage part, 24 ... CPU, 30 ... User Interface unit 31 ... Display unit 32 ... Operation unit 33 ... External interface 100 ... Image processing apparatus 101 ... Designation unit 102 ... Basic matrix acquisition unit 102a ... Optical flow acquisition unit 103 ... Selection unit 104 ... Virtual feature point generation unit 105 ... Virtual feature point trajectory construction unit 106 106 Smoothing unit 107 107 Correction unit 108 Evaluation unit

Claims

A designating unit for designating one of a plurality of images constituting the video,
A basic matrix acquisition unit that acquires a basic matrix representing a relationship with the designated image for each of a predetermined number of images other than the image designated by the designation unit of the plurality of images;
A selection unit that selects an image for epipolar projection on the designated image from the predetermined number of images based on the basic matrix acquired for each of the predetermined number of images by the basic matrix acquisition unit; ,
Among the basic matrices acquired by the basic matrix acquisition unit, based on the basic matrix acquired for the image selected by the selection unit, the feature points or virtual feature points corresponding to each other in the selected image are A virtual feature point generation unit that generates a virtual feature point in the specified image by performing an epipolar projection on the specified image;
By changing the image designated by the designation unit from the plurality of images and repeating the processing of the designation unit, the basic matrix acquisition unit, the selection unit, and the virtual feature point generation unit, a virtual feature point trajectory A virtual feature point trajectory construction unit for constructing
A smoothing unit that smoothes the virtual feature point trajectory constructed by the virtual feature point trajectory construction unit in the time direction;
Correction that corrects each of the plurality of images based on the relationship between the virtual feature point trajectory constructed by the virtual feature point trajectory construction unit and the virtual feature point trajectory smoothed by the smoothing unit. And
An image processing apparatus comprising:

An evaluation unit that evaluates the similarity of the basic matrix acquired for each of the predetermined number of images by the basic matrix acquisition unit;
The sorting unit sorts an image for epipolar projection on the designated image from the predetermined number of images based on the evaluation result by the evaluation unit.
The image processing apparatus according to claim 1.

When there are two or more images that are similar to each other in the predetermined number of images as a result of the evaluation by the evaluation unit, the selecting unit selects an image other than any one of the two or more images. Exclude from the image for epipolar projection to the specified image,
The image processing apparatus according to claim 2.

The evaluation unit takes a difference between each element included in a matrix obtained by normalizing one basic matrix and a corresponding element in a matrix obtained by normalizing the other basic matrix between two different basic matrices. Evaluates that the two base matrices are similar if both values are below a threshold,
The image processing apparatus according to claim 2, wherein the image processing apparatus is an image processing apparatus.

The evaluation unit squares a difference between each element included in a matrix obtained by normalizing one basic matrix and a corresponding element in a matrix obtained by normalizing the other basic matrix between two different basic matrices. If the sum is less than or equal to the threshold, the two base matrices are evaluated as being similar,
The image processing apparatus according to claim 2, wherein the image processing apparatus is an image processing apparatus.

An image processing apparatus according to any one of claims 1 to 5,
An imaging unit that generates an image constituting the moving image by imaging a subject;
An imaging apparatus comprising:

A designation process for designating one of a plurality of images constituting a movie;
A basic matrix acquisition process for acquiring a basic matrix representing a relationship with the specified image for each of a predetermined number of images other than the image specified by the specifying process among the plurality of images;
A selection process for selecting an image for epipolar projection on the designated image from the predetermined number of images based on the basic matrix acquired for each of the predetermined number of images by the basic matrix acquisition process; ,
Among the basic matrices acquired by the basic matrix acquisition process, based on the basic matrix acquired for the image selected by the selection process, the feature points or virtual feature points corresponding to each other in the selected image are A virtual feature point generation process for generating a virtual feature point in the specified image by performing an epipolar projection on the specified image;
A virtual feature point trajectory is constructed by changing the image designated by the designation processing from the plurality of images and repeating the designation processing, the basic matrix acquisition processing, the selection processing, and the virtual feature point generation processing. Virtual feature point trajectory construction processing,
Smoothing processing for smoothing the virtual feature point trajectory constructed by the virtual feature point trajectory construction processing in the time direction;
Correction that corrects each of the plurality of images based on the relationship between the virtual feature point trajectory constructed by the virtual feature point trajectory construction process and the virtual feature point trajectory smoothed by the smoothing unit. Processing,
An image processing method comprising:

Computer
A designating part for designating one of a plurality of images constituting the video,
A basic matrix acquisition unit that acquires a basic matrix representing a relationship with the designated image for each of a predetermined number of images other than the image designated by the designation unit among the plurality of images;
Based on the basic matrix acquired for each of the predetermined number of images by the basic matrix acquisition unit, a selection unit that selects an image for epipolar projection on the specified image from the predetermined number of images,
Among the basic matrices acquired by the basic matrix acquisition unit, based on the basic matrix acquired for the image selected by the selection unit, the feature points or virtual feature points corresponding to each other in the selected image are A virtual feature point generation unit that generates a virtual feature point in the specified image by performing an epipolar projection on the specified image;
By changing the image designated by the designation unit from the plurality of images and repeating the processing of the designation unit, the basic matrix acquisition unit, the selection unit, and the virtual feature point generation unit, a virtual feature point trajectory A virtual feature point trajectory construction unit,
A smoothing unit that smoothes the virtual feature point trajectory constructed by the virtual feature point trajectory construction unit in the time direction;
Correction that corrects each of the plurality of images based on the relationship between the virtual feature point trajectory constructed by the virtual feature point trajectory construction unit and the virtual feature point trajectory smoothed by the smoothing unit. Part,
A program characterized by functioning as