JP6476414B2

JP6476414B2 - Still image extraction method and image processing apparatus for realizing the method

Info

Publication number: JP6476414B2
Application number: JP2015082965A
Authority: JP
Inventors: 康伸小倉
Original assignee: Panasonic Intellectual Property Management Co Ltd
Current assignee: Panasonic Intellectual Property Management Co Ltd
Priority date: 2014-09-08
Filing date: 2015-04-15
Publication date: 2019-03-06
Anticipated expiration: 2035-04-15
Also published as: JP2016059029A

Description

本技術は、動画像ストリームから複数のフレームを静止画像として抽出する静止画像抽出方法及びその方法を実現する画像処理装置に関する。 The present technology relates to a still image extraction method for extracting a plurality of frames as a still image from a moving image stream, and an image processing apparatus that implements the method.

物体の全周囲のデータを得るために、その物体の周辺の各位置から、カメラでその物体を被写体として撮影することが行われている。ここで、物体の全周囲の静止画像をインタラクティブに表示・閲覧するために撮影が行われるカメラの位置（以下、撮影位置という）のそれぞれは、距離が等間隔に離れていることが望ましい。 In order to obtain data around the entire object, the object is photographed as a subject from each position around the object. Here, it is desirable that the positions of the cameras (hereinafter referred to as shooting positions) where shooting is performed in order to interactively display and browse still images around the entire object are equally spaced.

従来、上述の各撮影位置における静止画像を得るために、各撮影位置での静止画像の撮影を順に行う（特許文献１参照）。つまり、カメラを撮影位置に固定して、その撮影位置でカメラが撮影する処理動作を、カメラを移動させながらそれぞれの撮影位置で繰り返し行う。 Conventionally, in order to obtain a still image at each shooting position described above, a still image is shot at each shooting position in order (see Patent Document 1). That is, the processing operation in which the camera is fixed at the shooting position and the camera shoots at the shooting position is repeatedly performed at each shooting position while moving the camera.

特開２００７−７２５３７号公報JP 2007-72537 A

本技術は、距離が等間隔に離れた適切な撮影位置における静止画像を短時間で抽出することができる静止画像抽出方法及び画像処理装置を提供する。 The present technology provides a still image extraction method and an image processing apparatus that can extract a still image at an appropriate shooting position at an equal distance from each other in a short time.

本技術の静止画像抽出方法は、カメラを、被写体を中心とした円周または直線上一方向に相対的に移動させ、時間的に連続して撮影した動画像ストリームから、撮影時刻が等間隔になるように複数の第１のフレームを抽出する動作と、複数の第１のフレームそれぞれの撮影時刻における撮影位置を複数の第１のフレームから特定する動作と、複数の第１のフレームそれぞれの撮影時刻および撮影位置に基づき、距離が等間隔の複数の撮影位置で撮影することによって得られる複数のフレームそれぞれの撮影時刻を推定する動作と、推定された複数の撮影時刻それぞれに対応する複数の第２のフレームそれぞれを、静止画像として動画像ストリームから抽出する動作とを行うことを特徴とする静止画像抽出方法。 The still image extraction method of the present technology moves the camera relatively in one direction on the circumference or straight line centered on the subject, and captures the shooting time at equal intervals from a moving image stream that is continuously captured in time. An operation for extracting a plurality of first frames, an operation for specifying a shooting position at each shooting time of each of the plurality of first frames, and a shooting for each of the plurality of first frames. Based on the time and the shooting position, an operation for estimating the shooting time of each of a plurality of frames obtained by shooting at a plurality of shooting positions at equal distances, and a plurality of first times corresponding to each of the estimated plurality of shooting times A still image extraction method comprising: extracting each of the two frames as a still image from a moving image stream.

本技術によれば、距離が等間隔に離れた適切な撮影位置における静止画像を短時間で抽出することができる。 According to the present technology, it is possible to extract a still image at an appropriate shooting position at an equal interval in a short time.

本技術の実施の形態における画像処理システムを示す構成図である。It is a lineblock diagram showing an image processing system in an embodiment of this art. 実施の形態による撮影部において、被写体の撮影方法を説明するための説明図である。FIG. 5 is an explanatory diagram for describing a method for photographing a subject in a photographing unit according to an embodiment. 実施の形態における画像処理システム１の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the image processing system 1 in embodiment. 実施の形態における静止画像抽出方法の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the still image extraction method in embodiment. 実施の形態におけるカメラ２１の撮影位置の特定方法を示すフローチャートである。It is a flowchart which shows the identification method of the imaging position of the camera 21 in embodiment. 実施の形態における初期サンプリング点を示す説明図である。It is explanatory drawing which shows the initial sampling point in embodiment. 実施の形態における再サンプリング点のタイムスタンプを推定する方法の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the method of estimating the time stamp of the resampling point in embodiment. 実施の形態における初期サンプリング点のタイムスタンプと基準点からの角度との関係を示す説明図ある。It is explanatory drawing which shows the relationship between the time stamp of the initial sampling point and the angle from a reference point in embodiment. 実施の形態における再サンプリング点を示す説明図である。It is explanatory drawing which shows the resampling point in embodiment. 再サンプリング点での撮影によって得られる複数のフレームのそれぞれのタイムスタンプを推定する方法を示す説明図である。It is explanatory drawing which shows the method of estimating each time stamp of the some flame | frame obtained by imaging | photography at a resampling point. 本技術による他の実施の形態による静止画像抽出方法の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the still image extraction method by other embodiment by this technique. 本技術の他の実施の形態による被写体の撮影方法を示す説明図である。It is explanatory drawing which shows the imaging | photography method of the to-be-photographed object by other embodiment of this technique.

以下、本技術の実施の形態による画像処理装置について、図面を参照しながら説明する。但し、必要以上に詳細な説明は省略する場合がある。例えば、既によく知られた事項の詳細説明や実質的に同一の構成に対する重複説明を省略する場合がある。これは、以下の説明が不必要に冗長になるのを避け、当業者の理解を容易にするためである。 Hereinafter, an image processing apparatus according to an embodiment of the present technology will be described with reference to the drawings. However, more detailed description than necessary may be omitted. For example, detailed descriptions of already well-known matters and repeated descriptions for substantially the same configuration may be omitted. This is to avoid the following description from becoming unnecessarily redundant and to facilitate understanding by those skilled in the art.

なお、発明者は、当業者が本技術を十分に理解するために添付図面及び以下の説明を提供するのであって、これらによって特許請求の範囲に記載の主題を限定することを意図するものではない。 The inventor provides the accompanying drawings and the following description in order for those skilled in the art to fully understand the present technology, and is not intended to limit the subject matter described in the claims. Absent.

［１−１．構成］
図１は、本技術の実施の形態における画像処理システムを示す構成図である。 [1-1. Constitution]
FIG. 1 is a configuration diagram illustrating an image processing system according to an embodiment of the present technology.

本実施の形態における画像処理システム１は、カメラを被写体に対して相対的に移動させながら、被写体を連続して撮影することにより動画像ストリームを取得し、前記動画像ストリームから複数のフレームをそれぞれ静止画像として抽出するシステムである。画像処理システム１は、画像処理装置１０、撮影部２０、制御部３０、撮影画像記憶部４０及び多視点画像データベース（ＤＢ）５０を備える。 The image processing system 1 according to the present embodiment acquires a moving image stream by continuously shooting a subject while moving the camera relative to the subject, and each of a plurality of frames from the moving image stream. It is a system that extracts as a still image. The image processing system 1 includes an image processing device 10, a photographing unit 20, a control unit 30, a photographed image storage unit 40, and a multi-viewpoint image database (DB) 50.

本実施の形態における画像処理システム１では、動画像ストリームは、例えば６０ｆｐｓ（ｆｒａｍｅｐｅｒｓｅｃｏｎｄ）のフレームレートの撮影によって生成されたもので、時間的に等間隔の撮影、つまり１／６０秒ごとの撮影によって得られた複数のフレームを含んでいる。また、フレームそれぞれには、撮影により生成されるごとに、「１」が加算されるタイムスタンプが付されている。タイムスタンプは、タイムスタンプが付されたフレームの撮影時刻を示すこととなる。例えば、動画像ストリームのフレームレートが６０ｆｐｓの場合、動画撮影の開始から１秒後のタイムスタンプは、動画撮影開始時のタイムスタンプに、「６０」を加算した値を示す。動画撮影開始時のタイムスタンプが「１０」だった場合に、動画撮影の開始から１秒後のタイムスタンプは「７０」となる。 In the image processing system 1 according to the present embodiment, the moving image stream is generated by shooting at a frame rate of, for example, 60 fps (frame per second), and is shot at regular intervals, that is, every 1/60 seconds. It includes a plurality of frames obtained by shooting. Each frame has a time stamp to which “1” is added every time it is generated by photographing. The time stamp indicates the shooting time of the frame to which the time stamp is attached. For example, when the frame rate of the moving image stream is 60 fps, the time stamp one second after the start of moving image shooting indicates a value obtained by adding “60” to the time stamp at the start of moving image shooting. When the time stamp at the start of moving image shooting is “10”, the time stamp one second after the start of moving image shooting is “70”.

なお、本実施の形態では、動画像ストリームは、等時間間隔で撮影されたフレームから生成される動画であるが、これに限らない。例えば、等時間間隔で連写された静止画像の集まりであってもよい。等時間間隔で連写された静止画像には、例えば画像ＩＤが対応付けられ、画像ＩＤは、タイムスタンプと同じように扱うことができる。従って、本実施の形態では、タイムスタンプが静止画像抽出方法に用いられるが、動画像ストリームが等時間間隔で連写された静止画像の集まりの場合は、画像ＩＤが静止画像抽出方法に用いられる。 In the present embodiment, the moving image stream is a moving image generated from frames shot at equal time intervals, but is not limited thereto. For example, it may be a collection of still images taken continuously at equal time intervals. For example, an image ID is associated with still images continuously shot at equal time intervals, and the image ID can be handled in the same manner as a time stamp. Therefore, in this embodiment, the time stamp is used in the still image extraction method, but in the case of a collection of still images in which the moving image stream is continuously shot at equal time intervals, the image ID is used in the still image extraction method. .

画像処理装置１０は、撮影画像記憶部４０に記憶された動画像ストリームから、複数のフレームをそれぞれ静止画像として抽出する動作を行う。複数のフレームの静止画像は、等間隔に離れた距離の複数の撮影位置で撮影された画像となる。動画像ストリームは、カメラを被写体に対して相対的に移動させながら、前記被写体を時間的に連続して撮影することにより得られる。例えば、画像処理装置１０は、被写体を中心とした円周上を移動しているカメラで撮影することによって得られた動画像ストリームから、１０度毎の等間隔な角度を形成する３６個の撮影位置で撮影された静止画像を抽出する。 The image processing apparatus 10 performs an operation of extracting a plurality of frames as still images from the moving image stream stored in the captured image storage unit 40. The still images of a plurality of frames are images shot at a plurality of shooting positions that are separated at equal intervals. The moving image stream is obtained by shooting the subject continuously in time while moving the camera relative to the subject. For example, the image processing apparatus 10 captures 36 images that form angles at equal intervals of 10 degrees from a moving image stream obtained by shooting with a camera moving on a circle around a subject. Extract still images taken at the location.

画像処理装置１０は、画像抽出部１１及び撮影位置算出部１２を備える。 The image processing apparatus 10 includes an image extraction unit 11 and a shooting position calculation unit 12.

画像抽出部１１は、動画像ストリームに含まれる複数の第１のフレームを抽出し、抽出した複数の第１のフレームそれぞれの撮影時刻を取得する。また、画像抽出部１１は、初期サンプリング動作と再サンプリング動作とを行う。 The image extraction unit 11 extracts a plurality of first frames included in the moving image stream, and acquires shooting times of the extracted plurality of first frames. In addition, the image extraction unit 11 performs an initial sampling operation and a resampling operation.

撮影位置算出部１２は、複数の第１のフレームそれぞれで取得された撮影時刻におけるカメラの撮影位置に関する情報を複数の第１のフレームから特定する。撮影位置算出部１２は、第１のフレームそれぞれで取得された撮影時刻と、第１のフレームそれぞれで特定された撮影位置に関する情報とに基づき、距離が等間隔に離れた複数の撮影位置におけるカメラによる撮影によって得られる複数のフレームそれぞれの撮影時刻を推定する動作を行う。 The shooting position calculation unit 12 specifies information regarding the shooting position of the camera at the shooting time acquired in each of the plurality of first frames from the plurality of first frames. The shooting position calculation unit 12 is a camera at a plurality of shooting positions at equal intervals based on the shooting time acquired in each of the first frames and information on the shooting position specified in each of the first frames. The operation of estimating the photographing time of each of a plurality of frames obtained by photographing with the above is performed.

前記画像抽出部１１は、初期サンプリングにおいて、動画像ストリームに含まれる複数の第１のフレームを抽出し、抽出した複数の第１のフレームそれぞれの撮影時刻を示すタイムスタンプを取得する。初期サンプリングによって抽出される第１のフレームを初期サンプリングフレームという。このとき、画像抽出部１１は、取得するタイムスタンプが等間隔になるように複数の初期サンプリングフレームを抽出する。 In the initial sampling, the image extraction unit 11 extracts a plurality of first frames included in the moving image stream, and obtains time stamps indicating the photographing times of the extracted plurality of first frames. The first frame extracted by the initial sampling is called an initial sampling frame. At this time, the image extraction unit 11 extracts a plurality of initial sampling frames so that the acquired time stamps are equally spaced.

前記画像抽出部１１は、再サンプリングにおいて、撮影位置算出部１２により推定されたそれぞれの撮影時刻（タイムスタンプ）に対応する第２のフレームを動画像ストリームから抽出する。動画像ストリームから抽出する第２のフレームは、距離が等間隔に離れた複数の撮影位置で撮影することによって得られる複数のフレームとして抽出される。以下、画像抽出部１１で再サンプリングによって抽出される第２のフレームを再サンプリングフレームという。例えば、画像抽出部１１は、１０度毎の等間隔な角度にある撮影位置でのカメラによる撮影によって得られるフレームとして、３６個の再サンプリングフレームを抽出する。 In the re-sampling, the image extraction unit 11 extracts a second frame corresponding to each shooting time (time stamp) estimated by the shooting position calculation unit 12 from the moving image stream. The second frame extracted from the moving image stream is extracted as a plurality of frames obtained by shooting at a plurality of shooting positions that are spaced at equal intervals. Hereinafter, the second frame extracted by resampling by the image extraction unit 11 is referred to as a resampling frame. For example, the image extraction unit 11 extracts 36 resampling frames as frames obtained by shooting with a camera at shooting positions at equal intervals of 10 degrees.

撮影位置算出部１２は、カメラの撮影位置を、初期サンプリングフレームそれぞれのタイムスタンプから特定する。つまり、撮影位置算出部１２は、被写体が撮影された時の被写体に対する相対的なカメラの位置を特定することができる。以下、初期サンプリングフレームそれぞれが特定した撮影位置を初期サンプリング点という。 The shooting position calculation unit 12 specifies the shooting position of the camera from the time stamp of each initial sampling frame. That is, the shooting position calculation unit 12 can specify the position of the camera relative to the subject when the subject is shot. Hereinafter, the shooting position specified by each initial sampling frame is referred to as an initial sampling point.

撮影位置算出部１２は、さらに、初期サンプリング点と、初期サンプリングによって取得されたタイムスタンプとに基づいて、複数の撮影位置で撮影することによって得られる複数のフレームそれぞれのタイムスタンプを推定する。以下、再サンプリングフレームそれぞれが特定した撮影位置を再サンプリング点という。 The shooting position calculation unit 12 further estimates time stamps of a plurality of frames obtained by shooting at a plurality of shooting positions based on the initial sampling points and the time stamps acquired by the initial sampling. Hereinafter, the imaging position specified by each resampling frame is referred to as a resampling point.

図２は、実施の形態による撮影部において、被写体の撮影方法を説明するための説明図である。 FIG. 2 is an explanatory diagram for explaining a method of photographing a subject in the photographing unit according to the embodiment.

図２に示すように、撮影部２０は、カメラ２１、ロボットアーム２２及びターンテーブル２３を備える。ターンテーブル２３は、被写体２４が載せられて回転するテーブルであり、ターンテーブル２３の回転軸（図２のＺ軸）を回転中心として回転する。なお、ターンテーブル２３の回転速度は、ターンテーブル２３に載せられる被写体２４の重心位置、又はターンテーブル２３の機械的な構造の精度等の影響により、一定の速度になるとは限らない。カメラ２１は、ターンテーブル２３上に載っている被写体２４を撮影することにより、回転している被写体２４の動画像ストリームを生成する。つまり、カメラ２１は、被写体２４を中心とした円周上を被写体２４に対して相対的に移動しながら、被写体２４を撮影することにより動画像ストリームを生成する。ロボットアーム２２は、カメラ２１が取り付けられ、被写体２４を中心とした図２のＸＺ平面上の円周方向にカメラ２１を移動可能にする。 As shown in FIG. 2, the photographing unit 20 includes a camera 21, a robot arm 22, and a turntable 23. The turntable 23 is a table on which the subject 24 is placed and rotates, and rotates about the rotation axis of the turntable 23 (Z axis in FIG. 2) as the rotation center. Note that the rotation speed of the turntable 23 is not necessarily a constant speed due to the influence of the position of the center of gravity of the subject 24 placed on the turntable 23 or the accuracy of the mechanical structure of the turntable 23. The camera 21 generates a moving image stream of the rotating subject 24 by photographing the subject 24 placed on the turntable 23. That is, the camera 21 generates a moving image stream by shooting the subject 24 while moving relative to the subject 24 on the circumference around the subject 24. The robot arm 22 is attached with a camera 21 and enables the camera 21 to move in the circumferential direction on the XZ plane of FIG.

被写体２４は、ターンテーブル２３の回転中心上に載せられて、カメラ２１で撮影される物体である。被写体２４は、画像処理装置１０で抽出されるとともに、距離が等間隔に離れた（つまり、等間隔の角度にある）複数の撮影位置でのカメラ２１による撮影によって得られる被写体２４の静止画像により、例えば、３次元のコンピュータグラフィック（ＣＧ）で表現される。 The subject 24 is an object that is placed on the rotation center of the turntable 23 and photographed by the camera 21. The subject 24 is extracted by the image processing apparatus 10 and is a still image of the subject 24 obtained by photographing with the camera 21 at a plurality of photographing positions whose distances are equally spaced (that is, at equally spaced angles). For example, it is expressed by three-dimensional computer graphics (CG).

なお、本実施の形態では、ターンテーブル２３が回転することにより、カメラ２１は被写体２４に対して相対的に移動しながら被写体２４を撮影しているが、これに限定されない。例えば、カメラ２１が被写体２４を中心とした円周上を実際に移動しながら、被写体２４を撮影してもよい。 In this embodiment, the turntable 23 rotates to cause the camera 21 to photograph the subject 24 while moving relative to the subject 24. However, the present invention is not limited to this. For example, the subject 24 may be photographed while the camera 21 actually moves on the circumference around the subject 24.

また、本実施の形態では、カメラ２１は被写体２４を中心とした円周上を被写体２４に対して相対的に移動しながら被写体２４を撮影したが、これに限定されない。例えば、カメラ２１が直線上に沿って移動しながら被写体２４を撮影してもよい。これにより、等間隔の距離にある複数の撮影位置でのカメラ２１による撮影によって得られる複数のフレームとして、再サンプリングフレームのそれぞれが抽出されてもよい。 In the present embodiment, the camera 21 captures the subject 24 while moving relative to the subject 24 on the circumference around the subject 24. However, the present invention is not limited to this. For example, the subject 24 may be photographed while the camera 21 moves along a straight line. Thereby, each of the re-sampling frames may be extracted as a plurality of frames obtained by photographing with the camera 21 at a plurality of photographing positions at equal intervals.

制御部３０は、撮影部２０による撮影の制御を行う。具体的には、制御部３０は、カメラ２１による動画撮影の開始及び終了、ターンテーブル２３の回転、及び、ロボットアーム２２によるカメラ２１の移動の制御を行う。これにより、制御部３０は、ターンテーブル２３を回転させることで、ターンテーブル２３のテーブル面と平行な面にあり被写体２４を中心とした円周上の位置からカメラ２１に被写体２４を撮影させる。さらに、制御部３０はロボットアーム２２の制御を行い、被写体２４を中心とした図２のＸＺ平面上の円周方向にカメラ２１を移動させる。つまり、制御部３０は、ターンテーブル２３の回転、及び、ロボットアーム２２によるカメラ２１の移動の制御を繰り返し行うことで、被写体２４を中心とした球面上の位置からカメラ２１に被写体２４を撮影させる。 The control unit 30 controls photographing by the photographing unit 20. Specifically, the control unit 30 controls the start and end of moving image shooting by the camera 21, the rotation of the turntable 23, and the movement of the camera 21 by the robot arm 22. As a result, the control unit 30 rotates the turntable 23 to cause the camera 21 to photograph the subject 24 from a position on the circumference centered on the subject 24 that is parallel to the table surface of the turntable 23. Further, the control unit 30 controls the robot arm 22 to move the camera 21 in the circumferential direction on the XZ plane of FIG. That is, the control unit 30 repeatedly controls the rotation of the turntable 23 and the movement of the camera 21 by the robot arm 22 to cause the camera 21 to photograph the subject 24 from a position on the spherical surface centering on the subject 24. .

撮影画像記憶部４０は、撮影部２０により生成された動画像ストリームを記憶する、例えば不揮発性メモリである。また、撮影画像記憶部４０は、動画像ストリームに含まれるフレームのそれぞれの撮影時刻を示すタイムスタンプを記憶する。 The captured image storage unit 40 is, for example, a nonvolatile memory that stores the moving image stream generated by the imaging unit 20. In addition, the captured image storage unit 40 stores a time stamp indicating the shooting time of each frame included in the moving image stream.

多視点画像ＤＢ５０は、例えば不揮発性メモリなどにより構成され、画像処理装置１０で抽出されるとともに、距離が等間隔と推定された複数の撮影位置において、カメラ２１で撮影することによって得られる、複数の再サンプリングフレームを静止画像として保存する。 The multi-viewpoint image DB 50 is configured by, for example, a non-volatile memory, and is extracted by the image processing apparatus 10 and obtained by shooting with the camera 21 at a plurality of shooting positions whose distances are estimated at equal intervals. Are stored as still images.

［１−２．動作］
以上のように構成された本実施の形態における画像処理システム１について、動作を説明する。 [1-2. Operation]
An operation of the image processing system 1 in the present embodiment configured as described above will be described.

図３は、実施の形態における画像処理システム１の動作を示すフローチャートである。 FIG. 3 is a flowchart showing the operation of the image processing system 1 in the embodiment.

まず、図３に示すように、撮影部２０は、制御部３０の制御に応じて被写体２４を撮影する（ステップＳ２０１）。具体的には、制御部３０がターンテーブル２３の回転の制御を行うことで、カメラ２１はターンテーブル２３に載せられた被写体２４を、被写体２４の３６０度周囲から撮影する。さらに制御部３０がロボットアーム２２によるカメラ２１の移動の制御を行うことで、カメラ２１は、ターンテーブル２３のテーブル面と垂直な面において、被写体２４を中心とした円周方向に移動する。カメラ２１は、移動した位置から再度、被写体２４を被写体２４の３６０度周囲から撮影する。制御部３０がターンテーブル２３の回転及びロボットアーム２２によるカメラ２１の移動の制御を繰り返し行うことで、カメラ２１は被写体２４を中心とした球面上の位置から被写体２４を撮影した動画像ストリームを生成する。生成された動画像ストリームは撮影画像記憶部４０に記憶される。この時、動画像ストリームに含まれるフレームのそれぞれの撮影時刻を示すタイムスタンプも記憶される。つまり、動画像ストリームに含まれる複数のフレームと当該複数のフレームのそれぞれに対応するタイムスタンプとが撮影画像記憶部４０に記憶される。 First, as illustrated in FIG. 3, the photographing unit 20 photographs the subject 24 under the control of the control unit 30 (step S201). Specifically, the control unit 30 controls the rotation of the turntable 23, so that the camera 21 captures the subject 24 placed on the turntable 23 from around 360 degrees of the subject 24. Further, when the control unit 30 controls the movement of the camera 21 by the robot arm 22, the camera 21 moves in a circumferential direction around the subject 24 on a plane perpendicular to the table surface of the turntable 23. The camera 21 again captures the subject 24 from around 360 degrees around the subject 24 from the moved position. The control unit 30 repeatedly controls the rotation of the turntable 23 and the movement of the camera 21 by the robot arm 22, so that the camera 21 generates a moving image stream in which the subject 24 is captured from a position on the spherical surface centered on the subject 24. To do. The generated moving image stream is stored in the captured image storage unit 40. At this time, a time stamp indicating the shooting time of each frame included in the moving image stream is also stored. That is, a plurality of frames included in the moving image stream and time stamps corresponding to the plurality of frames are stored in the captured image storage unit 40.

次に、画像処理装置１０は、距離が等間隔と推定された複数の撮影位置（再サンプリング点）において、カメラ２１で撮影することによって得られる複数のフレーム（再サンプリングフレーム）を、撮影画像記憶部４０に記憶された動画像ストリームから抽出する（ステップＳ２０２）。 Next, the image processing apparatus 10 stores a plurality of frames (resampling frames) obtained by photographing with the camera 21 at a plurality of photographing positions (resampling points) whose distances are estimated to be equal intervals. Extracted from the moving image stream stored in the unit 40 (step S202).

最後に、画像処理装置１０は、抽出された複数の再サンプリングフレームを静止画像として多視点画像ＤＢ５０に保存する（ステップＳ２０３）。 Finally, the image processing apparatus 10 stores the extracted plurality of resampled frames as still images in the multi-viewpoint image DB 50 (step S203).

図４は、実施の形態における静止画像抽出方法の動作を示すフローチャートである。図４は、図３のステップステップＳ２０２の処理を詳細に示すフローチャートである。 FIG. 4 is a flowchart illustrating the operation of the still image extraction method according to the embodiment. FIG. 4 is a flowchart showing in detail the process of step S202 of FIG.

まず、画像抽出部１１は、撮影画像記憶部４０に記憶された動画像ストリームから、複数の初期サンプリングフレームを抽出し、抽出した複数の初期サンプリングフレームのそれぞれに対応するタイムスタンプを取得する初期サンプリングを行う（ステップＳ３０１）。例えば、画像抽出部１１は、８個の初期サンプリングフレームを抽出し、抽出した８個の初期サンプリングフレームのそれぞれに対応する８個のタイムスタンプを取得する。ここで、抽出された８個の初期サンプリングフレームは、初期サンプリングフレームそれぞれに対応する８個のタイムスタンプが等間隔なフレームとする。つまり、画像抽出部１１は、一定の時間間隔で初期サンプリングフレームを抽出する。なお、初期サンプリングフレームは８個に限らず、２〜７個でもよく、また、９個以上でもよい。初期サンプリングフレームの数が増えることにより、後述する再サンプリング点でのカメラ２１による撮影によって得られる複数のフレームのそれぞれのタイムスタンプの推定の精度が高くなる。 First, the image extraction unit 11 extracts a plurality of initial sampling frames from the moving image stream stored in the captured image storage unit 40, and obtains a time stamp corresponding to each of the extracted plurality of initial sampling frames. Is performed (step S301). For example, the image extraction unit 11 extracts eight initial sampling frames, and acquires eight time stamps corresponding to the extracted eight initial sampling frames. Here, the extracted eight initial sampling frames are frames in which eight time stamps corresponding to the initial sampling frames are equally spaced. That is, the image extraction unit 11 extracts initial sampling frames at regular time intervals. Note that the number of initial sampling frames is not limited to eight, but may be two to seven, or nine or more. By increasing the number of initial sampling frames, the accuracy of estimation of time stamps of a plurality of frames obtained by photographing by the camera 21 at a re-sampling point described later is increased.

次に、撮影位置算出部１２は、画像抽出部１１で８個の初期サンプリングフレームそれぞれのタイムスタンプにおけるカメラ２１の撮影位置（初期サンプリング点）を特定する（ステップＳ３０２）。 Next, the shooting position calculation unit 12 specifies the shooting position (initial sampling point) of the camera 21 at the time stamp of each of the eight initial sampling frames by the image extraction unit 11 (step S302).

さらに、撮影位置算出部１２は、特定した８個の初期サンプリング点と、画像抽出部１１で取得した８個の初期サンプリングフレームのそれぞれのタイムスタンプとに基づいて、再サンプリング点でのカメラ２１による撮影によって得られる複数のフレームのそれぞれのタイムスタンプを推定する（ステップＳ３０３）。 Further, the photographing position calculation unit 12 uses the camera 21 at the re-sampling point based on the identified eight initial sampling points and the time stamps of the eight initial sampling frames acquired by the image extraction unit 11. Time stamps of a plurality of frames obtained by photographing are estimated (step S303).

最後に、画像抽出部１１は、撮影画像記憶部４０に記憶された動画像ストリームから、再サンプリングフレームを、再サンプリング点でのカメラ２１による撮影によって得られる複数のフレームのそれぞれとして抽出する再サンプリングを行う（ステップＳ３０４）。 Finally, the image extracting unit 11 extracts a resampled frame from the moving image stream stored in the photographed image storage unit 40 as each of a plurality of frames obtained by photographing by the camera 21 at the resampling point. Is performed (step S304).

図５は、実施の形態におけるカメラ２１の撮影位置の特定方法を示すフローチャートである。図５は、図４のステップＳ３０２の処理を詳細に示すフローチャートである。 FIG. 5 is a flowchart showing a method for specifying the shooting position of the camera 21 in the embodiment. FIG. 5 is a flowchart showing in detail the process in step S302 of FIG.

まず、撮影位置算出部１２は、８個の初期サンプリングフレームそれぞれの特徴点（特徴量）を抽出する（ステップＳ４０１）。特徴点は、ＳＩＦＴ又はＨａｒｒｉｓのコーナー検出等で抽出される。なお、特徴点の抽出方法はこれらの方法に限定されない。 First, the shooting position calculation unit 12 extracts feature points (feature amounts) of the eight initial sampling frames (step S401). The feature points are extracted by SIFT or Harris corner detection or the like. Note that the feature point extraction method is not limited to these methods.

次に、撮影位置算出部１２は、抽出した８個の初期サンプリングフレームそれぞれの特徴点のマッチングにより、フレーム間の対応する点を算出する（ステップＳ４０２）。 Next, the shooting position calculation unit 12 calculates corresponding points between the frames by matching the feature points of the extracted eight initial sampling frames (step S402).

最後に、撮影位置算出部１２は、算出したフレーム間の対応する点のそれぞれからＳＦＭ（ＳｔｒｕｃｔｕｒｅＦｒｏｍＭｏｔｉｏｎ）アルゴリズムを用いて撮影位置を特定する（ステップＳ４０３）。 Finally, the shooting position calculation unit 12 specifies a shooting position from each of the corresponding points between the calculated frames by using an SFM (Structure From Motion) algorithm (step S403).

このように、８個の初期サンプリングフレームそれぞれのタイムスタンプにおけるカメラ２１の撮影位置である初期サンプリング点が特定される。初期サンプリング点は、例えば原点を任意とした座標（Ｘ_ｉ，Ｙ_ｉ，Ｚ_ｉ）で表される。本実施の形態では、８個の初期サンプリングフレームが初期サンプリングによって抽出されたので、ｉ＝０〜７となる。 In this way, the initial sampling point that is the shooting position of the camera 21 at the time stamp of each of the eight initial sampling frames is specified. The initial sampling point is represented by coordinates (X _i , Y _i , Z _i ) where the origin is arbitrary, for example. In the present embodiment, since eight initial sampling frames are extracted by the initial sampling, i = 0 to 7.

図６は、実施の形態における初期サンプリング点を示す説明図である。図６に示されるＳ_０（Ｘ_０，Ｙ_０，Ｚ_０）〜Ｓ_７（Ｘ_７，Ｙ_７，Ｚ_７）は、各初期サンプリング点及びその座標である。本実施の形態では、画像抽出部１１は、一定の時間間隔で８個の初期サンプリングフレームを抽出する。図６に示すように、各初期サンプリング点は距離が等間隔な位置に存在しない。つまり、画像抽出部１１は、初期サンプリングによって取得するタイムスタンプが等間隔になるように８個の初期サンプリングフレームを抽出するが、抽出した８個の初期サンプリングフレームの初期サンプリング点は、距離が等間隔な位置とはならない。これは、ターンテーブル２３が一定の速度で回転していないためである。従って、画像抽出部１１が、初期サンプリングによって取得するタイムスタンプが等間隔になるように複数の初期サンプリングフレームを抽出する場合でも、初期サンプリング点は、位置決めが不完全な撮影位置となる。 FIG. 6 is an explanatory diagram showing initial sampling points in the embodiment. S ₀ (X ₀ , Y ₀ , Z ₀ ) to S ₇ (X ₇ , Y ₇ , Z ₇ ) shown in FIG. 6 are each initial sampling point and its coordinates. In the present embodiment, the image extraction unit 11 extracts eight initial sampling frames at regular time intervals. As shown in FIG. 6, the initial sampling points do not exist at positions at equal distances. That is, the image extraction unit 11 extracts eight initial sampling frames so that the time stamps acquired by the initial sampling are equally spaced, but the initial sampling points of the extracted eight initial sampling frames have the same distance. There will be no gaps. This is because the turntable 23 does not rotate at a constant speed. Therefore, even when the image extraction unit 11 extracts a plurality of initial sampling frames so that the time stamps acquired by the initial sampling are equally spaced, the initial sampling point is a photographing position with incomplete positioning.

図７は、実施の形態における再サンプリング点のタイムスタンプを推定する方法の動作を示すフローチャートである。図７は、図４のステップＳ３０３の処理を詳細に示すフローチャートである。 FIG. 7 is a flowchart showing the operation of the method for estimating the time stamp of the re-sampling point in the embodiment. FIG. 7 is a flowchart showing in detail the processing in step S303 in FIG.

まず、撮影位置算出部１２は、被写体２４が載っているターンテーブル２３の回転中心の位置を算出する（ステップＳ５０１）。具体的には、ターンテーブル２３の回転中心の位置は、次式１〜７を用いた最小二乗法により算出される。 First, the photographing position calculation unit 12 calculates the position of the rotation center of the turntable 23 on which the subject 24 is placed (step S501). Specifically, the position of the rotation center of the turntable 23 is calculated by the least square method using the following equations 1-7.

次式１は球の中心点と半径との関係を示す式である。 The following expression 1 is an expression showing the relationship between the center point of the sphere and the radius.

式１に示されるＸ_ｉ，Ｙ_ｉ，Ｚ_ｉは初期サンプリング点の座標のそれぞれｘ成分、ｙ成分、ｚ成分である。Ｘ，Ｙ，Ｚはターンテーブル２３の回転中心の座標のそれぞれｘ成分、ｙ成分、ｚ成分である。また、ｒはターンテーブル２３の回転中心と初期サンプリング点との距離である。つまり、ｒはターンテーブル２３の回転中心を中心とした球の半径である。 X _i , Y _i , and Z _i shown in Equation 1 are the x component, y component, and z component of the coordinates of the initial sampling point, respectively. X, Y, and Z are the x component, y component, and z component of the coordinates of the rotation center of the turntable 23, respectively. R is the distance between the rotation center of the turntable 23 and the initial sampling point. That is, r is a radius of a sphere with the rotation center of the turntable 23 as the center.

次式２及び３は、式１の左辺と右辺との差の累積二乗誤差を示す式である。 The following formulas 2 and 3 are formulas showing the accumulated square error of the difference between the left side and the right side of formula 1.

ここでＡ＝−２Ｘ、Ｂ＝−２Ｙ、Ｃ＝−２Ｚ、Ｄ＝Ｘ^２＋Ｙ^２＋Ｚ^２−ｒ^２とすると、式２は次式３のようにまとめられる。 Here, assuming that A = −2X, B = −2Y, C = −2Z, and D = X ² + Y ² + Z ² −r ² , Expression 2 can be summarized as the following Expression 3.

次式４〜７は、式３を変数Ａ〜Ｄで偏微分した値が０になるときの方程式である。式３を変数Ａ〜Ｄで偏微分した値が０になるとき、式３で示される累積二乗誤差が最小となる球の中心点及び半径が算出される。具体的には、初期サンプリング点の座標（Ｘ_０，Ｙ_０，Ｚ_０）〜（Ｘ_７，Ｙ_７，Ｚ_７）を次式４〜７の（Ｘ_ｉ，Ｙ_ｉ，Ｚ_ｉ）に代入して解くことで、ターンテーブル２３の回転中心の座標は算出される。 The following equations 4 to 7 are equations when the value obtained by partial differentiation of equation 3 with variables A to D becomes zero. When the value obtained by partial differentiation of Equation 3 with variables A to D is 0, the center point and radius of the sphere that minimizes the cumulative square error shown in Equation 3 are calculated. Specifically, the coordinates (X ₀ , Y ₀ , Z ₀ ) to (X ₇ , Y ₇ , Z ₇ ) of the initial sampling point are substituted into (X _i , Y _i , Z _i ) in the following expressions 4 to 7: Thus, the coordinates of the rotation center of the turntable 23 are calculated.

このようにして、ターンテーブル２３の回転中心の位置は算出される。 In this way, the position of the rotation center of the turntable 23 is calculated.

なお、ターンテーブル２３の回転中心の座標は式１〜７に示されるように最小二乗法を用いて算出されたが、これに限らない。例えば、ＲＡＮＳＡＣ（ＲＡＮｄｏｍＳＡｍｐｌｅＣｏｎｓｅｎｓｕｓ）アルゴリズムを適応したロバスト推定法が用いられてもよい。 In addition, although the coordinate of the rotation center of the turntable 23 was calculated using the least square method as shown in Formulas 1-7, it is not restricted to this. For example, a robust estimation method adapted to a RANSAC (RANdom Sample Consensus) algorithm may be used.

次に、撮影位置算出部１２は、初期サンプリング点のうちの１つを基準点として、ターンテーブル２３の回転中心から基準点へのベクトルと、ターンテーブル２３の回転中心から基準点を除く初期サンプリング点へのベクトルとの間のなす角を算出する（ステップＳ５０２）。例えば、図６に示される点Ｓ_０を基準点として点Ｓ_ｂとする。また、ターンテーブル２３の回転中心を原点Ｏとした場合に、点Ｓ_ｂの座標を（ｘ_ｂ，ｙ_ｂ，ｚ_ｂ）とし、他の７個の初期サンプリング点の座標を（ｘ_ｉ，ｙ_ｉ，ｚ_ｉ）とする（ｉ＝１〜７）。ターンテーブル２３の回転中心Ｏから基準点Ｓ_ｂへのベクトルと、ターンテーブル２３の回転中心Ｏから他の７個の初期サンプリング点Ｓ_ｉそれぞれへのベクトルとの間のなす角θ_ｉを算出する式を次式８及び９に示す。 Next, the imaging position calculation unit 12 uses one of the initial sampling points as a reference point, a vector from the rotation center of the turntable 23 to the reference point, and initial sampling that excludes the reference point from the rotation center of the turntable 23. The angle between the point and the vector is calculated (step S502). For example, the point S _b a S ₀ points indicated in Figure 6 as a reference point. Further, when the rotation center of the turntable 23 is the origin O, the coordinates of the point S _b are (x _b , y _b , z _b ), and the coordinates of the other seven initial sampling points are (x _i , y _i , z _i ) (i = 1 to 7). It calculates a vector from the rotation center O of the turn table 23 to the reference point S _b, the angle theta _i between the vector from the rotation center O of the turntable 23 to the other seven initial sampling points S _i, respectively The equations are shown in the following equations 8 and 9.

このようにして、初期サンプリング点の位置は、基準点からの角度として算出される。また、画像抽出部１１は８個の初期サンプリングフレームそれぞれのタイムスタンプを取得している（ステップＳ３０１）。従って、８個の初期サンプリング点の基準点からの角度は、８個の初期サンプリングフレームそれぞれのタイムスタンプと対応している。 In this way, the position of the initial sampling point is calculated as an angle from the reference point. In addition, the image extraction unit 11 acquires time stamps for each of the eight initial sampling frames (step S301). Therefore, the angle of the eight initial sampling points from the reference point corresponds to the time stamp of each of the eight initial sampling frames.

次に、撮影位置算出部１２は、再サンプリング点でのカメラ２１による撮影によって得られる複数のフレームのそれぞれのタイムスタンプを推定する。（ステップＳ５０３）。 Next, the shooting position calculation unit 12 estimates time stamps of a plurality of frames obtained by shooting with the camera 21 at the re-sampling point. (Step S503).

図８は、実施の形態における初期サンプリング点のタイムスタンプと基準点からの角度との関係を示す説明図ある。各初期サンプリング点のタイムスタンプｓ_ｉと基準点からの角度θ_ｉとが対応している。 FIG. 8 is an explanatory diagram showing the relationship between the time stamp of the initial sampling point and the angle from the reference point in the embodiment. The time stamp s _i of each initial sampling point corresponds to the angle θ _i from the reference point.

図８において、基準点からの角度θ_ｉは、ステップＳ５０２で撮影位置算出部１２により算出された角度の例である。本実施の形態では、画像抽出部１１は、１秒間隔で初期サンプリングフレームを抽出しており、動画像ストリームのフレームレートを６０ｆｐｓとしているので、タイムスタンプｓ_ｉは、６０（１秒）毎の等間隔なタイムスタンプである。一方、基準点からの角度θ_ｉは等間隔な角度となっていない。これは、前述したようにターンテーブル２３が一定の速度で回転していないためである。従って、画像抽出部１１が、タイムスタンプが等間隔になるように複数の初期サンプリングフレームを抽出する場合でも、初期サンプリング点は、位置決めが不完全な撮影位置となる。 In FIG. 8, the angle θ _i from the reference point is an example of the angle calculated by the imaging position calculation unit 12 in step S502. In the present embodiment, the image extraction unit 11 extracts initial sampling frames at intervals of 1 second, and the frame rate of the moving image stream is set to 60 fps. Therefore, the time stamp s _i is set every 60 (1 second). The time stamps are equally spaced. On the other hand, the angles θ _i from the reference point are not equally spaced. This is because the turntable 23 does not rotate at a constant speed as described above. Therefore, even when the image extraction unit 11 extracts a plurality of initial sampling frames so that the time stamps are equally spaced, the initial sampling point is a photographing position with incomplete positioning.

なお、動画像ストリームのフレームレートは６０ｆｐｓとしたが、これに限らない。動画像ストリームのフレームレートは、６０ｆｐｓよりも小さい、例えば３０ｆｐｓでもよく、６０ｆｐｓよりも大きい、例えば１２０ｆｐｓでもよい。 Although the frame rate of the moving image stream is 60 fps, the present invention is not limited to this. The frame rate of the moving image stream may be smaller than 60 fps, for example, 30 fps, or larger than 60 fps, for example, 120 fps.

図９は、実施の形態における再サンプリング点を示す説明図である。図９に示すように、撮影位置算出部１２は、例えば、２２．５度毎の等間隔な角度にある再サンプリング点でのカメラ２１による撮影によって得られる１６個のフレームのそれぞれのタイムスタンプを推定する。図９において、点Ｔ_ｘは（ｘ＝０〜１５）、再サンプリング点である。 FIG. 9 is an explanatory diagram showing re-sampling points in the embodiment. As shown in FIG. 9, the shooting position calculation unit 12 obtains time stamps of 16 frames obtained by shooting with the camera 21 at re-sampling points at equal intervals of 22.5 degrees, for example. presume. 9, the point _{T x} is (x = 0 to 15), a re-sampling point.

なお、本実施の形態では、再サンプリング点は１６個であるが、これに限らない。再サンプリング点は１６個よりも少なくてもよく、例えば４５度毎の等間隔な角度にある再サンプリング点は８個になる。また、再サンプリング点は１６個よりも多くてもよく、例えば１０度毎の等間隔な角度にある再サンプリング点は３６個になる。 In the present embodiment, there are 16 re-sampling points, but the present invention is not limited to this. There may be fewer than 16 re-sampling points, for example, there will be 8 re-sampling points at equally spaced angles every 45 degrees. Further, the number of re-sampling points may be larger than 16, and for example, there are 36 re-sampling points at equal intervals of every 10 degrees.

撮影位置算出部１２は、再サンプリング点Ｔ_ｘでのカメラ２１による撮影によって得られるフレームのタイムスタンプｔ_ｘを推定する。具体的には、再サンプリング点Ｔ_ｘの近傍の初期サンプリング点Ｓ_ｍ及びＳ_ｍ−１の基準点からの角度θ_ｍ及びθ_ｍ−１と、対応するタイムスタンプｓ_ｍ及びｓ_ｍ−１とから、再サンプリング点Ｔ_ｘでのカメラ２１による撮影によって得られるフレームのタイムスタンプｔ_ｘは推定される。つまり、初期サンプリング点Ｓ_ｍ及びＳ_ｍ−１の基準点からの角度θ_ｍ及びθ_ｍ−１と、タイムスタンプｓ_ｍ及びｓ_ｍ−１とに基づく線形補完により、再サンプリング点Ｔ_ｘでのカメラ２１による撮影によって得られるフレームのタイムスタンプｔ_ｘは推定される。本実施の形態では、再サンプリング点は１６個としているので、ｘ＝０〜１５となる。また、初期サンプリング点は８個としているので、ｍ＝１〜７となる。再サンプリング点Ｔ_ｘでのカメラ２１による撮影によって得られるフレームのタイムスタンプｔ_ｘの算出式は次式１０となる。 The shooting position calculation unit 12 estimates a time stamp t _{x of a} frame obtained by shooting with the camera 21 at the re-sampling point T _x . Specifically, the angle theta _m and theta _m-1 from the initial sampling point reference point _{S m} and _{S m-1} in the vicinity of the re-sampling point _{T x,} associated with the time stamp _{s m} and _{s m-1} from the time stamp t _x of frames obtained by photographing by the camera 21 at the resampling point T _x it is estimated. That is, the linear sampling based on the angles θ _m and θ _m−1 from the reference points of the initial sampling points S _m and S _m−1 and the time stamps s _m and s _m−1 can be performed at the re-sampling point T _x . The time stamp t _{x of the} frame obtained by photographing with the camera 21 is estimated. In the present embodiment, since there are 16 re-sampling points, x = 0 to 15. Since the initial sampling points are 8, m = 1 to 7. The following expression 10 is used to calculate the frame time stamp t _x obtained by photographing with the camera 21 at the re-sampling point T _x .

式１０において、θは距離が等間隔に離れた再サンプリング点の間の角度を示す。つまり、図９に示すように、本実施の形態では、２２．５度毎の等間隔な角度にある再サンプリング点でのカメラ２１による撮影によって得られる複数のフレームのそれぞれのタイムスタンプを推定するので、θ＝２２．５となる。 In Equation 10, θ represents the angle between re-sampling points that are equally spaced apart. That is, as shown in FIG. 9, in this embodiment, the time stamps of a plurality of frames obtained by photographing with the camera 21 at re-sampling points at equal intervals of 22.5 degrees are estimated. Therefore, θ = 22.5.

図１０は、再サンプリング点での撮影によって得られる複数のフレームのそれぞれのタイムスタンプを推定する方法を示す説明図である。図１０に示すように、再サンプリング点Ｔ_２（ｘ＝２）の近傍の初期サンプリング点Ｓ_２及びＳ_１（ｍ＝２）の基準点からの角度θ_２及びθ_１と、タイムスタンプｓ_２及びｓ_１とに基づく線形補完により、再サンプリング点Ｔ_２でのカメラ２１による撮影によって得られるフレームのタイムスタンプｔ_２が推定される。図８に示す初期サンプリング点Ｓ_２及びＳ_１に対応するタイムスタンプと基準点からの角度とを式１０に代入することで、タイムスタンプｔ_２を推定できる。本実施の形態では、θ_２＝７５、θ_１＝３０、ｓ_２＝１８０及びｓ_１＝１２０からタイムスタンプｔ_２は１４０と推定される。 FIG. 10 is an explanatory diagram showing a method of estimating time stamps of a plurality of frames obtained by photographing at a re-sampling point. As shown in FIG. 10, the angles θ ₂ and θ ₁ from the reference points of the initial sampling points S ₂ and S ₁ (m = 2) in the vicinity of the re-sampling point T ₂ (x = 2) and the time stamp s ₂ and by linear interpolation based on the s _1, timestamp t ₂ of the frame obtained by the photographing by the camera 21 at the resampling point T ₂ it is estimated. The time stamp t ₂ can be estimated by substituting the time stamp corresponding to the initial sampling points S ₂ and S ₁ and the angle from the reference point shown in FIG. In the present embodiment, the time stamp t ₂ is estimated to be 140 from θ ₂ = 75, θ ₁ = 30, s ₂ = 180, and s ₁ = 120.

図８に示すように、８個の初期サンプリング点でのカメラ２１による撮影によって得られた８個の初期サンプリングフレームは、６０（１秒）毎のような等間隔なタイムスタンプのフレームである。こうすることで、図６に示されるように、初期サンプリング点は円周上に広く分散して位置することができる。従って、再サンプリング点Ｔ_ｘでのカメラ２１による撮影によって得られる複数のフレームのそれぞれのタイムスタンプｔ_ｘの推定の精度が高くなる。一方、例えば、初期サンプリング点が円周上の１部に偏った場合、再サンプリング点でのカメラ２１による撮影によって得られるフレームのタイムスタンプの推定の精度は、円周上の１部に偏った初期サンプリング点群から再サンプリング点が離れるほど低くなる。 As shown in FIG. 8, the eight initial sampling frames obtained by photographing by the camera 21 at the eight initial sampling points are frames of time stamps at equal intervals such as every 60 (1 second). By doing so, as shown in FIG. 6, the initial sampling points can be widely dispersed on the circumference. Therefore, the accuracy of estimation of each time stamp t _x of a plurality of frames obtained by photographing by the camera 21 at the re-sampling point T _x is increased. On the other hand, for example, when the initial sampling point is biased to a part on the circumference, the accuracy of the estimation of the time stamp of the frame obtained by photographing by the camera 21 at the resampling point is biased to a part on the circumference. The lower the re-sampling point from the initial sampling point group, the lower the value.

なお、本実施の形態では、複数の初期サンプリングフレームは、それぞれ時間的に等間隔なタイムスタンプのフレームであるが、これに限らない。例えば、複数の初期サンプリングフレームは、それぞれ等間隔なタイムスタンプのフレームでなくてもよい。 In the present embodiment, the plurality of initial sampling frames are time-stamp frames that are equally spaced in time. However, the present invention is not limited to this. For example, the plurality of initial sampling frames do not have to be time-stamped frames at equal intervals.

次に、１４０と推定されたタイムスタンプｔ_２と同様に、他の再サンプリング点でのカメラ２１による撮影によって得られるフレームのタイムスタンプは推定される。これにより、複数の再サンプリング点でのカメラ２１による撮影によって得られる複数のフレームのそれぞれのタイムスタンプが推定される。 Next, as with the time stamp t ₂ estimated to be 140, the time stamp of the frame obtained by imaging by the camera 21 at another re-sampling point is estimated. Thereby, time stamps of a plurality of frames obtained by photographing by the camera 21 at a plurality of re-sampling points are estimated.

最後に、画像抽出部１１は、撮影画像記憶部４０に記憶された動画像ストリームから、撮影位置算出部１２で推定されたそれぞれのタイムスタンプの再サンプリングフレームを、再サンプリング点でのカメラ２１による撮影によって得られる複数のフレームのそれぞれとして、再サンプリングによって抽出する。そして、画像抽出部１１は、抽出した再サンプリングフレームを静止画像として多視点画像ＤＢ５０に保存する。 Finally, the image extraction unit 11 uses the camera 21 at the re-sampling point to resample the time-stamp resampling frames estimated by the shooting position calculation unit 12 from the moving image stream stored in the shot image storage unit 40. Each of a plurality of frames obtained by photographing is extracted by resampling. Then, the image extraction unit 11 stores the extracted resampling frame in the multi-viewpoint image DB 50 as a still image.

［１−３．効果等］
以上のように、本技術は、カメラを被写体に対して相対的に移動させながら時間的に連続して撮影した動画像ストリームから複数のフレームをそれぞれ静止画像として抽出する静止画像抽出方法である。前記動画像ストリームに含まれる複数の第１のフレームを抽出するとともに、抽出した複数の第１のフレームそれぞれの撮影時刻を取得する動作と、複数の第１のフレームそれぞれの前記撮影時刻におけるカメラの撮影位置を複数の第１のフレームから特定する動作と、複数の第１のフレームそれぞれで特定された前記カメラの撮影位置と、取得された前記撮影時刻とに基づいて、距離が等間隔の複数の撮影位置で撮影することによって得られる複数のフレームそれぞれの撮影時刻を推定する動作と、推定されたそれぞれの前記撮影時刻の第２のフレームを、距離が等間隔の複数の撮影位置で撮影することによって得られる複数のフレームそれぞれとして前記動画像ストリームから抽出する動作とを行う。 [1-3. Effect]
As described above, the present technology is a still image extraction method in which a plurality of frames are extracted as still images from a moving image stream captured continuously in time while moving the camera relative to a subject. An operation of extracting a plurality of first frames included in the moving image stream, obtaining an imaging time of each of the extracted plurality of first frames, and a camera at the imaging time of each of the plurality of first frames Based on the operation of specifying the shooting position from the plurality of first frames, the shooting position of the camera specified by each of the plurality of first frames, and the acquired shooting time, a plurality of distances are equally spaced. The operation of estimating the shooting time of each of the plurality of frames obtained by shooting at the shooting position and the second frame at each of the estimated shooting times are shot at a plurality of shooting positions at equal intervals. The operation of extracting from the moving image stream as each of a plurality of frames obtained by this is performed.

これによれば、距離が等間隔に離れた適切な撮影位置における静止画像を短時間の撮影で抽出することができる。 According to this, it is possible to extract a still image at an appropriate shooting position whose distance is equally spaced by shooting in a short time.

すなわち、本実施の形態において、初期サンプリングによって動画像ストリームに含まれる複数の初期サンプリングフレームを抽出し、当該複数の初期サンプリングフレームに対応するタイムスタンプを取得する。そして、抽出した複数の初期サンプリングフレームのそれぞれの初期サンプリング点を特定し、特定した初期サンプリング点と、当該複数の初期サンプリングフレームに対応するタイムスタンプとに基づいて、再サンプリング点でのカメラによる撮影によって得られる複数のフレームのそれぞれのタイムスタンプを推定する。最後に、推定したそれぞれのタイムスタンプに対応する再サンプリングフレームを、再サンプリング点でのカメラ２１による撮影によって得られる複数のフレームのそれぞれとして、前記動画像ストリームから抽出する。 That is, in the present embodiment, a plurality of initial sampling frames included in the moving image stream are extracted by initial sampling, and time stamps corresponding to the plurality of initial sampling frames are acquired. Then, the initial sampling points of each of the plurality of extracted initial sampling frames are identified, and the image is taken by the camera at the re-sampling points based on the identified initial sampling points and the time stamps corresponding to the plurality of initial sampling frames. The time stamp of each of the plurality of frames obtained by is estimated. Finally, a re-sampling frame corresponding to each estimated time stamp is extracted from the moving image stream as each of a plurality of frames obtained by photographing by the camera 21 at the re-sampling point.

これにより、カメラを被写体に対して移動させながら被写体を撮影した場合、時間的に等間隔であっても距離は等間隔でない撮影位置での撮影によって得られる画像群から、距離が等間隔に離れた複数の撮影位置での撮影によって得られる静止画像を抽出することができる。そのため、距離が等間隔に離れた適切な撮影位置における静止画像を短時間の撮影で抽出することができる。 As a result, when the subject is photographed while moving the camera relative to the subject, the distance is evenly spaced from the image group obtained by photographing at the photographing position where the distance is not evenly spaced even if the distance is regular. It is possible to extract still images obtained by shooting at a plurality of shooting positions. Therefore, it is possible to extract a still image at an appropriate shooting position that is spaced at equal intervals by shooting in a short time.

また、前記動画像ストリームから抽出された第２のフレームの撮影位置を当該第２のフレームに基づいて特定し、前記第２のフレームの撮影位置と、前記距離が等間隔に離れた複数の撮影位置のうちの１つの等間隔位置との間の位置ずれが所定の範囲内であるか否かを判定する。前記位置ずれが所定の範囲内でない場合には、抽出された前記第２のフレームの撮影位置及び前記撮影時刻を用いて、前記等間隔位置でのカメラによる撮影によって得られるフレームの撮影時刻を再度推定し、再度推定された前記撮影時刻の第２のフレームを、距離が等間隔に離れた複数の撮影位置での前記カメラによる撮影によって得られるフレームとして、前記動画像ストリームから抽出する動作を行う。 In addition, the shooting position of the second frame extracted from the moving image stream is specified based on the second frame, and the shooting position of the second frame and a plurality of shootings at the same distance are separated. It is determined whether or not the positional deviation between one of the positions and the equally spaced position is within a predetermined range. If the positional deviation is not within a predetermined range, using the extracted shooting position and the shooting time of the second frame, the shooting time of the frame obtained by shooting with the camera at the equally spaced position is again set. The second frame at the shooting time estimated and re-estimated is extracted from the moving image stream as frames obtained by shooting with the camera at a plurality of shooting positions that are spaced at equal intervals. .

これによれば、距離が等間隔に離れた撮影位置における静止画像を高精度に抽出することができる。 According to this, it is possible to extract with high accuracy still images at shooting positions that are spaced at equal intervals.

また、前記撮影時刻を推定する際は、複数の第１のフレームそれぞれの特定された撮影位置及び取得された撮影時刻に基づく線形補完により、距離が等間隔の複数の撮影位置で撮影することによって得られる複数のフレームのそれぞれの撮影時刻を推定する。 In addition, when estimating the shooting time, by performing linear interpolation based on the specified shooting position of each of the plurality of first frames and the acquired shooting time, by shooting at a plurality of shooting positions at equal intervals. The imaging time of each of a plurality of obtained frames is estimated.

これによれば、距離が等間隔に離れた複数の撮影位置でのカメラによる撮影によって得られる複数のフレームについて、それぞれの撮影時刻を推定することができる。 According to this, it is possible to estimate the respective shooting times for a plurality of frames obtained by shooting with a camera at a plurality of shooting positions that are spaced at equal intervals.

また、前記複数の第１のフレームのそれぞれの撮影位置を特定する際は、前記複数の第１のフレームのそれぞれから抽出された複数の特徴点のマッチングにより、フレーム間の対応する点を算出し、算出したフレーム間の対応する点のそれぞれからＳＦＭ（ＳｔｒｕｃｔｕｒｅＦｒｏｍＭｏｔｉｏｎ）アルゴリズムを用いて前記撮影位置を特定する。 Further, when specifying the shooting positions of each of the plurality of first frames, the corresponding points between the frames are calculated by matching a plurality of feature points extracted from each of the plurality of first frames. Then, the photographing position is specified from each of the corresponding points between the calculated frames by using an SFM (Structure From Motion) algorithm.

これによれば、動画像ストリームに含まれる複数のフレームのそれぞれの取得された撮影時刻におけるカメラの撮影位置を特定することができる。 According to this, it is possible to specify the shooting position of the camera at the acquired shooting time of each of the plurality of frames included in the moving image stream.

また、推定されたそれぞれの前記撮影時刻の第２のフレームを抽出する際は、被写体を中心とした円周上でカメラを移動させながら撮影することにより得られる動画像ストリームから、等間隔の角度にある複数の撮影位置でのカメラによる撮影によって得られる複数のフレームとして、第２のフレームそれぞれを抽出する。 In addition, when extracting the second frames at the respective estimated shooting times, angles of equal intervals are obtained from a moving image stream obtained by shooting while moving the camera on the circumference around the subject. Each of the second frames is extracted as a plurality of frames obtained by photographing with a camera at a plurality of photographing positions.

これによれば、被写体を中心とした円周上の位置における撮影によって得られる動画像ストリームから等間隔に配置された撮影位置からの全周囲画像を抽出することができる。 According to this, it is possible to extract all-around images from shooting positions arranged at equal intervals from a moving image stream obtained by shooting at a position on the circumference around the subject.

さらに、推定されたそれぞれの前記撮影時刻の第２のフレームを抽出する際は、被写体を中心とした円周の接線方向にカメラを移動させて撮影することにより得られる動画像ストリームから、視差のある３次元画像を生成するための複数の第２のフレームを抽出する。 Further, when extracting the estimated second frames at the respective shooting times, the parallax of the parallax is obtained from the moving image stream obtained by moving the camera in the tangential direction of the circumference around the subject. A plurality of second frames for generating a certain three-dimensional image are extracted.

これによれば、奥行き方向からの撮影によって得られる動画像ストリームから視差のある３次元画像を生成することができる。 According to this, it is possible to generate a three-dimensional image with parallax from a moving image stream obtained by photographing from the depth direction.

また、推定されたそれぞれの前記撮影時刻の第２のフレームを抽出する際は、直線上にカメラを移動させながら撮影することにより得られる動画像ストリームから、等間隔の距離にある複数の撮影位置でのカメラによる撮影によって得られる複数のフレームとして、第２のフレームそれぞれを抽出する。 Further, when extracting the second frames at the respective estimated shooting times, a plurality of shooting positions that are equidistant from a moving image stream obtained by shooting while moving the camera on a straight line Each of the second frames is extracted as a plurality of frames obtained by shooting with the camera.

また、本技術の画像処理装置は、カメラを被写体に対して相対的に移動させながら時間的に連続して撮影した動画像ストリームから複数のフレームをそれぞれ静止画像として抽出する。前記画像処理装置は、画像抽出部と撮影位置算出部とを備える。前記画像抽出部は、動画像ストリームに含まれる複数の第１のフレームを抽出し、抽出した複数の第１のフレームそれぞれの撮影時刻を取得する動作を行う。前記撮影位置算出部は、複数の第１のフレームそれぞれで取得された撮影時刻におけるカメラの撮影位置に関する情報を複数の第１のフレームから特定し、第１のフレームそれぞれで取得された撮影時刻と、第１のフレームそれぞれで特定された撮影位置に関する情報とに基づき、距離が等間隔に離れた複数の撮影位置で撮影することによって得られる複数のフレームそれぞれの撮影時刻を推定する動作を行う。さらに、前記画像抽出部は、撮影位置算出部により推定されたそれぞれの撮影時刻に対応する第２のフレームを、距離が等間隔に離れた複数の撮影位置で撮影することによって得られる複数のフレームそれぞれとして動画像ストリームから抽出する動作を行うものである。 In addition, the image processing apparatus of the present technology extracts a plurality of frames as still images from a moving image stream continuously captured in time while moving the camera relative to the subject. The image processing apparatus includes an image extraction unit and a shooting position calculation unit. The image extraction unit performs an operation of extracting a plurality of first frames included in the moving image stream and acquiring a shooting time of each of the extracted first frames. The shooting position calculation unit identifies information on the shooting position of the camera at the shooting time acquired in each of the plurality of first frames from the plurality of first frames, and the shooting time acquired in each of the first frames; Then, based on the information regarding the shooting position specified in each of the first frames, an operation of estimating the shooting time of each of a plurality of frames obtained by shooting at a plurality of shooting positions that are spaced at equal intervals is performed. Further, the image extracting unit captures a plurality of frames obtained by capturing the second frame corresponding to each capturing time estimated by the capturing position calculating unit at a plurality of capturing positions that are equally spaced from each other. Each of them extracts the moving image stream.

図１１は、本技術による他の実施の形態による静止画像抽出方法の動作を示すフローチャートである。 FIG. 11 is a flowchart illustrating an operation of a still image extraction method according to another embodiment of the present technology.

本実施の形態の画像処理システムの画像処理装置１０は、再サンプリングによって抽出された再サンプリングフレームの撮影位置を検証し、その撮影位置が再サンプリング点から外れていれば、再サンプリングをもう一度行う。 The image processing apparatus 10 of the image processing system according to the present embodiment verifies the imaging position of the resampling frame extracted by resampling, and if the imaging position is out of the resampling point, performs resampling again.

図１１に示すように、図４に示す処理と同様、ステップＳ３０１〜Ｓ３０４の処理を行う。ステップＳ３０４において、画像抽出部１１は、再サンプリングフレームを、再サンプリング点でのカメラ２１による撮影によって得られる複数のフレームのそれぞれとして、動画像ストリームから再サンプリングによって抽出する。ステップＳ３０３で推定されたそれぞれのタイムスタンプに対応する再サンプリングフレームは、再サンプリング点でのカメラ２１による撮影によって得られる複数のフレームとは限らない。具体的には、再サンプリング点近傍の２つの初期サンプリング点の間で、タイムスタンプと基準点からの角度との関係が比例関係になっていないと、ステップＳ３０３での推定は間違ってしまう。 As shown in FIG. 11, the processes in steps S301 to S304 are performed in the same manner as the process shown in FIG. In step S304, the image extraction unit 11 extracts the re-sampling frame from the moving image stream by re-sampling as each of a plurality of frames obtained by photographing by the camera 21 at the re-sampling point. The re-sampling frame corresponding to each time stamp estimated in step S303 is not necessarily a plurality of frames obtained by photographing with the camera 21 at the re-sampling point. Specifically, if the relationship between the time stamp and the angle from the reference point is not proportional between the two initial sampling points in the vicinity of the re-sampling point, the estimation in step S303 is incorrect.

ターンテーブル２３の回転速度が一定でない場合、初期サンプリング点の間のタイムスタンプと基準点からの角度との関係が比例関係にならない。これにより、比例関係でないタイムスタンプと基準点からの角度とに基づく線形補完により推定されたタイムスタンプは、再サンプリング点に対応するタイムスタンプとならない。 When the rotation speed of the turntable 23 is not constant, the relationship between the time stamp between the initial sampling points and the angle from the reference point is not proportional. Thereby, the time stamp estimated by the linear interpolation based on the time stamp which is not proportional and the angle from the reference point does not become the time stamp corresponding to the re-sampling point.

本実施の形態では、ステップＳ３０４において再サンプリングによって抽出した複数の再サンプリングフレームを静止画像として多視点画像ＤＢ５０に直接保存せず、画像抽出部１１は、ステップＳ３０４において動画像ストリームから再サンプリングによって抽出した複数の再サンプリングフレームの撮影位置を、当該フレーム及び初期サンプリングフレームから特定する（ステップＳ３０５）。つまり、再サンプリングによって抽出された複数の再サンプリングフレームに対して、ステップＳ３０２での動作が行われる。これにより、ステップＳ３０３での推定が正しいか否かがわかる。 In the present embodiment, the plurality of re-sampling frames extracted by re-sampling in step S304 are not directly stored in the multi-viewpoint image DB 50 as still images, and the image extraction unit 11 extracts by re-sampling from the moving image stream in step S304. The shooting positions of the plurality of re-sampling frames are specified from the frame and the initial sampling frame (step S305). That is, the operation in step S302 is performed on a plurality of resampling frames extracted by resampling. Thereby, it can be determined whether or not the estimation in step S303 is correct.

次に、画像抽出部１１は、再サンプリングによって抽出した再サンプリングフレームのステップＳ３０５で特定した撮影位置と、距離が等間隔に離れた複数の撮影位置のうちの１つの撮影位置である等間隔位置、つまり再サンプリング点との間の位置ずれが所定の範囲内であるか否かを判定する（ステップＳ３０６）。例えば、画像抽出部１１は、再サンプリングによって抽出した再サンプリングフレームのステップＳ３０５で特定した撮影位置と、２２．５度毎の等間隔な角度にある複数の撮影位置のうちの１つの撮影位置である４５度の再サンプリング点との間の位置ずれが、所定の範囲内であるか否かを判定する。 Next, the image extraction unit 11 is an equidistant position that is one of the photographing positions identified in step S305 of the resampled frame extracted by resampling and a plurality of photographing positions whose distances are equally spaced. That is, it is determined whether or not the positional deviation from the re-sampling point is within a predetermined range (step S306). For example, the image extraction unit 11 uses one of the shooting positions specified in step S305 of the resampled frame extracted by resampling and a plurality of shooting positions at equal intervals of 22.5 degrees. It is determined whether or not a positional deviation from a certain 45-degree re-sampling point is within a predetermined range.

画像抽出部１１は、再サンプリングによって抽出した複数の再サンプリングフレームすべての位置ずれが所定の範囲内である場合には（ステップＳ３０６でＹｅｓ）、再サンプリングによって抽出した複数の再サンプリングフレームを静止画像として多視点画像ＤＢ５０に保存する。 When the positional deviations of all the plurality of resampling frames extracted by resampling are within a predetermined range (Yes in step S306), the image extraction unit 11 selects the plurality of resampling frames extracted by resampling as a still image. And stored in the multi-viewpoint image DB 50.

画像抽出部１１は、再サンプリングによって抽出した複数の再サンプリングフレームの位置ずれが所定の範囲内でない場合には（ステップＳ３０６でＮｏ）、再サンプリングによって抽出した再サンプリングフレームの撮影位置及びタイムスタンプを用いて、再度、再サンプリング点でのカメラ２１による撮影によって得られるフレームのタイムスタンプを推定する。つまり、再サンプリングによって抽出された複数の再サンプリングフレームの撮影位置及びタイムスタンプが、複数の初期サンプリングフレームの撮影位置に加えられて、推定の精度が高くなったステップＳ３０３での動作が行われる。そして、画像抽出部１１は、推定されたタイムスタンプのフレームを、再サンプリング点でのカメラ２１による撮影によって得られるフレームとして、動画像ストリームから抽出する。つまり、画像抽出部１１は、抽出した複数の再サンプリングフレームの位置ずれが所定の範囲内になるまで、直前の再サンプリングによって抽出された再サンプリングフレームを用いて再サンプリングを繰り返し実行する。これにより、確実に所定の範囲内に位置ずれが収まるフレームを抽出することができる。言い換えれば、再サンプリング点でのフレームを高い精度で抽出することができる。 When the positional deviation of the plurality of resampling frames extracted by resampling is not within the predetermined range (No in step S306), the image extraction unit 11 uses the imaging position and time stamp of the resampling frame extracted by resampling. Then, again, the time stamp of the frame obtained by photographing with the camera 21 at the re-sampling point is estimated. In other words, the shooting positions and time stamps of the plurality of re-sampling frames extracted by resampling are added to the shooting positions of the plurality of initial sampling frames, and the operation in step S303 in which the estimation accuracy is increased is performed. Then, the image extracting unit 11 extracts the estimated time stamp frame from the moving image stream as a frame obtained by photographing by the camera 21 at the re-sampling point. That is, the image extraction unit 11 repeatedly performs resampling using the resampling frame extracted by the immediately preceding resampling until the positional deviation of the extracted resampling frames falls within a predetermined range. As a result, it is possible to reliably extract a frame in which the positional deviation falls within a predetermined range. In other words, the frame at the resampling point can be extracted with high accuracy.

以上のように、動画像ストリームから再サンプリングによって抽出した複数の再サンプリングフレームの撮影位置を、当該再サンプリングフレームに基づいて特定する。そして、再サンプリングによって抽出した再サンプリングフレームの撮影位置と、再サンプリング点との間の位置ずれが所定の範囲内であるか否かを判定する。再サンプリングによって抽出した複数の再サンプリングフレームの位置ずれが所定の範囲内でない場合には、再サンプリングによって抽出した再サンプリングフレームの撮影位置及びタイムスタンプを用いて、再サンプリング点でのカメラ２１による撮影によって得られるフレームのタイムスタンプを推定する。そして、推定したタイムスタンプにより抽出した複数の再サンプリングフレームの位置ずれが所定の範囲内になるまで、直前の再サンプリングによって抽出した再サンプリングフレームを用いて再サンプリングを繰り返し実行する。 As described above, the shooting positions of a plurality of resampling frames extracted from the moving image stream by resampling are specified based on the resampling frames. Then, it is determined whether or not the positional deviation between the imaging position of the re-sampling frame extracted by re-sampling and the re-sampling point is within a predetermined range. When the positional deviation of a plurality of re-sampling frames extracted by resampling is not within a predetermined range, shooting by the camera 21 at the re-sampling point is performed using the shooting position and time stamp of the re-sampling frame extracted by re-sampling. Estimate the time stamp of the frame obtained by Then, re-sampling is repeatedly performed using the re-sampling frame extracted by the previous re-sampling until the positional deviation of the plurality of re-sampling frames extracted by the estimated time stamp falls within a predetermined range.

これにより、再サンプリングによって抽出した複数の再サンプリングフレームの位置ずれが所定の範囲内に収まるまで再サンプリングが行われるので、距離が等間隔に離れた適切な撮影位置における静止画像を抽出することができる。 As a result, re-sampling is performed until the positional deviation of the plurality of re-sampling frames extracted by re-sampling falls within a predetermined range, so that it is possible to extract a still image at an appropriate shooting position that is spaced at regular intervals. it can.

図１２は、本技術の他の実施の形態による被写体の撮影方法を示す説明図である。図１２に示すように、撮影部２０は、ロボットアーム２２とは別に、スライダー２５を備え、スライダー２５にカメラ２１が取り付けられる。 FIG. 12 is an explanatory diagram illustrating a method for photographing a subject according to another embodiment of the present technology. As illustrated in FIG. 12, the photographing unit 20 includes a slider 25 separately from the robot arm 22, and the camera 21 is attached to the slider 25.

図１２の一点鎖線で囲んだように（矢印Ａで示す部分）、ロボットアーム２２は、被写体２４を中心とした図１２のＸＺ平面上の円周方向にスライダー２５及びスライダー２５に取り付けられたカメラ２１を移動可能に保持している。スライダー２５は、図１２の矢印Ａに示すように、スライダー２５に取り付けられたカメラ２１を図１２のｙ軸方向に移動可能に保持している。図１２の一点鎖線で囲んだ部分は、カメラ２１を正面から見たときのスライダー２５によりカメラ２１がｙ軸方向へ移動する様子を示した図である。 As surrounded by a one-dot chain line in FIG. 12 (part indicated by an arrow A), the robot arm 22 includes a slider 25 and a camera attached to the slider 25 in the circumferential direction on the XZ plane of FIG. 21 is held movably. As indicated by an arrow A in FIG. 12, the slider 25 holds the camera 21 attached to the slider 25 so as to be movable in the y-axis direction in FIG. A portion surrounded by a one-dot chain line in FIG. 12 is a diagram illustrating a state in which the camera 21 is moved in the y-axis direction by the slider 25 when the camera 21 is viewed from the front.

上記実施の形態では、カメラ２１は、被写体２４を中心とした円周上を被写体２４に対して相対的に移動しながら、被写体２４を撮影することにより動画像ストリームを生成した。一方、本実施の形態では、カメラ２１はさらに、被写体２４を中心とした円周の接線方向（ｙ軸方向）にも移動し、移動後に再度、被写体２４を中心とした円周上を被写体２４に対して相対的に移動しながら、被写体２４を撮影する。具体的には制御部３０は以下の制御を行う。 In the embodiment described above, the camera 21 generates a moving image stream by shooting the subject 24 while moving relative to the subject 24 on the circumference around the subject 24. On the other hand, in the present embodiment, the camera 21 further moves in the tangential direction (y-axis direction) of the circumference around the subject 24, and after the movement, the subject 24 moves on the circumference around the subject 24 again. The subject 24 is photographed while moving relative to the image. Specifically, the control unit 30 performs the following control.

制御部３０は、上記実施の形態と同様に、ターンテーブル２３の回転、及び、ロボットアーム２２によるカメラ２１の移動の制御を繰り返し行うことで、被写体２４を中心とした球面上の位置からカメラ２１に被写体２４を撮影させる。そして、制御部３０は、スライダー２５によるカメラ２１の移動の制御を行い、被写体２４を図１２のｙ軸方向にカメラ２１を移動させる。制御部３０は、カメラ２１をｙ軸方向に移動させた後に、再度、カメラ２１に被写体２４を中心とした球面上の位置からカメラ２１に被写体２４を撮影させる。これにより、被写体２４の奥行き方向についても撮影されるので、被写体２４を中心とした球面上の位置から被写体２４を撮影した視差のある３次元画像を生成するための動画像ストリームが得られる。 The control unit 30 repeatedly controls the rotation of the turntable 23 and the movement of the camera 21 by the robot arm 22 in the same manner as in the above embodiment, so that the camera 21 is positioned from the position on the spherical surface with the subject 24 as the center. The subject 24 is photographed. Then, the control unit 30 controls the movement of the camera 21 by the slider 25 and moves the camera 21 in the y-axis direction of FIG. After moving the camera 21 in the y-axis direction, the control unit 30 causes the camera 21 to image the subject 24 again from a position on the spherical surface centered on the subject 24. Thereby, since the subject 24 is also photographed in the depth direction, a moving image stream for generating a parallax three-dimensional image obtained by photographing the subject 24 from a position on the spherical surface centering on the subject 24 is obtained.

以上のように、本変形例において、被写体２４を中心とした円周の接線方向にカメラ２１を移動させて撮影する。 As described above, in this modified example, the camera 21 is moved in the tangential direction of the circumference with the subject 24 as the center for shooting.

これにより、視差のある３次元画像を生成するためのフレームを含む動画像ストリームが得られる。そのため、視差のある３次元画像を生成するための複数のフレームを抽出することができる。 Thereby, a moving image stream including a frame for generating a three-dimensional image with parallax is obtained. Therefore, it is possible to extract a plurality of frames for generating a three-dimensional image with parallax.

以上のように、本技術における技術の例示として、実施の形態を説明した。そのために、添付図面及び詳細な説明を提供した。 As described above, the embodiments have been described as examples of the technology in the present technology. For this purpose, the accompanying drawings and detailed description are provided.

したがって、添付図面及び詳細な説明に記載された構成要素の中には、課題解決のために必須な構成要素だけでなく、課題解決のためには必須でない構成要素も含まれ得る。そのため、それらの必須ではない構成要素が添付図面や詳細な説明に記載されていることをもって、直ちに、それらの必須ではない構成要素が必須であるとの認定をするべきではない。 Accordingly, the constituent elements described in the accompanying drawings and the detailed description may include not only constituent elements essential for solving the problem but also constituent elements not essential for solving the problem. Therefore, it should not be immediately recognized that these non-essential components are essential as those non-essential components are described in the accompanying drawings and detailed description.

また、上述の実施の形態は、本技術における技術を例示するためのものであるから、特許請求の範囲又はその均等の範囲において種々の変更、置き換え、付加、省略などを行うことができる。 Moreover, since the above-mentioned embodiment is for demonstrating the technique in this technique, a various change, replacement, addition, omission, etc. can be performed in a claim or its equivalent range.

本技術は、例えばインタラクティブに物体の全周囲画像を表示するための静止画像を抽出する画像処理装置に適用可能である。 The present technology can be applied to, for example, an image processing apparatus that extracts a still image for interactively displaying an entire surrounding image of an object.

１画像処理システム
１０画像処理装置
１１画像抽出部
１２撮影位置算出部
２０撮影部
２１カメラ
２２ロボットアーム
２３ターンテーブル
２４被写体
２５スライダー
３０制御部
４０撮影画像記憶部
５０多視点画像ＤＢ DESCRIPTION OF SYMBOLS 1 Image processing system 10 Image processing apparatus 11 Image extraction part 12 Shooting position calculation part 20 Shooting part 21 Camera 22 Robot arm 23 Turntable 24 Subject 25 Slider 30 Control part 40 Photographed image memory | storage part 50 Multiview image DB

Claims

A plurality of first frames are set such that shooting times are equidistant from a moving image stream which is taken continuously in time by moving the camera relatively in one direction on the circumference or straight line centering on the subject. The action of extracting
An act of identifying a put that shooting positions on the plurality of first frames each shooting time from the plurality of first frame,
Operation and to estimate the respective imaging times plurality of frames obtained by the basis of the plurality of first frames each photographing time and photographing location, distance is taken at a plurality of photographing positions equally spaced,
A still image extraction method comprising: performing an operation of extracting each of a plurality of second frames corresponding to each of a plurality of estimated shooting times as a still image from the moving image stream.

The still image extraction method according to claim 1 , wherein the number of the plurality of first frames is smaller than the number of frames included in the moving image stream .

2. The moving image stream according to claim 1, wherein the moving image stream is a moving image stream that is taken continuously in time by moving the camera at a relatively constant speed around the subject. The still image extraction method described.

An image processing apparatus and a photographing position calculation unit images extractor,
The image extraction unit moves the camera relatively in one direction on a circumference or a straight line with the subject as the center so that shooting times are equidistant from a moving image stream that is taken continuously in time. An operation of extracting a plurality of first frames is performed,
The photographing position calculating unit, the imaging position location was identified from the plurality of first frame, the shooting time of each of the plurality of first frame and the imaging position in the plurality of first frames each shooting time based distance performs an operation for estimating the respective imaging times plurality of frames obtained by photographing a plurality of photographing positions equally spaced,
Before Symbol image extracting unit includes a feature to perform an operation of extracting a plurality of second frame corresponding to each of the photographing positions plurality of imaging times estimated by the calculation unit, from the moving image stream as a still image An image processing apparatus.

An operation of moving a camera relative to a subject and extracting a plurality of first frames from a moving image stream captured continuously in time;
An operation of acquiring a photographing time of each of the plurality of first frames;
An operation of specifying the shooting position of each of the plurality of first frames from the plurality of first frames;
An operation of estimating the shooting time of each of a plurality of frames obtained by shooting at a plurality of shooting positions at equal intervals based on the shooting time and shooting position of each of the plurality of first frames;
An operation of extracting a plurality of second frames corresponding to each of a plurality of estimated shooting times;
An act of identifying on the basis of the shooting position of each of the plurality of second frame to the plurality of second frames,
And imaging position of each of the plurality of second frames, the operation of positional deviation of the distance is equidistant photographing position to determine whether it is within a predetermined range,
When the positional deviation is not within the predetermined of the previous SL by using a plurality of second frames each photographing position and photographing time, distance and the operation of estimating the shooting time having equal intervals photographing position again ,
A still image extraction method, comprising: extracting a plurality of second frames corresponding to a plurality of re-estimated shooting times as still images from the moving image stream.

An image processing apparatus including an image extraction unit and a shooting position calculation unit,
The image extraction unit moves a camera relative to a subject, extracts a plurality of first frames from a moving image stream captured continuously in time, and extracts each of the plurality of first frames. Take action to get the shooting time,
The shooting position calculation unit specifies a shooting position at each shooting time of the plurality of first frames from the plurality of first frames, and based on the shooting time and the shooting position of each of the plurality of first frames, Performing an operation to estimate the shooting time of each of a plurality of frames obtained by shooting at a plurality of shooting positions at equal distances,
The image extraction unit performs an operation of extracting a plurality of second frames corresponding to a plurality of shooting times estimated by the shooting position calculation unit,
The shooting position calculation unit specifies a shooting position at a shooting time of each of the plurality of second frames from the plurality of second frames, and the shooting positions of the plurality of second frames are equally spaced from each other. It is determined whether or not the positional deviation with respect to the photographing position is within a predetermined range. If the positional deviation is not within the predetermined range, the photographing position and photographing time of each of the plurality of second frames are determined. Use it to re-estimate the shooting time at which the shooting positions are equally spaced,
The image processing apparatus, wherein the image extraction unit performs an operation of extracting each of a plurality of second frames corresponding to a plurality of re-estimated shooting times from the moving image stream as a still image.

An operation of moving a camera relative to a subject and extracting a plurality of first frames from a moving image stream captured continuously in time;
An operation of acquiring a photographing time of each of the plurality of first frames;
An operation of specifying the shooting position of each of the plurality of first frames from the plurality of first frames;
By linear interpolation based on the photographing time and photographing location of each of the plurality of first frame, operation for estimating a plurality of frames each imaging time obtained by the distance to shoot at a plurality of photographing positions equally spaced,
A still image extraction method comprising: performing an operation of extracting each of a plurality of second frames corresponding to each of a plurality of estimated shooting times as a still image from the moving image stream.

An image processing apparatus including an image extraction unit and a shooting position calculation unit,
The image extraction unit moves a camera relative to a subject, extracts a plurality of first frames from a moving image stream captured continuously in time, and extracts each of the plurality of first frames. Take action to get the shooting time,
The shooting position calculation unit specifies shooting positions at the shooting times of the plurality of first frames from the plurality of first frames, and is linear based on the shooting times and shooting positions of the plurality of first frames. By complementing, perform the operation to estimate the shooting time of each of multiple frames obtained by shooting at multiple shooting positions with equal distances,
The image extraction unit performs an operation of extracting each of a plurality of second frames corresponding to each of a plurality of shooting times estimated by the shooting position calculation unit as a still image from the moving image stream. Image processing device.