JP2019164783A

JP2019164783A - Image processing apparatus, image capturing system, image processing method, and program

Info

Publication number: JP2019164783A
Application number: JP2019042289A
Authority: JP
Inventors: 浩水藤; Hiroshi Mizufuji; 啓一河口; Keiichi Kawaguchi; 洋介川村; Yosuke Kawamura
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2018-03-15
Filing date: 2019-03-08
Publication date: 2019-09-26

Abstract

To solve the problem in which the difference in the projection type between an equirectangular projection image EC (spherical image CE) and a planar image P causes poor visibility when the high-definition planar image P captured separately from the spherical image CE is superimposed on a partial area of the low-definition spherical image CE.SOLUTION: An image processing apparatus calculates in a first image EC a first corresponding area CA1 corresponding to a second image P, determines a peripheral area PA including the area CA1, and converts the area PA to the projection type of the image P (S140). The image processing apparatus calculates in an image PI after the conversion a second corresponding area CA2 corresponding to the image P (S160). The image processing apparatus generates a rectangular default shape (S170), and converts the default shape to the projection type of the image EC thereby creating a reference shape (S200). The image processing apparatus calculates reference shape conversion information for converting the reference shape on the basis of a relation of the image P to the default shape (S180).SELECTED DRAWING: Figure 19

Description

本発明は、画像処理装置、撮影システム、画像処理方法、及びプログラムに関するものである。 The present invention relates to an image processing apparatus, a photographing system, an image processing method, and a program.

従来、広角の平面画像のうちの一部の領域に、広角の平面画像とは別に撮影することで得られた拡大の平面画像を嵌め込むことで、上記一部の領域を拡大しても鮮明な画像を表示することができる技術が開示されている（特許文献１参照）。 Conventionally, an enlarged planar image obtained by photographing separately from a wide-angle planar image is inserted into a partial area of the wide-angle planar image, so that the partial area is enlarged even when the partial area is enlarged. A technique capable of displaying a simple image is disclosed (see Patent Document 1).

ところで、近年、一度の撮影で、３６０°の全天球画像の元になる２つの半球画像データを得る特殊なデジタルカメラが提供されている（特許文献２参照）。このデジタルカメラは、２つの半球画像データに基づいて１つの正距円筒射影画像データを作成し、スマートフォン等の通信端末に正距円筒射影画像データを送信する。正距円筒射影画像データを得た通信端末は、正距円筒射影画像データに基づいて全天球画像を作成する。但し、そのままでは画像が湾曲して利用者が見えづらいため、通信端末に全天球画像の一部の所定領域を示す所定領域画像を表示させることで、利用者は一般のデジタルカメラで撮影された平面画像と同じ感覚で閲覧することができる。 By the way, in recent years, there has been provided a special digital camera that obtains two hemispherical image data based on a 360-degree omnidirectional image by one shooting (see Patent Document 2). This digital camera creates one equirectangular projection image data based on two hemispherical image data, and transmits the equirectangular projection image data to a communication terminal such as a smartphone. The communication terminal that has obtained the equirectangular projection image data creates an omnidirectional image based on the equirectangular projection image data. However, since the image is curved as it is, it is difficult for the user to see. Therefore, by displaying a predetermined area image indicating a predetermined area of the omnidirectional image on the communication terminal, the user is photographed with a general digital camera. Can be viewed with the same feeling as a flat image.

しかしながら、例えば、全天球画像の一部の領域に、全天球画像とは別に撮影することで得られた平面画像を重畳等する場合のように、射影方式が異なる一方の画像に他方の画像を重畳等すると、見えづらいという課題が生じる。 However, for example, in the case where a planar image obtained by shooting separately from the omnidirectional image is superimposed on a partial area of the omnidirectional image, one image having a different projection method is combined with the other image. When images are superimposed, there is a problem that it is difficult to see.

請求項１に係る発明は、第１の射影方式の第１の画像と、第１の射影方式とは異なる第２の射影方式の第２の画像とを取得する取得手段と、前記第１の画像を前記第２の射影方式の第３の画像に変換する第１の射影方式変換手段と、前記第２の画像及び前記第３の画像からそれぞれ複数の特徴点を抽出する抽出手段と、前記抽出手段でそれぞれ抽出した前記第２の画像の複数の特徴点及び前記第３の画像の複数の特徴点に基づいて前記第３の画像における前記第２の画像に対応する第２の対応領域を求める対応領域算出手段と、前記第２の射影方式であって前記第２の画像の形状に対して射影変換の関係にある初期設定形状を生成する初期設定形状生成手段と、前記初期設定形状を前記第１の射影方式に変換することで基準形状を作成する第２の射影方式変換手段と、前記第２の画像から前記第２の対応領域に射影変換する場合の情報を用いて、当該第２の対応領域に対して前記初期設定形状を変換する情報を算出することによって、前記基準形状を回転及び変倍の少なくとも一方により変換させるための基準形状変換情報を算出する基準形状変換情報算出手段と、前記基準形状を示す基準形状情報及び前記基準形状変換情報を関連付けて記憶する記憶手段と、を有することを特徴とする画像処理装置である。 The invention according to claim 1 is an acquisition means for acquiring a first image of the first projection method and a second image of a second projection method different from the first projection method; First projection method conversion means for converting an image into a third image of the second projection method, extraction means for extracting a plurality of feature points from the second image and the third image, and Based on a plurality of feature points of the second image and a plurality of feature points of the third image respectively extracted by the extraction means, a second corresponding region corresponding to the second image in the third image is obtained. Corresponding area calculation means to be obtained; initial setting shape generating means for generating an initial setting shape that is in the second projection method and has a projective transformation relationship with respect to the shape of the second image; and the initial setting shape. A reference shape is created by converting to the first projection method. The information for converting the initial setting shape for the second corresponding area is calculated using the projection method conversion means and the information for performing the projective conversion from the second image to the second corresponding area. The reference shape conversion information calculating means for calculating the reference shape conversion information for converting the reference shape by at least one of rotation and scaling, and the reference shape information indicating the reference shape and the reference shape conversion information are associated with each other. And an image processing apparatus characterized by comprising storage means for storing.

以上説明したように本発明によれば、射影方式が異なる一方の画像に他方の画像を合わせても、見えづらさを抑制することができるという効果を奏する。 As described above, according to the present invention, it is possible to suppress the difficulty of seeing even if the other image is combined with one image having a different projection method.

（ａ）は特殊撮影装置の左側面図であり、（ｂ）は特殊撮影装置の背面図であり、（ｃ）は特殊撮影装置の平面図であり、（ｄ）は特殊撮影装置の底面図である。(A) is a left side view of the special photographing apparatus, (b) is a rear view of the special photographing apparatus, (c) is a plan view of the special photographing apparatus, and (d) is a bottom view of the special photographing apparatus. It is. 特殊撮影装置の使用イメージ図である。It is a usage image figure of a special imaging device. （ａ）は特殊撮影装置で撮影された半球画像（前）、（ｂ）は特殊撮影装置で撮影された半球画像（後）、（ｃ）は正距円筒図法により表された画像を示した図である。(A) is a hemispheric image (front) photographed by a special photographing device, (b) is a hemispheric image (rear) photographed by a special photographing device, and (c) is an image represented by equirectangular projection. FIG. （ａ）は正距円筒射影画像で球を被う状態を示した概念図、（ｂ）は全天球画像を示した図である。(A) is the conceptual diagram which showed the state which covered the sphere with an equirectangular projection image, (b) is the figure which showed the omnidirectional image. 全天球画像を３次元の立体球とした場合の仮想カメラ及び所定領域の位置を示した図である。It is the figure which showed the position of the virtual camera and predetermined | prescribed area | region at the time of making an omnidirectional image into a three-dimensional solid sphere. （ａ）は図５の立体斜視図、（ｂ）は通信端末のディスプレイに所定領域の画像が表示されている状態を示す図である。(A) is a three-dimensional perspective view of FIG. 5, (b) is a figure which shows the state in which the image of the predetermined area is displayed on the display of a communication terminal. 所定領域情報と所定領域Ｔの画像との関係を示した図である。FIG. 6 is a diagram illustrating a relationship between predetermined area information and an image of a predetermined area T. 本発明の第１の実施形態に係る撮影システムの概略図である。1 is a schematic diagram of an imaging system according to a first embodiment of the present invention. 撮影システムの使用イメージ図である。It is a usage image figure of an imaging system. 特殊撮影装置のハードウェア構成図である。It is a hardware block diagram of a special imaging device. 一般撮影装置のハードウェア構成図である。It is a hardware block diagram of a general imaging device. スマートフォンのハードウェア構成図である。It is a hardware block diagram of a smart phone. 第１の実施形態に係る撮影システムの機能ブロック図である。It is a functional block diagram of the imaging system concerning a 1st embodiment. （ａ）は連携撮影装置管理テーブルの概念図、（ｂ）連携撮影装置設定画面を示す概念図である。(A) is a conceptual diagram of a cooperative photographing device management table, (b) is a conceptual diagram showing a cooperative photographing device setting screen. 第１の実施形態に係るメタデータ作成部の詳細な機能ブロック図である。It is a detailed functional block diagram of the metadata creation part which concerns on 1st Embodiment. 第１の実施形態に係る重畳部の詳細な機能ブロック図である。It is a detailed functional block diagram of the superimposition part which concerns on 1st Embodiment. 重畳表示メタデータの構成図である。It is a block diagram of superimposition display metadata. 第１の実施形態に係る撮影方法を示したシーケンス図である。It is the sequence diagram which showed the imaging | photography method which concerns on 1st Embodiment. 重畳表示パラメータの作成処理の過程における画像の概念図である。It is a conceptual diagram of the image in the process of the production | generation process of a superimposition display parameter. 周辺領域画像を特定する際の概念図である。It is a conceptual diagram at the time of specifying a peripheral region image. （ａ）は初期設定形状を示した概念図、（ｂ）は周辺領域画像及び第２の対応領域を示した概念図、（ｃ）は領域分割した初期設定形状の概念図である。(A) is a conceptual diagram showing an initial setting shape, (b) is a conceptual diagram showing a peripheral region image and a second corresponding region, and (c) is a conceptual diagram of an initial setting shape divided into regions. （ａ）基準形状の中心を全天球画像の中心に配置したときの概念図、（ｂ）基準形状の中心を全天球画像の中心以外の点に配置したときの概念図である。(A) Conceptual diagram when the center of the reference shape is arranged at the center of the omnidirectional image, (b) Conceptual diagram when the center of the reference shape is arranged at a point other than the center of the omnidirectional image. 重畳表示の処理の課程における画像の概念図である。It is a conceptual diagram of the image in the process of the superimposition display process. 角度パラメータの補間方法を示す概念図である。It is a conceptual diagram which shows the interpolation method of an angle parameter. 位置パラメータ算出部における処理の流れを示した三次元の概念図である。It is the three-dimensional conceptual diagram which showed the flow of the process in a position parameter calculation part. （ａ）は第３の対応領域の格子領域を示した概念図、（ｂ）は平面画像の格子領域を示した概念図である。(A) is the conceptual diagram which showed the lattice area | region of the 3rd corresponding | compatible area | region, (b) is the conceptual diagram which showed the lattice area | region of the plane image. ４つの格子領域と角の１点である共通の格子点を示した概念図である。It is the conceptual diagram which showed the four grid | lattice area | regions and the common grid point which is one point of a corner. 重畳の処理の課程における画像の概念図である。It is a conceptual diagram of the image in the process of the superimposition process. 全天球画像に平面画像を重畳した場合の二次元の概念図である。It is a two-dimensional conceptual diagram when a planar image is superimposed on an omnidirectional image. 全天球画像に平面画像を重畳した場合の三次元の概念図である。It is a three-dimensional conceptual diagram when a planar image is superimposed on an omnidirectional image. 本実施形態の位置パラメータを用いずに、全天球画像に平面画像を重畳した場合の二次元の概念図である。It is a two-dimensional conceptual diagram when a planar image is superimposed on an omnidirectional image without using the position parameter of the present embodiment. 本実施形態の位置パラメータを用いて、全天球画像に平面画像を重畳した場合の二次元の概念図である。It is a two-dimensional conceptual diagram when a planar image is superimposed on an omnidirectional image using the position parameters of the present embodiment. （ａ）重畳表示しない場合のワイド画像の表示例、（ｂ）重畳表示しない場合のテレ画像の表示例、（ｃ）重畳表示する場合のワイド画像の表示例、（ｄ）重畳表示する場合のテレ画像の表示例を示した概念図である。(A) Display example of wide image without superimposing display, (b) Display example of tele image without superimposing display, (c) Display example of wide image with superimposing display, (d) In case of superimposing display It is the conceptual diagram which showed the example of a display of a tele image. 動画の所定領域画像内に動画の平面画像が表示されている例を示す図である。It is a figure which shows the example in which the plane image of a moving image is displayed within the predetermined area image of a moving image. 本発明の第２の実施形態に係る撮影システムの概略図である。It is the schematic of the imaging | photography system which concerns on the 2nd Embodiment of this invention. 画像処理サーバのハードウェア構成図である。It is a hardware block diagram of an image processing server. 第２の実施形態に係る撮影システムの機能ブロック図である。It is a functional block diagram of the imaging system which concerns on 2nd Embodiment. 第２の実施形態に係るメタデータ作成部の詳細な機能ブロック図である。It is a detailed functional block diagram of the metadata creation part which concerns on 2nd Embodiment. 第２の実施形態に係る重畳部の詳細な機能ブロック図である。It is a detailed functional block diagram of the superimposition part which concerns on 2nd Embodiment. 第２の実施形態に係る撮影方法を示したシーケンス図である。It is the sequence diagram which showed the imaging | photography method which concerns on 2nd Embodiment.

以下、図面を用いて、本発明の実施形態について説明する。なお、後述の全天球画像は第１の画像の一例であり、重畳画像は第２の画像の一例である。周辺領域画像は第３の画像の一例である。 Hereinafter, embodiments of the present invention will be described with reference to the drawings. Note that the omnidirectional image described later is an example of the first image, and the superimposed image is an example of the second image. The peripheral area image is an example of a third image.

〔実施形態の概要〕
以下、本実施形態の概要について説明する。 [Outline of Embodiment]
Hereinafter, an outline of the present embodiment will be described.

まずは、図１乃至図７を用いて、全天球画像の生成方法について説明する。 First, a method for generating an omnidirectional image will be described with reference to FIGS.

まず、図１を用いて、特殊撮影装置１の外観を説明する。特殊撮影装置１は、全天球（３６０°）パノラマ画像の元になる撮影画像を得るためのデジタルカメラである。なお、図１（ａ）は特殊撮影装置の左側面図であり、図１（ｂ）は特殊撮影装置の背面図であり、図１（ｃ）は特殊撮影装置の平面図であり、図１（ｄ）は特殊撮影装置の底面図である。 First, the external appearance of the special photographing apparatus 1 will be described with reference to FIG. The special photographing apparatus 1 is a digital camera for obtaining a photographed image that is the basis of a panoramic image of a celestial sphere (360 °). 1A is a left side view of the special photographing apparatus, FIG. 1B is a rear view of the special photographing apparatus, and FIG. 1C is a plan view of the special photographing apparatus. (D) is a bottom view of the special photographing apparatus.

図１（ａ），図１（ｂ），図１（ｃ），図（ｄ）に示されているように、特殊撮影装置１の上部には、正面側（前側）に魚眼型のレンズ１０２ａ及び背面側（後側）に魚眼型のレンズ１０２ｂが設けられている。特殊撮影装置１の内部には、後述の撮像素子（画像センサ）１０３ａ，１０３ｂが設けられており、それぞれレンズ１０２ａ、１０２ｂを介して被写体や風景を撮影することで、半球画像（画角１８０°以上）を得ることができる。特殊撮影装置１の正面側と反対側の面には、シャッターボタン１１５ａが設けられている。また、特殊撮影装置１の側面には、電源ボタン１１５ｂ、Wi-Fi(Wireless Fidelity)ボタン１１５ｃ、及び撮影モード切替ボタン１１５ｄが設けられている。電源ボタン１１５ｂ、及びWi-Fiボタン１１５ｃは、いずれも押下される度に、オンとオフが切り替えられる。また、撮影モード切替ボタン１１５ｄは、押下される度に、静止画の撮影モードと動画の撮影モードが切り替えられる。なお、シャッターボタン１１５ａ、電源ボタン１１５ｂ、Wi-Fiボタン１１５ｃ、及び撮影モード切替ボタン１１５ｄは、操作部１１５の一部であり、操作部１１５は、これらのボタンに限られない。 As shown in FIGS. 1 (a), 1 (b), 1 (c), and (d), a fish-eye lens is placed on the front side (front side) of the special photographing apparatus 1 at the top. A fisheye lens 102b is provided on the rear side (rear side) 102a. Inside the special imaging device 1, imaging elements (image sensors) 103a and 103b, which will be described later, are provided, and a hemispherical image (angle of view 180 °) is obtained by photographing a subject or a landscape through the lenses 102a and 102b, respectively. Above). A shutter button 115 a is provided on the surface opposite to the front side of the special imaging device 1. In addition, a power button 115b, a Wi-Fi (Wireless Fidelity) button 115c, and a shooting mode switching button 115d are provided on the side surface of the special imaging device 1. Each time the power button 115b and the Wi-Fi button 115c are pressed, they are switched on and off. The shooting mode switching button 115d switches between a still image shooting mode and a moving image shooting mode each time the button is pressed. Note that the shutter button 115a, the power button 115b, the Wi-Fi button 115c, and the shooting mode switching button 115d are part of the operation unit 115, and the operation unit 115 is not limited to these buttons.

また、特殊撮影装置１の底部１５０の中央には、カメラ用三脚に特殊撮影装置１や一般撮影装置３を取り付けるための三脚ねじ穴１５１が設けられている。また、底部１５０の左端側には、Micro USB(Universal Serial Bus)端子１５２が設けられている。底部１５０の右端側には、HDMI(High-Definition Multimedia Interface)端子１５３が設けられている。なお、HDMIは登録商標である。 A tripod screw hole 151 for attaching the special photographing device 1 or the general photographing device 3 to a camera tripod is provided at the center of the bottom 150 of the special photographing device 1. A micro USB (Universal Serial Bus) terminal 152 is provided on the left end side of the bottom 150. An HDMI (High-Definition Multimedia Interface) terminal 153 is provided on the right end side of the bottom 150. HDMI is a registered trademark.

次に、図２を用いて、特殊撮影装置１の使用状況を説明する。なお、図２は、特殊撮影装置の使用イメージ図である。特殊撮影装置１は、図２に示されているように、例えば、利用者が手に持って利用者の周りの被写体を撮影するために用いられる。この場合、図１に示されている撮像素子１０３ａ及び撮像素子１０３ｂによって、それぞれ利用者の周りの被写体が撮像されることで、２つの半球画像を得ることができる。 Next, the use situation of the special photographing apparatus 1 will be described with reference to FIG. In addition, FIG. 2 is a use image figure of a special imaging device. As shown in FIG. 2, the special imaging device 1 is used, for example, for a user to take a picture of a subject around the user with his / her hand. In this case, two hemispherical images can be obtained by imaging the subject around the user by the imaging device 103a and the imaging device 103b shown in FIG.

次に、図３及び図４を用いて、特殊撮影装置１で撮影された画像から正距円筒射影画像ＥＣ及び全天球画像ＣＥが作成されるまでの処理の概略を説明する。なお、図３（ａ）は特殊撮影装置１で撮影された半球画像（前側）、図３（ｂ）は特殊撮影装置で撮影された半球画像（後側）、図３（ｃ）は正距円筒図法により表された画像（以下、「正距円筒射影画像」という）を示した図である。図４（ａ）は正距円筒射影画像で球を被う状態を示した概念図、図４（ｂ）は全天球画像を示した図である。 Next, the outline of processing from the image captured by the special imaging device 1 to the creation of the equirectangular projection image EC and the omnidirectional image CE will be described with reference to FIGS. 3 and 4. 3A is a hemispheric image (front side) photographed by the special photographing apparatus 1, FIG. 3B is a hemispheric image photographed by the special photographing apparatus (rear side), and FIG. It is the figure which showed the image (henceforth "an equirectangular projection image") represented by the cylindrical projection. FIG. 4A is a conceptual diagram showing a state where a sphere is covered with an equirectangular projection image, and FIG. 4B is a diagram showing an omnidirectional image.

図３（ａ）に示されているように、撮像素子１０３ａによって得られた画像は、後述の魚眼レンズ１０２ａによって湾曲した半球画像（前側）となる。また、図３（ｂ）に示されているように、撮像素子１０３ｂによって得られた画像は、後述の魚眼レンズ１０２ｂによって湾曲した半球画像（後側）となる。そして、半球画像（前側）と、１８０度反転された半球画像（後側）とは、特殊撮影装置１によって合成され、図３（ｃ）に示されているように、正距円筒射影画像ＥＣが作成される。 As shown in FIG. 3A, the image obtained by the image sensor 103a is a hemispherical image (front side) curved by a fish-eye lens 102a described later. Also, as shown in FIG. 3B, the image obtained by the image sensor 103b is a hemispherical image (rear side) curved by a fish-eye lens 102b described later. Then, the hemispherical image (front side) and the hemispherical image reversed 180 degrees (rear side) are synthesized by the special imaging device 1, and as shown in FIG. 3C, the equirectangular projection image EC. Is created.

そして、OpenGL ES（Open Graphics Library for Embedded Systems）が利用されることで、図４（ａ）に示されているように、正距円筒射影画像が球面を覆うように貼り付けられ、図４（ｂ）に示されているような全天球画像ＣＥが作成される。このように、全天球画像ＣＥは、正距円筒射影画像ＥＣが球の中心を向いた画像として表される。なお、OpenGL ESは、２Ｄ(2-Dimensions)および３Ｄ(3-Dimensions)のデータを視覚化するために使用するグラフィックスライブラリである。なお、全天球画像ＣＥは、静止画であっても動画であってもよい。 Then, by using OpenGL ES (Open Graphics Library for Embedded Systems), as shown in FIG. 4A, the equirectangular projection image is pasted so as to cover the spherical surface, and FIG. An omnidirectional image CE as shown in b) is created. In this way, the omnidirectional image CE is represented as an image in which the equirectangular projection image EC faces the center of the sphere. OpenGL ES is a graphics library used for visualizing 2D (2-Dimensions) and 3D (3-Dimensions) data. Note that the omnidirectional image CE may be a still image or a moving image.

以上のように、全天球画像ＣＥは、球面を覆うように貼り付けられた画像であるため、人間が見ると違和感を持ってしまう。そこで、全天球画像ＣＥの一部の所定領域（以下、「所定領域画像」という）を湾曲の少ない平面画像として表示することで、人間に違和感を与えない表示をすることができる。これに関して、図５及び図６を用いて説明する。 As described above, since the omnidirectional image CE is an image that is pasted so as to cover the spherical surface, it is uncomfortable when viewed by a human. Therefore, by displaying a predetermined area (hereinafter referred to as “predetermined area image”) of a part of the omnidirectional image CE as a flat image with little curvature, a display that does not give a sense of discomfort to humans can be achieved. This will be described with reference to FIGS.

なお、図５は、全天球画像を三次元の立体球とした場合の仮想カメラ及び所定領域の位置を示した図である。仮想カメラＩＣは、三次元の立体球として表示されている全天球画像ＣＥに対して、その画像を見るユーザの視点の位置に相当するものである。また、図６（ａ）は図５の立体斜視図、図６（ｂ）はディスプレイに表示された場合の所定領域画像を表す図である。また、図６（ａ）では、図４に示されている全天球画像ＣＥが、三次元の立体球ＣＳで表わされている。このように生成された全天球画像ＣＥが、立体球ＣＳであるとすると、図５に示されているように、仮想カメラＩＣが全天球画像ＣＥの内部に位置している。全天球画像ＣＥにおける所定領域Ｔは、仮想カメラＩＣの撮影領域であり、全天球画像ＣＥを含む三次元の仮想空間における仮想カメラＩＣの撮影方向と画角を示す所定領域情報によって特定される。 FIG. 5 is a diagram showing the positions of the virtual camera and the predetermined area when the omnidirectional image is a three-dimensional solid sphere. The virtual camera IC corresponds to the position of the viewpoint of the user who views the omnidirectional image CE displayed as a three-dimensional solid sphere. FIG. 6A is a three-dimensional perspective view of FIG. 5, and FIG. 6B is a diagram showing a predetermined area image when displayed on the display. In FIG. 6A, the omnidirectional image CE shown in FIG. 4 is represented by a three-dimensional solid sphere CS. Assuming that the omnidirectional image CE generated in this way is a three-dimensional sphere CS, the virtual camera IC is positioned inside the omnidirectional image CE as shown in FIG. The predetermined area T in the omnidirectional image CE is a shooting area of the virtual camera IC, and is specified by predetermined area information indicating the shooting direction and angle of view of the virtual camera IC in the three-dimensional virtual space including the omnidirectional image CE. The

そして、図６（ａ）に示されている所定領域画像Ｑは、図６（ｂ）に示されているように、所定のディスプレイに、仮想カメラＩＣの撮影領域の画像として表示される。図６（ｂ）に示されている画像は、初期設定（デフォルト）された所定領域情報によって表された所定領域画像である。以下では、仮想カメラＩＣの撮影方向（ea，aa）と画角（α）を用いて説明する。 Then, the predetermined area image Q shown in FIG. 6A is displayed on the predetermined display as an image of the photographing area of the virtual camera IC as shown in FIG. 6B. The image shown in FIG. 6B is a predetermined area image represented by the predetermined (default) predetermined area information. The following description will be made using the shooting direction (ea, aa) and the angle of view (α) of the virtual camera IC.

図７を用いて、所定領域情報と所定領域Ｔの画像の関係について説明する。なお、図７は、所定領域情報と所定領域Ｔの画像の関係との関係を示した図である。図７に示されているように、「ea」はelevation angle、「aa」はazimuth angle、「α」は画角(Angle)を示す。即ち、撮影方向（ea，aa）で示される仮想カメラＩＣの注視点が、仮想カメラＩＣの撮影領域である所定領域Ｔの中心点ＣＰとなるように、仮想カメラＩＣの姿勢を変更することになる。所定領域画像Ｑは、全天球画像ＣＥにおける所定領域Ｔの画像である。ｆは仮想カメラＩＣから中心点ＣＰまでの距離である。Ｌは所定領域Ｔの任意の頂点と中心点ＣＰとの距離である（２Ｌは対角線）。そして、図７では、一般的に以下の（式１）で示される三角関数が成り立つ。 The relationship between the predetermined area information and the image of the predetermined area T will be described with reference to FIG. FIG. 7 is a diagram showing the relationship between the predetermined region information and the relationship between the images of the predetermined region T. As shown in FIG. 7, “ea” represents an elevation angle, “aa” represents an azimuth angle, and “α” represents an angle of view. That is, the attitude of the virtual camera IC is changed so that the gazing point of the virtual camera IC indicated by the shooting direction (ea, aa) becomes the center point CP of the predetermined area T that is the shooting area of the virtual camera IC. Become. The predetermined area image Q is an image of the predetermined area T in the omnidirectional image CE. f is the distance from the virtual camera IC to the center point CP. L is a distance between an arbitrary vertex of the predetermined region T and the center point CP (2L is a diagonal line). In FIG. 7, a trigonometric function represented by the following (formula 1) is generally established.

Ｌ／ｆ＝ｔａｎ（α／２）・・・（式１）
〔第１の実施形態〕
続いて、図８乃至図３４を用いて、本発明の第１の実施形態について説明する。 L / f = tan (α / 2) (Formula 1)
[First Embodiment]
Subsequently, a first embodiment of the present invention will be described with reference to FIGS.

＜＜撮影システムの概略＞＞
まずは、図８を用いて、本実施形態の撮影システムの構成の概略について説明する。図８は、本実施形態の撮影システムの構成の概略図である。 << Summary of shooting system >>
First, an outline of the configuration of the photographing system of the present embodiment will be described with reference to FIG. FIG. 8 is a schematic diagram of the configuration of the photographing system of the present embodiment.

図８に示されているように、本実施形態の撮影システムは、特殊撮影装置１、一般撮影装置３、スマートフォン５、及びアダプタ９によって構成されている。特殊撮影装置１はアダプタ９を介して一般撮影装置３に接続されている。 As shown in FIG. 8, the photographing system of this embodiment includes a special photographing device 1, a general photographing device 3, a smartphone 5, and an adapter 9. The special photographing device 1 is connected to the general photographing device 3 through an adapter 9.

これらのうち、特殊撮影装置１は、上述のように、被写体や風景等を撮影して全天球（パノラマ）画像の元になる２つの半球画像を得るための特殊なデジタルカメラである。 Among these, the special photographing device 1 is a special digital camera for photographing a subject, a landscape, and the like to obtain two hemispherical images that are the basis of the omnidirectional (panoramic) image as described above.

一般撮影装置３は、コンパクトデジタルカメラであるが、デジタル一眼レフカメラであってもよい。 The general photographing device 3 is a compact digital camera, but may be a digital single-lens reflex camera.

スマートフォン５は、Wi-Fi、Bluetooth（登録商標）、NFC（Near Field Communication）等の近距離無線通信技術を利用して、特殊撮影装置１及び一般撮影装置３と無線通信を行なうことができる。また、スマートフォン５では、自装置に設けられた後述のディスプレイ５１７に、特殊撮影装置１及び一般撮影装置３からそれぞれ取得した画像を表示することができる。 The smartphone 5 can perform wireless communication with the special imaging device 1 and the general imaging device 3 using a short-range wireless communication technology such as Wi-Fi, Bluetooth (registered trademark), or NFC (Near Field Communication). Further, the smartphone 5 can display images respectively acquired from the special imaging device 1 and the general imaging device 3 on a display 517 described later provided in the device itself.

なお、スマートフォン５は、近距離無線通信技術を利用せずに、有線ケーブルによって特殊撮影装置１及び一般撮影装置３と通信を行なうようにしてもよい。また、スマートフォン５は、画像処理装置の一例であり、画像処理装置には、タブレット型ＰＣ（Personal Computer：パーソナルコンピュータ）、ノートＰＣ、デスクトップＰＣも含まれる。なお、スマートフォンは、後述の通信端末の一例でもある。 Note that the smartphone 5 may communicate with the special imaging device 1 and the general imaging device 3 through a wired cable without using the short-range wireless communication technology. The smartphone 5 is an example of an image processing apparatus, and the image processing apparatus includes a tablet PC (Personal Computer), a notebook PC, and a desktop PC. Note that the smartphone is also an example of a communication terminal described later.

アダプタ９は、ブラケット９ａ及び回動機構９ｂによって構成されている。ブラケット９ａの先端側には特殊撮影装置１の三脚ねじ穴１５１に取り付けるための三脚ねじ９ｃが取り付けられている。ブラケット９ａの基端側には回動機構９ｂが取り付けられている。回転機構９ｂには、一般撮影装置３が取り付けられており、図８に示されているように、一般撮影装置３を３軸により、Pitech，Yaw，Rollの回動を行うことができる。 The adapter 9 includes a bracket 9a and a rotation mechanism 9b. A tripod screw 9c for attaching to the tripod screw hole 151 of the special photographing apparatus 1 is attached to the distal end side of the bracket 9a. A rotation mechanism 9b is attached to the base end side of the bracket 9a. The general photographing device 3 is attached to the rotation mechanism 9b, and as shown in FIG. 8, the general photographing device 3 can be rotated by Pitech, Yaw, and Roll about three axes.

図９は、撮影システムの使用イメージ図である。図９に示されているように、利用者は、三脚２の上に特殊撮影装置１及び一般撮影装置３が取り付けられたアダプタ９を取り付ける。そして、利用者は、スマートフォン５を操作して、遠隔的に、回転機構９ｂを駆動させたり、特殊撮影装置１及び一般撮影装置３の撮影の開始及び終了を行ったりすることができる。なお、三脚２を用いずに、設置台等を用いてもよい。 FIG. 9 is a usage image diagram of the photographing system. As shown in FIG. 9, the user attaches the adapter 9 to which the special photographing device 1 and the general photographing device 3 are attached on the tripod 2. Then, the user can operate the smartphone 5 to remotely drive the rotation mechanism 9b or start and end shooting of the special imaging device 1 and the general imaging device 3. Note that an installation table or the like may be used without using the tripod 2.

＜＜実施形態のハードウェア構成＞＞
次に、図１０及び図１２を用いて、本実施形態の特殊撮影装置１、一般撮影装置３及びスマートフォン５のハードウェア構成を詳細に説明する。 << Hardware Configuration of Embodiment >>
Next, the hardware configuration of the special imaging device 1, the general imaging device 3, and the smartphone 5 according to the present embodiment will be described in detail with reference to FIGS.

＜特殊撮影装置のハードウェア構成＞
まず、図１０を用いて、特殊撮影装置１のハードウェア構成を説明する。図１０は、特殊撮影装置１のハードウェア構成図である。以下では、特殊撮影装置１は、２つの撮像素子を使用した全天球（全方位）特殊撮影装置とするが、撮像素子は２つ以上いくつでもよい。また、必ずしも全方位撮影専用の装置である必要はなく、通常のデジタルカメラやスマートフォン等に後付けの全方位の撮像ユニットを取り付けることで、実質的に特殊撮影装置１と同じ機能を有するようにしてもよい。 <Hardware configuration of special imaging device>
First, the hardware configuration of the special photographing apparatus 1 will be described with reference to FIG. FIG. 10 is a hardware configuration diagram of the special imaging device 1. In the following, the special imaging device 1 is an omnidirectional (omnidirectional) special imaging device using two image sensors, but any number of two or more image sensors may be used. In addition, it is not always necessary to use a dedicated device for omnidirectional photography. By attaching a retrofit omnidirectional imaging unit to a normal digital camera or smartphone, it has substantially the same function as the special photography device 1. Also good.

図１０に示されているように、特殊撮影装置１は、撮像ユニット１０１、画像処理ユニット１０４、撮像制御ユニット１０５、マイク１０８、音処理ユニット１０９、ＣＰＵ(Central Processing Unit)１１１、ＲＯＭ(Read Only Memory)１１２、ＳＲＡＭ(Static Random Access Memory)１１３、ＤＲＡＭ(Dynamic Random Access Memory)１１４、操作部１１５、ネットワークＩ／Ｆ１１６、通信部１１７、アンテナ１１７ａ、電子コンパス１１８、ジャイロセンサ１１９、加速度センサ１２０、及び端子１２１によって構成されている。 As shown in FIG. 10, the special imaging apparatus 1 includes an imaging unit 101, an image processing unit 104, an imaging control unit 105, a microphone 108, a sound processing unit 109, a CPU (Central Processing Unit) 111, and a ROM (Read Only). Memory) 112, SRAM (Static Random Access Memory) 113, DRAM (Dynamic Random Access Memory) 114, operation unit 115, network I / F 116, communication unit 117, antenna 117a, electronic compass 118, gyro sensor 119, acceleration sensor 120, And a terminal 121.

このうち、撮像ユニット１０１は、各々半球画像を結像するための１８０°以上の画角を有する広角レンズ（いわゆる魚眼レンズ）１０２ａ，１０２ｂと、各広角レンズに対応させて設けられている２つの撮像素子１０３ａ，１０３ｂを備えている。撮像素子１０３ａ，１０３ｂは、魚眼レンズ１０２ａ，１０２ｂによる光学像を電気信号の画像データに変換して出力するＣＭＯＳ(Complementary Metal Oxide Semiconductor)センサやＣＣＤ(Charge Coupled Device)センサなどの画像センサ、この画像センサの水平又は垂直同期信号や画素クロックなどを生成するタイミング生成回路、この撮像素子の動作に必要な種々のコマンドやパラメータなどが設定されるレジスタ群などを有している。 Among these, the imaging unit 101 includes wide-angle lenses (so-called fish-eye lenses) 102a and 102b each having an angle of view of 180 ° or more for forming a hemispherical image, and two imaging units provided corresponding to the wide-angle lenses. Elements 103a and 103b are provided. The image sensors 103a and 103b are image sensors such as a CMOS (Complementary Metal Oxide Semiconductor) sensor and a CCD (Charge Coupled Device) sensor that convert an optical image obtained by the fisheye lenses 102a and 102b into image data of an electric signal and output the image data. A timing generation circuit for generating a horizontal or vertical synchronization signal, a pixel clock, and the like, and a register group in which various commands and parameters necessary for the operation of the image sensor are set.

撮像ユニット１０１の撮像素子１０３ａ，１０３ｂは、各々、画像処理ユニット１０４とパラレルＩ／Ｆバスで接続されている。一方、撮像ユニット１０１の撮像素子１０３ａ，１０３ｂは、撮像制御ユニット１０５とは、シリアルＩ／Ｆバス（Ｉ２Ｃバス等）で接続されている。画像処理ユニット１０４、撮像制御ユニット１０５及び音処理ユニット１０９は、バス１１０を介してＣＰＵ１１１と接続される。さらに、バス１１０には、ＲＯＭ１１２、ＳＲＡＭ１１３、ＤＲＡＭ１１４、操作部１１５、ネットワークＩ／Ｆ１１６、通信部１１７、及び電子コンパス１１８なども接続される。 The imaging elements 103a and 103b of the imaging unit 101 are each connected to the image processing unit 104 via a parallel I / F bus. On the other hand, the imaging elements 103a and 103b of the imaging unit 101 are connected to the imaging control unit 105 through a serial I / F bus (I2C bus or the like). The image processing unit 104, the imaging control unit 105, and the sound processing unit 109 are connected to the CPU 111 via the bus 110. Further, ROM 112, SRAM 113, DRAM 114, operation unit 115, network I / F 116, communication unit 117, and electronic compass 118 are connected to the bus 110.

画像処理ユニット１０４は、撮像素子１０３ａ，１０３ｂから出力される画像データをパラレルＩ／Ｆバスを通して取り込み、それぞれの画像データに対して所定の処理を施した後、これらの画像データを合成処理して、図３（ｃ）に示されているような正距円筒射影画像のデータを作成する。 The image processing unit 104 takes in the image data output from the image sensors 103a and 103b through the parallel I / F bus, performs predetermined processing on the respective image data, and then combines these image data. The data of the equirectangular projection image as shown in FIG. 3C is created.

撮像制御ユニット１０５は、一般に撮像制御ユニット１０５をマスタデバイス、撮像素子１０３ａ，１０３ｂをスレーブデバイスとして、Ｉ２Ｃバスを利用して、撮像素子１０３ａ，１０３ｂのレジスタ群にコマンド等を設定する。必要なコマンド等は、ＣＰＵ１１１から受け取る。また、撮像制御ユニット１０５は、同じくＩ２Ｃバスを利用して、撮像素子１０３ａ，１０３ｂのレジスタ群のステータスデータ等を取り込み、ＣＰＵ１１１に送る。 In general, the imaging control unit 105 sets a command or the like in a register group of the imaging elements 103a and 103b using the I2C bus with the imaging control unit 105 as a master device and the imaging elements 103a and 103b as slave devices. Necessary commands and the like are received from the CPU 111. The imaging control unit 105 also uses the I2C bus to capture status data and the like of the register groups of the imaging elements 103a and 103b and send them to the CPU 111.

また、撮像制御ユニット１０５は、操作部１１５のシャッターボタンが押下されたタイミングで、撮像素子１０３ａ，１０３ｂに画像データの出力を指示する。特殊撮影装置１によっては、ディスプレイ（例えば、スマートフォン５のディスプレイ５１７）によるプレビュー表示機能や動画表示に対応する機能を持つ場合もある。この場合は、撮像素子１０３ａ，１０３ｂからの画像データの出力は、所定のフレームレート（フレーム／分）によって連続して行われる。 The imaging control unit 105 instructs the imaging elements 103a and 103b to output image data at the timing when the shutter button of the operation unit 115 is pressed. Some special imaging devices 1 may have a preview display function or a function corresponding to a moving image display by a display (for example, the display 517 of the smartphone 5). In this case, output of image data from the image sensors 103a and 103b is continuously performed at a predetermined frame rate (frame / min).

また、撮像制御ユニット１０５は、後述するように、ＣＰＵ１１１と協働して撮像素子１０３ａ，１０３ｂの画像データの出力タイミングの同期をとる同期制御手段としても機能する。なお、本実施形態では、特殊撮影装置１にはディスプレイが設けられていないが、表示部を設けてもよい。 Further, as will be described later, the imaging control unit 105 also functions as a synchronization control unit that synchronizes the output timing of image data of the imaging elements 103a and 103b in cooperation with the CPU 111. In the present embodiment, the special imaging apparatus 1 is not provided with a display, but a display unit may be provided.

マイク１０８は、音を音（信号）データに変換する。音処理ユニット１０９は、マイク１０８から出力される音データをＩ／Ｆバスを通して取り込み、音データに対して所定の処理を施す。 The microphone 108 converts sound into sound (signal) data. The sound processing unit 109 takes in the sound data output from the microphone 108 through the I / F bus and performs predetermined processing on the sound data.

ＣＰＵ１１１は、特殊撮影装置１の全体の動作を制御すると共に必要な処理を実行する。ＲＯＭ１１２は、ＣＰＵ１１１のための種々のプログラムを記憶している。ＳＲＡＭ１１３及びＤＲＡＭ１１４はワークメモリであり、ＣＰＵ１１１で実行するプログラムや処理途中のデータ等を記憶する。特にＤＲＡＭ１１４は、画像処理ユニット１０４での処理途中の画像データや処理済みの正距円筒射影画像のデータを記憶する。 The CPU 111 controls the overall operation of the special photographing apparatus 1 and executes necessary processes. The ROM 112 stores various programs for the CPU 111. The SRAM 113 and the DRAM 114 are work memories, and store programs executed by the CPU 111, data being processed, and the like. In particular, the DRAM 114 stores image data being processed by the image processing unit 104 and processed equirectangular projection image data.

操作部１１５は、シャッターボタン１１５ａなどの操作ボタンの総称である。ユーザは操作部１１５を操作することで、種々の撮影モードや撮影条件などを入力する。 The operation unit 115 is a general term for operation buttons such as the shutter button 115a. The user operates the operation unit 115 to input various shooting modes and shooting conditions.

ネットワークＩ／Ｆ１１６は、ＳＤカード等の外付けのメディアやパーソナルコンピュータなどとのインターフェース回路（ＵＳＢＩ／Ｆ等）の総称である。また、ネットワークＩ／Ｆ１１６としては、無線、有線を問わない。ＤＲＡＭ１１４に記憶された正距円筒射影画像のデータは、このネットワークＩ／Ｆ１１６を介して外付けのメディアに記録されたり、必要に応じてネットワークＩ／Ｆ１１６を介してスマートフォン５等の外部端末（装置）に送信されたりする。 The network I / F 116 is a general term for an interface circuit (USB I / F or the like) with an external medium such as an SD card or a personal computer. The network I / F 116 may be wireless or wired. Data of the equirectangular projection image stored in the DRAM 114 is recorded on an external medium via the network I / F 116, or an external terminal (device) such as the smartphone 5 via the network I / F 116 as necessary. ).

通信部１１７は、特殊撮影装置１に設けられたアンテナ１１７ａを介して、Wi-Fi、NFC、Bluetooth等の近距離無線通信技術によって、スマートフォン５等の外部端末（装置）と通信を行う。この通信部１１７によっても、正距円筒射影画像のデータをスマートフォン５等の外部端末（装置）に送信することができる。 The communication unit 117 communicates with an external terminal (device) such as the smartphone 5 through the antenna 117a provided in the special imaging device 1 by a short-range wireless communication technology such as Wi-Fi, NFC, or Bluetooth. The communication unit 117 can also transmit the equirectangular projection image data to an external terminal (device) such as the smartphone 5.

電子コンパス１１８は、地球の磁気から特殊撮影装置１の方位を算出し、方位情報を出力する。この方位情報はExifに沿った関連情報（メタデータ）の一例であり、撮影画像の画像補正等の画像処理に利用される。なお、関連情報には、画像の撮影日時、及び画像データのデータ容量の各データも含まれている。 The electronic compass 118 calculates the azimuth of the special imaging device 1 from the earth's magnetism and outputs azimuth information. This orientation information is an example of related information (metadata) along Exif, and is used for image processing such as image correction of a captured image. Note that the related information includes each data of the image capturing date and time and the data capacity of the image data.

ジャイロセンサ１１９は、全天球カメラ２０の移動に伴う角度の変化（Roll角、Pitch角、Yaw角）を検出するセンサである。角度の変化はExifに沿った関連情報（メタデータ）の一例であり、撮像画像の画像補正等の画像処理に利用される。 The gyro sensor 119 is a sensor that detects a change in angle (Roll angle, Pitch angle, Yaw angle) accompanying the movement of the omnidirectional camera 20. The change in angle is an example of related information (metadata) along Exif, and is used for image processing such as image correction of a captured image.

加速度センサ１２０は、３軸方向の加速度を検出するセンサである。特殊撮影装置３ａは、加速度センサ１２０が検出した加速度に基づいて、自装置（特殊撮影装置３ａ）の姿勢（重力方向に対する角度）を算出する。特殊撮影装置３ａに、ジャイロセンサ１１９と加速度センサ１２０の両方が設けられることによって、画像補正の精度が向上する。 The acceleration sensor 120 is a sensor that detects acceleration in three axial directions. The special imaging device 3a calculates the attitude (angle with respect to the direction of gravity) of the own device (special imaging device 3a) based on the acceleration detected by the acceleration sensor 120. By providing both the gyro sensor 119 and the acceleration sensor 120 in the special imaging device 3a, the accuracy of image correction is improved.

端子１２１は、Micro USB用の凹状の端子である。 The terminal 121 is a concave terminal for Micro USB.

＜一般撮影装置のハードウェア構成＞
次に、図１１を用いて、一般撮影装置のハードウェアについて説明する。図１１は、一般撮影装置３のハードウェア構成図である。図１１に示されているように、一般撮影装置３は、撮像ユニット３０１、画像処理ユニット３０４、撮像制御ユニット３０５、マイク３０８、音処理ユニット３０９、バス３１０、ＣＰＵ３１１、ＲＯＭ３１２、ＳＲＡＭ３１３、ＤＲＡＭ３１４、操作部３１５、ネットワークＩ／Ｆ３１６、通信部３１７、アンテナ３１７ａ、電子コンパス３１８、及びディスプレイ３１９によって構成されている。画像処理ユニット３０４及び撮像制御ユニット３０５は、バス３１０を介してＣＰＵ３１１と接続される。 <Hardware configuration of general imaging device>
Next, the hardware of the general photographing apparatus will be described with reference to FIG. FIG. 11 is a hardware configuration diagram of the general photographing apparatus 3. As shown in FIG. 11, the general imaging device 3 includes an imaging unit 301, an image processing unit 304, an imaging control unit 305, a microphone 308, a sound processing unit 309, a bus 310, a CPU 311, a ROM 312, an SRAM 313, a DRAM 314, an operation The unit 315 includes a network I / F 316, a communication unit 317, an antenna 317 a, an electronic compass 318, and a display 319. The image processing unit 304 and the imaging control unit 305 are connected to the CPU 311 via the bus 310.

各構成３０４、３１０、３１１、３１２、３１３、３１４、３１５、３１６、３１７、３１７ａ、３１８は、それぞれ、図１０の特殊撮影装置１における各構成１０４、１１０、１１１、１１２、１１３、１１４、１１５、１１６、１１７、１１７ａ、１１８と同様の構成であるため、その説明を省略する。 Configurations 304, 310, 311, 312, 313, 314, 315, 316, 317, 317a, and 318 are the configurations 104, 110, 111, 112, 113, 114, and 115 in the special imaging apparatus 1 of FIG. , 116, 117, 117a, and 118, the description thereof is omitted.

更に、一般撮影装置３の撮像ユニット３０１は、図１１に示されているように、撮像素子３０３の前面にレンズユニット３０６、及びメカニカルシャッタ３０７が外部から撮像素子３０３の方向に順に設けられている。 Further, as shown in FIG. 11, the imaging unit 301 of the general imaging apparatus 3 is provided with a lens unit 306 and a mechanical shutter 307 in order from the outside in the direction of the imaging element 303 on the front surface of the imaging element 303. .

撮像制御ユニット３０５は、基本的に撮像制御ユニット１０５と同様の構成及び処理を行なうが、更に、操作部３１５によって受け付けられた利用者の操作に基づいて、レンズユニット３０６、及びメカニカルシャッタ３０７の駆動を制御する。 The imaging control unit 305 basically performs the same configuration and processing as the imaging control unit 105, but further drives the lens unit 306 and the mechanical shutter 307 based on a user operation received by the operation unit 315. To control.

また、ディスプレイ３１９は、操作メニュー、撮影中又は撮影後の画像を表示させる表示手段の一例である。 The display 319 is an example of a display unit that displays an operation menu and an image during or after shooting.

＜スマートフォンのハードウェア構成＞
次に、図１２を用いて、スマートフォンのハードウェアについて説明する。図１２は、スマートフォンのハードウェア構成図である。図１２に示されているように、スマートフォン５は、ＣＰＵ５０１、ＲＯＭ５０２、ＲＡＭ５０３、ＥＥＰＲＯＭ５０４、ＣＭＯＳセンサ５０５、撮像素子Ｉ／Ｆ５１３ａ、加速度・方位センサ５０６、メディアＩ／Ｆ５０８、ＧＰＳ受信部５０９を備えている。 <Smartphone hardware configuration>
Next, the hardware of the smartphone will be described with reference to FIG. FIG. 12 is a hardware configuration diagram of the smartphone. As shown in FIG. 12, the smartphone 5 includes a CPU 501, a ROM 502, a RAM 503, an EEPROM 504, a CMOS sensor 505, an image sensor I / F 513a, an acceleration / direction sensor 506, a media I / F 508, and a GPS receiver 509. Yes.

これらのうち、ＣＰＵ５０１は、スマートフォン５全体の動作を制御する。ＲＯＭ５０２は、ＣＰＵ５０１やＩＰＬ(Initial Program Loader)等のＣＰＵ５０１の駆動に用いられるプログラムを記憶する。ＲＡＭ５０３は、ＣＰＵ５０１のワークエリアとして使用される。ＥＥＰＲＯＭ５０４は、ＣＰＵ５０１の制御にしたがって、スマートフォン用プログラム等の各種データの読み出し又は書き込みを行う。ＣＭＯＳセンサ５０５は、ＣＰＵ５０１の制御に従って被写体（主に自画像）を撮像し画像データを得る。撮像素子Ｉ／Ｆ５１３ａは、ＣＭＯＳセンサ５１２の駆動を制御する回路である。加速度・方位センサ５０６は、地磁気を検知する電子磁気コンパスやジャイロコンパス、加速度センサ等の各種センサである。メディアＩ／Ｆ５０８は、フラッシュメモリ等の記録メディア５０７に対するデータの読み出し又は書き込み（記憶）を制御する。ＧＰＳ受信部５０９は、ＧＰＳ衛星からＧＰＳ信号を受信する。 Among these, the CPU 501 controls the operation of the entire smartphone 5. The ROM 502 stores programs used for driving the CPU 501 such as the CPU 501 and IPL (Initial Program Loader). The RAM 503 is used as a work area for the CPU 501. The EEPROM 504 reads or writes various data such as a smartphone program under the control of the CPU 501. The CMOS sensor 505 captures a subject (mainly a self-portrait) under the control of the CPU 501 and obtains image data. The image sensor I / F 513 a is a circuit that controls driving of the CMOS sensor 512. The acceleration / direction sensor 506 is various sensors such as an electronic magnetic compass, a gyrocompass, and an acceleration sensor that detect geomagnetism. A media I / F 508 controls reading or writing (storage) of data with respect to a recording medium 507 such as a flash memory. The GPS receiver 509 receives GPS signals from GPS satellites.

また、スマートフォン５は、遠距離通信回路５１１、アンテナ５１１ａ、ＣＭＯＳセンサ５１２、撮像素子Ｉ／Ｆ５１３ｂ、マイク５１４、スピーカ５１５、音入出力Ｉ／Ｆ５１６、ディスプレイ５１７、外部機器接続Ｉ／Ｆ５１８、近距離通信回路５１９、近距離通信回路５１９のアンテナ５１９ａ、及びタッチパネル５２１を備えている。 The smartphone 5 includes a long-distance communication circuit 511, an antenna 511a, a CMOS sensor 512, an image sensor I / F 513b, a microphone 514, a speaker 515, a sound input / output I / F 516, a display 517, an external device connection I / F 518, and a short distance. A communication circuit 519, an antenna 519a of the short-range communication circuit 519, and a touch panel 521 are provided.

これらのうち、遠距離通信回路５１１は、後述の通信ネットワーク１００を介して、他の機器と通信する回路である。ＣＭＯＳセンサ５１２は、ＣＰＵ５０１の制御に従って被写体を撮像して画像データを得る内蔵型の撮像手段の一種である。撮像素子Ｉ／Ｆ５１３ｂは、ＣＭＯＳセンサ５１２の駆動を制御する回路である。マイク５１４は、音声を入力する内蔵型の集音手段の一種である。音入出力Ｉ／Ｆ５１６は、ＣＰＵ５０１の制御に従ってマイク５１４及びスピーカ５１５との間で音信号の入出力を処理する回路である。ディスプレイ５１７は、被写体の画像や各種アイコン等を表示する液晶や有機ＥＬなどの表示手段の一種である。外部機器接続Ｉ／Ｆ５１８は、各種の外部機器を接続するためのインターフェースである。近距離通信回路５１９は、Wi-Fi、NFC、Bluetooth等の通信回路である。タッチパネル５２１は、利用者がディスプレイ５１７を押下することで、スマートフォン５を操作する入力手段の一種である。 Among these, the long-distance communication circuit 511 is a circuit that communicates with other devices via a communication network 100 described later. The CMOS sensor 512 is a kind of built-in imaging unit that captures an image of a subject under the control of the CPU 501 and obtains image data. The image sensor I / F 513b is a circuit that controls driving of the CMOS sensor 512. The microphone 514 is a kind of built-in sound collecting means for inputting sound. The sound input / output I / F 516 is a circuit that processes input / output of a sound signal between the microphone 514 and the speaker 515 under the control of the CPU 501. The display 517 is a kind of display means such as a liquid crystal display or an organic EL display that displays a subject image, various icons, and the like. The external device connection I / F 518 is an interface for connecting various external devices. The short-range communication circuit 519 is a communication circuit such as Wi-Fi, NFC, or Bluetooth. The touch panel 521 is a kind of input means for operating the smartphone 5 when the user presses the display 517.

また、スマートフォン５は、バスライン５１０を備えている。バスライン５１０は、ＣＰＵ５０１等の各構成要素を電気的に接続するためのアドレスバスやデータバス等である。 In addition, the smartphone 5 includes a bus line 510. The bus line 510 is an address bus, a data bus, or the like for electrically connecting each component such as the CPU 501.

＜＜実施形態の機能構成＞＞
次に、図１０乃至図１３を用いて、本実施形態の機能構成について説明する。図１３は、本実施形態の撮影システムの一部を構成する、特殊撮影装置１、一般撮影装置３、及びスマートフォン５の各機能ブロック図である。 << Functional Configuration of Embodiment >>
Next, the functional configuration of the present embodiment will be described with reference to FIGS. 10 to 13. FIG. 13 is a functional block diagram of the special imaging device 1, the general imaging device 3, and the smartphone 5 that constitute a part of the imaging system of the present embodiment.

＜特殊撮影装置の機能構成＞
まず、図１０及び図１３を用いて、特殊撮影装置１の機能構成について詳細に説明する。図１３に示されているように、特殊撮影装置１は、受付部１２、撮像部１３、集音部１４、画像・音処理部１５、判断部１７、近距離通信部１８、及び記憶・読出部１９を有している。これら各部は、図１０に示されている各構成要素のいずれかが、ＳＲＡＭ１１３からＤＲＡＭ１１４上に展開された特殊撮影装置用のプログラムに従ったＣＰＵ１１１からの命令によって動作することで実現される機能又は手段である。 <Functional configuration of special imaging device>
First, the functional configuration of the special imaging device 1 will be described in detail with reference to FIGS. 10 and 13. As illustrated in FIG. 13, the special imaging device 1 includes a reception unit 12, an imaging unit 13, a sound collection unit 14, an image / sound processing unit 15, a determination unit 17, a short-range communication unit 18, and a storage / readout. It has a part 19. Each of these units is a function realized by any one of the constituent elements shown in FIG. 10 being operated by a command from the CPU 111 according to a program for the special photographing apparatus developed from the SRAM 113 onto the DRAM 114, or Means.

また、特殊撮影装置１は、図１０に示されているＲＯＭ１１２、ＳＲＡＭ１１３、及びＤＲＡＭ１１４によって構築される記憶部１０００を有している。 The special photographing apparatus 1 has a storage unit 1000 constructed by the ROM 112, the SRAM 113, and the DRAM 114 shown in FIG.

（特殊撮影装置の各機能構成）
次に、図１０及び図１３を用いて、特殊撮影装置１の各機能構成について更に詳細に説明する。 (Functional configuration of special imaging equipment)
Next, each functional configuration of the special imaging device 1 will be described in more detail with reference to FIGS. 10 and 13.

特殊撮影装置１の受付部１２は、主に、図１０に示されている操作部１１５及びＣＰＵ１１１の処理によって実現され、利用者からの操作入力を受け付ける。 The reception unit 12 of the special imaging device 1 is mainly realized by the processing of the operation unit 115 and the CPU 111 illustrated in FIG. 10 and receives operation input from the user.

撮像部１３は、主に、図１０に示されている撮像ユニット１０１、画像処理ユニット１０４、及び撮像制御ユニット１０５、並びにＣＰＵ１１１の処理によって実現され、被写体や風景等を撮像し、撮影画像データを得る。この撮影画像データは、図３（ａ），（ｂ）に示されているように、全天球画像データの元になる２つの半球画像データである。 The imaging unit 13 is realized mainly by the processing of the imaging unit 101, the image processing unit 104, the imaging control unit 105, and the CPU 111 shown in FIG. obtain. As shown in FIGS. 3A and 3B, the captured image data is two hemispherical image data that is the basis of the omnidirectional image data.

集音部１４は、図１０に示されているマイク１０８及び音処理ユニット１０９、並びにＣＰＵ１１１の処理によって実現され、特殊撮影装置１の周囲の音を集音する。 The sound collecting unit 14 is realized by the processing of the microphone 108 and the sound processing unit 109 and the CPU 111 shown in FIG. 10 and collects sounds around the special photographing apparatus 1.

画像・音処理部１５は、主にＣＰＵ１１１からの命令によって実現され、撮像部１３によって得られた撮影画像データ、又は集音部１４によって得られた音データに対して、各種処理を行なう。例えば、画像・音処理部１５は、２つの撮像素子１０３ａ，１０３ｂのそれぞれによって得られた２つの半球画像データ（図３（ａ），（ｂ）参照）に基づいて、正距円筒射影画像データ（図３（ｃ）参照）を作成する。 The image / sound processing unit 15 is realized mainly by a command from the CPU 111 and performs various processes on the captured image data obtained by the imaging unit 13 or the sound data obtained by the sound collecting unit 14. For example, the image / sound processing unit 15 performs equirectangular projection image data based on two hemispherical image data (see FIGS. 3A and 3B) obtained by the two image sensors 103a and 103b, respectively. (See FIG. 3C).

判断部１７は、ＣＰＵ１１１の処理によって実現され、各種判断を行なう。 The determination unit 17 is realized by the processing of the CPU 111 and makes various determinations.

近距離通信部１８は、主に、ＣＰＵ１１１の処理、並びに通信部１１７及びアンテナ１１７ａによって実現され、スマートフォン５の近距離通信部５８等と、Wi-Fi等による近距離無線通信技術によって通信することができる。 The short-range communication unit 18 is mainly realized by the processing of the CPU 111, the communication unit 117, and the antenna 117a, and communicates with the short-range communication unit 58 of the smartphone 5 by a short-range wireless communication technique such as Wi-Fi. Can do.

記憶・読出部１９は、主に、図１０に示されているＣＰＵ１１１の処理によって実現され、記憶部１０００に各種データ（または情報）を記憶したり、記憶部１０００から各種データ（または情報）を読み出したりする。 The storage / reading unit 19 is realized mainly by the processing of the CPU 111 shown in FIG. 10, and stores various data (or information) in the storage unit 1000 or stores various data (or information) from the storage unit 1000. Read out.

＜一般撮影装置の機能構成＞
続いて、図１１及び図１３を用いて、一般撮影装置３の機能構成について詳細に説明する。図１３に示されているように、一般撮影装置３は、受付部３２、撮像部３３、集音部３４、画像・音処理部３５、表示制御部３６、判断部３７、近距離通信部３８、及び記憶・読出部３９を有している。これら各部は、図１１に示されている各構成要素のいずれかが、ＳＲＡＭ３１３からＤＲＡＭ３１４上に展開された特殊撮影装置用のプログラムに従ったＣＰＵ３１１からの命令によって動作することで実現される機能又は手段である。 <Functional configuration of general imaging device>
Next, the functional configuration of the general imaging device 3 will be described in detail with reference to FIGS. 11 and 13. As illustrated in FIG. 13, the general imaging device 3 includes a reception unit 32, an imaging unit 33, a sound collection unit 34, an image / sound processing unit 35, a display control unit 36, a determination unit 37, and a short-range communication unit 38. And a storage / reading unit 39. Each of these units is a function realized by any one of the constituent elements shown in FIG. 11 being operated by a command from the CPU 311 according to a program for the special photographing apparatus developed from the SRAM 313 to the DRAM 314, or Means.

また、一般撮影装置３は、図１１に示されているＲＯＭ３１２、ＳＲＡＭ３１３、及びＤＲＡＭ３１４によって構築される記憶部３０００を有している。 The general photographing apparatus 3 includes a storage unit 3000 configured by the ROM 312, the SRAM 313, and the DRAM 314 shown in FIG. 11.

（一般撮影装置の各機能構成）
一般撮影装置３の受付部３２は、主に、図１１に示されている操作部３１５及びＣＰＵ３１１の処理によって実現され、利用者からの操作入力を受け付ける。 (Functional configuration of general imaging equipment)
The reception unit 32 of the general photographing apparatus 3 is mainly realized by the processing of the operation unit 315 and the CPU 311 illustrated in FIG. 11 and receives operation input from the user.

撮像部３３は、主に、図１１に示されている撮像ユニット３０１、画像処理ユニット３０４、及び撮像制御ユニット３０５、並びにＣＰＵ３１１の処理によって実現され、被写体や風景等を撮像し、撮影画像データを得る。この撮影画像データは、透視射影方式で撮影された平面画像データである。 The imaging unit 33 is realized mainly by the processing of the imaging unit 301, the image processing unit 304, the imaging control unit 305, and the CPU 311 shown in FIG. obtain. This photographed image data is planar image data photographed by the perspective projection method.

集音部３４は、図１１に示されているマイク３０８及び音処理ユニット３０９、並びにＣＰＵ３１１の処理によって実現され、一般撮影装置３の周囲の音を集音する。 The sound collection unit 34 is realized by the processing of the microphone 308 and the sound processing unit 309 and the CPU 311 illustrated in FIG. 11 and collects sounds around the general photographing apparatus 3.

画像・音処理部３５は、主にＣＰＵ３１１からの命令によって実現され、撮像部３３によって得られた撮影画像データ、又は集音部３４によって得られた音データに対して、各種処理を行なう。 The image / sound processing unit 35 is realized mainly by a command from the CPU 311, and performs various processes on the captured image data obtained by the imaging unit 33 or the sound data obtained by the sound collection unit 34.

表示制御部３６は、図１１に示されているＣＰＵ３１１の処理によって実現され、ディスプレイ３１９に、撮影中又は撮影後の撮影画像データに係る平面画像Ｐを表示させる。 The display control unit 36 is realized by the processing of the CPU 311 illustrated in FIG. 11, and causes the display 319 to display the planar image P related to the captured image data during or after capturing.

判断部３７は、ＣＰＵ３１１の処理によって実現され、各種判断を行なう。例えば、判断部３７は、利用者によって、シャッターボタン３１５ａが押下されたかを判断する。 The determination unit 37 is realized by the processing of the CPU 311 and performs various determinations. For example, the determination unit 37 determines whether the user has pressed the shutter button 315a.

近距離通信部３８は、主に、ＣＰＵ３１１、並びに通信部３１７及びアンテナ３１７ａによって実現され、スマートフォン５の近距離通信部５８等と、Wi-Fi等による近距離無線通信技術によって通信することができる。 The short-range communication unit 38 is mainly realized by the CPU 311, the communication unit 317, and the antenna 317 a, and can communicate with the short-range communication unit 58 of the smartphone 5 by a short-range wireless communication technique such as Wi-Fi. .

記憶・読出部３９は、主に、図１１に示されているＣＰＵ３１１の処理によって実現され、記憶部３０００に各種データ（または情報）を記憶したり、記憶部３０００から各種データ（または情報）を読み出したりする。 The storage / reading unit 39 is realized mainly by the processing of the CPU 311 shown in FIG. 11, and stores various data (or information) in the storage unit 3000 or stores various data (or information) from the storage unit 3000. Read out.

＜スマートフォンの機能構成＞
次に、図１２乃至図１６を用いて、スマートフォン５の機能構成について詳細に説明する。図１３に示されているように、スマートフォン５は、遠距離通信部５１、受付部５２、撮像部５３、集音部５４、画像・音処理部５５、表示制御部５６、判断部５７、近距離通信部５８、及び記憶・読出部５９を有している。これら各部は、図１２に示されている各構成要素のいずれかが、ＥＥＰＲＯＭ５０４からＲＡＭ５０３上に展開されたスマートフォン５用プログラムに従ったＣＰＵ５０１からの命令によって動作することで実現される機能又は手段である。 <Functional configuration of smartphone>
Next, the functional configuration of the smartphone 5 will be described in detail with reference to FIGS. As illustrated in FIG. 13, the smartphone 5 includes a long-distance communication unit 51, a reception unit 52, an imaging unit 53, a sound collection unit 54, an image / sound processing unit 55, a display control unit 56, a determination unit 57, A distance communication unit 58 and a storage / reading unit 59 are provided. Each of these units is a function or means realized by any of the constituent elements shown in FIG. 12 being operated by a command from the CPU 501 according to the smartphone 5 program expanded from the EEPROM 504 to the RAM 503. is there.

また、スマートフォン５は、図１２に示されているＲＯＭ５０２、ＲＡＭ５０３、及びＥＥＰＲＯＭ５０４によって構築される記憶部５０００を有している。この記憶部５０００には、連携撮影装置管理ＤＢ５００１が構築されている。この連携撮影装置管理ＤＢ５００１は、図１４（ａ）連携撮影装置管理テーブルによって構成されている。図１４（ａ）は連携撮影装置管理テーブルの概念図である。 The smartphone 5 includes a storage unit 5000 configured by the ROM 502, the RAM 503, and the EEPROM 504 illustrated in FIG. In this storage unit 5000, a cooperative photographing apparatus management DB 5001 is constructed. This cooperative photographing apparatus management DB 5001 is configured by a cooperative photographing apparatus management table in FIG. FIG. 14A is a conceptual diagram of the cooperative photographing apparatus management table.

（連携撮影装置管理テーブル）
次に、図１４（ａ）を用いて、連携撮影装置管理テーブルについて説明する。図１４（ａ）に示されているように、撮影装置毎に、各撮影装置の連携関係を示す関連関係情報、撮影装置のＩＰアドレス、及び撮影装置の装置名が関連付けて管理されている。このうち、関連関係情報は、自装置のシャッターが押下されることで撮影を開始する一の撮影装置を「メイン」とし、「メイン」の撮影装置でシャッターが押下されることに応じて撮影を開始する他の撮影装置を「サブ」として示している。なお、ＩＰアドレスは、Wi-Fiによる通信の場合であって、ＵＳＢの有線ケーブルを用いた通信の場合には製造者ＩＤ(Identification)及び製品ＩＤに代わり、Bluetoothを用いた無線通信の場合には、ＢＤ(Bluetooth Device Address）に代わる。 (Cooperative shooting device management table)
Next, the cooperative photographing apparatus management table will be described with reference to FIG. As shown in FIG. 14A, for each image capturing device, related relationship information indicating the cooperation relationship of each image capturing device, the IP address of the image capturing device, and the device name of the image capturing device are managed in association with each other. Of these, the related relationship information indicates that one imaging device that starts imaging when the shutter of its own device is pressed is “main”, and shooting is performed in response to the shutter being pressed by the “main” imaging device. Another imaging device to start is indicated as “sub”. Note that the IP address is for Wi-Fi communication, and in the case of wireless communication using Bluetooth instead of the manufacturer ID (Identification) and product ID in the case of communication using a USB wired cable. Replaces BD (Bluetooth Device Address).

（スマートフォンの各機能構成）
スマートフォン５の遠距離通信部５１は、主に、図１２に示されている遠距離通信回路５１１及びＣＰＵ５０１の処理によって実現され、インターネット等の通信ネットワークを介して、他の装置（例えば、他のスマートフォン、サーバ）との間で各種データ（または情報）の送受信を行う。 (Each smartphone functional configuration)
The long-distance communication unit 51 of the smartphone 5 is mainly realized by the processing of the long-distance communication circuit 511 and the CPU 501 illustrated in FIG. 12, and is connected to another device (for example, other network) via a communication network such as the Internet. Various data (or information) is transmitted to and received from a smartphone or server.

受付部５２は、主にタッチパネル５２１及びＣＰＵ５０１の処理によって実現され、利用者から各種の選択又は入力を受け付ける。タッチパネル５２１はディスプレイ５１７と共用であってもよい。また、タッチパネル以外の入力手段（ボタン）等でもよい。 The accepting unit 52 is realized mainly by the processes of the touch panel 521 and the CPU 501 and accepts various selections or inputs from the user. The touch panel 521 may be shared with the display 517. Further, input means (buttons) other than the touch panel may be used.

撮像部５３は、主に、図１２に示されているＣＭＯＳセンサ５０５，５１２、及びＣＰＵ５０１の処理によって実現され、被写体や風景等を撮像し、撮影画像データを得る。この撮影画像データは、透視射影方式で撮影された平面画像データである。 The imaging unit 53 is realized mainly by the processing of the CMOS sensors 505 and 512 and the CPU 501 shown in FIG. 12, and images a subject, a landscape, and the like to obtain captured image data. This photographed image data is planar image data photographed by the perspective projection method.

集音部５４は、図１２に示されているマイク５１４、及びＣＰＵ５０１の処理によって実現され、スマートフォン５の周囲の音を集音する。 The sound collection unit 54 is realized by the processing of the microphone 514 and the CPU 501 illustrated in FIG. 12 and collects sounds around the smartphone 5.

画像・音処理部５５は、主にＣＰＵ５０１からの命令によって実現され、撮像部５３によって得られた撮影画像データ、又は集音部５４によって得られた音データに対して、各種処理を行なう。 The image / sound processing unit 55 is realized mainly by a command from the CPU 501, and performs various processes on the captured image data obtained by the imaging unit 53 or the sound data obtained by the sound collection unit 54.

表示制御部５６は、図１２に示されているＣＰＵ５０１の処理によって実現され、ディスプレイ５１７に、撮像部５３による撮影中又は撮影後の撮影画像データに係る平面画像Ｐを表示させる。また、表示制御部５６は、画像・音処理部５５によって作成された重畳表示メタデータを利用し、基準形状情報（基準位置パラメータ）に対して基準形状変換情報で特定される背景動画（全天球画像ＣＥ）上の変換位置パラメータ、及び補正パラメータで示された明るさ値及び色値に基づいて、全天球画像ＣＥ上に平面画像Ｐを重畳して表示する。なお、変換位置パラメータは「変換位置情報」の一例である。補正パラメータは「補正情報」の一例である。 The display control unit 56 is realized by the processing of the CPU 501 illustrated in FIG. 12, and causes the display 517 to display the planar image P related to the captured image data during or after the imaging by the imaging unit 53. In addition, the display control unit 56 uses the superimposed display metadata created by the image / sound processing unit 55 and uses a background moving image (entire sky) specified by the reference shape conversion information with respect to the reference shape information (reference position parameter). The planar image P is superimposed and displayed on the omnidirectional image CE on the basis of the brightness value and the color value indicated by the conversion position parameter on the spherical image CE) and the correction parameter. The conversion position parameter is an example of “conversion position information”. The correction parameter is an example of “correction information”.

判断部５７は、図１２に示されているＣＰＵ５０１の処理によって実現され、各種判断を行なう。 The determination unit 57 is realized by the processing of the CPU 501 shown in FIG. 12, and makes various determinations.

近距離通信部５８は、主に、ＣＰＵ５０１の処理、並びに近距離通信回路５１９及びアンテナ５１９ａによって実現され、特殊撮影装置１の近距離通信部１８、一般撮影装置３の近距離通信部３８等と、Wi-Fi等による近距離無線通信技術によって通信することができる。 The short-range communication unit 58 is mainly realized by the processing of the CPU 501 and the short-range communication circuit 519 and the antenna 519a. It is possible to communicate by a short-range wireless communication technology such as Wi-Fi.

記憶・読出部５９は、主に、図１２に示されているＣＰＵ５０１の処理によって実現され、記憶部５０００に、重畳表示メタデータ等の各種データ（または情報）を記憶したり、記憶部５０００から重畳表示メタデータ等の各種データ（または情報）を読み出したりする。また、記憶・読出部５９は、記憶部５０００から各種データを取得する取得部の役割を果たす。 The storage / reading unit 59 is mainly realized by the processing of the CPU 501 shown in FIG. 12, and stores various data (or information) such as superimposed display metadata in the storage unit 5000 or from the storage unit 5000. Various data (or information) such as superimposed display metadata is read out. The storage / reading unit 59 serves as an acquisition unit that acquires various data from the storage unit 5000.

（画像・音処理部の詳細な各機能構成）
ここで、図１５及び図１６を用いて、画像・音処理部５５の各機能構成について詳細に説明する。図１５は、第１の実施形態に係るメタデータ作成部の詳細な機能ブロック図である。図１６は、第１の実施形態に係る重畳部の詳細な機能ブロック図である。 (Detailed functional configuration of image / sound processor)
Here, the functional configuration of the image / sound processing unit 55 will be described in detail with reference to FIGS. 15 and 16. FIG. 15 is a detailed functional block diagram of the metadata creation unit according to the first embodiment. FIG. 16 is a detailed functional block diagram of the superposition unit according to the first embodiment.

画像・音処理部５５は、大きく分けて、エンコードを行なうメタデータ作成部５５ａとデコードを行なう重畳部５５ｂを有している。メタデータ作成部５５ａは、図１８に示されている後述のステップＳ１９の処理を実行する。また、重畳部５５ｂは、図１８に示されている後述のステップＳ２０の処理を実行する。 The image / sound processing unit 55 includes a metadata creating unit 55a that performs encoding and a superimposing unit 55b that performs decoding. The metadata creation unit 55a executes a process of step S19 described later shown in FIG. In addition, the superimposing unit 55b executes a process of step S20 described later shown in FIG.

｛メタデータ作成部の各機能構成｝
まずは、メタデータ作成部５５ａの各機能構成について説明する。メタデータ作成部５５ａは、（メタデータ）作成用動画フレーム抽出部５４８、抽出部５５０、第１の対応領域算出部５５２、注視点特定部５５４、射影方式変換部５５６、第２の対応領域算出部５５８、初期設定形状生成部５５９、領域分割部５６０、射影方式逆変換部５６２、基準形状変換情報算出部５６８、及び重畳表示メタデータ作成部５７０を有している。また、以下に説明する画像や領域を示す符号は図１９に示されている。図１９は、重畳表示パラメータの作成処理の過程における画像の概念図である。 {Each functional configuration of the metadata creation unit}
First, each functional configuration of the metadata creation unit 55a will be described. The metadata creation unit 55a includes a (metadata) creation moving image frame extraction unit 548, an extraction unit 550, a first corresponding region calculation unit 552, a gazing point identification unit 554, a projection method conversion unit 556, and a second corresponding region calculation. A unit 558, an initial setting shape generation unit 559, a region division unit 560, a projection method inverse conversion unit 562, a reference shape conversion information calculation unit 568, and a superimposed display metadata creation unit 570. Reference numerals indicating images and regions described below are shown in FIG. FIG. 19 is a conceptual diagram of an image in the process of creating a superimposed display parameter.

作成用動画フレーム抽出部５４８は、背景動画となる一方の動画像と、前景動画となる他方の動画像のそれぞれから、それぞれ指定された時刻に対応するフレーム画像を抽出する。動画像は様々な記録フォーマットがあることは周知のことであるが、時系列に沿って複数の静止画像が並べられた画像としてとらえることができる。そして、１秒間の動画を構成する静止画像の枚数がフレームレートであり、例えば、秒１コマ（１コマ／１秒）などで表現される。したがって、動画記録開始の最初のフレームを時刻０とすると、時刻Ｔにおけるフレーム画像はフレームレートと時刻の関係から何番目のフレーム画像となるかを算出及び抽出することができる。もちろん指定された時刻ｔｉと同時刻となるフレーム画像は存在しない場合があり、前後のフレーム画像で近いものを時刻ｔｉのフレームとするなどの処理が考えられる。本実施形態では背景動画となる動画と、前景動画となる動画の２つの動画を取り扱うが、それぞれの動画開始時刻がずれている場合がある。本実施形態では背景動画に前景動画を重畳するための重畳位置を算出するため、取り扱う画像は時刻の差が小さい方が扱いやく、前景動画および背景動画の記録開始時刻がずれている場合は、前景動画又は背景動画のどちらかに時刻の基準を合わせると良い。前景動画及び背景動画としての２つの動画の開始画像フレームを合わせるための時刻のずれを算出する方法は、例えば、それぞれ動画において画像とともに音が記録されている場合は、それぞれの動画における音データの相互相関を算出し、相互相関のずれが最小になる時刻のずれを求める方法や、音データが無い場合は、それぞれのフレーム画像間での画像相関から求める方法などある。 The creation moving image frame extraction unit 548 extracts a frame image corresponding to each designated time from one moving image that is a background moving image and the other moving image that is a foreground moving image. Although it is well known that moving images have various recording formats, it can be regarded as an image in which a plurality of still images are arranged in time series. The number of still images constituting a moving image for one second is a frame rate, and is represented by, for example, one frame per second (1 frame / 1 second). Therefore, when the first frame at the start of moving image recording is time 0, it is possible to calculate and extract the number of the frame image at the time T from the relationship between the frame rate and the time. Of course, there may be no frame image having the same time as the designated time ti, and processing such as making a frame close to the previous and next frame images into a frame at the time ti can be considered. In this embodiment, two moving images, a moving image that becomes a background moving image and a moving image that becomes a foreground moving image, are handled. In this embodiment, since the superimposition position for superimposing the foreground video on the background video is calculated, it is easier to handle the image with a smaller time difference, and when the recording start times of the foreground video and the background video are shifted, The time reference should be adjusted to either the foreground video or the background video. The method for calculating the time lag for matching the start image frames of the two moving images as the foreground moving image and the background moving image is, for example, when sound is recorded together with the image in each moving image, There are a method of calculating cross-correlation and obtaining a time shift at which the cross-correlation shift is minimized, and a method of obtaining from the image correlation between frame images when there is no sound data.

ここで、背景動画と前景動画の同期に関して説明を補足する。例えば、それぞれの動画のフレームレートが３０ｆｐｓと２４ｆｐｓとして記録された背景動画と前景動画があったとする。この場合、上述の音データによる相互相関を算出し、背景動画が前景動画よりも０.５秒早く撮影された場合、背景動画と前景動画の開始フレームをあわせるために、背景動画を０．５秒分、即ち（３０×０．５＝）１５フレーム分オフセットさせることで時刻の同期を実現する。よって同期後の１秒間隔の動画フレームはそれぞれの動画のフレームを考慮し、背景動画のフレームカウントは１５，４５，７５，１０５・・・となり、前景動画のフレームカウントは０，２４，４８，７２・・・となる。ここでは、前景動画及び背景動画の各動画開始フレームのずれを考慮した上で、時刻ｔｉにおける背景動画のフレームを正距円筒射影画像ＥＣｉ、同じ時刻ｔｉにおける前景動画のフレームを平面画像Ｐｉとする。即ち、正距円筒射影画像ＥＣｉと平面画像Ｐｉは、同じ時間に撮影されることで得られた動画の一フレームである。なお、相互相関の算出方法に関しては、公知の技術を使えば良く本実施形態の本質部分ではないため、詳細な説明は省略する。以上のように、相互に画像間の同期については、公知の技術を使えば良く本実施形態の本質部分ではないため、詳細な説明は省略する。 Here, a supplementary explanation will be given regarding the synchronization of the background video and the foreground video. For example, it is assumed that there are a background moving image and a foreground moving image recorded with frame rates of 30 fps and 24 fps, respectively. In this case, when the cross-correlation based on the above sound data is calculated and the background moving image is shot 0.5 seconds earlier than the foreground moving image, the background moving image is set to 0.5 to match the background moving image and the starting frame of the foreground moving image. Time synchronization is realized by offsetting by 15 seconds, that is, (30 × 0.5 =) 15 frames. Therefore, the video frames at 1-second intervals after synchronization take into account the frames of the respective videos, the frame count of the background video is 15, 45, 75, 105, etc., and the frame count of the foreground video is 0, 24, 48, 72... Here, taking into account the deviation of the moving image start frames of the foreground moving image and the background moving image, the frame of the background moving image at time ti is the equirectangular projection image ECi, and the frame of the foreground moving image at the same time ti is the plane image Pi. . That is, the equirectangular projection image ECi and the planar image Pi are one frame of a moving image obtained by shooting at the same time. Note that the cross-correlation calculation method is not an essential part of the present embodiment as long as a known technique is used, and detailed description thereof is omitted. As described above, the synchronization between the images is not an essential part of the present embodiment as long as a known technique is used, and detailed description thereof is omitted.

抽出部５５０は、各画像ＥＣｉ，Ｐｉの局所特徴に基づき特徴点を抽出する。局所特徴とは、エッジやブロブなど画像内に見られるパターンや構造である。局所特徴を数値化したものが特徴量である。本実施形態では、抽出部５５０は、異なる画像で各特徴点を抽出する。抽出部５５０が用いられる２つの画像は、歪みが著しく大きくない限り、異なる射影方式であってもよい。例えば、抽出部５５０は、正距円筒射影方式によって得られた長方形の正距円筒射影画像ＥＣｉと、透視射影方式によって得られた長方形の平面画像Ｐｉとの間、及び、平面画像Ｐｉと、射影方式変換部５５６によって変換された後の周辺領域画像ＰＩｉとの間で用いられる。なお、正距円筒射影方式は第１の射影方式の一例であり、透視射影方式は第２の射影方式の一例である。また、正距円筒射影画像は第１の射影画像の一例であり、平面画像は第２の射影画像の一例である。 The extraction unit 550 extracts feature points based on the local features of the images ECi and Pi. A local feature is a pattern or structure found in an image, such as an edge or a blob. A feature value is a numerical value of a local feature. In the present embodiment, the extraction unit 550 extracts each feature point from different images. The two images used by the extraction unit 550 may have different projection methods as long as the distortion is not significantly large. For example, the extraction unit 550 includes, between the rectangular equirectangular projection image ECi obtained by the equirectangular projection method and the rectangular planar image Pi obtained by the perspective projection method, and the projection image Pi. It is used between the peripheral region image PIi after being converted by the method conversion unit 556. The equirectangular projection method is an example of the first projection method, and the perspective projection method is an example of the second projection method. The equirectangular projection image is an example of a first projection image, and the planar image is an example of a second projection image.

第１の対応領域算出部５５２は、最初に正距円筒射影画像ＥＣｉにおける複数の特徴点ｆｐ１に基づいた各特徴量ｆｖ１を求めるとともに、平面画像Ｐｉにおける複数の特徴点ｆｐ２に基づいた各特徴量ｆｖ２を求める。特徴量の記述方法はいくつかの方式が提案されているが、本実施形態においては、スケールや回転に対して不変又は頑強であることが望ましい。第１の対応領域算出部５５２は、続いて算出した正距円筒射影画像ＥＣの複数の特徴点ｆｐ１の特徴量ｆｖ１と、平面画像Ｐｉにおける複数の特徴点ｆｐ２に対する特徴量ｆｖ２の類似度に基づき画像間の対応点を算出し、算出した画像間の対応点の関係から、正距円筒射影画像ＥＣにおいて、平面画像Ｐに対応するホモグラフィを算出し、このホモグラフィを変換に用いることで、第１のホモグラフィ変換を行なう。その結果、第１の対応領域算出部５５２は、第１の対応領域ＣＡ１を算出する。この場合、平面画像Ｐｉの４頂点から成る四角形（矩形）の中心点ＣＰ１は、第１のホモグラフィ変換によって、正距円筒射影画像ＥＣにおける注視点ＧＰ１に変換される。 The first corresponding region calculation unit 552 first obtains each feature amount fv1 based on the plurality of feature points fp1 in the equirectangular projection image ECi, and each feature amount based on the plurality of feature points fp2 in the planar image Pi. Find fv2. Several methods have been proposed as a description method of the feature amount. In the present embodiment, it is desirable that the feature amount is invariable or robust to scale and rotation. The first corresponding area calculation unit 552 is based on the similarity between the feature quantity fv1 of the plurality of feature points fp1 of the equirectangular projection image EC and the feature quantity fv2 of the feature points fp2 in the planar image Pi. By calculating the corresponding points between the images, and calculating the homography corresponding to the planar image P in the equirectangular projection image EC from the calculated relationship between the corresponding points, and using this homography for conversion, A first homography transformation is performed. As a result, the first corresponding area calculation unit 552 calculates the first corresponding area CA1. In this case, a quadrangle (rectangular) center point CP1 composed of the four vertices of the planar image Pi is converted into a gazing point GP1 in the equirectangular projection image EC by the first homography conversion.

なお、平面画像Ｐｉの４頂点の頂点座標を、ｐ１＝（ｘ１，ｙ１）、ｐ２＝（ｘ２，ｙ２）、ｐ３＝（ｘ３，ｙ３）、ｐ４＝（ｘ４，ｙ４）とすると、第１の対応領域算出部５５２は、以下に示す（式２）に基づいて、中心点ＣＰ１（ｘ，ｙ）を定めることができる。 If the vertex coordinates of the four vertices of the planar image Pi are p1 = (x1, y1), p2 = (x2, y2), p3 = (x3, y3), and p4 = (x4, y4), the first The corresponding region calculation unit 552 can determine the center point CP1 (x, y) based on (Expression 2) shown below.

図１９では平面画像Ｐｉの画像形状が長方形であるが、対角線の交点を用いることによって正方形、台形、菱形等、さまざまな四角形の部分画像に対しても中心座標を算出することができる。平面画像Ｐｉの画像形状が、長方形、正方形に限定される場合は、計算の省略化のため、対角線の中点を部分画像の中心座標ＰＣとしてよい。対角線Ｐ１Ｐ３の中点の算出する場合の（式３）を以下に示す。 In FIG. 19, the image shape of the planar image Pi is a rectangle, but the center coordinates can be calculated even for partial images of various rectangles such as a square, a trapezoid, and a rhombus by using the intersections of diagonal lines. When the image shape of the planar image Pi is limited to a rectangle or a square, the midpoint of the diagonal line may be used as the center coordinate PC of the partial image in order to omit the calculation. (Equation 3) in the case of calculating the midpoint of the diagonal line P1P3 is shown below.

注視点特定部５５４は、平面画像Ｐｉの中心点ＣＰ１が第１のホモグラフィ変換後に位置する正距円筒射影画像ＥＣｉ上の点（本実施形態では「注視点」という）を特定する。 The gazing point specifying unit 554 specifies a point (referred to as “gazing point” in the present embodiment) on the equirectangular projection image ECi where the center point CP1 of the planar image Pi is located after the first homography transformation.

ところで、注視点ＧＰ１の座標は、正距円筒射影画像ＥＣ上の座標であるため、緯度及び経度の表現に変換すると都合が良い。具体的には、正距円筒射影画像ＥＣｉの垂直方向を−90度(−0.5π)から+90度(+0.5π)の緯度座標として表現し、水平方向を−180度(−π)から+180度(+π)の経度座標として表現する。このようにすることで、緯度・経度座標から、正距円筒射影画像ＥＣｉの画像サイズに対応した画素位置座標を算出することができる。 By the way, since the coordinates of the gazing point GP1 are coordinates on the equirectangular projection image EC, it is convenient to convert them into expressions of latitude and longitude. Specifically, the vertical direction of the equirectangular projection image ECi is expressed as latitude coordinates from −90 degrees (−0.5π) to +90 degrees (+ 0.5π), and the horizontal direction from −180 degrees (−π). Expressed as longitude coordinates of +180 degrees (+ π). In this way, pixel position coordinates corresponding to the image size of the equirectangular projection image ECi can be calculated from the latitude / longitude coordinates.

射影方式変換部５５６は、正距円筒射影画像ＥＣｉ内の注視点ＧＰ１を中心とした周辺領域ＰＡを、平面画像Ｐと同じ透視射影方式に変換することで、周辺領域画像ＰＩｉを作成する。この場合、注視点ＧＰ１が変換された後の点を中心点ＣＰ２として、平面画像Ｐｉの対角画角αと同じ画角を垂直画角（又は水平画角）とした場合に特定することができる正方形の周辺領域画像ＰＩｉを結果的に作成することができるように、射影変換元の周辺領域ＰＡを特定する。以下、更に詳細に説明する。 The projection method conversion unit 556 generates the peripheral region image PIi by converting the peripheral region PA around the gazing point GP1 in the equirectangular projection image ECi into the same perspective projection method as that of the planar image P. In this case, the point after the gazing point GP1 is converted is set as the center point CP2, and it is specified when the same angle of view as the diagonal angle of view α of the planar image Pi is set as the vertical angle of view (or horizontal angle of view). The peripheral area PA of the projective transformation source is specified so that a possible square peripheral area image PIi can be created as a result. This will be described in more detail below.

（射影方式の変換）
まず、射影方式の変換について説明する。図３乃至図５を用いて説明したように、正距円筒射影画像ＥＣによって立体球ＣＳを被うことで、全天球画像ＣＥを作成している。よって、正距円筒射影画像ＥＣの各画素データは、３次元の全天球画像の立体球ＣＳの表面における各画素データに対応させることができる。そこで、射影方式変換部５５６による変換式は、正距円筒射影画像ＥＣｉにおける座標を(緯度，経度)＝（ｅａ，ａａ）と表現し、３次元の立体球ＣＳ上の座標を直行座標（ｘ，ｙ，ｚ）で表わすと、以下の（式４）で表わすことができる。 (Projection method conversion)
First, the conversion of the projection method will be described. As described with reference to FIGS. 3 to 5, the omnidirectional image CE is created by covering the solid sphere CS with the equirectangular projection image EC. Therefore, each pixel data of the equirectangular projection image EC can correspond to each pixel data on the surface of the solid sphere CS of the three-dimensional omnidirectional image. Therefore, the conversion formula by the projection system conversion unit 556 expresses the coordinates in the equirectangular projection image ECi as (latitude, longitude) = (ea, aa), and the coordinates on the three-dimensional solid sphere CS are orthogonal coordinates (x , Y, z), it can be expressed by the following (formula 4).

(x, y, z) = (cos(ea) × cos(aa), cos(ea) × sin(aa), sin(ea)) ・・・(式４)
但し、このときの立体球ＣＳの半径は１とする。 (x, y, z) = (cos (ea) × cos (aa), cos (ea) × sin (aa), sin (ea)) (Equation 4)
However, the radius of the solid sphere CS at this time is 1.

一方で、透過射影画像である平面画像Ｐｉは２次元画像であるが、これを２次元の極座標（動径，偏角）＝（ｒ，ａ）で表現すると、動径ｒは対角画角αに対応し、取り得る範囲は0 ≦ r ≦ tan(対角画角/2)となる。また、平面画像Ｐを２次元の直交座標系（ｕ，ｖ）で表わすと、極座標（動径，偏角）＝（ｒ，ａ）との変換関係は、以下の（式５）で表わすことができる。 On the other hand, the planar image Pi, which is a transmission projection image, is a two-dimensional image. When this is expressed by two-dimensional polar coordinates (radial radius, declination) = (r, a), the radial radius r is a diagonal field angle. Corresponding to α, the possible range is 0 ≦ r ≦ tan (diagonal angle of view / 2). Further, when the planar image P is represented by a two-dimensional orthogonal coordinate system (u, v), the conversion relationship with polar coordinates (radial radius, declination) = (r, a) is represented by the following (formula 5). Can do.

u = r × cos(a), v = r × sin(a) ・・・(式５)
次に、この（式５）を３次元の座標（動径，極角，方位角）に対応させることを考える。今、立体球ＣＳの表面のみを考えているため、３次元極座標における動径は「１」である。また、立体球ＣＳの表面に張り付けた正距円筒射影画像ＥＣを透視射影変換する射影は、立体球ＣＳの中心に仮想カメラＩＣがあると考えると、上述の２次元極座標（動径，偏角）＝（ｒ，ａ）を使うと、以下の（式６）、（式７）で表わすことができる。 u = r × cos (a), v = r × sin (a) (Equation 5)
Next, let us consider that this (Formula 5) corresponds to three-dimensional coordinates (radial radius, polar angle, azimuth angle). Since only the surface of the solid sphere CS is considered now, the moving radius in the three-dimensional polar coordinate is “1”. Further, the projection for perspective projection transformation of the equirectangular projection image EC attached to the surface of the solid sphere CS is considered to be the above-described two-dimensional polar coordinate (radial radius, declination) when the virtual camera IC is located at the center of the solid sphere CS. ) = (R, a) can be expressed by the following (Expression 6) and (Expression 7).

r = tan(極角) ・・・（式６）
a = 方位角・・・（式７）
ここで極角をtとすると、t = arctan(r)となるため、３次元極座標（動径、極角、方位角)は、(動径、極角、方位角)＝(1, arctan(r), a)と表現することができる。 r = tan (polar angle) (Formula 6)
a = Azimuth angle (Formula 7)
If the polar angle is t, then t = arctan (r), so the three-dimensional polar coordinates (radial radius, polar angle, azimuth) are (radial radius, polar angle, azimuth) = (1, arctan ( r), a).

また３次元極座標から、直行座標系（ｘ，ｙ，ｚ）へ変換するための変換式は、以下の（式８）で表わすことができる。 A conversion formula for converting from the three-dimensional polar coordinate to the orthogonal coordinate system (x, y, z) can be expressed by the following (Formula 8).

(x, y, z) = (sin(t) × cos(a), sin(t) × sin(a), cos(t)) ・・・（式８）
上記の（式８）により、正距円筒射影方式による正距円筒射影画像ＥＣｉと、透視射影方式による平面画像Ｐｉの相互変換ができるようになった。即ち、作成すべき平面画像Ｐの対角画角αに対応する動径rを用いることで、平面画像Ｐの各画素が、正距円筒射影画像ＥＣのどの座標に対応するかを表す変換マップ座標を算出でき、この変換マップ座標に基づいて、正距円筒射影画像ＥＣｉから、透視射影画像である周辺領域画像ＰＩｉを作成することができる。 (x, y, z) = (sin (t) x cos (a), sin (t) x sin (a), cos (t)) (Equation 8)
According to the above (Equation 8), the equirectangular projection image ECi by the equirectangular projection method and the planar image Pi by the perspective projection method can be mutually converted. That is, by using the radius r corresponding to the diagonal angle of view α of the planar image P to be created, the conversion map indicating which coordinate of the equirectangular projection image EC corresponds to each pixel of the planar image P. The coordinates can be calculated, and the peripheral area image PIi, which is a perspective projection image, can be created from the equirectangular projection image ECi based on the conversion map coordinates.

ところで、上記射影方式の変換は、正距円筒射影画像ＥＣの（緯度，経度）が（９０°，０°)となる位置が、透視射影画像である周辺領域画像ＰＩの中心点ＣＰ２となるような変換を示している。そこで、正距円筒射影画像ＥＣｉの任意の点を注視点として透視射影変換をする場合は、正距円筒射影画像ＥＣを貼り付けた立体球ＣＳを回転させることで、注視点の座標(緯度、経度)が（９０°,０°）の位置に配置されるような座標回転を行えば良い。 By the way, in the projection method conversion, the position where the (latitude, longitude) of the equirectangular projection image EC is (90 °, 0 °) becomes the center point CP2 of the peripheral region image PI which is a perspective projection image. Shows the conversion. Therefore, when performing perspective projection conversion with an arbitrary point of the equirectangular projection image ECi as a gazing point, the coordinates of the gazing point (latitude, Coordinate rotation may be performed such that (longitude) is arranged at a position of (90 °, 0 °).

この立体球ＣＳの回転に関する変換公式は、一般の座標回転公式であるため、説明を省略する。 Since the conversion formula relating to the rotation of the solid sphere CS is a general coordinate rotation formula, description thereof is omitted.

（周辺領域画像の特定）
次に、図２０を用いて、周辺領域画像ＰＩｉの領域を特定する方法について説明する。なお、図２０は、周辺領域画像を特定する際の概念図である。 (Identification of surrounding area image)
Next, a method for specifying the area of the peripheral area image PIi will be described with reference to FIG. FIG. 20 is a conceptual diagram when specifying the peripheral region image.

第１の対応領域算出部５５２が、平面画像Ｐｉと周辺領域画像ＰＩｉの対応関係を判断するにあたって、周辺領域画像ＰＩｉ内に第２の対応領域ＣＡ２ができるだけ広く含まれていることが望ましい。よって、周辺領域画像ＰＩｉを広い領域に設定すれば、第２の対応領域ＣＡ２を含まないという事態は生じない。しかし、周辺領域画像ＰＩｉを、あまりにも広い領域に設定すると、その分だけ類似度を算出する対象の画素が増えるため、処理に時間が掛かってしまう。そのため、極力、周辺領域画像ＰＩｉの領域は第２の対応領域ＣＡ２を含む範囲内で小さい方が良い。そこで、本実施形態では、以下に示すような方法で、周辺領域画像ＰＩを特定する。 When the first corresponding area calculation unit 552 determines the correspondence relationship between the planar image Pi and the peripheral area image PIi, it is desirable that the second corresponding area CA2 is included as widely as possible in the peripheral area image PIi. Therefore, if the peripheral area image PIi is set to a wide area, a situation in which the second corresponding area CA2 is not included does not occur. However, if the peripheral area image PIi is set to an excessively large area, the number of pixels for which the similarity is calculated increases, and thus processing takes time. Therefore, it is preferable that the area of the peripheral area image PIi is as small as possible within the range including the second corresponding area CA2. Therefore, in the present embodiment, the peripheral area image PI is specified by the following method.

本実施形態では、周辺領域画像ＰＩｉの特定に、平面画像の３５ｍｍ換算焦点距離を使用するが、これは撮影時に記録されるExifデータから取得される。３５ｍｍ換算焦点距離は、いわゆる24mm x 36mmのフィルムサイズを基準とした焦点距離であるため、このフィルムの対角と、焦点距離から以下の算出式（式９）、（式１０）で対応する対角画角を算出することができる。 In the present embodiment, the 35 mm equivalent focal length of the planar image is used for specifying the peripheral area image PIi, and this is acquired from Exif data recorded at the time of photographing. Since the 35 mm equivalent focal length is a focal length based on a so-called 24 mm × 36 mm film size, the following calculation formulas (Equation 9) and (Equation 10) correspond to the diagonal of the film and the focal length. The angle of view can be calculated.

フィルム対角 = sqrt(24*24 + 36*36) ・・・（式９）
合成用画像画角/2 = arctan((フィルム対角/2)/合成用画像35mm換算焦点距離)・・（式１０）
ところで、この画角をカバーする画像は円形となるのだが、実際の撮像素子（フィルム）は長方形なので円に内接する長方形画像となっている。本実施形態では、周辺領域画像ＰＩｉの垂直画角αを、平面画像Ｐｉの対角画角αと同じになるように設定する。これにより、図２０（ｂ）に示されている周辺領域画像ＰＩｉは、図２０（ａ）に示されている平面画像Ｐｉの対角画角αをカバーする円に外接する正方形となり、垂直画角αは、下記（式１１）、(式１２)で示されているように、正方形の対角と平面画像Ｐの焦点距離から算出できる。
正方形対角=sqrt(フィルム対角*フィルム対角+フィルム対角*フィルム対角)・・・（式１１）
垂直画角α/2 = arctan((正方形対角/2) / 平面画像の35mm換算焦点距離))・・・（式１２）
このような垂直画角αで射影変換することで、注視点を中心に平面画像Ｐｉの対角画角αにおける画像をできるだけ広くカバーでき、かつ垂直画角αが大きくなりすぎない周辺領域画像ＰＩｉ（透視射影画像）を作成することができる。 Film diagonal = sqrt (24 * 24 + 36 * 36) (Equation 9)
Composite image angle of view / 2 = arctan ((film diagonal / 2) / 35mm equivalent focal length of composite image) ... (Equation 10)
By the way, the image covering this angle of view is circular, but since the actual image sensor (film) is rectangular, it is a rectangular image inscribed in the circle. In the present embodiment, the vertical field angle α of the peripheral area image PIi is set to be the same as the diagonal field angle α of the planar image Pi. As a result, the peripheral area image PIi shown in FIG. 20B becomes a square circumscribing a circle covering the diagonal angle of view α of the planar image Pi shown in FIG. The angle α can be calculated from the diagonal of the square and the focal length of the planar image P, as shown in (Expression 11) and (Expression 12) below.
Square diagonal = sqrt (film diagonal * film diagonal + film diagonal * film diagonal) ... (Formula 11)
Vertical angle of view α / 2 = arctan ((square square / 2) / 35mm equivalent focal length of planar image)) ... (Formula 12)
By performing projective transformation at such a vertical angle of view α, the peripheral region image PIi that can cover the image at the diagonal angle of view α of the planar image Pi as wide as possible with the gazing point as the center, and the vertical angle of view α is not too large. (Perspective projection image) can be created.

（位置情報の算出）
続いて、図１５に戻り、第２の対応領域算出部５５８は、平面画像Ｐｉにおける複数の特徴点ｆｐ２と周辺領域画像ＰＩｉにおける複数の特徴点ｆｐ３の特徴量ｆｖ３を算出する。算出された各特徴量ｆｖ２，ｆｖ３の類似度に基づき画像間の対応点を算出し、算出した画像間の対応点の関係から、周辺領域画像ＰＩｉにおいて、平面画像Ｐに対応するホモグラフィを算出し、このホモグラフィを変換に用いることで、第２のホモグラフィ変換を行なう。その結果、第２の対応領域算出部５５８は、第２の対応領域ＣＡ２を算出する。 (Calculation of location information)
Subsequently, returning to FIG. 15, the second corresponding region calculation unit 558 calculates the feature amount fv3 of the plurality of feature points fp2 in the planar image Pi and the plurality of feature points fp3 in the peripheral region image PIi. Corresponding points between images are calculated based on the calculated similarities of the respective feature quantities fv2 and fv3, and a homography corresponding to the planar image P is calculated in the peripheral region image PIi from the calculated relationship between the corresponding points between the images. The second homography conversion is performed by using this homography for the conversion. As a result, the second corresponding area calculation unit 558 calculates the second corresponding area CA2.

なお、第１のホモグラフィ変換の前に、第１のホモグラフィの算出時間を早めるために、平面画像Ｐｉ及び正距円筒射影画像ＥＣｉのうちの少なくとも一方の画像サイズをリサイズしてもよい。例えは、平面画像Ｐｉが４０００万画素で、正距円筒射影画像ＥＣｉが３０００万画素の場合、平面画像Ｐｉを３０００万画素にリサイズしたり、両画像を１０００万画素になるようにそれぞれの画像をリサイズしたりする。同様に、第２のホモグラフィ算出の前に、平面画像Ｐｉ及び周辺領域画像ＰＩｉの少なくとも一方の画像サイズをリサイズしてもよい。 Note that before the first homography conversion, the image size of at least one of the planar image Pi and the equirectangular projection image ECi may be resized in order to advance the calculation time of the first homography. For example, if the planar image Pi is 40 million pixels and the equirectangular projection image ECi is 30 million pixels, the planar image Pi is resized to 30 million pixels, or both images are adjusted to 10 million pixels. Or resize it. Similarly, the image size of at least one of the planar image Pi and the peripheral area image PIi may be resized before the second homography calculation.

ここで、一般的にホモグラフィとは、ある平面を射影変換を用いて別の平面に射影することを言う。また、本実施形態でのホモグラフィについて、以下に概要を示す。第１のホモグラフィ変換は、正距円筒射影画像ＥＣｉと平面画像Ｐｉとの射影関係を表すものとして、変換行列として示しており、平面画像Ｐｉにおける座標に、ホモグラフィ算出処理で算出したホモグラフィ変換行列を乗算することで、正距円筒射影画像ＥＣｉ（全天球画像ＣＥ）上での対応座標を算出することができる。具体的には、平面画像Ｐｉの４頂点から成る四角形（矩形）の中心点ＣＰ１は、第１のホモグラフィ変換によって、正距円筒射影画像ＥＣにおける注視点ＧＰ１を算出することができる。次に、第２のホモグラフィ変換は、正距円筒射影画像ＥＣｉを所定の射影方式変換した周辺領域画像ＰＩｉと平面画像Ｐｉとの射影関係を表すものとして、変換行列として示しており、平面画像Ｐｉにおける座標に、ホモグラフィ算出処理で算出したホモグラフィによる射影変換の行列を乗算することで、周辺領域画像ＰＩｉ上での対応座標を算出することができる。 Here, in general, homography refers to projecting one plane onto another plane using projective transformation. In addition, the outline of the homography in the present embodiment is shown below. The first homography transformation is shown as a transformation matrix as representing the projection relationship between the equirectangular projection image ECi and the plane image Pi, and the homography calculated by the homography calculation process is set to the coordinates in the plane image Pi. By multiplying the transformation matrix, the corresponding coordinates on the equirectangular projection image ECi (the omnidirectional image CE) can be calculated. Specifically, a gazing point GP1 in the equirectangular projection image EC can be calculated from a square (rectangular) center point CP1 composed of four vertices of the planar image Pi by the first homography transformation. Next, the second homography transformation is shown as a transformation matrix as representing a projection relationship between the peripheral area image PIi obtained by converting the equirectangular projection image ECi by a predetermined projection method and the planar image Pi. By multiplying the coordinates in Pi by the matrix of the projective transformation by homography calculated in the homography calculation process, the corresponding coordinates on the peripheral area image PIi can be calculated.

図２１（ａ）は初期設定形状を示した概念図、図２１（ｂ）は周辺領域画像及び第２の対応領域を示した概念図、図２１（ｃ）は領域分割した初期設定形状の概念図である。 FIG. 21A is a conceptual diagram showing an initial shape, FIG. 21B is a conceptual diagram showing a peripheral region image and a second corresponding region, and FIG. 21C is a concept of an initial shape that is divided into regions. FIG.

初期設定形状生成部５５９は、正距円筒射影方式であって平面画像Ｐｉの形状に対して射影変換（ホモグラフィ）の関係にあり、予め定められた画角と射影方式によって決まる初期設定形状ＤＦを生成する。ここで、初期設定形状ＤＦを適切に設定することで、回転、移動や変倍を行い表示がなされる際に、表示のズレを抑制することにつながる。なお、平面画像Ｐ１は、動画の最初のフレームを示すが、任意のフレームであってもよく、動画中の特定のフレームを示す。この初期設定形状生成部５５９は、初期設定形状ＤＦを生成するとしているが、この生成は、予め設定している、または、新たに作り出しても良い。つまり、この初期設定形状は、予め定められた画角と射影方式による所定形状、例えば四角形や円形に設定されている場合がある。また、本実施形態のように予め定めているが、他の場合では、平面画像Ｐｉの形状に応じて、画角や射影方式並びに形状を生成しても良い。なお、画角については、後述する焦点距離により求められることは、当該技術分野において、周知である。具体的には、平面画像Ｐｉのうち、平面画像Ｐ１の形状に応じて、まずは画角や射影方式並びに形状から初期設定形状を生成し、後の平面画像Ｐ２から平面画像Ｐｎに対する初期設定形状は、この平面画像Ｐ１に対して生成された初期設定形状を用いる。なお、この初期設定形状は、表示の際に適切に表示するために設けられるものであり、この初期設定形状と後述する基準形状変換情報とを用いて、合成表示するための位置と形状を定めることができる。 The initial setting shape generation unit 559 is an equirectangular projection method, has a projective transformation (homography) relationship with the shape of the planar image Pi, and has an initial setting shape DF determined by a predetermined angle of view and the projection method. Is generated. Here, by appropriately setting the initial setting shape DF, it is possible to suppress display displacement when display is performed by rotation, movement, or scaling. The planar image P1 indicates the first frame of the moving image, but may be an arbitrary frame and indicates a specific frame in the moving image. The initial shape generation unit 559 generates the initial shape DF, but this generation may be preset or newly generated. That is, the initial shape may be set to a predetermined shape based on a predetermined angle of view and a projection method, for example, a quadrangle or a circle. In addition, although it is predetermined as in the present embodiment, in other cases, an angle of view, a projection method, and a shape may be generated according to the shape of the planar image Pi. Note that it is well known in the technical field that the angle of view is obtained from a focal length described later. Specifically, in the planar image Pi, an initial setting shape is first generated from the angle of view, the projection method, and the shape according to the shape of the planar image P1, and the initial setting shape for the planar image Pn from the subsequent planar image P2 is The initial setting shape generated for the planar image P1 is used. The initial shape is provided for proper display during display, and the position and shape for composite display are determined using the initial shape and reference shape conversion information described later. be able to.

図２１（ａ）に示されているように、初期設定形状ＤＦは、前景動画と同じ射影方式として扱い、ここでは一般的なデジタルカメラで撮影された画像を前景動画として重畳することを考慮し、透視射影方式として予め初期設定形状ＤＦを定めておく。 As shown in FIG. 21A, the default shape DF is treated as the same projection method as that of the foreground moving image. Here, in consideration of superimposing an image taken with a general digital camera as the foreground moving image. The initial shape DF is determined in advance as a perspective projection method.

初期設定形状ＤＦは、図２１（ａ）のように長方形の中心を中心とした円に内接する長方形となるが、ここでは簡単のために長方形の縦、横の辺はそれぞれＸ軸、Ｙ軸に平行とし、中心を原点とする。対角画角は０度より大きく１８０度より小さい範囲で任意で決めることが可能であるが、極端な値を指定すると計算の誤差が大きくなる可能性があるため、例えば５０度などとする良い。また、前景動画となる動画において、実際に動画を記録した時に利用している記録画像の画角を採用しても良い。 The initial shape DF is a rectangle inscribed in a circle centered on the center of the rectangle as shown in FIG. 21A. Here, for the sake of simplicity, the vertical and horizontal sides of the rectangle are the X axis and the Y axis, respectively. Parallel to the center and the center as the origin. The diagonal field angle can be arbitrarily determined within a range larger than 0 degree and smaller than 180 degrees. However, if an extreme value is specified, a calculation error may increase. . Further, the angle of view of the recorded image that is used when the moving image is actually recorded may be employed in the moving image that becomes the foreground moving image.

次に、基準形状変換情報算出部５６８は、平面画像Ｐｉから第２の対応領域ＣＡ２に射影変換（ホモグラフィ）する場合の情報（パラメータ）を用いて、第２の対応領域ＣＡ２に対して初期設定形状ＤＦを変換する情報を算出する。この算出された情報は、基準形状ＢＦを回転及び変倍の少なくとも一方により変換させるための基準形状変換情報である。この回転に関する基準形状変換情報が後述の「基準形状回転情報」であり、変倍に関する基準形状変換情報が後述の「基準形状変倍情報」である。つまり、この基準形状変換情報は、初期設定形状ＤＦの位置や形状を変換させるたに用いられる情報であり、具体的には、回転、変倍、移動の３つを意味する情報が含まれている。 Next, the reference shape conversion information calculation unit 568 uses the information (parameter) when performing projective transformation (homography) from the planar image Pi to the second corresponding area CA2, and performs initial processing on the second corresponding area CA2. Information for converting the set shape DF is calculated. This calculated information is reference shape conversion information for converting the reference shape BF by at least one of rotation and scaling. The reference shape conversion information regarding this rotation is “reference shape rotation information” described later, and the reference shape conversion information regarding zooming is “reference shape scaling information” described later. In other words, this reference shape conversion information is information used to convert the position and shape of the initial shape DF, and specifically includes information meaning three of rotation, scaling, and movement. Yes.

更に、基準形状変換情報算出部５６８は、射影方式変換によって第２の対応領域ＣＡ２の中心点ＣＰ２を正距円筒射影方式に変換することで、正距円筒射影画像ＥＣｉ上で対応する対応点ＣＰ３の位置座標を算出する。この位置座標に関する基準形状変換情報が後述の「基準形状移動情報」であり、基準形状ＢＦの位置を移動により変換せるための情報である。 Further, the reference shape conversion information calculation unit 568 converts the center point CP2 of the second corresponding area CA2 to the equirectangular projection method by the projection method conversion, thereby corresponding points CP3 corresponding on the equirectangular projection image ECi. The position coordinates of are calculated. The reference shape conversion information regarding the position coordinates is “reference shape movement information” to be described later, and is information for converting the position of the reference shape BF by movement.

ここで、図２１（ｂ）を用いて、基準形状変換情報算出部５６８の処理について詳細に説明する。図２１（ｂ）は周辺領域画像及び第２の対応領域を示した概念図である。 Here, the processing of the reference shape conversion information calculation unit 568 will be described in detail with reference to FIG. FIG. 21B is a conceptual diagram showing a peripheral area image and a second corresponding area.

図２１（ｂ）では、周辺領域画像ＰＩｉと第２の対応領域ＣＡ２を示している。図２０で示したように、周辺領域画像ＰＩｉの垂直画角αは、平面画像Ｐｉの対角画角αと同じになるように設定されている。そして、第２の対応領域ＣＡ２と、平面画像Ｐｉとの間には、第２のホモグラフィ変換という関係がある。ところで、この場合のホモグラフィは、平面を平面に射影する変換となっているため、初期設定形状ＤＦと、第２の対応領域ＣＡ２の間にもホモグラフィによる射影関係で表現することを考える。第２の対応領域ＣＡ２の４頂点の取り方によっては、初期設定形状ＤＦで表現される平面からホモグラフィで完全に記述できる訳ではなく、最小二乗法等により４点のずれが一番小さくなるような近似が行われる。ホモグラフィの算出方法は、公知の算出方法なので省略するが、例えばOpenCV(Open Source Computer Vision Library)を利用すると、４組の対応点を入力としてホモグラフィ行列を算出する関数があるので利用しても良い。 FIG. 21B shows the peripheral area image PIi and the second corresponding area CA2. As shown in FIG. 20, the vertical field angle α of the peripheral area image PIi is set to be the same as the diagonal field angle α of the planar image Pi. Then, there is a relationship called second homography transformation between the second corresponding area CA2 and the planar image Pi. By the way, since the homography in this case is conversion for projecting a plane onto a plane, it is considered that the initial set shape DF and the second corresponding area CA2 are also expressed by a projection relationship by homography. Depending on how the four vertices of the second corresponding area CA2 are taken, it is not possible to completely describe by homography from the plane expressed by the default shape DF, and the shift of the four points is minimized by the least square method or the like. Such an approximation is performed. The calculation method of homography is omitted because it is a known calculation method. For example, when using OpenCV (Open Source Computer Vision Library), there is a function that calculates a homography matrix using four pairs of corresponding points as inputs. Also good.

また、基準形状変換情報算出部５６８は、算出されたホモグラフィから基準形状ＢＦを変換するための「基準形状回転情報」と「基準形状変倍情報」を算出する。算出されたホモグラフィから回転情報と変倍情報を求める方法としては公知のホモグラフィ分解法を用いることができる。ホモグラフィを回転マトリクスと、並進ベクトルに分解する方法の詳細は例えば、以下の文献に開示されているため、説明を省略する。 Further, the reference shape conversion information calculation unit 568 calculates “reference shape rotation information” and “reference shape scaling information” for converting the reference shape BF from the calculated homography. A known homography decomposition method can be used as a method for obtaining rotation information and scaling information from the calculated homography. Details of the method of decomposing the homography into a rotation matrix and a translation vector are disclosed in, for example, the following documents, and thus description thereof is omitted.

〔非特許文献〕
Zhang, Z. “A Flexible New Technique for Camera Calibration.” IEEE Transactions on Pattern Analysis and Machine Intelligence. Vol. 22, No. 11, 2000, pp. 1330-1334.
なお、ここでは、基準形状回転情報は、３次元モデル空間の各軸に対するオイラー角によって示されている。また、基準形状移動情報は、正距円筒射影画像ＥＣ上の注視点ＧＰ１により特定される。 [Non-patent literature]
Zhang, Z. “A Flexible New Technique for Camera Calibration.” IEEE Transactions on Pattern Analysis and Machine Intelligence. Vol. 22, No. 11, 2000, pp. 1330-1334.
Here, the reference shape rotation information is indicated by Euler angles with respect to the respective axes in the three-dimensional model space. Further, the reference shape movement information is specified by the gazing point GP1 on the equirectangular projection image EC.

次に、図１５に戻って、領域分割部５６０は、初期設定形状ＤＦの領域を複数の格子領域に分割する。ここで、図２１（ａ）、（ｃ）を用いて、基準形状の領域を複数の格子領域に分割する方法を説明する。図２１（ｃ）は、領域分割した基準形状の概念図である。 Next, returning to FIG. 15, the region dividing unit 560 divides the region of the initial shape DF into a plurality of lattice regions. Here, a method of dividing the reference shape region into a plurality of lattice regions will be described with reference to FIGS. FIG. 21C is a conceptual diagram of a reference shape divided into regions.

領域分割部５６０は、図２１（ａ）に示されているように、基準形状ＤＦの生成で説明したように対角画角と縦横比を与えることで生成された基準形状の４つの頂点から成る長方形を、図２１（ｃ）に示されているように、複数の格子領域に分割する。例えば、水平方向に８、垂直方向に８ほど均等分割する。 As illustrated in FIG. 21A, the region dividing unit 560 uses the four vertices of the reference shape generated by giving the diagonal angle of view and the aspect ratio as described in the generation of the reference shape DF. The rectangle formed is divided into a plurality of lattice regions as shown in FIG. For example, it is divided equally into 8 in the horizontal direction and 8 in the vertical direction.

次に、複数の格子領域の具体的な分割方法について説明する。 Next, a specific method for dividing a plurality of lattice regions will be described.

まず、基準形状の領域を均等に分割する算出式を示す。２点をA(X1，Y1)、B(X2，Y2)として、その２点間の線分を等間隔にｎ個に分割する場合、Ａからｍ番目にあたる点Ｐｍの座標は、以下に示す（式１３）によって算出される。 First, a calculation formula for equally dividing the reference shape region is shown. When the two points are A (X1, Y1) and B (X2, Y2) and the line segment between the two points is divided into n pieces at equal intervals, the coordinates of the point Pm corresponding to the mth point from A are as follows: Calculated by (Equation 13).

Pm = ( X1 + (X2 − X1) ×m ／ n， Y1 + (Y2 − Y1) × m ／ n )・・・（式１３）
上記の（式１３）によって、線分を均等に分割した座標が算出できるため、長方形の上辺、下辺をそれぞれ分割した座標を求め、分割した座標から成る線分をさらに分割すればよい。図２１（ａ）に示されているように、長方形である初期設定形状ＤＦの左上、右上、右下、左下の各座標をそれぞれＴＬ，ＴＲ，ＢＲ，ＢＬとした場合、線分ＴＬ−ＴＲおよび線分ＢＲ−ＢＬを均等に８分割した座標を求める。次に、０から８番目まで分割された各座標において、同じ順番に対応する座標同士から成る線分に対し、均等に８分割した座標を求める。これにより、四角形領域を８×８個の小領域に分割するための座標を求めることができる。 Pm = (X1 + (X2−X1) × m / n, Y1 + (Y2−Y1) × m / n) (Equation 13)
Since the coordinates obtained by equally dividing the line segment can be calculated by the above (Formula 13), the coordinates obtained by dividing the upper side and the lower side of the rectangle may be obtained, and the line segment formed by the divided coordinates may be further divided. As shown in FIG. 21A, when the upper left, upper right, lower right, and lower left coordinates of the rectangular initial setting shape DF are TL, TR, BR, and BL, respectively, a line segment TL-TR. And the coordinate which divided | segmented line segment BR-BL into 8 equally is calculated | required. Next, in each of the coordinates divided from the 0th to the 8th, coordinates obtained by equally dividing the line segment composed of coordinates corresponding to the same order are obtained. Thereby, coordinates for dividing the quadrangular area into 8 × 8 small areas can be obtained.

次に、図１５に戻り、射影方式逆変換部５６２は、領域分割部５６０によって領域分割された初期設定形状ＤＦの座標を、背景動画と同じ射影方式である正距円筒射影画像ＥＣｉと同じ正距円筒射影方式に逆変換することで、正距円筒射影画像ＥＣｉにおいて、基準形状の各格子点の対応点を算出する。 Next, returning to FIG. 15, the projection method inverse transformation unit 562 converts the coordinates of the initial shape DF divided by the region division unit 560 into the same normal projection image ECi as the equirectangular projection image ECi which is the same projection method as the background moving image. By inversely converting to the distance cylindrical projection method, the corresponding points of the grid points of the reference shape are calculated in the equirectangular projection image ECi.

図２２（ａ）は図２１（ｃ）で示した初期設定形状ＤＦの各格子点を、正距円筒射影画像ＥＣｉの中心（注視点（緯度０，経度０））の位置に配置した時に算出される各格子点を示した概念図である。射影方式の変換の説明において上述したように、射影方式が異なる画像の対応関係は、それぞれを３次元の立体球にマッピングすることで算出することが可能である。図２２（ａ）では、座標の一例として、ＴＬの座標(LO_00,00，LA_00,00)を示している。 FIG. 22A is calculated when each grid point of the initial setting shape DF shown in FIG. 21C is arranged at the center (gazing point (latitude 0, longitude 0)) of the equirectangular projection image ECi. It is the conceptual diagram which showed each lattice point performed. As described above in the description of the conversion of the projection method, the correspondence between the images having different projection methods can be calculated by mapping each of them to a three-dimensional solid sphere. In FIG. 22A, TL coordinates (LO _00,00 , LA _00,00 ) are shown as an example of coordinates.

ここで、基準形状ＢＦは透視射影方式として画角と縦横比を与えて生成したが、正距円筒射影画像ＥＣｉとの関係を記述するには、基準形状ＢＦをどの位置に配置するかという情報が必要となる。そこで、基準形状ＢＦを正距円筒射影画像ＥＣｉ上の緯度・経度座標を使って、配置する座標を定義することを考える。これは、すなわち、立体球ＣＳの中心からどこを向いているかという注視点として表すことができる。 Here, the reference shape BF is generated by giving an angle of view and an aspect ratio as a perspective projection method, but in order to describe the relationship with the equirectangular projection image ECi, information on where the reference shape BF is arranged Is required. Therefore, it is considered to define coordinates for arranging the reference shape BF using latitude / longitude coordinates on the equirectangular projection image ECi. That is, it can be expressed as a gazing point as to where it is facing from the center of the solid sphere CS.

図２２（ｂ）は図２２（ａ）と同じ基準形状ＢＦを、注視点を変更し、緯度-45度, 経度90度の位置に配置したときの概念図である。正距円筒射影は赤道付近から極方向に向かって水平方向が引き伸ばされた射影方式となっているので基準形状は図のように扇型のような形となる。 FIG. 22B is a conceptual diagram when the same reference shape BF as FIG. 22A is arranged at a position of −45 degrees latitude and 90 degrees longitude by changing the gazing point. Since the equirectangular projection is a projection system in which the horizontal direction is extended from the vicinity of the equator toward the polar direction, the reference shape is like a fan shape as shown in the figure.

図５及び図６で示したように基準形状ＢＦをどこに配置するかの情報を、本実施形態では「基準形状移動情報」と示し、正距円筒射影画像上の緯度・経度座標として表現する。図２２（ａ）は図２２（ｂ）で示された基準形状ＢＦを、基準形状移動情報により移動したと捉えることができる。ここで第２の対応領域ＣＡ２の中心点ＣＰ２を射影方式逆変換することで、正距円筒射影画像ＥＣｉ上で対応する対応点ＣＰ３を求め、この座標位置を「基準形状移動情報」とし、既に算出した「基準形状回転情報」および「基準形状変倍情報」を含んだ「基準形状変換情報」を利用することで、基準形状ＢＦを回転・変倍することと、回転・変倍された基準形状を正距円筒射影上の注視点ＧＰ１で示される位置に移動させることが可能となる。つまり、第２の対応領域に対応する第１の射影画像における対応領域が、基準形状変換情報と、基準形状を格子分割することで得られた座標を第一の射影方式に逆変換することで得られる位置パラメータとで表現できるということである。なお、格子点は複数の点の一例である。 As shown in FIG. 5 and FIG. 6, information on where the reference shape BF is arranged is indicated as “reference shape movement information” in the present embodiment, and is expressed as latitude / longitude coordinates on the equirectangular projection image. FIG. 22A can be considered that the reference shape BF shown in FIG. 22B has been moved according to the reference shape movement information. Here, the corresponding point CP3 on the equirectangular projection image ECi is obtained by inversely transforming the center point CP2 of the second corresponding area CA2 by the projection method, and this coordinate position is set as “reference shape movement information”. By using the “reference shape conversion information” including the calculated “reference shape rotation information” and “reference shape scaling information”, the reference shape BF is rotated and scaled, and the rotated and scaled reference The shape can be moved to the position indicated by the gazing point GP1 on the equirectangular projection. That is, the corresponding area in the first projection image corresponding to the second corresponding area is obtained by inversely converting the reference shape conversion information and the coordinates obtained by dividing the reference shape into the first projection method. It can be expressed by the obtained position parameter. Note that the grid point is an example of a plurality of points.

ところで、基準形状移動情報は立体球ＣＳの中心にある仮想カメラＩＣをパン（経度方向）、チルト（緯度方向）することにより、仮想カメラＩＣの視線軸を移動することと似ているため、「パン・チルト情報」と示すこともできる。 By the way, the reference shape movement information is similar to moving the visual axis of the virtual camera IC by panning (longitude direction) and tilting (latitude direction) the virtual camera IC at the center of the solid sphere CS. Pan / tilt information ”can also be indicated.

これにより、後述の重畳部５５ｂが、基準形状変換情報を利用して、平面画像Ｐｉの基準形状ＢＦを回転、変倍、及び移動することで、正距円筒射影画像ＥＣｉによって作成された全天球画像ＣＥに重畳表示させることができる。以上、基準位置パラメータと、動画フレームカウント値（図１５参照）、基準形状変換情報を作成することにより、正距円筒射影画像ＥＣｉと平面画像Ｐｉの位置関係を算出することができる。 As a result, the superimposing unit 55b described later uses the reference shape conversion information to rotate, scale, and move the reference shape BF of the planar image Pi, so that the whole sky created by the equirectangular projection image ECi is created. It can be displayed superimposed on the spherical image CE. As described above, the positional relationship between the equirectangular projection image ECi and the planar image Pi can be calculated by creating the reference position parameter, the moving image frame count value (see FIG. 15), and the reference shape conversion information.

続いて、図１５に戻り、重畳表示メタデータ作成部５７０は、図１７に示されているような重畳表示メタデータを作成する。 Subsequently, returning to FIG. 15, the superimposed display metadata creation unit 570 creates superimposed display metadata as shown in FIG. 17.

（重畳表示メタデータ）
ここで、図１７を用いて、重畳表示メタデータのデータ構造について説明する。図１７は、重畳表示メタデータのデータ構造である。 (Superimposed display metadata)
Here, the data structure of the superimposed display metadata will be described with reference to FIG. FIG. 17 shows the data structure of the superimposed display metadata.

図１７に示されているように、重畳表示メタデータは、正距円筒射影画像情報、平面画像情報、基準形状情報、複数の基準形状変換情報１〜Ｎ（Ｎ：正数）、及びメタデータ作成情報によって構成されている。 As shown in FIG. 17, the superimposed display metadata includes equirectangular projection image information, planar image information, reference shape information, a plurality of reference shape conversion information 1 to N (N: positive number), and metadata. Consists of creation information.

これらのうち、正距円筒射影画像情報は、特殊撮影装置１で撮影された動画データに付随しているメタデータの情報である。正距円筒射影画像情報は、画像識別情報及び付属情報を含んでいる。正距円筒射影画像情報における画像識別情報は、正距円筒射影画像を識別するための画像識別情報である。図１７では、正距円筒射影画像情報における画像識別情報の一例として、画像のファイル名が示されているが、画像を一意に識別するための画像ＩＤであってもよい。 Among these, the equirectangular projection image information is metadata information attached to the moving image data photographed by the special photographing apparatus 1. The equirectangular projection image information includes image identification information and attached information. The image identification information in the equirectangular projection image information is image identification information for identifying the equirectangular projection image. In FIG. 17, the file name of the image is shown as an example of the image identification information in the equirectangular projection image information, but an image ID for uniquely identifying the image may be used.

また、正距円筒射影画像情報における付属情報は、正距円筒射影画像情報に付属する関連情報である。図１７では、付属情報の一例として、特殊撮影装置１で撮影された際に得られた正距円筒射影画像データの姿勢補正情報(Pitch，Yaw，Roll)が示されている。この姿勢補正情報は、特殊撮影装置１の画像記録フォーマットとして規格化されているExif(Exchangeable image file format)で格納されている場合があり、GPano(Google Photo Sphere schema)で提供されている各種フォーマットで格納されている場合もある。全天球画像は、同じ位置で撮影すれば、姿勢が異なっていても３６０°全方位の画像が撮影できるという特徴があるが、全天球画像ＣＥを表示する場合に、姿勢情報や、どこを画像の中心にするか（注視点）を指定しなければ表示位置が決まらない。そのため、一般的には天頂が撮影者の真上に来るように全天球画像ＣＥを補正して表示する。これにより、水平線が真っ直ぐに補正された自然な表示が可能となる。なお、本実施形態で扱う画像は動画像であるため、動画の各フレームに対して姿勢情報を保存しておき、フレームごとに補正するようにしてもよい。 The attached information in the equirectangular projection image information is related information attached to the equirectangular projection image information. In FIG. 17, posture correction information (Pitch, Yaw, Roll) of equirectangular projection image data obtained when captured by the special imaging device 1 is shown as an example of attached information. This posture correction information may be stored in the Exif (Exchangeable image file format) standardized as the image recording format of the special imaging device 1, and various formats provided by GPano (Google Photo Sphere schema) It may be stored in. An omnidirectional image has a feature that if it is taken at the same position, an image of 360 ° omnidirectional image can be taken even if the posture is different. The display position is not determined unless the center of the image is specified (gaze point). Therefore, generally, the omnidirectional image CE is corrected and displayed so that the zenith is directly above the photographer. As a result, a natural display in which the horizon is straightly corrected is possible. Note that since the image handled in the present embodiment is a moving image, posture information may be stored for each frame of the moving image and corrected for each frame.

次に、平面画像情報は、一般撮影装置３で撮影された動画データに基づく情報である。平面画像情報は、画像識別情報及び付属情報を含んでいる。平面画像情報における画像識別情報は、平面画像Ｐを識別するための画像識別情報である。図１７では、画像識別情報の一例として、画像のファイル名が示されているが、画像を識別するための画像ＩＤであってもよい。 Next, the planar image information is information based on moving image data photographed by the general photographing device 3. The planar image information includes image identification information and attached information. The image identification information in the planar image information is image identification information for identifying the planar image P. In FIG. 17, the file name of the image is shown as an example of the image identification information, but an image ID for identifying the image may be used.

また、平面画像情報における付属情報は、平面画像情報に付属する関連情報である。図１７では、平面画像情報における付属情報の一例として、３５ｍｍ換算焦点距離の値が示されている。３５ｍｍ換算焦点距離の値は、全天球画像ＣＥに平面画像Ｐを重畳して表示するために必須ではないが、重畳表示を行う場合に表示する画角を決めるための参考情報となるため、例として挙げている。また動画においては、ズーム処理を行うと焦点距離が変化するため、動画のフレームごとに対応する焦点距離を付属情報として記録しても良い。 The attached information in the planar image information is related information attached to the planar image information. In FIG. 17, the value of the 35 mm equivalent focal length is shown as an example of the attached information in the planar image information. The value of the 35 mm equivalent focal length is not essential for displaying the celestial sphere image CE by superimposing the planar image P, but serves as reference information for determining the angle of view to be displayed when superimposing display is performed. Take as an example. In addition, since the focal length of a moving image changes when zoom processing is performed, the focal length corresponding to each frame of the moving image may be recorded as attached information.

次に、基準形状情報は、基準形状を構成する情報として、その領域分割情報と、各格子領域の格子点の位置（基準位置パラメータ）を含んでいる。これらのうち、領域分割情報は基準形状ＢＦを複数の格子領域に分割する際の水平（経度）方向の分割数及び垂直（緯度）方向の分割数を示している。 Next, the reference shape information includes the area division information and the position of the lattice point (reference position parameter) of each lattice area as information constituting the reference shape. Among these, the region division information indicates the number of divisions in the horizontal (longitude) direction and the number of divisions in the vertical (latitude) direction when dividing the reference shape BF into a plurality of lattice regions.

また、基準位置パラメータは、後述する基準形状変換情報１〜Ｎと共に利用され、基準形状変換情報ｉにおける動画フレーム特定情報により抽出することができる平面画像Ｐｉを格子状の複数の領域に分割した場合の各格子点が、同じく基準形状変換情報ｉにより全天球動画ＣＥから抽出することができる正距円筒射影画像ＥＣｉのどの位置に配置されるかを示す頂点マッピング情報を算出するためのパラメータである。 The reference position parameter is used together with reference shape conversion information 1 to N described later, and a planar image Pi that can be extracted by moving picture frame specifying information in the reference shape conversion information i is divided into a plurality of grid-like regions. Is a parameter for calculating vertex mapping information indicating in which position in the equirectangular projection image ECi that can be extracted from the omnidirectional video CE by the reference shape conversion information i. is there.

次に、基準形状変換情報は、基準形状回転情報、基準形状変倍情報、基準形状移動情報、動画フレーム特定情報を含んでいる。基準形状回転情報、基準形状変倍情報、及び基準形状移動情報については、上述した通りである。 Next, the reference shape conversion information includes reference shape rotation information, reference shape scaling information, reference shape movement information, and moving image frame specifying information. The reference shape rotation information, the reference shape scaling information, and the reference shape movement information are as described above.

動画フレーム特定情報は、作成用動画フレーム抽出部５４８によって生成され、正距円筒射影画像情報の画像識別情報で特定される動画と、平面画像情報の画像識別情報で特定される動画とから、フレーム画像をそれぞれ抽出するために使用される。基準形状の回転、変倍、移動の各情報は基準形状情報の位置パラメータで示される正距円筒射影画像上に対応する位置を、回転、変倍、又は移動させることで、平面画像Ｐｉを格子状の複数の領域に分割した場合の各格子点を、正距円筒射影画像ＥＣｉ（全天球動画ＣＥから抽出される１つのフレーム画像）のどの位置に配置されるかを示す上述の頂点マッピング情報に変換する。基準形状変換情報算出部５６８は、動画の全てのフレームに対して基準形状変換情報を算出するのではなく、前景動画を適当にサンプリングした動画のフレーム（平面画像Ｐｉ）に対して基準形状変換情報を算出するため、この例では１〜Ｎまでの基準形状変換情報を持つ構成となっている。また、基準形状変換情報算出部５６８は、前景動画において、フレーム間の変化が大きいときは、前景動画のフレームを時間軸で細かくサンプリングし、フレーム間の変化が小さいときは、前記第２の動画のフレームを粗くサンプリングする。 The moving image frame specifying information is generated by the creating moving image frame extracting unit 548, and the frame is determined from the moving image specified by the image identification information of the equirectangular projection image information and the moving image specified by the image identification information of the planar image information. Used to extract each image. Each information on rotation, scaling, and movement of the reference shape is obtained by rotating, scaling, or moving the corresponding position on the equirectangular projection image indicated by the position parameter of the reference shape information, so that the planar image Pi is latticed. The above-described vertex mapping indicating where each lattice point when divided into a plurality of regions is arranged in an equirectangular projection image ECi (one frame image extracted from the omnidirectional video CE) Convert to information. The reference shape conversion information calculation unit 568 does not calculate the reference shape conversion information for all the frames of the moving image, but the reference shape conversion information for the frame of the moving image (planar image Pi) appropriately sampled from the foreground moving image. In this example, reference shape conversion information of 1 to N is provided. Further, the reference shape conversion information calculation unit 568 samples the frames of the foreground moving image finely on the time axis when the change between frames is large in the foreground moving image, and the second moving image when the change between frames is small. Roughly sample the frames.

次に、メタデータ作成情報は、重畳表示メタデータのバージョンを示すバージョン情報を示している。重畳表示メタデータの構成を、上述した構成とすることで、動画の各フレームに係る画像における重畳位置の算出に必要となるパラメータのデータサイズを少なくし、さらに動画の全てのフレームに対しパラメータを持つのではなく、適当にサンプリングしたフレームに対する基準形状変換情報を記録することで、必要となるパラメータのデータ量を劇的に減らすことができる。そして、そのパラメータはビューア等で行われる重畳表示を行う際に、画像生成処理のための演算処理が従来処理よりも軽減できるようなパラメータセットとして提供することが考慮されている。具体例としては、２Ｄ(2-Dimensions)及び３Ｄ(3-Dimensions)のデータを視覚化するために使用するグラフィックスライブラリであるOpenGLで取り扱うことができるデータを算出するための演算処理を考慮している。このように、外部プログラムで取り扱うことが可能なメタデータとして記録しているため、外部プログラムが時系列の回転、移動、変倍の基準形状変換情報を読み込み、補間して利用することで連続的な重畳処理がリアルタイムの処理で実現することができるという効果を奏する。 Next, the metadata creation information indicates version information indicating the version of the superimposed display metadata. By configuring the superimposed display metadata as described above, the data size of the parameters required for calculating the superimposed position in the image related to each frame of the moving image is reduced, and the parameters are set for all the frames of the moving image. By recording the reference shape conversion information for an appropriately sampled frame instead of having it, the required parameter data amount can be dramatically reduced. Then, it is considered that the parameter is provided as a parameter set that can reduce the arithmetic processing for the image generation processing as compared with the conventional processing when performing superimposed display performed by a viewer or the like. As a specific example, consider an arithmetic process for calculating data that can be handled by OpenGL, a graphics library used to visualize 2D (2-Dimensions) and 3D (3-Dimensions) data. ing. In this way, since it is recorded as metadata that can be handled by an external program, the external program reads the reference shape conversion information of time series rotation, movement, and scaling, and uses it by interpolating it continuously. The effect that the superimposition processing can be realized by real-time processing is achieved.

｛重畳部の機能構成｝
続いて、図１６を用い、重畳部５５ｂの機能構成について説明する。重畳部５５ｂは、再生時間管理部５７１、再生用動画フレーム抽出部５７２、補間部５７４、変換位置パラメータ算出部５７６、形状変換部５７８、補正パラメータ作成部５８０、貼付領域作成部５８２、補正部５８４、画像作成部５８６、画像重畳部５８８、及び射影変換部５９０を有している。 {Functional structure of superimposition unit}
Subsequently, the functional configuration of the superimposing unit 55b will be described with reference to FIG. The superimposing unit 55b includes a reproduction time management unit 571, a reproduction moving image frame extraction unit 572, an interpolation unit 574, a conversion position parameter calculation unit 576, a shape conversion unit 578, a correction parameter creation unit 580, a pasting region creation unit 582, and a correction unit 584. , An image creating unit 586, an image superimposing unit 588, and a projective transformation unit 590.

これらのうち、再生時間管理部５７１は、基準となる動画再生時間を管理して、現在の時間である再生時刻（又は、動画フレームカウント値）を出力する。例えば30fpsで動画を再生したい場合は、再生時間管理部５７１は、１秒間に１／３０秒ずつ時間を増加させて３０回出力する。ここで、背景動画と前景動画の同期について、簡単に例示を用いて説明する。いずれの画像であっても良いが、基準となる画像を特定し、当該基準となる画像から、他方の画像の時間的な差分を用いて、同期を合わせることができる。例えば、背景動画（正距離円筒射影画像EC）の一枚目を基準となる画像として、前景動画（平面画像Ｐ）の記録開始が0.5秒早ければ時間的な差分は−0.5秒とし、逆に平面動画の記録開始が0.5秒遅ければ時間の差分は＋0.5秒として、メタデータに記録させることもできる。この時間的な差分を元にフレーム数のオフセットを計算し、同期させて再生表示させることが可能となる。なお、再生表示する際に、記録時間が遅い方に合わせて、動画の読み出しをスキップして、同期させて再生表示させたり、背景動画と前景動画の時間的な差分をゼロとなった画像から表示させたりするようにしても良い。 Among these, the playback time management unit 571 manages the video playback time serving as a reference, and outputs the playback time (or video frame count value) that is the current time. For example, when it is desired to reproduce a moving image at 30 fps, the reproduction time management unit 571 increases the time by 1/30 second per second and outputs it 30 times. Here, the synchronization of the background moving image and the foreground moving image will be briefly described using an example. Any image may be used, but a reference image can be specified, and synchronization can be synchronized from the reference image using a temporal difference between the other images. For example, if the recording start of the foreground moving image (planar image P) is 0.5 seconds earlier as the first image of the background moving image (positive-distance cylindrical projection image EC), the temporal difference is -0.5 seconds, conversely If the recording start of a planar video is delayed by 0.5 seconds, the time difference can be set to +0.5 seconds and recorded in the metadata. An offset of the number of frames can be calculated based on this temporal difference, and can be reproduced and displayed in synchronization. In addition, when playing back and displaying, in accordance with the later recording time, skip the reading of the video and display it synchronously, or from the image where the temporal difference between the background video and the foreground video is zero You may make it display.

再生用動画フレーム抽出部５７２は、再生時間管理部５７１で管理されている時間に基づき、背景動画（正距円筒射影画像ＥＣ）となる一方の動画像と、前景動画（平面画像Ｐ）となる他方の動画像のそれぞれから、それぞれ指定された時刻に対応するフレームを抽出する。 Based on the time managed by the reproduction time management unit 571, the reproduction moving image frame extraction unit 572 becomes one moving image that becomes a background moving image (an equirectangular projection image EC) and a foreground moving image (a planar image P). A frame corresponding to each designated time is extracted from each of the other moving images.

補間部５７４は、再生時間管理部５７１による動画におけるフレーム再生の時間（又は、動画フレームカウント値）に合わせて、メタデータ生成部５５ａで生成された各基準形状変換情報を読み取り、必要に応じて補間（内挿）することで、動画のフレームの再生時間毎に形状変換パラメータを算出する。この形状変換パラメータの形式は、上述の基準形状変換情報と同じであるが、値は補間結果を示す。補間方法は、再生時間に一番近い前後の基準形状変換情報を読出し、例えば線形補間をして求める。メタデータ作成部５５ａでは、例えば、作成用動画フレーム抽出部５４８が１fpsで動画のフレームを抽出することで、重畳表示メタデータ作成部５７０が１秒間に１つの基準形状変換情報を作成している場合であって、再生時間管理部５７１が１秒間に３０フレームを出力している場合、補間部５７４は２９フレーム分の補間を行う。 The interpolation unit 574 reads each reference shape conversion information generated by the metadata generation unit 55a in accordance with the frame reproduction time (or moving image frame count value) in the moving image by the reproduction time management unit 571, and if necessary, By performing interpolation (interpolation), a shape conversion parameter is calculated for each reproduction time of a moving image frame. The form of this shape conversion parameter is the same as the above-described reference shape conversion information, but the value indicates the interpolation result. The interpolation method is obtained by reading the reference shape conversion information before and after the reproduction time closest to the reproduction time, for example, by linear interpolation. In the metadata creation unit 55a, for example, the creation moving image frame extraction unit 548 extracts a moving image frame at 1 fps, so that the superimposed display metadata creation unit 570 creates one reference shape conversion information per second. In this case, when the reproduction time management unit 571 outputs 30 frames per second, the interpolation unit 574 performs interpolation for 29 frames.

変換位置パラメータ算出部５７６は、補間部５７４で得られた各形状変換パラメータを用いて、メタデータ生成部５５ａで生成された基準形状情報（基準位置パラメータ）を変換した結果である変換位置パラメータを算出する。この算出は、動画の各フレームの再生時間毎に行われる。 The conversion position parameter calculation unit 576 uses the shape conversion parameters obtained by the interpolation unit 574 to convert the conversion position parameter that is the result of converting the reference shape information (reference position parameter) generated by the metadata generation unit 55a. calculate. This calculation is performed for each playback time of each frame of the moving image.

ところで、変換位置パラメータは算出されたが、このまま重畳表示を行う場合、正距円筒射影画像ＥＣと平面画像Ｐとで明るさや色味が大きく異なる場合に、不自然な重畳表示となることがある。そのため、以下に示す形状変換部５７８及び補正パラメータ作成部５８０は、明るさや色味が大きく異なる場合であっても、不自然な重畳表示となることを防止する役割を果たす。 By the way, although the conversion position parameter has been calculated, when superimposing display is performed as it is, when the equirectangular projection image EC and the planar image P are greatly different in brightness and color, an unnatural superimposition display may occur. . Therefore, the shape conversion unit 578 and the correction parameter creation unit 580 described below play a role of preventing unnatural superimposition display even when the brightness and color tone are greatly different.

形状変換部５７８は、変換位置パラメータ算出部５７６で算出された変換位置パラメータで特定される正距円筒射影画像ＥＣ上の領域を、平面画像Ｐと同じ矩形の形状に変換することで、第３の対応領域ＣＡ３を作成する。 The shape conversion unit 578 converts the region on the equirectangular projection image EC specified by the conversion position parameter calculated by the conversion position parameter calculation unit 576 into the same rectangular shape as the planar image P, thereby Corresponding area CA3 is created.

補正パラメータ作成部５８０は、図２６（ａ）に示されているように、第３の対応領域ＣＡ３を、領域分割部５６０の処理と同様に重畳表示メタデータの領域分割数情報に基づいて分割することで各格子領域ＬＡ３を作成する。また、補正パラメータ作成部５８０は、図２６（ｂ）に示されているように、平面画像Ｐを、領域分割部５６０の処理と同様に重畳表示メタデータの領域分割数情報に基づいて分割することで各格子領域ＬＡ０を作成する。図２６（ｂ）では、（８×８＝）６４個の格子領域ＬＡ０が作成されている。更に、補正パラメータ作成部５８０は、同じ形状に変換後の第３の対応領域ＣＡ３における各格子領域ＬＡ２’の明るさ値及び色値に対して、各格子領域ＬＡ３と同じ形状である平面画像Ｐの各格子領域ＬＡ０の明るさ値及び色値を合わせるための補正パラメータ（補正情報の一例）を作成する。具体的には、補正パラメータ作成部５８０は、各格子領域ＬＡ０の１点を共通に格子点としてもつ４つの格子領域を構成する全ての画素の明るさ値及び色値（Ｒ，Ｇ，Ｂ)の平均値ａ= (Ｒ_ave，Ｇ_ave，Ｂ_ave）を算出し、各格子領域ＬＡ３’の角の１点を共通に格子点としてもつ４つの格子領域を構成する全ての画素の明るさ値及び色値（Ｒ’，Ｇ’，Ｂ’）の平均値ａ’ = (Ｒ’_ave，Ｇ’_ave，Ｂ’_ave）を算出する。図２７には、これら４つの格子領域（ＬＡ３ａ’，ＬＡ３ｂ’，ＬＡ３ｃ’，ＬＡ３ｄ’）と角の１点である共通の格子点ＬＰ３’が示されている。なお、上記各格子領域ＬＡ０の１点および各格子領域ＬＡ３の１点が第３の対応領域ＣＡ３および第３の対応領域ＣＡ３の４隅の場合、補正パラメータ作成部５８０は、１つの格子領域から明るさ値および色値の平均値ａおよび平均値ａ’を算出する。また、上記各格子領域ＬＡ０の１点および各格子領域ＬＡ３の１点が第３の対応領域ＣＡ３の外周の点の場合、補正パラメータ作成部５６６は、内側２つの格子領域から明るさ値および色値の平均値aおよび平均値ａ’を算出する。そして、補正パラメータは、本実施形態では、平面画像Ｐの明るさ値及び色値を補正するためのゲインデータとなるため、以下の（式１４）に示されているように、平均値ａ’を平均値ａで除算することで、補正パラメータＰａを求める。
Pa = a’/ a・・・（式１４）
これにより、後述の重畳表示において、補正パラメータが示すゲイン値を、格子領域ＬＡ０毎に乗算することで、平面画像Ｐの色味および明るさが、正距円筒射影画像ＥＣ（全天球画像ＣＥ）の画素値と近くなり、見た目が違和感なく重畳表示することが可能となる。なお、補正パラメータは、平均値から算出されるだけでなく、平均値に代えて又は平均値に加えて、中央値、最頻値等を使って算出されるようにしてもよい。 As shown in FIG. 26A, the correction parameter creation unit 580 divides the third corresponding area CA3 based on the area division number information of the superimposed display metadata as in the process of the area division unit 560. Thus, each lattice area LA3 is created. Further, as illustrated in FIG. 26B, the correction parameter creation unit 580 divides the planar image P based on the area division number information of the superimposed display metadata as in the process of the area division unit 560. Thus, each lattice area LA0 is created. In FIG. 26B, (8 × 8 =) 64 lattice areas LA0 are created. Furthermore, the correction parameter creation unit 580 has the same shape as that of each lattice region LA3 with respect to the brightness value and the color value of each lattice region LA2 ′ in the third corresponding region CA3 after being converted into the same shape. A correction parameter (an example of correction information) for matching the brightness value and the color value of each lattice area LA0 is created. Specifically, the correction parameter creation unit 580 has brightness values and color values (R, G, B) of all the pixels constituting the four grid areas having one grid area LA0 as a grid point in common. The average value a = (R _ave , G _ave , B _ave ) is calculated, and the brightness values of all the pixels constituting the four lattice areas having one common corner point of each lattice area LA3 ′. The average value a ′ = (R ′ _ave , G ′ _ave , B ′ _ave ) of the color values (R ′, G ′, B ′) is calculated. FIG. 27 shows these four lattice regions (LA3a ′, LA3b ′, LA3c ′, LA3d ′) and a common lattice point LP3 ′ that is one corner. When one point of each lattice area LA0 and one point of each lattice area LA3 are the four corners of the third corresponding area CA3 and the third corresponding area CA3, the correction parameter creation unit 580 An average value a and an average value a ′ of the brightness value and the color value are calculated. When one point of each of the lattice areas LA0 and one point of each of the lattice areas LA3 is a point on the outer periphery of the third corresponding area CA3, the correction parameter creation unit 566 calculates brightness values and colors from the two inner lattice areas. An average value a and an average value a ′ are calculated. In this embodiment, the correction parameter is gain data for correcting the brightness value and the color value of the planar image P. Therefore, as shown in the following (Expression 14), the average value a ′ Is divided by the average value a to obtain the correction parameter Pa.
Pa = a '/ a (Formula 14)
As a result, in the superimposed display described later, the gain value indicated by the correction parameter is multiplied for each lattice area LA0, so that the color and brightness of the planar image P become the equirectangular projection image EC (the omnidirectional image CE). ) And the pixel value are close to each other, and the display can be superimposed and displayed without a sense of incongruity. The correction parameter is not only calculated from the average value, but may be calculated using a median value, a mode value, or the like instead of or in addition to the average value.

また、本実施形態では明るさ値および色値の補正値の算出に画素値（R，G，B）を使用したが、輝度および色差信号であるYUVフォーマットや、JPEGのsYCC(YCbCr)フォーマット等における輝度値、色差値を用いて、同様の方法にて格子領域を構成する全ての画素の輝度値および色差値の平均値を求め、その平均値を除算することにより後述の重畳表示における補正パラメータとしてもよい。なお、RGB値からYUV、sYCC(YCbCr)に変換する方法は公知であるため詳細は省略するが、参考として（式１５）を用いて、JPEG圧縮画像のフォーマットJFIF(JPEG file interchange format)形式のRGBからYCbCrへ変換する例を挙げる。 In this embodiment, the pixel values (R, G, B) are used for calculating the correction values for the brightness value and the color value. However, the YUV format for luminance and color difference signals, the JPEG sYCC (YCbCr) format, etc. Using the luminance value and color difference value in, the average value of the luminance value and color difference value of all the pixels constituting the grid area is obtained in the same way, and the average value is divided to correct the correction parameter in the superimposed display described later It is good. The method of converting RGB values to YUV, sYCC (YCbCr) is well known and will not be described in detail. However, using (Equation 15) as a reference, JPEG compressed image format JFIF (JPEG file interchange format) format is used. Here is an example of converting from RGB to YCbCr.

補正部５８４は、補正パラメータ作成部５８０で算出された補正パラメータに基づいて、平面画像Ｐの明るさ値及び色値を、正距円筒射影画像ＥＣの明るさ値及び色値に合わせる補正を行なうことで、補正画像Ｃを作成する。なお、補正部５８４は、必ずしも明るさや色の補正を行なう必要はない。また、補正パラメータが補正を行なう場合、明るさの補正を行なっても、色の補正を行なわなくてもよい。 Based on the correction parameter calculated by the correction parameter creation unit 580, the correction unit 584 performs correction to match the brightness value and color value of the planar image P with the brightness value and color value of the equirectangular projection image EC. Thus, the corrected image C is created. Note that the correction unit 584 does not necessarily have to correct brightness and color. In addition, when the correction parameter is corrected, the brightness may be corrected or the color may not be corrected.

一方、貼付領域作成部５８２は、変換位置パラメータ算出部５７６によって算出された変換位置パラメータに基づいて、仮想の立体球ＣＳにおいて平面画像Ｐを貼り付ける領域部分（以下、「部分立体球」という）ＰＳを作成する。 On the other hand, the pasting region creation unit 582 is a region portion (hereinafter, referred to as “partial solid sphere”) to which the planar image P is pasted in the virtual solid sphere CS based on the conversion position parameter calculated by the conversion position parameter calculation unit 576. Create PS.

画像作成部５８６は、部分立体球ＰＳに対して、平面画像Ｐ（または、平面画像Ｐを補正した後の補正画像Ｃ）を貼り付けることで、重畳画像Ｓを作成する。また、画像作成部５８６は、部分立体球ＰＳの領域に基づいて、マスクデータＭを作成する。更に、画像作成部５８６は、立体球ＣＳに対して、正距円筒射影画像ＥＣを貼り付けることで全天球画像ＣＥを作成する。マスクデータＭは、重畳画像Ｓを全天球画像ＣＥに重畳した場合の境界周辺の明るさ及び色を、内側の重畳画像Ｓ側から外側の全天球画像ＣＥ側に徐々に近づけるために、マスク周辺の透過度が、内側から外側に向けて徐々に重畳画像Ｓ寄りから全天球画像ＣＥ寄りに高くなっている。これにより、全天球画像ＣＥに重畳画像Ｓが重畳されても、極力、重畳されたことが分からないようにすることができる。なお、重畳画像Ｓを全天球画像ＣＥに重畳した場合の境界周辺の明るさを徐々に変更しない場合には、マスクデータＭの作成は、必須ではない。 The image creating unit 586 creates the superimposed image S by pasting the planar image P (or the corrected image C after correcting the planar image P) to the partial solid sphere PS. The image creation unit 586 creates mask data M based on the region of the partial solid sphere PS. Furthermore, the image creation unit 586 creates the omnidirectional image CE by pasting the equirectangular projection image EC to the solid sphere CS. The mask data M is used to gradually bring the brightness and color around the boundary when the superimposed image S is superimposed on the omnidirectional image CE from the inner superimposed image S side to the outer omnidirectional image CE side. The transmittance around the mask gradually increases from the superimposed image S to the omnidirectional image CE from the inside to the outside. Thereby, even if the superimposition image S is superimposed on the omnidirectional image CE, it is possible to prevent the superimposition from being recognized as much as possible. Note that the creation of the mask data M is not essential when the brightness around the boundary when the superimposed image S is superimposed on the omnidirectional image CE is not gradually changed.

画像重畳部５８８は、全天球画像ＣＥに対して重畳画像Ｓ及びマスクデータＭを重畳する。これにより、境目が目立たないように高精細の重畳画像Ｓが重畳された低精細の全天球画像ＣＥが完成する。 The image superimposing unit 588 superimposes the superimposed image S and the mask data M on the omnidirectional image CE. Thereby, the low-definition omnidirectional image CE on which the high-definition superimposed image S is superimposed so that the boundary is not conspicuous is completed.

射影変換部５９０は、図７に示されているように、予め定められた仮想カメラＩＣの視線方向（所定領域Ｔの中心点ＣＰ）と所定領域Ｔの画角αに基づいて、重畳画像Ｓが重畳された状態の全天球画像ＣＥにおける所定領域Ｔをディスプレイで見ることができるように射影変換する。また、射影変換部５９０は、射影変換する際に、所定領域Ｔをディスプレイにおける表示領域の解像度に合わせる処理も行なう。具体的には、所定領域Ｔの解像度がディスプレイの表示領域の解像度よりも小さい場合には、射影変換部５９０は、所定領域Ｔをディスプレイの表示領域に合わせるように拡大する。一方、所定領域Ｔの解像度がディスプレイの表示領域の解像度よりも大きい場合には、射影変換部５９０は、所定領域Ｔをディスプレイの表示領域に合わせるように縮小する。これにより、表示制御部は、ディスプレイの表示領域の全体に亘って、所定領域Ｔを示す所定領域画像Ｑを表示することができる。 As shown in FIG. 7, the projective transformation unit 590 generates a superimposed image S based on a predetermined line-of-sight direction of the virtual camera IC (a center point CP of the predetermined area T) and the angle of view α of the predetermined area T. Projective transformation is performed so that the predetermined area T in the omnidirectional image CE in a state where is superimposed can be seen on the display. Projection conversion unit 590 also performs processing for matching predetermined area T to the resolution of the display area on the display when performing projective conversion. Specifically, when the resolution of the predetermined area T is smaller than the resolution of the display area of the display, the projection conversion unit 590 enlarges the predetermined area T so as to match the display area of the display. On the other hand, when the resolution of the predetermined area T is larger than the resolution of the display area of the display, the projective transformation unit 590 reduces the predetermined area T so as to match the display area of the display. Thereby, the display control unit can display the predetermined area image Q indicating the predetermined area T over the entire display area of the display.

＜＜実施形態の処理又は動作＞＞
続いて、図１８乃至図３４を用いて、本実施形態の処理又は動作について説明する。まず、図１８を用いて、撮影システムが実行する撮影方法を説明する。図１８は、本実施形態に係る撮影方法を示したシーケンス図である。なお、以下では、被写体や風景等の撮影を行なう場合について説明するが、撮影と同時に集音部１４によって周囲の音を録音してもよい。 << Processing or Operation of Embodiment >>
Subsequently, the processing or operation of the present embodiment will be described with reference to FIGS. First, a photographing method executed by the photographing system will be described with reference to FIG. FIG. 18 is a sequence diagram illustrating the photographing method according to the present embodiment. In the following description, a case where a subject, landscape, or the like is photographed will be described. However, ambient sounds may be recorded by the sound collection unit 14 simultaneously with photographing.

図１８に示されているように、スマートフォン５の受付部５２は、利用者から連携撮影開始を受け付ける（ステップＳ１１）。この場合、図１４（ｂ）に示されているように、表示制御部５６が、連携撮影装置設定画面をディスプレイ５１７に表示させる。この画面には、撮影装置毎に、連携撮影する際のメインの撮影装置を指定するためのラジオボタン、連携撮影する際のサブの撮影装置を指定（選択）するためのチェックボックスが表示されている。更に、撮影装置毎に、撮影装置の装置名及び電波の受信強度が表示されている。そして、利用者が所望の撮影装置をメイン及びサブとして指定（選択）して、「確定」ボタンを押下することで、受付部が連携撮影開始を受け付ける。なお、サブの撮影装置は複数であってもよいため、チェックボックスにして複数の撮影装置を指定（選択）することができるようになっている。 As illustrated in FIG. 18, the reception unit 52 of the smartphone 5 receives the start of cooperative shooting from the user (step S11). In this case, as shown in FIG. 14B, the display control unit 56 causes the display 517 to display the cooperative photographing apparatus setting screen. On this screen, a radio button for designating a main imaging device for linked shooting and a check box for designating (selecting) a sub imaging device for linked shooting are displayed for each imaging device. Yes. Further, for each photographing apparatus, the name of the photographing apparatus and the reception intensity of the radio wave are displayed. Then, when the user designates (selects) a desired photographing apparatus as a main and a sub and presses a “confirm” button, the accepting unit accepts the start of cooperative photographing. Since there may be a plurality of sub photographing apparatuses, a plurality of photographing apparatuses can be designated (selected) by using check boxes.

そして、スマートフォン５の近距離通信部５８は、一般撮影装置３の近距離通信部３８に対して、撮影開始を要求するための撮影開始要求情報を送信する（ステップＳ１２）。また、また、スマートフォン５の近距離通信部５８は、特殊撮影装置１の近距離通信部１８に対して、撮影開始を要求するための撮影開始要求情報を送信する（ステップＳ１３）。 And the short distance communication part 58 of the smart phone 5 transmits the imaging | photography start request information for requesting a photography start with respect to the short distance communication part 38 of the general imaging device 3 (step S12). Further, the short-range communication unit 58 of the smartphone 5 transmits shooting start request information for requesting the start of shooting to the short-range communication unit 18 of the special imaging device 1 (step S13).

次に、一般撮影装置３は、撮影を開始する（ステップＳ１４）。この撮影の処理は、撮像部３３が被写体や風景等を撮像することで動画を構成するフレームデータである撮影画像データ（ここでは、平面画像データ）を取得し、記憶・読出部３９が記憶部３０００に撮影画像データを記憶するまでの処理である。そして、一般撮影装置３の近距離通信部３８は、スマートフォン５に対して、ステップＳ１４によって得られた平面画像データを送信する（ステップＳ１５）。この際、送信される平面画像データを識別するための画像識別情報、及び付属情報も送信される。これにより、スマートフォン５の近距離通信部５８は、平面画像データ、画像識別情報、及び付属情報を受信する。 Next, the general photographing apparatus 3 starts photographing (step S14). In this photographing process, the image capturing unit 33 captures photographed image data (here, planar image data) which is frame data constituting a moving image by capturing an object, a landscape, and the like, and the storage / reading unit 39 stores the storage unit. This is a process until the captured image data is stored in 3000. And the short distance communication part 38 of the general imaging device 3 transmits the plane image data obtained by step S14 with respect to the smart phone 5 (step S15). At this time, image identification information for identifying the plane image data to be transmitted, and attached information are also transmitted. Thereby, the near field communication part 58 of the smart phone 5 receives plane image data, image identification information, and attached information.

一方、特殊撮影装置１は、撮影を開始する（ステップＳ１６）。この撮影の処理は、撮像部１３が被写体や風景等を撮影することで動画を構成するフレームデータである撮影画像データ（図３（ａ），（ｂ）に示されているような２つの半球画像データ）を生成し、画像・音処理部１５が、２つの半球画像データに基づいて、図３（ｃ）に示されているような単一の正距円筒射影画像データを作成して、記憶・読出部１９が記憶部１０００に正距円筒射影画像データを記憶するまでの処理である。そして、特殊撮影装置１の近距離通信部１８は、スマートフォン５に対して、ステップＳ１６によって得られた正距円筒射影画像データを送信する（ステップＳ１７）。この際、送信される正距円筒射影画像データを識別するための画像識別情報、及び付属情報も送信される。これにより、スマートフォン５の近距離通信部５８は、正距円筒射影画像データ、画像識別情報、及び付属情報を受信する。なお、正距円筒射影画像データは、特殊撮影装置１で作成されるのではなく、スマートフォン５で作成されるようにしてもよい。この場合、特殊撮影装置１からスマートフォン５には、２つの半球画像データ、画像識別情報、及び付属情報が送信される。 On the other hand, the special photographing apparatus 1 starts photographing (step S16). This shooting process is performed by shooting image data (two hemispheres as shown in FIGS. 3A and 3B) that are frame data that constitutes a moving image by the imaging unit 13 shooting a subject, a landscape, or the like. Image / sound processing unit 15 creates a single equirectangular projection image data as shown in FIG. 3C based on the two hemispherical image data, This is processing until the storage / reading unit 19 stores equirectangular projection image data in the storage unit 1000. Then, the short-range communication unit 18 of the special imaging device 1 transmits the equirectangular projection image data obtained in step S16 to the smartphone 5 (step S17). At this time, image identification information for identifying the equirectangular projection image data to be transmitted, and attached information are also transmitted. Thereby, the near field communication part 58 of the smart phone 5 receives equirectangular projection image data, image identification information, and attached information. The equirectangular projection image data may be created by the smartphone 5 instead of being created by the special imaging device 1. In this case, two hemispherical image data, image identification information, and attached information are transmitted from the special imaging device 1 to the smartphone 5.

次に、スマートフォン５の記憶・読出部５９は、ステップＳ１５によって受信された平面画像データの電子ファイルと、及びステップＳ１７によって受信された正距円筒射影画像データの電子ファイルを同一の電子フォルダに格納して、記憶部５０００に記憶する（ステップＳ１８）。 Next, the storage / reading unit 59 of the smartphone 5 stores the electronic file of planar image data received in step S15 and the electronic file of equirectangular projection image data received in step S17 in the same electronic folder. And it memorize | stores in the memory | storage part 5000 (step S18).

次に、スマートフォン５の画像・音処理部５５は、低精細画像である全天球画像ＣＥの一部の領域に高精細画像である平面画像Ｐを重畳して表示する際に利用する、重畳表示用メタデータを作成する（ステップＳ１９）。この際、記憶・読出部５９が、記憶部５０００に重畳表示用メタデータを記憶する。 Next, the image / sound processing unit 55 of the smartphone 5 uses the superimposition that is used when the planar image P that is a high-definition image is superimposed and displayed on a partial region of the omnidirectional image CE that is a low-definition image. Display metadata is created (step S19). At this time, the storage / reading unit 59 stores the superimposed display metadata in the storage unit 5000.

ここで、主に、図１９乃至図２２を用いて、重畳表示メタデータの作成処理について詳細に説明する。なお、一般撮影装置３と特殊撮影装置１の撮像素子の解像度が、たとえ同じであっても、特殊撮影装置１の撮像素子は３６０°全天球画像ＣＥの元になる正距円筒射影画像を全て網羅しなければならないため、撮影画像における一定領域あたりの精細度が低くなる。 Here, the process of creating superimposed display metadata will be described in detail mainly with reference to FIGS. Even if the resolutions of the imaging devices of the general imaging device 3 and the special imaging device 1 are the same, the imaging device of the special imaging device 1 generates an equirectangular projection image that is the basis of the 360 ° omnidirectional image CE. Since all must be covered, the definition per fixed area in the captured image is low.

｛重畳表示メタデータの作成処理｝
まず、低精細な正距円筒射影画像ＥＣによって作成される全天球画像ＣＥに対して、高精細な平面画像Ｐを重畳してディスプレイ５１７に表示するための重畳表示メタデータの作成処理について説明する。 {Process for creating superimposed display metadata}
First, a description will be given of a process for creating superimposition display metadata for superimposing a high-definition plane image P on the omnidirectional image CE created by the low-definition equirectangular projection image EC and displaying it on the display 517. To do.

抽出部５５０は、正距円筒射影方式によって得られた長方形の正距円筒射影画像ＥＣにおける複数の特徴点ｆｐ１、及び透視射影方式によって得られた長方形の平面画像Ｐにおける複数の特徴点ｆｐ２を抽出する（ステップＳ１１０）。 The extraction unit 550 extracts a plurality of feature points fp1 in the rectangular equirectangular projection image EC obtained by the equirectangular projection method and a plurality of feature points fp2 in the rectangular planar image P obtained by the perspective projection method. (Step S110).

次に、第１の対応領域算出部５５２は、第１のホモグラフィ変換によって、正距円筒射影画像ＥＣにおける複数の特徴点ｆｐ１の特徴量ｆｖ１と、平面画像Ｐにおける複数の特徴点ｆｐ２の特徴量ｆｖ２との類似度に基づき、図１９に示されているように、正距円筒射影画像ＥＣにおいて、平面画像Ｐに対応する四角形の第１の対応領域ＣＡ１を算出する（ステップＳ１２０）。より具体的には、第１の対応領域算出部５５２は、算出した正距円筒射影画像ＥＣの複数の特徴点ｆｐ１の特徴量ｆｖ１と、平面画像Ｐにおける複数の特徴点ｆｐ２の特徴量ｆｖ２の類似度に基づき画像間の対応点を算出し、正距円筒射影画像ＥＣにおいて、平面画像Ｐに対応するホモグラフィを求めることで得られる第１のホモグラフィ変換によって、図１９に示されているように、正距円筒射影画像ＥＣにおいて、平面画像Ｐに対応する四角形の第１の対応領域ＣＡ１を算出する。この処理は、正距円筒射影画像ＥＣに対して射影方式が異なる平面画像Ｐを正確に対応付けることができないが、ひとまず大まかに対応位置を推定するための処理（仮決め処理）である。 Next, the first corresponding region calculation unit 552 performs the first homography conversion to feature amounts fv1 of the plurality of feature points fp1 in the equirectangular projection image EC and features of the plurality of feature points fp2 in the planar image P. Based on the similarity to the quantity fv2, as shown in FIG. 19, in the equirectangular projection image EC, a rectangular first corresponding area CA1 corresponding to the planar image P is calculated (step S120). More specifically, the first corresponding area calculation unit 552 calculates the feature amounts fv1 of the plurality of feature points fp1 of the equirectangular projection image EC and the feature amounts fv2 of the plurality of feature points fp2 in the planar image P. FIG. 19 shows a first homography transformation obtained by calculating corresponding points between images based on the similarity and obtaining a homography corresponding to the planar image P in the equirectangular projection image EC. As described above, in the equirectangular projection image EC, a quadrangular first corresponding area CA1 corresponding to the planar image P is calculated. This process is a process (provisional decision process) for roughly estimating the corresponding position for the time being, although it is impossible to accurately associate the flat image P having a different projection method with the equirectangular projection image EC.

次に、注視点特定部５５４は、平面画像Ｐの中心点ＣＰ１が第１のホモグラフィ変換後に位置する正距円筒射影画像ＥＣ上の点（注視点ＧＰ１）を特定する（ステップＳ１３０）。 Next, the gazing point identification unit 554 identifies a point (gaze point GP1) on the equirectangular projection image EC where the center point CP1 of the planar image P is located after the first homography transformation (step S130).

次に、射影方式変換部５５６は、図２０に示されているように、周辺領域画像ＰＩの垂直画角αが平面画像Ｐの対角画角αと同じになるようにすることで、結果的に周辺領域画像ＰＩが作成できるように、正距円筒射影画像ＥＣ内で注視点ＧＰ１を中心とした周辺領域ＰＡを、平面画像Ｐと同じ透視射影方式に変換する（ステップＳ１４０）。 Next, the projection method conversion unit 556 makes the result by making the vertical field angle α of the peripheral region image PI the same as the diagonal field angle α of the planar image P as shown in FIG. Thus, the peripheral area PA centered on the gazing point GP1 in the equirectangular projection image EC is converted into the same perspective projection method as that of the planar image P so that the peripheral area image PI can be created (step S140).

次に、抽出部５５０は、射影方式変換部５５６によって得られた周辺領域画像ＰＩにおける複数の特徴点ｆｐ３を抽出する（ステップＳ１５０）。 Next, the extraction unit 550 extracts a plurality of feature points fp3 in the peripheral area image PI obtained by the projection method conversion unit 556 (step S150).

次に、第２の対応領域算出部５５８は、第２のホモグラフィ変換によって、平面画像Ｐにおける複数の特徴点ｆｐ２の特徴量ｆｖ２と、周辺領域画像ＰＩにおける複数の特徴点ｆｐ３の特徴量ｆｖ３との類似度に基づき、周辺領域画像ＰＩにおいて、平面画像Ｐに対応する四角形の第２の対応領域ＣＡ２を算出する（ステップＳ１６０）。なお、平面画像Ｐは、例えば４０００万画素の高精細な画像であるため、予め適切な大きさにリサイズしておく。 Next, the second corresponding region calculation unit 558 performs the feature amount fv2 of the plurality of feature points fp2 in the planar image P and the feature amount fv3 of the plurality of feature points fp3 in the peripheral region image PI by the second homography transformation. The second corresponding area CA2 having a square shape corresponding to the planar image P is calculated in the peripheral area image PI based on the similarity to (step S160). Note that the planar image P is a high-definition image of 40 million pixels, for example, and is resized to an appropriate size in advance.

次に、初期設定形状生成部５５９は、図２１（ａ）に示されているように、長方形の対角画角βと、長方形の縦横比を定めることで、初期設定形状ＤＦを生成する（ステップＳ１７０）。 Next, as shown in FIG. 21A, the initial setting shape generation unit 559 generates the initial setting shape DF by determining the rectangular diagonal angle of view β and the rectangular aspect ratio ( Step S170).

次に、領域分割部５６０は、図２１（ｃ）に示されているように、初期設定形状ＤＦの領域を複数の格子領域に分割する（ステップＳ１８０）。 Next, the region dividing unit 560 divides the region of the initial shape DF into a plurality of lattice regions as shown in FIG. 21C (step S180).

次に、射影方式逆変換部５６２は、領域分割部５６０によって領域分割された初期設定形状ＤＦの座標を、背景動画と同じ射影方式である正距円筒射影画像ＥＣｉと同じ正距円筒射影方式に逆変換することで、正距円筒射影画像ＥＣｉにおいて、基準形状の各格子点の対応点を算出する（ステップＳ１９０）。この射影方式逆変換部５６２の処理により、各格子領域の各格子点の座標を示す基準形状情報（基準位置パラメータ）が作成される。 Next, the projection method inverse transform unit 562 converts the coordinates of the initial shape DF divided by the region dividing unit 560 into the equirectangular projection method that is the same as the equirectangular projection image ECi that is the same projection method as the background moving image. By performing inverse transformation, corresponding points of each grid point of the reference shape are calculated in the equirectangular projection image ECi (step S190). By the process of the projection method inverse transform unit 562, reference shape information (reference position parameter) indicating the coordinates of each lattice point of each lattice region is created.

一方、基準形状変換情報算出部５６８は、基準形状ＢＦを回転、変倍（拡大若しくは縮小）、又は移動することにより基準形状ＢＦを変換するための基準形状変換情報を算出する（ステップＳ２００）。 On the other hand, the reference shape conversion information calculation unit 568 calculates reference shape conversion information for converting the reference shape BF by rotating, scaling (enlarging or reducing), or moving the reference shape BF (step S200).

最後に、重畳表示メタデータ作成部５７０は、特殊撮影装置１から取得した正距円筒射影画像情報、一般撮影装置３から取得した平面画像情報、上記ステップＳ１９０で作成された基準形状情報、ステップＳ２００で算出された各基準形状変換情報、及び作成用動画フレーム抽出部５４８から出力された動画フレームカウント値に基づいて、重畳表示メタデータを作成する（ステップＳ２１０）。この重畳表示メタデータは、記憶・読出部５９によって、記憶部５０００に記憶される。 Finally, the superimposed display metadata creation unit 570 includes equirectangular projection image information acquired from the special imaging device 1, planar image information acquired from the general imaging device 3, the reference shape information generated in step S190, and step S200. Superimposition display metadata is created based on the reference shape conversion information calculated in step S4 and the moving picture frame count value output from the creation moving picture frame extraction unit 548 (step S210). The superimposed display metadata is stored in the storage unit 5000 by the storage / readout unit 59.

以上より、図１８に示されているステップＳ１９の処理が終了する。そして、記憶・読出部５９及び表示制御部５６は、重畳表示メタデータを用いて、重畳の処理を行なう（ステップＳ２０）。 Thus, the process of step S19 shown in FIG. 18 ends. Then, the storage / reading unit 59 and the display control unit 56 perform superimposition processing using the superimposition display metadata (step S20).

｛重畳の処理｝
続いて、図２３乃至図３４を用いて、重畳の処理について詳細に説明する。図２３及び図２８は、重畳表示の処理の課程における画像の概念図である。図２３は、重畳の処理の前半部分である位置パラメータ及び補正パラメータを算出する処理を示している。図２８は、重畳の処理の後半部分である全天球画像ＣＥに平面画像Ｐを重畳する処理を示している。 {Superposition processing}
Next, the superimposition process will be described in detail with reference to FIGS. 23 and 28 are conceptual diagrams of images in the process of superimposed display. FIG. 23 shows a process for calculating the position parameter and the correction parameter, which is the first half of the superimposition process. FIG. 28 shows a process of superimposing the planar image P on the omnidirectional image CE, which is the latter half of the superimposition process.

まず、記憶・読出部（取得部）が、予め、記憶部５０００から、基準形状情報及び各基準形状変換情報を読み出して取得しておく。また、記憶・読出部（取得部）が、動画再生の時間経過に合わせて、正距円筒射影方式によって得られた動画のフレームに係る正距円筒射影画像ＥＣのデータ、透視射影方式によって得られた動画のフレームに係る平面画像Ｐのデータを読み出して取得しておく。 First, the storage / reading unit (acquisition unit) reads and acquires the reference shape information and each reference shape conversion information from the storage unit 5000 in advance. In addition, the storage / reading unit (acquiring unit) is obtained by data of the equirectangular projection image EC related to the frame of the moving image obtained by the equirectangular projection method and the perspective projecting method as the moving image playback time elapses. The data of the plane image P related to the frame of the moving image is read and acquired.

そして、補間部５７４は、再生時間管理部５７１による動画再生の時間経過に合わせて、メタデータ作成部５５ａで生成された各基準形状変換情報を読み取り、必要に応じて補間することで、動画のフレームの再生時間毎に形状変換パラメータを算出する（Ｓ２７０）。 Then, the interpolation unit 574 reads each reference shape conversion information generated by the metadata creation unit 55a in accordance with the time lapse of the video playback by the playback time management unit 571, and interpolates the video as necessary, thereby interpolating the video. A shape conversion parameter is calculated for each frame reproduction time (S270).

ここで、図２４を用いて、形状変換パラメータについて詳細に説明する。図２４は、補間方法に関する説明を行う。本実施形態において基準形状変換情報は、基準形状変倍情報を除いて全て角度で表されている。ホモグラフィ行列等から回転角を求めると、図２４（ａ）に示されているように、角度の値は例えば±１８０度の範囲となる。補間の際に±１８０度の境界をまたぐと、意図しない動作になる場合がある。例えば、動画のフレームが１秒間隔で１３０度、１５０度、１７０度、−１７０度、−１５０度、−１３０度と設定されていた場合を考える。図２４（ｂ）で示すように１３０度から−１３０度に向けて反時計周りで等速度に動いていると考えるのが普通である。しかしながら、この動画のフレームを０．５秒間隔で補間をすると、１７０と−１７０の単純な補間結果は０となり、結果として、図２４（ｃ）に示されているように、突然逆回りに大きく動くこととなる。そこで、補間部５７４は、補間するフレームデータ同士でどちら回りが近いかを判断して補間を実行する。例えば、図２４（ｄ）のように、基準位置パラメータで示される基準形状を１３０度（Ａ点）から−１３０度（Ｂ点）に移動又は回転させる場合、反時計周りが近くなり±１８０度の境界を跨ぐため、この場合はＡ点と（Ｂ点+３６０度）で補間を行う。図２４（ｅ）のように、１３０度（Ｃ点）から−２０度（Ｄ点）に移動又は回転させる場合は、時計回りが近くなり、±１８０度の境界をまたがないためＣ点とＤ点で単純に補間を行う。図２４（ｆ）のように、−１３０度（Ｅ点）から１３０度（Ｆ点）に移動又は回転させる場合、時計周りが近くなり、±１８０度の境界を跨ぐためＥ点と（Ｄ点−３６０度）で補間すればよい。このような方法で補間することで、移動又は回転の速さに対して動画のフレームの設定間隔が大きい場合等を除き、おおむね意図通りの動きになる。 Here, the shape conversion parameters will be described in detail with reference to FIG. FIG. 24 explains the interpolation method. In the present embodiment, the reference shape conversion information is all expressed in angles except for the reference shape scaling information. When the rotation angle is obtained from a homography matrix or the like, as shown in FIG. 24A, the value of the angle is in a range of ± 180 degrees, for example. If the boundary of ± 180 degrees is crossed during interpolation, an unintended operation may occur. For example, consider a case where the frames of a moving image are set to 130 degrees, 150 degrees, 170 degrees, -170 degrees, -150 degrees, and -130 degrees at 1 second intervals. As shown in FIG. 24 (b), it is normal to think that it is moving at a constant speed counterclockwise from 130 degrees to -130 degrees. However, if this video frame is interpolated at 0.5 second intervals, the simple interpolation result of 170 and -170 is 0, and as a result, as shown in FIG. It will move greatly. Therefore, the interpolation unit 574 determines which direction is closer between the interpolated frame data and executes the interpolation. For example, as shown in FIG. 24D, when the reference shape indicated by the reference position parameter is moved or rotated from 130 degrees (point A) to -130 degrees (point B), the counterclockwise direction is close and ± 180 degrees. In this case, interpolation is performed between point A and (point B + 360 degrees). As shown in FIG. 24 (e), when moving or rotating from 130 degrees (point C) to -20 degrees (point D), the clockwise direction is close and does not cross the ± 180 degree boundary. Simple interpolation is performed at point D. When moving or rotating from -130 degrees (point E) to 130 degrees (point F) as shown in FIG. 24 (f), the clockwise direction becomes closer and the point E and point (D) are crossed over the ± 180 degree boundary. -360 degrees). By interpolating in this way, the movement is almost as intended except when the setting interval of moving image frames is large with respect to the speed of movement or rotation.

続いて、図２３に戻り、変換位置パラメータ算出部５７６は、メタデータ生成部５５ａで生成された基準形状情報（基準位置パラメータ）、及び補間部５７４で得られた基準形状変換情報を用いて、この時間の変換位置パラメータを算出する（ステップＳ２８０）。 Subsequently, returning to FIG. 23, the conversion position parameter calculation unit 576 uses the reference shape information (reference position parameter) generated by the metadata generation unit 55a and the reference shape conversion information obtained by the interpolation unit 574. The conversion position parameter for this time is calculated (step S280).

ここで、図２５を用いて、変換位置パラメータの算出方法について説明する。図２５は、変換位置パラメータ算出部における処理の流れを示した三次元の概念図である。
図２５（ａ）は、基準位置パラメータに基づき、３次元の立体球ＣＳ上の座標（部分立体球ＰＳ）を作成した図である。ここでも、上記（式４）の考えが用いられる。 Here, the calculation method of the conversion position parameter will be described with reference to FIG. FIG. 25 is a three-dimensional conceptual diagram showing the flow of processing in the conversion position parameter calculation unit.
FIG. 25A is a diagram in which the coordinates (partial solid sphere PS) on the three-dimensional solid sphere CS are created based on the reference position parameter. Again, the idea of (Equation 4) above is used.

次に、変換位置パラメータ算出部５７６は、図２５（ｂ）に示されているように、部分立体球ＰＳの４つ角のうちの３点で構成される平面を求め、その平面に対して各点を立体球ＣＳの中心を視点として透視射影する。３点の選択は任意で良い。３次元上で３点を通る平面の式は公知の技術なので割愛する。また、部分立体球ＰＳから、平面への射影は部分立体球ＰＳ上の点Ｐ(ａ，ｂ，ｃ)とすると原点（視点）から点Ｐへの直線の式は媒介変数tを用いて（ｘ，ｙ，ｚ）＝ｔ（ａ，ｂ，ｃ）と表される。平面と直線の交点を求めればよい。 Next, as shown in FIG. 25B, the conversion position parameter calculation unit 576 obtains a plane composed of three points of the four corners of the partial solid sphere PS, and with respect to the plane. Each point is projected through the center of the solid sphere CS as a viewpoint. The selection of three points is arbitrary. Since the formula of a plane passing through three points on three dimensions is a known technique, it is omitted. If the projection from the partial solid sphere PS to the plane is a point P (a, b, c) on the partial solid sphere PS, the equation of the straight line from the origin (viewpoint) to the point P uses the parameter t ( x, y, z) = t (a, b, c). What is necessary is just to obtain | require the intersection of a plane and a straight line.

次に、変換位置パラメータ算出部５７６は、図２５（ｃ）に示されているように、平面の中心が原点に来るように各点を平行移動する。平面の中心は例えば４隅の点の位置座標の平均としてその点を原点に移動するベクトルを求め、各点に対してそのベクトルを加算する。 Next, the conversion position parameter calculation unit 576 translates each point so that the center of the plane comes to the origin, as shown in FIG. For the center of the plane, for example, a vector for moving the point to the origin is obtained as an average of the position coordinates of the four corner points, and the vector is added to each point.

次に、変換位置パラメータ算出部５７６は、図２５（ｄ）に示されているように、図２５（ｃ）平面の各点に対して補間部５７４で取得された情報（補間後の基準形状変換情報）のうち回転情報と変倍情報を作用させる。先ほど原点に移動したのは局所座標で回転及び変倍情報を作用させるためである。 Next, as shown in FIG. 25 (d), the conversion position parameter calculation unit 576 receives the information (reference shape after interpolation) obtained by the interpolation unit 574 for each point on the plane of FIG. 25 (c). Rotation information and scaling information are applied to the conversion information. The reason for moving to the origin earlier is to apply rotation and scaling information in local coordinates.

次に、変換位置パラメータ算出部５７６は、図２５（ｅ）に示されているように、上記原点へ平行移動したベクトルと逆のベクトルを各点に加算して元の位置に戻す。 Next, as shown in FIG. 25 (e), the conversion position parameter calculation unit 576 adds a vector opposite to the vector translated to the origin to each point to return to the original position.

次に、変換位置パラメータ算出部５７６は、図２５（ｆ）に示されているように、平面の各点を立体球ＣＳに戻す。立体球ＣＳへ戻す方法は、例えば、平面状のある点をＱ（ｄ，ｅ，ｆ）とする。これらの点を半径１の球上に射影するためには原点から点Ｑまでのベクトルの長が１になればよい。 Next, the conversion position parameter calculation unit 576 returns each point on the plane to the solid sphere CS as shown in FIG. In the method of returning to the solid sphere CS, for example, a certain point on the plane is defined as Q (d, e, f). In order to project these points onto a sphere having a radius of 1, the length of the vector from the origin to point Q need only be 1.

次に、変換位置パラメータ算出部５７６は、図２５（ｅ）に示されているように、立体球ＣＳ上に射影された各点に対して補間部５７４で取得された基準形状変換情報のうち残りの基準形状移動情報を作用させる。基準形状移動情報は原点を中心とした回転情報になるので、各点の（ｘ，ｙ，ｚ）座標に回転行列を掛ければよい。基準位置パラメータを作用させた３次元座標から、再度、正距円筒射影画像ＥＣにおける座標に戻すことにより、変換位置パラメータが算出される。なお、今回は、一度、正距円筒射影画像ＥＣにおける座標に戻した後、部分立体球ＰＳの作成を行う例を示したが、基準形状移動情報を作用させたものをそのまま部分立体球ＰＳとしてもよい。 Next, as shown in FIG. 25 (e), the conversion position parameter calculation unit 576 includes reference shape conversion information acquired by the interpolation unit 574 for each point projected onto the solid sphere CS. The remaining reference shape movement information is applied. Since the reference shape movement information is rotation information centered on the origin, the (x, y, z) coordinates of each point may be multiplied by a rotation matrix. The converted position parameter is calculated by returning again from the three-dimensional coordinates to which the reference position parameter is applied to the coordinates in the equirectangular projection image EC. In addition, although the example which produces the partial solid sphere PS once after returning to the coordinate in the equirectangular projection image EC once this time was shown, what made reference shape movement information act as a partial solid sphere PS as it is Also good.

続いて、図２３に戻り、形状変換部５７８は、変換位置パラメータ算出部５７６で算出された変換位置パラメータで特定される正距円筒射影画像ＥＣ上の領域を、平面画像Ｐと同じ矩形の形状に変換することで、第３の対応領域ＣＡ３を作成する（ステップＳ２９０）。そして、補正パラメータ作成部５８０は、第３の対応領域ＣＡ３と、再生用動画フレーム抽出部５７２から抽出された平面画像Ｐとを比較して、補正パラメータの算出を行うことで、補正パラメータを作成する（ステップＳ３００）。 Subsequently, returning to FIG. 23, the shape conversion unit 578 converts the region on the equirectangular projection image EC specified by the conversion position parameter calculated by the conversion position parameter calculation unit 576 into the same rectangular shape as the planar image P. The third corresponding area CA3 is created by converting to (step S290). Then, the correction parameter creation unit 580 compares the third corresponding area CA3 with the planar image P extracted from the reproduction moving image frame extraction unit 572, and calculates the correction parameter, thereby creating the correction parameter. (Step S300).

続いて、図２８を用いて、まず、図１３に示されている記憶・読出部５９（取得部）が、予め、記憶部５０００から、正距円筒射影方式によって得られた正距円筒射影画像ＥＣのデータ、透視射影方式によって得られた平面画像Ｐのデータ、及び変換位置パラメータを算出しておく。そして、再生フレーム選択部５７２が、映像のフレームである正距円筒射影画像ＥＣのデータ及び平面画像Ｐのデータを選択する。 28, first, the equirectangular projection image obtained by the storage / reading unit 59 (acquisition unit) shown in FIG. 13 in advance by the equirectangular projection method from the storage unit 5000 is used. EC data, data of a planar image P obtained by the perspective projection method, and conversion position parameters are calculated. Then, the playback frame selection unit 572 selects data of the equirectangular projection image EC and the data of the planar image P that are video frames.

次に、貼付領域作成部５８２は、図２８に示されているように、変換位置パラメータによって基準形状パラメータを変換することで、仮想の立体球ＣＳにおいて、部分立体球ＰＳを作成する（ステップＳ３１０）。 Next, as shown in FIG. 28, the pasting region creation unit 582 creates the partial solid sphere PS in the virtual solid sphere CS by converting the reference shape parameter using the conversion position parameter (step S310). ).

次に、補正部５８４は、補正パラメータ作成部５８０で作成された補正パラメータに基づいて、平面画像Ｐの明るさ値及び色値を、正距円筒射影画像ＥＣの明るさ値及び色値に合わせる補正を行なう（ステップＳ３２０）。以降、補正後の平面画像Ｐは、「補正画像Ｃ」という。 Next, the correction unit 584 matches the brightness value and color value of the planar image P with the brightness value and color value of the equirectangular projection image EC based on the correction parameters created by the correction parameter creation unit 580. Correction is performed (step S320). Hereinafter, the corrected planar image P is referred to as “corrected image C”.

次に、画像作成部５８６は、部分立体球ＰＳに対して、補正画像Ｃを貼り付けることで、重畳画像Ｓを作成する（ステップＳ３３０）。また、画像作成部５８６は、部分立体球ＰＳに基づいて、マスクデータＭを作成する（ステップＳ３４０）。更に、画像作成部５８６は、立体球ＣＳに対して、正距円筒射影画像ＥＣを貼り付けることで全天球画像ＣＥを作成する（ステップＳ３５０）。そして、画像重畳部５８８は、全天球画像ＣＥに対して、重畳画像Ｓ及びマスクデータＭを重畳する（ステップＳ３６０）。これにより、境目が目立たないように高精細の重畳画像Ｓが重畳された低精細の全天球画像ＣＥが完成する。 Next, the image creation unit 586 creates the superimposed image S by pasting the corrected image C on the partial solid sphere PS (step S330). The image creation unit 586 creates mask data M based on the partial solid sphere PS (step S340). Furthermore, the image creation unit 586 creates the omnidirectional image CE by pasting the equirectangular projection image EC on the solid sphere CS (step S350). Then, the image superimposing unit 588 superimposes the superimposed image S and the mask data M on the omnidirectional image CE (step S360). Thereby, the low-definition omnidirectional image CE on which the high-definition superimposed image S is superimposed so that the boundary is not conspicuous is completed.

次に、射影変換部５９０は、図７に示されているように、予め定められた仮想カメラＩＣの視線方向（所定領域Ｔの中心点ＣＰ）と所定領域Ｔの画角αに基づいて、重畳画像Ｓが重畳された状態の全天球画像ＣＥにおける所定領域Ｔをディスプレイ５１７で見ることができるように射影変換する（ステップＳ３７０）。この際、射影変換部５９０は、所定領域Ｔをディスプレイ５１７における表示領域の解像度に合わせる処理も行なう。これにより、表示制御部５６は、ディスプレイ５１７の表示領域の全体に亘って、所定領域Ｔを示す所定領域画像Ｑの表示の処理を行なうことができる（ステップＳ２４）。ここでは、所定領域画像Ｑ内に、平面画像Ｐが重畳された状態の平面画像Ｐ’である重畳画像Ｓが含まれている。 Next, as shown in FIG. 7, the projective transformation unit 590 is based on the predetermined visual line direction of the virtual camera IC (the center point CP of the predetermined area T) and the angle of view α of the predetermined area T. Projective transformation is performed so that the predetermined area T in the omnidirectional image CE with the superimposed image S superimposed can be seen on the display 517 (step S370). At this time, the projective transformation unit 590 also performs processing for matching the predetermined area T with the resolution of the display area on the display 517. Thereby, the display control part 56 can perform the process of the display of the predetermined area image Q which shows the predetermined area T over the whole display area of the display 517 (step S24). Here, a superimposed image S that is a planar image P ′ in which the planar image P is superimposed is included in the predetermined region image Q.

続いて、図２９乃至図３３を用いて、重畳表示された状態について詳細に説明する。図２９は、全天球画像に平面画像を重畳した場合の二次元の概念図である。ここでは、図５に対して、平面画像Ｐを重畳している場合を示している。図２９に示されているように、高精細な重畳画像Ｓは、立体球ＣＳに張り付けられた低精細な全天球画像ＣＥに対し、変換位置パラメータに従って、立体球ＣＳの内側に重畳されている。 Subsequently, the state of the superimposed display will be described in detail with reference to FIGS. 29 to 33. FIG. 29 is a two-dimensional conceptual diagram when a planar image is superimposed on an omnidirectional image. Here, the case where the planar image P is superimposed on FIG. 5 is shown. As shown in FIG. 29, the high-definition superimposed image S is superimposed on the inside of the solid sphere CS according to the conversion position parameter with respect to the low-definition omnidirectional image CE attached to the solid sphere CS. Yes.

図３０は、全天球画像に平面画像を重畳した場合の三次元の概念図である。図３０では、立体球ＣＳに全天球画像ＣＥ及び重畳画像Ｓが貼り付けられ、重畳画像Ｓを含む画像が所定領域画像Ｑとなっている状態を表している。 FIG. 30 is a three-dimensional conceptual diagram when a planar image is superimposed on an omnidirectional image. FIG. 30 illustrates a state in which the omnidirectional image CE and the superimposed image S are pasted on the solid sphere CS and the image including the superimposed image S is the predetermined region image Q.

図３１は、本実施形態の位置パラメータを用いずに、全天球画像に平面画像を重畳した場合の二次元の概念図である。図３２は、本実施形態の位置パラメータを用いて、全天球画像に平面画像を重畳した場合の二次元の概念図である。 FIG. 31 is a two-dimensional conceptual diagram when a planar image is superimposed on an omnidirectional image without using the position parameter of the present embodiment. FIG. 32 is a two-dimensional conceptual diagram when a planar image is superimposed on an omnidirectional image using the position parameters of the present embodiment.

図３１（ａ）に示されているように、仮想カメラＩＣが立体球ＣＳの中心点に位置している場合を基準にすると、被写体ｐｏ１は、全天球画像ＣＥ上で像ｐｏ２として表され、重畳画像Ｓ上で像ｐｏ３として表されている。図３１（ａ）に示されているように、像ｐｏ２及び像ｐｏ３は、仮想カメラＩＣと被写体ｐｏ１とを結ぶ直線上に位置しているため、全天球画像ＣＥに重畳画像Ｓが重畳された状態で表示されても、全天球画像ＣＥと重畳画像Ｓにズレが生じない。しかし、図３１（ｂ）に示されているように、仮想カメラＩＣが立体球ＣＳの中心点から離れると、仮想カメラＩＣと被写体ｐｏ１とを結ぶ直線上に、像ｐｏ２は位置しているが、像ｐｏ３はやや内側に位置している。このため、仮想カメラＩＣと被写体ｐｏ１とを結ぶ直線上における重畳画像Ｓ上の像を像ｐｏ３’とすると、全天球画像ＣＥと重畳画像Ｓに、像ｐｏ３と像ｐｏ３’との間のズレ量ｇ分のズレが生じてしまう。これにより、全天球画像ＣＥに対して重畳画像Ｓがズレて表示されてしまう。 As shown in FIG. 31A, when the virtual camera IC is located at the center point of the solid sphere CS, the subject po1 is represented as an image po2 on the omnidirectional image CE. And represented as an image po3 on the superimposed image S. As shown in FIG. 31A, the image po2 and the image po3 are located on a straight line connecting the virtual camera IC and the subject po1, and thus the superimposed image S is superimposed on the omnidirectional image CE. Even if displayed in the state, the omnidirectional image CE and the superimposed image S are not displaced. However, as illustrated in FIG. 31B, when the virtual camera IC is separated from the center point of the solid sphere CS, the image po2 is positioned on a straight line connecting the virtual camera IC and the subject po1. The image po3 is located slightly inside. For this reason, if an image on the superimposed image S on the straight line connecting the virtual camera IC and the subject po1 is an image po3 ′, the omnidirectional image CE and the superimposed image S are shifted from the image po3 and the image po3 ′. Deviation of the amount g occurs. Thereby, the superimposed image S is shifted from the omnidirectional image CE and displayed.

これに対して、本実施形態では、複数の格子領域によって示された位置パラメータを用いているため、図３２（ａ）、（ｂ）に示されているように、重畳画像Ｓを全天球画像ＣＥに沿って重畳することができる。これにより、図３２（ａ）に示されているように、仮想カメラＩＣが立体球ＣＳの中心点に位置する場合だけでなく、図３２（ｂ）に示されているように、仮想カメラが立体球ＣＳの中心点から離れた場合であっても、像ｐｏ２及び像ｐｏ３は、仮想カメラＩＣと被写体ｐｏ１とを結ぶ直線上に位置することになる。よって、全天球画像ＣＥに重畳画像Ｓが重畳された状態で表示されても、全天球画像ＣＥと重畳画像Ｓにズレが生じない。
On the other hand, in the present embodiment, since the position parameter indicated by the plurality of lattice regions is used, the superimposed image S is represented as an omnidirectional ball as shown in FIGS. 32 (a) and 32 (b). It can be superimposed along the image CE. Thereby, as shown in FIG. 32A, not only when the virtual camera IC is located at the center point of the solid sphere CS, but also as shown in FIG. Even when it is away from the center point of the three-dimensional sphere CS, the image po2 and the image po3 are located on a straight line connecting the virtual camera IC and the subject po1. Therefore, even if the superimposed image S is displayed in a state of being superimposed on the omnidirectional image CE, there is no deviation between the omnidirectional image CE and the superimposed image S.

図３３（ａ）は重畳表示しない場合のワイド画像の表示例、図３３（ｂ）は重畳表示しない場合のテレ画像の表示例、図３３（ｃ）は重畳表示する場合のワイド画像の表示例、図３３（ｄ）は重畳表示する場合のテレ画像の表示例を示した概念図である。なお、図中の波線は、説明の便宜上表しただけであり、実際にディスプレイ５１７上には表示されてもよく、表示されなくてもよい。 33A shows a display example of a wide image without superimposing display, FIG. 33B shows a display example of tele image without superimposition display, and FIG. 33C shows a display example of wide image with superimposition display. FIG. 33D is a conceptual diagram showing a display example of a tele image in the case of superimposed display. Note that the wavy lines in the figure are merely shown for convenience of explanation, and may or may not be actually displayed on the display 517.

図３３（ａ）に示されているように、全天球画像ＣＥに対して平面画像Ｐを重畳して表示しない場合、図３３（ａ）における波線で示される領域まで拡大表示すると、図３３（ｂ）に示されているように、低精細の画像のままとなっており、利用者は鮮明でない画像を見ることになってしまう。これに対して、図３３（ｃ）に示されているように、全天球画像ＣＥに対して平面画像Ｐを重畳して表示する場合、図３３（ｃ）における波線で示される領域まで拡大表示すると、図３３（ｄ）に示されているように、高精細の画像が表示され、利用者は鮮明な画像を見ることができる。特に、波線で示されている領域に、文字が描かれた看板等が表示されている場合、高精細な平面画像Ｐを重畳表示しなければ、拡大表示させても文字がぼやけてしまい、何が書かれてあるのか分からない。しかし、高精細な平面画像Ｐを重畳表示すれば、拡大表示させても文字が鮮明に見えるため、利用者は何が書かれているのかを把握することができる。 As shown in FIG. 33 (a), when the planar image P is not superimposed and displayed on the omnidirectional image CE, if it is enlarged and displayed up to the area indicated by the wavy line in FIG. 33 (a), FIG. As shown in (b), the low-definition image remains as it is, and the user will see an unclear image. On the other hand, as shown in FIG. 33 (c), when the planar image P is displayed superimposed on the omnidirectional image CE, it is enlarged to the area indicated by the wavy line in FIG. 33 (c). When displayed, a high-definition image is displayed as shown in FIG. 33D, and the user can see a clear image. In particular, when a signboard or the like on which characters are drawn is displayed in the area indicated by the wavy line, if the high-definition plane image P is not superimposed and displayed, the characters are blurred even if enlarged and displayed. I don't know if is written. However, if the high-definition planar image P is displayed in a superimposed manner, the characters can be clearly seen even when enlarged and displayed, so that the user can grasp what is written.

また、本実施形態のように、動画のフレームデータである全天球画像ＣＥ内に動画のフレームデータである平面画像Ｐが重畳されている場合には、図３４に示されているような見え方になる。図３４は、動画の所定領域画像内に動画の平面画像が表示されている例を示す図である。 In addition, as in the present embodiment, when the planar image P that is the frame data of the moving image is superimposed on the omnidirectional image CE that is the frame data of the moving image, the appearance as shown in FIG. Become. FIG. 34 is a diagram illustrating an example in which a moving image plane image is displayed in a predetermined area image of the moving image.

ディスプレイ５１７上には、図３４（ａ）から順に、図３４（ｂ）、図３４（ｃ）、及び図３４（ｄ）で時系列に、所定領域画像Ｑ１，Ｑ２，Ｑ３，Ｑ４は、それぞれ平面画像Ｐ１，Ｐ２，Ｐ３，Ｐ４が含まれた状態で表示される。所定領域画像Ｑ１，Ｑ２，Ｑ３，Ｑ４は、低精細の画像であるが、平面画像Ｐ１，Ｐ２，Ｐ３，Ｐ４は高精細であるため、平面画像Ｐ１，Ｐ２，Ｐ３，Ｐ４を拡大表示しても、利用者は鮮明な画像を見ることができる。 On the display 517, the predetermined area images Q1, Q2, Q3, and Q4 are respectively shown in time series in FIG. 34 (b), FIG. 34 (c), and FIG. 34 (d) in order from FIG. 34 (a). The flat images P1, P2, P3, and P4 are displayed. The predetermined area images Q1, Q2, Q3, and Q4 are low-definition images, but the planar images P1, P2, P3, and P4 are high-definition images, so that the planar images P1, P2, P3, and P4 are enlarged and displayed. However, the user can see a clear image.

また、背景画像（正距円筒射影画像）及び前景画像（平面画像）が静止画の場合、前景画像における複数の位置（点）、輝度及び色を補正する補正パラメータ（補正情報）を、背景画像の複数の位置（点）に対応付けて記憶してもよい。しかし、この方式をそのまま動画像に適応すると重畳表示メタデータのデータ量が重畳する動画の記録時間に比例して大きくなってしまうという課題が生じる。つまり、背景画像に対し、ある時刻における前景画像の背景画像に対する重畳位置と、別のある時刻における前景画像の背景画像に対する重畳位置は静止画と異なり変化することがあるため、各時刻の前景画像となる動画像のフレーム単位で重畳位置を記録することが必要となる。そのため、動画のフレーム枚数分だけ重畳位置を示す位置情報が必要となり、重畳表示メタデータが大量の位置情報を保持する必要がある。これに伴い、重畳部５５ｂは、重畳表示メタデータを用いて背景動画及び前景画像の一フレーム毎に重畳の処理を行う必要があるため、処理負荷が非常に大きくなってしまう。 In addition, when the background image (equal-angle cylindrical projection image) and the foreground image (planar image) are still images, a plurality of positions (points), luminance, and color correction parameters (correction information) in the foreground image are set as the background image. These may be stored in association with a plurality of positions (points). However, if this method is applied to a moving image as it is, there arises a problem that the data amount of the superimposed display metadata increases in proportion to the recording time of the moving image to be superimposed. That is, the foreground image at a certain time and the overlay position of the foreground image at a certain time and the overlay position of the foreground image at another certain time may change differently from a still image. It is necessary to record the superimposed position in units of frames of the moving image. For this reason, position information indicating the superimposed position is required for the number of frames of the moving image, and the superimposed display metadata needs to hold a large amount of position information. Accordingly, the superimposition unit 55b needs to perform superimposition processing for each frame of the background moving image and the foreground image using the superimposition display metadata, so that the processing load becomes very large.

これに対して、本実施形態では、図２２（ａ）に示しているように、基準位置パラメータによって背景画像上に基準形状ＢＦを定め、基準形状ＢＦを３次元モデル空間上で、回転、変倍、移動させるための基準形状変換情報を用いることで、動画における各フレームにおいて重畳位置の算出に必要となる重畳表示メタデータのデータサイズを小さくし、さらに動画像の全てのフレームに対しパラメータを持つのではなく、適当にサンプリングした動画のフレームに対する基準位置パラメータを記録することで、必要となるパラメータのデータ量を減らすことができる。さらに、そのパラメータはビューア等で行われる重畳表示を行う際に、重畳処理のための演算処理が従来処理よりも軽減できるようなパラメータセットとして提供することを考慮している。具体例としては、２Ｄ(2-Dimensions)および３Ｄ(3-Dimensions)のデータを視覚化するために使用するグラフィックスライブラリであるOpenGLで取り扱うことができるデータを算出するための演算処理を考慮している。このようにして、外部プログラムで取り扱うことが可能なメタデータとして記録しているため、外部プログラムの実行により実現される補間部５７４が、時系列の回転、変倍、移動を示す基準形状変換情報を読み込み、補間して利用することで連続的な重畳処理がリアルタイムの処理で実現できる。 On the other hand, in this embodiment, as shown in FIG. 22A, the reference shape BF is defined on the background image by the reference position parameter, and the reference shape BF is rotated and changed in the three-dimensional model space. By using the reference shape conversion information for moving the image, the data size of the superimposed display metadata necessary for calculating the superimposed position in each frame in the moving image is reduced, and parameters are set for all frames of the moving image. By recording the reference position parameters for the appropriately sampled moving image frames instead of having them, the data amount of the necessary parameters can be reduced. Further, it is considered that the parameter is provided as a parameter set that can reduce the arithmetic processing for the superimposition processing compared to the conventional processing when performing the superimposition display performed by a viewer or the like. As a specific example, consider an arithmetic process for calculating data that can be handled by OpenGL, a graphics library used to visualize 2D (2-Dimensions) and 3D (3-Dimensions) data. ing. In this way, since it is recorded as metadata that can be handled by an external program, the interpolation unit 574 realized by executing the external program performs reference shape conversion information indicating time series rotation, scaling, and movement. Can be realized by real-time processing.

＜＜本実施形態の主な効果＞＞
以上説明したように本実施形態によれば、射影方式が異なる一方の画像に他方の画像を合わせても、見えづらさを抑制することができるという効果を奏する。また、射影方式が異なる一方の画像（例えば、正距円筒射影画像ＥＣ）に他方の画像（例えば、平面画像Ｐ）を重畳しても、画像のずれを抑制することができるという効果を奏する。 << Main effects of this embodiment >>
As described above, according to the present embodiment, it is possible to suppress the difficulty of seeing even if the other image is combined with one image having a different projection method. Moreover, even if the other image (for example, the planar image P) is superimposed on one image (for example, the equirectangular projection image EC) having a different projection method, an effect of suppressing the image shift can be achieved.

また、図３３（ｃ）に示されているように、低精細の全天球画像ＣＥの一部である所定領域画像の一部の領域に、高精細の平面画像Ｐを重畳して表示することができるため、全天球画像と平面画像の画像が合い、平面画像が全天球画像に上手く溶け込むことができるという効果を奏する。 Further, as shown in FIG. 33 (c), a high-definition plane image P is superimposed and displayed on a partial area of a predetermined area image that is a part of the low-definition omnidirectional image CE. Therefore, there is an effect that the omnidirectional image and the image of the planar image are matched, and the planar image can be well blended into the omnidirectional image.

更に、本実施形態によれば、基準形状情報で示される基準形状ＢＦを定め、基準形状ＢＦを３次元モデル空間上で、回転、変倍、移動させるための基準形状変換情報を用いることで、射影方式が異なる複数の動画像において、一方の動画像の各フレームに他方の動画像の各フレームを重畳するための位置パラメータの算出に必要となるパラメータのデータサイズをできるだけ少なくすることができる。また、動画像の全てのフレームに対しパラメータを持つのではなく、適当にサンプリングした任意のフレームに対するパラメータを記録することで、必要となるパラメータのデータ量を減らすことができる。 Furthermore, according to the present embodiment, by defining the reference shape BF indicated by the reference shape information and using the reference shape conversion information for rotating, scaling, and moving the reference shape BF on the three-dimensional model space, In a plurality of moving images with different projection methods, it is possible to reduce the data size of a parameter necessary for calculating a position parameter for superimposing each frame of the other moving image on each frame of one moving image. Further, by recording parameters for an arbitrarily sampled frame instead of having parameters for all the frames of the moving image, it is possible to reduce the required parameter data amount.

〔第２の実施形態〕
続いて、図３５乃至図４０を用いて、本発明の第２の実施形態について説明する。 [Second Embodiment]
Subsequently, a second embodiment of the present invention will be described with reference to FIGS.

＜撮影システムの概略＞
まずは、図３５を用いて、本実施形態の撮影システムの構成の概略について説明する。図３５は、本実施形態の撮影システムの構成の概略図である。 <Outline of shooting system>
First, an outline of the configuration of the photographing system of this embodiment will be described with reference to FIG. FIG. 35 is a schematic diagram of the configuration of the imaging system of the present embodiment.

図３５に示されているように、本実施形態の撮影システムは、第１の実施形態に係る各構成に対して、更に画像処理サーバ７が追加されている。第１の実施形態と同一の構成は、同一の符号を付して、その説明を省略する。スマートフォン５と画像処理サーバ７は、インターネットやイントラネット等の通信ネットワーク１００を介して、画像処理サーバ７と相互通信することができる。 As shown in FIG. 35, in the imaging system of the present embodiment, an image processing server 7 is further added to each configuration according to the first embodiment. The same configurations as those of the first embodiment are denoted by the same reference numerals, and description thereof is omitted. The smartphone 5 and the image processing server 7 can mutually communicate with the image processing server 7 via a communication network 100 such as the Internet or an intranet.

第１の実施形態では、スマートフォン５が、重畳表示メタデータの作成や画像の重畳の処理を行なっているのに対して、本実施形態では、画像処理サーバ７がこれらの処理を行なう。
なお、本実施形態のスマートフォン５は、通信端末の一例となり、画像処理サーバが画像処理装置の一例となる。 In the first embodiment, the smartphone 5 performs the process of creating superimposed display metadata and the process of superimposing images, whereas in the present embodiment, the image processing server 7 performs these processes.
Note that the smartphone 5 of the present embodiment is an example of a communication terminal, and the image processing server is an example of an image processing apparatus.

画像処理サーバ７は、サーバコンピュータであり、複数台のサーバコンピュータで分散して画像処理を行なう場合も含まれる。 The image processing server 7 is a server computer, and includes a case where image processing is performed in a distributed manner by a plurality of server computers.

＜＜実施形態のハードウェア構成＞＞
次に、図３６を用いて、本実施形態の画像処理サーバ７のハードウェア構成を詳細に説明する。図３６は、画像処理サーバのハードウェア構成図である。なお、本実施形態の特殊撮影装置１、一般撮影装置３及びスマートフォン５のハードウェア構成は、第１の実施形態と同様であるため、それらの説明を省略する。 << Hardware Configuration of Embodiment >>
Next, the hardware configuration of the image processing server 7 of this embodiment will be described in detail with reference to FIG. FIG. 36 is a hardware configuration diagram of the image processing server. Note that the hardware configurations of the special imaging device 1, the general imaging device 3, and the smartphone 5 according to the present embodiment are the same as those in the first embodiment, and thus the description thereof is omitted.

＜画像処理サーバのハードウェア構成＞
図３６は、画像処理サーバのハードウェア構成図である。画像処理サーバ７は、コンピュータによって構築されており、図６に示されているように、ＣＰＵ７０１、ＲＯＭ７０２、ＲＡＭ７０３、ＨＤ７０４、ＨＤＤ(Hard Disk Drive)７０５、記録メディア７０６、メディアＩ／Ｆ７０７、ディスプレイ７０８、ネットワークＩ／Ｆ７０９、キーボード７１１、マウス７１２、ＣＤ−ＲＷドライブ７１４、及び、バスライン７１０を備えている。なお、画像処理サーバ７は、サーバとして機能するため、キーボード７１１やマウス７１２等の入力装置や、ディスプレイ７０８等の出力装置はなくてもよい。 <Hardware configuration of image processing server>
FIG. 36 is a hardware configuration diagram of the image processing server. The image processing server 7 is constructed by a computer, and as shown in FIG. 6, a CPU 701, ROM 702, RAM 703, HD 704, HDD (Hard Disk Drive) 705, recording medium 706, media I / F 707, display 708. A network I / F 709, a keyboard 711, a mouse 712, a CD-RW drive 714, and a bus line 710. Since the image processing server 7 functions as a server, an input device such as a keyboard 711 and a mouse 712 and an output device such as a display 708 may not be provided.

これらのうち、ＣＰＵ７０１は、画像処理サーバ７全体の動作を制御する。ＲＯＭ７０２は、ＣＰＵ７０１の駆動に用いられるプログラムを記憶する。ＲＡＭ７０３は、ＣＰＵ７０１のワークエリアとして使用される。ＨＤ７０４は、プログラム等の各種データを記憶する。ＨＤＤ７０５は、ＣＰＵ７０１の制御にしたがってＨＤ７０４に対する各種データの読み出し又は書き込みを制御する。メディアＩ／Ｆ７０７は、フラッシュメモリ等の記録メディア７０６に対するデータの読み出し又は書き込み（記憶）を制御する。ディスプレイ７０８は、カーソル、メニュー、ウィンドウ、文字、又は画像などの各種情報を表示する。ネットワークＩ／Ｆ７０９は、通信ネットワーク１００を利用してデータ通信をするためのインターフェースである。キーボード７１１は、文字、数値、各種指示などの入力のための複数のキーを備えた入力手段の一種である。マウス７１２は、各種指示の選択や実行、処理対象の選択、カーソルの移動などを行う入力手段の一種である。ＣＤ−ＲＷドライブ７１４は、着脱可能な記録媒体の一例としてのＣＤ−ＲＷ(Compact Disc-ReWritable)７１３に対する各種データの読み出し等を制御する。 Among these, the CPU 701 controls the overall operation of the image processing server 7. The ROM 702 stores a program used for driving the CPU 701. The RAM 703 is used as a work area for the CPU 701. The HD 704 stores various data such as programs. The HDD 705 controls reading or writing of various data with respect to the HD 704 according to the control of the CPU 701. The media I / F 707 controls reading or writing (storage) of data with respect to a recording medium 706 such as a flash memory. The display 708 displays various information such as a cursor, menu, window, character, or image. A network I / F 709 is an interface for performing data communication using the communication network 100. The keyboard 711 is a kind of input means having a plurality of keys for inputting characters, numerical values, various instructions, and the like. The mouse 712 is a kind of input means for selecting and executing various instructions, selecting a processing target, moving a cursor, and the like. The CD-RW drive 714 controls reading of various data with respect to a CD-RW (Compact Disc-ReWritable) 713 as an example of a removable recording medium.

また、画像処理サーバ７は、バスライン７１０を備えている。バスライン７１０は、図３６に示されているＣＰＵ７０１等の各構成要素を電気的に接続するためのアドレスバスやデータバス等である。 Further, the image processing server 7 includes a bus line 710. The bus line 710 is an address bus, a data bus, or the like for electrically connecting each component such as the CPU 701 shown in FIG.

＜＜実施形態の機能構成＞＞
次に、図３７及び図３９を用いて、本実施形態の機能構成について説明する。図３７は、本実施形態に係る撮影システムの機能ブロック図である。なお、本実施形態の特殊撮影装置１、一般撮影装置３及びスマートフォン５の機能構成は、第１の実施形態と同様であるため、それらの説明を省略する。本実施形態の場合、スマートフォン５の画像・音処理部５５は、図１５に示されている各機能構成を有していてもよいし、有していなくてもよい。 << Functional Configuration of Embodiment >>
Next, the functional configuration of this embodiment will be described with reference to FIGS. 37 and 39. FIG. 37 is a functional block diagram of the photographing system according to the present embodiment. In addition, since the functional structure of the special imaging device 1, the general imaging device 3, and the smart phone 5 of this embodiment is the same as that of 1st Embodiment, those description is abbreviate | omitted. In the case of the present embodiment, the image / sound processing unit 55 of the smartphone 5 may or may not have each functional configuration shown in FIG.

＜画像処理サーバの機能構成＞
図３７に示されているように、画像処理サーバ７は、遠距離通信部７１、受付部７２、画像・音処理部７５、表示制御部７６、判断部７７、及び記憶・読出部７９を有している。これら各部は、図３６に示されている各構成要素のいずれかが、ＨＤ７０４からＲＡＭ７０３上に展開された画像処理サーバ７用プログラムに従ったＣＰＵ７０１からの命令によって動作することで実現される機能又は手段である。 <Functional configuration of image processing server>
As shown in FIG. 37, the image processing server 7 includes a long-distance communication unit 71, a reception unit 72, an image / sound processing unit 75, a display control unit 76, a determination unit 77, and a storage / reading unit 79. is doing. Each of these units is a function realized by any of the constituent elements shown in FIG. 36 being operated by an instruction from the CPU 701 according to the image processing server 7 program expanded from the HD 704 onto the RAM 703, or Means.

また、画像処理サーバ７は、図３６に示されているＲＯＭ７０２、ＲＡＭ７０３、及びＨＤ７０４によって構築される記憶部７０００を有している。 Further, the image processing server 7 has a storage unit 7000 constructed by the ROM 702, the RAM 703, and the HD 704 shown in FIG.

（画像処理サーバの各機能構成）
画像処理サーバ７の遠距離通信部７１は、主に、図３６に示されているネットワークＩ／Ｆ７０７及びＣＰＵ７０１の処理によって実現され、通信ネットワークを介して、他の装置（例えば、他のサーバ、スマートフォン）との間で各種データ（または情報）の送受信を行う。 (Functional configuration of image processing server)
The long-distance communication unit 71 of the image processing server 7 is mainly realized by the processing of the network I / F 707 and the CPU 701 shown in FIG. 36, and other devices (for example, other servers, Send and receive various data (or information) to and from your smartphone.

受付部７２は、主にキーボード７１１、マウス７１２、及びＣＰＵ７０１の処理によって実現され、利用者から各種の選択又は入力を受け付ける。 The accepting unit 72 is realized mainly by the processing of the keyboard 711, the mouse 712, and the CPU 701, and accepts various selections or inputs from the user.

画像・音処理部７５は、主にＣＰＵ７０１からの命令によって実現され、スマートフォン５から送られて来た各種データに対して、各種処理を行なう。 The image / sound processing unit 75 is realized mainly by instructions from the CPU 701 and performs various processes on various data sent from the smartphone 5.

表示制御部７６は、主にＣＰＵ７０１の処理によって実現され、第１の実施形態の表示制御部５６と異なり、スマートフォン５のディスプレイ５１７に平面画像Ｐを表示させるための所定領域画像Ｑのデータを作成する。また、表示制御部７６は、画像・音処理部７５によって作成された重畳表示メタデータを利用して、平面画像Ｐの各格子領域ＬＡ０を、位置パラメータで示された位置、並びに補正パラメータで示された明るさ値及び色値に合わせることで、全天球画像ＣＥに平面画像Ｐを重畳して表示させるためのデータを作成する。 The display control unit 76 is realized mainly by the processing of the CPU 701, and unlike the display control unit 56 of the first embodiment, creates data of a predetermined area image Q for displaying the planar image P on the display 517 of the smartphone 5. To do. Further, the display control unit 76 uses the superimposed display metadata created by the image / sound processing unit 75 to indicate each lattice area LA0 of the planar image P by the position indicated by the position parameter and the correction parameter. By matching with the brightness value and the color value, data for displaying the planar image P superimposed on the omnidirectional image CE is created.

判断部７７は、図３６に示されているＣＰＵ７０１の処理によって実現され、各種判断を行なう。 The determination unit 77 is realized by the processing of the CPU 701 shown in FIG. 36 and makes various determinations.

記憶・読出部７９は、主に、図３６に示されているＣＰＵ７０１の処理によって実現され、記憶部７０００に、重畳表示メタデータ等の各種データ（または情報）を記憶したり、記憶部７０００から重畳表示メタデータ等の各種データ（または情報）を読み出したりする。また、記憶・読出部７９は、記憶部７０００から各種データを取得する取得部の役割を果たす。 The storage / reading unit 79 is realized mainly by the processing of the CPU 701 shown in FIG. 36, and stores various data (or information) such as superimposed display metadata in the storage unit 7000, or from the storage unit 7000. Various data (or information) such as superimposed display metadata is read out. The storage / reading unit 79 serves as an acquisition unit that acquires various data from the storage unit 7000.

（画像・音処理部の詳細な各機能構成）
ここで、図３９を用いて、画像・音処理部７５の各機能構成について詳細に説明する。図３９は、画像・音処理部の詳細な機能ブロック図である。 (Detailed functional configuration of image / sound processor)
Here, the functional configuration of the image / sound processor 75 will be described in detail with reference to FIG. FIG. 39 is a detailed functional block diagram of the image / sound processing unit.

画像・音処理部７５は、大きく分けて、エンコードを行なうメタデータ作成部７５ａとデコードを行なう重畳部７５ｂを有している。メタデータ作成部７５ａは、図４０に示されている後述のステップＳ１２１の処理を実行する。また、重畳部７５ｂは、図１８に示されている後述のステップＳ１２２の処理を実行する。 The image / sound processing unit 75 is roughly divided into a metadata creating unit 75a that performs encoding and a superimposing unit 75b that performs decoding. The metadata creation unit 75a executes the process of step S121 described later shown in FIG. In addition, the superimposing unit 75b performs a process of step S122 described later illustrated in FIG.

｛メタデータ作成部の各機能構成｝
まずは、メタデータ作成部７５ａの各機能構成について説明する。メタデータ作成部７５ａは、動画フレーム抽出部７４８、抽出部７５０、第１の対応領域算出部７５２、注視点特定部７５４、射影方式変換部７５６、第２の対応領域算出部７５８、初期設定形状生成部７５９、領域分割部７６０、射影方式逆変換部７６２、基準形状変換情報算出部７６８、及び重畳表示メタデータ作成部７７０を有している。これらは、それぞれ第１の実施形態における、作成用動画フレーム抽出部５４８、抽出部５５０、第１の対応領域算出部５５２、注視点特定部５５４、射影方式変換部５５６、第２の対応領域算出部５５８、初期設定形状生成部５５９、領域分割部５６０、射影方式逆変換部５６２、基準形状変換情報算出部５６８、及び重畳表示メタデータ作成部５７０と同様の機能を有するため、これらの説明は省略する。 {Each functional configuration of the metadata creation unit}
First, each functional configuration of the metadata creation unit 75a will be described. The metadata creation unit 75a includes a moving picture frame extraction unit 748, an extraction unit 750, a first corresponding region calculation unit 752, a gazing point identification unit 754, a projection method conversion unit 756, a second corresponding region calculation unit 758, and an initial setting shape. A generation unit 759, a region division unit 760, a projection method inverse conversion unit 762, a reference shape conversion information calculation unit 768, and a superimposed display metadata creation unit 770 are included. These are the moving image frame extraction unit 548, the extraction unit 550, the first corresponding region calculation unit 552, the gaze point specification unit 554, the projection method conversion unit 556, and the second corresponding region calculation in the first embodiment, respectively. Unit 558, initial setting shape generation unit 559, region division unit 560, projection method inverse conversion unit 562, reference shape conversion information calculation unit 568, and superimposed display metadata creation unit 570, and these descriptions are Omitted.

｛重畳部の機能構成｝
続いて、重畳部７５ｂの機能構成について説明する。重畳部７５ｂは、再生時間管理部７７１、再生動画フレーム選択部７７２、補間部７７４、位置パラメータ算出部７７６、形状変換部７７８、補正パラメータ作成部７８０、貼付領域作成部７８２、補正部７８４、画像作成部７８６、画像重畳部７８８、及び射影変換部７９０を有している。これらは、それぞれ第１の実施形態における、再生時間管理部５７１、再生用動画フレーム抽出部５７２、補間部５７４、変位位置パラメータ算出部５７６、形状変換部５７８、補正パラメータ作成部５８０、貼付領域作成部５８２、補正部５８４、画像作成部５８６、画像重畳部５８８、及び射影変換部５９０と同様の機能を有するため、これらの説明は省略する。 {Functional structure of superimposition unit}
Next, the functional configuration of the superimposing unit 75b will be described. The superimposing unit 75b includes a reproduction time management unit 771, a reproduction moving image frame selection unit 772, an interpolation unit 774, a position parameter calculation unit 776, a shape conversion unit 778, a correction parameter creation unit 780, a pasting region creation unit 782, a correction unit 784, an image. A creation unit 786, an image superimposing unit 788, and a projective transformation unit 790 are included. These are the playback time management unit 571, the playback video frame extraction unit 572, the interpolation unit 574, the displacement position parameter calculation unit 576, the shape conversion unit 578, the correction parameter creation unit 580, and the paste area creation, respectively, in the first embodiment. Since the functions are the same as those of the unit 582, the correction unit 584, the image creation unit 586, the image superimposing unit 588, and the projective transformation unit 590, their descriptions are omitted.

＜＜実施形態の処理又は動作＞＞
続いて、図４０を用いて、本実施形態の処理又は動作について説明する。図４０を用いて、撮影システムが実行する撮影方法を説明する。図４０は、本実施形態に係る撮影方法を示したシーケンス図である。なお、ステップＳ１１１〜Ｓ１１８の処理は、第１の実施形態のステップＳ１１〜Ｓ１８と同様の処理であるため、これらの説明を省略する。 << Processing or Operation of Embodiment >>
Subsequently, the processing or operation of the present embodiment will be described with reference to FIG. A photographing method executed by the photographing system will be described with reference to FIG. FIG. 40 is a sequence diagram illustrating the photographing method according to the present embodiment. In addition, since the process of step S111-S118 is a process similar to step S11-S18 of 1st Embodiment, these description is abbreviate | omitted.

スマートフォン５では、遠距離通信部５１が通信ネットワーク１００を介して画像処理サーバ７に、射影方式が異なる一方の画像に他方の画像を重畳するための重畳要求を示す重畳要求情報を送信する（ステップＳ１１９）。この重畳要求情報には、記憶部５０００に記憶された電子フォルダ内のデータ（平面画像データ、正距円筒射影画像データ）を送信する。これにより、画像処理サーバ７の送受信部７１は、電子フォルダ内のデータを受信する。 In the smartphone 5, the long-distance communication unit 51 transmits, to the image processing server 7 via the communication network 100, superimposition request information indicating a superimposition request for superimposing the other image on one image having a different projection method (step). S119). Data in the electronic folder (planar image data, equirectangular projection image data) stored in the storage unit 5000 is transmitted to the superimposition request information. Thereby, the transmission / reception unit 71 of the image processing server 7 receives the data in the electronic folder.

次に、画像処理サーバ７では、記憶・読出部７９が、記憶部７０００に、ステップＳ１１９で受信された電子フォルダ内のデータを記憶する（ステップＳ１２０）。そして、図３８に示されているメタデータ作成部７５ａが、重畳表示用メタデータの作成を行なう（ステップＳ１２１）。更に、重畳部７５ｂが、重畳の処理を行なう（ステップＳ１２２）。ステップＳ１２１，Ｓ１２２の処理は、それぞれステップＳ２１，Ｓ２２の処理と同様の内容であるため、これらの説明を省略する。 Next, in the image processing server 7, the storage / readout unit 79 stores the data in the electronic folder received in step S119 in the storage unit 7000 (step S120). Then, the metadata creating unit 75a shown in FIG. 38 creates superimposed display metadata (step S121). Furthermore, the superimposing unit 75b performs a superimposing process (step S122). Since the processes of steps S121 and S122 have the same contents as the processes of steps S21 and S22, respectively, their descriptions are omitted.

次に、表示制御部７６は、スマートフォン５のディスプレイ５１７の表示領域の全体に亘って、所定領域Ｔを示す所定領域画像Ｑを表示させる所定領域画像Ｑのデータを作成する。ここでは、所定領域画像Ｑ内に、平面画像Ｐが重畳された状態の平面画像Ｐ’である重畳画像Ｓが含まれている。遠距離通信部５１は、スマートフォン５に対して、表示制御部７６によって作成された所定領域画像Ｑのデータを送信する（ステップＳ１２３）。これにより、スマートフォンの遠距離通信部５１は、所定領域画像Ｑのデータを受信する。 Next, the display control unit 76 creates data of the predetermined area image Q that displays the predetermined area image Q indicating the predetermined area T over the entire display area of the display 517 of the smartphone 5. Here, a superimposed image S that is a planar image P ′ in which the planar image P is superimposed is included in the predetermined region image Q. The long-distance communication unit 51 transmits the data of the predetermined area image Q created by the display control unit 76 to the smartphone 5 (step S123). Thereby, the long-distance communication part 51 of a smart phone receives the data of the predetermined area image Q.

次に、スマートフォン５では、表示制御部５６が、ディスプレイ５１７上に、重畳画像Ｓが含まれた所定領域画像Ｑを表示させる（ステップＳ１２４）。 Next, in the smartphone 5, the display control unit 56 displays the predetermined area image Q including the superimposed image S on the display 517 (step S124).

＜＜本実施形態の主な効果＞＞
以上説明したように本実施形態によれば、上記第１の実施形態と同様の効果を奏する。 << Main effects of this embodiment >>
As described above, according to the present embodiment, the same effects as those of the first embodiment can be obtained.

また、本実施形態では、連携撮影の処理はスマートフォン５で行い、重畳表示用メタデータの作成及び重畳の処理は画像処理サーバ７で行なうため、スマートフォン５の処理能力が比較的低い場合であっても、画像のずれを抑制することができるという効果を奏する。 In the present embodiment, the cooperative shooting process is performed by the smartphone 5, and the superimposition display metadata creation and the superimposition process are performed by the image processing server 7. Therefore, the processing capability of the smartphone 5 is relatively low. In addition, there is an effect that image shift can be suppressed.

〔補足〕
上記各実施形態では、図１３に示されているように、スマートフォン５が、正距円筒射影画像データ、平面画像データ、及び重畳表示パラメータデータを有しているが、これに限るものではない。例えば、インターネット等の通信ネットワークを介して通信可能な管理サーバが、正距円筒射影画像データ、平面画像データ、及び重畳表示パラメータデータのうち少なくとも１つを記憶してもよい。 [Supplement]
In each of the above embodiments, as shown in FIG. 13, the smartphone 5 has equirectangular projection image data, planar image data, and superimposed display parameter data. However, the present invention is not limited to this. For example, a management server that can communicate via a communication network such as the Internet may store at least one of equirectangular projection image data, planar image data, and superimposed display parameter data.

また、上記各本実施形態では、全天球画像ＣＥに平面画像Ｐが重畳されているが、これに限るものではない。例えば、全天球画像ＣＥの一部の画像を平面画像Ｐに置き換えてもよいし、全天球画像ＣＥの一部の画像を削除して、この削除した部分に平面画像Ｐを嵌め込んでもよい。 In each of the above embodiments, the planar image P is superimposed on the omnidirectional image CE, but the present invention is not limited to this. For example, a partial image of the omnidirectional image CE may be replaced with the planar image P, or a partial image of the omnidirectional image CE may be deleted and the planar image P may be inserted into the deleted portion. Good.

また、上記第２の実施形態では、重畳の処理が画像処理サーバ７で行なわれているが（ステップＳ４５参照）、これに限るものではない。例えば、画像処理サーバ７からスマートフォン５に重畳表示用メタデータを送信し、スマートフォン５側で重畳の処理及び表示の処理を行なってもよい。この場合、画像処理サーバ７では、図３９に示されているメタデータ作成部７５ａが重畳表示用メタデータの作成を行なう。一方、スマートフォン５では、図１５に示されている重畳部５５ｂが重畳の処理を行い、図１３に示されている表示制御部５６が表示の処理を行なう。 In the second embodiment, the superimposing process is performed by the image processing server 7 (see step S45), but the present invention is not limited to this. For example, the superimposed display metadata may be transmitted from the image processing server 7 to the smartphone 5, and the overlapping process and the display process may be performed on the smartphone 5 side. In this case, in the image processing server 7, the metadata creation unit 75a shown in FIG. 39 creates metadata for superimposed display. On the other hand, in the smartphone 5, the superimposing unit 55b illustrated in FIG. 15 performs the superimposing process, and the display control unit 56 illustrated in FIG. 13 performs the display process.

また、上記では、全天球画像に平面画像を重畳する場合について説明したが、重畳は、合成の一例である。合成には、重畳の他に、貼り付け、嵌め込み、重ね合わせ等も含まれる。また、上記重畳画像は、合成画像の一例である。合成画像には、重畳画像の他に、貼り付け画像、嵌め込み画像、重ね合わせ画像等も含まれる。更に、画像重畳部５８８，７８８は、画像合成部の一例である。 Moreover, although the case where a planar image is superimposed on an omnidirectional image has been described above, the superimposition is an example of synthesis. In addition to superposition, the composition includes pasting, fitting, superposition, and the like. The superimposed image is an example of a composite image. In addition to the superimposed image, the composite image includes a pasted image, a fitted image, a superimposed image, and the like. Furthermore, the image superimposing units 588 and 788 are examples of an image synthesizing unit.

また、正距円筒射影画像ＥＣと平面画像Ｐは、両方とも静止画の場合、両方とも動画のフレームの場合、一方が静止画で他方が動画のフレームの場合のいずれであってもよい。 The equirectangular projection image EC and the planar image P may be either a still image, a case where both are moving image frames, a case where one is a still image and the other is a moving image frame.

図１３、図１５、図１６、図３６、図３７、図３８に示されている各機能構成は、ソフトウェア機能ユニットの形で実現し、且つ、独立製品として販売または使用する場合に、コンピュータ読取り可能な記憶媒体に記憶することができる。この場合に、本実施形態の技術方案は、本質的、または従来技術に貢献する部分若しくは上記技術方案の部分はソフトウェア製品の形で表現される。上記コンピュータソフトウェア製品は、記憶媒体に記憶され、コンピュータ装置(パーソナルコンピュータ、サーバ、或はネットワークデバイスなど)に上記各実施形態にかかる上記方法の全部或は一部のステップを実行させる複数の指令を含む。なお、上述の記憶媒体は、USBメモリ、リムーバブルディスク、ROM、RAM、磁気ディスク、あるいは光ディスクなど、プログラムコードを格納できる様々な媒体を含む。 Each of the functional configurations shown in FIGS. 13, 15, 16, 36, 37, and 38 is realized in the form of a software functional unit and is read by a computer when sold or used as an independent product. It can be stored in a possible storage medium. In this case, the technical plan of the present embodiment is expressed in the form of a software product, which is essential, or a part that contributes to the prior art or a part of the technical plan. The computer software product is stored in a storage medium, and a plurality of commands for causing a computer apparatus (personal computer, server, network device, etc.) to execute all or some steps of the method according to each of the embodiments. Including. The above-described storage medium includes various media that can store program codes, such as a USB memory, a removable disk, a ROM, a RAM, a magnetic disk, or an optical disk.

また、上記実施形態にかかる方法は、プロセッサに適用され、またはプロセッサによって実現される。プロセッサは信号を処理する能力を持つ集積回路基板である。上記各実施形態の方法の各ステップはプロセッサにおけるハードウェアである集積論理回路又はソフトウェア形式の指令によって実現される。上記プロセッサ42は汎用プロセッサ、デジタル信号プロセッサ(DSP)、専用集積回路(ASIC)、既製プログラマブルゲートアレイ(FPGA)または他のプログラマブル論理デバイス、ディスクリートゲートまたはトランジスタ論理デバイス、ディスクリートハードウェア部品であり、上記各実施形態に開示される各方法、ステップ及び論理ボックスを実現又は実行可能なものである。汎用処理器はマイクロプロセッサ又は任意の一般処理器などである。上記各実施形態にかかる方法の各ステップは、ハードウェアであるデコーダにより実行されることで実現されてもよく、または、デコーダにお行けるハードウェアとソフトウェアとの組み合わせによって実現されても良い。ソフトウェアモジュールはランダムメモリ、フラッシュメモリ、読出し専用メモリ、プログラマブル読出し専用メモリ、あるいは電気的消去可能なプログラマブルメモリ、レジスタなど、本分野で成熟した記憶媒体に記憶される。このソフトウェアが記憶される記憶媒体を備えるメモリから、プロセッサは情報を読み取り、ハードウェアに合わせて上記方法のステップを実現させる。 In addition, the method according to the above embodiment is applied to or realized by the processor. A processor is an integrated circuit board capable of processing signals. Each step of the method of each of the above embodiments is realized by an integrated logic circuit, which is hardware in the processor, or a software type instruction. The processor 42 is a general purpose processor, digital signal processor (DSP), dedicated integrated circuit (ASIC), off-the-shelf programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware component, and Each method, step, and logic box disclosed in each embodiment can be realized or executed. The general-purpose processor is a microprocessor or any general processor. Each step of the method according to each of the above embodiments may be realized by being executed by a decoder that is hardware, or may be realized by a combination of hardware and software that can be performed by the decoder. The software modules are stored in storage media mature in the field, such as random memory, flash memory, read-only memory, programmable read-only memory, or electrically erasable programmable memory, registers. The processor reads information from a memory having a storage medium in which the software is stored, and implements the steps of the method according to the hardware.

以上に説明した実施形態は、ハードウェア、ソフトウェア、ファームウェア、ミドルウェア、マイクロコード、またはそれらの組み合わせで実現される。その中に、ハードウェアの実現に関して、処理ユニットは一つまたは複数の専用集積回路(ASIC)、デジタル信号処理プロセッサ（DSP）、デジタル信号プロセッサ(DSPD)、プログラム可能論理回路 (PLD)、フィールドプログラマブルゲートアレイ(FPGA)、汎用プロセッサ、コントローラ、マイクロコントローラ、マイクロプロセッサ、本発明の機能を実行する他の電子ユニット或はその組合せにより実現される。また、ソフトウェアの実現に関しては、上述した機能を実現するモジュール(たとえばプロセス、関数など)により上記技術が実現される。ソフトウェアコードは、メモリに保存され、プロセッサによって実行される。なお、メモリはプロセッサの内部または外部で実現される。 The embodiment described above is realized by hardware, software, firmware, middleware, microcode, or a combination thereof. Among them, regarding hardware implementation, the processing unit can be one or more dedicated integrated circuits (ASIC), digital signal processor (DSP), digital signal processor (DSPD), programmable logic circuit (PLD), field programmable It can be implemented by a gate array (FPGA), a general purpose processor, a controller, a microcontroller, a microprocessor, other electronic units that perform the functions of the present invention, or a combination thereof. As for software implementation, the above technique is implemented by modules (for example, processes, functions, etc.) that implement the above-described functions. The software code is stored in memory and executed by the processor. The memory is realized inside or outside the processor.

１特殊撮影装置（第１の撮影装置の一例）
３一般撮影装置（第２の撮影装置の一例）
５スマートフォン（画像処理装置の一例）
７画像処理サーバ（画像処理装置の一例）
５１送受信部
５２受付部
５５ａメタデータ作成部
５５ｂ重畳部
５６表示制御部
５８近距離通信部
５９記憶・読出部（取得手段の一例）
７２受付部
７５画像・音処理部
７５ａメタデータ作成部
７５ｂ重畳部
７６表示制御部
７８近距離通信部
７９記憶・読出部（取得手段の一例）
５１７ディスプレイ
５４８（メタデータ）作成用動画フレーム抽出部
５５０抽出部（抽出手段の一例）
５５２第１の対応領域算出部（第１の対応領域算出手段の一例）
５５４注視点特定部（注視点特定手段の一例）
５５６射影方式変換部（射影方式変換手段の一例）
５５８第２の対応領域算出部（対応領域算出手段の一例／第２の対応領域算出手段の一例）
５５９初期設定形状生成部（初期設定形状生成手段の一例）
５６０領域分割部（領域分割手段の一例）
５６２射影方式逆変換部（射影方式逆変換手段の一例）
５６８基準形状変換情報算出部（基準形状変換情報算出手段の一例）
５７０重畳表示メタデータ作成部（位置算出手段の一例）
５７１再生時間管理部
５７２再生用動画フレーム抽出部
５７４補間部（補間手段の一例）
５７５変換位置パラメータ算出部
５７８形状変換部
５８０補正パラメータ作成部
５８２添付領域作成部（添付領域作成手段の一例）
５８４補正部（補正手段の一例）
５８６画像作成部（画像作成手段の一例）
５８８画像重畳部（画像重畳手段の一例）
５９０射影変換部（射影変換手段の一例）
７５０抽出部（抽出手段の一例）
７５２第１の対応領域算出部（第１の対応領域算出手段の一例）
７５４注視点特定部（注視点特定手段の一例）
７５６射影方式変換部（射影方式変換手段の一例）
７５８第２の対応領域算出部（対応領域算出手段の一例／第２の対応領域算出手段の一例）
７６０領域分割部（領域分割手段の一例）
７６２射影方式逆変換部（射影方式逆変換手段の一例）
７６４形状変換部（形状変換手段の一例）
７６６補正パラメータ作成部（補正情報作成手段の一例）
７７０重畳表示メタデータ作成部（位置算出手段の一例）
７８２添付領域作成部（添付領域作成手段の一例）
７８４補正部（補正手段の一例）
７８６画像作成部（画像作成手段の一例）
７８８画像重畳部（画像重畳手段の一例）
７９０射影変換部（射影変換手段の一例）
５０００記憶部（記憶手段の一例）
５００１連携撮影装置ＤＢ（連携撮影装置手段の一例）
７０００記憶部（記憶手段の一例） 1 Special shooting device (example of first shooting device)
3 General photographing device (an example of a second photographing device)
5 Smartphone (an example of an image processing device)
7 Image processing server (an example of an image processing device)
51 Transmission / Reception Unit 52 Reception Unit 55a Metadata Creation Unit 55b Superimposition Unit 56 Display Control Unit 58 Near Field Communication Unit 59 Storage / Readout Unit (Example of Acquisition Unit)
72 Reception unit 75 Image / sound processing unit 75a Metadata creation unit 75b Superimposition unit 76 Display control unit 78 Short-range communication unit 79 Storage / reading unit (an example of an acquisition unit)
517 Display 548 (Metadata) Creation Movie Frame Extraction Unit 550 Extraction Unit (Example of Extraction Unit)
552 1st corresponding area calculation part (an example of the 1st corresponding area calculation means)
554 Gaze point identification part (an example of gaze point identification means)
556 Projection method conversion unit (an example of projection method conversion means)
558 Second Corresponding Area Calculation Unit (Example of Corresponding Area Calculation Unit / Example of Second Corresponding Area Calculation Unit)
559 initial setting shape generation unit (an example of initial setting shape generation means)
560 area dividing unit (an example of area dividing means)
562 Projection method reverse conversion unit (an example of projection method reverse conversion means)
568 Reference shape conversion information calculation unit (an example of reference shape conversion information calculation means)
570 Superimposed display metadata creation unit (an example of position calculation means)
571 Playback time management unit 572 Playback video frame extraction unit 574 Interpolation unit (an example of interpolation means)
575 Conversion position parameter calculation unit 578 Shape conversion unit 580 Correction parameter creation unit 582 Attachment region creation unit (an example of attachment region creation means)
584 Correction unit (an example of correction means)
586 Image creation unit (an example of image creation means)
588 Image superimposing unit (an example of image superimposing means)
590 Projection conversion unit (an example of projection conversion means)
750 extraction unit (an example of extraction means)
752 First corresponding area calculation unit (an example of first corresponding area calculation means)
754 Gaze point identification part (an example of gaze point identification means)
756 Projection method conversion unit (an example of projection method conversion means)
758 Second corresponding area calculation unit (an example of a corresponding area calculation unit / an example of a second corresponding area calculation unit)
760 area dividing unit (an example of area dividing means)
762 Projection method inversion unit (an example of projection method inversion unit)
764 Shape converter (an example of shape converter)
766 Correction parameter creation unit (an example of correction information creation means)
770 superimposed display metadata creation unit (an example of position calculation means)
782 Attachment region creation unit (an example of attachment region creation means)
784 Correction unit (an example of correction means)
786 Image creation unit (an example of image creation means)
788 image superimposing unit (an example of image superimposing means)
790 Projection conversion unit (an example of projection conversion means)
5000 storage unit (an example of storage means)
5001 Collaborative photographing apparatus DB (an example of cooperative photographing apparatus means)
7000 storage unit (an example of storage means)

特開２０１６−９６４８７号公報Japanese Patent Laid-Open No. 2006-96487 特開２０１２−１７８１３５号公報JP 2012-178135 A

Claims

Acquisition means for acquiring a first image of the first projection method and a second image of a second projection method different from the first projection method;
First projection method conversion means for converting the first image into a third image of the second projection method;
Extraction means for extracting a plurality of feature points from each of the second image and the third image;
A second corresponding region corresponding to the second image in the third image based on the plurality of feature points of the second image and the plurality of feature points of the third image respectively extracted by the extraction unit; Corresponding area calculation means for obtaining
An initial setting shape generating means for generating an initial setting shape which is the second projecting method and has a projective transformation relationship with respect to the shape of the second image;
Second projection method conversion means for creating a reference shape by converting the initial setting shape to the first projection method;
By using the information in the case of performing projective transformation from the second image to the second corresponding area, calculating the information for converting the default shape for the second corresponding area, the reference shape is Reference shape conversion information calculating means for calculating reference shape conversion information for conversion by at least one of rotation and scaling;
Storage means for storing the reference shape information indicating the reference shape and the reference shape conversion information in association with each other;
An image processing apparatus comprising:

The reference shape conversion information calculating means converts the center point of the second corresponding area to the first projection method by the second projection method conversion means, thereby corresponding to the corresponding on the first image. The image processing apparatus according to claim 1, wherein the reference shape conversion information for converting the position of the reference shape by movement is calculated by calculating a point.

The image processing apparatus according to claim 2, wherein the rotation of the reference shape is performed by an Euler angle with respect to each axis of the three-dimensional model space.

The image processing apparatus according to claim 1, wherein the first image or the second image is a predetermined frame of a moving image.

5. The reference shape conversion information calculating unit calculates the reference shape conversion information based on an image obtained by appropriately sampling a frame image of a moving image from the second image. The image processing apparatus described.

In the second moving image, the reference shape conversion information calculating unit samples the frames of the second moving image finely on the time axis when the change between frames is large, and when the change between the frames is small, The image processing apparatus according to claim 5, wherein the frame of the second moving image is largely sampled.

The image processing apparatus according to claim 1, wherein the image processing apparatus is a smartphone, a tablet personal computer, a notebook personal computer, a desktop personal computer, or a server computer.

An image processing apparatus according to claim 1;
A first imaging device that images a subject and obtains the first image of the first projection method;
A second imaging device that images a subject and obtains the second image of the second projection method;
An imaging system comprising:

The imaging system according to claim 8, wherein the first imaging device is a camera that captures a subject and obtains an omnidirectional image.

An acquisition step of acquiring a first image of the first projection method and a second image of a second projection method different from the first projection method;
A first projection method conversion step of converting the first image into a third image of the second projection method;
An extraction step of extracting a plurality of feature points from each of the second image and the third image;
A second corresponding region corresponding to the second image in the third image based on the plurality of feature points of the second image and the plurality of feature points of the third image respectively extracted in the extraction step; A corresponding area calculating step for obtaining
An initial setting shape generating step for generating an initial setting shape which is the second projecting method and has a projective transformation relationship with respect to the shape of the second image;
A second projection method conversion step of creating a reference shape by converting the initial shape to the first projection method;
By using the information in the case of performing projective transformation from the second image to the second corresponding area, calculating the information for converting the default shape for the second corresponding area, the reference shape is A reference shape conversion information calculation step for calculating reference shape conversion information for conversion by at least one of rotation and scaling;
A storage step of storing the reference shape information indicating the reference shape and the reference shape conversion information in association with each other;
The image processing method characterized by performing.

The reference shape conversion information calculating step converts the center point of the second corresponding region into the first projection method by the second projection method conversion step, thereby corresponding to the corresponding on the first image. The image processing method according to claim 10, further comprising: calculating the reference shape conversion information for converting the position of the reference shape by movement by calculating a point.

A program causing a computer to execute the method according to claim 10 or 11.