JP6388532B2

JP6388532B2 - Image providing system and image providing method

Info

Publication number: JP6388532B2
Application number: JP2014242551A
Authority: JP
Inventors: 水谷　政美; 政美水谷
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2014-11-28
Filing date: 2014-11-28
Publication date: 2018-09-12
Anticipated expiration: 2034-11-28
Also published as: JP2016103248A

Description

本発明は、画像提供システムおよび画像提供方法に関する。 The present invention relates to an image providing system and an image providing method.

被写体の画像に撮影位置の位置情報を関連付けて、表示する被写体の画像を特定し、特定された被写体の画像を表示する技術がある。例えば、撮影位置と画像データとを関連付けて格納するデータベースから所望の検索対象物が写り込んでいる画像を検索する技術が提案されている。 There is a technique for associating position information of a shooting position with a subject image, specifying a subject image to be displayed, and displaying the specified subject image. For example, there has been proposed a technique for searching for an image in which a desired search object is reflected from a database that stores a shooting position and image data in association with each other.

また、コラージュを作成するために、デジタル画像の自動アノテーションのための、および画像をスラッチするための場所データに基づいて、デジタル画像により取得されたオブジェクトを識別する技術が提案されている（例えば、特許文献１および２参照）。 Also, techniques have been proposed to identify objects acquired by digital images based on location data for automatic annotation of digital images and for latching images to create collages (eg, (See Patent Documents 1 and 2).

国際公開第２０１３／１１４４７３号公報International Publication No. 2013/114473 特表２０１０−５０９６６８号公報Special table 2010-509668 gazette

１つの側面として、本発明は、複数の画像の中から被写体を捉えた区間の一連の画像を簡単な操作で表示することを目的とする。 As one aspect, an object of the present invention is to display a series of images of a section in which a subject is captured from a plurality of images with a simple operation.

１つの態様では、画像提供システムは、サーバと表示端末とを含む画像提供システムであって、前記サーバは、前記表示端末が指定した被写体に関する情報に基づいて、前記被写体の存在地点の位置を特定する特定部と、前記被写体を撮影した撮影位置と前記被写体の存在地点の位置とに基づいて、前記被写体を含む区間の一連の周囲画像を特定する特定データを生成する生成部と、前記特定データを前記表示端末に送信する送信部と、を備え、前記表示端末は、複数の周囲画像が記憶される記憶装置から、前記特定データに基づいて、前記被写体を含む区間の一連の周囲画像を抽出する抽出部と、前記特定データに基づいて、前記抽出部が抽出した一連の周囲画像のうち前記被写体の画像を表示する表示部と、を備える。 In one embodiment, the image providing system, an image providing system including a server and a display terminal, wherein the server, based on the information about the object which the display terminal is specified, identifies the position of existence point of the object A generating unit that generates specific data for specifying a series of surrounding images of a section including the subject, based on a shooting position where the subject is shot and a position of the location where the subject exists, and the specific data And a transmission unit that transmits a series of surrounding images of a section including the subject based on the specific data from a storage device that stores a plurality of surrounding images. And a display unit that displays an image of the subject in a series of surrounding images extracted by the extraction unit based on the specific data.

１つの側面によれば、複数の画像の中から被写体を捉えた区間の一連の画像を簡単な操作で表示することができる。 According to one aspect, a series of images of a section in which a subject is captured from a plurality of images can be displayed with a simple operation.

画像提供システムの一例を示す図である。It is a figure which shows an example of an image provision system. 車両に搭載したカメラと視野との関係の一例を示す図である。It is a figure which shows an example of the relationship between the camera mounted in the vehicle and a visual field. 車両に含まれる制御部の一例を示す機能ブロック図である。It is a functional block diagram which shows an example of the control part contained in a vehicle. 車載画像メタデータの一例を示す図である。It is a figure which shows an example of vehicle-mounted image metadata. 制御部が行う処理の一例を示すフローチャートである。It is a flowchart which shows an example of the process which a control part performs. 画像蓄積サーバおよびメタデータサーバの処理の一例を示すフローチャートである。It is a flowchart which shows an example of a process of an image storage server and a metadata server. 表示端末と画像検索サーバとが行う処理の一例を示すシーケンスチャートである。It is a sequence chart which shows an example of the process which a display terminal and an image search server perform. 合成画像メタデータの生成の一例を示すフローチャートである。It is a flowchart which shows an example of a production | generation of synthetic | combination image metadata. 車載画像メタデータの区間の一例を示す図である。It is a figure which shows an example of the area of vehicle-mounted image metadata. 被写体を囲うバウンディングボックスの一例を示す図である。It is a figure which shows an example of the bounding box surrounding a to-be-photographed object. 被写体を含むポリゴンデータの一例を示す図である。It is a figure which shows an example of the polygon data containing a to-be-photographed object. 車載画像メタデータと合成映像メタデータとの関係の一例を示す図である。It is a figure which shows an example of the relationship between vehicle-mounted image metadata and synthetic | combination video metadata. 合成画像メタデータの一例を示す図である。It is a figure which shows an example of composite image metadata. 選択画面の一例を示す図である。It is a figure which shows an example of a selection screen. 視野を設定して表示する処理の一例を示すフローチャートである。It is a flowchart which shows an example of the process which sets and displays a visual field. テクスチャが設定された全周囲画像の一例を示す図である。It is a figure which shows an example of the omnidirectional image to which the texture was set. 被写体を捉えた画像の一例を示す図である。It is a figure which shows an example of the image which caught the to-be-photographed object. 視点、視野を変更した場合の仮想カメラの仮想視野の一例を示す図である。It is a figure which shows an example of the virtual visual field of the virtual camera at the time of changing a viewpoint and a visual field. 表示部に表示される動画の一例を示す図である。It is a figure which shows an example of the moving image displayed on a display part. 可視性に基づく画像選択を行うための画面例を示す図である。It is a figure which shows the example of a screen for performing the image selection based on visibility. 画像検索サーバのハードウェア構成の一例を示す図である。It is a figure which shows an example of the hardware constitutions of an image search server. 表示端末のハードウェア構成の一例を示す図である。It is a figure which shows an example of the hardware constitutions of a display terminal.

＜発明者の知見＞
例えば、全周囲を撮影可能なカメラを搭載した車両が移動しながら所定のタイミングで全周囲画像の撮影を行う。従って、カメラが撮影する全周囲画像は、異なる位置で撮影した画像になる。車両が被写体の近傍を通過しながら撮影を行う場合、被写体が含まれる全周囲画像は一連の全周囲画像になる。 <Inventor's knowledge>
For example, an image of the entire periphery is captured at a predetermined timing while a vehicle equipped with a camera capable of capturing the entire periphery moves. Therefore, the entire surrounding image captured by the camera is an image captured at a different position. When shooting while the vehicle passes near the subject, the omnidirectional image including the subject becomes a series of omnidirectional images.

ユーザの端末（以下、表示端末と称する）が全周囲画像のうち車両の進行方向の視野の画像を表示するように設定されている場合、被写体が全周囲画像に含まれていたとしても、表示端末には被写体が表示されない場合がある。例えば、車両の進行方向に対して後方に被写体がある場合には、全周囲画像に被写体が含まれていたとしても、表示端末には被写体が表示されない。 When the user's terminal (hereinafter referred to as a display terminal) is set to display an image of the field of view in the traveling direction of the vehicle among the all-around image, even if the subject is included in the all-around image, the display The subject may not be displayed on the terminal. For example, when there is a subject behind the traveling direction of the vehicle, the subject is not displayed on the display terminal even if the subject is included in the all-around image.

この場合、ユーザは、表示端末を操作して、手動で視野を設定する。これにより、表示端末は、車両の進行方向に対して後方の被写体を表示することができ、表示端末は、被写体を中心に捉えたカメラワークで撮影した被写体を表示することができる。ただし、この場合、ユーザの手動操作により被写体を中心に捉えたカメラワークで被写体を表示するため、操作性が煩雑になる。 In this case, the user manually sets the field of view by operating the display terminal. As a result, the display terminal can display a subject behind the vehicle in the traveling direction, and the display terminal can display a subject photographed by camera work centered on the subject. However, in this case, since the subject is displayed with camera work centered on the subject by manual operation of the user, the operability becomes complicated.

また、被写体の動画再生を行う場合、視野を手動で設定しながら、被写体を中心に捉えたカメラワークで動画再生を行うことは難しい。従って、被写体を含む区間の一連の周囲画像のうち被写体を中心に捉えたカメラワークの画像は、ユーザによる視野設定の操作が行われることなく、表示されることが望ましい。 In addition, when reproducing a moving image of a subject, it is difficult to reproduce the moving image with camera work centered on the subject while manually setting the field of view. Therefore, it is desirable that the camerawork image captured around the subject in the series of surrounding images including the subject is displayed without the user performing a field-of-view setting operation.

＜画像提供システムの全体構成の一例＞
以下、図面を参照して、実施形態について説明する。図１は、画像提供システム１の一例を示している。画像提供システム１において、複数の車両２（２Ａ、２Ｂ、・・・）と画像蓄積サーバ３と画像検索サーバ４とメタデータサーバ５と地図サーバ６と表示端末７とがネットワーク８を介して接続されている。 <Example of overall configuration of image providing system>
Hereinafter, embodiments will be described with reference to the drawings. FIG. 1 shows an example of an image providing system 1. In the image providing system 1, a plurality of vehicles 2 (2A, 2B,...), An image storage server 3, an image search server 4, a metadata server 5, a map server 6, and a display terminal 7 are connected via a network 8. Has been.

車両２は、例えば自動車である。車両２は、全周囲を撮影可能なカメラを搭載している。実施形態では、複数の車両２は移動しながら、所定のタイミングで全周囲を視野とする全周囲画像を撮影する。複数の車両２が移動しながら全周囲画像を撮影しているため、異なる車両２が同じ被写体を撮影することがある。 The vehicle 2 is an automobile, for example. The vehicle 2 is equipped with a camera capable of photographing the entire periphery. In the embodiment, the plurality of vehicles 2 captures an all-around image having the entire view as a field of view at a predetermined timing while moving. Since the plurality of vehicles 2 capture the entire surrounding image while moving, different vehicles 2 may capture the same subject.

各車両２は、所定のタイミングで、ネットワーク８を介して、撮影した全周囲画像に画像ＩＤを付与して画像蓄積サーバ３に送信する。なお、ＩＤはIdentificationの略称である。また、各車両２は、ネットワーク８を介して、撮影した全周囲画像に関するデータ（以下、車載画像メタデータと称する）をメタデータサーバ５に送信する。 Each vehicle 2 gives an image ID to the captured all-around image and transmits it to the image storage server 3 via the network 8 at a predetermined timing. ID is an abbreviation for Identification. In addition, each vehicle 2 transmits data regarding the captured all-around image (hereinafter referred to as in-vehicle image metadata) to the metadata server 5 via the network 8.

画像蓄積サーバ３は、各車両２から送信される全周囲画像を画像ＩＤと関連付けて記憶する。また、画像蓄積サーバ３は、表示端末７から指定された被写体を含む一連の全周囲画像を抽出して、表示端末７に送信する。画像蓄積サーバ３は、記憶装置の一例である。 The image storage server 3 stores the all-around image transmitted from each vehicle 2 in association with the image ID. Further, the image storage server 3 extracts a series of all-around images including the subject designated from the display terminal 7 and transmits the extracted images to the display terminal 7. The image storage server 3 is an example of a storage device.

画像検索サーバ４は、サーバ通信部１１と被写体位置特定部１２と合成画像メタデータ生成部１３と演算部１４とを備える。画像検索サーバ４は、サーバの一例である。サーバ通信部１１は、ネットワーク８との間でデータの通信を行う。サーバ通信部１１は、送信部の一例である。 The image search server 4 includes a server communication unit 11, a subject position specifying unit 12, a composite image metadata generation unit 13, and a calculation unit 14. The image search server 4 is an example of a server. The server communication unit 11 performs data communication with the network 8. The server communication unit 11 is an example of a transmission unit.

被写体位置特定部１２は、指定された被写体の位置を特定する。実施形態では、被写体位置特定部１２は、表示端末７から送信された被写体に関するキーワードに基づいて、地図サーバ６からキーワードに対応する被写体の位置を特定する。被写体位置特定部１２は、特定部の一例である。 The subject position specifying unit 12 specifies the position of the designated subject. In the embodiment, the subject position specifying unit 12 specifies the position of the subject corresponding to the keyword from the map server 6 based on the keyword related to the subject transmitted from the display terminal 7. The subject position specifying unit 12 is an example of a specifying unit.

合成画像メタデータ生成部１３は、合成画像メタデータを生成する。合成画像メタデータは、車載画像メタデータの一連の全周囲画像のうち、表示端末７が指定した被写体を含む区間の全周囲画像を特定するデータである。合成画像メタデータは、特定データの一例である。また、合成画像メタデータ生成部１３は、生成部の一例である。 The composite image metadata generation unit 13 generates composite image metadata. The composite image metadata is data that identifies the all-around image of the section including the subject specified by the display terminal 7 in the series of all-around images of the in-vehicle image metadata. The composite image metadata is an example of specific data. The composite image metadata generation unit 13 is an example of a generation unit.

実施形態では、周囲画像に被写体が含まれるか否かは、被写体の位置と撮影位置との距離が所定の閾値以下であるか否かに基づいて判定される。ただし、周囲画像に被写体が含まれているか否かの判定基準は、上記の例には限定されない。 In the embodiment, whether or not the subject is included in the surrounding image is determined based on whether or not the distance between the position of the subject and the shooting position is equal to or less than a predetermined threshold. However, the criterion for determining whether or not the surrounding image includes a subject is not limited to the above example.

演算部１４は、合成画像メタデータに含まれる各パラメータの値を演算する。例えば、演算部１４は、上述した撮影位置と被写体の位置との間の距離を演算する。また、演算部１４は、撮影位置から見た被写体の位置の方位や仰角等を演算する。 The calculation unit 14 calculates the value of each parameter included in the composite image metadata. For example, the calculation unit 14 calculates the distance between the above-described shooting position and the subject position. Further, the calculation unit 14 calculates the azimuth and elevation angle of the position of the subject viewed from the shooting position.

メタデータサーバ５は、上述した車載画像メタデータおよび合成画像メタデータを記憶する。メタデータサーバ５は、複数の車載画像メタデータおよび複数の合成画像メタデータを記憶する。 The metadata server 5 stores the above-described vehicle-mounted image metadata and composite image metadata. The metadata server 5 stores a plurality of in-vehicle image metadata and a plurality of composite image metadata.

地図サーバ６は、地図上の位置とキーワードとを対応付けて記憶している。キーワードは、被写体を特定する情報である。例えば、キーワードは、施設の名称や住所等であってもよい。地図上の位置は、例えば、経度と緯度とにより特定されてもよい。また、地図サーバ６が記憶する地図は３次元地図であってもよい。 The map server 6 stores a map position and a keyword in association with each other. The keyword is information for specifying the subject. For example, the keyword may be a facility name or address. The position on the map may be specified by, for example, longitude and latitude. The map stored in the map server 6 may be a three-dimensional map.

表示端末７は、ユーザが所有する端末である。例えば、パーソナルコンピュータやスマートフォン、携帯電話、タブレット端末等であってもよい。表示端末７は、端末通信部２１と画像抽出部２２と視野設定部２３と画像処理部２４と表示部２５とを備える。 The display terminal 7 is a terminal owned by the user. For example, a personal computer, a smart phone, a mobile phone, a tablet terminal, or the like may be used. The display terminal 7 includes a terminal communication unit 21, an image extraction unit 22, a visual field setting unit 23, an image processing unit 24, and a display unit 25.

端末通信部２１は、ネットワーク８を介して通信を行う。画像抽出部２２は、被写体を含む区間の合成画像メタデータに基づいて、画像蓄積サーバ３から複数の全周囲画像の抽出を行う。 The terminal communication unit 21 performs communication via the network 8. The image extraction unit 22 extracts a plurality of omnidirectional images from the image storage server 3 based on the composite image metadata of the section including the subject.

視野設定部２３は、画像検索サーバ４から受信する合成画像メタデータに基づいて、全周囲画像の中での視野（仮想視野とも称することがある）を設定する。画像処理部２４は、全周囲画像のうち視野設定部２３が設定した視野の画像を生成する処理を行う。 The visual field setting unit 23 sets a visual field (also referred to as a virtual visual field) in the entire surrounding image based on the composite image metadata received from the image search server 4. The image processing unit 24 performs processing for generating a field-of-view image set by the field-of-view setting unit 23 among the entire surrounding images.

表示部２５は、画像処理部２４が処理した画像を表示する。例えば、表示部２５は、タッチパネルディスプレイであってもよい。表示部２５がタッチパネルディスプレイの場合、表示部２５は操作機能を有する。表示端末７が不図示の操作手段（例えば、ボタン等）を有する場合、表示部２５は操作機能を有しなくてもよい。 The display unit 25 displays the image processed by the image processing unit 24. For example, the display unit 25 may be a touch panel display. When the display unit 25 is a touch panel display, the display unit 25 has an operation function. When the display terminal 7 has an operation unit (not illustrated) (for example, a button), the display unit 25 may not have an operation function.

図１の例では、画像蓄積サーバ３と画像検索サーバ４とメタデータサーバ５とを別個のサーバとして設けているが、これらのサーバを１つのサーバとしてもよい。例えば、画像検索サーバ４が画像蓄積サーバ３およびメタデータサーバ５の機能を有していてもよい。 In the example of FIG. 1, the image storage server 3, the image search server 4, and the metadata server 5 are provided as separate servers, but these servers may be a single server. For example, the image search server 4 may have the functions of the image storage server 3 and the metadata server 5.

＜車両の一例＞
図２は、全周囲を撮影可能なカメラを搭載した車両２の一例を示している。車両２の前後左右にはカメラＣ１〜Ｃ４が設けられている。カメラＣ１〜Ｃ４に対応する視野をＶ１〜Ｖ４とする。 <Example of vehicle>
FIG. 2 shows an example of a vehicle 2 equipped with a camera capable of photographing the entire periphery. Cameras C 1 to C 4 are provided on the front, rear, left and right of the vehicle 2. The visual fields corresponding to the cameras C1 to C4 are V1 to V4.

カメラＣ１〜Ｃ４はそれぞれ広角レンズを用いており、視野Ｖ１〜Ｖ４は３次元的に広い範囲となる。視野Ｖ１および視野Ｖ４は、視野Ｖ２および視野Ｖ２およびＶ３と重なる視野範囲となる。カメラＣ１〜Ｃ４が撮影した画像を合成すると、車両２を中心とした全周囲を視野とした全周囲画像が生成される。 Each of the cameras C1 to C4 uses a wide-angle lens, and the visual fields V1 to V4 are three-dimensionally wide. The visual field V1 and the visual field V4 have a visual field range that overlaps the visual field V2 and the visual fields V2 and V3. When the images captured by the cameras C1 to C4 are combined, an all-around image with the entire view centered on the vehicle 2 as a field of view is generated.

実施形態では、全周囲画像は、カメラＣ１〜Ｃ４が撮影した４チャンネルの画像に基づいて生成することができる。ただし、全周囲画像を生成する図２の例には限定されない。例えば、３６０度全天周レンズを用いた１台のカメラを車両２の屋根に設置して、このカメラが全周囲画像を撮影してもよい。 In the embodiment, the omnidirectional image can be generated based on images of four channels taken by the cameras C1 to C4. However, the present invention is not limited to the example of FIG. For example, a single camera using a 360-degree omnidirectional lens may be installed on the roof of the vehicle 2 and this camera may shoot an all-around image.

また、実施形態では、車両２に搭載するカメラにより生成される画像は全周囲画像であるものとして説明するが、全周囲画像には限定されない。例えば、車両２に搭載したカメラは３５０度の周囲画像を撮影してもよい。 In the embodiment, the image generated by the camera mounted on the vehicle 2 is described as an all-around image, but is not limited to the all-around image. For example, a camera mounted on the vehicle 2 may capture a surrounding image of 350 degrees.

図３は、車両２に含まれる制御部３０の一例を示している。カメラＣ１〜Ｃ４が撮影した画像は、時刻同期がされている。時刻同期がされた４枚の画像は、一次記憶部３１に記憶される。一次記憶部３１は、例えばメモリ等であってもよい。 FIG. 3 shows an example of the control unit 30 included in the vehicle 2. The images taken by the cameras C1 to C4 are time synchronized. The four images synchronized in time are stored in the primary storage unit 31. The primary storage unit 31 may be a memory, for example.

時刻同期がされた４枚の画像は、合成されているか否かにかかわらず、全周囲画像と称する。実施形態では、表示端末７の画像処理部２４で画像処理がされるまで、全周囲画像は合成されていない画像であるものとする。ただし、時刻同期された４枚の画像は、制御部３０で合成されてもよい。 The four images that are time-synchronized are referred to as an all-around image regardless of whether they are synthesized. In the embodiment, it is assumed that the omnidirectional image is an image that has not been synthesized until the image processing unit 24 of the display terminal 7 performs image processing. However, the four images synchronized in time may be combined by the control unit 30.

圧縮符号化部３２は、一次記憶部３１から全周囲画像を読み出して、読み出した全周囲画像を圧縮符号化する。例えば、全周囲画像の４枚の画像（解像度：６４０×４８０）のそれぞれについて、時刻が異なる４枚の全周囲画像を１つにした全周囲画像（解像度：１２８０×９６０）に対して、圧縮符号化部３２は、圧縮符号化を行ってもよい。圧縮符号化には、例えば、Ｈ．２６４等を適用してもよい、圧縮符号化部３２が圧縮符号化を行った全周囲画像は、車載記憶部３６に記憶される。 The compression encoding unit 32 reads the all-around image from the primary storage unit 31 and compress-encodes the read all-around image. For example, for each of four images (resolution: 640 × 480) of the omnidirectional image, compression is performed on the omnidirectional image (resolution: 1280 × 960) obtained by combining four omnidirectional images at different times. The encoding unit 32 may perform compression encoding. Examples of compression encoding include H.264. H.264 or the like, to which the compression coding unit 32 performs compression coding, is stored in the in-vehicle storage unit 36.

時刻計測部３３は、時刻を計測する。位置情報取得部３４は、現在位置の情報を取得する。例えば、位置情報取得部３４は、Global Positioning System(GPS)であってもよい。位置情報取得部３４と一次記憶部３１が記憶する全周囲画像とは時刻同期がされている。 The time measuring unit 33 measures time. The position information acquisition unit 34 acquires information on the current position. For example, the position information acquisition unit 34 may be a Global Positioning System (GPS). The position information acquisition unit 34 and the all-around image stored in the primary storage unit 31 are time-synchronized.

車載画像メタデータ生成部３５は、車載画像メタデータを生成する。車載画像メタデータは、カメラＣ１〜Ｃ４が撮影した全周囲画像に関するデータである。図４の例に示される車載画像メタデータは、画像ＩＤ、撮影時刻、撮影位置、カメラ固有情報およびカメラ設置情報等のパラメータを含む。 The in-vehicle image metadata generation unit 35 generates in-vehicle image metadata. The in-vehicle image metadata is data related to the all-around image captured by the cameras C1 to C4. The in-vehicle image metadata shown in the example of FIG. 4 includes parameters such as an image ID, a shooting time, a shooting position, camera specific information, and camera installation information.

画像ＩＤは、全周囲画像を特定する識別子である。撮影時刻は、カメラＣ１〜Ｃ４が撮影を行ったときに時刻計測部３３が計測した時刻である。撮影位置は、全周囲画像を取得したときに位置情報取得部３４が取得する位置情報である。 The image ID is an identifier that identifies the entire surrounding image. The photographing time is a time measured by the time measuring unit 33 when the cameras C1 to C4 perform photographing. The shooting position is position information acquired by the position information acquisition unit 34 when an all-around image is acquired.

カメラ固有情報は、カメラＣ１〜Ｃ４の解像度やレンズ歪み等に関する情報である。カメラ設置情報は、カメラＣ１〜Ｃ４の設置位置や姿勢等に関する情報である。カメラ固有情報およびカメラ設置情報は、既知の情報である。 The camera specific information is information relating to the resolution, lens distortion, and the like of the cameras C1 to C4. The camera installation information is information related to the installation positions and postures of the cameras C1 to C4. The camera specific information and the camera installation information are known information.

車載画像メタデータは、図４の例に示されたパラメータ以外のパラメータを有していてもよい。車両２に搭載されたカメラＣ１〜Ｃ４は、所定のタイミングで全周囲画像を撮影する。車載画像メタデータ生成部３５は、全周囲画像が生成されるごとに、それぞれ固有の画像ＩＤを付与して、車載画像メタデータを生成する。 The in-vehicle image metadata may have parameters other than those shown in the example of FIG. Cameras C 1 to C 4 mounted on the vehicle 2 capture an all-around image at a predetermined timing. The in-vehicle image metadata generation unit 35 assigns a unique image ID to generate in-vehicle image metadata each time an all-around image is generated.

車載画像メタデータの画像ＩＤと該画像ＩＤに対応する全周囲画像とは関連付けがされて、車載記憶部３６に記憶される。車両通信部３７は、所定のタイミングごとに、車載画像メタデータをメタデータサーバに送信し、全周囲画像を画像蓄積サーバ３に送信する。 The image ID of the in-vehicle image metadata and the all-around image corresponding to the image ID are associated and stored in the in-vehicle storage unit 36. The vehicle communication unit 37 transmits the in-vehicle image metadata to the metadata server and transmits the entire surrounding image to the image storage server 3 at every predetermined timing.

なお、車載記憶部３６に記憶されている全周囲画像および車載画像メタデータのうち、送信済みの情報は削除されることが望ましい。これにより、車載記憶部３６に記憶される情報量を削減することができる。 In addition, it is desirable that the transmitted information is deleted from the all-around image and the in-vehicle image metadata stored in the in-vehicle storage unit 36. Thereby, the information amount memorize | stored in the vehicle-mounted memory | storage part 36 can be reduced.

＜制御部の処理の一例を示すフローチャート＞
次に、制御部３０の各部が行う処理のフローチャートについて、図５を参照して説明する。車両２に搭載されたカメラＣ１〜Ｃ４は撮影を行う（ステップＳ１）。カメラＣ１〜Ｃ４が撮影した４枚の画像は一次記憶部３１に記憶される。 <Flowchart showing an example of processing of the control unit>
Next, a flowchart of processing performed by each unit of the control unit 30 will be described with reference to FIG. The cameras C1 to C4 mounted on the vehicle 2 perform shooting (step S1). The four images captured by the cameras C1 to C4 are stored in the primary storage unit 31.

これら４枚の画像は時刻同期がされており、一次記憶部３１には、カメラＣ１〜Ｃ４が撮影した画像が記憶される（ステップＳ２）。上述したように、実施形態では、時刻同期された４枚の画像を総称して全周囲画像と称する。 These four images are time-synchronized, and images taken by the cameras C1 to C4 are stored in the primary storage unit 31 (step S2). As described above, in the embodiment, the four images synchronized in time are collectively referred to as an all-around image.

圧縮符号化部３２は、全周囲画像の圧縮符号化を行う（ステップＳ３）。制御部３０は、圧縮符号化された全周囲画像に対して画像ＩＤを付与する（ステップＳ４）。制御部３０は画像ＩＤが付与された全周囲画像を車載記憶部３６に記憶する（ステップＳ５）。 The compression encoding unit 32 performs compression encoding of the entire surrounding image (step S3). The control unit 30 assigns an image ID to the omnidirectional image that has been compression-encoded (step S4). The control part 30 memorize | stores the omnidirectional image to which image ID was provided in the vehicle-mounted memory | storage part 36 (step S5).

ステップＳ１〜Ｓ５の処理と並行して、以下のステップＳ６〜Ｓ９の処理が行われる。位置情報取得部３４は、カメラＣ１〜Ｃ４が撮影したときの位置情報を取得する（ステップＳ６）。車載画像メタデータ生成部３５は、時刻計測部３３が計測した時刻と位置情報取得部３４が取得した位置情報とを関連付ける（ステップＳ７）。 In parallel with the processes of steps S1 to S5, the following processes of steps S6 to S9 are performed. The position information acquisition unit 34 acquires position information when the cameras C1 to C4 are photographed (step S6). The in-vehicle image metadata generation unit 35 associates the time measured by the time measurement unit 33 with the position information acquired by the position information acquisition unit 34 (step S7).

車載画像メタデータ生成部３５は、位置情報取得部３４から取得した位置情報に対応する画像ＩＤを取得する（ステップＳ８）。これにより、時刻情報と位置情報とを含む車載画像メタデータが全周囲画像と関連付けられて車載記憶部３６に記憶される（ステップＳ９）。 The in-vehicle image metadata generation unit 35 acquires an image ID corresponding to the position information acquired from the position information acquisition unit 34 (step S8). Thereby, the vehicle-mounted image metadata including the time information and the position information is associated with the all-around image and stored in the vehicle-mounted storage unit 36 (step S9).

車両２は移動しながら、カメラＣ１〜Ｃ４を用いて、所定タイミングで撮影を行う。従って、異なる地点で撮影された経時的な一連の全周囲画像が生成される。そして、時刻ごとに、全周囲画像と車載画像メタデータとは関連付けられて車載記憶部３６に記憶される。 While the vehicle 2 is moving, the camera 2 uses the cameras C 1 to C 4 to take an image at a predetermined timing. Therefore, a series of omnidirectional images over time taken at different points is generated. Then, for each time, the all-around image and the in-vehicle image metadata are associated with each other and stored in the in-vehicle storage unit 36.

車両通信部３７は、画像蓄積サーバ３とコネクションが確立されているか否かを判定する（ステップＳ１０）。コネクションが確立されている場合（ステップＳ１０でＹＥＳ）、車両通信部３７は全周囲画像を画像蓄積サーバ３に送信する（ステップＳ１１）。一方、コネクションが確立されていない場合（ステップＳ１０でＮＯ）、全周囲画像は送信されない。 The vehicle communication unit 37 determines whether or not a connection with the image storage server 3 has been established (step S10). When the connection is established (YES in step S10), the vehicle communication unit 37 transmits the all-around image to the image storage server 3 (step S11). On the other hand, when the connection is not established (NO in step S10), the all-around image is not transmitted.

車両通信部３７は、メタデータサーバ５とコネクションが確立されているか否かを判定する（ステップＳ１２）。コネクションが確立されている場合（ステップＳ１２でＹＥＳ）、車両通信部３７は車載画像メタデータをメタデータサーバ５に送信する（ステップＳ１３）。一方、コネクションが確立されていない場合（ステップＳ１２でＮＯ）、車載画像メタデータは送信されない。 The vehicle communication part 37 determines whether the connection with the metadata server 5 is established (step S12). When the connection is established (YES in step S12), the vehicle communication unit 37 transmits the in-vehicle image metadata to the metadata server 5 (step S13). On the other hand, when the connection is not established (NO in step S12), the in-vehicle image metadata is not transmitted.

制御部３０は、送信済みの全周囲画像および車載画像メタデータを車載記憶部３６から削除してもよい。カメラＣ１〜Ｃ４は所定タイミングで撮影を行うため、全周囲画像および車載映像メタデータは、撮影を行うごとに増えていく。従って、制御部３０が送信済みの全周囲画像および車載画像メタデータを車載記憶部３６から削除することで、車載記憶部３６に記憶される情報量を削減することができる。 The control unit 30 may delete the transmitted all-around image and in-vehicle image metadata from the in-vehicle storage unit 36. Since the cameras C1 to C4 shoot at a predetermined timing, the all-around image and the in-vehicle video metadata increase every time shooting is performed. Therefore, the amount of information stored in the in-vehicle storage unit 36 can be reduced by deleting the entire surrounding image and in-vehicle image metadata that have been transmitted by the control unit 30 from the in-vehicle storage unit 36.

＜画像蓄積サーバおよびメタデータサーバの処理の一例＞
図６（Ａ）は、画像蓄積サーバ３の処理の一例を示している。画像蓄積サーバ３は、車両２の車両通信部３７との間でコネクションを確立しているか否かを判定する（ステップＳ２１）。 <Example of processing of image storage server and metadata server>
FIG. 6A shows an example of processing of the image storage server 3. The image storage server 3 determines whether or not a connection is established with the vehicle communication unit 37 of the vehicle 2 (step S21).

コネクションが確立されている場合（ステップＳ２１でＹＥＳ）、画像蓄積サーバ３は、車両通信部３７から全周囲画像を受信したか否かを判定する（ステップＳ２２）。画像蓄積サーバ３は、全周囲画像を受信したと判定した場合（ステップＳ２２でＹＥＳ）、受信した全周囲画像を画像ＩＤと関連付けて記憶する（ステップＳ２３）。これにより、画像蓄積サーバ３に全周囲画像が蓄積される。 When the connection is established (YES in step S21), the image storage server 3 determines whether or not an all-around image has been received from the vehicle communication unit 37 (step S22). If it is determined that the omnidirectional image has been received (YES in step S22), the image storage server 3 stores the received omnidirectional image in association with the image ID (step S23). As a result, the all-around image is accumulated in the image accumulation server 3.

画像蓄積サーバ３は、車両通信部３７との間でコネクションが確立されていない場合（ステップＳ２１でＮＯ）、または画像を受信しない場合（ステップＳ２２でＮＯ）、ステップＳ２３の処理を行わない。 If the connection with the vehicle communication unit 37 is not established (NO in step S21) or if no image is received (NO in step S22), the image storage server 3 does not perform the process of step S23.

図６（Ｂ）は、メタデータサーバ５の処理の一例を示している。メタデータサーバ５は、車両２の車両通信部３７との間でコネクションを確立しているか否かを判定する（ステップＳ２４）。 FIG. 6B shows an example of processing of the metadata server 5. The metadata server 5 determines whether or not a connection is established with the vehicle communication unit 37 of the vehicle 2 (step S24).

コネクションが確立されている場合（ステップＳ２４でＹＥＳ）、メタデータサーバ５は、車両通信部３７から車載映像メタデータを受信したか否かを判定する（ステップＳ２５）。メタデータサーバ５は、車載映像メタデータを受信したと判定した場合（ステップＳ２５でＹＥＳ）、受信した車載映像メタデータを記憶する（ステップＳ２３）。 When the connection is established (YES in step S24), the metadata server 5 determines whether or not the in-vehicle video metadata is received from the vehicle communication unit 37 (step S25). When it is determined that the in-vehicle video metadata has been received (YES in step S25), the metadata server 5 stores the received in-vehicle video metadata (step S23).

メタデータサーバ５は、車両通信部３７との間でコネクションが確立されていない場合（ステップＳ２４でＮＯ）、または車載映像メタデータを受信しない場合（ステップＳ２５でＮＯ）、ステップＳ２６の処理を行わない。 If the connection with the vehicle communication unit 37 is not established (NO in step S24), or if the in-vehicle video metadata is not received (NO in step S25), the metadata server 5 performs the process of step S26. Absent.

＜表示端末と画像検索サーバとの間で行われる処理の一例＞
次に、表示端末７と画像検索サーバ４とが行う処理の一例について、図７のシーケンスチャートを参照して説明する。表示端末７のユーザは、対象となる被写体の情報を表示端末７に入力する。例えば、表示部２５が操作機能を有する場合、ユーザは、表示部２５を用いて、対象となる被写体の情報を入力する。 <Example of processing performed between display terminal and image search server>
Next, an example of processing performed by the display terminal 7 and the image search server 4 will be described with reference to the sequence chart of FIG. The user of the display terminal 7 inputs information on the subject to be displayed on the display terminal 7. For example, when the display unit 25 has an operation function, the user uses the display unit 25 to input information on a subject to be processed.

表示端末７は、入力された被写体（ユーザにより指定された被写体）の情報を画像検索サーバ４に送信する。実施形態では、ユーザにより指定された被写体の情報は、キーワードであるものとする。 The display terminal 7 transmits information on the input subject (subject specified by the user) to the image search server 4. In the embodiment, it is assumed that the subject information designated by the user is a keyword.

キーワードは、上述したように、施設の名称や住所等であってもよい。従って、端末通信部２１は、入力されたキーワードを画像検索サーバ４に送信する（ステップＳＣ１）。なお、ユーザにより指定される被写体の情報は、キーワードには限定されない。例えば、被写体の情報は、被写体の経度および緯度を示す位置情報であってもよい。 As described above, the keyword may be the name or address of the facility. Accordingly, the terminal communication unit 21 transmits the input keyword to the image search server 4 (step SC1). Note that the subject information specified by the user is not limited to keywords. For example, the subject information may be position information indicating the longitude and latitude of the subject.

画像検索サーバ４は、受信したキーワードを地図サーバ６に送信する。地図サーバ６は、キーワードに対応付けられている位置情報を画像検索サーバ４に送信する。これにより、画像検索サーバ４は、キーワードに基づく位置情報を取得する（ステップＳＣ２）。取得した位置情報は、ユーザにより指定された被写体の位置情報である。 The image search server 4 transmits the received keyword to the map server 6. The map server 6 transmits the position information associated with the keyword to the image search server 4. Thereby, the image search server 4 acquires position information based on the keyword (step SC2). The acquired position information is the position information of the subject specified by the user.

画像検索サーバ４は、メタデータサーバ５に記憶されている複数の車載映像メタデータのうち、取得した位置情報（図４の例では撮影位置）を含む車載映像メタデータをメタデータサーバ５から抽出する（ステップＳＣ３）。抽出される車載映像メタデータは複数の場合もある。 The image search server 4 extracts, from the metadata server 5, in-vehicle video metadata including the acquired position information (shooting position in the example of FIG. 4) from among the plurality of in-vehicle video metadata stored in the metadata server 5. (Step SC3). There may be a plurality of in-vehicle video metadata to be extracted.

画像検索サーバ４の合成画像メタデータ生成部１３は、車載映像メタデータから被写体を含む区間の画像を特定し、合成画像メタデータを生成する（ステップＳＣ４）。合成画像メタデータの生成の一例について、図８のフローチャートを参照して説明する。 The composite image metadata generation unit 13 of the image search server 4 specifies the image of the section including the subject from the in-vehicle video metadata, and generates composite image metadata (step SC4). An example of the generation of the composite image metadata will be described with reference to the flowchart in FIG.

演算部１４は、抽出された車載映像メタデータに含まれる撮影位置と被写体の位置との間の距離Ｄを算出する（ステップＳ３１）。距離Ｄは、２つの地点間の距離を算出する以下の式（１）を用いてもよい。 The calculation unit 14 calculates a distance D between the shooting position and the subject position included in the extracted in-vehicle video metadata (step S31). As the distance D, the following formula (1) for calculating the distance between two points may be used.

なお、以下の式（１）において、撮影位置は（経度ｘ１、緯度ｙ１）とし、被写体の位置は（経度ｘ２、緯度ｙ２）とする。撮影位置は、車載映像メタデータに含まれ、被写体の位置は、ステップＳＣ２で取得した位置情報により特定される。ｒは地球の赤道半径である。
「Ｄ＝ｒ×ｃｏｓ^−１（ｓｉｎ（ｙ１）×ｓｉｎ（ｙ２）＋ｃｏｓ（ｙ１）×ｃｏｓ（ｙ２）×ｃｏｓ（ｘ２−ｘ１））」・・・（式１） In the following formula (1), the shooting position is (longitude x1, latitude y1), and the subject position is (longitude x2, latitude y2). The shooting position is included in the in-vehicle video metadata, and the position of the subject is specified by the position information acquired in step SC2. r is the equator radius of the earth.
“D = r × cos ⁻¹ (sin (y1) × sin (y2) + cos (y1) × cos (y2) × cos (x2−x1))” (Formula 1)

次に、演算部１４は、上記の式（１）で得られた撮影位置と被写体の位置との距離Ｄが閾値Ｄｔｈ以下となる区間Ｔｓｅｇを算出する。区間Ｔｓｅｇは、全周囲画像に被写体が含まれている区間を示す。 Next, the calculation unit 14 calculates a section Tseg in which the distance D between the shooting position obtained by the above equation (1) and the subject position is equal to or less than the threshold value Dth. The section Tseg indicates a section in which the subject is included in the entire surrounding image.

図９は、車載画像メタデータのうち区間Ｔｓｅｇ（ハッチングを施した区間）を特定した一例を示している。撮影位置と被写体の位置との距離が短ければ、全周囲画像に被写体は鮮明に写る。一方、撮影位置と被写体の位置との距離が長ければ、全周囲画像に写る被写体の鮮明度は低下する。 FIG. 9 shows an example in which the section Tseg (the hatched section) is specified in the in-vehicle image metadata. If the distance between the shooting position and the subject position is short, the subject appears clearly in the entire surrounding image. On the other hand, if the distance between the shooting position and the subject position is long, the sharpness of the subject in the all-around image decreases.

そこで、全周囲画像に被写体が含まれるか否かを判定する基準を上記の閾値Ｄｔｈとする。閾値Ｄｔｈは任意の値が設定されてもよい。画像検索サーバ４に対して、閾値Ｄｔｈが入力されてもよい。 Therefore, the above-mentioned threshold value Dth is used as a reference for determining whether or not a subject is included in the all-around image. An arbitrary value may be set as the threshold value Dth. A threshold value Dth may be input to the image search server 4.

また、後述する図１０の被写体のバウンディングボックスＢＢが得られる場合には、車載画像メタデータに含まれるカメラ固有情報に基づいて、全周囲画像に被写体が所定画素以上で写るために必要な距離をＤｔｈとしてもよい。 When the bounding box BB of the subject shown in FIG. 10 to be described later is obtained, the distance necessary for the subject to appear in the entire surrounding image at a predetermined pixel or more is determined based on the camera-specific information included in the in-vehicle image metadata. It may be Dth.

図９の場合、時刻Ｔ１から始まる一連の全周囲画像のうち、時刻Ｔ４からＴ７の区間における撮影位置と被写体の位置とが「Ｄ≧Ｄｔｈ」を満たす区間である。時刻Ｔ３または時刻Ｔ８における全周囲画像にも被写体が含まれている可能性がある。ただし、図９の例では、時刻Ｔ３および時刻Ｔ８は「Ｄ≧Ｄｔｈ」を満たさないため、合成画像メタデータ生成部１３は、全周囲画像に被写体が含まれていないと判定する。 In the case of FIG. 9, among the series of all-around images starting from time T1, the shooting position and the subject position in the section from time T4 to T7 satisfy “D ≧ Dth”. There is a possibility that the entire surrounding image at the time T3 or the time T8 also includes the subject. However, in the example of FIG. 9, since the time T3 and the time T8 do not satisfy “D ≧ Dth”, the composite image metadata generation unit 13 determines that the subject is not included in the all-around image.

撮影位置は、図９の例に示すように、経時的に変化する。図９の例では、時刻Ｔ４においては、撮影位置は位置Ｐ４であり、時刻Ｔ５においては、位置Ｐ５である。つまり、車両２に搭載されたカメラＣ１〜Ｃ４が所定のタイミングで撮影を行うため、撮影したタイミングごとの一連の全周囲画像が生成される。 The shooting position changes with time as shown in the example of FIG. In the example of FIG. 9, the shooting position is position P4 at time T4, and is position P5 at time T5. That is, since the cameras C1 to C4 mounted on the vehicle 2 perform shooting at a predetermined timing, a series of all-around images for each shooting timing is generated.

時刻Ｔにおける距離をＤｔとする。上記の式（１）は、Ｄｔ＝Ｆ（Ｘ、Ｐｔ）としてもよい。Ｘは、被写体の位置であり、Ｐｔは時刻ｔにおける撮影位置である。そして、時刻Ｔにおける距離Ｄｔは、上記の式（１）の関数Ｆにより求めることができる。 The distance at time T is Dt. The above equation (1) may be Dt = F (X, Pt). X is the position of the subject, and Pt is the shooting position at time t. Then, the distance Dt at time T can be obtained by the function F of the above equation (1).

また、上述したように、「Ｄ≧Ｄｔｈ」となる開始時刻をＴｓ、終了時刻Ｔｅとすると、上記の区間Ｔｓｅｇは「Ｔｓ，Ｔｅ」として表すことができる。図９の例の場合、区間Ｔｓｅｇの開始時刻ＴｓはＴ４であり、終了時刻ＴｅはＴ７である。 Further, as described above, if the start time when “D ≧ Dth” is Ts and the end time Te, the section Tseg can be expressed as “Ts, Te”. In the example of FIG. 9, the start time Ts of the section Tseg is T4, and the end time Te is T7.

図８の例に示すように、区間Ｔｓｅｇが算出された後、合成画像メタデータ生成部１３は、方位情報θを算出する（ステップＳ３３）。方位情報θは、撮影位置から被写体の位置を撮影したときの方位である。方位情報θは、例えば、以下の式（２）により得られる。なお、ｘ１、ｘ２、ｙ１およびｙ２は、上述した式（１）と同様である。
「θ＝９０−ａｒｃｔａｎ２（ｓｉｎ（ｘ２−ｘ１）、ｃｏｓ（ｙ１）×ｔａｎ（ｙ２）−ｓｉｎ（ｙ１）×ｃｏｓ（ｘ２−ｘ１））」・・・式（２） As shown in the example of FIG. 8, after the section Tseg is calculated, the composite image metadata generation unit 13 calculates the azimuth information θ (step S33). The azimuth information θ is the azimuth when the subject position is photographed from the photographing position. The azimuth information θ is obtained by the following equation (2), for example. Note that x1, x2, y1, and y2 are the same as those in the above-described formula (1).
“Θ = 90−arctan2 (sin (x2−x1), cos (y1) × tan (y2) −sin (y1) × cos (x2−x1))” (2)

これにより、撮影位置から見た被写体の方位情報θが得られる。車両２が移動するごとに撮影位置が変化するため、方位情報θも撮影位置によって変化する。撮影位置は撮影時刻によって変化するため、方位情報θも時刻によって変化する。従って、時刻ｔの方位情報θｔはθｔ＝Ｇ（Ｘ、Ｐｔ）として表すことができる。つまり、式（２）を関数Ｇとして表せば、θｔ＝Ｇ（Ｘ、Ｐｔ）となる。 Thereby, the azimuth information θ of the subject viewed from the photographing position is obtained. Since the shooting position changes each time the vehicle 2 moves, the azimuth information θ also changes depending on the shooting position. Since the shooting position changes with the shooting time, the azimuth information θ also changes with the time. Therefore, the azimuth information θt at time t can be expressed as θt = G (X, Pt). That is, if Expression (2) is expressed as a function G, θt = G (X, Pt).

演算部１４は、仰角情報φを算出する（ステップＳ３４）。仰角情報φは、水平面方向を基準とした被写体の仰角を示す。仰角情報φも時刻と共に変化するため、仰角情報φｔとすることができる。仰角情報φｔは、例えば、図１０の破線で示すバウンディングボックスＢＢに基づいて得ることができる。 The computing unit 14 calculates elevation angle information φ (step S34). The elevation angle information φ indicates the elevation angle of the subject relative to the horizontal plane direction. Since the elevation angle information φ also changes with time, the elevation angle information φt can be obtained. The elevation angle information φt can be obtained, for example, based on the bounding box BB indicated by the broken line in FIG.

仰角情報φｔは、被写体を３次元的に囲う直方体形状のバウンディングボックスＢＢが設定されている場合、バウンディングボックスＢＢの８つの頂点Ｂ１〜Ｂ８の中心座標Ｂｃに基づいて、以下の式（３）により得ることができる。なお、中心座標Ｂｃは（ｘｃ、ｙｃ、ｚｃ）とする。ｘｃ、ｙｘ、ｚｘは、バウンディングボックスＢＢのｘ軸、ｙ軸、ｚ軸の中心座標であることを示す。
「φｔ＝ａｒｃｔａｎ（ｚｃ／Ｄｔ）」・・・（式３） The elevation angle information φt is obtained by the following equation (3) based on the center coordinates Bc of the eight vertices B1 to B8 of the bounding box BB when a rectangular parallelepiped bounding box BB surrounding the subject three-dimensionally is set. Can be obtained. The center coordinate Bc is (xc, yc, zc). xc, yx, and zx indicate the center coordinates of the x-axis, y-axis, and z-axis of the bounding box BB.
“Φt = arctan (zc / Dt)” (Formula 3)

演算部１４は、視野情報ＦＯＶを算出する（ステップＳ３５）。視野情報ＦＯＶは、図１０に一例として示すバウンディングボックスＢＢを規定する。視野情報ＦＯＶは、水平方向の視野角（ｆｏｖＨ）および垂直方向の視野角（ｆｏｖＶ）により規定される。 The computing unit 14 calculates the visual field information FOV (step S35). The visual field information FOV defines a bounding box BB shown as an example in FIG. The visual field information FOV is defined by a horizontal viewing angle (fovH) and a vertical viewing angle (fovV).

例えば、視野情報ＦＯＶは、バウンディングボックスＢＢをちょうど包含するように設定されてもよいし、固定値としてもよい。視野情報ＦＯＶが固定値の場合、演算部１４は視野情報ＦＯＶを求める演算は行わない。次に、演算部１４は、可視性Ｖを算出する（ステップＳ３６）。 For example, the visual field information FOV may be set to just include the bounding box BB, or may be a fixed value. When the visual field information FOV is a fixed value, the calculation unit 14 does not perform calculation for obtaining the visual field information FOV. Next, the calculating part 14 calculates visibility V (step S36).

可視性Ｖも、時刻ごとに変化するため、時刻ｔにおける可視性Ｖを可視性Ｖｔとする。可視性Ｖｔは、撮影位置から被写体の立体領域（例えば、バウンディングボックスＢＢ）が見える程度を示す。つまり、可視性Ｖｔは、時刻ｔにおける全周囲画像に写っている被写体の率を示す。 Since the visibility V also changes with time, the visibility V at time t is defined as visibility Vt. Visibility Vt indicates the degree to which a three-dimensional area (for example, a bounding box BB) of a subject can be seen from the shooting position. That is, the visibility Vt indicates the rate of the subject that appears in the all-around image at time t.

例えば、図１０に示すように、撮影位置（Ｐｔ）からバウンディングボックスＢＢに向けて所定の角度分解能で視線ベクトルを発したときの該視線ベクトルの本数をＮ本とする。図１１の例は、被写体を含むポリゴンデータの一例である。演算部１４は、視線ベクトルが被写体に衝突する率を演算する。演算部１４は、被写体のポリゴンにヒット（衝突）した総数がＭ本の場合、可視性Ｖｔを以下の式（４）で演算する。
「Ｖｔ＝Ｍ／Ｎ」・・・（式４） For example, as shown in FIG. 10, the number of line-of-sight vectors when a line-of-sight vector is emitted with a predetermined angular resolution from the shooting position (Pt) toward the bounding box BB is N. The example of FIG. 11 is an example of polygon data including a subject. The calculation unit 14 calculates the rate at which the line-of-sight vector collides with the subject. When the total number of hits (collisions) on the polygons of the subject is M, the calculation unit 14 calculates the visibility Vt by the following expression (4).
“Vt = M / N” (Formula 4)

例えば、撮影位置と被写体との間に何らかの遮蔽物があると、撮影位置から発した視線ベクトルのうち遮蔽物にヒットした視線ベクトルは、被写体にはヒットしない。このため、Ｍが低下するため、可視性Ｖｔも低下する。 For example, if there is any shielding object between the shooting position and the subject, the line-of-sight vector that hits the shielding object among the line-of-sight vectors emitted from the shooting position does not hit the subject. For this reason, since M decreases, visibility Vt also decreases.

演算部１４は、可視良好性Ｑを演算する（ステップＳ３７）。可視良好性Ｑは、ステップＳ３２で求められた区間Ｔｓｅｇにおける可視性Ｖｔから統合的に評価される値である。例えば、可視良好性Ｑは、以下の式（５）のように、区間Ｔｓｅｇにおける全ての可視性Ｖの平均値として演算してもよい。
The computing unit 14 computes the visibility goodness Q (step S37). The visibility goodness Q is a value that is comprehensively evaluated from the visibility Vt in the section Tseg obtained in step S32. For example, the visibility goodness Q may be calculated as an average value of all the visibility V in the section Tseg as in the following formula (5).

なお、式（５）において、ｆｓは総フレーム数を示す。例えば、車載画像メタデータと合成映像メタデータとの関係の例を示す図１２の例の場合、区間Ｔｓｅｇにおける可視良好性Ｑは、以下の式（６）のようになる。
「Ｑ＝（Ｖ４＋Ｖ５＋Ｖ６＋Ｖ７）／４」・・・（式６） In equation (5), fs indicates the total number of frames. For example, in the case of the example of FIG. 12 showing an example of the relationship between the in-vehicle image metadata and the composite video metadata, the visibility goodness Q in the section Tseg is expressed by the following formula (6).
“Q = (V4 + V5 + V6 + V7) / 4” (Formula 6)

以上により、図７のステップＳＣ４の処理が終了し、合成画像メタデータが生成される。図１２の例に示されるように、車載画像メタデータのうち、時刻Ｔ４からＴ７までの間が区間Ｔｓｅｇ（ハッチングを施してある区間）である。 Thus, the process of step SC4 in FIG. 7 ends, and composite image metadata is generated. As shown in the example of FIG. 12, in the in-vehicle image metadata, a section from time T4 to T7 is a section Tseg (section where hatching is performed).

演算部１４は、区間Ｔｓｅｇの間の方位θ４からθ７、仰角φ４からφ７、視野ＦＯＶ４からＦＯＶ７および可視性Ｖ４からＶ７を演算する。図１３は、合成画像メタデータに含まれるパラメータの一例である。 The calculation unit 14 calculates the azimuths θ4 to θ7, the elevation angles φ4 to φ7, the visual fields FOV4 to FOV7, and the visibility V4 to V7 during the section Tseg. FIG. 13 is an example of parameters included in the composite image metadata.

合成画像メタデータＩＤは、合成画像メタデータを識別する識別子である。画像ＩＤ、撮影時刻、撮影位置、カメラ固有情報およびカメラ設置情報は、合成画像メタデータを生成する元となる車載画像メタデータに含まれるデータである。 The composite image metadata ID is an identifier for identifying composite image metadata. The image ID, shooting time, shooting position, camera-specific information, and camera installation information are data included in the in-vehicle image metadata that is a source for generating the composite image metadata.

また、区間情報は、合成画像メタデータが区間Ｔｓｅｇの開始時刻および終了時刻を示す。上述したように、区間情報は「Ｔｓ、Ｔｅ」であるため、図１２の例の場合、区間情報は「Ｔ４、Ｔ７」になる。 The section information indicates the start time and end time of the section Tseg for the composite image metadata. Since the section information is “Ts, Te” as described above, the section information is “T4, T7” in the example of FIG.

合成画像メタデータ生成部１３は、区間Ｔｓｅｇに含まれる各時刻について、合成画像メタデータを生成する。図１２の例の場合、合成画像メタデータ生成部１３は、時刻Ｔ４からＴ７の４つの時刻について、合成画像メタデータを生成する。 The composite image metadata generation unit 13 generates composite image metadata for each time included in the section Tseg. In the case of the example in FIG. 12, the composite image metadata generation unit 13 generates composite image metadata for four times from time T4 to T7.

上述したように、指定された被写体を含む複数の合成画像メタデータが生成されることがある。サーバ通信部１１は、生成された複数の合成画像メタデータをそれぞれ特定する合成画像メタデータＩＤをリスト形式で表示端末７に送信する（ステップＳＣ５）。表示端末７の端末通信部２１は、合成画像メタデータＩＤのリストを受信する。 As described above, a plurality of composite image metadata including a designated subject may be generated. The server communication unit 11 transmits the composite image metadata ID for specifying each of the generated composite image metadata to the display terminal 7 in a list format (step SC5). The terminal communication unit 21 of the display terminal 7 receives the list of composite image metadata IDs.

表示端末７は、リストに含まれる合成画像メタデータＩＤで特定される複数の合成画像メタデータをメタデータサーバ５から取得する（ステップＳＣ６）。このために、端末通信部２１は、リストに含まれる合成画像メタデータＩＤをメタデータサーバ５に送信する。 The display terminal 7 acquires a plurality of composite image metadata specified by the composite image metadata ID included in the list from the metadata server 5 (step SC6). For this purpose, the terminal communication unit 21 transmits the composite image metadata ID included in the list to the metadata server 5.

メタデータサーバ５は、合成画像メタデータＩＤで特定される複数の合成画像メタデータを表示端末５に送信する。これにより、表示端末７は、複数の合成画像メタデータを取得する。 The metadata server 5 transmits a plurality of composite image metadata specified by the composite image metadata ID to the display terminal 5. Thereby, the display terminal 7 acquires a plurality of composite image metadata.

表示端末７は、取得した複数の合成画像メタデータのうち何れの合成画像メタデータを表示するかを選択する画面（以下、選択画面と称する）を表示する（ステップＳＣ７）。なお、表示端末７が受信した合成画像メタデータが１つの場合は、ステップＳＣ７の処理は行われなくてもよい。 The display terminal 7 displays a screen (hereinafter referred to as a selection screen) for selecting which composite image metadata is to be displayed among the plurality of obtained composite image metadata (step SC7). Note that when the composite image metadata received by the display terminal 7 is one, the process of step SC7 may not be performed.

図１４は、選択画面の一例を示している。選択画面は、１つの合成画像メタデータＩＤごとに、可視良好性Ｑを表示する。図１４の例では、１つの合成画像メタデータＩＤについて、撮影時刻とサムネイルとを表示している。 FIG. 14 shows an example of the selection screen. The selection screen displays the visibility goodness Q for each composite image metadata ID. In the example of FIG. 14, the shooting time and thumbnail are displayed for one composite image metadata ID.

可視良好性Ｑは、区間Ｔｓｅｇにおける１つの全周囲画像についての可視性Ｖの統合的な評価値であり、動画表示したときの平均的な品質を示す。ユーザは、表示部２５に表示されている選択画面の中から所望の合成画像メタデータＩＤを選択する。従って、ユーザは、可視良好性Ｑの値が高い合成画像メタデータＩＤを選択することで、指定した被写体を含む高品質な動画を視聴することができる。 The visual goodness Q is an integrated evaluation value of the visibility V for one entire surrounding image in the section Tseg, and indicates an average quality when a moving image is displayed. The user selects a desired composite image metadata ID from the selection screen displayed on the display unit 25. Therefore, the user can view a high-quality moving image including the designated subject by selecting the composite image metadata ID having a high visibility goodness Q value.

なお、図１４の例において、表示部２５は、可視良好性Ｑではなく、可視性Ｖの最大値を表示してもよい。この場合、ユーザは、合成画像メタデータＩＤで特定される複数の静止画のうち、可視性Ｖが高い静止画を選択することができる。これにより、表示部２５は、指定した被写体を含む高品質な静止画を表示することができる。 In the example of FIG. 14, the display unit 25 may display the maximum value of the visibility V instead of the visibility goodness Q. In this case, the user can select a still image with high visibility V among a plurality of still images specified by the composite image metadata ID. Thereby, the display unit 25 can display a high-quality still image including the designated subject.

表示端末７の端末通信部２１は、選択された合成画像メタデータＩＤをメタデータサーバ５に送信する。メタデータサーバ５は、選択された合成画像メタデータを表示端末７に送信する。これにより、表示端末７は、選択された合成画像メタデータを取得する（ステップＳＣ８）。 The terminal communication unit 21 of the display terminal 7 transmits the selected composite image metadata ID to the metadata server 5. The metadata server 5 transmits the selected composite image metadata to the display terminal 7. Thereby, the display terminal 7 acquires the selected composite image metadata (step SC8).

図７の例に示すように、画像抽出部２２は、合成画像メタデータＩＤが選択されると、選択された合成画像メタデータＩＤに基づく一連の画像を画像蓄積サーバ３から抽出する（ステップＳＣ９）。合成画像メタデータは、画像ＩＤと区間情報とを含む。図１２の例では、区間情報は、時刻Ｔ４から時刻Ｔ７を示す情報になる。 As shown in the example of FIG. 7, when the composite image metadata ID is selected, the image extraction unit 22 extracts a series of images based on the selected composite image metadata ID from the image storage server 3 (step SC9). ). The composite image metadata includes an image ID and section information. In the example of FIG. 12, the section information is information indicating time T4 to time T7.

従って、表示端末７は、選択された合成画像メタデータに含まれる画像ＩＤと区間情報とを画像蓄積サーバ３に送信する。画像蓄積サーバ３は、記憶している複数の全周囲画像のうち、画像ＩＤで特定される全周囲画像について、区間情報で特定される一連の全周囲画像を抽出する。画像蓄積サーバ３は、抽出した一連の全周囲画像を端末通信部２１に送信する。このときに抽出される一連の全周囲画像は、区間Ｔｓｅｇの一連の全周囲画像になる。 Accordingly, the display terminal 7 transmits the image ID and the section information included in the selected composite image metadata to the image storage server 3. The image storage server 3 extracts a series of all-around images specified by the section information for all-around images specified by the image ID from among the plurality of stored all-around images. The image storage server 3 transmits the extracted series of all-around images to the terminal communication unit 21. A series of all-around images extracted at this time becomes a series of all-around images in the section Tseg.

端末通信部２１は、区間Ｔｓｅｇの一連の全周囲画像を受信する。図７の例に示すように、表示端末７は、全周囲画像の中で視野を設定し、設定された視野で画像処理部２４が画像処理を行う。そして、画像処理された画像が表示部２５に表示される（ステップＳＣ１０）。 The terminal communication unit 21 receives a series of all-around images in the section Tseg. As shown in the example of FIG. 7, the display terminal 7 sets a field of view in the entire surrounding image, and the image processing unit 24 performs image processing with the set field of view. Then, the image processed image is displayed on the display unit 25 (step SC10).

図１５は、ステップＳＣ８の処理の例を示す。表示端末７が受信する全周囲画像は、上述したように、カメラＣ１〜Ｃ４が撮影した４枚の画像である。画像処理部２４は、取得した全周囲画像を合成して、車両２を中心とした３次元曲面を設定する（ステップＳ４１）。３次元曲面は、例えば、ポリゴンで定義されてもよい。 FIG. 15 shows an example of the process of step SC8. As described above, the all-around image received by the display terminal 7 is four images captured by the cameras C1 to C4. The image processing unit 24 synthesizes the acquired all-around image and sets a three-dimensional curved surface centered on the vehicle 2 (step S41). The three-dimensional curved surface may be defined by a polygon, for example.

画像処理部２４は、テクスチャの設定を行う（ステップＳ４２）。例えば、画像処理部２４は、合成画像メタデータに含まれるカメラ固有情報およびカメラ設置情報に基づいて、仮想空間に配置したカメラから３次元曲面のポリゴン頂点を向く視線ベクトルを算出する。 The image processing unit 24 performs texture setting (step S42). For example, the image processing unit 24 calculates a line-of-sight vector facing the polygon vertex of the three-dimensional curved surface from the camera arranged in the virtual space based on the camera specific information and the camera installation information included in the composite image metadata.

そして、画像処理部２４は、視線ベクトルに対応したカメラの画素位置をカメラ固有情報に基づいて算出する。これにより、画像処理部２４は、ポリゴン頂点とテクスチャの画素位置との対応関係を算出することができる。 Then, the image processing unit 24 calculates the pixel position of the camera corresponding to the line-of-sight vector based on the camera specific information. Thereby, the image processing unit 24 can calculate the correspondence between the polygon vertex and the texture pixel position.

テクスチャが設定された全周囲画像の一例を図１６に示す。図１６では、楕円形で全周囲画像４１を示しているが、上述したように全周囲画像は３次元曲面である。例えば、テクスチャが設定された全周囲画像は、３次元的に湾曲したお椀型の形状をしている。 An example of the all-around image in which the texture is set is shown in FIG. In FIG. 16, the omnidirectional image 41 is shown as an ellipse. However, as described above, the omnidirectional image is a three-dimensional curved surface. For example, the all-around image in which the texture is set has a bowl-shaped shape that is curved three-dimensionally.

表示端末７は、図１６の例の全周囲画像に対応する合成画像メタデータを受信している。従って、視野設定部２３は、合成画像メタデータに含まれる方位情報θおよび仰角情報φに基づいて、図１６の例の全周囲画像に仮想カメラ４２の仮想視野を設定できる。図１６では、仮想視野を一点鎖線で示している。この仮想視野には、被写体４４が含まれる。 The display terminal 7 receives the composite image metadata corresponding to the omnidirectional image in the example of FIG. Therefore, the visual field setting unit 23 can set the virtual visual field of the virtual camera 42 for the entire peripheral image in the example of FIG. 16 based on the orientation information θ and the elevation angle information φ included in the composite image metadata. In FIG. 16, the virtual visual field is indicated by a one-dot chain line. This virtual visual field includes a subject 44.

画像処理部２４は、設定された仮想視野で描画処理を行う（ステップＳ４４）。これにより、図１７の例で示されるような被写体４４を捉えた画像（被写体４４を含む画像）が表示部２５に表示される。図１７の例の被写体４４は、建物を示している。 The image processing unit 24 performs a drawing process with the set virtual visual field (step S44). Thereby, an image (an image including the subject 44) capturing the subject 44 as shown in the example of FIG. 17 is displayed on the display unit 25. The subject 44 in the example of FIG. 17 shows a building.

上述したように、車両２が移動することで、撮影地点は変化し、被写体に対する視点、視野が変化する。つまり、区間Ｔｓｅｇにおいて、一連の全周囲画像の撮影位置は刻々と変化していく。そこで、視野設定部２３は、区間Ｔｓｅｇの次の時刻の全周囲画像に対して、この全周囲画像に対応する合成画像メタデータに基づいて、視点、視野の設定を変更する（ステップＳ４５）。 As described above, as the vehicle 2 moves, the shooting location changes, and the viewpoint and field of view of the subject change. That is, in the section Tseg, the shooting positions of a series of all-around images change every moment. Therefore, the visual field setting unit 23 changes the setting of the viewpoint and the visual field for the omnidirectional image at the next time in the section Tseg based on the composite image metadata corresponding to the omnidirectional image (step S45).

図１８は、視点、視野を変更した場合の仮想カメラ４２の仮想視野の例を示している。視野設定部２３は、区間Ｔｓｅｇの次の時刻の方位情報θおよび仰角情報φに基づいて、仮想視野を変更している。 FIG. 18 shows an example of the virtual field of view of the virtual camera 42 when the viewpoint and field of view are changed. The visual field setting unit 23 changes the virtual visual field based on the azimuth information θ and the elevation angle information φ at the next time of the section Tseg.

区間Ｔｓｅｇの一連の全周囲画像は、それぞれ撮影位置が異なる。従って、区間Ｔｓｅｇの一連の全周囲画像の方位情報θおよび仰角情報φは、全周囲画像によって異なる。つまり、全周囲画像によってカメラワークが異なる。 A series of all-around images in the section Tseg has different shooting positions. Therefore, the azimuth information θ and the elevation angle information φ of the series of all-around images in the section Tseg vary depending on the all-around image. That is, camera work differs depending on the entire surrounding image.

画像処理部２４は、表示を終了するか否かを判定する（ステップＳ４６）。例えば、区間Ｔｓｅｇの全周囲画像に基づく画像表示が全て終了したとき（ステップＳ４６でＹＥＳ）、表示部２５は表示を終了する。または、表示端末７に対して、表示を終了する操作がされた場合に、表示部２５は表示を終了する。 The image processing unit 24 determines whether or not to end the display (step S46). For example, when the image display based on the entire surrounding image in the section Tseg is completed (YES in step S46), the display unit 25 ends the display. Alternatively, when the display terminal 7 is operated to end the display, the display unit 25 ends the display.

一方、表示を終了しない場合（ステップＳ４６でＮＯ）、ステップＳ４３〜Ｓ４５の処理が繰り返される。従って、区間Ｔｓｅｇの間、ステップＳ４３およびＳ４４の処理が繰り返される。視野設定部２３は、合成画像メタデータに含まれる方位情報θおよび仰角情報φに基づいて、随時、被写体を捉えるカメラワークで撮影された画像を表示する。 On the other hand, when the display is not terminated (NO in step S46), the processes in steps S43 to S45 are repeated. Accordingly, the processes in steps S43 and S44 are repeated during the section Tseg. The field-of-view setting unit 23 displays an image captured by camera work that captures the subject as needed based on the orientation information θ and the elevation angle information φ included in the composite image metadata.

これにより、ユーザは、特別な操作を行うことなく、被写体を捉えたカメラワークで撮影された区間Ｔｓｅｇの一連の画像を視聴することができる。視野設定部２３は、方位情報θおよび仰角情報φを含むカメラワーク情報に基づいて、仮想カメラ４２の位置を変化させる。 Thereby, the user can view a series of images of the section Tseg captured by camera work capturing the subject without performing a special operation. The visual field setting unit 23 changes the position of the virtual camera 42 based on the camera work information including the azimuth information θ and the elevation angle information φ.

従って、区間Ｔｓｅｇの一連の画像を視聴するときに、各画像は、常に被写体を中心に捉えた画像になっている。例えば、区間Ｔｓｅｇの一連の画像を表示端末７が動画表示する場合、ユーザは単にキーワードを指定するだけで、簡単な操作で常に被写体を中心に捉えたカメラワークの動画を視聴することができる。 Therefore, when viewing a series of images in the section Tseg, each image is an image that is always captured around the subject. For example, when the display terminal 7 displays a series of images in the section Tseg as a moving image, the user can simply view a camerawork moving image that always captures the subject with a simple operation simply by specifying a keyword.

図１９は、表示部２５に表示される動画の一例を示している。図１９（Ａ）は、被写体４４を正面から撮影する位置に到達する前にカメラＣ１〜Ｃ４が撮影した全周囲画像に基づく画像の一例である。 FIG. 19 shows an example of a moving image displayed on the display unit 25. FIG. 19A is an example of an image based on the entire surrounding image captured by the cameras C1 to C4 before reaching the position where the subject 44 is captured from the front.

図１９（Ａ）の場合、遮蔽物４５により、被写体４４の一部が非表示の状態になる。遮蔽物４５としては、例えば、電柱や樹木等がある。図１９（Ａ）の例では、遮蔽物４５を要因として被写体４４の可視性がＶ＝０．７となる。 In the case of FIG. 19A, a part of the subject 44 is hidden by the shield 45. Examples of the shield 45 include a utility pole and a tree. In the example of FIG. 19A, the visibility of the subject 44 is V = 0.7 due to the shield 45.

図１９（Ｂ）は、被写体４４を正面から撮影した全周囲画像に基づく画像の一例である。この場合、カメラワークが変化しており、遮蔽物４５により被写体４４の可視性Ｖはそれほど影響を受けない。図１９（Ｂ）の場合、可視性ＶはＶ＝０．９である。 FIG. 19B is an example of an image based on an all-around image obtained by photographing the subject 44 from the front. In this case, the camera work has changed, and the visibility V of the subject 44 is not significantly affected by the shield 45. In the case of FIG. 19B, the visibility V is V = 0.9.

図１９（Ｃ）は、被写体４４を正面から通過した位置で撮影した全周囲画像に基づく画像の一例である。この場合、カメラワークが変化しており、遮蔽物４５を要因として被写体４４の可視性Ｖが低下する。図１９（Ｃ）の場合、可視性ＶはＶ＝０．７である。 FIG. 19C is an example of an image based on an all-around image captured at a position passing through the subject 44 from the front. In this case, the camera work has changed, and the visibility V of the subject 44 is reduced due to the shield 45. In the case of FIG. 19C, the visibility V is V = 0.7.

図２０は、可視性Ｖに基づく画像選択を行うための画面例を示している。表示端末７の表示部２５は、画像表示領域５１と可視性グラフ５２とスライダバー５３とを有している。画像表示領域５１は、被写体４４を含む画像を表示する領域である。 FIG. 20 shows an example of a screen for performing image selection based on the visibility V. The display unit 25 of the display terminal 7 includes an image display area 51, a visibility graph 52, and a slider bar 53. The image display area 51 is an area for displaying an image including the subject 44.

可視性グラフ５２は、区間Ｔｓｅｇの中で経時的に変化する可視性Ｖを示したグラフである。スライダバー５３は、可視性グラフ５２の任意の時刻Ｔを指定するためのバーである。ユーザは、スライダバー５３を時刻Ｔの方向に沿って移動させることができる。 The visibility graph 52 is a graph showing the visibility V that changes with time in the section Tseg. The slider bar 53 is a bar for designating an arbitrary time T of the visibility graph 52. The user can move the slider bar 53 along the direction of time T.

従って、最も可視性Ｖが高い位置にスライダバー５３のバーを位置させることで、画像表示領域５１には、最も可視性Ｖが高い静止画が表示される。従って、上述したように、カメラワークが適用された画像において、どの部分が被写体４４を良好に捉えているかの指標をユーザに与えることができる。 Accordingly, by positioning the bar of the slider bar 53 at a position where the visibility V is the highest, a still image with the highest visibility V is displayed in the image display area 51. Therefore, as described above, it is possible to give an index to the user as to which part of the image to which camerawork is applied that is capturing the subject 44 well.

＜画像検索サーバのハードウェア構成の一例＞
次に、図２１の例を参照して、画像検索サーバ４のハードウェア構成の一例を説明する。図２１の例に示すように、バス１００に対して、ＣＰＵ１１１とＲＡＭ１１２とＲＯＭ１１３と補助記憶装置１１４と媒体接続部１１５と通信インタフェース１１６とが接続されている。 <Example of hardware configuration of image search server>
Next, an example of the hardware configuration of the image search server 4 will be described with reference to the example of FIG. As illustrated in the example of FIG. 21, a CPU 111, a RAM 112, a ROM 113, an auxiliary storage device 114, a medium connection unit 115, and a communication interface 116 are connected to the bus 100.

ＣＰＵ１１１は任意の処理回路である。ＣＰＵ１１１はＲＡＭ１１２に展開されたプログラムを実行する。実行されるプログラムとしては、実施形態の処理を行うプログラムを適用することができる。ＲＯＭ１１３はＲＡＭ１１２に展開されるプログラムを記憶する不揮発性の記憶装置である。 The CPU 111 is an arbitrary processing circuit. The CPU 111 executes the program expanded in the RAM 112. As a program to be executed, a program for performing the processing of the embodiment can be applied. The ROM 113 is a non-volatile storage device that stores programs developed in the RAM 112.

補助記憶装置１１４は、種々の情報を記憶する記憶装置であり、例えばハードディスクドライブや半導体メモリ等を補助記憶装置１１４に適用することができる。媒体接続部１１５は、可搬型記録媒体１１８と接続可能に設けられている。 The auxiliary storage device 114 is a storage device that stores various information. For example, a hard disk drive, a semiconductor memory, or the like can be applied to the auxiliary storage device 114. The medium connection unit 115 is provided so as to be connectable to the portable recording medium 118.

可搬型記録媒体１１８としては、可搬型のメモリや光学式ディスク（例えば、Compact Disk(CD)やDigital Versatile Disk(DVD)等）を適用することができる。この可搬型記録媒体１１８に実施形態の画像検索サーバ４が行う処理のプログラムが記録されていてもよい。 As the portable recording medium 118, a portable memory or an optical disk (for example, Compact Disk (CD), Digital Versatile Disk (DVD), etc.) can be applied. A program for processing performed by the image search server 4 of the embodiment may be recorded on the portable recording medium 118.

画像検索サーバ４のサーバ通信部１１以外の各部は、ＣＰＵ１１１により実現されてもよい。また、サーバ通信部１１は、通信インタフェース１１６により実現されてもよい。ＲＡＭ１１２、ＲＯＭ１１３および補助記憶装置１１４は、何れもコンピュータ読み取り可能な有形の記憶媒体の一例である。これらの有形な記憶媒体は、信号搬送波のような一時的な媒体ではない。 Each unit other than the server communication unit 11 of the image search server 4 may be realized by the CPU 111. The server communication unit 11 may be realized by the communication interface 116. The RAM 112, the ROM 113, and the auxiliary storage device 114 are all examples of a tangible storage medium that can be read by a computer. These tangible storage media are not temporary media such as signal carriers.

＜表示端末のハードウェア構成の一例＞
次に、図２２を参照して、表示端末７のハードウェア構成の一例を説明する。図２２の例に示すように、バス２００に対して、ＣＰＵ２１１とＲＡＭ２１２とＲＯＭ２１３と補助記憶装置２１４と媒体接続部２１５と通信インタフェース２１６とが接続されている。 <Example of hardware configuration of display terminal>
Next, an example of the hardware configuration of the display terminal 7 will be described with reference to FIG. As illustrated in the example of FIG. 22, a CPU 211, a RAM 212, a ROM 213, an auxiliary storage device 214, a medium connection unit 215, and a communication interface 216 are connected to the bus 200.

ＣＰＵ２１１とＲＡＭ２１２とＲＯＭ２１３と補助記憶装置２１４と媒体接続部２１５とは上述した例と同様である。ＣＰＵ２１１が実行するプログラムは、実施形態の処理を行うプログラムであってもよい。通信インタフェース２１６は、外部との通信を行う。 The CPU 211, RAM 212, ROM 213, auxiliary storage device 214, and medium connection unit 215 are the same as those described above. The program executed by the CPU 211 may be a program that performs the processing of the embodiment. The communication interface 216 performs communication with the outside.

入出力インタフェース２１７は、例えば表示部２５との間でデータを入出力するインタフェースである。端末通信部２１は、通信インタフェース２１６により実現されてもよい。表示端末７のうち、端末通信部２１および表示部２５以外の各部はＣＰＵ２１１により実現されてもよい。 The input / output interface 217 is an interface for inputting / outputting data to / from the display unit 25, for example. The terminal communication unit 21 may be realized by the communication interface 216. Of the display terminal 7, each unit other than the terminal communication unit 21 and the display unit 25 may be realized by the CPU 211.

ＲＡＭ２１２、ＲＯＭ２１３および補助記憶装置２１４は、何れもコンピュータ読み取り可能な有形の記憶媒体の一例である。これらの有形な記憶媒体は、信号搬送波のような一時的な媒体ではない。 The RAM 212, the ROM 213, and the auxiliary storage device 214 are all examples of a tangible storage medium that can be read by a computer. These tangible storage media are not temporary media such as signal carriers.

＜その他＞
従って、実施形態では、表示端末７が指定した被写体を含む区間の一連の全周囲画像を特定する合成画像メタデータを画像検索サーバ４が生成し、表示端末７が合成画像メタデータに基づいて画像蓄積サーバ３から全周囲画像の抽出を行っている。これにより、簡単な操作で、表示端末７は被写体を含む区間Ｔｓｅｇの一連の画像を表示することができる。 <Others>
Therefore, in the embodiment, the image search server 4 generates composite image metadata that specifies a series of all-around images of a section including the subject designated by the display terminal 7, and the display terminal 7 generates an image based on the composite image metadata. The entire surrounding image is extracted from the accumulation server 3. Thereby, the display terminal 7 can display a series of images of the section Tseg including the subject with a simple operation.

特に、表示端末７にキーワードが指定されるだけで、表示端末７は、指定した被写体が含まれる区間Ｔｓｅｇの一連の画像を動画表示することができる。従って、ユーザは、簡単な操作で、被写体が含まれる区間の動画を視聴することができる。撮影位置を変化させた静止画を連続して表示する場合も同様である。 In particular, only by specifying a keyword on the display terminal 7, the display terminal 7 can display a series of images in the section Tseg including the specified subject as a moving image. Therefore, the user can view the moving image of the section including the subject with a simple operation. The same applies when still images with different shooting positions are displayed continuously.

また、視野設定部２３は、区間Ｔｓｅｇの一連の画像について、合成画像メタデータに含まれる方位情報θおよび仰角情報φに基づいて、被写体が常に画像の中心に位置するように視野を設定することができる。これにより、例えば、区間Ｔｓｅｇの一連の画像について、表示端末７は、被写体が常に中心に位置する動画を再生することができる。 The field of view setting unit 23 sets the field of view of the series of images in the section Tseg so that the subject is always located at the center of the image based on the orientation information θ and the elevation angle information φ included in the composite image metadata. Can do. Thereby, for example, for a series of images in the section Tseg, the display terminal 7 can reproduce a moving image in which the subject is always located at the center.

方位情報θおよび仰角情報φは、カメラワーク情報の一例である。カメラワーク情報は、方位情報θだけであってもよい。この場合、仰角は固定されている。従って、被写体は画像の中心に位置するとは限らない。表示端末７は、方位情報θおよび仰角情報φに基づくカメラワークで被写体を表示すると、被写体が常に中心に位置した動画または静止画を表示することができる。 The azimuth information θ and the elevation angle information φ are examples of camera work information. The camera work information may be only the orientation information θ. In this case, the elevation angle is fixed. Therefore, the subject is not always located at the center of the image. When the display terminal 7 displays the subject by camera work based on the azimuth information θ and the elevation angle information φ, the display terminal 7 can display a moving image or a still image in which the subject is always located at the center.

また、合成画像メタデータは、可視性Ｖを含む。図２０の例に示したように、ユーザは、スライダバー５３を操作することで、表示端末７は、可視性Ｖが良好な画像を選択的に表示することができる。 The composite image metadata includes visibility V. As shown in the example of FIG. 20, the user can selectively display an image with good visibility V by operating the slider bar 53.

また、表示端末７が指定する被写体を多数の車両２のカメラＣ１〜Ｃ４が撮影している場合、メタデータサーバ５は、表示端末７が指定する被写体についての合成画像メタデータを多数蓄積する。 In addition, when the cameras C1 to C4 of a large number of vehicles 2 photograph the subject specified by the display terminal 7, the metadata server 5 stores a large amount of composite image metadata regarding the subject specified by the display terminal 7.

このため、図１４のように、表示端末７は、可視良好性Ｑの一覧から選択する選択画面を表示することで、可視良好性Ｑが高い高品質な画像を選択的に抽出することができる。これにより、表示する画像（例えば、動画）の合成画像メタデータＩＤを絞り込むことができる。 For this reason, as shown in FIG. 14, the display terminal 7 can selectively extract a high-quality image with a high visibility goodness Q by displaying a selection screen to be selected from the list of the visibility goodness Q. . Thereby, the composite image metadata ID of the image (for example, moving image) to display can be narrowed down.

上述した実施形態では、可視良好性Ｑは、可視性Ｖを統合的に評価する値とする例について説明したが、可視良好性Ｑは、被写体の見え易さの度合いを示す値であってもよい。例えば、カメラＣ１〜Ｃ４が撮影した画像（つまり、全周囲画像）のうち、画像の中で被写体が写っている位置が中心からの距離に基づく値を可視良好性Ｑとしてもよい。 In the embodiment described above, the example in which the visibility goodness Q is a value that evaluates the visibility V in an integrated manner has been described, but the visibility goodness Q may be a value that indicates the degree of visibility of the subject. Good. For example, among images captured by the cameras C1 to C4 (that is, all-around images), a value based on the distance from the center where the subject is captured in the image may be used as the visibility goodness Q.

画像の中心に被写体が写っている場合、被写体の見え易さの度合いは高くなる。このため、可視良好性Ｑの値も高くなる。一方、画像の中心から被写体が離間している場合、被写体の見え易さの度合いは低くなるため、可視良好性Ｑの値も低くなる。 When the subject is shown in the center of the image, the degree of visibility of the subject increases. For this reason, the value of visibility goodness Q also becomes high. On the other hand, when the subject is separated from the center of the image, the degree of visibility of the subject is low, and the visibility goodness Q is also low.

従って、ユーザが可視良好性Ｑの高い合成映像メタデータＩＤを選択することで、画像抽出部２２は、画像の中心に被写体が写っている全周囲画像に絞って抽出することができる。 Therefore, when the user selects a composite video metadata ID having a high visual quality Q, the image extraction unit 22 can extract only the entire surrounding image in which the subject is captured at the center of the image.

本実施形態は、以上に述べた実施の形態に限定されるものではなく、本実施形態の要旨を逸脱しない範囲内で種々の構成または実施形態を取ることができる。 The present embodiment is not limited to the above-described embodiment, and various configurations or embodiments can be taken without departing from the gist of the present embodiment.

１画像提供システム
２車両
３画像蓄積サーバ
４画像検索サーバ
５メタデータサーバ
６地図サーバ
１１サーバ通信部
１２被写体位置特定部
１３合成画像メタデータ生成部
１４演算部
２１端末通信部
２２画像抽出部
２３視野設定部
２４画像処理部
２５表示部
３０制御部
３３時刻計測部
３４位置情報取得部
３５車載画像メタデータ生成部
３６車載記憶部
１１１、２１１ＣＰＵ
１１２、２１２ＲＡＭ
１１３、２１３ＲＯＭ DESCRIPTION OF SYMBOLS 1 Image provision system 2 Vehicle 3 Image storage server 4 Image search server 5 Metadata server 6 Map server 11 Server communication part 12 Subject position specification part 13 Composite image metadata production | generation part 14 Computation part 21 Terminal communication part 22 Image extraction part 23 View field Setting unit 24 Image processing unit 25 Display unit 30 Control unit 33 Time measurement unit 34 Position information acquisition unit 35 In-vehicle image metadata generation unit 36 In-vehicle storage unit 111, 211 CPU
112, 212 RAM
113, 213 ROM

Claims

An image providing system including a server and a display terminal,
The server
Based on the information about the object which the display terminal is specified, a specification unit for specifying a position of presence point of the object,
A generating unit that generates specific data for identifying a series of surrounding images of a section including the subject, based on a photographing position where the subject is photographed and a position of the existence point of the subject;
A transmission unit for transmitting the specific data to the display terminal;
With
The display terminal is
An extraction unit that extracts a series of surrounding images of a section including the subject based on the specific data from a storage device that stores a plurality of surrounding images;
A display unit for displaying an image of the subject in a series of surrounding images extracted by the extraction unit based on the specific data;
An image providing system comprising:

The generating unit, the distance between the position of existence point shooting position and the object of the peripheral image when less than a predetermined distance, is identified as a peripheral image including the object,
The image providing system according to claim 1.

It said specific data includes direction information indicating the orientation of the position of existence point of the object as viewed from the imaging position,
The display unit displays an image of a visual field based on the orientation information among the surrounding images.
The image providing system according to claim 1 or 2.

The specific data further includes elevation angle information indicating an elevation angle of the position of the subject existing from the shooting position,
The display unit displays an image of a visual field based on the azimuth information and the elevation angle information among the surrounding images.
The image providing system according to claim 3.

The specific data includes visibility indicating a rate of the subject in the surrounding image,
The display terminal displays a screen on which any one of the visibility of the plurality of specific data can be selected;
The image providing system according to any one of claims 1 to 4.

The specific data includes visibility goodness indicating an integrated value of the visibility of each surrounding image of a series of surrounding images of a section including a subject specified by the display terminal,
The display terminal displays a screen on which any one of the visual goodnesses of the plurality of specific data can be selected;
The image providing system according to claim 5.

The extraction unit extracts a series of surrounding images in which the subject is reflected in the center of the surrounding image among a series of surrounding images of a section including the plurality of subjects.
The image providing system according to any one of claims 1 to 6.

An image providing method in which a server and a display terminal communicate with each other,
The server
Based on the information about the object which the display terminal is specified, to locate the presence point of the object,
Based on the shooting position at which the subject is shot and the position of the location of the subject , specific data for identifying a series of surrounding images of a section including the subject is generated,
Transmitting the specific data to the display terminal;
The display terminal is
From a storage device that stores a plurality of surrounding images, a series of surrounding images of a section including the subject is extracted based on the specific data,
On the basis of the specific data, and displays an image of the object among the extracted the series of surrounding images,
Image providing method.