JP2003179867A

JP2003179867A - Data processing apparatus, data processing method, information storage medium, and computer program

Info

Publication number: JP2003179867A
Application number: JP2001375223A
Authority: JP
Inventors: Kazunori Hayashi; 和慶林
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2001-12-10
Filing date: 2001-12-10
Publication date: 2003-06-27

Abstract

<P>PROBLEM TO BE SOLVED: To provide a data processing apparatus and an information storage medium capable of attaining viewpoint switching processing for a multi- viewpoint image at a high-speed without intermission. <P>SOLUTION: The data processing apparatus is configured such that coded data blocks from different viewpoints comprise frame data shifted to each other and the apparatus stores the blocks to a storage medium such as a DVD. Through the configuration above, when the data processing apparatus executes reading of data from the medium and decoding and reproduction processing for the data, the data processing apparatus can reduce the time from the occurrence of a switching request until reproduction of block data after reproducible viewpoint switching so as to display a viewpoint switching image at a high- speed. Further, the data processing apparatus can surely execute discrimination of the block data after the reproducible viewpoint switching from the occurrence of the viewpoint switching request so as to prevent an error such as interruption of a display image due to the viewpoint switching from taking place. <P>COPYRIGHT: (C)2003,JPO

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、データ処理装置、
データ処理方法、および情報記憶媒体、並びにコンピュ
ータ・プログラムに関する。さらに詳細には、異なる視
点画像データの視点切り換え再生処理におけるスムーズ
な視点切り換えを可能とするデータ処理装置、データ処
理方法、および情報記憶媒体、並びにコンピュータ・プ
ログラムに関する。TECHNICAL FIELD The present invention relates to a data processing device,
The present invention relates to a data processing method, an information storage medium, and a computer program. More specifically, the present invention relates to a data processing device, a data processing method, an information storage medium, and a computer program that enable smooth viewpoint switching in viewpoint switching reproduction processing of different viewpoint image data.

【０００２】[0002]

【従来の技術】近年、パノラマ、全天球画像など、視点
を様々に移動可能とした画像データの利用が盛んになり
つつある。例えば、ＤＶＤ、ＣＤ等の記憶媒体に複数の
視点位置、視線方向からある被写体を撮影した画像を蓄
積し、蓄積画像をＣＲＴ、液晶表示装置等に表示する際
に、ユーザがコントローラの操作によって、自由な位置
に視点を移動させて、被写体の像を観察するシステムが
実現されている。また、インターネット等の通信システ
ムを介して複数の視点位置、視線方向からある被写体を
撮影した画像を配信し、ユーザがＰＣ等のマウス操作に
より、好みの視点位置、視線方向からの画像をディスプ
レイに表示するシステム等が構築されている。2. Description of the Related Art In recent years, image data such as panorama images and spherical images, which can be moved in various directions, has been actively used. For example, when a storage medium such as a DVD or a CD stores an image of a subject captured from a plurality of viewpoint positions and line-of-sight directions, and the stored image is displayed on a CRT, a liquid crystal display device, or the like, the user operates the controller. A system for observing an image of a subject by moving a viewpoint to a free position has been realized. Further, an image obtained by photographing a certain subject from a plurality of viewpoint positions and line-of-sight directions is distributed via a communication system such as the Internet, and the user operates the mouse on a PC or the like to display images from the desired viewpoint position and line-of-sight direction on the display. A display system, etc. is built.

【０００３】視点を様々に移動可能とした画像データ
は、例えば複数の異なる視点位置に配置したカメラによ
って撮影した複数の画像をつなぎ合わせて合成すること
によって生成される。カメラ間の画像は、画像データの
補正処理として例えばビューインタポレーション等の手
法により、隣接カメラの撮影画像に基づいて生成するこ
とが可能である。動画像においては、複数のカメラの撮
影した動画像データの時間的な同期処理を行なった上で
同様に生成することが可能となる。このような全方位映
像の生成手法は、例えば文献「Ｓ．Ｅ．Ｃｈｅｎ”ck t
ime VR-An Image-Based Approach to virtual Environm
ent Navigation", Computer Graphics SIGGRAPH,pp.29-
38,1995」等に記載されている。Image data in which the viewpoint can be moved in various ways is generated by, for example, combining and combining a plurality of images taken by cameras arranged at a plurality of different viewpoint positions. The image between the cameras can be generated based on the image captured by the adjacent camera by a method such as view interpolation as a correction process of the image data. In a moving image, moving image data captured by a plurality of cameras can be similarly generated after temporally synchronizing the moving image data. Such an omnidirectional video generation method is disclosed in, for example, the document “SE Chen” ck t.
ime VR-An Image-Based Approach to virtual Environm
ent Navigation ", Computer Graphics SIGGRAPH, pp.29-
38, 1995 ”and the like.

【０００４】視点を自由に変更できる画像を、例えばＤ
ＶＤ等の記憶媒体から読み出してディスプレイにおいて
再生する場合には、どの視点位置からのデータを表示す
るかをユーザが自ら選択するか、あるいは、あらかじめ
画像データに付随するプログラムに従って表示するかの
いずれかの処理を実行することとなる。また、インター
ネット、あるいは衛星を介したプロードキャスト等のデ
ータ配信システムにおいては、データ配信元から送信さ
れる複数の視点画像からの画像からユーザが１つの視点
画像を選択して表示したり、あるいは配信元においてあ
らかじめ定めた代表画像のみをディスプレイに表示する
等の処理が実行されることになる。An image whose viewpoint can be freely changed is, for example, D
When the data is read from a storage medium such as a VD and played back on the display, the user selects either from which viewpoint position the data is displayed, or the data is displayed according to a program attached to the image data in advance. Will be executed. In a data distribution system such as the Internet or a broadcast via satellite, a user selects and displays one viewpoint image from images from a plurality of viewpoint images transmitted from a data distributor. Originally, processing such as displaying only a representative image predetermined on the display is executed.

【０００５】このような視点切替処理の利用を想定した
コンテンツとして、従来の映画・テレビドラマ放送など
のように単一のストーリではなく、視聴者がストーリを
選択できたり、多彩な映像表現を実現し視聴者の楽しみ
も増すことができる新たな映像コンテンツが開発・提案
されつつある。特にスポーツ映像、音楽コンサート映像
において複数の視点の異なるカメラを切り換えて楽し
む、または同一視点からの３６０度パノラマ映像をユー
ザの好みの視線(アングル)から視聴して楽しむといった
用途が考えられている。As a content that is supposed to use such a viewpoint switching process, a viewer can select a story instead of a single story as in the conventional movie / TV drama broadcasting, and various video expressions can be realized. However, new video contents are being developed and proposed that can increase the enjoyment of viewers. In particular, applications such as sports videos and music concert videos can be enjoyed by switching cameras with different viewpoints, or viewing and enjoying 360-degree panoramic videos from the same viewpoint from the user's favorite line of sight (angle).

【０００６】これらの映像コンテンツは、複数の映像が
必要となるため、記録すべき映像情報量が増大する。ま
た、視聴者が複数アングルの映像を切り換える操作を行
うときの映像・音声の途切れの発生は避けるべきであ
る。すなわち、このような多視点画像コンテンツの表示
処理においては、大量の映像データから好みの視点ある
いはアングルを持つ映像を遅れあるいは途切れを発生さ
せることなく表示することが求められる。Since these video contents require a plurality of videos, the amount of video information to be recorded increases. In addition, it is necessary to avoid the occurrence of video / audio interruptions when the viewer performs an operation of switching video of multiple angles. That is, in such display processing of multi-view image content, it is required to display a video having a desired viewpoint or angle from a large amount of video data without causing delay or interruption.

【０００７】また、複数視点あるいはアングルを視聴者
が切り換えるだけでなく、あらかじめプログラムされた
視点・アングルに沿って視聴者に見せる表示方法もあ
る。このようなプログラム化された視点切り換え処理を
用いて画像表示を行なうことにより、たとえば複数の歌
手が同時にステージ上で歌っている場合に、特定の歌手
が常に画像の中心に来るような一連の映像シーンを視聴
者が見たいといった場合に自動的に最適視点あるいはア
ングルの切り換えをプログラムに従って行なうなど、視
聴者がいちいち視点やアングルを切り換えることなく好
みの歌手映像を見続けることが可能となる。この場合、
複数歌手のそれぞれを選択できるように複数のプログラ
ムをいれ、プログラムを選択可能とすることにより、各
ユーザの好みに対応でき、様々な映像表現が可能とな
る。There is also a display method in which not only the viewer switches between a plurality of viewpoints or angles, but also the viewer is shown along a preprogrammed viewpoint / angle. By displaying images using such programmed viewpoint switching processing, for example, when a plurality of singers are singing on the stage at the same time, a series of images in which a specific singer is always in the center of the image. When the viewer wants to view the scene, the optimum viewpoint or angle is automatically switched according to the program, so that the viewer can continuously watch the favorite singer's video without switching the viewpoint or angle. in this case,
By including a plurality of programs so that each of a plurality of singers can be selected and making the programs selectable, it is possible to respond to each user's preference and various video expressions are possible.

【０００８】ＤＶＤ等の記録媒体に複数視点からの動画
像データストリームを格納する場合の例を図１に示す。
図１には、視点Ａからの映像データと、視点Ｂからの映
像データ、視点Ｃからの映像データをそれぞれ複数フレ
ーム単位で、直列に格納したデータ格納構成を示してい
る。例えば映像データＡ格納部分１０１には、視点Ａか
らの撮影画像としてのフレームデータ０〜１２０が格納
され、映像データＢ格納部分１０２には、視点Ｂからの
撮影画像としてのフレーム０〜１２０が格納され、映像
データＣ格納部分１０３には、視点Ｃからの撮影画像と
してのフレーム０〜１２０が格納される。FIG. 1 shows an example of storing moving image data streams from a plurality of viewpoints on a recording medium such as a DVD.
FIG. 1 shows a data storage configuration in which the video data from the viewpoint A, the video data from the viewpoint B, and the video data from the viewpoint C are serially stored in units of a plurality of frames. For example, the video data A storage portion 101 stores frame data 0 to 120 as a captured image from the viewpoint A, and the video data B storage portion 102 stores frames 0 to 120 as a captured image from the viewpoint B. Then, the video data C storage portion 103 stores frames 0 to 120 as a captured image from the viewpoint C.

【０００９】このような多視点画像データを格納したデ
ィスクの再生処理において、視点切り換えを行なう場合
を想定する。図２を参照して視点切り換え処理について
説明する。図２には、視点Ａからの撮影画像としてのフ
レーム０〜１２０が格納された映像データＡ格納部分１
０１と、視点Ｂからの撮影画像としてのフレーム０〜１
２０が格納された映像データＢ格納部分１０２を示して
いる。It is assumed that viewpoint switching is performed in the reproduction processing of a disc storing such multi-view image data. The viewpoint switching process will be described with reference to FIG. In FIG. 2, a video data A storage portion 1 in which frames 0 to 120 as captured images from a viewpoint A are stored
01 and frames 0 to 1 as captured images from the viewpoint B
20 shows the video data B storage portion 102 in which 20 is stored.

【００１０】映像データＡを再生しているとき、フレー
ム６０の視点Ａのデータ再生中にに、ユーザの入力によ
る視点Ａから視点Ｂへの視点切り換え処理要求が発生す
ると、再生ヘッドは、フレーム６０の視点Ａのデータ格
納位置２０１から、フレーム６１の視点Ｂのデータ格納
位置２０２に移動することが必要となる。このデータ格
納位置２０１〜２０２間は物理的距離があり、その距離
に応じてヘッドの移動時間、すなわちシーク時間を要す
ることになる。このシーク時間およびその後の表示処理
のためのデータのデコード（復号）処理時間等によっ
て、視点Ａのフレーム６０のデータ再生時間と、視点Ｂ
のフレーム６１のデータ再生時間との間に途切れ、すな
わち中断が発生し、視聴者に対して不自然な動画像が提
示されてしまうことになる。When the viewpoint switching processing request from the viewpoint A to the viewpoint B is input by the user during the reproduction of the viewpoint A of the frame 60 while the video data A is being reproduced, the reproducing head causes the frame 60 to reproduce. It is necessary to move from the data storage position 201 of the viewpoint A to the data storage position 202 of the viewpoint B of the frame 61. There is a physical distance between the data storage positions 201 to 202, and a moving time of the head, that is, a seek time is required according to the physical distance. Depending on the seek time and the decoding processing time of the data for the subsequent display processing, the data reproduction time of the frame 60 of the viewpoint A and the viewpoint B
The data reproduction time of the frame 61 is interrupted, that is, interrupted, and an unnatural moving image is presented to the viewer.

【００１１】再生ヘッドの移動時間（ｔｈ）と、データ
の読み出し時間（ｔｒ）、データ表示のための処理例え
ばデコード処理時間（ｔｄ）の関係は、映像データの１
フレーム表示時間としてのフレームレートをｔｆとする
と、下記の式１を満足すれば、フレーム間の途切れの無
い視点移動再生処理が可能となる。The relationship between the moving time (th) of the reproducing head, the data reading time (tr), and the processing for displaying data, for example, the decoding processing time (td), is 1 for video data.
Assuming that the frame rate as the frame display time is tf, viewpoint movement reproduction processing without interruption between frames is possible if Expression 1 below is satisfied.

【００１２】[0012]

【数１】ｔｆ＞＝ｔｈ＋ｔｒ＋ｔｄ……（式１）[Formula 1] tf> = th + tr + td (Equation 1)

【００１３】マルチストーリ、マルチアングルの映像を
途切れることなく再生するための、記録媒体及び再生方
法に関する従来技術として、例えば特許公報第２８５７
１１９号、第２８５７１２３号がある。これらの各公報
は、複数の視点からの動画像データを光ディスクに格納
し、視点切り換え処理の場合の映像の途切れを防止する
ため、異なる視点画像データへのシーク時間を短縮する
データ記録構成を開示している。これらの公報には、複
数の視点からの映像データをディスク上の１ブロック領
域としてのインターリーブブロック内に混在させて記録
し、各インターリーブロック内に制御情報を設けて、視
聴者がシーン選択を行った場合に、その制御情報を元に
シーンジャンプをするという構成である。インターリー
ブブロックは、１つの狭い領域として設定され、シーク
時間が短くて済み、視点の異なる画像間の切り換え処理
の時間の短縮が可能となり、マルチストーリ間の切り換
えを途切れることなく行うことが可能となる。As a conventional technique relating to a recording medium and a reproducing method for reproducing a multi-story and multi-angle video without interruption, for example, Japanese Patent Publication No. 2857 is available.
119 and 2857123. Each of these publications discloses a data recording structure for storing moving image data from a plurality of viewpoints on an optical disc and for shortening a seek time to different viewpoint image data in order to prevent interruption of video in the viewpoint switching processing. is doing. In these publications, video data from a plurality of viewpoints are mixed and recorded in an interleave block as one block area on a disc, and control information is provided in each interleave block so that a viewer can select a scene. In this case, the scene jump is performed based on the control information. The interleave block is set as one narrow area, the seek time is short, the time for switching processing between images with different viewpoints can be shortened, and switching between multi-story can be performed without interruption. .

【００１４】すなわち、図３のようにある時間間隔（ｎ
フレーム）毎にインターリーブして交互に配置する。こ
のとき（式１）が満足されるように、それぞれのインタ
ーリーブブロックの時間的な長さ（図３におけるｎの
値）を決定する。さらに映像ストリームの切り換えを任
意の位置で可能とするのではなく、あらかじめ決めた切
り換え可能とする位置のみで、インターリーブブロック
を切り分けることにより、映像が途切れることなく切り
換えが可能となる。That is, as shown in FIG. 3, a certain time interval (n
Interleave every frame) and arrange them alternately. At this time, the temporal length (value of n in FIG. 3) of each interleave block is determined so that (Equation 1) is satisfied. Furthermore, instead of allowing the video stream to be switched at an arbitrary position, by separating the interleave blocks only at a predetermined switchable position, the video can be switched without interruption.

【００１５】しかし、これらの公報に記載の方法は、予
め定められた視点切り換え可能な処理部分を設定して、
そのシーン部分についてのみ複数視点からの映像データ
をディスク上の１領域としてのインターリーブブロック
内に記録するものであり、任意のシーンでの視点切り換
え処理に対する対応については考慮されていない。ま
た、各データが符号化されている場合の復号処理時間に
ついての考慮も十分なものではない。However, the methods described in these publications set a predetermined viewpoint switchable processing portion,
Video data from a plurality of viewpoints is recorded only in the scene portion in an interleave block as one area on the disc, and no consideration is given to the correspondence to the viewpoint switching processing in an arbitrary scene. Further, the consideration of the decoding processing time when each data is encoded is not sufficient.

【００１６】上述の公報に記載の技術は、ユーザがイン
タラクティブに自由に好きな時刻において視点移動を行
う場合には対応できす、任意位置での視点切り換え処理
を行なった場合には、従来の構成と同様、視点切り換え
によるシーンの途切れが発生することになる。The technique described in the above publication is not applicable to the case where the user interactively freely moves the viewpoint at a desired time, and when the viewpoint switching process is performed at an arbitrary position, the conventional configuration is used. Similar to the above, a scene break will occur due to the viewpoint switching.

【００１７】ディスク等の記憶媒体に格納される動画デ
ータは通常、符号化（圧縮）処理によりデータ量を減少
させて格納する。また、インターネット等のネットワー
クを介して伝送されるデータも符号化（圧縮）処理によ
りデータ量を減少させて送信し、受信側において符号化
データを記憶媒体に格納し、再生時に復号（伸長）処理
を実行する場合が多い。Moving image data stored in a storage medium such as a disk is usually stored with a reduced amount of data by an encoding (compression) process. In addition, the data transmitted via networks such as the Internet is also transmitted by reducing the data amount by encoding (compression) processing, and the encoded data is stored in a storage medium on the receiving side and decoded (decompression) processing at playback. Often run.

【００１８】画像圧縮処理の最も知られた手法にＭＰＥ
Ｇ（Moving Pictures Experts Group ）圧縮技術があ
る。このＭＰＥＧ圧縮により生成されるＭＰＥＧストリ
ームをＤＶＤ等の記録媒体に格納したり、あるいはＩＰ
（Internet Protocol）に従ったＩＰパケットに格納し
てインターネット上を転送させる処理が行なわれてい
る。MPE is the most known method of image compression processing.
There is G (Moving Pictures Experts Group) compression technology. The MPEG stream generated by this MPEG compression is stored in a recording medium such as a DVD, or IP
Processing for storing the IP packet in accordance with (Internet Protocol) and transferring it on the Internet is performed.

【００１９】ＭＰＥＧは、高品位な画像圧縮処理を実現
する技術である。現在最も多く使用されているＭＰＥＧ
２の圧縮方法は、画面内の相関を利用した圧縮である離
散コサイン変換（Discrete Cosine Transform; DCT）、
画面間の相関に基づく圧縮としての動き補償、符号列の
相関に基づく圧縮としてのハフマン符号化を組み合わせ
た圧縮手法であるり、ＭＰＥＧ２では、動き補償を用い
た予測符号化を行うために、Ｉピクチャ、Ｐピクチャ、
Ｂピクチャと呼ぶ３つの要素による複数フレームからな
るグループであるＧＯＰ（Group Of Pictures）構造を
持つ。MPEG is a technique for realizing high-quality image compression processing. The most used MPEG currently
The compression method of No. 2 is the compression using the correlation in the screen, which is Discrete Cosine Transform (DCT),
This is a compression method that combines motion compensation as compression based on correlation between screens and Huffman coding as compression based on correlation of code sequences. In MPEG2, in order to perform predictive coding using motion compensation, Picture, P-picture,
It has a GOP (Group Of Pictures) structure, which is a group consisting of a plurality of frames made up of three elements called B pictures.

【００２０】このようなグループからなるフレームデー
タを再生する場合には、グループデータ、すなわちＧＯ
Ｐ単位の復号処理が必要となる。従って、多視点画像ス
トリームをＭＰＥＧ圧縮して記憶媒体に格納し、スムー
ズな視点移動を可能とするためには、ＧＯＰ単位の復号
処理を考慮する構成が必要となる。When reproducing frame data consisting of such groups, group data, that is, GO
A P-unit decoding process is required. Therefore, in order to enable the smooth viewpoint movement by storing the multi-viewpoint image stream in the MPEG medium by compressing it, it is necessary to consider the decoding process in GOP units.

【００２１】[0021]

【発明が解決しようとする課題】本発明は、多視点画像
ストリームを圧縮処理してＤＶＤ等の記憶媒体に格納し
たデータの再生処理を行なう構成において、任意タイミ
ングにおける視点切り換えを、映像途切れの問題を生じ
ることなく可能とするデータ処理装置、データ処理方
法、および情報記憶媒体、並びにコンピュータ・プログ
ラムを提供することを目的とする。SUMMARY OF THE INVENTION According to the present invention, in a configuration in which a multi-view image stream is subjected to compression processing to reproduce data stored in a storage medium such as a DVD, viewpoint switching at arbitrary timing causes a problem of video interruption. It is an object of the present invention to provide a data processing device, a data processing method, an information storage medium, and a computer program that can be performed without causing a problem.

【００２２】[0022]

【課題を解決するための手段】本発明の第１の側面は、
多視点画像データの再生処理を実行するデータ処理装置
であり、異なる視点から撮影した複数の動画像データを
符号化して格納した記憶手段と、前記記憶手段から読み
出した符号化データを復号する復号手段と、前記復号手
段によって復号されたデータを表示する表示手段と、前
記表示手段に表示する動画像データの視点切り換えコマ
ンドを入力する入力手段と、前記入力手段からの視点切
り換えコマンドに基づいて、前記記憶手段からの読み出
しデータを決定する制御手段とを有し、前記記憶手段
は、前記複数の動画像データの各々を複数フレームから
なる符号化データブロック単位で格納し、かつ前記視点
切り換えコマンドによる切り換え前後の関係となる複数
の動画像データに対応する符号化データブロックの格納
フレームにずれを持たせて格納した構成を有し、前記制
御手段は、前記視点切り換えコマンドの入力検出時にお
けるデータ読み出し処理およびデータ表示処理状態に基
づいて、読み出し符号化データブロックの決定処理を実
行する構成を有することを特徴とするデータ処理装置に
ある。The first aspect of the present invention is as follows.
A data processing device for executing reproduction processing of multi-view image data, a storage means for encoding and storing a plurality of moving image data captured from different viewpoints, and a decoding means for decoding encoded data read from the storage means. A display unit for displaying the data decoded by the decoding unit; an input unit for inputting a viewpoint switching command for moving image data to be displayed on the display unit; and a viewpoint switching command from the input unit. Control means for deciding read data from the storage means, wherein the storage means stores each of the plurality of moving image data in units of encoded data blocks consisting of a plurality of frames, and switches by the viewpoint switching command. There is a discrepancy in the storage frames of encoded data blocks that correspond to multiple moving image data that are in the front-back relationship. The control means is configured to execute a read coded data block determination process based on a data read process and a data display process state at the time of detecting the input of the viewpoint switching command. A data processing device characterized by the above.

【００２３】さらに、本発明のデータ処理装置の一実施
態様において、前記制御手段は、前記視点切り換えコマ
ンドの入力検出時における前記記憶手段から読み出し中
の符号化データブロックの読み出し処理の残り所要時間
と、その後の１符号化データブロック読み出し処理所要
時間との総時間と、前記視点切り換えコマンドの入力検
出時における前記表示手段における表示データブロック
の残りフレーム表示処理時間と、視点切り換え前後の符
号化データブロックの格納フレームずれ量に相当するフ
レーム表示処理時間との総時間と、の比較処理を実行す
ることにより読み出し符号化データブロックの決定処理
を実行する構成を有することを特徴とする。Further, in an embodiment of the data processing device of the present invention, the control means sets the remaining time required for the read processing of the coded data block being read from the storage means when the input of the viewpoint switching command is detected. , A total time of the subsequent one encoded data block read processing required time, a remaining frame display processing time of the display data block in the display means when the input of the viewpoint switching command is detected, and the encoded data block before and after the viewpoint switching. It is characterized in that the read coded data block determination process is performed by performing a comparison process with a total time of the frame display processing time corresponding to the storage frame shift amount of.

【００２４】さらに、本発明のデータ処理装置の一実施
態様において、前記記憶手段は、前記複数の動画像デー
タの各々をＭＰＥＧ圧縮して複数フレームにより構成さ
れるＧＯＰ（（Group Of Pictures）単位で符号化デー
タブロックを形成して格納した構成を有することを特徴
とする。Further, in one embodiment of the data processing device of the present invention, the storage means is configured to compress each of the plurality of moving image data by MPEG and to store in GOP ((Group Of Pictures) units composed of a plurality of frames. It is characterized in that it has a configuration in which encoded data blocks are formed and stored.

【００２５】さらに、本発明のデータ処理装置の一実施
態様において、前記記憶手段は、前記複数の動画像デー
タの各々を複数フレームにより構成される符号化データ
ブロックとし、異なる視点からの複数の動画像データに
対応する符号化データブロックをインターリーブして配
置した構成を有することを特徴とする。Further, in one embodiment of the data processing apparatus of the present invention, the storage means sets each of the plurality of moving image data as an encoded data block composed of a plurality of frames, and a plurality of moving images from different viewpoints. It is characterized in that it has a configuration in which coded data blocks corresponding to image data are interleaved and arranged.

【００２６】さらに、本発明のデータ処理装置の一実施
態様において、前記記憶手段は、前記複数の動画像デー
タの各々を複数フレームからなる符号化データブロック
単位で格納し、かつ前記視点切り換えコマンドによる切
り換え前後の関係となる複数の動画像データに対応する
符号化データブロックの格納フレームのずれ量をを所定
量とした構成を有することを特徴とする。Further, in an embodiment of the data processing device of the present invention, the storage means stores each of the plurality of moving image data in units of encoded data blocks composed of a plurality of frames, and stores the plurality of moving image data according to the viewpoint switching command. The present invention is characterized in that a predetermined amount is a shift amount of a storage frame of encoded data blocks corresponding to a plurality of moving image data having a relationship before and after switching.

【００２７】さらに、本発明のデータ処理装置の一実施
態様において、前記ずれ量は、前記符号化データブロッ
クの格納フレームの２分の１とした構成を有することを
特徴とする。Further, in an embodiment of the data processing apparatus of the present invention, the shift amount is configured to be half the storage frame of the encoded data block.

【００２８】さらに、本発明のデータ処理装置の一実施
態様において、前記制御手段は、前記視点切り換えコマ
ンドの入力検出時におけるデータ読み出し処理およびデ
ータ表示処理状態に基づいて、読み出し符号化データブ
ロックの決定処理を実行するとともに、前記視点切り換
えコマンドに基づく視点切り換え後の画像表示開始に基
づいて、視点切り換え前の画像表示停止処理を実行する
構成を有することを特徴とする。Further, in one embodiment of the data processing device of the present invention, the control means determines the read coded data block based on the data read processing and the data display processing state when the input of the viewpoint switching command is detected. The present invention is characterized in that the processing is executed, and the image display stop processing before the viewpoint switching is executed based on the image display start after the viewpoint switching based on the viewpoint switching command.

【００２９】さらに、本発明の第２の側面は、多視点画
像データの再生処理を実行するデータ処理方法であり、
記憶手段から読み出した符号化データを復号する復号ス
テップと、前記復号ステップにおいて復号されたデータ
を表示する表示ステップと、前記表示ステップにおいて
表示される動画像データの視点切り換えコマンドを入力
するコマンド入力ステップと、前記コマンド入力ステッ
プからの視点切り換えコマンドに基づいて、前記記憶手
段からの読み出しデータを決定する制御ステップとを有
し、前記記憶手段は、前記複数の動画像データの各々を
複数フレームからなる符号化データブロック単位で格納
し、かつ前記視点切り換えコマンドによる切り換え前後
の関係となる複数の動画像データに対応する符号化デー
タブロックの格納フレームにずれを持たせて格納した構
成を有し、前記制御ステップは、前記視点切り換えコマ
ンドの入力検出時におけるデータ読み出し処理およびデ
ータ表示処理状態に基づいて、読み出し符号化データブ
ロックの決定処理を実行することを特徴とするデータ処
理方法にある。Further, a second aspect of the present invention is a data processing method for executing reproduction processing of multi-view image data,
A decoding step of decoding the encoded data read from the storage means, a display step of displaying the data decoded in the decoding step, and a command input step of inputting a viewpoint switching command of the moving image data displayed in the display step. And a control step of determining read data from the storage means based on a viewpoint switching command from the command input step, wherein the storage means includes each of the plurality of moving image data in a plurality of frames. The encoded data blocks are stored in units of encoded data blocks, and the stored frames of the encoded data blocks corresponding to a plurality of moving image data before and after switching by the viewpoint switching command are stored with a shift, The control step is when the input of the viewpoint switching command is detected. Based on the definitive data read process and the data display processing state, in the data processing method and executes the determination processing of the read encoded data blocks.

【００３０】さらに、本発明のデータ処理方法の一実施
態様において、前記制御ステップは、前記視点切り換え
コマンドの入力検出時における前記記憶手段から読み出
し中の符号化データブロックの読み出し処理の残り所要
時間と、その後の１符号化データブロック読み出し処理
所要時間との総時間と、前記視点切り換えコマンドの入
力検出時における前記表示手段における表示データブロ
ックの残りフレーム表示処理時間と、視点切り換え前後
の符号化データブロックの格納フレームずれ量に相当す
るフレーム表示処理時間との総時間と、の比較処理を実
行することにより読み出し符号化データブロックの決定
処理を実行することを特徴とする。Further, in an embodiment of the data processing method of the present invention, the control step includes a remaining time required for the read processing of the encoded data block being read from the storage means when the input of the viewpoint switching command is detected. , A total time of the subsequent one encoded data block read processing required time, a remaining frame display processing time of the display data block in the display means when the input of the viewpoint switching command is detected, and the encoded data block before and after the viewpoint switching. It is characterized in that the read coded data block determination process is executed by performing a comparison process with a total time of the frame display processing time corresponding to the storage frame shift amount of.

【００３１】さらに、本発明のデータ処理方法の一実施
態様において、前記制御ステップは、前記視点切り換え
コマンドの入力検出時におけるデータ読み出し処理およ
びデータ表示処理状態に基づいて、読み出し符号化デー
タブロックの決定処理を実行するとともに、前記視点切
り換えコマンドに基づく視点切り換え後の画像表示開始
に基づいて、視点切り換え前の画像表示停止処理を実行
することを特徴とする。Further, in an embodiment of the data processing method of the present invention, the control step determines the read coded data block based on the data read processing and the data display processing state at the time of detecting the input of the viewpoint switching command. The processing is executed, and the image display stop processing before the viewpoint switching is executed based on the start of the image display after the viewpoint switching based on the viewpoint switching command.

【００３２】さらに、本発明の第３の側面は、異なる視
点から撮影した複数の動画像データを符号化して格納し
た情報記憶媒体であり、前記異なる視点から撮影した複
数の動画像データの各々を複数フレームからなる符号化
データブロック単位で格納し、少なくと視点切り換えコ
マンドによる切り換え前後の関係となる複数の動画像デ
ータに対応する符号化データブロックの格納フレームに
ずれを持たせて格納した構成を有することを特徴とする
情報記憶媒体にある。Further, a third aspect of the present invention is an information storage medium in which a plurality of moving image data captured from different viewpoints are encoded and stored, and each of the plurality of moving image data captured from the different viewpoints is stored. Stored in units of encoded data blocks consisting of multiple frames, and at least storing the encoded data blocks corresponding to a plurality of moving image data that have a relationship before and after switching by the viewpoint switching command with a shift. An information storage medium characterized by having.

【００３３】さらに、本発明の情報記憶媒体の一実施態
様において、前記情報記憶媒体は、前記複数の動画像デ
ータの各々をＭＰＥＧ圧縮して複数フレームにより構成
されるＧＯＰ（（Group Of Pictures）単位で符号化デ
ータブロックを形成して格納した構成を有することを特
徴とする。Furthermore, in an embodiment of the information storage medium of the present invention, the information storage medium is a GOP ((Group Of Pictures) unit formed by a plurality of frames by MPEG-compressing each of the plurality of moving image data. It is characterized by having a structure in which an encoded data block is formed and stored in.

【００３４】さらに、本発明の情報記憶媒体の一実施態
様において、前記情報記憶媒体は、前記複数の動画像デ
ータの各々を複数フレームにより構成される符号化デー
タブロックとし、異なる視点からの複数の動画像データ
に対応する符号化データブロックをインターリーブして
配置した構成を有することを特徴とする。Further, in one embodiment of the information storage medium of the present invention, the information storage medium uses each of the plurality of moving image data as an encoded data block composed of a plurality of frames, and a plurality of viewpoints from different viewpoints. It is characterized by having a configuration in which coded data blocks corresponding to moving image data are interleaved and arranged.

【００３５】さらに、本発明の情報記憶媒体の一実施態
様において、前記情報記憶媒体は、前記複数の動画像デ
ータの各々を複数フレームからなる符号化データブロッ
ク単位で格納し、かつ前記視点切り換えコマンドによる
切り換え前後の関係となる複数の動画像データに対応す
る符号化データブロックの格納フレームのずれ量をを所
定量とした構成を有することを特徴とする。Further, in an embodiment of the information storage medium of the present invention, the information storage medium stores each of the plurality of moving image data in units of encoded data blocks composed of a plurality of frames, and the viewpoint switching command. The present invention is characterized in that a predetermined amount is a shift amount of a storage frame of encoded data blocks corresponding to a plurality of moving image data which has a relationship before and after switching by.

【００３６】さらに、本発明の情報記憶媒体の一実施態
様において、前記ずれ量は、前記符号化データブロック
の格納フレームの２分の１とした構成を有することを特
徴とする。Further, in an embodiment of the information storage medium of the present invention, the shift amount is configured to be one half of the storage frame of the encoded data block.

【００３７】さらに、本発明の第４の側面は、多視点画
像データの再生処理を実行するデータ処理を実行するコ
ンピュータプログラムであり、記憶手段から読み出した
符号化データを復号する復号ステップと、前記復号ステ
ップにおいて復号されたデータを表示する表示ステップ
と、前記表示ステップにおいて表示される動画像データ
の視点切り換えコマンドを入力するコマンド入力ステッ
プと、前記コマンド入力ステップからの視点切り換えコ
マンドに基づいて、該コマンドの入力検出時におけるデ
ータ読み出し処理およびデータ表示処理状態に基づく読
み出し符号化データブロック決定処理を実行するステッ
プと、を具備することを特徴とするコンピュータ・プロ
グラムにある。Further, a fourth aspect of the present invention is a computer program for executing data processing for executing reproduction processing of multi-view image data, the decoding step for decoding coded data read from a storage means; Based on a display step of displaying the data decoded in the decoding step, a command input step of inputting a viewpoint switching command of the moving image data displayed in the display step, and a viewpoint switching command from the command input step, And a step of executing a read coded data block determination process based on a data read process and a data display process state when a command input is detected.

【００３８】なお、本発明のコンピュータ・プログラム
は、例えば、様々なプログラム・コードを実行可能な汎
用コンピュータ・システムに対して、コンピュータ可読
な形式で提供する記憶媒体、通信媒体、例えば、ＣＤや
ＦＤ、ＭＯなどの記録媒体、あるいは、ネットワークな
どの通信媒体によって提供可能なコンピュータ・プログ
ラムである。このようなプログラムをコンピュータ可読
な形式で提供することにより、コンピュータ・システム
上でプログラムに応じた処理が実現される。The computer program of the present invention is, for example, a storage medium or communication medium provided in a computer-readable format to a general-purpose computer system capable of executing various program codes, such as a CD or FD. , MO, etc., or a computer program that can be provided by a communication medium such as a network. By providing such a program in a computer-readable format, processing according to the program is realized on the computer system.

【００３９】本発明のさらに他の目的、特徴や利点は、
後述する本発明の実施例や添付する図面に基づくより詳
細な説明によって明らかになるであろう。なお、本明細
書においてシステムとは、複数の装置の論理的集合構成
であり、各構成の装置が同一筐体内にあるものには限ら
ない。Further objects, features and advantages of the present invention are as follows.
It will be clarified by a more detailed description based on embodiments of the present invention described below and the accompanying drawings. In this specification, the system is a logical set configuration of a plurality of devices, and is not limited to a device in which each configuration is provided in the same housing.

【００４０】[0040]

【発明の実施の形態】図４に本発明のデータ処理装置の
一実施例構成のブロック図を示す。本実施例におけるデ
ータ処理装置は、ＤＶＤ、ＣＤ等の記録媒体に格納され
た多視点画像データの再生処理、あるいはインターネッ
ト等のデータ通信網を介して配信される多視点画像デー
タをＤＶＤ、ＣＤ、ハードディスク等の書き込み可能な
記憶媒体に格納し、これを再生する処理を実行する。FIG. 4 is a block diagram showing the configuration of an embodiment of the data processing apparatus of the present invention. The data processing apparatus according to the present embodiment reproduces multi-view image data stored in a recording medium such as a DVD or a CD, or multi-view image data distributed via a data communication network such as the Internet. It is stored in a writable storage medium such as a hard disk, and a process for reproducing the storage medium is executed.

【００４１】本発明のシステムで再生処理対象となるデ
ータは、符号化データであり、デコード（復号）処理が
実行された後、ディスプレイにおいて再生される。従っ
て、図４に示すデータ処理装置４５０は、デコード（復
号）処理を実行するコーデック４５１を有する。なお、
図４に示す構成例は、ビデオカメラ４３３、マイク４３
４等のＡＶデータ入力機器からの入力データの符号化処
理もコーデック４５１によって可能な構成であり、コー
デック４５１によって生成した符号化データをＤＶＤ、
ＣＤ、ハードディスク等に対して書き込み可能な構成を
持つ。The data to be reproduced in the system of the present invention is encoded data, and is reproduced on the display after the decoding process is executed. Therefore, the data processing device 450 shown in FIG. 4 has a codec 451 that executes a decoding process. In addition,
The configuration example shown in FIG. 4 has a video camera 433 and a microphone 43.
A codec 451 is also capable of encoding input data from an AV data input device such as a DVD 4, and the encoded data generated by the codec 451 is a DVD,
It has a writable structure for a CD, a hard disk, and the like.

【００４２】図４に示すデータ処理装置４５０の構成に
ついて説明する。ＣＰＵ(Central processing Unit)４
５６は、各種アプリケーションプログラムや、ＯＳ（Op
erating System）を実行する演算処理装置であり、本発
明のシステムにおいては、後段で詳細に説明するが、コ
ントローラ等から入力される視点切り換えコマンドの入
力検出時における記憶手段からの読み出しデータの決定
処理あるいは表示処理の制御を実行する制御手段として
機能する。メモリ４５７は、ＲＯＭ（Read-Only-Memor
y）、ＲＡＭ（Random Access Memory）等から構成さ
れ、ＣＰＵ４５６が実行するプログラム、あるいは演算
パラメータとしての固定データの格納、ＣＰＵ４５６の
処理において実行されるプログラム、およびプログラム
処理において適宜変化するパラメータの格納エリア、ワ
ーク領域として使用される。記録メディア４５８はＤＶ
Ｄ、ハードディスク、ＣＤ等の記録メディアであり、再
生対象となる多視点画像の符号化データを蓄積する。The configuration of the data processing device 450 shown in FIG. 4 will be described. CPU (Central processing Unit) 4
56 is various application programs and OS (Op
erating system), which will be described later in detail in the system of the present invention, but the determination processing of the read data from the storage unit at the time of detecting the input of the viewpoint switching command input from the controller or the like. Alternatively, it functions as a control unit that executes control of display processing. The memory 457 is a ROM (Read-Only-Memor
y), a RAM (Random Access Memory), etc., and a storage area for storing a program executed by the CPU 456 or fixed data as a calculation parameter, a program executed in the processing of the CPU 456, and a parameter appropriately changed in the program processing. , Used as a work area. The recording medium 458 is DV
It is a recording medium such as a D, a hard disk, or a CD, and stores encoded data of multi-viewpoint images to be reproduced.

【００４３】さらに、データ処理装置４５０は、通信ネ
ットワークとのインタフェースとして機能するネットワ
ークインタフェース４５２を有し、ネットワークを介し
て多視点画像符号化データの受信を行ない、受信された
データは、ＤＶＤ、ハードディスク等の記録メディア４
５８に格納される。あるいは符号化されていないデータ
を受信し、受信データをコーデック４５１によって符号
化し、生成した符号化データをＤＶＤ、ＣＤ、ハードデ
ィスク等の記録メディア４５８に格納する。Further, the data processing device 450 has a network interface 452 which functions as an interface with a communication network, receives multi-view image coded data via the network, and the received data is a DVD or a hard disk. Recording media such as 4
58. Alternatively, unencoded data is received, the received data is encoded by the codec 451, and the generated encoded data is stored in the recording medium 458 such as a DVD, a CD, a hard disk.

【００４４】ユーザからのデータ処理コマンド、あるい
はディスプレイ４３２における表示画像データについて
の視点切り換えコマンドは、マウス４３７、キーボード
４３６、コントローラ４３８の各種入力機器から入力イ
ンタフェース４５３を介して入力される。また、ビデオ
カメラ４３３、マイク４３４等のＡＶデータ入力機器か
らＡＶインタフェース４５４を介して入力されるデータ
は、コーデック４５１によって符号化（ＭＰＥＧ）さ
れ、ＤＶＤ、ハードディスク等の記録メディア４５８に
格納される。A data processing command from the user or a viewpoint switching command for display image data on the display 432 is input through the input interface 453 from various input devices such as the mouse 437, the keyboard 436, and the controller 438. Data input from the AV data input device such as the video camera 433 and the microphone 434 via the AV interface 454 is encoded (MPEG) by the codec 451 and stored in the recording medium 458 such as a DVD or a hard disk.

【００４５】ＤＶＤ、ハードディスク等の記録メディア
４５８に格納された符号化データは、コーデック４５１
において復号処理が実行され、画像データは、フレーム
単位でフレームメモリ４６１に格納された後、Ｄ／Ａ変
換器４６２を介した変換処理の後、ディスプレイ４３２
において表示される。一方、音声データもコーデックに
おいて復号された後、音声バッファ４６３に格納された
後、Ｄ／Ａ変換器４６４を介した変換処理の後、スピー
カ４３５において再生出力される。The coded data stored in the recording medium 458 such as a DVD or a hard disk is the codec 451.
The decoding process is executed in step S1, and the image data is stored in the frame memory 461 on a frame-by-frame basis, and after the conversion process via the D / A converter 462, the display 432 is performed.
Is displayed in. On the other hand, the audio data is also decoded by the codec, stored in the audio buffer 463, converted by the D / A converter 464, and then reproduced and output by the speaker 435.

【００４６】本実施例のデータ処理装置４５０での再生
処理対象データの１つは多視点動画像データの圧縮デー
タであり、高品位な画像圧縮処理を実現する技術として
知られるＭＰＥＧ圧縮画像データである。ＭＰＥＧ２の
圧縮方法は、画面内の相関を利用した圧縮である離散コ
サイン変換（Discrete Cosine Transform; DCT）、画面
間の相関に基づく圧縮としての動き補償、符号列の相関
に基づく圧縮としてのハフマン符号化を組み合わせた圧
縮手法である。ＭＰＥＧ２では、動き補償を用いた予測
符号化を行うために、図５に示すように動画像を構成す
る画像フレームをＩピクチャ、Ｐピクチャ、Ｂピクチャ
と呼ぶ３つの要素に分類し、所定単位のＩピクチャ、Ｐ
ピクチャ、Ｂピクチャのフレームからなるグループとし
てのＧＯＰ（Group Of Pictures）構造を採用してい
る。One of the reproduction processing target data in the data processing device 450 of the present embodiment is the compressed data of the multi-viewpoint moving image data, which is the MPEG compressed image data known as the technique for realizing the high quality image compression processing. is there. The compression method of MPEG2 is the Discrete Cosine Transform (DCT) which is a compression using the correlation in the screen, the motion compensation as the compression based on the correlation between the screens, and the Huffman code as the compression based on the correlation of the code strings. This is a compression method that combines the encoding. In MPEG2, in order to perform predictive coding using motion compensation, image frames forming a moving image are classified into three elements called I picture, P picture, and B picture as shown in FIG. I picture, P
A GOP (Group Of Pictures) structure is adopted as a group consisting of picture and B picture frames.

【００４７】Ｉピクチャ（Intra 符号化画像）は、フィ
ールド内符号化により作られるもので、前画像からの予
測符号化を行わない画像フレームデータである。予測符
号化を使って作った画像ばかり並んでいると、ランダム
アクセスが行われた場合、それに応じて瞬時に画面を出
すことができない。そこで、定期的にアクセスの基準と
なるものを作ってランダムアクセスにも対応できるよう
にしている。Ｉピクチャは、いわば、ＧＯＰの独立性を
持つため存在する。An I picture (Intra coded image) is an image frame data which is created by intra-field coding and is not subjected to predictive coding from the previous image. If only images created using predictive coding are lined up, when random access is performed, the screen cannot be instantly displayed accordingly. Therefore, we make a standard for access regularly so that we can deal with random access. The I-picture exists because it has GOP independence.

【００４８】Ｉピクチャの出現する頻度は、それぞれの
アプリケーションに必要とされるランダムアクセスの性
能によって決定されるが、普通１フィールドに１枚（１
フレームに２枚）、即ち画像１５枚に１枚の割合であ
る。Ｉピクチャ１枚のデータ量は、Ｐピクチャ１枚の２
〜３倍、Ｂピクチャ１枚の５〜６倍に相当する。ＧＯＰ
とは、１つのＩピクチャから次のＩピクチャまでの間の
ピクチャのグループのことである。従って、このグルー
プ内のピクチャ間で画像予測が行われることになる。The frequency of appearance of I-pictures is determined by the performance of random access required for each application, but normally one picture per field (1
The ratio is 2 per frame), that is, 1 per 15 images. The amount of data for one I picture is 2 for one P picture.
.About.3 times, which corresponds to 5 times to 6 times that of one B picture. GOP
Is a group of pictures between one I picture and the next I picture. Therefore, image prediction is performed between the pictures in this group.

【００４９】Ｐピクチャ（Predictive符号化画像）は、
１つ前の画像から予測符号化を行って作られる画像で、
Ｉピクチャに基づいて作られる。“フレーム内符号化画
像”であるＩピクチャに対して、Ｐピクチャは“フレー
ム間準方向予測符号化画像”と定義づけられる。P picture (Predictive coded image) is
An image created by performing predictive coding from the previous image,
It is created based on the I picture. The P picture is defined as an “inter-frame quasi-directional predictive coded image” as opposed to the I picture which is an “intra-frame coded image”.

【００５０】Ｂピクチャ（Bidirectionally predictive
符号化画像）は、“双方向予測符号化画像”である。Ｂ
ピクチャは、前後の２枚のＩピクチャまたはＰピクチャ
からの予測を行うことで作られる。B picture (Bidirectionally predictive
A coded image) is a “bidirectional predictive coded image”. B
The picture is created by performing prediction from the two preceding and following I picture or P picture.

【００５１】本発明のデータ処理装置では、複数の異な
る視点から撮影した動画像データのＭＰＥＧ圧縮データ
の再生処理を実行する構成である。例えば図６に示すよ
うに、視点Ａ〜Ｃの３つの異なる視点からの撮影画像に
ついて、それぞれＭＰＥＧ圧縮データを格納したＤＶＤ
等の記録媒体からデータを読み取り、復号処理を実行
し、復号データの再生処理を実行する。The data processing apparatus of the present invention is configured to execute the reproduction processing of MPEG compressed data of moving image data taken from a plurality of different viewpoints. For example, as shown in FIG. 6, a DVD in which MPEG compressed data is stored for captured images from three different viewpoints A to C, respectively.
The data is read from the recording medium such as the above, the decoding process is executed, and the reproduction process of the decoded data is executed.

【００５２】ＭＰＥＧのような時間的に連続な圧縮方法
では、時間的に連続している画像ブロック、すなわち、
上述した複数フレームからなるフレームグループとして
のＧＯＰ単位でまとめて読み込みデコードする必要があ
るため、そのブロック内で他の視点へ瞬時に切り換える
ことは難しい。データ読み出し、復号処理、再生処理
は、ＧＯＰ単位で行われることになり、各視点への切り
換えタイミングがＧＯＰ単位のフレーム区切りで行なわ
れれば、途切れのない視点切り換え処理が可能になる
が、ＧＯＰ単位のフレーム区切り以外のフレームで視点
切り換えを行なおうとした場合には処理時間の問題から
表示画像の途切れが発生する。In a temporally continuous compression method such as MPEG, temporally consecutive image blocks, that is,
Since it is necessary to collectively read and decode in GOP units as a frame group composed of a plurality of frames as described above, it is difficult to instantaneously switch to another viewpoint within the block. Data reading, decoding processing, and reproduction processing are performed in GOP units, and if the switching timing to each viewpoint is performed at frame delimiters in GOP units, seamless viewpoint switching processing is possible, but in GOP units. If an attempt is made to switch the viewpoint in a frame other than the frame delimiter, the display image is interrupted due to the processing time problem.

【００５３】図７を参照して再生画像の途切れについて
説明する。２つの視点Ａ、視点Ｂの動画像データがそれ
ぞれ１５フレーム単位のＧＯＰ毎に処理されるものとす
る。視点Ａから視点Ｂの切り換えをＧＯＰ単位のフレー
ム区切り以外のフレームで行なおうとした場合を想定す
る。図７の視点Ａのフレーム１５〜２９の再生途中、例
えばフレーム２０において、ユーザが視点Ａから視点Ｂ
への視点切り換え処理コマンドをコントローラ等の入力
装置から入力すると、再生対象となる視点Ｂのフレーム
１５〜２９の復号処理データは生成されておらず、ま
た、復号処理は、ＧＯＰの開始フレームから順に実行さ
れるので、再生データとして必要なデータである視点Ｂ
のフレーム２１を表示するまでに処理時間がかかってし
まう。その結果、図に示すような再生画像の途切れが発
生する。The interruption of the reproduced image will be described with reference to FIG. It is assumed that the moving image data of the two viewpoints A and B is processed for each GOP of 15 frames. It is assumed that the viewpoint A is switched to the viewpoint B in a frame other than a frame delimiter in GOP units. During reproduction of frames 15 to 29 of viewpoint A in FIG. 7, for example, in frame 20, the user views viewpoint A to viewpoint B.
When the viewpoint switching processing command to the input is input from the input device such as the controller, the decoding processing data of the frames 15 to 29 of the viewpoint B to be reproduced is not generated, and the decoding processing is sequentially performed from the GOP start frame. View B, which is data that is necessary as playback data, because it is executed
It takes a long processing time to display the frame 21. As a result, discontinuity of the reproduced image occurs as shown in the figure.

【００５４】図７の例において１５フレーム単位のＧＯ
Ｐ毎のデータ読み出し、復号、表示処理を実行してＧＯ
Ｐの開始フレームを表示するまでに要する時間をＴとす
ると、視点Ａのフレーム１５〜２９の再生途中、フレー
ム２０において、ユーザが視点Ａから視点Ｂへの視点切
り換え処理コマンドをコントローラ等の入力装置から入
力した場合、再生対象となる視点Ｂのフレーム１５〜２
９の復号処理によって視点Ｂのフレーム１５が再生可能
になるまでの時間：Ｔを要し、さらに、フレーム１５か
ら順に復号が実行され、視点Ｂのフレーム２１の再生が
可能になるまでの時間：αを要することになる。結果と
して、視点切り換えコマンド入力から視点Ｂのフレーム
２１の再生が可能になるまでの時間としてＴ＋αの時間
を要することになり、図に示すような再生画像の途切れ
が発生する。In the example of FIG. 7, GO in units of 15 frames
GO for each P by executing data read, decryption, and display processing
Letting T be the time required to display the start frame of P, the user issues a viewpoint switching processing command from viewpoint A to viewpoint B in frame 20 during reproduction of frames 15 to 29 of viewpoint A and an input device such as a controller. When input from, the frames 15 to 2 of the viewpoint B to be reproduced are
Time required until the frame 15 of the viewpoint B becomes reproducible by the decoding process of 9: T, and further, decoding is performed sequentially from the frame 15 and time until the frame 21 of the viewpoint B becomes reproducible: α will be required. As a result, T + α is required as the time from the input of the viewpoint switching command until the reproduction of the frame 21 of the viewpoint B becomes possible, and the reproduced image is interrupted as shown in the figure.

【００５５】２つの視点画像のディスクに対する格納構
成例を図８に示す。ＭＰＥＧ２では、上述したようにＧ
ＯＰ単位で符号化（エンコード）する。図８では１２フ
レームをＧＯＰの単位とした例を示している。図８で
は、Ｃａｍ０１はカメラ０１、Ｃａｍ０２はカメラ０２
によって撮影された異なる視点からの動画像データであ
ることを示し、Ｂｌｏｃｋ００１は、ＧＯＰブロック０
０１であることを示しており、［ｃａｍ０１００００
（Ｉ）］は、カメラ０１によって撮影されたフレーム０
０００の符号化データでありかつＩピクチャであること
を示している。（Ｐ）はＰピクチャ、（Ｂ）はＢピクチ
ャである。FIG. 8 shows an example of the storage configuration of the two viewpoint images on the disc. In MPEG2, as described above, G
Encoding is performed in OP units. FIG. 8 shows an example in which 12 frames are used as a unit of GOP. In FIG. 8, Cam01 is the camera 01 and Cam02 is the camera 02.
Block001 indicates GOP block 0.
01, and [cam01 0000
(I)] is the frame 0 captured by the camera 01.
000 encoded data and an I picture. (P) is a P picture, and (B) is a B picture.

【００５６】［Ｈｅａｄｅｒ００１］は、各ＧＯＰブロ
ックに対して設定されるヘッダ情報である。［Ａｕｄｉ
ｏ０１００００−００１１］は、各ブロックの画像デ
ータに対応して再生される音声データを示している。[Header001] is header information set for each GOP block. [Audi
o01 0000-0011] indicates audio data reproduced corresponding to the image data of each block.

【００５７】ここで図８に示すようにカメラ０１の映像
を第００００フレームから第０００５フレームまで表示
し、第０００６フレームはカメラ０２の映像を表示した
いとする。各カメラ毎の画像データは、１２フレームを
ＧＯＰ単位としてＭＰＥＧ２符号化処理がなされてい
る。Here, as shown in FIG. 8, it is assumed that the image of the camera 01 is displayed from the 0000th frame to the 0005th frame, and the image of the camera 02 is displayed in the 0006th frame. The image data of each camera is subjected to MPEG2 encoding processing with 12 frames as a GOP unit.

【００５８】従って、カメラ０２の第０００６フレーム
目から表示しようとしても、ＭＰＥＧ２は時間軸方向に
圧縮しているため、カメラ０２の第０００６フレームの
属しているＧＯＰ、すなわち［Ｃａｍ０２，Ｂｌｏｃｋ
０００１］のＧＯＰ単位で復号する必要がある。また、
復号処理は、［Ｃａｍ０２，Ｂｌｏｃｋ０００１］の復
号において、フレーム００００から順に実行されること
になり、本来再生データとして必要なデータであるカメ
ラ０２の第０００６フレームを表示するまで処理時間が
かかることになる。Therefore, even if an attempt is made to display from the 0006th frame of the camera 02, since MPEG2 is compressed in the time axis direction, the GOP to which the 0006th frame of the camera 02 belongs, that is, [Cam02, Block].
It is necessary to decode in GOP units of [0001]. Also,
The decoding process is sequentially executed from the frame 0000 in the decoding of [Cam02, Block0001], and it takes a processing time to display the 0006th frame of the camera 02, which is originally necessary as reproduction data. .

【００５９】ＧＯＰの単位を少ないフレーム単位として
設定すれば、必然的に余分なデータを読み込んだり、余
分なデータをデコードすることも無く、視点移動の自由
度が増す。ＧＯＰの単位を少ないフレーム単位として設
定した例を図９に示す。図９の例は、６フレーム単位で
ＧＯＰを設定した例である。ところがＧＯＰの単位を小
さくし、その単位で読み出し、デコードを可能にする
と、音声、管理情報を各ＧＯＰごとに付ける必要があり
冗長となる。If the unit of GOP is set as a small number of frames, the degree of freedom of moving the viewpoint is increased without inevitably reading extra data or decoding extra data. FIG. 9 shows an example in which the GOP unit is set as a small frame unit. The example of FIG. 9 is an example in which GOP is set in units of 6 frames. However, if the unit of GOP is made small and reading and decoding are possible in that unit, it is necessary to add voice and management information to each GOP, which is redundant.

【００６０】そこで、本発明の構成においては、図１０
に示すように、ＧＯＰの単位を小さくするのではなく、
各視点カメラ映像ごとの符号化データブロックに相当す
るＧＯＰのスタートフレーム番号を変える構成とした。
図１０に示す例ではカメラ０１の視点Ａ画像の符号化デ
ータブロックの最初のＧＯＰは、第００００フレームか
らスタートし第００１１フレームで終了する１２フレー
ム構成であるが、カメラ０２の視点Ｂ画像の符号化デー
タの最初ＧＯＰは第０００６フレームからスタートし第
００１７フレームで終了する１２フレームによって構成
される。すなわち、視点切り換えコマンドによる切り換
え前後の関係となる複数の動画像データに対応する符号
化データブロックの格納フレームにずれを持たせて格納
した構成を持つ。Therefore, in the configuration of the present invention, FIG.
As shown in, instead of reducing the unit of GOP,
The start frame number of the GOP corresponding to the encoded data block for each viewpoint camera image is changed.
In the example shown in FIG. 10, the first GOP of the encoded data block of the viewpoint A image of the camera 01 has a 12-frame structure that starts from the 0000th frame and ends at the 0011th frame. The first GOP of the encoded data is composed of 12 frames starting from the 0006th frame and ending at the 0017th frame. That is, it has a configuration in which the storage frames of the encoded data blocks corresponding to a plurality of moving image data before and after the switching by the viewpoint switching command are stored with a shift.

【００６１】カメラ０１の視点Ａ画像符号化データと、
カメラ０２の視点Ｂ画像符号化データのＧＯＰは、６フ
レームずれたフレームによって構成される。すなわちＧ
ＯＰのフレーム長（１２フレーム）の１／２だけそれぞ
れのカメラ間でＧＯＰのスタートがずれていることにな
る。ここではカメラ２台の例しか示さないが、３台以上
のときは、カメラ３はカメラ１と同じスタートフレーム
番号、カメラ４はカメラ２と同じフレームからスタート
するようにして、カメラ１台ごとに時間軸方向に互い違
いに配置することにより、複数カメラ間を連続的に視点
移動、すなわち、カメラ１→カメラ２→カメラ３→カメ
ラ４→カメラ１のように視点移動する場合であっても、
ＧＯＰ長が短くなって画質が劣化することなく、またＧ
ＯＰ長の１／２の時間間隔の小さな時間で処理できるよ
うになる。これらの例については後述する。Viewpoint A image coded data of camera 01,
The GOP of the viewpoint B image encoded data of the camera 02 is composed of frames that are shifted by 6 frames. Ie G
This means that the GOP start is shifted between the cameras by 1/2 of the OP frame length (12 frames). Although only two cameras are shown here, when the number of cameras is three or more, the camera 3 starts from the same start frame number as the camera 1 and the camera 4 starts from the same frame as the camera 2 so that each camera can be started. By alternately arranging in the time axis direction, the viewpoints are continuously moved between a plurality of cameras, that is, even when the viewpoints are moved in the order of camera 1 → camera 2 → camera 3 → camera 4 → camera 1,
GOP length is shortened and image quality is not deteriorated.
It becomes possible to perform processing in a small time interval that is 1/2 the OP length. These examples will be described later.

【００６２】図１１に図１０に示した２つの異なる視点
からの画像データのディスクへの格納構成を示す。図１
１では、［Ｃａｍ０１］はカメラ０１、［Ｃａｍ０２］
はカメラ０１とは異なる視点位置のカメラ０２によって
撮影された動画像データであることを示し、Ｂｌｏｃｋ
００１は、各符号化データブロックであり、ここでは１
２フレームのＧＯＰによって構成される。［ｃａｍ０１
００００（Ｉ）］は、カメラ０１によって撮影された
フレーム００００の符号化フレームデータでありかつＩ
ピクチャであることを示している。（Ｐ）はＰピクチ
ャ、（Ｂ）はピクチャである。［Ｈｅａｄｅｒ００１］
は、各ＧＯＰブロックに対して設定されるヘッダ情報で
ある。［Ａｕｄｉｏ０１００００−００１１］は、各
ブロックの画像データに対応して再生される音声データ
を示している。図１１に示すように、異なる視点からの
複数の動画像データの各々は、複数フレームにより構成
される符号化データブロックとされ、かつ異なる視点か
らの複数の動画像データに対応する符号化データブロッ
クがインターリーブして配置された構成としてディスク
に格納される。FIG. 11 shows the storage configuration of the image data from the two different viewpoints shown in FIG. 10 on the disc. Figure 1
In 1, the [Cam01] is the camera 01 and the [Cam02] is
Indicates that the moving image data is captured by the camera 02 at a viewpoint position different from that of the camera 01.
001 is each coded data block, here 1
It is composed of two frames of GOP. [Cam01
0000 (I)] is the encoded frame data of frame 0000 captured by camera 01, and I
It indicates that it is a picture. (P) is a P picture, and (B) is a picture. [Header001]
Is header information set for each GOP block. [Audio01 0000-0011] indicates audio data reproduced corresponding to the image data of each block. As shown in FIG. 11, each of a plurality of moving image data from different viewpoints is an encoded data block composed of a plurality of frames, and an encoded data block corresponding to a plurality of moving image data from different viewpoints. Are interleaved and stored on the disk as a configuration.

【００６３】ここで図１１に示すようにカメラ０１の映
像を第００００フレームから第０００５フレームまで表
示し、第０００６フレームはカメラ０２の映像を表示し
たいとする。各カメラ毎の画像データは、１２フレーム
をＧＯＰ単位としてＭＰＥＧ２符号化処理がなされてい
るが、カメラ０１の映像およびカメラ０２の映像は、そ
れぞれ６フレームずれたＧＯＰ構成となっている。Here, as shown in FIG. 11, it is assumed that the image of the camera 01 is displayed from the 0000th frame to the 0005th frame, and the image of the camera 02 is displayed in the 0006th frame. The image data of each camera is subjected to MPEG2 encoding processing with 12 frames as a GOP unit, but the image of the camera 01 and the image of the camera 02 have a GOP structure which is shifted by 6 frames.

【００６４】従って、カメラ０２の第０００６フレーム
目から表示しようとした場合、カメラ０２の第０００６
フレームの属しているＧＯＰ、すなわち［Ｃａｍ０２，
Ｂｌｏｃｋ００１］の符号化データブロックのＧＯＰ単
位で復号を行なうと、カメラ０２の視点Ｂ画像符号化デ
ータの最初の符号化データブロックとしてのＧＯＰは第
０００６フレームからスタートしているため、視点切り
換え要求からカメラ０２の視点Ｂ画像符号化データの第
０００６フレーム復号完了までの時間が短縮されること
になる。その結果、再生画像の途切れを発生させること
のないスムーズな視点切り換え処理が可能となる。デー
タ格納距離の物理的位置も近接することになり、読み出
しヘッドのシーク時間の短縮も達成されることになる。Therefore, when an attempt is made to display from the 0006th frame of the camera 02, the 0006th frame of the camera 02 is displayed.
The GOP to which the frame belongs, namely [Cam02,
When decoding the encoded data block of Block 001] in GOP units, since the GOP as the first encoded data block of the viewpoint B image encoded data of the camera 02 starts from the 0006th frame, the viewpoint switching request causes The time until the 0006th frame decoding of the viewpoint B image encoded data of the camera 02 is completed is shortened. As a result, it is possible to perform a smooth viewpoint switching process without causing a break in the reproduced image. Since the physical positions of the data storage distances are close to each other, the seek time of the read head can be shortened.

【００６５】図１０および図１１に示した２つの異なる
視点からの画像データを格納したディスクからのデータ
読み出し、復号、再生処理の具体例を、図１２のタイム
チャートを使って説明する。A specific example of data reading, decoding, and reproduction processing from a disk storing image data from two different viewpoints shown in FIGS. 10 and 11 will be described with reference to the time chart of FIG.

【００６６】図１２のタイムチャートは、最下段の時間
軸に示すように右方向に時間経過を示しており、上段の
（ａ）がディスプレイにおける各フレームのデコードお
よび表示処理時間を示し、中段の（ｂ）が、ディスクに
格納された各符号化データブロックとしてのＧＯＰの読
み出し処理タイミングを示している。中段の（ｂ）の各
符号化データブロックとしてのＧＯＰの読み出し処理後
に、上段の（ａ）の復号、表示処理が開始される。The time chart of FIG. 12 shows the passage of time in the right direction as shown on the time axis at the bottom, the upper part (a) shows the decoding and display processing time of each frame in the display, and the middle part of the time chart. (B) shows the read processing timing of GOP as each encoded data block stored in the disk. After the GOP as each encoded data block in the middle row (b) is read out, the decoding and display processing in the upper row (a) is started.

【００６７】図１２の処理例において、以下のような前
提条件を設定する。１フレームの表示時間は３３ｍｓとする。１つの符号化データブロックは１２フレームの画像
から構成されている（１２フレームの表示処理に３９６
ｍｓかかる）。隣接するカメラ間の符号化データとして設定される
符号化データブロック（ＧＯＰ）は、６フレーム分スタ
ートがずれている。１ブロックのデータ読み出しには３００ｍｓ必要で
あるとする。安全のために１ブロック分のデータをバッファして
おくようにする。ユーザは１回だけ視点変更スイッチを押すIn the processing example of FIG. 12, the following preconditions are set. The display time for one frame is 33 ms. One encoded data block is composed of images of 12 frames (for displaying processing of 12 frames, there are 396
ms). A coded data block (GOP) set as coded data between adjacent cameras has a start offset by 6 frames. It is assumed that 300 ms is required to read the data of one block. For safety, buffer one block of data. The user presses the viewpoint change switch only once

【００６８】図１２の横軸は時間である。まず時刻ｔ＝
０においてカメラ０１の第００１ブロックのデータを読
み出す。ｔ＝３００ｍｓでカメラ０１の第００１ブロッ
クの読み出しが完了した後、カメラ０１の第００２ブロ
ックのデータを読み出す。ｔ＝６００ｍｓでこれらが完
了すると、デコード処理と表示処理がスタートする。そ
れと共にカメラ０１の第００３ブロックの読み出し処理
が並行して実行される。The horizontal axis of FIG. 12 is time. First, time t =
At 0, the data of the 001th block of the camera 01 is read. After the reading of the 001th block of the camera 01 is completed at t = 300 ms, the data of the 002st block of the camera 01 is read. When these are completed at t = 600 ms, the decoding process and the display process start. At the same time, the reading process of the 003rd block of the camera 01 is executed in parallel.

【００６９】順調にデコード処理および表示処理が進ん
でいるとき、ｔ＝１．１ｓにおいてユーザからカメラ視
点移動のスイッチが押され、表示画像のカメラ視点をカ
メラ０１の画像からカメラ０２の画像に切り換え要求が
入力されたとする。このキー入力のタイミングでは、図
から理解されるように、カメラ０１ブロック００４の
ＧＯＰ符号化データの読み出し中である。この読み出し
が完了しないと、ユーザによって要求されたカメラ０２
の画像データを読み出すことはできない。When the decoding process and the display process are proceeding smoothly, the user presses the switch for moving the camera viewpoint at t = 1.1 s, and the camera viewpoint of the display image is switched from the image of camera 01 to the image of camera 02. Suppose a request is entered. At this key input timing, as understood from the figure, the GOP encoded data of the camera 01 block 004 is being read. If this reading is not completed, the camera 02 requested by the user
The image data of cannot be read.

【００７０】ここでカメラ０２のどのフレームを格納し
たＧＯＰデータを読み出せば良いかを判定する処理が必
要となる。視点切り換え入力がなされた時点では、カメ
ラ０１ブロック００４、すなわち、フレーム番号００
３６−００４７を読み出している。処理時間を考慮した
場合、最も安全なのは、カメラ０２ブロック００４、
すなわち、フレーム番号００４２−００５３を含むカメ
ラ０２のＧＯＰブロックを次に読み出し、カメラ０１
ブロック００４の再生を００４１フレームで中断し、カ
メラ０２の画像再生に移ることである。Here, it is necessary to perform processing for determining which frame of the camera 02 in which GOP data should be read. When the viewpoint switching input is made, the camera 01 block 004, that is, the frame number 00
36-0047 is being read. Considering the processing time, the safest is the camera 02 block 004,
That is, the GOP block of the camera 02 including the frame numbers 0042-0053 is read next, and the camera 01
The reproduction of the block 004 is interrupted at 0041 frames, and the image reproduction of the camera 02 is started.

【００７１】しかしこれではユーザからの視点切り換え
スイッチが押されてから、カメラが切り換わるまでの時
間が長くなる。そこで図に示すように、カメラ０２ブ
ロック００３、すなわちフレーム番号００３０−００４
１を格納したカメラ０２の符号化データブロック（ＧＯ
Ｐ）を、次に読み出して、カメラ０１ブロック００３
に属するフレーム番号００２４−００３５の再生をフレ
ーム番号００２４−００２９だけ行った後、カメラ０２
のブロック００３、すなわちフレーム番号００３０−０
０４１の再生に切り換えることが可能かどうかの判定処
理を行なう。判定処理は、図４に示す制御手段としての
ＣＰＵ４５６により実行される。以下、判定処理につい
て説明する。However, in this case, it takes a long time until the camera is switched after the viewpoint switching switch is pressed by the user. Therefore, as shown in the figure, the camera 02 block 003, that is, the frame numbers 0030-004
1 is stored in the encoded data block of the camera 02 (GO
P) is read out next, and the camera 01 block 003
After the reproduction of the frame numbers 0024-0035 that belong to the
Block 003, that is, frame number 0030-0
A determination process is performed as to whether or not it is possible to switch to playback of 041. The determination process is executed by the CPU 456 as the control unit shown in FIG. The determination process will be described below.

【００７２】１フレームあたりの表示時間：ｔ（ｄｓ
ｐ）＝０．０３３ｓ視点切り換えスイッチが押されたときの時刻：ｔ（ｓ
ｗ）＝１．１ｓ視点切り換えスイッチが押されたときに読み出している
符号化データブロックの予想残り読み出し時間：ｔ（ｒ
ｄｅｎｄ）＝０．１５ｓ視点切り換えスイッチが押されたときに表示している符
号化データブロックの表示残りフレーム数：ｆ（ｒｅｓ
ｔ）＝９符号化データブロックの予想読み出し時間：ｔ（ｂｌ
ｋ）＝０．３ｓ切り換え視点のＧＯＰのフレームずれ量：ｆ（ａｂ）＝
６としたとき、下式２が成立するか否かを判定する。Display time per frame: t (ds
p) = 0.033s Time when the viewpoint change switch is pressed: t (s
w) = 1.1 s Expected remaining read time of the coded data block read when the viewpoint change switch is pressed: t (r
dend) = 0.15s The number of display remaining frames of the coded data block displayed when the viewpoint change switch is pressed: f (res
t) = 9 Expected read time of the encoded data block: t (bl
k) = 0.3s Frame shift amount of GOP at the switching viewpoint: f (ab) =
When it is set to 6, it is determined whether the following expression 2 is satisfied.

【００７３】[0073]

【数２】ｔ（ｒｄｅｎｄ）＋ｔ（ｂｌｋ）＜（ｆ（ｒｅｓｔ）＋ｆ（ａｂ））×ｔ（ｄｓｐ）……（式２）[Equation 2] t (rdend) + t (blk) <(F (rest) + f (ab)) × t (dsp) (Equation 2)

【００７４】上記式２は、左辺が、視点切り換えスイッ
チが押された時点での読み出し途中の符号化データブロ
ック読み出しに必要な残り時間と、その後の１ブロック
の読み出し時間の総時間を示し、右辺が視点切り換えス
イッチが押された時点での表示ブロックの残りフレーム
の表示時間と、切り換え視点の符号化データブロック
（ＧＯＰ）のフレームずれ量分のフレームの表示時間と
の総時間を示している。上記式２に示す条件が成り立て
ば、カメラ０２ブロック００３のフレーム番号００３
０−００４１を次に読み出して、カメラ０１ブロック
００３のフレーム番号００２４−００３５の再生をフレ
ーム番号００２４−００２９だけ行い、その後カメラ０
２に切り換えて、カメラ０２ブロック００３のフレー
ム番号００３０−００４１の再生を実行することが可能
である。In the above expression 2, the left side shows the total time of the remaining time required for reading the coded data block during the reading at the time the viewpoint change switch is pressed and the reading time of one block after that, and the right side. Shows the total time of the display time of the remaining frames of the display block at the time when the viewpoint changeover switch is pressed and the display time of the frame corresponding to the frame shift amount of the coded data block (GOP) of the changeover viewpoint. If the condition shown in Equation 2 above is satisfied, the frame number 003 of the camera 02 block 003
0-0041 is read out next, the reproduction of the frame numbers 0024-0035 of the camera 01 block 003 is performed only by the frame numbers 0024-0029, and then the camera 0
It is possible to switch to 2 and execute the reproduction of the frame numbers 0030-0041 of the camera 02 block 003.

【００７５】上記式２に、上述した条件に基づく値を入
れて計算すると、０．１５＋０．３＜（９＋６）×０．０３３となり、左辺＝０．４５ｓであり、スイッチが押された
時刻が１．１ｓだとすると、時刻１．５５ｓにはカメラ
０２ブロック００３の読み出しが完了すると予測でき
る。また、上記式２の右辺は、（９＋６）×０．０３３
＝０．４９５となり、スイッチが押された時刻が１．１
ｓなので、時刻１．５９５ｓに表示の切り換えが実行さ
れる。When the value based on the above condition is put into the above equation 2 to calculate, it becomes 0.15 + 0.3 <(9 + 6) × 0.033, the left side = 0.45s, and the time when the switch is pressed is If it is 1.1 s, it can be predicted that the reading of the camera 02 block 003 will be completed at time 1.55 s. The right side of the above equation 2 is (9 + 6) × 0.033
= 0.495, and the time when the switch was pressed is 1.1
Therefore, the display is switched at time 1.595 s.

【００７６】従って、０．３３＜０．４９５が成立し、
式２を適用した判定結果に基づいて、表示切り換わり前
にデータが用意できると結論づけられる。このときすで
に読み出したカメラ０１ブロック００４のデータは破
棄してよい。Therefore, 0.33 <0.495 holds,
It is concluded that the data can be prepared before the display switching based on the determination result of applying the expression 2. At this time, the data of the camera 01 block 004 already read may be discarded.

【００７７】実際にはディスクの予想読み出し時間は、
ディスク上の位置やシーク量によって異なってくるた
め、それらのパラメータを入れ込んだ予想アクセス時間
を求める必要がある。またブロック内のフレーム毎のデ
コード処理時間も考慮する必要がある。Actually, the expected read time of the disc is
Since it depends on the position on the disk and the seek amount, it is necessary to find the expected access time that includes these parameters. It is also necessary to consider the decoding processing time for each frame in the block.

【００７８】図１２に示した例は、異なる視点画像デー
タは２つのみであり、ユーザによる視点切り換えコマン
ドの入力を行なうスイッチ入力は１回のみの例を示した
が、次に、図１３および図１４を参照して、５つの異な
る視点画像データを符号化データブロック（ＧＯＰ）と
したデータ格納構成を持つディスクからの再生処理にお
いて、ユーザによる視点切り換えコマンドの入力を連続
して実行した場合の処理について説明する。In the example shown in FIG. 12, there are only two different viewpoint image data, and the user inputs the viewpoint switching command only once. Referring to FIG. 14, in a reproducing process from a disc having a data storage structure in which five different viewpoint image data are encoded data blocks (GOP), a viewpoint switching command input by a user is continuously executed. The processing will be described.

【００７９】図１３に、５つの異なる視点画像データを
符号化データブロック（ＧＯＰ）としたデータ格納構成
を示す。図１３に示す例ではカメラ０１の視点Ａ画像の
符号化データ、カメラ０３の視点Ｃ画像の符号化デー
タ、およびカメラ０５の視点Ｅ画像の符号化データの最
初の符号化データブロック（ＧＯＰ）は、第００００フ
レームからスタートし第００１１フレームで終了する１
２フレーム構成であるが、カメラ０２の視点Ｂ画像の符
号化データブロック、およびカメラ０４の視点Ｄ画像の
符号化データブロックの最初ＧＯＰは第０００６フレー
ムからスタートし第００１７フレームで終了する１２フ
レームによって構成される。FIG. 13 shows a data storage structure in which five different viewpoint image data are coded data blocks (GOP). In the example shown in FIG. 13, the first encoded data block (GOP) of the encoded data of the viewpoint A image of the camera 01, the encoded data of the viewpoint C image of the camera 03, and the encoded data of the viewpoint E image of the camera 05 is , Starts at the 0000th frame and ends at the 0011th frame 1
Although it has a two-frame structure, the first GOP of the encoded data block of the viewpoint B image of the camera 02 and the encoded data block of the viewpoint D image of the camera 04 starts with the 0006th frame and ends with the 12th frame. Composed.

【００８０】カメラ０１、カメラ０３、カメラ０５の視
点画像符号化データと、カメラ０２、カメラ０４の視点
画像符号化データブロック（ＧＯＰ）は、６フレームず
れたフレームによって構成される。すなわち符号化デー
タブロック（ＧＯＰ）の格納フレーム長（１２フレー
ム）の１／２だけそれぞれのカメラ間で符号化データブ
ロック（ＧＯＰ）のスタートフレームがずれていること
になる。The viewpoint image coded data of the camera 01, the camera 03, and the camera 05 and the viewpoint image coded data block (GOP) of the camera 02, camera 04 are composed of frames that are shifted by 6 frames. That is, the start frames of the coded data block (GOP) are deviated between the respective cameras by 1/2 of the storage frame length (12 frames) of the coded data block (GOP).

【００８１】図１４を参照して、図１３に示した５つの
異なる視点画像データのフレームずれを持つ符号化デー
タブロック（ＧＯＰ）として格納したデータに対して、
ユーザによる視点切り換えスイッチの入力が連続して行
われた場合の処理について説明する。ユーザによる視点
切り換えスイッチの入力により、カメラ０１→カメラ０
２→カメラ０３→カメラ０４→カメラ０５→カメラ０１
の順に切り換えられるものとする。なお、ユーザによる
視点切り換えスイッチは、例えばコントローラのスイッ
チを連続してＯＮ状態に維持すると、Δｔごとの間欠的
な連続コマンドとして処理され、図１４に示すように、
所定間隔Δｔ毎の入力（ａ）〜（ｄ）として認識され
る。Referring to FIG. 14, with respect to the data stored as a coded data block (GOP) having the frame shift of the five different viewpoint image data shown in FIG.
The processing when the user continuously inputs the viewpoint change switch will be described. Camera 01 → Camera 0 by input of viewpoint switch by user
2 → camera 03 → camera 04 → camera 05 → camera 01
Shall be switched in this order. Note that the viewpoint switching switch by the user is processed as an intermittent continuous command for each Δt when the switch of the controller is continuously kept in the ON state, as shown in FIG.
It is recognized as inputs (a) to (d) at predetermined intervals Δt.

【００８２】図１４の例において、ユーザによる視点切
り換えスイッチ（ａ）は時刻１．１ｓで押され、その
後、時刻１．７ｓまで連続してスイッチがオン状態とな
り、所定間隔Δｔ毎の入力（ａ）〜（ｄ）として識別さ
れた場合の処理について説明する。先に図１２を参照し
て説明したと同様、最初の視点切り換えコマンド（ａ）
に応じて、前述の式２を満足するとの判定により、カメ
ラ０２ブロック００３を読み出して表示画像をカメラ
０１のブロック００３のフレーム００２４−００２９の
表示に続いて、カメラ０２のブロック００３のフレーム
００３０−の表示に切り換える処理が可能であり、この
切り換え処理が実行される。In the example of FIG. 14, the viewpoint switching switch (a) is pressed by the user at time 1.1 s, and then the switch is continuously turned on until time 1.7 s, and the input (a) is output at predetermined intervals Δt. ) To (d) will be described. As described above with reference to FIG. 12, the first viewpoint switching command (a)
Accordingly, the camera 02 block 003 is read out and the displayed image is displayed on the display screen of the frame 0024-0029 of the block 003 of the camera 01, and then the frame 0030-of the block 003 of the camera 02 of the camera 02. It is possible to switch to the display of, and this switching process is executed.

【００８３】さらに、カメラ０２のブロック００３のブ
ロックの読み出し処理の実行中にユーザによる視点切り
換コマンド（ｂ）の入力を検知すると、カメラ０２のブ
ロック００３のブロックのデータ読み出しをブロック全
体ではなく、カメラ０２のブロック００３のＧＯＰブロ
ックの前半のフレーム番号００３０−００３５までのフ
レーム読み出しの後、ユーザによる視点切り換えコマン
ド（ｂ）による視点移動先データであるカメラ０３のブ
ロック００３の画像ブロックの前半からの読み出し処理
を開始する。Further, when the input of the viewpoint switching command (b) by the user is detected during the reading process of the block 003 of the camera 02, the data reading of the block 003 of the camera 02 is not performed for the entire block, After reading frames up to the frame numbers 0030 to 0035 in the first half of the GOP block of the block 003 of the camera 02, from the first half of the image block of the block 003 of the camera 03 which is the viewpoint movement destination data by the viewpoint switching command (b) by the user. Start the reading process.

【００８４】さらに、カメラ０３のブロック００３の画
像ブロックの読み出し処理の実行中にユーザによる視点
切り換えコマンド（ｃ）の入力を検知すると、ユーザに
よる視点切り換えコマンド（ｃ）による視点移動先デー
タであるカメラ０４のブロック００４の画像ブロックの
前半からの読み出し処理を開始する。さらに、カメラ０
４のブロック００４の画像ブロックの読み出し処理の実
行中にユーザによる視点切り換えコマンド（ｄ）の入力
を検知すると、ユーザによる視点切り換えコマンド
（ｄ）による視点移動先データであるカメラ０５のブロ
ック００５の画像ブロックの前半からの読み出し処理を
開始する。Further, when the input of the viewpoint switching command (c) by the user is detected during the reading process of the image block of the block 003 of the camera 03, the camera which is the viewpoint moving destination data by the viewpoint switching command (c) by the user. The reading process from the first half of the image block of block 004 of 04 is started. Furthermore, camera 0
When the input of the viewpoint switching command (d) by the user is detected during the execution of the image block reading process of the block 004 of No. 4, the image of the block 005 of the camera 05, which is the viewpoint movement destination data by the viewpoint switching command (d) by the user. The reading process from the first half of the block is started.

【００８５】上記処理によって、カメラ間の視点移動を
連続的に行うことが可能となる。符号化ブロックデータ
を読み出している間にユーザによる視点切り換えスイッ
チの入力を検知しない場合、すなわちユーザによってコ
ントローラ等の視点切り換えスイッチが離されたら、次
の読み出しブロックは１／２だけ読むのではなく、ブロ
ック全体を読み出すことになる。この結果、カメラ０５
のブロック００５の画像ブロックの読み出し処理は、全
体のフレームデータ（ｆ：００４８−００５９）の読み
出し処理として実行されることになる。なお、これらの
データ読み出し制御、データ表示制御は、図４に示す制
御手段としてのＣＰＵ４５６の制御の下に実行される。The above processing makes it possible to continuously move the viewpoint between cameras. If the user does not detect the input of the viewpoint changeover switch while reading the encoded block data, that is, if the user releases the viewpoint changeover switch such as the controller, the next read block does not read only 1/2. The entire block will be read. As a result, camera 05
The image block read processing of block 005 is executed as the read processing of the entire frame data (f: 0048-0059). The data read control and the data display control are executed under the control of the CPU 456 as the control means shown in FIG.

【００８６】なお、ＭＰＥＧ圧縮により構成されるＧＯ
Ｐには、Ｉ，Ｂ，Ｐ各ピクチャからのフレーム画像デー
タのみではなく、例えば先に示した図１１のデータ構成
に示すように、音声データ（Ａｕｄｉｏ）が格納される
場合もあり、この場合、各カメラ映像ブロックの前半フ
レームだけを読み出そうとしても、そのブロック全体の
音声データが付随することになる。従って、万が一デー
タブロックの読み出し予想時刻よりも遅れてしまい、画
像が一時停止してしまうような状況になったとしても、
音声データだけは継続して（６フレーム分だけ）出力で
きることになる。It should be noted that GO configured by MPEG compression
In P, not only frame image data from each picture of I, B, and P but also audio data (Audio) may be stored as shown in the data structure of FIG. 11 described above. In this case, Even if only the first half frame of each camera video block is read out, the audio data of the entire block is attached. Therefore, even if by any chance the data block is delayed from the expected read time and the image temporarily stops,
Only the audio data can be continuously output (only for 6 frames).

【００８７】したがって、本構成のように、各視点画像
データのブロックの格納フレームにずれを持たせた構成
は、隣接するカメラ間での映像切り換え処理において読
み出し予想時刻の変動が発生した場合における音声出力
停止の発生可能性の低減という効果ももたらすものであ
る。Therefore, the configuration in which the storage frames of the blocks of the respective viewpoint image data are displaced as in the present configuration is the audio in the case where the predicted read time fluctuates in the image switching process between the adjacent cameras. This also brings about an effect of reducing the possibility of output stoppage.

【００８８】図１５に、３つの異なる視点からの画像デ
ータのディスクへの格納構成を示す。図１５では、［Ｃ
ａｍ０１］はカメラ０１、［Ｃａｍ０２］はカメラ０
２、［Ｃａｍ０３］はカメラ０３によって撮影された動
画像データであることを示し、［Ｂｌｏｃｋ００１］
は、ＧＯＰブロック００１であることを示しており、
［ｃａｍ０１００００（Ｉ）］は、カメラ０１によっ
て撮影されたフレーム００００の符号化データでありか
つＩピクチャであることを示している。（Ｐ）はＰピク
チャ、（Ｂ）はＢピクチャである。［Ｈｅａｄｅｒ００
１］は、各ＧＯＰブロックに対して設定されるヘッダ情
報である。［Ａｕｄｉｏ０１００００−００１１］
は、各ブロックの画像データに対応して再生される音声
データを示している。FIG. 15 shows the storage configuration of image data from three different viewpoints on the disc. In FIG. 15, [C
am01] is camera 01, [Cam02] is camera 0
2, [Cam03] indicates that it is moving image data captured by the camera 03, and [Block001]
Indicates that the GOP block is 001,
[Cam01 0000 (I)] indicates that it is the encoded data of the frame 0000 captured by the camera 01 and that it is an I picture. (P) is a P picture, and (B) is a B picture. [Header00
1] is header information set for each GOP block. [Audio01 0000-0011]
Indicates audio data reproduced corresponding to the image data of each block.

【００８９】図１５の構成では、１つの符号化データブ
ロック（ＧＯＰ）単位を１２フレーム構成とし、カメラ
０１の視点画像の符号化データ、カメラ０３の視点画像
の符号化データの最初のＧＯＰは、第００００フレーム
からスタートし第００１１フレームで終了する１２フレ
ーム構成であるが、カメラ０２の視点画像の符号化デー
タの最初ＧＯＰは第０００６フレームからスタートし第
００１７フレームで終了する１２フレームによって構成
され、以下、同様に６フレームずれたＧＯＰ構成とした
例である。In the configuration of FIG. 15, one coded data block (GOP) unit has a 12-frame configuration, and the first GOP of the coded data of the viewpoint image of camera 01 and the coded data of the viewpoint image of camera 03 is The 12-frame structure starts from the 0000th frame and ends at the 0011th frame, but the first GOP of the encoded data of the viewpoint image of the camera 02 is composed of 12 frames starting from the 0006th frame and ending at the 0017th frame. The following is an example in which the GOP structure is similarly shifted by 6 frames.

【００９０】このように、カメラ０１、カメラ０３の視
点画像符号化データと、カメラ０２の視点画像符号化デ
ータのＧＯＰは、６フレームずれたフレームによって構
成することにより、カメラ０１→カメラ０２→カメラ０
３→カメラ０１の順の切り換え処理がスムーズに実行さ
れる。Thus, the GOPs of the viewpoint image coded data of the cameras 01 and 03 and the GOP of the viewpoint image coded data of the camera 02 are composed of frames that are shifted by 6 frames, so that camera 01 → camera 02 → camera 0
The switching process in the order of 3 → camera 01 is smoothly executed.

【００９１】図１６に、同様に３つの異なる視点からの
画像データのディスクへの格納構成であるが、ＧＯＰを
６フレーム単位とした例を示す。図１６においても、Ｃ
ａｍ０１はカメラ０１、Ｃａｍ０２はカメラ０２、Ｃａ
ｍ０３はカメラ０３によって撮影された動画像データで
あることを示しているが、１つの符号化データブロック
には、２つのＧＯＰが存在する。すなわち、１ブロック
に６フレームのＧＯＰが２つ格納され１２フレームを１
ブロックとした構成としてある。FIG. 16 shows a configuration in which image data from three different viewpoints are similarly stored in the disk, but an example in which the GOP is in units of 6 frames is shown. Also in FIG. 16, C
am01 is camera 01, Cam02 is camera 02, Ca
Although m03 indicates moving image data captured by the camera 03, two GOPs exist in one encoded data block. That is, two 6-frame GOPs are stored in one block and one 12-frame
It is structured as a block.

【００９２】図１６に示す構成によるデータ格納を行な
った場合、各ブロックの前半および後半それぞれの６フ
レームからなるＧＯＰ毎にＩピクチャが含まれることに
なり、１つのブロックの後半フレームのみの読み出し処
理を行なった場合にも高精度な復号が可能となる。When data is stored according to the configuration shown in FIG. 16, an I picture is included in each GOP consisting of 6 frames in the first half and the second half of each block, and the read process of only the second half frame of one block is performed. Even if the above is performed, highly accurate decoding is possible.

【００９３】次に、図１７を参照して、本発明のデータ
処理装置におけるＧＯＰ単位で格納されたデータを復号
し、再生する処理手順について説明する。Next, with reference to FIG. 17, a processing procedure for decoding and reproducing data stored in GOP units in the data processing apparatus of the present invention will be described.

【００９４】まず、ステップＳ１０１において、ユーザ
が指定した視点あるいは、予めプログラムされたデフォ
ルトで設定された視点からの画像データを格納した符号
化データブロックを読み出す。ステップＳ１０２では、
視点切り換えコマンドが入力されたか否かを判定し、コ
マンド入力がない場合は、ステップＳ１０５に進み、読
み出された視点画像符号化データブロックの復号処理、
表示処理を実行する。First, in step S101, an encoded data block storing image data from a viewpoint designated by a user or a preset preset viewpoint is read. In step S102,
It is determined whether or not a viewpoint switching command has been input. If no command has been input, the process proceeds to step S105, and decoding processing of the read viewpoint image encoded data block is performed.
Display processing is executed.

【００９５】ステップＳ１０２で、視点切り換えコマン
ドが入力されたと判定されると、ステップＳ１０３に進
み、視点切り換え後の読み出しブロック判定処理を実行
する。この処理は、先に説明した式２を用いた判定処理
として実行され、視点切り換えコマンドが入力検出時に
おける下記の値、１フレームあたりの表示時間：ｔ（ｄｓｐ）、視点切り換えスイッチが押されたときの時刻：ｔ（ｓ
ｗ）、視点切り換えスイッチが押されたときに読み出している
データブロックの予想残り読み出し時間：ｔ（ｒｄｅｎ
ｄ）、視点切り換えスイッチが押されたときに表示しているデ
ータブロックの表示残りフレーム数：ｆ（ｒｅｓｔ）、データブロックの予想読み出し時間：ｔ（ｂｌｋ）、切り換え視点の符号化データブロックのフレームずれ
量：ｆ（ａｂ）、に基づいて、前述の式２を適用した演算により実行す
る。If it is determined in step S102 that the viewpoint switching command has been input, the process proceeds to step S103, and the read block determination processing after viewpoint switching is executed. This processing is executed as the determination processing using the equation 2 described above, and the viewpoint switching command has the following values when the input is detected, the display time per frame: t (dsp), and the viewpoint switching switch is pressed. Time of day: t (s
w), the expected remaining read time of the data block being read when the viewpoint switch is pressed: t (rden
d), the number of remaining display frames of the data block displayed when the viewpoint change switch is pressed: f (rest), expected read time of the data block: t (blk), frame of the coded data block of the change viewpoint Based on the shift amount: f (ab), the calculation is performed by applying the above-described Expression 2.

【００９６】ステップＳ１０４では、ステップＳ１０３
の判定処理の結果、選択された視点切り換え後の符号か
ブロックを読み出し処理を実行し、ステップＳ１０５に
進み、読み出された視点画像符号化データブロックの復
号処理、表示処理を実行する。In step S104, step S103
As a result of the determination processing in step S1, the selected code or block after switching the viewpoint is read out, and the process proceeds to step S105 to execute the decoding and display of the read out viewpoint image encoded data block.

【００９７】以上の処理を実行することにより、複数視
点画像データを符号化データとして格納したＤＶＤ等の
記憶媒体からの読み出し、復号、再生処理が画像の表示
の途切れを発生させることなくスムーズに実行される。By executing the above processing, the reading, decoding, and reproducing processing from the storage medium such as the DVD in which the multi-viewpoint image data is stored as the encoded data can be smoothly performed without causing the image display interruption. To be done.

【００９８】以上、説明した実施例では、カメラ０１→
カメラ０２→カメラ０３→カメラ０１のように順番を設
定して視点切り換えを行なう構成例を示したが、例えば
図１８に示すプロック構成とすることで、視点切り換え
順を設定せず、ランダムに視点移動を行なう場合におい
てもスムーズな表示が可能となる。In the embodiment described above, the camera 01 →
Although a configuration example in which the viewpoints are switched by setting the order of camera 02 → camera 03 → camera 01 has been shown, for example, by adopting the block configuration shown in FIG. 18, the viewpoint switching order is not set and the viewpoints are randomly selected. A smooth display is possible even when moving.

【００９９】図１８に示す例ではカメラ０１の視点Ａ画
像の符号化データブロックを２つのストリームとしてデ
ィスク等の記憶媒体に格納する。１つは、第００００フ
レームからスタートし第００１１フレームで終了する１
２フレーム構成を持ち、他方は、第０００６フレームか
らスタートし第００１７フレームで終了する１２フレー
ムによって構成される。また、カメラ０２の視点Ｂ画像
の符号化データ、カメラ０３の視点Ｃ画像の符号化デー
タについても同様、第００００フレームからスタートし
第００１１フレームで終了する１２フレーム構成と、第
０００６フレームからスタートし第００１７フレームで
終了する１２フレームによって構成される符号化データ
ブロックをディスクに格納する。In the example shown in FIG. 18, the encoded data blocks of the viewpoint A image of the camera 01 are stored as two streams in a storage medium such as a disk. One starts at the 0000th frame and ends at the 0011th frame 1
It has a 2-frame structure, and the other is composed of 12 frames starting from the 0006th frame and ending at the 0017th frame. Similarly, the encoded data of the viewpoint B image of the camera 02 and the encoded data of the viewpoint C image of the camera 03 similarly have a 12-frame structure starting from the 0000th frame and ending at the 0011th frame, and starting from the 0006th frame. The encoded data block consisting of 12 frames ending at the 0017th frame is stored on the disc.

【０１００】本構成とすることで、カメラ０１→カメラ
０２→カメラ０３→カメラ０１のように順番を設定して
視点切り換えを行なう場合のみならず、カメラ０１→カ
メラ０３→カメラ０２→カメラ０１のようにランダムな
視点切り換えを実行した場合においても、切り換え処理
に要する時間の短縮が図れ、視点切り換え時の画像の途
切れの発生しないスムーズな画像表示が可能となる。With this configuration, not only when the viewpoints are switched by setting the order of camera 01 → camera 02 → camera 03 → camera 01, but also camera 01 → camera 03 → camera 02 → camera 01. Even when random viewpoint switching is performed as described above, the time required for the switching processing can be shortened, and smooth image display without discontinuity of images at the time of viewpoint switching becomes possible.

【０１０１】なお、明細書中において説明した一連の処
理はハードウェア、またはソフトウェア、あるいは両者
の複合構成によって実行することが可能である。ソフト
ウェアによる処理を実行する場合は、処理シーケンスを
記録したプログラムを、専用のハードウェアに組み込ま
れたコンピュータ内のメモリにインストールして実行さ
せるか、あるいは、各種処理が実行可能な汎用コンピュ
ータにプログラムをインストールして実行させることが
可能である。The series of processes described in the specification can be executed by hardware, software, or a combined configuration of both. When executing the processing by software, the program recording the processing sequence is installed in the memory in the computer incorporated in the dedicated hardware and executed, or the program is stored in a general-purpose computer capable of executing various processing. It can be installed and run.

【０１０２】例えば、プログラムは記録媒体としてのハ
ードディスクやＲＯＭ（Read OnlyMemory)に予め記録し
ておくことができる。あるいは、プログラムはフロッピ
ー（登録商標）ディスク、ＣＤ−ＲＯＭ(Compact Disc
Read Only Memory)，ＭＯ(Magneto optical)ディスク，
ＤＶＤ(Digital Versatile Disc)、磁気ディスク、半導
体メモリなどのリムーバブル記録媒体に、一時的あるい
は永続的に格納（記録）しておくことができる。このよ
うなリムーバブル記録媒体は、いわゆるパッケージソフ
トウエアとして提供することができる。For example, the program can be recorded in advance in a hard disk or a ROM (Read Only Memory) as a recording medium. Alternatively, the program may be a floppy (registered trademark) disc or a CD-ROM (Compact Disc).
Read Only Memory), MO (Magneto optical) disk,
It can be temporarily or permanently stored (recorded) in a removable recording medium such as a DVD (Digital Versatile Disc), a magnetic disk, or a semiconductor memory. Such a removable recording medium can be provided as so-called package software.

【０１０３】なお、プログラムは、上述したようなリム
ーバブル記録媒体からコンピュータにインストールする
他、ダウンロードサイトから、コンピュータに無線転送
したり、ＬＡＮ(Local Area Network)、インターネット
といったネットワークを介して、コンピュータに有線で
転送し、コンピュータでは、そのようにして転送されて
くるプログラムを受信し、内蔵するハードディスク等の
記録媒体にインストールすることができる。The program is installed in the computer from the removable recording medium as described above, is wirelessly transferred from the download site to the computer, or is wired to the computer via a network such as LAN (Local Area Network) or the Internet. Then, the computer can receive the program thus transferred and install it in a recording medium such as a built-in hard disk.

【０１０４】なお、明細書に記載された各種の処理は、
記載に従って時系列に実行されるのみならず、処理を実
行する装置の処理能力あるいは必要に応じて並列的にあ
るいは個別に実行されてもよい。また、本明細書におい
てシステムとは、複数の装置の論理的集合構成であり、
各構成の装置が同一筐体内にあるものには限らない。The various processes described in the specification are
The processing may be executed not only in time series according to the description, but also in parallel or individually according to the processing capability of the device that executes the processing or the need. Further, the system in the present specification is a logical set configuration of a plurality of devices,
The devices of the respective configurations are not limited to being in the same housing.

【０１０５】以上、特定の実施例を参照しながら、本発
明について詳解してきた。しかしながら、本発明の要旨
を逸脱しない範囲で当業者が該実施例の修正や代用を成
し得ることは自明である。すなわち、例示という形態で
本発明を開示してきたのであり、限定的に解釈されるべ
きではない。本発明の要旨を判断するためには、冒頭に
記載した特許請求の範囲の欄を参酌すべきである。The present invention has been described in detail above with reference to the specific embodiments. However, it is obvious that those skilled in the art can modify or substitute the embodiments without departing from the scope of the present invention. That is, the present invention has been disclosed in the form of exemplification, and should not be limitedly interpreted. In order to determine the gist of the present invention, the section of the claims described at the beginning should be taken into consideration.

【０１０６】[0106]

【発明の効果】以上説明してきたように、本発明のデー
タ処理装置、データ処理方法、および情報記憶媒体によ
れば、複数視点からの画像データを符号化して格納した
媒体からのデータ読み出し、復号、再生処理を実行する
構成において、視点切り換え要求の発生から再生可能な
視点切り換え後のブロックデータの判定が確実に実行さ
れることになり、視点切り換えによる表示画像の中断な
どのエラー発生を防止することが可能となる。As described above, according to the data processing device, the data processing method, and the information storage medium of the present invention, the data reading and decoding from the medium in which the image data from a plurality of viewpoints are encoded and stored. In the configuration for executing the reproduction process, the determination of the reproducible block data after the viewpoint switching is surely executed from the generation of the viewpoint switching request, and the error occurrence such as the interruption of the display image due to the viewpoint switching is prevented. It becomes possible.

【０１０７】さらに、本発明のデータ処理装置、データ
処理方法、および情報記憶媒体によれば、異なる視点か
らの符号化データブロックをずれを持つフレームデータ
によって構成し、ＤＶＤ等の記憶媒体に格納する構成と
したので、媒体からのデータ読み出し、復号、再生処理
を実行する場合、切り換え要求の発生から再生可能な視
点切り換え後のブロックデータの再生までの時間を短縮
することが可能となり、より高速な視点切り換え画像の
表示が可能となる。Further, according to the data processing device, the data processing method, and the information storage medium of the present invention, the coded data blocks from different viewpoints are composed of frame data having a shift and stored in a storage medium such as a DVD. Since the configuration is adopted, when the data reading from the medium, the decoding, and the reproduction processing are executed, the time from the generation of the switching request to the reproduction of the block data after the reproducible viewpoint switching can be shortened, and a higher speed can be achieved. It is possible to display a viewpoint switching image.

[Brief description of drawings]

【図１】従来の多視点画像データのディスク格納構成を
説明する図である。FIG. 1 is a diagram illustrating a disk storage configuration of conventional multi-view image data.

【図２】従来の多視点画像データのディスクからの再生
処理におけるシーク時間について説明する図である。FIG. 2 is a diagram illustrating a seek time in a conventional reproduction process of multi-view image data from a disc.

【図３】従来の多視点画像データのディスクからの再生
処理におけるシーク時間について説明する図である。FIG. 3 is a diagram illustrating a seek time in a conventional reproduction process of multi-view image data from a disc.

【図４】本発明のデータ処理装置の構成を示す図であ
る。FIG. 4 is a diagram showing a configuration of a data processing device of the present invention.

【図５】ＭＰＥＧ画像データの構成を説明する図であ
る。FIG. 5 is a diagram illustrating a structure of MPEG image data.

【図６】多視点画像におけるＭＰＥＧ画像データの構成
を説明する図である。FIG. 6 is a diagram illustrating the structure of MPEG image data in a multi-view image.

【図７】多視点画像における符号化画像データの復号、
再生処理における映像の途切れの発生を説明する図であ
る。FIG. 7: Decoding of encoded image data in a multi-view image,
It is a figure explaining the occurrence of the discontinuity of the image in reproduction processing.

【図８】多視点画像におけるＭＰＥＧ画像データのディ
スク格納構成例を説明する図である。FIG. 8 is a diagram illustrating a disc storage configuration example of MPEG image data in a multi-view image.

【図９】多視点画像におけるＭＰＥＧ画像データのディ
スク格納構成例を説明する図である。FIG. 9 is a diagram illustrating a disc storage configuration example of MPEG image data in a multi-view image.

【図１０】本発明の構成における多視点画像の符号化デ
ータブロックの構成例を説明する図である。FIG. 10 is a diagram illustrating a configuration example of a coded data block of a multi-viewpoint image in the configuration of the present invention.

【図１１】本発明の構成における多視点画像の符号化デ
ータブロックのディスク格納構成例を説明する図であ
る。FIG. 11 is a diagram illustrating an example of a disk storage configuration of encoded data blocks of a multi-view image in the configuration of the present invention.

【図１２】本発明の構成における多視点画像の符号化デ
ータブロックの再生処理シーケンスを説明する図であ
る。FIG. 12 is a diagram illustrating a reproduction processing sequence of a coded data block of a multi-view image in the configuration of the present invention.

【図１３】本発明の構成における多視点画像の符号化デ
ータブロックの構成例を説明する図である。FIG. 13 is a diagram illustrating a configuration example of a coded data block of a multi-view image in the configuration of the present invention.

【図１４】本発明の構成における多視点画像の符号化デ
ータブロックの再生処理シーケンスを説明する図であ
る。FIG. 14 is a diagram illustrating a reproduction processing sequence of a coded data block of a multi-view image in the configuration of the present invention.

【図１５】本発明の構成における多視点画像の符号化デ
ータブロックのディスク格納構成例を説明する図であ
る。[Fig. 15] Fig. 15 is a diagram for describing a disk storage configuration example of encoded data blocks of a multi-view image in the configuration of the present invention.

【図１６】本発明の構成における多視点画像の符号化デ
ータブロックのディスク格納構成例を説明する図であ
る。[Fig. 16] Fig. 16 is a diagram for describing a disk storage configuration example of encoded data blocks of a multi-view image in the configuration of the present invention.

【図１７】多視点画像の符号化データブロックの再生を
実行する本発明に係るデータ処理装置の処理フロー図で
ある。FIG. 17 is a processing flow diagram of the data processing apparatus according to the present invention for executing reproduction of a coded data block of a multi-view image.

【図１８】本発明の構成における多視点画像の符号化デ
ータブロックの構成例を説明する図である。FIG. 18 is a diagram illustrating a configuration example of a coded data block of a multi-view image in the configuration of the present invention.

[Explanation of symbols]

１０１，１０２，１０３映像データ３０１〜３０４映像データ４３２ディスプレイ４３３ビデオカメラ４３４マイク４３５スピーカ４３６キーボード４３７マウス４３８コントローラ４５０データ処理装置４５１コーデック４５２ネットワークインタフェース４５３入力インタフェース４５４ＡＶインタフェース４５６ＣＰＵ４５７メモリ４５８記録メディア４６１フレームメモリ４６２Ｄ／Ａ変換器４６３音声バッファ４６４Ｄ／Ａ変換器 101, 102, 103 video data 301-304 video data 432 display 433 video camera 434 microphone 435 speaker 436 keyboard 437 mouse 438 controller 450 data processor 451 codec 452 network interface 453 input interface 454 AV interface 456 CPU 457 memory 458 recording media 461 frame memory 462 D / A converter 463 audio buffer 464 D / A converter

───────────────────────────────────────────────────── フロントページの続きＦターム(参考） 5C052 AA01 AB03 CC11 DD04 5C053 FA23 GB01 GB06 GB11 GB37 JA01 KA04 KA24 KA25 LA01 LA06 5D044 AB07 BC01 BC04 CC04 DE14 DE37 FG10 GK08 ─────────────────────────────────────────────────── ─── Continued front page F-term (reference) 5C052 AA01 AB03 CC11 DD04 5C053 FA23 GB01 GB06 GB11 GB37 JA01 KA04 KA24 KA25 LA01 LA06 5D044 AB07 BC01 BC04 CC04 DE14 DE37 FG10 GK08

Claims

[Claims]

1. A data processing device for executing reproduction processing of multi-view image data, comprising: storage means for encoding and storing a plurality of moving image data captured from different viewpoints; and encoded data read from the storage means. Decoding means, a display means for displaying the data decoded by the decoding means, an input means for inputting a viewpoint switching command of moving image data displayed on the display means, and a viewpoint switching command from the input means. On the basis of,
And a control unit that determines read data from the storage unit, wherein the storage unit stores each of the plurality of moving image data in units of encoded data blocks including a plurality of frames,
And a configuration in which the storage frames of the coded data blocks corresponding to a plurality of moving image data that have a relationship before and after switching by the viewpoint switching command are stored with a shift, and the control unit is configured to store the viewpoint switching command. A data processing device having a configuration for executing a read coded data block determination process based on a data read process and a data display process state when an input is detected.

2. The control means has a remaining time required for the read processing of the encoded data block being read from the storage means at the time of detecting the input of the viewpoint switching command, and a required time for the subsequent one encoded data block read processing. , The remaining frame display processing time of the display data block in the display unit at the time of detecting the input of the viewpoint switching command, and the frame display processing time corresponding to the stored frame shift amount of the encoded data block before and after the viewpoint switching. 2. The data processing apparatus according to claim 1, wherein the data processing apparatus has a configuration for executing the determination processing of the read coded data block by performing the comparison processing of:

3. The storage means has a structure in which each of the plurality of moving image data is MPEG-compressed to form and store a coded data block in units of GOP ((Group Of Pictures)) formed of a plurality of frames. The data processing device according to claim 1, further comprising:

4. The storage means sets each of the plurality of moving image data as an encoded data block composed of a plurality of frames, and interleaves the encoded data blocks corresponding to the plurality of moving image data from different viewpoints. The data processing apparatus according to claim 1, wherein the data processing apparatus has a configuration in which the data processing apparatus is arranged.

5. The storage means stores each of the plurality of moving image data in units of encoded data blocks composed of a plurality of frames, and stores the plurality of moving image data in a relationship before and after switching by the viewpoint switching command. The data processing device according to claim 1, wherein the data processing device has a configuration in which a shift amount of a storage frame of a corresponding encoded data block is set to a predetermined amount.

6. The data processing apparatus according to claim 5, wherein the shift amount has a configuration of ½ of a storage frame of the encoded data block.

7. The control means executes a read coded data block determination process based on a data read process and a data display process state when an input of the view point change command is detected, and also based on the view point change command. The data processing apparatus according to claim 1, wherein the data processing apparatus has a configuration for executing the image display stop processing before the viewpoint switching based on the image display start after the viewpoint switching.

8. A data processing method for executing reproduction processing of multi-view image data, comprising a decoding step of decoding encoded data read from a storage means, and a display step of displaying the data decoded in the decoding step. A command input step of inputting a viewpoint switching command of the moving image data displayed in the display step, and a control step of determining read data from the storage means based on the viewpoint switching command from the command input step. Comprising, the storage means stores each of the plurality of moving image data in units of encoded data blocks composed of a plurality of frames,
And a configuration in which the storage frames of the encoded data blocks corresponding to a plurality of moving image data that have a relationship before and after switching by the viewpoint switching command are stored with a shift, and the control step includes A data processing method, characterized in that a read coded data block determination process is executed based on a data read process and a data display process state when an input is detected.

9. The control step comprises the remaining time required for the read processing of the encoded data block being read from the storage means at the time of detecting the input of the viewpoint switching command, and the time required for the subsequent one encoded data block read processing. , The remaining frame display processing time of the display data block in the display unit at the time of detecting the input of the viewpoint switching command, and the frame display processing time corresponding to the stored frame shift amount of the encoded data block before and after the viewpoint switching. 9. The data processing method according to claim 8, wherein the read coded data block determination process is executed by executing a comparison process of the total time of

10. The control step executes a read coded data block determination process based on a data read process and a data display process state at the time of detecting an input of the view point change command, and based on the view point change command. 9. The data processing method according to claim 8, wherein the image display stop processing before the viewpoint switching is executed based on the start of the image display after the viewpoint switching.

11. An information storage medium in which a plurality of moving image data captured from different viewpoints are encoded and stored, and each of the plurality of moving image data captured from different viewpoints is an encoded data block unit composed of a plurality of frames. And an information storage medium having a configuration in which the storage frames of encoded data blocks corresponding to a plurality of moving image data having a relationship before and after switching by a viewpoint switching command are stored with a shift. .

12. The information storage medium has a structure in which each of the plurality of moving image data is MPEG-compressed to form and store a coded data block in units of GOP ((Group Of Pictures)) formed of a plurality of frames. The information storage medium according to claim 11, further comprising:

13. The information storage medium uses each of the plurality of moving image data as an encoded data block composed of a plurality of frames, and interleaves encoded data blocks corresponding to the plurality of moving image data from different viewpoints. The information storage medium according to claim 11, wherein the information storage medium has a configuration in which the information storage medium is arranged.

14. The information storage medium stores each of the plurality of moving image data in units of encoded data blocks composed of a plurality of frames, and stores a plurality of moving image data in a relationship before and after switching by a viewpoint switching command. The information storage medium according to claim 11, wherein the information storage medium has a configuration in which a shift amount of a storage frame of a corresponding encoded data block is set to a predetermined amount.

15. The data processing apparatus according to claim 14, wherein the shift amount has a configuration of ½ of a storage frame of the encoded data block.

16. A computer program for executing data processing for executing reproduction processing of multi-view image data, comprising a decoding step of decoding encoded data read from a storage means, and displaying the data decoded in the decoding step. Display step, a command input step for inputting a viewpoint switching command of the moving image data displayed in the display step, and a data reading process at the time of detecting the input of the command based on the viewpoint switching command from the command input step And a step of executing a read coded data block determination process based on a data display process state, a computer program.