JP5484276B2

JP5484276B2 - Data compression apparatus, data decoding apparatus, data compression method, data decoding method, and data structure of compressed video file

Info

Publication number: JP5484276B2
Application number: JP2010204806A
Authority: JP
Inventors: 徹悟稲田; 章男大場; 博之勢川
Original assignee: Sony Interactive Entertainment Inc; Sony Computer Entertainment Inc
Current assignee: Sony Interactive Entertainment Inc
Priority date: 2010-09-13
Filing date: 2010-09-13
Publication date: 2014-05-07
Anticipated expiration: 2030-09-13
Also published as: JP2012060612A

Description

本発明は、動画像などの３次元データを符号化、復号化する情報処理装置および情報処理方法に関する。 The present invention relates to an information processing apparatus and information processing method for encoding and decoding three-dimensional data such as moving images.

ゲームプログラムを実行するだけでなく、動画を再生できる家庭用エンタテインメントシステムが提案されている。この家庭用エンタテインメントシステムでは、ＧＰＵがポリゴンを用いた三次元画像を生成する（例えば特許文献１参照）。 Home entertainment systems have been proposed that not only execute game programs, but also play video. In this home entertainment system, the GPU generates a three-dimensional image using polygons (see, for example, Patent Document 1).

動画、静止画に関わらず、画像をいかに効率よく表示するかは常に重要な問題となる。そのため画像データの圧縮技術、伝送技術、画像処理技術、表示技術など多方面で様々な技術が開発、実用化され、高精細な画像を多様な場面で身近に楽しめるようになってきた。 Regardless of moving images or still images, how to display images efficiently is always an important issue. For this reason, various technologies such as image data compression technology, transmission technology, image processing technology, and display technology have been developed and put into practical use, and high-definition images can be enjoyed in various situations.

米国特許第６５６３９９９号公報US Pat. No. 6,563,999

高精細な画像をユーザの要求に従い応答性よく表示させたい、という要求は常に存在する。例えば表示させた全体画像のうちユーザが着目したい領域を拡大して表示させたり別の領域に移動したり、といった、ユーザの視点に対し自由度のある画像表示を応答性よく実現するためには、サイズの大きな画像データを短時間で処理しつつランダムアクセスをも可能にしなければならず、さらなる技術の進歩が求められている。 There is always a demand to display a high-definition image with high responsiveness according to a user's request. For example, in order to realize an image display with a high degree of freedom with respect to the user's viewpoint, such as enlarging and displaying a region that the user wants to focus on or displaying another region of the displayed whole image. Therefore, it is necessary to enable random access while processing large-size image data in a short time, and further technical progress is required.

本発明はこのような課題に鑑みてなされたものであり、その目的は様々な要求に対し応答性よく動画像などの３次元データを出力することのできる情報処理技術を提供することにある。 The present invention has been made in view of such problems, and an object thereof is to provide an information processing technique capable of outputting three-dimensional data such as a moving image with high responsiveness to various requests.

本発明のある態様はデータ圧縮装置に関する。このデータ圧縮装置は、圧縮対象の、３次元空間におけるデータ列を当該３次元方向に分割して符号化単位を形成するデータ分割部と、データ分割部が形成した符号化単位ごとに、データのうち２値を代表値として保持するパレットと、当該代表値を線形補間して定まる複数の中間値および代表値のいずれかを指定する情報を、当該符号化単位の元のデータに代えて保持するインデックスと、を生成して圧縮データとする圧縮符号化部と、を備えたことを特徴とする。 One embodiment of the present invention relates to a data compression apparatus. This data compression apparatus divides a data string in a three-dimensional space to be compressed in the three-dimensional direction to form a coding unit, and for each coding unit formed by the data dividing unit, Among them, a palette that holds two values as representative values and information that specifies any one of a plurality of intermediate values and representative values determined by linear interpolation of the representative values are held instead of the original data of the coding unit. And a compression encoding unit that generates an index and generates compressed data.

本発明の別の態様はデータ復号装置に関する。この動画データ復号装置は、３次元空間におけるデータ列を、当該３次元方向に分割して形成した符号化単位ごとに、画素値のうち２値を代表値として保持するパレットと、当該代表値を線形補間して定まる複数の中間値および代表値のいずれかを指定する情報を、当該符号化単位の元のデータに代えて保持するインデックスと、を対応づけた圧縮データを記憶装置から読み出す圧縮データ読み出し部と、パレットが保持する代表値を線形補間して中間値を生成し、インデックスが保持する情報に従い、各符号化単位に含まれるデータを代表値および中間値のいずれかに決定したうえ、符号化単位の配列に基づき、元のデータ列を再構成して生成する復号部と、を備えたことを特徴とする。 Another aspect of the present invention relates to a data decoding apparatus. This moving image data decoding apparatus includes, for each encoding unit formed by dividing a data string in a three-dimensional space in the three-dimensional direction, a palette that holds two of the pixel values as a representative value, and the representative value. Compressed data that reads from the storage device compressed data that associates information that specifies any one of a plurality of intermediate values and representative values determined by linear interpolation, instead of the original data of the encoding unit. The reading unit and the intermediate value are generated by linearly interpolating the representative value held by the palette, and according to the information held by the index, the data included in each coding unit is determined as either the representative value or the intermediate value, And a decoding unit that reconstructs and generates an original data sequence based on an array of encoding units.

本発明のさらに別の態様はデータ圧縮方法に関する。このデータ圧縮方法は、圧縮対象の、３次元空間におけるデータ列を記憶装置より読み出すステップと、データ列を３次元方向に分割して符号化単位を形成するステップと、符号化単位ごとに、データのうち２値を代表値として保持するパレットと、当該代表値を線形補間して定まる複数の中間値および代表値のいずれかを指定する情報を、当該符号化単位の元のデータに代えて保持するインデックスと、を生成して圧縮データとして記憶装置に格納するステップと、を含むことを特徴とする。 Yet another embodiment of the present invention relates to a data compression method. This data compression method includes a step of reading a data string in a three-dimensional space to be compressed from a storage device, a step of dividing the data string in a three-dimensional direction to form an encoding unit, and a data unit for each encoding unit. A palette that holds two values as representative values, and information that specifies any of a plurality of intermediate values and representative values determined by linear interpolation of the representative values, instead of the original data of the coding unit Generating an index to be stored in a storage device as compressed data.

本発明のさらに別の態様はデータ復号方法に関する。このデータ復号方法は、３次元空間におけるデータ列を、当該３次元方向に分割して形成した符号化単位ごとに、画素値のうち２値を代表値として保持するパレットと、当該代表値を線形補間して定まる複数の中間値および代表値のいずれかを指定する情報を、当該符号化単位の元のデータに代えて保持するインデックスと、を対応づけた圧縮データを記憶装置から読み出すステップと、パレットが保持する代表値を線形補間して中間値を生成し、インデックスが保持する情報に従い、各符号化単位に含まれるデータを代表値および中間値のいずれかに決定したうえ、符号化単位の配列に基づき、元のデータ列を再構成して生成するステップと、生成したデータ列を出力装置に出力するステップと、を含むことを特徴とする。 Yet another embodiment of the present invention relates to a data decoding method. In this data decoding method, for each encoding unit formed by dividing a data string in a three-dimensional space in the three-dimensional direction, a palette that holds two of the pixel values as a representative value, and the representative value is linear A step of reading from the storage device compressed data in which information specifying any one of a plurality of intermediate values and representative values determined by interpolation is held instead of the original data of the encoding unit; and An intermediate value is generated by linearly interpolating the representative value held in the palette, and the data included in each coding unit is determined as one of the representative value and the intermediate value according to the information held in the index. The method includes a step of reconstructing and generating an original data sequence based on the arrangement, and a step of outputting the generated data sequence to an output device.

本発明のさらに別の態様は圧縮動画像ファイルのデータ構造に関する。このデータ構造は、動画を構成する画像フレーム列に対応し、輝度Ｙを画素値とするＹ画像列、色差Ｃｂを画素値とするＣｂ画像列、色差Ｃｒを画素値とするＣｒ画像列を、それぞれ時空間分割して形成した符号化単位ごとに生成された、画素値のうち２値を代表値として保持するパレットと、当該代表値を線形補間して定まる複数の中間値および代表値のいずれかを指定する情報を画素ごとに保持するインデックスと、を対応づけて画像フレームの画像領域に対応させて配列したことを特徴とする。 Still another embodiment of the present invention relates to a data structure of a compressed moving image file. This data structure corresponds to an image frame sequence constituting a moving image, a Y image sequence having a luminance Y as a pixel value, a Cb image sequence having a color difference Cb as a pixel value, and a Cr image sequence having a color difference Cr as a pixel value. A palette that holds two of the pixel values as representative values generated for each encoding unit formed by space-time division, and any of a plurality of intermediate values and representative values determined by linear interpolation of the representative values An index for holding information for designating each pixel is associated with an image area of an image frame, and is arranged in association with the image area.

なお、以上の構成要素の任意の組合せ、本発明の表現を方法、装置、システム、コンピュータプログラムなどの間で変換したものもまた、本発明の態様として有効である。 It should be noted that any combination of the above-described constituent elements and a representation of the present invention converted between a method, an apparatus, a system, a computer program, etc. are also effective as an aspect of the present invention.

本発明によると、ランダムアクセスが可能でスループットの高い３次元データ出力を行える。 According to the present invention, it is possible to output 3D data with high throughput and random access.

本実施の形態に適用できる画像処理システムの使用環境を示す図である。It is a figure which shows the use environment of the image processing system which can be applied to this Embodiment. 図１の画像処理システムに適用できる入力装置の外観構成例を示す図である。It is a figure which shows the example of an external appearance structure of the input device applicable to the image processing system of FIG. 本実施の形態において処理対象とする動画像の階層データを概念的に示す図である。It is a figure which shows notionally the hierarchy data of the moving image made into a process target in this Embodiment. 本実施の形態における画像処理装置の構成を示す図である。It is a figure which shows the structure of the image processing apparatus in this Embodiment. 本実施の形態において、階層構造を有する動画データを用いて動画を表示する機能を有する制御部の構成を詳細に示す図である。In this Embodiment, it is a figure which shows the structure of the control part which has a function which displays a moving image using the moving image data which has a hierarchical structure in detail. 本実施の形態において処理対象となる動画データの構造例を示す図である。It is a figure which shows the structural example of the moving image data used as the process target in this Embodiment. 本実施の形態において処理対象となる動画データの構造例を示す図である。It is a figure which shows the structural example of the moving image data used as the process target in this Embodiment. 本実施の形態において処理対象となる動画データの構造例を示す図である。It is a figure which shows the structural example of the moving image data used as the process target in this Embodiment. 本実施の形態において処理対象となる動画データの構造例を示す図である。It is a figure which shows the structural example of the moving image data used as the process target in this Embodiment. 本実施の形態において一部の階層の動画ストリームを別の階層の動画ストリームで代替させる場合の動画のデータ構造を模式的に示す図である。It is a figure which shows typically the data structure of the moving image in the case of substituting the moving image stream of a part hierarchy with the moving image stream of another hierarchy in this Embodiment. 本実施の形態において、動画データ圧縮機能を有する制御部およびハードディスクドライブの構成を詳細に示す図である。In this Embodiment, it is a figure which shows the structure of the control part and hard disk drive which have a moving image data compression function in detail. 図１１で示した制御部を含む画像処理装置が実施する動画ストリームの圧縮手順を模式的に示す図である。It is a figure which shows typically the compression procedure of the moving image stream which the image processing apparatus containing the control part shown in FIG. 11 implements. 本実施の形態においてＹ画像列の符号化単位からパレットおよびインデックスのデータを生成する手法を模式的に示す図である。It is a figure which shows typically the method of producing | generating the data of a palette and an index from the encoding unit of a Y image sequence in this Embodiment. 本実施の形態においてＣｂＣｒ画像列の符号化単位からパレットおよびインデックスのデータを生成する手法を模式的に示す図である。It is a figure which shows typically the method of producing | generating the data of a palette and an index from the encoding unit of a CbCr image sequence in this Embodiment. 本実施の形態において１つの処理単位を分割するパターンのバリエーションを示す図である。It is a figure which shows the variation of the pattern which divides | segments one process unit in this Embodiment. 本実施の形態における分割パターンマップのデータ構造例を示す図である。It is a figure which shows the example of a data structure of the division | segmentation pattern map in this Embodiment. 本実施の形態の圧縮データ記憶部における圧縮データの配列を説明するための図である。It is a figure for demonstrating the arrangement | sequence of the compressed data in the compressed data storage part of this Embodiment. 本実施の形態において圧縮符号化処理を動画ストリーム全体に施したときのデータの変遷を模式的に示す図である。It is a figure which shows typically the transition of data when a compression encoding process is performed to the whole moving image stream in this Embodiment. 本実施の形態において２つのパレットに分割パターンの識別番号を埋め込む手法を説明するための図である。It is a figure for demonstrating the method of embedding the identification number of a division | segmentation pattern in two pallets in this Embodiment.

本実施の形態では動画像表示において、ユーザの視点移動要求に対応した表示領域の移動を可能にする。ここでの視点移動は、画像平面へ視点を近づけたり離したりすることを含み、それに応じて動画像は、再生されつつ拡大および縮小されることになる。そこで本実施の形態では処理対象の動画像データを、１つの動画像を異なる解像度で表した画像フレーム列からそれぞれ構成される複数の動画像ストリームを解像度順に階層化してなる階層構造とする。そして視点の遠近方向の移動要求に対し、表示に使用する動画ストリームを異なる階層へ切り替えることで、拡大表示や縮小表示を迅速に行う。以後、このような階層構造を有する動画像データを「階層データ」とも呼ぶ。 In the present embodiment, it is possible to move the display area corresponding to the user's viewpoint movement request in moving image display. The viewpoint movement here includes moving the viewpoint closer to or away from the image plane, and the moving image is enlarged and reduced while being reproduced accordingly. Therefore, in the present embodiment, the processing target moving image data has a hierarchical structure in which a plurality of moving image streams each composed of an image frame sequence representing one moving image at different resolutions are hierarchized in the order of resolution. In response to a request for moving the viewpoint in the near and near direction, the moving image stream used for display is switched to a different layer, so that enlarged display and reduced display are quickly performed. Hereinafter, moving image data having such a hierarchical structure is also referred to as “hierarchical data”.

まず、このような階層データの基本的な表示態様について説明する。図１は、本実施の形態を適用できる画像処理システム１の使用環境を示す。画像処理システム１は、画像処理ソフトウェアを実行する画像処理装置１０と、画像処理装置１０による処理結果を出力する表示装置１２とを備える。表示装置１２は、画像を出力するディスプレイおよび音声を出力するスピーカを有するテレビであってよい。 First, a basic display mode of such hierarchical data will be described. FIG. 1 shows a use environment of an image processing system 1 to which this embodiment can be applied. The image processing system 1 includes an image processing device 10 that executes image processing software, and a display device 12 that outputs a processing result by the image processing device 10. The display device 12 may be a television having a display that outputs an image and a speaker that outputs sound.

表示装置１２は、画像処理装置１０に有線ケーブルで接続されてよく、また無線ＬＡＮ（Local Area Network）などにより無線接続されてもよい。画像処理システム１において、画像処理装置１０は、ケーブル１４を介してインターネットなどの外部ネットワークに接続し、階層データをダウンロードして取得してもよい。なお画像処理装置１０は、無線通信により外部ネットワークに接続してもよい。 The display device 12 may be connected to the image processing device 10 by a wired cable, or may be wirelessly connected by a wireless local area network (LAN) or the like. In the image processing system 1, the image processing apparatus 10 may be connected to an external network such as the Internet via the cable 14 to download and acquire hierarchical data. The image processing apparatus 10 may be connected to an external network by wireless communication.

画像処理装置１０は、たとえばゲーム装置やパーソナルコンピュータであってよく、画像処理用のアプリケーションプログラムをロードすることで画像処理機能を実現してもよい。画像処理装置１０は、ユーザからの視点移動要求に応じて、表示装置１２のディスプレイに表示する動画像の拡大／縮小処理や、上下左右方向へのスクロール処理などを行う。以後、このような拡大／縮小を含めた表示領域の変更処理を「表示領域の移動」と表現する。ユーザが、ディスプレイに表示された画像を見ながら入力装置を操作すると、入力装置が、表示領域移動要求信号を画像処理装置１０に送信する。 The image processing device 10 may be, for example, a game device or a personal computer, and may implement an image processing function by loading an image processing application program. The image processing apparatus 10 performs enlargement / reduction processing of a moving image displayed on the display of the display device 12, scroll processing in the vertical and horizontal directions, and the like in response to a viewpoint movement request from the user. Hereinafter, the display area changing process including such enlargement / reduction is expressed as “movement of the display area”. When the user operates the input device while viewing the image displayed on the display, the input device transmits a display area movement request signal to the image processing device 10.

図２は、入力装置２０の外観構成例を示す。入力装置２０は、ユーザが操作可能な操作手段として、十字キー２１、アナログスティック２７ａ、２７ｂと、４種の操作ボタン２６を備える。４種の操作ボタン２６は、○ボタン２２、×ボタン２３、□ボタン２４および△ボタン２５から構成される。 FIG. 2 shows an external configuration example of the input device 20. The input device 20 includes a cross key 21, analog sticks 27a and 27b, and four types of operation buttons 26 as operation means that can be operated by the user. The four types of operation buttons 26 include a circle button 22, a x button 23, a square button 24, and a triangle button 25.

画像処理システム１において、入力装置２０の操作手段には、表示画像の拡大／縮小要求、および上下左右方向へのスクロール要求を入力するための機能が割り当てられる。たとえば、表示画像の拡大／縮小要求の入力機能は、右側のアナログスティック２７ｂに割り当てられる。ユーザはアナログスティック２７ｂを手前に引くことで、表示画像の縮小要求を入力でき、また手前から押すことで、表示画像の拡大要求を入力できる。 In the image processing system 1, a function for inputting a display image enlargement / reduction request and a vertical / left / right scroll request is assigned to the operation unit of the input device 20. For example, the input function of the display image enlargement / reduction request is assigned to the right analog stick 27b. The user can input a display image reduction request by pulling the analog stick 27b forward, and can input a display image enlargement request by pressing the analog stick 27b from the front.

また、スクロール要求の入力機能は、十字キー２１に割り当てられる。ユーザは十字キー２１を押下することで、十字キー２１を押下した方向へのスクロール要求を入力できる。なお、表示領域移動要求の入力機能は別の操作手段に割り当てられてもよく、たとえばアナログスティック２７ａに、スクロール要求の入力機能が割り当てられてもよい。 The scroll request input function is assigned to the cross key 21. The user can input a scroll request in the direction in which the cross key 21 is pressed by pressing the cross key 21. Note that the display area movement request input function may be assigned to another operation means. For example, the scroll request input function may be assigned to the analog stick 27a.

入力装置２０は、入力された表示領域移動要求の信号を画像処理装置１０に伝送する機能をもち、本実施の形態では画像処理装置１０との間で無線通信可能に構成される。入力装置２０と画像処理装置１０は、Bluetooth（ブルートゥース）（登録商標）プロトコルやIEEE802.11プロトコルなどを用いて無線接続を確立してもよい。なお入力装置２０は、画像処理装置１０とケーブルを介して接続して、表示領域移動要求の信号を画像処理装置１０に伝送してもよい。 The input device 20 has a function of transmitting an input display area movement request signal to the image processing device 10 and is configured to be capable of wireless communication with the image processing device 10 in the present embodiment. The input device 20 and the image processing device 10 may establish a wireless connection using a Bluetooth (registered trademark) protocol, an IEEE802.11 protocol, or the like. The input device 20 may be connected to the image processing apparatus 10 via a cable and transmit a display area movement request signal to the image processing apparatus 10.

図３は、本実施の形態において処理対象とする動画像の階層データを概念的に示している。階層データは、図の上から下へ向かうｚ軸方向に、第０階層３０、第１階層３２、第２階層３４および第３階層３６からなる階層構造を有する。なお同図においては４階層のみ示しているが、階層数はこれに限定されない。上述のとおり各階層は１つの動画像を異なる解像度で表した動画データ、すなわち複数の画像フレームを時系列順に並べたデータで構成される。同図においては各階層を４枚の画像フレームで象徴的に表しているが画像フレームの数は動画像の再生時間やフレームレートによって当然異なる。 FIG. 3 conceptually shows hierarchical data of moving images to be processed in the present embodiment. The hierarchical data has a hierarchical structure including a 0th hierarchy 30, a first hierarchy 32, a second hierarchy 34, and a third hierarchy 36 in the z-axis direction from the top to the bottom of the figure. Although only four layers are shown in the figure, the number of layers is not limited to this. As described above, each layer includes moving image data representing one moving image at different resolutions, that is, data in which a plurality of image frames are arranged in time series. In the figure, each layer is symbolically represented by four image frames, but the number of image frames naturally varies depending on the playback time and frame rate of the moving image.

なお後述するように本実施の形態は動画像データが有する画像平面および時間軸の３次元空間に対するランダムアクセス性に優れている。そのため例えば時間軸を「奥行き」とみなすことにより、動画像データに代えて３次元ボリュームデータを処理対象としてもよい。同様に３次元方向において冗長性を持ち得るデータであれば、パラメータの種類は特に限定されない。 As will be described later, the present embodiment is excellent in random accessibility with respect to a three-dimensional space of an image plane and a time axis possessed by moving image data. Therefore, for example, by regarding the time axis as “depth”, three-dimensional volume data may be processed instead of moving image data. Similarly, the type of parameter is not particularly limited as long as the data can have redundancy in the three-dimensional direction.

階層データは例えば４分木の階層構造を有し、各階層を構成する画像フレームを同一サイズを有する「タイル画像」に分割した場合、第０階層３０は１個のタイル画像、第１階層３２は２×２個のタイル画像、第２階層３４は４×４個のタイル画像、第３階層は８×８個のタイル画像、などとなる。このとき第Ｎ階層の解像度（Ｎは０以上の整数）は、画像平面上で左右（ｘ軸）方向、上下（ｙ軸）方向ともに、第（Ｎ＋１）階層の解像度の１／２となる。階層データは、最高解像度をもつ第３階層３６の動画像をもとに、画像フレームを複数段階に縮小するなどして生成することができる。 For example, the hierarchical data has a hierarchical structure of a quadtree, and when the image frames constituting each hierarchy are divided into “tile images” having the same size, the 0th hierarchy 30 is one tile image, and the first hierarchy 32 Is 2 × 2 tile images, the second layer 34 is 4 × 4 tile images, the third layer is 8 × 8 tile images, and the like. At this time, the resolution of the Nth layer (N is an integer of 0 or more) is ½ of the resolution of the (N + 1) th layer in both the left and right (x-axis) directions and the vertical (y-axis) direction on the image plane. Hierarchical data can be generated by reducing an image frame in a plurality of stages based on a moving image of the third hierarchy 36 having the highest resolution.

動画表示時の視点座標およびそれに対応する表示領域は、図３に示すように、画像の左右方向を表すｘ軸、上下方向を表すｙ軸、解像度を表すｚ軸からなる仮想的な３次元空間で表すことができる。なお上述のとおり本実施の形態では複数の画像フレームが連なる動画データを階層として準備するため、実際に表示される画像は再生が開始されてからの時間にも依存し、同図では階層ごとに時間軸ｔを表している。 As shown in FIG. 3, the viewpoint coordinates at the time of moving image display and the corresponding display area are a virtual three-dimensional space composed of an x-axis representing the horizontal direction of the image, a y-axis representing the vertical direction, and a z-axis representing the resolution. Can be expressed as As described above, in this embodiment, since moving image data including a plurality of image frames is prepared as a hierarchy, the actually displayed image depends on the time from the start of reproduction. The time axis t is represented.

画像処理装置１０は、基本的には時間軸ｔに沿っていずれかの階層の画像フレームを所定のフレームレートで順次描画していく。例えば第０階層３０の解像度の動画像を基準画像として表示する。その過程で入力装置２０から表示領域移動要求信号が供給されたら、当該信号から表示画像の変更量を導出し、その変更量を用いて次のフレームの、仮想空間における４隅の座標（フレーム座標）を導出する。そして当該フレーム座標に対応する画像フレームを描画する。この際、ｚ軸に対し階層の切り替え境界を設けておくことにより、フレーム座標のｚの値に応じて適宜、フレーム描画に用いる動画データの階層を切り替える。 The image processing apparatus 10 basically draws image frames in any hierarchy sequentially at a predetermined frame rate along the time axis t. For example, a moving image having a resolution of the 0th hierarchy 30 is displayed as a reference image. If a display area movement request signal is supplied from the input device 20 in the process, the change amount of the display image is derived from the signal, and the coordinates of the four corners of the next frame in the virtual space (frame coordinates) are used by using the change amount. ) Is derived. Then, an image frame corresponding to the frame coordinates is drawn. At this time, by providing a layer switching boundary with respect to the z-axis, the layer of moving image data used for frame drawing is appropriately switched according to the value of z of the frame coordinates.

なお、仮想空間におけるフレーム座標の代わりに、画像処理装置１０は、階層を特定する情報と、その階層におけるテクスチャ座標（ＵＶ座標）を導出してもよい。以下、階層特定情報およびテクスチャ座標の組み合わせも、フレーム座標と呼ぶ。 Instead of the frame coordinates in the virtual space, the image processing apparatus 10 may derive information for specifying the hierarchy and texture coordinates (UV coordinates) in the hierarchy. Hereinafter, the combination of the hierarchy specifying information and the texture coordinates is also referred to as frame coordinates.

画像処理装置１０において、階層データは、所定の圧縮形式で圧縮された状態で記憶装置に保持されている。そしてフレーム描画に必要なデータが記憶装置から読み出されてデコードされる。なお図３は階層データを概念的に表したものであり、記憶装置に格納されるデータの格納順やフォーマットを限定するものではない。例えば階層データの仮想空間における位置と実際の動画データの格納領域とが対応づけてあれば、動画データは任意の領域に格納することができる。また後に述べるように、各階層を構成する画像フレーム列に対し空間分割や時間分割を施し、その単位で圧縮符号化してもよい。 In the image processing apparatus 10, the hierarchical data is held in the storage device in a compressed state in a predetermined compression format. Data necessary for frame drawing is read from the storage device and decoded. FIG. 3 conceptually represents hierarchical data, and does not limit the storage order or format of data stored in the storage device. For example, if the position of the hierarchical data in the virtual space is associated with the storage area for the actual moving image data, the moving image data can be stored in an arbitrary area. As will be described later, space division or time division may be applied to the image frame sequence constituting each layer, and compression coding may be performed in that unit.

図４は画像処理装置１０の構成を示している。画像処理装置１０は、無線インタフェース４０、スイッチ４２、表示処理部４４、ハードディスクドライブ５０、記録媒体装着部５２、ディスクドライブ５４、メインメモリ６０、バッファメモリ７０および制御部１００を有して構成される。表示処理部４４は、表示装置１２のディスプレイに表示するデータをバッファするフレームメモリを有する。 FIG. 4 shows the configuration of the image processing apparatus 10. The image processing apparatus 10 includes a wireless interface 40, a switch 42, a display processing unit 44, a hard disk drive 50, a recording medium mounting unit 52, a disk drive 54, a main memory 60, a buffer memory 70, and a control unit 100. . The display processing unit 44 has a frame memory that buffers data to be displayed on the display of the display device 12.

スイッチ４２は、イーサネットスイッチ（イーサネットは登録商標）であって、外部の機器と有線または無線で接続して、データの送受信を行うデバイスである。スイッチ４２は、ケーブル１４を介して外部ネットワークに接続し、画像サーバから階層データを受信できるように構成される。またスイッチ４２は無線インタフェース４０に接続し、無線インタフェース４０は、所定の無線通信プロトコルで入力装置２０と接続する。入力装置２０においてユーザから入力された表示領域移動要求の信号は、無線インタフェース４０、スイッチ４２を経由して、制御部１００に供給される。 The switch 42 is an Ethernet switch (Ethernet is a registered trademark), and is a device that transmits and receives data by connecting to an external device in a wired or wireless manner. The switch 42 is connected to an external network via the cable 14 and configured to receive hierarchical data from the image server. The switch 42 is connected to the wireless interface 40, and the wireless interface 40 is connected to the input device 20 using a predetermined wireless communication protocol. A display area movement request signal input from the user by the input device 20 is supplied to the control unit 100 via the wireless interface 40 and the switch 42.

ハードディスクドライブ５０は、データを記憶する記憶装置として機能する。階層データはハードディスクドライブ５０に格納されてもよい。記録媒体装着部５２は、メモリカードなどのリムーバブル記録媒体が装着されると、リムーバブル記録媒体からデータを読み出す。ディスクドライブ５４は、読出専用のＲＯＭディスクが装着されると、ＲＯＭディスクを駆動して認識し、データを読み出す。ＲＯＭディスクは、光ディスクや光磁気ディスクなどであってよい。階層データはこれらの記録媒体に格納されていてもよい。 The hard disk drive 50 functions as a storage device that stores data. The hierarchical data may be stored in the hard disk drive 50. When a removable recording medium such as a memory card is mounted, the recording medium mounting unit 52 reads data from the removable recording medium. When a read-only ROM disk is loaded, the disk drive 54 drives and recognizes the ROM disk to read data. The ROM disk may be an optical disk or a magneto-optical disk. Hierarchical data may be stored in these recording media.

制御部１００は、マルチコアＣＰＵを備え、１つのＣＰＵの中に１つの汎用的なプロセッサコアと、複数のシンプルなプロセッサコアを有する。汎用プロセッサコアはＰＰＵ（Power Processing Unit）と呼ばれ、残りのプロセッサコアはＳＰＵ（Synergistic-Processing Unit）と呼ばれる。制御部１００はさらにＧＰＵ（Graphics Processing Unit）を備えていてもよい。 The control unit 100 includes a multi-core CPU, and includes one general-purpose processor core and a plurality of simple processor cores in one CPU. The general-purpose processor core is called a PPU (Power Processing Unit), and the remaining processor cores are called a SPU (Synergistic-Processing Unit). The control unit 100 may further include a GPU (Graphics Processing Unit).

制御部１００は、メインメモリ６０およびバッファメモリ７０に接続するメモリコントローラを備える。ＰＰＵはレジスタを有し、演算実行主体としてメインプロセッサを備えて、実行するアプリケーションにおける基本処理単位としてのタスクを各ＳＰＵに効率的に割り当てる。なお、ＰＰＵ自身がタスクを実行してもよい。ＳＰＵはレジスタを有し、演算実行主体としてのサブプロセッサとローカルな記憶領域としてのローカルメモリを備える。ローカルメモリは、バッファメモリ７０として使用されてもよい。 The control unit 100 includes a memory controller connected to the main memory 60 and the buffer memory 70. The PPU has a register, has a main processor as an operation execution subject, and efficiently assigns a task as a basic processing unit in an application to be executed to each SPU. Note that the PPU itself may execute the task. The SPU has a register, and includes a sub-processor as an operation execution subject and a local memory as a local storage area. The local memory may be used as the buffer memory 70.

メインメモリ６０およびバッファメモリ７０は記憶装置であり、ＲＡＭ（ランダムアクセスメモリ）として構成される。ＳＰＵは制御ユニットとして専用のＤＭＡ（Direct Memory Access）コントローラをもち、メインメモリ６０とバッファメモリ７０の間のデータ転送を高速に行うことができ、また表示処理部４４におけるフレームメモリとバッファメモリ７０の間で高速なデータ転送を実現できる。本実施の形態の制御部１００は、複数のＳＰＵを並列動作させることで、高速な画像処理機能を実現する。表示処理部４４は、表示装置１２に接続されて、ユーザからの要求に応じた画像処理結果を出力する。 The main memory 60 and the buffer memory 70 are storage devices and are configured as a RAM (Random Access Memory). The SPU has a dedicated DMA (Direct Memory Access) controller as a control unit, can transfer data between the main memory 60 and the buffer memory 70 at high speed, and the frame memory and the buffer memory 70 in the display processing unit 44 can be transferred. High-speed data transfer can be realized. The control unit 100 according to the present embodiment realizes a high-speed image processing function by operating a plurality of SPUs in parallel. The display processing unit 44 is connected to the display device 12 and outputs an image processing result according to a request from the user.

画像処理装置１０は、表示画像の拡大／縮小処理やスクロール処理をスムーズに行うために、現在表示されているフレームに対し空間的、時間的に近接した動画データを逐次、ハードディスクドライブ５０からメインメモリ６０にロードしておく。また、メインメモリ６０にロードした動画データの一部をデコードしてバッファメモリ７０に格納しておく。これにより、動画再生を進捗させつつ表示領域を円滑に移動させることが可能となる。このときロードやデコードの対象となるデータは、それまでの表示領域の移動方向に基づき、以後、必要となる領域を先読みすることによって決定してもよい。 The image processing apparatus 10 sequentially transfers moving image data spatially and temporally close to the currently displayed frame from the hard disk drive 50 to the main memory in order to smoothly perform enlargement / reduction processing and scroll processing of the display image. 60 is loaded. Further, a part of the moving image data loaded in the main memory 60 is decoded and stored in the buffer memory 70. As a result, the display area can be smoothly moved while moving picture reproduction is progressing. At this time, the data to be loaded or decoded may be determined by pre-reading the necessary area based on the movement direction of the display area so far.

図３に示す階層データにおいて、ｚ軸方向における位置は解像度を示し、第０階層３０に近い位置ほど解像度が低く、第３階層３６に近い位置ほど解像度は高い。ディスプレイに表示される画像の大きさに注目すると、ｚ軸方向における位置は、縮尺率に対応し、第３階層３６の表示画像の縮尺率を１とすると、第２階層３４における縮尺率は１／４、第１階層３２における縮尺率は１／１６となり、第０階層３０における縮尺率は１／６４となる。 In the hierarchical data shown in FIG. 3, the position in the z-axis direction indicates the resolution. The position closer to the 0th hierarchy 30 has a lower resolution, and the position closer to the third hierarchy 36 has a higher resolution. When attention is paid to the size of the image displayed on the display, the position in the z-axis direction corresponds to the scale ratio. When the scale ratio of the display image of the third hierarchy 36 is 1, the scale ratio in the second hierarchy 34 is 1. / 4, the scale factor in the first hierarchy 32 is 1/16, and the scale factor in the 0th hierarchy 30 is 1/64.

したがってｚ軸方向において、表示画像が第０階層３０側から第３階層３６側へ向かう方向に変化する場合、表示画像は拡大していき、第３階層３６側から第０階層３０側へ向かう方向に変化する場合は、表示画像は縮小していく。例えば表示画像の縮尺率が第２階層３４の近傍にある場合、表示画像は、第２階層３４の画像データを用いて作成される。 Therefore, in the z-axis direction, when the display image changes in the direction from the 0th layer 30 side to the third layer 36 side, the display image expands and the direction from the third layer 36 side to the 0th layer 30 side. In the case of changing to, the display image is reduced. For example, when the scale ratio of the display image is in the vicinity of the second hierarchy 34, the display image is created using the image data of the second hierarchy 34.

具体的には上述のとおり、各階層の中間の縮尺率などにそれぞれ切り替え境界を設ける。例えば表示する画像の縮尺率が、第１階層３２と第２階層３４の間の切り替え境界と、第２階層３４と第３階層３６の間の切り替え境界の間にある場合に、第２階層３４の画像データを利用してフレームを描画する。第１階層３２と第２階層３４の間の切り替え境界と、第２階層３４の間の縮尺率では、第２階層３４の画像フレームを縮尺して表示する。第２階層３４と第３階層３６の間の切り替え境界と、第２階層３４の間の縮尺率では、第２階層３４の画像フレームを拡大して表示する。 Specifically, as described above, a switching boundary is provided at an intermediate scale ratio of each layer. For example, when the scale ratio of the image to be displayed is between the switching boundary between the first hierarchy 32 and the second hierarchy 34 and the switching boundary between the second hierarchy 34 and the third hierarchy 36, the second hierarchy 34. A frame is drawn using the image data. At the switching boundary between the first hierarchy 32 and the second hierarchy 34 and the scale ratio between the second hierarchy 34, the image frame of the second hierarchy 34 is scaled and displayed. At the switching boundary between the second layer 34 and the third layer 36 and the scale ratio between the second layer 34, the image frame of the second layer 34 is enlarged and displayed.

一方、表示領域移動要求信号から予測される将来必要な領域を特定してデコードする場合は、各階層の縮尺率などを先読み境界として設定しておく。例えば、表示領域移動要求信号による要求縮尺率が第２階層３４の縮尺率をまたいだときなどに、縮小方向にある第１階層３２の画像データの少なくとも一部をハードディスクドライブ５０またはメインメモリ６０から先読みしてデコードし、バッファメモリ７０に書き込む。 On the other hand, when a future necessary region predicted from the display region movement request signal is identified and decoded, the scale ratio of each layer is set as a prefetch boundary. For example, at least a part of the image data of the first hierarchy 32 in the reduction direction is transferred from the hard disk drive 50 or the main memory 60 when the requested scale ratio by the display area movement request signal crosses the scale ratio of the second hierarchy 34. Read ahead, decode and write to buffer memory 70.

画像の上下左右方向の先読み処理についても同様である。具体的には、バッファメモリ７０に展開されている画像データに先読み境界を設定しておき、画像変更要求信号による表示位置が先読み境界をまたいだときに、先読み処理が開始されるようにする。このようにすることで、ユーザの表示領域移動の要求に応じ、円滑に解像度および表示位置を変化させつつ動画再生も進んでいく態様を実現できる。 The same applies to the prefetch processing in the vertical and horizontal directions of the image. Specifically, a prefetch boundary is set for the image data developed in the buffer memory 70, and the prefetch process is started when the display position by the image change request signal crosses the prefetch boundary. By doing so, it is possible to realize a mode in which the moving image reproduction proceeds while smoothly changing the resolution and the display position in response to the user's request for moving the display area.

図５は本実施の形態において、階層構造を有する動画データを用いて動画を表示する機能を有する制御部１００ａの構成を詳細に示している。制御部１００ａは、入力装置２０からユーザが入力した情報を取得する入力情報取得部１０２、新たに表示すべき領域のフレーム座標を決定するフレーム座標決定部１１０、新たにロードすべき動画ストリームの圧縮データを決定するロードストリーム決定部１０６、必要な動画ストリームをハードディスクドライブ５０からロードするロード部１０８を含む。制御部１００ａはさらに、動画ストリームの圧縮データをデコードするデコード部１１２、および画像フレームを描画する表示画像処理部１１４を含む。 FIG. 5 shows in detail the configuration of the control unit 100a having a function of displaying moving images using moving image data having a hierarchical structure in the present embodiment. The control unit 100a includes an input information acquisition unit 102 that acquires information input by the user from the input device 20, a frame coordinate determination unit 110 that determines frame coordinates of a region to be newly displayed, and compression of a video stream to be newly loaded. A load stream determination unit 106 that determines data and a load unit 108 that loads a necessary moving image stream from the hard disk drive 50 are included. The control unit 100a further includes a decoding unit 112 that decodes the compressed data of the moving image stream, and a display image processing unit 114 that draws an image frame.

図５および後述する図１０において、さまざまな処理を行う機能ブロックとして記載される各要素は、ハードウェア的には、ＣＰＵ（Central Processing Unit）、メモリ、その他のＬＳＩで構成することができ、ソフトウェア的には、メモリにロードされたプログラムなどによって実現される。既述したように、制御部１００は１つのＰＰＵと複数のＳＰＵとを有し、ＰＰＵおよびＳＰＵがそれぞれ単独または協同して、各機能ブロックを構成できる。したがって、これらの機能ブロックがハードウェアのみ、ソフトウェアのみ、またはそれらの組合せによっていろいろな形で実現できることは当業者には理解されるところであり、いずれかに限定されるものではない。 In FIG. 5 and FIG. 10 to be described later, each element described as a functional block for performing various processes can be configured by a CPU (Central Processing Unit), a memory, and other LSIs in terms of hardware. Specifically, it is realized by a program loaded in a memory. As described above, the control unit 100 includes one PPU and a plurality of SPUs, and each functional block can be configured by the PPU and the SPU individually or in cooperation. Therefore, it is understood by those skilled in the art that these functional blocks can be realized in various forms by hardware only, software only, or a combination thereof, and is not limited to any one.

入力情報取得部１０２は、ユーザが入力装置２０に対して入力した、動画再生の開始／終了、表示領域の移動などの要求内容を取得し、フレーム座標決定部１１０に通知する。フレーム座標決定部１１０は、現在の表示領域のフレーム座標とユーザが入力した表示領域移動要求信号に従い、新たに表示すべき領域のフレーム座標を決定し、ロードストリーム決定部１０６、デコード部１１２、表示画像処理部１１４に通知する。 The input information acquisition unit 102 acquires request contents such as moving image reproduction start / end and display area movement input by the user to the input device 20 and notifies the frame coordinate determination unit 110 of them. The frame coordinate determination unit 110 determines the frame coordinates of a region to be newly displayed in accordance with the frame coordinates of the current display region and the display region movement request signal input by the user, the load stream determination unit 106, the decoding unit 112, the display The image processing unit 114 is notified.

ロードストリーム決定部１０６は、フレーム座標決定部１１０から通知されたフレーム座標に基づき、ハードディスクドライブ５０からメインメモリ６０へ新たにロードすべき動画像の圧縮データを特定し、ロード部１０８にロード要求を発行する。後述するように本実施の形態の階層データは、各階層を構成するフレーム画像列を同じサイズに空間分割してなるタイル画像列ごとに動画ストリームを個別に保持する。 Based on the frame coordinates notified from the frame coordinate determination unit 110, the load stream determination unit 106 identifies compressed video data to be newly loaded from the hard disk drive 50 to the main memory 60, and issues a load request to the load unit 108. Issue. As will be described later, the hierarchical data of the present embodiment individually holds a moving image stream for each tile image sequence obtained by spatially dividing the frame image sequence constituting each layer into the same size.

そのため、縮尺率とその表示に用いる階層との対応関係以外に、各階層における空間座標と、その座標に対応する画像データを含む動画ストリームの識別情報およびその格納領域とをあらかじめ対応づけておく。ロードストリーム決定部１０６はその情報を元に、必要な動画ストリームの識別情報を取得する。そして該当する動画ストリームの圧縮データがロード済みでなければ、ロード部１０８にロード要求を発行する。また、フレーム座標が変化しない場合であっても、動画の進捗に応じて逐次、必要な動画ストリームの圧縮データがロードされるように要求する。 Therefore, in addition to the correspondence between the scale ratio and the hierarchy used for display, the spatial coordinates in each hierarchy, the identification information of the moving picture stream including the image data corresponding to the coordinates, and the storage area thereof are associated in advance. Based on the information, the load stream determination unit 106 acquires necessary moving image stream identification information. If the compressed data of the corresponding video stream has not been loaded, a load request is issued to the load unit 108. Further, even if the frame coordinates do not change, it is requested that the compressed data of the necessary moving image stream is sequentially loaded according to the progress of the moving image.

ロードストリーム決定部１０６は、その時点のフレーム描画に必要な動画ストリームの他、以後必要と予測される動画ストリームを先に述べた先読み処理などにより特定し、ロード部１０８にロード要求を発行してよい。ロードストリーム決定部１０６は、ロード部１０８がロード処理中でない状態において、例えば所定の時間間隔、あるいは、ユーザが表示領域移動要求を入力した際など、所定のタイミングでロード要求を行ってもよい。ロード部１０８は、ロードストリーム決定部１０６からの要求に従い、ハードディスクドライブ５０からのロード処理を行う。具体的にはロードすべき動画ストリームの識別情報から格納領域を特定し、当該格納領域から読み出したデータをメインメモリ６０に格納する。 The load stream determination unit 106 identifies a video stream that is predicted to be necessary in addition to the video stream necessary for frame drawing at that time by the pre-read process described above, and issues a load request to the load unit 108. Good. The load stream determination unit 106 may make a load request at a predetermined timing in a state where the load unit 108 is not performing the load process, for example, at a predetermined time interval or when the user inputs a display area movement request. The load unit 108 performs load processing from the hard disk drive 50 in accordance with a request from the load stream determination unit 106. Specifically, the storage area is specified from the identification information of the moving picture stream to be loaded, and the data read from the storage area is stored in the main memory 60.

デコード部１１２は各時刻のフレーム座標に基づき、メインメモリ６０から必要な動画ストリームのデータを読み出しデコードし、バッファメモリ７０に逐次格納していく。デコード対象は動画ストリーム単位でよく、フレーム座標決定部１１０が決定したフレーム座標の領域が複数の動画ストリームにまたがる場合は当該複数の動画ストリームをデコードしていく。表示画像処理部１１４は、各時刻のフレーム座標に基づきバッファメモリ７０から対応する画像フレームのデータを読み出し、表示処理部４４のフレームメモリに描画していく。 Based on the frame coordinates at each time, the decoding unit 112 reads and decodes the necessary moving picture stream data from the main memory 60 and sequentially stores the data in the buffer memory 70. The decoding target may be a moving image stream unit, and when the frame coordinate area determined by the frame coordinate determining unit 110 extends over a plurality of moving image streams, the plurality of moving image streams are decoded. The display image processing unit 114 reads the data of the corresponding image frame from the buffer memory 70 based on the frame coordinates at each time, and draws it in the frame memory of the display processing unit 44.

一つの動画再生中に拡大縮小を含め表示領域の移動を許す態様においては、全ての階層が時間軸を共有し、利用される動画データの階層が切り替えられたか否かに関わらずシームレスにフレーム描画が進捗することが望ましい。そこで上述のとおり、画像フレームをタイル画像単位の動画ストリームとして階層データを生成しておく。これにより、一度の表示に必要な領域やその後に必要と予測されるデータを優先的にロード、デコードできるため、フレーム描画までに必要な処理の効率を向上させることができる。また時間的にもランダムアクセスが可能な状態でデータを準備することが望ましい。 In a mode that allows movement of the display area including enlargement / reduction during playback of one movie, all layers share the time axis, and frame drawing is seamless regardless of whether the layer of the movie data to be used has been switched or not. It is desirable to progress. Therefore, as described above, hierarchical data is generated using an image frame as a moving image stream in units of tile images. Thereby, since an area necessary for one display and data predicted to be necessary thereafter can be preferentially loaded and decoded, it is possible to improve the efficiency of processing necessary until frame drawing. It is also desirable to prepare data in a state where random access is possible over time.

本実施の形態で処理対象となる動画データは、縮尺率方向を含めた３次元のフレーム座標、および時間、という４次元のパラメータを有するため、動画ストリームを生成する単位や全体的な構成を、圧縮手法や動画の内容などに応じて適宜変化させることができる。図６から図９は本実施の形態において処理対象となる動画データの構造例を示している。 Since the moving image data to be processed in this embodiment has four-dimensional parameters such as three-dimensional frame coordinates including the scale direction and time, the unit for generating the moving image stream and the overall configuration are It can be appropriately changed according to the compression method and the content of the moving image. 6 to 9 show examples of the structure of moving image data to be processed in the present embodiment.

これらの図において三角形は動画の階層データを表し、直方体は１つの動画ストリームを表している。また各階層データは第０階層、第１階層、第２階層の３階層からなるが、階層の数をそれに限る趣旨ではない。上述のとおり１つの動画ストリームは各階層の画像フレームを同じサイズに分割してなるタイル画像ごとに生成され、これらの例では第０階層の画像のサイズをタイル画像のサイズとしている。 In these figures, a triangle represents moving image hierarchical data, and a rectangular parallelepiped represents one moving image stream. Each hierarchical data consists of three hierarchies of the zeroth hierarchy, the first hierarchy, and the second hierarchy, but the number of hierarchies is not limited to that. As described above, one moving image stream is generated for each tile image obtained by dividing the image frame of each layer into the same size. In these examples, the size of the 0th layer image is set as the size of the tile image.

まず図６に示す動画データ構造２００は、各階層を、動画の開始から終了までを、各タイル画像に対し１つの動画ストリームとした１つの階層データ２０１からなる。ここで各動画ストリームの画像フレームであるタイル画像は上述のように同じサイズを有するため、第０階層は１個の動画ストリーム２０２ａ、第１階層は４個の動画ストリーム２０２ｂ、第２階層は１６個の動画ストリーム２０２ｃなどで構成される。 First, the moving image data structure 200 shown in FIG. 6 includes one layer data 201 in which each layer is one moving image stream for each tile image from the start to the end of the moving image. Here, since the tile images that are image frames of the respective moving image streams have the same size as described above, the 0th layer is one moving image stream 202a, the first layer is 4 moving image streams 202b, and the second layer is 16 The video stream 202c and the like.

図６の動画データ構造２００の場合、動画ストリームの時間方向の長さは、元の動画の長さ、すなわち元の画像フレームの数に応じてが変化する。そのため元々画像フレームの数が少ない場合や、長時間データの圧縮が可能かつランダムアクセスが可能な圧縮方式、例えば全てのフレームをＩピクチャとするＭＰＥＧ（Moving Picture Experts Group）などを利用する場合に有利である。 In the case of the moving image data structure 200 of FIG. 6, the length of the moving image stream in the time direction varies depending on the length of the original moving image, that is, the number of original image frames. Therefore, it is advantageous when the number of image frames is originally small, or when using a compression method capable of long-term data compression and random access, such as MPEG (Moving Picture Experts Group) with all frames as I pictures. It is.

図７に示す動画データ構造２０４は、動画データを所定の画像フレーム数で区切り、各階層を、各タイル画像に対し時間軸方向に複数の動画ストリームとした１つの階層データ２０５で構成される。すなわち同図の動画ストリームは、図６で示した各動画ストリームを、図の縦方向である時間軸に対し分割している。この例では、図６の動画ストリームがそれぞれ６個の動画ストリームに分割されている。したがって第０階層は１×６個の動画ストリーム２０６ａ、第１階層は４×６個の動画ストリーム２０６ｂ、第２階層は１６×６個の動画ストリーム２０６ｃなどで構成される。固定数の画像フレーム単位で圧縮を行う圧縮方式を利用する場合にこのような構造となる。 The moving image data structure 204 shown in FIG. 7 is composed of one hierarchical data 205 in which moving image data is divided by a predetermined number of image frames, and each layer is a plurality of moving image streams in the time axis direction for each tile image. That is, the moving image stream in FIG. 6 is obtained by dividing each moving image stream shown in FIG. 6 with respect to the time axis which is the vertical direction of the drawing. In this example, the moving picture stream of FIG. 6 is divided into six moving picture streams. Therefore, the 0th layer is composed of 1 × 6 moving image streams 206a, the first layer is composed of 4 × 6 moving image streams 206b, the second layer is composed of 16 × 6 moving image streams 206c, and the like. This structure is used when a compression method that performs compression in units of a fixed number of image frames is used.

図８に示す動画データ構造２０８は、動画データを所定の画像フレーム数で区切り、その単位で生成した動画ストリームごとに別の階層データ２１０ａ、２１０ｂ、２１０ｃを生成した構成を有する。すなわち各階層データ２１０ａ、２１０ｂ、２１０ｃは、図６と同様に、階層ごとに時間軸方向に１つの動画ストリームで構成されるが、各動画ストリームは固定数の画像フレームを有する。例えば階層データ２１０ａは、第０階層が１個の動画ストリーム２１２ａ、第１階層が４個の動画ストリーム２１２ｂ、第２階層が１６個の動画ストリーム２１２ｃで構成されている。 The moving image data structure 208 shown in FIG. 8 has a configuration in which moving image data is divided by a predetermined number of image frames, and different hierarchical data 210a, 210b, and 210c are generated for each moving image stream generated in that unit. That is, each hierarchical data 210a, 210b, 210c is composed of one moving image stream in the time axis direction for each layer as in FIG. 6, but each moving image stream has a fixed number of image frames. For example, in the hierarchical data 210a, the 0th layer is composed of one moving image stream 212a, the first layer is composed of 4 moving image streams 212b, and the second layer is composed of 16 moving image streams 212c.

図８の動画データ構造２０８の場合、時間軸方向に複数の階層データで構成されるため、あるシーンのみ別の階層データに差し替えたり、階層データを挿入、削除したり、というように、時間軸方向での動画編集が容易である。また各動画ストリームの画像フレーム数は固定となるため、データサイズが見積もりやすい。例えば後述する圧縮方式を適用すると、各動画ストリームのデータは静止画像を同様に階層構造としたときのタイル画像のデータと同様の構造とすることが可能であるため、静止画像の表示機構を動画像表示に利用したり、一部の領域を静止画にするなどの静止画像との共存が容易になる。 In the case of the moving picture data structure 208 in FIG. 8, since it is composed of a plurality of hierarchical data in the time axis direction, only a certain scene is replaced with another hierarchical data, or hierarchical data is inserted or deleted. Easy video editing in any direction. In addition, since the number of image frames in each video stream is fixed, the data size is easy to estimate. For example, if the compression method described later is applied, the data of each moving picture stream can have the same structure as the data of the tile image when the still picture has the same hierarchical structure. Coexistence with a still image such as an image display or a partial image as a still image is facilitated.

図９に示す動画データ構造２１４は、動画データを所定の画像フレーム数で区切り、その単位で生成した動画ストリームをさらに所定数ずつ分けて別の階層データ２１６ａ、２１６ｂ、２１６ｃとした構成を有する。すなわち各階層データ２１６ａ、２１６ｂ、２１６ｃは、図７と同様に、各階層につき時間軸方向に複数の動画ストリームで構成されるが、その数は動画の長さによらず固定とし、同図の場合は２個とすることで階層データを分けている。 The moving picture data structure 214 shown in FIG. 9 has a configuration in which the moving picture data is divided by a predetermined number of image frames, and the moving picture stream generated in that unit is further divided into a predetermined number of pieces as different hierarchical data 216a, 216b, and 216c. That is, each hierarchical data 216a, 216b, 216c is composed of a plurality of moving image streams in the time axis direction for each layer as in FIG. 7, but the number is fixed regardless of the length of the moving image. In this case, the hierarchy data is divided by using two.

例えば階層データ２１６ａは、第０階層が１×２個の動画ストリーム２１８ａ、第１階層は４×２個の動画ストリーム２１８ｂ、第２階層は１６×２個の動画ストリーム２１８ｃで構成されている。この場合も、１つの階層データを構成する各階層のデータサイズの見積もりおよび調整が容易であるとともに、階層データを差し替えることにより時間軸方向での動画編集が容易である。 For example, in the hierarchical data 216a, the 0th layer includes 1 × 2 moving image streams 218a, the first layer includes 4 × 2 moving image streams 218b, and the second layer includes 16 × 2 moving image streams 218c. Also in this case, it is easy to estimate and adjust the data size of each layer constituting one layer data, and it is easy to edit a moving image in the time axis direction by replacing the layer data.

図６から図９に示した動画データ構造は全て、各階層で画像フレームの全領域を網羅するように動画ストリームを保持していたが、動画像が有する冗長性に応じて一部の動画ストリームを動画データから省き、別の階層の動画ストリームで代替するようにしてもよい。図１０は一部の階層の動画ストリームを別の階層の動画ストリームで代替させる場合の動画のデータ構造を模式的に示している。データ構造の表し方は図６と同様である。同図に示す階層データ２２２は、領域２２４に対応する動画ストリームが省かれている。 The moving image data structures shown in FIGS. 6 to 9 all retain the moving image stream so as to cover the entire area of the image frame in each layer. However, some moving image streams depend on the redundancy of the moving image. May be omitted from the moving image data and replaced with a moving image stream of a different level. FIG. 10 schematically shows the data structure of a moving image when a moving image stream of a part of the hierarchy is replaced with a moving image stream of another layer. The way of representing the data structure is the same as in FIG. In the hierarchical data 222 shown in the figure, the moving picture stream corresponding to the area 224 is omitted.

図６に示した階層データ２０１と比較すると、第１階層２２８および第２階層２３０において動画ストリームの数が少なくなっている。この差分が省かれた動画ストリームである。この場合、省かれた動画ストリームが表す領域は、その階層にデータが存在しないことになる。そこでそのような階層のデータを用いるべき縮尺率で該当領域を表示するときは、該当領域のデータを保持する階層、同図の例では第０階層２２６まで階層を遡り、対応する領域を拡大して描画する。 Compared with the hierarchical data 201 shown in FIG. 6, the number of moving image streams is smaller in the first hierarchy 228 and the second hierarchy 230. This is a video stream from which this difference is omitted. In this case, the area represented by the omitted moving image stream has no data in the hierarchy. Therefore, when displaying the corresponding area at a scale ratio that should use data of such a hierarchy, the hierarchy is held up to the hierarchy holding the data of the applicable area, in the example of FIG. And draw.

このような態様は、画像フレーム中に詳細な情報を必要としない領域、例えば空、海、芝生などほぼ単色で構成される領域などが存在する場合に適用できる。このように画像フレームにおける冗長性の有無は、画像解析によって検出できる。例えば各時刻の画像フレームごとに、低解像度側の階層の画像フレームを拡大した画像と高解像度側の画像との差分画像を生成し、差分値が所定のしきい値以下となる領域を検出する。そしてその領域に含まれる動画ストリームのうち、高解像度側の階層の動画ストリームを動画データから除外する。 Such an aspect can be applied to a case where there is a region that does not require detailed information in the image frame, for example, a region composed of almost a single color such as sky, sea, or lawn. Thus, the presence or absence of redundancy in the image frame can be detected by image analysis. For example, for each image frame at each time, a difference image between an image obtained by enlarging an image frame on the low resolution side and an image on the high resolution side is generated, and an area where the difference value is equal to or less than a predetermined threshold is detected. . Then, among the video streams included in the area, the high-resolution layer video stream is excluded from the video data.

このようにすることで動画データのサイズを小さく抑えることができるとともに、動画ストリームのロード処理の一部を省略することができる。このような場合、前述の階層データが定める３次元座標と動画ストリームとを対応づけた情報において、除外した動画ストリームに対応する領域の座標に対し、拡大して用いる上の階層の動画ストリームの識別情報を対応づけ、さらに拡大倍率などの情報を付加することによって描画が可能となる。 By doing so, the size of the moving image data can be kept small, and a part of the loading processing of the moving image stream can be omitted. In such a case, in the information associating the three-dimensional coordinates determined by the hierarchical data and the moving image stream, identification of the moving image stream of the upper layer used in an enlarged manner with respect to the coordinates of the area corresponding to the excluded moving image stream Drawing is possible by associating information and adding information such as an enlargement ratio.

図１０の例は、本実施の形態が、動画データを階層構造にする特徴と、フレーム画像を空間分割し、個別に動画ストリームを生成する、という特徴を併せ持つことによって成り立つ態様である。すなわちフレーム画像をタイル画像に分割することによって、局所的にデータの保持形式を異ならせることができ、さらに解像度の低い階層のデータを代替利用することができるため、一部のデータを省略してデータサイズを抑えることができる。同様の発想で、一部の動画ストリームのみ、構成する画像フレームを間引いてその数を減らし、データサイズを抑えてもよい。 The example of FIG. 10 is an aspect realized by the present embodiment having both the feature that the moving image data has a hierarchical structure and the feature that the frame image is spatially divided and the moving image stream is individually generated. In other words, by dividing the frame image into tile images, the data retention format can be locally changed, and data with a lower resolution can be used instead, so some data is omitted. Data size can be reduced. With the same idea, only a part of moving image streams may be thinned out to reduce the number of image frames, thereby reducing the data size.

このようにすると、当該動画ストリームが担当する領域は時間解像度が低下することになるが、背景など時間的に変化の少ない領域が含まれる動画では有効である。このときの時間冗長性も上述の空間冗長性と同様、例えば隣接する画像フレーム同士の差分画像のうち所定のしきい値以下の差分値を有する領域を検出するなどして特定できる。同様に、一部の動画ストリームを静止画像に置き換えることもできる。 In this way, the time resolution of the area handled by the moving picture stream is reduced, but it is effective for moving pictures including areas with little temporal change such as the background. Similar to the above-described spatial redundancy, the temporal redundancy at this time can be specified by, for example, detecting a region having a difference value equal to or less than a predetermined threshold among the difference images between adjacent image frames. Similarly, some moving picture streams can be replaced with still images.

また動画ストリームごとに圧縮方式を異ならせてもよい。さらに、階層データ内で時間軸を共有させず、階層ごと、動画ストリームごと、画像中の画素列ごと、など所定の単位で意図的に時間軸をずらすことによって様々な画像表現が可能となるようにしてもよい。 Also, the compression method may be different for each video stream. Furthermore, without sharing the time axis in the hierarchical data, various image representations can be realized by intentionally shifting the time axis in predetermined units such as for each hierarchy, for each moving picture stream, for each pixel column in the image. It may be.

上述のとおり本実施の形態で表示対象となる動画データの階層構造は、個々の動画ストリームの圧縮方式については特に制限されず、ＪＰＥＧ（Joint Photographic Experts Group）、ＭＰＥＧ、Ｓ３ＴＣ（S3 Texture Compression）などの既存の方式のいずれを適用してもよい。ただし階層の切り替えを含む表示領域の移動がシームレスに行えるようにするためには、空間的、時間的にランダムアクセスが可能であること、高精細画像であっても画質とデコードスループットの双方を維持できることが望ましい。 As described above, the hierarchical structure of the moving image data to be displayed in the present embodiment is not particularly limited with respect to the compression method of each moving image stream, such as JPEG (Joint Photographic Experts Group), MPEG, S3TC (S3 Texture Compression), etc. Any of the existing methods may be applied. However, in order to enable seamless movement of the display area, including layer switching, spatial and random access is possible, and both image quality and decoding throughput are maintained even for high-definition images. It is desirable to be able to do it.

次に、図７から図９で示した動画データ構造に適用できる、固定数の画像フレーム単位で動画ストリームを圧縮する手法について説明する。なお同圧縮手法は階層データを構成する複数の動画ストリームのみならず、単体の動画ストリームに対しても適用できる。本圧縮手法を実施する装置も、図４で示した画像処理装置１０と同様の構成で実現できる。以下、制御部１００の構成に主眼を置き説明する。 Next, a method for compressing a moving picture stream in units of a fixed number of image frames, which can be applied to the moving picture data structure shown in FIGS. The compression method can be applied not only to a plurality of moving image streams constituting hierarchical data but also to a single moving image stream. An apparatus that implements this compression technique can also be realized with the same configuration as the image processing apparatus 10 shown in FIG. Hereinafter, the description will be given focusing on the configuration of the control unit 100.

図１１は本実施の形態において、動画データ圧縮機能を有する制御部１００ｂおよびハードディスクドライブ５０の構成を詳細に示している。制御部１００ｂは圧縮対象の動画ストリームを構成する画像フレームの色空間をＹＣｂＣｒへ変換するＹＣｂＣｒ変換部１２０、変換後の画像列を時空間分割して符号化単位を生成する画像分割部１２２、および分割された符号化単位ごとに画像データを量子化することで圧縮符号化処理を行う圧縮符号化部１２４を含む。 FIG. 11 shows in detail the configuration of the control unit 100b having the moving image data compression function and the hard disk drive 50 in the present embodiment. The control unit 100b includes a YCbCr conversion unit 120 that converts a color space of an image frame constituting a moving image stream to be compressed into YCbCr, an image division unit 122 that generates a coding unit by space-time dividing the converted image sequence, and A compression encoding unit 124 that performs compression encoding processing by quantizing the image data for each of the divided encoding units is included.

ハードディスクドライブ５０は、個々の画像フレーム列からなる圧縮対象の動画ストリームを格納した動画ストリーム記憶部１２６、画像分割部１２２が画像列を分割する際の分割パターンを記憶する分割パターン記憶部１２８、および圧縮符号化部１２４が圧縮符号化して生成した圧縮データを格納する圧縮データ記憶部１３０を含む。 The hard disk drive 50 includes a moving image stream storage unit 126 that stores a moving image stream to be compressed including individual image frame sequences, a division pattern storage unit 128 that stores division patterns when the image dividing unit 122 divides the image sequence, and A compressed data storage unit 130 for storing compressed data generated by compression encoding by the compression encoding unit 124 is included.

ＹＣｂＣｒ変換部１２０は、動画ストリーム記憶部１２６から圧縮対象の動画ストリームを構成する画像フレームのデータを順次読み出す。そして各画像フレームの画素値であるＲＧＢ値を輝度Ｙ、色差ＣｂおよびＣｒに変換することにより、それぞれの値を画素値とするＹ画像、Ｃｂ画像、Ｃｒ画像を生成する。ＲＧＢからＹＣｂＣｒへの色空間の変換は既存の手法を適用することができる。１つの画像フレームからＹ画像、Ｃｂ画像、Ｃｒ画像が生成されるため、動画ストリームを構成する複数の画像フレームに対し、Ｙ画像列、Ｃｂ画像列、Ｃｒ画像列が生成されることになる。 The YCbCr conversion unit 120 sequentially reads out the data of the image frames constituting the moving image stream to be compressed from the moving image stream storage unit 126. Then, by converting the RGB value, which is the pixel value of each image frame, into luminance Y and color differences Cb and Cr, a Y image, a Cb image, and a Cr image having the respective values as pixel values are generated. An existing method can be applied to the conversion of the color space from RGB to YCbCr. Since a Y image, a Cb image, and a Cr image are generated from one image frame, a Y image sequence, a Cb image sequence, and a Cr image sequence are generated for a plurality of image frames that form a moving image stream.

画像分割部１２２は、ＹＣｂＣｒ変換部１２０が生成したＹ画像列、Ｃｂ画像列、Ｃｒ画像列のうち、まず各Ｃｂ画像およびＣｒ画像を所定の割合で縮小する。そしてＹ画像列、Ｃｂ画像列、Ｃｒ画像列を、分割パターン記憶部１２８に格納された分割パターンで時空間分割する。分割によって生成された単位を「符号化単位」と呼ぶ。 The image dividing unit 122 first reduces each Cb image and Cr image at a predetermined ratio among the Y image sequence, Cb image sequence, and Cr image sequence generated by the YCbCr conversion unit 120. Then, the Y image sequence, the Cb image sequence, and the Cr image sequence are spatiotemporally divided by the division patterns stored in the division pattern storage unit 128. A unit generated by the division is referred to as “coding unit”.

詳細は後に述べるが、画像の内容によって最適な分割パターンが異なるため、画像分割部１２２は、分割パターン記憶部１２８に格納された複数の分割パターンから最適なパターンを選択する処理を行ってもよい。なお以後の処理において縮小されたＣｂ画像およびＣｒ画像は対応するフレームごとに組として扱う。以後、このようなＣｂ画像とＣｒ画像の組を単に「ＣｂＣｒ画像」と呼ぶ。 Although details will be described later, since the optimal division pattern differs depending on the content of the image, the image division unit 122 may perform a process of selecting an optimum pattern from a plurality of division patterns stored in the division pattern storage unit 128. . Note that the Cb image and the Cr image reduced in the subsequent processing are handled as a set for each corresponding frame. Hereinafter, such a set of Cb image and Cr image is simply referred to as “CbCr image”.

圧縮符号化部１２４は、Ｙ画像、ＣｂＣｒ画像の符号化単位ごとに、２つの代表値を表すパレット、および、それら２つの代表値と代表値を線形補間して得られる複数の中間値のうちいずれかを画素ごとに指定するインデックスを生成することにより、画像データを量子化して圧縮符号化する。これにより、Ｙ画像列の符号化単位、およびＣｂＣｒ画像列の符号化単位ごとに、パレットおよびインデックスが生成される。 The compression encoding unit 124 includes a palette representing two representative values for each encoding unit of the Y image and the CbCr image, and a plurality of intermediate values obtained by linearly interpolating the two representative values and the representative value. By generating an index that designates either one for each pixel, the image data is quantized and compression-coded. Thereby, a palette and an index are generated for each encoding unit of the Y image sequence and each encoding unit of the CbCr image sequence.

図１２は、制御部１００ｂを含む画像処理装置１０が実施する動画ストリームの圧縮手順を模式的に示している。圧縮対象の動画ストリーム２５０は、例えば図６から図９において直方体で示した動画ストリームに対応してよい。動画ストリーム２５０はＲＧＢ画像の画像フレームによって構成される。本圧縮手法では、当該動画ストリーム２５０を所定枚数の画像フレーム、図１２の例では８フレームごとに圧縮する。 FIG. 12 schematically illustrates a moving image stream compression procedure performed by the image processing apparatus 10 including the control unit 100b. The moving image stream 250 to be compressed may correspond to, for example, a moving image stream indicated by a rectangular parallelepiped in FIGS. The moving image stream 250 is composed of image frames of RGB images. In this compression method, the moving picture stream 250 is compressed every predetermined number of image frames, that is, every 8 frames in the example of FIG.

まずＹＣｂＣｒ変換部１２０は、８フレーム分の画像フレームをさらに所定のサイズに空間分割して画像平面（ｘ，ｙ）および時間軸ｔの３次元空間で処理単位を定める。図の例では８画素×８画素×８フレームのデータを処理単位２５２としている。次に当該処理単位２５２に含まれる８枚のＲＧＢ画像から、８枚のＹ画像列２５４、およびＣｂＣｒ画像列２５６を生成する（Ｓ１０）。 First, the YCbCr converter 120 further divides the image frame for eight frames into a predetermined size and determines a processing unit in a three-dimensional space of the image plane (x, y) and the time axis t. In the illustrated example, data of 8 pixels × 8 pixels × 8 frames is used as a processing unit 252. Next, eight Y image sequences 254 and CbCr image sequences 256 are generated from the eight RGB images included in the processing unit 252 (S10).

ここでＣｂＣｒ画像列２５６は上述のとおり、元のＲＧＢ画像から直接得られたＣｂ画像、Ｃｒ画像を縦横の両方向で１／２のサイズに縮小して得られた画像列である。従ってＹ画像列２５４は８画素×８画素の画像フレームが８フレーム、ＣｂＣｒ画像列２５６は４画素×４画素のＣｂ画像と４画素×４画素のＣｒ画像を連結した画像が８フレーム、となる。 Here, as described above, the CbCr image sequence 256 is an image sequence obtained by reducing the Cb image and Cr image directly obtained from the original RGB image to ½ size in both the vertical and horizontal directions. Therefore, the Y image row 254 has 8 image frames of 8 pixels × 8 pixels, and the CbCr image row 256 has 8 frames of images obtained by connecting a Cb image of 4 pixels × 4 pixels and a Cr image of 4 pixels × 4 pixels. .

次に画像分割部１２２は、Ｙ画像列２５４およびＣｂＣｒ画像列２５６を、分割パターン記憶部１２８に格納された分割パターンのうちいずれかのパターンで時空間分割して符号化単位を形成する（Ｓ１２）。同図の例では、Ｙ画像列２５４およびＣｂＣｒ画像列２５６の各画像フレームを横４画素×縦２画素の同じサイズで空間分割して得られた画像ブロックを、時間方向に隣接する２つの画像フレームごとに分割してなる、４画素×２画素×２枚のデータを符号化単位としている。 Next, the image dividing unit 122 performs space-time division on the Y image sequence 254 and the CbCr image sequence 256 with any one of the division patterns stored in the division pattern storage unit 128 to form a coding unit (S12). ). In the example of the figure, an image block obtained by spatially dividing each image frame of the Y image sequence 254 and the CbCr image sequence 256 with the same size of 4 horizontal pixels × 2 vertical pixels is divided into two images adjacent in the time direction. Data of 4 pixels × 2 pixels × 2 pieces divided for each frame is used as an encoding unit.

上述のとおりＹ画像列２５４は８画素×８画素であるため、各画像フレームは「Ａ」、「Ｂ」、「Ｃ」、「Ｄ」、「Ｅ」、「Ｆ」、「Ｇ」、「Ｈ」の８個の画像ブロックに分割され、１番目のフレームの画像ブロック「Ａ」と２番目のフレームの画像ブロック「Ａ」とで符号化単位２５８を形成する（網掛けされた領域）。その他の画像ブロックおよび画像フレームも同様であり、結果としてＹ画像列２５４について空間分割数８×時間分割数４＝３２個の符号化単位が形成される。 Since the Y image sequence 254 is 8 pixels × 8 pixels as described above, each image frame has “A”, “B”, “C”, “D”, “E”, “F”, “G”, “ The image block “A” is divided into eight image blocks “H”, and the image block “A” of the first frame and the image block “A” of the second frame form an encoding unit 258 (shaded area). The same applies to the other image blocks and image frames. As a result, for the Y image sequence 254, the number of spatial divisions 8 × the number of time divisions 4 = 32 encoding units are formed.

一方、ＣｂＣｒ画像列２５６はＣｂ画像、Ｃｒ画像ともに４画素×４画素であるため、前者は「Ｉ」、「Ｊ」、後者は「Ｋ」、「Ｌ」の２個の画像ブロックに分割され、１番目のフレームの画像ブロック「Ｉ」および「Ｋ」と２番目のフレームの画像ブロック「Ｉ」および「Ｋ」とで符号化単位２６０を形成する（網掛けされた領域）。その他の画像ブロックおよび画像フレームも同様であり、結果としてＣｂＣｒ画像列２５６について空間分割数２×時間分割数４＝８個の符号化単位が形成される。 On the other hand, since the CbCr image sequence 256 is 4 pixels × 4 pixels for both the Cb image and the Cr image, the former is divided into two image blocks of “I” and “J”, and the latter of “K” and “L”. The image blocks “I” and “K” of the first frame and the image blocks “I” and “K” of the second frame form an encoding unit 260 (shaded area). The same applies to the other image blocks and image frames. As a result, the coding unit of the space division number 2 × time division number 4 = 8 is formed for the CbCr image sequence 256.

圧縮符号化部１２４は、各符号化単位に対しパレットおよびインデックスのデータを生成する。パレットおよびインデックスは、基本的にはＳ３ＴＣのテクスチャ圧縮方式においてＲＧＢ画像から生成されるパレットおよびインデックスと同様である。一方、本実施の形態では、パラメータの次元数が一般的なＳ３ＴＣと異なる。図１３はＹ画像列２５４の符号化単位２５８からパレットおよびインデックスのデータを生成する手法を模式的に示している。 The compression encoding unit 124 generates palette and index data for each encoding unit. The palette and the index are basically the same as the palette and the index generated from the RGB image in the S3TC texture compression method. On the other hand, in the present embodiment, the number of parameter dimensions is different from that of general S3TC. FIG. 13 schematically shows a method for generating palette and index data from the encoding unit 258 of the Y image sequence 254.

図１２で示したパターンで分割した場合、符号化単位２５８は４×２×２＝１６画素を含む。同図において画素は円形で模式的に示されている。各画素が画素値として有する輝度Ｙのサンプル値を輝度Ｙの軸上に表すと分布２６２のようになる。その分布２６２でプロットされた１６個のサンプルのうち、２つの代表値を選択する。例えば最小値（ｍｉｎ）および最大値（ｍａｘ）を代表値として選択し、当該２値を保持するデータをパレットとする。さらに輝度Ｙの軸上、最小値と最大値の間の線分を１：２で内分する輝度Ｙの値を第１中間値（ｍｉｄ１）、２：１で内分する輝度Ｙの値を第２中間値（ｍｉｄ２）としたとき、最小値、第１中間値、第２中間値、最大値の４値のいずれかを指定する情報を画素ごとに保持するデータをインデックスとする。 When the pattern shown in FIG. 12 is used for division, the encoding unit 258 includes 4 × 2 × 2 = 16 pixels. In the figure, the pixels are schematically shown as circles. When a sample value of luminance Y that each pixel has as a pixel value is represented on the axis of luminance Y, a distribution 262 is obtained. Of the 16 samples plotted with the distribution 262, two representative values are selected. For example, the minimum value (min) and the maximum value (max) are selected as representative values, and data holding the binary values is used as a palette. Furthermore, on the axis of luminance Y, the luminance Y value that internally divides the line segment between the minimum value and the maximum value by 1: 2 is the first intermediate value (mid1), and the luminance Y value that is internally divided by 2: 1. Assuming that the second intermediate value (mid2) is used, data that stores information specifying any one of the four values of the minimum value, the first intermediate value, the second intermediate value, and the maximum value is used as an index.

すなわちＹ画像列２５４の１符号化単位２５８に対して、パレットは輝度Ｙを表す８ビット×２値＝２バイト、インデックスは４値の識別番号を０〜３で表す情報２ビット×１６画素＝４バイトのデータとなる。上述のとおり１つの処理単位であるＹ画像列２５４は３２個の符号化単位で構成されるため、Ｙ画像列２５４全体では、パレットは３２×２バイト＝６４バイト、インデックスは３２×４バイト＝１２８バイトのデータとなる。 That is, for one encoding unit 258 of the Y image sequence 254, the palette is 8 bits × 2 values = 2 bytes representing the luminance Y, and the index is information 2 bits × 16 pixels = 4 values representing identification numbers 0-3 = It becomes 4 bytes of data. As described above, the Y image sequence 254 that is one processing unit is composed of 32 encoding units. Therefore, in the entire Y image sequence 254, the palette is 32 × 2 bytes = 64 bytes, and the index is 32 × 4 bytes = The data is 128 bytes.

図１４はＣｂＣｒ画像列２５６の符号化単位２６０からパレットおよびインデックスのデータを生成する手法を模式的に示している。図１２で示したパターンで分割した場合、符号化単位２６０は、Ｃｂ画像、Ｃｒ画像のそれぞれが４×２×２＝１６画素を含む。そこで、両画像の対応する画素が画素値として有する（色差Ｃｂ，色差Ｃｒ）を要素とする色差のサンプル値を、色差Ｃｂ、色差Ｃｒの軸を有する２次元平面上に表すと分布２６４のようになる。 FIG. 14 schematically shows a method of generating palette and index data from the encoding unit 260 of the CbCr image sequence 256. When divided in the pattern shown in FIG. 12, the coding unit 260 includes 4 × 2 × 2 = 16 pixels in each of the Cb image and the Cr image. Therefore, when a sample value of color difference having (color difference Cb, color difference Cr) as an element in the corresponding pixels of both images is represented on a two-dimensional plane having the axes of color difference Cb and color difference Cr, distribution 264 is obtained. become.

この分布２６４にプロットされた１６個のサンプルのうち、２つの代表値を選択する。例えば分布２６４を直線で近似したときに、直線の左端、右端にある色差をそれぞれ最小値（ｍｉｎ）および最大値（ｍａｘ）として代表値とする。そして当該２値を保持するデータをパレットとする。このとき各代表値は、（色差Ｃｂ，色差Ｃｒ）を要素とする２次元パラメータである。また近似直線上で最小値と最大値との間の線分を１：２で内分する色差を第１中間値（ｍｉｄ１）、２：１で内分する色差を第２中間値（ｍｉｄ２）としたとき、最小値、第１中間値、第２中間値、最大値の４値のいずれかを指定する情報を画素ごとに保持するデータをインデックスとする。 Of the 16 samples plotted in this distribution 264, two representative values are selected. For example, when the distribution 264 is approximated by a straight line, the color difference at the left end and the right end of the straight line is set as a representative value as a minimum value (min) and a maximum value (max), respectively. Then, the data holding the binary values is set as a palette. At this time, each representative value is a two-dimensional parameter having (color difference Cb, color difference Cr) as an element. On the approximate straight line, the color difference that internally divides the line segment between the minimum value and the maximum value by 1: 2 is the first intermediate value (mid1), and the color difference that internally divides by 2: 1 is the second intermediate value (mid2). In this case, data that stores information specifying any one of the four values of the minimum value, the first intermediate value, the second intermediate value, and the maximum value is used as an index.

すなわちＣｂＣｒ画像列２５６の１符号化単位２６０に対して、パレットは色差ＣｂおよびＣｒの２要素×各色差を表す８ビット×２値＝４バイト、インデックスは４値の識別番号を０〜３で表す情報２ビット×１６画素＝４バイトのデータとなる。上述のとおり１つの処理単位であるＣｂＣｒ画像列２５６は８個の符号化単位で構成されるため、ＣｂＣｒ画像列２５６全体では、パレットは８×４バイト＝３２バイト、インデックスは８×４バイト＝３２バイトのデータとなる。 That is, for one encoding unit 260 of the CbCr image sequence 256, the palette is 2 elements of color difference Cb and Cr × 8 bits representing each color difference × 2 values = 4 bytes, and the index is a 4-value identification number 0-3. Information to be expressed is 2 bits × 16 pixels = 4 bytes of data. As described above, the CbCr image sequence 256, which is one processing unit, is composed of 8 coding units. Therefore, in the entire CbCr image sequence 256, the palette is 8 × 4 bytes = 32 bytes and the index is 8 × 4 bytes = The data is 32 bytes.

このように圧縮すると、１処理単位の８画素×８画素×８フレームのＲＧＢ画像は、Ｙ画像列のパレット６４バイトとインデックス１２８バイト、ＣｂＣｒ画像列のパレット３２バイトとインデックス３２バイトの、合計２５６バイトとなる。すなわち１画素あたり０．５バイトのデータとなる。 When compressed in this way, an RGB image of 8 pixels × 8 pixels × 8 frames in one processing unit is 256 in total, including a palette of 64 bytes and an index of 128 bytes for a Y image sequence, and a palette of 32 bytes and an index of 32 bytes for a CbCr image sequence. It becomes a byte. That is, the data is 0.5 bytes per pixel.

Ｓ３ＴＣを用いて４画素×４画素のＲＧＢ画像を圧縮したとき、パレットはＲＧＢ値を表す２バイト×２値＝４バイト、インデックスはＲＧＢ値のうち４値の識別番号を０〜３で表す情報２ビット×１６画素＝４バイトのデータとなるため、圧縮後のデータは１画素あたり、８バイト／１６画素＝０．５バイトであり、上述の圧縮手法による圧縮後のデータサイズと同一となる。したがってこのような処理単位で動画データを圧縮していくことにより、ハードディスクドライブ５０からメインメモリ６０へロードするデータの単位やメインメモリ６０内でのキャッシュラインのサイズなどの観点で、静止画と動画を同等に扱うことができる。 When a RGB image of 4 pixels × 4 pixels is compressed using S3TC, the palette is 2 bytes × 2 values = 4 bytes representing the RGB value, and the index is information representing the identification number of 4 values among the RGB values as 0 to 3 Since 2 bits × 16 pixels = 4 bytes of data, the data after compression is 8 bytes / 16 pixels = 0.5 bytes per pixel, which is the same as the data size after compression by the compression method described above. . Therefore, by compressing moving image data in such processing units, still images and moving images are taken into consideration in terms of the unit of data loaded from the hard disk drive 50 to the main memory 60, the size of the cache line in the main memory 60, and the like. Can be treated equally.

また本実施の形態では、ＲＧＢ画像を１次元のパラメータを保持するＹ画像、および２次元のパラメータを保持するＣｂＣｒ画像に分解したうえでパレットおよびインデックスを生成した。そのため、１次元のＹ画像の場合は全てのサンプル値が直線上に分布し、２次元のＣｂＣｒ画像も、近似直線からはずれるサンプルは当該近似直線の法線方向のみとなる。したがって、３次元のパラメータを保持するＲＧＢ画像を直線で近似して量子化する一般的なＳ３ＴＣの手法と比較し、量子化誤差を小さく抑えることができる。 In the present embodiment, the RGB image is decomposed into a Y image holding a one-dimensional parameter and a CbCr image holding a two-dimensional parameter, and then a palette and an index are generated. Therefore, in the case of a one-dimensional Y image, all sample values are distributed on a straight line, and in a two-dimensional CbCr image, the sample deviating from the approximate line is only in the normal direction of the approximate line. Therefore, compared with a general S3TC method in which an RGB image holding three-dimensional parameters is approximated by a straight line and quantized, the quantization error can be suppressed small.

図１２の分割パターンでは、動画ストリームを横４画素×縦２画素×２フレーム分に分割して符号化単位とした。この分割パターンは上述のとおり、画像の内容によって適応的に変化させてもよい。図１５は１つの処理単位を分割するパターンのバリエーションを示している。同図左端からパターン（Ａ）、パターン（Ｂ）、パターン（Ｃ）、パターン（Ｄ）とし、上段のＹ画像列および下段のＣｂＣｒ画像列のいずれも、空間分割の区切りを直線で示し、１つの符号化単位を網掛けして代表的に表している。 In the division pattern of FIG. 12, the moving picture stream is divided into 4 horizontal pixels × 2 vertical pixels × 2 frames to be an encoding unit. As described above, this division pattern may be adaptively changed according to the content of the image. FIG. 15 shows a variation of a pattern for dividing one processing unit. Pattern (A), pattern (B), pattern (C), and pattern (D) are shown from the left end of the figure, and each of the upper Y image sequence and the lower CbCr image sequence indicates a space division delimiter with a straight line. One coding unit is represented by shading.

パターン（Ａ）は横４画素×縦４画素×１フレームごとに分割するパターンである。パターン（Ｂ）は図１２に示したパターンと同一である。パターン（Ｃ）は横２画素×縦２画素×４フレームごとに分割するパターン、パターン（Ｄ）は横２画素×縦１画素×８フレームごとに分割するパターンである。 Pattern (A) is a pattern divided into 4 horizontal pixels × 4 vertical pixels × one frame. The pattern (B) is the same as the pattern shown in FIG. Pattern (C) is a pattern divided into 2 horizontal pixels × vertical 2 pixels × 4 frames, and pattern (D) is a pattern divided into 2 horizontal pixels × 1 vertical pixel × 8 frames.

これらのパターンはいずれも、１処理単位がＹ画像列に対し１６画素、ＣｂＣｒ画像列に対し１６画素×２となるため、量子化する際のサンプル数は図１３および図１４で示したのと同じである。一方パターン（Ｄ）からパターン（Ａ）へ向かうほど、詳細な時間分割を行い、パターン（Ａ）からパターン（Ｄ）へ向かうほど詳細な空間分割を行う。このような分割パターンを準備し、空間方向で冗長性を有するか、時間方向で冗長性を有するか、という画像の特性に応じて分割パターンを選択する。 In each of these patterns, since one processing unit is 16 pixels for the Y image sequence and 16 pixels × 2 for the CbCr image sequence, the number of samples at the time of quantization is as shown in FIG. 13 and FIG. The same. On the other hand, the more detailed time division is performed from the pattern (D) to the pattern (A), and the more detailed space division is performed from the pattern (A) to the pattern (D). Such a division pattern is prepared, and the division pattern is selected according to the characteristics of the image, such as whether it has redundancy in the spatial direction or redundancy in the temporal direction.

具体的には、空や芝など単色に近い領域が多く含まれるなど、画像が空間冗長性を有する場合、その画素値は空間に対してより一様となりやすく、空間分割数を少なくしても量子化による誤差が含まれにくいため、パターン（Ａ）に近い分割パターンを選択する。一方、動きの少ない景色を定点観測した場合など、画像が時間冗長性を有する場合、その画素値は時間方向で一様となりやすく、時間分割数を少なくしても量子化による誤差が含まれにくいため、パターン（Ｄ）に近い分割パターンを選択する。 Specifically, if the image has spatial redundancy, such as the sky and turf contain many areas close to a single color, the pixel values tend to be more uniform with respect to the space, and the number of space divisions can be reduced. Since an error due to quantization is difficult to be included, a divided pattern close to the pattern (A) is selected. On the other hand, if the image has temporal redundancy, such as when a fixed-point view of a scene with little motion is observed, the pixel values are likely to be uniform in the time direction, and even if the number of time divisions is reduced, errors due to quantization are not easily included. Therefore, a division pattern close to the pattern (D) is selected.

例えばパターン（Ｄ）の場合、１つの符号化単位は空間方向には２画素のみを有する。同じ符号化単位に含まれる８フレーム分で時間変化がなければ、パレットで保持する２つの代表値がそのまま元の画素値を表していることになるため、量子化誤差は０となる。ＲＧＢ画像に対してＳ３ＴＣの手法で圧縮を行う場合、パレットに保持されるＲＧＢのデータは本来の２４ビットから１６ビットへ低下させるため、デコードした際に十分な階調が得られないなど画質の低下が起こる場合がある。本実施の形態では輝度Ｙ、色差Ｃｂ、Ｃｒのそれぞれに対し８ビットのパレットを準備するため元の画質を保持できる可能性が高い。 For example, in the case of the pattern (D), one encoding unit has only two pixels in the spatial direction. If there is no time change for 8 frames included in the same encoding unit, the two representative values held in the palette represent the original pixel values as they are, and the quantization error is zero. When the RGB image is compressed by the S3TC method, the RGB data held in the palette is reduced from the original 24 bits to 16 bits, so that sufficient gradation cannot be obtained when decoding, such as image quality. A decrease may occur. In this embodiment, an 8-bit palette is prepared for each of the luminance Y and the color differences Cb and Cr, so that there is a high possibility that the original image quality can be maintained.

分割パターン記憶部１２８には、パターン（Ａ）〜パターン（Ｄ）の４種類の分割パターンと、それらを識別する情報、例えば０，１，２，３の４つの識別番号を対応づけて格納しておく。画像分割部１２２は、ＹＣｂＣｒ変換部１２０が生成した各画像列に対し分割パターン記憶部１２８に格納された分割パターンを全て実施して、原画像との誤差が最も少ない分割パターンを選択する。 The division pattern storage unit 128 stores four types of division patterns (A) to (D) and information for identifying them, for example, four identification numbers 0, 1, 2, and 3 in association with each other. Keep it. The image division unit 122 performs all the division patterns stored in the division pattern storage unit 128 for each image sequence generated by the YCbCr conversion unit 120, and selects a division pattern with the least error from the original image.

この処理は実際には、各分割パターンで分割したときの画像列の圧縮符号化を圧縮符号化部１２４に実施させ、各圧縮データをデコードした画像と圧縮前の画像とを画像フレームごとに比較する。そして差分の少ない分割パターンを選択すればよい。画像分割部１２２は、選択した分割パターンの識別番号を圧縮符号化部１２４に通知し、圧縮符号化部１２４は、生成した圧縮データに当該識別番号の情報を含ませて最終的な圧縮データとし、圧縮データ記憶部１３０に格納する。 In actuality, this processing causes the compression encoding unit 124 to perform compression encoding of the image sequence when divided by each division pattern, and compares the image obtained by decoding each compressed data with the image before compression for each image frame. To do. Then, a division pattern with a small difference may be selected. The image dividing unit 122 notifies the identification number of the selected division pattern to the compression encoding unit 124, and the compression encoding unit 124 includes the information of the identification number in the generated compressed data as final compressed data. And stored in the compressed data storage unit 130.

分割パターンは、画像内の領域で異ならせるようにしてもよい。領域ごとに分割パターンを選択する手順も上記と同様でよい。そして画像分割部１２２は選択した分割パターンの識別番号と領域とを対応づけたマップを生成し、最終的な圧縮データに含めるようにする。図１６は分割パターンマップのデータ構造例を示している。同図の例は、１つの動画ストリームを２５６画素×２５６画素の画像フレームで構成した場合を示している。図１５で示した４種類の分割パターンを設定可能とした場合、分割パターンを設定できる最小単位は１処理単位である８画素×８画素×８フレームとする。 The division pattern may be different for each region in the image. The procedure for selecting the division pattern for each region may be the same as described above. Then, the image division unit 122 generates a map in which the identification number of the selected division pattern is associated with the region, and includes the map in the final compressed data. FIG. 16 shows an example of the data structure of the division pattern map. The example in the figure shows a case where one moving picture stream is composed of image frames of 256 pixels × 256 pixels. When the four types of division patterns shown in FIG. 15 can be set, the minimum unit that can set the division pattern is 8 pixels × 8 pixels × 8 frames, which is one processing unit.

もし最小単位ごとに分割パターンを設定する場合は図１６に示すように、２５６画素×２５６画素の画像フレームに対し、８画素×８画素の領域ごとに分割パターンの識別番号、すなわち０〜３の値を対応づける。結果として分割パターンマップ２７０は３２×３２×２ビット＝２５６バイトの情報となる。このような分割パターンマップ２７０を、８フレームごとにに付加すれば、時間方向に対しても分割パターンを異ならせることができる。 If a division pattern is set for each minimum unit, as shown in FIG. 16, an identification number of the division pattern, that is, 0 to 3 for each region of 8 pixels × 8 pixels with respect to an image frame of 256 pixels × 256 pixels. Associate values. As a result, the division pattern map 270 is 32 × 32 × 2 bits = 256 bytes of information. If such a division pattern map 270 is added every 8 frames, the division pattern can be made different in the time direction.

図１６の例は分割パターンの設定を最小単位の８画素×８画素ごとに行った場合であるが、同様にして、１６画素×１６画素ごと、６４画素×３２画素ごとなど、縦方向、横方向に８画素×８画素の領域を連結した領域ごとに分割パターンを設定するようにしてもい。また全ての領域に対し１つの分割パターンを設定するなど、設定単位自体を様々に設定することができる。分割パターンマップは上述のように実際に圧縮符号化したデータをデコードして元の画像との誤差の小ささによって生成できるほか、同様の内容を有するテスト画像によって設定単位やそこに設定する分割パターンを準備しておいてもよい。 The example of FIG. 16 is a case where the division pattern is set for each minimum unit of 8 pixels × 8 pixels. Similarly, for example, every 16 pixels × 16 pixels, every 64 pixels × 32 pixels, etc. A division pattern may be set for each region obtained by connecting regions of 8 pixels × 8 pixels in the direction. Also, the setting unit itself can be set variously, such as setting one division pattern for all the areas. The division pattern map can be generated by decoding the data that was actually compressed and encoded as described above and with a small error from the original image. In addition, the division pattern map can be set by the test image having the same contents and the division pattern set there You may have prepared.

次に圧縮符号化部１２４が、圧縮符号化したデータを圧縮データ記憶部１３０に格納する手順について説明する。本実施の形態において生成される圧縮データは、Ｓ３ＴＣのテクスチャ圧縮方式と同様に、パレットおよびインデックスによって構成される。そのためデコード処理は、図４の画像処理装置１０の制御部１００に含まれる、一般的なＧＰＵのシェーディング機能をそのまま利用することができる。 Next, a procedure in which the compression encoding unit 124 stores the compression encoded data in the compressed data storage unit 130 will be described. The compressed data generated in the present embodiment is composed of palettes and indexes as in the S3TC texture compression method. Therefore, the decoding process can use a general GPU shading function included in the control unit 100 of the image processing apparatus 10 of FIG. 4 as it is.

そのため、Ｙ画像列のデータを量子化して生成されたインデックスおよびパレットと、ＣｂＣｒ画像列のデータを量子化して生成されたインデックスおよびパレットが、通常のテクスチャ画像と同様に読み出されデコードできるようにすることが望ましい。そこで圧縮データを格納する際は、同じ領域を表すＹ画像列の量子化データとＣｂＣｒ画像列の量子化データを１つのまとまりとすることにより、少ないデータアクセスで画素を復元できるようにする。 Therefore, the index and palette generated by quantizing the data of the Y image sequence and the index and palette generated by quantizing the data of the CbCr image sequence can be read and decoded in the same manner as a normal texture image. It is desirable to do. Therefore, when storing compressed data, the quantized data of the Y image sequence and the quantized data of the CbCr image sequence representing the same region are combined into one, so that the pixels can be restored with a small number of data accesses.

図１７は圧縮データ記憶部１３０における圧縮データの配列を説明するための図である。上述のとおりＹ画像列、ＣｂＣｒ画像列に分解したのちに量子化したデータを、ＲＧＢ画像の圧縮データと同等に扱うためには、同じ領域を表すそれらのデータをまとめて格納することが望ましい。そこで本実施の形態では、Ｙ画像列に対する圧縮データ２８０と、同じ領域を表すＣｂＣｒ画像列に対する圧縮データ２８２とを１つの格納単位としてまとめる。 FIG. 17 is a diagram for explaining the arrangement of compressed data in the compressed data storage unit 130. As described above, in order to treat the data quantized after being decomposed into the Y image sequence and the CbCr image sequence in the same manner as the compressed data of the RGB image, it is desirable to store the data representing the same region together. Therefore, in this embodiment, the compressed data 280 for the Y image sequence and the compressed data 282 for the CbCr image sequence representing the same area are collected as one storage unit.

図中、Ｙ画像列に対する圧縮データ２８０のうち「Ｉ」と表記された直方体はそれぞれ１符号化単位から生成されたインデックス、「Ｐ」と表記された直方体はそれぞれ１符号化単位から生成されたパレットである。ＣｂＣｒ画像列に対する圧縮データ２８２も同様である。上述のとおり、Ｙ画像列のインデックスおよびパレットは、１符号化単位あたりそれぞれ４バイト、２バイトのデータである。ＣｂＣｒ画像列のインデックスおよびパレットはどちらも、１符号化単位あたり４バイトのデータである。 In the figure, among the compressed data 280 for the Y image sequence, the rectangular parallelepiped denoted as “I” is generated from one encoding unit, and the rectangular parallelepiped denoted as “P” is generated from one encoding unit. It is a pallet. The same applies to the compressed data 282 for the CbCr image sequence. As described above, the index and palette of the Y image sequence are 4 bytes and 2 bytes of data per encoding unit, respectively. Both the index and palette of the CbCr image sequence are 4 bytes of data per encoding unit.

そこで図１７に示すように、同じ領域を表す、Ｙ画像列の４符号化単位とＣｂＣｒ画像列の１符号化単位のデータを、深さ４バイトの記憶領域に配列してまとめる。ここでＹ画像列に対する圧縮データ２８０のうち、パレットはそれぞれ２バイトのデータであるため、図のように深さ方向に２つ配置することにより、縦方向２×横方向４×４バイトのデータとなる。ここで同じ領域を表すＹ画像列とＣｂＣｒ画像列とは、例えば図１２におけるＹ画像の画像ブロック「Ａ」、「Ｂ」、「Ｃ」、「Ｄ」と、Ｃｂ画像の画像ブロック「Ｉ」、Ｃｒ画像の画像ブロック「Ｋ」などである。 Therefore, as shown in FIG. 17, the data of 4 encoding units of the Y image sequence and 1 encoding unit of the CbCr image sequence representing the same area are arranged in a storage area of 4 bytes in depth. Here, in the compressed data 280 for the Y image sequence, each palette is 2-byte data, so by arranging two in the depth direction as shown in the figure, 2 × 4 × 4 bytes of data in the vertical direction. It becomes. Here, the Y image sequence and the CbCr image sequence representing the same area are, for example, the image blocks “A”, “B”, “C”, “D” of the Y image and the image block “I” of the Cb image in FIG. , The image block “K” of the Cr image.

このように圧縮データをまとめると、縦方向２画素×横方向４画素分のＲＧＢＡ画像のデータを格納する記憶領域２８４にそのまま格納することができる。上述のとおり８画素×８画素×８フレームの処理単位あたり、Ｙ画像列は３２個、ＣｂＣｒ画像列は８個の符号化単位が形成されるため、１処理単位あたりこのような格納単位が８個形成される。１格納単位は縦方向２画素×横方向４画素分のＲＧＢＡ画像と同じデータサイズであるため、１処理単位あたり８画素×８画素のＲＧＢＡ画像分のデータとなる。この特徴は、図１５で示したどの分割パターンでも同様となる。 When the compressed data is collected in this way, it can be stored as it is in the storage area 284 for storing RGBA image data of 2 pixels in the vertical direction × 4 pixels in the horizontal direction. As described above, 32 encoding units are formed for the Y image sequence and 8 encoding units are formed for the CbCr image sequence per processing unit of 8 pixels × 8 pixels × 8 frames, so that there are 8 storage units per processing unit. Individually formed. Since one storage unit has the same data size as an RGBA image of 2 pixels in the vertical direction and 4 pixels in the horizontal direction, the data is equivalent to an RGBA image of 8 pixels × 8 pixels per processing unit. This feature is the same for any division pattern shown in FIG.

図１８は、これまで述べた圧縮符号化処理を動画ストリーム全体に施したときのデータの変遷を模式的に示している。動画ストリームは２５６画素×２５６画素のＲＧＢ画像の画像フレームで構成され、それを８フレーム単位で圧縮するとする。まず８枚の画像フレームを８画素×８画素の処理単位に分割する（Ｓ２０）。これにより縦方向および横方向に３２個の処理単位が形成される。 FIG. 18 schematically shows the transition of data when the compression encoding process described so far is applied to the entire moving image stream. The moving image stream is composed of image frames of RGB images of 256 pixels × 256 pixels and is compressed in units of 8 frames. First, 8 image frames are divided into processing units of 8 pixels × 8 pixels (S20). As a result, 32 processing units are formed in the vertical and horizontal directions.

次にそれぞれの処理単位に対し、図１２に示すように、ＹＣｂＣｒ変換を施してＹ画像、縮小したＣｂＣｒ画像を生成し、それぞれを符号化単位に分割したうえインデックスおよびパレットを生成する。それをまとめて１処理単位あたり８個の格納単位を生成する（Ｓ２２）。結果として８フレーム分のＲＧＢ画像が、同じ画素数を有するＲＧＢＡ画像１フレームに圧縮されたことになる。 Next, as shown in FIG. 12, YCbCr conversion is performed on each processing unit to generate a Y image and a reduced CbCr image, and each index is divided into coding units to generate an index and a palette. Collectively, eight storage units are generated per processing unit (S22). As a result, the RGB images for 8 frames are compressed into 1 frame of RGBA image having the same number of pixels.

ここで上述の分割パターンマップを圧縮データに埋め込む手法について説明する。図１７に示すように１つの格納単位にはＹ画像列のパレットが４つ格納されている。各パレットには輝度Ｙの代表値である２値が格納されている。そこで、１つの格納単位に含まれる４つのパレットのうち、深さ方向に並べて配置された２つのパレットを用いて、４つの分割パターンを識別するための２ビットの情報を埋め込む。 Here, a method of embedding the above-described division pattern map in the compressed data will be described. As shown in FIG. 17, four Y image sequence palettes are stored in one storage unit. Each palette stores binary values that are representative values of luminance Y. Therefore, 2-bit information for identifying the four division patterns is embedded using two palettes arranged side by side in the depth direction among the four palettes included in one storage unit.

図１９は当該２つのパレットに分割パターンの識別番号を埋め込む手法を説明するための図である。２つのパレットのうち第１のパレット２９０が保持する２値が、図の手前の先頭アドレスから順に「Ｐａ０」、「Ｐａ１」であり、第２のパレット２９２が保持する２値がアドレス順に「Ｐｂ０」、「Ｐｂ１」であるとする。ここで「Ｐａ０」と「Ｐａ１」の大小関係、「Ｐｂ０」と「Ｐｂ１」の大小関係によって合計２ビットの情報を表す。 FIG. 19 is a diagram for explaining a method of embedding the identification numbers of the divided patterns in the two pallets. Among the two pallets, the binary values held by the first pallet 290 are “Pa0” and “Pa1” in order from the head address in the front of the figure, and the binary values held by the second pallet 292 are “Pb0” in the address order. ”And“ Pb1 ”. Here, information of 2 bits in total is represented by the magnitude relationship between “Pa0” and “Pa1” and the magnitude relationship between “Pb0” and “Pb1”.

例えば第１のパレット２９０の「Ｐａ０」が「Ｐａ１」より大きければ「１」、それ以外であれば「０」とすることで１ビットの情報を表す。同様に第２のパレット２９２の「Ｐｂ０」が「Ｐｂ１」より大きければ「１」、それ以外であれば「０」とすることでさらに１ビットの情報を表す。パレットが保持する２値は、どちらが先のアドレスに格納されていてもデコード処理には影響しない。そこで各パレットにおいて大きい方の値をどちらのアドレスに格納するかを、分割パターンの識別番号に応じて入れ替えることにより、分割パターンの識別番号をパレットに埋め込むことができる。 For example, 1-bit information is represented by “1” if “Pa0” of the first palette 290 is larger than “Pa1” and “0” otherwise. Similarly, if “Pb0” of the second pallet 292 is larger than “Pb1”, “1” is set, and “0” is expressed otherwise. The binary value held by the palette does not affect the decoding process, whichever is stored at the previous address. Therefore, the identification number of the divided pattern can be embedded in the pallet by changing the address in which the larger value is stored in each pallet according to the identification number of the divided pattern.

このようにすることで、分割パターンマップを圧縮データの本体とは別に生成することなく、圧縮データに含めることができ、全体としてデータサイズを抑えることができる。また対応する領域の圧縮データごとに埋め込むため参照する際の効率がよい。上述のとおり分割パターンは最小で１処理単位（＝８個の格納単位）であるため、８個の格納単位のうちいずれかのパレットの対に１つの分割パターンを埋め込めばよい。一方、８個の格納単位に含まれるパレットの対１６個全てに同じ分割パターンを埋め込んでもよい。 In this way, the division pattern map can be included in the compressed data without being generated separately from the main body of the compressed data, and the data size can be suppressed as a whole. In addition, since it is embedded for each compressed data in the corresponding area, the efficiency in referring is good. As described above, since the division pattern is a minimum of one processing unit (= 8 storage units), it is only necessary to embed one division pattern in any pallet pair among the eight storage units. On the other hand, the same division pattern may be embedded in all 16 pairs of pallets included in the eight storage units.

このように分割パターンが埋め込まれた圧縮データを復号する際は、まず処理単位ごとに、分割パターンが埋め込まれたＹ画像列のパレットを読み出して、当該処理単位に設定された分割パターンの識別番号を特定する。それにより画素と、当該画素を描画するのに必要なデータが含まれているインデックスおよびパレットの格納場所とを対応づけられる。それに従い、描画対象の画素に対応するＹ画像列のインデックスおよびパレット、ＣｂＣｒ画像列のインデックスおよびパレットを読み出してデコードすればよい。 When decoding the compressed data in which the division pattern is embedded in this way, first, for each processing unit, the Y image sequence palette in which the division pattern is embedded is read, and the identification number of the division pattern set in the processing unit Is identified. As a result, the pixel can be associated with the storage location of the index and pallet containing the data necessary for drawing the pixel. Accordingly, the index and palette of the Y image sequence and the index and palette of the CbCr image sequence corresponding to the drawing target pixel may be read and decoded.

デコード処理は基本的にＳ３ＴＣと同様に行える。すなわち各パレットが保持する代表値からそれを補間する中間値を生成し、インデックスにおける指定に従い、代表値または中間値を各画素の画素値とする。一方、本実施の形態では符号化単位ごとにパレットおよびインデックスを生成しているため、決定した画素値を、分割パターンに対応した、画像列における符号化単位の配列に基づき、空間方向および時間方向に画素の配列を再構成することによりＹ画像列、ＣｂＣｒ画像列を復元する。そしてＣｂＣｒ画像を拡大してＣｂ画像、Ｃｒ画像を生成することにより、元の画像フレームに対応するＹＣｂＣｒ画像が得られる。 The decoding process can be basically performed in the same manner as S3TC. That is, an intermediate value that interpolates the representative value held by each palette is generated, and the representative value or the intermediate value is set as the pixel value of each pixel according to the designation in the index. On the other hand, since the palette and the index are generated for each encoding unit in the present embodiment, the determined pixel value is based on the arrangement of the encoding units in the image sequence corresponding to the division pattern, and in the spatial direction and the temporal direction. The Y image sequence and the CbCr image sequence are restored by reconstructing the pixel arrangement. Then, by enlarging the CbCr image to generate a Cb image and a Cr image, a YCbCr image corresponding to the original image frame is obtained.

以上述べた本実施の形態によれば、動画を構成する画像フレームを異なる解像度で表した複数の動画ストリームを階層化した階層データを生成し、ユーザからの視点移動要求に応じて表示領域を移動させながら動画を表示する。求められる縮尺率によってフレーム描画に用いるデータの階層を切り替えることにより、一般的な高精細画像やそれを超える解像度の動画像であっても、細部を確認するために拡大したり全体を俯瞰するために縮小したり、といった要求を逐次受け付け、反応性よく表示することができる。 According to the present embodiment described above, hierarchical data is generated by hierarchizing a plurality of video streams representing image frames constituting a video at different resolutions, and the display area is moved in response to a viewpoint movement request from the user. To display the video. By switching the hierarchy of data used for frame drawing according to the required scale ratio, even a general high-definition image or a moving image with a resolution higher than that can be enlarged to confirm details or have a bird's-eye view. Requests can be sequentially received and displayed with good responsiveness.

階層データの各階層を構成する動画ストリームは、どの階層においても同じサイズの画像フレームで構成されるようにする。結果として高解像度の階層になるほど、１つの階層を構成する動画ストリームの数が増えることになる。このように画像フレームのサイズを揃えることで、表示時のロードやデコードなどの処理を階層によらず均一化できるとともに、表示対象領域の局所性に適合した効率的な描画処理を行える。 The moving image stream constituting each layer of the layer data is configured with image frames of the same size in any layer. As a result, the higher the resolution, the greater the number of video streams that make up one hierarchy. By aligning the sizes of the image frames in this way, it is possible to equalize the processing such as loading and decoding at the time of display regardless of the hierarchy, and to perform efficient drawing processing suitable for the locality of the display target area.

また１つの画像を複数の動画ストリームで構成することにより、動画ストリームごとにフレームレートを異ならせたり一部の領域を静止画像とするなどの調整を、画像の空間局所性に鑑み実施できる。またある解像度の階層の画像において、低解像度側の階層の画像を拡大して代用できる領域がある場合は、当該領域を担当する動画ストリーム自体をデータから省くこともできる。 In addition, by configuring one image with a plurality of moving image streams, adjustments such as making the frame rate different for each moving image stream or setting some areas as still images can be performed in view of the spatial locality of the image. In addition, in an image of a certain resolution, when there is an area that can be used by enlarging the image of the lower resolution side, the moving picture stream itself responsible for the area can be omitted from the data.

動画像を全編にわたり１つの階層データで構成せず、時間軸上で分割して複数の階層データで構成するようにしてもよい。また１つの階層データに含まれる各階層の動画ストリームを全編にわたり１つの動画圧縮データとしてもよいし、所定数の画像フレームごとに異なる動画圧縮データとしてもよい。このように階層データの数や階層データ内の動画ストリームのデータ構造、圧縮符号化形式を、動画像の内容のや再生時間などによって適宜選択できるようにすることで、動画表示時の処理の負荷、求められる画質など多角的な観点から最適な表示態様を実現できる。 A moving image may be divided into a plurality of hierarchical data by being divided on the time axis, instead of being composed of a single hierarchical data throughout the entire story. In addition, the moving picture stream of each layer included in one hierarchical data may be one moving picture compressed data over the entire volume, or may be different moving picture compressed data for each predetermined number of image frames. In this way, the processing load when displaying moving images can be selected by appropriately selecting the number of hierarchical data, the data structure of the video stream in the hierarchical data, and the compression encoding format depending on the content of the moving image and the playback time. Therefore, an optimum display mode can be realized from various viewpoints such as required image quality.

さらに本実施の形態では、所定数の画像フレームごとに動画ストリームを圧縮符号化する。このとき元の動画ストリームを構成する画像フレームのＲＧＢ画像を、輝度Ｙ、色差ＣｂおよびＣｒで表した画像をそれぞれ生成する。そしてＣｂ画像およびＣｒ画像を縮小したうえ、各画像列を所定サイズ、所定画像フレーム数ごとに分割して符号化単位を生成する。そのようにしてＹ画像列およびＣｂＣｒ画像列のそれぞれに対しパレットおよびインデックスのデータを生成する。パレットは各画像の代表値を表す２値のデータ、インデックスは、代表値を線形補間して得られる中間値および代表値のうちの１つを画素ごとに指定するデータである。 Furthermore, in this embodiment, the moving image stream is compression-coded for each predetermined number of image frames. At this time, an image in which the RGB image of the image frame constituting the original moving image stream is represented by luminance Y, color difference Cb, and Cr is generated. Then, after reducing the Cb image and the Cr image, each image sequence is divided into a predetermined size and a predetermined number of image frames to generate a coding unit. In this way, palette and index data are generated for each of the Y image sequence and the CbCr image sequence. The palette is binary data representing the representative value of each image, and the index is data for designating one of the intermediate value and representative value obtained by linear interpolation of the representative value for each pixel.

パレットおよびインデックスの概念は、テクスチャのＲＧＢ画像に対するＳ３ＴＣの圧縮手法で導入されているが、本実施の形態ではパレットの２値が輝度Ｙ、色差Ｃｂ、色差Ｃｒのいずれに対しても８ビットを保持するため画質が劣化しにくい。またＹ画像列、およびＣｂＣｒ画像列に対し別個に量子化を行うため、ＲＧＢの３次元パラメータを量子化するのと比較してパラメータの次元数が小さく量子化誤差が少ない。また符号化単位を形成する際の空間分割数、時間分割数の組み合わせを変化させ、画像の持つ空間方向の冗長性、時間方向の冗長性に適応するデータ構造を柔軟に提供できる。 The concept of palette and index is introduced in the S3TC compression method for texture RGB images, but in this embodiment, the binary value of the palette is 8 bits for all of luminance Y, color difference Cb, and color difference Cr. Image quality is unlikely to deteriorate due to retention. In addition, since the quantization is separately performed on the Y image sequence and the CbCr image sequence, the number of parameter dimensions is smaller and the quantization error is smaller than when the RGB three-dimensional parameters are quantized. In addition, by changing the combination of the number of space divisions and the number of time divisions when forming the coding unit, it is possible to flexibly provide a data structure adapted to the redundancy in the spatial direction and the redundancy in the time direction of the image.

上記の圧縮手法を用いれば、ＧＰＵによるテクスチャマッピングの処理と同様に描画処理を行えるため、階層を切り替えつつ表示領域の動画ストリームを読み込み所定のフレームレートで画像を描画していく、という本実施の階層構造の動画データにも適用できる高いスループットを見込める。既存の圧縮符号化方式と比較すると、例えばＪＰＥＧを用いて画像フレームごとにデコードする場合、画像の内容によってはデコードの処理負荷が増大しやすい。またＭＰＥＧは複数の動画ストリームのそれぞれについてＩピクチャの復号が必要となるため結果的に処理負荷が増大しやすく、Ｉピクチャを減らすと時間方向のランダムアクセスに対しレイテンシが生じやすい、という問題が生じる。 If the above compression method is used, the rendering process can be performed in the same manner as the texture mapping process by the GPU. Therefore, the video stream in the display area is read while the hierarchy is switched, and the image is rendered at a predetermined frame rate. High throughput that can be applied to hierarchically structured video data can be expected. Compared with the existing compression encoding method, for example, when decoding for each image frame using JPEG, the processing load of decoding tends to increase depending on the content of the image. In addition, MPEG requires decoding of I pictures for each of a plurality of moving picture streams. As a result, the processing load tends to increase, and if I pictures are reduced, there is a problem that latency tends to occur for random access in the time direction. .

本実施の形態における圧縮符号化技術はＧＰＵでのデコードを実現することによって、上記の既存技術と比較し高速描画が可能である。結果としてＣＰＵにおける処理の負荷を抑えつつ高精細の動画を表示できる。そのためＣＰＵにおいてさらに付加的な処理を行うことも可能であるほか、携帯端末などＣＰＵの処理性能が劣る装置であってもコマ落ちなどのリスクが小さくなる。この特徴は、ＳＳＤ（Solid Sate Drive）の普及などに伴い記憶装置からのデータ読み出しが高速化され、デコード処理がボトルネックとなりやすい今後の技術動向に適合しているといえる。 The compression coding technique in the present embodiment realizes high-speed drawing as compared with the above-described existing technique by realizing decoding with a GPU. As a result, a high-definition moving image can be displayed while reducing the processing load on the CPU. For this reason, it is possible to perform additional processing in the CPU, and the risk of dropping frames is reduced even in a device such as a portable terminal having inferior CPU processing performance. This feature can be said to be suitable for future technological trends in which data reading from a storage device is accelerated with the spread of SSD (Solid Sate Drive) and decoding processing is likely to become a bottleneck.

結果として本圧縮符号化技術は、画質を保持したまま高スループット描画を実現でき、さらに低レイテンシでの時間的、空間的ランダムアクセスが可能であるため、表示領域を変化させながら高精細動画を表示するために用いる階層構造の動画像データへ適用することにより、より効果的な動画像表示技術を実現できる。 As a result, this compression coding technology can achieve high-throughput rendering while maintaining image quality, and also enables temporal and spatial random access with low latency, so high-definition video can be displayed while changing the display area. By applying to the moving image data having a hierarchical structure used for the purpose, a more effective moving image display technique can be realized.

以上、本発明を実施の形態をもとに説明した。上記実施の形態は例示であり、それらの各構成要素や各処理プロセスの組合せにいろいろな変形例が可能なこと、またそうした変形例も本発明の範囲にあることは当業者に理解されるところである。 The present invention has been described based on the embodiments. Those skilled in the art will understand that the above-described embodiment is an exemplification, and that various modifications can be made to combinations of the respective constituent elements and processing processes, and such modifications are also within the scope of the present invention. is there.

１画像処理システム、１０画像処理装置、１２表示装置、２０入力装置、３０第０階層、３２第１階層、３４第２階層、３６第３階層、４４表示処理部、５０ハードディスクドライブ、６０メインメモリ、７０バッファメモリ、１００制御部、１０２入力情報取得部、１０６ロードストリーム決定部、１０８ロード部、１１０フレーム座標決定部、１１２デコード部、１１４表示画像処理部、１２０ＹＣｂＣｒ変換部、１２２画像分割部、１２４圧縮符号化部、１２６動画ストリーム記憶部、１２８分割パターン記憶部、１３０圧縮データ記憶部。 DESCRIPTION OF SYMBOLS 1 Image processing system, 10 Image processing apparatus, 12 Display apparatus, 20 Input apparatus, 30 0th hierarchy, 32 1st hierarchy, 34 2nd hierarchy, 36 3rd hierarchy, 44 Display processing part, 50 Hard disk drive, 60 Main memory , 70 buffer memory, 100 control unit, 102 input information acquisition unit, 106 load stream determination unit, 108 load unit, 110 frame coordinate determination unit, 112 decoding unit, 114 display image processing unit, 120 YCbCr conversion unit, 122 image division unit , 124 compression encoding unit, 126 moving image stream storage unit, 128 division pattern storage unit, 130 compressed data storage unit.

Claims

A data dividing unit that divides a data string in a three-dimensional parameter space to be compressed in the three-dimensional direction to form a coding unit;
For each coding unit formed by the data dividing unit, a palette that holds binary data as representative values, and a plurality of intermediate values determined by linear interpolation of the representative values and one of the representative values are designated. An index that holds information instead of the original data of the encoding unit, and a compression encoding unit that generates compressed data,
A data compression apparatus comprising:

As the data string, the pixel value in each image frame of the moving image data having the pixel value with respect to the image frame plane and the time axis is generated for each image frame by converting the luminance Y and two color differences Cb and Cr. Y image having luminance Y as a pixel value, Cb image having color difference Cb as a pixel value, Y image sequence in which Cr images having a color difference Cr as a pixel value are arranged in chronological order, Cb image sequence, and YCbCr for generating a Cr image sequence A conversion unit;
The data division unit according to claim 1, wherein the data division unit divides the Y image sequence, the Cb image sequence, and the Cr image sequence generated by the YCbCr conversion unit, respectively, into space-time divisions to form an encoding unit. Data compression device.

The data dividing unit forms a pattern of a plurality of encoding units by dividing the data string with a plurality of division patterns prepared in advance, and causes the compression encoding unit to perform data compression with each pattern, 3. The data compression apparatus according to claim 1, wherein a division pattern with the least error is selected.

The data division unit selects a division pattern for each predetermined unit area of the three-dimensional parameter space, and generates a division pattern map in which identification information of the selected division pattern is associated with the predetermined unit area.
The compression encoding unit generates the palette and the index for each encoding unit formed by a selected division pattern, and includes the division pattern map in the compressed data. 4. The data compression apparatus according to any one of 3.

The compression encoding unit generates a palette and an index for the luminance Y for each encoding unit of the Y image sequence, and calculates (color difference Cb, color difference Cr) for each encoding unit of the Cb image sequence and the Cr image sequence. The data compression apparatus according to claim 2, wherein a palette and an index are generated for the parameters of the parameter.

The data dividing unit reduces the Cb image and the Cr image at a predetermined magnification in the image plane direction before dividing, and the number of pixels included in the encoding unit is the Y image sequence, the Cb image sequence, and the Cr image sequence. 6. The data compression apparatus according to claim 2, wherein the space-time division is performed so as to be equal to each other.

The compression encoding unit generates a storage unit in which palettes and indexes generated for each encoding unit are grouped in a predetermined area unit of the original three-dimensional parameter space, and stores the compressed unit for each storage unit. The data compression apparatus according to claim 1, wherein the data compression apparatus is stored in the apparatus.

The compression encoding unit is configured to generate the division pattern map for each encoding unit by expressing the identification information of the division pattern by a binary magnitude relationship held in the palette and a storage order thereof. The data compression apparatus according to claim 4, wherein the data compression apparatus is embedded in the data compression apparatus.

The moving image data includes hierarchical moving image data formed by hierarchizing a plurality of image frame sequences representing image frames constituting one moving image at different resolutions in the order of resolution. 7. The data compression device according to claim 2, wherein the data compression device is a moving image stream in units of tile images divided into sizes.

For each coding unit formed by dividing a data string in the three-dimensional parameter space in the three-dimensional direction, a palette that holds two of the pixel values as representative values, and a plurality of values determined by linear interpolation of the representative values A compressed data reading unit that reads compressed data associated with an index that holds information specifying either the intermediate value or the representative value instead of the original data of the encoding unit from the storage device;
The intermediate value is generated by linearly interpolating the representative value held by the palette, and the data included in each coding unit is determined as one of the representative value and the intermediate value according to the information held by the index. A decoding unit that reconstructs and generates an original data sequence based on the arrangement of the encoding units;
An output unit for outputting the generated data string;
A data decoding apparatus comprising:

The compressed data reading unit corresponds to an image frame sequence constituting a moving image, a Y image sequence having a luminance Y as a pixel value, a Cb image sequence having a color difference Cb as a pixel value, and a Cr image sequence having a color difference Cr as a pixel value. As the data string,
The decoding unit generates data of the Y image sequence, the Cb image sequence, and the Cr image sequence by reconstructing an array of pixels based on the array of coding units,
11. The data according to claim 10, wherein the output unit outputs data of a YCbCr image sequence representing the image frame sequence based on the data of the Y image sequence, Cb image sequence, and Cr image sequence to a display device. Decoding device.

The compressed data further includes a division pattern map that holds information for identifying a division pattern in the three-dimensional direction in a predetermined region unit of the three-dimensional parameter space,
The data decoding according to claim 10 or 11, wherein the decoding unit specifies the arrangement of the encoding units based on the division pattern indicated by the division pattern map, and reconstructs the original data arrangement based on the arrangement. apparatus.

The compressed data is stored in a storage device in a storage unit in which a palette and an index for each encoding unit are grouped in a predetermined area unit of the original three-dimensional parameter space,
The data decoding according to any one of claims 10 to 12, wherein the compressed data reading unit specifies the storage unit for each region to be decoded and reads a palette and an index included in the storage unit. apparatus.

Reading a data string in a three-dimensional parameter space to be compressed from a storage device;
Dividing the data string in the three-dimensional direction to form a coding unit;
For each encoding unit, a palette that holds two values of data as representative values, and a plurality of intermediate values determined by linear interpolation of the representative values and information specifying any one of the representative values are encoded. Generating an index to be held instead of the original data of the unit, and storing it in the storage device as compressed data;
A data compression method comprising:

For each coding unit formed by dividing a data string in the three-dimensional parameter space in the three-dimensional direction, a palette that holds two of the pixel values as representative values, and a plurality of values determined by linear interpolation of the representative values Reading the compressed data associated with the index that holds the information specifying either the intermediate value or the representative value instead of the original data of the encoding unit from the storage device;
The intermediate value is generated by linearly interpolating the representative value held by the palette, and the data included in each coding unit is determined as one of the representative value and the intermediate value according to the information held by the index. Reconstructing and generating the original data sequence based on the arrangement of the encoding units;
Outputting the generated data string to an output device;
A data decoding method comprising:

A function for reading a data string in a three-dimensional parameter space to be compressed from a storage device;
A function of dividing the data string in the three-dimensional direction to form a coding unit;
For each encoding unit, a palette that holds two values of data as representative values, and a plurality of intermediate values determined by linear interpolation of the representative values and information specifying any one of the representative values are encoded. A function of generating an index to be held instead of the original data of the unit and storing it in the storage device as compressed data;
A computer program for causing a computer to realize the above.

For each coding unit formed by dividing a data string in the three-dimensional parameter space in the three-dimensional direction, a palette that holds two of the pixel values as representative values, and a plurality of values determined by linear interpolation of the representative values A function of reading compressed data in association with an index that holds information designating either the intermediate value or the representative value instead of the original data of the encoding unit from the storage device,
The intermediate value is generated by linearly interpolating the representative value held by the palette, and the data included in each coding unit is determined as one of the representative value and the intermediate value according to the information held by the index. A function of reconstructing and generating an original data sequence based on the arrangement of the encoding units;
A function of outputting the generated data string to an output device;
A computer program for causing a computer to realize the above.

A function for reading a data string in a three-dimensional parameter space to be compressed from a storage device;
A function of dividing the data string in the three-dimensional direction to form a coding unit;
For each encoding unit, a palette that holds two values of data as representative values, and a plurality of intermediate values determined by linear interpolation of the representative values and information specifying any one of the representative values are encoded. A function of generating an index to be held instead of the original data of the unit and storing it in the storage device as compressed data;
The recording medium which recorded the computer program characterized by making a computer implement | achieve.

For each coding unit formed by dividing a data string in the three-dimensional parameter space in the three-dimensional direction, a palette that holds two of the pixel values as representative values, and a plurality of values determined by linear interpolation of the representative values A function of reading compressed data in association with an index that holds information designating either the intermediate value or the representative value instead of the original data of the encoding unit from the storage device,
The intermediate value is generated by linearly interpolating the representative value held by the palette, and the data included in each coding unit is determined as one of the representative value and the intermediate value according to the information held by the index. A function of reconstructing and generating an original data sequence based on the arrangement of the encoding units;
A function of outputting the generated data string to an output device;
The recording medium which recorded the computer program characterized by making a computer implement | achieve.

A data structure of a compressed video file,
The Y image sequence having luminance Y as a pixel value, the Cb image sequence having chrominance Cb as the pixel value, and the Cr image sequence having chrominance Cr as the pixel value corresponding to the image frame sequence constituting the moving image are respectively spatiotemporally divided. A palette that holds two of the pixel values as representative values generated for each encoding unit formed in this way, and a plurality of intermediate values determined by linear interpolation of the representative values and one of the representative values are designated. A data structure of a compressed moving image file, characterized in that an index for storing information for each pixel is associated and arranged in correspondence with an image area of the image frame.

21. The compressed moving image file according to claim 20, further comprising a division pattern map that holds information for identifying a division pattern at the time-space division for each predetermined image area of the image frame. data structure.

The recording medium which recorded the compressed moving image file which has the data structure of Claim 20 or Claim 21.