JP2009164937A

JP2009164937A - Motion image multiplexing method, file reading method and apparatus, program thereof and computer-readable recording medium

Info

Publication number: JP2009164937A
Application number: JP2008001010A
Authority: JP
Inventors: Hideaki Kimata; 英明木全; Shinya Shimizu; 信哉志水; Kazuto Kamikura; 一人上倉; Yoshiyuki Yajima; 由幸八島
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2008-01-08
Filing date: 2008-01-08
Publication date: 2009-07-23
Anticipated expiration: 2028-01-08
Also published as: JP5128963B2

Abstract

<P>PROBLEM TO BE SOLVED: To reduce the burden in taking out coded data of motion images of desired color space, etc., from a file storing multiplexed coded data created by prediction coding among a plurality of motion images having different color spaces, etc. <P>SOLUTION: A multiplexer 10 allows a management information multiplexing section 116 to multiplex the position information in a file of frame header of a motion image set by a frame header position information setting section 114, a motion image number for discriminating a motion image set by a motion image number setting section 112 and color space information indicating the color space of the frame set by the color space information setting section 113, and sets the multiplexed information at a specific position of the coded data. <P>COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は，フレーム間予測符号化方式を使った画像符号化データの多重化に関する技術である。 The present invention is a technique related to multiplexing of encoded image data using an inter-frame predictive encoding method.

映像符号化には，スケーラブル符号化や多視点映像符号化といった，レイヤ符号化を適用する符号化方式がある。スケーラブル符号化では，ＳＮＲスケーラビリティや空間スケーラビリティ等の機能を実現できる。ＳＮＲスケーラビリティとは，ベースレイヤの符号化データに対して符号化データを追加することで画像の画品質を向上する機能であり，空間スケーラビリティとは，ベースレイヤの符号化データに対して符号化データを追加することで画像の空間解像度を向上する機能である。 Video coding includes coding methods that apply layer coding, such as scalable coding and multi-view video coding. In scalable coding, functions such as SNR scalability and spatial scalability can be realized. The SNR scalability is a function that improves the image quality of an image by adding the encoded data to the base layer encoded data. The spatial scalability is the encoded data for the base layer encoded data. This is a function that improves the spatial resolution of an image by adding.

スケーラブル符号化の概念図を図１３に示す。図１３に示すように，復号画像あるいは符号化情報をレイヤ間で再利用することにより，符号量の削減が図られている。 A conceptual diagram of scalable encoding is shown in FIG. As shown in FIG. 13, the amount of code is reduced by reusing a decoded image or encoded information between layers.

近年では，映像符号化方式Ｈ．２６４の拡張方式としてＳＶＣと呼ぶ方式がＡｎｎｅｘＧとして標準化されている（非特許文献１参照）。 In recent years, the video encoding method H.264 has been improved. As a H.264 extension method, a method called SVC is standardized as Annex G (see Non-Patent Document 1).

また，多視点映像符号化においてもレイヤ符号化が適用される。近年では，やはり映像符号化方式Ｈ．２６４の拡張方式としてＭＶＣと呼ぶ方式が標準化されようとしている。ＭＶＣでは，複数の視点映像をまとめて符号化する仕組みが採用されている。動画像の符号化では動き補償符号化が利用されるが，多視点映像の場合には，動き補償に加えて，視点映像間でフレーム間予測符号化を適用する視差補償符号化を利用する。ＭＶＣでは視差補償を利用することで，個別に視点映像を符号化するよりも圧縮効率を向上できる。また，ＭＶＣでは，アプリケーションとして自由視点映像といった，視聴者が自由に視点位置を選択できる映像表現を想定している（非特許文献２，非特許文献３参照）。 Layer coding is also applied in multi-view video coding. In recent years, the video encoding system H.264 A method called MVC is being standardized as an H.264 extension method. In MVC, a mechanism for collectively encoding a plurality of viewpoint videos is employed. In motion picture encoding, motion compensation encoding is used. In the case of multi-view video, in addition to motion compensation, disparity compensation encoding that applies inter-frame prediction encoding between viewpoint videos is used. In MVC, by using parallax compensation, the compression efficiency can be improved as compared to encoding the viewpoint video individually. In addition, MVC assumes a video expression that allows a viewer to freely select a viewpoint position, such as a free viewpoint video as an application (see Non-Patent Document 2 and Non-Patent Document 3).

このため，映像シーンを非常に多くのカメラで撮影しておき，視聴者が見たい視点位置の映像を選択するような利用方法が想定されている。すなわち，撮影した多くの視点映像データのうち，いくつかを選択して復号して再生することが想定されている。このように非常に多くの視点映像からいくつかの視点映像を選択して復号する機能を視点映像スケーラビリティと呼ぶ。ＭＶＣは多視点映像を想定した符号化方式ではあるが，まとめて符号化する対象はカメラ映像に限定されない。すなわち，一般的に複数の動画像をまとめて圧縮する符号化方式と考えることができる。 For this reason, it is assumed that the video scene is photographed by a large number of cameras and the video at the viewpoint position that the viewer wants to view is selected. That is, it is assumed that some of the captured viewpoint video data are selected, decoded, and reproduced. Such a function of selecting and decoding several viewpoint videos from a very large number of viewpoint videos is called viewpoint video scalability. MVC is an encoding method that assumes multi-view video, but the target to be encoded together is not limited to camera video. That is, it can be considered as a coding method that generally compresses a plurality of moving images together.

このＭＶＣでは，前述のとおり，非常に多くの動画像からいくつかの動画像を選択して復号することを実現するため，スケーラブル符号化のように１つのベースレイヤに対してレイヤ構造を持つのではなく，複数のベースレイヤに対してレイヤ構造を持つような構成となる。 As described above, this MVC has a layer structure for one base layer as in scalable coding in order to select and decode several moving images from a large number of moving images. Instead, the configuration is such that a plurality of base layers have a layer structure.

図１４はＭＶＣの参照関係の例，図１５はＭＶＣの符号化データの構成例を示す図である。例えば，図１４に示すような参照関係がある場合には，ＭＶＣの符号化データは同じ時刻のフレームについて図１５に示すような構成になる。この同じ時刻のフレームの固まりをアクセスユニットと呼ぶ。ＭＶＣではベースレイヤに相当するレイヤが複数存在する。ＭＶＣでは動画像間の参照関係をサブセットＳＰＳ（subset＿seq ＿parameter ＿set ）の中のＭＶＣ拡張部分で記述する。サブセットＳＰＳは符号化データ中に含まれ，複数設定できる。各フレームでは，利用するサブセットＳＰＳを指定することができる。 FIG. 14 is a diagram showing an example of reference relationship of MVC, and FIG. 15 is a diagram showing a configuration example of encoded data of MVC. For example, when there is a reference relationship as shown in FIG. 14, the encoded data of MVC has a configuration as shown in FIG. 15 for frames at the same time. This group of frames at the same time is called an access unit. In MVC, there are a plurality of layers corresponding to the base layer. In MVC, a reference relationship between moving images is described by an MVC extension in a subset SPS (subset_seq_parameter_set). The subset SPS is included in the encoded data, and a plurality of subsets can be set. In each frame, a subset SPS to be used can be designated.

さらに，ＭＶＣでは非常に多くの視点映像を扱うため，高い圧縮効率が望まれる。撮影した映像を圧縮するのではなく，ある一部の視点映像の空間解像度やビット深度を小さくすることで情報量を減らして圧縮できれば，圧縮効率を向上できる。また，色空間についても，全ての視点映像で同一にする必要はない。撮影に利用するカメラによっては，得られる映像信号の色空間は異なる場合がある。ここで想定する「色空間が異なる」ことの意味は，３色であればＲＧＢやＹＵＶ等で色空間が正に異なることや，また，グレースケールと３色のように色数が異なることや，色が同じであっても特定の色だけサンプリング周波数が異なることを指す。このように，ＭＶＣでは，空間解像度，ビット深度，色空間の異なる複数の視点映像をまとめて符号化する場合がある。視点映像ごとに空間解像度やビット深度や色空間を指定できるが，このような映像信号の表現情報はパラメータセットとして符号化データの中に含まれる（非特許文献４参照）。 Furthermore, since MVC handles a large number of viewpoint videos, high compression efficiency is desired. The compression efficiency can be improved if the amount of information can be reduced by reducing the spatial resolution and bit depth of a certain viewpoint video rather than compressing the shot video. Also, the color space need not be the same for all viewpoint videos. Depending on the camera used for shooting, the color space of the obtained video signal may be different. The meaning of “different color spaces” assumed here means that if there are three colors, the color spaces are exactly different for RGB, YUV, etc., and that the number of colors is different, such as grayscale and three colors. , It means that the sampling frequency differs by a specific color even if the colors are the same. As described above, in MVC, a plurality of viewpoint videos having different spatial resolution, bit depth, and color space may be encoded together. Although spatial resolution, bit depth, and color space can be specified for each viewpoint video, such video signal expression information is included in the encoded data as a parameter set (see Non-Patent Document 4).

一方，このような映像符号化データをファイル化して保存するために，ファイルフォーマット技術が利用される。ファイルフォーマットとは，符号化データを多重化するための技術でもあり，符号化データストリームを構造化しておくことで，特定の情報に関する符号化データを低遅延で取り出す機能を有する。一般的にファイルフォーマットでは，各種のデータストリームをトラックに格納して，トラックの情報を特定の時刻を単位に分割して多重化する。各トラックには，映像または音声の符号化データの他に，ＣＧデータや著作権情報などのメタデータが格納される。 On the other hand, a file format technique is used to store such encoded video data as a file. The file format is also a technique for multiplexing encoded data, and has a function of extracting encoded data related to specific information with low delay by structuring the encoded data stream. Generally, in a file format, various data streams are stored in a track, and track information is divided and multiplexed in units of specific times. Each track stores metadata such as CG data and copyright information in addition to video or audio encoded data.

もっとも簡単なファイルフォーマットとしては，ＡＶＩフォーマットが挙げられる。ＡＶＩフォーマットは，映像データと音声データを１フレーム単位に多重化する。そのほかにも国際標準化されているフォーマットとして，ＩＳＯメディアファイルフォーマットが挙げられる。ＩＳＯフォーマットでは，フレームに相当するものとしてサンプルを定義し，サンプルを単位にトラックを分割して多重化する。 The simplest file format is the AVI format. In the AVI format, video data and audio data are multiplexed in units of one frame. Another ISO standard file format is an international standardized format. In the ISO format, a sample is defined as being equivalent to a frame, and a track is divided and multiplexed in units of samples.

ＩＳＯメディアファイルフォーマットでは，符号化データの他に，フレーム番号等の時刻情報とフレーム先頭のファイル中の位置情報を管理情報として，別のトラックとして構成することができる。ファイルを再生する場合には，この管理情報を記録したトラックを調べることで，フレームの先頭位置情報を知ることができ，所望のフレームの先頭に低遅延でアクセスすることが可能となる。 In the ISO media file format, in addition to the encoded data, time information such as a frame number and position information in the file at the head of the frame can be configured as separate tracks as management information. When playing back a file, by checking the track in which this management information is recorded, it is possible to know the start position information of the frame, and to access the start of the desired frame with low delay.

ファイル中におけるトラックや管理情報の関係の例を図１６に示す。符号化データトラックには，各動画像の符号化データが格納され，管理情報トラックには，各時刻におけるフレームの位置情報等が格納される。前述したＳＶＣ用にもファイルフォーマットが提案されている。ＳＶＣもトラックを単位にファイルを構成する（非特許文献５参照）。
Heiko Schwarz, Detlev Marpe, and Thomas Wiegand,“Overview of the Scalable Video Coding Extension of the H.264/AVC Standard ”, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL.17, NO.9, SEPTEMBER 2007, pp.1103-1120. 木全英明，“ＭＰＥＧ３ＤＡＶ国際標準化の動向”, 映像情報メディア学会誌 60, 2, pp.143-149, (2006)．木全英明，“多視点映像符号化ＭＶＣの国際標準化動向”，映像情報メディア学会誌 61, 4, pp.426-430, (April 2007)．木全英明, 志水信哉, 上倉一人, 八島由幸, “縮小参照画像を用いた多視点映像符号化の一検討”, 信学技報 MVE2007-25, pp.13-18, 2007 ． Peter Amon, Thomas Rathgen, and David Singer, “File Format for Scalable Video Coding ”, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL.17, NO.9, SEPTEMBER 2007, pp.1174-1185. An example of the relationship between tracks and management information in a file is shown in FIG. The encoded data track stores encoded data of each moving image, and the management information track stores frame position information at each time. A file format has also been proposed for the SVC described above. The SVC also forms a file in units of tracks (see Non-Patent Document 5).
Heiko Schwarz, Detlev Marpe, and Thomas Wiegand, “Overview of the Scalable Video Coding Extension of the H.264 / AVC Standard”, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL.17, NO.9, SEPTEMBER 2007, pp .1103-1120. Hideaki Kizen, “MPEG 3DAV international standardization trends”, Journal of the Institute of Image Information and Television Engineers 60, 2, pp.143-149, (2006). Hideaki Kizen, “International standardization trend of multi-view video coding MVC”, Journal of the Institute of Image Information and Television Engineers 61, 4, pp.426-430, (April 2007). Hideaki Kizen, Shinya Shimizu, Hitoshi Uekura, Yoshiyuki Yashima, “A Study on Multi-View Video Coding Using Reduced Reference Images”, IEICE Technical Report MVE2007-25, pp.13-18, 2007. Peter Amon, Thomas Rathgen, and David Singer, “File Format for Scalable Video Coding”, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL.17, NO.9, SEPTEMBER 2007, pp.1174-1185.

ＭＶＣで空間解像度やビット深度や色空間が異なる視点映像がある場合に，特定の空間解像度やビット深度や色空間の視点映像のみを取り出すことができることは好適である。例えば，特定の空間解像度やビット深度や色空間の符号化データのみを復号できるような復号器を持っている場合に，一部の視点映像だけでも復号して画像を表示できるからである。 When there is a viewpoint video with different spatial resolution, bit depth, and color space in MVC, it is preferable that only a viewpoint video with a specific spatial resolution, bit depth, or color space can be extracted. For example, when a decoder capable of decoding only encoded data in a specific spatial resolution, bit depth, or color space is provided, an image can be displayed by decoding only a part of viewpoint videos.

各視点映像の空間解像度やビット深度や色空間の情報はパラメータセットに記述されて符号化データの中に多重化されている。したがって，所望の空間解像度やビット深度や色空間の視点映像の符号化データを取り出す場合には，該当するパラメータセットの情報を符号化データ中から探し出す必要がある。このため，特にファイルに蓄積されている場合には，ファイルの先頭からパラメータセットを順次探して復号する必要があり，大きな負荷がかかっていた。 Information on the spatial resolution, bit depth, and color space of each viewpoint video is described in a parameter set and multiplexed in the encoded data. Therefore, when extracting encoded data of a viewpoint video in a desired spatial resolution, bit depth, or color space, it is necessary to search for information of the corresponding parameter set from the encoded data. For this reason, particularly when the data is stored in a file, it is necessary to sequentially search and decode the parameter set from the beginning of the file, which is a heavy load.

すなわち，本発明の課題は以下のとおりである。複数の動画像をまとめて符号化する場合に，色空間や空間解像度やビット深度の異なる動画像をまとめて符号化することを想定する。例えば撮影されるカメラの性能によって，これらの値は異なることがあり得る。このような動画像の符号化において，符号化データ中に個々の色空間や空間解像度やビット深度の情報を含めることができる。しかしながら，従来の符号化データの構成では，復号側では，これらの情報は符号化データを復号しなければ分からないため，所望の色空間や空間解像度やビット深度の動画像の符号化データを取り出すためには，大きな負荷がかかるという問題があった。 That is, the subject of this invention is as follows. When a plurality of moving images are encoded together, it is assumed that moving images having different color spaces, spatial resolutions, and bit depths are encoded together. For example, these values can vary depending on the performance of the camera being photographed. In such moving image encoding, information on individual color space, spatial resolution, and bit depth can be included in the encoded data. However, in the conventional encoded data configuration, since the decoding side does not know this information unless the encoded data is decoded, the encoded data of the moving image having the desired color space, spatial resolution, and bit depth is extracted. Therefore, there was a problem that a large load was applied.

本発明は，このような所望の色空間や空間解像度やビット深度の動画像の符号化データを取り出すときの負荷を低減する新しい技術手段を提供することを目的とする。 An object of the present invention is to provide a new technical means for reducing the load when extracting encoded data of a moving image having such a desired color space, spatial resolution, and bit depth.

本発明は，以上の課題を次のように解決する。符号化データを多重化するファイルフォーマット技術がある。このファイルフォーマット技術では，符号化データと管理情報に分けて，管理情報に各フレームの先頭位置情報を格納することが行われている。本発明では，このような管理情報に，フレーム先頭位置情報のほかに，色空間情報や空間解像度情報やビット深度情報も含める。これにより，管理情報を読み取るだけで，所望の色空間や空間解像度やビット深度のフレームの先頭位置を知ることができるため，そのようなフレームの符号化データを低負荷で取り出すことができる。 The present invention solves the above problems as follows. There is a file format technique for multiplexing encoded data. In this file format technique, the head position information of each frame is stored in the management information separately into encoded data and management information. In the present invention, such management information includes color space information, spatial resolution information, and bit depth information in addition to the frame head position information. As a result, it is possible to know the head position of a frame having a desired color space, spatial resolution, and bit depth simply by reading the management information, so that encoded data of such a frame can be extracted with a low load.

具体的には，本発明は，異なる色空間を持つ複数の動画像の間で予測符号化して作成された符号化データを多重化してファイルを構成する多重化方法であって，動画像のフレーム先頭のファイル中の位置情報と，動画像を区別する動画像番号と，該フレームの色空間を示す色空間情報とを多重化し，符号化データの特定の位置に設定することを特徴とする。 Specifically, the present invention is a multiplexing method for multiplexing encoded data created by predictive encoding between a plurality of moving images having different color spaces to form a file, and comprising a frame of a moving image Position information in the first file, a moving picture number for distinguishing moving pictures, and color space information indicating the color space of the frame are multiplexed and set at a specific position of encoded data.

この多重化方法によれば，符号化データ中の特定の位置にある情報を復号することで，すべての符号化データを復号することなく，ファイルから所望の色空間の動画像のフレームの先頭位置を求めて，そのフレームの符号化データを取り出すことができる。同時に，符号化データを復号することなく，その動画像の動画像番号も求めることができる。これにより，所望の色空間の動画像の動画像番号が予め分かっている場合に，符号化データ中に含まれる色空間情報を探索して復号することなく，低遅延で，所望の動画像の符号化データを復号して再生することができる。 According to this multiplexing method, by decoding information at a specific position in the encoded data, the start position of the frame of the moving image frame in the desired color space from the file without decoding all the encoded data. And the encoded data of the frame can be extracted. At the same time, the moving image number of the moving image can be obtained without decoding the encoded data. As a result, when the moving image number of the moving image in the desired color space is known in advance, the desired moving image can be recorded with low delay without searching for and decoding the color space information included in the encoded data. The encoded data can be decoded and reproduced.

また，本発明は，異なる色空間を持つ複数の動画像の間で予測符号化して作成された符号化データを多重化してファイルを構成する多重化方法であって，動画像のフレーム先頭のファイル中の位置情報と，動画像を区別する動画像番号と，該フレームの色空間を示す色空間情報を第一のトラックとし，符号化データを第二のトラックとして，第一のトラックと第二のトラックをそれぞれ独立に多重化することを特徴とする。 The present invention also relates to a multiplexing method for forming a file by multiplexing encoded data created by predictive encoding between a plurality of moving images having different color spaces, and comprising a file at the head of a frame of a moving image. The position information in the image, the moving image number for distinguishing the moving image, and the color space information indicating the color space of the frame are set as the first track, the encoded data as the second track, the first track and the second track The tracks are multiplexed independently of each other.

この多重化方法によれば，第一トラックに含まれる色空間情報を復号することで，第二トラックの符号化データを復号することなく，ファイルから所望の色空間の動画像のフレームの先頭位置を求めて，そのフレームの符号化データを取り出すことができる。同時に，符号化データを復号することなく，その動画像の動画像番号も求めることができる。これにより，所望の色空間の動画像の動画像番号が予め分かっている場合に，符号化データ中に含まれる色空間情報を復号することなく，低遅延で，所望の動画像の符号化データを復号して再生することができる。 According to this multiplexing method, by decoding the color space information included in the first track, the start position of the frame of the moving image frame in the desired color space from the file without decoding the encoded data of the second track. And the encoded data of the frame can be extracted. At the same time, the moving image number of the moving image can be obtained without decoding the encoded data. As a result, when the moving image number of the moving image in the desired color space is known in advance, the encoded data of the desired moving image can be obtained with low delay without decoding the color space information included in the encoded data. Can be decoded and played back.

また，本発明は，異なる空間解像度を持つ複数の動画像の間で予測符号化して作成された符号化データを多重化してファイルを構成する多重化方法であって，動画像のフレーム先頭のファイル中の位置情報と，動画像を区別する動画像番号と，該フレームの空間解像度情報を多重化し，符号化データの特定の位置に設定することを特徴とする。 The present invention also relates to a multiplexing method for forming a file by multiplexing encoded data created by predictive encoding between a plurality of moving images having different spatial resolutions. It is characterized in that the position information in it, the moving picture number for distinguishing moving pictures, and the spatial resolution information of the frame are multiplexed and set at a specific position of the encoded data.

この多重化方法によれば，符号化データ中の特定の位置にある情報を復号することで，すべての符号化データを復号することなく，ファイルから所望の空間解像度の動画像のフレームの先頭位置を求めて，そのフレームの符号化データを取り出すことができる。同時に，符号化データを復号することなく，その動画像の動画像番号も求めることができる。これにより，所望の空間解像度の動画像の動画像番号が予め分かっている場合に，符号化データ中に含まれる空間解像度情報を探索して復号することなく，低遅延で，所望の動画像の符号化データを復号して再生することができる。 According to this multiplexing method, by decoding information at a specific position in the encoded data, the start position of the frame of the moving image frame having a desired spatial resolution from the file without decoding all the encoded data. And the encoded data of the frame can be extracted. At the same time, the moving image number of the moving image can be obtained without decoding the encoded data. As a result, when the moving image number of a moving image having a desired spatial resolution is known in advance, the desired moving image can be recorded with low delay without searching for and decoding the spatial resolution information included in the encoded data. The encoded data can be decoded and reproduced.

また，本発明は，異なる空間解像度を持つ複数の動画像の間で予測符号化して作成された符号化データを多重化してファイルを構成する多重化方法であって，動画像のフレーム先頭のファイル中の位置情報と，動画像を区別する動画像番号と，該フレームの空間解像度情報を第一のトラックとし，符号化データを第二のトラックとして，第一のトラックと第二のトラックをそれぞれ独立に多重化することを特徴とする。 The present invention also relates to a multiplexing method for forming a file by multiplexing encoded data created by predictive encoding between a plurality of moving images having different spatial resolutions. The position information in the image, the moving image number for distinguishing the moving image, the spatial resolution information of the frame as the first track, the encoded data as the second track, the first track and the second track, respectively. It is characterized by multiplexing independently.

この多重化方法によれば，第一トラックに含まれる空間解像度情報を復号することで，第二トラックの符号化データを復号することなく，ファイルから所望の空間解像度の動画像のフレームの先頭位置を求めて，そのフレームの符号化データを取り出すことができる。同時に，符号化データを復号することなく，その動画像の動画像番号も求めることができる。これにより，所望の空間解像度の動画像の動画像番号が予め分かっている場合に，符号化データ中に含まれる空間解像度情報を復号することなく，低遅延で，所望の動画像の符号化データを復号して再生することができる。 According to this multiplexing method, by decoding the spatial resolution information contained in the first track, the start position of the frame of the moving image frame having a desired spatial resolution from the file without decoding the encoded data of the second track. And the encoded data of the frame can be extracted. At the same time, the moving image number of the moving image can be obtained without decoding the encoded data. As a result, when the moving image number of a moving image having a desired spatial resolution is known in advance, the encoded data of the desired moving image can be obtained with low delay without decoding the spatial resolution information included in the encoded data. Can be decoded and played back.

また，本発明は，異なるビット深度を持つ複数の動画像の間で予測符号化して作成された符号化データを多重化してファイルを構成する多重化方法であって，動画像のフレーム先頭のファイル中の位置情報と，動画像を区別する動画像番号と，該フレームのビット深度情報を多重化し，符号化データの特定の位置に設定することを特徴とする。 The present invention also relates to a multiplexing method for forming a file by multiplexing encoded data created by predictive coding between a plurality of moving images having different bit depths, and comprising a file at the head of a frame of a moving image. It is characterized in that the position information in it, the moving image number for distinguishing moving images, and the bit depth information of the frame are multiplexed and set at a specific position of the encoded data.

この多重化方法によれば，符号化データ中の特定の位置にある情報を復号することで，すべての符号化データを復号することなく，ファイルから所望のビット深度の動画像のフレームの先頭位置を求めて，そのフレームの符号化データを取り出すことができる。同時に，符号化データを復号することなく，その動画像の動画像番号も求めることができる。これにより，所望のビット深度の動画像の動画像番号が予め分かっている場合に，符号化データ中に含まれるビット深度情報を探索して復号することなく，低遅延で，所望の動画像の符号化データを復号して再生することができる。 According to this multiplexing method, by decoding the information at a specific position in the encoded data, the start position of the frame of the moving image frame of the desired bit depth from the file without decoding all the encoded data. And the encoded data of the frame can be extracted. At the same time, the moving image number of the moving image can be obtained without decoding the encoded data. As a result, when the moving image number of a moving image having a desired bit depth is known in advance, the desired moving image can be recorded with low delay without searching for and decoding the bit depth information included in the encoded data. The encoded data can be decoded and reproduced.

また，本発明は，異なるビット深度を持つ複数の動画像の間で予測符号化して作成された符号化データを多重化してファイルを構成する多重化方法であって，動画像のフレーム先頭のファイル中の位置情報と，動画像を区別する動画像番号と，該フレームのビット深度情報を第一のトラックとし，符号化データを第二のトラックとして，第一のトラックと第二のトラックをそれぞれ独立に多重化することを特徴とする。 The present invention also relates to a multiplexing method for forming a file by multiplexing encoded data created by predictive encoding between a plurality of moving images having different bit depths, and comprising a file at the head of a frame of a moving image Position information, moving image number for distinguishing moving images, bit depth information of the frame as the first track, encoded data as the second track, first track and second track respectively It is characterized by multiplexing independently.

この多重化方法によれば，第一トラックに含まれるビット深度情報を復号することで，第二トラックの符号化データを復号することなく，ファイルから所望のビット深度の動画像のフレームの先頭位置を求めて，そのフレームの符号化データを取り出すことができる。同時に，符号化データを復号することなく，その動画像の動画像番号も求めることができる。これにより，所望のビット深度の動画像の動画像番号が予め分かっている場合に，符号化データ中に含まれるビット深度情報を復号することなく，低遅延で，所望の動画像の符号化データを復号して再生することができる。 According to this multiplexing method, by decoding the bit depth information included in the first track, the start position of the frame of the moving image frame having a desired bit depth from the file without decoding the encoded data of the second track. And the encoded data of the frame can be extracted. At the same time, the moving image number of the moving image can be obtained without decoding the encoded data. As a result, when the moving image number of a moving image having a desired bit depth is known in advance, the encoded data of the desired moving image can be obtained with low delay without decoding the bit depth information included in the encoded data. Can be decoded and played back.

なお，以上では，フレーム単位に管理情報を設定する例について説明したが，本発明において，フレームを分割したスライス単位に管理情報を設定してもよい。この場合には，管理情報はフレーム先頭位置の代わりに，スライス先頭位置情報を含む。また，多重化や読み取りもフレーム単位ではなく，スライス単位で行う。 Although an example in which management information is set for each frame has been described above, in the present invention, management information may be set for each slice obtained by dividing a frame. In this case, the management information includes slice head position information instead of the frame head position. In addition, multiplexing and reading are performed not on a frame basis but on a slice basis.

本発明によれば，色空間や空間解像度やビット深度の異なる複数の動画像をまとめて符号化した符号化データを多重化したファイルの中から，符号化データを復号することなく低演算負荷で，特定の色空間や空間解像度やビット深度の動画像の符号化データを取り出すことができる。 According to the present invention, from a file obtained by multiplexing encoded data obtained by collectively encoding a plurality of moving images having different color spaces, spatial resolutions, and bit depths, the encoded data can be reduced without decoding. , Encoded data of a moving image having a specific color space, spatial resolution, or bit depth can be extracted.

以下，図面を用いて本発明の多重化方法と，多重化されたファイルを読み込んで再生する方法の実施例を説明する。 Embodiments of a multiplexing method of the present invention and a method of reading and reproducing a multiplexed file will be described below with reference to the drawings.

第１の実施例では，フレーム先頭の位置情報と動画像番号と色空間情報を管理情報として多重化し，全ての符号化データを多重化した最後に設定する場合の例を示す。動画像はＡ，Ｂ，Ｃ，Ｄの４つあり，予め符号化されていて，各動画像におけるフレーム先頭の位置情報は分かっているものとする。 The first embodiment shows an example in which the frame head position information, moving image number, and color space information are multiplexed as management information, and all encoded data are set at the end of multiplexing. It is assumed that there are four moving images A, B, C, and D, which are encoded in advance, and the position information of the frame head in each moving image is known.

図１に多重化装置の構成を示す。図１に示す多重化装置１０は，動画像Ａ，Ｂ，Ｃ，Ｄの符号化データをそれぞれ入力する動画像Ａ符号化データ入力部１０１，動画像Ｂ符号化データ入力部１０２，動画像Ｃ符号化データ入力部１０３，動画像Ｄ符号化データ入力部１０４と，各動画像のフレーム先頭の動画像中の位置情報をそれぞれ入力する動画像Ａフレーム位置情報入力部１０６，動画像Ｂフレーム位置情報入力部１０７，動画像Ｃフレーム位置情報入力部１０８，動画像Ｄフレーム位置情報入力部１０９と，動画像間の参照関係を示す情報を入力する参照関係データ入力部１０５と，動画像の色空間情報を入力する色空間データ入力部１１７と，動画像の符号化データやフレーム位置情報の入力を切り替えるデータ切り替え部１１０と，符号化データを多重化する符号化データ多重化部１１１と，動画像番号を設定する動画像番号設定部１１２と，色空間情報を設定する色空間情報設定部１１３と，フレーム位置の先頭情報を設定するフレーム先頭位置情報設定部１１４と，管理情報を多重化する管理情報多重化部１１６と，符号化データと管理情報を多重化するファイル多重化部１１５とから構成される。 FIG. 1 shows the configuration of the multiplexing apparatus. A multiplexing apparatus 10 shown in FIG. 1 includes a moving image A encoded data input unit 101, a moving image B encoded data input unit 102, and a moving image C that input encoded data of moving images A, B, C, and D, respectively. Encoded data input unit 103, moving image D encoded data input unit 104, moving image A frame position information input unit 106 for inputting position information in the moving image at the head of each moving image, moving image B frame position An information input unit 107, a moving image C frame position information input unit 108, a moving image D frame position information input unit 109, a reference relationship data input unit 105 for inputting information indicating a reference relationship between moving images, and a moving image color A color space data input unit 117 for inputting spatial information, a data switching unit 110 for switching input of encoded data of moving images and frame position information, and a code for multiplexing the encoded data A data multiplexing unit 111, a moving image number setting unit 112 that sets a moving image number, a color space information setting unit 113 that sets color space information, and a frame head position information setting unit 114 that sets head information of a frame position And a management information multiplexing unit 116 that multiplexes management information, and a file multiplexing unit 115 that multiplexes encoded data and management information.

以上のうち，動画像番号設定部１１２では，動画像Ａ，Ｂ，Ｃ，Ｄに対してそれぞれ動画像番号０，１，２，３を設定するものとする。動画像ＡからＤまでのフレーム位置情報入力部１０６〜１０９は，次のフレームの位置情報を入力するものとする。フレーム先頭位置情報設定部１１４は，各動画像のフレーム位置情報をカウントし，符号化データ多重化部１１１で符号化データを多重化した後における各フレームの位置情報を設定するものとする。色空間情報設定部１１３は，色空間がＹＵＶの３色の場合には色空間情報として１を設定し，グレースケールの場合には色空間情報として０を設定するものとする。ファイル多重化部１１５は，符号化データ多重化部１１１で作成された符号化データの後に，管理情報多重化部１１６で作成された管理情報を設定するものとする。符号化データ多重化部１１１は，参照関係データがある場合には各動画像の符号化データの前に多重化するものとする。 Among the above, the moving image number setting unit 112 sets moving image numbers 0, 1, 2, and 3 for moving images A, B, C, and D, respectively. Frame position information input units 106 to 109 for moving images A to D are assumed to input position information of the next frame. The frame head position information setting unit 114 counts the frame position information of each moving image, and sets the position information of each frame after the encoded data multiplexing unit 111 multiplexes the encoded data. The color space information setting unit 113 sets 1 as the color space information when the color space is three colors of YUV, and sets 0 as the color space information when the color space is grayscale. The file multiplexing unit 115 sets the management information created by the management information multiplexing unit 116 after the coded data created by the coded data multiplexing unit 111. When there is reference relation data, the encoded data multiplexing unit 111 multiplexes the encoded data before the encoded data of each moving image.

データ切り替え部１１０は，参照関係データに従い，次のように符号化データとファイル先頭位置情報を切り替えるものとする。動画像ＡからＤまで順に，まず独立して復号できる動画像かどうかを判定する。独立して復号できる場合には，そのまま選択する。独立して復号できない場合には，参照する動画像の符号化データを既に選択したかどうかを判定する。既に選択した場合には，そのまま選択し，まだ選択していない場合には，参照する動画像が全て選択されるまで，その動画像を選択しない。 The data switching unit 110 switches between encoded data and file head position information as follows according to the reference relation data. In order from the moving images A to D, it is first determined whether the moving images can be decoded independently. If it can be decoded independently, select it as it is. If it cannot be decoded independently, it is determined whether the encoded data of the moving image to be referred to has already been selected. If it has already been selected, it is selected as it is. If it has not been selected yet, the moving image is not selected until all the moving images to be referenced are selected.

動画像間の参照関係は全て動画像ＡとＣは独立して復号でき，動画像Ｂは動画像Ａを参照し，動画像Ｄは動画像Ｃを参照するものとする。 As for the reference relationship between the moving images, the moving images A and C can be decoded independently, the moving image B refers to the moving image A, and the moving image D refers to the moving image C.

動画像の色空間は動画像ＡとＢはＹＵＶ３色であり，動画像ＣとＤはグレースケールであるとする。 As for the color space of the moving image, the moving images A and B are YUV three colors, and the moving images C and D are grayscale.

以上の前提で動画像Ａ，Ｂ，Ｃ，Ｄの各フレームを多重化する場合には，動画像の各フレームは，次のように多重化される。 When the frames of moving images A, B, C, and D are multiplexed on the above assumption, the frames of the moving images are multiplexed as follows.

まず，先頭フレームについては，次のように多重化される。参照関係データ入力部１０５は，動画像間の参照関係データを入力する。データ切り替え部１１０は，参照関係データに従って符号化データを多重化する順序を決定する。ここでは，次の順序に決定する。動画像Ａは独立して復号できるため，まず動画像Ａを選択する。動画像Ｂは動画像Ａのみを参照するため，次に動画像Ｂを選択する。動画像Ｃは独立して復号できるため，次に動画像Ｃを選択する。そして動画像Ｄを選択する。符号化データ多重化部１１１は，参照関係データを多重化する。フレーム先頭位置情報設定部１１４は，参照関係データのデータサイズ分だけファイルサイズを加算する。 First, the first frame is multiplexed as follows. The reference relationship data input unit 105 inputs reference relationship data between moving images. The data switching unit 110 determines the order in which the encoded data is multiplexed according to the reference relation data. Here, the order is determined as follows. Since the moving image A can be decoded independently, the moving image A is first selected. Since the moving image B refers only to the moving image A, the moving image B is selected next. Since the moving image C can be decoded independently, the moving image C is selected next. Then, the moving image D is selected. The encoded data multiplexing unit 111 multiplexes the reference relation data. The frame head position information setting unit 114 adds the file size by the data size of the reference relation data.

多重化する順序に従い，データ切り替え部１１０は，符号化データとフレーム先頭位置情報を切り替える。まず動画像Ａを選択する。動画像Ａ符号化データ入力部１０１は，符号化データを１フレーム分入力し，動画像Ａフレーム位置情報入力部１０６は，次のフレームのフレーム位置情報を入力する。符号化データ多重化部１１１は，符号化データを多重化する。フレーム先頭位置情報設定部１１４は，入力されたフレーム位置情報をファイルサイズに加算し，ファイルのフレーム位置情報として設定する。動画像番号設定部１１２は，動画像番号０を設定する。色空間情報設定部１１３は，色空間データ入力部１１７からの入力に基づき，色空間情報として１を設定する。管理情報多重化部１１６は，フレーム位置情報と動画像番号と色空間情報を多重化する。 In accordance with the order of multiplexing, the data switching unit 110 switches between encoded data and frame head position information. First, the moving image A is selected. The moving image A encoded data input unit 101 inputs encoded data for one frame, and the moving image A frame position information input unit 106 inputs frame position information of the next frame. The encoded data multiplexing unit 111 multiplexes encoded data. The frame head position information setting unit 114 adds the input frame position information to the file size and sets it as the frame position information of the file. The moving image number setting unit 112 sets moving image number 0. The color space information setting unit 113 sets 1 as the color space information based on the input from the color space data input unit 117. The management information multiplexing unit 116 multiplexes the frame position information, the moving image number, and the color space information.

続いてデータ切り替え部１１０は，動画像Ｂを選択する。動画像Ｂ符号化データ入力部１０２は，符号化データを１フレーム分入力し，動画像Ｂフレーム位置情報入力部１０７は，次のフレームのフレーム位置情報を入力する。符号化データ多重化部１１１は，符号化データを多重化する。フレーム先頭位置情報設定部１１４は，入力されたフレーム位置情報をファイルサイズに加算し，ファイルのフレーム位置情報として設定する。動画像番号設定部１１２は，動画像番号１を設定する。色空間情報設定部１１３は，色空間情報として１を設定する。管理情報多重化部１１６は，フレーム位置情報と動画像番号と色空間情報を多重化する。 Subsequently, the data switching unit 110 selects the moving image B. The moving image B encoded data input unit 102 inputs encoded data for one frame, and the moving image B frame position information input unit 107 inputs frame position information of the next frame. The encoded data multiplexing unit 111 multiplexes encoded data. The frame head position information setting unit 114 adds the input frame position information to the file size and sets it as the frame position information of the file. The moving image number setting unit 112 sets moving image number 1. The color space information setting unit 113 sets 1 as the color space information. The management information multiplexing unit 116 multiplexes the frame position information, the moving image number, and the color space information.

続いてデータ切り替え部１１０は，動画像Ｃを選択する。動画像Ｃ符号化データ入力部１０３は，符号化データを１フレーム分入力し，動画像Ｃフレーム位置情報入力部１０８は，次のフレームのフレーム位置情報を入力する。符号化データ多重化部１１１は，符号化データを多重化する。フレーム先頭位置情報設定部１１４は，入力されたフレーム位置情報をファイルサイズに加算し，ファイルのフレーム位置情報として設定する。動画像番号設定部１１２は，動画像番号２を設定する。色空間情報設定部１１３は，色空間情報として０を設定する。管理情報多重化部１１６は，フレーム位置情報と動画像番号と色空間情報を多重化する。 Subsequently, the data switching unit 110 selects the moving image C. The moving image C encoded data input unit 103 inputs encoded data for one frame, and the moving image C frame position information input unit 108 inputs the frame position information of the next frame. The encoded data multiplexing unit 111 multiplexes encoded data. The frame head position information setting unit 114 adds the input frame position information to the file size and sets it as the frame position information of the file. The moving image number setting unit 112 sets moving image number 2. The color space information setting unit 113 sets 0 as the color space information. The management information multiplexing unit 116 multiplexes the frame position information, the moving image number, and the color space information.

続いてデータ切り替え部１１０は，動画像Ｄを選択する。動画像Ｄ符号化データ入力部１０４は，符号化データを１フレーム分入力し，動画像Ｄフレーム位置情報入力部１０９は，次のフレームのフレーム位置情報を入力する。符号化データ多重化部１１１は，符号化データを多重化する。フレーム先頭位置情報設定部１１４は，入力されたフレーム位置情報をファイルサイズに加算し，ファイルのフレーム位置情報として設定する。動画像番号設定部１１２は，動画像番号３を設定する。色空間情報設定部１１３は，色空間情報として０を設定する。管理情報多重化部１１６は，フレーム位置情報と動画像番号と色空間情報を多重化する。 Subsequently, the data switching unit 110 selects the moving image D. The moving image D encoded data input unit 104 inputs encoded data for one frame, and the moving image D frame position information input unit 109 inputs frame position information of the next frame. The encoded data multiplexing unit 111 multiplexes encoded data. The frame head position information setting unit 114 adds the input frame position information to the file size and sets it as the frame position information of the file. The moving image number setting unit 112 sets moving image number 3. The color space information setting unit 113 sets 0 as the color space information. The management information multiplexing unit 116 multiplexes the frame position information, the moving image number, and the color space information.

以上の動画像Ａ，Ｂ，Ｃ，Ｄの処理を多重化するフレーム全てについて続ける。最後に，ファイル多重化部１１５は，符号化データ多重化部１１１で得られる符号化データと，管理情報多重化部１１６で得られる管理情報を多重化してファイルを作成する。以上の処理でファイルに多重化することができる。 The above processing of the moving images A, B, C, and D is continued for all the frames to be multiplexed. Finally, the file multiplexing unit 115 multiplexes the encoded data obtained by the encoded data multiplexing unit 111 and the management information obtained by the management information multiplexing unit 116 to create a file. By the above processing, it can be multiplexed into a file.

本実施例では，予め符号化して得られた符号化データとファイル先頭位置情報から多重化してファイルを作成する例を示したが，符号化データを作成しつつファイルを構成してもよい。 In the present embodiment, an example is shown in which a file is created by multiplexing from encoded data obtained by encoding in advance and file head position information, but a file may be configured while creating encoded data.

次に，多重化されたファイルから特定の動画像で所望の色空間の符号化データを取り出すファイル読み込み方法について動作を説明する。図２にファイル読み込み装置の構成を示す。図２に示すように，ファイル読み込み装置２０は，ファイルを読み込むファイル入力部２０１と，管理情報を読み取る管理情報読み取り部２０２と，管理情報から所望の色空間情報を探索する色空間情報探索部２０４と，管理情報から動画像番号を判定する動画像番号判定部２０３と，管理情報からフレーム先頭位置情報を読み取るフレーム先頭位置情報設定部２０５と，フレーム先頭位置情報に従ってファイルから符号化データを読み取る符号化データ読み取り部２０６とで構成される。 Next, the operation of a file reading method for extracting encoded data of a desired color space with a specific moving image from a multiplexed file will be described. FIG. 2 shows the configuration of the file reading apparatus. As shown in FIG. 2, the file reading device 20 includes a file input unit 201 that reads a file, a management information reading unit 202 that reads management information, and a color space information search unit 204 that searches for desired color space information from the management information. A moving image number determination unit 203 that determines a moving image number from management information, a frame head position information setting unit 205 that reads frame head position information from management information, and a code that reads encoded data from a file according to the frame head position information And a digitized data reading unit 206.

図１の多重化装置１０で作成されたファイルからグレースケールの動画像Ｃの符号化データを読み取る場合の動作を示す。 The operation when reading the encoded data of the grayscale moving image C from the file created by the multiplexing apparatus 10 of FIG. 1 will be described.

ファイル入力部２０１は，ファイルを入力する。管理情報読み取り部２０２は，ファイルの後半にある管理情報を読み取る。色空間情報探索部２０４は，第１フレームで色空間情報がグレースケールの動画像の管理情報を探索する。動画像番号判定部２０３は，探索して得られた管理情報の動画像番号が２であるかどうかを判定する。最初に探索される動画像番号は２であるので，フレーム先頭位置情報設定部２０５は，このフレームのフレーム先頭位置情報を設定する。符号化データ読み取り部２０６は，設定されたファイル位置から符号化データを読み取る。このとき，フレーム先頭位置情報設定部２０５で得られる次のフレームの先頭位置までを読み取る。次に第１フレームで色空間情報探索部２０４で探索されて動画像番号判定部２０３で判定される管理情報の動画像番号は３であるので，そのフレームは復号しない。 The file input unit 201 inputs a file. The management information reading unit 202 reads management information in the second half of the file. The color space information search unit 204 searches for management information of a moving image whose color space information is grayscale in the first frame. The moving image number determination unit 203 determines whether or not the moving image number of the management information obtained by searching is 2. Since the moving image number searched first is 2, the frame head position information setting unit 205 sets the frame head position information of this frame. The encoded data reading unit 206 reads encoded data from the set file position. At this time, the frame head position information setting unit 205 reads up to the head position of the next frame. Next, since the moving image number of the management information searched by the color space information searching unit 204 in the first frame and determined by the moving image number determining unit 203 is 3, the frame is not decoded.

以後のフレームについても，同様に動画像Ｃの符号化データを読み取る。 Similarly, the encoded data of the moving image C is read for the subsequent frames.

ここで動画像Ｃの符号化データのみを読み取るようにしたが，特定の色空間の動画像の符号化データを全て読み取るようにしてもよい。この場合のファイル読み込み装置の構成は図３のファイル読み込み装置２１のようになり，図２の構成から動画像番号判定部２０３を省いた構成となる。この場合に，グレースケールの動画像の符号化データを読み取るときには，次のように処理される。 Although only the encoded data of the moving image C is read here, all the encoded data of the moving image in a specific color space may be read. The configuration of the file reading device in this case is the same as that of the file reading device 21 in FIG. 3, and is a configuration in which the moving image number determination unit 203 is omitted from the configuration in FIG. In this case, when reading the encoded data of the gray scale moving image, the following processing is performed.

ファイル入力部２０１は，ファイルを入力する。管理情報読み取り部２０２は，ファイルの後半にある管理情報を読み取る。色空間情報探索部２０４は，第１フレームで色空間情報がグレースケールの動画像の管理情報を探索する。最初に探索される動画像番号２のフレームについて，フレーム先頭位置情報設定部２０５は，このフレームのフレーム先頭位置情報を設定する。符号化データ読み取り部２０６は，設定されたファイル位置から符号化データを読み取る。このとき，フレーム先頭位置情報設定部２０５で得られる次のフレームの先頭位置までを読み取る。次に第１フレームで探索される動画像番号３のフレームについて，フレーム先頭位置情報設定部２０５は，このフレームのフレーム先頭位置情報を設定する。符号化データ読み取り部２０６は，設定されたファイル位置から符号化データを読み取る。このとき，フレーム先頭位置情報設定部２０５で得られる次のフレームの先頭位置までを読み取る。 The file input unit 201 inputs a file. The management information reading unit 202 reads management information in the second half of the file. The color space information search unit 204 searches for management information of a moving image whose color space information is grayscale in the first frame. For the frame of moving image number 2 to be searched first, the frame head position information setting unit 205 sets the frame head position information of this frame. The encoded data reading unit 206 reads encoded data from the set file position. At this time, the frame head position information setting unit 205 reads up to the head position of the next frame. Next, for the frame of moving image number 3 searched in the first frame, the frame head position information setting unit 205 sets the frame head position information of this frame. The encoded data reading unit 206 reads encoded data from the set file position. At this time, the frame head position information setting unit 205 reads up to the head position of the next frame.

以後のフレームについても，同様にグレースケールの動画像の符号化データを読み取る。 For the subsequent frames, the encoded data of the gray scale moving image is similarly read.

このように本実施例によれば，符号化データ中に含まれる色空間情報を復号することなく，特定の色空間情報の符号化データを取り出すことができる。 As described above, according to the present embodiment, encoded data of specific color space information can be extracted without decoding the color space information included in the encoded data.

なお本実施例では，グレースケールの動画像の符号化データを読み込む場合の処理を示したが，ＹＵＶ３色の動画像の符号化データを読み込むようにしてもよい。 In this embodiment, the processing for reading the encoded data of the grayscale moving image is shown, but the encoded data of the YUV three-color moving image may be read.

なお，本実施例の多重化装置１０では，符号化データの後に管理情報を設定したが，ファイルにトラックの構造が設定されている場合には，管理情報を第一トラックに設定し，符号化データを第二トラックに設定してもよい。 In the multiplexing apparatus 10 of this embodiment, the management information is set after the encoded data. However, when the track structure is set in the file, the management information is set in the first track and the encoding is performed. Data may be set in the second track.

図４に，ファイル中におけるトラックや管理情報の構成例を示す。作成されたファイルには，符号化データトラック３０と管理情報トラック４０とがあり，符号化データトラック３０には，動画像Ａ符号化データ３１，動画像Ｂ符号化データ３２，…が格納される。また，管理情報トラック４０には，各時刻Ｔｎごとの管理情報４１，４２，４３，…が格納され，これらには各々，フレーム先頭位置情報，動画像番号，色空間情報などの情報が格納される。 FIG. 4 shows a configuration example of tracks and management information in a file. The created file includes an encoded data track 30 and a management information track 40. In the encoded data track 30, moving image A encoded data 31, moving image B encoded data 32,... Are stored. . Further, management information 41, 42, 43,... For each time Tn is stored in the management information track 40, and information such as frame head position information, moving image number, color space information, etc. is stored in each of these. The

第２の実施例では，異なる空間解像度の動画像データを多重化し，特定の空間解像度の符号化データを読み込む場合の例を示す。第２の実施例では，フレーム先頭の位置情報と動画像番号と空間解像度情報を管理情報として多重化し，全ての符号化データを多重化した最後に設定する場合の例を示す。第１の実施例と同様に，動画像はＡ，Ｂ，Ｃ，Ｄの４つあり，予め符号化されていて，各動画像におけるフレーム先頭の位置情報は分かっているものとする。 The second embodiment shows an example in which moving image data having different spatial resolutions are multiplexed and encoded data having a specific spatial resolution is read. The second embodiment shows an example in which the position information at the beginning of the frame, the moving image number, and the spatial resolution information are multiplexed as management information and set at the end of multiplexing all the encoded data. As in the first embodiment, it is assumed that there are four moving images A, B, C, and D, which are encoded in advance, and the position information of the frame head in each moving image is known.

図５に多重化装置の構成を示す。図５に示す多重化装置１１は，動画像Ａ，Ｂ，Ｃ，Ｄの符号化データをそれぞれ入力する動画像Ａ符号化データ入力部１０１，動画像Ｂ符号化データ入力部１０２，動画像Ｃ符号化データ入力部１０３，動画像Ｄ符号化データ入力部１０４と，各動画像のフレーム先頭の動画像中の位置情報をそれぞれ入力する動画像Ａフレーム位置情報入力部１０６，動画像Ｂフレーム位置情報入力部１０７，動画像Ｃフレーム位置情報入力部１０８，動画像Ｄフレーム位置情報入力部１０９と，動画像間の参照関係を示す情報を入力する参照関係データ入力部１０５と，動画像の空間解像度情報を入力する空間解像度データ入力部１１９と，動画像の符号化データやフレーム位置情報の入力を切り替えるデータ切り替え部１１０と，符号化データを多重化する符号化データ多重化部１１１と，動画像番号を設定する動画像番号設定部１１２と，空間解像度情報を設定する空間解像度情報設定部１１８と，フレーム位置の先頭情報を設定するフレーム先頭位置情報設定部１１４と，管理情報を多重化する管理情報多重化部１１６と，符号化データと管理情報を多重化するファイル多重化部１１５とから構成される。 FIG. 5 shows the configuration of the multiplexing apparatus. The multiplexing apparatus 11 shown in FIG. 5 has a moving image A encoded data input unit 101, a moving image B encoded data input unit 102, and a moving image C that input encoded data of moving images A, B, C, and D, respectively. Encoded data input unit 103, moving image D encoded data input unit 104, moving image A frame position information input unit 106 for inputting position information in the moving image at the head of each moving image, moving image B frame position An information input unit 107, a moving image C frame position information input unit 108, a moving image D frame position information input unit 109, a reference relationship data input unit 105 for inputting information indicating a reference relationship between moving images, and a moving image space Spatial resolution data input unit 119 for inputting resolution information, data switching unit 110 for switching input of encoded data of moving images and frame position information, and multiplexed encoded data An encoded data multiplexing unit 111, a moving image number setting unit 112 that sets a moving image number, a spatial resolution information setting unit 118 that sets spatial resolution information, and frame head position information that sets head information of a frame position The setting unit 114, a management information multiplexing unit 116 that multiplexes management information, and a file multiplexing unit 115 that multiplexes encoded data and management information.

以上のうち，動画像番号設定部１１２では，動画像Ａ，Ｂ，Ｃ，Ｄに対してそれぞれ動画像番号０，１，２，３を設定するものとする。動画像ＡからＤまでのフレーム位置情報入力部１０６〜１０９は，次のフレームの位置情報を入力するものとする。フレーム先頭位置情報設定部１１４は，各動画像のフレーム位置情報をカウントし，符号化データ多重化部１１１で符号化データを多重化した後における各フレームの位置情報を設定するものとする。空間解像度情報設定部１１８は，空間解像度がＶＧＡ（横６４０縦４８０画素）の場合には空間解像度情報として１を設定し，ＱＶＧＡ（横３２０縦２４０画素）の場合には空間解像度情報として０を設定するものとする。ファイル多重化部１１５は，符号化データ多重化部１１１で作成された符号化データの後に，管理情報多重化部１１６で作成された管理情報を設定するものとする。符号化データ多重化部１１１は，参照関係データがある場合には各動画像の符号化データの前に多重化するものとする。 Among the above, the moving image number setting unit 112 sets moving image numbers 0, 1, 2, and 3 for moving images A, B, C, and D, respectively. Frame position information input units 106 to 109 for moving images A to D are assumed to input position information of the next frame. The frame head position information setting unit 114 counts the frame position information of each moving image, and sets the position information of each frame after the encoded data multiplexing unit 111 multiplexes the encoded data. The spatial resolution information setting unit 118 sets 1 as the spatial resolution information when the spatial resolution is VGA (horizontal 640 vertical 480 pixels), and sets 0 as the spatial resolution information when QVGA (horizontal 320 vertical 240 pixels). Shall be set. The file multiplexing unit 115 sets the management information created by the management information multiplexing unit 116 after the coded data created by the coded data multiplexing unit 111. When there is reference relation data, the encoded data multiplexing unit 111 multiplexes the encoded data before the encoded data of each moving image.

動画像間の参照関係は，全て動画像ＡとＣは独立して復号でき，動画像Ｂは動画像Ａを参照し，動画像Ｄは動画像Ｃを参照するものとする。 Regarding the reference relationship between the moving images, all of the moving images A and C can be decoded independently, the moving image B refers to the moving image A, and the moving image D refers to the moving image C.

動画像の空間解像度は動画像ＡとＢはＶＧＡであり，動画像ＣとＤはＱＶＧＡであるとする。 As for the spatial resolution of the moving image, the moving images A and B are VGA, and the moving images C and D are QVGA.

多重化する順序に従い，データ切り替え部１１０は，符号化データとフレーム先頭位置情報を切り替える。まず動画像Ａを選択する。動画像Ａ符号化データ入力部１０１は，符号化データを１フレーム分入力し，動画像Ａフレーム位置情報入力部１０６は，次のフレームのフレーム位置情報を入力する。符号化データ多重化部１１１は，符号化データを多重化する。フレーム先頭位置情報設定部１１４は，入力されたフレーム位置情報をファイルサイズに加算し，ファイルのフレーム位置情報として設定する。動画像番号設定部１１２は，動画像番号０を設定する。空間解像度情報設定部１１８は，空間解像度データ入力部１１９からの入力に基づき，空間解像度情報として１を設定する。管理情報多重化部１１６は，フレーム位置情報と動画像番号と空間解像度情報を多重化する。 In accordance with the order of multiplexing, the data switching unit 110 switches between encoded data and frame head position information. First, the moving image A is selected. The moving image A encoded data input unit 101 inputs encoded data for one frame, and the moving image A frame position information input unit 106 inputs frame position information of the next frame. The encoded data multiplexing unit 111 multiplexes encoded data. The frame head position information setting unit 114 adds the input frame position information to the file size and sets it as the frame position information of the file. The moving image number setting unit 112 sets moving image number 0. The spatial resolution information setting unit 118 sets 1 as the spatial resolution information based on the input from the spatial resolution data input unit 119. The management information multiplexing unit 116 multiplexes the frame position information, the moving image number, and the spatial resolution information.

続いてデータ切り替え部１１０は，動画像Ｂを選択する。動画像Ｂ符号化データ入力部１０２は，符号化データを１フレーム分入力し，動画像Ｂフレーム位置情報入力部１０７は，次のフレームのフレーム位置情報を入力する。符号化データ多重化部１１１は，符号化データを多重化する。フレーム先頭位置情報設定部１１４は，入力されたフレーム位置情報をファイルサイズに加算し，ファイルのフレーム位置情報として設定する。動画像番号設定部１１２は，動画像番号１を設定する。空間解像度情報設定部１１８は，空間解像度情報として１を設定する。管理情報多重化部１１６は，フレーム位置情報と動画像番号と空間解像度情報を多重化する。 Subsequently, the data switching unit 110 selects the moving image B. The moving image B encoded data input unit 102 inputs encoded data for one frame, and the moving image B frame position information input unit 107 inputs frame position information of the next frame. The encoded data multiplexing unit 111 multiplexes encoded data. The frame head position information setting unit 114 adds the input frame position information to the file size and sets it as the frame position information of the file. The moving image number setting unit 112 sets moving image number 1. The spatial resolution information setting unit 118 sets 1 as the spatial resolution information. The management information multiplexing unit 116 multiplexes the frame position information, the moving image number, and the spatial resolution information.

続いてデータ切り替え部１１０は，動画像Ｃを選択する。動画像Ｃ符号化データ入力部１０３は，符号化データを１フレーム分入力し，動画像Ｃフレーム位置情報入力部１０８は，次のフレームのフレーム位置情報を入力する。符号化データ多重化部１１１は，符号化データを多重化する。フレーム先頭位置情報設定部１１４は，入力されたフレーム位置情報をファイルサイズに加算し，ファイルのフレーム位置情報として設定する。動画像番号設定部１１２は，動画像番号２を設定する。空間解像度情報設定部１１８は，空間解像度情報として０を設定する。管理情報多重化部１１６は，フレーム位置情報と動画像番号と空間解像度情報を多重化する。 Subsequently, the data switching unit 110 selects the moving image C. The moving image C encoded data input unit 103 inputs encoded data for one frame, and the moving image C frame position information input unit 108 inputs the frame position information of the next frame. The encoded data multiplexing unit 111 multiplexes encoded data. The frame head position information setting unit 114 adds the input frame position information to the file size and sets it as the frame position information of the file. The moving image number setting unit 112 sets moving image number 2. The spatial resolution information setting unit 118 sets 0 as the spatial resolution information. The management information multiplexing unit 116 multiplexes the frame position information, the moving image number, and the spatial resolution information.

続いてデータ切り替え部１１０は，動画像Ｄを選択する。動画像Ｄ符号化データ入力部１０４は，符号化データを１フレーム分入力し，動画像Ｄフレーム位置情報入力部１０９は，次のフレームのフレーム位置情報を入力する。符号化データ多重化部１１１は，符号化データを多重化する。フレーム先頭位置情報設定部１１４は，入力されたフレーム位置情報をファイルサイズに加算し，ファイルのフレーム位置情報として設定する。動画像番号設定部１１２は，動画像番号３を設定する。空間解像度情報設定部１１８は，空間解像度情報として０を設定する。管理情報多重化部１１６は，フレーム位置情報と動画像番号と空間解像度情報を多重化する。 Subsequently, the data switching unit 110 selects the moving image D. The moving image D encoded data input unit 104 inputs encoded data for one frame, and the moving image D frame position information input unit 109 inputs frame position information of the next frame. The encoded data multiplexing unit 111 multiplexes encoded data. The frame head position information setting unit 114 adds the input frame position information to the file size and sets it as the frame position information of the file. The moving image number setting unit 112 sets moving image number 3. The spatial resolution information setting unit 118 sets 0 as the spatial resolution information. The management information multiplexing unit 116 multiplexes the frame position information, the moving image number, and the spatial resolution information.

次に，多重化されたファイルから特定の動画像で所望の空間解像度の符号化データを取り出すファイル読み込み方法について動作を説明する。図６にファイル読み込み装置の構成を示す。図６に示すファイル読み込み装置２２は，ファイルを読み込むファイル入力部２０１と，管理情報を読み取る管理情報読み取り部２０２と，管理情報から所望の空間解像度情報を探索する空間解像度情報探索部２０７と，管理情報から動画像番号を判定する動画像番号判定部２０３と，管理情報からフレーム先頭位置情報を読み取るフレーム先頭位置情報設定部２０５と，フレーム先頭位置情報に従ってファイルから符号化データを読み取る符号化データ読み取り部２０６とで構成される。 Next, the operation of a file reading method for extracting encoded data having a desired spatial resolution from a multiplexed file with a specific moving image will be described. FIG. 6 shows the configuration of the file reading apparatus. 6 includes a file input unit 201 that reads a file, a management information reading unit 202 that reads management information, a spatial resolution information search unit 207 that searches for desired spatial resolution information from the management information, a management A moving image number determination unit 203 that determines a moving image number from information, a frame head position information setting unit 205 that reads frame head position information from management information, and an encoded data reading that reads encoded data from a file according to the frame head position information Part 206.

図５の多重化装置１１で作成されたファイルからＱＶＧＡの動画像Ｃの符号化データを読み取る場合の動作を示す。 The operation when reading the encoded data of the moving image C of QVGA from the file created by the multiplexing device 11 of FIG.

ファイル入力部２０１は，ファイルを入力する。管理情報読み取り部２０２は，ファイルの後半にある管理情報を読み取る。空間解像度情報探索部２０７は，第１フレームで空間解像度情報がＱＶＧＡの動画像の管理情報を探索する。動画像番号判定部２０３は，探索して得られた管理情報の動画像番号が２であるかどうかを判定する。最初に探索される動画像番号は２であるので，フレーム先頭位置情報設定部２０５は，このフレームのフレーム先頭位置情報を設定する。符号化データ読み取り部２０６は，設定されたファイル位置から符号化データを読み取る。このとき，フレーム先頭位置情報設定部２０５で得られる次のフレームの先頭位置までを読み取る。次に第１フレームで空間解像度情報探索部２０７で探索されて動画像番号判定部２０３で判定される管理情報の動画像番号は３であるので，そのフレームは復号しない。 The file input unit 201 inputs a file. The management information reading unit 202 reads management information in the second half of the file. The spatial resolution information search unit 207 searches for management information of moving images whose spatial resolution information is QVGA in the first frame. The moving image number determination unit 203 determines whether or not the moving image number of the management information obtained by searching is 2. Since the moving image number searched first is 2, the frame head position information setting unit 205 sets the frame head position information of this frame. The encoded data reading unit 206 reads encoded data from the set file position. At this time, the frame head position information setting unit 205 reads up to the head position of the next frame. Next, since the moving image number of the management information searched by the spatial resolution information searching unit 207 in the first frame and determined by the moving image number determining unit 203 is 3, the frame is not decoded.

ここで動画像Ｃの符号化データのみを読み取るようにしたが，特定の空間解像度の動画像の符号化データを全て読み取るようにしてもよい。この場合のファイル読み込み装置の構成は図７のファイル読み込み装置２３のようになり，図６の構成から動画像番号判定部２０３を省いた構成となる。この場合に，ＱＶＧＡの動画像の符号化データを読み取るときには，次のように処理される。 Although only the encoded data of the moving picture C is read here, all the encoded data of the moving picture having a specific spatial resolution may be read. The configuration of the file reading device in this case is the same as that of the file reading device 23 in FIG. 7, and the moving image number determination unit 203 is omitted from the configuration in FIG. In this case, when reading the encoded data of the QVGA moving image, the following processing is performed.

ファイル入力部２０１は，ファイルを入力する。管理情報読み取り部２０２は，ファイルの後半にある管理情報を読み取る。空間解像度情報探索部２０７は，第１フレームで空間解像度情報がＱＶＧＡの動画像の管理情報を探索する。最初に探索される動画像番号２のフレームについて，フレーム先頭位置情報設定部２０５は，このフレームのフレーム先頭位置情報を設定する。符号化データ読み取り部２０６は，設定されたファイル位置から符号化データを読み取る。このとき，フレーム先頭位置情報設定部２０５で得られる次のフレームの先頭位置までを読み取る。次に第１フレームで探索される動画像番号３のフレームについて，フレーム先頭位置情報設定部２０５は，このフレームのフレーム先頭位置情報を設定する。符号化データ読み取り部２０６は，設定されたファイル位置から符号化データを読み取る。このとき，フレーム先頭位置情報設定部２０５で得られる次のフレームの先頭位置までを読み取る。 The file input unit 201 inputs a file. The management information reading unit 202 reads management information in the second half of the file. The spatial resolution information search unit 207 searches for management information of moving images whose spatial resolution information is QVGA in the first frame. For the frame of moving image number 2 to be searched first, the frame head position information setting unit 205 sets the frame head position information of this frame. The encoded data reading unit 206 reads encoded data from the set file position. At this time, the frame head position information setting unit 205 reads up to the head position of the next frame. Next, for the frame of moving image number 3 searched in the first frame, the frame head position information setting unit 205 sets the frame head position information of this frame. The encoded data reading unit 206 reads encoded data from the set file position. At this time, the frame head position information setting unit 205 reads up to the head position of the next frame.

以後のフレームについても，同様にＱＶＧＡの動画像の符号化データを読み取る。 For the subsequent frames, the encoded data of the QVGA moving image is similarly read.

このように本実施例によれば，符号化データ中に含まれる空間解像度情報を復号することなく，特定の空間解像度情報の符号化データを取り出すことができる。 As described above, according to the present embodiment, encoded data of specific spatial resolution information can be extracted without decoding the spatial resolution information included in the encoded data.

なお本実施例では，ＱＶＧＡの動画像の符号化データを読み込む場合の処理を示したが，ＶＧＡの動画像の符号化データを読み込むようにしてもよい。 In this embodiment, the process for reading the encoded data of the QVGA moving image is shown. However, the encoded data of the VGA moving image may be read.

なお，本実施例の多重化装置１１では，符号化データの後に管理情報を設定したが，ファイルにトラックの構造が設定されている場合には，管理情報を第一トラックに設定し，符号化データを第二トラックに設定してもよい。 In the multiplexing apparatus 11 of this embodiment, management information is set after the encoded data. However, when the track structure is set in the file, the management information is set in the first track and the encoding is performed. Data may be set in the second track.

図８に，ファイル中におけるトラックや管理情報の構成例を示す。作成されたファイルには，符号化データトラック３０と管理情報トラック４０とがあり，符号化データトラック３０には，動画像Ａ符号化データ３１，動画像Ｂ符号化データ３２，…が格納される。また，管理情報トラック４０には，各時刻Ｔｎごとの管理情報４１，４２，４３，…が格納され，これらには各々，フレーム先頭位置情報，動画像番号，空間解像度情報などの情報が格納される。 FIG. 8 shows a configuration example of tracks and management information in a file. The created file includes an encoded data track 30 and a management information track 40. In the encoded data track 30, moving image A encoded data 31, moving image B encoded data 32,... Are stored. . Further, management information 41, 42, 43,... For each time Tn is stored in the management information track 40, and information such as frame head position information, moving image number, spatial resolution information is stored in each of these. The

第３の実施例では，異なるビット深度の動画像データを多重化し，特定のビット深度の符号化データを読み込む場合の例を示す。第３の実施例では，フレーム先頭の位置情報と動画像番号とビット深度情報を管理情報として多重化し，全ての符号化データを多重化した最後に設定する場合の例を示す。第１の実施例と同様に，動画像はＡ，Ｂ，Ｃ，Ｄの４つあり，予め符号化されていて，各動画像におけるフレーム先頭の位置情報は分かっているものとする。 The third embodiment shows an example in which moving image data having different bit depths is multiplexed and encoded data having a specific bit depth is read. The third embodiment shows an example in which the frame head position information, moving picture number, and bit depth information are multiplexed as management information and set at the end of multiplexing all encoded data. As in the first embodiment, it is assumed that there are four moving images A, B, C, and D, which are encoded in advance, and the position information of the frame head in each moving image is known.

図９に多重化装置の構成を示す。図９に示す多重化装置１２は，動画像Ａ，Ｂ，Ｃ，Ｄの符号化データをそれぞれ入力する動画像Ａ符号化データ入力部１０１，動画像Ｂ符号化データ入力部１０２，動画像Ｃ符号化データ入力部１０３，動画像Ｄ符号化データ入力部１０４と，各動画像のフレーム先頭の動画像中の位置情報をそれぞれ入力する動画像Ａフレーム位置情報入力部１０６，動画像Ｂフレーム位置情報入力部１０７，動画像Ｃフレーム位置情報入力部１０８，動画像Ｄフレーム位置情報入力部１０９と，動画像間の参照関係を示す情報を入力する参照関係データ入力部１０５と，動画像のビット深度情報を入力するビット深度データ入力部１２１と，動画像の符号化データやフレーム位置情報の入力を切り替えるデータ切り替え部１１０と，符号化データを多重化する符号化データ多重化部１１１と，動画像番号を設定する動画像番号設定部１１２と，ビット深度情報を設定するビット深度情報設定部１２０と，フレーム位置の先頭情報を設定するフレーム先頭位置情報設定部１１４と，管理情報を多重化する管理情報多重化部１１６と，符号化データと管理情報を多重化するファイル多重化部１１５とから構成される。 FIG. 9 shows the configuration of the multiplexing apparatus. The multiplexing device 12 shown in FIG. 9 includes a moving image A encoded data input unit 101, a moving image B encoded data input unit 102, and a moving image C that input encoded data of moving images A, B, C, and D, respectively. Encoded data input unit 103, moving image D encoded data input unit 104, moving image A frame position information input unit 106 for inputting position information in the moving image at the head of each moving image, moving image B frame position An information input unit 107, a moving image C frame position information input unit 108, a moving image D frame position information input unit 109, a reference relationship data input unit 105 for inputting information indicating a reference relationship between moving images, and a moving image bit Bit depth data input unit 121 that inputs depth information, data switching unit 110 that switches between input of encoded data of moving images and frame position information, and multiplexed encoded data Encoded data multiplexing section 111, moving picture number setting section 112 for setting moving picture numbers, bit depth information setting section 120 for setting bit depth information, and frame head position information for setting head information of frame positions The setting unit 114, a management information multiplexing unit 116 that multiplexes management information, and a file multiplexing unit 115 that multiplexes encoded data and management information.

以上のうち，動画像番号設定部１１２では，動画像Ａ，Ｂ，Ｃ，Ｄに対してそれぞれ動画像番号０，１，２，３を設定するものとする。動画像ＡからＤまでのフレーム位置情報入力部１０６〜１０９は，次のフレームの位置情報を入力するものとする。フレーム先頭位置情報設定部１１４は，各動画像のフレーム位置情報をカウントし，符号化データ多重化部１１１で符号化データを多重化した後における各フレームの位置情報を設定するものとする。ビット深度情報設定部１２０は，ビット深度が１０ビットの場合にはビット深度情報として１を設定し，８ビットの場合にはビット深度情報として０を設定するものとする。ファイル多重化部１１５は，符号化データ多重化部１１１で作成された符号化データの後に，管理情報多重化部１１６で作成された管理情報を設定するものとする。符号化データ多重化部１１１は，参照関係データがある場合には各動画像の符号化データの前に多重化するものとする。 Among the above, the moving image number setting unit 112 sets moving image numbers 0, 1, 2, and 3 for moving images A, B, C, and D, respectively. Frame position information input units 106 to 109 for moving images A to D are assumed to input position information of the next frame. The frame head position information setting unit 114 counts the frame position information of each moving image, and sets the position information of each frame after the encoded data multiplexing unit 111 multiplexes the encoded data. The bit depth information setting unit 120 sets 1 as the bit depth information when the bit depth is 10 bits, and sets 0 as the bit depth information when the bit depth is 8 bits. The file multiplexing unit 115 sets the management information created by the management information multiplexing unit 116 after the coded data created by the coded data multiplexing unit 111. When there is reference relation data, the encoded data multiplexing unit 111 multiplexes the encoded data before the encoded data of each moving image.

動画像のビット深度は動画像ＡとＢは１０ビットであり，動画像ＣとＤは８ビットであるとする。 The bit depth of moving images is 10 bits for moving images A and B, and 8 bits for moving images C and D.

以上の前提で動画像Ａ，Ｂ，Ｃ，Ｄの各フレームを多重化する場合には，次のように多重化される。 When the frames of the moving images A, B, C, and D are multiplexed based on the above premise, they are multiplexed as follows.

まず，先頭フレームについては次のように多重化される。参照関係データ入力部１０５は，動画像間の参照関係データを入力する。データ切り替え部１１０は，参照関係データに従って符号化データを多重化する順序を決定する。ここでは，次の順序に決定する。動画像Ａは独立して復号できるため，まず動画像Ａを選択する。動画像Ｂは動画像Ａのみを参照するため，次に動画像Ｂを選択する。動画像Ｃは独立して復号できるため，次に動画像Ｃを選択する。そして動画像Ｄを選択する。符号化データ多重化部１１１は，参照関係データを多重化する。フレーム先頭位置情報設定部１１４は，参照関係データのデータサイズ分だけファイルサイズを加算する。 First, the first frame is multiplexed as follows. The reference relationship data input unit 105 inputs reference relationship data between moving images. The data switching unit 110 determines the order in which the encoded data is multiplexed according to the reference relation data. Here, the order is determined as follows. Since the moving image A can be decoded independently, the moving image A is first selected. Since the moving image B refers only to the moving image A, the moving image B is selected next. Since the moving image C can be decoded independently, the moving image C is selected next. Then, the moving image D is selected. The encoded data multiplexing unit 111 multiplexes the reference relation data. The frame head position information setting unit 114 adds the file size by the data size of the reference relation data.

多重化する順序に従い，データ切り替え部１１０は，符号化データとフレーム先頭位置情報を切り替える。まず動画像Ａを選択する。動画像Ａ符号化データ入力部１０１は，符号化データを１フレーム分入力し，動画像Ａフレーム位置情報入力部１０６は，次のフレームのフレーム位置情報を入力する。符号化データ多重化部１１１は，符号化データを多重化する。フレーム先頭位置情報設定部１１４は，入力されたフレーム位置情報をファイルサイズに加算し，ファイルのフレーム位置情報として設定する。動画像番号設定部１１２は，動画像番号０を設定する。ビット深度情報設定部１２０は，ビット深度データ入力部１２１からの入力に基づき，ビット深度情報として１を設定する。管理情報多重化部１１６は，フレーム位置情報と動画像番号とビット深度情報を多重化する。 In accordance with the order of multiplexing, the data switching unit 110 switches between encoded data and frame head position information. First, the moving image A is selected. The moving image A encoded data input unit 101 inputs encoded data for one frame, and the moving image A frame position information input unit 106 inputs frame position information of the next frame. The encoded data multiplexing unit 111 multiplexes encoded data. The frame head position information setting unit 114 adds the input frame position information to the file size and sets it as the frame position information of the file. The moving image number setting unit 112 sets moving image number 0. The bit depth information setting unit 120 sets 1 as the bit depth information based on the input from the bit depth data input unit 121. The management information multiplexing unit 116 multiplexes the frame position information, the moving image number, and the bit depth information.

続いてデータ切り替え部１１０は，動画像Ｂを選択する。動画像Ｂ符号化データ入力部１０２は，符号化データを１フレーム分入力し，動画像Ｂフレーム位置情報入力部１０７は，次のフレームのフレーム位置情報を入力する。符号化データ多重化部１１１は，符号化データを多重化する。フレーム先頭位置情報設定部１１４は，入力されたフレーム位置情報をファイルサイズに加算し，ファイルのフレーム位置情報として設定する。動画像番号設定部１１２は，動画像番号１を設定する。ビット深度情報設定部１２０は，ビット深度情報として１を設定する。管理情報多重化部１１６は，フレーム位置情報と動画像番号とビット深度情報を多重化する。 Subsequently, the data switching unit 110 selects the moving image B. The moving image B encoded data input unit 102 inputs encoded data for one frame, and the moving image B frame position information input unit 107 inputs frame position information of the next frame. The encoded data multiplexing unit 111 multiplexes encoded data. The frame head position information setting unit 114 adds the input frame position information to the file size and sets it as the frame position information of the file. The moving image number setting unit 112 sets moving image number 1. The bit depth information setting unit 120 sets 1 as the bit depth information. The management information multiplexing unit 116 multiplexes the frame position information, the moving image number, and the bit depth information.

続いてデータ切り替え部１１０は，動画像Ｃを選択する。動画像Ｃ符号化データ入力部１０３は，符号化データを１フレーム分入力し，動画像Ｃフレーム位置情報入力部１０８は，次のフレームのフレーム位置情報を入力する。符号化データ多重化部１１１は，符号化データを多重化する。フレーム先頭位置情報設定部１１４は，入力されたフレーム位置情報をファイルサイズに加算し，ファイルのフレーム位置情報として設定する。動画像番号設定部１１２は，動画像番号２を設定する。ビット深度情報設定部１２０は，ビット深度情報として０を設定する。管理情報多重化部１１６は，フレーム位置情報と動画像番号とビット深度情報を多重化する。 Subsequently, the data switching unit 110 selects the moving image C. The moving image C encoded data input unit 103 inputs encoded data for one frame, and the moving image C frame position information input unit 108 inputs the frame position information of the next frame. The encoded data multiplexing unit 111 multiplexes encoded data. The frame head position information setting unit 114 adds the input frame position information to the file size and sets it as the frame position information of the file. The moving image number setting unit 112 sets moving image number 2. The bit depth information setting unit 120 sets 0 as the bit depth information. The management information multiplexing unit 116 multiplexes the frame position information, the moving image number, and the bit depth information.

続いてデータ切り替え部１１０は，動画像Ｄを選択する。動画像Ｄ符号化データ入力部１０４は，符号化データを１フレーム分入力し，動画像Ｄフレーム位置情報入力部１０９は，次のフレームのフレーム位置情報を入力する。符号化データ多重化部１１１は，符号化データを多重化する。フレーム先頭位置情報設定部１１４は，入力されたフレーム位置情報をファイルサイズに加算し，ファイルのフレーム位置情報として設定する。動画像番号設定部１１２は，動画像番号３を設定する。ビット深度情報設定部１２０は，ビット深度情報として０を設定する。管理情報多重化部１１６は，フレーム位置情報と動画像番号とビット深度情報を多重化する。 Subsequently, the data switching unit 110 selects the moving image D. The moving image D encoded data input unit 104 inputs encoded data for one frame, and the moving image D frame position information input unit 109 inputs frame position information of the next frame. The encoded data multiplexing unit 111 multiplexes encoded data. The frame head position information setting unit 114 adds the input frame position information to the file size and sets it as the frame position information of the file. The moving image number setting unit 112 sets moving image number 3. The bit depth information setting unit 120 sets 0 as the bit depth information. The management information multiplexing unit 116 multiplexes the frame position information, the moving image number, and the bit depth information.

次に，多重化されたファイルから特定の動画像で所望のビット深度の符号化データを取り出すファイル読み込み方法について動作を説明する。図１０にファイル読み込み装置の構成を示す。図１０に示すファイル読み込み装置２４は，ファイルを読み込むファイル入力部２０１と，管理情報を読み取る管理情報読み取り部２０２と，管理情報から所望のビット深度情報を探索するビット深度情報探索部２０８と，管理情報から動画像番号を判定する動画像番号判定部２０３と，管理情報からフレーム先頭位置情報を読み取るフレーム先頭位置情報設定部２０５と，フレーム先頭位置情報に従ってファイルから符号化データを読み取る符号化データ読み取り部２０６とで構成される。 Next, the operation of a file reading method for extracting encoded data having a desired bit depth from a multiplexed file with a specific moving image will be described. FIG. 10 shows the configuration of the file reading apparatus. 10 includes a file input unit 201 that reads a file, a management information reading unit 202 that reads management information, a bit depth information search unit 208 that searches for desired bit depth information from the management information, and a management A moving image number determination unit 203 that determines a moving image number from information, a frame head position information setting unit 205 that reads frame head position information from management information, and an encoded data reading that reads encoded data from a file according to the frame head position information Part 206.

図９の多重化装置１２で作成されたファイルから８ビットの動画像Ｃの符号化データを読み取る場合の動作を示す。 The operation when reading the encoded data of the 8-bit moving image C from the file created by the multiplexing device 12 of FIG. 9 is shown.

ファイル入力部２０１は，ファイルを入力する。管理情報読み取り部２０２は，ファイルの後半にある管理情報を読み取る。ビット深度情報探索部２０８は，第１フレームでビット深度情報が８ビットの動画像の管理情報を探索する。動画像番号判定部２０３は，探索して得られた管理情報の動画像番号が２であるかどうかを判定する。最初に探索される動画像番号は２であるので，フレーム先頭位置情報設定部２０５は，このフレームのフレーム先頭位置情報を設定する。符号化データ読み取り部２０６は，設定されたファイル位置から符号化データを読み取る。このとき，フレーム先頭位置情報設定部２０５で得られる次のフレームの先頭位置までを読み取る。次に第１フレームでビット深度情報探索部２０８で探索されて動画像番号判定部２０３で判定される管理情報の動画像番号は３であるので，そのフレームは復号しない。 The file input unit 201 inputs a file. The management information reading unit 202 reads management information in the second half of the file. The bit depth information search unit 208 searches for management information of a moving image whose bit depth information is 8 bits in the first frame. The moving image number determination unit 203 determines whether or not the moving image number of the management information obtained by searching is 2. Since the moving image number searched first is 2, the frame head position information setting unit 205 sets the frame head position information of this frame. The encoded data reading unit 206 reads encoded data from the set file position. At this time, the frame head position information setting unit 205 reads up to the head position of the next frame. Next, since the moving image number of the management information searched by the bit depth information searching unit 208 in the first frame and determined by the moving image number determining unit 203 is 3, the frame is not decoded.

ここで動画像Ｃの符号化データのみを読み取るようにしたが，特定のビット深度の動画像の符号化データを全て読み取るようにしてもよい。この場合のファイル読み込み装置の構成は図１１のファイル読み込み装置２５のようになり，図１０の構成から動画像番号判定部２０３を省いた構成となる。この場合に，８ビットの動画像の符号化データを読み取るときには，次のように処理される。 Although only the encoded data of the moving picture C is read here, all the encoded data of the moving picture having a specific bit depth may be read. In this case, the configuration of the file reading device is the same as that of the file reading device 25 in FIG. In this case, when reading encoded data of an 8-bit moving image, the following processing is performed.

ファイル入力部２０１は，ファイルを入力する。管理情報読み取り部２０２は，ファイルの後半にある管理情報を読み取る。ビット深度情報探索部２０８は，第１フレームでビット深度情報が８ビットの動画像の管理情報を探索する。最初に探索される動画像番号２のフレームについて，フレーム先頭位置情報設定部２０５は，このフレームのフレーム先頭位置情報を設定する。符号化データ読み取り部２０６は，設定されたファイル位置から符号化データを読み取る。このとき，フレーム先頭位置情報設定部２０５で得られる次のフレームの先頭位置までを読み取る。次に第１フレームで探索される動画像番号３のフレームについて，フレーム先頭位置情報設定部２０５は，このフレームのフレーム先頭位置情報を設定する。符号化データ読み取り部２０６は，設定されたファイル位置から符号化データを読み取る。このとき，フレーム先頭位置情報設定部２０５で得られる次のフレームの先頭位置までを読み取る。 The file input unit 201 inputs a file. The management information reading unit 202 reads management information in the second half of the file. The bit depth information search unit 208 searches for management information of a moving image whose bit depth information is 8 bits in the first frame. For the frame of moving image number 2 to be searched first, the frame head position information setting unit 205 sets the frame head position information of this frame. The encoded data reading unit 206 reads encoded data from the set file position. At this time, the frame head position information setting unit 205 reads up to the head position of the next frame. Next, for the frame of moving image number 3 searched in the first frame, the frame head position information setting unit 205 sets the frame head position information of this frame. The encoded data reading unit 206 reads encoded data from the set file position. At this time, the frame head position information setting unit 205 reads up to the head position of the next frame.

以後のフレームについても，同様に８ビットの動画像の符号化データを読み取る。 Similarly, for the subsequent frames, the encoded data of the 8-bit moving image is read.

このように本実施例によれば，符号化データ中に含まれるビット深度情報を復号することなく，特定のビット深度情報の符号化データを取り出すことができる。 As described above, according to the present embodiment, encoded data of specific bit depth information can be extracted without decoding the bit depth information included in the encoded data.

なお本実施例では，８ビットの動画像の符号化データを読み込む場合の処理を示したが，１０ビットの動画像の符号化データを読み込むようにしてもよい。 In the present embodiment, the process for reading encoded data of 8-bit moving images is shown. However, encoded data of 10-bit moving images may be read.

なお，本実施例の多重化装置１２では，符号化データの後に管理情報を設定したが，ファイルにトラックの構造が設定されている場合には，管理情報を第一トラックに設定し，符号化データを第二トラックに設定してもよい。 In the multiplexing device 12 of this embodiment, management information is set after the encoded data. However, when the track structure is set in the file, the management information is set in the first track and the encoding is performed. Data may be set in the second track.

図１２に，ファイル中におけるトラックや管理情報の構成例を示す。作成されたファイルには，符号化データトラック３０と管理情報トラック４０とがあり，符号化データトラック３０には，動画像Ａ符号化データ３１，動画像Ｂ符号化データ３２，…が格納される。また，管理情報トラック４０には，各時刻Ｔｎごとの管理情報４１，４２，４３，…が格納され，これらには各々，フレーム先頭位置情報，動画像番号，ビット深度情報などの情報が格納される。 FIG. 12 shows a configuration example of tracks and management information in a file. The created file includes an encoded data track 30 and a management information track 40. In the encoded data track 30, moving image A encoded data 31, moving image B encoded data 32,... Are stored. . Further, management information 41, 42, 43,... For each time Tn is stored in the management information track 40, and information such as frame head position information, moving picture number, bit depth information is stored in each of these. The

以上の実施例で異なる色空間や空間解像度やビット深度の動画像の符号化データを多重化する装置と，特定の動画像を読み取る装置の動作の例を示した。本発明は，色空間や空間解像度やビット深度のうち，いずれかが異なるのではなく，組み合わせて異なる場合にも適用できる。例えば，色空間と空間解像度が同時に異なる場合の動画像の符号化データを多重化してもよい。この場合の多重化では，管理情報として，色空間情報と空間解像度情報が同時に多重化される。 In the above embodiment, an example of the operation of a device that multiplexes encoded data of moving images having different color spaces, spatial resolutions, and bit depths and a device that reads a specific moving image has been shown. The present invention can also be applied to a case where any one of color space, spatial resolution, and bit depth is not different but is different in combination. For example, encoded data of moving images in the case where the color space and the spatial resolution are different may be multiplexed. In this case, the color space information and the spatial resolution information are multiplexed simultaneously as management information.

また，以上の実施例の説明では，フレーム単位に管理情報を設定する例について説明したが，フレームを分割したスライス単位に管理情報を設定してもよい。この場合には，管理情報はフレーム先頭位置の代わりに，スライス先頭位置情報を含む。また，多重化や読み取りもフレーム単位ではなく，スライス単位で行う。すなわち，上記実施例の説明において，フレームをスライスに読み替えて本発明を実施することも可能である。 In the above description of the embodiment, an example in which management information is set in units of frames has been described. However, management information may be set in units of slices obtained by dividing a frame. In this case, the management information includes slice head position information instead of the frame head position. In addition, multiplexing and reading are performed not on a frame basis but on a slice basis. That is, in the description of the above embodiment, the present invention can be implemented by replacing a frame with a slice.

以上の多重化装置およびファイル読み込み装置による処理は，コンピュータとソフトウェアプログラムとによって実現することもでき，そのプログラムをコンピュータ読み取り可能な記録媒体に記録して提供することも，ネットワークを通して提供することも可能である。 The above processing by the multiplexing device and the file reading device can be realized by a computer and a software program, and the program can be provided by being recorded on a computer-readable recording medium or provided through a network. It is.

第１の実施例における多重化装置の構成を示す図である。It is a figure which shows the structure of the multiplexing apparatus in 1st Example. 第１の実施例におけるファイル読み込み装置の構成１を示す図である。It is a figure which shows the structure 1 of the file reading apparatus in a 1st Example. 第１の実施例におけるファイル読み込み装置の構成２を示す図である。It is a figure which shows the structure 2 of the file reading apparatus in a 1st Example. 第１の実施例のファイル中におけるトラックと管理情報の構成例を示す図である。It is a figure which shows the structural example of the track | truck in the file of a 1st Example, and management information. 第２の実施例における多重化装置の構成を示す図である。It is a figure which shows the structure of the multiplexing apparatus in a 2nd Example. 第２の実施例におけるファイル読み込み装置の構成１を示す図である。It is a figure which shows the structure 1 of the file reading apparatus in a 2nd Example. 第２の実施例におけるファイル読み込み装置の構成２を示す図である。It is a figure which shows the structure 2 of the file reading apparatus in a 2nd Example. 第２の実施例のファイル中におけるトラックと管理情報の構成例を示す図である。It is a figure which shows the structural example of the track | truck in the file of 2nd Example, and management information. 第３の実施例における多重化装置の構成を示す図である。It is a figure which shows the structure of the multiplexing apparatus in a 3rd Example. 第３の実施例におけるファイル読み込み装置の構成１を示す図である。It is a figure which shows the structure 1 of the file reading apparatus in a 3rd Example. 第３の実施例におけるファイル読み込み装置の構成２を示す図である。It is a figure which shows the structure 2 of the file reading apparatus in a 3rd Example. 第３の実施例のファイル中におけるトラックと管理情報の構成例を示す図である。It is a figure which shows the structural example of the track | truck and management information in the file of 3rd Example. スケーラブル符号化の概念を示す図である。It is a figure which shows the concept of scalable encoding. ＭＶＣの参照関係の例を示す図である。It is a figure which shows the example of the reference relationship of MVC. ＭＶＣの符号化データの構成例を示す図である。It is a figure which shows the structural example of the encoding data of MVC. ファイル中におけるトラックや管理情報の構成例を示す図である。It is a figure which shows the structural example of the track | truck and management information in a file.

Explanation of symbols

１０，１１，１２多重化装置
１０１動画像Ａ符号化データ入力部
１０２動画像Ｂ符号化データ入力部
１０３動画像Ｃ符号化データ入力部
１０４動画像Ｄ符号化データ入力部
１０５参照関係データ入力部
１０６動画像Ａフレーム位置情報入力部
１０７動画像Ｂフレーム位置情報入力部
１０８動画像Ｃフレーム位置情報入力部
１０９動画像Ｄフレーム位置情報入力部
１１０データ切り替え部
１１１符号化データ多重化部
１１２動画像番号設定部
１１３色空間情報設定部
１１４フレーム先頭位置情報設定部
１１５ファイル多重化部
１１６管理情報多重化部
１１７色空間データ入力部
１１８空間解像度情報設定部
１１９空間解像度データ入力部
１２０ビット深度情報設定部
１２１ビット深度データ入力部
２０，２１，２２，２３，２４，２５ファイル読み込み装置
２０１ファイル入力部
２０２管理情報読み取り部
２０３動画像番号判定部
２０４色空間情報探索部
２０５フレーム先頭位置情報設定部
２０６符号化データ読み取り部
２０７空間解像度情報探索部
２０８ビット深度情報探索部 10, 11, 12 Multiplexer 101 Moving image A encoded data input unit 102 Moving image B encoded data input unit 103 Moving image C encoded data input unit 104 Moving image D encoded data input unit 105 Reference relation data input unit 106 moving image A frame position information input unit 107 moving image B frame position information input unit 108 moving image C frame position information input unit 109 moving image D frame position information input unit 110 data switching unit 111 encoded data multiplexing unit 112 moving image Number setting unit 113 Color space information setting unit 114 Frame head position information setting unit 115 File multiplexing unit 116 Management information multiplexing unit 117 Color space data input unit 118 Spatial resolution information setting unit 119 Spatial resolution data input unit 120 Bit depth information setting Part 121 bit depth data input part 20, 21, 2, 23, 24, 25 File reading device 201 File input unit 202 Management information reading unit 203 Moving image number determination unit 204 Color space information searching unit 205 Frame head position information setting unit 206 Encoded data reading unit 207 Spatial resolution information searching unit 208-bit depth information search unit

Claims

A multiplexing method for constructing a file by multiplexing encoded data created by predictive encoding between a plurality of moving images having different color spaces,
A process of multiplexing position information in a frame or slice head file of a moving image, a moving image number for distinguishing the moving image, and color space information indicating a color space of the frame or slice;
A process of configuring the multiplexed information at a specific position different from the encoded data to be decoded of each moving image frame or slice, and configuring a file;
A method for multiplexing moving images, comprising:

A multiplexing method for constructing a file by multiplexing encoded data created by predictive encoding between a plurality of moving images having different color spaces,
The position information in the frame or slice head file of the moving image, the moving image number for distinguishing the moving image, and the color space information indicating the color space of the frame or slice are set as the first track,
The encoded data of each frame or slice as the second track
A method for multiplexing moving images, wherein the first track and the second track are multiplexed independently to form a file.

A multiplexing method for constructing a file by multiplexing encoded data created by predictive encoding between a plurality of moving images having different spatial resolutions,
A process of multiplexing position information in a frame or slice head file of a moving image, a moving image number for distinguishing the moving image, and spatial resolution information of the frame or slice;
A process of configuring the multiplexed information at a specific position different from the encoded data to be decoded of each moving image frame or slice, and configuring a file;
A method for multiplexing moving images, comprising:

A multiplexing method for constructing a file by multiplexing encoded data created by predictive encoding between a plurality of moving images having different spatial resolutions,
The first track is the position information in the frame or slice head file of the moving image, the moving image number for distinguishing the moving image, and the spatial resolution information of the frame or slice.
The encoded data of each frame or slice as the second track
A method for multiplexing moving images, wherein the first track and the second track are multiplexed independently to form a file.

A multiplexing method for constructing a file by multiplexing encoded data created by predictive encoding between a plurality of moving images having different bit depths,
A process of multiplexing position information in a frame or slice head file of a moving image, a moving image number for distinguishing the moving image, and bit depth information of the frame or slice;
A process of configuring the multiplexed information at a specific position different from the encoded data to be decoded of each moving image frame or slice, and configuring a file;
A method for multiplexing moving images, comprising:

A multiplexing method for constructing a file by multiplexing encoded data created by predictive encoding between a plurality of moving images having different bit depths,
The position information in the frame or slice head file of the moving image, the moving image number for distinguishing the moving image, and the bit depth information of the frame or slice as the first track,
The encoded data of each frame or slice as the second track
A method for multiplexing moving images, wherein the first track and the second track are multiplexed independently to form a file.

A multiplexing device that multiplexes encoded data created by predictive encoding between a plurality of moving images having different color spaces to constitute a file,
Management information multiplexing for creating management information by multiplexing position information in the frame or slice head file of a moving image, a moving image number for distinguishing the moving image, and color space information indicating the color space of the frame or slice Part,
A file multiplexing unit for setting the management information created by the management information multiplexing unit at a specific position different from the encoded data to be decoded for each moving image frame or slice;
An apparatus for multiplexing moving images, comprising:

A multiplexing device that multiplexes encoded data created by predictive encoding between a plurality of moving images having different color spaces to constitute a file,
Management information multiplexing that multiplexes the position information in the frame or slice head file of the moving image, the moving image number for distinguishing the moving image, and the color space information indicating the color space of the frame or slice as the first track. Part,
An encoded data multiplexing unit that multiplexes encoded data of each frame or slice as a second track;
A file multiplexing unit configured to multiplex the first track and the second track to form a file;
An apparatus for multiplexing moving images, comprising:

A multiplexing device that multiplexes encoded data created by predictive encoding between a plurality of moving images having different spatial resolutions to form a file,
A management information multiplexing unit for creating management information in which position information in a moving image frame or slice head file, a moving image number for distinguishing moving images, and spatial resolution information of the frame or slice are multiplexed;
A file multiplexing unit for setting the management information created by the management information multiplexing unit at a specific position different from the encoded data to be decoded for each moving image frame or slice;
An apparatus for multiplexing moving images, comprising:

A multiplexing device that multiplexes encoded data created by predictive encoding between a plurality of moving images having different spatial resolutions to form a file,
A management information multiplexing unit for multiplexing position information in a frame or slice head file of a moving image, a moving image number for distinguishing the moving image, and spatial resolution information of the frame or slice as a first track;
An encoded data multiplexing unit that multiplexes encoded data of each frame or slice as a second track;
A file multiplexing unit configured to multiplex the first track and the second track to form a file;
An apparatus for multiplexing moving images, comprising:

A multiplexing device that multiplexes encoded data created by predictive encoding between a plurality of moving images having different bit depths to form a file,
A management information multiplexing unit that creates position information in a frame or slice head file of a moving image, a moving image number that distinguishes the moving image, and management information that multiplexes the bit depth information of the frame or slice;
A file multiplexing unit for setting the management information created by the management information multiplexing unit at a specific position different from the encoded data to be decoded for each moving image frame or slice;
An apparatus for multiplexing moving images, comprising:

A multiplexing device that multiplexes encoded data created by predictive encoding between a plurality of moving images having different bit depths to form a file,
A management information multiplexing unit that multiplexes the position information in the frame or slice head file of the moving image, the moving image number for distinguishing the moving image, and the bit depth information of the frame or slice as the first track;
An encoded data multiplexing unit that multiplexes encoded data of each frame or slice as a second track;
A file multiplexing unit configured to multiplex the first track and the second track to form a file;
An apparatus for multiplexing moving images, comprising:

A method for reading encoded data to be decoded from a file in which encoded data created by predictive encoding between a plurality of moving images having different color spaces and management information data thereof are multiplexed and stored. There,
Management information data including position information in the frame or slice head file of the moving image, a moving image number for distinguishing the moving image, and color space information indicating the color space of the frame or slice is read from the file. Process,
Searching for management information having color space information set in advance from the management information data;
The process of reading the start position information of the frame or slice that becomes the preset color space information in the searched management information,
The process of reading the encoded data of the read frame or slice head position,
A method of reading a moving image file, comprising:

A method for reading encoded data to be decoded from a file in which encoded data created by predictive encoding between a plurality of moving images having different spatial resolutions and management information data thereof are multiplexed and stored. There,
Management information data including position information in the frame or slice head file of the moving image, a moving image number for distinguishing the moving image, and spatial resolution information indicating the spatial resolution of the frame or slice is read from the file. Process,
Searching for management information having spatial resolution information set in advance from the management information data;
The process of reading the start position information of the frame or slice that becomes the preset spatial resolution information in the searched management information,
The process of reading the encoded data of the read frame or slice head position,
A method of reading a moving image file, comprising:

A method for reading encoded data to be decoded from a file in which encoded data created by predictive encoding between a plurality of moving images having different bit depths and management information data thereof are multiplexed and stored. There,
Management information data including position information in a moving image frame or slice head file, a moving image number for distinguishing moving images, and bit depth information indicating the bit depth of the frame or slice is read from the file. Process,
Searching for management information having a preset bit depth information from the management information data;
The process of reading the head position information of the frame or slice that becomes the preset bit depth information in the searched management information,
The process of reading the encoded data of the read frame or slice head position,
A method of reading a moving image file, comprising:

A device that reads encoded data to be decoded from a file in which encoded data created by predictive encoding between a plurality of moving images having different color spaces and management information data thereof are multiplexed and stored. There,
Management information data including position information in the frame or slice head file of the moving image, a moving image number for distinguishing the moving image, and color space information indicating the color space of the frame or slice is read from the file. A management information reading unit;
A color space information search unit for searching for management information having preset color space information from the management information data;
A frame or slice head position information setting unit for reading the head position information of a frame or slice that is preset color space information in the searched management information;
An encoded data reading unit for reading the encoded data of the read frame or slice head position;
A moving image file reading apparatus comprising:

A device that reads encoded data to be decoded from a file in which encoded data created by predictive encoding between a plurality of moving images having different spatial resolutions and management information data thereof are multiplexed and stored. There,
Management information data including position information in the frame or slice head file of the moving image, a moving image number for distinguishing the moving image, and spatial resolution information indicating the spatial resolution of the frame or slice is read from the file. A management information reading unit;
A spatial resolution information search unit for searching for management information having preset spatial resolution information from the management information data;
A frame or slice head position information setting section for reading the head position information of a frame or slice that is preset spatial resolution information in the searched management information;
An encoded data reading unit for reading the encoded data of the read frame or slice head position;
A moving image file reading apparatus comprising:

A device that reads encoded data to be decoded from a file in which encoded data created by predictive encoding between a plurality of moving images having different bit depths and management information data thereof are multiplexed and stored. There,
Management information data including position information in a moving image frame or slice head file, a moving image number for distinguishing moving images, and bit depth information indicating the bit depth of the frame or slice is read from the file. A management information reading unit;
A bit depth information search unit for searching for management information having preset bit depth information from the management information data;
A frame or slice head position information setting unit for reading the head position information of a frame or a slice that is a preset bit depth information in the searched management information;
An encoded data reading unit for reading the encoded data of the read frame or slice head position;
A moving image file reading apparatus comprising:

A moving picture multiplexing program for causing a computer to execute the moving picture multiplexing method according to any one of claims 1 to 6.

A computer-readable recording medium on which the moving picture multiplexing program according to claim 19 is recorded.

A moving image file reading program for causing a computer to execute the moving image file reading method according to any one of claims 13 to 15.

A computer-readable recording medium on which the moving image file reading program according to claim 21 is recorded.