JP2005020339A

JP2005020339A - Video decoding device

Info

Publication number: JP2005020339A
Application number: JP2003182112A
Authority: JP
Inventors: Kengo Nishimura; 憲吾西村; Junko Yagi; 順子八木; Michihiro Matsumoto; 道弘松本; Takaharu Morohashi; 隆治諸橋
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2003-06-26
Filing date: 2003-06-26
Publication date: 2005-01-20

Abstract

【課題】フレーム情報を含まない多重化データに対するシーク動作を可能とする。
【解決手段】多重化データ１００（Ａ，Ｖ，Ｔ）を入出力インターフェイス１１を介して蓄積メモリ１３へ格納し蓄積メモリ１３に格納する（ステップＳＴ１）。次に、多重化データ自身のデータサイズＳａｌｌとそのデータ全体を再生する全体再生時間Ｔａｌｌと指定時刻までの再生時間Ｔｓｅｅｋとを取得する（ステップＳＴ２）。全体再生時間Ｔａｌｌと指定時刻までの再生時間Ｔｓｅｅｋとの比率を用いて多重化データ１００の中から指定時刻に対応するデータ位置Ｓｓｅｅｋを検索する（ステップＳＴ３）。求めたデータ位置Ｓｓｅｅｋに対応するフレーム（符号化された動画像データ（Ａ４））を多重化データより分離し復号化して入出力インターフェイス１５を介して表示装置２へ出力する。
【選択図】図４A seek operation for multiplexed data not including frame information is made possible.
Multiplexed data 100 (A, V, T) is stored in a storage memory 13 via an input / output interface 11 and stored in the storage memory 13 (step ST1). Next, the data size Sall of the multiplexed data itself, the total reproduction time Tall for reproducing the entire data, and the reproduction time Tseek until the specified time are acquired (step ST2). A data position Sseek corresponding to the specified time is searched from the multiplexed data 100 using the ratio between the total playback time Tall and the playback time Tseek until the specified time (step ST3). A frame (encoded moving image data (A4)) corresponding to the obtained data position Sseek is separated from the multiplexed data, decoded, and output to the display device 2 via the input / output interface 15.
[Selection] Figure 4

Description

【０００１】
【発明の属する技術分野】
この発明は、符号化された動画像データ（符号化された動画像データ）を復号化する装置に関し、さらに詳しくは、符号化動画像データのシーク動作の制御に関する。
【０００２】
【従来の技術】
今日、情報技術の発展により、ＡＳＦやＭＰ４などの多重化規格に基づいて蓄積メディアに圧縮保存された音声や動画像（映像）を再生して楽しむことが可能となった。このようなＡＶデータ（多重化データ）を復号化して再生するにあたって、シーク動作（ユーザによって指定された時刻（指定時刻）に対応するフレームを検索する動作）は、内容検索を容易にかつ迅速に行うために重要な機能である。このシーク動作を行う際、多重化データに含まれるフレーム情報を用いてシーク対象となるフレームを検索していた。多重化データの中にフレーム情報が含まれている場合は、フレーム情報に含まれる各基準画像のフレームに対する表示時刻であるＰＴＳ情報と指定時刻を比較することで、最も近い時刻でのフレームでの符号化動画像データが多重化データのどの位置に存在しているか一意に検索することが可能である。
【０００３】
【発明が解決しようとする課題】
また、ユーザは自作した音声や動画像（映像）を独自に多重化し蓄積メディアに圧縮保存することも可能となった。しかし、ユーザが自作した多重化データ（ＡＶデータ）の中には、フレーム情報を含まないものが存在する。フレーム情報が存在しない多重化データに対して上述の手法でシーク動作を行うことは困難である。
【０００４】
この発明の目的は、フレーム情報が存在しない符号化動画像データに対してシーク動作を行うことができる動画像復号化装置を提供することである。
【０００５】
【課題を解決するための手段】
この発明の１つの局面に従うと、動画像復号化装置は、符号化された動画像データを復号化する装置であって、取得部と、検索部と、復号化部とを備える。符号化動画像データは、複数のフレームを含む。また、符号化動画像データは、ＡＶデータに多重化されている。取得部は、ＡＶデータのデータサイズとＡＶデータの再生時間と指定時刻とを取得する。指定時刻は、外部より指定される表示時刻を示す。検索部は、取得部で得られた情報（ＡＶデータのデータサイズ，ＡＶデータの再生時間，指定時刻）を用いてＡＶデータの中から指定時刻に対応するデータ位置を検索する。復号化部は、検索部によって得られたデータ位置に対応するフレームを復号化する。
【０００６】
上記動画像復号化装置では、ＡＶデータのデータサイズ，ＡＶデータの再生時間，指定時刻を用いて、ＡＶデータの再生時間における指定時刻の時間軸上での位置を求める。具体的には、ＡＶデータ全体のデータサイズと、ＡＶデータ全体の再生時間と指定時刻との比率とにより、指定時刻に対応するフレームのデータが含まれるデータ位置を計算する。よって、フレーム情報の存在しない符号化動画像データに対してシーク動作を行うことができる。
【０００７】
この発明のもう１つの局面に従うと、動画像復号化装置は、符号化された動画像データを復号化する装置であって、取得部と、検索部と、復号化部とを備える。符号化動画像データは、複数のフレームを含む。また、符号化動画像データは、ＡＶデータに多重化されている。取得部は、ビットレートと指定時刻とを取得する。ビットレートは、単位時間当たりに消費されるＡＶデータのデータ量を示す。検索部は、ビットレートを用いて指定時刻に対応するデータ位置を検索する。
【０００８】
上記動画像復号化装置では、ビットレートは、単位時間当たりに上記動画像復号化装置に入力することによって消費されるＡＶデータのデータ量を示す。検索部は、ビットレートと指定時刻とを用いて指定時刻に対応するフレームデータのデータ位置（ＡＶデータにおける先頭からのデータ位置）を計算する。よって、フレーム情報の存在しない符号化動画像データに対してシーク動作を行うことができる。
【０００９】
好ましくは、前記ＡＶデータは、符号化動画像データと他の符号化データとが所定の配列単位で多重化されている。上記動画像復号化装置は、判定部と、再構成部とをさらに備える。判定部は、ＡＶデータの配列単位のデータサイズが所定のサイズを超えているか否かを判定する。再構成部は、ＡＶデータの配列単位のデータサイズが所定のサイズを超えていると判定部で判定されたとき、ＡＶデータの配列単位のデータサイズが所定のサイズ以下になるようにＡＶデータ内の各符号化データの並びを変更する。
【００１０】
フレーム情報が存在しないＡＶデータに対してシーク動作を行う場合、ＡＶデータに含まれる各符号化データの結合順序によって不可能な場合がある。例えば、ＡＶデータがデータの先頭から半分までに符号化動画像データ（Ａ）を含み、データの半分から最後までに他の符号化データ（Ｂ）を含むように多重化されているとする。このＡＶデータに対して上記全体再生時間と上記指定時刻との関係を用いてデータ位置を検索した。結果、データの先頭からデータ全体に対して３／４の位置がデータ位置として検索されたとする。すると、復号化されるのは、後者の他の符号化データ（Ｂ）のみで符号化動画像データは全く復号化されない。このように、ＡＶデータに含まれる符号化データの結合順序が偏ってしまうとシーク動作をうまく行うことができない。
【００１１】
上記動画像復号化装置では、ＡＶデータに含まれる各符号化データにおける配列単位のデータサイズが所定のサイズを超えるとき、配列単位のデータサイズが所定のサイズ以下になるようにＡＶデータ内の各符号化データの並びを変更する。つまり、ＡＶデータに含まれる各符号化データの結合順序の偏りが大きい時ＡＶデータ内の各符号化データの並びを変更する。よって上記のような問題を解消することができ、シーク動作を行うことができる。
【００１２】
この発明のさらにもう１つの局面に従うと、動画像復号化装置は、ＡＶデータに多重化されている符号化動画像データを復号化する装置であって、判定部と、再構成部と、フレーム情報作成部とを備える。ＡＶデータは、符号化動画像データと他の符号化動画像データとが所定の配列単位で多重化されている。符号化動画像データは、複数のフレームを含む。判定部は、ＡＶデータの配列単位のデータサイズが所定のサイズを超えているか否かを判定する。再構成部は、ＡＶデータの配列単位のデータサイズが所定のサイズを超えていると判定部で判定されたとき、ＡＶデータの配列単位のデータサイズが所定のサイズ以下になるようにＡＶデータ内の各符号化データの並びを変更する。フレーム情報作成部は、再構成部によって並びが変更されたＡＶデータについてのフレーム情報を作成し、作成したフレーム情報を当該ＡＶデータに付加する。フレーム情報は、ＡＶデータに含まれているフレームの時間軸上における並びを示す情報を含む。
【００１３】
上記動画像復号化装置では、各符号化データの結合順序を変更したＡＶデータに対してフレーム情報を付加する。よって、フレーム情報の表示時刻とユーザの指定時刻とを比較して、時間軸上で最も近いフレームの表示時刻を検索することによってシーク動作を行うことができる。
【００１４】
【発明の実施の形態】
以下、この発明の実施の形態を図面を参照して詳しく説明する。なお、図中同一または相当部分には同一の符号を付しその説明は繰り返さない。
【００１５】
（第１の実施形態）
第１の実施形態による動画像再生システムの全体構成を図１に示す。このシステムは、所定の多重化規格（例えばＡＳＦやＭＰ４など）に従って多重化されているＡＶデータ（多重化データ１００）に含まれる符号化動画像データの再生を行う。このシステムでは、ユーザによって指定された位置（時刻）から符号化動画像データを再生することができる（シーク動作）。このシステムは、動画像復号化装置１と、表示装置２とを備える。動画像復号化装置１は、入出力インターフェイス１１，１５と、ＣＰＵ１２と、蓄積メモリ１３と、フレームバッファ１４とを備える。入出力インターフェイス１１は、外部からの多重化データ１００の入力処理を行う。ＣＰＵ１２は、多重化データ１００の解析、多重化データ１００の分離、符号化動画像データの復号化、および指定時刻情報２００に応じて全体のコントロールを行う。指定時刻情報２００は、ユーザによって指示される表示時刻（指定時刻）を示す。蓄積メモリ１３は、多重化データ１００、および符号化動画像データを蓄積する。フレームバッファ１４は、復号化された動画像データ（フレームデータ）を蓄積する。入出力インターフェイス１５は、フレームバッファ１４に蓄積されたフレームデータを表示装置２へ出力する。
【００１６】
次に、図１に示した動画像再生システムの動作について説明する。ここではシーク動作について説明する。シーク動作は、多重化データ１００（ＡＶデータ）の中からユーザに指定された表示時刻に対応するフレームを表示する処理である。以下、シーク動作について図２を参照しつつ説明する。
【００１７】
〔ステップＳＴ１〕
入出力インターフェイス１１を介して多重化データ１００が動画像復号化装置１に入力される。多重化データ１００の一例を図３に示す。多重化データ１００は、符号化動画像データ（Ｖ），符号化音声・オーディオデータ（Ａ），符号化テキストデータ（Ｔ）などが所定の配列単位（図３に示した多重化データ１００の場合、符号化動画像データ（Ｖ）・符号化音声・オーディオデータ（Ａ）・符号化テキストデータ（Ｔ）の配列単位はそれぞれ１パケットである。１パケットには少なくとも１つ以上のフレームが含まれている。）で多重化されたデータ（ＡＶデータ）である。
【００１８】
〔ステップＳＴ２〕
次に、ＣＰＵ１２は、多重化データ１００に含まれるストリーム情報の中から、多重化データ自身のデータサイズＳａｌｌとそのデータ全体を再生する全体再生時間Ｔａｌｌとを取得する。また、指定時刻情報２００が示す指定時刻を取得して、データの先頭から指定時刻までの再生時間Ｔｓｅｅｋを求める。
【００１９】
〔ステップＳＴ３〕
次に、ＣＰＵ１２は、データサイズＳａｌｌと全体再生時間Ｔａｌｌと指定時刻までの再生時間Ｔｓｅｅｋとを用いて、多重化データ１００の中から指定時刻に対応するデータ位置Ｓｓｅｅｋを検索する。データ位置Ｓｓｅｅｋはデータの先頭からの位置を示す。
【００２０】
以下に、図４を参照し、データ位置Ｓｓｅｅｋの検索方法を説明する。
【００２１】
図４のように、多重化データ自身のデータサイズＳａｌｌに対して全体再生時間Ｔａｌｌと指定時刻までの再生時間Ｔｓｅｅｋとの比率を用いて、数１のようにデータ位置Ｓｓｅｅｋを検索する。
【００２２】
【数１】

【００２３】
〔ステップＳＴ４〕
次に、ＣＰＵ１２は、多重化データの中から検索したデータ位置Ｓｓｅｅｋ以降の符号化動画像データのパケットＶ４を分離する。次に、そのパケットＶ４の中から一番先頭のフレームを分離する。分離されたフレームは蓄積メモリ１３に蓄積される。
【００２４】
〔ステップＳＴ５〕
次に、ＣＰＵ１２は、蓄積されたフレームを復号化する。
【００２５】
〔ステップＳＴ６〕
次に、ＣＰＵ１２は、復号化したフレームＶ４をフレームバッファ１４に蓄積する。
【００２６】
〔ステップＳＴ７〕
次に、入出力インターフェイス１５は、復号化したフレームを表示装置２へ出力する。
【００２７】
以上のように、第１の実施形態では、シーク動作時にデータサイズＳａｌｌと、全体再生時間Ｔａｌｌと、指定時刻までの再生時間Ｔｓｅｅｋとを用いて、データ位置Ｓｓｅｅｋを決定することによって、フレーム情報が存在しない場合もシーク動作を行うことができる。
【００２８】
なお、シーク動作後に通常再生を行う場合では、シーク動作によって表示したフレーム以降の符号化動画像データをデータの先頭に近い順番にて復号化して、復号化した順番にて表示装置２へ出力する。
【００２９】
（第２の実施形態）
第２の実施形態による動画像再生システムの全体構成は図１に示したものと同じであるがＣＰＵ１２の動作が異なる。第２の実施形態は、データ位置Ｓｓｅｅｋを検索する際、データサイズＳａｌｌと全体再生時間Ｔａｌｌとに代えてビットレートＲｍｕｘを用いる。
【００３０】
次に、第２の実施形態による動画像再生システムにおけるシーク動作について、図５を参照しつつ説明する。
【００３１】
〔ステップＳＴ１〕
入出力インターフェイス１１を介して多重化データ１００が動画像復号化装置１に入力される。
【００３２】
〔ステップＳＴ１１〕
次に、ＣＰＵ１２は、多重化データ１００に含まれるストリーム情報の中から、ビットレートＲｍｕｘを取得する。また、指定時刻情報２００が示す指定時刻を取得して、データの先頭から指定時刻までの再生時間Ｔｓｅｅｋを求める。ビットレートＲｍｕｘは、単位時間あたりに動画像復号化装置１へ入力される多重化データ１００のデータ量を示す。
【００３３】
〔ステップＳＴ１２〕
次に、ＣＰＵ１２は、ビットレートＲｍｕｘと、指定時刻までの再生時間Ｔｓｅｅｋとを用いて多重化データ１００の中から指定時刻に対応するデータ位置Ｓｓｅｅｋを検索する。データ位置Ｓｓｅｅｋは多重化データ１００の先頭に対しての位置を示す。
【００３４】
以下に、図６を参照してデータ位置Ｓｓｅｅｋの検索方法を説明する。
【００３５】
ビットレートＲｍｕｘは単位時間あたりに出力するデータ量を示すので、指定時刻までを再生する時間Ｔｓｅｅｋとの積を求めるとデータ位置Ｓｓｅｅｋが算出される。つまり、数２のように示すことができる。
【００３６】
【数２】

【００３７】
次に、第１の実施形態と同様に、ステップＳＴ４〜ステップＳＴ７における処理が行われる。
【００３８】
以上のように、第２の実施形態によれば、ビットレート情報Ｒｍｕｘと、指定時刻までの再生時間Ｔｓｅｅｋとを用いてデータ位置Ｓｓｅｅｋを検索することによって、フレーム情報がない場合もシーク動作を行うことができる。
【００３９】
（第３の実施形態）
第３の実施形態による動画像再生システムの全体構成は図１に示したものと同じであるがＣＰＵ１２の動作が異なる。多重化データの中にフレーム情報がない場合で、かつ動画像データ、音声・オーディオデータ、テキストデータのどれか１つのデータが、それぞれに設定された一定のサイズ以上同じデータが並んで配置されていた場合、第１〜第２の実施形態で説明した手法を用いても、ユーザから指示された表示時刻のフレームデータを表示するシーク動作を行うことが困難となってしまう。そこで第３の実施形態では、そのような多重化データに対しても動画像データ、音声・オーディオデータ、テキストデータがそれぞれ一定サイズを超えないような形式へと再構成することで、容易にシーク動作を行うことを可能とする。
【００４０】
次に、第３の実施形態による動画像再生システムにおけるシーク動作について、図７を参照しつつ説明する。
【００４１】
〔ステップＳＴ１〕
入出力インターフェイス１１を介して多重化データ１００が動画像復号化装置１に入力される。
【００４２】
〔ステップＳＴ２１〕
次に、ＣＰＵ１２は、多重化データ１００の先頭からデータを走査し多重化データ１００に含まれている各符号化データ（動画像データ，音声・オーディオデータ，テキストデータ）の配列単位（多重化単位）を調べる。各符号化データの配列単位は、同じ種類の符号化データのパケットが連続する区間を示す。具体的には、図８のように音声・オーディオデータ（Ａ）の配列単位はＡ１〜Ａ６の区間（６パケット）であり、動画像データ（Ｖ）の配列単位はＶ１〜Ｖ５の区間（５パケット）であり、テキストデータ（Ｔ）の配列単位はＴ１〜Ｔ５（５パケット）である。各データの配列単位（Ａ，Ｖ，Ｔ）を調べた結果が図８のように各データに定められたサイズ（Ａｓ，Ｖｓ，Ｔｓ）を超える場合にはステップＳＴ２２へ進み、各データに定められたサイズを超えない場合には、ステップＳＴ２に進む。
【００４３】
〔ステップＳＴ２２〕
次に、ＣＰＵ１２は、動画像データ、音声オーディオデータ、テキストデータをそれぞれのデータに対して定められたサイズ以下で連続するように分割し、分割されたそれぞれのデータに対してデータの種類に関係なく表示時間の順番に並べて多重化する。すなわち、各符号化データの配列単位を定められたサイズ以下になるように分割し、分割した各データを、図９のように各符号化データの先頭に存在するものから順に多重化する。
【００４４】
〔ステップＳＴ２〜ステップＳＴ７〕
次に、第１の実施形態と同様に、ステップＳＴ２からステップＳＴ７の処理が行われる。
【００４５】
以上のように、多重化データ１００を再構成することによって、多重化データ１００に含まれる各データの配列単位が定められたサイズを超える場合でかつフレーム情報が存在しない場合でもシーク動作を行うことができる。
【００４６】
なお、ステップＳＴ２１より後の処理（ステップＳＴ２〜ステップＳＴ７）は、第２の実施形態のステップＳＴ１１〜ステップＳＴ７における処理を代わりに行ってもよい。
【００４７】
（第４の実施形態）
第４の実施形態による動画像再生システムの全体構成は図１に示したものと同じであるがＣＰＵ１２の動作が異なる。第４の実施形態は、第３の実施形態において構成を新たにした多重化データに対してフレーム情報を作成し付加する。
【００４８】
次に、第４の実施形態による動画像再生システムにおけるシーク動作について、図１０を参照しつつ説明する。
【００４９】
〔ステップＳＴ１〕
入出力インターフェイス１１を介して多重化データ１００が動画像復号化装置１に入力される。
【００５０】
〔ステップＳＴ２１，ステップＳＴ２２〕
次に、ステップＳＴ２１における処理を行い、多重化データ１００に含まれている各符号化データの配列単位（Ａ，Ｖ，Ｔ）を調べる。各符号化データの配列単位を調べた結果が図８のように定められたサイズ（Ａｓ，Ｖｓ，Ｔｓ）を超える場合にはステップＳＴ２２における処理を行う。定められたサイズを超えない場合にはステップＳＴ３１に進む。
【００５１】
〔ステップＳＴ３１〕
次に、ＣＰＵ１２は、多重化データ１００またはステップＳＴ２２で新たに構成した多重化データを解析する。つまり、多重化データ（多重化データ２００または新たに構築された多重化データ）に含まれる各符号化データの配列単位の並びを調べて表示時刻情報を取得する。ＣＰＵ１２は、表示時刻情報を基にフレーム情報（図１１参照）を作成して付加する。
【００５２】
〔ステップＳＴ３２〕
次に、ＣＰＵ１２は、フレーム情報の表示時刻（ＰＴＳ情報）とユーザによって指示された指定時刻とを比較する。そして、最も時間軸上で近い表示時刻に対応する符号化動画像データのフレームをフレーム情報を付加した多重化データの中から検索する。
【００５３】
〔ステップＳＴ３３〕
次に、ＣＰＵ１２は、検索したフレーム以降の符号化動画像データを多重化データ１００またはステップＳＴ２２で新たに構成した多重化データより分離する。分離された符号化動画像データのフレームは蓄積メモリ１３に蓄積される。
【００５４】
〔ステップＳＴ５からステップＳＴ７〕
次に、ステップＳＴ５〜ステップＳＴ７における処理が行われる。
【００５５】
以上のように、フレーム情報を作成し多重化データ１００に付加することによって、ユーザが指示した時刻に対応するデータ位置をフレーム情報を参照するだけで取得でき、容易にシーク動作を行うことができる。
【００５６】
【発明の効果】
この発明による動画像復号化装置では、フレーム情報が含まれていない多重化データに対してユーザが指示する時刻のフレームを表示するシーク動作を行う場合、従来の技術ではシーク動作自体を行うことが困難であったが、多重化データのファイルサイズ、もしくはビットレートを用いることでシーク動作が可能となる。また、多重化データを再構成する手段を備えることにより、より容易にシーク動作を行うことができる形式へと再構成することができる。
【図面の簡単な説明】
【図１】この発明の第１の実施形態における動画像復号化装置の全体構成を示すブロック図である。
【図２】図１に示した動画像復号化装置による処理の手順を示したフローチャートである。
【図３】図１に示した多重化データの一例である。
【図４】図２に示した指定時刻に対応するデータ位置の検索方法を示す図である。
【図５】この発明の第２の実施形態における処理の手順を示したフローチャートである。
【図６】図５に示した指定時刻に対応するデータ位置の検索方法を示す図である。
【図７】この発明の第３の実施形態における処理の手順を示したフローチャートである。
【図８】各データに定められたサイズを超えて連続して各データが並んでいる多重化データの一例である。
【図９】図７に示した多重化データを並び替える方法について示す図である。
【図１０】この発明の第４の実施形態における処理の手順を示したフローチャートである。
【図１１】フレーム情報の一例である。
【符号の説明】
１動画像復号化装置
２表示装置
１１，１５入出力インターフェイス
１２ＣＰＵ
１３蓄積メモリ
１４フレームバッファ
１００多重化データ
２００指定時刻情報[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an apparatus for decoding encoded moving image data (encoded moving image data), and more particularly to control of a seek operation of encoded moving image data.
[0002]
[Prior art]
Today, with the development of information technology, it has become possible to play and enjoy audio and video (video) compressed and stored in storage media based on multiplexing standards such as ASF and MP4. When such AV data (multiplexed data) is decoded and played back, a seek operation (an operation for searching for a frame corresponding to a time (specified time) specified by the user) is an easy and quick content search. It is an important function to do. When this seek operation is performed, a frame to be seeked is searched using frame information included in the multiplexed data. When frame information is included in the multiplexed data, the PTS information, which is the display time for each reference image frame included in the frame information, is compared with the specified time, so that the frame at the nearest time is It is possible to uniquely search at which position in the multiplexed data the encoded moving image data exists.
[0003]
[Problems to be solved by the invention]
In addition, users can now independently multiplex their own voices and moving images (videos) and save them on storage media. However, some multiplexed data (AV data) created by the user does not include frame information. It is difficult to perform a seek operation on multiplexed data for which no frame information exists using the above-described method.
[0004]
An object of the present invention is to provide a moving picture decoding apparatus capable of performing a seek operation on encoded moving picture data having no frame information.
[0005]
[Means for Solving the Problems]
According to one aspect of the present invention, a video decoding device is a device that decodes encoded video data, and includes an acquisition unit, a search unit, and a decoding unit. The encoded moving image data includes a plurality of frames. The encoded moving image data is multiplexed with AV data. The acquisition unit acquires the data size of the AV data, the reproduction time of the AV data, and the specified time. The designated time indicates a display time designated from the outside. The retrieval unit retrieves the data position corresponding to the designated time from the AV data using the information (AV data size, AV data reproduction time, designated time) obtained by the obtaining unit. The decoding unit decodes a frame corresponding to the data position obtained by the search unit.
[0006]
In the moving picture decoding apparatus, the position of the designated time on the time axis in the reproduction time of the AV data is obtained using the data size of the AV data, the reproduction time of the AV data, and the designated time. Specifically, the data position including the frame data corresponding to the specified time is calculated based on the data size of the entire AV data and the ratio between the reproduction time of the entire AV data and the specified time. Therefore, it is possible to perform a seek operation on encoded moving image data having no frame information.
[0007]
According to another aspect of the present invention, a video decoding device is a device that decodes encoded video data, and includes an acquisition unit, a search unit, and a decoding unit. The encoded moving image data includes a plurality of frames. The encoded moving image data is multiplexed with AV data. The acquisition unit acquires a bit rate and a specified time. The bit rate indicates the amount of AV data consumed per unit time. The search unit searches for a data position corresponding to the specified time using the bit rate.
[0008]
In the moving picture decoding apparatus, the bit rate indicates the amount of AV data consumed by being input to the moving picture decoding apparatus per unit time. The search unit calculates the data position of the frame data corresponding to the specified time (data position from the beginning in the AV data) using the bit rate and the specified time. Therefore, it is possible to perform a seek operation on encoded moving image data having no frame information.
[0009]
Preferably, in the AV data, encoded moving image data and other encoded data are multiplexed in a predetermined arrangement unit. The moving picture decoding apparatus further includes a determination unit and a reconstruction unit. The determination unit determines whether the data size of the array unit of the AV data exceeds a predetermined size. When the determination unit determines that the data size of the array unit of the AV data exceeds a predetermined size, the reconstruction unit includes the AV data so that the data size of the array unit of the AV data is equal to or less than the predetermined size. The order of each encoded data is changed.
[0010]
When a seek operation is performed on AV data for which no frame information exists, it may not be possible depending on the combination order of encoded data included in the AV data. For example, it is assumed that AV data is multiplexed so that the encoded moving image data (A) is included in the first half of the data and the other encoded data (B) is included in the half of the data from the last. A data position was searched for the AV data using the relationship between the total reproduction time and the specified time. As a result, it is assumed that a 3/4 position is retrieved as the data position from the top of the data to the entire data. Then, only the other encoded data (B) is decoded, and the encoded moving image data is not decoded at all. As described above, when the combination order of the encoded data included in the AV data is biased, the seek operation cannot be performed well.
[0011]
In the moving picture decoding apparatus, when the data size of the array unit in each encoded data included in the AV data exceeds a predetermined size, each data in the AV data is set so that the data size of the array unit is equal to or less than the predetermined size. Change the sequence of encoded data. That is, when there is a large deviation in the coupling order of the encoded data included in the AV data, the arrangement of the encoded data in the AV data is changed. Therefore, the above problems can be solved and a seek operation can be performed.
[0012]
According to yet another aspect of the present invention, a video decoding device is a device that decodes encoded video data multiplexed with AV data, and includes a determination unit, a reconstruction unit, a frame, And an information creation unit. In AV data, encoded moving image data and other encoded moving image data are multiplexed in a predetermined arrangement unit. The encoded moving image data includes a plurality of frames. The determination unit determines whether the data size of the array unit of the AV data exceeds a predetermined size. When the determination unit determines that the data size of the array unit of the AV data exceeds a predetermined size, the reconstruction unit includes the AV data so that the data size of the array unit of the AV data is equal to or less than the predetermined size. The order of each encoded data is changed. The frame information creation unit creates frame information for the AV data whose arrangement has been changed by the reconstruction unit, and adds the created frame information to the AV data. The frame information includes information indicating the arrangement on the time axis of frames included in the AV data.
[0013]
In the video decoding apparatus, frame information is added to AV data in which the order of combining encoded data is changed. Therefore, the seek operation can be performed by comparing the display time of the frame information with the user-specified time and searching for the display time of the closest frame on the time axis.
[0014]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. In the drawings, the same or corresponding parts are denoted by the same reference numerals, and description thereof will not be repeated.
[0015]
(First embodiment)
FIG. 1 shows the overall configuration of a moving image playback system according to the first embodiment. This system reproduces encoded moving image data included in AV data (multiplexed data 100) multiplexed according to a predetermined multiplexing standard (for example, ASF or MP4). In this system, encoded moving image data can be reproduced from a position (time) designated by the user (seek operation). This system includes a moving picture decoding apparatus 1 and a display apparatus 2. The moving picture decoding apparatus 1 includes input /

output interfaces

11 and 15, a CPU 12, a storage memory 13, and a frame buffer 14. The input / output interface 11 performs input processing of multiplexed data 100 from the outside. The CPU 12 analyzes the multiplexed data 100, separates the multiplexed data 100, decodes the encoded moving image data, and performs overall control according to the specified time information 200. The designated time information 200 indicates a display time (designated time) designated by the user. The storage memory 13 stores the multiplexed data 100 and the encoded moving image data. The frame buffer 14 stores the decoded moving image data (frame data). The input / output interface 15 outputs the frame data stored in the frame buffer 14 to the display device 2.
[0016]
Next, the operation of the moving image reproduction system shown in FIG. 1 will be described. Here, the seek operation will be described. The seek operation is a process of displaying a frame corresponding to the display time designated by the user from the multiplexed data 100 (AV data). Hereinafter, the seek operation will be described with reference to FIG.
[0017]
[Step ST1]
Multiplexed data 100 is input to the moving picture decoding apparatus 1 via the input / output interface 11. An example of the multiplexed data 100 is shown in FIG. The multiplexed data 100 includes encoded moving image data (V), encoded audio / audio data (A), encoded text data (T) and the like in a predetermined arrangement unit (in the case of the multiplexed data 100 shown in FIG. 3). The arrangement unit of the encoded moving image data (V), the encoded audio data, the audio data (A), and the encoded text data (T) is one packet, and each packet includes at least one frame. Data) (AV data) multiplexed.
[0018]
[Step ST2]
Next, the CPU 12 acquires from the stream information included in the multiplexed data 100 the data size Sall of the multiplexed data itself and the total playback time Tall for playing back the entire data. Also, the designated time indicated by the designated time information 200 is acquired, and the reproduction time Tseek from the beginning of the data to the designated time is obtained.
[0019]
[Step ST3]
Next, the CPU 12 searches the multiplexed data 100 for a data position Sseek corresponding to the specified time using the data size Sall, the total playback time Tall, and the playback time Tseek until the specified time. The data position Sseek indicates the position from the beginning of the data.
[0020]
Hereinafter, a method of searching for the data position Sseek will be described with reference to FIG.
[0021]
As shown in FIG. 4, the data position Sseek is searched as shown in Equation 1 using the ratio between the total playback time Tall and the playback time Tseek until the specified time with respect to the data size Sall of the multiplexed data itself.
[0022]
[Expression 1]

[0023]
[Step ST4]
Next, the CPU 12 separates the encoded moving image data packet V4 after the data position Sseek searched from the multiplexed data. Next, the first frame is separated from the packet V4. The separated frames are stored in the storage memory 13.
[0024]
[Step ST5]
Next, the CPU 12 decodes the accumulated frame.
[0025]
[Step ST6]
Next, the CPU 12 stores the decoded frame V4 in the frame buffer 14.
[0026]
[Step ST7]
Next, the input / output interface 15 outputs the decoded frame to the display device 2.
[0027]
As described above, in the first embodiment, the frame information is determined by determining the data position Sseek using the data size Sall, the total playback time Tall, and the playback time Tseek up to the specified time during the seek operation. The seek operation can be performed even when it does not exist.
[0028]
When normal playback is performed after the seek operation, the encoded moving image data after the frame displayed by the seek operation is decoded in the order close to the head of the data, and is output to the display device 2 in the decoded order. .
[0029]
(Second Embodiment)
The overall configuration of the moving image playback system according to the second embodiment is the same as that shown in FIG. 1, but the operation of the CPU 12 is different. In the second embodiment, when searching for the data position Sseek, the bit rate Rmux is used instead of the data size Sall and the total playback time Tall.
[0030]
Next, a seek operation in the moving image reproduction system according to the second embodiment will be described with reference to FIG.
[0031]
[Step ST1]
Multiplexed data 100 is input to the moving picture decoding apparatus 1 via the input / output interface 11.
[0032]
[Step ST11]
Next, the CPU 12 acquires the bit rate Rmux from the stream information included in the multiplexed data 100. Also, the designated time indicated by the designated time information 200 is acquired, and the reproduction time Tseek from the beginning of the data to the designated time is obtained. The bit rate Rmux indicates the amount of multiplexed data 100 input to the video decoding device 1 per unit time.
[0033]
[Step ST12]
Next, the CPU 12 searches the multiplexed data 100 for the data position Sseek corresponding to the specified time using the bit rate Rmux and the reproduction time Tseek until the specified time. The data position Sseek indicates a position with respect to the head of the multiplexed data 100.
[0034]
Hereinafter, a method for searching for the data position Sseek will be described with reference to FIG.
[0035]
Since the bit rate Rmux indicates the amount of data to be output per unit time, the data position Sseek is calculated when the product of the reproduction time Tseek until the specified time is obtained. In other words, it can be expressed as in Equation 2.
[0036]
[Expression 2]

[0037]
Next, similarly to the first embodiment, the processes in steps ST4 to ST7 are performed.
[0038]
As described above, according to the second embodiment, the seek operation is performed even when there is no frame information by searching the data position Sseek using the bit rate information Rmux and the reproduction time Tseek up to the specified time. be able to.
[0039]
(Third embodiment)
The overall configuration of the moving image playback system according to the third embodiment is the same as that shown in FIG. 1, but the operation of the CPU 12 is different. When there is no frame information in the multiplexed data, and any one of moving image data, audio / audio data, and text data is arranged side by side with the same data more than a predetermined size set for each. In this case, even if the methods described in the first to second embodiments are used, it is difficult to perform a seek operation for displaying the frame data at the display time instructed by the user. Therefore, in the third embodiment, seek is easily performed by reconfiguring such multiplexed data into a format in which moving image data, audio / audio data, and text data do not exceed a certain size. It is possible to perform an operation.
[0040]
Next, a seek operation in the moving image reproduction system according to the third embodiment will be described with reference to FIG.
[0041]
[Step ST1]
Multiplexed data 100 is input to the moving picture decoding apparatus 1 via the input / output interface 11.
[0042]
[Step ST21]
Next, the CPU 12 scans the data from the head of the multiplexed data 100, and arranges each encoded data (moving image data, audio / audio data, text data) included in the multiplexed data 100 (multiplex unit). ) The unit of arrangement of each encoded data indicates a section in which packets of the same type of encoded data are continuous. Specifically, as shown in FIG. 8, the arrangement unit of voice / audio data (A) is a section (6 packets) of A1 to A6, and the arrangement unit of moving image data (V) is a section of V1 to V5 (5 Packet), and the arrangement unit of the text data (T) is T1 to T5 (5 packets). If the result of checking the array unit (A, V, T) of each data exceeds the size (As, Vs, Ts) determined for each data as shown in FIG. If it does not exceed the specified size, the process proceeds to step ST2.
[0043]
[Step ST22]
Next, the CPU 12 divides the moving image data, the audio audio data, and the text data so as to be continuous within a predetermined size for each data, and relates to the data type for each divided data. Instead, they are multiplexed in the order of display time. That is, the array unit of each encoded data is divided so as to be equal to or smaller than a predetermined size, and each divided data is multiplexed in order from the one existing at the head of each encoded data as shown in FIG.
[0044]
[Step ST2 to Step ST7]
Next, similarly to the first embodiment, the processing from step ST2 to step ST7 is performed.
[0045]
As described above, by reconstructing the multiplexed data 100, a seek operation is performed even when the arrangement unit of each data included in the multiplexed data 100 exceeds a predetermined size and no frame information exists. Can do.
[0046]
Note that the processing after step ST21 (step ST2 to step ST7) may be performed instead of the processing in step ST11 to step ST7 of the second embodiment.
[0047]
(Fourth embodiment)
The overall configuration of the moving image playback system according to the fourth embodiment is the same as that shown in FIG. 1, but the operation of the CPU 12 is different. In the fourth embodiment, frame information is created and added to the multiplexed data having a new configuration in the third embodiment.
[0048]
Next, a seek operation in the moving image reproduction system according to the fourth embodiment will be described with reference to FIG.
[0049]
[Step ST1]
Multiplexed data 100 is input to the moving picture decoding apparatus 1 via the input / output interface 11.
[0050]
[Step ST21, Step ST22]
Next, the process in step ST21 is performed, and the array unit (A, V, T) of each encoded data included in the multiplexed data 100 is examined. If the result of checking the array unit of each encoded data exceeds the size (As, Vs, Ts) defined as shown in FIG. 8, the process in step ST22 is performed. If the predetermined size is not exceeded, the process proceeds to step ST31.
[0051]
[Step ST31]
Next, the CPU 12 analyzes the multiplexed data 100 or the multiplexed data newly configured in step ST22. That is, the display time information is obtained by examining the arrangement of the array units of each encoded data included in the multiplexed data (multiplexed data 200 or newly constructed multiplexed data). The CPU 12 creates and adds frame information (see FIG. 11) based on the display time information.
[0052]
[Step ST32]
Next, the CPU 12 compares the display time (PTS information) of the frame information with the designated time designated by the user. Then, the frame of the encoded moving image data corresponding to the display time closest on the time axis is searched from the multiplexed data to which the frame information is added.
[0053]
[Step ST33]
Next, the CPU 12 separates the encoded moving image data after the searched frame from the multiplexed data 100 or the multiplexed data newly configured in step ST22. The separated frame of encoded moving image data is stored in the storage memory 13.
[0054]
[Step ST5 to Step ST7]
Next, processing in step ST5 to step ST7 is performed.
[0055]
As described above, by creating the frame information and adding it to the multiplexed data 100, the data position corresponding to the time designated by the user can be obtained simply by referring to the frame information, and the seek operation can be easily performed. .
[0056]
【The invention's effect】
In the moving picture decoding apparatus according to the present invention, when performing a seek operation for displaying a frame at a time designated by a user for multiplexed data not including frame information, the conventional technique may perform the seek operation itself. Although difficult, a seek operation can be performed by using the file size or bit rate of the multiplexed data. Also, by providing means for reconfiguring multiplexed data, it can be reconfigured into a format that allows a seek operation to be performed more easily.
[Brief description of the drawings]
FIG. 1 is a block diagram showing an overall configuration of a moving picture decoding apparatus according to a first embodiment of the present invention.
FIG. 2 is a flowchart showing a processing procedure performed by the video decoding device shown in FIG. 1;
FIG. 3 is an example of multiplexed data shown in FIG. 1;
4 is a diagram showing a method for searching for a data position corresponding to the designated time shown in FIG. 2. FIG.
FIG. 5 is a flowchart showing a processing procedure in the second embodiment of the present invention;
6 is a diagram showing a method for searching for a data position corresponding to the designated time shown in FIG.
FIG. 7 is a flowchart showing a processing procedure in the third embodiment of the present invention;
FIG. 8 is an example of multiplexed data in which each data is continuously arranged exceeding the size determined for each data.
9 is a diagram showing a method for rearranging multiplexed data shown in FIG. 7; FIG.
FIG. 10 is a flowchart showing a processing procedure in the fourth embodiment of the present invention.
FIG. 11 is an example of frame information.
[Explanation of symbols]
DESCRIPTION OF SYMBOLS 1 Moving image decoding apparatus 2

Display apparatus

11,15 Input / output interface 12 CPU
13 Storage memory 14 Frame buffer 100 Multiplexed data 200 Designated time information

Claims

An apparatus for decoding encoded video data multiplexed with AV data,
The encoded moving image data includes a plurality of frames,
The device is
An acquisition unit that acquires a data size of the AV data, a reproduction time of the AV data, and a designated time indicating a display time designated from outside;
A search unit for searching for a data position corresponding to the specified time from the AV data using the data size, playback time, and specified time acquired by the acquisition unit;
A moving picture decoding apparatus comprising: a decoding unit that decodes a frame corresponding to a data position obtained by the search unit.

An apparatus for decoding encoded video data multiplexed with AV data,
The device is
The encoded moving image data includes a plurality of frames,
An acquisition unit for acquiring a bit rate of the AV data and a designated time indicating a display time designated from outside;
A search unit for searching for a data position corresponding to the specified time from the AV data using the output speed information acquired by the acquiring unit and the specified time;
A decoding unit for decoding a frame corresponding to the data position obtained by the search unit,
The video decoding apparatus, wherein the bit rate indicates a data amount of AV data consumed per unit time.

In claim 1 or claim 2,
The AV data is
The encoded moving image data and other encoded data are multiplexed in a predetermined array unit,
The video decoding device further includes:
A determination unit for determining whether or not the data size of the array unit of the AV data exceeds a predetermined size;
When the determination unit determines that the data size of the array unit of the AV data exceeds a predetermined size, the data size in the AV data is set so that the data size of the array unit of the AV data is equal to or less than the predetermined size. And a reconstructing unit that changes the arrangement of the encoded data.

An apparatus for decoding encoded video data multiplexed with AV data,
The AV data is
The encoded moving image data and other encoded data are multiplexed in a predetermined array unit,
The device is
A determination unit for determining whether or not the data size of the array unit of the AV data exceeds a predetermined size;
When the determination unit determines that the data size of the array unit of the AV data exceeds a predetermined size, the AV data is set so that the data size of the array unit of the AV data is equal to or less than the predetermined size. A reconstruction unit for changing the order of each encoded data in
A frame information creation unit that creates frame information for the AV data whose arrangement has been changed by the reconstruction unit, and adds the created frame information to the AV data;
The frame information is
A moving picture decoding apparatus comprising information indicating a sequence on a time axis of frames included in the AV data.