JP3664678B2

JP3664678B2 - Video playback device

Info

Publication number: JP3664678B2
Application number: JP2001383010A
Authority: JP
Inventors: 正晃兵頭; 寛草尾
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2001-12-17
Filing date: 2001-12-17
Publication date: 2005-06-29
Anticipated expiration: 2020-06-29
Also published as: JP2002238028A

Description

【０００１】
【発明の属する技術分野】
本発明は、ＣＤ−ＲＯＭなどの読み出し専用記録メディアを用いたディジタルビデオ再生装置や、光磁気ディスクなどの書き換え可能な記録メディアを用いたディジタルビデオ再生装置に適用して好適な動画像再生装置に関するものである。
【０００２】
【従来の技術】
磁気ディスク，光ディスク，光磁気ディスクといったディスク媒体は、高速なランダムアクセスが可能である。高速なランダムアクセスを用いれば、不連続な領域に記録されているデータを連続したデータのように再生することが可能になる。
【０００３】
一方、動画像データはデータレートが非常に高いため、データを加工することなく、前記ディスク媒体に記録するのは困難である。しかしながら、動画像データや音声データを高能率符号化することにより、視覚的あるいは聴覚的に劣化することなく、データレートをディスク媒体に記録可能なレートまで下げることができる。
【０００４】
高能率符号化方式の例としては、ＭＰＥＧ（Moving Picture Experts Group：エムペグ）方式がある。ＭＰＥＧは、ISO-IEC／JTC1／SC29／WG11 で規格化が進められており、ビデオ，オーディオの高能率符号化と、それらの同期再生のための多重化方式に関するものである。ＭＰＥＧでのビデオ符号化方式は、１２ビデオフレームや１５ビデオフレームといった、まとまった単位をＧＯＰ（Group Of Pictures）と呼び、ＧＯＰ内で予測符号化を行う。
【０００５】
図２０は、ＧＯＰの構造の例を示す図である。ＧＯＰ中のビデオフレームはＩフレームとＰフレームとＢフレームに分類される。Ｉフレームはフレーム内符号化する。ＰフレームはＩフレームから前方向の予測符号化するか、Ｐフレームから前方向の予測符号化する。ＢフレームはＩフレームとＰフレームの間もしくはＰフレーム間にあり、ＩフレームやＰフレームの両方向から予測符号化する。
【０００６】
このように、ＰフレームやＢフレームはＩフレームもしくはＩフレームから予測符号化されたビデオフレームから予測符号化するため、復号時には、まずＩフレームを復号しないとＧＯＰ内の全てのビデオフレームが復号できなくなる。すなわち、ランダムアクセスなどで、ビデオシーケンスの途中から再生する場合には、Ｉフレームから復号する必要がある。また、Ｐフレームは前方向からの予測のみ用いるため、比較的容易に復号することができる。
【０００７】
以下の説明では、ＩフレームとＰフレームをキーフレームと呼ぶことにする。また、ビデオシーケンスとは、記録が開始されてから記録が終了するまでの時間的に連続したビデオフレーム及び対応する符号化データを示す。
【０００８】
ＭＰＥＧのビデオ符号化では、平均の符号量はほぼ一定になるが、短時間では符号量は時間に比例しない。そのため、各フレームのビデオ符号化データの間隔も一定にならず、各フレームの符号化データの記録位置は一意的に定まらない。このため、高速再生のように連続しないビデオフレームを再生するのが難しい。
【０００９】
高能率に符号化された符号化データを用いて高速再生を実現する従来の例としては、例えば、特開平５−１５３５７７号公報に記載の方法がある。これは、再生時にディスクメディアのデータ読出速度を通常より高くし、得られた連続データから高速再生に必要なデータを選択し、高速再生を実現しようとするものである。
【００１０】
【発明が解決しようとする課題】
しかしながら、前記公報に記載の方法では、ランダムアクセスを用いずにデータ読出速度を上げ、通常より多く得られるデータから高速再生に必要なデータを選択しているため、データ読出速度に比例して高速再生の速度は高くなるが、データ読出速度には限界がある。すなわち、高速再生の速度も限界がある。
【００１１】
より高速な再生を実現するためには、ランダムアクセスを用いて、とびとびのデータを読み出すことが考えられる。すなわち、データの読み出しと高速なジャンプを繰り返し、ジャンプする量によって高速再生の速度を変化させることができる。
【００１２】
しかしながら、高能率符号化されたデータを読み出す場合は、ジャンプ時に以下のような問題点が生じる。すなわち、ジャンプし、ビデオシーケンスの途中から再生する場合には、前述したように、Ｉフレームから復号する必要があるが、符号量と時間が比例していないため、Ｉフレームの符号化データが記録されている開始点を見い出すことができないという問題点がある。
【００１３】
前述のように、前記公報に記載の方法では、所定の時間のデータにランダムアクセスすることができないので、後述する頭出しやポインタ編集の再生といったディスクのランダムアクセスを用いた機能も実現できない。頭出しとは、指定された時間からビデオ出力とオーディオ出力を同期させながら再生する機能で、指定された時間から再生するために、ランダムアクセスする必要がある。
【００１４】
従来の技術では、Ｉフレームの位置が不明なため、再生に必要な多重符号化データを読み出すまでの不要なデータを読み出すことがあり、頭出しに時間がかかるという問題点があった。また、頭出し位置も正確には規定できず、大まかな位置にアクセスするしかなかった。
【００１５】
ディスクメディアの高速再生では、先に述べたようにビデオフレームを間引きながらキーフレームだけを再生することで実現できる。この場合、ビデオ符号化データの読み出しと高速なランダムアクセスを繰り返すが、従来の技術では、キーフレームのビデオ符号化データの記録位置がわからず、ビデオ符号化データを読み出す際に、キーフレームを含めた広範囲の多重符号化データを読み出すため、単位時間当たりに再生できるビデオフレームの数が少なくなるという問題点があった。
【００１６】
ポインタ編集とは、再生開始点と終了点を複数指定し、時間的に離れた複数のデータをコピーすることなく論理的なポインタでつなぎ合わせる機能であり、再生時にはビデオ出力とオーディオ出力を同期させながら再生する。
【００１７】
この場合も、再生時に指定された領域から次の領域へランダムアクセスする必要がある。デコーダでは、データが得られない間も再生データを出力し、連続した再生データとして出力する必要がある。そのため、符号バッファを備え、符号化データが得られない間は、符号バッファに蓄えられている符号化データを用いて再生データを出力しなければならない。
【００１８】
前述したように、従来の技術では、ランダムアクセス時に不要なデータを読み出すことがあるため、復号に必要な符号が得られない時間が長く、より多くの符号バッファが必要になるという問題点があった。なお、ディスクメディアから読み出すデータは、多重化のためのヘッダが付加された多重符号化データのため、ビデオ出力とオーディオ出力を同期して再生する場合には、該ヘッダから読み出す必要がある。
【００１９】
図２１は、ＭＰＥＧ方式で記録される多重符号化データの概要を示す図である。多重符号化データは、パックと呼ばれる単位に分割され、パック毎に第１のヘッダが付加されている。パックは、分割されたビデオ符号化データやオーディオ符号化データ毎に第２のヘッダが付加されたデータの集まりである。
【００２０】
第１のヘッダには、ビデオ符号化データとオーディオ符号化データとの同期をとるための情報が含まれ、第２のヘッダには、そのヘッダ以降に続くデータの種類を示す情報が含まれる。従って、ビデオとオーディオを同期再生するためには、第１のヘッダから読み出す必要がある。
【００２１】
本発明は、このような実情に鑑みてなされたもので、符号量と時間が比例しない符号化データが記録されている場合においても、ランダムアクセス時に、必要なデータだけを読み出すことのできる動画像再生装置及び動画像記録装置を提供することを目的としている。
【００２２】
【課題を解決するための手段】
本願の第１の発明は、ビデオ符号化データとオーディオ符号化データとを含む多重符号化データが、該多重符号化データに対応する管理データとは別に記録されている記録メディアから、ビデオまたはオーディオの少なくとも一方を再生する動画像再生装置であって、前記ビデオ符号化データは、フレーム内符号化されたフレームまたは前方向予測符号化されたフレームで構成されるキーフレームの符号化データを含み、前記管理データは、前記キーフレームに関する前記多重符号化データ上での位置を示す位置情報を含み、前記記録メディアをコントロールして、前記記録メディアから前記管理データを読み出す手段と、前記記録メディアから読み出した管理データを保持する手段と、前記管理データに含まれる前記位置を可能とする手段と情報に基づいて、ランダムアクセスを可能とする手段とを備え、前記管理データを保持する手段が、前記管理データが記録される、第１のメモリ領域、第２のメモリ領域、第３のメモリ領域を含むメモリを有し、前記第１のメモリ領域に記録される管理データは、多重符号化データからなるシーケンスの再生順を示し、前記第２のメモリ領域に記録される管理データは、多重符号化データが記録された記録メディアの記録領域を示し、前記第３のメモリ領域に記録される管理データは、前記キーフレームに関する前記多重符号化データ上での位置を示すことを特徴とする。
本願の第２の発明は、前記管理データに含まれる前記位置情報は、キーフレームのビデオ符号化データの位置を示すビデオ位置情報、キーフレームに対応するオーディオ符号化データの位置を示すオーディオ位置情報、キーフレームからビデオとオーディオとを同期して再生するために読み出す必要がある多重符号化データの位置を示す多重化位置情報の少なくとも１つを含むことを特徴とする。
本願の第３の発明は、前記ビデオ位置情報が、キーフレームのビデオ符号化データの先頭位置または後端位置を含み、前記オーディオ位置情報が、キーフレームに対応するオーディオ符号化データの先頭位置または後端位置を含み、前記多重化位置情報が、キーフレームからビデオとオーディオとを同期して再生するために読み出す必要がある多重符号化データの先頭位置または後端位置を含むことを特徴とする。
本願の第４の発明は、ビデオとオーディオとを同期させて再生する場合は、前記多重化位置情報を用い、ビデオのみを再生する場合は、前記ビデオ位置情報を用いることを特徴とする。
本願の第５の発明は、前記記録メディアから多重符号化データを読み出し、管理データを構築することを特徴とする。
【００２５】
【発明の実施の形態】
本発明の実施例について、図面を参照して以下に説明する。
【００２６】
図１は、本発明による動画像記録再生装置の一実施例を説明するためのブロック図であり、図１中、１はデータ処理部、２は記録（ディスク）メディア、３は記録メディアコントローラ、４は符号化データ管理部、５はメモリコントローラ、５ａはビデオ記録情報管理部、５ｂはオーディオ記録情報管理部、５ｃは多重化記録情報管理部、６はメモリである。
【００２７】
データ処理部１は、エンコーダから入力される多重符号化データや符号化データ管理部４から入力されるデータをセクタ単位に分割し、記録メディア２の所定のセクタに記録するように出力する。セクタ番号は、記録メディアコントローラ３から入力される。
【００２８】
該記録メディアコントローラ３は、記録メディア２の読み出し，書き込みを制御する記録メディア制御データを出力するとともに、データ処理部１にデータを書き込みあるいは読み出す記録メディア２上でのセクタ番号を出力する。符号化データ管理部４には、予め記録メディアに記録されているあるいは新たに記録メディアに記録するビデオシーケンスに対応する管理データが記録されている。
【００２９】
符号化データ管理部４は、ビデオ記録情報管理部５ａとオーディオ記録情報管理部５ｂと多重化記録情報管理部５ｃとから成るメモリコントローラ５を有し、該メモリコントローラ５はメモリ６と接続されており、該メモリ６に格納されている管理データを制御する。
【００３０】
また、符号化データ管理部４は、ビデオデータを高能率に符号化したビデオ符号化データと、オーディオデータを高能率に符号化したオーディオ符号化データと、前記ビデオ符号化データと前記オーディオ符号化データとを多重化するための付加データとからなる多重符号化データに対して、該多重符号化データを記録メディアに蓄積するための管理を行うもので、フレーム内符号化されたビデオフレームもしくは前方向から予測符号化されたビデオフレームをキーフレームとして管理に用い、該キーフレームに対応する前記ビデオ符号化データと、前記オーディオ符号化データと、前記多重符号化データとのうちの１つもしくは２つもしくは全ての符号化データの記録メディア上での記録位置情報及び前後の符号化データとの連結情報を管理データして与えるものである。
【００３１】
以下に説明する実施例では、符号化方式として従来の符号化方式の例で挙げたＭＰＥＧ方式を用い、エンコーダから入力される多重符号化データは、図２１に示した構造であるとする。また、キーフレームのビデオ符号化データの先頭位置とキーフレームに対応するオーディオ符号化データの先頭位置のうち、先に記録されている先頭位置を含むパックをキーパックと呼ぶことにする。
【００３２】
まず、記録，再生装置に記録メディアが挿入されたときの動作について説明する。記録メディアコントローラ３は、記録メディア２に記録されている管理データを読み出すように記録メディア制御データを出力する。データ処理部１には記録メディア２から管理データが入力され、符号化データ管理部４に出力される。該符号化データ管理部４では、入力される管理データを所定のアドレスに記録する。
【００３３】
次に、符号化データの記録時の動作について説明する。この動作は、動画像再生専用装置の場合ではなく、記録可能な装置の場合の動作である。まず、データ処理部１では、エンコーダから入力される多重符号化データをセクタ単位に分割する。そして、記録メディアコントローラ３から入力されるセクタ番号に従って、記録メディア２上の所定のセクタに記録されるように符号化データを出力する。
【００３４】
また、データ処理部１は、エンコーダから入力される多重符号化データを記録メディア２に記録するに際して、記録領域が連続した部分について、最初のセクタ番号と最後のセクタ番号を符号化データ管理部４に出力する。記録時には、時間的に連続したビデオシーケンスを複数の領域に分割して記録できる記録装置もあれば、連続した領域にしか記録できない記録装置もある。前者の場合は、複数の領域に分割して記録した場合、それぞれの領域に対して、最初のセクタ番号と最後のセクタ番号が符号化データ管理部４に出力されることになる。
【００３５】
さらに、データ処理部１では、エンコーダで付加される図２１の第１のヘッダを検出する検出部と、キーフレームのビデオ符号化データ及びキーフレームに対応するオーディオ符号化データの開始と終了を検出する検出部を設けておき、キーフレームに対応するビデオ符号化データ，多重符号化データが記録される記録メディア上での位置情報を符号化データ管理部４に出力する。
【００３６】
ここで、キーフレームのビデオ符号化データの開始と終了は、ビデオ符号化データにおいて、ビデオフレーム単位に存在するヘッダを検出することで可能である。また、キーフレームに対応するオーディオ符号化データの開始と終了は、ビデオ符号化データ及びオーディオ符号化データに付加される時間情報から知ることができる。
【００３７】
また、第１のヘッダの開始を示すフラグ，キーフレームのビデオ符号化データ及びキーフレームに対応するオーディオ符号化データの開始と終了を示すフラグを出力するようにエンコーダを構成すれば、データ処理部１で第１のヘッダを検出する出力部及びキーフレームのビデオ符号化データ及びキーフレームに対応するオーディオ符号化データの開始と終了を検出する検出部を備える必要はない。
【００３８】
符号化データ管理部４内のメモリコントーラ５では、データ処理部１から入力されるデータから連結情報を生成する。このとき、ビデオ符号化データの記録位置情報はビデオ記録情報管理部５ａで管理し、オーディオ符号化データの記録位置情報はオーディオ記録位置情報管理部５ｂで管理し、多重符号化データの記録位置情報は多重化記録情報管理部５ｃで管理する。
【００３９】
そして、メモリ６の内容からビデオシーケンス，記録メディア上での記録領域，記録領域に記録されているキーフレームの連結情報を生成し、記録位置情報とともにメモリ６に記録する。この動作で、記録メディア２に符号化データが新たに記録された場合に、符号化データ管理部４には常に記録メディア２に記録されている符号化データに対応する最新の管理データが記録されていることになる。
【００４０】
記録メディア２に新たなデータを記録した場合、符号化データ管理部４には最新の管理データが記録されるが、記録メディア２に記録されている管理データは更新されていない。そこで、例えば、記録が終了した時点や装置から記録メディア２を取り出す時点で、符号化データ管理部４に記録されている管理データを記録メディア２に記録する必要がある。また、記録メディア２に符号化データ記録中であっても、定期的に管理データを更新すべく記録メディア２を制御してもよい。
【００４１】
この手順は、記録メディアコントローラ３が記録メディア２に管理データを書き込むように記録メディア２を制御し、符号化データ管理部４のデータを順次データ処理部１に入力するように制御する。データ処理部１では、符号化データ管理部４から入力されるデータを記録メディア２に出力する。
【００４２】
次に、通常再生時の動作を説明する。ここで、符号化データ管理部４には、既に記録メディア２から読み出した管理データが記録されているものとする。まず、符号化データ管理部４から再生しようとする多重符号化データが記録されている最初のセクタ番号と最後のセクタ番号が読み出され、記録メディアコントローラ３に入力される。
【００４３】
符号化データは、記録メディア２上で複数の領域に分割して記録していることもあるが、その場合には、各記録領域毎に領域の最初のセクタ番号と最後のセクタ番号が符号化データ管理部４から記録メディアコントローラ３に入力される。
【００４４】
図２は、符号化データが斜線で示した２つの領域に分割して記録されている場合の例を示す図である。実際の記録メディア２の記録領域は、複数の同心円状又はスパイラル状であるが、ここでは、横軸に時間軸を取って表している。
【００４５】
図２で符号化データを１,２の順番で読み出すとすると、符号化データ管理部４からは最初にａ,ｂの位置のセクタ番号が出力され、次にｃ,ｄの位置のセクタ番号が出力される。記録メディアコントローラ３は、符号化データ管理部４から入力されるセクタ番号に従って、記録メディア２から多重符号化データを読み出すように記録メディア制御データを出力する。データ処理部１には、記録メディア２から連続した多重符号化データが入力され、デコーダに出力される。
【００４６】
次に、頭出し時の動作について説明する。頭出し時には、符号化データ管理部４から再生すべきキーフレームに対応する多重符号化データの記録メディア上での記録位置情報が記録メディアコントローラ３に出力される。該記録メディアコントローラ３では、入力される記録位置情報を用いて、記録メディア２をキーパックの符号化データが記録されているセクタにアクセスするように制御する。データ処理部１の動作は通常再生と同一である。
【００４７】
次に、高速再生時の動作について説明する。高速再生時には、記録メディアコントローラ３は、キーフレームに対応する符号化データに連続してアクセスするように記録メディアを制御する。ＭＰＥＧ方式では、キーフレーム間に前方向，後方向の両方向から予測符号化するＢフレームがあるが、高速再生時には、Ｂフレームのビデオ符号化データは読み出さない。
【００４８】
まず、高速再生が指定された場合、符号化データ管理部４から再生すべきキーフレームの記録メディア上での位置情報が順序に記録メディアコントローラ３に出力される。該記録メディアコントローラ３では、入力される記録位置情報を用いて、記録メディア２をキーフレームのビデオ符号化データが記録されているセクタにアクセスするように制御する。
【００４９】
なお、ポインタ編集の再生時の動作は、高速再生とほぼ同一であり、高速再生の場合にはキーフレームに対応する符号化データに順序にランダムアクセスするが、ポインタ編集の再生の場合は、管理データで示された連結情報に従って、不連続な領域に記録されている符号化データにランダムアクセスする。
【００５０】
このように、本実施例では、ランダムアクセス時に読み出すべきデータの記録メディア上での記録位置を管理データとして符号化データ管理部４に格納しているため、不要なデータをほとんど読み出すことがなく、高速なランダムアクセスが可能になる。
【００５１】
次に、本発明の他の実施例について説明する。本実施例のブロック図は、図１に示したブロック図と同一である。本実施例では、符号化データ管理部４に記録される管理データは、多重符号化データの記録メディア上での記録位置情報として、キーパックの先頭位置を示すセクタ番号、ビデオ符号化データのメディア上での記録位置情報として、キーフレームのビデオ符号化データの先頭位置及び後尾位置を示すセクタ番号を含むものである。
【００５２】
図３（ａ）,（ｂ）は、キーパックの先頭位置の例を示す図である。キーパックとは、キーフレームのビデオ符号化データの先頭位置とキーフレームに対応するオーディオ符号化データの先頭位置のうち、先に記録されている先頭位置を含むパックを示す。図３（ａ）は、キーフレームのビデオ符号化データの先頭が先に記録されている場合、図３（ｂ）は、キーフレームに対応するオーディオ符号化データの先頭が先に記録されている場合の例である。なお、図中ｍ1，ｍ2，…，ｍ8は多重化用付加データである。
【００５３】
図４は、図１におけるデータ処理部のヘッダ検出回路のブロック図で、図４中、７は第１のヘッダ検出部、８はビデオヘッダ検出部、９はオーディオヘッダ検出部、１０は判定回路である。本実施例のデータ処理部１では、図４に示すようなヘッダを検出するヘッダ検出回路を有し、符号化データ記録時には、キーパックが記録される最初のセクタ番号及びキーフレームの符号化データの先頭と後尾が記録されるセクタ番号を符号化データ管理部４に出力する。
【００５４】
図４に示すヘッダ検出回路は、第１のヘッダ検出部７とビデオヘッダ検出部８とオーディオヘッダ検出部９と判定回路１０とで構成されており、第１のヘッダ検出部７，ビデオヘッダ検出部８，オーディオヘッダ検出部９には、エンコーダから多重符号化データが入力され、それぞれ図２１で示した第１のヘッダ、ビデオフレームの先頭に付加されているヘッダ，オーディオフレームの先頭に付加されているヘッダを検出する。
【００５５】
そして、ヘッダが検出された場合、判定回路１０にフラグを入力する。該判定回路１０では、第１のヘッダ検出部７，ビデオヘッダ検出部８，オーディオヘッダ検出部９から入力されるヘッダと、記録メディアコントローラ３から入力されるセクタ番号から、キーパックが記録される最初のセクタ番号及びキーフレームのビデオ符号化データの先頭と後尾が記録されるセクタ番号を符号化データ管理部４に出力する。
【００５６】
なお、図４の例で検出されるヘッダは、データ処理部１の前段にあるエンコーダで付加される。そこで、エンコーダからデータ処理部１に図２１で示した第１のヘッダ、ビデオフレームの先頭に付加されているヘッダ、オーディオフレームの先頭に付加されているヘッダが入力されることを示すフラグを入力する構成としておけば、データ処理部１に図４の第１のヘッダ検出部７，ビデオヘッダ検出部８，オーディオヘッダ検出部９を備える必要はない。
【００５７】
また、再生時において、通常は記録メディア２から読み出すデータはセクタ単位で得られるため、再生に必要な多重符号化データの先頭がセクタの途中に記録されている場合には、最初に不要なデータが存在することになる。そこで、データ処理部１の後段のデコーダでは、図４に示した第１のヘッダ検出部７，ビデオヘッダ検出部８，オーディオヘッダ検出部９と同一の回路を有し、ヘッダを検出し、ヘッダ以前に得られる不要なデータを取り除く必要がある。
【００５８】
しかしながら、本実施例のように、データ処理部１に図４で示した回路を備えている場合は、データ処理部１でヘッダ以前に得られる不要なデータを取り除くことができるので、後段のデコーダにヘッダを検出する検出回路を備える必要はない。
【００５９】
このように、本実施例では、キーフレームに対応するビデオ符号化データ，オーディオ符号化データ，多重符号化データの管理に、キーフレームに対応したそれぞれの符号化データの先頭もしくは後尾が記録されるセクタ番号を用いるため、キーフレームへのアクセスが効率良く行える。すなわち、頭出し，高速再生，ポインタ編集を行う際に必要となるキーフレームに対応した記録位置へのアクセスが、前記管理データを利用することにより可能となる。
【００６０】
この際、ビデオとオーディオを共に出力するのであれば、前記管理データのうち多重符号化データに関するものだけを使用すればよく、ビデオ符号化データ及びオーディオ符号化データに関する管理データは不要である。一方、ビデオ符号化データのみ、もしくはオーディオ符号化データのみにアクセスする場合は、管理データとしてビデオ符号化データ及びオーディオ符号化データに関するものを用いることになる。
【００６１】
次に、本発明の別の実施例について説明する。本実施例のブロック図は、図１に示したブロック図を同一であるが、符号化データ管理部４の動作が異なる。本実施例では、ランダムアクセスを伴う再生のうち、頭出し，ポインタ編集の再生の場合は、多重符号化データの記録メディア上での記録位置情報を用いて多重符号化データを読み出し、高速再生の場合は、ビデオ符号化データの記録メディア上での記録位置情報を用いてビデオ符号化データを読み出す。
【００６２】
従って、頭出し，ポインタ編集の再生の場合は、符号化データ管理部４から記録メディアコントローラ３にキーパックの先頭位置を示すセクタ番号を入力し、高速再生の場合は、キーフレームのビデオ符号化データの先頭位置と後尾位置を示すセクタ番号を入力する。
【００６３】
これは、頭出し，ポインタ編集の再生の場合は、オーディオ出力とビデオ出力を同期して再生する必要があり、高速再生の場合は、ビデオ出力だけのためである。本実施例によると、高速再生時には、不要なオーディオ符号化データを読み出すことがなくなり、単位時間当たりに再生できる画像の枚数がより多くなる。
【００６４】
前述した実施例においては、管理データに多重符号化データの記録メディア上での位置情報として、キーパックの先頭位置が記録されるセクタ番号を、ビデオ符号化データの記録メディア上での位置情報として、キーフレームのビデオ符号化データの先頭位置と後尾位置を示すセクタ番号を用いるものである。
【００６５】
さらに、他の実施例として、管理データにキーパックの先頭位置とキーフレームのビデオ符号化データの後尾位置を示すセクタ番号を用い、キーフレームのビデオ符号化データの先頭位置を示すセクタ番号を用いない実施例も考えられる。この場合、オーディオ出力とビデオ出力を同期再生する場合も、ビデオ出力だけの場合も、ランダムアクセス時にキーパックの先頭位置にアクセスする。
【００６６】
従って、ビデオ出力だけ必要とする場合は、不要なオーディオ符号化データを読み出す場合があり、例えば、高速再生の場合には、単位時間当たりに再生できるビデオフレーム数が少なくなるが、管理データの容量は少なくすることができる。
【００６７】
また、再生時にビデオデコーダからビデオフレームの終了を示すフラグを入力するようにしておけば、キーフレームのビデオ符号化データの後尾位置を記録する必要がなくなり、管理データをさらに少なくすることができる。この場合、管理データとして用いるのは、キーパックの先頭位置を示すセクタ番号だけになる。
【００６８】
次に、本発明の他の実施例について説明する。図５は、図１の符号化データ管理部のブロック図で、図中、１１はメモリコントローラ、１２〜１４はメモリである。
【００６９】
符号化データ管理部４は、図５に示すように、管理データを格納する３つのメモリ１２，１３，１４と、それらを制御するメモリコントローラ１１とで構成される。メモリ１２，１３，１４には、それぞれビデオシーケンスを管理する第１のテーブル（以下、シーケンステーブルと呼ぶ）、記録メディア上での記録領域を管理する第２のテーブル（以下、領域テーブルと呼ぶ）、キーフレームに対応するビデオ符号化データもしくはオーディオ符号化データもしくは多重符号化データを管理する第３のテーブル（以下、キーフレームテーブルと呼ぶ）が記録される。
【００７０】
図６は、図５に示す各メモリに記録される管理データの内容の一例を示す図で、図５中、２１は次のシーケンスアドレス、２２は領域テーブルアドレス、２３は次の領域のアドレス、２４は開始セクタ番号、２５は終了セクタ番号、２６は開始フレームアドレス、２７は次のフレームアドレス、２８はパック開始セクタ番号、２９はキー開始セクタ番号、３０はキー終了セクタ番号であり、アドレス２１，２２，２３，２６，２７はポインタ（連結情報）で、セクタ番号２４，２５，２８，２９，３０は記録位置情報である。
【００７１】
メモリ１２のシーケンステーブルは、１つのビデオシーケンスに対して１ワードのデータがある。１ワードデータは、次のワードを示すポインタ「次のシーケンスアドレス」２１及び領域テーブルの位置を示すポインタ「領域テーブルアドレス」２２からなる。
【００７２】
「次のシーケンスアドレス」２１は、ビデオシーケンスの再生順序を示し、「領域テーブルアドレス」２２はビデオシーケンスに対応する領域テーブルのアドレスを示す。ここで、「次のシーケンスアドレス」２１は、メモリ１２のアドレスを示すが、ポインタ「次のシーケンスアドレス」２１でつながれた最後のデータは‘終了’を示す特別な値を有する。
【００７３】
メモリ１３の領域テーブルには、記録メディア上の連続した記録領域毎に１ワードのデータがある。１ワードデータは、次のワードを示すポインタ「次の領域のアドレス」２３、連続した記録領域の最初のセクタ番号である「開始セクタ番号」２４、連続した記録領域の最後のセクタ番号である「終了セクタ番号」２５、連続した記録領域に含まれるキーフレームに対応するキーフレームテーブルの開始アドレスを示すポインタ「開始フレームアドレス」２６の４つからなる。
【００７４】
シーケンステーブルの１ワードデータに対応する領域テーブルは、シーケンステーブルの「領域テーブルアドレス」２２で示された領域テーブルアドレスから、ポインタ「次の領域のアドレス」２３でつながった領域テーブルの一連となる。
【００７５】
「次の領域のアドレス」２３は、メモリ１３のアドレスを示すが、ポインタ「次の領域のアドレス」２３でつながれた最後のデータは‘終了’を示す特別な値を持つ。また、領域テーブルの１ワードデータで示された開始セクタ番号から終了セクタ番号の領域にキーフレームが記録されない場合は、「開始フレームアドレス」２６は‘データなし’を示す特別な値を有する。
【００７６】
メモリ１４のキーフレームテーブルには、１つのキーフレーム毎に１ワードのデータがある。１ワードのデータは、次のワードを示すポインタ「次のフレームアドレス」２７，キーパックの先頭が記録されているセクタ番号を示す「パック開始セクタ番号」２８，キーフレームのビデオ符号化データの先頭が記録されているセクタ番号を示す「キー開始セクタ番号」２９，後端が記録されているセクタ番号を示す「キー終了セクタ番号」３０である。
【００７７】
「次のフレームアドレス」２７は、メモリ１４のアドレスを示すが、ポインタ「次のフレームアドレス」２７でつながれた最後のデータは‘終了’を示す特別な値を有する。
【００７８】
図６の例では、図５のメモリ１４のキーフレームテーブルに記録されているデータがＩフレームの符号化データを示すのか、Ｐフレームの符号化データを示すのかを示す情報はない。そこで、後述するように、キーフレームテーブルの１ワードデータ毎にＩフレームのビデオ符号化データを示すのか、Ｐフレームのビデオ符号化データを示すのかを示すフラグを付加する、あるいはキーフレームテーブルをＩフレームのテーブルとＰフレームのテーブルに分離するという手法を用いる。
【００７９】
図７は、図５に示すメモリの各テーブルに記録される管理データと記録領域との関係を示す図である。記録メディア２上には、複数のビデオシーケンスを記録することができ、１つのビデオシーケンスが記録メディア２上で連続した記録領域に記録されることもあれば、複数の記録領域に分割して記録されることもある。
【００８０】
図７の例では、３つのビデオシーケンスが記録されており、ビデオシーケンス１は、３つの領域に分割して記録されている。再生順序はビデオシーケンス１,２,３の順であるとする。
【００８１】
図７の例の場合、シーケンステーブルにはビデオシーケンス１,２,３それぞれに対応した３つの１ワードデータがあることになる。最初の１ワードデータは、ビデオシーケンス１に対応するデータであり、ビデオシーケンス１の「次のシーケンスアドレス」がビデオシーケンス２に対応する１ワードデータを示し、ビデオシーケンス２の「次のシーケンスアドレス」がビデオシーケンス３に対応する１ワードデータを示すことになる。そして、シーケンステーブルの「領域テーブルアドレス」がビデオシーケンスに対応する最初の領域テーブルデータのアドレスを示す。
【００８２】
領域テーブルは、記録メディア２上で連続した記録領域に１ワードデータが対応する。すなわち、図７の場合、ビデオシーケンス１に対応する領域テーブルの１ワードデータは３つあることになる。このように、複数の領域テーブルがある場合は、領域テーブルの「次の領域のアドレス」が次の領域テーブルデータを示す。
【００８３】
領域テーブルの１ワードデータのうち、「開始セクタ番号」は連続する記録領域の最初のセクタ番号を示し、「終了セクタ番号」は最後のセクタ番号を示す。この連続した記録領域には、複数のキーフレームの符号化データが記録される。このうち、最初のキーフレームの符号化データが記録されるセクタ番号は、「開始フレームアドレス」によって示されるキーフレームテーブルにより管理される。
【００８４】
キーフレームテーブルの１ワードデータのうち、「パック開始セクタ番号」がキーパックの先頭が記録されるセクタ番号を示し、「キー開始セクタ番号」がキーフレームのビデオ符号化データの先頭が記録されるセクタ番号を示し、「キー終了セクタ番号」がビデオ符号化データの後尾が記録されるセクタ番号を示す。そして、「次のフレームアドレス」が次のキーフレームテーブルデータを示す。
【００８５】
図８は、同一のキーフレームの符号化データが複数の領域テーブルで管理された領域にまたがる特別な場合の例を示す図である。図８の例では、キーフレームｉの符号化データが記録領域１，記録領域２，記録領域３の３つの記録領域にまたがって記録されている。
【００８６】
図８の場合、記録領域１に対応する最後のキーフレームテーブルの「キー開始セクタ番号」には、キーフレームｉのビデオ符号化データの先頭を示すセクタ番号が、「キー終了セクタ番号」には後尾を示すセクタ番号が記録されており、「キー終了セクタ番号」は記録領域３のセクタ番号である。
【００８７】
また、記録領域２に対応するキーフレームテーブルはないので、記録領域２の「開始フレームアドレス」には‘データなし’を示すデータを記録する。そして、記録領域３の「開始フレームアドレス」は、キーフレームｉ＋１のキーフレームテーブルを示す。
【００８８】
高速再生時には、キーフレームテーブルに記録されている「キー開始セクタ番号」と「キー終了セクタ番号」が記録メディアコントローラ３に入力され、該記録メディアコントローラ３は「キー開始セクタ番号」から「キー終了セクタ番号」までのデータを読み出すように、記録メディア２を制御する。
【００８９】
このとき、図８で示したキーフレームｉを再生する場合は、前述したように、キーフレームテーブルのキーフレームｉに対応する「キー開始セクタ番号」と「キー終了セクタ番号」が異なる記録領域を示すことになる。しかし、このような場合でも、領域テーブルに記録された情報によって記録領域１，記録領域２，記録領域３の順に再生することは知ることができる。
【００９０】
そこで、記録メディアコントローラ３は記録領域１，記録領域２，記録領域３の順に記録メディア２にアクセスし、「キー終了セクタ番号」までのデータを読み出す。
【００９１】
なお、ビデオシーケンスを複数の領域に分割して記録せず連続した領域のみに記録する装置で、かつ再生時も編集を行わず、順序を変更しない場合には、例えば、図６において、領域テーブルの「次の領域のアドレス」２３がビデオシーケンスの再生順序を示すことにすれば、シーケンステーブルは不要になる。
【００９２】
次に、図５に示すメモリコントローラ及びメモリの動作についてさらに詳細に説明する。装置に記録メディア２が挿入された場合は、該記録メディア２に記録されている管理データが順次入力されるので、メモリコントローラ１１は、管理データを所定のメモリに記録するようにメモリ１２，１３，１４を制御する。
【００９３】
図９は、記録メディアに符号化データを記録する場合のメモリの動作を説明するためのフローチャートである。ここで、各テーブルは図６の構造に基づくものであるとし、管理データの変化の様子を図１０に図示する。
【００９４】
図６のメモリ１２のシーケンステーブルでは、既に記録されているデータの後端に１ワードのデータを追加し、追加されたシーケンステーブルのデータに対応するメモリ１３の領域テーブル、メモリ１４のキーフレームテーブルのデータを追加記録する。図１０では、実線部分が既に格納されている管理データ、点線部分が追加記録される管理データである。
【００９５】
なお、以下の説明では、ａｄ１２，ａｄｌ１２，はメモリ１２のアドレスを、ａｄ１３，ａｄｎ１３はメモリ１３のアドレスを、ａｄ１４，ａｄｎ１４はメモリ１４のアドレスを各々示す。また、記録に際しては、予め図１の記録メディアコントローラ３が管理データに基づき、記録メディア２の空き領域を管理し、空き領域の各セクタにデータを記録する制御を行うものとする。
【００９６】
以下、図９のフローチャートに基づき、各ステップ（Ｓ）に従って順に説明する。まず、ステップＳ１０１〜ステップＳ１０４でメモリ１２のシーケンステーブルを設定する。メモリ１２の空き領域とメモリ１３の空き領域を探索し、それぞれａｄ１２，ａｄ１３とする（Ｓ１０１）。そして、ａｄ１２の「領域テーブルアドレス」をａｄ１３、「次のシーケンスアドレス」を‘終了’と設定する（Ｓ１０２）。
【００９７】
次に、メモリ１２のシーケンステーブルのポインタ「次のシーケンスアドレス」でつながったデータのうち、最後のデータのアドレスをａｄｌ１２とする（Ｓ１０３）。そして、ａｄｌ１２の「次のシーケンスアドレス」をａｄ１２と設定する（Ｓ１０４）。以上の操作でシーケンステーブルの後端に新たな１ワードデータａｄ１２が追加されたことになる。
【００９８】
次に、メモリ１３の領域テーブルを設定する。符号化データ管理部２には、記録メディア２上での連続した記録領域の最初のセクタ番号が入力されるので、この入力データをａｄ１３の「開始セクタ番号」とする（Ｓ１０５）。そして、メモリ１４のキーフレームテーブルの空き領域を探索し、ａｄ１４とする（Ｓ１０６）。そして、ａｄ１３の「開始フレームアドレス」をａｄ１４と設定する（Ｓ１０７）。
【００９９】
次に、キーパックの先頭が記録されるセクタ番号、キーフレームのビデオ符号化データの先頭及び後尾を示すセクタ番号の３つのセクタ番号が入力されるので、この３つのセクタ番号をａｄ１４の「パック開始セクタ番号」,「キー開始セクタ番号」,「キー終了セクタ番号」とする（Ｓ１０８）。
【０１００】
そして、記録メディア２の連続した記録領域が終了かどうかを判定し（Ｓ１０９）、終了でない場合はメモリ１４の空き領域を探索し、ａｄｎ１４とする（Ｓ１１０）。そして、ａｄ１４の「次のフレームアドレス」をａｎｄ１４と設定し、ａｄ１４がａｄｎ１４を示すように更新する（Ｓ１１１）。
【０１０１】
次に、ステップＳ１０８に戻る。ステップＳ１０８〜ステップＳ１１１のループで記録メディア２上で連続した記録領域に記録されるキーフレームの符号化データを示すセクタ番号をキーフレームテーブルに書き込むことになる。
【０１０２】
ステップＳ１０９で記録メディア２の連続した記録領域が終了である場合は、入力セクタ番号である記録領域の最後のセクタ番号を領域テーブルａｄ１３の「終了セクタ番号」に記録し（Ｓ１１２）、キーフレームテーブルａｄ１４の「次のフレームアドレス」を‘終了’とする。ステップＳ１０５〜ステップＳ１１３で１ワードの領域テーブルのデータが設定されることになる。
【０１０３】
次に、ステップＳ１１４で入力データが終了かどうかを判定し、終了でない場合は、領域テーブルの空き領域を探索し、ａｄｎ１３とし（Ｓ１１５）、ａｄ１３の「次の領域のアドレス」をａｄｎ１３と設定し、ａｄ１３がａｄｎ１３を示すように更新する（Ｓ１１６）。
【０１０４】
そして、ステップＳ１０５に戻る。ステップＳ１０５〜ステップＳ１１６のループで前の領域テーブルデータからポインタで接続された新たな領域テーブルデータが作成され、その領域テーブルデータが示す一連のキーフレームテーブルにセクタ番号が記録されることになる。そして、ステップＳ１１４でデータが終了と判定された場合は、ａｄ１３の「次の領域のアドレス」を‘終了’と設定し（Ｓ１１７）、動作を終了する。
【０１０５】
次に、通常再生時の動作を説明する。通常再生時には、記録メディアコントローラ３に再生するビデオシーケンスに対応する領域テーブルの「開始セクタ番号」と「終了セクタ番号」が出力される。図１１は、通常再生時の動作をフローチャートで示す図である。
【０１０６】
ここでは、再生中のデータに対応するシーケンステーブル，領域テーブルそれぞれのアドレスを示す「シーケンスアドレス」，「領域アドレス」という２つのポインタを用いる。また、各テーブルは図６の構造に基づくものである。
【０１０７】
以下、各ステップ（Ｓ）に従って順に説明する。まず、「シーケンスアドレス」，「領域アドレス」の設定をする（Ｓ１２１〜Ｓ１２２）。次に、「領域アドレス」の「開始セクタ番号」の値と「終了セクタ番号」の値を記録メディアコントローラ３に出力する（Ｓ１２３，Ｓ１２４）。
【０１０８】
記録メディアコントローラ３では、符号化データ管理部４から入力される連続した記録領域の最初のセクタ番号と最後のセクタ番号を用いて、連続した記録領域に記録されているデータを読み出すように記録メディア２を制御する。
【０１０９】
そして、「領域アドレス」の「次の領域のアドレス」が‘終了’かどうかを判定する（Ｓ１２５）。‘終了’でないと判定された場合は、「シーケンスアドレス」に対応する領域テーブルデータが残っているので、「領域アドレス」を「領域アドレス」の「次の領域のアドレス」とし、ポインタを進め（Ｓ１２６）、ステップＳ１２３に戻る。
【０１１０】
ステップＳ１２３〜ステップＳ１２６のループでシーケンステーブルの１ワードデータに対応する記録メディア２上での複数の連続した記録領域毎に、記録領域の最初のセクタ番号と最後のセクタ番号が順次出力される。
【０１１１】
ステップ１２５で‘終了’と判定された場合は、「シーケンスアドレス」の「次のシーケンスアドレス」が‘終了’かどうかを判定する（Ｓ１２７）。‘終了’でないと判定された場合は、記録メディア２に記録されているビデオシーケンスが残っているので、「シーケンスアドレス」を「シーケンスアドレス」の「次のシーケンスアドレス」とし、ポインタを進め（Ｓ１２８）、ステップＳ１２２に戻る。
【０１１２】
ステップＳ１２２〜ステップＳ１２８のループで記録メディア２に記録されているビデオシーケンスが終了するまで連続した記録領域毎に最初のセクタ番号と最後のセクタ番号が順次出力される。
【０１１３】
図１２は、高速再生の場合の動作を説明するためのフローチャートである。高速再生時には、キーフレームテーブルに記録されているキーフレームのビデオ符号化データの先頭と後尾が記録されているセクタ番号を読み出し、記録メディアコントローラ３に出力する。
【０１１４】
ここでは、再生中のデータに対応するシーケンステーブル，領域テーブル，キーフレームテーブルそれぞれのアドレスを示す「シーケンスアドレス」，「領域アドレス」，「キーフレームアドレス」という３つのポインタを用いる。
【０１１５】
以下、各ステップ（Ｓ）に従って順に説明する。まず、「シーケンスアドレス」，「領域アドレス」，「キーフレームアドレス」の設定をする（Ｓ１３１〜Ｓ１３３）。次に、「キーフレームアドレス」の「キー開始セクタ番号」と「キー終了セクタ番号」を記録メディアコントローラ３に出力する（Ｓ１３４）。
【０１１６】
記録メディアコントローラ３では、符号化データ管理部４から入力される最初のセクタ番号と最後のセクタ番号から、キーフレームのビデオ符号化データを読み出すように、記録メディア２を制御する。
【０１１７】
そして、「キーフレームアドレス」の「次のキーフレームアドレス」が‘終了’かどうかを判定する（Ｓ１３５）。‘終了’でない場合は、「次のキーフレームテーブル」に次にアクセスすべきセクタ番号が記録されているので、「キーフレームアドレス」を「キーフレームアドレス」の「次のキーフレームアドレス」とし、ポインタを進め（Ｓ１３６）、ステップＳ１３４に戻る。
【０１１８】
ステップＳ１３４〜ステップＳ１３６のループで領域テーブルの１ワードデータに対応するキーフレームテーブルに記録されている複数のセクタ番号が記録メディアコントローラ３に出力される。
【０１１９】
ステップＳ１３５で‘終了’と判定された場合は、「領域アドレス」の「次の領域アドレス」が‘終了’かどうかを判定する（Ｓ１３７）。‘終了’でないと判定された場合は、「シーケンスアドレス」に対応する領域テーブルデータが残っているので、「領域アドレス」を「領域アドレス」の「次の領域アドレス」とし、ポインタを進め（Ｓ１３８）、ステップＳ１３３に戻る。
【０１２０】
ステップＳ１３３〜ステップＳ１３８のループでシーケンステーブルの１ワードデータに対応するキーフレームのビデオ符号化データに対するセクタ番号が順次出力される。
【０１２１】
ステップＳ１３７で‘終了’と判定された場合は、「シーケンスアドレス」の「次のシーケンスアドレス」が‘終了’かどうかを判定する（Ｓ１３９）。‘終了’でないと判定された場合は、記録メディア２に記録されているビデオシーケンスが残っているので、「シーケンスアドレス」を「シーケンスアドレス」の「次のシーケンスアドレス」とし、ポインタを進め（Ｓ１４０）、ステップＳ１３２に戻る。
【０１２２】
ステップＳ１３２〜ステップＳ１４０のループで記録メディア２に記録されているビデオシーケンスが終了するまで、キーフレームのビデオ符号化データに対応するセクタ番号が順次出力される。
【０１２３】
図１２の例では、キーフレームテーブルに記録されているセクタ番号を全て出力する。すなわち、全てのキーフレームを再生することになるが、高速再生の速度が速い場合は、いくつかのフレームを間引いた出力してもよい。
【０１２４】
ここで、Ｐフレームを復号するためには、過去のＩフレーム又はＰフレームの復号画像が必要になる。従って、ＧＯＰ内のＩフレームやＰフレームを間引いた場合、そのＧＯＰ内で間引いたフレーム以降のＰフレームは復号できないことになる。
【０１２５】
例えば、図１３に示すようなＧＯＰ構造の場合、フレーム（ｃ）を間引いた場合、フレーム（ｄ）は参照するフレーム（ｃ）の復号画像が得られないため、復号でないことになる。同様に、フレーム（ｂ）を間引いた場合は、フレーム（ｃ）,（ｄ）は復号できず、フレーム（ａ）を間引いた場合、フレーム（ｂ）,（ｃ）,（ｄ）は復号できないことになる。このように、間引くフレームによって復号できなくなるフレーム数が異なるため、高速再生の速度からどのフレームを間引くかを計算することになる。
【０１２６】
図１４は、図１の符号化データ管理部に記録される管理データのその他の例を示す図で、図中の参照番号は図６と同じである。これは、デコーダからキーフレームの終了を示す信号が入力される場合の例で、図６に示した符号化データ管理部４に記録される管理データの一例から、キーフレームテーブルの要素「キー終了セクタ番号」を除いたものである。従って、本実施例の方が符号化データ管理部４の容量はより小さくなる。
【０１２７】
この例の場合の動作は、図６の例に比べ、高速再生の場合だけ異なる。すなわち、図６の例では、高速再生時にキーフレームテーブルの「キー終了セクタ番号」を出力し、記録メディアコントローラ３はキーフレームの終了をこのデータで検知できるが、この例においては、キーフレームテーブルからキーフレームの終了を示すデータは出力せず、記録メディアコントローラ３はデコーダからの入力信号でキーフレームの終了を検知する。通常再生やポインタ編集の再生の場合は、キーフレームの終了位置は用いる必要がなく、図６の例と同じ動作となる。
【０１２８】
図１５は、管理データのその他の例を示す図で、図１５中の参照番号は図１４と同じである。これは、デコーダからキーフレームの終了を示す信号が入力され、オーディオ出力とビデオ出力を同期再生する場合も、ビデオ出力だけの場合もランダムアクセス時にキーパックの先頭位置にアクセスするものである。
【０１２９】
前述したように、この例の場合は、ビデオ出力だけ必要とする場合は、不要なオーディオ符号化データを読み出す場合があるので、例えば、高速再生の場合には、単位時間当たりに再生できるビデオフレーム数が少なくなるが、管理データの容量はより少なくすることができる。
【０１３０】
頭出しや高速再生，ポインタ編集の再生では、管理しているキーフレームのうち、どのフレームがＩフレームかを知る必要がある。キーフレームテーブルでＩフレームだけを管理する場合は、図６，図１４，図１５の例でどのキーフレームがＩフレームであるかを知ることができるが、キーフレームテーブルでＩフレームとＰフレームを管理する場合は、図６，図１４，図１５の例では、キーフレームがＩフレームであるかＰフレームであるかを知ることは困難である。
【０１３１】
そこで、例えば、図１６に示すように、キーフレームテーブルの１ワードデータ毎にＩフレームのデータかＰフレームのデータであるかを示す１ビットのフラグ３１を付加すれば、キーフレームがＩフレームを示すのかＰフレームを示すのかがわかる。
【０１３２】
図１６は、図６で示したキーフレームテーブルに１ビットのフラグ３１を付加したものであるが、図１４，図１５のキーフレームテーブルについても同様に１ビットのフラグを付加することで、ＩフレームとＰフレームを区別できる。
【０１３３】
次に、本発明のさらに別の実施例について説明する。図１７は、図１の符号化データ管理部の他の例を示すブロック図で、図中、３２はメモリコントローラ、３３，３４はメモリで、その他、図５と同じ作用をする部分は同一の符号を付してある。符号化データ管理部４は４つのメモリ１２，１３，３３，３４とそれらを制御するメモリコントローラ３２で構成される。
【０１３４】
メモリ１２，１３は、図５のそれと同一で、メモリ３３にはキーフレームのうち、Ｉフレームの記録メディア上での記録位置情報を管理する第４のテーブル（以下、Ｉフレーム用キーフレームテーブルと呼ぶ）、メモリ３４にはＰフレームの記録メディア上での記録位置情報を管理する第５のテーブル（以下、Ｐフレーム用キーフレームテーブルと呼ぶ）が格納される。
【０１３５】
図１８は、Ｉフレーム用キーフレームテーブルとＰフレーム用キーフレームテーブルに記録するデータの一例を示す図で、図中、４１はＩの次のフレームアドレス、４２はパック開始セクタ番号、４３はＩのキー開始セクタ番号、４４はＩのキー終了セクタ番号、４５はＰのフレームアドレス、４６はＰの次のフレームアドレス、４７はＰのキー開始セクタ番号、４８はＰのキー終了セクタ番号であり、アドレス４１，４５，４６はポインタ（連結情報）で、セクタ番号４２，４３，４４，４７，４８は記録位置情報である。
【０１３６】
各テーブルの構成は、図６のキーフレームテーブルと同様であり、Ｉフレーム用キーフレームテーブルは、Ｐフレーム用キーフレームテーブルのアドレスを示すポインタを備えている点だけが異なる。Ｐフレーム用キーフレームテーブルはパック開始セクタ番号がない点だけが異なる。Ｐフレーム用キーフレームテーブルは高速再生の場合だけ用いることを前提としているため、キーパックの位置情報は必要ない。
【０１３７】
図１８の例では、１枚のＩフレームに対応するキーフレームテーブルが、Ｉフレーム用キーフレームテーブルの１ワードデータとそのデータの「Ｐのフレームアドレス」４５で示されたＰフレーム用キーフレームテーブルのデータから「Ｐの次のフレームアドレス」４６でつながった一連のＰフレーム用キーフレームテーブルのワードデータとなっている。
【０１３８】
本発明を用いた動画像記録再生装置では、ビデオシーケンスやその一部の再生順序を変更する場合や消去する場合は、メモリ１２のシーケンステーブルやメモリ１３の領域テーブル，メモリ１４のキーフレームテーブルの変更だけで済み、記録メディアに記録されている符号化データを変更する必要はない。例えば、１つのビデオシーケンスを消去する場合は、消去するビデオシーケンスに対応するシーケンステーブルの１ワードデータを「次のシーケンスアドレス」でつながっているポインタからはずすだけでよい。
【０１３９】
ビデオシーケンスの再生順序を変更する場合も、シーケンステーブルの「次のシーケンスアドレス」でつながっているポインタの順序を変更するだけでよい。また、シーケンス内の一部分を残して、それ以外を消去する場合は、消去する部分に対応する領域テーブルのデータを「次の領域のテーブルアドレス」でつながったポインタからはずし、残す部分に対応した「開始セクタ番号」と「終了セクタ番号」を新たに設定するだけでよい。
【０１４０】
また、ポインタ編集の場合には、もとの管理データから編集後の新たな再生順序を示す管理データを再構築し、もとの管理データと別に格納しておけば、もとの再生順序を保存したまま、様々な再生が可能である。
【０１４１】
なお、各テーブルに新たなデータを書き込むためには、各テーブルで未使用のアドレスを管理する必要がある。未使用のアドレスを管理する方法は、テーブル毎に空きアドレスを管理するテーブルを備える、各テーブルのポインタをたどり使用のアドレスを調べることで、未使用のアドレスを知るといった方法が考えられるが、図１９のように、未使用のアドレスを管理することもできる。
【０１４２】
次に、本発明の他の実施例について説明する。図１９は、各テーブルの１ワードデータ毎に使用か未使用かを示す１ビットのフラグを付加した図で、図１９中、５１〜５３は使用か未使用かを示すフラグで、その他、図６と同じ作用をする部分は同一の符号を付してある。
【０１４３】
図１９の実施例は、図６で示した構成例の各ワードに使用か未使用かを示すフラグ５１〜５３を付加したものであるが、図１４，図１５，図１６，図１８で示した例にも同様に適用できる。
【０１４４】
前述の実施例は、符号化データ記録時に管理データを構築し、その管理データを用いてランダムアクセスを伴う再生を高速に行うものである。従って、符号化データ記録時に管理データが記録されていないメディアの場合は、管理データを構築する必要がある。
【０１４５】
次に、図１の実施例において、管理データを構築する手段について、以下に説明する。まず、記録メディアコントローラ３は、記録メディア２に記録されているデータを最初から最後まで読み出すように記録メディア制御データを出力する。
【０１４６】
データ処理部１では、ビデオシーケンス，パック，キーフレームそれぞれの開始と終了を検出する検出回路を設けておき、ビデオシーケンスの先頭，後尾が記録されているセクタ番号，キーパックの先頭が記録されているセクタ番号，キーフレームのビデオ符号化データの先頭と後尾が記録されているセクタ番号を符号化データ管理部４に出力する。該符号化データ管理部４では、符号化データ記録時と同様に、順序に管理データを追加記録する。
【０１４７】
なお、前述の実施例では、記録メディア上での記録位置情報としてセクタ番号を用いているが、これに限定するものではない。記録メディアによっては、トラック番号とセクタ番号の組など、別の表記の場合もあるが、その場合は記録メディアに応じた表記を用いればよい。
【０１４８】
以上の説明から明らかなように、本発明によると、以下のような効果がある。キーフレームに対応するビデオ符号化データ，オーディオ符号化データ，多重符号化データの記録メディア上での記録位置情報を管理データとして格納するため、ランダムアクセスを伴う再生の場合に、アクセスするキーフレームの記録メディア上での位置がわかる。従って、ランダムアクセス時に不要なデータを読み出すことがなく、高速なランダムアクセスが可能になる。
【０１４９】
管理データとして、キーフレームに対応するビデオ符号化データ及び多重符号化データそれぞれの先頭位置や後尾位置を利用することにより、ランダムアクセス時に不要なデータを読み出すことがなく、高速な頭出しが可能になる。すなわち、高速再生時には、単位時間当たりに再生できるビデオフレームが多くなり、ポインタ編集の再生時には、シーンのつなぎ位置へのアクセスを効率良く行うことができる。
【０１５０】
ビデオ出力とオーディオ出力を同期して再生する場合は、同期再生に必要な多重符号化データにアクセスし、ビデオ出力だけの再生の場合には、ビデオ符号化データのみにアクセスするようにすれば、高速再生などビデオ出力だけが要求される際に、単位時間当たりより多くの画像を再生することができる。
【０１５１】
管理データとして、第１階層がビデオシーケンス、第２階層が記録メディア上での記録領域、第３階層が記録領域に記録されるキーフレーム位置とする階層構造を用いれば、新たにビデオシーケンスを記録，消去する場合、あるいは再生順序を入れ換える場合でも、容易に管理データを更新することができる。
【０１５２】
前記記録領域に記録されるキーフレームに対応する管理データを、ＩフレームとＩフレームに続くＰフレームの階層構造で記録すれば、ＩフレームとＰフレームを分離して管理することができる。そのため、例えば、高速再生時には、Ｉフレーム及びＰフレームの種別及び記録メディア上での記録位置がわかるので、高速再生の速度に応じて使用するフレームを柔軟に選択することができる。
【０１５３】
また、管理データの各階層内および各階層間の連結情報として、各テーブル内の再生順序を示すポインタ及び各テーブル間の連結関係を示すポインタを用いるので、これらのポインタを変更することで、ビデオシーケンスの記録，消去，再生順序の変更，ビデオシーケンス内の一部の消去，再生順序の変更等が容易に行える。
【０１５４】
管理データの各ワード毎に、そのワードが使用か未使用かを示す１ビットのフラグを付加すれば、管理データを記録するメモリが使われているのか空いているのかがわかるので、メモリの空き領域管理が容易になる。
【０１５５】
管理データが記録されていない場合でも、管理データを新たに構築すれば、管理データが記録されていない記録メディアの場合でも、ランダムアクセスを伴う再生を高速に行うことができる。
【０１５６】
以上のように、ＭＰＥＧ方式など動画像符号化にフレーム間予測を用いるために、ランダムアクセスが特定のキーフレームに対してしか許されない場合に、キーフレームに対するビデオ符号化データやオーディオ符号化データ及びその多重符号化データの記録位置を管理しているため、所望のキーフレームに直接アクセスすることが可能となる。このため、高速再生やポインタ編集を容易かつ効率良く行えることになり、また、オーディオを伴った動画を扱えるため、その効果は顕著である。
【０１５７】
【発明の効果】
本発明によれば、ビデオ符号化データ及びオーディオ符号化データを含む多重化データを記録メディアに記録する際、多重化データ上でキーとなるフレームの位置を示す位置情報を管理データとしてメディアに記録し、再生時には管理データを用いてキーとなるフレームに正確にアクセスすることで、頭出し，高速再生，編集等の機能を実現することができる。
【図面の簡単な説明】
【図１】本発明による動画像記録再生装置の一実施例を説明するためのブロック図である。
【図２】図１における符号化データ管理部から入力されるセクタ番号の記録メディア上での位置を説明するための図である。
【図３】本発明におけるキーパックの先頭を示す図である。
【図４】図１におけるデータ処理部のヘッダ検出回路を示す図である。
【図５】図１における符号化データ管理部のブロック図である。
【図６】図５におけるメモリに記録される管理データの内容の一例を示す図である。
【図７】本発明における符号化データ管理部に記録する管理データと記録メディア上での記録領域との関係を示す図である。
【図８】本発明における符号化データ管理部に記録する管理データと記録メディア上での記録領域との関係を示すその他の図である。
【図９】本発明による動画像記録再生装置の符号化データ記録時の符号化データ管理部の動作を説明するためのフローチャートである。
【図１０】本発明における符号化データ記録時の符号化データ管理部の更新を示す図である。
【図１１】本発明における通常再生時の符号化データ管理部の動作を示すフローチャートである。
【図１２】本発明における高速再生時の符号化データ管理部の動作を示すフローチャートである。
【図１３】本発明におけるＧＯＰ内で間引くフレームと復号できなくなるフレームを説明するための図である。
【図１４】本発明における符号化データ管理部に記録する管理データのその他の例を示す図である。
【図１５】本発明における符号化データ管理部に記録する管理データのその他の例を示す図である。
【図１６】本発明におけるキーフレームテーブルのその他の構成例を示す図である。
【図１７】図１における符号化データ管理部の他の実施例を示すブロック図である。
【図１８】本発明におけるＩフレーム用キーフレームテーブルとＰフレーム用キーフレームテーブルに記録するデータの一例を示す図である。
【図１９】本発明における各テーブルの１ワードデータ毎に使用か未使用かを示す１ビットのフラグを付加した図である。
【図２０】従来のＧＯＰの構成例を示す図である。
【図２１】従来の多重符号化データの一例を示す図である。
【符号の説明】
１…データ処理部
２…記録メディア
３…記録メディアコントローラ
４…符号化データ管理部
５…メモリコントローラ
５ａ…ビデオ記録情報管理部
５ｂ…オーディオ記録情報管理部
５ｃ…多重化記録情報管理部
６…メモリ
７…第１のヘッダ検出部
８…ビデオヘッダ検出部
９…オーディオヘッダ検出部
１０…判定回路
１１…メモリコントローラ
１２，１３，１４…メモリ
３２メモリコントローラ
３３，３４…メモリ
２１…次のシーケンスアドレス
２２…領域テーブルアドレス
２３…次の領域のアドレス
２４…開始セクタ番号
２５…終了セクタ番号
２６…開始フレームアドレス
２７…次のフレームアドレス
２８…パック開始セクタ番号
２９…キー開始セクタ番号
３０…キー終了セクタ番号
３１…ＩフレームかＰフレームかを示すフラグ
４１…Ｉの次のフレームアドレス
４２…パック開始セクタ番号
４３…Ｉのキー開始セクタ番号
４４…Ｉのキー終了セクタ番号
４５…Ｐのフレームアドレス
４６…Ｐの次のフレームアドレス
４７…Ｐのキー開始セクタ番号
４８…Ｐのキー終了セクタ番号
５１〜５３…使用か未使用かを示すフラグ[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a digital video using a read-only recording medium such as a CD-ROM. Oh again Digital video using rewritable recording media such as live devices and magneto-optical disks Oh again Video playback device suitable for application to a live device In place It is related.
[0002]
[Prior art]
A disk medium such as a magnetic disk, an optical disk, or a magneto-optical disk can be accessed at high speed. If high-speed random access is used, data recorded in a discontinuous area can be reproduced as continuous data.
[0003]
On the other hand, since moving image data has a very high data rate, it is difficult to record it on the disk medium without processing the data. However, high-efficiency encoding of moving image data and audio data can reduce the data rate to a rate that can be recorded on a disk medium without visual or auditory degradation.
[0004]
As an example of the high-efficiency encoding method, there is a moving picture experts group (MPEG) method. MPEG is being standardized by ISO-IEC / JTC1 / SC29 / WG11, and relates to high-efficiency encoding of video and audio and a multiplexing system for synchronous playback of them. In the MPEG video encoding method, a unit such as 12 video frames or 15 video frames is called GOP (Group Of Pictures), and predictive encoding is performed in the GOP.
[0005]
FIG. 20 is a diagram illustrating an example of a GOP structure. Video frames in the GOP are classified into I frames, P frames, and B frames. The I frame is intra-frame encoded. The P frame is subjected to predictive encoding in the forward direction from the I frame, or is subjected to predictive encoding in the forward direction from the P frame. The B frame is between the I frame and the P frame or between the P frames, and is predictively encoded from both directions of the I frame and the P frame.
[0006]
In this way, since P frames and B frames are predictively encoded from I frames or video frames that are predictively encoded from I frames, all video frames in the GOP can be decoded without decoding the I frames first. Disappear. That is, when playing back from the middle of a video sequence by random access or the like, it is necessary to decode from an I frame. In addition, since the P frame uses only prediction from the forward direction, it can be decoded relatively easily.
[0007]
In the following description, the I frame and the P frame are called key frames. Further, the video sequence indicates a temporally continuous video frame and corresponding encoded data from the start of recording to the end of recording.
[0008]
In MPEG video encoding, the average code amount is almost constant, but the code amount is not proportional to time in a short time. For this reason, the interval between the encoded video data of each frame is not constant, and the recording position of the encoded data of each frame is not uniquely determined. For this reason, it is difficult to reproduce non-consecutive video frames as in high-speed reproduction.
[0009]
As a conventional example for realizing high-speed reproduction using encoded data encoded with high efficiency, for example, there is a method described in JP-A-5-1553577. This is intended to realize high-speed playback by increasing the data reading speed of the disk medium during playback and selecting data necessary for high-speed playback from the obtained continuous data.
[0010]
[Problems to be solved by the invention]
However, in the method described in the above publication, the data reading speed is increased without using random access, and data necessary for high-speed reproduction is selected from data obtained more than usual, so that the speed is increased in proportion to the data reading speed. Although the reproduction speed is increased, the data reading speed is limited. That is, the speed of high-speed playback is also limited.
[0011]
In order to realize higher speed reproduction, it is conceivable to read out random data using random access. That is, it is possible to repeat the data reading and the high-speed jump, and to change the high-speed playback speed according to the jump amount.
[0012]
However, when reading high-efficiency encoded data, the following problems occur at the time of jump. That is, when playing back from the middle of the video sequence, it is necessary to decode from the I frame as described above, but since the code amount and the time are not proportional, the encoded data of the I frame is recorded. There is a problem that it is not possible to find the starting point.
[0013]
As described above, since the method described in the above publication cannot randomly access data for a predetermined time, a function using random access of the disc such as cueing and reproduction of pointer editing described later cannot be realized. Cueing is a function of reproducing video output and audio output in synchronization from a specified time, and random access is required to reproduce from the specified time.
[0014]
In the conventional technique, since the position of the I frame is unknown, there is a problem that unnecessary data is read out until the multiplexed encoded data necessary for reproduction is read out, and it takes time to cue. In addition, the cueing position could not be accurately defined, and there was no choice but to access a rough position.
[0015]
High-speed playback of disk media can be realized by playing back only key frames while thinning out video frames as described above. In this case, reading of the video encoded data and high-speed random access are repeated. However, in the conventional technology, the recording position of the video encoded data of the key frame is not known, and the key frame is included when reading the video encoded data. In addition, since a wide range of multiplexed encoded data is read, there is a problem that the number of video frames that can be reproduced per unit time is reduced.
[0016]
Pointer editing is a function that specifies multiple playback start points and end points, and connects them with a logical pointer without copying multiple data that are separated in time. During playback, the video output and audio output are synchronized. Play while.
[0017]
Also in this case, it is necessary to randomly access the next area from the area designated at the time of reproduction. The decoder needs to output reproduction data even when data is not obtained, and output it as continuous reproduction data. For this reason, while the code buffer is provided and the encoded data cannot be obtained, the reproduction data must be output using the encoded data stored in the code buffer.
[0018]
As described above, in the conventional technique, unnecessary data may be read at the time of random access, so there is a problem that a longer time for which a code necessary for decoding cannot be obtained and a larger number of code buffers are required. It was. Since the data read from the disk medium is multiplexed encoded data to which a header for multiplexing is added, it is necessary to read from the header when reproducing video output and audio output in synchronization.
[0019]
FIG. 21 is a diagram showing an outline of multiplexed encoded data recorded by the MPEG method. The multiplex encoded data is divided into units called packs, and a first header is added to each pack. The pack is a collection of data in which a second header is added for each of the divided video encoded data and audio encoded data.
[0020]
The first header includes information for synchronizing video encoded data and audio encoded data, and the second header includes information indicating the type of data following the header. Therefore, in order to play back video and audio synchronously, it is necessary to read from the first header.
[0021]
The present invention has been made in view of such a situation, and even when encoded data whose time is not proportional to the amount of code is recorded, a moving image capable of reading out only necessary data at the time of random access An object of the present invention is to provide a playback device and a moving image recording device.
[0022]
[Means for Solving the Problems]
According to a first aspect of the present application, video or audio is recorded from a recording medium in which multiplex encoded data including video encoded data and audio encoded data is recorded separately from management data corresponding to the multiplex encoded data. The video encoded data includes encoded data of a key frame composed of an intra-frame encoded frame or a forward-predictive encoded frame, The management data includes position information indicating a position of the key frame on the multiplex encoded data, and controls the recording medium to read the management data from the recording medium; Means for holding the management data, and means for enabling the position included in the management data Based on the information, and means for enabling random access , The means for holding the management data has a memory including a first memory area, a second memory area, and a third memory area in which the management data is recorded, and is recorded in the first memory area. Management data indicates the playback order of a sequence of multiplex encoded data, management data recorded in the second memory area indicates a recording area of a recording medium on which multiplex encoded data is recorded, and The management data recorded in the memory area 3 indicates a position on the multiplexed encoded data with respect to the key frame.
According to a second aspect of the present application, the position information included in the management data includes video position information indicating a position of video encoded data of a key frame, and audio position information indicating a position of audio encoded data corresponding to the key frame. , Including at least one of multiplexed position information indicating a position of multiplexed encoded data that needs to be read out in order to reproduce video and audio synchronously from a key frame.
In a third invention of the present application, the video position information includes a head position or a rear end position of video encoded data of a key frame, and the audio position information includes a head position of audio encoded data corresponding to the key frame or Including a rear end position, wherein the multiplexed position information includes a head position or a rear end position of multiplexed encoded data that needs to be read out in order to reproduce video and audio in synchronization from a key frame. .
The fourth invention of the present application is characterized in that the multiplexed position information is used when video and audio are reproduced in synchronization, and the video position information is used when only video is reproduced.
The fifth invention of the present application is: Reads multiplexed encoded data from the recording medium and constructs management data It is characterized by that.
[0025]
DETAILED DESCRIPTION OF THE INVENTION
Embodiments of the present invention will be described below with reference to the drawings.
[0026]
FIG. 1 is a block diagram for explaining an embodiment of a moving image recording / playback apparatus according to the present invention. In FIG. 1, 1 is a data processing unit, 2 is a recording (disc) medium, 3 is a recording media controller, Reference numeral 4 denotes an encoded data management unit, 5 denotes a memory controller, 5a denotes a video recording information management unit, 5b denotes an audio recording information management unit, 5c denotes a multiplexed recording information management unit, and 6 denotes a memory.
[0027]
The data processing unit 1 divides the multiplex encoded data input from the encoder and the data input from the encoded data management unit 4 into sectors and outputs the data so as to be recorded in a predetermined sector of the recording medium 2. The sector number is input from the recording media controller 3.
[0028]
The recording medium controller 3 outputs recording medium control data for controlling reading and writing of the recording medium 2 and outputs a sector number on the recording medium 2 for writing or reading data to the data processing unit 1. The encoded data management unit 4 stores management data corresponding to a video sequence that is recorded in advance on the recording medium or newly recorded on the recording medium.
[0029]
The encoded data management unit 4 includes a memory controller 5 including a video recording information management unit 5a, an audio recording information management unit 5b, and a multiplexed recording information management unit 5c. The memory controller 5 is connected to the memory 6. The management data stored in the memory 6 is controlled.
[0030]
The encoded data management unit 4 also includes video encoded data obtained by encoding video data with high efficiency, audio encoded data obtained by encoding audio data with high efficiency, the video encoded data, and the audio encoded data. Management for storing the multiplexed encoded data in the recording medium with respect to the multiplexed encoded data composed of the additional data for multiplexing with the data. A video frame predictively encoded from the direction is used as a key frame for management, and one or two of the video encoded data, the audio encoded data, and the multiplexed encoded data corresponding to the key frame are used. Information on the recording position of one or all of the encoded data on the recording medium and the link information of the encoded data before and after. It is intended to give you data.
[0031]
In the embodiment described below, it is assumed that the MPEG system exemplified in the example of the conventional encoding system is used as the encoding system, and the multiplex encoded data input from the encoder has the structure shown in FIG. Further, a pack including a head position recorded first among the head position of video encoded data of a key frame and the head position of audio encoded data corresponding to the key frame is referred to as a key pack.
[0032]
First, the operation when a recording medium is inserted into the recording / reproducing apparatus will be described. The recording media controller 3 outputs the recording media control data so as to read the management data recorded on the recording media 2. Management data is input to the data processing unit 1 from the recording medium 2 and output to the encoded data management unit 4. The encoded data management unit 4 records input management data at a predetermined address.
[0033]
Next, an operation at the time of recording encoded data will be described. This operation is not a case of a moving image reproduction dedicated device but an operation of a recordable device. First, the data processing unit 1 divides multiplex encoded data input from the encoder into sectors. Then, according to the sector number input from the recording media controller 3, the encoded data is output so as to be recorded in a predetermined sector on the recording medium 2.
[0034]
In addition, when the multiplex encoded data input from the encoder is recorded on the recording medium 2, the data processing unit 1 specifies the first sector number and the last sector number of the encoded data management unit 4 for the portion where the recording area is continuous. Output to. At the time of recording, there are recording apparatuses that can record a temporally continuous video sequence by dividing it into a plurality of areas, and there are recording apparatuses that can record only in a continuous area. In the former case, when recording is divided into a plurality of areas, the first sector number and the last sector number are output to the encoded data management unit 4 for each area.
[0035]
Further, the data processing unit 1 detects the first header of FIG. 21 added by the encoder, and detects the start and end of the video encoded data of the key frame and the audio encoded data corresponding to the key frame. A detection unit is provided, and position information on a recording medium on which video encoded data and multiplexed encoded data corresponding to the key frame are recorded is output to the encoded data management unit 4.
[0036]
Here, the start and end of the video encoded data of the key frame can be performed by detecting a header existing in video frame units in the video encoded data. Also, the start and end of the audio encoded data corresponding to the key frame can be known from the time information added to the video encoded data and the audio encoded data.
[0037]
If the encoder is configured to output a flag indicating the start of the first header, video encoded data of the key frame, and a flag indicating the start and end of the audio encoded data corresponding to the key frame, a data processing unit It is not necessary to include an output unit that detects the first header in 1 and a detection unit that detects the start and end of the video encoded data of the key frame and the audio encoded data corresponding to the key frame.
[0038]
The memory controller 5 in the encoded data management unit 4 generates concatenation information from the data input from the data processing unit 1. At this time, the recording position information of the video encoded data is managed by the video recording information management unit 5a, the recording position information of the audio encoded data is managed by the audio recording position information management unit 5b, and the recording position information of the multiplexed encoded data is recorded. Is managed by the multiplexed record information management unit 5c.
[0039]
Then, the video sequence, the recording area on the recording medium, and the key frame recorded information in the recording area are generated from the contents of the memory 6 and recorded in the memory 6 together with the recording position information. With this operation, when encoded data is newly recorded on the recording medium 2, the latest management data corresponding to the encoded data recorded on the recording medium 2 is always recorded on the encoded data management unit 4. Will be.
[0040]
When new data is recorded on the recording medium 2, the latest management data is recorded in the encoded data management unit 4, but the management data recorded on the recording medium 2 is not updated. Therefore, for example, when the recording is completed or when the recording medium 2 is taken out from the apparatus, the management data recorded in the encoded data management unit 4 needs to be recorded in the recording medium 2. Further, even when the encoded data is being recorded on the recording medium 2, the recording medium 2 may be controlled so as to periodically update the management data.
[0041]
In this procedure, the recording medium controller 3 controls the recording medium 2 so as to write management data to the recording medium 2 and controls the data of the encoded data management unit 4 to be sequentially input to the data processing unit 1. The data processing unit 1 outputs the data input from the encoded data management unit 4 to the recording medium 2.
[0042]
Next, the operation during normal playback will be described. Here, it is assumed that management data read from the recording medium 2 has already been recorded in the encoded data management unit 4. First, the first sector number and the last sector number in which multiplex encoded data to be reproduced is recorded are read from the encoded data management unit 4 and input to the recording media controller 3.
[0043]
The encoded data may be divided and recorded in a plurality of areas on the recording medium 2. In that case, the first sector number and the last sector number of the area are encoded for each recording area. Input from the data management unit 4 to the recording media controller 3.
[0044]
FIG. 2 is a diagram illustrating an example in which encoded data is divided and recorded in two areas indicated by hatching. The actual recording area of the recording medium 2 has a plurality of concentric or spiral shapes, but here, the horizontal axis represents the time axis.
[0045]
If the encoded data is read in the order of 1 and 2 in FIG. 2, the encoded data management unit 4 first outputs the sector numbers at the positions a and b, and then the sector numbers at the positions c and d. Is output. The recording media controller 3 outputs the recording media control data so as to read the multiplexed encoded data from the recording media 2 in accordance with the sector number input from the encoded data management unit 4. The data processor 1 receives continuous encoded data from the recording medium 2 and outputs it to the decoder.
[0046]
Next, the operation at the time of cueing will be described. At the time of cueing, the encoded data management unit 4 outputs the recording position information on the recording medium of the multiplexed encoded data corresponding to the key frame to be reproduced to the recording medium controller 3. The recording medium controller 3 controls the recording medium 2 to access the sector in which the encoded data of the key pack is recorded using the input recording position information. The operation of the data processing unit 1 is the same as the normal reproduction.
[0047]
Next, the operation during high-speed playback will be described. During high-speed playback, the recording media controller 3 controls the recording media so as to continuously access the encoded data corresponding to the key frame. In the MPEG system, there is a B frame that is predictively encoded from both the forward and backward directions between key frames, but video encoded data of the B frame is not read during high-speed playback.
[0048]
First, when high-speed playback is designated, positional information on the recording medium of key frames to be reproduced is output from the encoded data management unit 4 to the recording medium controller 3 in order. The recording medium controller 3 controls the recording medium 2 to access the sector in which the video encoded data of the key frame is recorded using the input recording position information.
[0049]
Note that the operation during playback of pointer editing is almost the same as that of high-speed playback. In the case of high-speed playback, the encoded data corresponding to the key frame is randomly accessed in order. According to the connection information indicated by the data, the encoded data recorded in the discontinuous area is randomly accessed.
[0050]
As described above, in the present embodiment, since the recording position on the recording medium of data to be read at the time of random access is stored in the encoded data management unit 4 as management data, unnecessary data is hardly read out. Fast random access is possible.
[0051]
Next, another embodiment of the present invention will be described. The block diagram of this embodiment is the same as the block diagram shown in FIG. In this embodiment, the management data recorded in the encoded data management unit 4 includes the sector number indicating the head position of the key pack, the media of the video encoded data, as the recording position information of the multiplexed encoded data on the recording medium. The recording position information above includes the sector number indicating the head position and the tail position of the video encoded data of the key frame.
[0052]
FIGS. 3A and 3B are diagrams showing examples of the leading position of the key pack. The key pack refers to a pack including the first recorded first position among the start position of the video encoded data of the key frame and the start position of the audio encoded data corresponding to the key frame. 3A shows a case where the head of the video encoded data of the key frame is recorded first, and FIG. 3B shows a case where the head of the audio encoded data corresponding to the key frame is recorded first. This is an example. In the figure, m1, m2,..., M8 are multiplexing additional data.
[0053]
4 is a block diagram of the header detection circuit of the data processing unit in FIG. 1. In FIG. 4, 7 is a first header detection unit, 8 is a video header detection unit, 9 is an audio header detection unit, and 10 is a determination circuit. It is. The data processing unit 1 of this embodiment has a header detection circuit for detecting a header as shown in FIG. 4, and when encoded data is recorded, the first sector number in which the key pack is recorded and the encoded data of the key frame Is output to the encoded data management unit 4.
[0054]
The header detection circuit shown in FIG. 4 includes a first header detection unit 7, a video header detection unit 8, an audio header detection unit 9, and a determination circuit 10. The first header detection unit 7, the video header detection The unit 8 and the audio header detection unit 9 receive multiplexed encoded data from the encoder, and are added to the first header, the header added to the beginning of the video frame, and the beginning of the audio frame, respectively, shown in FIG. Is detected.
[0055]
When the header is detected, a flag is input to the determination circuit 10. In the determination circuit 10, the key pack is recorded from the header input from the first header detection unit 7, the video header detection unit 8, and the audio header detection unit 9 and the sector number input from the recording media controller 3. The first sector number and the sector number where the head and tail of the video encoded data of the key frame are recorded are output to the encoded data management unit 4.
[0056]
It should be noted that the header detected in the example of FIG. Therefore, a flag indicating that the first header shown in FIG. 21, the header added to the beginning of the video frame, and the header added to the beginning of the audio frame are input from the encoder to the data processing unit 1 is input. In this configuration, the data processing unit 1 does not need to include the first header detection unit 7, the video header detection unit 8, and the audio header detection unit 9 shown in FIG.
[0057]
In addition, since data read from the recording medium 2 is usually obtained in units of sectors at the time of reproduction, if the head of multiplexed encoded data necessary for reproduction is recorded in the middle of the sector, unnecessary data is first obtained. Will exist. Therefore, the decoder at the subsequent stage of the data processing unit 1 has the same circuits as the first header detection unit 7, the video header detection unit 8, and the audio header detection unit 9 shown in FIG. It is necessary to remove unnecessary data obtained previously.
[0058]
However, when the data processing unit 1 includes the circuit shown in FIG. 4 as in the present embodiment, unnecessary data obtained before the header can be removed by the data processing unit 1, so that the decoder at the subsequent stage It is not necessary to provide a detection circuit for detecting the header.
[0059]
Thus, in this embodiment, the head or tail of each encoded data corresponding to the key frame is recorded in the management of the video encoded data, the audio encoded data, and the multiplex encoded data corresponding to the key frame. Since the sector number is used, the key frame can be accessed efficiently. That is, access to a recording position corresponding to a key frame required when performing cueing, high-speed playback, and pointer editing becomes possible by using the management data.
[0060]
At this time, if both video and audio are output, only the management data related to the multiplex encoded data need be used, and the management data related to the video encoded data and the audio encoded data is unnecessary. On the other hand, when only video encoded data or only audio encoded data is accessed, management data related to video encoded data and audio encoded data is used.
[0061]
Next, another embodiment of the present invention will be described. The block diagram of the present embodiment is the same as the block diagram shown in FIG. 1, but the operation of the encoded data management unit 4 is different. In the present embodiment, in the reproduction accompanied by random access, in the case of the reproduction of the head search and pointer editing, the multiplexed encoded data is read using the recording position information on the recording medium of the multiplexed encoded data, and the high-speed reproduction is performed. In this case, the video encoded data is read using the recording position information of the video encoded data on the recording medium.
[0062]
Therefore, in the case of playback by cueing or pointer editing, a sector number indicating the head position of the key pack is input from the encoded data management unit 4 to the recording media controller 3, and in the case of high-speed playback, video encoding of key frames is performed. Enter the sector number indicating the head position and tail position of the data.
[0063]
This is because the audio output and the video output need to be reproduced in synchronism in the case of cueing and pointer editing reproduction, and only in the video output in the case of high-speed reproduction. According to the present embodiment, unnecessary audio encoded data is not read during high-speed reproduction, and the number of images that can be reproduced per unit time is increased.
[0064]
In the embodiment described above, the sector number where the head position of the key pack is recorded as the position information on the recording medium of the video encoded data as the position information on the recording medium of the multiplex encoded data in the management data. The sector number indicating the head position and the tail position of the video encoded data of the key frame is used.
[0065]
Further, as another embodiment, the sector number indicating the head position of the key frame video encoded data is used as the management data, and the sector number indicating the head position of the key frame video encoded data is used. Examples that are not conceivable are also conceivable. In this case, the head position of the key pack is accessed at the time of random access, both when the audio output and the video output are reproduced synchronously or when only the video output is performed.
[0066]
Therefore, when only video output is required, unnecessary audio encoded data may be read. For example, in the case of high-speed playback, the number of video frames that can be played back per unit time decreases, but the capacity of management data Can be less.
[0067]
If a flag indicating the end of the video frame is input from the video decoder at the time of reproduction, it is not necessary to record the tail position of the video encoded data of the key frame, and management data can be further reduced. In this case, only the sector number indicating the head position of the key pack is used as management data.
[0068]
Next, another embodiment of the present invention will be described. FIG. 5 is a block diagram of the encoded data management unit of FIG. 1, in which 11 is a memory controller, and 12 to 14 are memories.
[0069]
As shown in FIG. 5, the encoded data management unit 4 includes three memories 12, 13, and 14 that store management data, and a memory controller 11 that controls them. The memories 12, 13, and 14 each have a first table for managing video sequences (hereinafter referred to as a sequence table) and a second table for managing recording areas on a recording medium (hereinafter referred to as an area table). A third table (hereinafter referred to as a key frame table) for managing video encoded data, audio encoded data, or multiplexed encoded data corresponding to the key frame is recorded.
[0070]
6 is a diagram showing an example of the contents of the management data recorded in each memory shown in FIG. 5. In FIG. 5, 21 is the next sequence address, 22 is the area table address, 23 is the address of the next area, 24 is the start sector number, 25 is the end sector number, 26 is the start frame address, 27 is the next frame address, 28 is the pack start sector number, 29 is the key start sector number, 30 is the key end sector number, and the address 21 , 22, 23, 26 and 27 are pointers (concatenation information), and sector numbers 24, 25, 28, 29 and 30 are recording position information.
[0071]
The sequence table in the memory 12 has one word of data for one video sequence. One word data includes a pointer “next sequence address” 21 indicating the next word and a pointer “area table address” 22 indicating the position of the area table.
[0072]
“Next sequence address” 21 indicates the playback sequence of the video sequence, and “area table address” 22 indicates the address of the area table corresponding to the video sequence. Here, the “next sequence address” 21 indicates the address of the memory 12, but the last data connected by the pointer “next sequence address” 21 has a special value indicating “end”.
[0073]
In the area table of the memory 13, there is one word of data for each continuous recording area on the recording medium. One word data is a pointer “next area address” 23 indicating the next word, “start sector number” 24 which is the first sector number of the continuous recording area, and last sector number of the continuous recording area “ It consists of four elements: “end sector number” 25 and pointer “start frame address” 26 indicating the start address of the key frame table corresponding to the key frames included in the continuous recording area.
[0074]
The area table corresponding to 1-word data in the sequence table is a series of area tables connected by the pointer “next area address” 23 from the area table address indicated by “area table address” 22 in the sequence table.
[0075]
The “next area address” 23 indicates the address of the memory 13, but the last data connected by the pointer “next area address” 23 has a special value indicating “end”. If no key frame is recorded in the area from the start sector number to the end sector number indicated by 1-word data in the area table, the “start frame address” 26 has a special value indicating “no data”.
[0076]
In the key frame table of the memory 14, there is one word of data for each key frame. One-word data includes a pointer “next frame address” 27 indicating the next word, “pack start sector number” 28 indicating the sector number in which the head of the key pack is recorded, and the head of the video encoded data of the key frame. “Key start sector number” 29 indicating the sector number in which “B” is recorded, and “key end sector number” 30 indicating the sector number in which the rear end is recorded.
[0077]
The “next frame address” 27 indicates the address of the memory 14, but the last data connected by the pointer “next frame address” 27 has a special value indicating “end”.
[0078]
In the example of FIG. 6, there is no information indicating whether the data recorded in the key frame table of the memory 14 of FIG. 5 indicates the encoded data of the I frame or the encoded data of the P frame. Therefore, as will be described later, a flag indicating whether I-frame video encoded data or P-frame video encoded data is indicated for each word data of the key frame table is added, or the key frame table is changed to I A method of separating into a frame table and a P frame table is used.
[0079]
FIG. 7 is a diagram showing the relationship between the management data recorded in each table of the memory shown in FIG. 5 and the recording area. A plurality of video sequences can be recorded on the recording medium 2, and one video sequence may be recorded in a continuous recording area on the recording medium 2, or may be divided into a plurality of recording areas and recorded. Sometimes it is done.
[0080]
In the example of FIG. 7, three video sequences are recorded, and the video sequence 1 is divided and recorded in three areas. The playback order is assumed to be the order of video sequences 1, 2, and 3.
[0081]
In the case of the example in FIG. 7, there are three 1-word data corresponding to the video sequences 1, 2, and 3 in the sequence table. The first one-word data is data corresponding to the video sequence 1, the “next sequence address” of the video sequence 1 indicates one-word data corresponding to the video sequence 2, and the “next sequence address” of the video sequence 2 Indicates one word data corresponding to the video sequence 3. The “area table address” in the sequence table indicates the address of the first area table data corresponding to the video sequence.
[0082]
In the area table, one word data corresponds to a continuous recording area on the recording medium 2. That is, in the case of FIG. 7, there are three 1-word data in the area table corresponding to the video sequence 1. In this way, when there are a plurality of area tables, the “next area address” of the area table indicates the next area table data.
[0083]
Of the 1-word data in the area table, “start sector number” indicates the first sector number of consecutive recording areas, and “end sector number” indicates the last sector number. In this continuous recording area, encoded data of a plurality of key frames is recorded. Among these, the sector number in which the encoded data of the first key frame is recorded is managed by a key frame table indicated by “start frame address”.
[0084]
Of the 1-word data in the key frame table, “pack start sector number” indicates the sector number where the head of the key pack is recorded, and “key start sector number” records the head of the video encoded data of the key frame. The sector number is indicated, and the “key end sector number” indicates the sector number in which the tail of the video encoded data is recorded. The “next frame address” indicates the next key frame table data.
[0085]
FIG. 8 is a diagram illustrating an example of a special case in which encoded data of the same key frame extends over an area managed by a plurality of area tables. In the example of FIG. 8, the encoded data of the key frame i is recorded across the three recording areas of the recording area 1, the recording area 2, and the recording area 3.
[0086]
In the case of FIG. 8, the “key start sector number” of the last key frame table corresponding to the recording area 1 has a sector number indicating the beginning of the video encoded data of the key frame i, and “key end sector number” has A sector number indicating the tail is recorded, and the “key end sector number” is the sector number of the recording area 3.
[0087]
Further, since there is no key frame table corresponding to the recording area 2, data indicating “no data” is recorded in the “start frame address” of the recording area 2. The “start frame address” of the recording area 3 indicates the key frame table of the key frame i + 1.
[0088]
During high-speed playback, the “key start sector number” and the “key end sector number” recorded in the key frame table are input to the recording media controller 3, and the recording media controller 3 starts from the “key start sector number” to the “key end”. The recording medium 2 is controlled so as to read data up to the “sector number”.
[0089]
At this time, when reproducing the key frame i shown in FIG. 8, as described above, recording areas having different “key start sector number” and “key end sector number” corresponding to the key frame i in the key frame table are recorded. Will show. However, even in such a case, it can be known that the information is recorded in the order of the recording area 1, the recording area 2, and the recording area 3 according to the information recorded in the area table.
[0090]
Therefore, the recording medium controller 3 accesses the recording medium 2 in the order of the recording area 1, the recording area 2, and the recording area 3, and reads data up to “key end sector number”.
[0091]
In the case where the video sequence is divided into a plurality of areas and is recorded only in a continuous area without being recorded, and editing is not performed during reproduction and the order is not changed. For example, in FIG. If the “next area address” 23 indicates the playback sequence of the video sequence, the sequence table becomes unnecessary.
[0092]
Next, the operation of the memory controller and the memory shown in FIG. 5 will be described in more detail. When the recording medium 2 is inserted into the apparatus, the management data recorded in the recording medium 2 is sequentially input, so that the memory controller 11 stores the memory 12, 13 so as to record the management data in a predetermined memory. , 14 are controlled.
[0093]
FIG. 9 is a flowchart for explaining the operation of the memory when the encoded data is recorded on the recording medium. Here, it is assumed that each table is based on the structure of FIG. 6, and FIG. 10 illustrates how the management data changes.
[0094]
In the sequence table of the memory 12 in FIG. 6, 1-word data is added to the rear end of already recorded data, the area table of the memory 13 corresponding to the added data of the sequence table, and the key frame table of the memory 14 Record additional data. In FIG. 10, the solid line portion is management data already stored, and the dotted line portion is management data additionally recorded.
[0095]
In the following description, ad12 and adl12 indicate addresses of the memory 12, ad13 and adn13 indicate addresses of the memory 13, and ad14 and adn14 indicate addresses of the memory 14, respectively. In recording, the recording medium controller 3 in FIG. 1 manages the free area of the recording medium 2 based on the management data in advance, and performs control to record data in each sector of the free area.
[0096]
Hereinafter, based on the flowchart of FIG. 9, it demonstrates in order according to each step (S). First, the sequence table of the memory 12 is set in steps S101 to S104. The free space in the memory 12 and the free space in the memory 13 are searched for ad12 and ad13, respectively (S101). Then, the “area table address” of ad12 is set to ad13, and the “next sequence address” is set to “end” (S102).
[0097]
Next, of the data connected by the pointer “next sequence address” in the sequence table of the memory 12, the address of the last data is set to adl12 (S103). Then, the “next sequence address” of adl12 is set to ad12 (S104). With the above operation, new one-word data ad12 is added to the rear end of the sequence table.
[0098]
Next, an area table of the memory 13 is set. Since the first sector number of the continuous recording area on the recording medium 2 is input to the encoded data management unit 2, this input data is set as the “start sector number” of ad13 (S105). Then, an empty area of the key frame table in the memory 14 is searched and set to ad14 (S106). Then, the “start frame address” of ad13 is set to ad14 (S107).
[0099]
Next, since the three sector numbers are input, the sector number in which the head of the key pack is recorded and the sector number indicating the head and the tail of the video encoded data of the key frame are input. The “start sector number”, “key start sector number”, and “key end sector number” are set (S108).
[0100]
Then, it is determined whether or not the continuous recording area of the recording medium 2 is finished (S109). If not, the empty area of the memory 14 is searched and set to adn14 (S110). Then, the “next frame address” of ad14 is set to and14, and the ad14 is updated to indicate adn14 (S111).
[0101]
Next, the process returns to step S108. The sector number indicating the encoded data of the key frame recorded in the continuous recording area on the recording medium 2 in the loop of step S108 to step S111 is written in the key frame table.
[0102]
If the continuous recording area of the recording medium 2 is completed in step S109, the last sector number of the recording area which is the input sector number is recorded in the “end sector number” of the area table ad13 (S112), and the key frame table The “next frame address” of ad14 is set to “end”. In step S105 to step S113, the data of the one-word area table is set.
[0103]
Next, in step S114, it is determined whether or not the input data is complete. If not, the free space in the area table is searched and set to adn13 (S115), and the “next area address” of ad13 is set to adn13. , Ad13 is updated to indicate adn13 (S116).
[0104]
Then, the process returns to step S105. In the loop from step S105 to step S116, new area table data connected by the pointer is created from the previous area table data, and the sector numbers are recorded in a series of key frame tables indicated by the area table data. If it is determined in step S114 that the data is complete, the “next area address” of ad13 is set to “end” (S117), and the operation is terminated.
[0105]
Next, the operation during normal playback will be described. During normal playback, the “start sector number” and “end sector number” of the area table corresponding to the video sequence to be played are output to the recording media controller 3. FIG. 11 is a flowchart illustrating the operation during normal playback.
[0106]
Here, two pointers “sequence address” and “area address” indicating addresses of the sequence table and area table corresponding to the data being reproduced are used. Each table is based on the structure of FIG.
[0107]
Hereinafter, it demonstrates in order according to each step (S). First, “sequence address” and “area address” are set (S121 to S122). Next, the value of “start sector number” and “end sector number” of “area address” are output to the recording media controller 3 (S123, S124).
[0108]
The recording medium controller 3 uses the first sector number and the last sector number of the continuous recording area input from the encoded data management unit 4 to read out the data recorded in the continuous recording area. 2 is controlled.
[0109]
Then, it is determined whether the “address of the next area” of the “area address” is “end” (S125). If it is determined that it is not 'end', the area table data corresponding to "sequence address" remains, so "area address" is set to "address of next area" of "area address" and the pointer is advanced ( S126), the process returns to step S123.
[0110]
In the loop from step S123 to step S126, the first sector number and the last sector number of the recording area are sequentially output for each of a plurality of continuous recording areas on the recording medium 2 corresponding to one word data of the sequence table.
[0111]
If “end” is determined in step 125, it is determined whether “next sequence address” of “sequence address” is “end” (S127). If it is determined that it is not “end”, the video sequence recorded on the recording medium 2 remains, so that “sequence address” is set to “next sequence address” of “sequence address” and the pointer is advanced (S128). ), The process returns to step S122.
[0112]
The first sector number and the last sector number are sequentially output for each continuous recording area until the video sequence recorded on the recording medium 2 is completed in the loop of step S122 to step S128.
[0113]
FIG. 12 is a flowchart for explaining the operation in the case of high-speed playback. At the time of high-speed playback, the sector number in which the head and tail of the video encoded data of the key frame recorded in the key frame table is read and output to the recording media controller 3.
[0114]
Here, three pointers “sequence address”, “area address”, and “key frame address” indicating addresses of the sequence table, area table, and key frame table corresponding to the data being reproduced are used.
[0115]
Hereinafter, it demonstrates in order according to each step (S). First, “sequence address”, “area address”, and “key frame address” are set (S131 to S133). Next, the “key start sector number” and the “key end sector number” of the “key frame address” are output to the recording media controller 3 (S134).
[0116]
The recording medium controller 3 controls the recording medium 2 so as to read the video encoded data of the key frame from the first sector number and the last sector number input from the encoded data management unit 4.
[0117]
Then, it is determined whether the “next key frame address” of the “key frame address” is “end” (S135). If it is not 'End', since the sector number to be accessed next is recorded in the “Next Key Frame Table”, “Key Frame Address” is set to “Next Key Frame Address” of “Key Frame Address”, The pointer is advanced (S136), and the process returns to step S134.
[0118]
In the loop from step S134 to step S136, a plurality of sector numbers recorded in the key frame table corresponding to one word data in the area table are output to the recording media controller 3.
[0119]
If “end” is determined in step S135, it is determined whether “next area address” of “area address” is “end” (S137). If it is determined that it is not “end”, the area table data corresponding to “sequence address” remains, so that “area address” is set to “next area address” of “area address” and the pointer is advanced (S138). ), The process returns to step S133.
[0120]
In the loop from step S133 to step S138, the sector number for the video encoded data of the key frame corresponding to one word data of the sequence table is sequentially output.
[0121]
If “end” is determined in step S137, it is determined whether “next sequence address” of “sequence address” is “end” (S139). If it is determined that it is not “end”, the video sequence recorded on the recording medium 2 remains, so that “sequence address” is set to “next sequence address” of “sequence address” and the pointer is advanced (S140). ), The process returns to step S132.
[0122]
The sector number corresponding to the video encoded data of the key frame is sequentially output until the video sequence recorded on the recording medium 2 is completed in the loop of step S132 to step S140.
[0123]
In the example of FIG. 12, all sector numbers recorded in the key frame table are output. That is, all key frames are reproduced, but when the speed of high-speed reproduction is high, some frames may be output after being thinned out.
[0124]
Here, in order to decode a P frame, a decoded image of a past I frame or P frame is required. Therefore, when I frames and P frames in a GOP are thinned out, P frames after the frame thinned out in the GOP cannot be decoded.
[0125]
For example, in the case of the GOP structure as shown in FIG. 13, when the frame (c) is thinned out, the frame (d) is not decoded because the decoded image of the frame (c) to be referenced cannot be obtained. Similarly, when the frame (b) is thinned out, the frames (c) and (d) cannot be decoded, and when the frame (a) is thinned out, the frames (b), (c) and (d) cannot be decoded. It will be. Thus, since the number of frames that cannot be decoded differs depending on the thinned frame, it is calculated which frame is thinned from the speed of high-speed playback.
[0126]
FIG. 14 is a diagram showing another example of management data recorded in the encoded data management unit of FIG. 1, and the reference numerals in the figure are the same as those in FIG. This is an example in which a signal indicating the end of the key frame is input from the decoder. From the example of the management data recorded in the encoded data management unit 4 shown in FIG. It excludes the “sector number”. Therefore, the capacity of the encoded data management unit 4 is smaller in this embodiment.
[0127]
The operation in this example differs from that in FIG. 6 only in the case of high-speed playback. That is, in the example of FIG. 6, the “key end sector number” of the key frame table is output during high-speed playback, and the recording media controller 3 can detect the end of the key frame from this data, but in this example, the key frame table The recording media controller 3 detects the end of the key frame from the input signal from the decoder. In normal playback or pointer editing playback, the end position of the key frame does not need to be used, and the operation is the same as the example of FIG.
[0128]
FIG. 15 is a diagram showing another example of management data, and the reference numbers in FIG. 15 are the same as those in FIG. In this case, a signal indicating the end of the key frame is input from the decoder, and the head position of the key pack is accessed at the time of random access both when the audio output and the video output are synchronously reproduced and when only the video output is performed.
[0129]
As described above, in this example, if only video output is required, unnecessary audio encoded data may be read. For example, in the case of high-speed playback, video frames that can be played back per unit time. Although the number is reduced, the capacity of management data can be reduced.
[0130]
In cueing, high-speed playback, and pointer editing playback, it is necessary to know which of the managed key frames is an I frame. When managing only I frames in the key frame table, it is possible to know which key frame is the I frame in the examples of FIGS. 6, 14, and 15. In the case of management, it is difficult to know whether the key frame is an I frame or a P frame in the examples of FIGS.
[0131]
Therefore, for example, as shown in FIG. 16, if a 1-bit flag 31 indicating whether data is I frame data or P frame data is added to each word data of the key frame table, the key frame changes the I frame. It can be seen whether it shows a P frame or a P frame.
[0132]
FIG. 16 is obtained by adding a 1-bit flag 31 to the key frame table shown in FIG. 6, but similarly by adding a 1-bit flag to the key frame tables of FIGS. A frame and a P frame can be distinguished.
[0133]
Next, still another embodiment of the present invention will be described. FIG. 17 is a block diagram showing another example of the encoded data management unit of FIG. 1. In FIG. 17, 32 is a memory controller, 33 and 34 are memories, and the other parts having the same operation as FIG. 5 are the same. The code | symbol is attached | subjected. The encoded data management unit 4 includes four memories 12, 13, 33, and 34 and a memory controller 32 that controls them.
[0134]
The memories 12 and 13 are the same as those in FIG. 5, and the memory 33 stores a fourth table (hereinafter referred to as an I frame key frame table) for managing the recording position information of the I frame on the recording medium. The memory 34 stores a fifth table (hereinafter referred to as a P frame key frame table) for managing the recording position information of the P frame on the recording medium.
[0135]
FIG. 18 is a diagram showing an example of data recorded in the I-frame key frame table and the P-frame key frame table. In the figure, 41 is the frame address next to I, 42 is the pack start sector number, and 43 is I Key start sector number, 44 is the key end sector number of I, 45 is the frame address of P, 46 is the next frame address of P, 47 is the key start sector number of P, and 48 is the key end sector number of P Addresses 41, 45, and 46 are pointers (concatenation information), and sector numbers 42, 43, 44, 47, and 48 are recording position information.
[0136]
The configuration of each table is the same as that of the key frame table of FIG. 6 except that the I frame key frame table includes a pointer indicating the address of the P frame key frame table. The key frame table for P frame is different only in that there is no pack start sector number. Since the key frame table for P frame is assumed to be used only for high-speed playback, no key pack position information is required.
[0137]
In the example of FIG. 18, the key frame table corresponding to one I frame is a P frame key frame table indicated by 1-word data of the I frame key frame table and “P frame address” 45 of the data. Is the word data of the key frame table for a series of P frames connected by “the next frame address of P” 46.
[0138]
In the moving image recording / playback apparatus using the present invention, when changing or deleting the playback sequence of a video sequence or a part thereof, the sequence table of the memory 12, the area table of the memory 13, and the key frame table of the memory 14 are stored. There is no need to change the encoded data recorded on the recording medium. For example, when erasing one video sequence, it is only necessary to remove one word data of the sequence table corresponding to the video sequence to be erased from the pointer connected by the “next sequence address”.
[0139]
Even when the playback order of the video sequence is changed, it is only necessary to change the order of the pointers connected by the “next sequence address” in the sequence table. Also, if you want to erase a part of the sequence and leave the rest, remove the data in the area table corresponding to the part to be erased from the pointer connected by the “table address of the next area” and select “ It is only necessary to newly set “start sector number” and “end sector number”.
[0140]
In the case of pointer editing, if the management data indicating the new playback order after editing is reconstructed from the original management data and stored separately from the original management data, the original playback order is changed. Various playbacks are possible while being stored.
[0141]
In order to write new data in each table, it is necessary to manage unused addresses in each table. As a method for managing unused addresses, there is a method in which a table for managing free addresses is provided for each table, and by checking the used addresses by following the pointers of each table, it is possible to know the unused addresses. As in 19, unused addresses can be managed.
[0142]
Next, another embodiment of the present invention will be described. FIG. 19 is a diagram in which a 1-bit flag indicating whether the table is used or not used is added for each word data of each table. In FIG. 19, reference numerals 51 to 53 indicate flags used or unused. Parts having the same function as 6 are denoted by the same reference numerals.
[0143]
In the embodiment of FIG. 19, flags 51 to 53 indicating whether the word is used or not used are added to each word of the configuration example shown in FIG. 6, but are shown in FIG. 14, FIG. 15, FIG. The same applies to other examples.
[0144]
In the above-described embodiment, management data is constructed at the time of recording encoded data, and reproduction with random access is performed at high speed using the management data. Therefore, in the case of a medium on which no management data is recorded at the time of recording encoded data, it is necessary to construct management data.
[0145]
Next, means for constructing management data in the embodiment of FIG. 1 will be described below. First, the recording media controller 3 outputs recording media control data so that data recorded on the recording media 2 is read from the beginning to the end.
[0146]
The data processing unit 1 is provided with a detection circuit for detecting the start and end of each of the video sequence, pack, and key frame, the sector number where the head and tail of the video sequence are recorded, and the head of the key pack are recorded. And the sector number in which the head and tail of the video encoded data of the key frame are recorded are output to the encoded data management unit 4. The encoded data management unit 4 additionally records management data in order as in the case of recording encoded data.
[0147]
In the above-described embodiment, the sector number is used as the recording position information on the recording medium. However, the present invention is not limited to this. Depending on the recording medium, there may be other notations such as a combination of a track number and a sector number. In that case, a notation according to the recording medium may be used.
[0148]
As is clear from the above description, the present invention has the following effects. Since the recording position information on the recording medium of the video encoded data, audio encoded data, and multiplexed encoded data corresponding to the key frame is stored as management data, in the case of reproduction accompanied by random access, the key frame to be accessed The position on the recording medium is known. Therefore, unnecessary random data is not read out during random access, and high-speed random access is possible.
[0149]
By using the start position and tail position of each of the video encoded data and multiplexed encoded data corresponding to the key frame as management data, high-speed cueing is possible without reading unnecessary data during random access. Become. In other words, the number of video frames that can be played back per unit time increases during high-speed playback, and the scene connection position can be efficiently accessed during playback of pointer editing.
[0150]
If the video output and the audio output are played back synchronously, the multiplex encoded data necessary for synchronous playback is accessed. If the video output is played back only, only the video encoded data is accessed. When only video output is required, such as high-speed playback, more images can be played back per unit time.
[0151]
As management data, a new video sequence is recorded by using a hierarchical structure in which the first hierarchy is a video sequence, the second hierarchy is a recording area on the recording medium, and the third hierarchy is a key frame position recorded in the recording area. The management data can be easily updated even when erasing or changing the reproduction order.
[0152]
If management data corresponding to a key frame recorded in the recording area is recorded in a hierarchical structure of an I frame and a P frame following the I frame, the I frame and the P frame can be managed separately. Therefore, for example, at the time of high speed reproduction, the type of I frame and P frame and the recording position on the recording medium can be known, so that the frame to be used can be flexibly selected according to the speed of high speed reproduction.
[0153]
Further, since the pointer indicating the reproduction order in each table and the pointer indicating the connection relationship between the tables are used as the connection information within each layer and between each layer of the management data, the video can be changed by changing these pointers. It is possible to easily record, delete, change the playback order of the sequence, erase part of the video sequence, change the playback order, and the like.
[0154]
For each word of management data, if a 1-bit flag indicating whether the word is used or not is added, it can be determined whether the memory for recording the management data is used or not. Area management becomes easy.
[0155]
Even when management data is not recorded, if management data is newly constructed, even with a recording medium on which management data is not recorded, playback with random access can be performed at high speed.
[0156]
As described above, in order to use inter-frame prediction for moving picture coding such as the MPEG system, when random access is allowed only for a specific key frame, video coded data or audio coded data for the key frame and Since the recording position of the multiplex encoded data is managed, it is possible to directly access a desired key frame. For this reason, high-speed playback and pointer editing can be performed easily and efficiently, and since moving images with audio can be handled, the effect is remarkable.
[0157]
【The invention's effect】
According to the present invention, when multiplexed data including video encoded data and audio encoded data is recorded on a recording medium, position information indicating the position of a key frame on the multiplexed data is recorded on the medium as management data. However, by accurately accessing the key frame using the management data during playback, functions such as cueing, high-speed playback, and editing can be realized.
[Brief description of the drawings]
FIG. 1 is a block diagram for explaining an embodiment of a moving image recording / reproducing apparatus according to the present invention.
FIG. 2 is a diagram for explaining a position on a recording medium of a sector number input from an encoded data management unit in FIG. 1;
FIG. 3 is a diagram showing a head of a key pack in the present invention.
4 is a diagram showing a header detection circuit of the data processing unit in FIG. 1. FIG.
5 is a block diagram of an encoded data management unit in FIG. 1. FIG.
6 is a diagram showing an example of the contents of management data recorded in the memory in FIG. 5. FIG.
FIG. 7 is a diagram showing a relationship between management data recorded in an encoded data management unit and a recording area on a recording medium according to the present invention.
FIG. 8 is another diagram showing the relationship between the management data recorded in the encoded data management unit and the recording area on the recording medium in the present invention.
FIG. 9 is a flowchart for explaining the operation of the encoded data management unit when recording encoded data in the moving image recording / reproducing apparatus according to the present invention;
FIG. 10 is a diagram showing an update of an encoded data management unit at the time of recording encoded data in the present invention.
FIG. 11 is a flowchart showing the operation of the encoded data management unit during normal reproduction in the present invention.
FIG. 12 is a flowchart showing the operation of the encoded data management unit during high-speed playback in the present invention.
FIG. 13 is a diagram for explaining a frame to be thinned out in a GOP and a frame that cannot be decoded in the present invention.
FIG. 14 is a diagram illustrating another example of management data recorded in the encoded data management unit according to the present invention.
FIG. 15 is a diagram illustrating another example of management data recorded in the encoded data management unit according to the present invention.
FIG. 16 is a diagram showing another configuration example of the key frame table in the present invention.
FIG. 17 is a block diagram showing another embodiment of the encoded data management unit in FIG. 1;
FIG. 18 is a diagram showing an example of data recorded in an I frame key frame table and a P frame key frame table in the present invention;
FIG. 19 is a diagram in which a 1-bit flag indicating whether each table is used or unused is added for each word data of the table according to the present invention.
FIG. 20 is a diagram illustrating a configuration example of a conventional GOP.
FIG. 21 is a diagram illustrating an example of conventional multiplexed encoded data.
[Explanation of symbols]
1. Data processing unit
2. Recording media
3. Recording media controller
4 ... Encoded data management unit
5 ... Memory controller
5a ... Video recording information management section
5b ... Audio recording information management unit
5c: Multiplexed record information management unit
6 ... Memory
7: First header detector
8 ... Video header detector
9 ... Audio header detector
10: Determination circuit
11 ... Memory controller
12, 13, 14 ... memory
32 memory controller
33, 34 memory
21 ... Next sequence address
22 ... Area table address
23 ... Next area address
24: Start sector number
25 ... End sector number
26: Start frame address
27: Next frame address
28: Pack start sector number
29 ... Key start sector number
30 ... Key end sector number
31. Flag indicating whether I frame or P frame
41 ... Next frame address of I
42 ... Pack start sector number
43 ... I key start sector number
44 ... I key end sector number
45 ... P frame address
46 ... Next frame address of P
47 ... P key start sector number
48 ... P key end sector number
51 to 53: Flag indicating whether used or not used

Claims

A moving image for reproducing at least one of video and audio from a recording medium in which multiplexed encoded data including encoded video data and encoded audio data is recorded separately from management data corresponding to the multiplexed encoded data A playback device,
The encoded video data includes encoded data of a key frame composed of an intra-frame encoded frame or a forward predictive encoded frame,
The management data includes position information indicating a position on the multiple encoded data with respect to the key frame,
Means for controlling the recording medium and reading the management data from the recording medium;
Means for holding management data read from the recording medium;
Means for enabling random access based on the location information included in the management data ;
The means for holding the management data has a memory including a first memory area, a second memory area, and a third memory area in which the management data is recorded,
The management data recorded in the first memory area indicates the reproduction order of a sequence composed of multiple encoded data,
The management data recorded in the second memory area indicates a recording area of a recording medium on which multiplexed encoded data is recorded,
The moving picture reproducing apparatus according to claim 3, wherein the management data recorded in the third memory area indicates a position on the multiplexed encoded data with respect to the key frame .

The moving image reproducing apparatus according to claim 1, wherein
The position information included in the management data includes video position information indicating a position of video encoded data of a key frame, audio position information indicating a position of audio encoded data corresponding to the key frame, and video and audio from the key frame. A moving picture reproducing apparatus comprising at least one piece of multiplexed position information indicating a position of multiplexed encoded data that needs to be read out in order to reproduce the video in synchronization with each other.

The moving image reproducing apparatus according to claim 2, wherein
The video position information includes a head position or a rear end position of video encoded data of a key frame,
The audio position information includes a head position or a rear end position of audio encoded data corresponding to a key frame,
The moving picture reproduction apparatus characterized in that the multiplexed position information includes a head position or a rear end position of multiplexed encoded data that needs to be read out in order to reproduce video and audio in synchronization from a key frame.

In the moving image reproducing device according to claim 2 or 3,
When playing back video and audio in synchronization, use the multiplexed position information,
A moving picture reproducing apparatus using the video position information when reproducing only a video.

In the moving image reproducing device according to any one of claims 1 to 4,
A moving picture reproducing apparatus for reading out multiplexed encoded data from the recording medium and constructing management data.