JP3603868B2

JP3603868B2 - Audio signal encoding method, audio signal transmission / reception method, recording medium for computer program, and audio signal decoding method

Info

Publication number: JP3603868B2
Application number: JP2002022905A
Authority: JP
Inventors: 美昭田中; 昭治植野; 徳彦渕上
Original assignee: Victor Company of Japan Ltd
Current assignee: Victor Company of Japan Ltd
Priority date: 1997-11-28
Filing date: 2002-01-31
Publication date: 2004-12-22
Anticipated expiration: 2018-08-12
Also published as: JP2002304850A

Description

【０００１】
【発明の属する技術分野】
本発明は、オーディオ信号のエンコード方法、オーディオ信号送信受信方法、コンピュータプログラムの記録媒体、及びオーディオ信号のデコード方法に関する。
【０００２】
【従来の技術】
従来のオーディオ再生用光ディスクとしてはＣＤ（コンパクトディスク）が知られている。また、ＣＤより高密度な光ディスクとしてＤＶＤ（デジタルビデオディスク）が知られている。
【０００３】
【発明が解決しようとする課題】
しかしながら、ＤＶＤ（以下、ＤＶＤ−ビデオ）ではビデオ信号が主、オーディオ信号が従として記録されるので、次のような問題点がある。
（１）オーディオ信号がビデオ信号と一体化されており、オーディオ信号の記録容量が少ない。
（２）オーディオ信号の時間を管理することができない。
（３）曲名などの簡単な文字情報を取り出すことができない。
【０００４】
また、ビデオに比べて、オーディオのユーザは使い方の層が幅広いので、ＣＤのようにＴＯＣ（テーブルオブコンテンツ）の領域を設けることにより簡易な再生方法が求められる。しかしながら、ＤＶＤ−ビデオでは、ナビゲーションコントロールパック（ＣＯＮＴパック）と複数のビデオ（Ｖ）パック及びオーディオ（Ａ）パックによりビデオコンテンツブロックユニットを構成してＶ、Ａパックの再生などをＣＯＮＴパックにより制御するので、オーディオ信号を主として記録しようとしてもユーザにとって簡易に再生することができず、使い勝手が悪いという問題点がある。
【０００５】
また、ＤＶＤ−ビデオでは、時間管理をビデオフレーム単位でのみ行うので、オーディオ信号を主として記録しようとしても、ビデオに比べてオーディオ信号は連続性が重要であるので実時間の管理が困難であるという問題点がある。
【０００６】
そこで、本発明は、オーディオ信号を主として記録する場合にユーザにとって簡易に再生することができて使い勝手がよく、また、実時間の管理を簡単にすることができるオーディオ信号のエンコード方法、オーディオ信号送信受信方法、コンピュータプログラムの記録媒体、及びオーディオ信号のデコード方法を提供することを目的とする。
【０００７】
【課題を解決するための手段】
本発明は上記目的を達成するために、以下の１）〜４）に記載の手段よりなる。
すなわち、
【０００８】
１）マルチチャネルのアナログオーディオ信号の第１のグループのチャネルと第２のグループのチャネルとでそれぞれ同一または異なる第１及び第２のサンプリング周波数（ｆｓ１、ｆｓ２）によりグループ別にＡ／Ｄ変換してオーディオデータを生成するステップと、
複数のオーディオオブジェクト（ＡＯＢ）を含むオーディオタイトルセット（ＡＴＳ）と、スチルピクチャセット（ＳＰＳ）とを有し、
前記オーディオオブジェクト（ＡＯＢ）が、
実データとして前記オーディオデータを有する第１のパックのみにより構成された第１のタイプのオーディオ・セル（ＡＴＳ−Ｃ）から構成される第１のオーディオオブジェクト（ＡＯＢ）と、
前記第１のパック、及び実データとして前記オーディオデータに関するリアルタイムインフォメーションデータを有する第２のパックにより構成された第２のタイプのオーディオ・セル（ＡＴＳ−Ｃ）から構成される第２のオーディオオブジェクト（ＡＯＢ）の２種類で構成され、
さらに、前記スチルピクチャセット（ＳＰＳ）が前記オーディオデータに関するスチルピクチャデータを含む第３のパックを有し、
前記オーディオタイトルセット（ＡＴＳ）は前記オーディオ・セル（ＡＴＳ−Ｃ）のタイプを識別するタイプ識別データ（ＡＴＳ−Ｃ−ＴＹ）を含むと共に前記複数のオーディオオブジェクト（ＡＯＢ）が所定の再生順に再生されるための制御情報であってオーディオ・セル単位で再生制御するためのオーディオ・セル・プレイバック・インフォメーション（ＡＴＳ−Ｃ−ＰＢＩ）を有する制御情報（ＡＴＳ−ＰＧＣＩＴ）を含むオーディオタイトルセット・インフォメーション（ＡＴＳＩ）が配置され、このオーディオタイトルセット・インフォメーション（ＡＴＳＩ）は、前記スチルピクチャセット（ＳＰＳ）内のスチルピクチャデータをページ制御する制御情報であってシーケンシャル再生モード（シーケンシャル）とランダム再生モード（ランダム）とを識別するモードデータ（ディスプレイオーダーモード）を含むスチルピクチャ制御情報（ＳＰＣＩＴ）を有しており、
さらに、前記第１のパック内には前記オーディオデータが記録されるパケットを有し、前記パケットにはさらに前記オーディオデータの第１及び第２のサンプリング周波数（ｆｓ１、ｆｓ２）が記録されるプライベートヘッダを有するように構成される、
データ構造にフォーマット化するステップと、
からなるオーディオ信号のエンコード方法。
２）１）に記載のオーディオ信号のエンコード方法によりエンコードされて生成されたデータ構造のデータを送信または受信するオーディオ信号送信受信方法であって、
前記データを所定の通信フォーマットに変換して送信し、または前記所定の通信フォーマットに変換されたデータを受信して元のデータに復元するステップを有するオーディオ信号送信受信方法。
３）１）に記載のオーディオ信号のエンコード方法によりエンコードされて生成されたデータ構造が記録されたデータをデコードするコンピュータプログラムが記録されたコンピュータプログラムの記録媒体であって、
前記記録されたオーディオタイトルセット（ＡＴＳ）に配置されるオーディオタイトルセット・インフォメーション（ＡＴＳＩ）からスチルピクチャのモードデータ（ディスプレイオーダーモード）を含むスチルピクチャ制御情報（ＳＰＣＩＴ）をデコードしてモードデータ（ディスプレイオーダーモード）を取り出す第１のデコードのステップと、
前記記録されたオーディオタイトルセット（ＡＴＳ）に配置される第１のパック、及び第２のパック、さらにスチルピクチャセット（ＳＰＳ）に配置される第３のパックを識別する第２のデコードのステップと、
前記第２のデコードのステップから取り出されたそれぞれのパックからオーディオデータ、リアルタイムインフォメーションデータ、及びスチルピクチャデータをデコードする第３のデコードのステップと、
少なくとも前記第１のデコードのステップから取り出されたモードデータ（ディスプレイオーダーモード）に基づいて前記第２のデコードのステップで取り出されたスチルピクチャデータを、制御して単独で／または前記オーディオデータと共に出力するステップとを、有するコンピュータプログラムの記録媒体。
４）１）に記載のオーディオ信号のエンコード方法によりエンコードされて生成されたデータ構造が記録されたデータをデコードするオーディオ信号のデコード方法であって、
前記記録されたオーディオタイトルセット（ＡＴＳ）に配置されるオーディオタイトルセット・インフォメーション（ＡＴＳＩ）からスチルピクチャのモードデータ（ディスプレイオーダーモード）を含むスチルピクチャ制御情報（ＳＰＣＩＴ）をデコードしてモードデータ（ディスプレイオーダーモード）を取り出す第１のデコードのステップと、
前記記録されたオーディオタイトルセット（ＡＴＳ）に配置される第１のパック、及び第２のパック、さらにスチルピクチャセット（ＳＰＳ）に配置される第３のパックを識別する第２のデコードのステップと、
前記第２のデコードのステップから取り出されたそれぞれのパックからオーディオデータ、リアルタイムインフォメーションデータ、及びスチルピクチャデータをデコードする第３のデコードのステップと、
少なくとも前記第１のデコードのステップから取り出されたモードデータ（ディスプレイオーダーモード）に基づいて前記第２のデコードのステップで取り出されたスチルピクチャデータを、制御して単独で／または前記オーディオデータと共に出力するステップとを、有するオーディオ信号のデコード方法。
【０００９】
【発明の実施の形態】
以下、図面を参照して本発明の実施の形態を説明する。図１はＤＶＤ−ビデオのフォーマットと、本発明に適用されるＤＶＤ−オーディオのフォーマットの一実施形態を示す説明図、図２は図１のオーディオマネージャ（ＡＭＧ）のフォーマットを詳しく示す説明図、図３は図１のオーディオタイトルセット（ＡＴＳ）のフォーマットを詳しく示す説明図、図４は図２のオーディオマネージャインフォメーション（ＡＭＧＩ）のフォーマットを詳しく示す説明図、図５は図４のオーディオタイトルセット・アトリビュートテーブル（ＡＴＳ−ＡＴＲＴ）のフォーマットを詳しく示す説明図、図６は図５のオーディオタイトルセット・アトリビュートデータ（ＡＴＳ−ＡＴＲ）のフォーマットを詳しく示す説明図、図７は図３のオーディオタイトルセットインフォメーション（ＡＴＳＩ）のフォーマットを詳しく示す説明図、図８は図７のオーディオタイトルセットインフォメーション・マネージメントテーブル（ＡＴＳＩ−ＭＡＴ）のフォーマットを詳しく示す説明図、図９は図８のオーディオタイトルセットメニュー・オーディオストリーム・アトリビュートデータ（ＡＴＳＭ−ＡＳＴ−ＡＴＲ）を詳しく示す説明図、図１０は図８のオーディオタイトルセット・オーディオストリーム・アトリビュートテーブル（ＡＴＳ−ＡＳＴ−ＡＴＲＴ）のフォーマットを詳しく示す説明図、図１１は図１０の各オーディオストリームのアトリビュートデータ（ＡＴＳ−ＡＳＴ−ＡＴＲ）を詳しく示す説明図である。
【００１０】
また、図１２は図１のオーディオコンテンツブロックユニット（ＡＣＢＵ）を示す説明図、図１３は図１２のオーディオパックとビデオパックのフォーマットを詳しく示す説明図、図１４は図１２のオーディオコントロール（Ａ−ＣＯＮＴ）パックのフォーマットを詳しく示す説明図、図１５は図１４のオーディオキャラクタディスプレイ（ＡＣＤ）エリアのフォーマットを詳しく示す説明図、図１６は図１５のネームスペース情報により表示される例を示す説明図、図１７は図１４のオーディオサーチデータ（ＡＳＤ）エリアのフォーマットを詳しく示す説明図、図１８は図１のオーディオコンテンツブロックユニットの変形例を示す説明図である。
【００１１】
ここで、この説明のＤＶＤ−オーディオディスクには、ＣＤ世代からＤＶＤ−オーディオ世代に移行する際の過渡期に対応するように、オーディオ信号としてステレオ用２チャネルと５／６／８チャネルのマルチチャネルの両方の信号が記録される。また、この過渡期が経過したときには５／６／８チャネルのマルチチャネル信号のみが記録されるようになると考えられる。また、マルチチャネル信号のみが記録された場合であっても、再生時にはダウンミックスの係数によりマルチチャネル信号から２チャネル信号を生成することが可能である。この生成された２チャネル信号は簡易再生として位置づけられる。
【００１２】
図１（ａ）、（ｂ）はそれぞれＤＶＤ−ビデオ、ＤＶＤ−オーディオの各フォーマットを示し、ＤＶＤ−オーディオのフォーマットはエリアの名称が異なるがＤＶＤ−ビデオと互換性を有する。まず、大別してＤＶＤ−ビデオのフォーマットは先頭のビデオマネージャ（ＶＭＧ）と、それに続く複数のビデオタイトルセット（ＶＴＳ）の各エリアにより構成され、他方、ＤＶＤ−オーディオのフォーマットはこれに対応して図２に詳しく示すオーディオマネージャ（ＡＭＧ）と、図３に詳しく示すようにＡＭＧに続く複数のオーディオタイトルセット（ＡＴＳ）の各エリアにより構成されている。
【００１３】
ＶＴＳの各々は先頭のＶＴＳインフォメーション（ＶＴＳＩ）と、それに続く１以上のビデオコンテンツブロックセット（ＶＣＢＳ）と最後のＶＴＳＩにより構成され、他方、ＡＴＳの各々はこれに対応して先頭のＡＴＳインフォメーション（ＡＴＳＩ）と、それに続く１以上のオーディオコンテンツブロックセット（ＡＣＢＳ）と最後のＡＴＳＩにより構成されている。ＡＴＳＩには、ＡＣＢＳ内の各曲の演奏時間が実時間でセットされる。
本発明では、最初のＡＣＢＳにはメニュー画面を表示するためのメニュー情報が記録される。これはＤＶＤビデオと同様のものであり説明を省く。
【００１４】
ＶＣＢＳの各々は複数のＶＣＢにより構成され、他方、ＡＣＢＳの各々は複数のＡＣＢにより構成されている。ＶＣＢの各々はビデオの１タイトル（Ｔｉｔｌｅ）分であり、ＡＣＢの各々はこれに対応してオーディオの１タイトル分である。ＶＣＢの各々（１タイトル）は複数のチャプタ（Ｃｈａｐｔｅｒ）により構成され、他方、ＡＣＢの各々（１タイトル）はこれに対応して複数のトラック（Ｔｒａｃｋ）により構成されている。チャプタはパートオブタイトル（ＰＴＴ）を含み、トラックはパートオブタイトル（ＰＴＴ）を含む。
【００１５】
チャプタの各々は複数のセル（ＣＥＬＬ）により構成され、他方、トラックの各々はこれに対応して複数のインデックス（Ｉｎｄｅｘ）により構成されている。セルの各々は複数のＶＣＢユニット（ＶＣＢＵ）により構成され、他方、インデックスの各々はこれに対応して複数のＡＣＢユニット（ＡＣＢＵ）により構成されている。ＶＣＢユニットとＡＣＢユニットの各々は、複数のパックにより構成され、１パックは２０４８バイトで構成されている。
【００１６】
ＶＣＢユニットの各々は、先頭のコントロールパック（以下、ＣＯＮＴパック）と、それに続くオーディオ（Ａ）パック、複数のビデオ（Ｖ）パック、及びサブピクチャ（ＳＰ）パックにより構成され、他方、ＡＣＢユニットの各々は、これに対応して先頭のオーディオコントロールパック（以下、Ａ−ＣＯＮＴパック）と、それに続く複数のＡパックとＶパックにより構成されている。
【００１７】
ＣＯＮＴパックには後続のＶパックを制御する情報が配置され、Ａ−ＣＯＮＴパックにはＣＤのＴＯＣ情報のように後続のＡパックのオーディオ信号を管理するための情報が配置される。Ａパックにはオーディオデータが配置され、Ｖパックにはビデオデータの他、オーディオデータ以外の例えばクローズドキャプション（ＣＣ）データが配置される。
【００１８】
ＡＭＧ（オーディオマネージャ）は図２に示すように、
・図４に詳しく示すオーディオマネージャインフォメーション（ＡＭＧＩ）と、
・ＡＭＧメニュー用のオーディオコンテンツブロックセット（ＡＭＧＭ−ＡＣＢＳ）と、
・バックアップ用のＡＭＧＩ
を有する。ＡＭＧＭ−ＡＣＢＳはコントロール情報として
・プレゼンテーションコントロールインフォメーション（ＰＣＩ）と、
・データサーチインフォメーション（ＤＳＩ）
を有する。
【００１９】
ＡＴＳ（オーディオタイトルセット）は図３に示すように、
・図７に詳しく示すオーディオタイトルセットインフォメーション（ＡＴＳＩ）と、
・ＡＴＳメニュー用のオーディオコンテンツブロックセット（ＡＴＳＭ−ＡＣＢＳ）と、
・ＡＴＳタイトル用のオーディオコンテンツブロックセット（ＡＴＳＡ−ＡＣＢＳ）と、
・バックアップ用のＡＴＳＩ
を有する。ＡＴＳＭ−ＡＣＢＳとＡＴＳＡ−ＡＣＢＳは共に、前述（図２）したＰＣＩとＤＳＩを有する。
【００２０】
ＡＭＧＩ（オーディオマネージャインフォメーション）は図４に詳しく示すように、
・ＡＭＧＩのマネージメントテーブル（ＡＭＧＩ−ＭＡＴ）と、
・タイトルのサーチポインタテーブル（Ｔ−ＳＲＰＴ）と、
・オーディオマネージャメニューＰＧＣＩユニットテーブル（ＡＭＧＭ−ＰＧＣＩ−ＵＴ）と、
・ペアレンタルマネージメントインフォメーションテーブル（ＰＴＬ−ＭＡＩＴ）と、
・図５に詳しくオーディオタイトルセット・アトリビュートテーブル（ＡＴＳ−ＡＴＲＴ）と、
・テキストデータマネージャ（ＴＸＴＤＴ−ＭＧ）と、
・オーディオマネージャメニューセル（インデックス）アドレステーブル（ＡＭＧＭ−Ｃ−ＡＤＴ）と、
・オーディオマネージャメニュー・オーディオコンテンツブロックユニット・アドレスレスマップ（ＡＭＧＭ−ＡＣＢＵ−ＡＤＭＡＰ）
を有する。
【００２１】
ＡＴＳ−ＡＴＲＴ（オーディオタイトルセット・アトリビュートテーブル）は図５に詳しく示すように、
・オーディオタイトルセットアトリビュートテーブルインフォメーション（ＡＴＳ−ＡＴＲＴＩ）と、
・複数（ｎ）個のＡＴＳの各々のオーディオタイトルセットアトリビュートサーチポインタ（ＡＴＳ−ＡＴＲ−ＳＲＰ＃１〜＃ｎ）と、
・図６に詳しく示すような複数（ｎ）個のＡＴＳの各々のオーディオタイトルセット・アトリビュートデータ（ＡＴＳ−ＡＴＲ−＃１〜＃ｎ）
を有する。
【００２２】
オーディオタイトルセット・アトリビュートデータ（ＡＴＳ−ＡＴＲ−＃１〜＃ｎ）の各々は、図６に詳しく示すように
・ＡＴＳ−ＡＴＲ−ＥＡ（エンドアドレス）と、
・ＡＴＳ−ＣＡＴ（カテゴリ）と、
・ＡＴＳ−ＡＴＲＩ（インフォメーション）
を有する。
【００２３】
図３に示すＡＴＳＩ（ＡＴＳインフォメーション）は図７に詳しく示すように、
・図８に詳しく示すオーディオタイトルセットインフォメーション・マネージメントテーブル（ＡＴＳＩ−ＭＡＴ）と、
・オーディオタイトルセット・パートオブタイトル・サーチポインタテーブル（ＡＴＳ−ＰＴＴ−ＳＲＰＴ）と、
・オーディオタイトルセット・プログラムチェーンインフォメーションテーブル（ＡＴＳ−ＰＧＣＩＴ）と、
・オーディオタイトルセットメニュー・ＰＧＣＩ・ユニットテーブル（ＡＴＳＭ−ＰＧＣＩ−ＵＴ）と、
・オーディオタイトルセット・タイムマップテーブル（ＡＴＳ−ＴＭＡＰＴ）と、
・オーディオタイトルセットメニュー・セル・アドレステーブル（ＡＴＳＭ−Ｃ−ＡＤＴ）と、
・オーディオタイトルセットメニュー・オーディオコンテンツブロックユニット・アドレスマップ（ＡＴＳＭ−ＡＣＢＵ−ＡＤＭＡＰ）と、
・オーディオタイトルセット・セル・アドレステーブル（ＡＴＳ−Ｃ−ＡＤＴ）と、
・オーディオタイトルセット・オーディオコンテンツブロックユニット・アドレスマップ（ＡＴＳ−ＡＣＢＵ−ＡＤＭＡＰ）
を有する。
【００２４】
図７に示すＡＴＳＩ−ＭＡＴ（オーディオタイトルセットインフォメーション・マネージメントテーブル）は図８に詳しく示すように、
・ＡＴＳ−ＩＤ（識別子）と、
・ＡＴＳ−ＥＡ（エンドアドレス）と、
・ＡＴＳＩ−ＥＡと、
・ＶＥＲＮ（ＤＶＤオーディオスペックのバージョン番号）と、
・ＡＴＳ−ＣＡＴ（カテゴリ）と、
・ＡＴＳＩ−ＭＡＴ−ＥＡと、
・ＡＴＳＭ−ＡＣＢＳ−ＳＡ（スタートアドレス）と、
・ＡＴＳＡ−ＡＣＢＳ−ＳＡと、
・ＡＴＳ−ＰＴＴ−ＳＲＰＴ−ＳＡと、
・ＡＴＳ−ＰＧＣＩＴ−ＳＡと、
・ＡＴＳＭ−ＰＧＣＩ−ＵＴ−ＳＡと、
・ＡＴＳ−ＴＭＡＰ−ＳＡと、
・ＡＴＳＭ−Ｃ−ＡＤＴ−ＳＡと、
・ＡＴＳＭ−ＡＣＢＵ−ＡＤＭＡＰ−ＳＡと、
・図９に詳しく示すようなＡＴＳＭ−ＡＳＴ−ＡＴＲ（ＡＴＳＭのオーディオストリーム・アトリビュート）と、
・ＡＴＳ−ＡＳＴ−Ｎｓ（ＡＴＳのオーディオストリームの数）と、
・図１０に詳しく示すようなＡＴＳ−ＡＳＴ−ＡＴＲＴ（ＡＴＳのオーディオストリーム・アトリビュートテーブル）
を有する。
【００２５】
ＡＴＳＭ−ＡＳＴ−ＡＴＲは図９に詳しく示すように８バイト（ビットｂ６３〜ｂ０）により構成され、このディスクに記録されている符号化オーディオ信号の属性として次のようなデータ（１）〜（４）が配置される（他のビットは保留）。
【００２６】
（１）オーディオ符号化モード（３ビットｂ６３〜ｂ６１）
０００ｂ：ドルビーＡＣ−３
０１０ｂ：ＭＰＥＧ−１又はＭＰＥＧ−２（拡張ビットストリーム無し）
０１１ｂ：ＭＰＥＧ−２（拡張ビットストリーム有り）
１００ｂ：リニヤＰＣＭオーディオ
１０１ｂ：リニヤＰＣＭオーディオ（２ｃｈ＋５ｃｈ、２ｃｈ＋６ｃｈ、２ｃｈ＋８ｃｈを含む。）
【００２７】
（２）量子化／ＤＲＣ（ダイナミックレンジコントロール）情報（２ビットｂ５５、ｂ５４）
・オーディオ符号化モードが「０００ｂ」の場合には「１１ｂ」
・オーディオ符号化モードが「０１０ｂ」又は「０１１ｂ」の場合、
００ｂ：ＭＰＥＧオーディオストリーム内にダイナミックレンジコントロールデータが存在しない
０１ｂ：ＭＰＥＧオーディオストリーム内にダイナミックレンジコントロールデータが存在する
１０ｂ，１１ｂ：保留
・オーディオ符号化モードが「１００ｂ」、「１０１ｂ」の場合、ステレオ２ｃｈに対して
００ｂ：１６ビット
０１ｂ：２０ビット
１０ｂ：２４ビット
１１ｂ：保留
【００２８】
（３）サンプリング周波数ｆｓ（２ビットｂ５３、ｂ５２）
ステレオ２ｃｈに対して
００ｂ：４８ｋＨｚ
０１ｂ：９６ｋＨｚ
１０ｂ：１９２ｋＨｚ
（４）オーディオチャネル数（３ビットｂ５０〜ｂ４８）
０００ｂ：１ｃｈ（モノラル）
００１ｂ：２ｃｈ（ステレオ）
０１０ｂ：３ｃｈ
０１１ｂ：４ｃｈ
１００ｂ：（ステレオ２ｃｈ＋５ｃｈ）
１０１ｂ：（ステレオ２ｃｈ＋６ｃｈ）
１１０ｂ：７ｃｈ
１１１ｂ：（ステレオ２ｃｈ＋８ｃｈ）
【００２９】
図１０に示すＡＴＳ−ＡＳＴ−ＡＴＲＴ（ＡＴＳのオーディオストリーム・アトリビュートテーブル）は図１１に詳しく示すように、オーディオストリーム＃０〜＃７毎のＡＴＳ−ＡＳＴ−ＡＴＲを有し、ＡＴＳ−ＡＳＴ−ＡＴＲの各々は８バイトで構成されている（合計６４バイト）。
【００３０】
１つのオーディオストリームのＡＴＳ−ＡＳＴ−ＡＴＲは図１１に示すように、図９に示すオーディオタイトルセットメニュー・オーディオストリーム・アトリビュートデータ（ＡＴＳＭ−ＡＳＴ−ＡＴＲ）と同様な８バイト（ビットｂ６３〜ｂ０）で構成され、上記属性データ（１）〜（４）の他に、
（５）マルチチャネル・イクステンション（１ビットｂ６０）と、
（６）オーディオタイプ（２ビットｂ５９、ｂ５８）と、
（７）オーディオアプリケーションモード（２ビットｂ５７、ｂ５６）と、
（８）そのストリーム（ＡＳＴ）の間引き情報（２ビットｂ４７、ｂ４６）と、
（９）ＬＦＥ（ＬｏｗＦｒｅｑｕｅｎｃｙＥｆｆｅｃｔ）１ｃｈのみの間引き情報（２ビットｂ４５、ｂ４４）
の各データを有する。そして、このＤＶＤオーディオディスクの（７）オーディオアプリケーションモードには、
１１ｂ：２ｃｈ＋サラウンドモード
が記録され、また、（８）そのストリームの間引き情報と、（９）ＬＦＥ１ｃｈのみの間引き情報には共に、帯域情報として
００ｂ：フル（１／１）
０１ｂ：ハーフ（１／２）
１０ｂ：クオータ（１／４）
が記録される。
【００３１】
ただし、このＡＴＳＭ−ＡＳＴ−ＡＴＲにおける（４）オーディオチャネル数は、オーディオストリーム＃０では必ず２ｃｈとなり、また、オーディオストリーム＃１はフロントの３ｃｈを含む。すなわち、例えば１つのタイトルのオーディオ信号を２＋６ｃｈで記録する場合、２ｃｈのステレオ信号をオーディオストリーム＃０に割り当て、６ｃｈの内、３ｃｈのフロント信号をオーディオストリーム＃１に割り当て、２ｃｈのリヤ信号とＬＦＥ１ｃｈ信号をオーディオストリーム＃２に割り当てる。そして、図４に示すオーディオマネージャインフォメーション・マネージメントテーブル（ＡＭＧＩ−ＭＡＴ）と図８に示すオーディオタイトルセットインフォメーション・マネージメントテーブル（ＡＴＳＩ−ＭＡＴ）には共に、ストリーム＃０〜＃２の利用データとして「３」が記録される。
【００３２】
また、この２＋６ｃｈのアナログオーディオ信号を例えば次のようなサンプリング周波数ｆｓでサンプリングし、次のような量子化ビット数で量子化して記録する場合、
ステレオ２ｃｈ：４８ｋＨｚ、２０ビット
フロント３ｃｈ：９６ｋＨｚ、１６ビット
リヤ２ｃｈ、ＬＦＥ１ｃｈ：４８ｋＨｚ、１６ビット（間引きなし）
図９に示すオーディオタイトルセットメニュー・オーディオストリーム・アトリビュートデータ（ＡＴＳＭ−ＡＳＴ−ＡＴＲ）にはステレオ２ｃｈの属性として
（１）オーディオ符号化モード
１０１ｂ：リニヤＰＣＭオーディオ（２ｃｈ＋５ｃｈ、２ｃｈ＋６ｃｈ、２ｃｈ＋８ｃｈを含む。）
（２）量子化／ＤＲＣ
０１ｂ：２０ビット
（３）サンプリング周波数ｆｓ
００ｂ：４８ｋＨｚ
（４）オーディオチャネル数
１０１ｂ：（ステレオ２ｃｈ＋６ｃｈ）
が記録される。
【００３３】
また、オーディオストリーム＃０のＡＴＳ−ＡＳＴ−ＡＴＲには
（１）オーディオ符号化モード
１０１ｂ：リニヤＰＣＭオーディオ（２ｃｈ＋５ｃｈ、２ｃｈ＋６ｃｈ、２ｃｈ＋８ｃｈを含む。）
（２）量子化／ＤＲＣ
０１ｂ：２０ビット
（３）サンプリング周波数ｆｓ
００ｂ：４８ｋＨｚ
（４）オーディオチャネル数
００１ｂ：２ｃｈ（ステレオ）
（７）オーディオアプリケーションモード
１１ｂ：２ｃｈ＋サラウンドモード
（８）そのストリームの間引き情報
００ｂ：フル（１／１）
（９）ＬＦＥ１ｃｈのみ間引き情報
００ｂ：フル（１／１）
が記録される。
【００３４】
また、オーディオストリーム＃１のＡＴＳ−ＡＳＴ−ＡＴＲには
（１）オーディオ符号化モード
１０１ｂ：リニヤＰＣＭオーディオ（２ｃｈ＋５ｃｈ、２ｃｈ＋６ｃｈ、２ｃｈ＋８ｃｈを含む。）
（２）量子化／ＤＲＣ
００ｂ：１６ビット
（３）サンプリング周波数ｆｓ
０１ｂ：９６ｋＨｚ
（４）オーディオチャネル数
０１０ｂ：３ｃｈ
（７）オーディオアプリケーションモード
１１ｂ：２ｃｈ＋サラウンドモード
（８）そのストリームの間引き情報
００ｂ：フル（１／１）
（９）ＬＦＥ１ｃｈのみ間引き情報
００ｂ：フル（１／１）
が記録される。
【００３５】
また、オーディオストリーム＃２のＡＴＳ−ＡＳＴ−ＡＴＲには
（１）オーディオ符号化モード
１０１ｂ：リニヤＰＣＭオーディオ（２ｃｈ＋５ｃｈ、２ｃｈ＋６ｃｈ、２ｃｈ＋８ｃｈを含む。）
（２）量子化／ＤＲＣ
００ｂ：１６ビット
（３）サンプリング周波数ｆｓ
００ｂ：４８ｋＨｚ
（４）オーディオチャネル数
０１０ｂ：３ｃｈ
（７）オーディオアプリケーションモード
１１ｂ：２ｃｈ＋サラウンドモード
（８）そのストリームの間引き情報
００ｂ：フル（１／１）
（９）ＬＦＥ１ｃｈのみ間引き情報
００ｂ：フル（１／１）
が記録される。
【００３６】
次に、オーディオストリームが記録されるＡパックとその制御パックについて説明する。図１２に示すようにＶＣＢユニットは０．４〜１．０秒分の任意の数のパックにより構成され、ＡＣＢユニットは０．５〜１．０秒分の任意の数のパックにより構成されている。また、ＤＶＤ−オーディオのＡＣＢユニットにおけるＡ−ＣＯＮＴパックは、ＤＶＤ−ビデオのＶＣＢユニットにおける第３パックに配置される。
【００３７】
Ａ−ＣＯＮＴパックは基本的にオーディオ時間の０．５秒単位に配置され、インデックスの切れ目では０．５〜１．０秒の範囲で完結するように配置される。また、オーディオの時間（ＧＯＦ：ＧｒｏｕｐｏｆＡｕｄｉｏＦｒａｍｅ単位）はＡ−ＣＯＮＴパックにより示され、そのデータ位置はオーディオフレームナンバと、ファーストアクセスユニットポインタとフレームヘッダの数により決まる。また、Ａ−ＣＯＮＴパック直前のＡパックは、オーディオ時間の０．５秒単位でパディングすることを強制しない。
【００３８】
隣接するＡパックは、オーディオ信号がお互いに関連するように配置され、例えばステレオの場合にはＬチャネルパックとＲチャネルパックが隣接して配置され、また、５／６／８チャネルのマルチチャネルの場合にも同様に隣接して配置される。Ｖパックはオーディオ信号の再生時に映像を表示する場合にそのＡパックに隣接して配置される。ＡパックとＶパックは、図１３に示すように２０３４バイトのユーザデータ（Ａデータ、Ｖデータ）に対して４バイトのパックスタート情報と、６バイトのＳＣＲ（ＳｙｓｔｅｍＣｌｏｃｋＲｅｆｅｒｅｎｃｅ：システム時刻基準参照値）情報と、３バイトのＭｕｘレート（ｒａｔｅ）情報と１バイトのスタッフィングの合計１４バイトのパックヘッダが付加されて構成されている（１パック＝合計２０４８バイト）。この場合、タイムスタンプであるＳＣＲ情報を、ＡＣＢユニット内の先頭パックでは「１」として同一タイトル内で連続とすることにより同一タイトル内のＡパックの時間を管理することができる。
【００３９】
これに対し、Ａ−ＣＯＮＴパックは図１４に示すように、１４バイトのパックヘッダと、２４バイトのシステムヘッダと、１００３バイトのＡＣＤ（オーディオキャラクタディスプレイ）パケットと、１００７バイトのＡＳＤ（オーディオサーチデータ）パケットにより構成されている。また、ＡＣＤパケットは６バイトのパケットヘッダと、１バイトのサブストリームＩＤと、図１５に詳しく示すような６３６バイトのＡＣＤ（オーディオキャラクタディスプレイ）情報と、３６０バイトの保留エリアにより構成されている。ＡＳＤパケットは同じく６バイトのパケットヘッダ及び１バイトのサブストリームＩＤと、図１７に詳しく示すような１０００バイトのＡＳＤ（オーディオサーチデータ）により構成されている。
【００４０】
６３６バイトのＡＣＤ情報エリアは、図１５に詳しく示すように４８バイトのジェネラル情報エリアと、第１の言語の文字「１」及び第２の言語の文字「２」毎に２９４バイトのエリアを有し、この各エリアは９３バイトのネームスペースエリア、各々９３バイトの２つのフリースペースエリアと１５バイトのデータポインタエリアにより構成されている。第１の言語の文字「１」と第２の言語の文字「２」の一方のネームスペースエリアには例えば図１６に示すように楽曲名を日本語で表示するためのデータが配置され、他方のネームスペースエリアには英語で表示するためのデータが配置される。なお、この表示言語はディスク発行元が決定してよい。
【００４１】
４８バイトのジェネラル情報は、例えば１６バイトのサービスレベル情報と、１２バイトの言語コード情報と、６バイトの文字セットコード情報と、６バイトの表示アイテム情報と、２バイトの「前のＡＣＤ情報との相違」情報と、６バイトの保留情報により構成される。１６バイトのサービスレベル情報は、表示サイズ、表示の種類、オーディオ／ビデオ／ＳＰの区別、ストリームなどを示し、また、文字はマンダトリー（必須）、ビットマップはオプション（随意）である。１２バイトの言語コード情報はビデオファイルと同様に文字「１」「２」の言語をそれぞれ２バイトで示し、１ファイル中最大８言語分を示す。英語はマンダトリーである。
【００４２】
６バイトの文字セットコード情報は、言語コードに対応した文字コードを最大１５個持つことが可能であり、文字「１」「２」の言語の有無と種類を１バイトで示す。コード例を以下に示す。
１．ＩＳＯ６４６
２．ＩＳＯ８８５９−１
３．ＭＳ−ＪＩＳ
６バイトの表示アイテム情報は、図１５に示すフリースペース「１」「２」、データポインタの有無、ＩＤを示す。ネームスペースはマンダトリーであり、タイトルネーム、ミュージックネーム、アーティストネームは必ず記述する。
【００４３】
１０００バイトのＡＳＤ（オーディオサーチデータ）は、図１７に詳しく示すように１６バイトのジェネラル情報と、８バイトの現在の番号（Ｎｏ．）情報と、１６バイトの現在時刻情報と、８バイトのタイトルセットサーチ情報と、８バイトのタイトルサーチ情報と、４０４バイトのトラックサーチ情報と、４０８バイトのインデックスサーチ情報と、８０バイトのハイライトサーチ情報と、５２バイトの保留エリアにより構成されている。
【００４４】
８バイトの現在の番号情報は、タイトルセットの現在のタイトル番号（２バイト：ＢＣＤ）と、タイトルセットの現在のトラック番号（２バイト：ＢＣＤ）と、トラックの現在のインデックス番号（２バイト：ＢＣＤ）と保留領域（２バイト）により構成されている。１６バイトの現在時刻情報は、トラックのプレイバック時間（４バイト：ＢＣＤ）と、トラックの残りのプレイバック時間（４バイト：ＢＣＤ）と、タイトルの絶対時間（４バイト：ＢＣＤ）とタイトルの残りの絶対時間（４バイト：ＢＣＤ）により構成されている。
【００４５】
８バイトのタイトルセットサーチ情報は、タイトルセットの最初のセクタ番号（４バイト）と、タイトルセットの最後のセクタ番号（４バイト）により構成されている。８バイトのタイトルサーチ情報は、タイトルの最初のセクタ番号（４バイト）と、タイトルの最後のセクタ番号（４バイト）により構成されている。４０４バイトのトラックサーチ情報は、タイトルのトラック及びセクタ番号（４バイト×９９）と、タイトルの最初のトラック番号（４バイト）とタイトルの最後のトラック番号（４バイト）により構成されている。
【００４６】
４０８バイトのインデックスサーチ情報は、トラックのインデックス及びセクタ番号（４バイト×１００）と、トラックの最初のインデックス番号（４バイト）とトラックの最後のインデックス番号（４バイト）により構成されている。８０バイトのハイライトサーチ情報は、トラックのインセクタ番号（４バイト×１０）とトラックのアウトセクタ番号（４バイト×１０）により構成されている。
【００４７】
このようなフォーマットによれば、複数のＡパックの先頭に、ＣＤのＴＯＣ情報のように後続のＡパックのオーディオ信号を管理するためのＡ−ＣＯＮＴパックが配置されるので、オーディオデータはビデオデータなどとは一体化されず、記録容量を多くすることができる。また、Ａ−ＣＯＮＴパックによりオーディオ時間を管理することができ、また、Ａ−ＣＯＮＴパックによりオーディオデータに関する曲名などの簡単な文字情報を取り出すことができる。
【００４８】
また、Ａ−ＣＯＮＴパック内にタイトル、スタートアドレス、演奏時間などのＴＯＣ情報を配置するので、オーディオ再生中であってもユーザの操作に応じた情報をＡ−ＣＯＮＴパックから取り出して再生を開始することができる。また、オーディオマネージャインフォメーション（ＡＭＧＩ）とオーディオタイトルセットインフォメーション（ＡＴＳＩ）内にＴＯＣ情報を配置することにより、必要なＴＯＣ情報を再生装置内のメモリに記憶させて、ユーザの操作に応じた情報をメモリから即座に読み出して再生を開始することができる。また、ＤＶＤ−ビデオにおけるプログラムチェーンインフォメーション（ＰＧＣＩ）のような大きな容量の情報を記憶する必要がないので、ディスクを効率的に管理することができる。
【００４９】
さらに、
１．コンテンツ内に画像（Ｖ）データがない場合、
（１）タイトル、曲、インデックスの３階層に対するサーチ、ランダムアクセスが可能になる。
（２）ＧＯＦ（オーディオフレーム）単位の頭出し、タイムサーチ、ランダムアクセスが可能になる。
（３）タイトル、曲、インデックスの時間を実時間で管理することができる。
【００５０】
また、
２．コンテンツ内に画像（Ｖ）データがある場合、
オーディオデータに関しては、
上記（１）〜（３）の他に、
（４）タイトル、曲中の現在時間、残り時間を実時間で表示、管理することができる。
【００５１】
ビデオデータに関しては、
（１）タイトル、ＰＴＴ、セルの３階層に対するサーチ、ランダムアクセスが可能になる。
（２）ビデオフレーム単位の頭出し、タイムサーチ、ランダムアクセスが可能になる。
（３）タイトル、ＰＴＴ、セルの時間を実時間で管理することができる。
（４）ＰＴＴ又はタイトル中の現在時間、残り時間をビデオフレーム単位時間で表示、管理することができる。
【００５２】
なお、図１（ｂ）のＡＣＢＵは、Ａ−ＣＯＮＴパックとＣＯＮＴパックを含んでいるが、図１８に示すようにＶパックとＣＯＮＴパックは含まないように構成してもよい。この場合にはビデオ信号は記録されないが、オーディオ信号の記録容量が割り増しになる特徴があり、ディスクサイズを小型化することができ、また、再生機能を簡略化することができるのでポータブル用の再生装置に適するものを提供することができる。
【００５３】
図１９は第２の実施形態におけるオーディオマネージャインフォメーション（ＡＭＧＩ）のフォーマットを詳しく示す説明図、図２０は図１９のＴＯＣ情報を詳しく示す説明図、図２１は第３の実施形態におけるオーディオタイトルセットインフォメーション（ＡＴＳＩ）のフォーマットを詳しく示す説明図である。
次に、ＴＯＣ（ＴａｂｌｅＯｆＣｏｎｔｅｎｔｓ）情報を用いた第２の実施形態について説明する。図１９に示すように、ＡＭＧＩ（オーディオマネージャインフォメーション）の空きエリアに対して、図２０に詳しく示すようなＴＯＣを追加して記録し、再生装置はこのＴＯＣ情報にアクセスして曲の頭出しを行う。図２０は一例として、ＣＤのリードインエリアに記録されている一般的なＴＯＣ情報を示し、同じ情報が３回繰り返して記録されている。なお、本発明のＤＶＤオーディオディスク１に記録する場合にはこのように繰り返してもよく、また、繰り返さなくてもよい。
【００５４】
ここで、ＣＤにおいて用いられているＴＯＣ情報では、ポイント＝００〜９９のときにその数字で示される各楽章が始まる絶対時間が分（ＰＭＩＮ）、秒（ＰＳＥＣ）及びフレーム（ＰＦＲＡＭＥ）で表される。また、ポイント＝Ａ０のときにＰＭＩＮが最初の楽章を示し、ＰＳＥＣ＝ＰＦＲＡＭＥ＝０となる。ポイント＝Ａ１のときにはＰＭＩＮが最後の楽章を示し、ＰＳＥＣ＝ＰＦＲＡＭＥ＝０となる。ポイント＝Ａ２のときにはリードアウトエリアが始まる絶対時間が分（ＰＭＩＮ）、秒（ＰＳＥＣ）及びフレーム（ＰＦＲＡＭＥ）で表される。したがって、図２０に示すＴＯＣ情報はＤＶＤオーディオディスク１に対して６曲（又は６楽章）分が記録されていることを示している（ポイント＝０１〜０６）。なお、このＴＯＣ情報はＡＭＧＩの代わりに、図２１に示すようにＡＴＳＩ（オーディオタイトルセットインフォメーション）の空きエリアに記録するようにしてもよく、また、図１４に示すＡ−ＣＯＮＴパックのＡＣＤパケット内の保留エリア（３６０バイト分）に記録するようにしてもよい。
【００５５】
次に、第４の実施形態について説明する。図２２は本発明に適用されるＤＶＤオーディオディスクの第４の実施形態のフォーマットを示し、図２４ないし図２６に示すようなＶＴＳは含まず、ＡＴＳのみにより構成されている。そして、このＡＴＳ（ディレクトリ）は、ＳＡＭＧ（ＳｔｒｕｃｔｕｒｅｏｆＳｉｍｐｌｅＡｕｄｉｏＭａｎａｇｅｒ）と、図１（ｂ）に示すオーディオマネージャ（ＡＭＧ）と、ビデオ及びオーディオのオーディオマネージャメニュー（ＡＭＧＭ）と、ＡＭＧ内のＡＭＧＩにより管理されるＡＴＳ＜１＞及びＡＴＳ＜２＞により構成され、また、ＡＴＳ＜１＞及びＡＴＳ＜２＞は図２３に示すように、Ａ−ＣＯＮＴパックを含まず、ＡパックとＲＴＩパックにより構成されている。また、このＲＴＩパックはＡパックに対して多く配置されず、０．５秒毎に１パック程度が配置される。また、静止画パックが所定の位置に配置される。ＳＡＭＧはＡＴＳ＜１＞及びＡＴＳ＜２＞の頭出しのためのＳＡＰＰテーブル（ＴＯＣ）が繰り返し８回記述される領域である。この領域は１つの独立したファイルとして定義できる。
【００５６】
ここで、参考までに、図２４はＤＶＤ−Ｖａｎ（ビデオ＋オーディオナビゲーション）ディスクのフォーマットを示し、このフォーマットは概略的にはＤＶＤ−ビデオデータとしてビデオタイトルセット（ＶＴＳ）と、オーディオナビ（ナビゲーション）データとしてＡＮＶタイトルセット（ＡＮＶ−ＴＳ）により構成されている。また、詳しくは、ＶＴＳは図１（ａ）及び後述する図２５に示すＤＶＤビデオディスクと同じ構成であり、他方、ＡＮＶ−ＴＳは図１（ｂ）に示すオーディオマネージャ（ＡＭＧ）と、ＶＴＳ側のＶＴＳ＜１＞及びＶＴＳ＜２＞とそれぞれ対を成してＡＭＧ内のＡＭＧＩにより管理されるＡＴＳ＜１＞及びＡＴＳ＜２＞により構成されている。
また、ＤＶＤビデオディスクのフォーマットは図２５及び図１（ａ）に示すようにＡＴＳやＡＮＶ−ＴＳを含まず、ＶＴＳのみにより構成されている。
【００５７】
また、図２６はＤＶＤ−Ａｖｄ（オーディオ＋ＡＶデータ）ディスクのフォーマットを示し、このフォーマットは概略的にＤＶＤ−ビデオデータとしてビデオタイトルセット（ＶＴＳ）と、ＤＶＤ−オーディオデータとしてオーディオタイトルセット（ＡＴＳ）により構成されている。また、詳しくは、ＶＴＳは図１（ａ）に示すビデオマネージャ（ＶＭＧ）と、ビデオ及びオーディオのビデオマネージャメニュー（ＶＭＧＭ）と、ＶＭＧ内のＶＭＧＩにより管理されるＶＴＳ＜１＞により構成されている。
【００５８】
他方、ＡＴＳはＳＡＭＧと、図１（ｂ）に示すオーディオマネージャ（ＡＭＧ）と、ビデオ及びオーディオのオーディオマネージャメニュー（ＡＭＧＭ）と、ＶＴＳ側のＶＴＳ＜１＞内のオーディオデータと対を成し、かつＡＭＧ内のＡＭＧＩにより管理されるＡＴＳ＜１＞と、ＶＴＳ側とは対をなさず、同じくＡＭＧ内のＡＭＧＩにより管理されるＡＴＳ＜２＞により構成されている。また、このＡＴＳ＜２＞は図２３に示すように、Ａ−ＣＯＮＴパックを含まず、ＡパックとＲＴＩパックにより構成されている。
【００５９】
図２７は第４の実施形態のディスクのオーディオデータの内容を示す属性データとしてディスクに記録されるオーディオ・オンリ・タイトル・オーディオ・オブジェクト・アトリビュート（ＡＯＴＴ−ＡＯＢ−ＡＴＲ）を示している。この属性データは８バイト（６４ビットｂ６３〜ｂ０）により構成され、ＭＳＢ側から順に詳しく説明すると
・４ビット（ｂ６３〜ｂ６０）のオーディオ符号化モードと、
・１ビット（ｂ５９）のダウンミックス（Ｄ−Ｍ）モードと、
・３ビット（ｂ５８〜ｂ５６）のマルチチャネル構造と、
・４ビット（ｂ５５〜ｂ５２）のチャネルグループ１の量子化ビット数Ｑ１と、
・４ビット（ｂ５１〜ｂ４８）のチャネルグループ２の量子化ビット数Ｑ２と、
・４ビット（ｂ４７〜ｂ４４）のチャネルグループ１のサンプリング周波数ｆｓ１と、
・４ビット（ｂ４３〜ｂ４０）のチャネルグループ２のサンプリング周波数ｆｓ２と、
・３ビット（ｂ３９〜ｂ３７）の保留領域と、
・５ビット（ｂ３６〜ｂ３２）のチャネル割り当てと、
・残り３２ビット（ｂ３１〜ｂ０）の保留領域により構成されている。なお、残りの３２ビット（ｂ３１〜ｂ０）は各チャネルの属性データ用として用いられる。
【００６０】
上記データを以下に更に詳しく説明する。
（１）オーディオ符号化モード（ｂ６３〜ｂ６０）
００００ｂ：リニアＰＣＭモード
０００１ｂ：圧縮オーディオ（ドルビーデジタル）用に保留
００１０ｂ：圧縮オーディオ（ＭＰＥＧ２拡張無し）用に保留
００１１ｂ：圧縮オーディオ（ＭＰＥＧ２拡張有り）用に保留
０１００ｂ：圧縮オーディオ（ＤＴＳ）用に保留
０１０１ｂ：圧縮オーディオ（ＳＤＤＳ）用に保留
その他：その他の符号化モード用に保留
（２）ダウンミックスモード（ｂ５９）
０ｂ：ダウンミックスステレオ出力許可
１ｂ：ダウンミックスステレオ出力禁止
（３）マルチチャネル構造のタイプ（ｂ５８〜ｂ５６）
０００ｂ：タイプ１
その他：保留
【００６１】
（４）チャネルグループ１の量子化ビット数Ｑ（ｂ５５〜ｂ５２）
００００ｂ：１６ビット
０００１ｂ：２０ビット
００１０ｂ：２４ビット
その他：保留
（５）チャネルグループ２の量子化ビット数Ｑ（ｂ５１〜ｂ４８）
・チャネルグループ１の量子化ビット数Ｑが「００００ｂ」の場合には「００００ｂ」
・チャネルグループ１の量子化ビット数Ｑが「０００１ｂ」の場合には「００００ｂ」又は「０００１ｂ」
・チャネルグループ１の量子化ビット数Ｑが「００１０ｂ」の場合には「００００ｂ」、「０００１ｂ」又は「００１０ｂ」
ただし、００００ｂ：１６ビット
０００１ｂ：２０ビット
００１０ｂ：２４ビット
その他：保留
【００６２】
（６）チャネルグループ１のサンプリング周波数ｆｓ１（ｂ４７〜ｂ４４）
００００ｂ：４８ｋＨｚ
０００１ｂ：９６ｋＨｚ
００１０ｂ：１９２ｋＨｚ
１０００ｂ：４４．１ｋＨｚ
１００１ｂ：８８．２ｋＨｚ
１０１０ｂ：１７６．４ｋＨｚ
その他：保留
【００６３】
（７）チャネルグループ２のサンプリング周波数ｆｓ２（ｂ４３〜ｂ４０）
・チャネルグループ１のサンプリング周波数ｆｓ１が「００００ｂ」の場合には「００００ｂ」
・チャネルグループ１のサンプリング周波数ｆｓ１が「０００１ｂ」の場合には「００００ｂ」又は「０００１ｂ」
・チャネルグループ１のサンプリング周波数ｆｓ１が「００１０ｂ」の場合には「００００ｂ」、「０００１ｂ」又は「００１０ｂ」
・チャネルグループ１のサンプリング周波数ｆｓ１が「１０００ｂ」の場合には「１０００ｂ」
・チャネルグループ１のサンプリング周波数ｆｓ１が「１００１ｂ」の場合には「１０００ｂ」又は「１００１ｂ」
・チャネルグループ１のサンプリング周波数ｆｓ１が「１０１０ｂ」の場合には「１０００ｂ」、「１００１ｂ」又は「１０１０ｂ」
【００６４】
この第４の実施形態のディスクではリニアＰＣＭモードが使用される。リニアＰＣＭのプライベートヘッダは、図２８に示すように
・８ビットのサブストリームＩＤと、
・４ビットの保留領域と、
・４ビットのＩＳＲＣ番号と、
・８ビットのＩＳＲＣデータと、
・８ビットのプライベートヘッダ長と、
・１６ビットの第１アクセスユニットポインタと、
・１ビットのオーディオ・エンファシス・フラグＦ１と、
・１ビットのオーディオ・エンファシス・フラグＦ２などにより構成されている。
【００６５】
図２９、図３０はエンコード装置を示す。図２９は本発明に適用されるオーディオ信号のエンコード装置の一実施形態を示すブロック図、図３０は図２９の信号処理回路を詳細に示すブロック図である。
【００６６】
図２９においてアナログオーディオ信号ＡはＡ／Ｄコンバータ３１により十分高いサンプリング周波数（サンプリング周期Δｔ）、例えば１９２ｋＨｚでサンプリングされて、例えば２４ビットの高分解能のＰＣＭ信号に変換され、高分解能の曲線αに対応するデータ列
ｘｂ１，ｘ１，ｘａ１，ｘ２，ｘｂ２，ｘ３，ｘａ２，・・・，ｘｂｉ，ｘ２ｉ−１，ｘａｉ，ｘ２ｉ，・・・
に変換される。このデータ列（ｘｂｉ，ｘ２ｉ−１，ｘａｉ，ｘ２ｉ）は図３０に詳しく示す信号処理回路３２及びメモリ３３によりエンコードされ、次いでＤＶＤオーディオフォーマット化部３４に印加される。
【００６７】
図３０を参照して信号処理回路３２の構成を詳しく説明する。まず、１／２の帯域を通過させるローパスフィルタ３６、例えばＦＩＲフィルタにより、高分解能の曲線αに対応するデータ列（ｘｂｉ，ｘ２ｉ−１，ｘａｉ，ｘ２ｉ）から、帯域制限された低分解能の曲線βに対応するデータ列
ｘｃ１，＊，＊，＊，ｘｃ２，＊，＊，＊，ｘｃ３，＊，＊，＊，・・・，ｘｃｉ，＊，＊，＊，・・・
を得、次にこのデータ列の内、データ「＊」を間引き回路３７により間引くことによりデータ列
ｘｃ１，ｘｃ２，ｘｃ３，・・・，ｘｃｉ，・・・
を生成する。ここで、データ列ｘｃｉはＡ／Ｄコンバータ３１によりＡ／Ｄ変換されたデジタルデータを帯域制限してサンプリング周波数を１／４に低減したデータ列となっている。
【００６８】
また、データ列（ｘｂｉ，ｘ２ｉ−１，ｘａｉ，ｘ２ｉ）の内、データｘｉを間引き回路３８により間引くことによりデータ列
ｘｂ１，ｘａ１，ｘｂ２，ｘａ２，・・・，ｘｂｉ，ｘａｉ，・・・
を生成する。
【００６９】
そして、これらのデータ列ｘｃｉ、ｘｂｉ、ｘａｉに基づいて、差分計算器として作用する加算器３９により差分
ｘｂｉ−ｘｃｉ＝Δ１ｉ
ｘａｉ−ｘｃｉ＝Δ２ｉ
を演算する。ここで、差分データΔ１ｉ、Δ２ｉは、例えば２４ビット又はそれ以下であり、また、ビット数は固定でも可変でもよい。
【００７０】
アロケーション回路４０はデータ列ｘｃｉ及び差分データΔ１ｉ、Δ２ｉをユーザデータ（図１３参照）にパッキングし（１パケット＝２０３４バイト）、そのユーザデータをＤＶＤフォーマット化部３４に出力する。
【００７１】
また、ビデオ信号ＶはＡ／Ｄ変換器３１Ｖによりデジタル信号に変換され、次いでこのデジタルビデオ信号がＶエンコーダ３２ＶによりＭＰＥＧフォーマットにエンコードされ、次いで図１３に示すユーザデータにパッキングされてＤＶＤフォーマット化部３４に印加される。そしてＤＶＤフォーマット化部３４は、例えば図１〜図１８に示すようなフォーマットにパッキングする。このＤＶＤフォーマット化部３４によりフォーマット化されたデータは、変調回路３５によりディスクに応じた変調方式で変調され、この変調データに基づいてディスクが製造される。
【００７２】
次に図３１〜図９０を参照して第５の実施形態のＤＶＤ−オーディオディスクについて説明する。まず、図３１（Ａ）に示すようにこの第５の実施形態のデータ構造は、概略的にＳＡＭＧと、ＡＭＧ（オーディオマネージャ）と、ＳＰＳ（スチルピクチャセット）と複数のＡＴＳ（オーディオ・タイトルセット）を有する。なお、ＳＰＳ（スチルピクチャセット）は、サブピクチャ（ＳＰ）との混同を避けるために、以下の説明ではＡＳＶＳ（オーディオ・スチル・ビデオ・セット）とも言う。
【００７３】
ＡＴＳは、先頭から順に
・ＡＴＳＩ（ＡＴＳインフォメーション）と、
・図３２〜図３７に詳しく示すオーディオ・オンリ・タイトル用のオーディオ・オブジェクト・セット（ＡＯＴＴ−ＡＯＢＳ）と
・バックアップ用のＡＴＳＩにより構成されている。ＡＴＳＩは先頭から順に
・図３９〜図４４に詳しく示すＡＴＳＩ−ＭＡＴ（ＡＴＳＩマネージメント・テーブル）と
・図４５〜図５７に詳しく示すＡＴＳ−ＰＧＣＩＴ（ＡＴＳプログラム・チェーン・インフォメーション・テーブル）により構成されている。
【００７４】
ＡＯＴＴ−ＡＯＢＳは図３２に詳しく示すように、複数のオーディオ・オンリ・タイトル用のオーディオ・オブジェクト（ＡＯＴＴ−ＡＯＢ）により構成されている。ＡＯＴＴ−ＡＯＢの各々は複数のプログラム（ＰＧ）により構成され、プログラムの各々は複数のセル（ＡＴＳ−Ｃ）により構成されている。
【００７５】
ＡＯＴＴ−ＡＯＢは、図３２（１）に詳しく示すようにオーディオデータのみを含むものと、図３２（２）に詳しく示すようにオーディオデータ及びリアル・タイム・インフォメーション・データ（ＲＴＩデータ）を含むものの２種類のＡＯＴＴ−ＡＯＢにより構成されている。そして、１枚のディスク中や１曲中に１種類以上のＡＯＴＴ−ＡＯＢが配置される。
【００７６】
オーディオデータのみを含む第１のＡＯＴＴ−ＡＯＢの各プログラムは複数のオーディオセル（ＡＴＳ−Ｃ）により構成され、このオーディオセルは複数のオーディオパックのみにより構成されている。オーディオデータ及びＲＴＩデータを含む第２のＡＯＴＴ−ＡＯＢの各プログラムは複数のオーディオセル（ＡＴＳ−Ｃ）により構成され、このオーディオセルは２番目のパック位置に配置されたＲＴＩパックと、他のパック位置に配置されたオーディオパックにより構成されている。
【００７７】
リニアＰＣＭのＡパックは２０４８バイト以下で構成され、その内訳は図３３に示すように１４バイトのパックヘッダとＡパケットにより構成されている。Ａパケットは１７、９又は１４バイトのパケットヘッダと、図３４に詳しく示すプライベートヘッダと、１ないし２０１１バイトのオーディオＰＣＭデータにより構成されている。
【００７８】
プライベートヘッダは、図３４に示すように
・８ビットのサブストリームＩＤと、
・３ビットの保留領域と、
・５ビットのＵＰＣ／ＥＡＮ−ＩＳＲＣ（ユニバーサル・プロダクト・コード：ＵｎｉｖｅｒｓａｌＰｒｏｄｕｃｔＣｏｄｅ／ヨーロピアン・アーティクル・ナンバー・インターナショナル・スタンダード・レコーディング・コード：ＥｕｒｏｐｅａｎＡｒｔｉｃｌｅＮｕｍｂｅｒ−ＩｎｔｅｒｎａｔｉｏｎａｌＳｔａｎｄａｒｄＲｅｃｏｒｄｉｎｇＣｏｄｅ）番号と、
・８ビットのＵＰＣ／ＥＡＮ−ＩＳＲＣデータと、
・８ビットのプライベートヘッダ長と、
・１６ビットの第１アクセスユニットポインタと、
・８バイトのオーディオデータインフォメーション（ＡＤＩ）と
・０〜８バイトのスタッフィングバイト
により構成されている。
【００７９】
ＡＤＩは
・１ビットのオーディオ・エンファシス・フラグと、
・１ビットの保留領域と、
・１ビットのダウンミックスモードと、
・１ビットのダウンミックスコード有効性と、
・４ビットのダウンミックスコードと、
・４ビットのグループ「１」の量子化ワード長「１」と、
・４ビットのグループ「２」の量子化ワード長「２」と、
・４ビットのグループ「１」のオーディオ・サンプリング周波数ｆｓ１と、
・４ビットのグループ「２」のオーディオ・サンプリング周波数ｆｓ２と、
・４ビットの保留領域と、
・４ビットのマルチチャネルタイプと、
・３ビットのチャネルグループ「２」のビットシフトデータ（図３６参照）と
・５ビットのチャネル割り当て情報（図４２参照）と、
・８ビットのダイナミックレンジ制御情報
・８×２ビットの保留領域
により構成されている。
【００８０】
８ビット（ｂ７〜ｂ０）のＵＰＣ／ＥＡＮ−ＩＳＲＣデータエリアには、図３５に示すようにＵＰＣ／ＥＡＮ−ＩＳＲＣ番号に応じて異なるデータが配置される。すなわち、
（１）ＵＰＣ／ＥＡＮ−ＩＳＲＣ番号＝１の場合
上位２ビットｂ７、ｂ６：保留
下位６ビットｂ５〜ｂ０：カントリコード（ＩＳＲＣ＃１）
（２）ＵＰＣ／ＥＡＮ−ＩＳＲＣ番号＝２の場合
上位２ビットｂ７、ｂ６：保留
下位６ビットｂ５〜ｂ０：カントリコード（ＩＳＲＣ＃２）
（３）ＵＰＣ／ＥＡＮ−ＩＳＲＣ番号＝３の場合
上位２ビットｂ７、ｂ６：保留
下位６ビットｂ５〜ｂ０：コピーライトホルダコード（ＩＳＲＣ＃３）
（４）ＵＰＣ／ＥＡＮ−ＩＳＲＣ番号＝４の場合
上位２ビットｂ７、ｂ６：保留
下位６ビットｂ５〜ｂ０：コピーライトホルダコード（ＩＳＲＣ＃４）
（５）ＵＰＣ／ＥＡＮ−ＩＳＲＣ番号＝５の場合
上位２ビットｂ７、ｂ６：保留
下位６ビットｂ５〜ｂ０：コピーライトホルダコード（ＩＳＲＣ＃５）
（６）ＵＰＣ／ＥＡＮ−ＩＳＲＣ番号＝６の場合
上位４ビットｂ７〜ｂ４：保留
下位４ビットｂ３〜ｂ０：レコーディングイヤー（ＩＳＲＣ＃６）
（７）ＵＰＣ／ＥＡＮ−ＩＳＲＣ番号＝７の場合
上位４ビットｂ７〜ｂ４：保留
下位４ビットｂ３〜ｂ０：レコーディングイヤー（ＩＳＲＣ＃７）
【００８１】
Ａパック内の実データであるリニヤＰＣＭデータのエリアには、Ｓ／Ｎ比の向上とビット削減のためにグループ「２」の各チャネルのデータのビットが削減されて配置される。図３６（ａ）は一例として６チャネル（グループ「１」＝Ｃｈ１〜Ｃｈ３、グループ「２」＝Ｃｈ４〜Ｃｈ６）のＰＣＭデータを示し、レベル範囲がＭＡＸ＝０ｄＢ〜ＭＩＮ＝−１４４ｄＢ（２４ビット）であって、各チャネルＣｈの値が以下の通りである。
Ｌｍａｘ２＞Ｌｍａｘ１＝Ｌｍａｘ３＞Ｌｍａｘ４＞Ｌｍａｘ５＞Ｌｍａｘ６
そして、グループ「１」のＣｈ１〜Ｃｈ３のワード長はそのままにして、この例ではＣｈ２の値が最も大きいので、グループ「２」のＣｈ４〜Ｃｈ６の各レベルを（０−Ｌｍａｘ２）ｄＢだけアップシフトしてＬＳＢ側０〜４ビットを削減する。なお、図３６に示す例ではＣｈ４〜Ｃｈ６の各レベルが最大ビット数＝４だけアップシフトされて２０ビットに削減されたことを示している。
【００８２】
次に図３７を参照してＲＴＩパックの構成を詳しく説明する。このパックは１４バイトのパックヘッダとＲＴＩパケットにより構成され、ＲＴＩパケットは１７又は１４バイトのパケットヘッダと、プライベートヘッダと、１ないし２０１５バイトのＲＴＩデータにより構成されている。ＲＴＩデータはオーディオデータに関する文字情報や再生制御情報である。
【００８３】
ＲＴＩパケットのプライベートヘッダは、
・１バイトのサブストリームＩＤと、
・２バイトのＵＰＣ／ＥＡＮ−ＩＳＲＣ番号及びデータ（図ではこれらを単にＩＳＲＣと表記）と、
・１バイトのプライベートヘッダ長と、
・１バイトのＲＴＩ情報ＩＤと、
・０〜７バイトのスタッフィングバイト
により構成されている。
上記ＵＰＣ／ＥＡＮ−ＩＳＲＣ番号及びデータは、ＳＰＣＴパックに収められるスチルピクチャの著作権に関するＵＰＣ／ＥＡＮ−ＩＳＲＣ番号及びデータである。
【００８４】
ちなみに、図３１に示すスチルピクチャセット（オーディオ・スチル・ビデオ・セット）にはＳＰＣＴパックが配置され、このＳＰＣＴパックは図３８に詳しく示すように、１４バイトのパックヘッダとＳＰＣＴパケットにより構成され、ＳＰＣＴパケットは２２又は１９又は９バイトのパケットヘッダと２０２５バイト以下のＳＰＣＴデータにより構成されている。ここで、１枚の静止画はＭＰＥＧ１又はＭＰＥＧ２方式で圧縮されてＩピクチャとイントラ・コーデッド・ピクチャにより構成され、１つのピクチャセル内で分割されてＳＰＣＴパックのＳＰＣＴデータとして配置される。なお、ＳＰＣＴパックのパケットヘッダ内にも同様に、ＲＴＩパックで説明したようにスチルピクチャの著作権に関するＵＰＣ／ＥＡＮ−ＩＳＲＣ番号及びデータを含めてもよい。
【００８５】
図３１（Ａ）に示したＡＴＳＩ−ＭＡＴは、図３９に詳しく示すように２０４８バイト（リラティブ・バイト・ポジションＲＢＰ０〜２０４７）で構成され、先頭から順に
・１２バイト（ＲＢＰ０〜１１）のＡＴＳ識別子（ＡＴＳ−ＩＤ）と、
・４バイト（ＲＢＰ１２〜１５）のＡＴＳのエンドアドレス（ＡＴＳ−ＥＡ）と、
・１２バイト（ＲＢＰ１６〜２７）の保留領域と、
・４バイト（ＲＢＰ２８〜３１）のＡＴＳＩのエンドアドレス（ＡＴＳＩ−ＥＡ）と、
・２バイト（ＲＢＰ３２〜３３）のバージョン番号（ＶＥＲＮ）と、
・９４バイト（ＲＢＰ３４〜１２７）の保留領域と、
・４バイト（ＲＢＰ１２８〜１３１）のＡＴＳＩ−ＭＡＴのエンドアドレスと、
・６０バイト（ＲＢＰ１３２〜１９１）の保留領域と、
・４バイト（ＲＢＰ１９２〜１９５）のＡＯＴＴ用のＶＴＳのスタートアドレスと、
・４バイト（ＲＢＰ１９６〜１９９）のＡＯＴＴ用のＡＯＢＳのスタートアドレス又はＡＯＴＴ用のＶＯＢＳのスタートアドレスと、
・４バイト（ＲＢＰ２００〜２０３）の保留領域と、
・４バイト（ＲＢＰ２０４〜２０７）のＡＴＳ−ＰＧＣＩＴのスタートアドレスと、
・４８バイト（ＲＢＰ２０８〜２５５）の保留領域と、
・１２８（１６×８）バイト（ＲＢＰ２５６〜３８３）のＡＯＴＴ用のＡＯＢのアトリビュート（ＡＯＴＴ−ＡＯＢ−ＡＴＲ）又はＡＯＴＴ用のＶＯＢのオーディオストリームのアトリビュート（ＡＯＴＴ−ＶＯＢ−ＡＳＴ−ＡＴＲ）と、
・２８８（１８×８）バイト（ＲＢＰ３８４〜６６１）の、マルチチャネルオーディオデータを２チャネルにダウンミックスするための係数（ＡＴＳ−ＤＭ−ＣＯＥＦＴ＃０〜＃１５）と、
・３２バイト（ＲＢＰ６７２〜７０３）の保留領域と、
・２バイト（ＲＢＰ７０４〜７０５）の、ＡＯＴＴ用のＡＯＢＳにおけるスチルピクチャデータのアトリビュート（ＡＴＳ−ＳＰＣＴ−ＡＴＲ）と、
・１３４２バイト（ＲＢＰ７０６〜２０４７）の保留領域により構成されている。
【００８６】
１２８（１６×８）バイト（ＲＢＰ２５６〜３８３）のエリアには、このＡＴＳがＡＯＴＴ用のＡＯＢＳを有する場合には、図４０に詳しく示すＡＯＴＴ−ＡＯＢ−ＡＴＲが記述される。このＡＯＴＴ−ＡＯＢ−ＡＴＲ（ｂ１２７〜ｂ０）は、ＭＳＢ側から順に
・８ビット（ｂ１２７〜ｂ１２０）のオーディオ符号化モードと、
・８ビット（ｂ１１９〜ｂ１１２）の保留領域と、
・４ビット（ｂ１１１〜ｂ１０８）のチャネルグループ「１」の量子化ビット数Ｑ１と、
・４ビット（ｂ１０７〜ｂ１０４）のチャネルグループ「２」の量子化ビット数Ｑ２と、
・４ビット（ｂ１０３〜ｂ１００）のチャネルグループ「１」のサンプリング周波数ｆｓ１と、
・４ビット（ｂ９９〜ｂ９６）のチャネルグループ「２」のサンプリング周波数ｆｓ２と、
・３ビット（ｂ９５〜ｂ９３）のマルチチャネル構造のタイプと、
・５ビット（ｂ９２〜ｂ８８）のチャネル割り当てと、
・８ビット×１１（ｂ８７〜ｂ０）の保留領域により構成されている。
【００８７】
これに対し、このＡＴＳがＡＯＴＴ用のＡＯＢＳを有しない場合には、図４１に示すＡＯＴＴ−ＶＯＢ−ＡＳＴ−ＡＴＲが記述される。このＡＯＴＴ−ＶＯＢ−ＡＳＴ−ＡＴＲ（ｂ１２７〜ｂ０）は、ＭＳＢ側から順に
・８ビット（ｂ１２７〜ｂ１２０）のオーディオ符号化モードと、
・８ビット（ｂ１１９〜ｂ１１２）の保留領域と、
・４ビット（ｂ１１１〜ｂ１０８）の量子化ビット数Ｑと、
・４ビット（ｂ１０７〜ｂ１０４）の保留領域と、
・４ビット（ｂ１０３〜ｂ１００）のサンプリング周波数ｆｓと、
・４ビット（ｂ９９〜ｂ９６）の保留領域と、
・３ビット（ｂ９５〜ｂ９３）のマルチチャネル構造のタイプと、
・５ビット（ｂ９２〜ｂ８８）のチャネル割り当てと、
・３ビット（ｂ８７〜ｂ８５）のデコーディング・オーディオ・ストリーム数と、
・５ビット（ｂ８４〜ｂ８０）の保留領域と、
・２ビット（ｂ７９、ｂ７８）のＭＰＥＧオーディオ用ＤＲＣと、
・２ビット（ｂ７７、ｂ７６）の保留領域と、
・４ビット（ｂ７５〜ｂ７２）の圧縮オーディオチャネル数と、
・８ビット×９（ｂ７１〜ｂ０）の保留領域により構成されている。
【００８８】
上記データを以下に詳しく示す。ただし、量子化ビット数、サンプリング周波数、マルチチャネルタイプは図２７と同じであるので説明を省略する。
（１）オーディオ符号化モード（ｂ１２７〜ｂ１２０）
００００００００ｂ：リニアＰＣＭモード
０００００００１ｂ：圧縮オーディオ（ドルビーデジタル）用に保留
００００００１０ｂ：圧縮オーディオ（ＭＰＥＧ２拡張無し）用に保留
００００００１１ｂ：圧縮オーディオ（ＭＰＥＧ２拡張有り）用に保留
０００００１００ｂ：圧縮オーディオ（ＤＴＳ）用に保留
０００００１０１ｂ：圧縮オーディオ（ＳＤＤＳ）用に保留
その他：その他の符号化モード用に保留
【００８９】
（８）チャネル割り当て（ｂ９２〜ｂ８８）
図４２は１チャネル（モノラル）から６チャネルまでのグループ「１」、「２」のチャネル割当情報を示している。ちなみに、図に示す記号を以下に説明する。
Ｃ（ｍｏｎｏ）：モノラル
Ｌ，Ｒ：２チャネルステレオ
Ｌｆ：マルチチャネルのレフトフロント
Ｒｆ：マルチチャネルのライトフロント
Ｃ：マルチチャネルのセンター
ＬＦＥ：マルチチャネルのＬｏｗＦｒｅｑｕｅｎｃｙＥｆｆｅｃｔ
Ｓ：マルチチャネルのサラウンド
Ｌｓ：マルチチャネルのレフトサラウンド
Ｒｓ：マルチチャネルのライトサラウンド
【００９０】
（９）デコーディング・オーディオ・ストリーム数（ｂ８７〜ｂ８５）の「０」又は「１」
（１０）ＭＰＥＧオーディオ用ＤＲＣ（ｂ７９、ｂ７８）
００ｂ：ＭＰＥＧオーディオストリーム内にＤＲＣデータが存在しない。
０１ｂ：ＭＰＥＧオーディオストリーム内にＤＲＣデータが存在する。
【００９１】
（１１）圧縮オーディオチャネル数（ｂ７５〜ｂ７２）
オーディオ符号化モードがリニヤＰＣＭオーディオの場合には「１１１１ｂ」
００００ｂ：１ｃｈ（モノ）
０００１ｂ：２ｃｈ（ステレオ）
００１０ｂ：３ｃｈ
００１１ｂ：４ｃｈ
０１００ｂ：５ｃｈ
０１０１ｂ：６ｃｈ
０１１０ｂ：７ｃｈ
０１１１ｂ：８ｃｈ
その他：保留
【００９２】
図３９に示した２８８（１８×１６）バイトのエリア（ＲＢＰ３８４〜６７１）には、マルチチャネルオーディオデータを２チャネルにダウンミックスするために図４３に示すようにテーブル番号「０」〜「１５」の各ダウンミックス係数（ＡＴＳ−ＤＭ−ＣＯＥＦＴ＃０〜＃１５）が１８ビットで記述される。
【００９３】
図３９に示した２バイト（ＲＢＰ７０４、７０５）のエリアは、ＡＯＴＴ用のＡＯＢＳにおけるスチルピクチャデータのアトリビュート（ＡＴＳ−ＳＰＣＴ−ＡＴＲ）を記述するために、図４４に詳しく示すようにＭＳＢ側から順に
・２ビット（ｂ１５、ｂ１４）のビデオ圧縮モードと、
・２ビット（ｂ１３、ｂ１２）のＴＶシステムと、
・２ビット（ｂ１１、ｂ１０）のアスペクト比と、
・２ビット（ｂ９、ｂ８）のディスプレイモードと、
・２ビット（ｂ７、ｂ６）の保留領域と、
・３ビット（ｂ５〜ｂ３）のソースピクチャの解像度と、
・３ビット（ｂ２〜ｂ０）の保留領域により構成されている。
【００９４】
上記ＡＴＳ−ＳＰＣＴ−ＡＴＲの内容を以下に詳しく示す。
（１）ビデオ圧縮モード（ｂ１５、ｂ１４）
００ｂ：ＭＰＥＧ１対応
０１ｂ：ＭＰＥＧ２対応
その他：保留
（２）ＴＶシステム（ｂ１３、ｂ１２）
００ｂ：５２５／６０
０１ｂ：６２５／６０
その他：保留
（３）アスペクト比（ｂ１１、ｂ１０）
００ｂ：４：３
１１ｂ：１６：９
その他：保留
（４）ディスプレイモード（ｂ９、ｂ８）
００ｂ：保留
０１ｂ：保留
１０ｂ：レターボックスのみ許可
１１ｂ：記述しない。
（５）ソースピクチャの解像度（ｂ５〜ｂ３）

その他：保留
【００９５】
図３１（Ａ）に示したＡＴＳ−ＰＧＣＩＴ（ＡＴＳプログラム・チェーン・インフォメーション・テーブル）は、図４５に詳しく示すように先頭から順に
・図４６に詳しく示すオーディオ・タイトルセットＰＧＣＩテーブル・インフォメーション（ＡＴＳ−ＰＧＣＩＴＩ）と、
・図４７、図４８に詳しく示すｎ個のオーディオ・タイトルセットＰＧＣＩサーチポインタ（ＡＴＳ−ＰＧＣＩ−ＳＲＰ＃１〜＃ｎ）と
・図４９に詳しく示す複数のオーディオ・タイトルセットＰＧＣＩにより構成されている。
【００９６】
ＡＴＳ−ＰＧＣＩＴＩは図４６に詳しく示すように８バイトで構成され、先頭から順に
・２バイトのＡＴＳ−ＰＧＣＩ−ＳＲＰ＃１〜＃ｎの数と、
・２バイトの保留領域と、
・４バイトのＡＴＳ−ＰＧＣＩＴのエンドアドレスにより構成されている。ＡＴＳ−ＰＧＣＩ−ＳＲＰ＃１〜＃ｎの各々は、図４７に詳しく示すように８バイトで構成され、先頭から順に
・図４８に詳しく示す４バイトのＡＴＳ−ＰＧＣのカテゴリ（ＡＴＳ−ＰＧＣ−ＣＡＴ）と
・４バイトのＡＴＳ−ＰＧＣＩのエンドアドレスにより構成されている。
【００９７】
上記の４バイト（ｂ３１〜ｂ０）のＡＴＳ−ＰＧＣのカテゴリは、図４８に詳しく示すように先頭から順に
・１ビット（ｂ３１）のエントリータイプと、
・７ビット（ｂ３０〜ｂ２４）のＡＴＳオーディオタイトル数（ＡＴＳ−ＴＴＮ）と、
・２ビット（ｂ２３、ｂ２２）のブロックモードと、
・２ビット（ｂ２１、ｂ２０）のブロックタイプと、
・４ビット（ｂ１９〜ｂ１６）のオーディオチャネル数と、
・８ビット（ｂ１５〜ｂ８）のオーディオ符号化モードと、
・８ビット（ｂ７〜ｂ０）の保留領域により構成されている。
【００９８】
上記カテゴリ（ＡＴＳ−ＰＧＣ−ＣＡＴ）の内容を以下に詳しく示す。
（１）エントリータイプ（ｂ３１）
０ｂ：エントリーＰＧＣでない
１ｂ：エントリーＰＧＣ
（２）ＡＴＳオーディオタイトル数（ｂ３０〜ｂ２４）
このＡＴＳのオーディオタイトル数を「１」〜「９９」の範囲で記述する。
（３）ブロックモード（ｂ２３、ｂ２２）
００ｂ：ＡＴＳ−ＰＧＣブロックのＡＴＳ−ＰＧＣでない
０１ｂ：ＡＴＳ−ＰＧＣブロックの最初のＡＴＳ−ＰＧＣ
１０ｂ：保留
１１ｂ：ＡＴＳ−ＰＧＣブロックの最後のＡＴＳ−ＰＧＣ
（４）ブロックタイプ（ｂ２１、ｂ２０）
００ｂ：このブロックの一部でない
０１ｂ：オーディオコーディングモードのみの差分のブロック
１０ｂ：オーディオチャネルのみの差分のブロック
１１ｂ：オーディオコーディングモードとオーディオチャネルの両方の差分のブロック
（５）オーディオチャネル数（ｂ１９〜ｂ１６）
００００ｂ：２チャネル以下
０００１ｂ：２チャネルを超える
【００９９】
図４５に示したオーディオ・タイトルセットＰＧＣＩ（ＡＴＳ−ＰＧＣＩ）の各々は、図４９に詳しく示すように先頭から順に
・図５０、図５１に詳しく示すＡＴＳ−ＰＧＣジェネラル・インフォメーション（ＡＴＳ−ＰＧＣ−ＧＩ）と、
・図５２〜図５６に詳しく示すＡＴＳプログラム・インフォメーション・テーブル（ＡＴＳ−ＰＧＩＴ）と
・図５５〜図５７に詳しく示すＡＴＳセル・プレイバック・インフォメーション・テーブル（ＡＴＳ−Ｃ−ＰＢＩＴ）により構成されている。
【０１００】
ＡＴＳ−ＰＧＣ−ＧＩは図５０に詳しく示すように１６バイト（ＲＢＰ０〜１５）で構成され、先頭から順に
・図５１に詳しく示す４バイト（ＲＢＰ０〜３）のＡＴＳ−ＰＧＣコンテンツ（ＡＴＳ−ＰＧＣ−ＣＮＴ）と、
・４バイト（ＲＢＰ４〜７）のＡＴＳ−ＰＧＣプレイバック・タイム（ＡＴＳ−ＰＧＣ−ＰＢ−ＴＭ）と、
・２バイト（ＲＢＰ８、９）の保留領域と、
・２バイト（ＲＢＰ１０、１１）のＡＴＳ−ＰＧＩＴのスタートアドレスと、
・２バイト（ＲＢＰ１２、１３）のＡＴＳ−Ｃ−ＰＢＩＴのスタートアドレスと、
・２バイト（ＲＢＰ１４、１５）の保留領域により構成されている。
【０１０１】
上記の４バイト（ｂ３１〜ｂ０）のＡＴＳ−ＰＧＣコンテンツは図５１に詳しく示すように先頭から順に
・１７ビット（ｂ３１〜ｂ１５）の保留領域と、
・７ビット（ｂ１４〜ｂ８）のプログラム数と
・８ビット（ｂ７〜ｂ０）のセル数により構成されている。プログラム数は「１」〜「９９」の範囲であり、セル数は「１」〜「２５５」の範囲である。
【０１０２】
図４９に示したＡＴＳプログラム・インフォメーション・テーブル（ＡＴＳ−ＰＧＩＴ）は、図５２に詳しく示すようにｎ個のＡＴＳプログラム・インフォメーション（ＡＴＳ−ＰＧＩ）＃１〜＃ｎにより構成されている。ＡＴＳ−ＰＧＩ＃１〜＃ｎの各々は図５３に詳しく示すように２０バイト（ＲＢＰ０〜１９）で構成され、先頭から順に
・図５４に詳しく示す４バイト（ＲＢＰ０〜３）のＡＴＳ−ＰＧコンテンツ（ＡＴＳ−ＰＧ−ＣＮＴ）と、
・１バイト（ＲＢＰ４）のＡＴＳ−ＰＧのエントリセル番号と、
・１バイト（ＲＢＰ５）の保留領域と、
・４バイト（ＲＢＰ６〜９）のＡＴＳ−ＰＧの最初のオーディオセルのスタート・プレゼンテーション・タイム（ＦＡＣ−Ｓ−ＰＴＭ）と、
・４バイト（ＲＢＰ１０〜１３）のＡＴＳ−ＰＧプレイバック・タイムと、
・４バイト（ＲＢＰ１４〜１７）のＡＴＳ−ＰＧポーズ・タイムと、
・１バイト（ＲＢＰ１８）の保留領域（著作権管理データＣＭＩ用）と、
・１バイト（ＲＢＰ１９）の保留領域
により構成されている。
【０１０３】
上記２バイト（ｂ３１〜０）のＡＴＳ−ＰＧコンテンツは、図５４に詳しく示すように先頭から順に
・１ビット（ｂ３１）の、前回と今回のＰＧの関係（Ｒ／Ａ）と、
・１ビット（ｂ３０）のＳＴＣ不連続性フラグ（ＳＴＣ−Ｆ）と、
・３ビット（ｂ２９〜ｂ２７）のアトリビュート数（ＡＴＲＮ）と、
・３ビット（ｂ２６〜ｂ２４）のチャネルグループ（ＣｈＧｒ）「２」のビットシフトデータと、
・２ビット（ｂ２３、ｂ２２）の保留領域と、
・１ビット（ｂ２１）のダウンミックスモード（Ｄ−Ｍ）と、
・１ビット（ｂ２０）のダウンミックス係数の有効性（図示※）と、
・４ビット（ｂ１９〜ｂ１６）のダウンミックス係数テーブル番号（ＤＭ−ＣＯＥＦＴＮ）と、
・各々が１ビット、合計１６ビット（ｂ１５〜ｂ０）のＲＴＩフラグＦ１５〜Ｆ０により構成されている。
【０１０４】
図４９に示したＡＴＳセル・プレイバック・インフォメーション・テーブル（ＡＴＳ−Ｃ−ＰＢＩＴ）は、図５５に詳しく示すようにｎ個のＡＴＳセル・プレイバック・インフォメーション（ＡＴＳ−Ｃ−ＰＢＩ）＃１〜＃ｎにより構成されている。ＡＴＳ−Ｃ−ＰＢＩ＃１〜＃ｎの各々は、図５６に詳しく示すように１２バイト（ＲＢＰ０〜１１）により構成され、先頭から順に
・１バイト（ＲＢＰ０）のＡＴＳ−Ｃのインデックス番号と、
・図５７に詳しく示す１バイト（ＲＢＰ１）のＡＴＳ−Ｃタイプ（ＡＴＳ−Ｃ−ＴＹ）と、
・２バイト（ＲＢＰ２、３）の保留領域と、
・４バイト（ＲＢＰ４〜７）のＡＴＳ−Ｃのスタートアドレスと、
・４バイト（ＲＢＰ８〜１１）のＡＴＳ−Ｃのエンドアドレスにより構成されている。
【０１０５】
１バイト（ｂ７〜ｂ０）のＡＴＳ−Ｃタイプは、図５７に詳しく示すように先頭から順に
・２ビット（ｂ７、ｂ６）のＡＴＳセル要素（ＡＴＳ−Ｃ−ＣＯＭＰ）と、
・２ビット（ｂ５、ｂ４）の保留領域と
・４ビット（ｂ３〜ｂ０）のＡＴＳセル用途（ＡＴＳ−Ｃ−Ｕｓａｇｅ）により構成されている。
【０１０６】
上記データの内容を以下に詳しく示す。
（１）ＡＴＳセル要素（ｂ７、ｂ６）
００ｂ：オーディオデータのみから成るオーディオセル
０１ｂ：オーディオデータとリアルタイムインフォメーションから成るオーディオセル
１０ｂ：サイレンス用のオーディオデータのみから成るサイレンスセル
１１ｂ：スチルピクチャのみから成るピクチャセル
（２）ＡＴＳセル用途（ｂ３〜ｂ０）
００００ｂ：記述無し
０００１ｂ：スポットライトパート
その他：保留
【０１０７】
次の第５の実施形態のエンコード装置について説明する。図５８、図５９はそれぞれエンコード装置の構成と処理を示している。アナログオーディオ信号ＡはＡ／Ｄコンバータ３１により十分高いサンプリング周波数（サンプリング周期Δｔ）、例えば１９２ｋＨｚでサンプリングされて、例えば２４ビットの高分解能のＰＣＭ信号に変換される。続くビットシフト／信号処理回路３２では、圧縮を行わない場合には、Ａ／Ｄコンバータ３１により変換されたＰＣＭデータがそのままＤＶＤフォーマット化部３４に印加される。これに対し、圧縮を行う場合には、Ａ／Ｄコンバータ３１により変換されたＰＣＭデータがその符号化モードに応じてビットシフト／信号処理回路３２により圧縮され、次いでＤＶＤフォーマット化部３４に印加される（ステップＳ５、Ｓ６）。ビットシフト／信号処理回路３２ではまた、グループ「２」の各チャネルがビットシフトされる。
【０１０８】
また、ビデオ信号ＶはＡ／Ｄ変換器３１Ｖによりデジタル信号に変換され、次いでこのデジタルビデオ信号がＶエンコーダ３２ＶによりＭＰＥＧフォーマットにエンコードされ、ＤＶＤフォーマット化部３４に印加される（ステップＳ１、Ｓ２）。また、静止画信号ＳＰはＡ／Ｄ変換器３１ＳＰによりデジタル信号に変換され、次いでこのデジタル静止画信号ＳＰが圧縮エンコーダ３２ＳＰによりＭＰＥＧフォーマットにエンコードされ、ＤＶＤフォーマット化部３４に印加される（ステップＳ３、Ｓ４）。また、著作権情報とリアルタイムテキスト情報（ＲＴＩ）がインタフェース（Ｉ／Ｆ）４０を介して（ステップＳ７、Ｓ８）、また、文字情報とディスク識別子ＥＸがＤＶＤフォーマット化部３４に印加される（ステップＳ９、Ｓ１０）。
【０１０９】
そしてＤＶＤフォーマット化部３４は、前述したようなフォーマットにパッキングする（ステップＳ１１）。このＤＶＤフォーマット化部３４によりフォーマット化されたデータは、変調回路３５によりディスクに応じた変調方式で変調されてこの変調データに基づいてディスクが製造されたり、記録部３８にいったん記録されたり、通信Ｉ／Ｆ３９を介して伝送される（ステップＳ１２）。
【０１１０】
図６０は第５の実施形態のデコード装置の具体的構成を示し、図６１は図６０の構成を機能的に示している。また、図６２はその処理を示している。図６０、図６１において、まず、操作部１８やリモコン装置１９により曲目選択、再生、早送り、停止操作が行われると、制御部２３はその操作に応じてドライブ装置２と再生装置１７を制御し、再生時にはＤＶＤオーディオディスク１に記録されたピットデータがドライブ装置２により読み取られた後、ＥＦＭ復調される。
【０１１１】
再生装置１７では、この信号が静止画及びＶパック検出部３とＡ及びＲＴＩパック検出部９に送られる。静止画パック、Ｖパックがディスク１に記録されている場合には、静止画及びＶパック検出部３はこの再生データ中の静止画パック、Ｖパックを検出して制御パラメータをパラメータ部８に設定するとともに静止画パック、Ｖパックを静止画及びＶパックバッファ４に順次書き込む。静止画及びＶパックバッファ４に書き込まれた静止画パック、Ｖパック内のユーザデータ（ビデオ信号、静止画情報）は、バッファ取り出し部５により静止画パック、Ｖパック内のＳＣＲ（図１３参照）に基づいてパック順に、また、出力時刻順に取り出され、次いで伸長及び画像変換部６、Ｄ／Ａ変換部７、ビデオ出力端子１５、１５’を介してアナログビデオ信号として出力される。
【０１１２】
また、Ａ及びＲＴＩパック検出部９は再生データ中のＡパックとＲＴＩパックを検出して制御パラメータをパラメータ部１４に設定するとともに、ＡパックとＲＴＩパックをＡ及びＲＴＩパックバッファ１０に順次書き込む。Ａ及びＲＴＩパックバッファ１０に書き込まれたＡパック、ＲＴＩパック内のユーザデータ（オーディオ信号、リアルタイム・インフォメーション）は、バッファ取り出し部１１によりパック順に、また、出力時刻順に取り出される。そして、オーディオ信号はＰＣＭ変換及びビットシフト／信号処理部１２、Ｄ／Ａ変換部１３、オーディオ出力端子１６を介してアナログオーディオ信号として出力される。また、リアルタイム・インフォメーションは表示信号生成部２０に送られて表示信号が生成され、この表示信号は表示信号出力端子２２を介して出力されたり、内蔵の文字表示部２１に出力される。
【０１１３】
図６２を参照してこのデコード装置の処理を説明する。まず、ディスク１にアクセスして記録データを読み出し（ステップＳ２０）、次いで各分離ステップＳ２１〜Ｓ２９においてビデオ信号と、静止画信号と、オーディオ信号と、著作権情報及びリアルタイム情報（ＲＴＩ）と、文字情報及びディスク識別子（ＥＸ）が分離される。次いで各デコードステップＳ２２〜Ｓ３０においてそれぞれ各分離データがデコードされ、次いで同期再生される（ステップＳ３１、Ｓ３２）。
【０１１４】
ここで、静止画ＳＰを再生する処理には次の３通りがある。
１）静止画ＳＰが得られると、オーディオ信号Ａの再生を中断してミュートする。
２）静止画ＳＰが得られると、時間制御信号に基づいてオーディオ信号Ａと共に再生する。これをスライドショーと呼ぶ。
３）静止画ＳＰが得られると、ユーザに指示されたページめくりコマンドに基づいてページめくり再生する。このときオーディオ信号Ａはそのまま再生する。これをブラウザブルと呼ぶ。
【０１１５】
静止画を音声に同期させる必要がある場合は、リアルタイムの同期のための時間制御信号は、図３１（Ｂ）のＡＴＳＩに追加して設けるスチルピクチャ・コントロール・インフォメーション・テーブル（ＳＰＣＩＴ）の下のタイム・コントロール・データ・インフォメーション（ＳＰＣＩＴ−ＴＣＤＩ）に置くようにする。
【０１１６】
また、さらにページめくりコマンドを収めたスチルピクチャ・ページ制御コマンド・インフォメーション（ＳＰＰＩ）をＳＰＣＩＴの下に置くようにする。このようにＳＰＣＩＴは、一般情報のＳＰＣＩＴジェネラル・インフォメーション（ＳＰＣＩＴ−ＧＩ）と、タイム・コントロール・データ・インフォメーション（ＳＰＣＩＴ−ＴＣＤＩ）と、スチルピクチャ・ページ制御コマンド・インフォメーション（ＳＰＰＩ）とから構成される。
【０１１７】
また、ここで、図３８のＳＰＣＴパックのスチル・ピクチャ・データの中に、スチルピクチャのページ制御するためのサイド情報を含むようにすることができる。このサイド情報により規定されたページ制御データをＳＰＰＩを参照しながら解釈して行うようにする。なお、スチル・ピクチャ・データに収めるには容量に余裕がない場合は、ＲＴＩパックのＲＴＩデータの中に、上記したスチルピクチャのページ制御するためのサイド情報を含むようにすることも許容できる。
【０１１８】
次に、上記のようにフォーマット化されたデジタルオーディオ信号を通信回線を介して伝送する実施例について説明する。まず、図６３〜図６７を参照して送信側であるパッキング装置について説明する。パッキング装置は図６３に示すようにパッキング処理部３０と、バッファメモリ３０Ｂと、コントロール回路２９と、操作部２７とディスプレイ２８を有する。そして、図６４〜図６７において、まず、ビデオ信号Ｖと、静止画信号ＳＰと、オーディオ信号Ａとリアルタイム情報ＲＴＩとディスク識別子（ＥＸ）が入力すると、ステップＳ１００では図６５に詳しく示すようにオーディオパックを生成し（ステップＳ１０１）、次いでビデオパックを生成し（ステップＳ１０２）、次いで静止画パックを生成し（ステップＳ１０３）、次いでリアルタイムテキストを生成する（ステップＳ１０４）。
【０１１９】
次いでセル（ＡＴＳ−Ｃ）を管理し（ステップＳ２００）、次いでＰＴＴ（パートオブタイトル）を管理し（ステップＳ３００）、次いでタイトル（ＡＯＴＴ−ＡＯＢ）を管理し（ステップＳ４００）、次いでタイトルセット（ＡＯＴＴ−ＡＯＢＳを管理する（ステップＳ５００）。続くステップＳ６００ではＡＴＳを生成するために、図６６に詳しく示すようにタイトルセットを生成し（ステップＳ６０１）、次いでメニューを生成する（ステップＳ６０２）。次いでＡＴＳ−ＰＧＣＩのカテゴリを記載し（ステップＳ６０３）、次いでビットシフトを含むＰＧコンテンツから成るＰＧＩＴを生成してＰＧＣＩを生成することによりＡＴＳ−ＰＧＣＩＴを生成する（ステップＳ６０４）。次いで属性、係数のＭＡＴを生成することによりＡＴＳＩを生成する（ステップＳ６０５）。次いでＡＭＧを生成し（ステップＳ７００）、最後にＴＯＣを生成する（ステップＳ８００）。
【０１２０】
次に、上記のようにフォーマット化されたデジタルオーディオ信号を通信回線を介して伝送する場合には、図６７に示すように、送信バッファに蓄えられている送信データを所定長に分割してパケット化し（ステップＳ４１）、次いでパケットの先頭には宛て先アドレスを含むヘッダを付与し（ステップＳ４２）、次いでこれをネットワーク上に出力する（ステップＳ４３）。
【０１２１】
次に図６８〜図７２を参照してデータ受信側について説明する。図６８に示すようにデータ受信側のアンパッキング装置は、アンパッキング処理部６０と、バッファメモリ６０Ｂと、パラメータメモリ５６と、コントロール回路５９と、操作部５７とディスプレイ５８を有する。まず、図６９に示すように、ネットワークから受信したパケットからヘッダを除去し（ステップＳ５１）、次いで受信データを復元し（ステップＳ５２）、次いでこれをメモリに転送する（ステップＳ５３）。
【０１２２】
次に図７０〜図７２に示すように、まず、ＡＭＧをデコードしてＡＴＳを検出し（ステップＳ１１００）、続くステップＳ１２００では目的のＡＴＳのＡＴＳＩをデコードするために、図７１に詳しく示すようにＡＴＳ−ＰＧＣＩのカテゴリをデコードし（ステップＳ１２０１）、次いでビットシフトを含むＰＧコンテンツから成るＰＧＩＴをデコードし（ステップＳ１２０２）、次いでＭＡＴの属性、係数をデコードし（ステップＳ１２０３）、次いでこれらのデコードした各パラメータをパラメータメモリ５６に設定する（ステップＳ１２０４）。
【０１２３】
次いで再生が開始されると、パックを識別し（ステップＳ１３００）、続くステップＳ１４００ではパックをデコードするために、図７２に詳しく示すようにオーディオパックをデコードし（ステップＳ１４０１）、次いでビデオパックをデコードし（ステップＳ１４０２）、次いで静止画パックをデコードし（ステップＳ１４０３）、次いでリアルタイムテキストをデコードする（ステップＳ１４０４）。そしてこれらの各パックからデコードしたオーディオ信号と、ビデオ信号と、静止画信号とリアルタイムテキスト信号を出力し（ステップＳ１５００）、再生中にはステップＳ１３００〜ステップＳ１５００の処理を繰り返す。
【０１２４】
次に図３１に示したＳＰＳ（スチルピクチャセット）すなわちＡＳＶＳ（オーディオ・スチル・ビデオ・セット）について図７３〜図８０を参照して詳しく説明する。ここで、以下に示すスチル・ピクチャ・オブジェクト・セット（ＳＰＯＢＳ）についても、サブピクチャ（ＳＰ）との混同を避けるためにオーディオ・スチル・ビデオ・オブジェクト・セット（ＡＳＶＯＢＳ）とも言う。図７３に示すようにＡＳＶＳ（オーディオ・スチル・ビデオ・セット）は、図７４、図７５に詳しく示すＡＳＶＳインフォメーション（ＡＳＶＳＩ）と、図７６に詳しく示すオーディオ・スチル・ビデオ・オブジェクト・セット（ＡＳＶＯＢＳ）とバックアップＡＳＶＳＩにより構成されている。
【０１２５】
ＡＳＶＳインフォメーション（ＡＳＶＳＩ）は、図７４に詳しく示すオーディオ・スチル・ビデオ・ユニット・インフォメーション（ＡＳＶＵＩ）と、図７５に詳しく示すＡＳＶアドレスマップ（ＡＳＶ−ＡＤＭＡＰ）とスタッフィングエリア（００ｈ）により構成されている。
【０１２６】
ＡＳＶＵＩ（合計８８８バイト）は図７４に示すように
・１２バイトのＡＳＶＳ−ＩＤと、
・２バイトのＡＳＶＵの数と、
・２バイトの保留エリアと、
・４バイトのＡＳＶＯＢＳスタートアドレスと、
・４バイトのＡＳＶＯＢＳエンドアドレスと、
・２バイト×４のＡＳＶＵアトリビュート＃０〜＃３と、
・４バイト×１６のＡＳＶＯＢＳサブピクチャパレット＃０〜＃１５と、
・８バイト×９９のＡＳＶＵ＃１〜＃９９一般情報
により構成されている。
【０１２７】
図７３に示したＡＳＶアドレスマップ（ＡＳＶ−ＡＤＭＡＰ）は、図７５に詳しく示すようにｍ（≦９９）個のＡＳＶＵ＃１〜＃ｍにより構成され、ＡＳＶＵ＃１〜＃ｍの各々はＡＳＶＵ＃１〜＃９９のスタートアドレスにより構成されている。
【０１２８】
次に図７３に示したオーディオ・スチル・ビデオ・オブジェクト・セット（ＡＳＶＯＢＳ）について説明する。オーディオ・スチル・ビデオ・オブジェクト（ＡＳＶＯＢ）は１つのオーディオ・スチル・ビデオ（ＡＳＶ）のプレゼンテーションデータであって、ボタン用のハイライト（ＨＬＴ）インフォメーションデータと、同じくボタン用のサブピクチャ（ＳＰ）データとスチルピクチャ（ＳＰＣＴ）データを含む。ただし、
・１つのＡＳＶＯＢ内には１つのスチルピクチャ（ＳＰＣＴ）データのみが含まれる。
・１つのＡＳＶＯＢ内には１つのハイライト（ＨＬＴ）インフォメーションデータのみを含むことができる。ハイライトインフォメーションはスチルピクチャのボタンを操作するために用いられる。
・１つのＡＳＶＯＢ内にはスチルピクチャモードに応じて１ないし３のサブピクチャ（ＳＰ）データを含むことができる。ＳＰデータはスチルピクチャのボタンを表示するために用いられる。
【０１２９】
図７３に示したオーディオ・スチル・ビデオ・オブジェクト・セット（ＡＳＶＯＢＳ）は図７６に示すように上記のＡＳＶＯＢの集合体である。ＡＳＶＯＢは図７６（ａ）に示すように、ハイライト（ＨＬＴ）パックを先頭に配置するもののハイライト情報を含まない空のパックとしてＨＬＴパックとして機能させないために、実質的にスチルピクチャ（ＳＰＣＴ）パックのみを含むものと、図７６（ｂ）に示すようにハイライト（ＨＬＴ）パックと、サブピクチャ（ＳＰ）パックとスチルピクチャ（ＳＰＣＴ）パックを含むものの２種類がある。
【０１３０】
ハイライト（ＨＬＴ）パックは図７７（ａ）に示すように、１４バイトのパックヘッダと、システムヘッダと２０１３バイトのハイライト情報パケットにより構成されている。ハイライト情報パケットは６バイトのパケットヘッダと、１バイトのサブストリームＩＤと６９４バイトのハイライト情報（ＡＳＶ−ＨＬＩ）により構成されている。システムヘッダは４バイトのシステムスタートコードと、２バイトのヘッダ長と、３バイトのレートバウンドと、２バイトのオーディオバウンドと、１バイトの制限フラグと、９バイトのストリームＩＤの各エリアにより構成されている。
【０１３１】
ハイライト情報（ＡＳＶ−ＨＬＩ）は図７８に示すように、ＡＳＶハイライト一般情報（２２バイト）と、ＡＳＶボタンカラー情報テーブル（８バイト×３）と、ＡＳＶボタン情報テーブル（１８バイト×３６）から構成される。ＡＳＶボタン情報テーブルはＡＳＶボタン情報＃１〜＃ｎから成り、各ＡＳＶボタン情報＃ｉはピクチャ制御コマンドであるＡＳＶボタンコマンドを含む。このＡＳＶボタンコマンドには図８１に示すボタンが操作されたときのナビコマンドが記述される。
【０１３２】
これに対し、ＳＰＣＴパックとＳＰパックは図７７（ｂ）に示すように、１４バイトのパックヘッダと２０２５バイトのスチルピクチャパケット又はサブピクチャパケットにより構成され、パックヘッダは４バイトのパックスタートコードと、６バイトのＳＣＲと、３バイトのプログラムｍｕｘレートと、９又は２２バイトのスタッフィング長により構成されている。
【０１３３】
スチルピクチャ（ＳＰＣＴ）パケットは図７９に詳しく示すようにＳＰＣＴパケットヘッダとスチルピクチャ用のビデオデータを含む。このＳＰＣＴパケットヘッダは、先頭において必ず設けられる９バイトのＳＰＣＴパケット情報と、静止画の最初のＳＰＣＴパケットの場合にのみ設けられる５＋５バイトのＳＰＣＴパケット情報と、ＡＳＶＯＢの最初のＳＰＣＴパケットの場合にのみ設けられる３バイトのＳＰＣＴパケット情報を含む。
【０１３４】
サブピクチャ（ＳＰ）パケットは図８０に詳しく示すようにＳＰパケットヘッダとサブピクチャデータを含む。このＳＰパケットヘッダは、先頭において必ず設けられる９バイトのＳＰパケット情報と、ＳＰユニットの最初のＳＰパケットの場合にのみ設けられる５バイトのＳＰパケット情報と、ＡＳＶＯＢの最初のＳＰパケットの場合にのみ設けられる３バイトのＳＰパケット情報を含む。そして、このようなデータ構造に基づいてデコーダ側では、図８１に示すようにメインピクチャと、サブピクチャとハイライト情報が合成されて表示される。
【０１３５】
次に図８２以下を参照して第５の実施形態の変形例のデータ構造について説明する。ここで、図３１（Ｂ）に示したデータ構造では、スチルピクチャ・コントロール・インフォメーション・テーブル（ＳＰＣＩＴ）がＡＴＳ内においてＡＴＳＩ−ＭＡＴ及びＡＴＳ−ＰＧＣＩＴとは独立して設けられているが、この変形例では図８２に示すようにＡＴＳ−ＰＧＣＩＴ内に設けられている。なお、以下の説明ではサブピクチャ（ＳＰ）との混同を避けるために、ＳＰＣＩＴをＡＴＳ−ＡＳＶ−ＰＢＩＴ（ＡＴＳオーディオスチルビデオ・プレイバックインフォメーションテーブル）と言う。
【０１３６】
このＡＴＳ−ＡＳＶ−ＰＢＩＴは図８３に詳しく示すように、図４９に示したＡＴＳ−ＰＢＩＴにおいて追加して設けられ、図８４、図８５に詳しく示すｍ個のプログラム＃１〜＃ｍの各ＡＴＳ−ＡＳＶ−プレイバックインフォメーション・サーチポインタ（ＡＴＳ−ＰＧ−ＡＳＶ−ＰＢＩ−ＳＲＰ＃１〜＃ｍ）と、図８６〜図９０に詳しく示すｎ個のＡＴＳ−ＡＳＶ−ＰＢＩ＃１〜＃ｎを有する（ｎ≦ｍ≦９９）。このＳＲＰ＃１〜＃ｍの各々は、図８４に示すように
・１バイトのＡＳＶＵ番号と、
・１バイトのＡＳＶディスプレイモード（ＡＳＶ−ＤＭＯＤ）と、
・２バイトのＡＴＳ−ＡＳＶ−ＰＢＩスタートアドレスと、
・２バイトのＡＴＳ−ＡＳＶ−ＰＢＩエンドアドレスと
により構成されている。
【０１３７】
ＡＳＶＵ番号は「１」から「９９」の範囲の値である。ＡＳＶディスプレイモードは、図８５に詳しく示すように
・４ビット（ｂ７〜ｂ４）の保留エリアと、
・２ビット（ｂ３、ｂ２）のディスプレイタイミングモードと、
・２ビット（ｂ１、ｂ０）のディスプレイオーダモードと
により構成されている。上記データを以下に詳しく示す。
（１）ディスプレイタイミングモード
００ｂ：スライドショー
０１ｂ：ブラウザブル
その他：保留
（２）ディスプレイオーダモード
００ｂ：シーケンシャル
０１ｂ：ランダム
１１ｂ：シャッフル
その他：保留
【０１３８】
ＡＴＳ−ＡＳＶ−ＰＢＩ＃１〜＃ｎの各々は、図８６に示すように１０バイト×ｋ個（ｋ≦９９）のＡＳＶディスプレイリストを含む。図８７はディスプレイタイミングモード（ｂ３、ｂ２）がスライドショー（００ｂ）であって、ディスプレイオーダモード（ｂ１、ｂ０）がシーケンシャル（００ｂ）の場合のＡＳＶディスプレイリストを示し、このリストは
・８ビット（ｂ７９〜ｂ７２）のＡＳＶ番号と、
・８ビット（ｂ７１〜ｂ６４）の保留領域と、
・８ビット（ｂ６３〜ｂ５６）の、ＡＳＶのスタート時に強制的に選択されるボタン番号（ＦＯＳＬ−ＢＴＮＮ）と、
・８ビット（ｂ５５〜ｂ４８）の、ＡＳＶのスタート時にプレーバックされるプログラム番号と、
・８ビット×４（ｂ４７〜ｂ１６）のディスプレイスタートタイミング（３１〜０）と、
・４ビット（ｂ１５〜ｂ１２）のスタートエフェクトモードと、
・４ビット（ｂ１１〜ｂ８）のスタートエフェクト期間と、
・４ビット（ｂ７〜ｂ４）のエンドエフェクトモードと、
・４ビット（ｂ３〜ｂ０）のエンドエフェクト期間と
により構成されている。
【０１３９】
上記データの内容を以下に詳しく示す。
（１）ディスプレイスタートタイミング（３１〜０）は、スタートＰＴＳからのディスプレイのスタートタイミングを示し、３１〜０／９００００（秒）を表す。
（２）スタートエフェクトモード（ｂ１５〜ｂ１２）
００００ｂ：カットイン
０００１ｂ：フェードイン
００１０ｂ：ディゾルブ
００１１ｂ：ワイプ・フロム・トップ
０１００ｂ：ワイプ・フロム・ボトム
０１０１ｂ：ワイプ・フロム・レフト
０１１０ｂ：ワイプ・フロム・ライト
０１１１ｂ：ワイプ・ダイアゴナル・レフト
１０００ｂ：ワイプ・ダイアゴナル・ライト
その他：保留
（３）エンドエフェクトモード（ｂ７〜ｂ４）
００００ｂ：カットアウト
０００１ｂ：フェードアウト
その他：（２）スタートエフェクトモード（ｂ１５〜ｂ１２）と同じ
【０１４０】
図８８は図８５のディスプレイモードにおいてディスプレイタイミングモード（ｂ３、ｂ２）がスライドショー（００ｂ）であって、ディスプレイオーダモード（ｂ１、ｂ０）がランダム（０１ｂ）の場合のＡＳＶディスプレイリストを示し、このリストでは８ビット（ｂ７９〜ｂ７２）が保留となる他は図８７に示すリストと同一となる。図８９は図８５のディスプレイモードにおいてディスプレイタイミングモード（ｂ３、ｂ２）がブラウザブル（０１ｂ）であって、ディスプレイオーダモード（ｂ１、ｂ０）がシーケンシャル（００ｂ）の場合のＡＳＶディスプレイリストを示し、このリストでは８ビット（ｂ５５〜ｂ４８）が保留となる他は図９７に示すリストと同一となる。図９０は図８５のディスプレイモードにおいてディスプレイタイミングモード（ｂ３、ｂ２）がブラウザブル（０１ｂ）であって、ディスプレイオーダモード（ｂ１、ｂ０）がランダム（０１ｂ）の場合のＡＳＶディスプレイリストを示し、このリストでは８ビット（ｂ７９〜ｂ７２）と８ビット（ｂ５５〜ｂ４８）が保留となる他は図８７に示すリストと同一となる。
【０１４１】
上記エンコード装置及びデコード装置は、上記エンコード方法及びデコード方法をコンピュータプログラムとしてＲＯＭなどのＩＣチップに記憶しておき、このプログラムによりコンピュータのＣＰＵ（中央演算処理装置）を作動させることによっても実現できる。本発明はまた、ＤＶＤなどの記録媒体を介して伝送するのみならず、インターネットやカラオケ通信回線などの通信回線を介して伝送して再生側ではハードウエアやＰＣ上のアプリケーションにより処理する場合にも適用することができる。
【０１４２】
【発明の効果】
以上説明したように本発明によれば、オーディオ信号を主として記録する場合にユーザにとって簡易に再生することができて使い勝手がよく、また、実時間の管理を簡単にすることができる。
【図面の簡単な説明】
【図１】ＤＶＤ−ビデオのフォーマットと、本発明に適用されるＤＶＤ−オーディオのフォーマットの第１実施形態を示す説明図である。
【図２】図１のオーディオマネージャ（ＡＭＧ）のフォーマットを詳しく示す説明図である。
【図３】図１のオーディオタイトルセット（ＡＴＳ）のフォーマットを詳しく示す説明図である。
【図４】図２のオーディオマネージャインフォメーション（ＡＭＧＩ）のフォーマットを詳しく示す説明図である。
【図５】図４のオーディオタイトルセット・アトリビュートテーブル（ＡＴＳ−ＡＴＲＴ）のフォーマットを詳しく示す説明図である。
【図６】図５のオーディオタイトルセット・アトリビュートデータ（ＡＴＳ−ＡＴＲ）のフォーマットを詳しく示す説明図である。
【図７】図３のオーディオタイトルセットインフォメーション（ＡＴＳＩ）のフォーマットを詳しく示す説明図である。
【図８】図７のオーディオタイトルセットインフォメーション・マネージメントテーブル（ＡＴＳＩ−ＭＡＴ）のフォーマットを詳しく示す説明図である。
【図９】図８のオーディオタイトルセットメニュー・オーディオストリーム・アトリビュートデータ（ＡＴＳＭ−ＡＳＴ−ＡＴＲ）を詳しく示す説明図である。
【図１０】図８のオーディオタイトルセット・オーディオストリーム・アトリビュートテーブル（ＡＴＳ−ＡＳＴ−ＡＴＲＴ）のフォーマットを詳しく示す説明図である。
【図１１】図１０の各オーディオストリームのアトリビュートデータ（ＡＴＳ−ＡＳＴ−ＡＴＲ）を詳しく示す説明図である。
【図１２】図１のオーディオコンテンツブロックユニット（ＡＣＢＵ）を示す説明図である。
【図１３】図１２のオーディオパックとビデオパックのフォーマットを詳しく示す説明図である。
【図１４】図１２のオーディオコントロール（Ａ−ＣＯＮＴ）パックのフォーマットを詳しく示す説明図である。
【図１５】図１４のオーディオキャラクタディスプレイ（ＡＣＤ）エリアのフォーマットを詳しく示す説明図である。
【図１６】図１５のネームスペース情報により表示される例を示す説明図である。
【図１７】図１４のオーディオサーチデータ（ＡＳＤ）エリアのフォーマットを詳しく示す説明図である。
【図１８】図１のオーディオコンテンツブロックユニットの変形例を示す説明図である。
【図１９】第２の実施形態におけるオーディオマネージャインフォメーション（ＡＭＧＩ）のフォーマットを詳しく示す説明図である。
【図２０】図１９のＴＯＣ情報を詳しく示す説明図である。
【図２１】第３の実施形態のオーディオタイトルセットインフォメーション（ＡＴＳＩ）のフォーマットを詳しく示す説明図である。
【図２２】第４の実施形態のＤＶＤ−オーディオディスクの基本フォーマットを示す説明図である。
【図２３】図２２のＤＶＤ−オーディオディスクのオーディオデータ構造を示す説明図である。
【図２４】ＤＶＤ−Ｖａｎディスクの基本フォーマットを示す説明図である。
【図２５】ＤＶＤビデオディスクの基本フォーマットを示す説明図である。
【図２６】ＤＶＤ−Ａｖｄディスクの基本フォーマットを示す説明図である。
【図２７】第４の実施形態のＤＶＤ−オーディオディスクにおけるＡＯＴＴ−ＡＯＢ−ＡＴＲを示す説明図である。
【図２８】第４の実施形態のＤＶＤ−ＡｖｄディスクにおけるリニアＰＣＭのプライベートヘッダを示す説明図である。
【図２９】本発明に適用されるオーディオ信号のエンコード装置の一実施形態を示すブロック図である。
【図３０】図２９の信号処理回路を詳細に示すブロック図である。
【図３１】第５の実施形態のデータ構造を示す説明図である。
【図３２】図３１のオーディオ・オンリ・タイトル用オーディオ・オブジェクト・セット（ＡＯＴＴ−ＡＯＢＳ）を詳しく示す説明図である。
【図３３】図３２のオーディオパックの一例を詳しく示す説明図である。
【図３４】図３３のプライベートヘッダを詳しく示す説明図である。
【図３５】図３４のＵＰＣ／ＥＡＮ−ＩＳＲＣデータを詳しく示す説明図である。
【図３６】図３３のオーディオデータのビットシフトを示す説明図である。
【図３７】図３２のリアルタイム・インフォメーション（ＲＴＩ）パックを詳しく示す説明図である。
【図３８】スチルピクチャ（ＳＰＣＴ）パックを詳しく示す説明図である。
【図３９】図３１のオーディオ・タイトルセット・インフォメーション・マネージメント・テーブル（ＡＴＳＩ−ＭＡＴ））を詳しく示す説明図である。
【図４０】図３９のオーディオ・オンリ・タイトル用オーディオ・オブジェクト・アトリビュート（ＡＯＴＴ−ＡＯＢ−ＡＴＲ）を詳しく示す説明図である。
【図４１】図３９のオーディオ・オンリ・タイトル用ビデオ・オブジェクト・オーディオストリーム・アトリビュート（ＡＯＴＴ−ＶＯＢ−ＡＳＴ−ＡＴＲ）を詳しく示す説明図である。
【図４２】図４０及び図４１のチャネル割当情報を詳しく示す説明図である。
【図４３】図３９のダウンミックス係数（ＡＴＳ−ＤＭ−ＣＯＥＦＴ）を詳しく示す説明図である。
【図４４】図３９のスチルピクチャ・データ・アトリビュート（ＡＴＳ−ＳＰＣＴ−ＡＴＲ）を詳しく示す説明図である。
【図４５】図３１のオーディオ・タイトルセット・プログラム・チェーン・インフォメーション・テーブル（ＡＴＳ−ＰＧＣＩＴ）を詳しく示す説明図である。
【図４６】図４５のＡＴＳ−ＰＧＣＩＴインフォメーション（ＡＴＳ−ＰＧＣＩＴＩ）を詳しく示す説明図である。
【図４７】図４５のＡＴＳ−ＰＧＣＩサーチポインタ（ＡＴＳ−ＰＧＣＩ−ＳＲＰ）を詳しく示す説明図である。
【図４８】図４７のＡＴＳ−ＰＧＣカテゴリ（ＡＴＳ−ＰＧＣＩ−ＣＡＴ）を詳しく示す説明図である。
【図４９】図４５のオーディオ・タイトルセット・プログラム・チェーン・インフォメーション（ＡＴＳ−ＰＧＣＩ）を詳しく示す説明図である。
【図５０】図４９のＡＴＳ−ＰＧＣジェネラル・インフォメーション（ＡＴＳ−ＰＧＣ−ＧＩ）を詳しく示す説明図である。
【図５１】図５０のＡＴＳ−ＰＧＣコンテンツ（ＡＴＳ−ＰＧＣ−ＣＮＴ）を詳しく示す説明図である。
【図５２】図４９のＡＴＳプログラム・インフォメーション・テーブル（ＡＴＳ−ＰＧＩＴ）を詳しく示す説明図である。
【図５３】図５２のＡＴＳプログラム・インフォメーション（ＡＴＳ−ＰＧＩ）を詳しく示す説明図である。
【図５４】図５３のＡＴＳ−ＰＧコンテンツ（ＡＴＳ−ＰＧ−ＣＮＴ）を詳しく示す説明図である。
【図５５】図５２のＡＴＳセル・プレイバック・インフォメーション・テーブル（ＡＴＳ−Ｃ−ＰＢＩＴ）を詳しく示す説明図である。
【図５６】図５５のＡＴＳセル・プレイバック・インフォメーション（ＡＴＳ−Ｃ−ＰＢＩ）を詳しく示す説明図である。
【図５７】図５６のＡＴＳ−Ｃタイプ（ＡＴＳ−Ｃ−ＴＹ）を詳しく示す説明図である。
【図５８】第５の実施形態のエンコード装置を示すブロック図である。
【図５９】図５８のエンコード装置の処理を示すフローチャートである。
【図６０】第５の実施形態のデコード装置を示すブロック図である。
【図６１】図６０のデコード装置を機能的に示すブロック図である。
【図６２】図６０、図６１のデコード装置の処理を示すフローチャートである。
【図６３】第５の実施形態のオーディオ信号を伝送する場合のパッキング装置を示すブロック図である。
【図６４】図６３のパッキング装置のパッキング処理を示すフローチャートである。
【図６５】図６４のパック生成処理を詳しく示すフローチャートである。
【図６６】図６４のＡＴＳ生成処理を詳しく示すフローチャートである。
【図６７】図６３のパッキング装置の送信処理を示すフローチャートである。
【図６８】第５の実施形態のオーディオ信号を伝送する場合のアンパッキング装置を示すブロック図である。
【図６９】図６８のアンパッキング装置の受信処理を示すフローチャートである。
【図７０】図６８のアンパッキング装置のアンパッキング処理を示すフローチャートである。
【図７１】図７０のＡＴＳＩデコード処理を詳しく示すフローチャートである。
【図７２】図７０のパックデコード処理を詳しく示すフローチャートである。
【図７３】図３１のＳＰＳ（スチルピクチャセット）すなわちＡＳＶＳ（オーディオ・スチル・ビデオ・セット）を詳しく示す説明図である。
【図７４】図７３のＡＳＶＵＩ（オーディオ・スチル・ビデオ・ユニット・インフォメーション）を詳しく示す説明図である。
【図７５】図７３のＡＳＶ−ＡＤＭＡＰ（オーディオ・スチル・ビデオ・アドレスマップ）を詳しく示す説明図である。
【図７６】図７３のＡＳＶＯＢＳ（オーディオ・スチル・ビデオ・オブジェクト・セット）を詳しく示す説明図である。
【図７７】図７６のハイライト情報パック、スチルピクチャパック及びサブピクチャパックを詳しく示す説明図である。
【図７８】図７７のハイライト情報を詳しく示す説明図である。
【図７９】図７７のスチルピクチャパケットを詳しく示す説明図である。
【図８０】図７７のサブピクチャパケットを詳しく示す説明図である。
【図８１】図７８のハイライト情報パック、スチルピクチャパック及びサブピクチャパックによる表示画面を示す説明図である。
【図８２】図３１のデータ構造の変形例を示す説明図である。
【図８３】図８２のスチルピクチャ・コントロール・インフォメーション・テーブル（ＳＰＣＩＴ）すなわちＡＴＳ−ＡＳＶ−ＰＢＩＴ（ＡＴＳオーディオスチルビデオ・プレイバックインフォメーションテーブル）を詳しく示す説明図である。
【図８４】図８３のＡＴＳ−ＡＳＶ−プレイバックインフォメーション・サーチポインタ（ＡＴＳ−ＰＧ−ＡＳＶ−ＰＢＩＴ−ＳＲＰ＃１〜＃ｍ）を詳しく示す説明図である。
【図８５】図８４のＡＳＶディスプレイモードを詳しく示す説明図である。
【図８６】図８３のＡＴＳ−ＡＳＶ−ＰＢＩ（ＡＴＳオーディオスチルビデオ・プレイバックインフォメーション）を詳しく示す説明図である。
【図８７】図８６のＡＳＶディスプレイリストを詳しく示す説明図である。
【図８８】図８６の他のＡＳＶディスプレイリストを詳しく示す説明図である。
【図８９】図８６の他のＡＳＶディスプレイリストを詳しく示す説明図である。
【図９０】図８６の他のＡＳＶディスプレイリストを詳しく示す説明図である。
【符号の説明】
Ａパック第１のパック
ＡＯＢオーディオオブジェクト
ＡＯＢＳオーディオオブジェクトセット
ＡＴＳＩオーディオタイトルセット・インフォメーション
ＡＴＳＩ−ＭＡＴオーディオタイトルセット・インフォメーション・マネージメント・テーブル
ＲＴＩパック第２のパック
ＳＰＳスチルピクチャセット
ＡＳＶＳオーディオ・スチル・ビデオ・セット
ＳＰＣＴパック第３のパック[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to encoding an audio signal.Method, Audio signal transmission / reception method, and computer program recording medium, And decoding method of audio signalAbout.
[0002]
[Prior art]
A CD (compact disc) is known as a conventional optical disc for audio reproduction. A DVD (digital video disk) is known as an optical disk having a higher density than a CD.
[0003]
[Problems to be solved by the invention]
However, in a DVD (hereinafter, DVD-video), a video signal is recorded as a main signal, and an audio signal is recorded as a sub signal.
(1) The audio signal is integrated with the video signal, and the recording capacity of the audio signal is small.
(2) The time of the audio signal cannot be managed.
(3) Simple character information such as a song title cannot be extracted.
[0004]
Also, since audio users have a wider range of usage layers than video, a simple reproduction method is required by providing a TOC (table of contents) area like a CD. However, in the case of DVD-video, a navigation control pack (CONT pack), a plurality of video (V) packs and an audio (A) pack constitute a video content block unit, and playback of the V and A packs is controlled by the CONT pack. Therefore, there is a problem in that even if an audio signal is mainly recorded, it cannot be easily reproduced for the user, and the usability is poor.
[0005]
Further, in DVD-Video, time management is performed only in video frame units. Therefore, even if an audio signal is mainly recorded, continuity of the audio signal is more important than video, so that real-time management is difficult. There is a problem.
[0006]
Accordingly, the present invention provides an audio signal encoding that can be easily reproduced and convenient for a user when mainly recording an audio signal, and can simplify real-time management.Method, Audio signal transmission and reception method, and computer program recording medium, And decoding method of audio signalThe purpose is to provide.
[0007]
[Means for Solving the Problems]
The present invention comprises the following means 1) to 4) to achieve the above object.
That is,
[0008]
1)Multi-channelAnalog audio signalAnd the first and second sampling frequencies (fs1, fs2) which are the same or different for the first group of channels and the second group of channels, respectively.A / D conversion to generate audio dataStepsWhen,
An audio title set (ATS) including a plurality of audio objects (AOB) and a still picture set (SPS);
SaidAudio object (AOB)But,
As actual dataSaidConstituted only by the first pack having audio dataConsists of a first type of audio cell (ATS-C)FirstAudio object (AOB)When,
The first pack and a second pack having real-time information data on the audio data as actual data.Consists of a second type of audio cell (ATS-C)SecondAudio object (AOB)It consists of two types,
In addition,Still picture set (SPS)Has a third pack containing still picture data for said audio data,
SaidThe audio title set (ATS) includes type identification data (ATS-C-TY) for identifying the type of the audio cell (ATS-C), and the plurality of audio objects (AOB) are reproduced in a predetermined reproduction order. Information (ATS-PGCIT) having audio cell playback information (ATS-C-PBI) for controlling reproduction in units of audio cells.Audio title set information (ATSI) is placed,The audio title set information (ATSI) is control information for page-controlling still picture data in the still picture set (SPS), and is a mode for identifying a sequential playback mode (sequential) and a random playback mode (random). Has still picture control information (SPCIT) including data (display order mode),
The first pack further includes a packet in which the audio data is recorded, and the packet further includes a private header in which first and second sampling frequencies (fs1, fs2) of the audio data are recorded. Is configured to have
Format to data structureStepsWhen,
Encoding of audio signals consisting ofMethod.
2)1)Encoding of audio signal described inGenerated by encodingAn audio signal transmission / reception method for transmitting or receiving data having a data structure,
Converting the data into a predetermined communication format and transmitting the data, or receiving the data converted into the predetermined communication format and restoring the original dataHavingAudio signal transmission and reception method.
3)1)Encoding of audio signal described inGenerated by encodingA recording medium of a computer program in which a computer program for decoding data in which the data structure is recorded is recorded,
A still picture is obtained from audio title set information (ATSI) arranged in the recorded audio title set (ATS).Picture control including mode data (display order mode)Decoding information (SPCIT)Mode data (display order mode)A first decoding step of extracting
Said recordedAudio title set (ATS), A second pack, and a third pack arranged in a still picture set (SPS).A second decoding step of identifying
A third decoding step of decoding audio data, real-time information data, and still picture data from each of the packs extracted from the second decoding step;
At least taken from the first decoding stepBased on mode data (display order mode)The still picture data extracted in the second decoding step isControlled alone and / or with the audio dataOutputting a computer program.
4)1)Encoding of audio signal described inGenerated by encodingThe recorded data to decode the recorded dataAudio signal decoding methodAnd
A still picture is obtained from audio title set information (ATSI) arranged in the recorded audio title set (ATS).Picture control including mode data (display order mode)Decode information (SPCIT)To retrieve mode data (display order mode)A first decoding step;
Said recordedAudio title set (ATS), A second pack, and a third pack arranged in a still picture set (SPS).IdentifyA second decoding step;
A third decoding step of decoding audio data, real-time information data, and still picture data from each of the packs extracted from the second decoding step;
At least taken from the first decoding stepBased on mode data (display order mode)Controlling the still picture data extracted in the second decoding step,Alone / or as aboveOutputting together with the audio data.Audio signal decoding method.
[0009]
BEST MODE FOR CARRYING OUT THE INVENTION
Hereinafter, embodiments of the present invention will be described with reference to the drawings. FIG. 1 is an explanatory diagram showing an embodiment of a DVD-video format and a DVD-audio format applied to the present invention. FIG. 2 is an explanatory diagram showing the format of an audio manager (AMG) in FIG. 1 in detail. 3 is an explanatory diagram showing the format of the audio title set (ATS) of FIG. 1 in detail, FIG. 4 is an explanatory diagram showing the format of the audio manager information (AMGI) of FIG. 2 in detail, and FIG. 5 is an audio title set attribute of FIG. FIG. 6 is an explanatory diagram showing the format of the table (ATS-ATRT) in detail, FIG. 6 is an explanatory diagram showing the format of the audio title set attribute data (ATS-ATR) of FIG. 5 in detail, and FIG. 7 is the audio title set information (ATS-ATR) of FIG. ATSI) Former FIG. 8 is an explanatory diagram showing the format of the audio title set information management table (ATSI-MAT) shown in FIG. 7 in detail, and FIG. 9 is an audio title set menu audio stream attribute data (ATSM) shown in FIG. FIG. 10 is an explanatory diagram showing the format of the audio title set audio stream attribute table (ATS-AST-ATRT) of FIG. 8 in detail. FIG. 11 is an explanatory diagram showing each audio stream of FIG. FIG. 4 is an explanatory diagram showing in detail attribute data (ATS-AST-ATR).
[0010]
FIG. 12 is an explanatory diagram showing the audio content block unit (ACBU) of FIG. 1, FIG. 13 is an explanatory diagram showing the format of the audio pack and video pack of FIG. 12 in detail, and FIG. FIG. 15 is an explanatory diagram showing the format of the audio character display (ACD) area in FIG. 14 in detail, and FIG. 16 is an explanatory diagram showing an example displayed by the namespace information in FIG. FIG. 17 is an explanatory diagram showing the format of the audio search data (ASD) area in FIG. 14 in detail, and FIG. 18 is an explanatory diagram showing a modification of the audio content block unit in FIG.
[0011]
Here, the DVD-audio disc described in this description has a stereo 2-channel and a 5/6 / 8-channel multi-channel as an audio signal so as to correspond to a transition period when shifting from the CD generation to the DVD-audio generation. Are recorded. When the transition period has elapsed, it is considered that only the multi-channel signal of 5/6/8 channel is recorded. Even when only a multi-channel signal is recorded, a two-channel signal can be generated from the multi-channel signal by a downmix coefficient during reproduction. The generated two-channel signal is positioned as simple reproduction.
[0012]
FIGS. 1A and 1B show DVD-video and DVD-audio formats, respectively. The DVD-audio format has different area names but is compatible with DVD-video. First, the DVD-video format is roughly composed of a video manager (VMG) at the head and each area of a plurality of video title sets (VTS) following the format. On the other hand, the DVD-audio format corresponds to the figure. 2 and an audio title set (ATS) following the AMG, as shown in detail in FIG.
[0013]
Each of the VTSs is composed of the first VTS information (VTSI), followed by one or more video content block sets (VCBS) and the last VTSI, while each of the ATSs corresponds to the first ATS information (ATSI). ), Followed by one or more audio content block sets (ACBS) and the last ATSI. In ATSI, the playing time of each song in the ACBS is set in real time.
In the present invention, menu information for displaying a menu screen is recorded in the first ACBS. This is the same as a DVD video, and the description is omitted.
[0014]
Each of the VCBSs is made up of a plurality of VCBs, while each of the ACBSs is made up of a plurality of ACBs. Each of the VCBs corresponds to one title of the video, and each of the ACBs corresponds to one title of the audio. Each VCB (one title) is constituted by a plurality of chapters (Chapter), while each ACB (one title) is correspondingly constituted by a plurality of tracks (Track). A chapter contains a part of title (PTT) and a track contains a part of title (PTT).
[0015]
Each of the chapters is constituted by a plurality of cells (CELL), while each of the tracks is correspondingly constituted by a plurality of indexes (Index). Each of the cells is composed of a plurality of VCBU units (VCBU), while each of the indexes is correspondingly composed of a plurality of ACB units (ACBU). Each of the VCB unit and the ACB unit is composed of a plurality of packs, and one pack is composed of 2048 bytes.
[0016]
Each VCB unit is composed of a head control pack (hereinafter referred to as a CONT pack), followed by an audio (A) pack, a plurality of video (V) packs, and sub-picture (SP) packs. Each of them is constituted by a head audio control pack (hereinafter, A-CONT pack) corresponding thereto, followed by a plurality of A packs and V packs.
[0017]
In the CONT pack, information for controlling the subsequent V pack is arranged. In the A-CONT pack, information for managing the audio signal of the subsequent A pack, such as the TOC information of the CD, is arranged. Audio data is arranged in the A pack, and closed caption (CC) data other than audio data, for example, is arranged in the V pack in addition to video data.
[0018]
AMG (Audio Manager), as shown in FIG.
Audio manager information (AMGI) shown in detail in FIG.
An audio content block set (AMGM-ACBS) for the AMG menu;
・ AMGI for backup
Having. AMGM-ACBS is used as control information.
・ Presentation control information (PCI)
・ Data search information (DSI)
Having.
[0019]
ATS (Audio Title Set) is as shown in FIG.
Audio title set information (ATSI) shown in detail in FIG.
An audio content block set (ATSM-ACBS) for the ATS menu;
An audio content block set (ATSA-ACBS) for ATS titles;
・ ATSI for backup
Having. Both ATSM-ACBS and ATSA-ACBS have the PCI and DSI described above (FIG. 2).
[0020]
AMGI (Audio Manager Information), as shown in detail in FIG.
・ AMGI management table (AMGI-MAT)
A title search pointer table (T-SRPT);
An audio manager menu PGCI unit table (AMGM-PGCI-UT);
-Parental management information table (PTL-MAIT)
An audio title set attribute table (ATS-ATRT);
A text data manager (TXTDT-MG);
An audio manager menu cell (index) address table (AMGM-C-ADT);
-Audio manager menu-Audio content block unit-Addressless map (AMGM-ACBU-ADMAP)
Having.
[0021]
The ATS-ATRT (audio title set attribute table), as shown in detail in FIG.
Audio title set attribute table information (ATS-ATRTI);
An audio title set attribute search pointer (ATS-ATR-SRP # 1 to #n) of each of a plurality (n) of ATSs;
-Audio title set attribute data (ATS-ATR- # 1- # n) of each of a plurality (n) of ATSs as shown in detail in FIG.
Having.
[0022]
Each of the audio title set attribute data (ATS-ATR- # 1 to #n) is, as shown in detail in FIG.
ATS-ATR-EA (end address)
-ATS-CAT (category),
・ ATS-ATRI (information)
Having.
[0023]
The ATSI (ATS information) shown in FIG. 3 is, as shown in detail in FIG.
An audio title set information management table (ATSI-MAT) shown in detail in FIG.
An audio title set, a part of title, a search pointer table (ATS-PTT-SRPT),
-Audio title set-Program chain information table (ATS-PGCIT)
・ Audio title set menu ・ PGCI ・ Unit table (ATSM-PGCI-UT)
Audio title set time map table (ATS-TMAPT)
・ Audio title set menu ・ Cell address table (ATSM-C-ADT)
-Audio title set menu-Audio content block unit-Address map (ATSM-ACBU-ADMAP)
An audio title set cell address table (ATS-C-ADT);
-Audio title set-Audio content block unit-Address map (ATS-ACBU-ADMAP)
Having.
[0024]
The ATSI-MAT (Audio Title Set Information Management Table) shown in FIG.
ATS-ID (identifier);
ATS-EA (end address),
・ ATSI-EA,
・ VERN (DVD audio specification version number)
-ATS-CAT (category),
・ ATSI-MAT-EA,
ATSM-ACBS-SA (start address)
ATSA-ACBS-SA,
ATS-PTT-SRPT-SA,
ATS-PGCIT-SA,
ATSM-PGCI-UT-SA,
ATS-TMAP-SA,
ATSM-C-ADT-SA,
ATSM-ACBU-ADMAP-SA;
ATSM-AST-ATR (ATSM audio stream attribute) as shown in detail in FIG.
ATS-AST-Ns (the number of ATS audio streams);
ATS-AST-ATRT (ATS audio stream attribute table) as shown in detail in FIG.
Having.
[0025]
The ATSM-AST-ATR is composed of 8 bytes (bits b63 to b0) as shown in detail in FIG. 9, and the following data (1) to (4) as attributes of the encoded audio signal recorded on this disc. ) Is placed (other bits are reserved).
[0026]
(1) Audio encoding mode (3 bits b63 to b61)
000b: Dolby AC-3
010b: MPEG-1 or MPEG-2 (no extended bit stream)
011b: MPEG-2 (with extended bit stream)
100b: Linear PCM audio
101b: Linear PCM audio (including 2ch + 5ch, 2ch + 6ch, 2ch + 8ch)
[0027]
(2) Quantization / DRC (Dynamic Range Control) information (2 bits b55, b54)
"11b" when the audio encoding mode is "000b"
When the audio encoding mode is “010b” or “011b”,
00b: Dynamic range control data does not exist in MPEG audio stream
01b: Dynamic range control data exists in the MPEG audio stream
10b, 11b: pending
When the audio encoding mode is “100b” or “101b”, for stereo 2ch
00b: 16 bits
01b: 20 bits
10b: 24 bits
11b: pending
[0028]
(3) Sampling frequency fs (2 bits b53, b52)
For stereo 2ch
00b: 48 kHz
01b: 96 kHz
10b: 192 kHz
(4) Number of audio channels (3 bits b50 to b48)
000b: 1ch (monaural)
001b: 2ch (stereo)
010b: 3ch
011b: 4ch
100b: (stereo 2ch + 5ch)
101b: (stereo 2ch + 6ch)
110b: 7ch
111b: (stereo 2ch + 8ch)
[0029]
As shown in detail in FIG. 11, the ATS-AST-ATRT (ATS audio stream attribute table) shown in FIG. 10 has ATS-AST-ATR for each of audio streams # 0 to # 7, and ATS-AST-ATR. Are composed of 8 bytes (total of 64 bytes).
[0030]
As shown in FIG. 11, the ATS-AST-ATR of one audio stream is 8 bytes (bits b63 to b0) similar to the audio title set menu audio stream attribute data (ATSM-AST-ATR) shown in FIG. And in addition to the attribute data (1) to (4),
(5) Multi-channel extension (1 bit b60)
(6) audio type (2-bit b59, b58);
(7) Audio application mode (2-bit b57, b56)
(8) thinning information (2 bits b47, b46) of the stream (AST);
(9) Thinning-out information for only 1 channel of LFE (Low Frequency Effect) (2 bits b45, b44)
Of each data. And, in (7) audio application mode of this DVD audio disc,
11b: 2ch + surround mode
Are recorded as band information in both (8) thinning information of the stream and (9) thinning information of only LFE1ch.
00b: full (1/1)
01b: Half (1/2)
10b: Quota (1/4)
Is recorded.
[0031]
However, (4) the number of audio channels in the ATSM-AST-ATR is always 2ch in the audio stream # 0, and the audio stream # 1 includes the front 3ch. That is, for example, when an audio signal of one title is recorded in 2 + 6 channels, a stereo signal of 2 channels is allocated to the audio stream # 0, and a front signal of 3 channels is allocated to the audio stream # 1 among the 6 channels, and a rear signal of 2 channels and LFE1ch are allocated. Assign the signal to audio stream # 2. The audio manager information management table (AMGI-MAT) shown in FIG. 4 and the audio title set information management table (ATSI-MAT) shown in FIG. Is recorded.
[0032]
When the 2 + 6 ch analog audio signal is sampled at, for example, the following sampling frequency fs and quantized with the following quantization bit number and recorded,
Stereo 2ch: 48 kHz, 20 bits
Front 3ch: 96kHz, 16 bit
Rear 2ch, LFE1ch: 48kHz, 16 bits (no thinning)
The audio title set menu, audio stream, and attribute data (ATSM-AST-ATR) shown in FIG.
(1) Audio encoding mode
101b: Linear PCM audio (including 2ch + 5ch, 2ch + 6ch, 2ch + 8ch)
(2) Quantization / DRC
01b: 20 bits
(3) Sampling frequency fs
00b: 48 kHz
(4) Number of audio channels
101b: (stereo 2ch + 6ch)
Is recorded.
[0033]
Also, ATS-AST-ATR of audio stream # 0 has
(1) Audio encoding mode
101b: Linear PCM audio (including 2ch + 5ch, 2ch + 6ch, 2ch + 8ch)
(2) Quantization / DRC
01b: 20 bits
(3) Sampling frequency fs
00b: 48 kHz
(4) Number of audio channels
001b: 2ch (stereo)
(7) Audio application mode
11b: 2ch + surround mode
(8) Thinning information of the stream
00b: full (1/1)
(9) LFE1ch only thinning information
00b: full (1/1)
Is recorded.
[0034]
Also, ATS-AST-ATR of audio stream # 1 has
(1) Audio encoding mode
101b: Linear PCM audio (including 2ch + 5ch, 2ch + 6ch, 2ch + 8ch)
(2) Quantization / DRC
00b: 16 bits
(3) Sampling frequency fs
01b: 96 kHz
(4) Number of audio channels
010b: 3ch
(7) Audio application mode
11b: 2ch + surround mode
(8) Thinning information of the stream
00b: full (1/1)
(9) LFE1ch only thinning information
00b: full (1/1)
Is recorded.
[0035]
Also, ATS-AST-ATR of audio stream # 2 has
(1) Audio encoding mode
101b: Linear PCM audio (including 2ch + 5ch, 2ch + 6ch, 2ch + 8ch)
(2) Quantization / DRC
00b: 16 bits
(3) Sampling frequency fs
00b: 48 kHz
(4) Number of audio channels
010b: 3ch
(7) Audio application mode
11b: 2ch + surround mode
(8) Thinning information of the stream
00b: full (1/1)
(9) LFE1ch only thinning information
00b: full (1/1)
Is recorded.
[0036]
Next, the A pack in which an audio stream is recorded and its control pack will be described. As shown in FIG. 12, the VCB unit is constituted by an arbitrary number of packs for 0.4 to 1.0 seconds, and the ACB unit is constituted by an arbitrary number of packs for 0.5 to 1.0 seconds. I have. The A-CONT pack in the DVD-Audio ACB unit is arranged in the third pack in the DVD-Video VCB unit.
[0037]
The A-CONT pack is basically arranged in units of 0.5 second of the audio time, and is arranged so as to be completed within the range of 0.5 to 1.0 second at the break of the index. The audio time (GOF: Group of Audio Frame unit) is indicated by the A-CONT pack, and its data position is determined by the audio frame number, the number of the first access unit pointer, and the number of frame headers. The A-pack immediately before the A-CONT pack does not force padding in units of 0.5 seconds of the audio time.
[0038]
Adjacent A-packs are arranged so that audio signals are related to each other. For example, in the case of stereo, an L-channel pack and an R-channel pack are arranged adjacently, and a multi-channel of 5/6/8 channel is used. In such a case, they are similarly arranged adjacently. The V pack is arranged adjacent to the A pack when displaying an image during reproduction of an audio signal. As shown in FIG. 13, the A-pack and the V-pack have 4-byte pack start information and 6-byte SCR (System Clock Reference: system time reference value) for 2034 bytes of user data (A data and V data). ) Information, 3-byte mux rate information, and 1-byte stuffing, and a pack header of a total of 14 bytes is added (1 pack = 2048 bytes in total). In this case, the time of the A pack in the same title can be managed by setting the SCR information as the time stamp to be “1” in the first pack in the ACB unit so as to be continuous in the same title.
[0039]
On the other hand, as shown in FIG. 14, the A-CONT pack has a 14-byte pack header, a 24-byte system header, a 1003 byte ACD (Audio Character Display) packet, and a 1007 byte ASD (Audio Search Data). ) It is composed of packets. The ACD packet includes a 6-byte packet header, a 1-byte substream ID, 636-byte ACD (audio character display) information as shown in detail in FIG. 15, and a 360-byte reserved area. The ASD packet includes a 6-byte packet header, a 1-byte substream ID, and a 1000-byte ASD (audio search data) as shown in detail in FIG.
[0040]
The 636-byte ACD information area has a 48-byte general information area and a 294-byte area for each of the first language character "1" and the second language character "2" as shown in detail in FIG. Each of these areas is composed of a 93-byte name space area, two 93-byte free space areas, and a 15-byte data pointer area. For example, as shown in FIG. 16, data for displaying a song name in Japanese is arranged in one namespace area of the character “1” of the first language and the character “2” of the second language, and In the name space area, data to be displayed in English is arranged. This display language may be determined by the disc publisher.
[0041]
The 48-byte general information includes, for example, 16-byte service level information, 12-byte language code information, 6-byte character set code information, 6-byte display item information, and 2-byte “previous ACD information. Difference information and 6-byte hold information. The 16-byte service level information indicates display size, display type, audio / video / SP distinction, stream, and the like. Characters are mandatory (mandatory), and bitmaps are optional (optional). The 12-byte language code information indicates the language of the characters "1" and "2" in 2 bytes each as in the case of the video file, and indicates up to 8 languages in one file. English is mandatory.
[0042]
The 6-byte character set code information can have a maximum of 15 character codes corresponding to the language code, and indicates the presence / absence and type of the language of the characters "1" and "2" in 1 byte. A code example is shown below.
1. ISO646
2. ISO8859-1
3. MS-JIS
The 6-byte display item information indicates the free spaces “1” and “2”, the presence / absence of a data pointer, and the ID shown in FIG. The namespace is mandatory, and the title name, music name, and artist name must be described.
[0043]
As shown in detail in FIG. 17, the 1000-byte ASD (audio search data) has 16-byte general information, 8-byte current number (No.) information, 16-byte current time information, and 8-byte title. It is composed of set search information, 8-byte title search information, 404-byte track search information, 408-byte index search information, 80-byte highlight search information, and a 52-byte reserved area.
[0044]
The 8-byte current number information includes the current title number of the title set (2 bytes: BCD), the current track number of the title set (2 bytes: BCD), and the current index number of the track (2 bytes: BCD). ) And a reserved area (2 bytes). The 16-byte current time information includes the playback time of the track (4 bytes: BCD), the remaining playback time of the track (4 bytes: BCD), the absolute time of the title (4 bytes: BCD), and the rest of the title. (4 bytes: BCD).
[0045]
The 8-byte title set search information includes the first sector number (4 bytes) of the title set and the last sector number (4 bytes) of the title set. The 8-byte title search information includes a first sector number (4 bytes) of the title and a last sector number (4 bytes) of the title. The 404-byte track search information includes a track and sector number of the title (4 bytes × 99), a first track number of the title (4 bytes), and a last track number of the title (4 bytes).
[0046]
The 408-byte index search information includes a track index and a sector number (4 bytes × 100), a first index number of the track (4 bytes), and a last index number of the track (4 bytes). The 80-byte highlight search information includes a track in-sector number (4 bytes × 10) and a track out-sector number (4 bytes × 10).
[0047]
According to such a format, the A-CONT pack for managing the audio signal of the succeeding A pack, such as the TOC information of the CD, is arranged at the head of the plurality of A packs. However, the recording capacity can be increased. The A-CONT pack can manage audio time, and the A-CONT pack can extract simple character information such as a song title related to audio data.
[0048]
Also, since TOC information such as a title, a start address, and a playing time is arranged in the A-CONT pack, even during audio reproduction, information according to a user operation is taken out from the A-CONT pack and reproduction is started. be able to. Further, by arranging the TOC information in the audio manager information (AMGI) and the audio title set information (ATSI), the necessary TOC information is stored in the memory in the playback device, and the information according to the user operation is stored in the memory. And can immediately start playback. Also, since there is no need to store a large amount of information such as program chain information (PGCI) in DVD-Video, it is possible to efficiently manage the disk.
[0049]
further,
1. If there is no image (V) data in the content,
(1) Search and random access to three levels of title, music, and index become possible.
(2) Cueing, time search, and random access in GOF (audio frame) units can be performed.
(3) The time of title, music, and index can be managed in real time.
[0050]
Also,
2. If there is image (V) data in the content,
For audio data,
In addition to the above (1) to (3),
(4) The title, the current time in the music, and the remaining time can be displayed and managed in real time.
[0051]
For video data,
(1) Search and random access to the three layers of title, PTT, and cell become possible.
(2) Cueing, time search, and random access in video frame units become possible.
(3) Title, PTT, and cell time can be managed in real time.
(4) The current time and the remaining time in the PTT or the title can be displayed and managed in video frame unit time.
[0052]
The ACBU in FIG. 1B includes the A-CONT pack and the CONT pack, but may be configured so as not to include the V pack and the CONT pack as shown in FIG. In this case, the video signal is not recorded, but the recording capacity of the audio signal is increased, the disc size can be reduced, and the reproduction function can be simplified, so that the portable reproduction is possible. Anything suitable for the device can be provided.
[0053]
19 is an explanatory diagram showing the format of the audio manager information (AMGI) in the second embodiment in detail, FIG. 20 is an explanatory diagram showing the TOC information in FIG. 19 in detail, and FIG. 21 is an audio title set information in the third embodiment. FIG. 3 is an explanatory diagram showing the format of (ATSI) in detail.
Next, a second embodiment using TOC (Table Of Contents) information will be described. As shown in FIG. 19, a TOC such as that shown in detail in FIG. 20 is additionally recorded in a free area of AMGI (Audio Manager Information), and the playback device accesses the TOC information to find the beginning of the music. Do. FIG. 20 shows, as an example, general TOC information recorded in a lead-in area of a CD, and the same information is repeatedly recorded three times. When recording on the DVD audio disk 1 of the present invention, the above-described operation may be repeated, or may not be repeated.
[0054]
Here, in the TOC information used in the CD, when point = 00 to 99, the absolute time at which each movement indicated by the numeral starts is represented by minutes (PMIN), seconds (PSEC) and frames (PFRAME). You. When point = A0, PMIN indicates the first movement, and PSEC = PFRAME = 0. When point = A1, PMIN indicates the last movement, and PSEC = PFRAME = 0. When point = A2, the absolute time at which the lead-out area starts is expressed in minutes (PMIN), seconds (PSEC) and frames (PFRAME). Therefore, the TOC information shown in FIG. 20 indicates that six songs (or six movements) are recorded on the DVD audio disc 1 (points = 01 to 06). This TOC information may be recorded in an empty area of ATSI (audio title set information) as shown in FIG. 21 instead of AMGI, and may be recorded in the ACD packet of the A-CONT pack shown in FIG. May be recorded in the reserved area (360 bytes).
[0055]
Next, a fourth embodiment will be described. FIG. 22 shows a format of a fourth embodiment of a DVD audio disk applied to the present invention, which does not include a VTS as shown in FIGS. 24 to 26, and is constituted only by an ATS. The ATS (directory) is composed of a SAMG (Structure of Simple Audio Manager), an audio manager (AMG) shown in FIG. 1B, a video and audio audio manager menu (AMGM), and an AMGI in the AMG. ATS <1> and ATS <2> are managed, and ATS <1> and ATS <2> do not include an A-CONT pack, but are configured by an A pack and an RTI pack, as shown in FIG. Have been. Also, this RTI pack is not arranged more frequently than the A pack, and about one pack is arranged every 0.5 seconds. Further, the still image pack is arranged at a predetermined position. SAMG is an area in which a SAPP table (TOC) for locating ATS <1> and ATS <2> is repeatedly described eight times. This area can be defined as one independent file.
[0056]
Here, for reference, FIG. 24 shows the format of a DVD-Van (video + audio navigation) disc. This format is roughly a DVD-video data including a video title set (VTS) and an audio navigation (navigation). The data is composed of an ANV title set (ANV-TS). More specifically, the VTS has the same configuration as the DVD video disk shown in FIG. 1A and FIG. 25 described later, while the ANV-TS has an audio manager (AMG) shown in FIG. And ATS <1> and ATS <2> managed by AMGI in AMG in a pair with VTS <1> and VTS <2>.
As shown in FIG. 25 and FIG. 1A, the format of the DVD video disc does not include the ATS or the ANV-TS, and is composed of only the VTS.
[0057]
FIG. 26 shows the format of a DVD-Avd (audio + AV data) disc. This format is based on a video title set (VTS) as DVD-video data and an audio title set (ATS) as DVD-audio data. It is configured. More specifically, the VTS includes a video manager (VMG) shown in FIG. 1A, a video and audio video manager menu (VMGM), and VTS <1> managed by a VMGI in the VMG. .
[0058]
On the other hand, the ATS pairs with the SAMG, the audio manager (AMG) shown in FIG. 1B, the audio manager menu (AMGM) for video and audio, and the audio data in the VTS <1> on the VTS side. ATS <1> managed by AMGI in AMG and ATS <2> managed by AMGI in AMG without forming a pair with the VTS side. As shown in FIG. 23, this ATS <2> does not include the A-CONT pack, but is composed of the A pack and the RTI pack.
[0059]
FIG. 27 shows audio-only title audio object attributes (AOTT-AOB-ATR) recorded on the disc as attribute data indicating the contents of audio data of the disc of the fourth embodiment. This attribute data is composed of 8 bytes (64 bits b63 to b0).
A 4-bit (b63 to b60) audio encoding mode;
1-bit (b59) downmix (DM) mode;
A multi-channel structure of 3 bits (b58 to b56);
A 4-bit (b55 to b52) quantization bit number Q1 of the channel group 1;
A 4-bit (b51 to b48) channel group 2 quantization bit number Q2;
A 4-bit (b47 to b44) sampling frequency fs1 of channel group 1;
A sampling frequency fs2 of channel group 2 of 4 bits (b43 to b40);
A 3-bit (b39 to b37) reserved area;
A 5-bit (b36 to b32) channel assignment;
-Consists of a reserved area of the remaining 32 bits (b31 to b0). The remaining 32 bits (b31 to b0) are used for attribute data of each channel.
[0060]
The above data is described in more detail below.
(1) Audio encoding mode (b63 to b60)
0000b: Linear PCM mode
0001b: Reserved for compressed audio (Dolby Digital)
0010b: Reserved for compressed audio (no MPEG2 extension)
0011b: Reserved for compressed audio (with MPEG2 extension)
0100b: Reserved for compressed audio (DTS)
0101b: Reserved for compressed audio (SDDS)
Other: reserved for other encoding modes
(2) Downmix mode (b59)
0b: Downmix stereo output enabled
1b: Downmix stereo output prohibited
(3) Multi-channel structure type (b58 to b56)
000b: Type 1
Others: Hold
[0061]
(4) Number of quantization bits Q of channel group 1 (b55 to b52)
0000b: 16 bits
0001b: 20 bits
0010b: 24 bits
Others: Hold
(5) Number of quantization bits Q of channel group 2 (b51 to b48)
"0000b" when the quantization bit number Q of the channel group 1 is "0000b"
"0000b" or "0001b" when the quantization bit number Q of the channel group 1 is "0001b"
"0000b", "0001b" or "0010b" when the quantization bit number Q of the channel group 1 is "0010b"
However, 0000b: 16 bits
0001b: 20 bits
0010b: 24 bits
Others: Hold
[0062]
(6) Sampling frequency fs1 of channel group 1 (b47 to b44)
0000b: 48 kHz
0001b: 96 kHz
0010b: 192 kHz
1000b: 44.1 kHz
1001b: 88.2 kHz
1010b: 176.4 kHz
Others: Hold
[0063]
(7) Sampling frequency fs2 of channel group 2 (b43 to b40)
"0000b" when the sampling frequency fs1 of the channel group 1 is "0000b"
"0000b" or "0001b" when the sampling frequency fs1 of the channel group 1 is "0001b"
"0000b", "0001b" or "0010b" when the sampling frequency fs1 of the channel group 1 is "0010b"
"1000b" when the sampling frequency fs1 of the channel group 1 is "1000b"
"1000b" or "1001b" when the sampling frequency fs1 of the channel group 1 is "1001b"
When the sampling frequency fs1 of the channel group 1 is “1010b”, “1000b”, “1001b”, or “1010b”
[0064]
The disk of the fourth embodiment uses the linear PCM mode. The private header of the linear PCM is as shown in FIG.
An 8-bit substream ID;
A 4-bit reserved area;
A 4-bit ISRC number;
-8-bit ISRC data;
An 8-bit private header length,
A first access unit pointer of 16 bits;
A 1-bit audio emphasis flag F1,
-It is composed of a 1-bit audio emphasis flag F2 and the like.
[0065]
29 and 30 show an encoding device. FIG. 29 is a block diagram showing an embodiment of an audio signal encoding apparatus applied to the present invention, and FIG. 30 is a block diagram showing a signal processing circuit of FIG. 29 in detail.
[0066]
In FIG. 29, the analog audio signal A is sampled by the A / D converter 31 at a sufficiently high sampling frequency (sampling period Δt), for example, 192 kHz, and is converted into a 24-bit high-resolution PCM signal, for example, to obtain a high-resolution curve α. Corresponding data column
xb1, x1, xa1, x2, xb2, x3, xa2,..., xbi, x2i-1, xai, x2i,.
Is converted to This data string (xbi, x2i-1, xai, x2i) is encoded by the signal processing circuit 32 and the memory 33 shown in detail in FIG. 30 and then applied to the DVD audio formatting unit 34.
[0067]
The configuration of the signal processing circuit 32 will be described in detail with reference to FIG. First, a band-limited low-resolution curve is converted from a data string (xbi, x2i-1, xai, x2i) corresponding to the high-resolution curve α by a low-pass filter 36 that passes a half band, for example, an FIR filter. Data string corresponding to β
xc1, *, *, *, xc2, *, *, *, xc3, *, *, *, ..., xci, *, *, *, ...
And then thinning out the data “*” from the data string by the thinning circuit 37 to obtain the data string
xc1, xc2, xc3, ..., xci, ...
Generate Here, the data sequence xci is a data sequence in which the digital data subjected to A / D conversion by the A / D converter 31 is band-limited and the sampling frequency is reduced to 1/4.
[0068]
The data string (xbi, x2i-1, xai, x2i) is thinned out by the thinning circuit 38 in the data string (xbi, x2i-1, xai, x2i).
xb1, xa1, xb2, xa2,..., xbi, xai,.
Generate
[0069]
Then, based on these data strings xci, xbi, xai, the adder 39 acting as a difference calculator calculates the difference.
xbi-xci = Δ1i
xai−xci = Δ2i
Is calculated. Here, the difference data Δ1i and Δ2i are, for example, 24 bits or less, and the number of bits may be fixed or variable.
[0070]
The allocation circuit 40 packs the data string xci and the difference data Δ1i and Δ2i into user data (see FIG. 13) (1 packet = 2034 bytes), and outputs the user data to the DVD formatting unit 34.
[0071]
The video signal V is converted into a digital signal by an A / D converter 31V, and then this digital video signal is encoded into an MPEG format by a V encoder 32V, and then packed into user data shown in FIG. 34. Then, the DVD formatter 34 packs the data into a format as shown in FIGS. The data formatted by the DVD formatter 34 is modulated by a modulation circuit 35 in a modulation method according to the disk, and a disk is manufactured based on the modulated data.
[0072]
Next, a DVD-Audio disc according to a fifth embodiment will be described with reference to FIGS. First, as shown in FIG. 31A, the data structure of the fifth embodiment roughly includes a SAMG, an AMG (audio manager), an SPS (still picture set), and a plurality of ATSs (audio title sets). ). The SPS (still picture set) is also referred to as an ASVS (audio still video set) in the following description in order to avoid confusion with the sub-picture (SP).
[0073]
ATS is
・ ATSI (ATS information)
Audio object set (AOTT-AOBS) for audio-only titles shown in detail in FIGS.
-It is composed of ATSI for backup. ATSI starts from the top
ATSI-MAT (ATSI management table) shown in detail in FIGS.
It is composed of ATS-PGCIT (ATS program chain information table) shown in detail in FIGS.
[0074]
The AOTT-AOBS is composed of a plurality of audio-only title audio objects (AOTT-AOB), as shown in detail in FIG. Each of the AOTT-AOBs is constituted by a plurality of programs (PG), and each of the programs is constituted by a plurality of cells (ATS-C).
[0075]
The AOTT-AOB includes audio data only as shown in detail in FIG. 32 (1) and audio data and real time information data (RTI data) as shown in detail in FIG. 32 (2). It is composed of two types of AOTT-AOB. Then, one or more types of AOTT-AOB are arranged in one disc or one song.
[0076]
Each program of the first AOTT-AOB including only audio data is composed of a plurality of audio cells (ATS-C), and these audio cells are composed of only a plurality of audio packs. Each program of the second AOTT-AOB including audio data and RTI data is composed of a plurality of audio cells (ATS-C), and this audio cell is composed of an RTI pack arranged at a second pack position and another pack. It is composed of audio packs arranged at positions.
[0077]
The A pack of the linear PCM is composed of 2048 bytes or less, and is composed of a 14 byte pack header and an A packet as shown in FIG. The A packet is composed of a packet header of 17, 9, or 14 bytes, a private header shown in detail in FIG. 34, and audio PCM data of 1 to 2011 bytes.
[0078]
The private header is as shown in FIG.
An 8-bit substream ID;
A 3-bit reserved area;
5-bit UPC / EAN-ISRC (Universal Product Code / European Article Number International Standard Recording Code: European Article Number-International Standard Recording Code)
-8-bit UPC / EAN-ISRC data;
An 8-bit private header length,
A first access unit pointer of 16 bits;
・ 8 bytes of audio data information (ADI)
・ Stuffing byte of 0 to 8 bytes
It consists of.
[0079]
ADI
A 1-bit audio emphasis flag,
A 1-bit reserved area;
1-bit downmix mode,
1-bit downmix code validity,
・ 4 bit downmix code,
A quantization word length “1” of a 4-bit group “1”;
A quantization word length “2” of a 4-bit group “2”;
An audio sampling frequency fs1 of a 4-bit group "1";
An audio sampling frequency fs2 of a 4-bit group "2";
A 4-bit reserved area;
・ 4-bit multi-channel type,
• Bit shift data of channel group “2” of 3 bits (see FIG. 36)
-5-bit channel assignment information (see Fig. 42);
• 8-bit dynamic range control information
.8 × 2 bit reserved area
It consists of.
[0080]
In the 8-bit (b7 to b0) UPC / EAN-ISRC data area, different data is arranged according to the UPC / EAN-ISRC number as shown in FIG. That is,
(1) When UPC / EAN-ISRC number = 1
Upper 2 bits b7, b6: reserved
Lower 6 bits b5 to b0: Country code (ISRC # 1)
(2) When UPC / EAN-ISRC number = 2
Upper 2 bits b7, b6: reserved
Lower 6 bits b5 to b0: Country code (ISRC # 2)
(3) When UPC / EAN-ISRC number = 3
Upper 2 bits b7, b6: reserved
Lower 6 bits b5 to b0: copyright holder code (ISRC # 3)
(4) When UPC / EAN-ISRC number = 4
Upper 2 bits b7, b6: reserved
Lower 6 bits b5 to b0: copyright holder code (ISRC # 4)
(5) When UPC / EAN-ISRC number = 5
Upper 2 bits b7, b6: reserved
Lower 6 bits b5 to b0: copyright holder code (ISRC # 5)
(6) When UPC / EAN-ISRC number = 6
Upper 4 bits b7 to b4: reserved
Lower 4 bits b3 to b0: recording ear (ISRC # 6)
(7) When UPC / EAN-ISRC number = 7
Upper 4 bits b7 to b4: reserved
Lower 4 bits b3 to b0: recording ear (ISRC # 7)
[0081]
In the area of the linear PCM data which is the actual data in the A-pack, the bits of the data of each channel of the group “2” are reduced and arranged in order to improve the S / N ratio and reduce bits. FIG. 36A shows PCM data of six channels (group “1” = Ch1 to Ch3, group “2” = Ch4 to Ch6) as an example, and the level range is MAX = 0 dB to MIN = −144 dB (24 bits). And the value of each channel Ch is as follows.
Lmax2> Lmax1 = Lmax3> Lmax4> Lmax5> Lmax6
Then, while the word length of Ch1 to Ch3 of the group "1" is kept as it is, and the value of Ch2 is the largest in this example, each level of Ch4 to Ch6 of the group "2" is up-shifted by (0-Lmax2) dB. Then, the LSB side 0 to 4 bits are reduced. Note that the example shown in FIG. 36 shows that each level of Ch4 to Ch6 is up-shifted by the maximum number of bits = 4 and reduced to 20 bits.
[0082]
Next, the configuration of the RTI pack will be described in detail with reference to FIG. This pack is composed of a 14-byte pack header and an RTI packet, and the RTI packet is composed of a 17- or 14-byte packet header, a private header, and 1 to 2015 bytes of RTI data. The RTI data is character information and reproduction control information relating to audio data.
[0083]
The private header of the RTI packet is
A 1-byte substream ID,
A 2-byte UPC / EAN-ISRC number and data (these are simply referred to as ISRC in the figure);
A 1-byte private header length,
1-byte RTI information ID;
・ 0-7 byte stuffing byte
It consists of.
The UPC / EAN-ISRC number and data are the UPC / EAN-ISRC number and data relating to the copyright of the still picture contained in the SPCT pack.
[0084]
Incidentally, an SPCT pack is arranged in the still picture set (audio still video set) shown in FIG. 31, and as shown in detail in FIG. 38, the SPCT pack is composed of a 14-byte pack header and SPCT packet. The SPCT packet is composed of a packet header of 22 or 19 or 9 bytes and SPCT data of 2025 bytes or less. Here, one still image is compressed according to the MPEG1 or MPEG2 system, is composed of I pictures and intra-coded pictures, is divided in one picture cell, and is arranged as SPCT data of an SPCT pack. Similarly, the packet header of the SPCT pack may include the UPC / EAN-ISRC number and data relating to the copyright of the still picture as described in the RTI pack.
[0085]
The ATSI-MAT shown in FIG. 31A is composed of 2048 bytes (relative byte positions RBP0 to 2047) as shown in detail in FIG.
An ATS identifier (ATS-ID) of 12 bytes (RBP0 to 11);
ATS end address (ATS-EA) of 4 bytes (RBP12 to 15);
A reserved area of 12 bytes (RBP 16 to 27);
An ATSI end address (ATSI-EA) of 4 bytes (RBP 28 to 31);
A version number (VERN) of 2 bytes (RBP32 to 33);
A reserved area of 94 bytes (RBP34 to 127);
4 bytes (RBP128 to 131) of ATSI-MAT end address;
A reserved area of 60 bytes (RBP132 to 191);
A 4-byte (RBP 192 to 195) AOTT VTS start address;
4 bytes (RBP196 to 199) of the start address of AOBS for AOTT or the start address of VOBS for AOTT;
A 4-byte (RBP 200 to 203) reserved area;
A 4-byte (RBP 204 to 207) ATS-PGCIT start address;
A reserved area of 48 bytes (RBP 208 to 255);
128 bytes (16 × 8) bytes (RBP 256 to 383) of AOB attributes for AOTT (AOTT-AOB-ATR) or attributes of AOTT VOB audio stream (AOTT-VOB-AST-ATR);
A coefficient (ATS-DM-COEFT # 0 to # 15) of 288 (18 × 8) bytes (RBP384 to 661) for downmixing multi-channel audio data into two channels;
A reserved area of 32 bytes (RBP672 to 703);
2 bytes (RBP 704 to 705) of still picture data attribute (ATS-SPCT-ATR) in AOBS for AOTT;
It is composed of a 1342 byte (RBP 706-2047) reserved area.
[0086]
In the area of 128 (16 × 8) bytes (RBP 256 to 383), when the ATS has AOBS for AOTT, AOTT-AOB-ATR shown in detail in FIG. 40 is described. This AOTT-AOB-ATR (b127 to b0) is sequentially from the MSB side
An audio encoding mode of 8 bits (b127 to b120);
An 8-bit (b119 to b112) reserved area;
A 4-bit (b111 to b108) channel group “1” quantization bit number Q1;
A 4-bit (b107 to b104) quantization bit number Q2 of the channel group “2”;
A sampling frequency fs1 of a 4-bit (b103 to b100) channel group "1";
A sampling frequency fs2 of a 4-bit (b99 to b96) channel group “2”;
A 3-bit (b95 to b93) multi-channel structure type;
Channel assignment of 5 bits (b92 to b88);
-Consists of a reserved area of 8 bits x 11 (b87 to b0).
[0087]
On the other hand, if the ATS does not have an AOTT AOBS, the AOTT-VOB-AST-ATR shown in FIG. 41 is described. This AOTT-VOB-AST-ATR (b127 to b0) is sequentially from the MSB side.
An audio encoding mode of 8 bits (b127 to b120);
An 8-bit (b119 to b112) reserved area;
A quantization bit number Q of 4 bits (b111 to b108);
A 4-bit (b107 to b104) reserved area;
A sampling frequency fs of 4 bits (b103 to b100);
A 4-bit (b99 to b96) reserved area;
A 3-bit (b95 to b93) multi-channel structure type;
Channel assignment of 5 bits (b92 to b88);
A 3-bit (b87-b85) decoding audio stream number;
A 5-bit (b84 to b80) reserved area;
A 2-bit (b79, b78) MPEG audio DRC;
A 2-bit (b77, b76) reserved area;
The number of 4-bit (b75 to b72) compressed audio channels;
-Consists of a reserved area of 8 bits x 9 (b71 to b0).
[0088]
The above data is shown in detail below. However, the number of quantization bits, the sampling frequency, and the multi-channel type are the same as those in FIG.
(1) Audio encoding mode (b127 to b120)
00000000b: Linear PCM mode
00000001b: Reserved for compressed audio (Dolby Digital)
00000010b: Reserved for compressed audio (without MPEG2 extension)
00000011b: Reserved for compressed audio (with MPEG2 extension)
00000100b: Reserved for compressed audio (DTS)
00000101b: Reserved for compressed audio (SDDS)
Other: reserved for other encoding modes
[0089]
(8) Channel assignment (b92 to b88)
FIG. 42 shows channel assignment information of groups “1” and “2” from 1 channel (monaural) to 6 channels. Incidentally, the symbols shown in the figure will be described below.
C (mono): monaural
L, R: 2 channel stereo
Lf: Multi-channel left front
Rf: Multi-channel light front
C: Multi-channel center
LFE: Multi-channel Low Frequency Effect
S: Multi-channel surround
Ls: Multi-channel left surround
Rs: Multi-channel light surround
[0090]
(9) “0” or “1” of the number of decoding audio streams (b87 to b85)
(10) DRC for MPEG audio (b79, b78)
00b: DRC data does not exist in the MPEG audio stream.
01b: DRC data exists in the MPEG audio stream.
[0091]
(11) Number of compressed audio channels (b75 to b72)
"1111b" when the audio encoding mode is linear PCM audio
0000b: 1ch (mono)
0001b: 2ch (stereo)
0010b: 3ch
0011b: 4ch
0100b: 5ch
0101b: 6ch
0110b: 7ch
0111b: 8ch
Others: Hold
[0092]
In the 288 (18 × 16) byte area (RBP 384 to 671) shown in FIG. 39, table numbers “0” to “15” as shown in FIG. 43 for downmixing multi-channel audio data into two channels. Are described in 18 bits (ATS-DM-COEFT # 0 to # 15).
[0093]
The 2-byte (RBP 704, 705) area shown in FIG. 39 is used to describe the still picture data attribute (ATS-SPCT-ATR) in the AOBS for AOTT.
A 2-bit (b15, b14) video compression mode;
A 2-bit (b13, b12) TV system;
An aspect ratio of 2 bits (b11, b10);
A display mode of 2 bits (b9, b8);
A 2-bit (b7, b6) reserved area;
A 3-bit (b5 to b3) source picture resolution;
-It is composed of a 3-bit (b2 to b0) reserved area.
[0094]
The contents of the ATS-SPCT-ATR will be described in detail below.
(1) Video compression mode (b15, b14)
00b: MPEG1 compatible
01b: MPEG2 compatible
Other: pending
(2) TV system (b13, b12)
00b: 525/60
01b: 625/60
Other: pending
(3) Aspect ratio (b11, b10)
00b: 4: 3
11b: 16: 9
Other: pending
(4) Display mode (b9, b8)
00b: pending
01b: Hold
10b: Only letterbox allowed
11b: Not described.
(5) Resolution of source picture (b5 to b3)

Other: pending
[0095]
The ATS-PGCIT (ATS program chain information table) shown in FIG. 31A is, as shown in detail in FIG.
An audio title set PGCI table information (ATS-PGCITI) detailed in FIG. 46;
47. n audio title set PGCI search pointers (ATS-PGCI-SRP # 1 to #n) shown in detail in FIGS.
-It is composed of a plurality of audio title sets PGCI shown in detail in FIG.
[0096]
The ATS-PGCITI is composed of 8 bytes as shown in detail in FIG.
The number of 2-byte ATS-PGCI-SRPs # 1 to #n;
A 2-byte reserved area,
-Consists of a 4-byte ATS-PGCIT end address. Each of the ATS-PGCI-SRPs # 1 to #n is composed of 8 bytes as shown in detail in FIG.
-The 4-byte ATS-PGC category (ATS-PGC-CAT) shown in detail in FIG.
-Consists of a 4-byte ATS-PGCI end address.
[0097]
The above 4-byte (b31 to b0) ATS-PGC categories are arranged in order from the top as shown in FIG.
1-bit (b31) entry type;
The number of ATS audio titles (ATS-TTN) of 7 bits (b30 to b24);
A 2-bit (b23, b22) block mode;
A 2-bit (b21, b20) block type;
The number of 4-bit (b19 to b16) audio channels;
An 8-bit (b15 to b8) audio encoding mode;
-Consists of an 8-bit (b7 to b0) reserved area.
[0098]
The contents of the category (ATS-PGC-CAT) will be described in detail below.
(1) Entry type (b31)
0b: Not entry PGC
1b: Entry PGC
(2) Number of ATS audio titles (b30 to b24)
The number of audio titles of this ATS is described in the range of “1” to “99”.
(3) Block mode (b23, b22)
00b: Not ATS-PGC of ATS-PGC block
01b: First ATS-PGC of ATS-PGC block
10b: pending
11b: last ATS-PGC of ATS-PGC block
(4) Block type (b21, b20)
00b: not part of this block
01b: Difference block only in audio coding mode
10b: Difference block of audio channel only
11b: Block of difference between audio coding mode and audio channel
(5) Number of audio channels (b19 to b16)
0000b: 2 channels or less
0001b: More than 2 channels
[0099]
As shown in detail in FIG. 49, each of the audio title sets PGCI (ATS-PGCI) shown in FIG.
-ATS-PGC general information (ATS-PGC-GI) shown in detail in FIGS.
ATS program information table (ATS-PGIT) shown in detail in FIGS.
-It is composed of an ATS cell playback information table (ATS-C-PBIT) shown in detail in Figs.
[0100]
The ATS-PGC-GI is composed of 16 bytes (RBP0 to 15) as shown in detail in FIG.
-ATS-PGC content (ATS-PGC-CNT) of 4 bytes (RBP0 to 3) shown in detail in FIG.
ATS-PGC playback time (ATS-PGC-PB-TM) of 4 bytes (RBP4 to 7);
A 2-byte (RBP8, 9) reserved area;
A 2-byte (RBP10, 11) ATS-PGIT start address;
A 2-byte (RBP12, 13) ATS-C-PBIT start address;
-Consists of a 2-byte (RBP14, 15) reserved area.
[0101]
The ATS-PGC contents of 4 bytes (b31 to b0) are sequentially arranged from the top as shown in FIG.
A 17-bit (b31 to b15) reserved area;
· Number of 7-bit (b14 to b8) programs
-It is constituted by the number of cells of 8 bits (b7 to b0). The number of programs is in the range of “1” to “99”, and the number of cells is in the range of “1” to “255”.
[0102]
The ATS program information table (ATS-PGIT) shown in FIG. 49 includes n ATS program information (ATS-PGI) # 1 to #n as shown in detail in FIG. Each of the ATS-PGIs # 1 to #n is composed of 20 bytes (RBP0 to RBP19) as shown in detail in FIG.
-ATS-PG content (ATS-PG-CNT) of 4 bytes (RBP0 to 3) shown in detail in FIG.
1-byte (RBP4) ATS-PG entry cell number;
A 1-byte (RBP5) reserved area;
The start presentation time (FAC-S-PTM) of the first audio cell of the ATS-PG of 4 bytes (RBP 6 to 9);
An ATS-PG playback time of 4 bytes (RBP10 to 13);
ATS-PG pause time of 4 bytes (RBP14-17),
A 1-byte (RBP18) reserved area (for copyright management data CMI);
• 1-byte (RBP19) reserved area
It consists of.
[0103]
As shown in detail in FIG. 54, the ATS-PG contents of 2 bytes (b31 to 0) are sequentially arranged from the top as shown in FIG.
-1 bit (b31) of the relationship between the previous and current PG (R / A);
1-bit (b30) STC discontinuity flag (STC-F);
The number of attributes (ATRN) of 3 bits (b29 to b27);
3 bits (b26 to b24) of a channel group (ChGr) “2” bit shift data;
A 2-bit (b23, b22) reserved area;
1-bit (b21) downmix mode (DM);
・ Effectiveness (illustration *) of 1-bit (b20) downmix coefficient,
A 4-bit (b19 to b16) downmix coefficient table number (DM-COEFTN);
-Each bit is composed of RTI flags F15 to F0 each having 1 bit and a total of 16 bits (b15 to b0).
[0104]
The ATS cell playback information table (ATS-C-PBIT) shown in FIG. 49 includes n ATS cell playback information (ATS-C-PBI) # 1 to ATS-C-PBI, as shown in detail in FIG. #N. Each of the ATS-C-PBIs # 1 to #n is composed of 12 bytes (RBP0 to 11) as shown in detail in FIG.
An ATS-C index number of 1 byte (RBP0);
An ATS-C type (ATS-C-TY) of 1 byte (RBP1) shown in detail in FIG.
A 2-byte (RBP2, 3) reserved area;
A 4-byte (RBP4 to 7) ATS-C start address;
-Consists of 4 bytes (RBP8 to 11) of ATS-C end address.
[0105]
As shown in detail in FIG. 57, the ATS-C type of 1 byte (b7 to b0) is
A 2-bit (b7, b6) ATS cell element (ATS-C-COMP);
A 2-bit (b5, b4) reserved area;
-It is configured for 4-bit (b3 to b0) ATS cell use (ATS-C-Usage).
[0106]
The contents of the above data are shown in detail below.
(1) ATS cell element (b7, b6)
00b: audio cell consisting of audio data only
01b: Audio cell composed of audio data and real-time information
10b: Silence cell consisting only of audio data for silence
11b: Picture cell consisting only of still pictures
(2) ATS cell use (b3 to b0)
0000b: No description
0001b: Spotlight part
Others: Hold
[0107]
Next, an encoding device according to a fifth embodiment will be described. 58 and 59 show the configuration and processing of the encoding device, respectively. The analog audio signal A is sampled by the A / D converter 31 at a sufficiently high sampling frequency (sampling period Δt), for example, 192 kHz, and is converted into a 24-bit high-resolution PCM signal, for example. In the subsequent bit shift / signal processing circuit 32, when compression is not performed, the PCM data converted by the A / D converter 31 is directly applied to the DVD formatting unit. On the other hand, when performing compression, the PCM data converted by the A / D converter 31 is compressed by the bit shift / signal processing circuit 32 according to the encoding mode, and then applied to the DVD formatting unit 34. (Steps S5 and S6). In the bit shift / signal processing circuit 32, each channel of the group “2” is bit-shifted.
[0108]
The video signal V is converted into a digital signal by the A / D converter 31V, and then the digital video signal is encoded into the MPEG format by the V encoder 32V and applied to the DVD formatter 34 (steps S1, S2). . Further, the still image signal SP is converted into a digital signal by the A / D converter 31SP, and then the digital still image signal SP is encoded into the MPEG format by the compression encoder 32SP and applied to the DVD formatter 34 (step S3). , S4). The copyright information and the real-time text information (RTI) are applied to the DVD formatting unit 34 via the interface (I / F) 40 (steps S7 and S8), and the character information and the disc identifier EX are applied to the DVD formatting unit 34 (step S7). S9, S10).
[0109]
Then, the DVD formatting unit 34 performs packing in the format as described above (step S11). The data formatted by the DVD formatter 34 is modulated by a modulation circuit 35 according to a modulation method corresponding to the disk, and a disk is manufactured based on the modulated data, recorded once in the recording unit 38, or communicated. The data is transmitted via the I / F 39 (step S12).
[0110]
FIG. 60 shows a specific configuration of the decoding device of the fifth embodiment, and FIG. 61 functionally shows the configuration of FIG. FIG. 62 shows the processing. 60 and 61, first, when a music selection, reproduction, fast-forward, and stop operation are performed by the operation unit 18 or the remote control device 19, the control unit 23 controls the drive device 2 and the reproduction device 17 according to the operation. During reproduction, the pit data recorded on the DVD audio disk 1 is read by the drive device 2 and then EFM demodulated.
[0111]
In the reproducing device 17, this signal is sent to the still image and V pack detecting section 3 and the A and RTI pack detecting sections 9. When a still image pack and a V pack are recorded on the disk 1, the still image and V pack detection unit 3 detects the still image pack and the V pack in the reproduced data and sets the control parameters in the parameter unit 8. At the same time, the still image pack and the V pack are sequentially written into the still image and V pack buffer 4. The still image pack and the user data (video signal, still image information) in the V pack written in the V pack buffer 4 are transferred to the still image pack and the SCR in the V pack by the buffer extracting unit 5 (see FIG. 13). Are extracted in the order of packs and in order of output time, and then output as analog video signals via the decompression and image conversion unit 6, the D / A conversion unit 7, and the video output terminals 15, 15 '.
[0112]
The A and RTI pack detector 9 detects the A and RTI packs in the reproduced data, sets the control parameters in the parameter section 14, and writes the A and RTI packs into the A and RTI pack buffers 10 sequentially. The user data (audio signal, real-time information) in the A pack and the RTI pack written in the A and RTI pack buffers 10 are extracted by the buffer extracting unit 11 in the order of packs and in the order of output time. The audio signal is output as an analog audio signal via the PCM conversion and bit shift / signal processing unit 12, the D / A conversion unit 13, and the audio output terminal 16. The real-time information is sent to the display signal generation unit 20 to generate a display signal. The display signal is output through the display signal output terminal 22 or is output to the built-in character display unit 21.
[0113]
The processing of this decoding device will be described with reference to FIG. First, the recording data is read out by accessing the disk 1 (step S20), and then, in each of the separation steps S21 to S29, a video signal, a still image signal, an audio signal, copyright information and real-time information (RTI), and text Information and disc identifier (EX) are separated. Next, each of the separated data is decoded in each of the decoding steps S22 to S30, and is then synchronously reproduced (steps S31 and S32).
[0114]
Here, there are the following three types of processing for reproducing the still image SP.
1) When the still image SP is obtained, the reproduction of the audio signal A is interrupted and muted.
2) When the still picture SP is obtained, the still picture SP is reproduced together with the audio signal A based on the time control signal. This is called a slide show.
3) When a still image SP is obtained, a page turning reproduction is performed based on a page turning command instructed by the user. At this time, the audio signal A is reproduced as it is. This is called browserable.
[0115]
When it is necessary to synchronize a still image with audio, a time control signal for real-time synchronization is provided under a still picture control information table (SPCIT) provided in addition to the ATSI shown in FIG. It is placed in the time control data information (SPCIT-TCDI).
[0116]
Still picture page control command information (SPPI) containing a page turning command is placed below SPCIT. As described above, SPCIT is composed of SPCIT general information (SPCIT-GI) of general information, time control data information (SPCIT-TCDI), and still picture page control command information (SPPI). .
[0117]
In addition, the still picture data of the SPCT pack in FIG. 38 may include side information for controlling a still picture page. The page control data specified by the side information is interpreted with reference to the SPPI. If there is not enough space to store the still picture data, it is acceptable to include the side information for controlling the still picture page in the RTI data of the RTI pack.
[0118]
Next, an embodiment for transmitting the digital audio signal formatted as described above via a communication line will be described. First, the packing device on the transmitting side will be described with reference to FIGS. As shown in FIG. 63, the packing device includes a packing processing unit 30, a buffer memory 30B, a control circuit 29, an operation unit 27, and a display 28. 64 to 67, first, when the video signal V, the still image signal SP, the audio signal A, the real-time information RTI, and the disc identifier (EX) are input, in step S100, as shown in FIG. A pack is generated (Step S101), a video pack is generated (Step S102), a still image pack is generated (Step S103), and then a real-time text is generated (Step S104).
[0119]
Next, the cell (ATS-C) is managed (step S200), then the PTT (part of title) is managed (step S300), then the title (AOTT-AOB) is managed (step S400), and then the title set (AOTT) -Manage AOBS (step S500) In the following step S600, to generate an ATS, a title set is generated as shown in detail in Fig. 66 (step S601), and then a menu is generated (step S602), and then the ATS is generated. -Describe the category of PGCI (step S603), and then generate ATS-PGCIT by generating PGIT composed of PG contents including bit shift and generate PGCI (step S604). By generating A Generating a SI (step S605). Then generate AMG (step S700), finally to produce a TOC (step S800).
[0120]
Next, when transmitting the digital audio signal formatted as described above via a communication line, as shown in FIG. 67, the transmission data stored in the transmission buffer is divided into predetermined lengths and divided into packets. (Step S41), a header including the destination address is added to the head of the packet (Step S42), and this is output on the network (Step S43).
[0121]
Next, the data receiving side will be described with reference to FIGS. As shown in FIG. 68, the unpacking device on the data receiving side includes an unpacking processing unit 60, a buffer memory 60B, a parameter memory 56, a control circuit 59, an operation unit 57, and a display 58. First, as shown in FIG. 69, the header is removed from the packet received from the network (step S51), the received data is restored (step S52), and the data is transferred to the memory (step S53).
[0122]
Next, as shown in FIGS. 70 to 72, first, the AMG is decoded to detect the ATS (step S1100). In the subsequent step S1200, in order to decode the ATSI of the target ATS, as shown in FIG. ATS-PGCI category is decoded (step S1201), PGIT composed of PG content including bit shift is decoded (step S1202), then MAT attributes and coefficients are decoded (step S1203), and these are decoded. Each parameter is set in the parameter memory 56 (step S1204).
[0123]
Next, when the reproduction is started, the pack is identified (step S1300), and in the subsequent step S1400, the audio pack is decoded (step S1401) as shown in detail in FIG. 72 to decode the pack, and then the video pack is decoded. (Step S1402), then decode the still image pack (Step S1403), and then decode the real-time text (Step S1404). Then, an audio signal, a video signal, a still image signal, and a real-time text signal decoded from each of these packs are output (step S1500), and the processing of steps S1300 to S1500 is repeated during reproduction.
[0124]
Next, the SPS (still picture set) shown in FIG. 31, that is, the ASVS (audio still video set) will be described in detail with reference to FIGS. Here, a still picture object set (SPOBS) described below is also called an audio still video object set (ASVOBS) to avoid confusion with a subpicture (SP). As shown in FIG. 73, the ASVS (audio still video set) includes an ASVS information (ASVSI) shown in detail in FIGS. 74 and 75, and an audio still video object set (ASVOBS) shown in detail in FIG. And a backup ASVSI.
[0125]
The ASVS information (ASVSI) includes audio still video unit information (ASVUI) shown in detail in FIG. 74, an ASV address map (ASV-ADMAP) shown in FIG. 75, and a stuffing area (00h). .
[0126]
ASVUI (888 bytes in total) is as shown in FIG.
A 12-byte ASVS-ID,
The number of 2-byte ASVUs,
A 2-byte holding area;
A 4-byte ASVOBS start address;
A 4-byte ASVOBS end address;
2 bytes × 4 ASVU attributes # 0 to # 3,
4 bytes × 16 ASVOBS sub-picture palettes # 0 to # 15,
-8 bytes x 99 ASVU # 1 to # 99 general information
It consists of.
[0127]
The ASV address map (ASV-ADMAP) shown in FIG. 73 is composed of m (≦ 99) ASVUs # 1 to #m as shown in detail in FIG. 75, and each of the ASVUs # 1 to #m is an ASVU # 1 to # 99.
[0128]
Next, the audio still video object set (ASVOBS) shown in FIG. 73 will be described. The audio still video object (ASVOB) is presentation data of one audio still video (ASV), and highlight (HLT) information data for a button and sub-picture (SP) data for the button. And still picture (SPCT) data. However,
-One ASVOB contains only one still picture (SPCT) data.
One ASVOB can include only one highlight (HLT) information data. The highlight information is used to operate a still picture button.
One ASVOB can include one to three sub-picture (SP) data according to the still picture mode. The SP data is used to display a still picture button.
[0129]
The audio still video object set (ASVOBS) shown in FIG. 73 is an aggregate of the above ASVOBs as shown in FIG. As shown in FIG. 76 (a), the ASVOB places a highlight (HLT) pack at the top but does not function as an HLT pack as an empty pack containing no highlight information. As shown in FIG. 76 (b), there are two types: a pack including only a pack, a highlight (HLT) pack, and a pack including a sub-picture (SP) pack and a still picture (SPCT) pack.
[0130]
As shown in FIG. 77A, the highlight (HLT) pack includes a 14-byte pack header, a system header, and a 2013-byte highlight information packet. The highlight information packet includes a 6-byte packet header, a 1-byte substream ID, and 694-byte highlight information (ASV-HLI). The system header is composed of a 4-byte system start code, a 2-byte header length, a 3-byte rate bound, a 2-byte audio bound, a 1-byte restriction flag, and a 9-byte stream ID area. ing.
[0131]
As shown in FIG. 78, the highlight information (ASV-HLI) includes general ASV highlight information (22 bytes), an ASV button color information table (8 bytes × 3), and an ASV button information table (18 bytes × 36). Consists of The ASV button information table includes ASV button information # 1 to #n, and each ASV button information #i includes an ASV button command which is a picture control command. In this ASV button command, a navigation command when the button shown in FIG. 81 is operated is described.
[0132]
On the other hand, as shown in FIG. 77 (b), the SPCT pack and the SP pack are composed of a 14-byte pack header and a 2025-byte still picture packet or sub-picture packet. , 6 byte SCR, 3 byte program mux rate, and 9 or 22 byte stuffing length.
[0133]
The still picture (SPCT) packet includes an SPCT packet header and video data for a still picture as shown in detail in FIG. The SPCT packet header has 9 bytes of SPCT packet information always provided at the beginning, 5 + 5 bytes of SPCT packet information provided only for the first SPCT packet of a still image, and only for the first SPCT packet of ASVOB. Contains the provided 3-byte SPCT packet information.
[0134]
A sub-picture (SP) packet includes an SP packet header and sub-picture data as shown in detail in FIG. This SP packet header has 9 bytes of SP packet information always provided at the head, 5 bytes of SP packet information provided only in the case of the first SP packet of the SP unit, and only in the case of the first SP packet of the ASVOB. It contains the provided 3-byte SP packet information. On the decoder side based on such a data structure, the main picture, the sub-picture, and the highlight information are combined and displayed as shown in FIG.
[0135]
Next, a data structure of a modification of the fifth embodiment will be described with reference to FIG. Here, in the data structure shown in FIG. 31B, the still picture control information table (SPCIT) is provided in the ATS independently of the ATSI-MAT and the ATS-PGCIT. In the example, as shown in FIG. 82, it is provided in the ATS-PGCIT. In the following description, SPCIT is referred to as ATS-ASV-PBIT (ATS audio still video playback information table) in order to avoid confusion with a sub-picture (SP).
[0136]
This ATS-ASV-PBIT is additionally provided in the ATS-PBIT shown in FIG. 49, as shown in detail in FIG. 83, and each ATS of m programs # 1 to #m shown in FIGS. 84, 85 in detail. It has an ASV-playback information search pointer (ATS-PG-ASV-PBI-SRP # 1 to #m) and n ATS-ASV-PBIs # 1 to #n shown in detail in FIGS. (N ≦ m ≦ 99). Each of the SRPs # 1 to #m is, as shown in FIG.
A 1-byte ASVU number,
1-byte ASV display mode (ASV-DMOD)
A 2-byte ATS-ASV-PBI start address;
• ATS-ASV-PBI end address of 2 bytes
It consists of.
[0137]
The ASVU number is a value in a range from “1” to “99”. The ASV display mode is as shown in FIG.
A 4-bit (b7 to b4) reserved area;
A 2-bit (b3, b2) display timing mode;
・ Display order mode of 2 bits (b1, b0)
It consists of. The above data is shown in detail below.
(1) Display timing mode
00b: Slide show
01b: Browserable
Other: pending
(2) Display order mode
00b: Sequential
01b: random
11b: Shuffle
Other: pending
[0138]
Each of the ATS-ASV-PBIs # 1 to #n includes an ASV display list of 10 bytes × k (k ≦ 99) as shown in FIG. FIG. 87 shows an ASV display list when the display timing mode (b3, b2) is the slide show (00b) and the display order mode (b1, b0) is the sequential (00b).
An 8-bit (b79 to b72) ASV number;
An 8-bit (b71 to b64) reserved area;
An 8-bit (b63 to b56) button number (FOSL-BTNN) forcibly selected at the start of ASV;
An 8-bit (b55 to b48) program number to be played back at the start of ASV;
Display start timing (31 to 0) of 8 bits × 4 (b47 to b16);
A 4-bit (b15-b12) start effect mode;
A 4-bit (b11 to b8) start effect period;
A 4-bit (b7 to b4) end effect mode;
・ 4 bits (b3 to b0) end effect period
It consists of.
[0139]
The contents of the above data are shown in detail below.
(1) Display start timing (31 to 0) indicates the display start timing from the start PTS, and represents 31 to 0/90000 (seconds).
(2) Start effect mode (b15 to b12)
0000b: Cut-in
0001b: Fade in
0010b: Dissolve
0011b: Wipe from Top
0100b: Wipe from bottom
0101b: Wipe from Left
0110b: Wipe from Light
0111b: Wipe Diagonal Left
1000b: Wipe Diagonal Light
Others: Hold
(3) End effect mode (b7 to b4)
0000b: Cut out
0001b: Fade out
Others: (2) Same as start effect mode (b15 to b12)
[0140]
FIG. 88 shows an ASV display list when the display timing mode (b3, b2) is the slide show (00b) and the display order mode (b1, b0) is random (01b) in the display mode of FIG. 87 is the same as the list shown in FIG. 87 except that 8 bits (b79 to b72) are reserved. FIG. 89 shows an ASV display list when the display timing mode (b3, b2) is browsable (01b) and the display order mode (b1, b0) is sequential (00b) in the display mode of FIG. The list is the same as the list shown in FIG. 97 except that 8 bits (b55 to b48) are reserved. FIG. 90 shows an ASV display list when the display timing mode (b3, b2) is browsable (01b) and the display order mode (b1, b0) is random (01b) in the display mode of FIG. The list is the same as the list shown in FIG. 87 except that 8 bits (b79 to b72) and 8 bits (b55 to b48) are reserved.
[0141]
The encoding device and the decoding device can also be realized by storing the encoding method and the decoding method as a computer program on an IC chip such as a ROM, and operating a CPU (Central Processing Unit) of the computer with the program. The present invention can be applied not only to transmission via a recording medium such as a DVD, but also to transmission via a communication line such as the Internet or a karaoke communication line and processing on a reproduction side by hardware or an application on a PC. Can be applied.
[0142]
【The invention's effect】
As described above, according to the present invention, when an audio signal is mainly recorded, the audio signal can be easily reproduced for the user, the usability is good, and the management in real time can be simplified.
[Brief description of the drawings]
FIG. 1 is an explanatory diagram showing a first embodiment of a DVD-video format and a DVD-audio format applied to the present invention.
FIG. 2 is an explanatory diagram showing a format of an audio manager (AMG) of FIG. 1 in detail.
FIG. 3 is an explanatory diagram showing a format of an audio title set (ATS) of FIG. 1 in detail.
FIG. 4 is an explanatory diagram showing the format of audio manager information (AMGI) in FIG. 2 in detail;
FIG. 5 is an explanatory diagram showing a format of an audio title set attribute table (ATS-ATRT) of FIG. 4 in detail.
FIG. 6 is an explanatory diagram showing the format of the audio title set attribute data (ATS-ATR) of FIG. 5 in detail.
FIG. 7 is an explanatory diagram showing the format of the audio title set information (ATSI) of FIG. 3 in detail.
FIG. 8 is an explanatory diagram showing the format of the audio title set information management table (ATSI-MAT) of FIG. 7 in detail.
9 is an explanatory diagram showing the audio title set menu, audio stream, and attribute data (ATSM-AST-ATR) of FIG. 8 in detail.
FIG. 10 is an explanatory diagram showing the format of an audio title set audio stream attribute table (ATS-AST-ATRT) of FIG. 8 in detail.
FIG. 11 is an explanatory diagram showing in detail attribute data (ATS-AST-ATR) of each audio stream in FIG. 10;
FIG. 12 is an explanatory diagram showing an audio content block unit (ACBU) of FIG. 1;
13 is an explanatory diagram showing the format of the audio pack and the video pack of FIG. 12 in detail.
FIG. 14 is an explanatory diagram showing the format of the audio control (A-CONT) pack of FIG. 12 in detail.
FIG. 15 is an explanatory diagram showing in detail a format of an audio character display (ACD) area in FIG. 14;
FIG. 16 is an explanatory diagram showing an example displayed by the namespace information of FIG. 15;
FIG. 17 is an explanatory diagram showing a format of an audio search data (ASD) area in FIG. 14 in detail;
FIG. 18 is an explanatory diagram showing a modification of the audio content block unit of FIG. 1;
FIG. 19 is an explanatory diagram showing in detail a format of audio manager information (AMGI) in the second embodiment.
20 is an explanatory diagram showing the TOC information of FIG. 19 in detail.
FIG. 21 is an explanatory diagram showing the format of audio title set information (ATSI) of the third embodiment in detail.
FIG. 22 is an explanatory diagram showing a basic format of a DVD-Audio disc according to the fourth embodiment.
FIG. 23 is an explanatory diagram showing an audio data structure of the DVD-audio disc of FIG. 22.
FIG. 24 is an explanatory diagram showing a basic format of a DVD-Van disc.
FIG. 25 is an explanatory diagram showing a basic format of a DVD video disk.
FIG. 26 is an explanatory diagram showing a basic format of a DVD-Avd disk.
FIG. 27 is an explanatory diagram showing AOTT-AOB-ATR in the DVD-Audio disc of the fourth embodiment.
FIG. 28 is an explanatory diagram showing a private header of the linear PCM on the DVD-Avd disc of the fourth embodiment.
FIG. 29 is a block diagram showing an embodiment of an audio signal encoding device applied to the present invention.
FIG. 30 is a block diagram showing the details of the signal processing circuit of FIG. 29;
FIG. 31 is an explanatory diagram showing a data structure of the fifth embodiment.
FIG. 32 is an explanatory diagram showing the audio object set for audio-only title (AOTT-AOBS) of FIG. 31 in detail;
FIG. 33 is an explanatory diagram showing an example of the audio pack of FIG. 32 in detail;
FIG. 34 is an explanatory diagram showing the private header of FIG. 33 in detail.
FIG. 35 is an explanatory diagram showing the UPC / EAN-ISRC data of FIG. 34 in detail.
FIG. 36 is an explanatory diagram showing a bit shift of the audio data of FIG. 33;
FIG. 37 is an explanatory diagram showing in detail the real-time information (RTI) pack of FIG. 32;
FIG. 38 is an explanatory diagram showing a still picture (SPCT) pack in detail.
FIG. 39 is an explanatory diagram showing in detail the audio title set information management table (ATSI-MAT) of FIG. 31;
FIG. 40 is an explanatory diagram showing in detail an audio object attribute (AOTT-AOB-ATR) for audio-only title of FIG. 39;
FIG. 41 is an explanatory diagram showing in detail an audio-only title video object audio stream attribute (AOTT-VOB-AST-ATR) of FIG. 39;
FIG. 42 is an explanatory diagram showing the channel assignment information of FIGS. 40 and 41 in detail.
FIG. 43 is an explanatory diagram showing the downmix coefficient (ATS-DM-COEFT) of FIG. 39 in detail;
FIG. 44 is an explanatory diagram showing the still picture data attribute (ATS-SPCT-ATR) of FIG. 39 in detail.
FIG. 45 is an explanatory diagram showing in detail an audio title set program chain information table (ATS-PGCIT) of FIG. 31;
FIG. 46 is an explanatory diagram showing the ATS-PGCIT information (ATS-PGCITI) of FIG. 45 in detail;
FIG. 47 is an explanatory diagram showing an ATS-PGCI search pointer (ATS-PGCI-SRP) in FIG. 45 in detail;
FIG. 48 is an explanatory diagram showing an ATS-PGC category (ATS-PGCI-CAT) in FIG. 47 in detail;
FIG. 49 is an explanatory diagram showing the audio title set program chain information (ATS-PGCI) in FIG. 45 in detail;
50 is an explanatory diagram showing the ATS-PGC general information (ATS-PGC-GI) of FIG. 49 in detail.
FIG. 51 is an explanatory diagram showing the ATS-PGC content (ATS-PGC-CNT) of FIG. 50 in detail;
FIG. 52 is an explanatory diagram showing an ATS program information table (ATS-PGIT) in FIG. 49 in detail;
FIG. 53 is an explanatory diagram showing the ATS program information (ATS-PGI) of FIG. 52 in detail;
FIG. 54 is an explanatory diagram showing the ATS-PG content (ATS-PG-CNT) of FIG. 53 in detail;
FIG. 55 is an explanatory diagram showing an ATS cell playback information table (ATS-C-PBIT) of FIG. 52 in detail;
FIG. 56 is an explanatory diagram showing the ATS cell playback information (ATS-C-PBI) of FIG. 55 in detail;
FIG. 57 is an explanatory diagram showing the ATS-C type (ATS-C-TY) of FIG. 56 in detail.
FIG. 58 is a block diagram illustrating an encoding device according to a fifth embodiment.
FIG. 59 is a flowchart showing the processing of the encoding device in FIG. 58;
FIG. 60 is a block diagram illustrating a decoding device according to a fifth embodiment.
FIG. 61 is a block diagram functionally showing the decoding device of FIG. 60.
FIG. 62 is a flowchart showing processing of the decoding device in FIGS. 60 and 61.
FIG. 63 is a block diagram showing a packing device for transmitting an audio signal according to the fifth embodiment.
FIG. 64 is a flowchart showing a packing process of the packing device of FIG. 63.
FIG. 65 is a flowchart showing the pack generation processing of FIG. 64 in detail.
FIG. 66 is a flowchart showing the ATS generation processing of FIG. 64 in detail.
FIG. 67 is a flowchart showing a transmission process of the packing device of FIG. 63.
FIG. 68 is a block diagram showing an unpacking device for transmitting an audio signal according to the fifth embodiment.
FIG. 69 is a flowchart showing a receiving process of the unpacking device of FIG. 68.
70 is a flowchart showing an unpacking process of the unpacking device of FIG. 68.
FIG. 71 is a flowchart showing the ATSI decoding process of FIG. 70 in detail.
FIG. 72 is a flowchart showing the pack decoding process of FIG. 70 in detail.
FIG. 73 is an explanatory diagram showing an SPS (still picture set), that is, an ASVS (audio still video set) in FIG. 31 in detail;
FIG. 74 is an explanatory diagram showing ASVUI (Audio Still Video Unit Information) of FIG. 73 in detail;
FIG. 75 is an explanatory diagram showing in detail an ASV-ADMAP (audio still video address map) shown in FIG. 73;
FIG. 76 is an explanatory diagram showing ASVOBS (Audio Still Video Object Set) of FIG. 73 in detail;
FIG. 77 is an explanatory diagram showing the highlight information pack, the still picture pack, and the sub-picture pack in FIG. 76 in detail;
FIG. 78 is an explanatory diagram showing the highlight information of FIG. 77 in detail;
FIG. 79 is an explanatory diagram showing the still picture packet of FIG. 77 in detail;
FIG. 80 is an explanatory diagram showing the sub-picture packet of FIG. 77 in detail;
FIG. 81 is an explanatory diagram showing a display screen using the highlight information pack, still picture pack, and sub-picture pack shown in FIG. 78;
FIG. 82 is an explanatory diagram showing a modification of the data structure of FIG. 31;
FIG. 83 is an explanatory diagram specifically showing a still picture control information table (SPCIT) of FIG. 82, that is, an ATS-ASV-PBIT (ATS audio still video playback information table).
FIG. 84 is an explanatory diagram showing in detail ATS-ASV-playback information search pointers (ATS-PG-ASV-PBIT-SRP # 1- # m) of FIG. 83;
FIG. 85 is an explanatory diagram showing the ASV display mode of FIG. 84 in detail;
86 is an explanatory diagram showing the ATS-ASV-PBI (ATS audio still video playback information) of FIG. 83 in detail;
FIG. 87 is an explanatory diagram showing the ASV display list in FIG. 86 in detail;
FIG. 88 is an explanatory diagram showing another ASV display list in FIG. 86 in detail.
FIG. 89 is an explanatory diagram showing another ASV display list in FIG. 86 in detail.
FIG. 90 is an explanatory diagram showing another ASV display list in FIG. 86 in detail;
[Explanation of symbols]
A pack 1st pack
AOB audio object
AOBS Audio Object Set
ATSI Audio Title Set Information
ATSI-MAT audio title set information management table
RTI pack second pack
SPS Still Picture Set
ASVS Audio Still Video Set
SPCT Pack Third Pack

Claims

A / D conversion is performed for each group using the same or different first and second sampling frequencies (fs1, fs2) for the channels of the first group and the channels of the second group of the multi-channel analog audio signal , and audio data is obtained. the method comprising the steps of: generating a,
An audio title set (ATS) including a plurality of audio objects (AOB) and a still picture set (SPS);
The audio object (AOB) is
The first and the audio object (AOB) formed from a first type of the audio cell composed by only the first pack having the audio data as real data (ATS-C),
A second audio object ( ATS-C) composed of a second type of audio cell (ATS-C) composed of the first pack and a second pack having real-time information data on the audio data as actual data. AOB)
Further, the still picture set (SPS) has a third pack including still picture data related to the audio data,
The audio title set (ATS) includes type identification data (ATS-C-TY) for identifying the type of the audio cell (ATS-C), and the plurality of audio objects (AOB) are reproduced in a predetermined reproduction order. Title information including control information (ATS-PGCIT) having audio cell playback information (ATS-C-PBI) for controlling playback in units of audio cells. ATSI) are arranged, the audio title set information (ATSI), the still picture set (SPS) in the still picture data and a control information for page control of the sequential playback mode (sequential) and the random reproduction mode De has a (random) and identifying the mode data still picture control information including (Display Order Mode) (SPCIT),
The first pack further includes a packet in which the audio data is recorded, and the packet further includes a private header in which first and second sampling frequencies (fs1, fs2) of the audio data are recorded. Is configured to have
A step of formatting the data structure,
An audio signal encoding method consisting of :

An audio signal transmission / reception method for transmitting or receiving data having a data structure encoded and generated by the audio signal encoding method according to claim 1,
An audio signal transmission / reception method, comprising: converting the data into a predetermined communication format and transmitting the data; or receiving the data converted into the predetermined communication format and restoring the original data.

A recording medium of a computer program recording a computer program for decoding data recorded with a data structure encoded and generated by the audio signal encoding method according to claim 1,
The recorded audio title set audio title set information (ATSI) decodes the mode data of the still picture control information (SPCIT) including mode data of a still picture (display order mode) from (display arranged in (ATS) Order mode) , a first decoding step to retrieve
A second decoding step for identifying a first pack and a second pack arranged in the recorded audio title set (ATS) , and a third pack arranged in a still picture set (SPS) ; ,
A third decoding step of decoding audio data , real-time information data, and still picture data from each of the packs extracted from the second decoding step;
The still picture data extracted in the second decoding step is controlled and output alone and / or together with the audio data based on at least the mode data (display order mode) extracted from the first decoding step. And a recording medium for a computer program.

An audio signal decoding method for decoding data in which a data structure encoded and generated by the audio signal encoding method according to claim 1 is recorded,
The recorded audio title set audio title set information (ATSI) decodes the mode data of the still picture control information (SPCIT) including mode data of a still picture (display order mode) from (display arranged in (ATS) Order mode), a first decoding step to retrieve
A second decoding step for identifying a first pack and a second pack arranged in the recorded audio title set (ATS) , and a third pack arranged in a still picture set (SPS); ,
A third decoding step of decoding audio data, real-time information data, and still picture data from each of the packs extracted from the second decoding step;
The still picture data extracted in the second decoding step is controlled and output alone and / or together with the audio data based on at least the mode data (display order mode) extracted from the first decoding step. And an audio signal decoding method .