JP4207109B2

JP4207109B2 - Data conversion method, data conversion apparatus, data reproduction method, data restoration method, and program

Info

Publication number: JP4207109B2
Application number: JP2002114783A
Authority: JP
Inventors: 直也羽田; 京弥筒井
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2002-04-17
Filing date: 2002-04-17
Publication date: 2009-01-14
Anticipated expiration: 2022-04-17
Also published as: JP2003308096A

Description

【０００１】
【発明の属する技術分野】
本発明は、データ変換方法およびデータ変換装置、データ再生方法およびデータ再生装置、データ復元方法およびデータ復元装置、データフォーマット、記録媒体、並びにプログラムに関し、特に、コンテンツの試聴データをユーザに配布する場合に用いて好適なデータ変換方法およびデータ変換装置、データ再生方法およびデータ再生装置、データ復元方法およびデータ復元装置、データフォーマット、記録媒体、並びにプログラムに関する。
【０００２】
【従来の技術】
例えば、インターネットなどの通信ネットワーク技術の普及、情報圧縮技術の向上、更に、情報記録媒体の高集積化、あるいは高密度化が進んだことなどにより、オーディオ、静止画像、動画像、あるいは、オーディオと動画像からなる例えば映画など、様々なマルチメディアデータから構成されるデジタルコンテンツが、通信ネットワークを介して、視聴者に有料で配信されるという販売形態が実施されるようになった。
【０００３】
例えば、ＣＤ（Compact Disk）やＭＤ（Mini-Disk）（商標）などのパッケージメディア、すなわち、デジタルコンテンツが予め記録された記録媒体を販売する店舗などは、例えば、音楽データをはじめとする多数のデジタルコンテンツが蓄積された、いわゆるＭＭＫ（Multi Media KIOSK）などの情報端末を設置することにより、パッケージメディアを販売するのみならず、デジタルコンテンツを販売することが可能である。
【０００４】
ユーザは、ＭＭＫに、持参したＭＤなどの記録媒体を挿入し、メニュー画面などを参照して、購入したいデジタルコンテンツのタイトルを選択して、要求されるコンテンツの代金を支払う。代金の支払方法は、現金の投入であっても、電子マネーのやり取りであっても、あるいは、クレジットカードやプリペイドカードを用いた電子決済であっても良い。ＭＭＫは、所定の処理により、ユーザが挿入した記録媒体に、選択されたデジタルコンテンツデータを記録する。
【０００５】
デジタルコンテンツの販売者は、上述したように、ＭＭＫを用いてデジタルコンテンツをユーザに販売する以外にも、例えば、インターネットを介して、デジタルコンテンツをユーザに配信することも可能である。
【０００６】
このように、コンテンツが予め記録されたパッケージメディアを販売するのみならず、デジタルコンテンツそのものを販売する手法を取り入れることにより、更に効果的にコンテンツが流通されるようになった。
【０００７】
著作権を保護しながら、デジタルコンテンツを流通させるために、例えば、特開平２００１−１０３０４７、あるいは、特開平２００１−３２５４６０などの技術を用いることにより、デジタルコンテンツの試聴可能な部分以外を暗号化して配信し、暗号化に対する復号鍵を購入したユーザにのみ、コンテンツ全ての試聴を許可するようにすることができる。暗号化の方法としては、例えば、ＰＣＭ（Pulse Code Modulation）のデジタル音声データのビット列に対する鍵信号となる乱数系列の初期値を与え、発生した０／１の乱数系列と、配信するＰＣＭデータとの排他的論理和を、暗号化されたビット列とする方法が知られている。このように暗号化されたデジタルコンテンツが、例えば、上述したＭＭＫなどを用いて記録媒体に記録されたり、ネットワークを介して配信されることにより、ユーザに配布される。暗号化されたデジタルコンテンツデータを取得したユーザは、鍵を手に入れなければ、暗号化されていない試聴可能な部分しか試聴することができず、暗号化されている部分を復号せずに再生しても、雑音しか試聴することができない。
【０００８】
また、音声データなどを圧縮して放送したり、ネットワークを介して配信したり、圧縮されたデータを、例えば光磁気ディスクなどの、様々な形態の記録媒体に記録する技術も向上している。
【０００９】
音声データの高能率符号化には、様々な方法があるが、例えば、時間軸上のオーディオ信号をブロック化せず、複数の周波数帯域に分割して符号化する帯域分割符号化（ＳＢＣ（Sub Band Coding））や、時間軸上の信号を周波数軸上の信号にスペクトル変換して、複数の周波数帯域に分割し、帯域毎に符号化するブロック化周波数帯域分割方式（いわゆる、変換符号化）などがある。また、帯域分割符号化で帯域分割を行った後、各帯域において、信号を周波数軸上の信号にスペクトル変換し、スペクトル変換された帯域毎に符号化を施す手法も考えられている。
【００１０】
ここで利用されるフィルタには、例えば、ＱＭＦ（Quadrature Mirror Filter）があり、ＱＭＦについては、R. E. Crochiereによる"Digital coding of speech in subbands"（Bell Ｓyst. Tech. J. Vol.55,No.8 1974）の文献に記載されている。また、Joseph H. Rothweilerによる"Polyphase Quadrature Filters-A new subband coding technique"（ICASSP 83, BOSTON）などの文献には、等しいバンド幅のフィルタ分割手法について記載されている。
【００１１】
また、上述したスペクトル変換としては、例えば、入力オーディオ信号を所定の単位時間（フレーム）でブロック化し、そのブロック毎に、離散フーリエ変換（ＤＦＴ；Discrete Fourier Transform）、離散コサイン変換（ＤＣＴ；Discrete Cosine Transform）、モデファイドＤＣＴ変換（ＭＤＣＴ；modified Discrete Cosine Transform）などを行う方法がある。例えば、ＭＤＣＴについての詳細は、J. P. Princen, A. B. Bradley（Univ. of Surrey Royal Melbourne Inst. of Tech.）らによる"Subband / Transform Cording Using Filter Bank Designs Based on Time Domain Aliasing Cancellation"（ICASSP 1987）の論文に述べられている。
【００１２】
また、波形信号をスペクトル変換する方法として、上述したＤＦＴやＤＣＴが用いられた場合、Ｍ個のサンプルからなる時間ブロックで変換を行うと、Ｍ個の独立した実数データが得られる。時間ブロック間の接続ひずみを軽減するために、通常、両隣のブロックと、それぞれＮ／２個ずつ、すなわち、両側合わせてＮ個のサンプルをオーバーラップさせるので、ＤＦＴやＤＣＴにおいては、平均して、（Ｍ＋Ｎ）個のサンプルに対して、独立したＭ個の実数データを量子化して符号化することになる。
【００１３】
これに対して、スペクトル変換する方法として、上述したＭＤＣＴが用いられた場合には、Ｍ個のサンプルからなる時間ブロックで変換を行うと、両隣のブロックとそれぞれＭ／２個ずつ、すなわち、両側合わせてＭ個オーバーラップさせた２Ｍ個のサンプルから、Ｍ個の独立した実数データが得られるので、ＭＤＣＴでは、平均して、Ｍ個のサンプルに対して、Ｍ個の実数データを、量子化して符号化することになる。
【００１４】
復号装置においては、ＭＤＣＴを用いて得られた符号から、各ブロックを逆変換して得られた波形要素を、お互いに干渉させながら加え合わせることにより、波形信号を再構成することができる。
【００１５】
一般に、変換のための時間ブロックを長くすることによって、スペクトルの周波数分解能が高まり、特定のスペクトル成分にエネルギが集中する。従って、両隣のブロックと半分ずつオーバーラップさせることにより、長いブロック長で変換を行い、しかも、得られたスペクトル信号の個数が、基となった時間サンプルの個数に対して増加しないＭＤＣＴを用いて変換を施すことにより、変換にＤＦＴやＤＣＴを用いた場合より、効率よく符号化を行うことができる。また、隣接するブロック同士に十分長いオーバーラップを持たせることにより、波形信号のブロック間歪みを軽減することができる。
【００１６】
上述したように、フィルタリングやスペクトル変換によって、帯域毎に分割された信号を量子化することにより、量子化雑音が発生する帯域を制御することができ、マスキング効果などの性質を利用して、聴覚的に、より高能率な符号化を行うことができる。また、量子化を行う前に、帯域毎に、例えば、その帯域における信号成分の絶対値の最大値で正規化を行うようにすることにより、更に、高能率な符号化を行うことができる。
【００１７】
周波数帯域分割された各周波数成分を量子化する場合、例えば、人間の聴覚特性を考慮して、周波数分割幅が決定されるようにしても良い。すなわち、一般に臨界帯域（クリティカルバンド）と称される高域ほど帯域幅が広くなるように、オーディオ信号が複数の帯域（例えば、２５バンド）に分割されるようにしても良い。
【００１８】
また、クリティカルバンドが広くなるように帯域が分割されている場合に、帯域毎のデータが符号化されるとき、帯域毎に所定のビット配分が行われるようにしても良いし、帯域毎に適応的にビットが割り当てられる（ビットアロケーションが行われる）ようにしても良い。
【００１９】
例えば、ＭＤＣＴされて得られた係数データが、ビットアロケーションによって符号化される場合、ブロック毎のＭＤＣＴにより得られる帯域毎のＭＤＣＴ係数データに対して、それぞれ、適応的にビット数が割り当てられて、符号化が行われる。ビット割り当て手法としては、例えば、次にあげる２つの手法が知られている。
【００２０】
R. Zelinski, P. Nollらによる、"Adaptive Transform Coding of Speech Signals"（IEEE Transactions of Acoustics, Speech, and Signal Processing, Vol. ASSP-25, No. 4, August 1977）の論文では、帯域毎の信号の大きさを基に、ビット割り当てが行われることについて述べられている。この方式によると、量子化雑音スペクトルが平坦となり、雑音エネルギは最小となるが、聴覚的に考慮した場合、マスキング効果が利用されていないため、人間の耳に実際聞こえる雑音を減少する点では最適ではない。
【００２１】
また、M. A. Kransner（Massachusetts Institute of Technology）による、"The critical band coder digital encoding of the perceptual requirements of the auditory system"（ICASSP 1980）の論文には、聴覚マスキングを利用することで、各帯域毎に必要な信号対雑音比を得て、固定的なビット割り当てを行う手法が記載されている。しかしながら、この手法では、サイン波入力で特性を測定する場合においても、ビット割り当てが固定的であるために、その特性値は、それほど良い値とはならない。
【００２２】
これらの問題を解決するために、ビット割り当てに使用できる全ビットが、小ブロック毎に予め定められた固定ビット割り当てパターン分と、各ブロックの信号の大きさに依存したビット割り当てを行う分とに分割使用され、その分割比が、入力信号に関係する信号に依存され、その信号のスペクトルが滑らかなほど、固定ビット割り当てパターン分への分割比率が大きくされるようになされている高能率符号化装置が提案されている。
【００２３】
この方法を用いることにより、サイン波入力のように、特定のスペクトルにエネルギが集中する場合には、そのスペクトルを含むブロックに多くのビット数を割り当てることができるので、全体的な信号対雑音特性を著しく改善することができる。一般的に、急峻なスペクトル成分を持つ信号に対する人間の聴覚は、極めて敏感であるため、このような方法を用いて信号対雑音特性を改善することは、測定上の特性値のみならず、人間が実際に聞く音の音質を改善するのに有効である。
【００２４】
ビット割り当ての方法には、上述した以外にも、多くの方法が提案されている。更に、聴覚に関するモデルが精緻化され、符号化装置の能力が向上したことにより、測定上の特性値のみならず、人間の聴覚に対してより高能率な符号化を行うことが可能となっている。これらの方法においては、計算によって求められた信号対雑音特性を、なるべく忠実に実現するような実数のビット割り当て基準値が求められ、それを近似する整数値が求められて、割り当てビット数に設定されるのが一般的である。
【００２５】
また、本発明者が先に出願した、特願平５−１５２８６５、もしくは、WO９４／２８６３３には、生成されたスペクトル信号から、聴覚上、特に重要なトーン性の成分、すなわち、特定の周波数周辺にエネルギが集中しているような成分を分離して、他のスペクトル成分とは別に符号化する方法について記載されている。この方法により、オーディオ信号などを、聴覚上の劣化を殆ど感じさせずに、高い圧縮率で効果的に符号化することが可能となっている。
【００２６】
実際の符号列を生成する場合、まず、正規化および量子化が行われる帯域毎に、量子化精度情報および正規化係数情報が、所定のビット数で符号化され、次に、正規化、および量子化されたスペクトル信号が符号化される。また、ISO／IEC 11172-3；（1993（E）, a933）では、帯域によって量子化精度情報を表すビット数が異なるように設定された高能率符号化方式が記述されており、帯域が高域になるにともなって、量子化精度情報を表すビット数が少なくなるように規格化されている。
【００２７】
量子化精度情報を直接符号化する代わりに、復号装置において、例えば、正規化係数情報から量子化精度情報を決定する方法も知られているが、この方法では、規格を設定した時点で、正規化係数情報と、量子化精度情報との関係が決まってしまうので、将来的に、更に高度な聴覚モデルに基づいた量子化精度を用いる制御を導入することができなくなってしまう。また、実現する圧縮率に幅がある場合には、圧縮率毎に正規化係数情報と量子化精度情報との関係を定める必要が生じてしまう。
【００２８】
量子化されたスペクトル信号を、より効率的に符号化する方法として、例えば、D. A. Huffmanによる"A Method for Construction of Minimum Redundancy Codes"（Proc. I. R. E. , 40, p.1098, 1952）の論文に記載されている可変長符号を用いて効率的に符号化を行う方法も知られている。
【００２９】
以上説明したような方法で符号化された信号を、ＰＣＭ信号の場合と同様にして暗号化して配布することも可能であり、このスクランブル方法が用いられた場合には、鍵信号を入手していないものは、元の信号を再生することが出来ない。また、符号化ビット列を暗号化するのではなく、ＰＣＭ信号をランダム信号に変換した後、圧縮のために符号化を行う方法もあるが、このスクランブル方法が用いられた場合には、鍵信号を入手していないものは、雑音しか再生することが出来ない。
【００３０】
また、コンテンツデータの試聴データを配布することにより、コンテンツデータの販売を促進することができる。試聴データには、例えば、オリジナルデータよりも低音質で再生されるデータや、オリジナルデータのうちの一部（例えば、さびの部分のみ）などを再生することが出来るデータなどがある。ユーザは、試聴データを再生して、気に入った場合に、暗号を復号する鍵を購入して、オリジナルの音声を再生することができるようにしたり、オリジナルの音声データが記録された記録媒体を新たに購入しようとする。
【００３１】
しかしながら、上述したスクランブル方法では、データ全体が再生できないか、もしくは、全てが雑音として再生されるので、例えば、比較的低音質で音声を録音した記録媒体を、試聴データとして配布するという用途に利用することが出来なかった。これらの方法によりスクランブルされたデータをユーザに配布しても、ユーザは、そのデータの全体の概要を把握することができない。
【００３２】
また、従来の方法では、高能率符号化を施した信号を暗号化する場合に、通常、広く用いられている再生装置にとって、意味のある符号列を与えながら、その圧縮効率を下げないようにすることは非常に困難であった。すなわち、上述したように、高能率符号化を施すことによって生成された符号列にスクランブルをかけた場合、その符号列をデスクランブルしないまま再生しても、雑音が発生するばかりではなく、スクランブルによって生成された符号列が、元となる高能率符号の規格に適合していない場合には、再生処理が全く実行できない可能性がある。
【００３３】
また、逆に、ＰＣＭ信号にスクランブルをかけた後に高能率符号化が施された場合、例えば、聴覚の性質を利用して情報量を削ると、不可逆符号化となってしまう。従って、このような高能率符号を復号しても、ＰＣＭ信号にスクランブルをかけた信号が正しく再現できない。すなわち、このような信号は、デスクランブルを正しく行うことが非常に困難なものとなってしまう。
【００３４】
従って、たとえ、圧縮の効率が下がってしまっても、スクランブルが正しく解除できる方法が選択されてきた。
【００３５】
このような課題に対して、本発明者等は、特開平１０−１３５９４４において、例えば、音楽データをスペクトル信号に変換して符号化したもののうち、高帯域に対応する符号のみが暗号化されたデータを、試聴データとして配布することにより、鍵を保有していないユーザであっても、暗号化されていない狭帯域の信号を復号して再生することができるオーディオ符号化方式について開示した。この方式においては、高域側の符号が暗号化されるとともに、高域側のビット割り当て情報が、ダミーデータに置き換えられ、高域側の真のビット割り当て情報が、再生処理を行うデコーダが再生処理時に情報を読み取らない（無視する）位置に記録されるようになされている。
【００３６】
この方式を採用することにより、ユーザは、試聴データの配布を受けて、試聴データを再生し、試聴の結果、気に入った試聴データをオリジナルデータに復号するための鍵を有償で購入して、所望の音楽などを全ての帯域で正しく再生して、高音質で楽しむことが可能となる。
【００３７】
【発明が解決しようとする課題】
特開平１０−１３５９４４号公報において開示されている技術では、鍵を保有していないユーザは、無償で配布されるデータの狭帯域の信号しか復号されないようになされている。しかしながら、その安全性は、暗号化にのみ依存されているため、暗号が解読されてしまった場合、ユーザは、料金を支払うことなく、高音質の音楽を再生することが可能となるので、音楽データの配信者（コンテンツ提供者）は、正当に料金を徴収することができない。
【００３８】
また、コンテンツ提供者は、試聴データとして、コンテンツ全体にわたって品質を制限したものではなく、コンテンツの一部、あるいは数箇所のみ、品質を限定して、試聴を可能とし、他の部分は、品質を制限したデータであっても、試聴することができないようにしたい場合がある。
【００３９】
例えば、試聴データを無償で配布するにあたって、その楽曲のさびの部分の数十秒のみを品質を制限して再生可能として、ユーザが試聴することが可能なようにしたい場合、そのコンテンツのうちの試聴可能な箇所以外を試聴不可としなければならず、更に、試聴可能な部分を不自然なく再生することが可能な試聴データを作成することが望まれる。
【００４０】
上述したように、試聴不可の部分を試聴できないようにする技術を用いることにより、そのデータを再生しても、対応する部分（さび以外の部分）は、無音となるようにすることができる。しかしながら、さびの部分以外を無音とし、さびの部分のみが低品質で再生されるような試聴データを復号して再生する場合、データの先頭から、順次復号処理を行うため、試聴可能な部分までは、無音のまま再生されてしまい、試聴可能な部分のデータの復号処理が行われるまで、音声が全く再生出力されない不自然なものとなってしまう。
【００４１】
本発明はこのような状況に鑑みてなされたものであり、コンテンツデータの１部のみを低音質で再生可能とし、他の部分を再生不可とした試聴データを配布する場合において、再生可能部分のみが自然に再生され、かつ、試聴データのヘッダ長が変更されない試聴データを生成することができるようにし、更に、このような試聴データをオリジナルデータに復元することが出来るようにするものである。
【００４２】
【課題を解決するための手段】
本発明のデータ変換方法は、第１のデータ列に含まれ、所定の情報を再生するために用いられるデータのうちの少なくとも一部である第１のデータを、所定の情報を再生するために用いられたときに所定の情報の再生品質を第１のデータ列における場合よりも劣化させる第２のデータに置き換える第１の置き換えステップと、第１のデータとは異なる領域に含まれ、所定の情報を再生するために用いられるデータのうちの少なくとも一部である第３のデータのうちの少なくとも一部を、第２のデータ列を再生する場合に参照される制御情報である第４のデータに置き換えるとともに、他の部分を、第２のデータ列が再生される場合に再生データとして参照されない非参照領域とする第２の置き換えステップと、第１の置き換えステップの処理、および第２の置き換えステップの処理により生成されたデータを用いて、第２のデータ列を生成する第１の生成ステップとを含むことを特徴とする。
【００４４】
第１のデータ列および第２のデータ列は、それぞれ複数のフレームにより構成されるものとすることができ、第３のデータは、複数のフレームにそれぞれ記録されるものとすることができ、第４のデータは、対応するフレームが再生可能か否かを示す制御情報であるものとすることができる。
【００４５】
第１の生成ステップの処理により生成された第２のデータ列を第１のデータ列に復元するために必要な第３のデータ列を生成する第２の生成ステップを更に含ませるようにすることができ、第２の生成ステップの処理により生成される第３のデータ列には、第１のデータを含むとともに、第３のデータのうち第２の置き換えステップの処理により第４のデータに置き換えられた部分を含ませるようにすることができる。
【００５４】
入力されたデータを周波数成分に変換する周波数成分変換ステップと、周波数成分変換ステップの処理により周波数成分に変換されたデータを符号化する符号化ステップとを更に含ませるようにすることができ、第１の置き換えステップの処理では、符号化ステップの処理により符号化された符号化データを第１のデータ列として、第１のデータを第２のデータと置き換えさせるようにすることができ、第２の置き換えステップの処理では、符号化ステップの処理により符号化された符号化データを第１のデータ列として、第３のデータのうちの少なくとも一部を第２のデータと置き換えさせるようにすることができる。
第３のデータには、周波数成分変換ステップの処理により変換された周波数成分のスペクトル係数情報のうちの少なくとも一部を含ませるようにすることができる。
第１のデータは、符号化ステップの処理による符号化処理の正規化係数情報のうちの少なくとも一部を含むものであるとすることができる。
第１のデータは、符号化ステップの処理による符号化処理の量子化精度情報のうちの少なくとも一部を含むものであるとすることができる。
第１のデータは、符号化ステップの処理による符号化処理の量子化ユニット数を表す情報のうちの少なくとも一部を含むものであるとすることができる。
第１のデータは、所定の数値を示すデータであるとすることができ、第２のデータは、第１のデータの数値を最小化したものである
第２のデータは、第１のデータの少なくとも一部をランダムなデータに置き換えたものであるとすることができる。
【００６０】
本発明のデータ変換装置は、第１のデータ列に含まれ、所定の情報を再生するために用いられるデータのうちの少なくとも一部である第１のデータを、所定の情報を再生するために用いられたときに所定の情報の再生品質を第１のデータ列における場合よりも劣化させる第２のデータに置き換える第１の置き換え手段と、第１のデータとは異なる領域に含まれ、所定の情報を再生するために用いられるデータのうちの少なくとも一部である第３のデータのうちの少なくとも一部を、第２のデータ列を再生する場合に参照される制御情報である第４のデータに置き換えるとともに、他の部分を、第２のデータ列が再生される場合に再生データとして参照されない非参照領域とする第２の置き換え手段と、第１の置き換え手段、および第２の置き換え手段により生成されたデータを用いて、第２のデータ列を生成する生成手段とを備えることを特徴とする。
【００６２】
本発明の第1のプログラムは、第１のデータ列に含まれ所定の情報を再生するために用いられるデータのうちの少なくとも一部である第１のデータを、所定の情報の再生品質を劣化して再生するための第２のデータに置き換える第１の置き換えステップと、第１のデータとは異なる領域に含まれ、所定の情報を再生するために用いられるデータのうちの少なくとも一部である第３のデータのうちの少なくとも一部を、第２のデータ列を再生する場合に参照される制御情報である第４のデータに置き換えるとともに、他の部分を、第２のデータ列が再生される場合に再生データとして参照されない非参照領域とする第２の置き換えステップと、第１の置き換えステップの処理、および第２の置き換えステップの処理により生成されたデータを用いて、第２のデータ列を生成する生成ステップとを含むことを特徴とする。
【００６３】
本発明のデータ再生方法は、第１のデータ列には、所定の情報を再生するために用いられるデータのうちの少なくとも一部である第１のデータが含まれているとともに、第１のデータとは異なる領域に含まれ、所定の情報を再生するために用いられるデータのうちの少なくとも一部である第３のデータが更に含まれており、第２のデータ列は、第１のデータ列における第１のデータが、所定の情報を再生するために用いられたときに所定の情報の再生品質を第１のデータ列における場合よりも劣化させる第２のデータに置き換えられ、第１のデータ列における第３のデータのうちの少なくとも一部が、第２のデータ列を再生する場合に参照される制御情報である第４のデータに置き換えられるとともに、他の部分が、第２のデータ列が再生される場合に再生データとして参照されない非参照領域とされたものであり、第２のデータ列に含まれる第４のデータを基に、第２のデータ列の再生を制御する再生制御ステップを含むことを特徴とする。
【００６４】
第４のデータは、再生制御ステップの処理により第２のデータ列の再生が制御される場合に、対応するフレームが再生可能であるか否かを指定する情報を含むものとすることができる。
【００６７】
第４のデータには、第２のデータに置き換えられた第１のデータのうちの少なくとも一部が含まれるようにすることができる。
【００６８】
第４のデータには、再生制御ステップの処理により第２のデータ列の再生が制御される場合に、制御情報に含まれる第１のデータのうちの少なくとも一部を用いて、第２のデータを復元することが可能か否かを示す情報が更に含まれているものとすることができる。
【００７４】
第２のデータ列の周波数成分を逆変換する逆変換ステップと、逆変換ステップの処理により逆変換された第２のデータ列を復号する復号ステップとを更に備えさせるようにすることができる。
第３のデータは、周波数成分変換ステップの処理により変換された周波数成分のスペクトル係数情報のうちの少なくとも一部を含むものとすることができる。
第１のデータは、符号化ステップの処理による符号化処理の正規化係数情報のうちの少なくとも一部を含むものとすることができる。
第１のデータは、符号化ステップの処理による符号化処理の量子化精度情報のうちの少なくとも一部を含むものとすることができる。
第１のデータは、符号化ステップの処理による符号化処理の量子化ユニット数を表す情報のうちの少なくとも一部を含むものとすることができる。
【００７５】
第１のデータは、所定の数値を示すデータであるものとすることができ、第２のデータは、第１のデータの数値を最小化したものであるとすることができる。
第２のデータは、第１のデータの少なくとも一部をランダムなデータに置き換えたものであるとすることができる。
【００８０】
本発明の第２のプログラムは、第１のデータ列には、所定の情報を再生するために用いられるデータのうちの少なくとも一部である第１のデータが含まれているとともに、第１のデータとは異なる領域に含まれ、所定の情報を再生するために用いられるデータのうちの少なくとも一部である第３のデータが更に含まれており、第２のデータ列は、第１のデータ列における第１のデータが、所定の情報を再生するために用いられたときに所定の情報の再生品質を第１のデータ列における場合よりも劣化させる第２のデータに置き換えられ、第１のデータ列における第３のデータのうちの少なくとも一部が、第２のデータ列を再生する場合に参照される制御情報である第４のデータに置き換えられるとともに、他の部分が、第２のデータ列が再生される場合に再生データとして参照されない非参照領域とされたものであるときに、第２のデータ列に含まれる第４のデータを基に、第２のデータ列の再生を制御する再生制御ステップを含むことを特徴とする。
【００８１】
本発明のデータ復元方法は、第１のデータ列には、所定の情報を再生するために用いられるデータのうちの少なくとも一部である第１のデータが含まれており、第２のデータ列は、第１のデータ列における第１のデータが、所定の情報を再生するために用いられたときに所定の情報の再生品質を第１のデータ列における場合よりも劣化させる第２のデータに置き換えられたものであり、第２のデータ列を第１のデータ列に復元するために必要な情報を含む第３のデータ列の取得を制御する取得制御ステップと、取得制御ステップの処理により取得が制御された第３のデータ列を基に、第２のデータ列から第１のデータ列を復元する復元ステップとを含み、第３のデータ列は、第２のデータ列を復元するための第１のデータを含み、復元ステップの処理では、取得制御ステップの処理により取得が制御された第３のデータ列に含まれている第１のデータを、第２のデータ列に含まれている第２のデータと置き換えることにより、第１のデータ列を復元し、第２のデータの一部は、第２のデータ列を再生させる場合に参照される制御情報であることを特徴とする。
【０１０１】
本発明のデータ変換方法およびデータ変換装置、ならびに第1のプログラムにおいては、第１のデータ列に含まれ、所定の情報を再生するために用いられるデータのうちの少なくとも一部である第１のデータが、所定の情報を再生するために用いられたときに所定の情報の再生品質を第１のデータ列における場合よりも劣化させる第２のデータに置き換えられ、第１のデータとは異なる領域に含まれ、所定の情報を再生するために用いられるデータのうちの少なくとも一部である第３のデータのうちの少なくとも一部が、第２のデータ列を再生する場合に参照される制御情報である第４のデータに置き換えられるととともに、他の部分が、第２のデータ列が再生される場合に再生データとして参照されない非参照領域とされ、データの置き換えにより生成されたデータを用いて、第２のデータ列が生成される。
【０１０２】
本発明のデータ再生方法および第２のプログラムにおいては、第１のデータ列には、所定の情報を再生するために用いられるデータのうちの少なくとも一部である第１のデータが含まれているとともに、第１のデータとは異なる領域に含まれ、所定の情報を再生するために用いられるデータのうちの少なくとも一部である第３のデータが更に含まれており、第２のデータ列は、第１のデータ列における第１のデータが、所定の情報を再生するために用いられたときに所定の情報の再生品質を第１のデータ列における場合よりも劣化させる第２のデータに置き換えられ、第１のデータ列における第３のデータのうちの少なくとも一部が、第２のデータ列を再生する場合に参照される制御情報である第４のデータに置き換えられるとともに、他の部分が、第２のデータ列が再生される場合に再生データとして参照されない非参照領域とされたものであり、第２のデータ列に含まれる第４のデータを基に、第２のデータ列が再生される。
【０１０３】
本発明のデータ復元方法においては、第１のデータ列には、所定の情報を再生するために用いられるデータのうちの少なくとも一部である第１のデータが含まれており、第２のデータ列は、第１のデータ列における第１のデータが、所定の情報を再生するために用いられたときに所定の情報の再生品質を第１のデータ列における場合よりも劣化させる第２のデータに置き換えられたものであり、第２のデータ列を第１のデータ列に復元するために必要な情報を含む第３のデータ列が取得され、第３のデータ列を基に、第２のデータ列から第１のデータ列が復元され、第３のデータ列は、第１のデータ列を復元するための第１のデータを含み、第３のデータ列に含まれている第１のデータを、第２のデータ列に含まれている第２のデータと置き換えることにより、第１のデータ列が復元され、第２のデータの一部は、第２のデータ列を再生させる場合に参照される制御情報である。
【０１０５】
【発明の実施の形態】
以下、図を参照して、本発明の実施の形態について説明する。
【０１０６】
ここでは、オーディオＰＣＭ信号などのデジタル信号の入力を受け、帯域分割符号化（ＳＢＣ）、適応変換符号化（ＡＴＣ：Adaptive Transform cording）および、適応ビット割り当てを行うことにより、高能率符号化を行う場合について説明する。適応変換符号化とは、離散コサイン変換（ＤＣＴ）などをベースに、ビット配分を適応化した符号化方法であり、入力信号を時間ブロック毎にスペクトル信号に変換し、所定の帯域毎に、各スペクトル信号をまとめて正規化、すなわち、最大信号成分を近似する正規化係数で、各信号成分を除算してから、信号の性質によって適時定められた量子化精度で量子化して符号化するものである。
【０１０７】
図１は、音響波形信号の入力を受けて、試聴データを作成する符号化装置１の構成例を示すブロック図である。
【０１０８】
変換部１１は、音響波形信号の入力を受けて、信号周波数成分に変換し、信号成分符号化部１２に出力する。信号成分符号化部１２は、入力された信号周波数成分を符号化し、符号列生成部１３に出力する。符号列生成部１３は、信号成分符号化部１２により符号化された信号周波数成分から符号列を生成し、試聴データ生成部１４に出力する。試聴データ生成部１４は、符号列生成部１３から入力された符号列に対して、正規化係数情報の書き換え、制御情報の挿入などの所定の処理を行って、高音質で再生可能な音声データ（オリジナルデータ）を、試聴データに変換するとともに、オリジナルデータの再生を希望するユーザに対して販売される、試聴データに対応する追加データ（復元用データ）を生成して出力する。
【０１０９】
図２は、変換部１１の更に詳細な構成を示すブロック図である。
【０１１０】
変換部１１に入力された音響波形信号は、帯域分割フィルタ２１によって２つの帯域に分割され、それぞれの信号が、順スペクトル変換部２２−１および２２−２に出力される。順スペクトル変換部２２−１および２２−２は、例えばＭＤＣＴなどを用いて、入力された信号を、スペクトル信号成分に変換して信号成分符号化部１２に出力する。順スペクトル変換部２２−１および２２−２に入力される信号は、帯域分割フィルタ２１に入力される信号の帯域幅の１／２であり、信号の入力も、それぞれ１／２に間引かれている。
【０１１１】
図２の変換部１１においては、帯域分割フィルタ２１によって２つの帯域に分割された信号が、ＭＤＣＴを用いてスペクトル信号成分に変換されるものとして説明したが、入力された信号をスペクトル信号成分に変換する方法は、いずれの方法を用いるようにしても良く、例えば、入力された信号を帯域分割せずに、ＭＤＣＴを用いてスペクトル信号成分に変換するようにしても良い。あるいは、順スペクトル変換部２２−１および２２−２は、ＤＣＴやＤＦＴを用いて、入力された信号をスペクトル信号に変換するようにしても良い。
【０１１２】
いわゆる帯域分割フィルタを用いることにより、入力された信号を帯域成分に分割することも可能であるが、多数の周波数成分を比較的少ない演算量で演算することが可能な、ＭＤＣＴ、ＤＣＴ、あるいは、ＤＦＴを用いてスペクトル変換を行うと好適である。
【０１１３】
また、図２においては、入力された音響波形信号が帯域分割フィルタ２１において、２つの帯域に分割されるものとして説明したが、帯域分割数は、２つでなくてもかまわないことは言うまでもない。帯域分割フィルタ２１における帯域分割数を示す情報は、信号成分符号化部１２を介して、符号列生成部１３に出力される。
【０１１４】
図３は、変換部１１によって得られるＭＤＣＴによるスペクトル信号の絶対値を、パワーレベルに変換して示した図である。変換部１１に入力された音響波形信号は、所定の時間ブロック毎に、例えば、６４個のスペクトル信号に変換される。これらのスペクトル信号は、信号成分符号化部１２によって、後述する処理により、例えば、図中の実線でかこまれた１６個の枠組みで示されるように、［１］乃至［１６］の、１６個の帯域に分けられ、それぞれの帯域毎に量子化および正規化が行われる。この１６個の帯域に分けられたスペクトル信号の集合、すなわち、量子化および正規化を行うスペクトル信号の集合が、量子化ユニットである。
【０１１５】
周波数成分の分布の仕方に基づいて、量子化精度を量子化ユニット毎に変化させることにより、人間に聞こえる音の質の劣化を最小限にとどめることが出来る効率の良い符号化が可能となる。
【０１１６】
図４は、信号成分符号化部１２の更に詳細な構成を示すブロック図である。ここでは、周波数成分符号化部１２は、入力されたスペクトル信号から、聴感上、特に重要なトーン部分、すなわち、特定の周波数周辺にエネルギが集中している信号成分を分離して、他のスペクトル成分とは別に符号化を行うようになされている場合について説明する。
【０１１７】
変換部１１から入力されたスペクトル信号は、トーン成分分離部３１により、トーン成分と、非トーン成分に分離され、トーン成分は、トーン成分符号化部３２に出力され、非トーン成分は、非トーン成分符号化部３３に出力される。
【０１１８】
図５を用いて、トーン成分と非トーン成分について説明する。例えば、トーン成分分離部３１に入力されたスペクトル信号が、図５のような信号である場合、特にパワーレベルが高い部分が、トーン成分４１乃至４３として、非トーン成分から分離される。なお、分離されたトーン成分４１乃至４３の位置を示す位置データＰ１乃至Ｐ３、およびトーン成分として抜き出された周波数の幅がそれぞれ検出されて、トーン成分とともに、トーン成分符号化部３２に出力される。
【０１１９】
トーン成分の分離方法は、例えば、本発明者が先に出願した、特願平５−１５２８６５号公報、もしくは、WO９４／２８６３３などに記載の方法を用いればよい。この方法により分離されたトーン成分および非トーン成分は、後述するトーン成分符号化部３２および非トーン成分符号化部３３の処理により、それぞれ、異なるビット数で量子化される。
【０１２０】
トーン成分符号化部３２および非トーン成分符号化部３３は、入力された信号を、それぞれ符号化するが、トーン成分符号化部３２は、トーン成分に対して、量子化ビット数を大きく、すなわち、量子化精度を高くして量子化を行い、非トーン成分符号化部３３は、非トーン成分に対して、量子化ビット数を小さく、すなわち、量子化精度を低くして量子化を行う。
【０１２１】
各トーン成分に関しては、トーン成分の位置情報や、トーン成分として抜き出された周波数の幅などの情報を新たに付け加える必要があるが、非トーン成分のスペクトル信号を少ないビット数で量子化することが可能となる。特に、符号化装置１に入力された音響波形信号が、特定のスペクトルにエネルギが集中するような信号である場合には、このような方法を取ることにより、聴覚上の劣化を殆ど感じさせずに、高い圧縮率で効果的に符号化することが可能である。
【０１２２】
図６は、図４のトーン成分符号化部３２の更に詳細な構成を示すブロック図である。
【０１２３】
正規化部５１は、量子化ユニット毎にトーン成分のスペクトル信号の入力を受けて、正規化を行い、量子化部５２に出力する。量子化精度決定部５３は、入力された量子化ユニットを参照して、量子化精度を計算し、計算結果を量子化部５２に出力する。入力される量子化ユニットは、トーン成分であるから、量子化精度決定部５３は、量子化精度が高くなるように量子化精度を計算する。量子化部５２は、正規化部５１から入力された正規化結果を、量子化精度決定部５３により決定された量子化精度で量子化して、符号を生成するとともに、生成された符号に加えて、正規化係数情報や量子化精度情報などの、符号化情報を出力する。
【０１２４】
また、トーン成分符号化部３２は、トーン成分とともに入力されたトーン成分の位置情報なども、トーン成分とともに符号化して出力する。
【０１２５】
図７は、図４の非トーン成分符号化部３３の更に詳細な構成を示すブロック図である。
【０１２６】
正規化部５４は、量子化ユニット毎に非トーン成分のスペクトル信号の入力を受けて、正規化を行い、量子化部５５に出力する。量子化精度決定部５６は、入力された量子化ユニットを参照して、量子化精度を計算し、計算結果を量子化部５５に出力する。入力される量子化ユニットは、非トーン成分であるから、量子化精度決定部５６は、量子化精度が低くなるように量子化精度を計算する。量子化部５５は、正規化部５４から入力された正規化結果を、量子化精度決定部５６により決定された量子化精度で量子化して、符号を生成するとともに、生成された符号に加えて、正規化係数情報や量子化精度情報などの、符号化情報を出力する。
【０１２７】
上述した符号化方法に対して、更に符号化効率を高めることが可能である。例えば、可変長符号化を行い、量子化されたスペクトル信号のうち、頻度の高いものに対しては、比較的短い符号長を割り当て、頻度の低いものに対しては、比較的長い符号長を割り当てることにより、符号化効率を更に高めることが出来る。
【０１２８】
そして、図１の符号列生成部１３は、信号成分符号化部１２により出力された信号周波数成分の符号から、例えば、記録媒体に記録したり、データ伝送路を介して、他の情報処理装置などに送出可能な符号列、すなわち、複数のフレームにより構成された符号列を生成し、試聴データ生成部１４に出力する。符号列生成部１３により生成される符号列は、通常のデコーダによって高音質で再生可能な音声データである。図８に、符号列生成部１３において生成される高音質で再生可能な音声データのフレームのフォーマットを示す。
【０１２９】
各フレームの先頭には、同期信号を含む固定長のヘッダが配置されている。ヘッダには、図２を用いて説明した変換部１１の帯域分割フィルタ２１の帯域分割数等も記録される。
【０１３０】
各フレームには、ヘッダに続いて、分離されたトーン成分に関するトーン成分情報が記録される。トーン成分情報には、トーン成分数、トーン幅、および、図６を用いて説明したトーン成分符号化部３２がトーン成分に対して施した量子化の量子化精度情報が記録される。続いて、トーン成分４１乃至４３のデータとして、それぞれの正規化係数、トーン位置、およびスペクトル係数が記録されている。ここでは、例えば、トーン成分４１の正規化係数が３０であり、トーン成分４２の正規化係数が２７であり、トーン成分４３の正規化係数が２４であるものとする。
【０１３１】
そして、トーン成分情報に続いて、非トーン成分情報が記載される。非トーン成分情報には、量子化ユニット数（ここでは１６）、図７を用いて説明したトーン成分符号化部３３が、非トーン成分に対して施した量子化の量子化精度情報、１６個の量子化ユニットそれぞれの正規化係数情報、およびスペクトル係数情報が記録されている。正規化係数情報には、最低域の量子化ユニット［１］の４６という値から、最高域の量子化ユニット［１６］の８という値までが、量子化ユニット毎に記録されている。ここでは、正規化係数情報として、スペクトル信号のパワーレベルのｄＢ値に比例する値が用いられているものとする。また、コンテンツフレームの長さが固定長である場合、スペクトル係数情報の後に空き領域が設けられるようにしても良い。
【０１３２】
図９は、図１の試聴データ生成部１４の更に詳細な構成を示すブロック図である。
【０１３３】
試聴データ生成処理制御部６１は、図示しない外部の操作入力部などから入力される、試聴データの試聴領域や、試聴領域の試聴帯域などの設定データを基に、試聴領域判定部６２、および帯域制限処理部６３を制御する。試聴領域とは、試聴データのうち、追加データを用いることなく低品質での再生が可能な部分であり、例えば、さびの部分などが試聴領域として設定される。試聴領域は、試聴データ内で、複数箇所設定されるようにしても良い。また、試聴領域の試聴帯域とは、低品質再生される周波数帯域のことである。例えば、図５を用いて説明したスペクトルデータのうち、一部の量子化ユニットのみを指定することにより、一定の範囲の周波数帯域のみを再生可能として、再生される音声の品質を下げるようになされている。
【０１３４】
試聴領域判定部６２は、図８を用いて説明した各フレームに対して、試聴データ生成処理制御部６１の制御に従って、入力されたフレームが、試聴領域内であるか否かを判断し、判断結果を帯域制限処理部６３、および制御情報挿入部６４に出力する。試聴領域判定部６２は、試聴データ生成処理制御部６１の制御に従って、例えば、図１０に示されるように、入力されたフレーム列を、保護領域、すなわち、試聴を許可されない領域と、試聴領域に区別する。
【０１３５】
帯域制限処理部６３は、試聴領域判定部６２から入力される信号を基に、入力されたフレームが、試聴領域に含まれているフレームであるか否かに基づいて、試聴帯域が制限されたデータを生成する。試聴領域内の試聴帯域についての情報は、試聴データ生成処理制御部６１から入力される。
【０１３６】
例えば、試聴可能部分の試聴帯域として、量子化ユニット［１］乃至量子化ユニット［１２］が指定された場合、帯域制限処理部６３は、入力されたフレームが、試聴領域のフレームであるとき、図１１に示されるように、試聴帯域より高域側の量子化ユニット［１３］乃至量子化ユニット［１６］の正規化係数情報の値を、例えば、最小化して、正規化係数０とする。従って、量子化ユニット［１３］乃至量子化ユニット［１６］に対応する部分のスペクトル係数情報には、有効な値が記述されているが、再生時には、正規化係数情報が０であるので、対応する部分のスペクトルは、厳密には０にはならないが、可聴性という観点からは、実質的には０と同等の値となる。従って、図中Ａｄで示される位置より高域側のスペクトル係数情報は、参照されないのと同義である。
【０１３７】
非トーン成分と同様にして、トーン成分のうち、試聴帯域から外れている部分の正規化係数の値も、例えば０として最小化することにより、再生時には、対応するトーン成分のスペクトル信号も極小化（実質的には０と同等の値に変更）される。
【０１３８】
図１１に示された試聴データを再生した場合のスペクトル信号を図１２に示す。量子化ユニット［１３］乃至量子化ユニット［１６］の正規化係数情報は０に変更されているため、対応するスペクトル信号は極小化される。また、量子化ユニット［１３］乃至量子化ユニット［１６］に含まれている２つのトーン成分に対しても、同様に、対応するスペクトル信号は極小化される。すなわち、試聴データを復号して再生した場合、試聴可能部分においては、狭帯域のスペクトル信号のみが再生される。
【０１３９】
このようにすることにより、試聴データの試聴領域を再生した場合、狭帯域のデータしか再生されないので、図８を用いて説明したオリジナルデータと比較して、品質の低いデータが再生される。
【０１４０】
更に、帯域制限処理部６３は、試聴不可の部分に対応するフレームの量子化ユニット［１］乃至［１６］の全ての正規化係数情報の値を０とする。これにより、試聴不可のフレーム（保護領域）のデータは、再生されても無音となる。
【０１４１】
なお、帯域制限処理部６３は、試聴データを復号した場合の復号データ長が、オリジナルデータを復号した場合の復号データ長より長くなることがないような置き換え用のデータを用いて、試聴フレームを生成する。
【０１４２】
制御情報挿入部６４は、試聴データが再生されたときに、無音状態が続いた後、試聴部分が突然流れ出すといったような、不自然な再生が行われるようなことがないように、試聴領域のみが連続して再生されるように、全てのフレームに、再生可であるか否かを示す制御情報を挿入する。制御情報挿入部６４は、試聴領域判定部６２から入力される情報に従って、参照されないスペクトル係数情報の領域、すなわち、再生品質を制限したため、試聴データを再生する場合に無視される領域に、制御情報を挿入して、試聴データ生成部６６に出力する。そして、制御情報挿入部６４は、制御情報が挿入された部分に対応する、真のスペクトル係数情報と、必要に応じて、制御情報が挿入された位置を示す情報とを、追加フレーム生成部６５に出力する。
【０１４３】
制御情報が挿入された場合のフレームのフォーマットを図１３に示す。
【０１４４】
図１１を用いて説明したように、試聴帯域より高域側の量子化ユニット［１３］乃至量子化ユニット［１６］の正規化係数情報の値が０に変更されている場合、量子化ユニット［１３］乃至量子化ユニット［１６］に対応する、図中Ａｄで示される位置より高域側のスペクトル係数情報は、参照されないのと同義であるので、この範囲内に、任意の制御情報を記載することが可能である。
【０１４５】
更に、制御情報挿入部６４は、図中Ａｄで示される位置より高域側のスペクトル係数情報の範囲内で、制御情報が記載されない位置に、ランダムなダミーデータなどを記載するようにしても良い。この場合、制御情報挿入部６４は、ダミーデータが記載された部分に対応する真のスペクトル係数情報と、必要に応じて、ダミーデータが挿入された位置を示す情報とを、追加フレーム生成部６５に出力する。
【０１４６】
ここでは、試聴データの再生時に参照されないスペクトル係数情報に対応する位置に、制御情報を挿入するものとして説明したが、試聴データの再生時に参照されない部分であれば、制御情報を挿入する位置は、スペクトル係数情報に対応する位置以外であっても良い。
【０１４７】
制御情報のフォーマットの例を図１４に示す。
【０１４８】
制御情報には、試聴データの入力を受けて、これを再生する再生装置が、対応するフレームは試聴が許可されたフレームであるか、試聴が許可されていないフレームであるかを判断するための情報として、試聴許可フラグ、試聴開始フラグ、試聴終了フラグが記載されている。
【０１４９】
試聴許可フラグは、コンテンツの著作権を有する者によって設定され、試聴を禁止するフレームと許可するフレームとを区別するためのものである。試聴データを再生する再生装置においては、この試聴許可フラグを参照し、試聴が許可されている場合は通常の再生処理を行い、試聴が許可されていない場合は再生処理することなく次の試聴フレームに対する処理を開始する。これによって、試聴を禁止されているフレームは再生処理をスキップされるので、試聴領域の前後に試聴不可のフレームがあるような場合においても、無音の再生を行わないようにすることができる。
【０１５０】
試聴開始フラグおよび試聴終了フラグは、ひとかたまりの試聴領域に対して、試聴領域の開始位置後の所定時間内、および終了位置前の所定時間内の試聴フレームに設定する情報であり、例えば、所定時間を３秒とすると、試聴領域の開始から３秒間に再生される試聴フレームには、それを判別できるように試聴開始フラグを設定し、試聴領域の終了前の３秒以内に再生される試聴フレームにはそれを判別できるように試聴終了フラグを設定する。
【０１５１】
これによって、例えば、試聴データを再生装置で再生させる場合、試聴領域の前後でフェードインおよびフェードアウト等の再生処理を施すことが可能となる。すなわち、複数の試聴領域が存在する場合、本来のオリジナルデータでは、異なる箇所で再生されるはずの複数の試聴領域が、同じ音量で、連続して再生されるのに対し、複数の試聴領域のそれぞれの開始および終了で、試聴領域の前後でフェードインおよびフェードアウト等の再生処理を施すことができるので、試聴データを、ユーザにとってより違和感がなく、購買意欲を高めることができるものとすることができる。
【０１５２】
特に、スペクトル係数情報が可変長符号化されており、その可変長符号が、スペクトル係数情報の記載領域に、低域側から高域側に、順次記述されている場合、参照されないスペクトル係数情報の領域に、制御情報が記載されていることにより、中域の可変長符号の一部が欠落するので、その部分を含めた高域側のデータは、全く復号できなくなる。すなわち、試聴データに含まれるオリジナルデータに関わるスペクトル係数情報を、追加データを用いることなく復元することが困難となるので、試聴データの安全性が強化される。
【０１５３】
このように、正規化係数などのデータの一部が、０、あるいは制御情報で置き換えられている場合、置き換えられたデータに対する真のデータを推測することは、比較的鍵長の短い暗号鍵を解読することと比較して、非常に困難である。また、試聴データを不正に改変しようとすると、かえって音質を劣化させる原因となる。従って、オリジナルデータの試聴が許可されていないユーザが、試聴データを基に、オリジナルデータを推測することが非常に困難となり、コンテンツデータの著作者や配布者の権利をより強固に保護することが可能となる。
【０１５４】
また、万が一、ある試聴データにおいて、置き換えられたデータに対する真のデータが推測されてしまっても、暗号アルゴリズムを解読されてしまった場合と異なり、他のコンテンツにその被害が拡大することはないので、特定のアルゴリズムを用いて暗号化を施したコンテンツデータを試聴データとして配布するよりも安全性が高い。
【０１５５】
以上において、帯域制限処理部６３および制御情報挿入部６４により置き換えられた真の正規化係数情報および真のスペクトル係数情報は、後述する追加フレーム生成部６５に供給され、追加フレームに記載される。帯域制限処理部６３は、正規化係数情報以外に、例えば、量子化精度情報や量子化ユニット数などの値を、０あるいは、ランダムなダミーデータなどに変更するようにしても良いので、後述する追加フレーム生成部６５は、帯域制限処理部６３により変更された、例えば、量子化精度情報、量子化ユニット数などの値を示す情報の入力を受け、追加フレームにそれらの情報を記載する。
【０１５６】
ただし、正規化係数情報を変更する場合と、量子化精度情報を変更する場合とでは、追加データを用いずに試聴データから不正にオリジナルデータを推測するための困難さ、すなわち、試聴データの安全強度が異なってしまう。例えば、オリジナルデータの生成時に、正規化係数情報に基づいて量子化精度情報を算出するようなビット割り当てアルゴリズムが採用されている場合、量子化精度情報のみの値を変更しても、正規化係数情報を手掛かりにして、真の量子化精度情報を推測される危険性がある。
【０１５７】
これに対して、正規化係数情報のみを変更しても、量子化精度情報から正規化係数情報を推測するのは困難であるので、試聴データの安全強度は高いといえる。なお、正規化係数情報および量子化精度情報の両方の値を変更することで、不正にオリジナルデータを推測される危険性は更に低くなる。また、試聴データのコンテンツフレームによって、正規化係数情報または量子化精度情報の値を選択的に変更するようにしてもよい。
【０１５８】
追加フレーム生成部６５は、帯域制限処理部６３により、帯域が制限されて、変更された正規化係数などに関する情報から、試聴データを試聴したユーザが、高音質で楽曲を聞く場合に購入するオリジナルデータ復元用の追加データを構成する追加フレームを生成する。
【０１５９】
図１５に、生成される追加フレームのフォーマットを示す。図１３を用いて説明したように、試聴帯域として、量子化ユニット［１］乃至量子化ユニット［１２］が選択されている場合、試聴データの各フレームにおいて、量子化ユニット［１３］乃至量子化ユニット［１６］の４つの量子化ユニットに対応する、トーン成分および非トーン成分のそれぞれの正規化係数は、０に置き換えられている。また、参照されないスペクトル係数情報の一部が、制御情報に置き換えられている。追加フレーム生成部６５は、帯域制限処理部６３から、変更された正規化係数情報およびスペクトル係数情報の本来の値（真の正規化係数情報および真のスペクトル係数情報）を示す情報の供給を受け、図１５の追加フレームを生成する。
【０１６０】
追加フレームには、トーン成分に対応する追加情報と、非トーン成分に対応する追加情報が記載される。図１５は、試聴帯域として、量子化ユニット［１］乃至量子化ユニット［１２］が選択されている場合を示している。トーン成分に対応する追加情報としては、最小化されたトーン成分数（ここでは２成分最小化されている）と、それぞれのトーン成分の正規化係数情報の試聴フレーム上の位置と、真の正規化係数情報が記載される。また、非トーン成分に対応する追加情報としては、最小化された非トーン成分の正規化係数の数（ここでは４成分最小化されている）、その正規化係数の先頭の位置を示す情報（例えば、正規化係数が０に変更されている先頭の量子化ユニットの番号１３などでも良い）、書き換えられた真の正規化係数、および制御情報に書き換えられた部分の真のスペクトル係数情報が記載されている。
【０１６１】
ここで、制御情報に書き換えられた部分の真のスペクトル係数情報の位置情報は、生成される追加フレームに記載されていないが、試聴フレームにおいて、スペクトル係数情報を制御情報に書き換える部分を、参照されないスペクトル係数情報となる部分の先頭とすれば、正規化係数が０に置き換えられた量子化ユニットの領域の情報を基に、制御情報に書き換えられた部分の真のスペクトル係数情報の位置を求めることが可能である。制御情報に書き換える位置を、参照されないスペクトル係数情報となる部分の先頭と異なる位置とする場合は、追加フレームに、制御情報に書き換えられた部分の真のスペクトル係数情報の位置情報を記載する必要がある。
【０１６２】
また、帯域制限処理部６３が、正規化係数情報およびスペクトル係数情報以外に、例えば、量子化精度情報や量子化ユニット数などの値を変更した場合、追加フレーム生成部６５は、帯域制限処理部６３により変更された、量子化精度情報や量子化ユニット数などの値を示す情報の入力を受け、追加フレームにそれらの真の値の情報を記載する。
【０１６３】
また、追加フレームに記載される正規化係数情報の一部、あるいは全部を、制御情報に記載するようにしても良い。その場合の制御情報のフォーマットの例を図１６に示す。
【０１６４】
制御情報には、試聴データの再生時に参照する再生制御情報と、帯域を制限するために置き換えた正規化係数情報の全部、もしくは一部が記載される。図１６においては、追加フレームに記載される正規化係数情報のうち、非トーン成分に関する情報が全て記載されているが、ここに記載される情報は、例えば、非トーン成分のうちの一部の正規化係数情報であっても、トーン成分を含む、置き換えられた全ての正規化係数情報であっても良いし、試聴データの生成時に、正規化係数情報以外の情報が置き換えられている場合は、それらの情報を記載するようにしても良い。
【０１６５】
また、再生制御情報には、図１４を用いて説明した、再生許可フラグ、試聴開始フラグ、および試聴終了フラグに加えて、再生可能フラグが記載される。
【０１６６】
再生可能フラグは、制御情報内に記述されている正規化係数情報等を利用した再生が可能か否かを示すものである。試聴データの入力を受けた再生装置は、試聴フレーム内の制御情報の再生可能フラグを参照し、正規化係数情報が利用可能な場合、制御情報内の正規化係数情報を試聴フレーム内の本来の位置に復元して再生処理を行うことが可能である。また、試聴データの入力を受けた再生装置は、再生可能フラグに、正規化係数情報の利用が不可であることが示されている場合、制御情報内の正規化係数情報を利用せずに再生処理を行う。
【０１６７】
また、再生可能フラグでは、再生装置などのバージョンや種類を指定することも可能であり、例えば、指定されたバージョンを満たす再生装置では、制御情報に含まれている正規化係数情報を用いて、試聴データの一部を復元して、再生処理を行うことができ、それに対して、指定以外のバージョンの再生装置では、再生処理において、制御情報に含まれている正規化係数情報を利用することはできない。
【０１６８】
また、制御情報に、追加データに加えるべき情報の一部を記載することにより、追加データのデータ容量を少なくすることができるので、ユーザが、例えば、ＭＭＫなどで、追加データを記録媒体に記録しようとした場合に、処理時間を短くすることができたり、追加データをダウンロード処理により手に入れようとした場合に、通信時間を短くすることができる。
【０１６９】
図１６の制御情報を試聴フレームに挿入する場合、帯域制限処理部６３は、帯域を制限するために置き換えた正規化係数情報の全部、もしくは一部を、追加フレーム生成部６５ではなく、制御情報挿入部６４に出力する。制御情報挿入部６４は、帯域制限処理部６３から入力された真の正規化係数情報を用いて、図１６の制御情報を生成して、試聴フレームに挿入する。
【０１７０】
また、追加フレ―ムに記載される情報の一部、例えば、図１６を用いて説明したように、非トーン部分の真の正規化係数情報が、制御情報に記載された場合、対応する部分は、追加フレームに記載されない。
【０１７１】
試聴データ生成部６６は、試聴データのヘッダを生成し、入力された試聴データの符号化フレーム列に、生成したヘッダを付加して、試聴データを生成して出力する。試聴データのヘッダには、例えば、コンテンツを識別するためのコンテンツＩＤや、コンテンツの再生時間、コンテンツのタイトル、あるいは、符号化方式の情報などの情報が含まれている。
【０１７２】
追加データ生成部６７は、追加データのヘッダを生成し、入力された追加データの符号化フレーム列に、生成した追加データのヘッダを付加して、追加データを生成して出力する。追加データのヘッダには、コンテンツを識別して、試聴データと対応させるためのコンテンツＩＤ、コンテンツの再生時間、必要に応じて符号化方式に関する情報などが記載される。
【０１７３】
このようにして、試聴データの試聴領域が連続して再生されるように、制御情報を記載するようにしたので、保護領域が無音で再生されてしまうことを防ぐことができ、試聴データを試聴したユーザが違和感を覚えないようにすることができる。
【０１７４】
また、制御情報に、試聴データの再生に関する様々な情報を記述することにより、例えば、試聴領域の始めの部分と終わりの部分でフェードインやフェードアウトなどの処理が行えるようにして、本来異なる部分で再生されるはずの複数の試聴領域が連続して再生された場合にも、試聴データを試聴したユーザが、本来の試聴領域の連続する部分と連続しない部分を理解することができるようにしたり、再生装置のバージョンなどによって、試聴データの再生品質を変更することができるようにすることができる。
【０１７５】
更に、制御情報に、試聴フレームの生成時に置き換えられた真のデータを含ませるようにすることにより、追加データのデータ容量を小さくしたり、再生装置の種類やバージョンなどによって、試聴データの再生品質を変更することが可能となる。
【０１７６】
試聴データ生成部１４によって生成された試聴データと追加データとを用いて、後述する処理により、オリジナルデータを復元することが出来る。
【０１７７】
次に、図１７および図１８のフローチャートを参照して、試聴データ生成処理について説明する。
【０１７８】
ステップＳ１において、試聴データ生成処理制御部６１は、図示しない操作入力部などから入力された、試聴データの試聴領域の試聴帯域の設定値を取得する。ここでは、試聴帯域として、図１１および図１２を用いて説明したように、量子化ユニット［１］乃至量子化ユニット［１２］が設定されたものとして説明する。試聴データ生成処理制御部６１は、試聴帯域の設定値を、帯域制限処理部６３に供給する。
【０１７９】
ステップＳ２において、試聴データ生成処理制御部６１は、図示しない操作入力部などから入力された、試聴データの試聴領域を指定する情報を取得する。ここでは、試聴領域として、図１０を用いて説明した第４および第５フレームからなる試聴領域１と、第９および第１０フレームからなる試聴領域２が指定されたものとして説明する。試聴データ生成処理制御部６１は、試聴領域を指定する情報を、試聴領域判定部６２に供給する。
【０１８０】
ステップＳ３において、試聴領域判定部６２および帯域制限処理部６３は、オリジナルデータに相当するフレーム列に含まれるいずれかのフレーム、すなわち、図８を用いて説明した高音質再生可能なフレームの入力を受ける。
【０１８１】
ステップＳ４において、試聴領域判定部６２は、試聴データ生成処理制御部６１から供給された情報を基に、入力されたフレームは試聴フレーム（試聴領域に含まれるフレーム）であるか否かを判断する。
【０１８２】
ステップＳ４において、入力されたフレームは試聴フレームであると判断された場合、ステップＳ５において、試聴領域判定部６２は、判定結果を帯域制限処理部６３および制御情報挿入部６４に供給するので、帯域制限処理部６３は、入力されたフレームのトーン成分の正規化係数情報のうち、試聴データ生成処理制御部６１から供給された試聴帯域の設定値で指定されている帯域以外の部分を、例えば、値０などに変更する。
【０１８３】
ステップＳ６において、帯域制限処理部６３は、非トーン成分の正規化係数情報のうち、試聴データ生成処理制御部６１から供給された試聴帯域の設定値で指定されている帯域以外の部分を、例えば、値０に変更し、図１１を用いて説明した試聴フレームを生成して、制御情報挿入部６４に出力するとともに、ステップＳ５およびステップＳ６において置き換えられた真の正規化係数情報および、その位置などを示す各種情報を、全部、追加フレーム生成部６５に出力するか、その一部を制御情報挿入部６４に出力して、その他を追加フレーム生成部６５に出力する。
【０１８４】
ステップＳ７において、制御情報挿入部６４は、ステップＳ６の処理において書き換えられた非トーン成分の正規化係数情報に対応する、参照されないスペクトル係数情報の一部に、試聴可を示す制御情報を記載し、図１３を用いて説明した試聴フレームを生成する。ここで、記載される制御情報は、図１４を用いて説明したような、再生制御情報のみのものであっても、図１６を用いて説明したような、追加データに記載する情報の一部を含んだものであっても、どちらでもかまわない。制御情報挿入部６４は、生成された試聴フレームを試聴データ生成部６６に出力するとともに、ステップＳ７において制御情報と置き換えられた真のスペクトル係数情報を、追加フレーム生成部６５に出力する。
【０１８５】
ここで、スペクトル係数情報が可変長符号化されている場合、真のスペクトル係数情報が復号された場合のビット長より、制御情報が復号された場合のビット長が短くなるような制御情報を用いることにより、後述する復号処理において、符号列のフレーム長をオーバーしてしまうことを防ぐようにすることができる。
【０１８６】
また、制御情報挿入部６４は、参照されないスペクトル係数情報の一部に、制御情報に加えて、ダミーデータを置き換えるようにしても良い。スペクトル係数情報に置き換えられるダミーデータは、全て値０とするようにしても良いし、適当に値１および値０を混在させるようにしても良い。
【０１８７】
ステップＳ８において、追加フレーム生成部６５は、帯域制限処理部６３および制御情報挿入部６４から入力される信号を基に、試聴データを試聴したユーザが、高音質で楽曲を聞く場合に購入する追加データを構成する追加フレーム用のデータを生成する。
【０１８８】
ステップＳ４において、入力されたフレームは試聴フレームではない、すなわち、保護フレーム（保護領域に含まれるフレーム）であると判断された場合、ステップＳ９において、試聴領域判定部６２は、判定結果を帯域制限処理部６３および制御情報挿入部６４に供給する。帯域制限処理部６３は、トーン成分の正規化係数情報を、全て最小値（値０）に変更する。
【０１８９】
ステップＳ１０において、帯域制限処理部６３は、非トーン成分の正規化係数情報を、全て最小値（値０）に変更して、制御情報挿入部６４に出力するとともに、ステップＳ９およびステップＳ１０において置き換えられた真の正規化係数情報および、その位置などを示す各種情報を、全部、追加フレーム生成部６５に出力するか、その一部を制御情報挿入部６４に出力して、その他を追加フレーム生成部６５に出力する。
【０１９０】
ステップＳ１１において、制御情報挿入部６４は、ステップＳ１０の処理において書き換えられた非トーン成分の正規化係数情報に対応する、参照されないスペクトル係数情報の一部に、試聴不可を示す制御情報を記載する。制御情報挿入部６４は、生成された試聴フレームを試聴データ生成部６６に出力するとともに、ステップＳ１１において制御情報と置き換えられた真のスペクトル係数情報を、追加フレーム生成部６５に出力する。
【０１９１】
ここでも、同様にして、スペクトル係数情報が可変長符号化されている場合、真のスペクトル係数情報が復号された場合のビット長より、制御情報が復号された場合のビット長が短くなるような制御情報を用いて、置き換えを行うようにする。更に、制御情報に加えて、ダミーデータを用いて置き換えを行うようにしても良い。
【０１９２】
ステップＳ１２において、追加フレーム生成部６５は、帯域制限処理部６３および制御情報挿入部６４から入力される信号を基に、試聴データを試聴したユーザが、高音質で楽曲を聞く場合に購入する追加データを構成する追加フレーム用のデータを生成する。
【０１９３】
ステップＳ８の処理の終了後、もしくはステップＳ１２の処理の終了後、ステップＳ１３において、試聴データ生成部６４は、処理されたフレームは、最終フレームであるか否かを判断する。ステップＳ１３において、処理されたフレームは、最終フレームではないと判断された場合、処理は、ステップＳ３に戻り、それ以降の処理が繰り返される。
【０１９４】
ステップＳ１３において、処理されたフレームは、最終フレームであると判断された場合、ステップＳ１４において、試聴データ生成部６６は、試聴データのヘッダを生成して、試聴フレーム列に付加して試聴データを生成し、出力する。
【０１９５】
ステップＳ１５において、追加データ生成部６７は、入力された情報を用いて、追加データのヘッダを生成して追加フレーム列に付加して追加データを生成して出力し、処理が終了される。
【０１９６】
図１７および図１８のフローチャートを参照して説明した処理により、試聴領域のみが低品質で再生される試聴データと、試聴データからオリジナルデータを復元するための追加データが生成される。
【０１９７】
また、ユーザの購買意欲を喚起させるために、試聴領域のデータを、帯域制限せずに、オリジナルデータと同様の高音質のものとするようにしても良い。その場合、試聴領域のデータを帯域制限することなく、オリジナルデータの、試聴領域のフレームをそのまま試聴フレームにコピーし、その部分の追加フレームを生成しないようにすればよい。
【０１９８】
なお、符号化装置１の信号成分符号化部１２は、入力された信号を符号化する場合、トーン成分と非トーン成分を分離して、それぞれ別に符号化を行うものとして説明したが、信号成分符号化部１２に代わって、図７の非トーン成分符号化部３３を用いることにより、入力された信号をトーン成分と非トーン成分を分離せずに符号化するようにしても良い。
【０１９９】
図１９に、入力された信号をトーン成分と非トーン成分に分離しない場合に符号列生成部１３により生成される高音質のオリジナルデータフレームのフォーマットを示す。オリジナルデータフレームの先頭には、図８で説明した場合と同様に、同期信号を含む固定長のヘッダが配置されている。ヘッダには、図２を用いて説明した変換部１１の帯域分割フィルタ２１の帯域分割数なども記録される。ヘッダに続いて、量子化ユニット数（ここでは１６）、非トーン成分符号化部３３が施した量子化の量子化精度情報、１６個の量子化ユニットそれぞれの正規化係数情報、およびスペクトル係数情報が記録されている。正規化係数情報は、最低域の量子化ユニット［１］の４６という値から、最高域の量子化ユニット［１６］の８という値までが、量子化ユニット毎に記録されている。また、コンテンツフレームの長さが固定長である場合、スペクトル係数情報の後に空き領域が設けられるようにしても良い。
【０２００】
そして、図２０に、図１９を用いて説明したオリジナルデータフレームの入力を受けた試聴データ生成部１４により生成される試聴部分の音声データのフォーマットを示す。例えば、試聴可能部分の試聴帯域として、量子化ユニット［１］乃至量子化ユニット［１２］が指定された場合、図１１を用いて説明した場合と同様に、試聴帯域より高域側の量子化ユニット［１３］乃至量子化ユニット［１６］の正規化係数情報の値が０とされる。従って、量子化ユニット［１３］乃至量子化ユニット［１６］に対応する部分のスペクトル係数情報には、有効な値が記述されているが、再生時には、正規化係数情報が０であるので、対応する部分のスペクトルは極小化される。そして、参照されないスペクトル係数の一部に、図１４、もしくは図１６を用いて説明した制御情報が記載される。
【０２０１】
そして、図２１に、図１９を用いて説明したオリジナルデータフレームの入力を受けた試聴データ生成部１４の追加フレーム生成部６５により生成される追加フレームを示す。ここでは、制御情報として、図１４を用いて説明した再生制御情報のみが記載されている場合の追加フレームについて説明する。追加フレームには、最小化された量子化フレームの正規化係数の数（ここでは４成分最小化されている）、その正規化係数の先頭の位置、書き換えられた真の正規化係数、および制御情報に書き換えられた部分の真のスペクトル係数情報が記載されている。
【０２０２】
図１９乃至図２１を用いて説明したように、トーン成分が分離されない場合においても、同様の処理により、試聴領域のみが低品質で、試聴領域のみが再生される試聴データと、試聴データからオリジナルデータを復元するための追加データが生成される。
【０２０３】
このようにして生成された試聴データは、インターネットなどを介して、ユーザに配信されたり、店舗などに備えられたＭＭＫによって、ユーザが保有する各種の記録媒体に記録されて配布される。試聴データを再生して、気に入ったユーザは、所定の料金をコンテンツデータの配信事業者に支払うなどして、追加データを入手することが出来る。ユーザは、試聴データおよび追加データを用いて、オリジナルデータを復元させ、復号して再生したり、記録媒体に記録することが可能となる。
【０２０４】
次に、試聴データを復号して出力、あるいは再生する、もしくは、試聴データおよび追加フレームから、オリジナルデータを復号して出力、あるいは再生する場合の処理について説明する。
【０２０５】
図２２は、データ再生装置８１の構成を示すブロック図である。
【０２０６】
符号列分解部９１は、符号化された試聴データの入力を受け、符号列を分解して、各信号成分の符号を抽出し、符号列復元部９３に出力する。
【０２０７】
制御部９２は、図示しない操作入力部から、ユーザの操作を受け、入力されたデータを高音質再生するか否かを示す情報の入力を受けるとともに、追加データの入力を受け、符号列復元部９３の処理を制御する。また、制御部９２は、必要に応じて、真の符号化係数情報、あるいは、真のスペクトル係数情報などを、符号復元部９３に供給する。
【０２０８】
符号列復元部９３は、制御部９２の制御に基づいて、入力された試聴データが試聴、すなわち、そのまま再生される場合は、入力された符号化フレームをそのまま信号成分復号部９４に出力し、入力された試聴データがオリジナルデータに復元されて再生される場合は、制御部９２から供給される真の符号化係数情報、あるいは、真のスペクトル係数情報などの各種情報を基に、試聴データの符号化フレームを、オリジナルデータの符号化フレームに復元する処理を実行し、復元されたオリジナルデータの符号化フレームを、信号成分復号部９４に出力する。
【０２０９】
また、符号列復元部９３は、試聴データが再生される場合、その制御情報を参照して、試聴データの一部、あるいは全体を復元するようにしても良い。すなわち、試聴データに真の符号化係数情報などが含まれ、再生可能フラグに示される、それらを用いて試聴データを復元することが許可される条件（例えば、データ再生装置８１のバージョンの指定など）を満たしている場合、符号列復元部９３は、試聴データに含まれている真の符号化係数情報などを用いて、試聴データの一部、あるいは全体を復元することが可能である。
【０２１０】
信号成分復号部９４は、入力された試聴データ、もしくはオリジナルデータの符号化フレームを復号する。図２３は、入力された符号化フレームが、トーン成分と非トーン成分に分割されて符号化された場合、その符号化フレームを復号する信号成分復号部９４の更に詳細な構成を示すブロック図である。
【０２１１】
フレーム分離部１０１は、例えば、図１３を用いて説明したような符号化フレームの入力を受け、トーン成分と非トーン成分とに分割し、トーン成分は、トーン成分復号部１０２に、非トーン成分は、非トーン成分復号部１０３に出力する。
【０２１２】
図２４は、トーン成分復号部１０２の更に詳細な構成を示すブロック図である。逆量子化部１１１は、入力された符号化データを逆量子化し、逆正規化部１１２に出力する。逆正規化部１１２は、入力されたデータを逆正規化する。すなわち、逆量子化部１１１および逆正規化部１１２により、復号処理が行われて、トーン部分のスペクトル信号が出力される。
【０２１３】
図２５は、非トーン成分復号部１０３の更に詳細な構成を示すブロック図である。逆量子化部１２１は、入力された符号化データを逆量子化し、逆正規化部１２２に出力する。逆正規化部１２２は、入力されたデータを逆正規化する。すなわち、逆量子化部１２１および逆正規化部１２２により、復号処理が行われて、非トーン部分のスペクトル信号が出力される。
【０２１４】
スペクトル信号合成部１０４は、トーン成分復号部１０２および非トーン成分復号部１０３から出力されたスペクトル信号の入力を受け、それらの信号を合成し、オリジナルデータであれば図５、あるいは、試聴データであれば図１２を用いて説明したスペクトラム信号を生成して、逆変換部９５に出力する。
【０２１５】
なお、符号化データが、トーン成分と非トーン成分とに分割されて符号化されていない場合、フレーム分離部１０１を省略し、トーン成分復号部１０２、もしくは、非トーン成分復号部１０３のうちのいずれか一方のみを用いて、復号処理を行うようにしても良い。
【０２１６】
図２６は、逆変換部９５の更に詳細な構成を示すブロック図である。
【０２１７】
信号分離部１３１は、入力されたフレームのヘッダに記載されている帯域分割数に基づいて、信号を分離する。ここでは、帯域分割数が２であり、信号分離部１３１が、入力されたスペクトル信号を逆スペクトル変換部１３２−１および１３２−２に分離するものとする。
【０２１８】
逆スペクトル変換部１３２−１および１３２−２は、入力されたスペクトル信号に対して、逆スペクトル変換し、得られた各帯域の信号を帯域合成フィルタ１３３に出力する。帯域合成フィルタ１３３は、入力された各帯域の信号を合成して出力する。
【０２１９】
帯域合成フィルタ１３３から出力された信号（例えば、オーディオＰＣＭ信号）は、例えば、図示しないＤ／Ａ変換部でアナログデータに変換され、図示しないスピーカから、音声として再生出力される。また、帯域合成フィルタ１３３から出力された信号は、ネットワークなどを介して、他の装置に出力されるようにしても良い。
【０２２０】
次に、図２７のフローチャートを参照して、図２２のデータ再生装置８１が実行するデータ再生処理について説明する。
【０２２１】
符号列分解部９１は、ステップＳ３１において、試聴データの符号化フレームの入力を受け、ステップＳ３２において、入力された符号列を分解し、符号列復元部９３に出力する。
【０２２２】
ステップＳ３３において、符号列復元部９３は、制御部９２から入力される信号を基に、高音質再生、すなわち、オリジナルデータを復元して再生する処理が実行されるか否かを判断する。
【０２２３】
ステップＳ３３において、高音質再生が実行されると判断された場合、ステップＳ３４において、図２８のフローチャートを用いて後述する符号列復元処理が実行される。
【０２２４】
ステップＳ３３において、高音質再生が実行されないと判断された場合、入力されたフレームは、試聴データとして再生されるので、ステップＳ３５において、符号列復元部９３は、入力されたフレームに含まれる制御情報を参照して、このフレームは、試聴可能なフレームであるか否かを判断する。
【０２２５】
ステップＳ３５において、このフレームは、試聴可能なフレームではないと判断された場合、ステップＳ３６において、符号列復元部９３は、入力されたフレームは、入力されたフレームに対する処理をスキップする。すなわち、符号列復元部９３は、入力されたフレームを、信号成分復号部９４に出力しないで、破棄し、処理は、ステップＳ３９に進む。
【０２２６】
ステップＳ３４の処理の終了後、もしくは、ステップＳ３５において、このフレームは、試聴可能なフレームであると判断された場合、ステップＳ３７において、信号成分復号部９４は、入力された符号列を、トーン成分と非トーン成分とに分割し、それぞれ、逆量子化および逆正規化を施すことにより復号し、復号によって生成されたスペクトル信号を合成して、逆変換部９５に出力する。
【０２２７】
ステップＳ３８において、逆変換部９５は、入力されたスペクトル信号を、必要に応じて帯域分離し、それぞれ逆スペクトル変換した後、帯域合成して、時系列信号に逆変換する。
【０２２８】
ステップＳ３６、もしくは、ステップＳ３８の処理の終了後、ステップＳ３９において、制御部９２は、ステップＳ３８において、逆変換部９５によって逆変換されたのは、試聴データの最終フレームであるか否かを判断する。
【０２２９】
ステップＳ３９において、最終フレームではないと判断された場合、処理は、ステップＳ３１に戻り、それ以降の処理が繰り返される。ステップＳ３９において、最終フレームであると判断された場合、処理は終了される。
【０２３０】
逆変換部９５によって逆変換されて生成された時系列信号は、図示しないＤ／Ａ変換部によりアナログデータに変換されて、図示しないスピーカから再生出力されるようにしても良いし、図示しないネットワークを介して、他の装置などに出力されるようにしても良い。
【０２３１】
また、試聴データが再生される場合、必要に応じて、制御情報の試聴開始フラグおよび試聴終了フラグを参照することにより、例えば、フェードイン、フェードアウトなどの処理を行うことができる。
【０２３２】
なお、ここでは、トーン成分と非トーン成分とが分割されて符号化されている試聴データ、もしくは、その試聴データから復元されたオリジナルデータを復号する場合について説明しているが、トーン成分と非トーン成分とが分割されていない場合においても、同様にして、復元処理、および再生処理が可能である。
【０２３３】
次に、図２８のフローチャートを参照して、図２７のステップＳ３４において実行される符合列復元処理について説明する。
【０２３４】
ステップＳ５１において、符号列復元部９３は、制御部９２から、符号列を復元するために、試聴領域情報、真の符号化係数情報、真のスペクトル係数情報などの、追加データの情報を取得する。
【０２３５】
符号列復元部９３は、ステップＳ５２において、符号列分解部９１により分解されたフレームの入力を受け、ステップＳ５３において、入力されたフレームは試聴フレームであるか否かを判断する。
【０２３６】
ステップＳ５３において、入力されたフレームは、試聴フレームであると判断された場合、ステップＳ５４において、符号列復元部９３は、図１５を用いて説明した追加フレームに含まれるトーン成分の正規化係数情報の真の値を用いて、試聴データの試聴フレームにおいて０に置き換えられた部分のトーン成分の正規化係数情報を復元する。
【０２３７】
ステップＳ５５において、符号列復元部９３は、図１５を用いて説明した追加フレームに含まれる非トーン成分の正規化係数情報の真の値を用いて、試聴データの試聴フレームにおいて０に置き換えられた部分の非トーン成分の正規化係数情報を復元する。
【０２３８】
ステップＳ５６において、符号列復元部９３は、図１５を用いて説明した追加フレームに含まれる非トーン成分のスペクトル係数情報の真の値を用いて、試聴データの試聴フレームにおいて制御情報に置き換えられた部分の非トーン成分のスペクトル係数情報を復元して、処理は、図２７のステップＳ３７に戻る。
【０２３９】
ステップＳ５３において、入力されたフレームは、試聴フレームではない、すなわち保護フレームであると判断された場合、ステップＳ５７において、符号列復元部９３は、保護フレームに対応する追加フレームに含まれるトーン成分の正規化係数情報の真の値を用いて、試聴データの試聴フレームにおいて０に置き換えられているトーン成分全体の正規化係数情報を復元する。
【０２４０】
ステップＳ５８において、符号列復元部９３は、保護フレームに対応する追加フレームに含まれる非トーン成分の量子化フレームの正規化係数情報の真の値を用いて、試聴データの試聴フレームにおいて０に置き換えられている非トーン成分全体の正規化係数情報を復元する。
【０２４１】
ステップＳ５９において、符号列復元部９３は、保護フレームに対応する追加フレームに含まれる非トーン成分のスペクトル係数情報の真の値を用いて、試聴データの試聴フレームにおいて制御情報に置き換えられた部分の非トーン成分のスペクトル係数情報を復元して、処理は、図２７のステップＳ３７に戻る。
【０２４２】
図２８のフローチャートを用いて説明した処理により、試聴データと追加データを用いて、オリジナルデータが復元される。
【０２４３】
なお、図２８においては、図１５を用いて説明した追加データを用いてオリジナルデータを復元する、すなわち、真の正規化係数情報が、全て追加データに記載されているものとして説明したが、真のスペクトル係数情報の一部、もしくは全部が、制御情報に記載されている場合、ステップＳ５４およびステップＳ５５、もしくは、ステップＳ５７およびステップＳ５８において、符号列復元部９３は、試聴フレーム内の制御情報に記載されている真の正規化係数情報を用いて、置き換えられている部分の正規化係数情報を復元する。
【０２４４】
図２２乃至図２８を用いて説明した処理により、復号された試聴データ、あるいは復元されて復号されたオリジナルデータは、図示しないスピーカなどを用いて再生されても、例えば、ネットワークなどを介して、他の装置に出力されるようにしても良い。
【０２４５】
また、図２７においては、量子化ユニット１３乃至量子化ユニット１６の正規化係数が０に変更されている試聴データが再生される場合、量子化ユニット１乃至量子化ユニット１２に対応する帯域のデータのみが再生されるものとして説明した。これに対して、試聴フレームの制御情報に、例えば、量子化ユニット１３および量子化ユニット１４の真の正規化係数を含ませて、再生可能フラグで、制御情報に記載されている真の正規化係数情報を用いて再生処理をすることを許可し、追加データには、量子化ユニット１５および量子化ユニット１６の真の正規化係数を記載するようにしてもよい。そして、試聴データが再生される場合、符号列復元部９３は、制御情報を基に、量子化ユニット１３および量子化ユニット１４の真の正規化係数情報を復元して、量子化ユニット１乃至量子化ユニット１２に対応する帯域のデータのみが再生される場合よりも、やや品質の良い再生処理を行うことができるようにしても良い。
【０２４６】
通常で試聴データが再生される場合、符号列復元部９３は、試聴フレームの制御データに含まれる正規化係数情報を参照しないので、再生されるのは、量子化ユニット１乃至量子化ユニット１２に対応する帯域のデータのみである。例えば、データ再生装置が所定のバージョンであった場合、あるいは、図示しない操作入力部などから、例えば、所定のパスワードが入力された場合、制御部９２は、符号列復元部９３を制御して、試聴フレームの制御データに含まれる量子化ユニット１３および量子化ユニット１４の真の正規化係数を抽出させて、対応する部分の正規化係数を復元させる。これによって、量子化ユニット１乃至量子化ユニット１２に対応する帯域のデータのみが再生される場合よりも、やや品質の良い再生処理を行うことができる。
【０２４７】
なお、試聴フレームの制御データに含まれる量子化ユニット１３および量子化ユニット１４の真の正規化係数を参照するか否かの判断は、データ再生装置８１のバージョンの整合、あるいは、パスワードの入力以外の方法によって判断されるようにしても良く、例えば、データ再生装置８１の設定によって、予め決められるようにしても良い。また、高音質再生が実行される場合、符号列復元部９３は、制御部９２から供給される追加データに基づいて、全ての量子化ユニットの正規化係数を復元することができるので、オリジナルデータが再生されることは言うまでもない。
【０２４８】
次に、試聴データを記録媒体に記録する、もしくは、試聴データおよび追加フレームからオリジナルデータを復元して記録媒体に記録する場合の処理について説明する。
【０２４９】
図２９は、データ記録装置１４１の構成を示すブロック図である。
【０２５０】
なお、図２２のデータ再生装置８１の場合と対応する部分には同一の符号を付してあり、その説明は適宜省略する。
【０２５１】
すなわち、符号列分解部９１は、符号化された試聴データの入力を受け、符号列を分解して、各信号成分の符号を抽出し、制御部９２は、図示しない操作入力部から、ユーザの操作を受け、入力されたデータを高音質記録するか、すなわち、オリジナルデータを復元して記録する処理を実行するか否かを示す情報の入力を受けるとともに、追加データの入力を受け、符号列復元部９３の処理を制御する。
【０２５２】
符号列復元部９３は、制御部９２の制御に基づいて、入力された試聴データが記録される場合は、入力された符号化フレームをそのまま記録部１５１に出力し、オリジナルデータが復元されて記録される場合には、入力された試聴データを、制御部９２から供給される、試聴領域情報、真の符号化係数情報、真のスペクトル係数情報などの各種情報を基に、オリジナルデータの符号化フレームに復元する処理を実行し、復元されたオリジナルデータの符号化フレームを、記録部１５１に出力する。
【０２５３】
記録部１５１は、例えば、磁気ディスク、光ディスク、光磁気ディスク、半導体メモリ、あるいは、磁気テープなどの記録媒体に、所定の方法でデータを記録する。また、記録部１５１は、例えば、基板などに備えられているメモリや、ハードディスクなどのように、その内部に情報を記録するものであってもかまわない。例えば、記録部１５１が、光ディスクにデータを記録することが可能である場合、記録部１５１は、光ディスクに記録するために適したフォーマットにデータを変換するエンコーダ、レーザダイオードなどのレーザ光源、各種レンズ、および、偏向ビームスプリッタなどから構成される光学ユニット、光ディスクを回転駆動するスピンドルモータ、光学ユニットを光ディスクの所定のトラック位置に駆動する駆動部、並びにそれらを制御する制御部などから構成される。
【０２５４】
なお、記録部１５１に装着される記録媒体は、符号列分解部９１、あるいは、制御部９２に入力される試聴データ、あるいは、追加データが記録されていた記録媒体と同一のものであっても良い。
【０２５５】
次に、図３０のフローチャートを参照して、データ記録装置１４１が実行するデータ記録処理について説明する。
【０２５６】
符号列分解部９１は、ステップＳ８１において、試聴データの符号化フレームの入力を受け、ステップＳ８２において、入力された符号列を分解し、符号列復元部９３に出力する。
【０２５７】
ステップＳ８３において、符号列復元部９３は、制御部９２から入力される信号を基に、高音質記録が実行されるか否かを判断し、高音質記録が実行されると判断された場合、ステップＳ８４において、図２８のフローチャートを用いて説明した符号列復元処理が実行される。
【０２５８】
ステップＳ８３において、高音質記録が実行されると判断されなかった場合、もしくはステップＳ８４の処理の終了後、ステップＳ８５において、記録部１５１は、入力されたオリジナルデータ、もしくは試聴データに対応する符号列を、装着された記録媒体などに記録する。
【０２５９】
ステップＳ８６において、制御部９２は、ステップＳ８５において、記録部１５１によって記録されたのは、オリジナルデータ、もしくは試聴データに対応する符号列の最終フレームであるか否かを判断する。
【０２６０】
ステップＳ８６において、最終フレームではないと判断された場合、処理は、ステップＳ８１に戻り、それ以降の処理が繰り返される。ステップＳ８６において、最終フレームであると判断された場合、処理は終了される。
【０２６１】
本発明を適用することにより、正規化係数情報の値を変更し、更に、正規化係数情報の値を変更したことにより、再生処理時に参照されなくなるデータ（例えば、対応するスペクトル係数情報）を制御情報に置き換えて、試聴データを生成することができる。このような試聴データからオリジナルデータを推測することは非常に困難であり、また、試聴データを不正に改変しようとすると、かえって音質を劣化させる原因となるので、コンテンツの著作権や、コンテンツ販売者の権利を保護することが可能である。
【０２６２】
そして、試聴データが再生される場合、低音質で試聴可能な試聴領域が、１箇所のみならず複数設定されていても、試聴データを構成する試聴フレームのうち、再生処理に関係ない部分（例えば、参照されないスペクトル係数情報に対応する領域）に挿入され、対応する試聴フレームが、試聴可能であるか否かを示す制御情報が参照されることにより、試聴領域に対応する試聴フレームのみが選択的に再生される。従って、試聴データを再生した場合、再生開始直後に試聴領域が再生され、不自然な無音の再生が行われるようなことがない。
【０２６３】
更に、制御情報に試聴開始フラグおよび試聴終了フラグなどの情報を記載することにより、例えば、試聴領域が複数ある場合に、試聴データの再生時に、それぞれの開始位置および終了位置において、フェードインおよびフェードアウトなどの処理を施すことができるので、本来連続しないはずの試聴領域が、同じ音量で連続して再生されることがなくなり、ユーザにとって、聞きやすい試聴データを提供することができる。
【０２６４】
更に、制御情報に、置き換えられたデータに対応する真の値（例えば、真の正規化係数情報、真のスペクトル係数情報、真の量子化精度情報、あるいは、量子化ユニット数など）の一部、あるいは全部と、再生可能フラグなどの情報とを記載することにより、例えば、再生装置の種類やバージョンなどに応じて、あるいは、所定のキーワードの入力があるか否かなどに応じて、試聴データの品質を制御することが可能となる。
【０２６５】
そして、試聴データの生成時に変更された、あるいは置き換えられたデータに対応する真の値（例えば、真の正規化係数情報、真のスペクトル係数情報、真の量子化精度情報、あるいは、量子化ユニット数など）が記載された追加フレームにより構成される追加データを作成するようにしたので、追加データを用いて、試聴データからオリジナルデータを復元することが可能である。
【０２６６】
更に、追加フレームに記載される情報の一部、もしくは全部を、制御情報に含ませるようにした場合、追加データのデータ容量を少なくすることができるので、例えば、ユーザが、追加データをダウンロード処理により手に入れようとした場合など、通信時間を短くすることができる。
【０２６７】
本発明を適用することにより、試聴データ、および、復元されたオリジナルデータを、再生出力したり、記録媒体に記録したリ、ネットワークなどを介して他の機器に出力することが可能である。
【０２６８】
以上では、オーディオ信号によるコンテンツデータの試聴データおよび対応する追加データを生成したり、試聴データおよび追加データから、オリジナルデータを復元して、再生したり、記録する処理について説明したが、本発明は、画像信号、あるいは、画像信号とオーディオ信号からなるコンテンツデータにも適応することが可能である。
【０２６９】
例えば、画像信号によるコンテンツデータを、二次元ＤＣＴを用いて変換し、多様な量子化テーブルを用いて量子化する場合、ダミーの量子化テーブルとして、高域成分を欠落させたものを指定し、必要に応じて、ダミーに対応する高域部分のスペクトル係数情報の領域に、制御情報を記録して試聴データとする。追加データには、欠落された量子化テーブルの高域成分、および置き換えられたスペクトル係数情報が記載される。
【０２７０】
そして、オリジナルデータの復元時には、追加データを用いて、高域成分が欠落されていない真の量子化テーブルが復元され、真のスペクトル係数情報が復元されるので、オリジナルデータを復元して復号することができる。
【０２７１】
上述した一連の処理は、ハードウエアにより実行させることもできるが、ソフトウエアにより実行させることもできる。この場合、例えば、符号化装置１、データ再生装置８１、もしくは、データ記録装置１４１は、図３１に示されるようなパーソナルコンピュータ１６１により構成される。
【０２７２】
図３１において、CPU１７１は、ROM１７２に記憶されているプログラム、または記憶部１７８からRAM１７３にロードされたプログラムに従って、各種の処理を実行する。RAM１７３にはまた、CPU１７１が各種の処理を実行する上において必要なデータなども適宜記憶される。
【０２７３】
CPU１７１、ROM１７２、およびRAM１７３は、バス１７４を介して相互に接続されている。このバス１７４にはまた、入出力インタフェース１７５も接続されている。
【０２７４】
入出力インタフェース１７５には、キーボード、マウスなどよりなる入力部１７６、ディスプレイやスピーカなどよりなる出力部１７７、ハードディスクなどより構成される記憶部１７８、モデム、ターミナルアダプタなどより構成される通信部１７９が接続されている。通信部１７９は、インターネットを含むネットワークを介しての通信処理を行う。
【０２７５】
入出力インタフェース１７５にはまた、必要に応じてドライブ１８０が接続され、磁気ディスク１９１、光ディスク１９２、光磁気ディスク１９３、或いは半導体メモリ１９４などが適宜装着され、それらから読み出されたコンピュータプログラムが、必要に応じて記憶部１７８にインストールされる。
【０２７６】
一連の処理をソフトウエアにより実行させる場合には、そのソフトウエアを構成するプログラムが、専用のハードウエアに組み込まれているコンピュータ、または、各種のプログラムをインストールすることで、各種の機能を実行することが可能な、例えば汎用のパーソナルコンピュータなどに、ネットワークや記録媒体からインストールされる。
【０２７７】
この記録媒体は、図３１に示されるように、装置本体とは別に、ユーザにプログラムを供給するために配布される、プログラムが記憶されている磁気ディスク１９１（フロッピディスクを含む）、光ディスク１９２（ＣＤ-ＲＯＭ（Compact Disk-Read Only Memory），ＤＶＤ（Digital Versatile Disk）を含む）、光磁気ディスク１９３（ＭＤ（Mini-Disk）（商標）を含む）、もしくは半導体メモリ１９４などよりなるパッケージメディアにより構成されるだけでなく、装置本体に予め組み込まれた状態でユーザに供給される、プログラムが記憶されているROM１７２や、記憶部１７８に含まれるハードディスクなどで構成される。
【０２７８】
なお、本明細書において、記録媒体に記憶されるプログラムを記述するステップは、含む順序に沿って時系列的に行われる処理はもちろん、必ずしも時系列的に処理されなくとも、並列的あるいは個別に実行される処理をも含むものである。
【０２７９】
以上、本発明の実施の形態について説明したが、本発明はかかる構成に限定されない。すなわち、特許請求の範囲に記載された発明およびそれと均等な発明は、本発明に与えられるであろう特許の権利範囲に含まれるものと理解される。
【０２８０】
例えば、上記実施の形態では、図８に記載の符号化フォーマットを有する第１のデータ列を例示したが、本発明はかかる構成に限定されない。本発明では、第２のデータ列を生成可能な符号化フォーマットであれば、他の様々な符号化フォーマットを第１のデータ列に適用することができる。ここで、第２のデータ列とは、第１のデータ列の第１のデータを第２のデータに置き換えることにより生成され、第１のデータ列と実質的に同一の方法で再生可能なデータ列である。なお、本発明では、第1のデータ列の符号化フォーマットとして、例えば、符号列上で、少なくともコンテンツデータに係る部分（例えば、スペクトル係数、画素値等）と該符号の復号化に必要な符号化パラメータ（例えば、量子化精度情報、正規化係数情報等）とが多重化されるフォーマットを適用することが好適である。この場合、さらに、例えば、コンテンツの属性等を記述したメタ情報、著作権管理情報、或いは暗号化情報等を、当該符号列上に多重化することも可能である。
【０２８１】
【発明の効果】
以上のように、第１の本発明によれば、データ列を変換することができる。
また、第１の本発明によれば、例えば、オリジナルデータなどの元のデータ列を、例えば、その一部のみを再生可能とし、再生できない部分は再生処理時に処理されずにスキップされるような制御情報を含む試聴データなどのデータ列に変換し、更に、変換されたデータ列から元のデータ列に復元するために必要な、例えば、追加データなどのデータ列を生成することができる。
【０２８２】
第２の本発明によれば、データ列の入力を受けて再生することが出来る。
また、第２の本発明によれば、制御情報を基に、再生可能な部分と、再生不可能な部分を判断して、再生可能な部分だけを連続して再生することにより、例えば、無音の再生、あるいは、雑音のみの再生を行わないようにすることができる。
【０２８３】
第３の本発明によれば、データ列を復元することができる。
また、第３の本発明によれば、データ列を復元するための、例えば、追加データなどのデータ列を、別途取得することにより、試聴データなどのデータ列から、オリジナルデータなどの基となるデータ列を復元することができる。
【図面の簡単な説明】
【図１】本発明を適用した符号化装置の構成を示すブロック図である。
【図２】図１の変換部の構成を示すブロック図である。
【図３】スペクトル信号と量子化ユニットについて説明する図である。
【図４】図１の信号成分符号化部の構成を示すブロック図である。
【図５】トーン成分および非トーン成分について説明するための図である。
【図６】図４のトーン成分符号化部の構成を示すブロック図である。
【図７】図４の非トーン成分符号化部の構成を示すブロック図である。
【図８】オリジナルデータのフレームのフォーマットについて説明する図である。
【図９】図１の試聴データ生成部の構成を示すブロック図である。
【図１０】入力されるフレーム列の試聴領域および保護領域について説明する図である。
【図１１】試聴フレームのフォーマットについて説明する図である。
【図１２】図１１の試聴フレームに対応するスペクトル信号について説明する図である。
【図１３】制御情報を用いてスペクトル係数の一部を書き換えた試聴フレームについて説明する図である。
【図１４】制御情報について説明する図である。
【図１５】追加フレームを説明する図である。
【図１６】制御情報について説明する図である。
【図１７】試聴データ生成処理について説明するフローチャートである。
【図１８】試聴データ生成処理について説明するフローチャートである。
【図１９】トーン成分が分離されない場合のオリジナルデータのフレームについて説明する図である。
【図２０】トーン成分が分離されない場合の試聴フレームについて説明する図である。
【図２１】トーン成分が分離されない場合の追加フレームについて説明する図である。
【図２２】本発明を適用したデータ再生装置の構成を示すブロック図である。
【図２３】図２２の信号成分復号部の構成を示すブロック図である。
【図２４】図２３のトーン成分復号部の構成を示すブロック図である。
【図２５】図２３の非トーン成分復号部の構成を示すブロック図である。
【図２６】図２２の逆変換部の構成を示すブロック図である。
【図２７】データ再生処理について説明するフローチャートである。
【図２８】符号列復元処理について説明するフローチャートである。
【図２９】本発明を適用したデータ記録装置の構成を示すブロック図である。
【図３０】データ記録処理について説明するフローチャートである。
【図３１】パーソナルコンピュータの構成を示すブロック図である。
【符号の説明】
１符号化装置，１１変換部，１２信号成分符号化部，１３符号列生成部，１４試聴データ生成部，６１試聴データ生成処理制御部，６２試聴領域判定部，６３帯域制限処理部，６４制御情報挿入部，６５追加フレーム生成部，６６試聴データ生成部，６７追加データ生成部，９２制御部，９３符号列復元部[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a data conversion method and data conversion device, a data reproduction method and data reproduction device, a data restoration method and data restoration device, a data format, a recording medium, and a program, and in particular, when content preview data is distributed to users. The present invention relates to a data conversion method and a data conversion device, a data reproduction method and a data reproduction device, a data restoration method and a data restoration device, a data format, a recording medium, and a program that are suitable for use in a computer.
[0002]
[Prior art]
For example, due to the spread of communication network technologies such as the Internet, improvement of information compression technology, and the progress of higher integration or higher density of information recording media, audio, still images, moving images, or audio For example, digital contents composed of various multimedia data such as movies made of moving images are distributed to viewers for a fee via a communication network.
[0003]
For example, a store that sells package media such as CD (Compact Disk) and MD (Mini-Disk) (trademark), that is, a recording medium on which digital contents are recorded in advance, for example, a large number of music data and the like. By installing an information terminal such as a so-called MMK (Multi Media KIOSK) in which digital content is stored, it is possible to sell not only package media but also digital content.
[0004]
The user inserts a recording medium such as MD brought into the MMK, refers to the menu screen, selects the title of the digital content to be purchased, and pays for the requested content. The payment method may be cash insertion, electronic money exchange, or electronic payment using a credit card or prepaid card. The MMK records the selected digital content data on a recording medium inserted by the user by a predetermined process.
[0005]
As described above, a seller of digital contents can distribute digital contents to users via the Internet, for example, in addition to selling digital contents to users using MMK.
[0006]
In this way, not only selling package media in which content is recorded in advance, but also by introducing a technique for selling digital content itself, content has been distributed more effectively.
[0007]
In order to distribute digital contents while protecting the copyright, for example, by using a technique such as Japanese Patent Application Laid-Open No. 2001-103447 or Japanese Patent Application Laid-Open No. 2001-325460, a part other than the portion where the digital content can be auditioned is encrypted. Only the user who has distributed and purchased the decryption key for encryption can be allowed to listen to all the contents. As an encryption method, for example, an initial value of a random number sequence that is a key signal for a bit string of PCM (Pulse Code Modulation) digital audio data is given, and the generated random number sequence of 0/1 and PCM data to be distributed are A method is known in which an exclusive OR is used as an encrypted bit string. The digital content encrypted in this manner is distributed to users by being recorded on a recording medium using the above-described MMK or distributed via a network, for example. If the user who has obtained the encrypted digital content data does not get the key, he or she can only listen to the unencrypted part that can be auditioned, and plays back without decrypting the encrypted part. Even so, only noise can be auditioned.
[0008]
In addition, technologies for compressing and broadcasting audio data and the like, distributing it via a network, and recording compressed data on various types of recording media such as magneto-optical disks have also been improved.
[0009]
There are various methods for high-efficiency encoding of audio data. For example, the audio signal on the time axis is not blocked, but is divided into a plurality of frequency bands and encoded (SBC (Sub) Band Coding)), or a block frequency band division method (so-called transform coding) that performs spectrum conversion of a signal on the time axis into a signal on the frequency axis, divides the signal into multiple frequency bands, and encodes each band. and so on. In addition, after band division by band division coding, a method is conceived in which, in each band, a signal is spectrum-converted into a signal on the frequency axis and coding is performed for each spectrum-converted band.
[0010]
  The filter used here is, for example, QMF (Quadrature Mirror Filter). Regarding QMF, “Digital coding of speech in subbands” (Bell Syst. Tech. J. Vol. 55, No. 8) by RE Crochiere. 1974). Also, “Polyphase Quadrature by Joseph H. RothweilerFiltersDocuments such as “-A new subband coding technique” (ICASSP 83, BOSTON) describe filter division techniques with equal bandwidth.
[0011]
As the above-described spectral transformation, for example, the input audio signal is blocked in a predetermined unit time (frame), and discrete Fourier transform (DFT), discrete cosine transform (DCT) is used for each block. Transform), modified DCT transform (MDCT; modified Discrete Cosine Transform), and the like. For example, details of MDCT can be found in the paper "Subband / Transform Cording Using Filter Bank Designs Based on Time Domain Aliasing Cancellation" (ICASSP 1987) by JP Princen, AB Bradley (Univ. Of Surrey Royal Melbourne Inst. Of Tech.) And others. It is stated in.
[0012]
Further, when the above-described DFT or DCT is used as a method for spectrally converting a waveform signal, M independent real data can be obtained by performing conversion in a time block composed of M samples. In order to reduce the connection distortion between time blocks, N / 2 samples are usually overlapped with each adjacent block, that is, N samples are overlapped on both sides, so in DFT and DCT, on average, , (M + N) samples of independent M pieces of real number data are quantized and encoded.
[0013]
On the other hand, when the above-described MDCT is used as a spectrum conversion method, when the conversion is performed with a time block composed of M samples, M / 2 each of the adjacent blocks, that is, both sides, Since M independent real data is obtained from 2M samples that are overlapped by M in total, in MDCT, on average, M real data is quantized for M samples. Will be encoded.
[0014]
In the decoding apparatus, a waveform signal can be reconstructed by adding waveform elements obtained by inversely transforming each block from codes obtained using MDCT while interfering with each other.
[0015]
In general, lengthening the time block for conversion increases the frequency resolution of the spectrum and concentrates energy on specific spectral components. Therefore, by using the MDCT which performs conversion with a long block length by overlapping with both adjacent blocks by half, and the number of obtained spectrum signals does not increase with respect to the number of time samples as a basis. By performing conversion, encoding can be performed more efficiently than when DFT or DCT is used for conversion. Further, by providing a sufficiently long overlap between adjacent blocks, it is possible to reduce distortion between blocks of the waveform signal.
[0016]
As described above, by quantizing the signal divided for each band by filtering or spectral conversion, it is possible to control the band in which the quantization noise is generated. Therefore, more efficient encoding can be performed. In addition, before performing quantization, by performing normalization for each band, for example, with the maximum absolute value of the signal component in the band, further efficient encoding can be performed.
[0017]
When quantizing each frequency component divided into frequency bands, for example, the frequency division width may be determined in consideration of human auditory characteristics. In other words, the audio signal may be divided into a plurality of bands (for example, 25 bands) so that the higher the band generally called a critical band (critical band), the wider the bandwidth.
[0018]
In addition, when the band is divided so that the critical band becomes wide, when data for each band is encoded, predetermined bit allocation may be performed for each band, or adaptation may be performed for each band. Alternatively, bits may be assigned (bit allocation is performed).
[0019]
For example, when coefficient data obtained by MDCT is encoded by bit allocation, the number of bits is adaptively allocated to MDCT coefficient data for each band obtained by MDCT for each block, Encoding is performed. As the bit allocation method, for example, the following two methods are known.
[0020]
R. Zelinski, P. Noll et al., "Adaptive Transform Coding of Speech Signals" (IEEE Transactions of Acoustics, Speech, and Signal Processing, Vol. ASSP-25, No. 4, August 1977) It is described that bit allocation is performed based on the signal size. According to this method, the quantization noise spectrum is flattened and the noise energy is minimized, but it is optimal in terms of reducing the noise that can actually be heard by the human ear because the masking effect is not used when considered auditorily. is not.
[0021]
In addition, the paper of “The critical band coder digital encoding of the perceptual requirements of the auditory system” (ICASSP 1980) by MA Kransner (Massachusetts Institute of Technology) is required for each band by using auditory masking. A technique for obtaining a stable signal-to-noise ratio and performing fixed bit allocation is described. However, in this method, even when the characteristic is measured with a sine wave input, the bit allocation is fixed, and the characteristic value is not so good.
[0022]
In order to solve these problems, all bits that can be used for bit allocation are divided into fixed bit allocation patterns determined in advance for each small block and bit allocation depending on the signal size of each block. High-efficiency coding in which the division ratio is dependent on the signal related to the input signal, and the division ratio into the fixed bit allocation pattern is increased as the spectrum of the signal is smoother. A device has been proposed.
[0023]
By using this method, when energy is concentrated in a specific spectrum, such as a sine wave input, a large number of bits can be assigned to the block including the spectrum, so the overall signal-to-noise characteristics Can be significantly improved. In general, human hearing for signals with steep spectral components is very sensitive, so improving signal-to-noise characteristics using such methods is not only a characteristic value for measurement, but also human characteristics. Is effective in improving the quality of the sound that you actually hear.
[0024]
Many methods other than those described above have been proposed for bit allocation. In addition, the auditory model has been refined and the encoding device has been improved so that not only characteristic values for measurement but also human hearing can be encoded more efficiently. Yes. In these methods, a real bit allocation reference value is obtained so that the signal-to-noise characteristic obtained by calculation is realized as faithfully as possible, and an integer value approximating it is obtained and set to the number of allocated bits. It is common to be done.
[0025]
Further, Japanese Patent Application No. Hei 5-152865 or WO94 / 28633 filed earlier by the present inventor discloses a tone component that is particularly important from the viewpoint of the generated spectrum signal, that is, around a specific frequency. A method is described in which a component in which energy is concentrated is separated and encoded separately from other spectral components. By this method, it is possible to effectively encode an audio signal or the like at a high compression rate with almost no auditory degradation.
[0026]
When generating an actual code string, first, for each band in which normalization and quantization are performed, quantization accuracy information and normalization coefficient information are encoded with a predetermined number of bits, then normalization, and The quantized spectral signal is encoded. Also, ISO / IEC 11172-3; (1993 (E), a933) describes a high-efficiency encoding method that is set so that the number of bits representing quantization accuracy information differs depending on the band. It is standardized so that the number of bits representing quantization accuracy information decreases as it becomes a region.
[0027]
Instead of directly encoding quantization accuracy information, for example, a method of determining quantization accuracy information from normalized coefficient information in a decoding device is also known, but in this method, when the standard is set, Since the relationship between the quantization coefficient information and the quantization accuracy information is determined, it becomes impossible to introduce control using quantization accuracy based on a more advanced auditory model in the future. In addition, when there is a range in the compression rate to be realized, it becomes necessary to define the relationship between the normalization coefficient information and the quantization accuracy information for each compression rate.
[0028]
As a method for encoding quantized spectral signals more efficiently, for example, it is described in a paper of “A Method for Construction of Minimum Redundancy Codes” (Proc. IRE, 40, p. 1098, 1952) by DA Huffman. There is also known a method for efficiently performing encoding using a variable length code.
[0029]
The signal encoded by the method described above can be encrypted and distributed in the same manner as the PCM signal. When this scrambling method is used, the key signal is obtained. Those that do not can reproduce the original signal. In addition, there is a method of encoding for compression after converting a PCM signal into a random signal instead of encrypting the encoded bit string. However, when this scrambling method is used, the key signal is Those that are not available can only reproduce noise.
[0030]
Further, distribution of content data trial data can promote sales of content data. The sample data includes, for example, data reproduced with lower sound quality than the original data, data capable of reproducing a part of the original data (for example, only the rust portion), and the like. When the user plays the trial listening data and likes it, he / she can purchase a key to decrypt the encryption so that the original sound can be played back, or a new recording medium on which the original sound data is recorded can be obtained. Try to buy.
[0031]
However, in the above-described scramble method, the entire data cannot be reproduced or all is reproduced as noise. For example, a recording medium in which sound is recorded with a relatively low sound quality is used for distribution as audition data. I could not do it. Even if the data scrambled by these methods is distributed to the user, the user cannot grasp the outline of the entire data.
[0032]
In addition, in the conventional method, when encrypting a signal that has been subjected to high-efficiency encoding, the compression efficiency is not lowered while giving a meaningful code string to a reproduction apparatus that is usually widely used. It was very difficult to do. That is, as described above, when a code string generated by performing high-efficiency encoding is scrambled, even if the code string is reproduced without being descrambled, not only noise is generated, but also by scrambling. If the generated code string does not conform to the original high efficiency code standard, there is a possibility that the reproduction process cannot be executed at all.
[0033]
Conversely, when high-efficiency encoding is performed after scrambling a PCM signal, for example, if the amount of information is reduced using the auditory property, irreversible encoding is performed. Therefore, even when such a high-efficiency code is decoded, a signal obtained by scrambling a PCM signal cannot be reproduced correctly. That is, such a signal is very difficult to correctly descramble.
[0034]
Therefore, a method has been selected that can correctly scramble even if the compression efficiency decreases.
[0035]
In response to such a problem, the present inventors, for example, disclosed in Japanese Patent Laid-Open No. 10-135944, only the code corresponding to the high band was encoded out of the music data converted into a spectrum signal and encoded. Disclosed is an audio encoding system that enables even a user who does not have a key to decrypt and reproduce an unencrypted narrowband signal by distributing the data as trial listening data. In this method, the high frequency side code is encrypted, the high frequency bit allocation information is replaced with dummy data, and the high frequency true bit allocation information is reproduced by the decoder that performs the reproduction process. Information is recorded at a position where it is not read (ignored) during processing.
[0036]
By adopting this method, the user receives distribution of the audition data, reproduces the audition data, purchases a key for decrypting the favorite audition data into the original data as a result of the audition, and pays for it. It will be possible to play music of all types correctly and enjoy it with high sound quality.
[0037]
[Problems to be solved by the invention]
In the technique disclosed in Japanese Patent Laid-Open No. 10-135944, a user who does not have a key can decrypt only a narrowband signal of data distributed free of charge. However, since its security depends only on encryption, if the encryption is decrypted, the user can play high-quality music without paying a fee. A data distributor (content provider) cannot legitimately collect charges.
[0038]
In addition, the content provider does not limit the quality of the entire content of the audition data, but enables the audition by limiting the quality of only a part of the content or only a few places, and the quality of the other parts Even if the data is restricted, there are cases where it is desired that the audition cannot be performed.
[0039]
For example, when distributing audition data free of charge, if you want to allow the user to audition only by limiting the quality of the rusted part of the song to tens of seconds, It is desirable to make it impossible to listen to a part other than the part that can be auditioned, and to create trial data that can reproduce the part that can be auditioned unnaturally.
[0040]
As described above, by using a technique that prevents a part that cannot be auditioned from being auditioned, even if the data is reproduced, the corresponding part (part other than rust) can be silenced. However, when listening to sample data that is silent except for the rust portion and only the rust portion is reproduced with low quality, the decoding process is performed sequentially from the beginning of the data, so that the portion that can be auditioned Is reproduced without sound, and it is unnatural that no sound is reproduced and output until the part of the data that can be auditioned is decoded.
[0041]
The present invention has been made in view of such a situation, and when distributing audition data in which only one part of the content data can be reproduced with low sound quality and the other part cannot be reproduced, only the reproducible part is distributed. Can be reproduced naturally, and the audition data in which the header length of the audition data is not changed can be generated, and such audition data can be restored to the original data.
[0042]
[Means for Solving the Problems]
  The data conversion method of the present invention is included in the first data string., At least part of the data used to reproduce the predetermined informationThe first data, When used for reproducing the predetermined information, the reproduction quality of the predetermined information is deteriorated as compared with the case of the first data string.The first replacement step for replacement with the second data and the first data are included in different areas., At least part of the data used to reproduce the predetermined informationThird dataAt least some ofThe, Control information referred to when reproducing the second data stringReplace with 4th dataAt the same time, the other part is a non-reference area that is not referred to as reproduction data when the second data string is reproduced.A second replacement step, a first replacement step, and a first generation step for generating a second data string using the data generated by the second replacement step and the data generated by the second replacement step.IncludeIt is characterized by that.
[0044]
  The first data string and the second data string can each be composed of a plurality of frames, and the third data can be recorded in a plurality of frames,4th dataCan be control information indicating whether or not the corresponding frame can be reproduced.
[0045]
  A second generation step of generating a third data string necessary for restoring the second data string generated by the processing of the first generation step into the first data string; The third data string generated by the process of the second generation step can beA portion that includes the first data and is replaced with the fourth data by the processing of the second replacement step in the third dataCan be included.
[0054]
  A frequency component conversion step for converting the input data into frequency components, and an encoding step for encoding the data converted into frequency components by the processing of the frequency component conversion step can be further included. In the process of 1 replacement step, the encoded data encoded by the process of the encoding step can be used as the first data string, and the first data can be replaced with the second data.In the process of the second replacement step, the encoded data encoded by the process of the encoding step is used as the first data string, and at least a part of the third data is replaced with the second data. can do.
The third data includes spectral coefficient information of the frequency component converted by the processing of the frequency component conversion step.At least some ofCan be included.
The first data may include at least a part of the normalization coefficient information of the encoding process by the process of the encoding step.
The first data may include at least a part of the quantization accuracy information of the encoding process by the process of the encoding step.
The first data may include at least a part of information representing the number of quantization units of the encoding process by the process of the encoding step.
The first data may be data indicating a predetermined numerical value, and the second data is a value obtained by minimizing the numerical value of the first data.
The second data can be obtained by replacing at least part of the first data with random data.
[0060]
  The data converter of the present invention is included in the first data string., At least part of the data used to reproduce the predetermined informationThe first data, When used for reproducing the predetermined information, the reproduction quality of the predetermined information is deteriorated as compared with the case of the first data string.The first replacement means for replacing with the second data is included in a different area from the first data., At least part of the data used to reproduce the predetermined informationThird dataAt least some ofThe, Control information referred to when reproducing the second data stringReplace with 4th dataAt the same time, the other part is a non-reference area that is not referred to as reproduction data when the second data string is reproduced.A second replacing unit; a first replacing unit; and a generating unit configured to generate a second data string using the data generated by the second replacing unit.PrepareIt is characterized by that.
[0062]
  The first program of the present invention is included in the first data string.It is at least a part of data used for reproducing predetermined informationThe first data, In order to reproduce the reproduction quality of the predetermined informationThe first replacement step for replacement with the second data and the first data are included in different areas., At least part of the data used to reproduce the predetermined informationThird dataAt least some ofThe, Control information referred to when reproducing the second data stringReplace with 4th dataAt the same time, the other part is a non-reference area that is not referred to as reproduction data when the second data string is reproduced.A second replacement step, a process of the first replacement step, and a generation step of generating a second data string using the data generated by the process of the second replacement step.IncludeIt is characterized by that.
[0063]
  The data reproduction method of the present inventionThe first data string includes first data that is at least a part of data used to reproduce predetermined information, and is included in a different area from the first data. Third data that is at least a part of data used for reproducing the predetermined information is further included, and the second data string includes the first data in the first data string. Of the third data in the first data string is replaced with the second data that deteriorates the reproduction quality of the predetermined information as compared with the case of the first data string. Is replaced with the fourth data, which is control information referred to when the second data string is reproduced, and the other part is reproduced data when the second data string is reproduced. As Has been set to non-reference region which is not irradiation,Contained in the second data column4th dataA reproduction control step for controlling the reproduction of the second data string based onIncludeIt is characterized by that.
[0064]
  4th dataIs information for designating whether or not the corresponding frame can be reproduced when the reproduction of the second data string is controlled by the process of the reproduction control step.includingCan be.
[0067]
  4th dataCan include at least a part of the first data replaced with the second data.
[0068]
  4th dataWhen the reproduction of the second data string is controlled by the process of the reproduction control step, the second data is restored using at least a part of the first data included in the control information. It may be assumed that information indicating whether or not is possible is further included.
[0074]
  Of the second data columnAn inverse transformation step for inverse transformation of the frequency component, and a decoding step for decoding the second data string inversely transformed by the processing of the inverse transformation step may be further provided.it can.
The third data may include at least a part of the spectral coefficient information of the frequency component converted by the processing of the frequency component conversion step.
The first data may include at least a part of the normalization coefficient information of the encoding process by the process of the encoding step.
The first data may include at least a part of the quantization accuracy information of the encoding process by the process of the encoding step.
The first data may include at least a part of information indicating the number of quantization units of the encoding process by the process of the encoding step.
[0075]
  The first data may be data indicating a predetermined numerical value, and the second data may be data obtained by minimizing the numerical value of the first data.
  The second data can be obtained by replacing at least part of the first data with random data.
[0080]
  The second program of the present invention is:The first data string includes first data that is at least a part of data used to reproduce predetermined information, and is included in a different area from the first data. Third data that is at least a part of data used for reproducing the predetermined information is further included, and the second data string includes the first data in the first data string. Of the third data in the first data string is replaced with the second data that deteriorates the reproduction quality of the predetermined information as compared with the case of the first data string. Is replaced with the fourth data, which is control information referred to when the second data string is reproduced, and the other part is reproduced data when the second data string is reproduced. As When those which are the non-reference region which is not irradiation,Contained in the second data column4th dataA reproduction control step for controlling the reproduction of the second data string based onIncludeIt is characterized by that.
[0081]
  The data restoration method of the present inventionThe first data string includes first data that is at least a part of data used for reproducing predetermined information, and the second data string is the first data string. When the first data is used to reproduce the predetermined information, the first data is replaced with second data that deteriorates the reproduction quality of the predetermined information as compared with the case of the first data string. 2Data columnFirstBased on the acquisition control step for controlling the acquisition of the third data string including information necessary for restoring the data string, and the third data string whose acquisition is controlled by the processing of the acquisition control step, the second Data columnTo the first data columnAnd the third data string includes first data for restoring the second data string, and in the process of the restoration step, the acquisition is controlled by the process of the acquisition control step The first data included in the third data column isSecondBy replacing the second data contained in the data column,FirstThe data string is restored, and part of the second data isSecondThe control information is referred to when the data string is reproduced.
[0101]
  In the data conversion method and data conversion apparatus of the present invention, and the first program, they are included in the first data string., At least part of the data used to reproduce the predetermined informationThe first data, When used for reproducing the predetermined information, the reproduction quality of the predetermined information is deteriorated as compared with the case of the first data string.Replaced with the second data and included in a different area from the first data, At least part of the data used to reproduce the predetermined informationThird dataAt least some ofBut, Control information referred to when reproducing the second data stringReplaced by the fourth dataAt the same time, the other part is a non-reference area that is not referred to as reproduction data when the second data string is reproduced.Using the data generated by data replacement, the second data string is generatedBe done.
[0102]
  In the data reproduction method and the second program of the present invention,The first data string includes first data that is at least a part of data used to reproduce predetermined information, and is included in a different area from the first data. Third data that is at least a part of data used for reproducing the predetermined information is further included, and the second data string includes the first data in the first data string. Of the third data in the first data string is replaced with the second data that deteriorates the reproduction quality of the predetermined information as compared with the case of the first data string. Is replaced with the fourth data, which is control information referred to when the second data string is reproduced, and the other part is reproduced data when the second data string is reproduced. As Has been set to non-reference region which is not irradiation,Contained in the second data column4th dataThe second data string is played back based onBe done.
[0103]
  In the data restoration method of the present invention,The first data string includes first data that is at least a part of data used for reproducing predetermined information, and the second data string is the first data string. When the first data is used to reproduce the predetermined information, the first data is replaced with second data that deteriorates the reproduction quality of the predetermined information as compared with the case of the first data string. 2Data columnFirstA third data string including information necessary for restoring the data string is acquired, and the second data string is obtained based on the third data string.To the first data columnIs restored and the third data column isFirstIncluding the first data for restoring the data string, the first data included in the third data string,SecondBy replacing the second data contained in the data column,FirstThe data string is restored and part of the second data isSecondThis is control information referred to when reproducing a data string.
[0105]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, embodiments of the present invention will be described with reference to the drawings.
[0106]
Here, high efficiency coding is performed by receiving input of a digital signal such as an audio PCM signal and performing band division coding (SBC), adaptive transform coding (ATC) and adaptive bit allocation. The case will be described. Adaptive transform coding is a coding method in which bit allocation is adapted based on discrete cosine transform (DCT) or the like, and an input signal is converted into a spectrum signal for each time block, and for each predetermined band, Normalizes spectrum signals together, that is, normalizes coefficients that approximate the maximum signal component, divides each signal component, and then quantizes and encodes the signal with the quantization accuracy determined by the nature of the signal. is there.
[0107]
FIG. 1 is a block diagram illustrating a configuration example of an encoding device 1 that receives an input of an acoustic waveform signal and creates trial listening data.
[0108]
The converter 11 receives an acoustic waveform signal, converts it into a signal frequency component, and outputs it to the signal component encoder 12. The signal component encoding unit 12 encodes the input signal frequency component and outputs the encoded signal frequency component to the code string generation unit 13. The code sequence generation unit 13 generates a code sequence from the signal frequency component encoded by the signal component encoding unit 12 and outputs the code sequence to the trial listening data generation unit 14. The trial-listening data generation unit 14 performs predetermined processing such as rewriting of normalization coefficient information and insertion of control information on the code string input from the code string generation unit 13 and can be reproduced with high sound quality. (Original data) is converted into trial data, and additional data (restoration data) corresponding to the trial data that is sold to a user who wants to reproduce the original data is generated and output.
[0109]
FIG. 2 is a block diagram showing a more detailed configuration of the conversion unit 11.
[0110]
The acoustic waveform signal input to the conversion unit 11 is divided into two bands by the band division filter 21, and the respective signals are output to the forward spectrum conversion units 22-1 and 22-2. The forward spectrum conversion units 22-1 and 22-2 convert the input signal into a spectrum signal component using, for example, MDCT, and outputs the signal to the signal component encoding unit 12. The signals input to the forward spectrum conversion units 22-1 and 22-2 are ½ of the bandwidth of the signal input to the band division filter 21, and the signal input is also decimated to ½. ing.
[0111]
In the conversion unit 11 of FIG. 2, the signal divided into two bands by the band division filter 21 has been described as being converted into a spectrum signal component using MDCT, but the input signal is converted into a spectrum signal component. Any conversion method may be used. For example, the input signal may be converted into a spectrum signal component using MDCT without dividing the band. Alternatively, the forward spectrum conversion units 22-1 and 22-2 may convert an input signal into a spectrum signal using DCT or DFT.
[0112]
  By using a so-called band division filter, it is possible to divide an input signal into band components, but MDCT, DCT, orDFTIt is preferable to perform spectral conversion using
[0113]
In FIG. 2, the input acoustic waveform signal has been described as being divided into two bands by the band division filter 21, but it goes without saying that the number of band divisions may not be two. . Information indicating the number of band divisions in the band division filter 21 is output to the code string generation unit 13 via the signal component encoding unit 12.
[0114]
FIG. 3 is a diagram showing the absolute value of the spectrum signal obtained by the MDCT obtained by the conversion unit 11 converted into a power level. The acoustic waveform signal input to the conversion unit 11 is converted into, for example, 64 spectrum signals for each predetermined time block. These spectral signals are processed by the signal component encoding unit 12 by the processing described later, for example, 16 pieces [1] to [16] as shown by 16 frames surrounded by solid lines in the figure. Quantization and normalization are performed for each band. A set of spectrum signals divided into 16 bands, that is, a set of spectrum signals to be quantized and normalized is a quantization unit.
[0115]
By changing the quantization accuracy for each quantization unit based on the distribution method of the frequency components, it is possible to perform efficient coding that can minimize deterioration of sound quality heard by humans.
[0116]
FIG. 4 is a block diagram showing a more detailed configuration of the signal component encoding unit 12. Here, the frequency component encoding unit 12 separates a signal component in which energy is concentrated around a specific frequency from an input spectrum signal, in particular, a tone portion that is particularly important for hearing, A case will be described where encoding is performed separately from the components.
[0117]
The spectrum signal input from the conversion unit 11 is separated into a tone component and a non-tone component by the tone component separation unit 31, the tone component is output to the tone component encoding unit 32, and the non-tone component is converted into a non-tone component. This is output to the component encoding unit 33.
[0118]
The tone component and the non-tone component will be described with reference to FIG. For example, when the spectrum signal input to the tone component separation unit 31 is a signal as shown in FIG. 5, particularly high power level portions are separated from the non-tone components as tone components 41 to 43. The position data P1 to P3 indicating the positions of the separated tone components 41 to 43 and the width of the frequency extracted as the tone components are detected and output to the tone component encoding unit 32 together with the tone components. The
[0119]
As a method for separating tone components, for example, the method described in Japanese Patent Application No. 5-152865, WO94 / 28633, or the like filed previously by the present inventor may be used. The tone component and the non-tone component separated by this method are quantized with different numbers of bits, respectively, by the processing of the tone component encoding unit 32 and the non-tone component encoding unit 33 described later.
[0120]
The tone component encoding unit 32 and the non-tone component encoding unit 33 encode the input signals, respectively, but the tone component encoding unit 32 increases the number of quantization bits with respect to the tone component, that is, Then, the quantization accuracy is increased and quantization is performed, and the non-tone component encoding unit 33 performs quantization with respect to the non-tone components by reducing the number of quantization bits, that is, by reducing the quantization accuracy.
[0121]
For each tone component, it is necessary to add information such as the position information of the tone component and the width of the frequency extracted as the tone component, but the spectrum signal of the non-tone component should be quantized with a small number of bits. Is possible. In particular, when the acoustic waveform signal input to the encoding device 1 is a signal in which energy is concentrated in a specific spectrum, this method makes it possible to hardly perceive auditory degradation. In addition, it is possible to effectively encode at a high compression rate.
[0122]
FIG. 6 is a block diagram showing a more detailed configuration of the tone component encoding unit 32 of FIG.
[0123]
The normalizing unit 51 receives the spectrum signal of the tone component for each quantization unit, performs normalization, and outputs the normalized signal to the quantization unit 52. The quantization accuracy determination unit 53 calculates the quantization accuracy with reference to the input quantization unit, and outputs the calculation result to the quantization unit 52. Since the input quantization unit is a tone component, the quantization accuracy determination unit 53 calculates the quantization accuracy so as to increase the quantization accuracy. The quantization unit 52 quantizes the normalization result input from the normalization unit 51 with the quantization accuracy determined by the quantization accuracy determination unit 53 to generate a code, and in addition to the generated code , Encoding information such as normalization coefficient information and quantization accuracy information is output.
[0124]
The tone component encoding unit 32 also encodes and outputs the position information of the tone component input together with the tone component together with the tone component.
[0125]
FIG. 7 is a block diagram showing a more detailed configuration of the non-tone component encoding unit 33 of FIG.
[0126]
The normalization unit 54 receives the non-tone component spectrum signal for each quantization unit, performs normalization, and outputs the normalized signal to the quantization unit 55. The quantization accuracy determination unit 56 calculates the quantization accuracy with reference to the input quantization unit and outputs the calculation result to the quantization unit 55. Since the input quantization unit is a non-tone component, the quantization accuracy determination unit 56 calculates the quantization accuracy so that the quantization accuracy is lowered. The quantization unit 55 quantizes the normalization result input from the normalization unit 54 with the quantization accuracy determined by the quantization accuracy determination unit 56 to generate a code, and in addition to the generated code , Encoding information such as normalization coefficient information and quantization accuracy information is output.
[0127]
Compared to the above-described encoding method, it is possible to further increase the encoding efficiency. For example, variable length coding is performed. A relatively short code length is assigned to a frequency signal that is quantized, and a relatively long code length is assigned to an infrequent one. By assigning, the encoding efficiency can be further increased.
[0128]
Then, the code string generation unit 13 in FIG. 1 records, for example, on the recording medium from the code of the signal frequency component output by the signal component encoding unit 12 or other information processing apparatus via the data transmission path. Are generated, that is, a code string composed of a plurality of frames is generated and output to the audition data generation unit 14. The code string generated by the code string generator 13 is audio data that can be reproduced with high sound quality by a normal decoder. FIG. 8 shows a frame format of audio data that can be reproduced with high sound quality generated by the code string generator 13.
[0129]
A fixed-length header including a synchronization signal is arranged at the head of each frame. In the header, the number of band divisions of the band division filter 21 of the conversion unit 11 described with reference to FIG.
[0130]
In each frame, tone component information regarding the separated tone components is recorded following the header. In the tone component information, the number of tone components, the tone width, and quantization accuracy information of quantization applied to the tone components by the tone component encoding unit 32 described with reference to FIG. 6 are recorded. Subsequently, as the data of the tone components 41 to 43, the respective normalization coefficients, tone positions, and spectrum coefficients are recorded. Here, for example, it is assumed that the normalization coefficient of the tone component 41 is 30, the normalization coefficient of the tone component 42 is 27, and the normalization coefficient of the tone component 43 is 24.
[0131]
Then, the non-tone component information is described following the tone component information. The non-tone component information includes the number of quantization units (16 in this case), quantization accuracy information of quantization applied to the non-tone components by the tone component encoding unit 33 described with reference to FIG. Normalization coefficient information and spectral coefficient information of each quantization unit are recorded. In the normalization coefficient information, a value from 46 in the lowest quantization unit [1] to a value 8 in the highest quantization unit [16] is recorded for each quantization unit. Here, it is assumed that a value proportional to the dB value of the power level of the spectrum signal is used as the normalization coefficient information. Further, when the length of the content frame is a fixed length, an empty area may be provided after the spectrum coefficient information.
[0132]
FIG. 9 is a block diagram showing a more detailed configuration of the trial listening data generation unit 14 of FIG.
[0133]
The audition data generation processing control unit 61 is based on setting data such as the audition area of the audition data and the audition band of the audition area, which are input from an external operation input unit (not shown). The restriction processing unit 63 is controlled. The trial listening area is a portion of the trial listening data that can be reproduced with low quality without using additional data. For example, a rust portion is set as the trial listening area. A plurality of audition areas may be set in the audition data. The trial listening band in the trial listening area is a frequency band in which low quality reproduction is performed. For example, by designating only a part of the quantization units in the spectrum data described with reference to FIG. 5, only a certain range of frequency bands can be reproduced, and the quality of reproduced audio is lowered. ing.
[0134]
The audition area determination unit 62 determines whether or not the input frame is within the audition area for each frame described with reference to FIG. 8 according to the control of the audition data generation processing control unit 61. The result is output to the band limitation processing unit 63 and the control information insertion unit 64. As shown in FIG. 10, for example, as shown in FIG. 10, the trial area determination unit 62 converts the input frame sequence into a protected area, that is, an area where the audition is not permitted, and a trial area. Distinguish.
[0135]
Based on the signal input from the audition area determination unit 62, the band limitation processing unit 63 limits the audition band based on whether or not the input frame is a frame included in the audition area. Generate data. Information about the trial band in the trial area is input from the trial data generation processing control unit 61.
[0136]
For example, when the quantization unit [1] to the quantization unit [12] are designated as the trial listening band of the trial listening possible part, the band limitation processing unit 63, when the input frame is a frame in the trial listening area, As shown in FIG. 11, the value of the normalization coefficient information of the quantization unit [13] to the quantization unit [16] higher than the trial band is, for example, minimized to be the normalization coefficient 0. Accordingly, although effective values are described in the spectral coefficient information corresponding to the quantization units [13] to [16], the normalized coefficient information is 0 at the time of reproduction. Strictly speaking, the spectrum of the portion does not become zero, but from the viewpoint of audibility, it is substantially equal to zero. Accordingly, the spectral coefficient information on the higher frequency side than the position indicated by Ad in the figure is synonymous with not being referred to.
[0137]
Similar to the non-tone component, the normalization coefficient value of the tone component that is outside the listening band is also minimized, for example, to 0, thereby minimizing the spectrum signal of the corresponding tone component during playback. (Substantially change to a value equivalent to 0).
[0138]
FIG. 12 shows a spectrum signal when the sample data shown in FIG. 11 is reproduced. Since the normalization coefficient information of the quantization unit [13] to the quantization unit [16] is changed to 0, the corresponding spectrum signal is minimized. Similarly, the corresponding spectrum signal is minimized for the two tone components included in the quantization unit [13] to the quantization unit [16]. That is, when the trial listening data is decoded and reproduced, only the narrow-band spectrum signal is reproduced in the portion where the trial listening is possible.
[0139]
In this way, when the trial listening area of the trial listening data is reproduced, only narrow-band data is reproduced, so that data with lower quality than the original data described with reference to FIG. 8 is reproduced.
[0140]
Furthermore, the band limitation processing unit 63 sets the values of all the normalization coefficient information of the quantization units [1] to [16] of the frame corresponding to the part where the trial listening is impossible to 0. Thereby, the data of the frame (protection area) that cannot be listened to is silent even if it is reproduced.
[0141]
The band limitation processing unit 63 uses the replacement data so that the decoded data length when the sample data is decoded does not become longer than the decoded data length when the original data is decoded. Generate.
[0142]
The control information inserting unit 64 prevents only unsound reproduction such as a case where the trial listening part suddenly flows after the silent state continues when the trial listening data is reproduced. Control information indicating whether or not playback is possible is inserted into all frames so that the frames are continuously played back. The control information insertion unit 64 controls the control information into a region of spectrum coefficient information that is not referred to, that is, a region that is ignored when the sample data is reproduced because the reproduction quality is limited in accordance with the information input from the sample region determination unit 62. Is output to the trial data generator 66. Then, the control information insertion unit 64 adds the true spectrum coefficient information corresponding to the portion where the control information is inserted, and information indicating the position where the control information is inserted, if necessary, to the additional frame generation unit 65. Output to.
[0143]
FIG. 13 shows a frame format when control information is inserted.
[0144]
As described with reference to FIG. 11, when the value of the normalization coefficient information of the quantization unit [13] to the quantization unit [16] higher than the trial band is changed to 0, the quantization unit [ 13] to the quantization unit [16], the spectral coefficient information on the higher frequency side than the position indicated by Ad in the figure is synonymous with that it is not referred to, so any control information is described within this range. Is possible.
[0145]
Furthermore, the control information insertion unit 64 may describe random dummy data or the like at a position where control information is not described within the range of spectral coefficient information on the higher frequency side than the position indicated by Ad in the figure. . In this case, the control information insertion unit 64 sends the true spectrum coefficient information corresponding to the portion where the dummy data is described, and information indicating the position where the dummy data is inserted, if necessary, to the additional frame generation unit 65. Output to.
[0146]
Here, it has been described that the control information is inserted at a position corresponding to the spectral coefficient information that is not referred to when the sample data is reproduced, but if the part is not referred to when the sample data is reproduced, the position at which the control information is inserted is It may be other than the position corresponding to the spectral coefficient information.
[0147]
An example of the format of the control information is shown in FIG.
[0148]
In the control information, a playback device that receives input of audition data and reproduces it determines whether the corresponding frame is a frame for which audition is permitted or a frame for which audition is not permitted. As information, a trial listening permission flag, a trial listening start flag, and a trial listening end flag are described.
[0149]
The trial listening permission flag is set by a person who has the copyright of the content, and is for distinguishing between a frame that prohibits the trial listening and a frame that is permitted. In the playback device that plays the audition data, the audition permission flag is referred to, and if the audition is permitted, the normal playback process is performed, and if the audition is not permitted, the next audition frame is not processed. The process for is started. As a result, the playback process is skipped for frames for which trial listening is prohibited, so that silent playback can be prevented even when there are frames that cannot be auditioned before and after the trial listening area.
[0150]
The audition start flag and the audition end flag are information set in a predetermined time after the start position of the audition area and a predetermined time before the end position for a group of the audition area. Is set to 3 seconds, the audition start flag is set so that it can be discriminated in the trial frame that is played for 3 seconds from the start of the trial area, and the trial frame is played within 3 seconds before the end of the trial area. The audition end flag is set so that it can be discriminated.
[0151]
Thereby, for example, when the trial listening data is played back by the playback device, playback processing such as fade-in and fade-out can be performed before and after the trial listening area. In other words, when there are a plurality of audition areas, in the original original data, a plurality of audition areas that should be reproduced at different locations are continuously reproduced at the same volume, whereas a plurality of audition areas are reproduced. At each start and end, playback processing such as fade-in and fade-out can be performed before and after the trial listening area, so that the trial listening data can be more uncomfortable for the user and can increase purchase motivation. it can.
[0152]
In particular, when the spectral coefficient information is variable-length encoded and the variable-length code is sequentially described from the low-frequency side to the high-frequency side in the spectral coefficient information description area, Since the control information is described in the area, a part of the middle-range variable length code is lost, and therefore the high-frequency data including that part cannot be decoded at all. That is, since it becomes difficult to restore the spectral coefficient information related to the original data included in the trial listening data without using additional data, the safety of the trial listening data is enhanced.
[0153]
As described above, when a part of data such as a normalization coefficient is replaced with 0 or control information, the estimation of true data for the replaced data is performed by using an encryption key having a relatively short key length. Compared to deciphering, it is very difficult. In addition, if the trial listening data is illegally altered, sound quality is deteriorated. Therefore, it is very difficult for a user who is not permitted to audition the original data to guess the original data based on the audition data, and the rights of the authors and distributors of the content data can be further protected. It becomes possible.
[0154]
Also, in the unlikely event that the true data for the replaced data is inferred in some sample data, unlike the case where the encryption algorithm is decrypted, the damage will not spread to other content. The content data encrypted by using a specific algorithm is safer than distributing it as trial listening data.
[0155]
In the above, the true normalization coefficient information and the true spectrum coefficient information replaced by the band limitation processing unit 63 and the control information insertion unit 64 are supplied to the additional frame generation unit 65 described later and described in the additional frame. Since the band limitation processing unit 63 may change values such as quantization accuracy information and the number of quantization units to 0 or random dummy data in addition to the normalization coefficient information, for example, which will be described later. The additional frame generation unit 65 receives, for example, information indicating values such as quantization accuracy information and the number of quantization units changed by the band limitation processing unit 63, and describes the information in the additional frame.
[0156]
However, when the normalization coefficient information is changed and when the quantization accuracy information is changed, it is difficult to illegally infer the original data from the trial data without using additional data, that is, the safety of the trial data. Strength will be different. For example, when a bit allocation algorithm that calculates quantization accuracy information based on normalization coefficient information is employed when generating original data, even if only the quantization accuracy information is changed, the normalization coefficient There is a risk that true quantization accuracy information can be estimated using information.
[0157]
On the other hand, even if only the normalization coefficient information is changed, it is difficult to estimate the normalization coefficient information from the quantization accuracy information, and thus it can be said that the safety strength of the trial listening data is high. It should be noted that by changing both values of the normalization coefficient information and the quantization accuracy information, the risk of illegally guessing the original data is further reduced. Further, the value of the normalization coefficient information or the quantization accuracy information may be selectively changed according to the content frame of the trial listening data.
[0158]
The additional frame generation unit 65 is an original to be purchased when the user who listened to the audition data listens to the music with high sound quality from the information on the normalized normalization coefficient or the like whose band is limited by the band limitation processing unit 63 An additional frame constituting additional data for data restoration is generated.
[0159]
FIG. 15 shows a format of the generated additional frame. As described with reference to FIG. 13, when the quantization unit [1] to the quantization unit [12] are selected as the trial listening band, the quantization unit [13] to the quantization in each frame of the trial listening data. The normalization coefficients of the tone component and the non-tone component corresponding to the four quantization units of the unit [16] are replaced with 0. In addition, a part of the spectrum coefficient information that is not referred to is replaced with control information. The additional frame generation unit 65 receives supply of information indicating the original values (true normalization coefficient information and true spectrum coefficient information) of the changed normalization coefficient information and spectrum coefficient information from the band limitation processing unit 63. The additional frame in FIG. 15 is generated.
[0160]
In the additional frame, additional information corresponding to the tone component and additional information corresponding to the non-tone component are described. FIG. 15 shows a case where quantization unit [1] to quantization unit [12] are selected as the trial band. Additional information corresponding to tone components includes the number of tone components minimized (here, two components are minimized), the position of the normalization coefficient information of each tone component on the audition frame, and the true normality. The conversion factor information is described. Further, as additional information corresponding to the non-tone component, the number of normalized coefficients of the non-tone component minimized (here, four components are minimized) and information indicating the head position of the normalized coefficient ( For example, the number 13 of the first quantization unit whose normalization coefficient is changed to 0 may be used), the rewritten true normalization coefficient, and the true spectral coefficient information of the portion rewritten in the control information Has been.
[0161]
Here, the position information of the true spectral coefficient information of the portion rewritten with the control information is not described in the generated additional frame, but the portion of the trial listening frame where the spectral coefficient information is rewritten with the control information is not referred to. If the head of the part to be the spectral coefficient information is obtained, the position of the true spectral coefficient information of the part rewritten by the control information is obtained based on the information of the region of the quantization unit in which the normalization coefficient is replaced with 0. Is possible. When the position to be rewritten to the control information is set to a position different from the head of the part that becomes the non-referenced spectral coefficient information, it is necessary to describe the position information of the true spectral coefficient information of the part rewritten to the control information in the additional frame. is there.
[0162]
In addition, when the band limitation processing unit 63 changes values such as quantization accuracy information and the number of quantization units other than the normalization coefficient information and the spectrum coefficient information, the additional frame generation unit 65 In response to the input of information indicating values such as quantization accuracy information and the number of quantization units changed by 63, information on their true values is written in the additional frame.
[0163]
Further, part or all of the normalization coefficient information described in the additional frame may be described in the control information. An example of the format of the control information in that case is shown in FIG.
[0164]
The control information includes all or part of the reproduction control information to be referred to when reproducing the sample data and the normalization coefficient information replaced to limit the band. In FIG. 16, all the information regarding the non-tone component is described in the normalization coefficient information described in the additional frame, but the information described here is, for example, a part of the non-tone component. Even normalization coefficient information, all replaced normalization coefficient information including tone components may be used, or when information other than normalization coefficient information is replaced when generating audition data Such information may be described.
[0165]
Further, in the reproduction control information, a reproducible flag is described in addition to the reproduction permission flag, the trial listening start flag, and the trial listening end flag described with reference to FIG.
[0166]
The reproducible flag indicates whether or not reproduction using normalization coefficient information described in the control information is possible. The playback device that has received the input of the audition data refers to the reproducible flag of the control information in the audition frame, and when the normalization coefficient information is available, the normalization coefficient information in the control information is converted into the original information in the audition frame. It is possible to restore the position and perform the reproduction process. Also, the playback device that has received the input of the audition data plays back without using the normalization coefficient information in the control information when the playable flag indicates that the normalization coefficient information cannot be used. Process.
[0167]
In the reproducible flag, it is also possible to specify the version and type of the playback device, for example, in a playback device that satisfies the specified version, using the normalization coefficient information included in the control information, A part of the audition data can be restored and the playback process can be performed. On the other hand, in the playback device of a version other than the specified, the normalization coefficient information included in the control information must be used in the playback process. I can't.
[0168]
Also, by describing a part of the information to be added to the additional data in the control information, the data capacity of the additional data can be reduced, so that the user records the additional data on the recording medium using, for example, MMK. When trying to do so, the processing time can be shortened, or the communication time can be shortened when additional data is to be obtained through download processing.
[0169]
When the control information of FIG. 16 is inserted into the trial frame, the band limitation processing unit 63 uses the control information instead of the additional frame generation unit 65 as a whole or a part of the normalized coefficient information replaced to limit the band. The data is output to the insertion unit 64. The control information insertion unit 64 generates the control information of FIG. 16 using the true normalization coefficient information input from the band limitation processing unit 63, and inserts it into the trial listening frame.
[0170]
In addition, as described with reference to FIG. 16, a part of the information described in the additional frame, for example, when the true normalization coefficient information of the non-tone part is described in the control information, the corresponding part Is not described in the additional frame.
[0171]
The sample data generation unit 66 generates a sample data header, adds the generated header to the encoded frame sequence of the input sample data, and generates and outputs sample data. The header of the trial listening data includes, for example, information such as a content ID for identifying the content, a content reproduction time, a content title, or encoding method information.
[0172]
The additional data generation unit 67 generates a header of the additional data, adds the generated header of the additional data to the encoded frame sequence of the input additional data, and generates and outputs the additional data. In the header of the additional data, a content ID for identifying the content and making it correspond to the trial listening data, the playback time of the content, and information on the encoding method as necessary are described.
[0173]
In this way, since the control information is described so that the trial listening area of the trial listening data is continuously played back, it is possible to prevent the protected area from being played back silently, and to listen to the trial listening data. It is possible to prevent the user who has done this from feeling uncomfortable.
[0174]
In addition, by describing various information related to the playback of sample data in the control information, for example, processing such as fade-in and fade-out can be performed at the beginning and end of the sample area, so that it is different from the original part. Even when multiple preview areas that should be played are played back continuously, the user who auditioned the audition data can understand the continuous and non-continuous parts of the original audition area, The reproduction quality of the trial listening data can be changed depending on the version of the reproduction apparatus.
[0175]
Furthermore, by including the true data that was replaced when the sample frame was generated in the control information, the data volume of the additional data can be reduced, or the playback quality of the sample data depending on the type and version of the playback device. Can be changed.
[0176]
Using the trial listening data and the additional data generated by the trial listening data generation unit 14, the original data can be restored by the processing described later.
[0177]
Next, the trial listening data generation process will be described with reference to the flowcharts of FIGS. 17 and 18.
[0178]
In step S1, the audition data generation processing control unit 61 acquires a setting value of the audition band of the audition area of the audition data input from an operation input unit (not shown) or the like. Here, as described with reference to FIG. 11 and FIG. 12, the description will be made assuming that quantization unit [1] to quantization unit [12] are set as trial listening bands. The audition data generation processing control unit 61 supplies the setting value of the audition band to the band limitation processing unit 63.
[0179]
In step S2, the audition data generation processing control unit 61 acquires information specifying an audition area of the audition data input from an operation input unit (not shown) or the like. Here, description will be made assuming that the trial listening area 1 composed of the fourth and fifth frames and the trial listening area 2 composed of the ninth and tenth frames described with reference to FIG. 10 are designated as the trial listening areas. The audition data generation processing control unit 61 supplies information specifying the audition area to the audition area determination unit 62.
[0180]
In step S3, the trial listening area determination unit 62 and the band limitation processing unit 63 input any of the frames included in the frame sequence corresponding to the original data, that is, the frame capable of high-quality reproduction described with reference to FIG. receive.
[0181]
In step S <b> 4, the trial area determination unit 62 determines whether the input frame is a trial frame (a frame included in the trial area) based on the information supplied from the trial data generation processing control unit 61. .
[0182]
If it is determined in step S4 that the input frame is a trial listening frame, the trial listening area determination unit 62 supplies the determination result to the band limitation processing unit 63 and the control information insertion unit 64 in step S5. The restriction processing unit 63 includes, for example, a portion other than the band specified by the set value of the trial band supplied from the trial data generation processing control unit 61 in the normalization coefficient information of the tone component of the input frame, for example, Change to a value such as 0.
[0183]
In step S <b> 6, the band limitation processing unit 63 includes, for example, a portion other than the band specified by the set value of the trial band supplied from the trial data generation processing control unit 61 in the normalization coefficient information of the non-tone component. 11, the trial frame described with reference to FIG. 11 is generated and output to the control information insertion unit 64, and the true normalization coefficient information replaced in steps S 5 and S 6 and its position Are output to the additional frame generation unit 65, or a part of the information is output to the control information insertion unit 64 and the rest is output to the additional frame generation unit 65.
[0184]
In step S7, the control information insertion unit 64 describes the control information indicating whether or not the trial listening is possible in part of the non-referenced spectral coefficient information corresponding to the normalized coefficient information of the non-tone component rewritten in the process of step S6. The sample listening frame described with reference to FIG. 13 is generated. Here, even if the control information described is only for the reproduction control information as described with reference to FIG. 14, a part of the information described in the additional data as described with reference to FIG. It does not matter if it contains any. The control information insertion unit 64 outputs the generated trial listening frame to the trial listening data generation unit 66, and outputs the true spectral coefficient information replaced with the control information in step S7 to the additional frame generation unit 65.
[0185]
Here, when the spectrum coefficient information is variable-length encoded, control information is used such that the bit length when the control information is decoded is shorter than the bit length when the true spectrum coefficient information is decoded. As a result, it is possible to prevent the frame length of the code string from being exceeded in the decoding process described later.
[0186]
Further, the control information insertion unit 64 may replace dummy data in addition to the control information with part of the spectrum coefficient information that is not referred to. The dummy data to be replaced with the spectrum coefficient information may be all set to the value 0, or the value 1 and the value 0 may be appropriately mixed.
[0187]
In step S8, the additional frame generation unit 65 is added when the user who auditioned the audition data listens to the music with high sound quality based on the signals input from the band limitation processing unit 63 and the control information insertion unit 64. Data for an additional frame constituting the data is generated.
[0188]
In step S4, when it is determined that the input frame is not a trial listening frame, that is, a protection frame (a frame included in the protection area), in step S9, the trial listening area determination unit 62 limits the determination result to a band limit. The data is supplied to the processing unit 63 and the control information insertion unit 64. The band limitation processing unit 63 changes all the tone component normalization coefficient information to the minimum value (value 0).
[0189]
In step S10, the band limitation processing unit 63 changes all the normalization coefficient information of the non-tone components to the minimum value (value 0) and outputs it to the control information insertion unit 64, and replaces it in step S9 and step S10. All of the obtained normal normalization coefficient information and various kinds of information indicating the position thereof are output to the additional frame generation unit 65 or a part thereof is output to the control information insertion unit 64 and the other is generated as additional frames To the unit 65.
[0190]
In step S11, the control information insertion unit 64 describes the control information indicating that the trial listening is impossible in a part of the non-referenced spectral coefficient information corresponding to the normalized coefficient information of the non-tone component rewritten in the process of step S10. . The control information insertion unit 64 outputs the generated trial listening frame to the trial listening data generation unit 66, and outputs the true spectral coefficient information replaced with the control information in step S11 to the additional frame generation unit 65.
[0191]
Similarly, when the spectral coefficient information is variable-length encoded, the bit length when the control information is decoded is shorter than the bit length when the true spectral coefficient information is decoded. Replacement is performed using control information. Furthermore, replacement may be performed using dummy data in addition to the control information.
[0192]
In step S12, the additional frame generation unit 65 is added when the user who auditioned the audition data listens to the music with high sound quality based on the signals input from the band limitation processing unit 63 and the control information insertion unit 64. Data for an additional frame constituting the data is generated.
[0193]
After the process of step S8 ends, or after the process of step S12 ends, in step S13, the trial listening data generation unit 64 determines whether or not the processed frame is the final frame. If it is determined in step S13 that the processed frame is not the final frame, the process returns to step S3, and the subsequent processing is repeated.
[0194]
In step S13, when it is determined that the processed frame is the final frame, in step S14, the trial data generation unit 66 generates a header of the trial data and adds the sample data to the trial frame sequence. Generate and output.
[0195]
In step S15, the additional data generation unit 67 generates a header of additional data using the input information, adds the header to the additional frame sequence, generates and outputs the additional data, and the process ends.
[0196]
The process described with reference to the flowcharts of FIGS. 17 and 18 generates audition data in which only the audition area is reproduced with low quality, and additional data for restoring the original data from the audition data.
[0197]
In order to stimulate the user's willingness to purchase, the data in the trial listening area may have the same high sound quality as the original data without band limitation. In that case, without limiting the bandwidth of the data in the trial listening area, the frame in the trial listening area of the original data may be copied to the trial listening frame as it is, and an additional frame for that portion may not be generated.
[0198]
The signal component encoding unit 12 of the encoding apparatus 1 has been described as separating the tone component and the non-tone component and performing the encoding separately when encoding the input signal. By using the non-tone component encoding unit 33 in FIG. 7 instead of the encoding unit 12, the input signal may be encoded without separating the tone component and the non-tone component.
[0199]
FIG. 19 shows the format of an original data frame of high sound quality generated by the code string generator 13 when the input signal is not separated into tone components and non-tone components. As in the case described with reference to FIG. 8, a fixed-length header including a synchronization signal is arranged at the beginning of the original data frame. In the header, the number of band divisions of the band division filter 21 of the conversion unit 11 described with reference to FIG. Following the header, the number of quantization units (here 16), quantization accuracy information of quantization performed by the non-tone component encoding unit 33, normalization coefficient information of each of the 16 quantization units, and spectral coefficient information Is recorded. The normalization coefficient information is recorded for each quantization unit from a value of 46 in the lowest quantization unit [1] to a value of 8 in the highest quantization unit [16]. Further, when the length of the content frame is a fixed length, an empty area may be provided after the spectrum coefficient information.
[0200]
FIG. 20 shows the format of the audio data of the trial part generated by the trial data generation unit 14 that has received the input of the original data frame described with reference to FIG. For example, when the quantization unit [1] to the quantization unit [12] are designated as the trial listening band of the trial listening possible portion, the quantization on the higher frequency side than the trial listening band is performed as in the case described with reference to FIG. The value of the normalization coefficient information of units [13] to [16] is set to 0. Accordingly, although effective values are described in the spectral coefficient information corresponding to the quantization units [13] to [16], the normalized coefficient information is 0 at the time of reproduction. The portion of the spectrum is minimized. And the control information demonstrated using FIG. 14 or FIG. 16 is described in a part of spectrum coefficient which is not referred.
[0201]
FIG. 21 shows an additional frame generated by the additional frame generation unit 65 of the trial listening data generation unit 14 that has received the input of the original data frame described with reference to FIG. Here, an additional frame when only the reproduction control information described with reference to FIG. 14 is described as control information will be described. The additional frame includes the number of normalized coefficients of the minimized quantization frame (here, the four components are minimized), the position of the head of the normalized coefficient, the rewritten true normalization coefficient, and the control The true spectral coefficient information of the rewritten portion is written in the information.
[0202]
As described with reference to FIGS. 19 to 21, even when tone components are not separated, the same processing is performed, and the audition data in which only the audition area is reproduced and only the audition area is reproduced, and the original from the audition data is reproduced. Additional data for restoring the data is generated.
[0203]
The audition data generated in this way is distributed to the user via the Internet or the like, or recorded and distributed on various recording media owned by the user by the MMK provided in the store or the like. The user who likes the playback of the trial listening data can obtain additional data, for example, by paying a predetermined fee to a content data distribution company. The user can restore the original data by using the trial listening data and the additional data, decode and reproduce it, or record it on the recording medium.
[0204]
Next, a description will be given of a process in the case where the sample data is decoded and output or reproduced, or the original data is decoded and output or reproduced from the sample data and the additional frame.
[0205]
FIG. 22 is a block diagram showing the configuration of the data reproducing device 81. As shown in FIG.
[0206]
The code string decomposing unit 91 receives input of the encoded sample data, decomposes the code string, extracts the code of each signal component, and outputs the code to the code string restoring unit 93.
[0207]
The control unit 92 receives a user operation from an operation input unit (not shown), receives information indicating whether or not the input data is to be reproduced with high sound quality, receives input of additional data, and receives a code string restoration unit. 93 processing is controlled. Further, the control unit 92 supplies true coding coefficient information, true spectrum coefficient information, or the like to the code restoration unit 93 as necessary.
[0208]
Based on the control of the control unit 92, the code string restoration unit 93 outputs the input encoded frame as it is to the signal component decoding unit 94 when the input trial data is auditioned, that is, reproduced as it is. When the input audition data is restored to the original data and played back, the audition data of the audition data is based on various information such as true coding coefficient information or true spectrum coefficient information supplied from the control unit 92. The process of restoring the encoded frame to the encoded frame of the original data is executed, and the restored encoded frame of the original data is output to the signal component decoding unit 94.
[0209]
In addition, when the sample data is reproduced, the code string restoration unit 93 may restore a part or the whole of the sample data by referring to the control information. In other words, the trial data includes true coding coefficient information and the like, which is indicated by the reproducible flag, and is permitted to restore the trial data using them (for example, specification of the version of the data playback device 81, etc.) ) Is satisfied, the code string restoration unit 93 can restore a part or the whole of the trial listening data by using the true coding coefficient information included in the trial listening data.
[0210]
The signal component decoding unit 94 decodes the input sample data or the encoded frame of the original data. FIG. 23 is a block diagram showing a more detailed configuration of the signal component decoding unit 94 that decodes an encoded frame when the input encoded frame is encoded by being divided into a tone component and a non-tone component. is there.
[0211]
For example, the frame separation unit 101 receives an encoded frame as described with reference to FIG. 13 and divides it into a tone component and a non-tone component. The tone component is sent to the tone component decoding unit 102 as a non-tone component. Is output to the non-tone component decoding unit 103.
[0212]
FIG. 24 is a block diagram showing a more detailed configuration of the tone component decoding unit 102. The inverse quantization unit 111 inversely quantizes the input encoded data and outputs it to the inverse normalization unit 112. The denormalization unit 112 denormalizes the input data. That is, decoding processing is performed by the inverse quantization unit 111 and the inverse normalization unit 112, and the spectrum signal of the tone portion is output.
[0213]
FIG. 25 is a block diagram showing a more detailed configuration of the non-tone component decoding unit 103. The inverse quantization unit 121 inversely quantizes the input encoded data and outputs it to the inverse normalization unit 122. The denormalization unit 122 denormalizes the input data. That is, decoding processing is performed by the inverse quantization unit 121 and the inverse normalization unit 122, and the spectrum signal of the non-tone portion is output.
[0214]
The spectrum signal synthesis unit 104 receives the spectrum signals output from the tone component decoding unit 102 and the non-tone component decoding unit 103, synthesizes these signals, and if the original data, FIG. 5 or the audition data is used. If there is, the spectrum signal described with reference to FIG. 12 is generated and output to the inverse conversion unit 95.
[0215]
When the encoded data is not encoded by being divided into a tone component and a non-tone component, the frame separation unit 101 is omitted and the tone component decoding unit 102 or the non-tone component decoding unit 103 Decoding processing may be performed using only one of them.
[0216]
FIG. 26 is a block diagram showing a more detailed configuration of the inverse transform unit 95.
[0217]
The signal separation unit 131 separates the signal based on the number of band divisions described in the header of the input frame. Here, it is assumed that the number of band divisions is 2, and the signal separation unit 131 separates the input spectrum signal into the inverse spectrum conversion units 132-1 and 132-2.
[0218]
The inverse spectrum conversion units 132-1 and 132-2 perform inverse spectrum conversion on the input spectrum signal, and output the obtained signals of each band to the band synthesis filter 133. The band synthesis filter 133 synthesizes and outputs the input signals of the respective bands.
[0219]
A signal (for example, an audio PCM signal) output from the band synthesis filter 133 is converted into analog data by, for example, a D / A conversion unit (not shown), and reproduced and output as sound from a speaker (not shown). Further, the signal output from the band synthesis filter 133 may be output to another device via a network or the like.
[0220]
Next, the data reproduction process executed by the data reproduction device 81 in FIG. 22 will be described with reference to the flowchart in FIG.
[0221]
In step S31, the code string decomposing unit 91 receives input of the encoded frame of the trial listening data. In step S32, the code string decomposing unit 91 decomposes the input code string and outputs it to the code string restoring unit 93.
[0222]
In step S33, the code string restoration unit 93 determines, based on the signal input from the control unit 92, whether high-quality reproduction, that is, processing for restoring and reproducing the original data is executed.
[0223]
If it is determined in step S33 that high sound quality reproduction is to be executed, a code string restoration process, which will be described later, is executed in step S34 using the flowchart of FIG.
[0224]
If it is determined in step S33 that the high-quality sound reproduction is not executed, the input frame is reproduced as trial listening data. Therefore, in step S35, the code string restoration unit 93 controls the control information included in the input frame. Referring to FIG. 4, it is determined whether or not this frame is a frame that can be auditioned.
[0225]
If it is determined in step S35 that this frame is not a frame that can be auditioned, in step S36, the code string restoration unit 93 skips processing for the input frame. That is, the code string restoration unit 93 discards the input frame without outputting it to the signal component decoding unit 94, and the process proceeds to step S39.
[0226]
After the process of step S34 is completed or when it is determined in step S35 that this frame is a frame that can be auditioned, in step S37, the signal component decoding unit 94 converts the input code string into a tone component. And the non-tone component, respectively, are decoded by performing inverse quantization and inverse normalization, and the spectrum signal generated by the decoding is synthesized and output to the inverse transform unit 95.
[0227]
In step S38, the inverse conversion unit 95 performs band separation on the input spectrum signal as necessary, performs inverse spectrum conversion on each, and then performs band synthesis to inversely convert the signal into a time series signal.
[0228]
After the process of step S36 or step S38 is completed, in step S39, the control unit 92 determines whether or not the reverse frame converted by the reverse conversion unit 95 in step S38 is the final frame of the audition data. To do.
[0229]
If it is determined in step S39 that the frame is not the last frame, the process returns to step S31, and the subsequent processes are repeated. If it is determined in step S39 that the frame is the last frame, the process ends.
[0230]
The time series signal generated by the inverse conversion by the inverse conversion unit 95 may be converted to analog data by a D / A conversion unit (not shown) and reproduced and output from a speaker (not shown), or a network (not shown). It may be output to other devices via the.
[0231]
Further, when the audition data is reproduced, for example, processing such as fade-in and fade-out can be performed by referring to the audition start flag and the audition end flag of the control information as necessary.
[0232]
Note that, here, a case has been described in which the sample data that is encoded by dividing the tone component and the non-tone component, or the original data restored from the sample data, is decoded. Even when the tone component is not divided, restoration processing and reproduction processing can be performed in the same manner.
[0233]
Next, the code sequence restoration process executed in step S34 in FIG. 27 will be described with reference to the flowchart in FIG.
[0234]
In step S <b> 51, the code string restoration unit 93 acquires additional data information such as sample area information, true coding coefficient information, and true spectrum coefficient information from the control unit 92 in order to restore the code string. .
[0235]
In step S52, the code string restoration unit 93 receives an input of the frame decomposed by the code string decomposition unit 91. In step S53, the code string restoration unit 93 determines whether the input frame is a trial frame.
[0236]
When it is determined in step S53 that the input frame is a trial listening frame, in step S54, the code string restoration unit 93 performs normalization coefficient information of the tone component included in the additional frame described with reference to FIG. Is used to restore the normalization coefficient information of the tone component of the portion replaced with 0 in the trial listening frame of the trial listening data.
[0237]
In step S55, the code string restoration unit 93 uses the true value of the normalization coefficient information of the non-tone component included in the additional frame described with reference to FIG. The normalization coefficient information of the non-tone component of the part is restored.
[0238]
In step S56, the code string restoration unit 93 is replaced with the control information in the trial frame of the trial data using the true value of the spectral coefficient information of the non-tone component included in the additional frame described with reference to FIG. The spectral coefficient information of the partial non-tone components is restored, and the process returns to step S37 in FIG.
[0239]
If it is determined in step S53 that the input frame is not a trial listening frame, that is, a protected frame, in step S57, the code string restoration unit 93 determines the tone component included in the additional frame corresponding to the protected frame. Using the true value of the normalization coefficient information, the normalization coefficient information of the entire tone component replaced with 0 in the trial listening frame of the trial listening data is restored.
[0240]
In step S58, the code string restoration unit 93 uses the true value of the normalization coefficient information of the quantized frame of the non-tone component included in the additional frame corresponding to the protection frame, and replaces it with 0 in the trial data frame of the trial data. The normalization coefficient information of the entire non-tone component being restored is restored.
[0241]
In step S59, the code string restoration unit 93 uses the true value of the spectral coefficient information of the non-tone component included in the additional frame corresponding to the protection frame, and uses the true value of the spectral coefficient information of the non-tone component of the sample data. The spectral coefficient information of the non-tone component is restored, and the process returns to step S37 in FIG.
[0242]
By the process described with reference to the flowchart of FIG. 28, the original data is restored using the trial listening data and the additional data.
[0243]
In FIG. 28, the original data is restored using the additional data described with reference to FIG. 15, that is, the true normalization coefficient information is described as being all described in the additional data. In step S54 and step S55, or in step S57 and step S58, the code string restoration unit 93 adds control information in the trial frame to the control information in the case where part or all of the spectral coefficient information is described in the control information. Using the true normalization coefficient information described, the normalization coefficient information of the replaced part is restored.
[0244]
Even if the decoded trial listening data or the restored and decoded original data is reproduced using a speaker (not shown) or the like by the processing described with reference to FIGS. 22 to 28, for example, via a network or the like. You may make it output to another apparatus.
[0245]
In FIG. 27, when the audition data in which the normalization coefficient of the quantization unit 13 to the quantization unit 16 is changed to 0 is reproduced, the data in the band corresponding to the quantization unit 1 to the quantization unit 12 is reproduced. Only described as being played. On the other hand, for example, the true normalization coefficient of the quantization unit 13 and the quantization unit 14 is included in the control information of the audition frame, and the true normalization described in the control information is indicated by the playable flag. The reproduction processing may be permitted using the coefficient information, and the true normalization coefficients of the quantization unit 15 and the quantization unit 16 may be described in the additional data. When the trial listening data is reproduced, the code string restoration unit 93 restores the true normalization coefficient information of the quantization unit 13 and the quantization unit 14 based on the control information, and the quantization units 1 to Quantum It may be possible to perform a slightly higher quality reproduction process than when only the data of the band corresponding to the data conversion unit 12 is reproduced.
[0246]
When the trial listening data is normally reproduced, the code string restoration unit 93 does not refer to the normalization coefficient information included in the control data of the trial listening frame, so that the reproduction is performed by the quantization unit 1 to the quantization unit 12. Only the corresponding band data. For example, when the data reproducing apparatus is a predetermined version, or when a predetermined password is input from an operation input unit (not shown), the control unit 92 controls the code string restoration unit 93, The true normalization coefficient of the quantization unit 13 and the quantization unit 14 included in the control data of the trial listening frame is extracted, and the normalization coefficient of the corresponding part is restored. As a result, it is possible to perform a slightly higher quality reproduction process than when only the data in the band corresponding to the quantization unit 1 to the quantization unit 12 is reproduced.
[0247]
Note that whether or not to refer to the normalization coefficient of the quantization unit 13 and the quantization unit 14 included in the control data of the trial frame is other than the version matching of the data reproduction device 81 or the input of a password. For example, it may be determined in advance by the setting of the data reproducing device 81. Further, when high-quality sound reproduction is performed, the code string restoration unit 93 can restore the normalization coefficients of all the quantization units based on the additional data supplied from the control unit 92, so that the original data Needless to say, will be played.
[0248]
Next, a description will be given of a process in which the trial listening data is recorded on the recording medium, or the original data is restored from the trial listening data and the additional frame and recorded on the recording medium.
[0249]
FIG. 29 is a block diagram showing the configuration of the data recording device 141.
[0250]
Note that portions corresponding to those in the case of the data reproducing device 81 in FIG. 22 are denoted with the same reference numerals, and description thereof will be omitted as appropriate.
[0251]
That is, the code string decomposing unit 91 receives input of the encoded trial listening data, decomposes the code string and extracts the code of each signal component, and the control unit 92 receives the user's input from the operation input unit (not shown). In response to the operation, the input data is recorded with high sound quality, that is, information indicating whether or not to execute the process of restoring and recording the original data is received, and additional data is input, and the code string The processing of the restoration unit 93 is controlled.
[0252]
Based on the control of the control unit 92, the code string restoration unit 93 outputs the inputted encoded frame as it is to the recording unit 151, and restores and records the original data. In this case, the input audition data is encoded from the original data based on various information such as the audition area information, the true encoding coefficient information, and the true spectrum coefficient information supplied from the control unit 92. A process of restoring to a frame is executed, and the encoded frame of the restored original data is output to the recording unit 151.
[0253]
The recording unit 151 records data by a predetermined method on a recording medium such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or a magnetic tape. Further, the recording unit 151 may record information therein, such as a memory provided on a substrate or a hard disk. For example, when the recording unit 151 can record data on an optical disc, the recording unit 151 includes an encoder that converts data into a format suitable for recording on the optical disc, a laser light source such as a laser diode, and various lenses. And an optical unit composed of a deflection beam splitter, a spindle motor that rotationally drives the optical disc, a drive unit that drives the optical unit to a predetermined track position of the optical disc, and a control unit that controls them.
[0254]
Note that the recording medium mounted on the recording unit 151 may be the same as the recording medium on which the audition data input to the code string decomposition unit 91 or the control unit 92 or additional data is recorded. good.
[0255]
Next, data recording processing executed by the data recording device 141 will be described with reference to the flowchart of FIG.
[0256]
In step S 81, the code string decomposing unit 91 receives an input of the encoded frame of the trial listening data. In step S 82, the code string decomposing unit 91 decomposes the input code string and outputs it to the code string restoring unit 93.
[0257]
In step S83, the code string restoration unit 93 determines whether or not high sound quality recording is performed based on the signal input from the control unit 92, and if it is determined that high sound quality recording is performed, In step S84, the code string restoration process described with reference to the flowchart of FIG. 28 is executed.
[0258]
If it is not determined in step S83 that high sound quality recording is to be performed, or after the processing in step S84 is completed, in step S85, the recording unit 151 stores the code string corresponding to the input original data or trial data. Is recorded on the attached recording medium or the like.
[0259]
In step S86, the control unit 92 determines whether or not what is recorded by the recording unit 151 in step S85 is the last frame of the code string corresponding to the original data or the trial listening data.
[0260]
If it is determined in step S86 that the frame is not the last frame, the process returns to step S81, and the subsequent processes are repeated. If it is determined in step S86 that the frame is the last frame, the process ends.
[0261]
By applying the present invention, the value of the normalization coefficient information is changed, and further, the data (for example, the corresponding spectral coefficient information) that is not referred to during the reproduction process is controlled by changing the value of the normalization coefficient information. Audition data can be generated in place of information. It is very difficult to infer the original data from such sampled data, and if the sampled data is illegally altered, it may cause the sound quality to deteriorate. It is possible to protect the rights.
[0262]
When the sample data is reproduced, even if a plurality of sample areas that can be sampled with low sound quality are set in addition to one place, a part of the sample frame constituting the sample data not related to the reproduction process (for example, , A region corresponding to spectrum coefficient information that is not referred to) is inserted, and control information indicating whether or not the corresponding sample listening frame can be sampled is referred to, so that only the sample listening frame corresponding to the sample listening region is selectively selected. To be played. Accordingly, when the trial listening data is reproduced, the trial listening area is reproduced immediately after the reproduction is started, and unnatural silence reproduction is not performed.
[0263]
Further, by describing information such as a trial listening start flag and a trial listening end flag in the control information, for example, when there are a plurality of trial listening areas, fade-in and fade-out are performed at the respective start and end positions when reproducing the trial data. Thus, the audition area that should not be continuous is not reproduced continuously at the same volume, and it is possible to provide audition data that is easy to hear for the user.
[0264]
Furthermore, a part of the true value (for example, true normalization coefficient information, true spectrum coefficient information, true quantization accuracy information, or the number of quantization units) corresponding to the replaced data is included in the control information. Or by describing information such as a reproducible flag or the like, for example, depending on the type or version of the playback device, or whether there is an input of a predetermined keyword, etc. It becomes possible to control the quality.
[0265]
Then, a true value corresponding to the data changed or replaced at the time of generating the audition data (for example, true normalization coefficient information, true spectrum coefficient information, true quantization accuracy information, or quantization unit) Since the additional data composed of the additional frames in which the number is described is created, it is possible to restore the original data from the trial listening data using the additional data.
[0266]
Furthermore, when part or all of the information described in the additional frame is included in the control information, the data capacity of the additional data can be reduced. For example, the user downloads the additional data. The communication time can be shortened, for example, when trying to obtain it.
[0267]
By applying the present invention, the trial listening data and the restored original data can be reproduced and output, or can be output to other devices via a network or the like recorded on a recording medium.
[0268]
In the above, the process of generating the audition data of the content data by the audio signal and the corresponding additional data, restoring the original data from the audition data and the additional data, reproducing, and recording has been described. It is also possible to adapt to content data including an image signal or an image signal and an audio signal.
[0269]
For example, when content data by an image signal is converted using a two-dimensional DCT and quantized using various quantization tables, a dummy quantization table is designated with a high-frequency component missing, If necessary, control information is recorded in the region of the spectral coefficient information in the high frequency part corresponding to the dummy to obtain audition data. In the additional data, the missing high frequency component of the quantization table and the replaced spectral coefficient information are described.
[0270]
At the time of restoring the original data, the true quantization table in which the high frequency component is not lost is restored using the additional data, and the true spectral coefficient information is restored. Therefore, the original data is restored and decoded. be able to.
[0271]
The series of processes described above can be executed by hardware, but can also be executed by software. In this case, for example, the encoding device 1, the data reproducing device 81, or the data recording device 141 is configured by a personal computer 161 as shown in FIG.
[0272]
In FIG. 31, the CPU 171 executes various processes in accordance with a program stored in the ROM 172 or a program loaded from the storage unit 178 to the RAM 173. The RAM 173 also appropriately stores data necessary for the CPU 171 to execute various processes.
[0273]
The CPU 171, ROM 172, and RAM 173 are connected to each other via a bus 174. An input / output interface 175 is also connected to the bus 174.
[0274]
The input / output interface 175 includes an input unit 176 including a keyboard and a mouse, an output unit 177 including a display and a speaker, a storage unit 178 including a hard disk, and a communication unit 179 including a modem and a terminal adapter. It is connected. The communication unit 179 performs communication processing via a network including the Internet.
[0275]
A drive 180 is connected to the input / output interface 175 as necessary, and a magnetic disk 191, an optical disk 192, a magneto-optical disk 193, a semiconductor memory 194, or the like is appropriately mounted, and a computer program read from them is loaded. It is installed in the storage unit 178 as necessary.
[0276]
When a series of processing is executed by software, a program constituting the software executes various functions by installing a computer incorporated in dedicated hardware or various programs. For example, a general-purpose personal computer is installed from a network or a recording medium.
[0277]
As shown in FIG. 31, this recording medium is distributed to supply a program to a user separately from the apparatus main body, and includes a magnetic disk 191 (including a floppy disk) on which a program is stored, an optical disk 192 ( CD-ROM (including Compact Disk-Read Only Memory), DVD (including Digital Versatile Disk), magneto-optical disk 193 (including MD (Mini-Disk) (trademark)), or package media including semiconductor memory 194 In addition to being configured, it is configured by a ROM 172 storing a program, a hard disk included in the storage unit 178, and the like supplied to the user in a state of being incorporated in the apparatus main body in advance.
[0278]
In the present specification, the step of describing the program stored in the recording medium is not limited to the processing performed in chronological order in the order in which it is included, but is not necessarily processed in chronological order, either in parallel or individually. The process to be executed is also included.
[0279]
As mentioned above, although embodiment of this invention was described, this invention is not limited to this structure. In other words, it is understood that the invention described in the scope of claims and equivalent inventions are included in the scope of patent rights to be given to the present invention.
[0280]
For example, in the above-described embodiment, the first data string having the encoding format illustrated in FIG. 8 is illustrated, but the present invention is not limited to such a configuration. In the present invention, various other encoding formats can be applied to the first data sequence as long as the encoding format can generate the second data sequence. Here, the second data string is data that is generated by replacing the first data in the first data string with the second data, and can be reproduced in substantially the same manner as the first data string. Is a column. In the present invention, as an encoding format of the first data string, for example, at least a part (for example, a spectrum coefficient, a pixel value, etc.) related to content data on the code string and a code necessary for decoding the code It is preferable to apply a format in which quantization parameters (for example, quantization accuracy information, normalization coefficient information, etc.) are multiplexed. In this case, for example, meta information describing copyright attributes, copyright management information, encryption information, or the like can be multiplexed on the code string.
[0281]
【The invention's effect】
As described above, according to the first aspect of the present invention, a data string can be converted.
Further, according to the first aspect of the present invention, for example, only a part of an original data string such as original data can be reproduced, and a part that cannot be reproduced is skipped without being processed during the reproduction process. For example, it is possible to generate a data string such as additional data necessary for conversion to a data string such as trial listening data including control information and to restore the original data string from the converted data string.
[0282]
According to the second aspect of the present invention, it is possible to receive and reproduce a data string.
Further, according to the second aspect of the present invention, for example, silence can be obtained by determining a reproducible part and a non-reproducible part based on the control information and continuously reproducing only the reproducible part. Or reproduction of only noise can be avoided.
[0283]
According to the third aspect of the present invention, the data string can be restored.
In addition, according to the third aspect of the present invention, for example, by acquiring a data string such as additional data for restoring the data string separately, it becomes a basis of original data or the like from the data string such as the audition data. The data column can be restored.
[Brief description of the drawings]
FIG. 1 is a block diagram showing a configuration of an encoding apparatus to which the present invention is applied.
FIG. 2 is a block diagram illustrating a configuration of a conversion unit in FIG. 1;
FIG. 3 is a diagram illustrating a spectrum signal and a quantization unit.
4 is a block diagram showing a configuration of a signal component encoding unit in FIG. 1. FIG.
FIG. 5 is a diagram for explaining a tone component and a non-tone component.
6 is a block diagram showing a configuration of a tone component encoding unit in FIG. 4;
7 is a block diagram showing a configuration of a non-tone component encoding unit in FIG. 4. FIG.
FIG. 8 is a diagram for explaining a format of a frame of original data.
9 is a block diagram showing a configuration of a trial listening data generation unit of FIG. 1. FIG.
FIG. 10 is a diagram illustrating a trial listening area and a protection area of an input frame sequence.
FIG. 11 is a diagram for explaining a format of a trial listening frame.
12 is a diagram illustrating a spectrum signal corresponding to the trial listening frame of FIG. 11;
FIG. 13 is a diagram for explaining a trial listening frame in which a part of a spectrum coefficient is rewritten using control information.
FIG. 14 is a diagram illustrating control information.
FIG. 15 is a diagram illustrating an additional frame.
FIG. 16 is a diagram illustrating control information.
FIG. 17 is a flowchart for explaining trial listening data generation processing;
FIG. 18 is a flowchart for explaining trial listening data generation processing;
FIG. 19 is a diagram illustrating a frame of original data when tone components are not separated.
FIG. 20 is a diagram for explaining a trial listening frame when tone components are not separated.
FIG. 21 is a diagram illustrating an additional frame when tone components are not separated.
FIG. 22 is a block diagram showing a configuration of a data reproducing apparatus to which the present invention is applied.
23 is a block diagram showing a configuration of a signal component decoding unit in FIG.
24 is a block diagram showing a configuration of a tone component decoding unit in FIG. 23. FIG.
25 is a block diagram showing a configuration of a non-tone component decoding unit in FIG. 23. FIG.
26 is a block diagram showing a configuration of an inverse conversion unit in FIG.
FIG. 27 is a flowchart for describing data reproduction processing;
FIG. 28 is a flowchart illustrating a code string restoration process.
FIG. 29 is a block diagram showing a configuration of a data recording apparatus to which the present invention is applied.
FIG. 30 is a flowchart for describing data recording processing;
FIG. 31 is a block diagram illustrating a configuration of a personal computer.
[Explanation of symbols]
DESCRIPTION OF SYMBOLS 1 Encoding apparatus, 11 Conversion part, 12 Signal component encoding part, 13 Code sequence generation part, 14 Audition data generation part, 61 Audition data generation process control part, 62 Audition area determination part, 63 Band-limiting process part, 64 Control Information insertion unit, 65 additional frame generation unit, 66 audition data generation unit, 67 additional data generation unit, 92 control unit, 93 code string restoration unit

Claims

In the data conversion method of the data conversion device for converting the first data string for reproducing the predetermined information into the second data string for reproducing the predetermined information by degrading the reproduction quality of the predetermined information ,
When the first data included in the first data string and used for reproducing the predetermined information is used for reproducing the predetermined information. A first replacement step of replacing the reproduction quality of the predetermined information with second data that deteriorates compared to the case of the first data sequence ;
Wherein the first data included in the different regions, at least a portion of the third data is at least a portion of the data used to reproduce the predetermined information, the second data The second data is replaced with fourth data which is control information to be referred to when the column is reproduced , and the other part is a non-reference area which is not referred to as reproduction data when the second data string is reproduced . A replacement step of
A first generation step of generating the second data string using the data generated by the processing of the first replacement step and the processing of the second replacement step;
Data conversion method, which comprises a.

Each of the first data string and the second data string is composed of a plurality of frames,
The third data is recorded in each of the plurality of frames,
The data conversion method according to claim 1, wherein the fourth data is control information indicating whether or not a corresponding frame can be reproduced.

A second generation step of generating a third data string necessary for restoring the second data string generated by the processing of the first generation step into the first data string;
The third data string generated by the process of the second generation step includes the first data and is converted into the fourth data by the process of the second replacement step among the third data. The data conversion method according to claim 1, further comprising a replaced part .

A frequency component conversion step for converting input data into frequency components;
An encoding step of encoding the data converted into frequency components by the processing of the frequency component conversion step;
In the process of the first replacement step, the encoded data encoded by the process of the encoding step is replaced with the first data string, and the first data is replaced with the second data.
In the process of the second replacement step, the encoded data encoded by the process of the encoding step is the first data string, and at least a part of the third data is the second data. data conversion method according to claim 1, characterized in that replace.

The third data includes at least a part of spectrum coefficient information of the frequency component converted by the processing of the frequency component conversion step.
The data conversion method according to claim 4, wherein:

The first data includes at least a part of normalization coefficient information of an encoding process by the process of the encoding step.
The data conversion method according to claim 4, wherein:

The first data includes at least a part of quantization accuracy information of an encoding process by the process of the encoding step.
The data conversion method according to claim 4, wherein:

The first data is encoded by the encoding step. Includes at least some of the information representing the number of quantization units
The data conversion method according to claim 4, wherein:

The first data is data indicating a predetermined numerical value,
The second data is obtained by minimizing the numerical value of the first data.
The data conversion method according to claim 1.

The second data is obtained by replacing at least a part of the first data with random data.
The data conversion method according to claim 1.

In the data conversion device for converting the first data string for reproducing the predetermined information into the second data string for reproducing the predetermined information with degraded reproduction quality ,
When the first data included in the first data string and used to reproduce the predetermined information is used to reproduce the predetermined information. First replacement means for replacing the reproduction quality of the predetermined information with second data that deteriorates compared to the case of the first data string ;
Wherein the first data included in the different regions, at least a portion of the third data is at least a portion of the data used to reproduce the predetermined information, the second data The second data is replaced with the fourth data which is control information referred to when the column is reproduced , and the other part is a non-reference area which is not referred to as reproduction data when the second data string is reproduced . Replacement means,
Generating means for generating the second data string using data generated by the first replacing means and the second replacing means;
Data conversion apparatus comprising: a.

A computer-executable program for controlling a data conversion apparatus for converting a first data string for reproducing predetermined information into a second data string for reproducing the predetermined information with degraded reproduction quality Because
When the first data included in the first data string and used for reproducing the predetermined information is used for reproducing the predetermined information. A first replacement step of replacing the reproduction quality of the predetermined information with second data that deteriorates compared to the case of the first data sequence ;
Wherein the first data included in the different regions, at least a portion of the third data is at least a portion of the data used to reproduce the predetermined information, the second data The second data is replaced with fourth data which is control information to be referred to when the column is reproduced , and the other part is a non-reference area which is not referred to as reproduction data when the second data string is reproduced . A replacement step of
A generation step for generating the second data string using the data generated by the processing of the first replacement step and the processing of the second replacement step;
The program characterized by including .

Generated on the basis of the first data string for reproducing the predetermined information, the data reproducing method of the data reproducing apparatus for reproducing second data string for reproducing deteriorated reproduction quality of the predetermined information In
The first data string includes first data which is at least a part of data used for reproducing the predetermined information, and is in a different area from the first data. Third data that is included and is at least part of data used to reproduce the predetermined information is further included.
The second data string indicates the reproduction quality of the predetermined information when the first data in the first data string is used for reproducing the predetermined information. Control information that is replaced with second data that deteriorates more than in the case of the above, and that is referred to when at least a part of the third data in the first data string reproduces the second data string And the other portion is a non-reference area that is not referred to as reproduction data when the second data string is reproduced ,
A data reproduction method comprising: a reproduction control step for controlling reproduction of the second data string based on the fourth data included in the second data string.

The fourth data includes information specifying whether or not a corresponding frame can be reproduced when reproduction of the second data string is controlled by the process of the reproduction control step. The data reproduction method according to claim 13 .

The data reproduction method according to claim 13 , wherein the fourth data includes at least a part of the first data replaced with the second data.

For the fourth data , when reproduction of the second data string is controlled by the process of the reproduction control step, at least a part of the first data included in the control information is used. The data reproducing method according to claim 15 , further comprising information indicating whether or not the second data can be restored.

An inverse transformation step for inversely transforming the frequency component of the second data string ;
Data reproducing method according to claim 13, wherein <br/> further comprising a decoding step of decoding the inverse transformed second data string by the processing of the inverse transform step.

The third data includes at least a part of spectrum coefficient information of the frequency component converted by the processing of the frequency component conversion step.
The data reproducing method according to claim 17, wherein:

The first data includes at least a part of normalization coefficient information of an encoding process by the process of the encoding step.
The data reproducing method according to claim 17, wherein:

The first data includes at least a part of quantization accuracy information of an encoding process by the process of the encoding step.
The data reproducing method according to claim 17, wherein:

The first data includes at least a part of information indicating the number of quantization units of the encoding process by the process of the encoding step.
The data reproducing method according to claim 17, wherein:

The first data is data indicating a predetermined numerical value,
The data reproduction method according to claim 13 , wherein the second data is obtained by minimizing a numerical value of the first data.

The second data is obtained by replacing at least a part of the first data with random data.
The data reproduction method according to claim 13, wherein:

A computer for controlling a data reproducing apparatus that reproduces a second data string that is generated based on a first data string for reproducing predetermined information and that reproduces the predetermined information with degraded reproduction quality. Is an executable program,
The first data string includes first data which is at least a part of data used for reproducing the predetermined information, and is in a different area from the first data. Third data that is included and is at least part of data used to reproduce the predetermined information is further included.
The second data string indicates the reproduction quality of the predetermined information when the first data in the first data string is used for reproducing the predetermined information. Control information that is replaced with second data that deteriorates more than in the case of the above, and that is referred to when at least a part of the third data in the first data string reproduces the second data string And the other part is a non-reference area that is not referred to as reproduction data when the second data string is reproduced.
sometimes,
Wherein based on said fourth data contained in the second data string, the program characterized by comprising a reproduction control step for controlling reproduction of the second data string.

A second data string that is generated based on the first data string for reproducing the predetermined information and is reproduced with the reproduction quality of the predetermined information deteriorated is restored to the first data string. In the data restoration method of the data restoration device to
The first data string includes first data that is at least a part of data used for reproducing the predetermined information;
The second data string indicates the reproduction quality of the predetermined information when the first data in the first data string is used for reproducing the predetermined information. Is replaced with second data that deteriorates more than in the case of
An acquisition control step for controlling acquisition of a third data string including information necessary to restore the second data string to the first data string;
A restoration step of restoring the first data string from the second data string based on the third data string whose acquisition is controlled by the processing of the acquisition control step;
Said third data string comprises the first data for restoring the first data string, wherein in the process of the restoration step, the acquisition by the processing of the control step acquisition controlled the third It said first data contained in the data string, by replacing the second data contained in the second data stream, restores the first data row,
A part of said 2nd data is control information referred when reproducing said 2nd data sequence. The data restoration method characterized by the above-mentioned.