JP2004094090A

JP2004094090A - System and method for compressing and expanding audio signal

Info

Publication number: JP2004094090A
Application number: JP2002257649A
Authority: JP
Inventors: Kiyohisa Azuma; 東　清久
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2002-09-03
Filing date: 2002-09-03
Publication date: 2004-03-25

Abstract

<P>PROBLEM TO BE SOLVED: To reduce a deterioration in sound quality of audio data, which occurs when the audio data once subjected to compression processing are once expanded to apply the compression processing again. <P>SOLUTION: The multiplexed audio data formed by compressing the audio data by a first compression system and by multiplexing the data together with parameters necessary for expansion processing are separated by reverse multiplexing into the quantized audio data and the parameters necessary for the expansion processing. The multiplexed data complying with the format of a second compression system are formed by respectively converting the encoded audio data and the parameter data to the data complying with the formation of the second compression system and multiplexing the data as it is. <P>COPYRIGHT: (C)2004,JPO

Description

【０００１】
【発明の属する技術分野】
本発明は所定の方式で圧縮されたオーディオデータと伸長に必要なパラメータとを多重化する方法および装置に関し、特に一度圧縮したオ−ディオデータを再度異なるもしくは同じ圧縮方式で圧縮し直す方法および装置に関する。
【０００２】
【従来の技術】
オーディオデータを記録したＣＤ等の光ディスクや磁気テープ等の記録媒体から再生装置でデータを読み出し、記録装置へデータを転送し、そのデータを記録装置で別の記録媒体へ記録するダビング操作は一般的に行われている。例えば、読み出したオーディオデータをデジタル化した後にＭＰＥＧ（Ｍｏｖｉｎｇ　Ｐｉｃｔｕｒｅ　Ｅｘｐｅｒｔｓ　Ｃｏｄｉｎｇ　Ｇｒｏｕｐ）やＡＴＲＡＣ（Ａｄａｐｔｉｖｅ　Ｔｒａｎｓｆｏｒｍ　Ａｃｏｕｓｔｉｃ　Ｃｏｄｉｎｇ）などのような圧縮処理を施し、圧縮データの形式で記録媒体に記録する方法も一般的になっている。
【０００３】
データ圧縮を行うと、同じ時間の音楽や画像データが数分の１から１０数分の１のデータ量に圧縮できるため記録媒体の使用量を少なくすることができ、また記録媒体のサイズを小さく出来るため、録音、録画、再生機器の大きさを小さくすることが可能となる。また通信回線を介したオーディオデータの配信においても圧縮したデータを用いることで通信コストを小さくすることが可能となる。
【０００４】
多くのオーディオデータ圧縮方式は、例えば、人間の聴覚特性を利用して人間には聞こえにくい信号成分を圧縮したり、可変長符号化方法や変換テーブルを用いて頻繁に現れる符号パターンには短い符号パターンに置きかえるエントロピー符号化等符号化時に冗長な部分を圧縮する方法を用いて、再生時の音質の劣化を最小限に抑えながら圧縮率を高める工夫がなされている。図４を用いて、オーディオデータの圧縮方法を簡単に説明する。
【０００５】
アナログオーディオデータ４００をサンプリング周波数４４．１ｋＨｚのＡ／Ｄコンバータ４０１によってデジタルオーディオデータ４０９に変換される。この５１２ワードのデジタルオーディオデータ４０９を１フレームとして処理が行われる。
【０００６】
デジタルオーディオデータ４０９は直交変換装置４０２によって５１２ワードのスペクトルデータ４１０に変換される。絶対値最大スペクトルデータ抽出装置４０３では、これらのスペクトルデータ４１０を所定のワード単位でユニットと呼ばれるグループに分けられ、ユニット毎に最大の絶対値を持つスペクトルデータを検出し、正規化基準データ４１４を出力する。量子化ビット割り当て装置４０８は、正規化基準データ４１４を用いて各グループに割り当てる量子化ビット数データ４１５を算出する。使用できる量子化ビット数は限られており、残りビット数記憶装置４０８に記憶されている残りビット数４１７を参照しながら量子化ビット数データを決めていく。正規化装置４０５では、正規化基準データ４１４を用いて、各ユニット毎にスペクトルデータ４１１に正規化を施して正規化スペクトルデータ４１２を生成する。量子化装置４０６は量子化ビット数データ４１５を用いて、各ユニット毎に正規化スペクトルデータ４１２を量子化し、符号化オーディオデータ４１３を生成する。多重化装置４０７は符号化オーディオデータ４１３と、伸長処理に必要な量子化ビット数データ４１５と正規化基準データ４１４とを、圧縮方式によってあらかじめ決められた形式に従って多重化データ４１８を生成し、出力する。
【０００７】
人間の聴覚にはある信号レベル以下になると信号そのものがあっても検知できなくなる境界が存在し、周波数によってそのレベルが変化する。この周波数特性を最小可聴限と言い、各グループの最大の絶対値を持つスペクトルデータの大きさが最小可聴限のレベル以下の場合には、人間には検知できない信号とみなすことができ量子化ビットを割り当てる必要がない。また、スペクトルデータの量子化の際には量子化ノイズが発生するが、量子化ノイズが残存していてもそのレベルが最小可聴限のレベル以下ならば人間が検知出来なければ聴感上問題がないため、信号レベルが大きくても最小可聴限のレベルが大きければ、量子化ビットを多く割り当てる必要がなくなる。このようにして、人間の聴覚特性を用いることで、割り当てる量子化ビットを減らすことが可能となる。
【０００８】
各グループのスペクトルデータは該当グループの最大スペクトルデータによって正規化され、先に決められた量子化ビット数によって量子化される。
【０００９】
この量子化オーディオデータ、割り当てビット数データと正規化基準データを一定のフォーマットに従って多重化される。この多重化データの１フレームあたりのデータ数は決まっており、通常は量子化ビットが足りない場合が多い。この時に、制限されたデータ量に収まるようにどのスペクトルデータのグループにどれだけ量子化ビットを割り振るかを明確に規定している圧縮フォーマットはほとんどなく、基本的には各圧縮装置によって異なる。
【００１０】
【発明が解決しようとする課題】
これまでは、前述のようにコンパクトディスクなど非圧縮のオーディオデータを入力としてその機器で対応している圧縮方式によって圧縮し、記録媒体に記録して音楽を楽しむことが多かったが、最近ではあらかじめ圧縮された音楽データが入力となる機会が増えてきている。例えば、電子配信によって配信された圧縮データをユーザが受信して記録媒体に記録する、というものである。配信された圧縮データの圧縮方式がユーザの保有する再生機器の対応する圧縮方式と同じならばそのままその機器で再生が可能であるが、圧縮データの圧縮方式とユーザが保有している機器が対応している圧縮方式とが異なる場合がある。
【００１１】
このような場合、例えば配信された圧縮データをコンピュータ等でその圧縮方式に準じて伸長し、更に機器が対応している圧縮方式で圧縮しなおさなければならない。
【００１２】
また圧縮方式には、同じ方式で複数の異なる圧縮率に対応しているものもあり、同じ時間のオーディオデータをより小さいデータ量で記録しなおしたい場合には上記と同様に一旦伸長処理を行った後、より圧縮率の高いモードで圧縮を行って記録する場合もある。
【００１３】
オーディオデータを圧縮する際には、前述のように所定の圧縮率を達成するために限定された量子化ビット数をより聴感上有効とみなされるデータに割り当てるが、このビット割り当て方法は圧縮する方式ではもちろんのこと、同じ方式でも実行する機器やソフトウェア等によって異なる。
【００１４】
例えば、ある入力信号に対して聴覚モデル等を用いて量子化ビット数を計算した結果、図５の５０１のような量子化ビットの分布になったが、多重化データの規定ワード数より多いために割り当てた量子化ビットを減らさなければならない。
第一の圧縮方式では図５の５０２のように全帯域から一様に量子化ビットを減らす方法をとっている。この場合、空白の升目分が量子化ノイズに相当する。一方、第二の圧縮方式では、図５の５０３のように高域のユニットから順に割当ビットを減らす方法をとっている。
【００１５】
第一の圧縮方式で圧縮した後に一旦伸長し、再度第二の圧縮方式で圧縮した場合、中低域の信号に多く量子化ビットを割り当てても、第一の圧縮時に既に量子化ノイズを含んでいるので、元にはもどらない。更に、第一の圧縮方式では残っていた高域部の信号が第二の圧縮方式によって量子化ビットが割り当てられなくなり、音質が劣化することになる。
【００１６】
本発明は上記の問題点を解決することを目的とするものであり、圧縮されたオーディオデータを伸長し、再び圧縮する場合に、再圧縮する際に起こる音質劣化を最小限に抑えることが可能なオーディオ信号圧縮伸長装置及び方法を提供することを目的とする。
【００１７】
【課題を解決するための手段】
本発明に係るオーディオ信号圧縮伸長装置及び方法は、次のような手段を講じることにより、上記の課題を解決するものである。すなわち、オーディオデータを第一の圧縮方式で圧縮し、伸長処理に必要なパラメータと共に多重化された多重化データを、逆多重化によって量子化オーディオデータと伸長処理に必要なパラメータとに分離する。その符号化オーディオデータとパラメータデータをそれぞれ第二の圧縮方式でのフォーマットに準ずるデータに変換し、そのまま多重化させて第二の圧縮方式のフォーマットに準じた多重化データを生成する。
また、パラメータ保持装置によって逆多重化の際に得られたパラメータデータを保持しておき、第一の圧縮方式で圧縮された符号化データを伸長した後、第二の圧縮方式での再圧縮時に保持したパラメータを参照しながら各ユニットに割り当てる量子化ビット数を決定する。
【００１８】
本発明では、圧縮されたオーディオデータを一旦伸長してオーディオデータを生成した後、他の圧縮方式、もしくは同じ圧縮方式で再度オーディオデータを圧縮する際に、最初の圧縮処理で残された成分を伸長されたパラメータや量子化オーディオデータ自身から取り出し、保持し、再圧縮時にそれらを利用することによって、再圧縮時に通常起こりうる音質劣化を低減させることが可能となる。
【００１９】
【発明の実施の形態】
以下、本発明に係るオーディオ信号圧縮伸長装置及び方法の具体的な実施の形態について図面を用いて詳細に説明する。
【００２０】
（実施の形態１）
実施の形態１は、第一の圧縮処理によって生成された多重化データを逆多重化して量子化オーディオデータとパラメータデータとに分離し、それぞれのデータを第二の圧縮方式に適合するように変換し、多重化することで、従来伸長処理、再圧縮処理を連続して行うことによって発生していた音質劣化を低減させるものである。
【００２１】
以下、図１、図３を用いて第１の実施の形態に係るオーディオ信号圧縮伸長装置を説明する。図１は本発明の実施の形態１におけるオーディオ信号圧縮伸長装置の構成を示すブロック図である。図３は各データ変換手段で使用するテーブルである。
【００２２】
逆多重化手段１０１は多重化データ１０６から、第一の圧縮処理方式によって圧縮された符号化オーディオデータ１０７と各ユニットの正規化基準データ１０９と、各ユニットに割り当てられた量子化ビット数データ１１１とに分離する。
【００２３】
第一のデータ変換手段１０５は、量子化ビット数データ１１１から第二の圧縮方式に準じた量子化ビット数データ１１２に変換する。まず、量子化ビット数データ変換テーブル３０１を用いて、第一の圧縮方式の量子化データに対応する第二の圧縮方式の量子化ビット数データを検索し、変換する。更に、ユニット変換テーブル３００によって、第一の圧縮方式のユニットが第二の圧縮方式においてどのユニットに含まれるかを確認し、ユニットがまたがる場合には、その複数ユニットの量子化ビット数データの中で最小値を採用する。
【００２４】
更新した量子化ビット数データから全割当ビット数を算出し、第二の圧縮方式による多重化データのワード数より大きいかどうかをチェックする。第一の圧縮方式による全割当ビット数の方が大きい場合には、第一の圧縮方式による全割当ビット数の方が小さくなるまで最も高域にあるユニットから順に割当ビット数を１つずつ減らして行く。
【００２５】
第二のデータ変換手段１０４では、第一の圧縮方式に準じた正規化基準データ１０９を第二の圧縮方式に準じた正規化基準データ１１０に変換する。まず、正規化基準データ変換テーブル３０２を用いて、第一の圧縮方式の正規化基準データに対応する第二の圧縮方式の正規化基準データを検索し、変換する。更に、ユニット変換テーブル２００によって、第一の圧縮方式のユニットが第二の圧縮方式においてどのユニットに含まれるかを確認し、複数のグループにまたがる場合には、その複数グループの正規化データの中で最大値を採用する。正規化データがここで変更した場合には、
第三のデータ変換手段１０２は、符号化データ１０７を第二の圧縮方式に準拠した符号化データ１０８に変換する。その際、第一のデータ変換手段１０５及び第二のデータ変換手段１０４で正規化データ１１０や量子化ビット数データ１１１が変更された場合には、変更前のデータを用いて逆量子化を行った後、変更後のデータによって量子化を行い、量子化データを生成する。
【００２６】
多重化手段１０３は第二の圧縮方式に準拠したオーディオ符号化データ１０８、正規化データ１１０、量子化ビット数データ１１１を多重化し、第二の圧縮方式に準拠した伸長装置によって伸長可能な多重化データ１１３を生成する。
【００２７】
このように実施の形態１では、ある圧縮方式で圧縮したオーディオデータを伸長し再度圧縮する際に、実際には伸長せずに符号化データとパラメータを再圧縮方式にあわせて変換することによって、最初の圧縮処理時の割当情報をほとんどそのまま用いることができ、再圧縮時に発生する音質劣化を軽減することができる。
【００２８】
（実施の形態２）
実施の形態２は、伸長処理で使用したパラメータを別途保持しておき、再圧縮時にそのデータを参照することによって音質劣化を軽減するものである。
【００２９】
以下、図２を用いて実施の形態２におけるオーディオ信号圧縮伸長装置を説明する。図２は本発明の実施の形態２におけるオーディオ信号圧縮伸長装置の構成を示すブロック図である。
【００３０】
逆多重化手段２０１は多重化データ２１１から、第一の圧縮処理方式によって圧縮された符号化オーディオデータ及び全ユニットの正規化データ２１２と、全ユニットの量子化ビット数データ２１９とに分離する。
【００３１】
伸長装置２０２は量子化オーディオデータ及び正規化基準データ２１２と量子化ビット数データ２１９によって伸長処理を行い、デジタルオーディオデータ２１３を生成する。これで、第一の圧縮方式に準じた伸長処理が終了する。
【００３２】
直交変換装置２０３で第二の圧縮方式に準じた方式でデジタルオーディオデータ２１３をスペクトルデータ２１４に変換する。
【００３３】
スペクトルデータ２１４を所定のワード単位でユニットに分けられ、絶対値最大スペクトルデータ抽出装置２０４によってユニット毎に最大の絶対値を持つスペクトルデータを検出し、最大絶対値のスペクトルデータから正規化基準データ２２０を算出し、スペクトルデータ２１６とを出力する。
【００３４】
正規化装置２０５は各ユニット毎に正規化基準データ２２０を用いてスペクトルデータ２１６の正規化を行い正規化スペクトルデータ２１６を出力する。
【００３５】
一方、データ変換装置２０８は逆多重化装置２０１から出力された量子化ビット数データ２１９からビット割当情報２２１を算出する。まず、量子化ビット数データ２１９から各スペクトルデータに割り当てられる割り当てビット数とその総計である全割り当てビット数を算出する。次に、各スペクトルデータが第二の圧縮方式で規定されたどのユニットに含まれるかを確認しながら該当するスペクトルデータに割り当てられる量子化ビット数を算出する。各ユニットに割り当てられる量子化ビット数を全割り当てビット数で割ることで、各ユニットに割り当てられた量子化ビット数の比を求め、これをビット割り当て情報２２１として出力する。
【００３６】
量子化ビット割り当て装置２０９は、通常ならば絶対値最大スペクトルデータ抽出装置２０４から出力される正規化基準データ２２０と、あらかじめ決められた算出ルールに従って各ユニットのスペクトルデータに割り当てる量子化割り当てビット数を算出する。しかし、データ変換装置２０８にビット割当情報２２１がある場合、各ユニットのビット割当情報２２１と残りビット数記憶装置２１０に初期値として保持している全割り当てビット数とを掛けることによって各ユニットへの最初の割り当てビット数を算出する。その後、各ユニットに含まれるスペクトルデータの数等を考慮して調整を行って、各ユニットに割り当てる量子化割り当てビット数データ２２４を出力する。
【００３７】
量子化装置２０６は正規化スペクトルデータ２１６をユニット毎に量子化割当ビットデータ２２４をもとにして量子化を行い、符号化オーディオデータ２１７を出力する。
【００３８】
多重化装置２０７は、符号化オーディオデータ２１７、量子化割当ビットデータ２２４、正規化基準データ２２０を第二の圧縮方式に準じた形で多重化し、多重化データ２１８を出力する。
【００３９】
このように実施の形態２では、通常行われる伸長処理と再圧縮処理との連続処理において、伸長処理の際に得られる量子化ビットの割当情報を保持しておき、再圧縮処理時に圧縮装置固有の量子化ビット割当アルゴリズムを用いずに先の割当情報を用いて圧縮処理を行うために、最初の圧縮処理時の割当情報をほぼそのまま用いることができ、再圧縮時に発生する音質劣化を軽減することができる。
【００４０】
（実施の形態３）
実施の形態３は、第一の圧縮処理によって生成された多重化データを逆多重化して量子化オーディオデータとパラメータデータとに分離し、それぞれのデータを第二の圧縮方式に適合するように変換し、多重化することで、従来伸長処理、再圧縮処理を連続して行うことによって発生していた音質劣化を低減させるものである。
【００４１】
以下、図６、図７、図８、図９、図１０、図１１を用いて実施の形態３におけるオーディオ信号圧縮伸長方法を説明する。図６は本発明の実施の形態３におけるオーディオ信号圧縮伸長方法の処理順序を示すフローチャートである。　図７はオーディオ信号圧縮伸長方法の処理の一部である、第一のデータ変換処理の処理内容を示すフローチャートである。図８は同じくオーディオ信号圧縮伸長方法の処理の一部である、第二のデータ変換処理の処理内容を示すフローチャートである。図９は同じくオーディオ信号圧縮伸長方法の処理の一部である、第三のデータ変換処理の処理内容を示すフローチャートである。図１０は第一の圧縮処理による多重化データを示す図である。
【００４２】
逆多重化処理Ｓ６０２は、多重化データ１０００から多重化されているデータを取り出す処理である。ユニット数を表すユニット数データ１００１、各ユニットの符号化スペクトルデータに割り当てられる量子化ビット数を表す量子化ビット数データ１００２、各ユニットのスペクトルデータが正規化された際に用いた正規化データ１００３、そして正規化、量子化された符号化オーディオデータ１１０４を第一の圧縮フォーマットに従ってそれぞれ抽出し、保持する。
【００４３】
第一のデータ変換処理Ｓ９０３は、逆多重化処理Ｓ９０２によって取り出した量子化ビット数データ１１０２を第二の圧縮方式に準じたデータ形式に変換する処理である。最初に、多重化データ１１００の総データビット数から、ユニット数データ１１０１、量子化ビット数データ１１０２、正規化データ１１０３に使用したビット数を引いた、符号化オーディオデータ１１０４に割当可能な量子化ビット数の総数（ＳＵＭ１）を算出する。次の処理Ｓ１００３は、第二の圧縮方式で符号化オーディオデータに用いることのできる量子化ビット数の総数ＳＵＭ２を求める。
【００４４】
次に、ＳＵＭ１とＳＵＭ２の大きさを比較し、ＳＵＭ１が大きい場合、すなわち今のままでは第二の圧縮方式では量子化ビット数が不足してしまう場合には、高域のユニットの量子化ビット数を１減らし、ＳＵＭ１からそのユニットに含まれるスペクトルの本数を引くことによって再計算する。これを、ＳＵＭ１がＳＵＭ２以下になるまで繰り返す。量子化ビット数を１減らしたユニットに対しては、割当ビット更新フラグをつけ、量子化ビット数データをこの処理で変更したユニットがこのフラグでわかるようにする。
【００４５】
ＳＵＭ１がＳＵＭ２以下になった時点で、それぞれの量子化ビット数データを第二の圧縮方式に準拠した形に変換する。第一の圧縮方式では、量子化ビット数データはそのユニットに割り当てるビット数をそのままデータにしていたが、第二の圧縮方式では、実際に割り当てるビット数に１を足したものをデータとしているため、ここでは各データに１を足して変換を行う。
【００４６】
第二のデータ変換処理Ｓ９０４は、逆多重化処理Ｓ９０２によって取り出した正規化基準データ１１０３を第二の圧縮方式に準拠したデータ形式に変換する処理である。図１１は第二のデータ変換処理Ｓ９０４を詳細に説明したフローチャートである。
【００４７】
あらかじめ、第二の圧縮方式における正規化基準データの最大値（ＭＡＸ＃ＳＦ２）と最小値（ＭＩＮ＃ＳＦ２）とを保持しておく。逆多重化処理Ｓ９０２によって多重化データから抽出した各ユニットの正規化基準データをＭＡＸ＃ＳＦ２と比較する。ＭＡＸ＃ＳＦ２より大きい場合、該当の正規化基準データは第二の圧縮方式でのデータとして扱うことができないため、ＭＡＸ＃ＳＦ２に置き換える。同様にＭＩＮ＃ＳＦ２と比較して、ＭＩＮ＃ＳＦ２よりも小さい場合にはＭＩＮ＃ＳＦ２に置き換える。このような置き換え処理が発生したユニットに対しては、正規化基準データ更新フラグをつけ、データをこの処理で変更したユニットがこのフラグでわかるようにする。この後、正規化基準データを第二の圧縮フォーマットに準じた形式に変換する。これを全ユニット行って終了となる。
【００４８】
第三のデータ変換処理Ｓ９０５は、符号化オーディオデータ１１０４を第二の圧縮方式に準拠したデータ形式に変換する処理である。図１２は第三のデータ変換処理Ｓ９０５を詳細に説明したフローチャートである。
【００４９】
各ユニットに対して、第一のデータ変換方法Ｓ９０３及び第二のデータ変換方法Ｓ９０４でデータの変更を行った際につけたデータ更新フラグ（ＳＦＦｉ，　ＢＩＴＦｉ）を読み出し、１かどうかを確認する。割当ビットデータか正規化基準データのどちらかが変更されていた場合、そのユニットに含まれる符号化オーディオデータを変更前の割当ビットデータと正規化基準データで一旦逆量子化してオーディオスペクトルデータに戻した後、変更後の割当ビットデータ及び正規化基準データで再度量子化を行う。しかし、ＳＦＦｉ，　ＢＩＴＦｉの両方とも０の場合は何も行わない。そして、符号化データを第二の圧縮方式に準じた形式に変換する。ここでは、該当するユニットの量子化ビット数データ、正規化基準データ、符号化データから求める符号化データが得られるようなテーブルを用意しておき、テーブル引きで変換できるようにする。
【００５０】
多重化処理Ｓ９０６は、逆多重化処理Ｓ９０２、第一のデータ変換処理Ｓ９０３、第二のデータ変換処理Ｓ９０４、第三のデータ変換処理Ｓ９０５から得られたパラメータによって第二の圧縮方式に準じた多重化データを生成し、圧縮処理を終了する。
【００５１】
このように実施の形態３では、ある圧縮方式で圧縮したオーディオデータを伸長し再度圧縮する際に、実際には伸長せずに符号化データとパラメータを再圧縮方式にあわせて変換することによって、最初の圧縮処理時の割当情報をほとんどそのまま用いることができ、再圧縮時に発生する音質劣化を軽減することができる。
（実施の形態４）
実施の形態４は、伸長処理で使用したパラメータを別途保持しておき、再圧縮時にそのデータを参照することによって音質劣化を軽減するものである。
【００５２】
以下、図１２、図１３、図１４、図１１を用いて実施の形態４におけるオーディオ信号圧縮伸長方法を説明する。図１２は本発明の実施の形態４におけるオーディオ信号圧縮伸長方法の処理順序を示すフローチャートである。　図１３はオーディオ信号圧縮伸長方法の処理の一部である、データ変換処理の処理内容を示すフローチャートである。図１４は同じくオーディオ信号圧縮伸長方法の処理の一部である、量子化ビット割当処理の処理内容を示すフローチャートである。図１１は第一の圧縮処理による多重化データを示す図である。
【００５３】
逆多重化処理Ｓ１２０２は、図９の逆多重化処理Ｓ９０２と同様の処理を行い、多重化データ１４００からユニット数を表すユニット数データ１４０１、各ユニットのスペクトルデータに割り当てられる量子化ビット数を表す量子化ビット数データ１４０２、各ユニットのスペクトルデータが正規化された際に用いた正規化基準データ１４０３、そして正規化、量子化された符号化オーディオデータ１４０４を分離する。
【００５４】
伸長処理Ｓ１２０３は、逆多重化処理Ｓ１２０２で得た量子化ビット数データ１４０２、正規化基準データ１４０３を用いて、符号化オーディオデータ１４０４を逆量子化、逆正規化、逆直交変換処理を行って、デジタルオーディオデータを生成する。これで、第一の圧縮方式によって圧縮されたオーディオデータの伸長処理が完了する。
【００５５】
直交変換Ｓ１２０４は、伸長処理Ｓ１２０３によって得られたデジタルオーディオデータに直交変換処理を施し、スペクトルオーディオデータを得る。
【００５６】
絶対値最大スペクトルデータ抽出処理Ｓ１２０５は、スペクトルオーディオデータを第二の圧縮方式で規定されているユニット毎にグループ分けし、各ユニットに含まれるスペクトルデータの中で最大の絶対値を算出する。
【００５７】
データ変換処理Ｓ１２０６は、逆多重化処理Ｓ１２０２によって得られた量子化ビットデータ１４０２からビット割当情報を出力する処理である。図１３はデータ変換処理Ｓ１２０６を詳細に説明したフローチャートである。まず、第一の圧縮処理においてオーディオデータに割り当てられた量子化ビット数の総和を求めておく。次に、スペクトルデータの本数分のメモリエリアを確保し、第一の圧縮処理におけるユニット毎に、該当するユニットに含まれるスペクトルデータの番号のメモリエリアに割当ビット数を書き込む。次に、第二の圧縮処理におけるユニット毎に、該当するユニットに含まれるスペクトルデータ番号のデータエリアに書き込まれたビット数を合計して、該当するユニットに与えられた量子化ビット数の総和を求める。各ユニットの量子化ビット数を、スペクトルデータに与えられた量子化ビットの総数で割って、総量子化ビット数に対する割合を算出し、これを保持しておく。
【００５８】
量子化ビット割当処理Ｓ１２０７は、データ変換処理Ｓ１２０６で得られた総量子化ビット数に対する割合データをもとに量子化ビット数を算出する。図１４は量子化ビット割当処理の詳細なフローチャートである。あらかじめ、第二の圧縮方式でスペクトルオーディオデータに割当可能な総割当ビット数を算出しておく。第二の圧縮方式における各ユニット毎に、まず総量子化ビット数に対する割合データを読み出し、先の総割当ビット数を掛けることで該当するユニットに割り当てる量子化ビット数を算出する。算出結果が、該当するユニットに含まれるスペクトルの本数の倍数かどうかを調べ、そうでなければ算出結果よりも大きな倍数に置き換える。
【００５９】
全てのユニットで計算を行った後、全ユニットに割り当てたビット数の総和を求め、割当可能な総ビット数と大きさを比較する。もし割当可能総ビット数の方が小さい場合、高域のユニットから各スペクトルデータに割り当てるビット数を１つ減らす。これを割当可能総ビット数の方が大きくなるまで繰り返す。これによって各ユニットに割り当てる量子化ビット数データを算出する。
【００６０】
正規化処理Ｓ１２０８は絶対値最大スペクトルデータ抽出処理Ｓ１２０５で算出した正規化基準データをもとにスペクトルオーディオデータを正規化し、正規化オーディオデータを算出する。
【００６１】
量子化処理Ｓ１２０９は量子化ビット割当処理Ｓ１２０７で算出した量子化ビット数データから、正規化オーディオデータを量子化して符号化オーディオデータを算出する。
【００６２】
多重化処理Ｓ１２１０は、量子化ビット数データ、正規化基準データ、符号化オーディオデータを第二の圧縮フォーマットに従って多重化処理を行い、多重化データを生成する。
【００６３】
このように実施の形態４では、通常行われる伸長処理と再圧縮処理との連続処理において、伸長処理の際に得られる量子化ビットの割当情報を保持しておき、再圧縮処理時に圧縮装置固有の量子化ビット割当アルゴリズムを用いずに先の割当情報を用いて圧縮処理を行うために、最初の圧縮処理時の割当情報をほとんどそのまま用いることができ、再圧縮時に発生する音質劣化を軽減することができる。
【００６４】
【発明の効果】
以上説明したように本発明によれば、一度オーディオデータに圧縮処理を施して得られた圧縮データを伸長して再圧縮を行う場合に、逆多重化によって得られた最初の圧縮処理時に各スペクトルデータに割り当てられた量子化ビット数等の情報を再圧縮処理時に利用することで、最初の圧縮処理で量子化ビットを割り当てた帯域に再圧縮時でも優先的に割り振ることができ、従来通り伸長処理後に再圧縮を行う場合よりも音質劣化を低減することが出来る。
【００６５】
最初の圧縮方式と再圧縮時の圧縮方式とで、圧縮処理の過程が比較的似ている場合や、同じ場合で圧縮率を変換するだけの場合には、伸長処理時に必要となるパラメータを再圧縮時に適用する圧縮方式に適合する形に変換し、多重化することで最初の圧縮処理の情報を使用することができ、再圧縮時の音質劣化を低減することができる。
【００６６】
また、二つの圧縮方式の間に類似性が少なくパラメータ変換などが困難な場合には、従来通り伸長処理を行った後、最圧縮処理を行うが、逆多重化で得られる前の圧縮処理でのパラメータを保持しておき、再圧縮時に参照することにより最初の圧縮処理で量子化ビットが割り当てられた帯域に量子化ビットを割り当てることができ、パラメータを参照しない場合に比較して音質劣化を低減することができる。
【００６７】
以上、本発明の内容は装置として実施が可能であり、一方で計算機のプログラムででも実行が可能である。
【図面の簡単な説明】
【図１】第１の実施の形態であるオーディオ信号圧縮伸長装置のブロック図
【図２】第２の実施の形態であるオーディオ信号圧縮伸長装置のブロック図
【図３】第一の圧縮方式のデータを第二の圧縮方式のデータに変換するためのテーブルを示す図
【図４】従来例のオーディオ信号圧縮装置のブロック図
【図５】各ユニットに割り当てられる理想的な量子化ビット数と、圧縮装置によって割り当て方法が異なることを示した図
【図６】第３の実施の形態であるオーディオ信号圧縮伸長方法の処理の手順を示したフローチャート
【図７】第３の実施の形態であるオーディオ信号圧縮伸長方法の第一のデータ変換処理の処理手順を示したフローチャート
【図８】第３の実施の形態であるオーディオ信号圧縮伸長方法の第二のデータ変換処理の処理手順を示したフローチャート
【図９】第３の実施の形態であるオーディオ信号圧縮伸長方法の第三のデータ変換処理の処理手順を示したフローチャート
【図１０】逆多重化処理を処理手順を示したフローチャート
【図１１】多重化データの構成を示す図
【図１２】第４の実施の形態であるオーディオ信号圧縮伸長方法の処理の手順を示したフローチャート
【図１３】第４の実施の形態であるオーディオ信号圧縮伸長方法のデータ変換処理の処理手順を示したフローチャート
【図１４】第４の実施の形態であるオーディオ信号圧縮伸長方法の量子化ビット割当処理の処理手順を示したフローチャート
【符号の説明】
１０１　逆多重化手段
１０２　第３のデータ変換手段
１０３　多重化手段
１０４　第２のデータ変換手段
１０５　第１のデータ変換手段[0001]
TECHNICAL FIELD OF THE INVENTION
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method and apparatus for multiplexing audio data compressed by a predetermined method and parameters necessary for decompression, and more particularly to a method and apparatus for re-compressing audio data once compressed by a different or the same compression method. About.
[0002]
[Prior art]
A dubbing operation of reading data from a recording medium such as a CD or a magnetic tape or the like on which audio data is recorded by a reproducing device, transferring the data to the recording device, and recording the data on another recording medium by the recording device is common. Has been done. For example, a method of digitizing read audio data and then performing compression processing such as MPEG (Moving Picture Experts Coding Group) or ATRAC (Adaptive Transform Acoustic Coding) and recording the data on a recording medium in the form of compressed data is also common. It has become.
[0003]
When data compression is performed, music and image data of the same time can be compressed to several tenths to one tenth of a data amount, so that the amount of recording media used can be reduced, and the size of recording media can be reduced. Therefore, it is possible to reduce the size of the recording, recording, and playback devices. Also, in the distribution of audio data via a communication line, communication costs can be reduced by using compressed data.
[0004]
Many audio data compression methods use, for example, the human auditory characteristics to compress signal components that are difficult for humans to hear, and use short-length codes for code patterns that frequently appear using variable-length coding methods or conversion tables. By using a method of compressing a redundant portion at the time of encoding such as entropy encoding that replaces a pattern, a scheme has been devised to increase the compression ratio while minimizing deterioration of sound quality during reproduction. A compression method of audio data will be briefly described with reference to FIG.
[0005]
The analog audio data 400 is converted into digital audio data 409 by an A / D converter 401 having a sampling frequency of 44.1 kHz. The digital audio data 409 of 512 words is processed as one frame.
[0006]
The digital audio data 409 is converted by the orthogonal transformer 402 into 512-word spectral data 410. The maximum-absolute-value spectrum data extracting device 403 divides the spectrum data 410 into groups called units in predetermined word units, detects spectrum data having the maximum absolute value for each unit, and converts the normalized reference data 414 into units. Output. The quantization bit allocation device 408 calculates quantization bit number data 415 to be allocated to each group using the normalized reference data 414. The number of quantization bits that can be used is limited, and the quantization bit number data is determined with reference to the remaining bit number 417 stored in the remaining bit number storage device 408. The normalization device 405 performs normalization on the spectrum data 411 for each unit using the normalization reference data 414 to generate normalized spectrum data 412. The quantization device 406 quantizes the normalized spectrum data 412 for each unit using the quantization bit number data 415, and generates encoded audio data 413. The multiplexing device 407 generates multiplexed data 418 from the encoded audio data 413, the quantization bit number data 415 necessary for the decompression process, and the normalization reference data 414 according to a format predetermined by a compression method, and outputs the multiplexed data 418. I do.
[0007]
In human hearing, when the signal level falls below a certain signal level, there is a boundary that cannot be detected even if the signal itself exists, and the level changes depending on the frequency. This frequency characteristic is called minimum audibility, and when the magnitude of the spectrum data having the maximum absolute value of each group is equal to or less than the minimum audibility level, it can be regarded as a signal that cannot be detected by humans and the quantization bit No need to assign. In addition, quantization noise is generated when the spectrum data is quantized. However, even if the quantization noise remains, if its level is equal to or lower than the minimum audible level, there is no problem in the sense of hearing unless a human can detect it. Therefore, even if the signal level is large, if the minimum audible level is large, it is not necessary to allocate many quantization bits. In this way, by using the human auditory characteristics, it is possible to reduce the number of quantization bits to be allocated.
[0008]
The spectral data of each group is normalized by the maximum spectral data of the group, and is quantized by the previously determined number of quantization bits.
[0009]
The quantized audio data, the allocated bit number data and the normalized reference data are multiplexed according to a certain format. The number of data per frame of the multiplexed data is determined, and the number of quantization bits is usually insufficient in many cases. At this time, there is hardly any compression format clearly defining how many quantization bits are allocated to which spectral data group so as to fit in the limited data amount, and basically differs depending on each compression device.
[0010]
[Problems to be solved by the invention]
Until now, as described above, uncompressed audio data such as a compact disc was input and compressed by the compression method supported by the device, and it was often recorded on a recording medium to enjoy music. Opportunities for inputting compressed music data are increasing. For example, a user receives compressed data distributed by electronic distribution and records it on a recording medium. If the compression method of the distributed compressed data is the same as the corresponding compression method of the playback device owned by the user, playback can be performed on that device as it is, but the compression method of the compressed data and the device owned by the user correspond The compression method used may be different.
[0011]
In such a case, for example, the distributed compressed data must be decompressed by a computer or the like in accordance with the compression method, and further compressed again by a compression method supported by the device.
[0012]
Some compression methods use the same method and support a plurality of different compression ratios. If you want to re-record audio data of the same time with a smaller data amount, perform the decompression process once as described above. After recording, the data may be compressed and recorded in a mode having a higher compression rate.
[0013]
When compressing audio data, as described above, a limited number of quantization bits is allocated to data considered to be more perceptually effective in order to achieve a predetermined compression ratio. However, it goes without saying that the same method differs depending on a device or software to be executed.
[0014]
For example, as a result of calculating the number of quantization bits for a certain input signal using an auditory model or the like, a distribution of quantization bits as shown in 501 in FIG. 5 is obtained. Must be reduced.
The first compression method employs a method of uniformly reducing the number of quantization bits from the entire band as indicated by 502 in FIG. In this case, the blank cells correspond to the quantization noise. On the other hand, the second compression method employs a method of decreasing the number of bits to be allocated in order from the higher frequency unit as indicated by 503 in FIG.
[0015]
If the image is temporarily expanded after being compressed by the first compression method and then compressed again by the second compression method, even if a large number of quantization bits are assigned to the middle and low band signals, quantization noise is already included in the first compression. , So it does not return. Further, the remaining high-frequency signal in the first compression method is not assigned a quantization bit by the second compression method, and the sound quality is degraded.
[0016]
SUMMARY OF THE INVENTION An object of the present invention is to solve the above-described problems, and it is possible to minimize the sound quality deterioration that occurs when recompressing when expanding and recompressing compressed audio data. It is an object of the present invention to provide an audio signal compression / decompression apparatus and method.
[0017]
[Means for Solving the Problems]
An audio signal compression / decompression apparatus and method according to the present invention solves the above-described problem by taking the following means. That is, the audio data is compressed by the first compression method, and the multiplexed data multiplexed together with the parameters required for the decompression process are separated into quantized audio data and the parameters required for the decompression process by demultiplexing. The encoded audio data and the parameter data are respectively converted into data conforming to the format of the second compression method, and are multiplexed as they are to generate multiplexed data conforming to the format of the second compression method.
Further, the parameter holding device holds the parameter data obtained at the time of demultiplexing, expands the coded data compressed by the first compression method, and then re-compresses the data by the second compression method. The number of quantization bits to be allocated to each unit is determined with reference to the stored parameters.
[0018]
According to the present invention, after audio data is generated by temporarily expanding compressed audio data, when the audio data is compressed again by another compression method or the same compression method, the components remaining in the first compression processing are removed. By extracting and storing the decompressed parameters and the quantized audio data itself, and using them at the time of recompression, it is possible to reduce sound quality degradation that normally occurs at the time of recompression.
[0019]
BEST MODE FOR CARRYING OUT THE INVENTION
Hereinafter, specific embodiments of an audio signal compression / decompression apparatus and method according to the present invention will be described in detail with reference to the drawings.
[0020]
(Embodiment 1)
In the first embodiment, the multiplexed data generated by the first compression processing is demultiplexed and separated into quantized audio data and parameter data, and each data is converted so as to be compatible with the second compression method. The multiplexing reduces the sound quality deterioration that has conventionally been caused by continuously performing the decompression processing and the recompression processing.
[0021]
The audio signal compression / decompression device according to the first embodiment will be described below with reference to FIGS. FIG. 1 is a block diagram showing a configuration of an audio signal compression / decompression device according to Embodiment 1 of the present invention. FIG. 3 is a table used by each data conversion means.
[0022]
The demultiplexing means 101 converts the multiplexed data 106 from the encoded audio data 107 compressed by the first compression processing method, the normalized reference data 109 of each unit, and the quantized bit number data 111 assigned to each unit. And separated into
[0023]
The first data converter 105 converts the quantized bit number data 111 into quantized bit number data 112 according to the second compression method. First, using the quantization bit number data conversion table 301, quantization bit number data of the second compression method corresponding to the quantization data of the first compression method is searched and converted. Further, the unit conversion table 300 is used to check which unit includes the unit of the first compression scheme in the second compression scheme. Adopts the minimum value.
[0024]
The total number of allocated bits is calculated from the updated quantized bit number data, and it is checked whether it is larger than the number of words of the multiplexed data according to the second compression method. If the total number of allocated bits by the first compression method is larger, the number of allocated bits is reduced by one from the highest band unit until the total number of allocated bits by the first compression method becomes smaller. Go.
[0025]
The second data conversion unit 104 converts the normalized reference data 109 according to the first compression method into the normalized reference data 110 according to the second compression method. First, using the normalization reference data conversion table 302, the normalization reference data of the second compression method corresponding to the normalization reference data of the first compression method is searched and converted. Further, the unit conversion table 200 confirms which unit of the first compression system is included in the second compression system. Adopts the maximum value. If the normalized data changes here,
The third data conversion means 102 converts the encoded data 107 into encoded data 108 conforming to the second compression method. At this time, when the normalized data 110 and the quantized bit number data 111 are changed by the first data conversion means 105 and the second data conversion means 104, inverse quantization is performed using the data before the change. After that, quantization is performed using the changed data to generate quantized data.
[0026]
The multiplexing means 103 multiplexes the audio encoded data 108, the normalized data 110, and the quantized bit number data 111 according to the second compression scheme, and can perform demultiplexing by a decompression device conforming to the second compression scheme. The data 113 is generated.
[0027]
As described above, in the first embodiment, when audio data compressed by a certain compression method is decompressed and re-compressed, the encoded data and parameters are converted according to the re-compression method without actually decompressing. Allocation information at the time of the first compression process can be used almost as it is, and sound quality degradation that occurs at the time of recompression can be reduced.
[0028]
(Embodiment 2)
In the second embodiment, the parameters used in the decompression process are separately stored, and the data is referred to at the time of recompression to reduce the sound quality deterioration.
[0029]
Hereinafter, the audio signal compression / decompression device according to the second embodiment will be described with reference to FIG. FIG. 2 is a block diagram showing a configuration of an audio signal compression / decompression apparatus according to Embodiment 2 of the present invention.
[0030]
The demultiplexing means 201 separates the multiplexed data 211 into coded audio data compressed by the first compression processing method, normalized data 212 of all units, and quantized bit number data 219 of all units.
[0031]
The decompression device 202 performs decompression processing using the quantized audio data and the normalized reference data 212 and the quantized bit number data 219, and generates digital audio data 213. This completes the decompression processing according to the first compression method.
[0032]
The orthogonal audio converter 203 converts the digital audio data 213 into spectrum data 214 by a method according to the second compression method.
[0033]
The spectrum data 214 is divided into units in units of a predetermined word, and the spectrum data having the largest absolute value is detected for each unit by the absolute value maximum spectrum data extracting device 204, and the normalized reference data 220 Is calculated, and the spectrum data 216 is output.
[0034]
The normalization device 205 normalizes the spectrum data 216 using the normalization reference data 220 for each unit, and outputs the normalized spectrum data 216.
[0035]
On the other hand, the data converter 208 calculates bit allocation information 221 from the quantized bit number data 219 output from the demultiplexer 201. First, the number of allocated bits allocated to each spectrum data and the total number of allocated bits, which is the total number of allocated bits, are calculated from the quantized bit number data 219. Next, the number of quantization bits allocated to the corresponding spectral data is calculated while confirming which unit specified by the second compression method is included in each spectral data. By dividing the number of quantization bits allocated to each unit by the total number of allocated bits, a ratio of the number of quantization bits allocated to each unit is obtained, and this is output as bit allocation information 221.
[0036]
The quantization bit allocation device 209 calculates the normalized reference data 220 normally output from the absolute maximum spectrum data extraction device 204 and the number of quantization allocation bits to be allocated to the spectrum data of each unit according to a predetermined calculation rule. calculate. However, when the data conversion device 208 has the bit allocation information 221, the bit allocation information 221 of each unit is multiplied by the total number of allocated bits held in the remaining bit number storage device 210 as an initial value, so that the number of bits allocated to each unit is increased. Calculate the initial number of allocated bits. Thereafter, adjustment is performed in consideration of the number of spectrum data included in each unit and the like, and quantization allocation bit number data 224 allocated to each unit is output.
[0037]
The quantizer 206 quantizes the normalized spectrum data 216 based on the quantization assignment bit data 224 for each unit, and outputs encoded audio data 217.
[0038]
The multiplexing device 207 multiplexes the coded audio data 217, the quantized allocation bit data 224, and the normalization reference data 220 according to the second compression method, and outputs multiplexed data 218.
[0039]
As described above, in the second embodiment, in the continuous processing of the decompression processing and the recompression processing which are normally performed, the allocation information of the quantization bits obtained at the time of the decompression processing is held, and the compression apparatus specific information is stored at the time of the recompression processing. Since the compression processing is performed using the previous allocation information without using the quantization bit allocation algorithm, the allocation information at the time of the first compression processing can be used almost as it is, and the sound quality deterioration occurring at the time of recompression is reduced. be able to.
[0040]
(Embodiment 3)
In the third embodiment, the multiplexed data generated by the first compression processing is demultiplexed and separated into quantized audio data and parameter data, and each data is converted so as to be compatible with the second compression method. The multiplexing reduces the sound quality deterioration that has conventionally been caused by continuously performing the decompression processing and the recompression processing.
[0041]
Hereinafter, an audio signal compression / decompression method according to the third embodiment will be described with reference to FIGS. 6, 7, 8, 9, 10, and 11. FIG. 6 is a flowchart showing the processing order of the audio signal compression / decompression method according to Embodiment 3 of the present invention. FIG. 7 is a flowchart showing the processing contents of the first data conversion processing, which is a part of the processing of the audio signal compression / expansion method. FIG. 8 is a flowchart showing the processing contents of a second data conversion processing, which is also a part of the processing of the audio signal compression / expansion method. FIG. 9 is a flowchart showing the processing contents of the third data conversion processing, which is also a part of the processing of the audio signal compression / expansion method. FIG. 10 shows multiplexed data obtained by the first compression processing.
[0042]
The demultiplexing process S602 is a process for extracting multiplexed data from the multiplexed data 1000. Unit number data 1001 indicating the number of units, quantization bit number data 1002 indicating the number of quantization bits allocated to the encoded spectrum data of each unit, normalized data 1003 used when the spectrum data of each unit is normalized , And the normalized and quantized encoded audio data 1104 are extracted and stored according to the first compression format.
[0043]
The first data conversion process S903 is a process of converting the quantized bit number data 1102 extracted in the demultiplexing process S902 into a data format according to the second compression method. First, the number of bits used for the unit number data 1101, the quantization bit number data 1102, and the normalized data 1103 is subtracted from the total number of data bits of the multiplexed data 1100, and the quantization that can be assigned to the encoded audio data 1104. The total number of bits (SUM1) is calculated. In the next processing S1003, the total number SUM2 of the number of quantization bits that can be used for the encoded audio data in the second compression method is obtained.
[0044]
Next, the sizes of SUM1 and SUM2 are compared, and if SUM1 is large, that is, if the number of quantization bits is insufficient in the second compression method as it is, quantization bits of the high-frequency unit are Recalculate by reducing the number by 1 and subtracting the number of spectra contained in the unit from SUM1. This is repeated until SUM1 becomes equal to or less than SUM2. An allocation bit update flag is attached to a unit in which the number of quantization bits has been reduced by 1, so that the unit whose quantization bit number data has been changed in this process can be identified by this flag.
[0045]
When SUM1 becomes equal to or less than SUM2, each quantized bit number data is converted into a form conforming to the second compression method. In the first compression method, the quantization bit number data is the data of the number of bits allocated to the unit as it is, but in the second compression method, the data is obtained by adding 1 to the actually allocated bit number. Here, conversion is performed by adding 1 to each data.
[0046]
The second data conversion processing S904 is processing for converting the normalized reference data 1103 extracted in the demultiplexing processing S902 into a data format conforming to the second compression method. FIG. 11 is a flowchart illustrating the second data conversion processing S904 in detail.
[0047]
The maximum value (MAX # SF2) and the minimum value (MIN # SF2) of the normalized reference data in the second compression method are stored in advance. The normalized reference data of each unit extracted from the multiplexed data in the demultiplexing process S902 is compared with MAX # SF2. If it is larger than MAX # SF2, the corresponding normalized reference data cannot be treated as data in the second compression method, and is therefore replaced with MAX # SF2. Similarly, if it is smaller than MIN # SF2 compared to MIN # SF2, it is replaced with MIN # SF2. A unit for which such replacement processing has occurred is provided with a normalization reference data update flag so that the unit whose data has been changed by this processing can be identified by this flag. Thereafter, the normalized reference data is converted into a format according to the second compression format. This is performed for all units, and the process ends.
[0048]
The third data conversion processing S905 is processing for converting the encoded audio data 1104 into a data format conforming to the second compression method. FIG. 12 is a flowchart illustrating the third data conversion processing S905 in detail.
[0049]
For each unit, a data update flag (SFFi, BITFi) attached when the data is changed by the first data conversion method S903 and the second data conversion method S904 is read, and whether it is 1 or not is checked. If either the assigned bit data or the normalized reference data has been changed, the coded audio data contained in that unit is once dequantized with the assigned bit data and the normalized reference data before the change and returned to the audio spectrum data. After that, the quantization is performed again using the changed assigned bit data and the normalized reference data. However, when both SFFi and BITFi are 0, nothing is performed. Then, the encoded data is converted into a format according to the second compression method. Here, a table is prepared so that the coded data obtained from the quantization bit number data, the normalized reference data, and the coded data of the corresponding unit is prepared, and conversion can be performed by table lookup.
[0050]
The multiplexing processing S906 is based on the parameters obtained from the demultiplexing processing S902, the first data conversion processing S903, the second data conversion processing S904, and the third data conversion processing S905. The compressed data is generated, and the compression process ends.
[0051]
As described above, in the third embodiment, when audio data compressed by a certain compression method is decompressed and re-compressed, the encoded data and parameters are converted according to the re-compression method without actually decompressing. Allocation information at the time of the first compression process can be used almost as it is, and sound quality degradation that occurs at the time of recompression can be reduced.
(Embodiment 4)
In the fourth embodiment, the parameters used in the decompression process are separately stored, and the sound quality is reduced by referring to the data at the time of recompression.
[0052]
Hereinafter, the audio signal compression / decompression method according to the fourth embodiment will be described with reference to FIGS. 12, 13, 14, and 11. FIG. 12 is a flowchart showing the processing order of the audio signal compression / decompression method according to Embodiment 4 of the present invention. FIG. 13 is a flowchart showing the processing contents of the data conversion processing, which is a part of the processing of the audio signal compression / expansion method. FIG. 14 is a flowchart showing the details of the quantization bit allocation process, which is a part of the process of the audio signal compression / expansion method. FIG. 11 is a diagram showing multiplexed data obtained by the first compression processing.
[0053]
The demultiplexing process S1202 performs the same process as the demultiplexing process S902 in FIG. 9 and indicates the number of units 1401 indicating the number of units from the multiplexed data 1400 and the number of quantization bits allocated to the spectrum data of each unit. The quantized bit number data 1402, the normalized reference data 1403 used when the spectral data of each unit is normalized, and the normalized and quantized coded audio data 1404 are separated.
[0054]
The decompression process S1203 performs inverse quantization, inverse normalization, and inverse orthogonal transform on the encoded audio data 1404 using the quantization bit number data 1402 and the normalization reference data 1403 obtained in the demultiplexing process S1202. And generate digital audio data. Thus, the decompression processing of the audio data compressed by the first compression method is completed.
[0055]
In the orthogonal transform S1204, the digital audio data obtained in the decompression process S1203 is subjected to an orthogonal transform to obtain spectral audio data.
[0056]
The absolute value maximum spectrum data extraction processing S1205 divides the spectrum audio data into groups defined by the second compression method, and calculates the maximum absolute value among the spectrum data included in each unit.
[0057]
The data conversion processing S1206 is processing for outputting bit allocation information from the quantized bit data 1402 obtained by the demultiplexing processing S1202. FIG. 13 is a flowchart illustrating the data conversion processing S1206 in detail. First, a total sum of the number of quantization bits assigned to audio data in the first compression processing is obtained. Next, a memory area for the number of spectrum data is secured, and for each unit in the first compression processing, the number of allocated bits is written to the memory area of the number of the spectrum data included in the corresponding unit. Next, for each unit in the second compression process, the number of bits written in the data area of the spectrum data number included in the corresponding unit is summed, and the sum of the number of quantization bits given to the corresponding unit is calculated. Ask. The number of quantization bits of each unit is divided by the total number of quantization bits given to the spectrum data to calculate a ratio to the total number of quantization bits, and this ratio is stored.
[0058]
The quantization bit allocation processing S1207 calculates the number of quantization bits based on the ratio data to the total number of quantization bits obtained in the data conversion processing S1206. FIG. 14 is a detailed flowchart of the quantization bit allocation process. The total number of allocated bits that can be allocated to spectrum audio data by the second compression method is calculated in advance. For each unit in the second compression method, first, ratio data with respect to the total number of quantization bits is read, and the number of quantization bits to be allocated to the corresponding unit is calculated by multiplying the data by the total number of allocated bits. It is checked whether the calculation result is a multiple of the number of spectra included in the corresponding unit, and if not, it is replaced with a multiple that is larger than the calculation result.
[0059]
After calculation is performed for all units, the total number of bits allocated to all units is calculated, and the total number of bits that can be allocated is compared with the size. If the total number of bits that can be allocated is smaller, the number of bits allocated to each spectrum data from the high-band unit is reduced by one. This is repeated until the total number of assignable bits becomes larger. Thereby, quantization bit number data to be allocated to each unit is calculated.
[0060]
The normalization processing S1208 normalizes the spectrum audio data based on the normalization reference data calculated in the absolute value maximum spectrum data extraction processing S1205, and calculates normalized audio data.
[0061]
A quantization process S1209 quantizes the normalized audio data from the quantization bit number data calculated in the quantization bit allocation process S1207 to calculate encoded audio data.
[0062]
The multiplexing processing S1210 performs multiplexing processing on the quantization bit number data, the normalization reference data, and the encoded audio data according to the second compression format to generate multiplexed data.
[0063]
As described above, in the fourth embodiment, in the continuous processing of the decompression processing and the recompression processing that are normally performed, the allocation information of the quantization bits obtained at the time of the decompression processing is held, and the compression apparatus unique Since the compression processing is performed using the previous allocation information without using the quantization bit allocation algorithm, the allocation information at the time of the first compression processing can be used almost as it is, and the sound quality deterioration occurring at the time of recompression is reduced. be able to.
[0064]
【The invention's effect】
As described above, according to the present invention, when decompressing and recompressing compressed data obtained by once performing compression processing on audio data, each spectrum is decompressed at the first compression processing obtained by demultiplexing. By using information such as the number of quantization bits assigned to data at the time of recompression processing, it is possible to preferentially allocate to the band to which quantization bits were assigned in the first compression processing even at the time of recompression, and expand as before Sound quality deterioration can be reduced as compared with the case where recompression is performed after the processing.
[0065]
If the compression process is relatively similar between the initial compression method and the compression method at the time of recompression, or if only the compression ratio is converted when the compression method is the same, the parameters necessary for the decompression process are reset. By converting the data into a form suitable for the compression method applied at the time of compression and multiplexing, the information of the first compression processing can be used, and the sound quality deterioration at the time of recompression can be reduced.
[0066]
If the two compression methods have little similarity and it is difficult to perform parameter conversion, etc., the decompression process is performed as before, and the re-compression process is performed, but the compression process before demultiplexing is performed. By holding the parameters and referencing them at the time of recompression, the quantization bits can be assigned to the band to which the quantization bits have been assigned in the first compression processing. Can be reduced.
[0067]
As described above, the contents of the present invention can be implemented as an apparatus, and can also be executed by a computer program.
[Brief description of the drawings]
FIG. 1 is a block diagram of an audio signal compression / decompression device according to a first embodiment;
FIG. 2 is a block diagram of an audio signal compression / decompression device according to a second embodiment;
FIG. 3 is a diagram showing a table for converting data of a first compression method into data of a second compression method;
FIG. 4 is a block diagram of a conventional audio signal compression device.
FIG. 5 is a diagram showing an ideal number of quantization bits allocated to each unit and a different allocation method depending on a compression device.
FIG. 6 is a flowchart showing the processing procedure of an audio signal compression / decompression method according to a third embodiment;
FIG. 7 is a flowchart showing a processing procedure of a first data conversion process of the audio signal compression / decompression method according to the third embodiment;
FIG. 8 is a flowchart showing a processing procedure of a second data conversion process of the audio signal compression / decompression method according to the third embodiment;
FIG. 9 is a flowchart showing a processing procedure of a third data conversion process of the audio signal compression / expansion method according to the third embodiment;
FIG. 10 is a flowchart showing a processing procedure of demultiplexing processing;
FIG. 11 is a diagram showing a configuration of multiplexed data.
FIG. 12 is a flowchart showing a procedure of processing of an audio signal compression / expansion method according to a fourth embodiment;
FIG. 13 is a flowchart showing a processing procedure of data conversion processing of an audio signal compression / expansion method according to a fourth embodiment;
FIG. 14 is a flowchart showing a procedure of a quantization bit allocation process in the audio signal compression / decompression method according to the fourth embodiment;
[Explanation of symbols]
101 demultiplexing means
102 Third data conversion means
103 multiplexing means
104 second data conversion means
105 First data conversion means

Claims

The digital audio signal on the time axis is converted into a spectrum signal on the frequency axis for each predetermined time frame, and the generated spectrum signal for each frame is divided into a plurality of units, and each unit is included in the unit. The spectral signal of the maximum level is detected from the spectral signals to be detected, and the spectral signal contained in each unit is compressed, encoded and expanded by allocating a predetermined number of quantization bits based on the auditory characteristics according to the maximum level. The multiplexed data obtained by multiplexing with the necessary parameters is transmitted or recorded on the recording medium, and the multiplexed data read or transmitted from the recording medium is demultiplexed to obtain the necessary parameters and codes at the time of decompression. Separates the encoded data, dequantizes the encoded data using parameters, and To generate a signal, an audio signal decompression apparatus for generating audio data in the time domain by an inverse transform,
Encoding audio data converting means for converting the first encoded audio data compressed and quantized by the first compression method generated by the demultiplexing means into encoded audio data conforming to the second compression method; ,
Parameter conversion means for converting a plurality of parameters required to decompress the first encoded audio data generated by the demultiplexing means into parameters compatible with the second compression method,
An audio signal compression / decompression device comprising:

After expanding the first encoded audio data into an audio signal using the plurality of parameters, when re-compressing by the second compression method,
Parameter holding means for holding the first plurality of parameters,
With reference to the first plurality of parameters held in the parameter holding means, quantization bit allocation means to determine the allocation of quantization bits,
An audio signal compression / decompression device comprising:

The audio signal compression / decompression device according to claim 1, wherein the parameter conversion device derives conversion data using a table.

The digital audio signal on the time axis is converted into a spectrum signal on the frequency axis for each predetermined time frame, and the generated spectrum signal for each frame is divided into a plurality of units, and each unit is included in the unit. The spectrum signal of the maximum level is detected from the spectrum signals to be detected, and the spectrum signal included in each unit is compressed, encoded and expanded by allocating a predetermined number of quantization bits based on the auditory characteristics according to the maximum level. The multiplexed data obtained by multiplexing with the necessary parameters is transmitted or recorded on the recording medium, and the multiplexed data read or transmitted from the recording medium is demultiplexed to encode the necessary parameters at the time of decompression. Separate the data and dequantize the encoded data using the parameters to obtain the spectrum It generates No., an audio signal decompression method for generating audio data in the time domain by an inverse transform,
An encoded audio data converting step of converting the first encoded audio data compressed and quantized by the first compression method generated by the demultiplexing means into encoded audio data conforming to the second compression method; ,
A parameter conversion step of converting a plurality of parameters necessary for decompressing the first encoded audio data generated by the demultiplexing means into parameters compatible with the second compression method;
An audio signal compression / expansion method characterized by comprising:

After expanding the first encoded audio data into an audio signal using the plurality of parameters, when re-compressing by the second compression method,
A parameter holding step of holding the first plurality of parameters,
An audio signal compression / expansion method, comprising: a quantization bit allocation step of determining the number of quantization bits allocated to each unit by referring to the first plurality of parameters held in the parameter holding means. .