JP4024784B2

JP4024784B2 - Audio decoding device

Info

Publication number: JP4024784B2
Application number: JP2004210342A
Authority: JP
Inventors: 弥章佐藤
Original assignee: United Module Corp
Current assignee: United Module Corp
Priority date: 2004-07-16
Filing date: 2004-07-16
Publication date: 2007-12-19
Anticipated expiration: 2016-01-12
Also published as: JP2004302493A

Description

本発明はオーディオ復号装置に関し、特に、時間／周波数変換技術を用いて周波数領域で符号化されたオーディオデータを復号する際のオーバーサンプリング方式に関するものである。 The present invention relates to an audio decoding apparatus, and more particularly to an oversampling method for decoding audio data encoded in the frequency domain using a time / frequency conversion technique.

従来、オーディオ信号の符号化方式については、様々な方式が知られている。その一例として、オーディオ信号を時間領域の信号から周波数領域の信号に変換し、周波数領域で符号化を行う方式がある。時間／周波数変換を行う方式としては、例えば、サブバンドフィルタやＭＤＣＴ(Modified Discrete Cosine Transform)を用いた方式があり、このような方式を用いた符号化方式としてＭＰＥＧ（Moving Picture Image Coding Experts Group ）オーディオが挙げられる。 Conventionally, various methods are known for encoding audio signals. As an example, there is a method in which an audio signal is converted from a time domain signal to a frequency domain signal and encoded in the frequency domain. As a method for performing time / frequency conversion, for example, there is a method using a subband filter or MDCT (Modified Discrete Cosine Transform), and MPEG (Moving Picture Image Coding Experts Group) is an encoding method using such a method. Audio.

上記ＭＰＥＧオーディオのレイヤＩでは、クリティカル・バンド（ある周波数スペクトルのピーク近傍の周波数では聴感度が低下するというマスキング効果の及ぶ周波数幅）などの聴覚心理モデルを効率よく利用するために、全帯域が３２の等間隔の周波数幅に分割される。そして、分割された各帯域内の信号が、元のサンプリング周波数の１／３２でサブサンプリングされて符号化される。 In the above-mentioned MPEG audio layer I, in order to efficiently use a psychoacoustic model such as a critical band (a frequency range with a masking effect that a hearing sensitivity decreases at a frequency in the vicinity of a peak of a certain frequency spectrum) Divided into 32 equally spaced frequency widths. Then, the divided signals in each band are subsampled at 1/32 of the original sampling frequency and encoded.

このようにして所定のサンプリングレートに従って符号化されたオーディオデータの復号化は、基本的には上記符号化と逆の操作によって行われる。
図６は、従来のＭＰＥＧオーディオ復号装置の構成を、処理の流れが分かりやすくなるように示したブロック図である。なお、この例では、サンプリング周波数が44.1KHz 、ビット幅が１６ビットでオーディオデータが符号化されているものとする。 The decoding of the audio data encoded according to the predetermined sampling rate in this way is basically performed by an operation reverse to the above encoding.
FIG. 6 is a block diagram showing the configuration of a conventional MPEG audio decoding apparatus so that the processing flow can be easily understood. In this example, it is assumed that audio data is encoded with a sampling frequency of 44.1 KHz and a bit width of 16 bits.

図６において、符号化されたオーディオデータは、まず最初にアンパック回路５１に入力される。一般に、ＭＰＥＧオーディオにより符号化されたオーディオデータは、主にアロケーション（Allocation）、スケールファクタ（Scale Factor）、サンプル（Sample）から構成されている。上記アンパック回路５１は、入力される符号化オーディオデータのビットストリームから上記アロケーション（Allocation）、スケールファクタ（Scale Factor）、サンプル（Sample）の各データを分離して抽出する。 In FIG. 6, encoded audio data is first input to the unpack circuit 51. In general, audio data encoded by MPEG audio mainly includes an allocation, a scale factor, and a sample. The unpack circuit 51 separates and extracts the allocation, scale factor, and sample data from the input encoded audio data bit stream.

上記アンパック回路５１により分離された各データは、次に周波数／時間変換回路５２に入力される。周波数／時間変換回路５２では、上記アンパック回路５１から入力される各データに基づいて周波数領域の信号であるサブバンド情報Ｓ_kが求められ、更に以下に示す（式１）に従って上記サブバンド情報Ｓ_kから時間領域の信号であるＶベクタＶ[i] が求められる。 Each data separated by the unpack circuit 51 is then input to the frequency / time conversion circuit 52. The frequency / time conversion circuit 52, the subband information S _k is a signal in the frequency domain based on the data inputted from the unpack circuit 51 is determined, the sub-band information S in accordance with further below (Equation 1) _A V vector V [i], which is a time domain signal, is _obtained from _k .

上記周波数／時間変換回路５２により求められたＶベクタは、Ｖバッファ５３に一時的に格納された後、フィルタ回路５４に与えられ、所定のフィルタ係数を用いてフィルタ処理が施されることにより、ディジタルのＰＣＭデータ（44.1KHz ）が生成される。そして、このようにして求められたＰＣＭデータが１６ビットＤＡＣ（ディジタル−アナログ・コンバータ）５５によりアナログ信号に変換されて出力される。 The V vector obtained by the frequency / time conversion circuit 52 is temporarily stored in the V buffer 53 and then given to the filter circuit 54, and is subjected to filter processing using a predetermined filter coefficient. Digital PCM data (44.1 KHz) is generated. The PCM data thus obtained is converted into an analog signal by a 16-bit DAC (digital-analog converter) 55 and output.

ところが、上記のように構成された従来のオーディオ復号装置では、サンプリング周波数の１／２の周波数の近傍において折り返し雑音が生じることがあり、再生されるアナログ信号の波形が歪んでしまうことがあった。このため、符号化されたオーディオデータを復号化して符号化前のオーディオ信号を再生する際に、音声の再現性が悪くなってしまうという問題があった。 However, in the conventional audio decoding apparatus configured as described above, aliasing noise may occur in the vicinity of half the sampling frequency, and the waveform of the reproduced analog signal may be distorted. . For this reason, when the encoded audio data is decoded and the audio signal before encoding is reproduced, there is a problem that sound reproducibility is deteriorated.

例えば、44.1KHz のサンプリングレートで20KHz のコサイン波をデコードした場合、図６の１６ビットＤＡＣ５５から出力されるアナログのオーディオ信号は、図９に示すような波形となる。符号化前の波形を示す図１０と比較すると、音声の再現性が著しく悪化していることが分かる。 For example, when a 20 KHz cosine wave is decoded at a sampling rate of 44.1 KHz, the analog audio signal output from the 16-bit DAC 55 of FIG. 6 has a waveform as shown in FIG. Compared with FIG. 10 showing the waveform before encoding, it can be seen that the reproducibility of speech is remarkably deteriorated.

従来、このような問題を解決するために、図６の１６ビットＤＡＣ５５の代わりに、図７に示すような１ビットＤＡＣシステム５６を用いるようにした技術が考えられている。上記１ビットＤＡＣシステム５６は、ＦＩＦＯメモリ５７および乗加算器５８から成る補間器５９と、ＤＡＣ６０とを備えている。 Conventionally, in order to solve such a problem, a technique in which a 1-bit DAC system 56 as shown in FIG. 7 is used instead of the 16-bit DAC 55 in FIG. 6 has been considered. The 1-bit DAC system 56 includes an interpolator 59 including a FIFO memory 57 and a multiplier / adder 58, and a DAC 60.

この１ビットＤＡＣシステム５６は、ＭＰＥＧオーディオデコーダ５０より出力されるＰＣＭデータをＦＩＦＯメモリ５７にある程度蓄積し、その蓄積したＰＣＭデータに対して、乗加算器５８によりディジタルフィルタ処理を施す。これにより、離散的な実データ間のデータ値を推測した補間データを得て、その補間データも含めてＤＡＣ６０によりＤ／Ａ変換を行うことにより、アナログのオーディオ信号を出力するものである。 This 1-bit DAC system 56 accumulates PCM data output from the MPEG audio decoder 50 to a certain extent in the FIFO memory 57, and applies digital filter processing to the accumulated PCM data by a multiplier / adder 58. As a result, interpolation data in which data values between discrete real data are estimated is obtained, and analog audio signals are output by performing D / A conversion by the DAC 60 including the interpolation data.

また、図８は、図７に示した機能ブロックの構成を、ハードウェアイメージに即して書き直した図である。なお、図８において、図７に示したブロックと同じブロックには同一の符号を付している。 FIG. 8 is a diagram in which the functional block configuration shown in FIG. 7 is rewritten according to the hardware image. In FIG. 8, the same blocks as those shown in FIG. 7 are denoted by the same reference numerals.

図８に示したＭＰＥＧオーディオデコーダ５０内にある乗加算器６１は、図７の周波数／時間変換回路５２における周波数／時間変換処理と、フィルタ回路５４における所定のフィルタ処理とを行うものである。それらの処理を行う際に必要な種々の係数は、係数ＲＯＭ／ＲＡＭ６２に記憶されているものが利用される。 The multiplier / adder 61 in the MPEG audio decoder 50 shown in FIG. 8 performs frequency / time conversion processing in the frequency / time conversion circuit 52 in FIG. 7 and predetermined filter processing in the filter circuit 54. As various coefficients necessary for performing these processes, those stored in the coefficient ROM / RAM 62 are used.

また、図８に示したメモリ６３は、上記周波数／時間変換処理および所定のフィルタ処理を行う際に使用するワークメモリ、および図７に示したＶバッファ５３を含むものである。ＰＣＭデータ出力部６４は、上記所定のフィルタ処理により生成されメモリ６３に格納されたＰＣＭデータをＭＰＥＧオーディオデコーダ５０の外部に出力するものである。 The memory 63 shown in FIG. 8 includes a work memory used when performing the frequency / time conversion process and the predetermined filter process, and the V buffer 53 shown in FIG. The PCM data output unit 64 outputs the PCM data generated by the predetermined filter process and stored in the memory 63 to the outside of the MPEG audio decoder 50.

一方、図８に示した１ビットＤＡＣシステム５６内にある係数ＲＯＭ／ＲＡＭ６５は、乗加算器５８によりディジタルフィルタ処理を施す際に使用するフィルタ係数等を記憶するものである。なお、フィルタ係数は複数種類記憶されていて、どれを利用するかによって再生音声の音質がある程度決められる。 On the other hand, the coefficient ROM / RAM 65 in the 1-bit DAC system 56 shown in FIG. 8 stores filter coefficients used when the digital adder 58 performs digital filter processing. A plurality of types of filter coefficients are stored, and the sound quality of the reproduced sound is determined to some extent depending on which filter coefficient is used.

図７あるいは図８に示したような１ビットＤＡＣシステム５６を用いれば、補間データの利用により元の波形に比較的近い波形を再現できるようになり、音質の劣化を少なくすることができる。 If the 1-bit DAC system 56 as shown in FIG. 7 or FIG. 8 is used, a waveform that is relatively close to the original waveform can be reproduced by using the interpolation data, and deterioration in sound quality can be reduced.

しかしながら、この１ビットＤＡＣシステム５６を用いた場合には、ＤＡＣ６０の他に、相当の演算能力を有する乗加算器５８や、ＦＩＦＯメモリ５７、係数ＲＯＭ／ＲＡＭ６５などの種々の構成が必要となるため、回路規模が大きくなってしまうとともに、高価になってしまうという問題があった。 However, when this 1-bit DAC system 56 is used, in addition to the DAC 60, various configurations such as a multiplier / adder 58, a FIFO memory 57, a coefficient ROM / RAM 65, and the like having considerable calculation capability are required. There is a problem that the circuit scale becomes large and the cost becomes high.

本発明はこのような問題を解決するために成されたものであり、１ビットＤＡＣシステムの機能を持たせる場合に、回路規模が大きくなるのを防ぎ、高価になるのを避けることを目的としている。 The present invention has been made to solve such problems, and it is intended to prevent an increase in circuit scale and avoid an increase in cost when a 1-bit DAC system function is provided. Yes.

本発明のオーディオ復号装置は、時間／周波数変換を用いて周波数領域で符号化されたオーディオデータを周波数領域の情報から時間領域の情報に変換して復号化し、上記復号化オーディオデータを用いた補間データを生成し、上記復号化オーディオデータおよび上記補間データを含むディジタルのオーディオデータをアナログのオーディオ信号に変換するようにしたオーディオ復号装置であって、上記復号化オーディオデータを得るための上記時間／周波数変換処理およびフィルタ処理、並びに、上記復号化オーディオデータを用いて上記補間データを生成するためのフィルタ処理を行う乗算器と、上記時間／周波数変換処理および上記フィルタ処理を行う際に使用するワークメモリ、並びに、上記復号化オーディオデータを蓄積するＦＩＦＯメモリとして機能するメモリとを備え、更に、上記時間／周波数変換処理を、符号化の規格に従った基本的な時間軸情報を生成する場合の処理レートよりも細かい処理レートを設定して行うことにより、上記基本的な時間軸情報と、上記基本的な時間軸情報を補間するための時間軸情報とを同時に生成するようにした点に特徴を有する。 The audio decoding apparatus of the present invention converts audio data encoded in the frequency domain using time / frequency conversion from information in the frequency domain to information in the time domain, decodes the audio data, and performs interpolation using the decoded audio data. An audio decoding device that generates data and converts digital audio data including the decoded audio data and the interpolated data into an analog audio signal, wherein the time / time for obtaining the decoded audio data is obtained. A multiplier for performing a frequency conversion process and a filter process, and a filter process for generating the interpolation data using the decoded audio data; and a work used when the time / frequency conversion process and the filter process are performed. FI for storing memory and decoded audio data And a memory that functions as O memory, further, performing the time / frequency conversion process, to set the fine processing rate than the processing rate when generating the basic time axis information in accordance with the coding standard Thus, the basic time axis information and the time axis information for interpolating the basic time axis information are generated at the same time.

本発明によれば、時間／周波数変換を用いて周波数領域で符号化されたオーディオデータを周波数領域の情報から時間領域の情報に変換して復号化し、上記復号化オーディオデータを用いた補間データを生成し、上記復号化オーディオデータおよび上記補間データを含むディジタルのオーディオデータをアナログのオーディオ信号に変換するようにしたオーディオ復号装置であって、上記時間／周波数変換の際に用いる乗加算器およびメモリと、上記補間データ生成の際に用いる乗加算器およびメモリとを、それぞれ１つの乗加算器およびメモリで共用する構成にしたので、Ｄ／Ａ変換の際に補間データを得て音声の再現性を向上させるための構成を少ないハードウェア量で実現することができる。
更に、上記時間／周波数変換処理を、符号化の規格に従った基本的な時間軸情報を生成する場合の処理レートよりも細かい処理レートを設定して行うことにより、上記基本的な時間軸情報と、上記基本的な時間軸情報を補間するための時間軸情報とを同時に生成するようにしたので、上記基本的な時間軸情報を補間するための時間軸情報および上記復号化オーディオデータを用いた補間データの両方を用いることができる。 According to the present invention, audio data encoded in the frequency domain using time / frequency conversion is converted from information in the frequency domain to information in the time domain and decoded, and interpolation data using the decoded audio data is converted. An audio decoding apparatus for generating and converting digital audio data including the decoded audio data and the interpolation data into an analog audio signal, and a multiplier / adder and a memory used for the time / frequency conversion And the multiplier / adder and the memory used for generating the interpolation data are shared by one multiplier / adder and the memory, respectively, so that the interpolation data is obtained during the D / A conversion and the sound reproducibility is obtained. It is possible to realize a configuration for improving the above with a small amount of hardware.
Further, the time / frequency conversion process is performed by setting a processing rate finer than the processing rate in the case of generating basic time axis information in accordance with the encoding standard. And the time axis information for interpolating the basic time axis information are generated at the same time, the time axis information for interpolating the basic time axis information and the decoded audio data are used. Both of the interpolated data can be used.

以下、本発明によるオーディオ復号装置の一実施形態を図面に基づいて詳細に説明する。 Hereinafter, an audio decoding apparatus according to an embodiment of the present invention will be described in detail with reference to the drawings.

図１は、本実施形態によるオーディオ復号装置の要素的特徴を示すブロック図である。なお、このオーディオ復号装置は、時間／周波数変換を用いて周波数領域で符号化されたオーディオデータを復号するためのものであり、図１には、その一連の復号化処理の中の１つである周波数／時間変換処理を行う部分のみを示している。 FIG. 1 is a block diagram showing elemental features of the audio decoding apparatus according to the present embodiment. This audio decoding apparatus is for decoding audio data encoded in the frequency domain using time / frequency conversion. FIG. 1 shows one of a series of decoding processes. Only the part which performs a certain frequency / time conversion process is shown.

図１において、（１）（図中では丸付きで表示する）は周波数／時間変換手段であり、上記周波数領域で符号化されたオーディオデータに周波数／時間変換処理を施して、符号化の規格に従った基本的な時間軸情報を生成する。例えば、符号化方式がＭＰＥＧオーディオである場合、この周波数／時間変換手段（１）は、上記（式１）に示した規格に基づく演算式に従って基本的な時間軸情報であるＶベクタＶ[i] を生成する。 In FIG. 1, (1) (indicated by a circle in the figure) is a frequency / time conversion means, which performs a frequency / time conversion process on the audio data encoded in the frequency domain, thereby encoding standards. Basic time axis information according to the above is generated. For example, when the encoding method is MPEG audio, the frequency / time conversion means (1) uses the V vector V [i which is basic time axis information according to the arithmetic expression based on the standard shown in the above (Expression 1). ] Is generated.

また、（２）（図中では丸付きで表示する）は補間データ生成手段であり、上記周波数／時間変換手段（１）により基本的な時間軸情報を生成する際に行う周波数／時間変換処理の演算と同様の演算によって、上記基本的な時間軸情報を補間するための補間データを生成する。例えば、符号化方式がＭＰＥＧオーディオである場合、この補間データ生成手段（２）は、以下に示す上記（式１）と同様の（式２）に従って補間データＶ[i] ′を生成する。 Further, (2) (displayed with circles in the figure) is an interpolation data generation means, and a frequency / time conversion process performed when basic time axis information is generated by the frequency / time conversion means (1). Interpolation data for interpolating the basic time axis information is generated by the same calculation as the above calculation. For example, when the encoding method is MPEG audio, the interpolation data generation means (2) generates interpolation data V [i] ′ according to (Expression 2) similar to (Expression 1) described below.

なお、図１に示したように、補間データ生成手段（２）において補間データを生成する際に使用する元データは、周波数／時間変換手段（１）で基本的な時間軸情報を生成する際に使用する元データと同じである。 As shown in FIG. 1, the original data used when generating the interpolation data in the interpolation data generation means (2) is used when the basic time axis information is generated in the frequency / time conversion means (1). It is the same as the original data used for.

（３）（図中では丸付きで表示する）はマルチプレクス手段であり、上記周波数／時間変換手段（１）により生成された基本的な時間軸情報と、上記補間データ生成手段（２）により生成された補間データとを合わせる処理を行う。その後、このマルチプレクス手段（３）より出力されるデータに対して所定の処理が施されて、ディジタルの復号化オーディオデータが生成される。そして、図示しないＤ／Ａ変換手段によりアナログのオーディオ信号に変換されて出力される。 (3) (displayed with a circle in the figure) is a multiplex means, and the basic time axis information generated by the frequency / time conversion means (1) and the interpolation data generation means (2). A process of matching with the generated interpolation data is performed. Thereafter, the data output from the multiplex means (3) is subjected to a predetermined process to generate digital decoded audio data. Then, it is converted into an analog audio signal by a D / A conversion means (not shown) and output.

このように、図１の実施形態によれば、一連の復号化処理の中の周波数／時間変換処理において、符号化の規格に従った基本的な時間軸情報の他に、その基本的な時間軸情報を補間するための補間データが上記周波数／時間変換処理の演算と同様の演算によって同時に生成されるようになるので、複雑な構成の１ビットＤＡＣシステムを用いなくても補間データを得ることができるようになり、その補間データの利用により音声の再現性を向上させることができる。 As described above, according to the embodiment of FIG. 1, in the frequency / time conversion process in the series of decoding processes, in addition to the basic time axis information according to the encoding standard, the basic time axis information is used. Interpolation data for interpolating axis information is generated simultaneously by the same calculation as the calculation of the frequency / time conversion process, so that interpolation data can be obtained without using a complicated 1-bit DAC system. Thus, the reproducibility of voice can be improved by using the interpolation data.

図２は、図１に示した本発明の特徴を実現する具体的なオーディオ復号装置の構成例を示すブロック図である。この図２は、時間／周波数変換を用いた符号化方式の例として、ＭＰＥＧオーディオを採用した場合のオーディオ復号装置について示したものであり、オーディオデータは、44.1KHz のサンプリング周波数で符号化されているものとする。 FIG. 2 is a block diagram showing an example of the configuration of a specific audio decoding apparatus that implements the features of the present invention shown in FIG. FIG. 2 shows an audio decoding apparatus when MPEG audio is used as an example of an encoding method using time / frequency conversion. Audio data is encoded at a sampling frequency of 44.1 KHz. It shall be.

図２に示すように、本実施形態のＭＰＥＧオーディオデコーダ１は、アンパック回路２、周波数／時間変換回路３、Ｖバッファ４およびフィルタ回路５により構成される。上記アンパック回路２は、入力される符号化オーディオデータのビットストリームからアロケーション（Allocation）、スケールファクタ（Scale Factor）、サンプル（Sample）の各データを分離するものである。 As shown in FIG. 2, the MPEG audio decoder 1 according to this embodiment includes an unpack circuit 2, a frequency / time conversion circuit 3, a V buffer 4, and a filter circuit 5. The unpack circuit 2 separates allocation, scale factor, and sample data from the input encoded audio data bit stream.

また、周波数／時間変換回路３は、上記アンパック回路２により分離された各データに基づいてシンセサイザ合成処理を行うことにより、Ｖベクタを求めるものである。すなわち、このシンセサイザ合成処理では、上記アンパック回路２により分離された各データから周波数領域の信号であるサブバンド情報Ｓ_kを求め、更に以下に示す（式３）に従って、上記サブバンド情報Ｓ_kから時間領域の信号であるＶベクタＶ[i] を求める。 The frequency / time conversion circuit 3 obtains a V vector by performing synthesizer synthesis processing based on each data separated by the unpacking circuit 2. That is, in this synthesizer synthesis process, subband information S _k that is a frequency domain signal is obtained from each data separated by the unpack circuit 2, and further, from the subband information S _k according to (Equation 3) shown below. A V vector V [i], which is a time domain signal, is obtained.

この（式３）では、サンプルｉのきざみ幅を従来の（式１）の場合よりも細かく設定している。すなわち、（式１）ではi=0,1,2,…のようにサンプルｉのきざみ幅が１であったのに対して、（式３）ではi=0,0.5,1,1.5,2,…のようにサンプルｉのきざみ幅を0.5 に設定している。これにより、i=0,1,2,…に対応する基本データの他に、i=0.5,1.5,…に対応する補間データをも同時に計算するようにしている。 In (Expression 3), the step width of the sample i is set finer than in the case of the conventional (Expression 1). That is, in (Equation 1), the step width of sample i is 1 as i = 0,1,2,..., Whereas in (Equation 3), i = 0,0.5,1,1.5,2 , ..., the step size of sample i is set to 0.5. As a result, in addition to basic data corresponding to i = 0, 1, 2,..., Interpolation data corresponding to i = 0.5, 1.5,.

このように、本実施形態では、１ビットＤＡＣシステムを用いてデコード後のＤ／Ａ変換処理の際に補間データを生成するのではなく、一連のデコード処理の中で行う周波数／時間変換処理の際に、サンプルのきざみ幅を細かくして演算することによってオーバーサンプリングを実行し、補間データを同時に生成するようにしている。 As described above, in this embodiment, interpolation data is not generated at the time of D / A conversion processing after decoding using a 1-bit DAC system, but frequency / time conversion processing performed in a series of decoding processing. At this time, oversampling is performed by calculating with a small step size of the sample, and interpolation data is generated at the same time.

また、Ｖバッファ４は、上記周波数／時間変換回路３により求められたＶベクタを一時的に格納するものである。フィルタ回路５は、上記Ｖバッファ４に格納されたＶベクタに対して、所定のフィルタ係数を用いてフィルタ処理を施すことにより、ディジタルのＰＣＭデータを生成するものである。（式３）に示したように、周波数／時間変換回路３では、レートを通常の１／２に細かく設定して処理を行っているので、生成されるＰＣＭデータの周波数は、88.2KHz となる。 The V buffer 4 temporarily stores the V vector obtained by the frequency / time conversion circuit 3. The filter circuit 5 generates digital PCM data by subjecting the V vector stored in the V buffer 4 to filter processing using a predetermined filter coefficient. As shown in (Equation 3), in the frequency / time conversion circuit 3, since the processing is performed with the rate set to 1/2 the normal rate, the frequency of the generated PCM data is 88.2KHz. .

このようにして構成されたＭＰＥＧオーディオデコーダ１の後段に接続されているＤＡＣ６は、上記ＭＰＥＧオーディオデコーダ１より出力されるディジタルのＰＣＭデータをアナログ信号に変換して出力するものである。本実施形態においては、Ｄ／Ａ変換の際に補間データを生成する必要がないので、構成が複雑な１ビットＤＡＣシステムを用いなくても良く、構成が簡単で安価なＤ／ＡコンバータをＤＡＣ６として使用することが可能である。 The DAC 6 connected to the subsequent stage of the MPEG audio decoder 1 thus configured converts the digital PCM data output from the MPEG audio decoder 1 into an analog signal and outputs the analog signal. In the present embodiment, since it is not necessary to generate interpolation data at the time of D / A conversion, it is not necessary to use a 1-bit DAC system having a complicated configuration, and a D / A converter having a simple configuration and an inexpensive configuration can be used. It can be used as

ここで、図１０に示した元のコサイン波形を符号化して得られるオーディオデータを、本実施形態のＭＰＥＧオーディオデコーダ１で復号化した場合にＤＡＣ６から出力されるアナログのオーディオ信号の波形を、図４に示す。 Here, when the audio data obtained by encoding the original cosine waveform shown in FIG. 10 is decoded by the MPEG audio decoder 1 of the present embodiment, the waveform of the analog audio signal output from the DAC 6 is shown in FIG. 4 shows.

この図４の波形と図９の波形とを比較すれば明らかなように、本実施形態によれば、従来に比べて、図１０に示した符号化前の波形により近い波形を得ることができ、音声の再現性を向上させることができている。しかも、本実施形態では、１ビットＤＡＣシステムのような複雑なＤＡＣを用いたり、その他の付加的な構成を設けたりすることなく音声の再現性を向上させることができる。 As is clear from the comparison of the waveform of FIG. 4 and the waveform of FIG. 9, according to the present embodiment, a waveform closer to the waveform before encoding shown in FIG. , The reproducibility of voice can be improved. Moreover, in this embodiment, it is possible to improve the reproducibility of voice without using a complicated DAC such as a 1-bit DAC system or providing any other additional configuration.

なお、以上の実施形態では、（式３）のようにサンプルｉのきざみ幅を通常の１／２に細かく設定することによって２倍のオーバーサンプリングを実現しているが、サンプルｉのきざみ幅を通常の１／Ｍに設定すれば、Ｍ倍のオーバーサンプリングを実現することができる。 In the above embodiment, double the oversampling is realized by finely setting the step width of the sample i to 1/2 of the normal width as shown in (Equation 3), but the step width of the sample i is reduced. If the normal 1 / M is set, M times oversampling can be realized.

図５は、サンプルのきざみ幅を0.125 に設定して８倍のオーバーサンプリングを実行した場合にＤＡＣ６から出力されるアナログのオーディオ信号の波形を示す図である。この図５を見れば明らかなように、２倍のオーバーサンプリングを行った場合に比べて、より原音に近い波形を再生することができ、音声の再現性を更に向上させることができる。 FIG. 5 is a diagram showing a waveform of an analog audio signal output from the DAC 6 when the sample step width is set to 0.125 and 8 times oversampling is executed. As apparent from FIG. 5, compared to the case where the oversampling is performed twice, a waveform closer to the original sound can be reproduced, and the reproducibility of the sound can be further improved.

このように、本実施形態では、１ビットＤＡＣシステムを用いて補間データを生成する場合に比べて、サンプルのきざみ幅を任意に設定することにより、より多くの補間データを生成することができるようになり、音声の再現性を著しく向上させることができるというメリットがある。また、周波数／時間変換処理を行うときに、その処理の演算式と同じ演算式に従って補間データを同時に生成することができるので、通常の復号化の処理プロセスを変更する必要もない。 As described above, in this embodiment, it is possible to generate more interpolation data by arbitrarily setting the step size of the sample as compared to the case of generating the interpolation data using the 1-bit DAC system. Therefore, there is a merit that the reproducibility of voice can be remarkably improved. In addition, when performing the frequency / time conversion process, the interpolation data can be generated simultaneously according to the same arithmetic expression as the arithmetic expression of the process, so that it is not necessary to change the normal decoding process.

図３は、本発明の他の実施形態を示すものであり、この他の実施形態によるオーディオ復号装置のハードウェア構成の例を示す図である。
図８に示したように、補間データを生成するために１ビットＤＡＣシステムを用いた場合、従来は、ＭＰＥＧオーディオデコーダ５０と１ビットＤＡＣシステム５６とが別々に設けられていた。 FIG. 3 shows another embodiment of the present invention, and is a diagram showing an example of a hardware configuration of an audio decoding device according to another embodiment.
As shown in FIG. 8, when a 1-bit DAC system is used to generate interpolation data, conventionally, an MPEG audio decoder 50 and a 1-bit DAC system 56 are provided separately.

これに対して、図３に示す実施形態では、上記ＭＰＥＧオーディオデコーダ５０と１ビットＤＡＣシステム５６とで重複して設けられていた構成を１つにまとめることにより、ハードウェア構成の簡略化を図っている。 On the other hand, in the embodiment shown in FIG. 3, the hardware configuration is simplified by combining the configurations provided redundantly in the MPEG audio decoder 50 and the 1-bit DAC system 56 into one. ing.

すなわち、図３の乗加算器１１は、図８のＭＰＥＧオーディオデコーダ５０内の乗加算器６１と、１ビットＤＡＣシステム５６内の乗加算器５８とを兼用するものである。つまり、図３の乗加算器１１は、図７の周波数／時間変換回路５２におけるシンセイザ合成処理（上記した（式１）に従う演算処理）と、フィルタ回路５４における所定のフィルタ処理と、乗加算器５８におけるディジタルフィルタ処理とを行う。 That is, the multiplier / adder 11 in FIG. 3 serves as both the multiplier / adder 61 in the MPEG audio decoder 50 in FIG. 8 and the multiplier / adder 58 in the 1-bit DAC system 56. That is, the multiplier / adder 11 in FIG. 3 includes a synthesizer synthesis process (arithmetic process according to the above (Equation 1)) in the frequency / time conversion circuit 52 in FIG. 7, a predetermined filter process in the filter circuit 54, and a multiplier / adder. The digital filter processing at 58 is performed.

また、図３の係数ＲＯＭ／ＲＡＭ１２は、図８のＭＰＥＧオーディオデコーダ５０内の係数ＲＯＭ／ＲＡＭ６２と、１ビットＤＡＣシステム５６内の係数ＲＯＭ／ＲＡＭ６５とを兼用するものである。すなわち、図７の周波数／時間変換回路５２における周波数／時間変換処理やフィルタ回路５４における所定のフィルタ処理、および乗加算器５８におけるディジタルフィルタ処理を行う際に必要な種々の係数を記憶している。 Also, the coefficient ROM / RAM 12 in FIG. 3 serves as both the coefficient ROM / RAM 62 in the MPEG audio decoder 50 in FIG. 8 and the coefficient ROM / RAM 65 in the 1-bit DAC system 56. That is, various coefficients necessary for frequency / time conversion processing in the frequency / time conversion circuit 52 of FIG. 7, predetermined filter processing in the filter circuit 54, and digital filter processing in the multiplier / adder 58 are stored. .

また、図３のメモリ１３は、図８のＭＰＥＧオーディオデコーダ５０内のメモリ６３と、１ビットＤＡＣシステム５６内のＦＩＦＯメモリ５７とを兼用するものである。つまり、上述した図３の乗加算器１１における各処理は、このメモリ１３をワークメモリとして使用しながら行うようになっている。 Further, the memory 13 in FIG. 3 serves as both the memory 63 in the MPEG audio decoder 50 in FIG. 8 and the FIFO memory 57 in the 1-bit DAC system 56. That is, each process in the multiplier / adder 11 of FIG. 3 described above is performed while using the memory 13 as a work memory.

図３のＰＣＭデータ出力部１４は、上記乗加算器１１における各処理によって生成されメモリ１３に格納されたＰＣＭデータを外部に出力するものである。また、ＤＡＣ１５は、ＰＣＭデータ出力部１４より出力されるディジタルのＰＣＭデータをアナログ信号に変換して出力するものであり、図７あるいは図８に示したＤＡＣ６０に対応するものである。 The PCM data output unit 14 shown in FIG. 3 outputs the PCM data generated by each process in the multiplier / adder 11 and stored in the memory 13 to the outside. The DAC 15 converts digital PCM data output from the PCM data output unit 14 into an analog signal and outputs the analog signal, and corresponds to the DAC 60 shown in FIG. 7 or FIG.

このように、図３に示す実施形態では、図８に示した従来のＭＰＥＧオーディオデコーダ５０と１ビットＤＡＣシステム５６とで重複して設けられていた構成を１つにまとめて共用しているので、ハードウェア量を削減することができる。なお、図３の場合と図８の場合とで係数ＲＯＭ／ＲＡＭのメモリ量の合計サイズは変化しないが、図３では１つのメモリにまとめたことで構成を簡単にすることができる。 As described above, in the embodiment shown in FIG. 3, the conventional MPEG audio decoder 50 and the 1-bit DAC system 56 shown in FIG. , Can reduce the amount of hardware. The total size of the coefficient ROM / RAM memory amount does not change between the case of FIG. 3 and the case of FIG. 8, but the configuration can be simplified by combining them into one memory in FIG.

また、図３に示す実施形態では、１ビットＤＡＣシステムの機能を有しているので、復号化されたオーディオデータを用いた補間データを生成することができ、音声の再現性が悪化するのを防ぐことができるのはもちろんである。 In addition, in the embodiment shown in FIG. 3, since it has the function of a 1-bit DAC system, it is possible to generate interpolation data using decoded audio data, and the sound reproducibility is deteriorated. Of course, it can be prevented.

更に他の実施形態としては、図２に示した実施形態と、図３に示した実施形態とを合わせたものが考えられる。すなわち、本実施形態は、図３のような構成において、乗加算器１１が行うシンセイザ合成処理を、（式１）ではなくて（式３）に従って行うようにしたものである。 Still another embodiment may be a combination of the embodiment shown in FIG. 2 and the embodiment shown in FIG. That is, in the present embodiment, the synthesizer combining process performed by the multiplier / adder 11 in the configuration as shown in FIG. 3 is performed according to (Expression 3) instead of (Expression 1).

このようにすれば、周波数／時間変換処理を行う際のサンプルのきざみ幅を細かく設定することによって得られる補間データと、１ビットＤＡＣシステムの機能に基づいて得られる補間データとの両方を利用してアナログオーディオ信号を再生することができ、簡単な構成で音声の再現性を更に向上させることが期待できる。 In this way, both the interpolation data obtained by finely setting the step size of the sample when performing the frequency / time conversion process and the interpolation data obtained based on the function of the 1-bit DAC system are used. Thus, it is possible to reproduce an analog audio signal, and it can be expected that the reproducibility of sound is further improved with a simple configuration.

なお、以上に述べた実施形態では、符号化方式の１つとしてＭＰＥＧオーディオを例に挙げたが、時間／周波数変換方式を採用する符号化方式であれば、復号化時における周波数／時間変換の際に上述したようなオーバーサンプリングを実行することができるので、その符号化方式は問わない。 In the embodiment described above, MPEG audio is taken as an example of one of the encoding methods, but if the encoding method adopts the time / frequency conversion method, the frequency / time conversion at the time of decoding is performed. In this case, since the oversampling as described above can be executed, the encoding method is not limited.

例えば、ＭＤＣＴ符号化方式、サブバンド符号化方式、ＡＣ−３符号化方式、あるいはＡＴＲＡＣ（Adaptive TRansform Acoustic cording ）などの変換符号化方式にも本発明を適用することが可能である。 For example, the present invention can also be applied to transform coding systems such as MDCT coding system, subband coding system, AC-3 coding system, or ATRAC (Adaptive TRansform Acoustic Cording).

要素的特徴を示すブロック図である。It is a block diagram which shows an element characteristic. 図１に示した特徴を実現する具体的なオーディオ復号装置の構成例を示すブロック図である。FIG. 2 is a block diagram illustrating a specific configuration example of an audio decoding device that realizes the characteristics illustrated in FIG. 1. 本発明を適用したオーディオ復号装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the audio decoding apparatus to which this invention is applied. 図２の実施形態において２倍のオーバーサンプリングを実行した場合に得られるアナログオーディオ信号の波形の例を示す図である。It is a figure which shows the example of the waveform of the analog audio signal obtained when 2 times oversampling is performed in embodiment of FIG. 図２の実施形態において８倍のオーバーサンプリングを実行した場合に得られるアナログオーディオ信号の波形の例を示す図である。It is a figure which shows the example of the waveform of the analog audio signal obtained when 8 times oversampling is performed in embodiment of FIG. 従来のオーディオ復号装置の構成を示すブロック図である。It is a block diagram which shows the structure of the conventional audio decoding apparatus. 従来の問題を解決するために１ビットＤＡＣシステムを用いた場合の構成を示すブロック図である。It is a block diagram which shows the structure at the time of using a 1 bit DAC system in order to solve the conventional problem. 図７に示したオーディオ復号装置のハードウェアイメージを示すブロック図である。It is a block diagram which shows the hardware image of the audio decoding apparatus shown in FIG. 図６のオーディオ復号装置で復号化処理を行った場合に得られるアナログオーディオ信号の波形の例を示す図である。It is a figure which shows the example of the waveform of the analog audio signal obtained when a decoding process is performed with the audio decoding apparatus of FIG. 符号化前の元の音声信号の波形の例を示す図である。It is a figure which shows the example of the waveform of the original audio | voice signal before an encoding.

Explanation of symbols

（１）周波数／時間変換手段
（２）補間データ生成手段
（３）マルチプレクス手段
１ＭＰＥＧオーディオデコーダ
２アンパック回路
３周波数／時間変換回路
４Ｖバッファ
５フィルタ回路
６ＤＡＣ
１１乗加算器
１２係数ＲＯＭ／ＲＡＭ
１３メモリ
１４ＰＣＭデータ出力部
１５ＤＡＣ (1) Frequency / time conversion means (2) Interpolation data generation means (3) Multiplex means 1 MPEG audio decoder 2 Unpack circuit 3 Frequency / time conversion circuit 4 V buffer 5 Filter circuit 6 DAC
11 Multiplier / adder 12 Coefficient ROM / RAM
13 Memory 14 PCM Data Output Unit 15 DAC

Claims

Audio data encoded in the frequency domain using time / frequency conversion is converted from frequency domain information to time domain information and decoded to generate interpolated data using the decoded audio data, and the decoding An audio decoding device configured to convert digital audio data including audio data and the interpolation data into an analog audio signal,
A multiplier for performing the time / frequency conversion process and the filter process for obtaining the decoded audio data, and the filter process for generating the interpolation data using the decoded audio data ;
A work memory used when performing the time / frequency conversion process and the filter process, and a memory functioning as a FIFO memory for storing the decoded audio data ;
Further, the time / frequency conversion process is performed by setting a processing rate finer than the processing rate in the case of generating basic time axis information in accordance with the encoding standard. And an audio decoding device characterized in that the time axis information for interpolating the basic time axis information is generated simultaneously.