JP2000151414A

JP2000151414A - Digital audio encoding device/method and recording medium recording encoding program

Info

Publication number: JP2000151414A
Application number: JP10322644A
Authority: JP
Inventors: Tatsuya Murata; 達也村田
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1998-11-12
Filing date: 1998-11-12
Publication date: 2000-05-30

Abstract

PROBLEM TO BE SOLVED: To provide an encoding device whose constitution can be simplified and whose encoding processing of a digital audio signal can be executed at high speed by providing a sub-band division part setting all audio data of a sub-band pertaining to the specified and above frequency area of a frequency which is N times larger than that of the frequency after up-sampling. SOLUTION: An up-sampling part 11 generates PCM data sampled by a sampling frequency which is N times larger than a prescribed sampling frequency from input PCM data sampled by the prescribed sampling frequency based on up-sampling information N given from outside. A sub-band division part 12 sets all the samplings of a band pertaining to a frequency area which is 1/(2×N) of/above the up-sampled sampling frequency to zero. All samples pertaining to sub-band data are set to zero and are outputted to a bit allocation part 14.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、パソコンや放送な
どのマルチメディア技術分野で用いられているオーディ
オ信号を圧縮符号化して、ディジタルオーディオ符号化
信号を生成するディジタルオーディオ符号化装置、同符
号化方法、及び同符号化プログラムを記録した記録媒体
に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a digital audio encoding apparatus which compresses and encodes an audio signal used in the field of multimedia technology such as personal computers and broadcasting to generate a digital audio encoded signal. The present invention relates to a method and a recording medium on which the encoding program is recorded.

【０００２】[0002]

【従来の技術】近年、ディジタル技術の進歩に伴って、
パーソナルコンピュータ（以下、”パソコン”と略称す
る）の性能が著しく向上している。このため、パソコン
においては、従来では困難であったＭＰＥＧ規格（ISO/
IEC 11172-3,13818-3）に基づいたビットストリームの
再生が可能となり、更にＭＰＥＧ規格に基づく符号化も
実施できるようになってきた。上記ＭＰＥＧ規格に代表
される高能率圧縮技術では、オーディオ信号の原信号を
帯域分割した後に、各帯域信号を符号化したサブバンド
符号化方式のディジタルオーディオ信号が用いられてい
る。さらに、ＭＰＥＧオーディオ規格では、ディジタル
オーディオ信号のサンプリング周波数として、３２ＫＨ
ｚ，４４.１ＫＨｚ，４８ＫＨｚを規定している。一
方、パソコンや放送などのマルチメディア技術分野で
は、２２.０５ＫＨｚや１１.０２５ＫＨｚのサンプリン
グ周波数でサンプリングされたＰＣＭデータをディジタ
ルオーディオ信号として一般的に使用してきた。それゆ
え、パソコンにおいて、上述のＭＰＥＧオーディオ規格
に基づくデータ処理を実行するためには、ディジタルオ
ーディオ符号化装置を用いて上記ＰＣＭデータを４４.
１ＫＨｚにアップサンプリングしたデータに変換してか
ら符号化する必要がある。2. Description of the Related Art In recent years, with the progress of digital technology,
The performance of personal computers (hereinafter abbreviated as "PCs") has been significantly improved. For this reason, in the personal computer, the MPEG standard (ISO / ISO
It has become possible to reproduce a bit stream based on IEC 11172-3, 13818-3), and to carry out encoding based on the MPEG standard. In the high-efficiency compression technology represented by the MPEG standard, a digital audio signal of a sub-band encoding method in which an original signal of an audio signal is divided into bands and each band signal is encoded. Further, according to the MPEG audio standard, the sampling frequency of a digital audio signal is 32 KH.
z, 44.1 KHz and 48 KHz are specified. On the other hand, in the field of multimedia technologies such as personal computers and broadcasting, PCM data sampled at a sampling frequency of 22.05 KHz or 11.025 KHz has been generally used as a digital audio signal. Therefore, in order to execute data processing based on the above-mentioned MPEG audio standard in a personal computer, the PCM data is converted to 44.
It is necessary to convert the data to data up-sampled to 1 KHz before encoding.

【０００３】以下、従来のディジタルオーディオ符号化
装置について、図３を用いて具体的に説明する。尚、以
下の説明では、２２.０５ＫＨｚでサンプリングされた
１チャンネル（モノラル）のＰＣＭデータを４４.１Ｋ
Ｈｚにアップサンプリングして、ＭＰＥＧ１オーディオ
規格に基づいたディジタルオーディオ信号を生成する場
合について説明する。図３は、従来のディジタルオーデ
ィオ符号化装置の主要部の構成を示すブロック図であ
る。図３において、この従来のディジタルオーディオ符
号化装置は、ＭＰＥＧ１オーディオ規格を用いたＭＰＥ
Ｇオーディオ符号化部３０と、その前段に設けられ、オ
ーディオ信号の原信号から入力ＰＣＭデータを生成する
入力データ生成部（図示せず）に分けられる。さらに、
従来のディジタルオーディオ符号化装置には、入力デー
タ生成部からの入力ＰＣＭデータをアップサンプリング
するアップサンプリング部３１、及び前記アップサンプ
リング部３１からのＰＣＭデータにフィルタ処理を施す
フィルタ処理部３２が設けられている。この従来のディ
ジタルオーディオ符号化装置では、入力データ生成部が
元のＰＣＭデータのサンプリング周波数に対して何倍に
アップサンプリングするかを設定して、アップサンプリ
ング情報Ｎ（Ｎは２の階乗）としてアップサンプリング
部３１とフィルタ処理部３２に出力する。この説明で
は、２２.０５ＫＨｚから４４.１ＫＨｚにアップサンプ
リングするので、アップサンプリング情報Ｎは２とな
る。Hereinafter, a conventional digital audio encoding device will be described in detail with reference to FIG. In the following description, one channel (monaural) PCM data sampled at 22.05 KHz is converted to 44.1K.
A case where a digital audio signal based on the MPEG1 audio standard is generated by upsampling to Hz will be described. FIG. 3 is a block diagram showing a configuration of a main part of a conventional digital audio encoding device. Referring to FIG. 3, this conventional digital audio encoding apparatus is an MPE using the MPEG1 audio standard.
It is divided into a G audio encoding unit 30 and an input data generation unit (not shown) which is provided at the preceding stage and generates input PCM data from an original signal of the audio signal. further,
The conventional digital audio encoding device is provided with an up-sampling unit 31 for up-sampling input PCM data from an input data generating unit, and a filter processing unit 32 for filtering PCM data from the up-sampling unit 31. ing. In this conventional digital audio encoding apparatus, the input data generation unit sets how many times the sampling frequency of the original PCM data is up-sampled, and sets up-sampling information N (N is a factor of 2). Output to the upsampling unit 31 and the filter processing unit 32. In this description, up-sampling is performed from 22.05 KHz to 44.1 KHz.

【０００４】アップサンプリング部３１は、入力データ
生成部から与えられるアップサンプリング情報Ｎ（＝
２）を用いて、所定のサンプリング周波数（２２.０５
ＫＨｚ）でサンプリングされた入力ＰＣＭデータからそ
のＮ倍のサンプリング周波数（４４.１ＫＨｚ）でサン
プリングされたＰＣＭデータを生成する。具体的には、
アップサンプリング部３１は、隣り合う入力ＰＣＭデー
タの平均値により補間して、アップサンプリング後のＰ
ＣＭデータをフィルタ処理部３２に出力する。フィルタ
処理部３２は、アップサンプリング部３１でアップサン
プリングされたＰＣＭデータに対して折り返しノイズを
除去するためにフィルタ処理を行い、ＭＰＥＧオーディ
オ符号化部３０に出力する。[0004] The upsampling section 31 provides upsampling information N (=) provided from an input data generating section.
2), a predetermined sampling frequency (22.05
KM), and generates PCM data sampled at N times the sampling frequency (44.1 KHz) from the input PCM data sampled at KHz. In particular,
The up-sampling unit 31 interpolates the average value of the adjacent input PCM data to obtain the Ps after the up-sampling.
The CM data is output to the filter processing unit 32. The filter processing unit 32 performs a filtering process on the PCM data up-sampled by the up-sampling unit 31 to remove aliasing noise, and outputs the result to the MPEG audio encoding unit 30.

【０００５】ここで、図４を参照して、従来のディジタ
ルオーディオ符号化装置におけるアップサンプリング部
３１での補間処理について、詳細に説明する。図４は、
図３に示したアップサンプリング部での補間処理を示す
説明図である。まずＭＰＥＧ１規格に基づいてオーディ
オ信号を符号化するためには、１チャンネル（モノラ
ル）のオーディオ信号では１１５２個のＰＣＭデータが
必要であり、２チャンネル（ステレオ）のオーディオ信
号ではその２倍の２３０４個のＰＣＭデータが必要であ
り、それぞれ１フレーム分のＰＣＭデータと呼ぶことに
する。この説明では１チャンネルの場合を説明している
ので、アップサンプリング部３１は１１５２個のＰＣＭ
データ毎にアップサンプリング後のＰＣＭデータをフィ
ルタ処理部３２を経てＭＰＥＧオーディオ符号化部３０
に出力する。従って、アップサンプリング部３１では、
図４に示すように、元の入力ＰＣＭデータにおいて隣り
合う２つの入力ＰＣＭデータの平均値で補間したアップ
サンプリング後のＰＣＭデータを生成する。ここでは、
アップサンプリング部３１は２倍のアップサンプリング
を行うので、同図に示すように、１フレーム分のＰＣＭ
データと次のフレームの先頭のＰＣＭデータの合計５７
７個のＰＣＭデータから１１５２個のＰＣＭデータを生
成している。このように、アップサンプリング部３１に
よって符号化に必要なＰＣＭデータを生成されるが、そ
のＰＣＭデータには、後段のＭＰＥＧオーディオ符号化
部３０で周波数分割する際に生じる折り返しノイズを含
んでいる。このため、上述のフィルタ処理部３２（図
３）は、帯域制限を行って折り返しノイズを除去する。
ＭＰＥＧオーディオ符号化部３０（図３）は、上記帯域
制限された１１５２個のＰＣＭデータを４４.１ＫＨｚ
でサンプリングしたデータとして符号化する。Here, with reference to FIG. 4, the interpolation processing in the up-sampling section 31 in the conventional digital audio encoding device will be described in detail. FIG.
FIG. 4 is an explanatory diagram illustrating an interpolation process in the upsampling unit illustrated in FIG. 3. First, in order to encode an audio signal based on the MPEG1 standard, 1 channel (monaural) audio signal requires 1152 pieces of PCM data, and 2 channel (stereo) audio signal is twice as large as 2304 pieces of data. PCM data is required, and each of them is referred to as one frame of PCM data. In this description, the case of one channel is described, so that the up-sampling unit 31 has 1152 PCMs.
The up-sampled PCM data for each data is passed through a filter processor 32 to an MPEG audio encoder 30.
Output to Therefore, in the up-sampling unit 31,
As shown in FIG. 4, upsampled PCM data interpolated by an average value of two adjacent input PCM data in the original input PCM data is generated. here,
Since the up-sampling unit 31 performs double up-sampling, as shown in FIG.
Total 57 of data and PCM data at the head of next frame
1152 pieces of PCM data are generated from the seven pieces of PCM data. As described above, the PCM data necessary for encoding is generated by the up-sampling unit 31, and the PCM data includes aliasing noise generated when the subsequent MPEG audio encoding unit 30 performs frequency division. Therefore, the above-described filter processing unit 32 (FIG. 3) performs band limitation to remove aliasing noise.
The MPEG audio encoder 30 (FIG. 3) converts the band-limited 1152 pieces of PCM data to 44.1 kHz.
Is encoded as data sampled by.

【０００６】図３に戻って、ＭＰＥＧオーディオ符号化
部３０を含んだ従来のディジタルオーディオ符号化装置
には、例えば特開平９−１３４２００号公報に記載され
たものが知られている。まずＭＰＥＧオーディオ符号化
部３０内で利用されている人間の聴覚心理モデルについ
て説明する。つまり、ＭＰＥＧ規格によるオーディオ信
号の圧縮符号化においては、オーディオ信号を受け取る
人間の感覚の性質を利用して、感度の低い細部の情報を
省略し符号（データ）量を低減（圧縮）する知覚符号化
方式を採用している。詳細には、聴覚特性のうちマスキ
ング現象と臨界帯域とを利用して聞き取れないオーディ
オ信号は取り除き、必要なオーディオ信号のみを符号化
してビットを割当てることにより、元のオーディオ信号
より少量のビットで符号化しても、その原音と殆ど同じ
水準の音質を得ることができる。ここで、マスキング現
象とは、オーディオ信号間の干渉により一つのオーディ
オ信号が他のオーディオ信号によりマスキングされて、
全く聞き取れない現象である。また、臨界帯域とは、人
間が音の周波数を区分する一種の単位であって、一般に
２４個の帯域に分けられる。これらの帯域のうち高周波
数信号であるほど、その帯域の幅はログ・スケール（対
数単位）で大きくなる。したがって、臨界帯域では、低
周波数信号よりは高周波数信号を区分しにくいものであ
る。ＭＰＥＧオーディオ符号化部３０では上述の聴覚特
性を用いてビットを割り当てを行うため、信号対雑音比
（ＳＮＲ）と信号対マスク・レベル比（ＳＭＲ）を求
め、これらの値から更にマスク・レベル対雑音比（ＭＮ
Ｒ）を計算する。ここで、マスク・レベルとは、聞き取
れない最小の信号レベルである。したがって、このマス
ク・レベル以下の信号にはビットを割り当てを行う必要
はない。Returning to FIG. 3, a conventional digital audio encoding device including an MPEG audio encoding unit 30 is known, for example, from Japanese Patent Application Laid-Open No. 9-134200. First, a human psychoacoustic model used in the MPEG audio encoding unit 30 will be described. In other words, in the compression encoding of an audio signal according to the MPEG standard, a perceptual code that reduces (compresses) the amount of code (data) by omitting information with low sensitivity and using details of the human sense of receiving the audio signal. Is adopted. More specifically, audio signals that cannot be heard are removed by using the masking phenomenon and the critical band in the auditory characteristics, and only necessary audio signals are encoded and bits are allocated, so that the bits can be encoded with fewer bits than the original audio signal. Even if the sound quality is changed, the same sound quality as the original sound can be obtained. Here, the masking phenomenon means that one audio signal is masked by another audio signal due to interference between audio signals,
It is a phenomenon that cannot be heard at all. Further, the critical band is a kind of unit for dividing the frequency of a sound by a human, and is generally divided into 24 bands. The higher the frequency of these bands, the greater the width of the band on a log scale (logarithmic unit). Therefore, in the critical band, it is more difficult to distinguish high frequency signals than low frequency signals. Since the MPEG audio encoding unit 30 allocates bits using the above-described auditory characteristics, a signal-to-noise ratio (SNR) and a signal-to-mask level ratio (SMR) are obtained. Noise ratio (MN
R) is calculated. Here, the mask level is the minimum signal level that cannot be heard. Therefore, there is no need to assign bits to signals below this mask level.

【０００７】ＭＰＥＧオーディオ符号化部３０は、フィ
ルタ処理部３２からの帯域制限されたＰＣＭデータに上
記知覚符号化を施すために、図３に示すサブバンド分割
部３３、ビット割り当て情報生成部３４、ビット割り当
て部３５、及び多重化部３６により構成されている。サ
ブバンド分割部３３は、フィルタ処理部３２からの時間
領域のＰＣＭデータを３２個の均等なサブバンドに分け
た周波数領域のサブバンドデータに変換する。サブバン
ドデータには、各サブバンドにおいて、１２個（レイヤ
Ｉの場合）、又は３６個（レイヤＩＩの場合）のサンプ
ルを含んでいる。さらに、サブバンド分割部３３は、再
生時での音圧レベルを示すスケール・ファクタを入力し
たＰＣＭデータから取得して符号化する。このスケール
・ファクタの個数は全部で６４個なので、この情報の符
号化に必要なビット数は６ビットである。さらに、この
情報の符号化方法は各レイヤによって異なり、レイヤＩ
では、各帯域（サブバンド）に存在する１２個のサンプ
ルのうち最大値を求めて、求めた最大値と一致するか、
やや大きい値をその帯域のスケール・ファクタとする。
一方、レイヤＩＩでは、各帯域に三つのスケール・ファ
クタが存在するため、各スケール・ファクタの類似性を
検討して三つのスケール・ファクタのうち、何個を符号
化するかを決める。すなわち、符号化するスケール・フ
ァクタの数は隣接するスケール・ファクタとの差値の範
囲に応じて異なる。したがって、レイヤＩＩでは、レイ
ヤＩとは異なり、スケール・ファクタを選択するための
付加情報を必要とするが、スケール・ファクタは２ビッ
トで符号化される。[0007] In order to perform the above-described perceptual encoding on the band-limited PCM data from the filter processing unit 32, the MPEG audio encoding unit 30 includes a sub-band division unit 33, a bit allocation information generation unit 34, It comprises a bit allocating unit 35 and a multiplexing unit 36. The sub-band division unit 33 converts the PCM data in the time domain from the filter processing unit 32 into sub-band data in the frequency domain divided into 32 equal sub-bands. The subband data includes 12 (in the case of layer I) or 36 (in the case of layer II) samples in each subband. Further, the sub-band dividing unit 33 obtains a scale factor indicating a sound pressure level at the time of reproduction from the input PCM data and encodes the scale factor. Since the number of scale factors is 64 in total, the number of bits required for encoding this information is 6 bits. Furthermore, the encoding method of this information differs for each layer,
Then, the maximum value of the twelve samples existing in each band (sub-band) is obtained, and it is determined whether the maximum value matches the obtained maximum value.
A slightly larger value is used as the scale factor of the band.
On the other hand, in Layer II, since there are three scale factors in each band, the similarity of each scale factor is examined to determine how many of the three scale factors are to be coded. That is, the number of scale factors to be encoded differs depending on the range of the difference value between adjacent scale factors. Thus, unlike Layer I, Layer II requires additional information to select the scale factor, but the scale factor is encoded with two bits.

【０００８】ビット割り当て情報生成部３４は、人間の
聴覚特性を利用して、フィルタ処理部３２から入力した
ＰＣＭデータから上述の信号対マスク・レベル比（ＳＭ
Ｒ）を生成する。ビット割り当て情報生成部３４の最終
出力値は、各帯域毎にビット割り当ての基準であるビッ
ト割り当て情報（ＳＭＲ値）としてビット割り当て部３
５に出力される。詳細にいえば、このＳＭＲ値は、下記
の一連の九つの段階によって計算される。第１段階では
高速フーリエ変換（ＦＦＴ）により時間領域のオーディ
オ信号（ＰＣＭデータ）を周波数領域に変換し、第２段
階では各帯域の音圧レベルを計算する。第３段階では絶
対マスキング・スレショルド（しきい）値を計算し、第
４段階ではオーディオ信号の有声音と無声音成分を決定
する。第５段階では有声音のうちマスクする音であるマ
スカー（Masker）を決定し、第６段階では各帯域のマス
キング・スレショルド値を計算する。第７段階では全て
の帯域のマスキング・スレショルド値を計算し、第８段
階では各帯域の最小マスキング・スレショルド値を計算
する。そして、第９段階では各帯域のＳＭＲ値を計算す
る。[0008] The bit allocation information generation unit 34 utilizes the human auditory characteristics to convert the PCM data input from the filter processing unit 32 into the aforementioned signal-to-mask level ratio (SM).
R). The final output value of the bit allocation information generator 34 is used as bit allocation information (SMR value) as a bit allocation reference for each band.
5 is output. Specifically, this SMR value is calculated by a series of nine steps as follows. In the first stage, the audio signal (PCM data) in the time domain is converted into the frequency domain by the fast Fourier transform (FFT), and in the second stage, the sound pressure level in each band is calculated. In a third step, an absolute masking threshold value is calculated, and in a fourth step, voiced and unvoiced sound components of the audio signal are determined. In the fifth step, a masker which is a sound to be masked among voiced sounds is determined, and in the sixth step, a masking threshold value of each band is calculated. In the seventh step, masking threshold values of all bands are calculated, and in the eighth step, the minimum masking threshold value of each band is calculated. In the ninth stage, the SMR value of each band is calculated.

【０００９】ビット割り当て部３５は、サブバンド分割
部３３からの各帯域のサブバンドデータにビット割り当
て処理と量子化処理を施して多重化部３６に出力する。
ビット割り当て処理では、ビット割り当て情報生成部３
４からのＳＭＲ値に基づいて、下記の一連の四つの段階
を繰り返して行い、各帯域の割り当てビット数を求め
る。第１段階では初期の割り当てビットを０とし、第２
段階では各帯域について上記マスク・レベル対雑音比で
あるＭＮＲ値を求める。この際、ＭＮＲ値は、各帯域毎
に信号対雑音比であるＳＮＲ値からＳＭＲ値を減算する
ことにより算出される。第３段階では各帯域に求められ
たＭＮＲ値のうち最小のＭＮＲ値をもつ帯域を探し出し
て、その帯域の割り当てビット数を例えば１つ増加し、
第４段階では増加した割り当てビット数が求められる割
り当てビット数を超えないとき、残り他の帯域について
第２及び第３段階を繰り返して行う。また、量子化処理
は、上記ビット割り当て処理に続いて、下記の一連の四
つの段階を繰り返して行われる。第１段階では、各帯域
内のサンプルをスケール・ファクタで割ってＸとする。
第２段階では、”Ａ×Ｘ＋Ｂ”（Ａ，Ｂは所定のテーブ
ル値）を計算する。第３段階では第２段階で計算された
値のうちビット割り当て処理から得られた割り当てビッ
ト数を求め、第４段階では求めた割り当てビット数の最
上位ビット（ＭＳＢ）を反転させて量子化データとして
出力する。このようにして得られた量子化データに対し
て多重化部３６は、ＭＰＥＧ規格に規定されたヘッダ部
等を付加して、ＭＰＥＧオーディオビットストリームと
して形成して出力する。[0009] The bit allocation unit 35 performs bit allocation processing and quantization processing on the sub-band data of each band from the sub-band division unit 33, and outputs the result to the multiplexing unit 36.
In the bit allocation process, the bit allocation information generation unit 3
Based on the SMR value from No. 4, the following series of four steps are repeated to determine the number of bits allocated to each band. In the first stage, the initial allocation bits are set to 0, and the second
In the step, an MNR value, which is the above-described mask level-to-noise ratio, is obtained for each band. At this time, the MNR value is calculated by subtracting the SMR value from the SNR value that is the signal-to-noise ratio for each band. In the third stage, a band having the smallest MNR value is searched for among the MNR values obtained for each band, and the number of bits allocated to the band is increased by, for example, one.
In the fourth step, when the increased number of allocated bits does not exceed the required number of allocated bits, the second and third steps are repeated for the remaining other bands. Further, the quantization process is performed by repeating the following four steps following the bit allocation process. In the first stage, the samples in each band are divided by the scale factor to give X.
In the second stage, “A × X + B” (A and B are predetermined table values) is calculated. In the third step, the number of allocated bits obtained from the bit allocation processing among the values calculated in the second step is obtained. In the fourth step, the most significant bit (MSB) of the obtained allocated bit number is inverted to obtain the quantized data. Output as The multiplexing unit 36 adds a header part and the like defined in the MPEG standard to the quantized data thus obtained, and forms and outputs the MPEG audio bit stream.

【００１０】以上のように、従来のディジタルオーディ
オ符号化装置では、アップサンプリングした後のＰＣＭ
データをフィルタ処理部３２で帯域制限して、ＭＰＥＧ
オーディオ符号化部３０を用いてＭＰＥＧオーディオビ
ットストリームを形成していた。As described above, in the conventional digital audio encoding device, the upsampled PCM
The data is band-limited by the filter processing unit 32 and MPEG
The MPEG audio bit stream was formed using the audio encoding unit 30.

【００１１】[0011]

【発明が解決しようとする課題】上記のような従来のデ
ィジタルオーディオ符号化装置では、折り返しノイズを
除去するために、ＭＰＥＧオーディオ符号化部の前段に
フィルタ処理部を接続して、アップサンプリング後のＰ
ＣＭデータの帯域制限（フィルタ処理）を行う必要があ
った。このため、従来のディジタルオーディオ符号化装
置では、当該装置の構成を簡略化することができないと
いう問題点を生じた。さらに、この従来のディジタルオ
ーディオ符号化装置では、ＭＰＥＧオーディオ符号化部
は周波数成分を含んでいないサブバンドデータについて
も、ビット割り当て処理や量子化処理を行っていた。そ
の結果、従来のディジタルオーディオ符号化装置では、
オーディオ信号の符号化処理の処理効率を向上すること
が困難であり、その符号化処理に多大な時間を要した。In the conventional digital audio encoding apparatus as described above, in order to remove aliasing noise, a filter processing section is connected in front of the MPEG audio encoding section, and after upsampling, P
It was necessary to limit the band of CM data (filter processing). For this reason, the conventional digital audio encoding device has a problem that the configuration of the device cannot be simplified. Further, in this conventional digital audio encoding device, the MPEG audio encoding unit performs bit allocation processing and quantization processing on subband data that does not include a frequency component. As a result, in the conventional digital audio encoding device,
It was difficult to improve the processing efficiency of the audio signal encoding process, and the encoding process required a great deal of time.

【００１２】さらに、従来のディジタルオーディオ符号
化装置では、アップサンプリング部でのアップサンプリ
ングの処理が複雑で、時間を要するという問題点があっ
た。詳細にいえば、従来のディジタルオーディオ符号化
装置では、アップサンプリング部は、１フレーム分のＰ
ＣＭデータを生成するとき、１フレーム分の入力ＰＣＭ
データだけでなく、次のフレームの先頭の入力ＰＣＭデ
ータを用いていた。具体的には、図４に示したように、
１１５２番目の最後のＰＣＭデータは、５７６番目の入
力ＰＣＭデータと次のフレームの１番目の入力ＰＣＭデ
ータとの平均値により生成していた。このため、このア
ップサンプリング部では、処理対象のフレームだけでな
く、次のフレームの先頭の入力ＰＣＭデータを先読み処
理を行って取得する必要があった。さらに、このアップ
サンプリング部では、上述の先読み処理だけでなく、ア
ップサンプリングの処理終了後に先読みした先頭の入力
ＰＣＭデータを元のフレームに移動する処理や読み出し
位置情報を先読み処理した分だけ戻すなどの処理を別に
行う必要があった。Further, the conventional digital audio encoding apparatus has a problem that the up-sampling process in the up-sampling section is complicated and takes time. More specifically, in the conventional digital audio encoding device, the up-sampling unit uses one frame of P
When generating CM data, input PCM for one frame
In addition to the data, the input PCM data at the head of the next frame is used. Specifically, as shown in FIG.
The 1152th last PCM data was generated by the average value of the 576th input PCM data and the 1st input PCM data of the next frame. For this reason, in this upsampling unit, it is necessary to acquire not only the processing target frame but also the leading input PCM data of the next frame by performing a pre-reading process. Further, this up-sampling unit performs not only the pre-reading process described above, but also a process of moving the first input PCM data pre-read after the end of the up-sampling process to the original frame, returning the read position information by the pre-reading process, and the like. Processing had to be performed separately.

【００１３】この発明は、上記のような問題点を解決す
るためになされたものであり、当該装置の構成を簡略化
することができ、ディジタルオーディオ信号の符号化処
理を軽減し高速に行うディジタルオーディオ符号化装
置、同符号化方法、及び同符号化プログラムを記録した
記録媒体を提供することを目的とする。SUMMARY OF THE INVENTION The present invention has been made to solve the above-mentioned problems, and can simplify the structure of the apparatus, reduce the encoding processing of digital audio signals, and perform high-speed digital processing. An object of the present invention is to provide an audio encoding device, an encoding method, and a recording medium on which the encoding program is recorded.

【００１４】[0014]

【課題を解決するための手段】本発明のディジタルオー
ディオ符号化装置は、アップサンプリング情報Ｎ（Ｎは
２の階乗）に基づいて、所定のサンプリング周波数でサ
ンプリングされた入力ディジタルオーディオデータをア
ップサンプリングして、Ｎ倍のサンプリング周波数でサ
ンプリングされたオーディオデータを生成するアップサ
ンプリング部、及び前記アップサンプリング部からのオ
ーディオデータを所定のサブバンドに分割する帯域分割
処理を施し、さらにアップサンプリング後のＮ倍のサン
プリング周波数の１／（２×Ｎ）以上の周波数領域に属
するサブバンドの全てのオーディオデータをゼロの値に
設定するサブバンド分割部を備えている。このように構
成することにより、当該装置の構成を簡略化することが
でき、ディジタルオーディオ信号の符号化処理を高速に
行うことができる。According to the present invention, there is provided a digital audio encoding apparatus for up-sampling input digital audio data sampled at a predetermined sampling frequency based on up-sampling information N (N is a factor of 2). Then, an up-sampling unit that generates audio data sampled at N times the sampling frequency and a band division process of dividing the audio data from the up-sampling unit into predetermined subbands are performed. A sub-band division unit is provided for setting all audio data of sub-bands belonging to a frequency region equal to or more than 1 / (2 × N) of the double sampling frequency to a value of zero. With this configuration, the configuration of the device can be simplified, and the encoding process of the digital audio signal can be performed at high speed.

【００１５】別の観点による発明のディジタルオーディ
オ符号化装置は、前記入力ディジタルオーディオデータ
の１フレーム分のデータ数をＭ個（Ｍは自然数）とした
とき、前記アップサンプリング部が、１番目からＭ番目
の前記入力ディジタルオーディオデータを用いて、１番
目から（Ｍ×Ｎ−Ｎ）番目までの前記オーディオデータ
を生成し、Ｍ番目の前記入力ディジタルオーディオデー
タを（Ｍ×Ｎ−Ｎ＋１）番目から（Ｍ×Ｎ）番目の前記
オーディオデータとするよう構成している。このように
構成することにより、アップサンプリング部での処理を
軽減することができ、ディジタルオーディオ信号の符号
化処理を高速に行うことができる。In a digital audio encoding apparatus according to another aspect of the present invention, when the number of data for one frame of the input digital audio data is M (M is a natural number), the up-sampling unit sets M data from the first to M. The first to (M × N−N) th audio data are generated by using the first input digital audio data, and the Mth input digital audio data is converted from the (M × N−N + 1) th to (M × N−N + 1) th. (M × N) th audio data. With this configuration, it is possible to reduce the processing in the upsampling unit, and to perform the encoding processing of the digital audio signal at high speed.

【００１６】本発明のディジタルオーディオ符号化装置
の符号化方法は、アップサンプリング情報Ｎ（Ｎは２の
階乗）に基づいて、所定のサンプリング周波数でサンプ
リングされた入力ディジタルオーディオデータをアップ
サンプリングして、Ｎ倍のサンプリング周波数でサンプ
リングされたオーディオデータを生成するアップサンプ
リングステップ、及び前記アップサンプリングステップ
で生成したオーディオデータを所定のサブバンドに分割
する帯域分割処理を施し、さらにアップサンプリング後
のＮ倍のサンプリング周波数の１／（２×Ｎ）以上の周
波数領域に属するサブバンドの全てのオーディオデータ
をゼロの値に設定するサブバンド分割ステップを備えて
いる。このように構成することにより、当該装置の構成
を簡略化することができ、ディジタルオーディオ信号の
符号化処理を高速に行うことができる。In the encoding method of the digital audio encoding apparatus according to the present invention, input digital audio data sampled at a predetermined sampling frequency is up-sampled based on up-sampling information N (N is a factor of 2). , An upsampling step of generating audio data sampled at N times the sampling frequency, and a band division process of dividing the audio data generated in the upsampling step into predetermined subbands. Sub-band division step of setting all audio data of sub-bands belonging to a frequency region equal to or more than 1 / (2 × N) of the sampling frequency of the sub-band to zero. With this configuration, the configuration of the device can be simplified, and the encoding process of the digital audio signal can be performed at high speed.

【００１７】別の観点による発明のディジタルオーディ
オ符号化装置の符号化方法は、前記入力ディジタルオー
ディオデータの１フレーム分のデータ数をＭ個（Ｍは自
然数）としたとき、前記アップサンプリングステップに
おいて、１番目からＭ番目の前記入力ディジタルオーデ
ィオデータを用いて、１番目から（Ｍ×Ｎ−Ｎ）番目ま
での前記オーディオデータを生成し、Ｍ番目の前記入力
ディジタルオーディオデータを（Ｍ×Ｎ−Ｎ＋１）番目
から（Ｍ×Ｎ）番目の前記オーディオデータとしてい
る。このように構成することにより、アップサンプリン
グ部での処理を軽減することができ、ディジタルオーデ
ィオ信号の符号化処理を高速に行うことができる。According to another aspect of the present invention, there is provided an encoding method for a digital audio encoding apparatus, wherein when the number of data of one frame of the input digital audio data is M (M is a natural number), The first to Mth input digital audio data are used to generate first to (M × N−N) th audio data, and the Mth input digital audio data is generated as (M × N−N + 1). ) -Th to (M × N) -th audio data. With this configuration, it is possible to reduce the processing in the upsampling unit, and to perform the encoding processing of the digital audio signal at high speed.

【００１８】本発明の符号化プログラムを記録した記録
媒体は、アップサンプリング情報Ｎ（Ｎは２の階乗）に
基づいて、所定のサンプリング周波数でサンプリングさ
れた入力ディジタルオーディオデータをアップサンプリ
ングして、Ｎ倍のサンプリング周波数でサンプリングさ
れたオーディオデータを生成するアップサンプリングス
テップ、及び前記アップサンプリングステップで生成し
たオーディオデータを所定のサブバンドに分割する帯域
分割処理を施し、さらにアップサンプリング後のＮ倍の
サンプリング周波数の１／（２×Ｎ）以上の周波数領域
に属するサブバンドの全てのオーディオデータをゼロの
値に設定するサブバンド分割ステップを備えている。こ
のように構成することにより、当該装置の構成を簡略化
することができ、ディジタルオーディオ信号の符号化処
理を高速に行うことができる。The recording medium on which the encoding program of the present invention has been recorded up-samples input digital audio data sampled at a predetermined sampling frequency based on up-sampling information N (N is a factor of 2). Performing an up-sampling step of generating audio data sampled at an N-times sampling frequency, and performing a band division process of dividing the audio data generated in the up-sampling step into predetermined subbands; The method includes a subband division step of setting all audio data of subbands belonging to a frequency region equal to or more than 1 / (2 × N) of the sampling frequency to a value of zero. With this configuration, the configuration of the device can be simplified, and the encoding process of the digital audio signal can be performed at high speed.

【００１９】別の観点による発明の符号化プログラムを
記録した記録媒体は、前記入力ディジタルオーディオデ
ータの１フレーム分のデータ数をＭ個（Ｍは自然数）と
したとき、前記アップサンプリングステップにおいて、
１番目からＭ番目の前記入力ディジタルオーディオデー
タを用いて、１番目から（Ｍ×Ｎ−Ｎ）番目までの前記
オーディオデータを生成し、Ｍ番目の前記入力ディジタ
ルオーディオデータを（Ｍ×Ｎ−Ｎ＋１）番目から（Ｍ
×Ｎ）番目の前記オーディオデータとしている。このよ
うに構成することにより、アップサンプリング部での処
理を軽減することができ、ディジタルオーディオ信号の
符号化処理を高速に行うことができる。According to another aspect of the present invention, there is provided a recording medium on which an encoding program according to the invention is recorded, wherein the number of data of one frame of the input digital audio data is M (M is a natural number),
The first to Mth input digital audio data are used to generate first to (M × N−N) th audio data, and the Mth input digital audio data is generated as (M × N−N + 1). ) To (M
× N) th audio data. With this configuration, it is possible to reduce the processing in the upsampling unit, and to perform the encoding processing of the digital audio signal at high speed.

【００２０】[0020]

【発明の実施の形態】以下、本発明のディジタルオーデ
ィオ符号化装置、及びその符号化方法を示す好ましい実
施例について、図面を参照しながら説明する。尚、以下
の説明では、従来例との比較を容易なものとするため
に、２２.０５ＫＨｚでサンプリングされた１チャンネ
ル（モノラル）のＰＣＭデータを４４.１ＫＨｚにアッ
プサンプリングして、ＭＰＥＧ１オーディオ規格（ISO/
IEC 11172-3）に基づき符号化するディジタルオーディ
オ符号化装置について例示して説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A preferred embodiment showing a digital audio encoding apparatus and an encoding method according to the present invention will be described below with reference to the drawings. In the following description, in order to facilitate comparison with the conventional example, 1-channel (monaural) PCM data sampled at 22.05 KHz is up-sampled to 44.1 KHz, and the MPEG1 audio standard ( ISO /
A digital audio encoding device that performs encoding based on IEC 11172-3) will be described as an example.

【００２１】《実施例》図１は、本発明の実施例である
ディジタルオーディオ符号化装置の主要部の構成を示す
ブロック図である。図１において、本実施例のディジタ
ルオーディオ符号化装置は、ＭＰＥＧ１オーディオ規格
を用いたＭＰＥＧオーディオ符号化部１０と、その前段
に設けられ、入力ＰＣＭデータをアップサンプリングす
るアップサンプリング部１１を備えている。入力ＰＣＭ
データは、図示しない入力データ生成部により、オーデ
ィオ信号の原信号をパルス符号変調して生成されたデー
タである。ＭＰＥＧオーディオ符号化部１０とアップサ
ンプリング部１１には、アップサンプリング情報Ｎ（Ｎ
は２の階乗）が入力される。このアップサンプリング情
報Ｎは、例えば入力データ生成部によって設定、通知さ
れるものであり、アップサンプリング部１１における、
元のＰＣＭデータのサンプリング周波数に対して何倍に
アップサンプリングするかを指定する情報である。この
アップサンプリング情報ＮをＭＰＥＧオーディオ符号化
部１０にも通知することにより、図３に示した従来例で
のフィルタ処理部を省略して、当該装置の構成を簡略化
することができ、さらに符号化処理を高速に行うことが
できる（詳細は後述）。尚、以下の説明では、アップサ
ンプリング部１１がサンプリング周波数を２２.０５Ｋ
Ｈｚから４４.１ＫＨｚにアップサンプリングするの
で、アップサンプリング情報Ｎは２となる。<< Embodiment >> FIG. 1 is a block diagram showing a configuration of a main part of a digital audio encoding apparatus according to an embodiment of the present invention. In FIG. 1, the digital audio encoding apparatus according to the present embodiment includes an MPEG audio encoding unit 10 using the MPEG1 audio standard, and an up-sampling unit 11 provided at a preceding stage for up-sampling input PCM data. . Input PCM
The data is data generated by an input data generator (not shown) by pulse code modulating the original signal of the audio signal. The up-sampling information N (N
Is the factorial of 2). The up-sampling information N is set and notified by, for example, an input data generation unit.
This information specifies how many times the sampling frequency of the original PCM data is to be upsampled. By notifying the up-sampling information N to the MPEG audio encoding unit 10, the filter processing unit in the conventional example shown in FIG. 3 can be omitted, and the configuration of the device can be simplified. Can be performed at high speed (details will be described later). In the following description, the upsampling unit 11 sets the sampling frequency to 22.05K.
Since the frequency is up-sampled from Hz to 44.1 KHz, the up-sampling information N is 2.

【００２２】アップサンプリング部１１は、外部から与
えられるアップサンプリング情報Ｎ（＝２）に基づい
て、所定のサンプリング周波数（２２.０５ＫＨｚ）で
サンプリングされた入力ＰＣＭデータからそのＮ倍のサ
ンプリング周波数（４４.１ＫＨｚ）でサンプリングさ
れたＰＣＭデータを生成する。このアップサンプリング
部１１は、後に詳述するように、フレーム単位に隣り合
う２つの入力ＰＣＭデータの平均値により補間しアップ
サンプリング後のＰＣＭデータを生成して、ＭＰＥＧオ
ーディオ符号化部１０に出力する。これにより、本実施
例のディジタルオーディオ符号化装置では、アップサン
プリング部１１の構成を簡略化することができ、その処
理負荷を軽減することができる。The up-sampling section 11 converts input PCM data sampled at a predetermined sampling frequency (22.05 KHz) based on up-sampling information N (= 2) supplied from the outside into an N-fold sampling frequency (44). Generate PCM data sampled at .1 KHz). As will be described in detail later, the up-sampling unit 11 generates PCM data after up-sampling by interpolating with an average value of two adjacent input PCM data in frame units, and outputs the PCM data to the MPEG audio encoding unit 10. . Thus, in the digital audio encoding device according to the present embodiment, the configuration of the upsampling unit 11 can be simplified, and the processing load thereof can be reduced.

【００２３】ＭＰＥＧオーディオ符号化部１０は、アッ
プサンプリング部１１に接続されたサブバンド分割部１
２及びビット割り当て情報生成部１３と、それらのサブ
バンド分割部１２及びビット割り当て情報生成部１３に
接続されたビット割り当て部１４と、前記ビット割り当
て部１４に接続された多重化部１５を備えている。サブ
バンド分割部１２は、アップサンプリング部１１から入
力したＰＣＭデータに帯域分割を施しサブバンドデータ
を生成する。詳細には、サブバンド分割部１２は、帯域
分割フィルタを用いて時間領域のＰＣＭデータを３２個
の均等なサブバンドに分ける帯域分割を施して、周波数
領域のサブバンドデータｓｂ０〜ｓｂ３１を生成する。
これらのサブバンドデータｓｂ０〜ｓｂ３１は入力ＰＣ
Ｍデータのサンプリング周波数（２２.５ＫＨｚ）を低
周波数帯域から順次帯域分割したものであり、各サブバ
ンドデータｓｂ０〜ｓｂ３１には１２個（レイヤＩの場
合）、又は３６個（レイヤＩＩの場合）のサンプルを含
んでいる。The MPEG audio encoding unit 10 includes a subband dividing unit 1 connected to the upsampling unit 11.
2 and a bit allocation information generation unit 13, a bit allocation unit 14 connected to the subband division unit 12 and the bit allocation information generation unit 13, and a multiplexing unit 15 connected to the bit allocation unit 14. I have. The sub-band division unit 12 performs sub-band division on the PCM data input from the up-sampling unit 11 to generate sub-band data. Specifically, the sub-band division unit 12 performs band division to divide time-domain PCM data into 32 equal sub-bands using a band division filter, and generates frequency-domain sub-band data sb0 to sb31. .
These subband data sb0 to sb31 are input PC
The sampling frequency (22.5 KHz) of M data is divided into bands in order from the low frequency band, and each subband data sb0 to sb31 has 12 (in the case of layer I) or 36 (in the case of layer II) Includes samples.

【００２４】さらに、サブバンド分割部１２は、上記ア
ップサンプリング情報Ｎ（＝２）を用いて、入力したＰ
ＣＭデータに含まれていない帯域の全てのサンプルをゼ
ロに設定する。言い換えれば、サブバンド分割部１２
は、アップサンプリング後のサンプリング周波数（４
４.１ＫＨｚ）の１／（２×Ｎ）以上の周波数領域、す
なわち１１.０２５ＫＨｚ以上の周波数領域に属する帯
域の全てのサンプルをゼロに設定する。これにより、サ
ブバンドデータｓｂ１６〜ｓｂ３１に属する全てのサン
プルがゼロに設定されて、ビット割り当て部１４に出力
される。さらに、入力ＰＣＭデータのサンプリング周波
数（２２.５ＫＨｚ）の１／２よりも高い周波数成分を
カットせずにデータ処理した場合に生じる折り返しノイ
ズの発生を防止することができ、従来例に示したフィル
タ処理部（図３）を省略することができる。Further, the subband division unit 12 uses the up-sampling information N (= 2) to
All samples in the band not included in the CM data are set to zero. In other words, the sub-band division unit 12
Is the sampling frequency (4
All samples in a band belonging to a frequency region of 1 / (2 × N) or more of (4.1 KHz), that is, a frequency region of 11.25 KHz or more, are set to zero. As a result, all the samples belonging to the sub-band data sb16 to sb31 are set to zero and output to the bit allocation unit 14. Further, it is possible to prevent the generation of aliasing noise that occurs when data processing is performed without cutting a frequency component higher than 1/2 of the sampling frequency (22.5 KHz) of the input PCM data. The processing unit (FIG. 3) can be omitted.

【００２５】ビット割り当て情報生成部１３は、アップ
サンプリング部１１からのＰＣＭデータに基づいて、ビ
ット割り当て情報（ＳＭＲ値）を生成する。詳細には、
ビット割り当て情報生成部１３は、人間の聴覚特性を利
用して、上記ＰＣＭデータからビット割り当て情報を生
成しビット割り当て部１４に出力する。ビット割り当て
部１４は、ビット割り当て情報生成部１３からのビット
割り当て情報に基づいて、サブバンド分割部１２からの
サブバンドデータｓｂ０〜ｓｂ３１にビットを割り当て
るビット割り当て処理を施し、さらにサブバンド（帯
域）毎に量子化処理を行って量子化した量子化データを
多重化部１５に出力する。尚、サブバンドデータｓｂ１
６〜ｓｂ３１はゼロに設定されているので、ビット割り
当て部１４はビット割り当て処理及び量子化処理を省略
することが可能となり、処理負荷を軽減することができ
る。多重化部１５は、ビット割り当て部１４からの量子
化データをＭＰＥＧオーディオビットストリームとして
形成して出力する。The bit allocation information generating section 13 generates bit allocation information (SMR value) based on the PCM data from the upsampling section 11. For details,
The bit allocation information generation unit 13 generates bit allocation information from the PCM data using the human auditory characteristics and outputs the generated bit allocation information to the bit allocation unit 14. The bit allocation unit 14 performs a bit allocation process of allocating bits to the subband data sb0 to sb31 from the subband division unit 12 based on the bit allocation information from the bit allocation information generation unit 13, and further performs a subband (band). The quantized data is quantized by performing a quantization process every time and output to the multiplexing unit 15. Note that the sub-band data sb1
Since 6 to sb31 are set to zero, the bit allocation unit 14 can omit the bit allocation processing and the quantization processing, and can reduce the processing load. The multiplexing unit 15 forms and outputs the quantized data from the bit allocation unit 14 as an MPEG audio bit stream.

【００２６】尚、上述のビット割り当て情報生成部１
３、ビット割り当て部１４、及び多重化部１５は、図３
に示した従来例のビット割り当て情報生成部、ビット割
り当て部、及び多重化部とそれぞれ同等な機能をもつよ
うにＭＰＥＧ１オーディオ規格に基づいて構成されたも
のである。また、上述の説明では、サブバンド分割部１
２は全てのサブバンドデータｓｂ０〜ｓｂ３１をビット
割り当て部１４に出力する構成について説明したが、ゼ
ロを設定したサブバンドデータｓｂ１６〜ｓｂ３１を除
いて、サブバンドデータｓｂ０〜ｓｂ１５だけをビット
割り当て部１４に出力するよう構成してもよい。The above-mentioned bit allocation information generating section 1
3, the bit allocation unit 14 and the multiplexing unit 15
Are configured based on the MPEG1 audio standard so as to have the same functions as those of the bit allocation information generation unit, bit allocation unit, and multiplexing unit of the conventional example shown in FIG. Also, in the above description, the subband division unit 1
2 describes the configuration in which all the sub-band data sb0 to sb31 are output to the bit allocating unit 14. However, except for the sub-band data sb16 to sb31 in which zero is set, only the sub-band data sb0 to sb15 are output to the bit allocating unit 14. May be output.

【００２７】以下、本実施例のディジタルオーディオ符
号化装置の動作について、図１及び図２を用いて説明す
る。図２は、図１に示したアップサンプリング部での補
間処理を示す説明図である。まずＭＰＥＧ１規格に基づ
いてオーディオ信号を符号化するためには、１チャンネ
ル（モノラル）のオーディオ信号では１１５２個のＰＣ
Ｍデータが必要であり、２チャンネル（ステレオ）のオ
ーディオ信号ではその２倍の２３０４個のＰＣＭデータ
が必要であり、それぞれ１フレーム分のＰＣＭデータと
呼ぶことにする。この説明では１チャンネルの場合を説
明しているので、アップサンプリング部１１は１１５２
個のＰＣＭデータ毎にアップサンプリング後のＰＣＭデ
ータをＭＰＥＧオーディオ符号化部１０に出力する。さ
らに、アップサンプリング部１１は、上述したように、
フレーム単位に隣り合う２つの入力ＰＣＭデータの平均
値により補間しアップサンプリング後のＰＣＭデータを
生成している。Hereinafter, the operation of the digital audio encoding device according to the present embodiment will be described with reference to FIGS. FIG. 2 is an explanatory diagram showing an interpolation process in the upsampling unit shown in FIG. First, in order to encode an audio signal based on the MPEG1 standard, 1152 PCs are used for a one-channel (monaural) audio signal.
M data is required. For a two-channel (stereo) audio signal, 2304 pieces of PCM data are required, which is twice as much, and each of them is called one frame of PCM data. In this description, the case of one channel is described, so that the up-sampling unit 11
The PCM data after upsampling is output to the MPEG audio encoding unit 10 for each piece of PCM data. Further, as described above, the up-sampling unit 11
Interpolation is performed using an average value of two input PCM data adjacent to each other in a frame unit to generate up-sampled PCM data.

【００２８】詳細にいえば、元の入力ＰＣＭデータの１
フレーム分の個数をＭ（Ｍは自然数）としたとき、アッ
プサンプリング部１１はアップサンプリング情報Ｎを用
いて、１番目からＭ番目の元の入力ＰＣＭデータによ
り、１番目から（Ｍ×Ｎ−Ｎ）番目までのＰＣＭデータ
を生成している。さらに、アップサンプリング部１１
は、Ｍ番目の最後の入力ＰＣＭデータを（Ｍ×Ｎ−Ｎ＋
１）番目から（Ｍ×Ｎ）番目のＰＣＭデータとして繰り
返し用いている。具体的には、図２に示すように、アッ
プサンプリング部１１は、１番目から５７６番目の元の
入力ＰＣＭデータと、隣り合う２つの入力ＰＣＭデータ
の各平均値を求めて、求めた各平均値を元の隣り合う２
つの入力ＰＣＭデータの間に補間して、１番目から１１
５０番目までのアップサンプリング後のＰＣＭデータと
して生成している。例えばアップサンプリング部１１
は、１番目、及び２番目の元の入力ＰＣＭデータを１番
目、及び３番目のＰＣＭデータとし、それらの平均値を
２番目のＰＣＭデータとしている。さらに、アップサン
プリング部１１は、同図に示すように、１１５１番目、
及び１１５２番目のＰＣＭデータには、５７６番目の最
後の入力ＰＣＭデータを用いている。More specifically, one of the original input PCM data
When the number of frames is M (M is a natural number), the up-sampling unit 11 uses the up-sampling information N and outputs the first to (M × N−N) based on the first to M-th original input PCM data. ) -Th PCM data is generated. Further, the up-sampling unit 11
Converts the M-th last input PCM data to (M × N−N +
It is repeatedly used as (1) -th to (M × N) -th PCM data. Specifically, as shown in FIG. 2, the up-sampling unit 11 calculates the average value of the first to 576th original input PCM data and the average value of two adjacent input PCM data, and Value of the original adjacent two
Interpolation between two input PCM data, 1st to 11th
It is generated as PCM data after up-sampling up to the 50th. For example, the upsampling unit 11
Uses the first and second original input PCM data as the first and third PCM data, and sets the average value thereof as the second PCM data. Further, as shown in FIG.
The 576th last input PCM data is used for the 1152th PCM data.

【００２９】以上のように、本実施例のアップサンプリ
ング部１１は、１フレーム分の入力ＰＣＭデータだけを
用いて、そのフレーム分のＰＣＭデータを生成してい
る。このため、本実施例のアップサンプリング部１１で
は、従来例のものと異なって次のフレームの先頭の入力
ＰＣＭデータが不要となり、そのアップサンプリングの
処理を高速に行うことができる。また、アップサンプリ
ング部１１では、最後の入力ＰＣＭデータだけを用い
て、（Ｍ×Ｎ−Ｎ＋１）番目から（Ｍ×Ｎ）番目のＰＣ
Ｍデータを生成しているので、音質の低下を抑制でき
る。As described above, the up-sampling unit 11 of this embodiment uses only one frame of input PCM data to generate PCM data for that frame. Therefore, unlike the conventional example, the upsampling unit 11 of the present embodiment does not need the input PCM data at the head of the next frame, and can perform the upsampling process at high speed. Further, the upsampling unit 11 uses only the last input PCM data to generate the (M × N−N + 1) th to (M × N) th PCM data.
Since the M data is generated, a decrease in sound quality can be suppressed.

【００３０】続いて、サブバンド分割部１２は、アップ
サンプリング部１１からのＰＣＭデータを３２個の均等
なサブバンドに分ける帯域分割を施して、サブバンドデ
ータｓｂ０〜ｓｂ３１を生成しビット割り当て部１４に
出力する。このとき、サブバンド分割部１２は、上記ア
ップサンプリング情報Ｎ（＝２）を用いて、入力したＰ
ＣＭデータに含まれていない帯域の全てのサンプルをゼ
ロに設定する。次に、ビット割り当て部１４は、ビット
割り当て情報生成部１３からのビット割り当て情報に基
づいて、サブバンド分割部１２から入力したサブバンド
データｓｂ０〜ｓｂ３１にビット割り当て処理、及び量
子化処理を施し、量子化データを生成する。その後、多
重化部１５が、ビット割り当て部１４からの量子化デー
タをＭＰＥＧオーディオビットストリームに形成して出
力する。尚、ビット割り当て情報生成部１３でのビット
割り当て情報を作成する詳細な段階、及びビット割り当
て部１４での人間の聴覚特性を利用したビット割り当て
処理と量子化処理の詳細な段階は、従来例のものとそれ
ぞれ同様であるのでそれらの重複した説明は省略する。Subsequently, the sub-band division unit 12 performs band division for dividing the PCM data from the up-sampling unit 11 into 32 equal sub-bands, generates sub-band data sb0 to sb31, and Output to At this time, the subband dividing unit 12 uses the up-sampling information N (= 2) to
All samples in the band not included in the CM data are set to zero. Next, the bit allocation unit 14 performs a bit allocation process and a quantization process on the sub-band data sb0 to sb31 input from the sub-band division unit 12, based on the bit allocation information from the bit allocation information generation unit 13, Generate quantized data. Thereafter, the multiplexing unit 15 forms the quantized data from the bit allocation unit 14 into an MPEG audio bit stream and outputs the MPEG audio bit stream. The detailed steps of creating bit allocation information in the bit allocation information generation unit 13 and the detailed steps of bit allocation processing and quantization processing using human auditory characteristics in the bit allocation unit 14 are the same as those in the conventional example. Since they are the same as those described above, their duplicate description will be omitted.

【００３１】以上のように、本実施例のディジタルオー
ディオ符号化装置では、サブバンド分割部１２がアップ
サンプリング情報Ｎを用いて、入力したＰＣＭデータに
含まれていない帯域（サブバンド）の全てのサンプルを
ゼロに設定している。これにより、本実施例のディジタ
ルオーディオ符号化装置では、図３に示した従来例での
フィルタ処理部を省略して、当該装置の構成を簡略化す
ることができる。さらに、本実施例のディジタルオーデ
ィオ符号化装置では、折り返しノイズを生じることな
く、符号化処理を高速に行うことができる。As described above, in the digital audio encoding apparatus according to the present embodiment, the sub-band division unit 12 uses the up-sampling information N to perform all of the bands (sub-bands) not included in the input PCM data. The sample is set to zero. As a result, in the digital audio encoding apparatus according to the present embodiment, the filter processing unit in the conventional example shown in FIG. 3 is omitted, and the configuration of the apparatus can be simplified. Furthermore, in the digital audio encoding device of the present embodiment, encoding processing can be performed at high speed without aliasing noise.

【００３２】尚、上述の説明では、１チャンネルのＰＣ
ＭデータをＭＰＥＧオーディオ符号化部によって符号化
しＭＰＥＧオーディオビットストリームに形成する構成
について説明したが、実施例は１チャンネルのＰＣＭデ
ータに限定されるものではなく、例えばＭＰＥＧ２オー
ディオ規格（ISO/IEC 13818-3）に規定されたＭＰＥＧ
マルチチャンネルオーディオ信号を複数チャンネルのオ
ーディオ信号から生成することも可能である。さらに、
本発明は、ＭＰＥＧオーディオ規格に限定されるもので
はなく、サブバンド符号化方式のディジタルオーディオ
信号であれば、それ以外の高能率圧縮手法に規定された
符号化処理を高速に、かつ簡単な構成で実現することが
可能である。また、上述の実施例におけるディジタルオ
ーディオ符号化装置の符号化方法は、いずれもコンピュ
ータ・プログラム化することができるので、コンピュー
タにより実行可能な記録媒体にて本願の符号化方法を提
供することも可能である。ここでいうところの記録媒体
とは、フロッピーディスク、ＣＤ−ＲＯＭ、ＤＶＤ、光
磁気ディスク、リムーバブル・ハードディスク、及びフ
ラッシュメモリを含むデータ記録装置である。In the above description, a one-channel PC
Although the configuration in which the M data is encoded by the MPEG audio encoding unit and formed into an MPEG audio bit stream has been described, the embodiment is not limited to the one-channel PCM data. For example, the MPEG2 audio standard (ISO / IEC 13818- MPEG specified in 3)
It is also possible to generate a multi-channel audio signal from a multi-channel audio signal. further,
The present invention is not limited to the MPEG audio standard. For digital audio signals of the subband encoding system, the encoding process specified by other high-efficiency compression methods can be performed at high speed and with a simple configuration. It is possible to realize with. Further, since any of the encoding methods of the digital audio encoding device in the above-described embodiments can be computer-programmed, it is also possible to provide the encoding method of the present application on a computer-executable recording medium. It is. Here, the recording medium is a data recording device including a floppy disk, a CD-ROM, a DVD, a magneto-optical disk, a removable hard disk, and a flash memory.

【００３３】[0033]

【発明の効果】以上のように、本発明のディジタルオー
ディオ符号化装置、及びその符号化方法では、サブバン
ド分割部がアップサンプリング情報を用いて、入力した
入力ディジタルオーディオデータに含まれていない帯域
（サブバンド）の全てのデータをゼロに設定している。
これにより、本発明のディジタルオーディオ符号化装
置、及びその符号化方法では、フィルタ処理部を設ける
ことなく、当該装置の構成を簡略化することができる。
さらに、本発明のディジタルオーディオ符号化装置、及
びその符号化方法では、折り返しノイズを生じることな
く、符号化処理を高速に行うことができる。As described above, according to the digital audio encoding apparatus and the encoding method of the present invention, the sub-band division unit uses the up-sampling information to use the band not included in the input digital audio data. All data of (subband) is set to zero.
Thus, in the digital audio encoding device and the encoding method of the present invention, the configuration of the device can be simplified without providing a filter processing unit.
Further, with the digital audio encoding device and the encoding method of the present invention, encoding processing can be performed at high speed without generating aliasing noise.

【００３４】別の観点による発明のディジタルオーディ
オ符号化装置、及びその符号化方法では、入力ディジタ
ルオーディオデータの１フレーム分のデータ数をＭ個
（Ｍは自然数）としたとき、アップサンプリング部が、
１番目からＭ番目の前記入力ディジタルオーディオデー
タを用いて、１番目から（Ｍ×Ｎ−Ｎ）番目までの前記
オーディオデータを生成し、Ｍ番目の前記入力ディジタ
ルオーディオデータを（Ｍ×Ｎ−Ｎ＋１）番目から（Ｍ
×Ｎ）番目の前記オーディオデータとしている。これに
より、この発明のディジタルオーディオ符号化装置、及
びその符号化方法では、アップサンプリング部での処理
を軽減することができ、ディジタルオーディオ信号の符
号化処理を高速に行うことができる。また、本発明の符
号化方法はコンピュータ・プログラム化することができ
るので、本発明の符号化方法をコンピュータにより実行
可能な記録媒体に記録して実施することもできる。In the digital audio encoding apparatus and the encoding method of the invention according to another aspect, when the number of data of one frame of input digital audio data is M (M is a natural number), the upsampling unit
The first to Mth input digital audio data are used to generate first to (M × N−N) th audio data, and the Mth input digital audio data is generated as (M × N−N + 1). ) To (M
× N) th audio data. Thus, in the digital audio encoding device and the encoding method according to the present invention, the processing in the upsampling unit can be reduced, and the encoding processing of the digital audio signal can be performed at high speed. Further, since the encoding method of the present invention can be made into a computer program, the encoding method of the present invention can be recorded on a computer-executable recording medium and executed.

[Brief description of the drawings]

【図１】本発明の実施例であるディジタルオーディオ符
号化装置の主要部の構成を示すブロック図FIG. 1 is a block diagram showing a configuration of a main part of a digital audio encoding device according to an embodiment of the present invention.

【図２】図１に示したアップサンプリング部での補間処
理を示す説明図FIG. 2 is an explanatory diagram showing an interpolation process in an up-sampling unit shown in FIG. 1;

【図３】従来のディジタルオーディオ符号化装置の主要
部の構成を示すブロック図FIG. 3 is a block diagram showing a configuration of a main part of a conventional digital audio encoding device.

【図４】図３に示したアップサンプリング部での補間処
理を示す説明図FIG. 4 is an explanatory diagram showing an interpolation process in the upsampling unit shown in FIG. 3;

[Explanation of symbols]

１１アップサンプリング部１２サブバンド分割部１３ビット割り当て情報生成部１４ビット割り当て部１５多重化部 DESCRIPTION OF SYMBOLS 11 Upsampling part 12 Subband division part 13 Bit allocation information generation part 14 Bit allocation part 15 Multiplexing part

Claims

[Claims]

1. An input digital audio data sampled at a predetermined sampling frequency is up-sampled based on up-sampling information N (N is a factor of 2), and audio data sampled at N times the sampling frequency And a band division process for dividing the audio data from the up-sampling unit into predetermined sub-bands. Further, the sampling frequency is 1 / (2 × N) or more of N times the sampling frequency after the up-sampling. A digital audio encoding device, comprising: a subband division unit that sets all audio data of a subband belonging to a frequency domain to a value of zero.

2. When the number of data for one frame of the input digital audio data is M (M is a natural number), the up-sampling unit uses the first to M-th input digital audio data, The first to (M × N−N) th audio data is generated, and the Mth input digital audio data is used as the (M × N−N + 1) th to (M × N) th audio data. 2. The digital audio encoding device according to claim 1, wherein the digital audio encoding device is configured as described above.

3. The digital audio encoding apparatus according to claim 1, wherein the sub-band dividing unit is configured to divide input audio data into bands based on the MPEG standard. .

4. Up-sampling of input digital audio data sampled at a predetermined sampling frequency based on up-sampling information N (N is a power of 2), and audio data sampled at N times the sampling frequency , And a band division process of dividing the audio data generated in the up-sampling step into predetermined sub-bands, and more than 1 / (2 × N) of N times the sampling frequency after the up-sampling A sub-band division step of setting all audio data of a sub-band belonging to the frequency domain to a value of zero.

5. When the number of data for one frame of the input digital audio data is M (M is a natural number), the first to M-th input digital audio data are used in the upsampling step. The first to (M × N−N) th audio data is generated, and the Mth input digital audio data is converted from the (M × N−N + 1) th to (M × N).
The encoding method of the digital audio encoding device according to claim 4, wherein the audio data is the first audio data.

6. In the sub-band division step,
6. The input audio data is band-divided based on the MPEG standard.
3. The encoding method of the digital audio encoding device according to item 1.

7. Up-sampling of input digital audio data sampled at a predetermined sampling frequency based on up-sampling information N (N is a power of 2), and audio data sampled at N times the sampling frequency , And a band division process of dividing the audio data generated in the up-sampling step into predetermined sub-bands, and more than 1 / (2 × N) of N times the sampling frequency after the up-sampling A sub-band division step of setting all audio data of a sub-band belonging to the frequency domain to a value of zero.

8. When the number of data for one frame of the input digital audio data is M (M is a natural number), in the upsampling step, the first to M-th input digital audio data are used. The first to (M × N−N) th audio data is generated, and the Mth input digital audio data is converted from the (M × N−N + 1) th to (M × N).
8. A recording medium on which the encoding program according to claim 7 is recorded, wherein the encoding data is the first audio data.

9. In the sub-band dividing step,
9. The input audio data is divided into bands based on the MPEG standard.
A recording medium on which the encoding program described in 1 is recorded.