JP2002014700A

JP2002014700A - Method and device for processing audio signal, and storage medium

Info

Publication number: JP2002014700A
Application number: JP2000199608A
Authority: JP
Inventors: Takayuki Kikuchi; 孝之菊池
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2000-06-30
Filing date: 2000-06-30
Publication date: 2002-01-18

Abstract

PROBLEM TO BE SOLVED: To speedily turn digital audio signals into high efficient signs. SOLUTION: A mask amount computing section 104 converts frequency analyzed digital audio signals into first and second information. The first information indicates the values located within a prescribed range and the second information indicates the exponent of the power of ten. The section 104 searches a conversion table 109 to obtain the logarithm of the first information. Moreover, the section 104 obtains the sound pressure level of the digital audio signals on the basis of the logarithm of the first information and the second information and computes the mask amount on the basis of the obtained sound pressure level.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、デジタルオーディ
オ信号を高能率符号化する技術に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a technique for encoding digital audio signals with high efficiency.

【０００２】[0002]

【従来の技術】デジタルオーディオ信号を高能率符号化
する手法には、ＩＳＯ／ＩＥＣ１１１７２−３に準拠し
たＭＰＥＧオーディオ符号化方式がある。2. Description of the Related Art As a technique for encoding digital audio signals with high efficiency, there is an MPEG audio encoding method based on ISO / IEC11172-3.

【０００３】図５は、ＭＰＥＧオーディオ符号化方式を
実現する音声符号化装置の構成を示すブロック図であ
る。同図において、１は入力端子、２はサブバンド分割
部、３はＦＦＴ（高速フーリエ変換）演算部、４はマス
ク量演算部、５は量子化ステップ演算部、６は量子化
部、７は伝送フォーマット形成部、８は出力端子であ
る。FIG. 5 is a block diagram showing a configuration of a speech encoding device for realizing the MPEG audio encoding method. In the figure, 1 is an input terminal, 2 is a sub-band division unit, 3 is an FFT (fast Fourier transform) operation unit, 4 is a mask amount operation unit, 5 is a quantization step operation unit, 6 is a quantization unit, and 7 is A transmission format forming unit 8 is an output terminal.

【０００４】入力端子１は、デジタルオーディオ信号を
入力する。サブバンド分割部２は、入力端子１からのデ
ジタルオーディオ信号を所定の時間間隔（オーディオフ
レーム）に分割し、分割したオーディオフレームを複数
の周波数帯域に写像する。ＦＦＴ演算部３は、高速フー
リエ変換（ＦＦＴ）を用いて入力端子１からのデジタル
オーディオ信号を周波数分析する。マスク量演算部４
は、ＦＦＴ演算部３の分析結果から音圧レベル[ｄＢ]を
求めた後、この音圧レベルと所定の聴覚心理モデルとを
用いてマスク量を演算する。量子化ステップ演算部５
は、マスク量演算部４で求めたマスク量に基づいて各サ
ブバンドに割り当てるビット数を見積る（即ち、各サブ
バンドの量子化ステップを設定する）。量子化部６は、
量子化ステップ演算部５で求めた量子化ステップに基づ
いて各サブバンドを量子化する。伝送フォーマット形成
部７は、量子化部６の出力（高能率符号化されたデジタ
ルオーディオ信号），量子化ステップ演算部５で求めた
量子化ステップ等の情報を、所定の伝送フォーマットに
変換する。出力端子８は、高能率符号化されたデジタル
オーディオ信号を出力する。[0004] An input terminal 1 inputs a digital audio signal. The sub-band division unit 2 divides the digital audio signal from the input terminal 1 into predetermined time intervals (audio frames), and maps the divided audio frames to a plurality of frequency bands. The FFT operation unit 3 analyzes the frequency of the digital audio signal from the input terminal 1 using the fast Fourier transform (FFT). Mask amount calculation unit 4
Calculates a sound pressure level [dB] from the analysis result of the FFT calculation unit 3, and then calculates a mask amount using the sound pressure level and a predetermined psychoacoustic model. Quantization step calculator 5
Estimates the number of bits to be allocated to each subband based on the mask amount calculated by the mask amount calculation unit 4 (that is, sets a quantization step for each subband). The quantization unit 6
Each sub-band is quantized based on the quantization step obtained by the quantization step calculator 5. The transmission format forming unit 7 converts information such as the output of the quantization unit 6 (a highly efficient encoded digital audio signal) and the quantization step obtained by the quantization step calculation unit 5 into a predetermined transmission format. The output terminal 8 outputs a highly efficient encoded digital audio signal.

【０００５】[0005]

【発明が解決しようとする課題】上記のＭＰＥＧオーデ
ィオ符号化方式では、ＦＦＴ演算部３の分析結果から音
圧レベルを求める場合、数式（１）に示す対数演算を実
行しなければならない。Ｘ（ｋ）＝１０Ｌｏｇ（ＦＦＴ（ｋ））・・・（１）In the above-mentioned MPEG audio coding system, when a sound pressure level is obtained from the analysis result of the FFT operation unit 3, a logarithmic operation represented by the equation (1) must be executed. X (k) = 10Log (FFT (k)) (1)

【０００６】ここで、ＦＦＴ（ｋ）は周波数位置ｋにお
けるＦＦＴ演算部１０３の分析結果、Ｘ（ｋ）は周波数
位置ｋにおける音圧レベル（デシベル量）である。上記
のＭＰＥＧオーディオ符号化方式をソフトウェアで実現
する場合には、このような対数演算を周波数位置ｋごと
にソフトウェアで処理しなければならない。ところが、
このような対数演算の全てをソフトウェアで処理する場
合には、演算量と処理時間とが増加し、実時間での処理
が困難になるという問題がある。Here, FFT (k) is the analysis result of the FFT operation unit 103 at the frequency position k, and X (k) is the sound pressure level (decibel amount) at the frequency position k. When the above-mentioned MPEG audio encoding system is realized by software, such a logarithmic operation must be processed by software for each frequency position k. However,
When all such logarithmic operations are processed by software, the amount of calculation and the processing time increase, and there is a problem that real-time processing becomes difficult.

【０００７】本発明は上記問題点を鑑みてなされたもの
であり、デジタルオーディオ信号を高速に高能率符号化
するための技術を提供することを目的とする。The present invention has been made in view of the above problems, and has as its object to provide a technique for encoding digital audio signals at high speed with high efficiency.

【０００８】[0008]

【課題を解決するための手段】本発明の目的を達成する
ために、本発明の音声信号処理方法は、周波数分析され
たデジタルオーディオ信号を第１の情報と第２の情報と
に変換する変換工程と、所定の範囲内にある複数の値の
対数を保持するテーブルを検索し、前記第１の情報の対
数を求める検索工程と、前記第１の情報の対数と前記第
２の情報とに基づいて前記デジタルオーディオ信号の音
圧レベルを求め、求めた音圧レベルに基づいてマスク量
を演算するマスク量演算工程とを有することを特徴とす
る。In order to achieve the object of the present invention, an audio signal processing method according to the present invention comprises a converter for converting a frequency-analyzed digital audio signal into first information and second information. A step of searching a table that holds a logarithm of a plurality of values within a predetermined range to obtain a logarithm of the first information; and searching for a logarithm of the first information and a logarithm of the first information and the second information. Calculating a sound pressure level of the digital audio signal based on the calculated sound pressure level, and calculating a mask amount based on the calculated sound pressure level.

【０００９】また、本発明の音声信号処理装置は、周波
数分析されたデジタルオーディオ信号を第１の情報と第
２の情報とに変換する変換手段と、所定の範囲内にある
複数の値の対数を保持するテーブルを検索し、前記第１
の情報の対数を求める検索手段と、前記第１の情報の対
数と前記第２の情報とに基づいて前記デジタルオーディ
オ信号の音圧レベルを求め、求めた音圧レベルに基づい
てマスク量を演算するマスク量演算手段とを有すること
を特徴とする。Further, the audio signal processing apparatus of the present invention comprises a conversion means for converting a frequency-analyzed digital audio signal into first information and second information, and a logarithm of a plurality of values within a predetermined range. Search for a table that holds
Searching means for calculating the logarithm of the information, calculating the sound pressure level of the digital audio signal based on the logarithm of the first information and the second information, and calculating the mask amount based on the calculated sound pressure level And a mask amount calculating means.

【００１０】また、本発明の記憶媒体は、上記音声信号
処理方法をコンピュータに実現するためのプログラムコ
ードを保持することを特徴とする。Further, the storage medium of the present invention is characterized by holding a program code for realizing the above-mentioned audio signal processing method in a computer.

【００１１】[0011]

【発明の実施の形態】以下、添付図面を参照して本発明
の好適な実施の形態を詳細に説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Preferred embodiments of the present invention will be described below in detail with reference to the accompanying drawings.

【００１２】図１は、本発明の実施の形態における音声
符号化装置の構成を示す図である。本実施の形態では、
デジタルオーディオ信号を高能率符号化する手法の一つ
であるＭＰＥＧオーディオ符号化方式（ＩＳＯ／ＩＥＣ
１１１７２−３に準拠）を実現する音声符号化装置につ
いて説明する。なお、本実施の形態の音声符号化装置
は、一部あるいは全てをソフトウェアで実現することが
可能である。この場合、音声符号化装置の制御部（ＣＰ
ＵやＭＰＵを含む）がこのソフトウェアを実現するプロ
グラムを記憶媒体から読み出し、読み出したプログラム
に従って各種の処理を実行する。FIG. 1 is a diagram showing a configuration of a speech coding apparatus according to an embodiment of the present invention. In the present embodiment,
The MPEG audio encoding method (ISO / IEC), which is one of the techniques for encoding digital audio signals with high efficiency
11172-3) will be described. The speech encoding device according to the present embodiment can be partially or entirely realized by software. In this case, the control unit (CP
U and MPU) read a program for realizing this software from the storage medium, and execute various processes according to the read program.

【００１３】図１において、１０１は入力端子、１０２
はサブバンド分割部、１０３はＦＦＴ（高速フーリエ変
換）演算部、１０４はマスク量演算部、１０５は量子化
ステップ演算部、１０６は量子化部、１０７は伝送フォ
ーマット形成部、１０８は出力端子、１０９は変換テー
ブルである。In FIG. 1, reference numeral 101 denotes an input terminal;
Is a subband division unit, 103 is an FFT (fast Fourier transform) operation unit, 104 is a mask amount operation unit, 105 is a quantization step operation unit, 106 is a quantization unit, 107 is a transmission format formation unit, 108 is an output terminal, 109 is a conversion table.

【００１４】入力端子１０１はデジタルオーディオ信号
を入力し、入力したデジタルオーディオ信号をサブバン
ド分割部１０２およびＦＦＴ演算部１０３に供給する。
サブバンド分割部１０２は、デジタルオーディオ信号を
所定の時間間隔（オーディオフレーム）に分割し、分割
したオーディオフレームをオーバラップ・ポリフェイズ
・フィルタ・バンク（ＯＰＦＢ）を用いて７５０Ｈｚ単
位の周波数帯域（サブバンド）に写像する。各サブバン
ドのデータは、後に量子化部１０６で量子化される。An input terminal 101 inputs a digital audio signal, and supplies the input digital audio signal to a subband division unit 102 and an FFT operation unit 103.
The sub-band dividing unit 102 divides the digital audio signal into predetermined time intervals (audio frames), and divides the divided audio frames into frequency bands (sub-bands) in units of 750 Hz using an overlapping polyphase filter bank (OPFB). Band). The data of each subband is quantized by the quantization unit 106 later.

【００１５】一方、ＦＦＴ演算部１０３は、高速フーリ
エ変換を用いて、入力端子１０１からのデジタルオーデ
ィオ信号を周波数分析する。この分析結果は、マスク量
演算部１０４に供給される。マスク量演算部１０４は、
変換テーブル１０９を用いてＦＦＴ演算部１０３の分析
結果から音圧レベル［ｄＢ］を求めた後、この音圧レベ
ルと所定の聴覚心理モデルとを用いてマスク量を演算す
る。このマスク量は、量子化ステップ演算部１０５に供
給される。On the other hand, the FFT operation unit 103 analyzes the frequency of the digital audio signal from the input terminal 101 using the fast Fourier transform. This analysis result is supplied to the mask amount calculation unit 104. The mask amount calculation unit 104
After obtaining the sound pressure level [dB] from the analysis result of the FFT calculation unit 103 using the conversion table 109, the mask amount is calculated using the sound pressure level and a predetermined psychoacoustic model. This mask amount is supplied to the quantization step calculation unit 105.

【００１６】量子化ステップ演算部１０５は、マスク量
演算部１０４で求めたマスク量に基づいて各サブバンド
に割り当てるビット数を適応的に見積る（即ち、各サブ
バンドの量子化ステップを適応的に設定する）。The quantization step calculator 105 adaptively estimates the number of bits to be allocated to each subband based on the mask amount obtained by the mask amount calculator 104 (ie, adaptively determines the quantization step of each subband). Set).

【００１７】量子化部１０６は、量子化ステップ演算部
１０５で求めた量子化ステップに基づいて各サブバンド
を量子化する量子化部である。伝送フォーマット形成部
１０７は、量子化部１０６の出力（高能率符号化された
デジタルオーディオ信号），量子化ステップ演算部１０
５で求めた量子化ステップ等の情報を、所定の伝送フォ
ーマットに変換する伝送フォーマット形成部である。出
力端子１０８は、高能率符号化されたデジタルオーディ
オ信号を出力する。The quantization unit 106 is a quantization unit that quantizes each subband based on the quantization step obtained by the quantization step calculation unit 105. The transmission format forming unit 107 outputs the output of the quantizing unit 106 (a highly efficient encoded digital audio signal) and the quantization step calculating unit 10
The transmission format forming unit converts the information such as the quantization step obtained in step 5 into a predetermined transmission format. The output terminal 108 outputs a highly efficient encoded digital audio signal.

【００１８】図２は、変換テーブル１０９を作成するア
ルゴリズムを説明するフローチャートである。変換テー
ブル１０９は、所定の範囲内に存在する複数個の値の対
数を保持する。本実施の形態では、対数の周期性を考慮
し、所定の範囲を１以上１０未満とする例について説明
する。また、本実施の形態では、最小可聴限界値の有効
数字がデシベル量で小数点以下２桁であることを考慮
し、１．００から０．０１ごとに増加する値の対数を求
め、それらを変換テーブル１０９に格納する例について
説明する。FIG. 2 is a flowchart illustrating an algorithm for creating the conversion table 109. The conversion table 109 holds logarithms of a plurality of values existing within a predetermined range. In the present embodiment, an example in which the predetermined range is set to 1 or more and less than 10 in consideration of log periodicity will be described. Also, in the present embodiment, considering that the significant digits of the minimum audible limit value are two decimal places in decibels, the logarithm of the value increasing by 0.01 from 1.00 is calculated and converted. An example of storing in the table 109 will be described.

【００１９】ステップＳ２０１では、変数ｓｅｅｄに１
００をセットする。ステップＳ２０２では、変数ｓｅｅ
ｄに０．０１を乗じ、この乗算の結果の対数を演算す
る。In step S201, the variable seed is set to 1
Set 00. In step S202, the variable seed
Multiply d by 0.01 and calculate the logarithm of the result of this multiplication.

【００２０】ステップＳ２０３では、ステップＳ２０２
で求めた対数ＴＢＬｅｘｐ［ｓｅｅｄ］は、変換テーブ
ル１０９の変数ｓｅｅｄに対応するアドレスに格納され
る。なお、変換テーブル１０９のアドレスは、変数ｓｅ
ｅｄと同様に、１００から９９９まで変化する。In step S203, step S202
The logarithm TBLexp [seed] obtained in step (1) is stored in the conversion table 109 at an address corresponding to the variable seed. Note that the address of the conversion table 109 is the variable se
Like ed, it changes from 100 to 999.

【００２１】ステップＳ２０４では、変数ｓｅｅｄが１
０００に等しいか否かを判別する。等しくない場合に
は、ステップＳ２０５に進み、変数ｓｅｅｄに１を加
え、ステップＳ２０１に進む。一方、等しい場合には、
処理を終了する。In step S204, the variable seed is set to 1
000 is determined. If not equal, the process proceeds to step S205, 1 is added to the variable seed, and the process proceeds to step S201. On the other hand, if they are equal,
The process ends.

【００２２】以上のように構成することにより、変換テ
ーブル１０９には、１．００から９．９９までの対数を
格納することができる。With the above configuration, the conversion table 109 can store logarithms from 1.00 to 9.99.

【００２３】図３は、本実施の形態におけるマスク量演
算部１０４の構成を示す図である。本実施の形態では、
マスク量演算部１０４をソフトウェアによって実現する
場合について説明する。FIG. 3 is a diagram showing a configuration of the mask amount calculation unit 104 according to the present embodiment. In the present embodiment,
A case where the mask amount calculation unit 104 is realized by software will be described.

【００２４】ＦＦＴ演算部１０３の分析結果から音圧レ
ベルを求める場合、マスク量演算部１０４は数式（１）
に示す演算を実行しなければならない。Ｘ（ｋ）＝１０Ｌｏｇ（ＦＦＴ（ｋ））・・・（１）When calculating the sound pressure level from the analysis result of the FFT operation unit 103, the mask amount operation unit 104 calculates the expression (1)
The operation shown in must be performed. X (k) = 10Log (FFT (k)) (1)

【００２５】ここで、ＦＦＴ（ｋ）は周波数位置ｋにお
けるＦＦＴ演算部１０３の分析結果、Ｘ（ｋ）は周波数
位置ｋにおける音圧レベル（デシベル量）である。本実
施の形態のマスク量演算部１０４では、数式（１）に示
す演算を高速に行うために、音圧レベルＸ（ｋ）を次の
ようにして求める。即ち、ＦＦＴ（ｋ）の対数を直接演
算するのではなく、ＦＦＴ（ｋ）を一度「Ａ×１０^Ｂに
変換し、「Ａ」の対数と「Ｂ」との和に基づいてＦＦＴ
（ｋ）の対数を求め、音圧レベルＸ（ｋ）を求める。こ
こで、「Ａ」は変換テーブル１０９を作成する際に設定
した所定の範囲（即ち、１以上１０未満）内の存在する
値を示し、「Ｂ」は１０のべき乗を示す。Here, FFT (k) is the analysis result of the FFT operation unit 103 at the frequency position k, and X (k) is the sound pressure level (decibel amount) at the frequency position k. In the mask amount calculation unit 104 of the present embodiment, the sound pressure level X (k) is obtained as follows in order to perform the calculation represented by Expression (1) at high speed. That is, instead of directly calculating the logarithm of FFT (k), FFT (k) is once converted to “A × 10 ^B ”, and the FFT (k) is calculated based on the sum of the logarithm of “A” and “B”.
The logarithm of (k) is found, and the sound pressure level X (k) is found. Here, “A” indicates a value that exists within a predetermined range (ie, 1 or more and less than 10) set when the conversion table 109 is created, and “B” indicates a power of 10.

【００２６】図３において、３０１は変換モジュール、
３０２はテーブル検索モジュール、３０３はデシベル値
演算モジュール、３０４はマスク量演算モジュールであ
る。In FIG. 3, reference numeral 301 denotes a conversion module;
302 is a table search module, 303 is a decibel value calculation module, and 304 is a mask amount calculation module.

【００２７】変換モジュール３０１は、ＦＦＴ演算部１
０３の分析結果であるＦＦＴ（ｋ）を「Ａ×１０^Ｂ」に
変換する。ここで求めた「Ａ」は第１の情報としてテー
ブル検索モジュール３０２に供給され、「Ｂ」は第２の
情報としてデシベル値演算モジュール３０３に供給され
る。The conversion module 301 includes an FFT operation unit 1
FFT (k), which is the analysis result of 03, is converted into “A × 10 ^B ”. “A” obtained here is supplied to the table search module 302 as first information, and “B” is supplied to the decibel value calculation module 303 as second information.

【００２８】テーブル検索モジュール３０２は、変換テ
ーブル１０９を検索し、変換モジュール３０１で求めた
第１の情報（即ち、「Ａ」）の対数を求める。The table search module 302 searches the conversion table 109 and obtains the logarithm of the first information (that is, "A") obtained by the conversion module 301.

【００２９】デシベル値演算モジュール３０３は、第１
の情報（即ち、「Ａ」）の対数と第２の情報（即ち、
「Ｂ」）とを用いて数式（１）に示す演算を行い、音圧
レベルＸ（ｋ）を求める。The decibel value calculation module 303 has a first
Of the information (ie, “A”) and the second information (ie,
(B)) to calculate the sound pressure level X (k).

【００３０】マスク量演算モジュール３０４は、デシベ
ル値演算モジュール３０３で求めた音圧レベルＸ（ｋ）
と所定の聴覚心理モデルとを用いてマスク量を演算す
る。このマスク量は、量子化ステップ演算部１０５に供
給される。The mask amount calculation module 304 calculates the sound pressure level X (k) obtained by the decibel value calculation module 303.
And a predetermined psychoacoustic model to calculate the mask amount. This mask amount is supplied to the quantization step calculation unit 105.

【００３１】図４は、変換テーブル１０９を用いてＦＦ
Ｔ演算部１０３の解析結果からマスク量を求めるアルゴ
リズムを説明するフローチャートである。FIG. 4 shows an FF using the conversion table 109.
5 is a flowchart illustrating an algorithm for obtaining a mask amount from an analysis result of a T operation unit 103.

【００３２】ステップＳ４０１において、変換モジュー
ル３０１はＦＦＴ演算部１０３の分析結果であるＦＦＴ
（ｋ）を入力する。ここで、変換テーブル１０９を小数
点以下第２位までの値から作成した場合、変換モジュー
ル３０１は切り捨て、切り上げまたは四捨五入のいずれ
かによってＦＦＴ（ｋ）を小数点以下第２位に丸める。
ステップＳ４０２において、変換モジュール３０１は、
変数ｃｏｎｓｔを０にリセットする。なお、変数ｃｏｎ
ｓｔは、上述の「Ｂ」に相当する。In step S 401, the transform module 301 executes the FFT
Enter (k). Here, when the conversion table 109 is created from values to the second decimal place, the conversion module 301 rounds the FFT (k) to the second decimal place by either rounding down, rounding up, or rounding.
In step S402, the conversion module 301
Reset the variable const to 0. Note that the variable con
st corresponds to “B” described above.

【００３３】ステップＳ４０３において、変換モジュー
ル３０１はＦＦＴ（ｋ）が１０．０未満か否かを判定す
る。ＦＦＴ（ｋ）が１０．０以上である場合にはステッ
プＳ４０４に進み、１０．０未満である場合にはステッ
プＳ４０７に進む。In step S403, the conversion module 301 determines whether FFT (k) is less than 10.0. If FFT (k) is 10.0 or more, the process proceeds to step S404. If FFT (k) is less than 10.0, the process proceeds to step S407.

【００３４】ステップＳ４０４では、変数ｃｏｎｓｔに
１を加算する。ステップＳ４０５では、ＦＦＴ（ｋ）に
０．１を乗じる。ステップＳ４０６では、ステップＳ４
０５で求めた乗算結果が１０．０未満か否かを判定す
る。ＦＦＴ（ｋ）が１０．０以上である場合にはステッ
プＳ４０４に戻り、１０．０未満である場合にはステッ
プＳ４１１に進む。In step S404, 1 is added to the variable const. In step S405, FFT (k) is multiplied by 0.1. In step S406, step S4
It is determined whether the multiplication result obtained in step 05 is less than 10.0. If FFT (k) is 10.0 or more, the process returns to step S404. If FFT (k) is less than 10.0, the process proceeds to step S411.

【００３５】このように、ＦＦＴ（ｋ）が１０．０以上
である場合、変換モジュール３０１はＦＦＴ（ｋ）が１
０．０未満になるまで、ＦＦＴ（ｋ）に０．１を乗じ、
変数ｃｏｎｓｔに１を加算する。As described above, when the FFT (k) is 10.0 or more, the conversion module 301 determines that the FFT (k) is 1
Multiply FFT (k) by 0.1 until it is less than 0.0,
Add 1 to the variable const.

【００３６】ステップＳ４０７において、変換モジュー
ル３０１はＦＦＴ（ｋ）が１．００未満か否かを判定す
る。ＦＦＴ（ｋ）が１．００未満である場合にはステッ
プＳ４０８に進む。一方、ＦＦＴ（ｋ）が１．００以上
である場合には、ＦＦＴ（ｋ）は１以上１０未満内の存
在する値であるので、ステップＳ４１１に進む。In step S407, the conversion module 301 determines whether FFT (k) is less than 1.00. If FFT (k) is less than 1.00, the process proceeds to step S408. On the other hand, if FFT (k) is equal to or greater than 1.00, FFT (k) is an existing value within the range of 1 to less than 10, and the process proceeds to step S411.

【００３７】ステップＳ４０８では、変数ｃｏｎｓｔか
ら１を減算する。ステップＳ４０９では、ＦＦＴ（ｋ）
に１０を乗じる。ステップＳ４１０では、ステップＳ４
０９で求めた乗算結果が１．００未満か否かを判定す
る。ＦＦＴ（ｋ）が１．００未満である場合にはステッ
プＳ４０８に戻り、１．００以上である場合にはステッ
プＳ４１１に進む。In step S408, 1 is subtracted from the variable const. In step S409, FFT (k)
Multiply by 10. In step S410, step S4
It is determined whether the multiplication result obtained in step 09 is less than 1.00. If FFT (k) is less than 1.00, the process returns to step S408; otherwise, the process proceeds to step S411.

【００３８】このように、ＦＦＴ（ｋ）が１．００未満
である場合、変換モジュール３０１はＦＦＴ（ｋ）が
１．００以上になるまで、ＦＦＴ（ｋ）に１０を乗じ、
変数ｃｏｎｓｔから１を減算する。Thus, if FFT (k) is less than 1.00, transform module 301 multiplies FFT (k) by 10 until FFT (k) is greater than or equal to 1.00,
Subtract 1 from the variable const.

【００３９】ステップＳ４１１において、テーブル検索
モジュール３０２は、１以上１０未満内の値に変換され
たＦＦＴ（ｋ）（上述の「Ａ」に相当する）に対応する
アドレスａｄｒを求める。具体的には、ＦＦＴ（ｋ）に
１００を乗じた結果の整数部からアドレスａｄｒを求め
る。そして、テーブル検索モジュール３０２は、求めた
アドレスａｄｒに基づいて変換テーブル１０９を検索
し、アドレスａｄｒが保持する対数ＴＢＬｅｘｐ［ａｄ
ｒ］（「Ａ」の対数に相当する）を求める。In step S411, the table search module 302 obtains an address adr corresponding to FFT (k) (corresponding to "A" described above) converted to a value of 1 or more and less than 10. Specifically, the address adr is obtained from the integer part of the result of multiplying FFT (k) by 100. Then, the table search module 302 searches the conversion table 109 based on the obtained address adr, and finds the logarithmic TBLexp [ad held by the address adr.
r] (corresponding to the logarithm of “A”).

【００４０】ステップＳ４１２において、デシベル値演
算モジュール３０３は、ステップＳ４１１で求めた対数
ＴＢＬｅｘｐ［ａｄｒ］と変数ｃｏｎｓｔとを加算し、
この加算結果に１０を乗じて音圧レベルＸ（ｋ）を求め
る。即ち、ステップＳ４１１で求めた対数ＴＢＬｅｘｐ
［ａｄｒ］と変数ｃｏｎｓｔに基づいて数式（１）に示
す対数演算を実現する。In step S412, the decibel value calculation module 303 adds the logarithmic TBLexp [adr] obtained in step S411 and the variable const, and
The sound pressure level X (k) is obtained by multiplying the addition result by 10. That is, the logarithmic TBLexp obtained in step S411
The logarithmic operation shown in Expression (1) is realized based on [adr] and the variable const.

【００４１】次の周波数位置ｋのＦＦＴ（ｋ）から音圧
レベルＸ（ｋ）を演算する場合、マスク値演算モジュー
ル３０４はステップＳ４０１に進み、それ以外の場合、
マスク値演算モジュール３０４はステップＳ４１４に進
む。When calculating the sound pressure level X (k) from the FFT (k) at the next frequency position k, the mask value calculating module 304 proceeds to step S401.
The mask value calculation module 304 proceeds to step S414.

【００４２】ステップＳ４１４において、マスク値演算
モジュール３０４は、ステップＳ４１２で求めた音圧レ
ベルＸ（ｋ）に基づいてマスク量を演算する。このよう
に得られたマスク量は、量子化ステップ演算部１０５に
供給される。量子化ステップ演算部１０５はこのマスク
量に基づいて各サブバンドの量子化ステップを適応的に
設定する。In step S414, the mask value calculation module 304 calculates a mask amount based on the sound pressure level X (k) obtained in step S412. The mask amount thus obtained is supplied to the quantization step calculation unit 105. The quantization step calculation unit 105 adaptively sets the quantization step of each subband based on the mask amount.

【００４３】以上説明したように本実施の形態によれ
ば、ＦＦＴ（ｋ）を直接対数演算するのではなく、ＦＦ
Ｔ（ｋ）を一度「Ａ×１０^Ｂ」に変換し、「Ａ」の対数
と「Ｂ」との和に基づいてＦＦＴ（ｋ）の対数を求める
ので、通常の対数演算よりもAs described above, according to the present embodiment, FFT (k) is not directly logarithmically calculated,
Since T (k) is once converted to “A × 10 ^B ”, and the logarithm of FFT (k) is obtained based on the sum of the logarithm of “A” and “B”, the logarithmic calculation is more efficient than the usual logarithmic calculation.

【００４４】高速に音圧レベルＸ（ｋ）を演算すること
ができ、デジタルオーディオ信号を高速に高能率符号化
することができる。また、本実施の形態によれば、最小
可聴限界値の有効数字を考慮して変換テーブル１０９を
作成するので、演算精度の劣化を少なくすることも可能
である。The sound pressure level X (k) can be calculated at high speed, and the digital audio signal can be encoded at high speed with high efficiency. Further, according to the present embodiment, since conversion table 109 is created in consideration of the significant figures of the minimum audible limit value, it is possible to reduce the deterioration of the calculation accuracy.

【００４５】本実施の形態では、小数点以下第２位まで
の値の対数を求め、それらを変換テーブル１０９に格納
する場合について説明したが、これに限るものではな
い。小数点以下第３位までの値の対数を求め、それらを
変換テーブル１０９に格納してもよい。In the present embodiment, a case has been described where the logarithms of the values up to the second decimal place are calculated and stored in the conversion table 109, but the present invention is not limited to this. The logarithms of the values up to the third decimal place may be obtained and stored in the conversion table 109.

【００４６】また、本実施の形態では、上述の指数解析
アルゴリズムをＩＳＯ／ＩＥＣ１１１７２−３に準拠し
たＭＰＥＧオーディオ符号化方式に適用する場合につい
て説明したが、これに限るものではない。ＡＴＲＡＣ
（アダプティブ・トランスフォーム・アコースティック
・コーディング）方式，ＰＡＳＣ（プリシジョン・アダ
プティブ・サブバンド・コーディング）方式等の聴覚心
理特性を利用した音声符号化方式に適用することも可能
である。In this embodiment, the case where the above-described exponent analysis algorithm is applied to the MPEG audio coding system conforming to ISO / IEC11172-3 has been described, but the present invention is not limited to this. ATRAC
(Adaptive Transform Acoustic Coding) method, PASC (Precision Adaptive Subband Coding) method and the like can be applied to a speech coding method using psychoacoustic characteristics.

【００４７】[0047]

【他の実施の形態】本実施の形態は、複数の機器から構
成されるシステムに適用することも可能である。[Other Embodiments] The present embodiment can be applied to a system including a plurality of devices.

【００４８】また、本実施の形態は前述したように、本
実施の形態の機能を実現するソフトウェアのプログラム
コードを記録した記憶媒体を、システムあるいは装置に
供給し、そのシステムあるいは装置の制御部（ＣＰＵや
ＭＰＵを含む）が記憶媒体に格納されたプログラムコー
ドを読み出し実行することによっても、達成されること
は言うまでもない。In the present embodiment, as described above, a storage medium storing software program codes for realizing the functions of the present embodiment is supplied to a system or an apparatus, and a control unit (control unit) of the system or the apparatus is provided. Needless to say, the present invention is also achieved when a CPU (including a CPU and an MPU) reads out and executes the program code stored in the storage medium.

【００４９】この場合、記憶媒体から読み出されたプロ
グラムコード自体が前述した実施形態の機能を実現する
ことになり、そのプログラムコードを記憶した記憶媒体
は本発明を構成することになる。また、コンピュータが
読み出したプログラムコードを実行することにより、前
述した実施形態の機能が実現されるだけでなく、そのプ
ログラムコードの指示に基づき、コンピュータ上で稼働
しているオペレーティングシステム（ＯＳ）などが実際
の処理の一部または全部を行い、その処理によって前述
した実施形態の機能が実現される場合も含まれることは
言うまでもない。In this case, the program code itself read from the storage medium implements the functions of the above-described embodiment, and the storage medium storing the program code constitutes the present invention. When the computer executes the readout program codes, not only the functions of the above-described embodiments are realized, but also an operating system (OS) running on the computer based on the instructions of the program codes. It goes without saying that a case where some or all of the actual processing is performed and the functions of the above-described embodiments are realized by the processing is also included.

【００５０】さらに、記憶媒体から読み出されたプログ
ラムコードが、コンピュータに挿入された機能拡張カー
ドやコンピュータに接続された機能拡張ユニットに備わ
るメモリに書込まれた後、そのプログラムコードの指示
に基づき、その機能拡張カードや機能拡張ユニットに備
わるＣＰＵなどが実際の処理の一部または全部を行い、
その処理によって前述した実施形態の機能が実現される
場合も含まれることは言うまでもない。Further, after the program code read from the storage medium is written into a memory provided in a function expansion card inserted into the computer or a function expansion unit connected to the computer, the program code is read based on the instruction of the program code. , The CPU provided in the function expansion card or the function expansion unit performs part or all of the actual processing,
It goes without saying that a case where the function of the above-described embodiment is realized by the processing is also included.

【００５１】[0051]

【発明の効果】上記説明の通り本発明によれば、デジタ
ルオーディオ信号を高速に高能率符号化することができ
る。As described above, according to the present invention, digital audio signals can be encoded at high speed with high efficiency.

[Brief description of the drawings]

【図１】本実施の形態における音声符号化装置の構成を
説明するブロック図である。FIG. 1 is a block diagram illustrating a configuration of a speech encoding device according to the present embodiment.

【図２】本実施の形態における変換テーブルの作成手順
を説明するフローチャートである。FIG. 2 is a flowchart illustrating a procedure for creating a conversion table in the present embodiment.

【図３】本実施の形態におけるマスク量演算部の構成を
説明するブロック図である。FIG. 3 is a block diagram illustrating a configuration of a mask amount calculation unit according to the present embodiment.

【図４】本実施の形態におけるマスク量演算部の処理手
順を説明するフローチャートである。FIG. 4 is a flowchart illustrating a processing procedure of a mask amount calculation unit according to the present embodiment.

【図５】従来の音声符号化装置の構成を示すブロック図
である。FIG. 5 is a block diagram illustrating a configuration of a conventional speech encoding device.

Claims

[Claims]

A conversion step of converting a frequency-analyzed digital audio signal into first information and second information; and searching a table holding a logarithm of a plurality of values within a predetermined range, A search step of obtaining a logarithm of the first information; obtaining a sound pressure level of the digital audio signal based on the logarithm of the first information and the second information; and a mask amount based on the obtained sound pressure level. And calculating a mask amount.

2. The audio signal processing method according to claim 1, wherein the predetermined range is 1 or more and less than 10.

3. The converting step includes: converting a frequency-analyzed digital audio signal to a value within the predetermined range;
2. The method according to claim 1, wherein the data is converted into a product of a power of zero.
Or the audio signal processing method according to 2.

4. The audio signal processing method according to claim 1, wherein the first information indicates a value within the predetermined range.

5. The audio signal processing method according to claim 1, wherein the second information indicates an exponent of a power of 10.

6. The mask amount calculating step, wherein a sound pressure level of the digital audio signal is obtained based on a result obtained by adding a logarithm of the first information and the second information. The audio signal processing method according to any one of claims 1 to 5.

7. A conversion means for converting the frequency-analyzed digital audio signal into first information and second information, and searching a table holding a logarithm of a plurality of values within a predetermined range, Searching means for obtaining the logarithm of the first information; obtaining the sound pressure level of the digital audio signal based on the logarithm of the first information and the second information; An audio signal processing device comprising:

8. The audio signal processing device according to claim 7, wherein the predetermined range is 1 or more and less than 10.

9. The converting means converts the frequency-analyzed digital audio signal to a value within the predetermined range and 1
8. The method according to claim 7, wherein the data is converted to a product of a power of zero.
Or the audio signal processing device according to 8.

10. The audio signal processing apparatus according to claim 7, wherein the first information indicates a value within the predetermined range.

11. The audio signal processing apparatus according to claim 7, wherein the second information indicates an exponent of a power of 10.

12. The apparatus according to claim 1, wherein the mask amount calculating means obtains a sound pressure level of the digital audio signal based on a result obtained by adding a logarithm of the first information and the second information. The audio signal processing device according to any one of 7 to 11.

13. A storage medium storing program codes for realizing the audio signal processing method according to claim 1. Description: