JPH07202821A

JPH07202821A - High efficiency coding voice decoder

Info

Publication number: JPH07202821A
Application number: JP5348433A
Authority: JP
Inventors: Masami Aizawa; 雅己相沢
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1993-12-27
Filing date: 1993-12-27
Publication date: 1995-08-04

Abstract

PURPOSE:To eliminate the need for addition of hardware such as a distribute filter and a synthesis filter after complete decoding by providing various adjustment to a high efficiency coding voice signal on the way of voice decoding. CONSTITUTION:A de-format section 12 analyzes a bit stream subject to high efficiency coding to separate allocation information, a scale factor and a frequency sampling thereby implementing de-formatting and the result is fed to a reproduction section 14, in which inverse quantization of a frequency sampling and inverse scaling processing are implemented and its output is given to an inverse mapping section 16 and a decoded acoustic signal is outputted. The scale factor of the reproduction section 14 is fed to an output section 23, from which a maximum value for each prescribed period of the scale factor is outputted as range information. Furthermore, a quantization coefficient in the reproduction section 14 is optionally adjusted by an input section 21.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】高能率符号化された音声信号を復
号化する高能率符号化音声の復号化装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a high-efficiency coded speech decoding apparatus for decoding a high-efficiency coded speech signal.

【０００２】[0002]

【従来の技術】従来利用されている音声圧縮は時間軸上
の相関を利用したＡＤＰＣＭや、準瞬時圧縮などであ
り、ここでいう高効率符号化音声信号は、音響信号に対
してスペクトル解析を行ない、人の聴覚心理モデルにも
とずき、聞こえない成分についての情報を削減すること
で大幅な圧縮を行なうものである。2. Description of the Related Art Conventionally used speech compression is ADPCM utilizing correlation on a time axis, quasi-instantaneous compression, etc. The high-efficiency coded speech signal referred to here is a spectrum analysis of an acoustic signal. According to the human psychology model of human beings, a large amount of compression is performed by reducing the information about inaudible components.

【０００３】図２（Ａ）に示すように高能率符号化され
た信号、ビットストリームはヘッダーと、音声信号を周
波数帯域（バンド）に分割し、スケールファクターや、
周波数サンプルの情報量を示す割り当て情報（アロケー
ション情報）と、そのバンドのレンジ（指数情報）を示
すスケールファクターと、周波数サンプルとから構成さ
れている。As shown in FIG. 2 (A), a signal and a bit stream which are highly efficient coded are divided into a header and an audio signal into frequency bands, and scale factors and
It is composed of allocation information (allocation information) indicating the information amount of the frequency sample, a scale factor indicating the range (index information) of the band, and the frequency sample.

【０００４】図１１（Ａ）に従来の高能率音声復号化装
置を示す。FIG. 11A shows a conventional high efficiency speech decoding apparatus.

【０００５】入力されたビットストリーム１１１は、デ
フォーマット１１２において、アロケーション情報１１
３ａと、スケールファクター１１３ｂと、周波数サンプ
ル１１３ｃとに、アロケーション情報１１３ａを参考に
して分割される。分割された周波数サンプル１１３ｃ
は、再生部１１４により逆量子化、逆スケーリングされ
る。逆スケールリングされた周波数サンプル１１５は、
ＩＦＦＴやＩＭＤＣＴなどの周波数−時間軸変換の演算
を行う逆マッピング部１１６により、時間軸のデータに
変換され、オーバーラップ等の処理を施した後合成さ
れ、音響信号１１７として復元される。The input bit stream 111 is allocated to the allocation information 11 in the deformat 112.
3a, the scale factor 113b, and the frequency sample 113c are divided with reference to the allocation information 113a. Divided frequency samples 113c
Is inversely quantized and inversely scaled by the reproducing unit 114. The inverse scaled frequency samples 115 are
The inverse mapping unit 116, which performs a frequency-time axis conversion operation such as IFFT or IMDCT, converts the data into time axis data, performs processing such as overlap, and then synthesizes the data to restore the sound signal 117.

【０００６】復元した音響信号１１７は、分配フィルタ
ー１１８に送られる。分配フィルター１１８は音声を周
波数分解し、複数の帯域に分割し、レベル調整部１２０
に伝送する。入力部１２１は、各帯域の増減比率を与え
るための情報１２２を出力するもので、この情報１２２
により、レベル調整部１２０では帯域ごとに増幅、減衰
が行われ、その出力は、統合フィルター１２６に伝送さ
れる。また、調整を行われた各帯域の信号１２５はある
一定の期間、例えば１００ｍｓの間の最大値が求めら
れ、帯域ごとに出力部１２３に送られる。出力部１２３
では各帯域のレベルをインジケーター等で表示する。統
合フィルター１２６は周波数ごとに分割された信号１２
５を合成し、音響信号１２７として出力する。The restored acoustic signal 117 is sent to the distribution filter 118. The distribution filter 118 frequency-decomposes the sound and divides it into a plurality of bands, and the level adjusting unit 120
To transmit. The input unit 121 outputs information 122 for giving the increase / decrease ratio of each band.
As a result, the level adjusting unit 120 performs amplification and attenuation for each band, and the output thereof is transmitted to the integrated filter 126. Further, the adjusted signal 125 of each band is determined to have a maximum value for a certain period, for example, 100 ms, and is sent to the output unit 123 for each band. Output unit 123
Then, the level of each band is displayed by an indicator or the like. The integrated filter 126 divides the signal 12 into frequency components.
5 is synthesized and output as an acoustic signal 127.

【０００７】次に図１１（Ｂ）を用いて再生部１１４で
の処理について説明する。Next, the processing in the reproducing section 114 will be described with reference to FIG.

【０００８】逆量子化部１３４ではデフォーマットされ
た周波数サンプル１１３ｃをアロケーション情報１１３
ａにもとずき、逆量子化し、逆スケーリング部１３６に
伝送する。逆量子化は次の式により行う。The dequantization unit 134 converts the reformatted frequency samples 113c into allocation information 113.
Based on a, it is inversely quantized and transmitted to the inverse scaling unit 136. Inverse quantization is performed by the following formula.

【０００９】ｓ′＝ｃ×（ｓ＋ｄ）ここでｃ，ｄは逆量子化係数といわれ、アロケーション
情報が示す、量子化ステップ数により定まる。逆量子化
係数ｃ，ｄを表１に示す。S '= c * (s + d) Here, c and d are called inverse quantization coefficients and are determined by the number of quantization steps indicated by the allocation information. Table 1 shows the inverse quantized coefficients c and d.

【００１０】係数算出器１３７はレンジ情報であるスケ
ールファクター１１３ｂから逆スケーリングに必要な乗
数１３８を、テーブルを参照、あるいは演算により求
め、逆スケーリングに必要な乗数値１３８を伝送する。
スケールファクター１１３ｂは６ビット、２ｄＢステッ
プの離散値を表している場合、逆スケーリングに必要な
乗数値１３８をｄＢ、つまり指数の値からを、テーブル
等を利用し、乗数値を算出しなくてはならない。その一
例としてスケールファクター１１３ｂと乗数値との関係
を表２に示す。逆スケーリング部１３６では、逆量子化
された信号を係数算出器１３７により求められた各バン
ドの乗数値１３８を乗算し、出力する。The coefficient calculator 137 obtains a multiplier 138 required for inverse scaling from the scale factor 113b, which is range information, by referring to a table or by calculation, and transmits a multiplier value 138 required for inverse scaling.
When the scale factor 113b represents 6 bits and a discrete value of 2 dB steps, the multiplier value 138 required for inverse scaling is calculated in dB, that is, from the value of the exponent, using a table or the like to calculate the multiplier value. I won't. As an example, Table 2 shows the relationship between the scale factor 113b and the multiplier value. The inverse scaling unit 136 multiplies the inversely quantized signal by the multiplier value 138 of each band obtained by the coefficient calculator 137, and outputs the result.

【００１１】[0011]

【発明が解決しようとする課題】上記した高能率音声符
号化信号の復号装置によると、イコライザーなどの音質
を操作するアプリケーションを行う場合、図１１に示し
たように、一度、完全な復号を行いに音響信号１１７を
得てから、分配フィルターにて帯域毎に分配して調整や
レンジの表示を行い、最後に調整された各信号を合成す
るという手法が取られており、分配フィルタや統合フィ
ルターといったハードウェアが必要で、回路規模、及び
価格の面で不利である。According to the above-described highly efficient speech coded signal decoding apparatus, when an application for manipulating sound quality such as an equalizer is performed, complete decoding is performed once as shown in FIG. After the acoustic signal 117 has been obtained, the distribution filter distributes each band for adjustment and range display, and finally the adjusted signals are combined. Such hardware is required, which is disadvantageous in terms of circuit scale and price.

【００１２】そこでこの発明は、分配フィルタや統合フ
ィルターといったハードウェアを不要とし、復号の過程
において各種の音声調整が可能であり、またレンジ情報
の取りだしが可能な高能率符号化音声の復号化装置を提
供することを目的とする。Therefore, the present invention eliminates the need for hardware such as a distribution filter and an integrated filter, allows various voice adjustments in the decoding process, and is a highly efficient coded voice decoding device capable of extracting range information. The purpose is to provide.

【００１３】[0013]

[Means for Solving the Problems]

（１）高能率符号化されたビットストリームを解析し、
アロケーション情報、スケールファクター、周波数サン
プルを分離してデフォーマットする手段と、該スケール
ファクター、または周波数データの、一定期間ごとの最
大値または、平均値を出力する手段と、該最大値または
平均値を表示する手段と、該スケールファクターによ
り、周波数サンプルを逆量子化する手段と、逆量子化さ
れた周波数サンプルを周波数統合する手段とを有するこ
とを要旨とする。(1) Analyze the high efficiency coded bit stream,
Means for separating and reformatting allocation information, scale factor, frequency samples, means for outputting the maximum value or average value of the scale factor or frequency data for each fixed period, and the maximum value or average value The gist of the present invention is to have means for displaying, means for dequantizing frequency samples by the scale factor, and means for frequency integrating the dequantized frequency samples.

【００１４】（２）ビットストリームを解析し、アロケ
ーション情報、スケールファクター、周波数サンプルを
分離、デフォーマットする手段と、周波数サンプルを操
作する値を入力する手段と、入力手段のデータによりス
ケールファクターまたは周波数データを変更する手段
と、該スケールファクターにより、周波数サンプルを逆
量子化する手段と、逆量子化された周波数サンプルを周
波数統合する手段とを有することを要旨とする。(2) A means for analyzing a bit stream and separating and reformatting allocation information, scale factor, and frequency samples, means for inputting values for operating frequency samples, and scale factor or frequency depending on the data of the input means. The gist of the present invention is to have means for changing data, means for dequantizing frequency samples by the scale factor, and means for frequency integrating the dequantized frequency samples.

【００１５】[0015]

【作用】ビットストリームを解析し、アロケーション情
報、スケールファクター、周波数サンプルを分離、デフ
ォーマットし、逆量子化された周波数サンプルを周波数
統合する前段階の時点で、スケールファクター、または
周波数データの、一定期間ごとの最大値または、平均値
を出力し、表示を行うことにより、また周波数サンプル
を操作する値を入力し、スケールファクターまたは周波
数データを変更することにより、分析ファクター、統合
フィルターを不必要としている。Operation: Before the bit stream is analyzed, the allocation information, the scale factor, the frequency samples are separated and reformatted, and the dequantized frequency samples are frequency integrated, the scale factor or the frequency data is kept constant. By outputting the maximum value or average value for each period and displaying it, and by inputting the value for operating the frequency sample, and changing the scale factor or frequency data, the analysis factor and integrated filter are unnecessary. There is.

【００１６】[0016]

【実施例】以下、この発明の実施例について図面を参照
しながら説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００１７】図１（Ａ）、（Ｂ）はこの発明の実施例に
おける全体の構成を示すブロック図である。FIGS. 1A and 1B are block diagrams showing the overall construction of an embodiment of the present invention.

【００１８】図２（Ａ）に示すように高能率符号化され
た信号（ビットストリーム）は、ヘッダーと、音声信号
を周波数帯域（バンド）に分割し、スケールファクター
や周波数サンプルの情報量を示す割り当て情報（アロケ
ーション情報）と、そのバンドのレンジ（指数情報）を
示すスケールファクターと、周波数サンプルとから構成
される。As shown in FIG. 2 (A), the signal (bit stream) coded with high efficiency is obtained by dividing the header and the audio signal into frequency bands, and indicates the scale factor and the information amount of frequency samples. It is composed of allocation information (allocation information), a scale factor indicating the range of the band (index information), and frequency samples.

【００１９】図１（Ａ）は、この発明の高能率復号化装
置の第１の実施例である。FIG. 1A shows the first embodiment of the high-efficiency decoding device of the present invention.

【００２０】入力された上記ビットストリーム１１は、
デフォーマット部１２において、アロケーション情報１
３ａと、スケールファクター１３ｂと、アロケーション
情報１３ａを参考にして分割された周波数サンプルとに
分離されて、再生部１４に送られる。The input bit stream 11 is
In the reformatting unit 12, allocation information 1
3a, the scale factor 13b, and the frequency samples divided by referring to the allocation information 13a are sent to the reproducing unit 14.

【００２１】再生部１４では、分割された周波数サンプ
ル１３ｃを量子化、逆スケーリングし、逆マッピング部
１６に伝送する。逆マッピング部１６では周波数統合が
行われる。また、ある一定の周期、例えば１００ｍｓの
間の、スケールファクター１３ｂまたは逆スケーリング
された周波数サンプル１５の各帯域の最大値あるいは平
均値が求められて、出力部２３に送られる。出力部２３
では各帯域のレベルを表示手段であるインジケーター
（図示せず）等で表示する。The reproducing unit 14 quantizes and descales the divided frequency sample 13c, and transmits it to the inverse mapping unit 16. The inverse mapping unit 16 performs frequency integration. Further, the maximum value or the average value of each band of the scale factor 13b or the inversely scaled frequency sample 15 for a certain fixed period, for example, 100 ms, is obtained and sent to the output unit 23. Output unit 23
Then, the level of each band is displayed by an indicator (not shown) which is a display means.

【００２２】さらに逆スケーリングされた周波数サンプ
ル１５は、ＩＦＦＴ（逆フーリエ変換）やＩＭＤＣＴ
（逆離散コサイン変換）などの周波数−時間軸変換の演
算を行う逆マッピング部１６により、周波数統合されて
時間軸のデータに変換され、さらにオーバーラップ等の
処理が施されて合成され、音響信号１７として復元され
る。Further, the inversely scaled frequency sample 15 is subjected to IFFT (Inverse Fourier Transform) or IMDCT.
An inverse mapping unit 16 that performs a frequency-time axis conversion operation such as (inverse discrete cosine transform) frequency-integrates and converts the data into time-axis data, further performs processing such as overlap and synthesizes the acoustic signal. Restored as 17.

【００２３】上記のように第１の実施例が構成され、再
生部１４では後述するように、スケールファクターある
いは周波数データを用いて、再生部１４への入力信号あ
るいは再生部１４から出力される信号のレンジ情報を得
ることができる。つまり、従来の如く、完全な復号が済
んだ後で、レンジ情報を得るための手段を設ける必要が
ない。The first embodiment is configured as described above, and the reproducing section 14 uses a scale factor or frequency data, as will be described later, to input a signal to the reproducing section 14 or a signal output from the reproducing section 14. The range information of can be obtained. That is, unlike the conventional technique, it is not necessary to provide a means for obtaining range information after complete decoding.

【００２４】図１（Ｂ）には、この発明の高能率復号化
装置の第２の実施例を示す。FIG. 1B shows a second embodiment of the high efficiency decoding apparatus of the present invention.

【００２５】入力されたビットストリーム１１は、デフ
ォーマット部１２よりアロケーション情報１３ａと、ス
ケールファクター１３ｂと周波数サンプル１３ｃをアロ
ケーション情報１３ａを参考に、分割し、再生部１４に
送られる。The input bit stream 11 is divided by the deformatting unit 12 into allocation information 13a, scale factor 13b and frequency samples 13c with reference to the allocation information 13a and sent to the reproducing unit 14.

【００２６】入力部２１では各帯域の増減比率を入力
し、コントロール信号２２を再生部１４に送る。再生部
１４ではコントロール信号２２により帯域ごとに増幅、
減衰を行い、分割された周波数サンプル１３ｃを量子
化、逆スケーリングし、逆マッピング部６に伝送する。The input section 21 inputs the increase / decrease rate of each band and sends the control signal 22 to the reproducing section 14. The reproduction unit 14 amplifies each band by the control signal 22,
The frequency sample 13 c that has been attenuated and divided is quantized, inversely scaled, and transmitted to the inverse mapping unit 6.

【００２７】逆スケーリングされた周波数サンプル１５
はＩＦＦＴやＩＭＤＣＴなどの周波数−時間軸変換の演
算を行う逆マッピング部１６により、時間軸のデータに
変換され、オーバーラップ等の処理が施されて合成さ
れ、音響信号１７として復元される。Descaled frequency samples 15
Is converted into time axis data by an inverse mapping unit 16 such as IFFT or IMDCT, which performs frequency-time axis conversion calculation, is subjected to processing such as overlap, is synthesized, and is restored as an acoustic signal 17.

【００２８】上記のように第２の実施例が構成され、再
生部１４では後述するように、入力部２１からのコント
ロール信号２２を用いて、各種の音声調整を得ることが
できる。つまり、従来の如く、完全な復号が済んだ後
で、音声調整を行うための手段を設ける必要がない。The second embodiment is configured as described above, and the reproducing unit 14 can obtain various audio adjustments by using the control signal 22 from the input unit 21 as described later. That is, unlike the conventional case, it is not necessary to provide a means for performing voice adjustment after complete decoding.

【００２９】次に、図１（Ａ）、図１（Ｂ）で実現され
ている再生部１４の各種例について説明する。Next, various examples of the reproducing section 14 realized in FIGS. 1A and 1B will be described.

【００３０】図２（Ｂ）は、図１（Ａ）の再生部１４の
処理ブロックを示している。FIG. 2B shows processing blocks of the reproducing section 14 of FIG. 1A.

【００３１】逆量子化部３４では、デフォーマットされ
た周波数サンプル１３ｃをアロケーション情報１３ａに
もとずき、逆量子化し、逆スケーリング部３６に伝送す
る。逆量子化は次の式により行う。The dequantization section 34 dequantizes the frequency-formatted frequency sample 13c based on the allocation information 13a, dequantizes it, and transmits it to the descaling section 36. Inverse quantization is performed by the following formula.

【００３２】ｓ′＝ｃ×（ｓ＋ｄ）ここでｃ，ｄは逆量子化係数といわれ、アロケーション
情報が示す量子化ステップ数により定まる。逆量子化係
数ｃ，ｄの例を図３に示す。S '= c * (s + d) Here, c and d are called inverse quantization coefficients and are determined by the number of quantization steps indicated by the allocation information. An example of the inverse quantized coefficients c and d is shown in FIG.

【００３３】係数算出器３７は、レンジ情報であるスケ
ールファクター１３ｂから逆スケーリングに必要な乗数
３８を、テーブルを参照、あるいは演算により求め、逆
スケーリング部３６に必要な乗数値３８を伝送する。ス
ケールファクター１３ｂは、６ビット、２ｄＢステップ
の離散値を表している場合、逆スケーリングに必要な乗
数値３８をｄＢ、つまり指数の値からテーブル等を利用
し、乗数値を算出しなくてはならない。その一例として
スケールファクター１３ｂと乗数値との関係を図４に示
す。逆スケーリング部３６では、逆量子化された信号を
係数算出器３７により求められた各バンドの乗数値３８
を乗算し出力する。The coefficient calculator 37 obtains a multiplier 38 required for inverse scaling from the scale factor 13b, which is range information, by referring to a table or by calculation, and transmits the required multiplier value 38 to the inverse scaling unit 36. When the scale factor 13b represents 6 bits and a discrete value of 2 dB steps, the multiplier value 38 required for inverse scaling must be calculated from dB, that is, the value of the exponent, using a table or the like. . As an example thereof, FIG. 4 shows the relationship between the scale factor 13b and the multiplier value. In the inverse scaling unit 36, the inversely quantized signal is multiplied by the multiplier value 38 of each band obtained by the coefficient calculator 37.
Is multiplied and output.

【００３４】（インジケーターとしての機能）図２
（Ｂ）において、スケールファクターは、比較器／加算
器４０にも入力される。比較器／加算器４０は、ある一
定期間、例えば５フレーム（１フレームは２０ｍｓとし
て）のスケールファクターの比較を行い、最大値を得
て、その期間の各帯域のレンジ情報２４として出力す
る。(Function as Indicator) FIG. 2
In (B), the scale factor is also input to the comparator / adder 40. The comparator / adder 40 compares the scale factors for a certain fixed period, for example, 5 frames (1 frame is 20 ms), obtains the maximum value, and outputs it as the range information 24 of each band in the period.

【００３５】または比較器／加算器４０はある一定期
間、例えば５フレーム（１フレームは２０ｍｓとして）
のスケールファクターの加算を行い、平均値（合計値）
を得て、その期間の各帯域のレンジ情報２４として出力
する。Alternatively, the comparator / adder 40 has a fixed period of time, for example, 5 frames (one frame is 20 ms).
Add the scale factors of and average (total value)
Is obtained and is output as the range information 24 of each band in the period.

【００３６】上記の例は再生部１４へ入力する入力信号
のレンジ情報２４を得る例として説明した。しかし、レ
ンジ情報は、再生部１４の出力信号から得る場合もあ
る。The above example has been described as an example of obtaining the range information 24 of the input signal input to the reproducing unit 14. However, the range information may be obtained from the output signal of the reproducing unit 14.

【００３７】図２（Ｃ）に示すように、逆スケーリング
部３６の出力が、比較器／加算器４０に供給される。比
較器／加算器４０は、例えば５フレームの逆スケーリン
グされた周波数サンプル１５の二乗の加算を行い、平均
値（合計値）を得て、その期間の各帯域のレンジ情報２
４として出力する。As shown in FIG. 2C, the output of the inverse scaling section 36 is supplied to the comparator / adder 40. The comparator / adder 40 performs square addition of the inversely scaled frequency samples 15 of, for example, 5 frames to obtain an average value (total value), and the range information 2 of each band in the period.
Output as 4.

【００３８】（イコライザーとしての機能）図５には、
図１（Ｂ）の再生部１４について示している。(Function as Equalizer) FIG.
The reproduction unit 14 of FIG. 1B is shown.

【００３９】逆量子化部３４ではデフォーマットされた
周波数サンプル１３ｃをアロケーション情報１３ａにも
とずき、逆量子化し、逆スケーリング部３６に伝送す
る。逆量子化は次の式により行う。The dequantization section 34 dequantizes the frequency-formatted sample 13c based on the allocation information 13a, and transmits it to the descaling section 36. Inverse quantization is performed by the following formula.

【００４０】ｓ′＝ｃ×（ｓ＋ｄ）ここでｃ，ｄは逆量子化係数といわれ、アロケーション
情報が示す、量子化ステップ数により定まる。逆量子化
係数ｃ，ｄを図３に示す。S '= c * (s + d) Here, c and d are called inverse quantization coefficients and are determined by the number of quantization steps indicated by the allocation information. The inverse quantization coefficients c and d are shown in FIG.

【００４１】係数算出器３７はレンジ情報であるスケー
ルファクター１３ｂから逆スケーリングに必要な乗数３
８を、テーブルを参照、あるいは演算により求め、逆ス
ケーリングに必要な乗数値３８を伝送する。スケールフ
ァクター１３ｂは６ビット、２ｄＢステップの離散値を
表している場合、逆スケーリングに必要な乗数値３８を
ｄＢ、つまり指数の値からを、テーブル等を利用し、乗
数値を算出しなくてはならない。その一例としてスケー
ルファクター１３ｂと乗数値との関係を図４に示す。The coefficient calculator 37 calculates the multiplier 3 required for inverse scaling from the scale factor 13b which is range information.
8 is obtained by referring to a table or by calculation, and a multiplier value 38 required for inverse scaling is transmitted. When the scale factor 13b represents 6 bits and a discrete value of 2 dB steps, the multiplier value 38 required for inverse scaling is calculated in dB from the exponent value by using a table or the like. I won't. As an example thereof, FIG. 4 shows the relationship between the scale factor 13b and the multiplier value.

【００４２】ここで入力値２２に応じて、帯域毎にスケ
ールファクター１３ｂあるいは乗数値３８を増減させ
る。またスケールファクターは離散値であるため、入力
値２２により次の値に変化させる場合、急激に変化する
ので、スケールファクター値を変えるのでなく、乗数値
に窓関数をかけることで、滑らかに変化（図６（Ａ）参
照）させ、ノイズの発生を低減させるようにしている。Here, the scale factor 13b or the multiplier value 38 is increased or decreased for each band according to the input value 22. Also, since the scale factor is a discrete value, when it is changed to the next value by the input value 22, it changes abruptly. Therefore, instead of changing the scale factor value, it is possible to smoothly change it by applying a window function to the multiplier value ( 6A), the generation of noise is reduced.

【００４３】逆スケーリング部３６では、逆量子化され
た信号を係数算出器３７により求められた各バンドの乗
数値３８を乗算し、出力する。The inverse scaling unit 36 multiplies the inversely quantized signal by the multiplier value 38 of each band obtained by the coefficient calculator 37, and outputs it.

【００４４】また図６（Ｂ）に示すように、乗数変化の
調整値テーブル４１を持ち、各種モードに選択するだけ
で、個々の帯域のレベル調整値４２を決定し、これを係
数算出器３７に与えるようにしても良い。Further, as shown in FIG. 6B, a level change value 42 for each band is determined only by having an adjustment value table 41 for the multiplier change and selecting various modes, and this is calculated by the coefficient calculator 37. You may give it to.

【００４５】以上、説明したように、各帯域のスケール
ファクター値、または乗数値を変化させることで様々な
アプリケーションを実現できる。As described above, various applications can be realized by changing the scale factor value or the multiplier value of each band.

【００４６】次に、上述したようなイコライザー機能を
応用したアプリケーションの各種について説明する。Next, various kinds of applications to which the equalizer function as described above is applied will be described.

【００４７】（ＬＲバランス）図１（Ｂ）の構成におい
て、入力部２１を調整する場合、各帯域ごとの値でな
く、幾つかの帯域をグルーピングして同時に変化させる
ことが可能である。(LR balance) In the configuration of FIG. 1B, when adjusting the input section 21, it is possible to group not only the values for each band but also several bands and change them simultaneously.

【００４８】図７は、入力部２１の入力パネルの例を示
している。例えば、チャンネルごといっせいに全バンド
のスケールファクターを変化させるような場合、Ｌチャ
ンネルならＬチャンネルだけのｔｏｔａｌ調整８１を増
減させたり、またはＬＲ調整８２を増減することでＬチ
ャンネルを下げながら、Ｒチャンネルを上げたりなど
で、ＬＲのバランス調整を行うようなアプリケーション
が可能である。FIG. 7 shows an example of the input panel of the input section 21. For example, when changing the scale factors of all the bands at the same time for each channel, if the L channel is used, the total adjustment 81 of only the L channel may be increased or decreased, or the LR adjustment 82 may be increased or decreased to lower the L channel while changing the R channel. An application that adjusts the balance of the LR by raising or the like is possible.

【００４９】（バスブースト）図１（Ｂ）の入力部２１
に、バスブーストモードを選択した情報を送ると、図６
（Ｂ）における調整値テーブル４１は、図８（Ａ）に示
すテーブルを適用し、スケールファクターを補正する。
これにより低減を強調した効果が実現する。つまり、調
整値テーブルは、周波数バンドに応じて低域を強調する
ゲインを得られるようにデータが設定されている。(Bass Boost) Input unit 21 of FIG. 1 (B)
If you send the information that selected the bass boost mode to
As the adjustment value table 41 in (B), the table shown in FIG. 8A is applied to correct the scale factor.
As a result, the effect emphasizing the reduction is realized. That is, data is set in the adjustment value table so that a gain that emphasizes the low frequency range can be obtained according to the frequency band.

【００５０】（ラウドネス補正）人間の聴覚は音の周波
数により感度が異なり、音圧を変化させた場合、聴感上
の変化は帯域によって異なっている。例えばヴォーリュ
ームを絞った場合など、高域と低域が中域に比べ、より
音量が下がった感じになる。それを補正することも容易
である。(Loudness Correction) Human auditory sense has different sensitivity depending on the frequency of the sound, and when the sound pressure is changed, the change in the auditory sense differs depending on the band. For example, when you turn down the volume, the high range and low range feel more quiet than the mid range. It is easy to correct it.

【００５１】図１（Ｂ）の入力部２１に、出力音響信号
のヴォーリューム（音圧レベル調整）の値、またはラウ
ドネスモードを選択した情報を送るようにする。このと
き、図６（Ｂ）に示した調整値テーブル４１には、ヴォ
ーリュームが低い場合、またはラウドネスモードが選択
された場合は、図８（Ｂ）に示すようなテーブルを送
り、低域と高域の乗数値を上げるように設定される。あ
るいはヴォーリューム量に応じて、適切なテーブルを選
択するか、演算により算出するようになっている。The value of the volume (sound pressure level adjustment) of the output acoustic signal or the information for selecting the loudness mode is sent to the input section 21 of FIG. 1 (B). At this time, when the volume is low or the loudness mode is selected, the adjustment value table 41 shown in FIG. 6 (B) sends a table as shown in FIG. It is set to increase the multiplier value in the high range. Alternatively, an appropriate table is selected or calculated by calculation according to the volume amount.

【００５２】（サウンドエフェクト；周波数チェンジャ
ー、音質変化）周波数サンプルをバンド間で交換する
（図９（Ａ）参照）、またはほかのバンドに複写するこ
とで人工的に倍音を作る（図９（Ｂ）参照）、または周
波数サンプルを帯域をずらすことでピッチシフトを実現
する（図９（Ｃ）参照）などの各種の特殊効果を実現す
ることができる。これは、量子化信号が、周波数バンド
で分かれた量子化係数であるために、その係数を可変し
たり、または、各係数の加算等が容易であるからであ
る。(Sound Effect; Frequency Changer, Tone Change) Frequency samples are exchanged between bands (see FIG. 9A) or copied to another band to artificially create overtones (FIG. 9B). )), Or by shifting the band of the frequency samples to realize pitch shift (see FIG. 9C), and other various special effects can be realized. This is because the quantized signal is a quantized coefficient divided in frequency bands, so that it is easy to change the coefficient or to add each coefficient.

【００５３】また、以上の操作を逆量子化部３４の前、
つまり周波数サンプル１３ｃの段階で、または逆量子化
の後、つまり信号３５の段階で、行ってもかまわない。Further, the above operation is performed before the inverse quantizer 34,
That is, it may be performed at the stage of the frequency sample 13c or after the inverse quantization, that is, at the stage of the signal 35.

【００５４】（エコー機能）図１０（Ａ）に示すように
周波数サンプル１０１と、フィードバックされた１０３
とを加算器１０２で加算して出力信号１０４とするとエ
コー効果を得ることができる。出力信号１０４はバッフ
ァ１０６に送られ、バッファ１０６は、その入力を外部
より設定された遅延量だけ遅らせて出力する。バッファ
１０６の出力は、減衰器１０５に入力され、外部より設
定された減衰率だけ減衰され、加算器１０２に送られ
る。以上、バンドごとに行い、バンドごとに異なった遅
延量、減衰率を設定し、エコーを実現することができ
る。(Echo function) As shown in FIG. 10A, the frequency sample 101 and the feedback 103
The echo effect can be obtained by adding and with the adder 102 to form the output signal 104. The output signal 104 is sent to the buffer 106, and the buffer 106 delays its input by an externally set delay amount and outputs it. The output of the buffer 106 is input to the attenuator 105, attenuated by an externally set attenuation rate, and sent to the adder 102. As described above, it is possible to perform echo for each band by setting different delay amounts and attenuation rates for each band.

【００５５】以上のエコー器は、図２の再生部の信号１
３ｃまたは、信号３５、または信号１５のいずれに対し
て行ってもよい。The above echo device is the same as the signal 1 of the reproducing section of FIG.
3c, signal 35, or signal 15.

【００５６】（各種ホール再現機能）図１０（Ａ）にお
けるエコー器において、バンドごとに異なった遅延量、
減衰率をテーブルとして、いくつかの代表値（サンプ
ル）を用意しておき、外部の選択入力によりそのテーブ
ルを選択してもよい。(Various Hall Reproducing Functions) In the echo device shown in FIG. 10 (A), different delay amounts for each band,
It is also possible to prepare some representative values (samples) using the attenuation rate as a table and select the table by external selection input.

【００５７】このようにすると、各種コンサートホール
などのデータ（テーブル）を用意して、選択することだ
け各種コンサートホールなどの状態を実現する。In this way, the data (table) of various concert halls and the like are prepared and the state of various concert halls and the like is realized only by selecting.

【００５８】（ラップ機能）図１０（Ａ）において、あ
る一定期間、例えば０．３秒の音声データバッファ１０
６に取り込み、バッファ１０６に取り込んだデータをく
り返し、例えば３回出力し、出力された信号は減衰器１
０５を通り、加算器１０２で周波数サンプルを加算せず
に、信号１０３をそのまま、信号１０４として出力する
ことでラップ機能を実現できる。つまり加算器１０２を
選択器として利用する。その後（この場合、０．９秒）
加算器１０２は、周波数サンプル１０１を信号１０３を
加算せずに、信号１０４として出力する。その０．３秒
のデータをバッファ１０６で取り込むといった動作を繰
り返すことでラップ効果が行える。この期間や、回数は
任意に設定できるようにする。また常に一定値をとらな
くとも、時間経過や、音声データの状態、例えば音圧が
下がった時をフレーズとして認識し、取り込み時間や、
回数を変えたりしてもよい。(Wrap Function) In FIG. 10A, the audio data buffer 10 for a certain fixed period, for example, 0.3 seconds.
6, the data captured in the buffer 106 is repeatedly output, for example, three times, and the output signal is the attenuator 1
It is possible to realize the wrap function by outputting the signal 103 as it is as the signal 104 without passing the frequency sample in the adder 102 through 05. That is, the adder 102 is used as a selector. After that (0.9 seconds in this case)
The adder 102 outputs the frequency sample 101 as the signal 104 without adding the signal 103. The wrap effect can be achieved by repeating the operation of fetching the 0.3 second data in the buffer 106. This period and the number of times can be set arbitrarily. Also, even if it does not always take a constant value, it recognizes the passage of time, the state of voice data, such as when the sound pressure has decreased, as a phrase,
You may change the number of times.

【００５９】図１０（Ｂ）にそのタイミング図を示す。FIG. 10B shows the timing chart.

【００６０】通常、再生部はＤＳＰ（デジタルシグナル
プロセッサ）等で処理されるため、ソフトウェアの変更
だけで、上記、各種のアプリケーションを実現できる。Since the reproducing unit is usually processed by a DSP (digital signal processor) or the like, the above various applications can be realized only by changing the software.

【００６１】[0061]

【発明の効果】以上説明したように本発明によれば、分
配フィルター、合成フィルターというハードウェアの追
加がなくても、各種のアプリケーションを実行できる。As described above, according to the present invention, various applications can be executed without the addition of distribution filters and synthesis filters.

[Brief description of drawings]

【図１】この発明の実施例を示すブロック図。FIG. 1 is a block diagram showing an embodiment of the present invention.

【図２】高能率符号化音声のビットストリームを示す図
及び図１の再生部の詳細を示すブロック図。FIG. 2 is a diagram showing a bit stream of high-efficiency coded audio and a block diagram showing details of a reproducing unit in FIG.

【図３】逆量子化係数の例を示す図。FIG. 3 is a diagram showing an example of inverse quantization coefficients.

【図４】スケールファクターの例を示す図。FIG. 4 is a diagram showing an example of a scale factor.

【図５】図１（Ｂ）の再生部の詳細を示すブロック図。FIG. 5 is a block diagram showing details of a reproducing unit in FIG.

【図６】スケールファクターを変化させるときの状態を
説明するために示した図及び再生部の他の例を示す図。FIG. 6 is a diagram shown for explaining a state when a scale factor is changed and a diagram showing another example of a reproducing unit.

【図７】入力パネルの例を示す図。FIG. 7 is a diagram showing an example of an input panel.

【図８】再生部の音声調整機能の例を説明するために示
した図。FIG. 8 is a diagram shown for explaining an example of a sound adjusting function of a reproducing unit.

【図９】再生部の他の音声調整機能の例を説明するため
に示した図。FIG. 9 is a diagram shown for explaining an example of another audio adjusting function of the reproducing unit.

【図１０】再生部のさらに他の音声調整機能の例を説明
するために示した図。FIG. 10 is a diagram shown for explaining an example of still another audio adjusting function of the reproducing unit.

【図１１】従来の高能率符号化音声信号の復号装置を示
す図。FIG. 11 is a diagram showing a conventional high-efficiency coded speech signal decoding apparatus.

[Explanation of symbols]

１２…デフォーマット、１４…再生部、１６…逆マッピ
ング部、２１…入力部、２３…出力部、３４…逆量子化
部、３６…逆スケーリング部、３７…係数算出器、４０
…比較器／加算器。12 ... Deformat, 14 ... Reproducing unit, 16 ... Inverse mapping unit, 21 ... Input unit, 23 ... Output unit, 34 ... Inverse quantization unit, 36 ... Inverse scaling unit, 37 ... Coefficient calculator, 40
... comparator / adder.

Claims

[Claims]

1. Deformatting means for separating and reformatting allocation information, scale factor, and frequency samples from a high-efficiency coded bit stream, and range information for outputting the maximum value of the scale factor for each constant period. Output means, display means for displaying the maximum value, dequantization means for dequantizing the frequency samples according to the scale factor, frequency integration of frequency samples dequantized by the dequantization means And a frequency unifying means for performing high efficiency coding speech decoding apparatus.

2. The means for outputting the maximum value, further comprising means for outputting the maximum value of the dequantized frequency samples for each constant period instead of the scale factor. High-efficiency coded speech decoding device.

3. The decoding apparatus for high efficiency coded speech according to claim 1, further comprising means for outputting an average value instead of the maximum value in the means for outputting the maximum value.

4. The decoding apparatus for high efficiency coded speech according to claim 2, further comprising means for outputting an average value instead of the maximum value in the means for outputting the maximum value.

5. Deformatting means for separating and reformatting allocation information, scale factor, and frequency samples from a high-efficiency coded bit stream, and input means for inputting values for operating the frequency samples. Adjusting means for changing the scale factor according to the data of the input means, and the scale factor obtained from the adjusting means,
An apparatus for decoding highly efficient coded speech, comprising: an inverse quantizer for inversely quantizing the frequency sample; and a means for frequency integrating the frequency samples inversely quantized by the inverse quantizer.

6. Decoding of high-efficiency coded speech according to claim 5, further comprising means for changing the dequantized frequency sample instead of the scale factor in the adjusting means for changing the scale factor. Device.