JPH10105198A

JPH10105198A - Speech encoding device

Info

Publication number: JPH10105198A
Application number: JP8261094A
Authority: JP
Inventors: Hisami Kanbayashi; 久美神林
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1996-10-02
Filing date: 1996-10-02
Publication date: 1998-04-24

Abstract

PROBLEM TO BE SOLVED: To provide the speech encoding device which can perform efficient input speech level control making good use of existent software and hardware resources for a high-compressibility speech encoding process. SOLUTION: An input speech control means 21 performs a block flooding process for a digital speech signal inputted from a digital speech signal input device 1 to calculate scheduling bits, which are stored in a shift register 25. After quantization, encoding is carried out. A level detection part 26 detects whether or not a frame having a maximum frame energy code calculated by a frame energy quantizing means 22 is inputted successively more than a specific number of times on the basis of control information stored in a control information storage part 27. When the level detection part detects the successive high- level input state, the shift register 25 subtracts '1' from the value of the stored scaling bits.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は音声符号化装置に関
し、特に入力レベル調整機能を有する音声符号化装置に
関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech coding apparatus, and more particularly to a speech coding apparatus having an input level adjusting function.

【０００２】[0002]

【従来の技術】従来、この種の音声符号化装置において
は、符号化処理の前段階で専用ハードウェアを追加する
ことで、音声符号化のアルゴリズムとは関係なく、入力
音声のレベル調整を実現している。2. Description of the Related Art Conventionally, in this type of speech coding apparatus, level adjustment of input speech has been realized by adding dedicated hardware before the encoding process, regardless of the speech coding algorithm. doing.

【０００３】入力音声のレベル調整としては、図９に示
すように、アナログ／デジタル変換の際に、レベル調整
のためのしきい値を所定の値に調整することができるよ
うにした音声信号用のレベル調整システムがある。この
レベル調整システムはアナログ音声信号入力手段７と、
入力音声信号レベル調整装置８と、ディジタル音声信号
出力手段９とを備えている。As shown in FIG. 9, for adjusting the level of an input audio signal, a threshold value for adjusting an audio signal is adjusted to a predetermined value at the time of analog / digital conversion. There is a level adjustment system. This level adjustment system comprises an analog audio signal input means 7;
An input audio signal level adjusting device 8 and digital audio signal output means 9 are provided.

【０００４】入力音声信号レベル調整装置８は入力信号
を調整する可変利得増幅器８１と、可変利得増幅器８１
への制御電圧を発生する応答・減衰タイミング発生器８
２と、応答・減衰タイミング発生器８２のトリガ出力を
発生するピーク値検出比較器８３と、ピーク値検出比較
器８３のしきい値を超える信号の持続時間を測定するし
きい値復帰タイマ８４と、入力信号に対する電力利得を
増幅する駆動増幅器８５とを備えている。The input audio signal level adjusting device 8 includes a variable gain amplifier 81 for adjusting an input signal, and a variable gain amplifier 81.
And decay timing generator 8 for generating control voltage to
2, a peak value detection comparator 83 for generating a trigger output of the response / decay timing generator 82, and a threshold value recovery timer 84 for measuring the duration of a signal exceeding the threshold value of the peak value detection comparator 83. And a drive amplifier 85 for amplifying the power gain with respect to the input signal.

【０００５】上記のレベル調整システムにおいて、ピー
ク値検出比較器８３はしきい値復帰タイマ８４によっ
て、最初は高しきい値状態に設定される。信号レベルと
入力信号のピーク値とが増大すると、ピーク値検出比較
器８３は応答・減衰タイミング発生器８２に対してトリ
ガパルスを出力し始める。In the above level adjustment system, the peak value detection comparator 83 is initially set to the high threshold state by the threshold value recovery timer 84. When the signal level and the peak value of the input signal increase, the peak value detection comparator 83 starts outputting a trigger pulse to the response / decay timing generator 82.

【０００６】同時に、しきい値復帰タイマ８４はピーク
値検出比較器８３のしきい値を超える信号の持続時間の
測定を始め、予め選択されている時間にわたって増力さ
れた出力のレベルがピーク値検出比較器８３のしきい値
を超えたことをしきい値復帰タイマ８４が検出すると、
ピーク値検出比較器８３の圧縮しきい値は低レベルに切
替えられる。At the same time, the threshold recovery timer 84 begins measuring the duration of the signal exceeding the threshold of the peak value detection comparator 83, and the level of the output, which has been boosted over a preselected period of time, is used to detect the peak value. When the threshold value recovery timer 84 detects that the threshold value of the comparator 83 has been exceeded,
The compression threshold value of the peak value detection comparator 83 is switched to a low level.

【０００７】ピーク値検出比較器８３のしきい値は連続
信号がなくなるまで低レベルにとどまる。この時点で回
路は正常動作状態に復帰する。この技術については、特
開平２−２７８９４５号公報に開示されている。[0007] The threshold value of the peak value detection comparator 83 remains at a low level until there is no continuous signal. At this point, the circuit returns to a normal operating state. This technique is disclosed in JP-A-2-278945.

【０００８】上記のレベル調整システムを高圧縮率音声
符号化装置に反映させた例を図１０に示す。図１０にお
いて、高圧縮率音声符号化装置はアナログ音声信号入力
手段１１と、上記のレベ調整システムに相当する入力音
声信号レベル調整装置８と、ディジタル音声信号入力手
段１２と、音声符号化装置１３と、ディジタル符号化音
声信号出力手段１４とを備えている。FIG. 10 shows an example in which the above-described level adjustment system is applied to a high-compression-rate speech coding apparatus. In FIG. 10, a high-compression-rate audio encoding device includes an analog audio signal input unit 11, an input audio signal level adjusting device 8 corresponding to the above-described level adjustment system, a digital audio signal input unit 12, and an audio encoding device 13. And digitally encoded audio signal output means 14.

【０００９】アナログ音声信号入力手段１１によって入
力されたアナログ音声信号は入力音声信号レベル調整装
置８によって入力レベルが調整された後、ディジタル音
声信号入力手段１２から音声符号化装置１３に入力され
る。このアナログ音声信号は音声符号化装置１３にて任
意の高圧縮率音声符号化処理が施された後、ディジタル
符号化音声信号出力手段１４から出力される。The analog audio signal input by the analog audio signal input means 11 is input to the audio encoding device 13 from the digital audio signal input means 12 after the input level is adjusted by the input audio signal level adjusting device 8. This analog audio signal is subjected to arbitrary high compression rate audio encoding processing in the audio encoding device 13 and then output from the digitally encoded audio signal output means 14.

【００１０】また、他の入力音声のレベル調整として
は、図１１に示すように、ディジタル音声信号を所定の
レベルに調整して出力するための入力音声レベル調整制
御方式がある。この入力音声レベル調整制御方式による
システムの構成はディジタル音声信号入力手段１５と、
入力音声信号レベル調整装置１６と、ディジタル音声信
号出力手段１７とを備えている。As another level adjustment of input audio, there is an input audio level adjustment control method for adjusting a digital audio signal to a predetermined level and outputting the same as shown in FIG. The system configuration based on the input audio level adjustment control system includes a digital audio signal input unit 15 and
An input audio signal level adjusting device 16 and digital audio signal output means 17 are provided.

【００１１】入力音声信号レベル調整装置１６はディジ
タル音声信号のレベルを増幅する増幅回路１６１と、デ
ィジタル音声信号の平均レベルを検出するレベル検出回
路１６２と、この増幅回路１６１の利得を設定する利得
設定部１６４と、ディジタル音声信号の平均レベルとい
き値設定レジスタ１６５に設定されたいき値とを比較す
る比較回路１６３とを備える。The input audio signal level adjusting device 16 includes an amplifier circuit 161 for amplifying the level of the digital audio signal, a level detection circuit 162 for detecting the average level of the digital audio signal, and a gain setting for setting the gain of the amplifier circuit 161. And a comparator 163 for comparing the average level of the digital audio signal with the threshold set in the threshold setting register 165.

【００１２】このシステムにおいて、利得設定部１６４
は比較回路１６３による比較結果に基づいて利得を設定
し、所定のレベルとなるようにディジタル音声信号を増
幅回路１６１によって増幅する。In this system, the gain setting section 164
Sets the gain based on the comparison result by the comparison circuit 163, and amplifies the digital audio signal by the amplification circuit 161 so as to have a predetermined level.

【００１３】また、増幅回路１６１から出力されるディ
ジタル音声信号の平均レベルをレベル検出回路１６２に
よって検出し、かついき値設定レジスタ１６５に設定す
るいき値を順次更新する。The average level of the digital audio signal output from the amplification circuit 161 is detected by the level detection circuit 162, and the threshold set in the threshold setting register 165 is sequentially updated.

【００１４】その値と平均レベル検出回路１６２による
平均レベルとを比較回路１６３で比較し、平均レベルが
いき値を超える回数の大小判定によって増幅回路１６１
の利得を設定する。The value and the average level detected by the average level detection circuit 162 are compared by a comparison circuit 163, and the amplification circuit 161 is determined based on the number of times the average level exceeds the threshold.
Set the gain of

【００１５】また、増幅回路１６１から出力されるディ
ジタル音声信号の平均レベルをレベル検出回路１６２に
よって検出し、かついき値設定レジスタ１６５に設定す
るいき値を固定し、その値と平均レベル検出回路１６２
による平均レベルとを比較回路１６３で比較し、平均レ
ベルがいき値を超える回数が所定数となるように、増幅
回路１６１の利得を設定する。この技術については、特
開平４−１２７７５８号公報に開示されている。The average level of the digital audio signal output from the amplifying circuit 161 is detected by a level detecting circuit 162, and a threshold value set in a threshold value setting register 165 is fixed.
The comparison circuit 163 compares the average level with the average level according to the above, and sets the gain of the amplification circuit 161 so that the number of times the average level exceeds the threshold value becomes a predetermined number. This technique is disclosed in Japanese Patent Application Laid-Open No. 4-127758.

【００１６】上記のシステムを高圧縮率音声符号化装置
に反映させた例を図１２に示す。図１２において、高圧
縮率音声符号化装置はディジタル音声信号入力手段１８
と、上記のシステムに相当する入力音声信号レベル調整
装置１６と、音声符号化装置１９と、ディジタル符号化
音声信号出力手段２０とを備えている。FIG. 12 shows an example in which the above system is applied to a high-compression-rate speech coding apparatus. In FIG. 12, a high-compression-ratio speech coding apparatus includes digital speech signal input means 18.
And an input audio signal level adjusting device 16 corresponding to the above system, an audio encoding device 19, and digitally encoded audio signal output means 20.

【００１７】ディジタル音声信号入力手段１８から入力
されたディジタル音声信号は入力音声信号レベル調整装
置１６によって入力レベルが調整された後、音声符号化
装置１９に入力される。ディジタル音声信号は音声符号
化装置１９において任意の高圧縮率音声符号化処理を施
された後、ディジタル符号化音声信号出力手段２０から
出力される。The digital audio signal input from the digital audio signal input means 18 is input to the audio encoding device 19 after the input level is adjusted by the input audio signal level adjusting device 16. The digital audio signal is subjected to an arbitrary high compression rate audio encoding process in the audio encoding device 19, and then output from the digital encoded audio signal output means 20.

【００１８】[0018]

【発明が解決しようとする課題】ディジタル携帯電話等
のように高圧縮率音声符号化を行う音声符号化装置にと
っては装置の小型化・低消費電力化と同時に、符号化理
論にマッチした符号化音声品質の向上が重要な課題であ
る。特に、高圧縮率音声符号化においては、入力音声の
レベルが符号化の限度を超えて高い場合、符号化音声の
品質が著しく劣化する。SUMMARY OF THE INVENTION For a speech coding apparatus such as a digital cellular phone which performs speech coding at a high compression rate, the coding is matched with the coding theory while the size and power consumption of the apparatus are reduced. Improving voice quality is an important issue. In particular, in high-compression-rate audio coding, if the level of the input audio is higher than the limit of the encoding, the quality of the encoded audio is significantly deteriorated.

【００１９】しかしながら、上述した従来の技術では、
入力音声のレベル調整方法において音声符号化処理プロ
グラムの処理内容を利用していないので、音声符号化方
法にマッチした方法で符号化音声の品質向上を図ること
はできない。However, in the above-mentioned conventional technology,
Since the processing content of the audio encoding processing program is not used in the input audio level adjustment method, the quality of the encoded audio cannot be improved by a method that matches the audio encoding method.

【００２０】また、上述した従来の技術では、入力音声
のレベル調整方法において音声符号化装置に入力制御処
理用のハードウェアを新たに追加して実現する構成とな
っているので、装置の小型化及び低消費電力化を図るこ
とができない。Further, in the above-mentioned conventional technique, the input audio level adjusting method is configured to newly realize hardware for input control processing in the audio encoding apparatus, so that the apparatus can be downsized. In addition, power consumption cannot be reduced.

【００２１】そこで、本発明の目的は上記の問題点を解
消し、高圧縮率音声符号化処理として既存のソフトウェ
ア及びハードウェア資源を利用した効率的な入力音声レ
ベル制御を行うことができる音声符号化装置を提供する
ことにある。Therefore, an object of the present invention is to solve the above-mentioned problems and to provide a speech codec capable of performing efficient input speech level control using existing software and hardware resources as speech processing at a high compression rate. To provide a chemical conversion device.

【００２２】[0022]

【課題を解決するための手段】本発明による音声符号化
装置は、入力音声データを高圧縮率音声符号化処理によ
って符号化する音声符号化装置であって、前記高圧縮率
音声符号化処理中に算出されかつ前記入力音声データの
分析フレーム単位の電力パワーを示すフレームエネルギ
コードを監視する監視手段と、前記監視手段の監視結果
を基に前記高圧縮率音声符号化処理において前記入力音
声データのレベル制御を行う手段とを備えている。According to the present invention, there is provided an audio encoding apparatus for encoding input audio data by high-compression-rate audio encoding processing. Monitoring means for monitoring a frame energy code which is calculated and indicates the power of the input audio data in analysis frame units; and a high compression rate audio encoding process based on the monitoring result of the monitoring means. Means for performing level control.

【００２３】上記の如く、本発明の音声符号化装置では
既存の音声符号化処理プログラム中で算出されるパラメ
ータを利用し、ソフトウェアによって入力音声レベル制
御を実現している。より具体的には、音声符号化処理プ
ログラム中でフレームエネルギを計算して量子化するフ
レームエネルギ量子化手段の算出結果であるフレームエ
ネルギコード（以下、Ｒ０コードとする）をレベル調整
手段で監視し、その監視結果に基づいて入力音声スケー
リングビット値を増減したり、入力音声データをシフト
することで入力音声データのレベル制御を行っている。As described above, in the speech coding apparatus of the present invention, the input speech level control is realized by software using the parameters calculated in the existing speech coding processing program. More specifically, a frame energy code (hereinafter, referred to as an R0 code), which is a calculation result of a frame energy quantizing unit that calculates and quantizes frame energy in a speech encoding processing program, is monitored by a level adjusting unit. The level of the input audio data is controlled by increasing or decreasing the input audio scaling bit value or shifting the input audio data based on the monitoring result.

【００２４】また、入力音声レベル制御を、既存の音声
符号化処理プログラム及びハードウェアを利用して実現
する。より具体的には、フレームエネルギ量子化手段の
算出結果であるＲ０コードをレベル調整手段で監視し、
その監視結果に基づいて入力音声データの次データに対
してアナログレベルでのレベル調整を行うことで入力音
声データのレベル制御を行っている。The input audio level control is realized by using an existing audio coding processing program and hardware. More specifically, the R0 code, which is the calculation result of the frame energy quantization means, is monitored by the level adjustment means,
Based on the monitoring result, the level control of the input audio data is performed by adjusting the level of the next data of the input audio data at the analog level.

【００２５】レベル調整手段では高圧縮率音声符号化処
理プログラム中で生成されるＲ０コードの最大値の連続
状態を監視しており、音声の連続的な高レベル入力状態
を示す指標としてＲ０コードの最大値が連続する時、レ
ベル調整手段が高圧縮率音声符号化処理プログラム内部
または音声入力手段であるハードウェアへの外部コマン
ド発行の形でレベル調整を行う。これによって、既存の
ソフトウェアプログラムを利用した効率的な入力音声レ
ベル制御を行うことが可能となる。The level adjusting means monitors the continuous state of the maximum value of the R0 code generated in the high-compression-rate audio encoding processing program, and uses the R0 code as an index indicating the continuous high-level input state of the audio. When the maximum value is continuous, the level adjusting means adjusts the level in the form of an external command issued to the high-compression-rate audio encoding processing program or to hardware as audio input means. This makes it possible to perform efficient input audio level control using an existing software program.

【００２６】[0026]

【発明の実施の形態】次に、本発明の実施例について図
面を参照して説明する。図１は本発明の一実施例の構成
を示すブロック図である。図において、本発明の一実施
例によるレベル調整システムはディジタル音声信号入力
手段１と、ＶＳＥＬＰ（ＶｅｃｔｏｒＳｕｍＥｘｃｉ
ｔｅｄＬｉｎｅａｒＰｒｅｄｉｃｔｉｏｎ）等の高
圧縮率音声符号化を実現する音声符号化装置２と、伝送
路誤り符号化手段（図示せず）への入力となるディジタ
ル符号化音声信号出力手段３とを含んで構成されてい
る。Next, an embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing the configuration of one embodiment of the present invention. In the figure, a level adjustment system according to an embodiment of the present invention includes a digital audio signal input means 1 and a VSELP (Vector Sum Exci).
and a digital coded voice signal output means 3 which is an input to a transmission path error coding means (not shown), which realizes a high compression rate voice coding such as ted Linear Prediction. It is configured.

【００２７】音声符号化装置２は入力音声制御手段２１
と、フレームエネルギ量子化手段２２と、線形予測分析
・ピッチ抽出・音源コードブック探索・ゲイン量子化等
の処理を行う音声処理手段２３と、コードマルチプレク
サ２４と、シフトレジスタ２５と、レベル検出部２６
と、制御情報記憶部２７とを備えている。The speech encoding device 2 has input speech control means 21
, A frame energy quantization means 22, a speech processing means 23 for performing processes such as linear prediction analysis, pitch extraction, sound source codebook search, and gain quantization, a code multiplexer 24, a shift register 25, and a level detection section 26.
And a control information storage unit 27.

【００２８】入力音声制御手段２１はディジタル音声信
号入力装置１から入力されたディジタル音声信号に対し
てブロックフローディング処理を行い、音声処理の単位
である１フレーム（＝１６０サンプル）のデータ毎のス
ケーリング（正規化）ビットを算出する。The input voice control means 21 performs block loading processing on the digital voice signal input from the digital voice signal input device 1, and performs scaling for each data of one frame (= 160 samples) as a unit of voice processing. Calculate (normalized) bits.

【００２９】ここで、ブロックフローディング処理とは
データメモリや乗算器の入力データのビット幅以上のビ
ット精度を持ち、例えば音声信号に代表されるような互
いにダイナミックレンジが相関を持ってまとまった数値
データ群に対して共通のスケール値（ブロックスケール
値）で正規化し、メモリに退避したりあるいは乗算器の
入力に使用する際に数値データの演算精度を保つ処理を
いう。Here, the block loading processing is a numerical value having a bit precision equal to or greater than the bit width of input data of a data memory or a multiplier and having a dynamic range correlated with each other as represented by, for example, an audio signal. This is a process of normalizing a data group by a common scale value (block scale value) and keeping the calculation accuracy of numerical data when saving the data in a memory or using it for an input of a multiplier.

【００３０】入力音声制御手段２１は算出したスケーリ
ングビットをシフトレジスタ２５に格納する。シフトレ
ジスタ２５に格納されたスケーリングビットはフレーム
エネルギ量子化手段２２及び音声処理手段２３において
利用される。The input voice control means 21 stores the calculated scaling bits in the shift register 25. The scaling bits stored in the shift register 25 are used in the frame energy quantization means 22 and the audio processing means 23.

【００３１】フレームエネルギ量子化手段２２は入力音
声データのフレーム毎の電力パワー（＝フレームエネル
ギ）を計算し、量子化した後に符号化する。すなわち、
エネルギ値Ｒ（０）は、Ｒ（０）＝｛［φ（０，０）＋φ（ＮP ，ＮP ）］／２
（ＮA −ＮP ）｝という式から求められる。ここで、φ（ｊ，ｋ）は共分
散（自己相関）行列を示しており、ＮA はサンプル数
を、ＮP は予測の次数を夫々示している。The frame energy quantization means 22 calculates the power (= frame energy) of the input voice data for each frame, and quantizes and then encodes. That is,
The energy value R (0) is given by: R (0) = {[φ (0,0) + φ (NP, NP)] / 2
(NA-NP)}. Here, φ (j, k) indicates a covariance (autocorrelation) matrix, NA indicates the number of samples, and NP indicates the order of prediction.

【００３２】フレームエネルギ量子化手段２２では上記
の式によって求められたエネルギ値Ｒ（０）をフルスケ
ールＲｍａｘを基準として次式のように変換する（フル
スケールＲｍａｘは最大サンプル振幅の二乗と定義す
る）。すなわち、ＲdB＝１０ｌｏｇ₁₀［Ｒ（０）／Ｒｍａｘという式でＲdBを算出し、このＲdBを３２レベルに量子
化してフレームエネルギコード（以下、Ｒ０コードとす
る）を得る。The frame energy quantizing means 22 converts the energy value R (0) obtained by the above equation into the following equation based on the full scale Rmax (the full scale Rmax is defined as the square of the maximum sample amplitude). ). That is, RdB is calculated by the following equation: RdB = ₁₀ log ₁₀ [R (0) / Rmax], and this RdB is quantized to 32 levels to obtain a frame energy code (hereinafter referred to as an R0 code).

【００３３】尚、上記のフレームエネルギ量子化手段２
２によるフレームエネルギの算出動作は「デジタル方式
自動車電話システム標準規格ＲＣＲＳＴＤ−２７Ｄ
第１分冊」（財団法人電波システム開発センター
刊、平成７年６月２７日改訂、ｐ．５９４〜６０４）に
詳述されている。The above frame energy quantization means 2
2 for calculating the frame energy is described in “Digital Car Phone System Standard RCR STD-27D
1st Volume "(published by the Radio System Development Center, revised on June 27, 1995, pp. 594 to 604).

【００３４】音声処理手段２３はＶＳＥＬＰ等の高圧縮
率音声符号化の音声処理の中心である線形予測分析、ピ
ッチ抽出、音源コードブック探索、ゲイン量子化の一連
の処理を行う。The voice processing means 23 performs a series of processes such as linear prediction analysis, pitch extraction, sound source codebook search, and gain quantization, which are the core of voice processing of high compression rate voice coding such as VSELP.

【００３５】コードマルチプレクサ２４はフレームエネ
ルギ量子化手段２２及び音声処理手段２３各々の出力で
ある符号を統合し、誤り訂正処理等の入力データとなる
符号化音声データを形成する。The code multiplexer 24 integrates the codes output from the frame energy quantizing means 22 and the sound processing means 23 to form coded sound data as input data for error correction processing and the like.

【００３６】シフトレジスタ２５は入力音声制御手段２
１で算出されたスケーリングビットを格納する。また、
シフトレジスタ２５はレべル検出部２６において連続す
る高レベル入力を検出した場合、格納しているスケーリ
ングビットの値から「１」を減ずる。フレームエネルギ
量子化手段２２及び音声処理手段２３ではこの再格納し
たスケーリングビット値を演算に使用する。The shift register 25 is provided for the input voice control means 2
The scaling bit calculated in step 1 is stored. Also,
When the level detection unit 26 detects a continuous high level input, the shift register 25 subtracts “1” from the stored scaling bit value. The frame energy quantizing unit 22 and the audio processing unit 23 use the restored scaling bit value for calculation.

【００３７】レベル検出部２６はＲ０コードが最大値
（＝３１）であるようなフレームが所定の回数以上に連
続的に入力されていることを検出する。レベル検出部２
６が連続的な高レベル入力状態を検出した場合、シフト
レジスタ２５は格納しているスケーリングビットの値か
ら「１」を減ずる。The level detecting section 26 detects that a frame whose R0 code has the maximum value (= 31) is continuously input more than a predetermined number of times. Level detector 2
When 6 detects a continuous high level input state, the shift register 25 subtracts "1" from the stored scaling bit value.

【００３８】制御情報記憶部２７はレベル検出部２６に
よる制御情報（レベル制御中／レベル非制御中）を記憶
する。これらの情報はレベル検出部２６で検出された入
力レベルの変化がレベル制御を引き起こすものであるか
否かの判定に用いられる。The control information storage section 27 stores control information (during level control / during level non-control) by the level detection section 26. These pieces of information are used to determine whether or not the change in the input level detected by the level detection section 26 causes level control.

【００３９】図２は図１の音声符号化装置２の動作を示
すフローチャートである。これら図１及び図２を参照し
て、音声符号化装置２の動作について説明する。まず、
ディジタル音声信号入力手段１から与えられた入力信号
は入力音声制御手段２１に供給される。FIG. 2 is a flowchart showing the operation of the speech coding apparatus 2 of FIG. The operation of the speech encoding device 2 will be described with reference to FIGS. First,
The input signal given from the digital audio signal input means 1 is supplied to the input audio control means 21.

【００４０】入力音声制御手段２１はこの入力信号に対
してブロックフローティング処理（入力音声データ群を
まとめて正規化するスケーリングビットを算出するこ
と）を行い、算出したスケーリングビットをシフトレジ
スタ２５に格納する（図２ステップＳ１）。The input voice control means 21 performs block floating processing (calculation of scaling bits for collectively normalizing the input voice data group) on the input signal, and stores the calculated scaling bits in the shift register 25. (Step S1 in FIG. 2).

【００４１】フレームエネルギ量子化手段２２は入力音
声のフレームエネルギを算出し、量子化して０から３１
までの３２レベルのＲ０コードに符号化する（図２ステ
ップＳ２）。The frame energy quantizing means 22 calculates the frame energy of the input voice, quantizes the frame energy from 0 to 31.
(Step S2 in FIG. 2).

【００４２】レベル検出部２６は制御情報記憶部２７で
記憶している制御情報（制御状態フラグＦｌａｇ）の値
を識別する（図２ステップＳ３）。すなわち、制御情報
記憶部２７の制御情報は現在の状態がレベル制御中であ
る場合にＦｌａｇ＝１、レベル制御中でない場合にＦｌ
ａｇ＝０となっている。尚、このＦｌａｇの初期値は０
とする。The level detector 26 identifies the value of the control information (control state flag) stored in the control information storage 27 (step S3 in FIG. 2). That is, the control information in the control information storage unit 27 is Flag = 1 when the current state is under level control, and FL1 when the current state is not under level control.
ag = 0. The initial value of this Flag is 0.
And

【００４３】レベル検出部２６はＦｌａｇの値が１ある
いは０であっても、引き続いてＲ０コードの値が最大値
の３１であるかどうかを判定する（図２ステップＳ４，
Ｓ１０）。The level detector 26 determines whether the value of the R0 code is the maximum value 31 even if the value of the flag is 1 or 0 (step S4 in FIG. 2).
S10).

【００４４】レベル検出部２６はステップＳ４において
Ｒ０コードの値が最大値の３１に満たない場合、Ｒ０カ
ウンタの値をクリアする（図２ステップＳ５）。尚、Ｒ
０カウンタの初期値は０に設定されている。When the value of the R0 code is less than the maximum value of 31 in step S4, the level detector 26 clears the value of the R0 counter (step S5 in FIG. 2). Note that R
The initial value of the 0 counter is set to 0.

【００４５】また、レベル検出部２６はステップＳ４に
おいてＲ０コードの値が最大値の３１であった場合、Ｒ
０カウンタの値をインクリメントし（図２ステップＳ
６）、Ｒ０カウンタの値が所定値Ｘｈ以上になったかど
うかを判定する（図２ステップＳ７）。ここで、所定値
Ｘｈは入力音声が定常的に高レベルであることを検出す
るためのＲ０コードのスレッショルド値である。If the value of the R0 code is the maximum value of 31 in step S4,
The value of the 0 counter is incremented (step S in FIG. 2).
6) It is determined whether the value of the R0 counter is equal to or greater than a predetermined value Xh (step S7 in FIG. 2). Here, the predetermined value Xh is a threshold value of the R0 code for detecting that the input voice is constantly at a high level.

【００４６】レベル検出部２６はＲ０カウンタの値が所
定値Ｘｈ以上になると、Ｆｌａｇの値を１に設定し（図
２ステップＳ８）、レベル調整機能を作動させる。すな
わち、レベル検出部２６はシフトレジスタ２５に格納さ
れているスケーリングビットの値を１デクリメント（−
１）する（図２ステップＳ９）。デクリメントしてシフ
トレジスタ２５に再格納されたスケーリングビットは音
声処理手段２３において用いられるので、結果的に入力
音声レベルを抑制した音声処理が行われる。When the value of the R0 counter exceeds a predetermined value Xh, the level detection unit 26 sets the value of Flag to 1 (step S8 in FIG. 2) and activates the level adjustment function. That is, the level detector 26 decrements the value of the scaling bit stored in the shift register 25 by 1 (−
1) Do (Step S9 in FIG. 2). Since the scaling bits that have been decremented and re-stored in the shift register 25 are used in the audio processing means 23, as a result, audio processing in which the input audio level is suppressed is performed.

【００４７】高レベル入力が連続して制御状態フラグＦ
ｌａｇが１になった後、Ｒ０コードの値がさらに連続し
て３１を示すと、レベル検出部２６はＲ０カウンタの値
を所定値Ｘｈに保ち（図２ステップＳ１１）、レベル調
整を続行する（図２ステップＳ９）。The control state flag F is continuously output at a high level.
After the lag becomes 1, if the value of the R0 code further continuously indicates 31, the level detector 26 keeps the value of the R0 counter at the predetermined value Xh (step S11 in FIG. 2) and continues the level adjustment (step S11 in FIG. 2). FIG. 2 step S9).

【００４８】一方、レベル検出部２６はステップＳ１０
において制御状態フラグＦｌａｇが１の状態でＲ０コー
ドが最大値３１未満になったことを検出すると、Ｒ０カ
ウンタの値を１デクリメントし（図２ステップＳ１
２）、所定値Ｘｌと比較する（図２ステップＳ１３）。
所定値ＸＩは入力音声が定常的に低レベルであることを
検出するためのＲ０コードのスレッショルド値である。On the other hand, the level detector 26 determines in step S10
When it is detected that the R0 code is less than the maximum value 31 while the control state flag Flag is 1, the value of the R0 counter is decremented by 1 (step S1 in FIG. 2).
2) Compare with a predetermined value Xl (step S13 in FIG. 2).
The predetermined value XI is a threshold value of the R0 code for detecting that the input voice is constantly at a low level.

【００４９】レベル検出部２６はＲ０カウンタの値が所
定値Ｘｌ以上である場合、そのままレべル調整を続行す
る（図２ステップＳ９）。また、レベル検出部２６はス
テップＳ１３において低レベル入力が連続し、Ｒ０カウ
ンタの値が所定値Ｘｌ未満になったことを検出すると、
Ｆｌａｇの値を０に設定してＲ０カウンタをクリアす
る。この状態になると、レベル検出部２６はレベル調整
機能を停止させ、レベル調整を行わない。When the value of the R0 counter is equal to or larger than the predetermined value X1, the level detector 26 continues the level adjustment (step S9 in FIG. 2). When the level detection unit 26 detects in step S13 that the low level input has continued and the value of the R0 counter has become less than the predetermined value Xl,
The value of Flag is set to 0, and the R0 counter is cleared. In this state, the level detection unit 26 stops the level adjustment function and does not perform the level adjustment.

【００５０】上記のように、高圧縮率音声符号化処理の
中で算出されるＲ０コードを用いてレベル検出部２６で
連続的な高レベル入力状態を判定し、その結果を同じく
音声符号化処理の中で用いられるスケーリングビットに
反映させることによって、既存のプログラムを利用した
効率的な入力音声レベル制御を行うことができる。ま
た、従来の入力制御処理用のハードウェアが不要となる
ので、装置の小型化及び低消費電力化を図ることができ
る。As described above, the continuous high-level input state is determined by the level detection unit 26 using the R0 code calculated in the high-compression-rate audio encoding process, and the result is similarly used in the audio encoding process. By reflecting the result on the scaling bits used in the above, efficient input voice level control using an existing program can be performed. In addition, since conventional input control processing hardware is not required, the size and power consumption of the device can be reduced.

【００５１】図３は本発明の一実施例によるレベル制御
の動作を説明するための図である。これら図１〜図３を
参照して、本発明の一実施例によるレベル制御の動作を
詳細に説明する。尚、所定値Ｘｈ，Ｘｌはその値を任意
に設定することが可能となっている。FIG. 3 is a diagram for explaining the level control operation according to one embodiment of the present invention. The operation of the level control according to the embodiment of the present invention will be described in detail with reference to FIGS. The predetermined values Xh and Xl can be set arbitrarily.

【００５２】図３の（１）以前では入力音声フレームの
レベルがＲ０コードの３１未満であり、このときの処理
は図２のステップＳ３→Ｓ４→Ｓ５というルートであ
る。続いて、図３の（１）においては、Ｒ０コードが最
大値の３１であるようなフレームの連続的な入力が開始
される。この場合の処理は図２のステップＳ３→Ｓ４→
Ｓ６→Ｓ７というルートである。Before (1) in FIG. 3, the level of the input voice frame is less than 31 of the R0 code, and the processing at this time is a route of steps S3 → S4 → S5 in FIG. Subsequently, in (1) of FIG. 3, continuous input of frames in which the R0 code is the maximum value of 31 is started. The process in this case is performed in steps S3 → S4 → FIG.
The route is S6 → S7.

【００５３】図３の（２）はＲ０カウンタの値が所定値
Ｘｈに達する前にＲ０コードが再び３１未満になった状
態を示す。このときの処理は再び図２のステップＳ３→
Ｓ４→Ｓ５というルートに戻る。FIG. 3 (2) shows a state in which the R0 code becomes less than 31 again before the value of the R0 counter reaches the predetermined value Xh. The process at this time is performed again in step S3 in FIG.
Return to the route of S4 → S5.

【００５４】図３の（３）は連続的な高レベル入力によ
り、Ｒ０カウンタの値が所定値Ｘｈに達したことを示
す。このときの処理は図２のステップＳ３→Ｓ４→Ｓ６
→Ｓ７→Ｓ８→Ｓ９というルートに進む。すなわち、Ｆ
ｌａｇを１に設定し、レベル制御機能として、シフトレ
ジスタ２５に格納されている現フレームのスケーリング
ビットを１デクリメントする。この後はステップＳ３→
Ｓ１０→Ｓ１１→Ｓ９というルートで処理が続行され
る。FIG. 3 (3) shows that the value of the R0 counter has reached the predetermined value Xh due to the continuous high level input. The processing at this time is performed in steps S3 → S4 → S6 in FIG.
Go to the route of → S7 → S8 → S9. That is, F
lag is set to 1 and the scaling bit of the current frame stored in the shift register 25 is decremented by 1 as a level control function. After this, step S3 →
The processing is continued along the route of S10 → S11 → S9.

【００５５】図３の（４）は連続的な高レベル入力状態
において一時的にＲ０コードが３１未満になった場合を
示す。このときの処理は図２のステップＳ１０→Ｓ１２
→Ｓ１３→Ｓ９というルートで実行される。FIG. 3D shows a case where the R0 code temporarily becomes less than 31 in a continuous high level input state. The process at this time is performed in steps S10 → S12 in FIG.
It is executed in the route of → S13 → S9.

【００５６】図３の（５）はＲ０カウンタの値が所定値
Ｘｌに達する前にＲ０コードが再び３１になった状態を
示す。このときの処理は再び図２のステップＳ３→Ｓ１
０→Ｓ１１→Ｓ９というルートに戻る。FIG. 3 (5) shows a state in which the R0 code becomes 31 again before the value of the R0 counter reaches the predetermined value X1. The process at this time is performed again in step S3 → S1 in FIG.
Return to the route of 0 → S11 → S9.

【００５７】図３の（６）は連続的な低レベル入力によ
り、Ｒ０カウンタの値が所定値Ｘｌに達したことを示
す。このときの処理は図２のステップＳ３→Ｓ１０→Ｓ
１２→Ｓ１３→Ｓ１４→Ｓ１５というルートに進む。す
なわち、Ｆｌａｇを０に設定し、レベル制御機能を行わ
ない。この後、ステップＳ３→Ｓ４→Ｓ５というルート
で処理が続行される。FIG. 3 (6) shows that the value of the R0 counter has reached the predetermined value Xl due to the continuous low level input. The process at this time is performed in steps S3 → S10 → S in FIG.
It goes to the route of 12 → S13 → S14 → S15. That is, Flag is set to 0, and the level control function is not performed. Thereafter, the processing is continued along the route of steps S3 → S4 → S5.

【００５８】図４は本発明の他の実施例の構成を示すブ
ロック図である。図において、本発明の他の実施例によ
るレベル調整システムはディジタル音声信号入力手段１
と、音声符号化装置４と、ディジタル符号化音声信号出
力手段３とを含んで構成されている。FIG. 4 is a block diagram showing the configuration of another embodiment of the present invention. In the figure, a level adjustment system according to another embodiment of the present invention is a digital audio signal input means 1.
, A voice coding device 4 and a digitally coded voice signal output means 3.

【００５９】音声符号化装置４は入力音声制御手段４１
と、フレームエネルギ量子化手段４２と、音声処理手段
４３と、コードマルチプレクサ４４と、シフトレジスタ
４５と、レベル検出部４６と、制御情報記憶部４７とを
備えている。The speech encoding device 4 is provided with an input speech control means 41.
, A frame energy quantizing unit 42, an audio processing unit 43, a code multiplexer 44, a shift register 45, a level detecting unit 46, and a control information storage unit 47.

【００６０】上記の構成において、音声符号化装置４で
はレベル検出部４６が入力音声制御手段４１を直接制御
する以外は、上記の図１に示す音声符号化装置２と同様
の構成となっており、その動作も同様である。In the above configuration, the speech encoding apparatus 4 has the same configuration as that of the speech encoding apparatus 2 shown in FIG. 1 except that the level detection unit 46 directly controls the input speech control means 41. The operation is the same.

【００６１】レベル検出部４６はＲ０コードが最大値
（＝３１）であるようなフレームが所定の回数以上に連
続的に入力されていることを検出する。レベル検出部４
６は連続的な高レベル入力状態を検出した場合、入力音
声制御手段４１において分析フレームの次のフレームの
入力データ群を左に１ビットシフトすることによってレ
ベル調整を行う。The level detecting section 46 detects that a frame in which the R0 code has the maximum value (= 31) is continuously input more than a predetermined number of times. Level detector 4
When a continuous high-level input state is detected, the input voice control means 41 performs level adjustment by shifting the input data group of the frame next to the analysis frame by one bit to the left.

【００６２】図５は図４の音声符号化装置４の動作を示
すフローチャートである。これら図４及び図５を参照し
て、音声符号化装置４の動作について説明する。尚、図
５においてステップＳ２１〜Ｓ２４及びＳ３８，Ｓ３９
で示されるディジタル音声信号入力手段１、音声符号化
装置４、ディジタル符号化音声信号出力手段３各々にお
ける動作は、図１に示すディジタル音声信号入力手段
１、音声符号化装置２、ディジタル符号化音声信号出力
手段３各々における動作（図２のステップＳ１〜Ｓ４及
びＳ１６，Ｓ１７で示される動作）と同一であるので、
その説明は省略する。FIG. 5 is a flowchart showing the operation of the speech coding apparatus 4 of FIG. The operation of the speech encoding device 4 will be described with reference to FIGS. In FIG. 5, steps S21 to S24 and S38 and S39 are performed.
The operation of each of the digital voice signal input means 1, voice coder 4 and digital coded voice signal output means 3 shown in FIG. Since the operation in each of the signal output means 3 (the operation shown in steps S1 to S4 and S16 and S17 in FIG. 2) is the same,
The description is omitted.

【００６３】レベル検出部４６はステップＳ２４におい
てＲ０コードが最大値の３１に満たない場合、Ｒ０ｈカ
ウンタの値をクリアする（図５ステップＳ２５）。尚、
Ｒ０ｈカウンタは高レベル入力のフレーム数をカウント
するためのカウンタであり、初期値は０とする。When the R0 code is less than the maximum value of 31 in step S24, the level detector 46 clears the value of the R0h counter (step S25 in FIG. 5). still,
The R0h counter is a counter for counting the number of high-level input frames, and its initial value is 0.

【００６４】レベル検出部４６はステップＳ２４におい
てＲ０コードの値が最大値の３１であった場合、Ｒ０ｈ
カウンタの値をインクリメント（＋１）し（図５ステッ
プＳ２６）、Ｒ０ｈカウンタの値が所定値Ｘｈ以上にな
ったかどうかを判定する（図５ステップＳ２７）。所定
値Ｘｈは入力音声が定常的に高レベルであることを検出
するためのＲ０コードのスレッショルド値である。If the value of the R0 code is the maximum value of 31 in step S24, the level detector 46 sets
The value of the counter is incremented (+1) (step S26 in FIG. 5), and it is determined whether or not the value of the R0h counter is equal to or more than a predetermined value Xh (step S27 in FIG. 5). The predetermined value Xh is a threshold value of the R0 code for detecting that the input voice is constantly at a high level.

【００６５】レベル検出部４６はＲ０ｈカウンタの値が
所定値Ｘｈ以上になった場合、Ｆｌａｇの値を１に設定
し（図５ステップＳ２８）、Ｒ０ｈカウンタの値をクリ
アする（図５ステップＳ２９）。続いて、レベル検出部
４６はステップＳ３６においてＦｌａｇの値を判定する
ことによってレベル調整機能を作動する。When the value of the R0h counter is equal to or more than the predetermined value Xh, the level detector 46 sets the value of Flag to 1 (step S28 in FIG. 5) and clears the value of the R0h counter (step S29 in FIG. 5). . Subsequently, the level detector 46 activates the level adjustment function by determining the value of Flag in step S36.

【００６６】本発明の一実施例ではシフトレジスタ２５
の値をデクリメントすることで、レベル調整を行ってい
る。これに対し、本発明の他の実施例では入力音声制御
手段４１にレベル検出部４６の判定結果をフィードバッ
クし、分析フレームの次のフレームの入力データ群を右
に１ビットシフトすることによってレベル調整を行って
いる（図５ステップＳ３７）。In one embodiment of the present invention, the shift register 25
The level is adjusted by decrementing the value of. On the other hand, in another embodiment of the present invention, the determination result of the level detection section 46 is fed back to the input voice control means 41, and the input data group of the frame next to the analysis frame is shifted by one bit to the right to adjust the level. (Step S37 in FIG. 5).

【００６７】レベル検出部４６は高レベル入力が連続し
て制御状態フラグＦｌａｇが１になった後、Ｒ０コード
の値が所定値Ｒ０Ｌｏｗであることを検出する（図５
ステップＳ３０）。所定値Ｒ０Ｌｏｗは入力音声が低
レベルであることを認識するための任意のＲ０コードの
値である。The level detector 46 detects that the value of the R0 code is a predetermined value R0 Low after the control state flag Flag is set to 1 after the high level input continues (FIG. 5).
Step S30). The predetermined value R0 Low is an arbitrary R0 code value for recognizing that the input voice is at a low level.

【００６８】レベル検出部４６は連続的な高レベル入力
状態（Ｆｌａｇ＝１）に続く低レベル入力により、Ｒ０
コード＜Ｒ０Ｌｏｗであったとき（図５ステップＳ３
０）、Ｒ０ｌカウンタの値をインクリメントし（ステッ
プＢ１２）、Ｒ０ｌカウンタの値が所定値Ｘｌ以上かど
うかを判定する（図５ステップＳ３３）。所定値Ｘｌは
入力音声信号が定常的に低レベルであることを検出する
ためのＲ０コードのスレッショルド値である。The level detector 46 outputs R0 by a low level input following a continuous high level input state (Flag = 1).
When code <R0 Low (step S3 in FIG. 5)
0), the value of the R01 counter is incremented (step B12), and it is determined whether the value of the R01 counter is equal to or more than a predetermined value X1 (step S33 in FIG. 5). The predetermined value Xl is a threshold value of the R0 code for detecting that the input audio signal is constantly at a low level.

【００６９】レベル検出部４６はＲ０ｌカウンタの値が
所定値Ｘｌ以上である場合、Ｆｌａｇの値を０に設定し
（図５ステップＳ３４）、Ｒ０ｌカウンタの値をクリア
する（図５ステップＳ３５）。続いて、レベル検出部４
６はステップＳ３６においてＦｌａｇの値を判定するこ
とによってレベル調整機能を停止する。When the value of the R01 counter is equal to or larger than the predetermined value X1, the level detector 46 sets the value of the Flag to 0 (step S34 in FIG. 5) and clears the value of the R01 counter (step S35 in FIG. 5). Subsequently, the level detector 4
6 stops the level adjustment function by determining the value of Flag in step S36.

【００７０】上述したように、本発明の他の実施例では
Ｒ０コードを用いるという本発明の一実施例の効果に加
え、シフトレジスタ４５を介することなく入力音声デー
タに対して直接的にレベル調整を行うという構成上、既
存のプログラム構成をそのまま利用し、そのプログラム
構成に一部条件分岐のルーチンを加えるだけで、効率的
な入力音声レベル制御を行うことができる。As described above, in another embodiment of the present invention, in addition to the effect of the embodiment of the present invention in which the R0 code is used, the level of the input audio data is directly adjusted without using the shift register 45. In this configuration, efficient input voice level control can be performed only by using an existing program configuration as it is and adding only a conditional branching routine to the program configuration.

【００７１】図６は本発明の他の実施例によるレベル制
御の動作を説明するための図である。これら図４〜図６
を参照して、本発明の他の実施例によるレベル制御の動
作を詳細に説明する。図６においてはＲ０コード、制御
状態フラグＦｌａｇ、Ｒ０ｈカウンタの値及びＲ０ｌカ
ウンタの値の時間推移の様子を示している。図５及び図
６中のＲ０Ｌｏｗ及び所定値Ｘｈ，Ｘｌの値は、任意
に設定することが可能である。また、図６の（１１）か
ら（１６）のタイミングは図３に示す（１）から（６）
のタイミングに相当する。FIG. 6 is a diagram for explaining an operation of level control according to another embodiment of the present invention. These FIGS. 4 to 6
The operation of the level control according to another embodiment of the present invention will be described in detail with reference to FIG. FIG. 6 shows how the R0 code, the control state flag Flag, the value of the R0h counter, and the value of the R01 counter change over time. The values of R0 Low and the predetermined values Xh and Xl in FIGS. 5 and 6 can be set arbitrarily. The timings (11) to (16) in FIG. 6 are shown in (1) to (6) in FIG.
Timing.

【００７２】図６が図３と異なる点はレベル調整がどの
分析フレームに対して行われるかという点及び連続的な
高レベル入力状態における低レベル入力の検出方法の２
点である。その他の詳細な内容は本発明の一実施例と同
様であるので、その説明は省略する。FIG. 6 is different from FIG. 3 in that the level adjustment is performed for which analysis frame and the method 2 for detecting the low level input in the continuous high level input state.
Is a point. The other details are the same as those of the embodiment of the present invention, and the description thereof is omitted.

【００７３】まず、レベル調整の対象フレームについ
て、本発明の一実施例では図３の「レべル調整」を現フ
レームに対して行う。それに対し、本発明の他の実施例
では図６の「レベル調整」を次フレームに対して行って
いる（図５ステップＳ３７）。First, for a frame to be subjected to level adjustment, in one embodiment of the present invention, "level adjustment" in FIG. 3 is performed on the current frame. On the other hand, in another embodiment of the present invention, the “level adjustment” in FIG. 6 is performed for the next frame (step S37 in FIG. 5).

【００７４】また、連続的高レベル入力状態における低
レベル入力の検出方法について、本発明の一実施例では
高レベル入力（Ｒ０コード＝３１）のフレーム数をカウ
ントするＲ０カウンタの値に対して状態を判定するため
の所定値Ｘｈ，Ｘｌが設けられている。それに対し、本
発明の他の実施例では高レベル入力（Ｒ０コード＝３
１）のフレーム数をカウントするＲ０ｈカウンタの値に
対して定常的状態を判別するための所定値Ｘｈを設け、
また低レベル入力（Ｒ０コード＝Ｒ０Ｌｏｗ）のフレ
ーム数をカウントするＲ０ｌカウンタの値に対して定常
的状態を判別するための所定値Ｘｌを設けている。In a method of detecting a low-level input in a continuous high-level input state, in one embodiment of the present invention, the state of the R0 counter for counting the number of frames of the high-level input (R0 code = 31) is determined according to the state. There are provided predetermined values Xh and Xl for judging. On the other hand, in another embodiment of the present invention, the high level input (R0 code = 3)
A predetermined value Xh for determining a steady state is provided for the value of the R0h counter for counting the number of frames in 1),
In addition, a predetermined value Xl for determining a steady state is provided for a value of an R01 counter for counting the number of frames of a low-level input (R0 code = R0 Low).

【００７５】図７は本発明の別の実施例の構成を示すブ
ロック図である。図において、本発明の別の実施例によ
るレベル調整システムはディジタル音声信号入力手段５
と、音声符号化装置６と、ディジタル符号化音声信号出
力手段３とを含んで構成されている。FIG. 7 is a block diagram showing the configuration of another embodiment of the present invention. In the figure, a level adjustment system according to another embodiment of the present invention includes digital audio signal input means 5.
, An audio encoding device 6 and digitally encoded audio signal output means 3.

【００７６】音声符号化装置６は入力音声制御手段６１
と、フレームエネルギ量子化手段６２と、音声処理手段
６３と、コードマルチプレクサ６４と、シフトレジスタ
６５と、レベル検出部６６と、制御情報記憶部６７とを
備えている。The speech encoding device 6 includes an input speech control means 61
, A frame energy quantizing unit 62, an audio processing unit 63, a code multiplexer 64, a shift register 65, a level detecting unit 66, and a control information storing unit 67.

【００７７】上記の構成において、音声符号化装置６で
はレベル検出部６６がディジタル音声信号入力手段５を
直接制御する以外は、上記の図４に示す音声符号化装置
４と同様の構成となっており、その動作も同様である。
尚、本発明の別の実施例ではディジタル音声信号入力手
段５が外部コマンド制御等によるレベル調整手段（図示
せず）を備えているものとする。In the above configuration, the configuration of the speech encoding apparatus 6 is the same as that of the speech encoding apparatus 4 shown in FIG. 4 except that the level detecting section 66 directly controls the digital speech signal input means 5. The operation is the same.
In another embodiment of the present invention, it is assumed that the digital audio signal input means 5 has a level adjusting means (not shown) by external command control or the like.

【００７８】レベル検出部６６はＲ０コードが最大値
（＝３１）であるようなフレームが所定の回数以上に連
続的に入力されているかどうかを判定する。レベル検出
部６６は連続的な高レベル入力状態を検出した場合、デ
ィジタル音声信号入力手段５に含まれるレベル調整手段
を直接制御することによってレベル調整を行う。The level detecting section 66 determines whether or not a frame in which the R0 code has the maximum value (= 31) is continuously input more than a predetermined number of times. When detecting a continuous high-level input state, the level detection unit 66 performs level adjustment by directly controlling the level adjustment unit included in the digital audio signal input unit 5.

【００７９】図８は図７の音声符号化装置６の動作を示
すフローチャートである。これら図７及び図８を参照し
て、音声符号化装置６の動作について説明する。尚、図
８においてステップＳ５７を除く部分で示されるディジ
タル音声信号入力手段５、音声符号化装置６、ディジタ
ル符号化音声信号出力手段３各々の動作は、本発明の他
の実施例のディジタル音声信号入力手段１、音声符号化
装置４、ディジタル符号化音声信号出力手段３各々の動
作と同一であるため、その説明は省略する。FIG. 8 is a flowchart showing the operation of the speech encoding device 6 of FIG. The operation of the speech encoding device 6 will be described with reference to FIGS. The operations of the digital audio signal input means 5, the audio encoding device 6, and the digitally encoded audio signal output means 3 shown in FIG. 8 except for step S57 are the same as those of the digital audio signal of the other embodiment of the present invention. The operations are the same as those of the input means 1, the voice coding device 4, and the digitally coded voice signal output means 3, and the description thereof will be omitted.

【００８０】本発明の一実施例では連続的な高レベル入
力を認識した場合、レベル調整機能としてシフトレジス
タ２５の値を１デクリメントしている（図２ステップＳ
９）。また、本発明の他の実施例ではこのような場合、
入力音声制御手段４１にその結果をフィードバックし、
分析フレームの次のフレームの入力データ群を右に１ビ
ットシフトすることによってレベル調整を行っている
（図５ステップＳ３７）。In the embodiment of the present invention, when a continuous high level input is recognized, the value of the shift register 25 is decremented by 1 as a level adjusting function (step S in FIG. 2).
9). Further, in another embodiment of the present invention, in such a case,
The result is fed back to the input voice control means 41,
The level adjustment is performed by shifting the input data group of the frame next to the analysis frame to the right by one bit (step S37 in FIG. 5).

【００８１】これに対し、本発明の別の実施例ではディ
ジタル音声信号入力手段５の外部コマンドにその結果を
フィードバックし、分析フレームの次のフレームに対し
てアナログレベルでの入力音声レベル調整を行っている
（図８ステップＳ５７）。On the other hand, in another embodiment of the present invention, the result is fed back to an external command of the digital audio signal input means 5, and the input audio level is adjusted at the analog level for the next frame of the analysis frame. (Step S57 in FIG. 8).

【００８２】上述したように、本発明の別の実施例では
Ｒ０コードを用いるという本発明の一実施例の効果に加
え、ディジタル音声信号入力手段５の外部コマンド制御
等によるレベル調整手段をＲ０コードと連携させて利用
することによって、ハードウェアの機能を有効的に活用
し、かつ既存のプログラム構成をそのまま用いて符号化
の性能を劣化させることなく、効率的な入力音声レベル
制御を行うことができる。As described above, in another embodiment of the present invention, in addition to the effect of the embodiment of the present invention in which the R0 code is used, the level adjusting means by external command control or the like of the digital audio signal input means 5 is used. By using this function in conjunction with, it is possible to effectively utilize the hardware functions and to perform efficient input audio level control without deteriorating the coding performance using the existing program configuration as it is. it can.

【００８３】この本発明の別の実施例によるレベル制御
の動作については、図６をそのまま使用して説明するこ
とができる。すなわち、本発明の別の実施例によるレベ
ル制御の動作についての詳細な内容は本発明の他の実施
例と同じであるので、その説明は省略する。The operation of the level control according to another embodiment of the present invention can be described with reference to FIG. That is, the detailed contents of the operation of the level control according to another embodiment of the present invention are the same as those of the other embodiments of the present invention, and the description thereof will be omitted.

【００８４】本発明の一実施例では図３の「レベル調
整」を現フレームに対して行っている。これに対して、
本発明の別の実施例では図６の「レベル調整」を次フレ
ームに対して行っている（図８ステップＳ５７）。その
際、レベル調整の方法はディジタル音声信号入力手段５
の外部コマンド入力によるレベル調整の機能及び構成に
依存する。In one embodiment of the present invention, "level adjustment" in FIG. 3 is performed on the current frame. On the contrary,
In another embodiment of the present invention, "level adjustment" in FIG. 6 is performed on the next frame (step S57 in FIG. 8). At this time, the level adjustment method is the digital audio signal input means 5.
Depends on the function and configuration of level adjustment by input of an external command.

【００８５】このように、ソフトウェアプログラムで実
行される高圧縮率音声符号化処理の中で算出されるＲ0
コードを入力レベル制御用のパラメータとして用い、Ｒ
0 コードを監視することで連続的な高レベル入力状態を
判定し、その結果を同じく音声符号化処理の中で用いら
れるスケーリングビットに反映させることによって、入
力レベルを監視するための手段を音声符号化装置の外部
に別途設けることなく、既存のソフトウェアプログラム
を利用した効率的な入力音声レベル制御を行うことがで
きる。よって、従来の入力制御処理用のハードウェアが
不要となるので、装置の小型化及び低消費電力化を図る
ことができる。As described above, R0 calculated in the high-compression-rate speech encoding process executed by the software program
Using the code as a parameter for input level control,
By monitoring the 0 code, a continuous high-level input state is determined, and the result is reflected in the scaling bits used in the voice coding process, thereby providing a means for monitoring the input level to the voice code. It is possible to perform efficient input sound level control using an existing software program without separately providing the same outside the coding device. This eliminates the need for conventional hardware for input control processing, thereby reducing the size and power consumption of the device.

【００８６】また、Ｒ0 コードを監視することで連続的
な高レベル入力状態を判定し、その結果に基づいて、シ
フトレジスタ４５を介することなく、入力音声データに
対して直接的にレベル調整を行うという構成をとること
によって、スケーリングビット算出を含む既存のプログ
ラム構成に、条件分岐のルーチンを追加することで効率
的な入力音声レベル制御を行うことができる。Also, by monitoring the R0 code, a continuous high level input state is determined, and based on the result, the level is directly adjusted for the input audio data without passing through the shift register 45. With this configuration, it is possible to perform efficient input voice level control by adding a conditional branch routine to an existing program configuration including scaling bit calculation.

【００８７】さらに、Ｒ0 コードを監視することで連続
的な高レベル入力状態を判定し、その結果に基づいて、
ディジタル音声信号入力手段５の外部コマンド制御等に
よるレベル調整手段をＲ0 コードと連携させて利用する
ことによって、ハードウェアの機能を有効的に活用し、
かつ既存のプログラム構成をそのまま用いて、符号化の
性能を劣化させることなく効率的な入力音声レベル制御
を行うことができる。Further, a continuous high-level input state is determined by monitoring the R0 code, and based on the result,
By using the level adjustment means by external command control or the like of the digital audio signal input means 5 in cooperation with the R0 code, the hardware functions can be effectively utilized,
In addition, it is possible to perform efficient input audio level control without deteriorating the encoding performance using the existing program configuration as it is.

【００８８】[0088]

【発明の効果】以上説明したように本発明によれば、入
力音声データを高圧縮率音声符号化処理によって符号化
する音声符号化装置において、高圧縮率音声符号化処理
中に算出されかつ入力音声データの分析フレーム単位の
電力パワーを示すフレームエネルギコードを監視し、そ
の監視結果を基に高圧縮率音声符号化処理において入力
音声データのレベル制御を行うことによって、高圧縮率
音声符号化処理として既存のソフトウェア及びハードウ
ェア資源を利用した効率的な入力音声レベル制御を行う
ことができるという効果がある。As described above, according to the present invention, in a speech encoding apparatus for encoding input speech data by high-compression-rate speech encoding processing, the speech data is calculated during the high-compression-rate speech encoding processing and input. Analysis of audio data The frame energy code indicating the power of each frame is monitored, and based on the monitoring result, the level control of the input audio data is performed in the high-compression-rate audio encoding processing, whereby the high-compression-rate audio encoding processing is performed. As a result, there is an effect that efficient input voice level control can be performed using existing software and hardware resources.

[Brief description of the drawings]

【図１】本発明の一実施例の構成を示すブロック図であ
る。FIG. 1 is a block diagram showing the configuration of an embodiment of the present invention.

【図２】図１の音声符号化装置の動作を示すフローチャ
ートである。FIG. 2 is a flowchart illustrating an operation of the speech encoding device in FIG. 1;

【図３】本発明の一実施例によるレベル制御の動作を説
明するための図である。FIG. 3 is a diagram for explaining an operation of level control according to one embodiment of the present invention.

【図４】本発明の他の実施例の構成を示すブロック図で
ある。FIG. 4 is a block diagram showing a configuration of another embodiment of the present invention.

【図５】図４の音声符号化装置の動作を示すフローチャ
ートである。FIG. 5 is a flowchart showing an operation of the speech encoding device in FIG. 4;

【図６】本発明の他の実施例によるレベル制御の動作を
説明するための図である。FIG. 6 is a diagram for explaining an operation of level control according to another embodiment of the present invention.

【図７】本発明の別の実施例の構成を示すブロック図で
ある。FIG. 7 is a block diagram showing a configuration of another embodiment of the present invention.

【図８】図７の音声符号化装置の動作を示すフローチャ
ートである。FIG. 8 is a flowchart showing an operation of the speech encoding device in FIG. 7;

【図９】従来例の構成の一例を示すブロック図である。FIG. 9 is a block diagram showing an example of a configuration of a conventional example.

【図１０】図９のレベル調整システムを高圧縮率音声符
号化装置に反映させた例を示す図である。FIG. 10 is a diagram showing an example in which the level adjustment system of FIG. 9 is applied to a high-compression-rate audio encoding device.

【図１１】従来例の構成の他の例を示すブロック図であ
る。FIG. 11 is a block diagram showing another example of the configuration of the conventional example.

【図１２】図１１のシステムを高圧縮率音声符号化装置
に反映させた例を示す図である。FIG. 12 is a diagram showing an example in which the system of FIG. 11 is reflected on a high-compression-rate audio encoding device.

[Explanation of symbols]

１，５ディジタル音声信号入力手段２，４，６音声符号化装置３ディジタル符号化音声信号出力手段２１，４１，６１入力音声制御手段２２，４２，６２フレームエネルギ量子化手段２３，４３，６３音声処理手段２４，４４，６４コードマルチプレクサ２５，４５，６５シフトレジスタ２６，４６，６６レベル検出部２７，４７，６７制御情報記憶部 1,5 Digital audio signal input means 2,4,6 Audio encoding device 3 Digitally encoded audio signal output means 21,41,61 Input audio control means 22,42,62 Frame energy quantization means 23,43,63 Audio Processing means 24, 44, 64 Code multiplexer 25, 45, 65 Shift register 26, 46, 66 Level detector 27, 47, 67 Control information storage

Claims

[Claims]

1. An audio encoding apparatus for encoding input audio data by a high-compression-rate audio encoding process, comprising: Monitoring means for monitoring a frame energy code indicating electric power; and means for controlling the level of the input audio data in the high-compression-rate audio encoding process based on the monitoring result of the monitoring means. Audio coding device.

2. The level control unit according to claim 1, wherein the level control unit controls the level of the input voice data by increasing or decreasing an input voice scaling bit value used in the high compression rate voice coding process based on a monitoring result of the monitoring unit. 2. The speech encoding device according to claim 1, wherein

3. The level control unit according to claim 1, wherein the level control unit controls the level of the input audio data by shifting the input audio data based on a monitoring result of the monitoring unit. A speech encoding device according to claim 1.

4. A means for performing the level control, wherein the input sound data is reflected by a command for instructing a level adjustment at an analog level with respect to the next data of the input sound data on the basis of a monitoring result of the monitoring means. 2. The speech encoding apparatus according to claim 1, wherein level control is performed.

5. The level control unit, wherein the frame energy code is set to be equal to or more than a first predetermined value and equal to or less than a second predetermined value by the monitoring unit. 5. The speech encoding apparatus according to claim 1, wherein a level control of the input speech data is performed when the speech data is continuous.