JPH04367899A

JPH04367899A - Agc control system of voice recognition device

Info

Publication number: JPH04367899A
Application number: JP3170621A
Authority: JP
Inventors: Shuji Kubota; 修司久保田; Harutake Yasuda; 安田　晴剛
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1991-06-14
Filing date: 1991-06-14
Publication date: 1992-12-21

Abstract

PURPOSE:To make the amplitude and the variation point of suppression most suitable by placing an AGC circuit after a feature pickup section and by AGC controlling based upon the output (power spectrum) signals. CONSTITUTION:An AGC control section 24 is located in the next stage of a power spectrum generating section 23. Means 25, 26, and 27, which control gain constants against each power spectrum by the voice power signals which are obtained by low pass filtering the voice power in the total of power spectrum then integrated, a comparing means, which judges decrease and increase of the voice power signals, and a means, which switches time constants of each of the low pass filters for each case, are provided. When the voice signal is small, a certain gain is given to amplify the signal and when the signal is large, it is suppressed.

Description

[Detailed description of the invention]

【０００１】0001

【技術分野】本発明は、音声認識装置のＡＧＣ制御方式
、より詳細には、入力音声を周波数分析し、音声パター
ンを生成する音声認識装置におけるＡＧＣ制御方式に関
する。TECHNICAL FIELD The present invention relates to an AGC control method for a speech recognition device, and more particularly to an AGC control method for a speech recognition device that frequency-analyzes input speech and generates a speech pattern.

【０００２】0002

【従来技術】音声認識装置のＡＧＣ（自動利得制御）は
話者の声の大きさを正規化するために用いられ、例えば
、声の小さい人や、マイクロフォンから口元までの距離
が離れていて信号が小さく入力されていたりする場合に
は信号を引き上げ、逆に、声の大きい人や信号が大きく
入力される場合には抑圧して、その信号レベルを正規化
しようとするものである。図１０に、音声信号のＡＧＣ
（自動利得制御）の基本構成を示す。入力された信号は
、Ｌ．Ｐ．Ｆ（ローパスフィルタ）２で低周波成分（パ
ワー信号）が抽出され、その信号が可変電流素子を用い
た可変利得制御部（ＡＧＣ）１に加えられ、これにより
音声信号のゲインが変えられ、音声信号の出力が制御さ
れる。つまり、信号レベルが小さい時は利得を大きくと
って引き上げ、信号レベルが大きい時は逆に抑圧する。その信号の入出力特性は図１１に示す様に、Ａ点を境に
変化する。つまりＡ点までの小信号に対しては一定利得
で信号増幅を行い、Ａ点以上の信号については利得を１
以下として抑圧する。入力される信号はＡ点に対し適切
なレベルで入ってくることが望ましいが、人間の音声の
レベル変動は大きく、更に、マイクの装着の仕方などで
も大きく異ってくる。従って、その入力信号のレベルの
中心値がほぼＢ点にくる様にしておけば（ＡとＢは約２
０〜３０ｄＢ程度）音声のレベルバランスは良くなる。[Prior Art] AGC (automatic gain control) of a speech recognition device is used to normalize the loudness of a speaker's voice. The system attempts to normalize the signal level by raising the signal when a person's voice is loud or when a large signal is being input, and suppressing it when a person has a loud voice or a large signal is being input. Figure 10 shows the AGC of the audio signal.
The basic configuration of (automatic gain control) is shown. The input signal is L. P. A low-frequency component (power signal) is extracted by an F (low-pass filter) 2, and the signal is applied to an variable gain control section (AGC) 1 using a variable current element, which changes the gain of the audio signal and controls the audio signal. The output of the signal is controlled. In other words, when the signal level is low, the gain is increased to raise it, and when the signal level is high, it is suppressed. The input/output characteristics of the signal change from point A as shown in FIG. In other words, signal amplification is performed at a constant gain for small signals up to point A, and the gain is reduced to 1 for signals above point A.
Suppress as below. It is desirable that the input signal be at a level appropriate for point A, but the level of human voice fluctuates widely, and the way the microphone is attached also varies greatly. Therefore, if you set the center value of the input signal level to be approximately at point B (A and B are approximately 2
(about 0 to 30 dB) improves the audio level balance.

【０００３】図１２は、従来の音声認識装置の一例を説
明するための図で、図中、１１はマイクロフォン、１２
はマイクアンプ、１３はＡＧＣ（自動利得制御回路）、
１４はＡ／Ｄ変換器、１５は特徴抽出部（パワースペク
トル生成部）、１６はパターン生成部、１７は認識部、
１８は認識用辞書、１９は結果出力部で、前記特徴抽出
部１５は、周知のように、Ｂ．Ｐ．Ｆ（バンドパスフィ
ルタ）１５ａ、整流器１５ｂ、Ｌ．Ｐ．Ｆ（ローパスフ
ィルタ）１５ｃ等により構成されている。FIG. 12 is a diagram for explaining an example of a conventional speech recognition device. In the figure, 11 is a microphone, 12 is a
is a microphone amplifier, 13 is an AGC (automatic gain control circuit),
14 is an A/D converter, 15 is a feature extraction unit (power spectrum generation unit), 16 is a pattern generation unit, 17 is a recognition unit,
18 is a recognition dictionary, 19 is a result output unit, and the feature extraction unit 15 is constructed using B.I. P. F (band pass filter) 15a, rectifier 15b, L. P. It is composed of an F (low pass filter) 15c and the like.

【０００４】しかしながら、上記従来の音声認識装置に
おけるＡＧＣでは、マイクアンプ１２の直後に、ＡＧＣ
１３を入れ、その後に、特徴抽出部１５において特徴量
を抽出し、その特徴量でパターン生成部１６において入
力パターンを生成し、予め登録されている認識辞書１８
のパターンとパターンの比較を行い、認識結果を出力し
ていた。この様に構成した場合、一般に、ＡＧＣはアナ
ログ素子で構成され、ＡＧＣの増幅と抑圧の変化点を制
御することが難しかった。又、更に重要なことはＡＧＣ
を行うことにより、音声の波形そのものに歪みが生じ、
特徴抽出そのものに影響を与える場合もあった。However, in the AGC in the conventional speech recognition device, the AGC is performed immediately after the microphone amplifier 12.
13, after that, the feature extraction section 15 extracts the feature amount, the pattern generation section 16 generates an input pattern using the feature amount, and the pre-registered recognition dictionary 18
It compared the patterns and output the recognition results. When configured in this way, the AGC is generally configured with analog elements, and it is difficult to control the changing point of amplification and suppression of the AGC. Also, more importantly, AGC
By doing this, distortion occurs in the audio waveform itself,
In some cases, the feature extraction itself was affected.

【０００５】上記従来のＡＧＣでは、小信号時に一定ゲ
インで増幅し、大信号時はゲインを減少させて制御して
いる。しかしながら、音声に多少のノイズが付加された
場合、それも同様に増幅してしまうため、不要ノイズを
持ち上げてしまい、そのため、ＡＧＣ通過後の音声信号
のＳ／Ｎが悪化し、特徴量の抽出に悪影響を与えていた
。[0005] In the conventional AGC described above, a small signal is amplified with a constant gain, and a large signal is controlled by decreasing the gain. However, if some noise is added to the audio, it will be amplified as well, which will increase the unnecessary noise.As a result, the S/N of the audio signal after passing through the AGC will deteriorate, and the feature value extraction had a negative impact on.

【０００６】また、従来のＡＧＣは、前述のようにマイ
クアンプの後段に位置し、アナログ音声信号をＡＧＣ制
御し、これから特徴抽出を行って特徴量を求めていた。そのため、その特性の変更はアナログ素子を変更するこ
とになり現実的に難しかった。[0006] Furthermore, as described above, the conventional AGC is located after the microphone amplifier, performs AGC control on analog audio signals, and performs feature extraction from the signals to obtain feature quantities. Therefore, changing the characteristics requires changing the analog elements, which is practically difficult.

【０００７】更に、パワースペクトル生成後にＡＧＣ制
御を行う方式は、パワー値が小さい場合には一定利得を
与えて増幅し、大きい場合には逆に抑圧させるように利
得を変化させてパワー値が一定となるようにするもので
ある。しかし、この方式では増幅と抑制の切り替え点の
みがＡＧＣ制御のパラメータになっているため、切り替
え点の設定は増幅と抑制の両方に関係する。たとえば、
増幅レンジを小さく設定した場合には抑制部のダイナミ
ックレンジが保てなくなる。つまり、このＡＧＣ制御方
式では、増幅部のレンジのみを変更することができない
。Furthermore, in the method of performing AGC control after power spectrum generation, when the power value is small, a constant gain is applied and amplified, and when the power value is large, the gain is changed to suppress it, so that the power value remains constant. The purpose is to However, in this method, only the switching point between amplification and suppression is a parameter for AGC control, so the setting of the switching point is related to both amplification and suppression. for example,
If the amplification range is set small, the dynamic range of the suppressor cannot be maintained. In other words, with this AGC control method, only the range of the amplification section cannot be changed.

【０００８】[0008]

【目的】本発明は、上述のごとき実情に鑑みてなされた
もので、特に、ＡＧＣ回路を特徴抽出部の後段におき、
その出力（パワースペクトル）信号をもとにＡＧＣ制御
を行い、その増幅と抑圧の変化点を最適となる様に制御
すること、或いは、ＡＧＣの小信号時のゲイン設定を、
小信号領域で一定にノイズレベルを引き上げずに、音声
信号に対して効果的にＡＧＣ制御すること、或いは、大
きなパワー値に対してＡＧＣ機能を働かせないようにし
て大きなパワー値に対してパワー値を保存可能にするこ
とを目的としてなされたものである。[Purpose] The present invention has been made in view of the above-mentioned circumstances, and in particular, by placing an AGC circuit at a stage subsequent to the feature extraction section,
AGC control is performed based on the output (power spectrum) signal, and the change point of amplification and suppression is controlled to be optimal, or the gain setting when the AGC signal is small is
To perform AGC control effectively on audio signals without constantly increasing the noise level in the small signal region, or to control the power value for large power values by not operating the AGC function for large power values. This was done with the purpose of making it possible to preserve the

【０００９】本発明は、上記目的を達成するために、（
１）マイクロフォンから入力された音声信号の増幅・補
正を行う前処理部と、得られた音声信号を周波数解析し
てパワースペクトルに変換するパワースペクトル生成部
と、これらのパワースペクトルから２値化パターンを生
成する２値パターン生成部と、この２値化された音声信
号の単語パターンを予め登録されている辞書パターンと
比較して最も類似しているものを認識結果とする音声認
識装置において、前記パワースペクトル生成部の次段に
ＡＧＣ制御部が位置し、パワースペクトルの総和におい
て得られる音声パワー信号に対して低域通過フィルタを
とおして得られ、更に積分された音声パワー信号で、各
パワースペクトルに対するゲイン定数を制御してパワー
スペクトルの利得を制御する手段と、上記更に積分され
た音声パワー信号の減少、増加を判断する比較手段と、
各々の場合に前記低域通過フィルタの時定数を切り替え
る手段を有し、音声信号が小さい場合には一定利得を与
えて増幅し、大なる場合には逆に抑圧することを特徴と
したものであり、更には、（２）前記（１）において、
前記増幅と抑圧との切り替え点を変更できることを特徴
とし、更には、（３）前記（２）において、前記増幅と
抑圧との切り替え点の変更を外部のレジスタの設定によ
り変更すること、又は、（４）前記増幅部と抑圧部の切
り変え点を過去の音声の平均パワーにより制御すること
を特徴としたものであり、或いは、（５）マイクロフォ
ンから入力された音声信号の増幅・補正を行う前処理部
と、得られた音声信号を周波数解析してパワースペクト
ルに変換するパワースペクトル生成部と、これらのパワ
ースペクトルから２値化パターンを生成する２値パター
ン生成部と、この２値化された音声信号の単語パターン
を予め登録されている辞書パターンと比較して最も類似
しているものを認識結果とする音声認識装置であって、
前記パワースペクトル生成部の次段にＡＧＣ制御部が位
置し、パワースペクトルの総和において得られる音声パ
ワー信号に対して低域通過フィルタをとおして得られ、
更に積分された音声パワー信号で、各パワースペクトル
に対するゲイン定数を制御してパワースペクトルの利得
を制御する手段と、上記更に積分された音声パワー信号
の減少、増加を判断する比較手段と、各々の場合に前記
低域通過フィルタの時定数を切り替える手段を有し、音
声信号が小さい場合には一定利得を与えて増幅し、大な
る場合には逆に抑圧する機能を有し、増幅と抑圧との切
り替え点を変更できる音声認識装置のＡＧＣ制御方式に
おいて、定常ノイズレベルを検知する手段を有し、定常
ノイズレベルの音声パワー域では、ゲインを一定にして
抑圧することを、又は、（６）マイクロフォンから入力
された音声信号の増幅・補正を行う前処理部と、得られ
た音声信号を周波数解析しパワースペクトルに変換する
パワースペクトル生成部と、これらのパワースペクトル
から２値化パターンを生成する２値パターン生成部と、
この２値化された音声信号の単語パターンを予め登録さ
れている辞書パターンと比較して最も類似しているもの
を認識結果とする音声認識装置であって、前記パワース
ペクトル生成部の次段にＡＧＣ制御部が位置し、パワー
スペクトルの総和において得られる音声パワー信号に対
して低域通過フィルタをとおして得られた、更に積分さ
れた音声パワー信号で、各パワースペクトルに対するゲ
イン定数を制御してパワースペクトルの利得を制御する
手段と、上記更に積分された音声パワー信号の減少、増
加を判断する比較手段と、各々の場合に前記低域通過フ
ィルタの時定数を切り替える手段を有し、音声信号が小
さい場合には一定利得を与えて増幅し、大なる場合には
逆に抑圧する機能を有し、抑圧部の利得が１以下になる
場合、その利得を１とすることにより大きなパワー値に
対してはＡＧＣ制御機能を働かさないことを特徴とし、
更には、（７）前記パワー値が一定値以上になる場合に
抑圧部を設けることを特徴としたものである。以下、本
発明の実施例に基いて説明する。[0009] In order to achieve the above object, the present invention provides (
1) A preprocessing unit that amplifies and corrects the audio signal input from the microphone, a power spectrum generation unit that frequency-analyzes the obtained audio signal and converts it into a power spectrum, and a binarization pattern from these power spectra. and a speech recognition device that compares the word pattern of the binarized audio signal with a dictionary pattern registered in advance and takes the most similar word pattern as a recognition result. An AGC control unit is located at the next stage of the power spectrum generation unit, and the audio power signal obtained from the summation of the power spectra is obtained by passing it through a low-pass filter, and the audio power signal that is further integrated is used to generate each power spectrum. means for controlling the gain of the power spectrum by controlling a gain constant for the voice power signal; and comparison means for determining whether the further integrated audio power signal decreases or increases;
It has a means for switching the time constant of the low-pass filter in each case, and is characterized in that when the audio signal is small, it is amplified by giving a constant gain, and when it is large, it is suppressed. Yes, and furthermore, (2) in (1) above,
The switching point between amplification and suppression can be changed, and (3) in (2) above, the switching point between amplification and suppression can be changed by setting an external register; (4) The switching point between the amplifying section and the suppressing section is controlled by the average power of past sounds, or (5) the audio signal input from the microphone is amplified and corrected. A preprocessing section, a power spectrum generation section that frequency-analyzes the obtained audio signal and converts it into a power spectrum, a binary pattern generation section that generates a binarized pattern from these power spectra, and a binary pattern generation section that generates a binarized pattern from these power spectra. A speech recognition device that compares a word pattern of a voice signal with a pre-registered dictionary pattern and selects the most similar word pattern as a recognition result, the speech recognition device comprising:
An AGC control unit is located at the next stage of the power spectrum generation unit, and the audio power signal obtained from the sum of the power spectra is obtained through a low-pass filter,
means for controlling the gain of the power spectrum by controlling a gain constant for each power spectrum using the further integrated audio power signal; a comparing means for determining whether the further integrated audio power signal decreases or increases; It has a function of switching the time constant of the low-pass filter when the audio signal is small, and has a function of amplifying it by giving a constant gain when the audio signal is small, and suppressing it when it is large, and has a function of amplifying and suppressing the audio signal. (6) In the AGC control method of a speech recognition device that can change the switching point of the voice recognition device, it has a means for detecting a steady noise level, and suppresses the sound by keeping the gain constant in the voice power range of the steady noise level, or (6) A pre-processing section that amplifies and corrects the audio signal input from the microphone, a power spectrum generation section that analyzes the frequency of the obtained audio signal and converts it into a power spectrum, and generates a binarized pattern from these power spectra. a binary pattern generation unit;
A speech recognition device that compares the word pattern of the binarized speech signal with a pre-registered dictionary pattern and selects the most similar word pattern as a recognition result, An AGC control unit is located, and controls the gain constant for each power spectrum using the further integrated audio power signal obtained by passing the low-pass filter to the audio power signal obtained from the summation of the power spectra. means for controlling the gain of the power spectrum; comparison means for determining a decrease or increase in the further integrated audio power signal; and means for switching the time constant of the low-pass filter in each case; It has the function of amplifying by giving a constant gain when is small, and suppressing it when it is large, and when the gain of the suppressor is less than 1, setting the gain to 1 increases the power value. The feature is that the AGC control function is not activated for
Furthermore, (7) a suppressor is provided when the power value exceeds a certain value. Hereinafter, the present invention will be explained based on examples.

【００１０】図１は、本発明による音声認識装置のＡＧ
Ｃ制御方式の一実施例を説明するための構成図で、図中
、２１はマイクロフォン、２２はマイクアンプ、２３は
特徴量抽出部、２４はＡＧＣ制御部、２５は音声パター
ン生成部、２６は累積平均演算部、２７はゲイン設定部
、２８はパターン生成部、２９は認識部、３０は認識用
辞書、３１は結果出力部である。図１において、マイク
ロフォン２１から入力された音声信号は、マイクアンプ
２２で増幅され、特徴量抽出部（パワースペクトル生成
部）２３において、ｎチャネルのＢ．Ｐ．Ｆ（バンドパ
スフィルタ）２３ａで周波数解析され、これが整流器２
３ｂで整流され、次いで、Ｌ．Ｐ．Ｆ（ローパスフィル
タ）２３ｃをとおしてパワースペクトルに変換される。ここで得られたパワースペクトルはＡＧＣ制御部２４に
おいて最適にレベル変換され、パターン生成部２８で入
力パターンを生成し、認識部２９で予め登録されている
認識辞書３０とパターン比較され結果が出力される。Ａ
ＧＣ制御部２４に対し、音声パワー生成部２５では前記
Ｌ．Ｐ．Ｆ２３ｃからのパワースペクトルをもとに音声
パワー生成部２５においてｎチャネルのパワースペクト
ルを合算して音声パワーを算出し、累積平均演算部２６
において過去の平均パワーを累積して行く。この平均パ
ワーによりゲイン設定部２７におけるレジスタを更新し
、このレジスタに設定された値によりＡＧＣ制御部２４
のゲインが決定される。FIG. 1 shows the AG of the speech recognition device according to the present invention.
This is a configuration diagram for explaining one embodiment of the C control method, in which 21 is a microphone, 22 is a microphone amplifier, 23 is a feature extraction section, 24 is an AGC control section, 25 is a voice pattern generation section, and 26 is a 27 is a gain setting section, 28 is a pattern generation section, 29 is a recognition section, 30 is a recognition dictionary, and 31 is a result output section. In FIG. 1, an audio signal input from a microphone 21 is amplified by a microphone amplifier 22, and an n-channel B.I. P. Frequency analysis is performed by F (band pass filter) 23a, and this is applied to rectifier 2.
3b and then L.3b. P. It is converted into a power spectrum through an F (low pass filter) 23c. The power spectrum obtained here is optimally level-converted in the AGC control unit 24, an input pattern is generated in the pattern generation unit 28, and the pattern is compared with a recognition dictionary 30 registered in advance in the recognition unit 29, and the result is output. Ru. A
In contrast to the GC control unit 24, the audio power generation unit 25 generates the L. P. Based on the power spectrum from the F23c, the audio power generation unit 25 calculates the audio power by summing the power spectra of n channels, and the cumulative average calculation unit 26
The past average powers are accumulated. The register in the gain setting unit 27 is updated based on this average power, and the AGC control unit 24 is updated based on the value set in this register.
The gain of is determined.

【００１１】図２は、前記ＡＧＣ制御部２４の詳細図で
、このＡＧＣ制御部２４は、図２に示す様に、入力され
るパワースペクトルが信号比較部４１により信号が増加
か減少かが比較され、増加の場合は、Ｌ．Ｐ．Ｆ１（ア
タック時定数のフィルタ）４２を通して、又、減少の場
合は、Ｌ．Ｐ．Ｆ２（ディケイ時定数のフィルタ）４３
を通して、ここで得られた音声のパワーにより、入力信
号に対するゲインをゲイン演算部４４によりゲイン制御
部４５において決定する。ゲイン制御部４５では、増幅
と抑圧の変化点を前記ゲイン設定レジスタにより変化さ
せ、最適なゲインを決定する。FIG. 2 is a detailed diagram of the AGC control section 24. As shown in FIG. and in case of increase, L. P. Through F1 (attack time constant filter) 42, and in case of decrease, L. P. F2 (decay time constant filter) 43
The gain for the input signal is determined by the gain calculation unit 44 and the gain control unit 45 based on the power of the voice obtained here. The gain control unit 45 changes the change point between amplification and suppression using the gain setting register, and determines the optimum gain.

【００１２】例えば、図１１の入力と出力の関係におけ
るＡＧＣ制御において、図３（ａ）に示す様な入力信号
の音声パワーが得られた場合、図３（ｂ）の様なゲイン
の変化となる。つまり、音声パワーが小さい時は、一定
のゲインが得られ、大信号となった時にはゲインが減少
し、増幅から抑圧へと変化する。又、ゲイン設定レジス
タは図３（ｃ）のＩ，ＩＩの変化を設定する。つまり、
音声のレベルの中心点がＡ近辺にある時にはＩのゲイン
設定の方とし、平均的音声パワーが小さくなってきたら
ゲイン設定レジスタによりＩＩの方に動かす。このよう
にすることにより、同じようにＡＧＣの出力信号を生成
することが可能となる。For example, in AGC control with the input and output relationship shown in FIG. 11, if the audio power of the input signal as shown in FIG. 3(a) is obtained, the change in gain as shown in FIG. 3(b) and Become. In other words, when the audio power is small, a constant gain is obtained, and when the signal becomes large, the gain decreases, changing from amplification to suppression. Further, the gain setting register sets the changes in I and II in FIG. 3(c). In other words,
When the center point of the audio level is near A, the gain is set to I, and when the average audio power becomes small, it is moved to II using the gain setting register. By doing so, it becomes possible to generate the AGC output signal in the same way.

【００１３】図４は、請求項５に記載した発明の動作原
理を説明するための図で、この発明は、図示のように、
ａとＡの２つのゲインの変化点を設けたものである。図
５は、本発明と従来技術とを比較して説明するための図
であるが、従来技術では、前述のようにＡ点のみに変化
点を設定していた。つまり小信号に対しては一定のゲイ
ンで増幅していた。この場合、図５（ａ）に示す音声パ
ワー信号に対して、斜線部に示すようなノイズ（例えば
、マイクラインのノイズや周囲の定常的な騒音）が混入
した場合、一定のゲインを与えるため、図５（ｂ）に示
すように、ノイズレベルを引き上げてしまう。これに対
して、本発明では、図４に示したように、ａ点にも変化
点を設け、ａ点以下の信号に対しては、ゲインを一定に
している。このようにすることによって、図５（ｃ）に
示すように、ノイズレベルは増幅されず、音声信号のみ
が増幅される。FIG. 4 is a diagram for explaining the operating principle of the invention set forth in claim 5.
Two gain change points, a and A, are provided. FIG. 5 is a diagram for comparing and explaining the present invention and the prior art. In the prior art, the change point was set only at point A as described above. In other words, small signals were amplified with a constant gain. In this case, if the noise shown in the shaded area (for example, microphone line noise or ambient steady noise) is mixed into the audio power signal shown in FIG. 5(a), a certain gain is applied. , as shown in FIG. 5(b), increases the noise level. In contrast, in the present invention, as shown in FIG. 4, a change point is also provided at point a, and the gain is kept constant for signals below point a. By doing this, as shown in FIG. 5(c), the noise level is not amplified, but only the audio signal is amplified.

【００１４】図６は、上述のごとき本発明を実施するた
めに使用されるＡＧＣ制御回路の一例を説明するための
電気回路の要部ブロック図で、図中、５１信号比較器、
５２，５３はローパスフィルタ（ＬＰＦ）、５４はノイ
ズレベル検知部、５５はレベル比較制御部、５６は可変
利得制御部である。入力されるパワースペクトルは、信
号比較器５１により信号が増加か減少かが比較され、増
加の場合はＬ．Ｐ．Ｆ１（アタック時定数のフィルタ）
５２を通して、減少の場合はＬ．Ｐ．Ｆ２（ディケイ時
定数のフィルタ）５３を通して、ここで得られた音声の
パワーにより入力信号に対するゲインを可変利得制御部
５６において決定する。可変利得制御部５６では増幅と
抑圧の変化点を入力信号中の定常ノイズレベルをノイズ
レベル検知部５４で検出し、これにより、レベル比較制
御部５５を制御し、可変利得制御部５６の利得を、定常
ノイズレベルの音声パワー域ではゲインを一定にして抑
圧して制御する。これにより、音声信号に混入した不要
ノイズを、ＡＧＣ制御部により引き上げることなく、よ
り正確な音声信号のＡＧＣ制御が可能になる。FIG. 6 is a block diagram of a main part of an electric circuit for explaining an example of an AGC control circuit used to implement the present invention as described above.
52 and 53 are low pass filters (LPF), 54 is a noise level detection section, 55 is a level comparison control section, and 56 is a variable gain control section. The input power spectra are compared by the signal comparator 51 to see if the signal increases or decreases, and if the signal increases, the L. P. F1 (attack time constant filter)
52, in case of decrease L. P. The gain for the input signal is determined by the variable gain control unit 56 based on the power of the audio obtained through the F2 (decay time constant filter) 53. In the variable gain control section 56, the stationary noise level in the input signal is detected at the change point of amplification and suppression by the noise level detection section 54, and thereby the level comparison control section 55 is controlled to adjust the gain of the variable gain control section 56. In the audio power range of steady noise level, the gain is kept constant and suppressed for control. This enables more accurate AGC control of the audio signal without using the AGC control unit to remove unnecessary noise mixed into the audio signal.

【００１５】前述の、パワースペクトル生成後にＡＧＣ
制御を行なう方式は、パワー値が小さい場合には一定利
得を与えて増幅し、大きい場合には逆に抑圧させるよう
に利得を変化させてパワー値が一定となるようにするも
のである。しかし、この方式では増幅と抑制の切り替え
点のみがＡＧＣ制御のパラメータになっているため、切
り替え点の設定は増幅と抑制の両方に関係する。たとえ
ば、増幅レンジを小さく設定した場合には抑制部のダイ
ナミックレンジが保てなくなる。請求項６に記載した発
明は、上述のごとき不具合、つまり、上述のＡＧＣ制御
方式では増幅部のレンジのみを変更することができない
という欠点を解決するものである。After the above-mentioned power spectrum generation, AGC
The control method is such that when the power value is small, a constant gain is applied to amplify it, and when it is large, the gain is changed so as to suppress it, so that the power value remains constant. However, in this method, only the switching point between amplification and suppression is a parameter for AGC control, so the setting of the switching point is related to both amplification and suppression. For example, if the amplification range is set small, the dynamic range of the suppressor cannot be maintained. The invention set forth in claim 6 is intended to solve the above-mentioned problem, that is, the above-described AGC control method cannot change only the range of the amplifier section.

【００１６】図７（ａ）は、前述のＡＧＣ制御方式にお
ける利得特性を、図７（ｂ）は、Ｇａｉｎ（ゲイン）係
数との関係を示す。図８（ａ）は、本発明による制御方
式での利得特性を、図８（ｂ）はＧａｉｎ係数との関係
を示す。図８から明らかな様に、本発明によると、図７の利得特
性に比べて、パワー値が一定値以上、即ち、Ｇａｉｎ係
数が１以下になる場合にＧａｉｎ係数を１に固定するこ
とで大きなパワーに対してパワー値を保存することが可
能となる。図９は、請求項６に記載した発明を説明する
ための特性で、一定以上のパワー値に対して抑制効果を
付加したもので、これにより最大パワーの固定を行うこ
とが可能となる。FIG. 7(a) shows the gain characteristics in the above-mentioned AGC control method, and FIG. 7(b) shows the relationship with the gain coefficient. FIG. 8(a) shows the gain characteristic in the control method according to the present invention, and FIG. 8(b) shows the relationship with the gain coefficient. As is clear from FIG. 8, according to the present invention, compared to the gain characteristics shown in FIG. It becomes possible to save power values for powers. FIG. 9 shows a characteristic for explaining the invention as claimed in claim 6, in which a suppressing effect is added to the power value above a certain level, thereby making it possible to fix the maximum power.

【００１７】而して、音声認識においてＡＧＣ制御を用
いる目的は、パワーの小さい子音を引き上げること及び
音声のレベルを一定に保つことが最大の目的である。従
って、パワーの小さいものを増幅することは必要となる
が、パワーの大きい部分を抑制することは不必要である
。又、ダイナミックレンジを小さくすることとなる。請求項６に記載した発明はこうした抑制部をなくすこと
を可能にする。また、請求項７の発明によると、増幅部
と抑制部の切り替え点の間にＧａｉｎ係数を１とする区
間を設けたこととなり、必要以上のダイナミックレンジ
を抑えることができる。The main purpose of using AGC control in speech recognition is to raise consonants with low power and to maintain a constant speech level. Therefore, it is necessary to amplify the parts with low power, but it is unnecessary to suppress the parts with high power. Furthermore, the dynamic range will be reduced. The invention set forth in claim 6 makes it possible to eliminate such a suppressing portion. Furthermore, according to the seventh aspect of the invention, a section in which the gain coefficient is 1 is provided between the switching points of the amplifying section and the suppressing section, so that an excessive dynamic range can be suppressed.

【００１８】[0018]

【効果】請求項１〜４に記載の発明によると、安定して
ＡＧＣ制御を行うことができ、音声レベルの変動に常に
最適となるように制御することができる。請求項５に記
載の発明によると、音声信号に混入した不要ノイズをＡ
ＧＣ制御によって引き上げないようにしているので、よ
り正確な音声信号のＡＧＣ制御が可能となる。請求項６
に記載の発明によると、パワー値が一定値以上、すなわ
ち、ゲイン係数が１以下になる場合に、ゲイン係数を１
に固定するようにしているので、大きなパワーに対して
パワー値を保存することができる。請求項７に記載の発
明によると、一定値以上のパワー値に対して抑性効果を
付加したので、これにより最大パワーの固定を行うこと
ができる。[Effects] According to the invention as set forth in claims 1 to 4, AGC control can be performed stably, and control can always be optimally performed in response to fluctuations in audio level. According to the invention set forth in claim 5, unnecessary noise mixed into the audio signal is
Since the GC control prevents the signal from being pulled up, more accurate AGC control of the audio signal becomes possible. Claim 6
According to the invention described in , when the power value is a certain value or more, that is, the gain coefficient is 1 or less, the gain coefficient is set to 1.
Since it is fixed to , the power value can be preserved for large powers. According to the seventh aspect of the invention, since a suppressing effect is added to the power value above a certain value, it is possible to fix the maximum power.

[Brief explanation of the drawing]

【図１】　　本発明によるＡＧＣ制御回路を音声認識装
置に適用した時の全体構成を示す図である。FIG. 1 is a diagram showing the overall configuration when an AGC control circuit according to the present invention is applied to a speech recognition device.

【図２】　　請求項１〜４に記載したＡＧＣ制御回路の
詳細を説明するためのブロック図である。FIG. 2 is a block diagram for explaining details of the AGC control circuit according to claims 1 to 4.

【図３】　　図２に示したＡＧＣ制御回路の動作説明を
するための波形図及び特性図である。3 is a waveform diagram and a characteristic diagram for explaining the operation of the AGC control circuit shown in FIG. 2. FIG.

【図４】　　請求項５に記載したＡＧＣ制御回路の動作
説明をするための特性図である。FIG. 4 is a characteristic diagram for explaining the operation of the AGC control circuit according to claim 5;

【図５】　　請求項５に記載したＡＧＣ制御回路の動作
説明をするための波形図である。5 is a waveform diagram for explaining the operation of the AGC control circuit according to claim 5. FIG.

【図６】　　請求項５に記載したＡＧＣ制御回路の詳細
を説明するためのブロック図である。FIG. 6 is a block diagram for explaining details of the AGC control circuit according to claim 5.

【図７】　　従来のＡＧＣ制御回路の特性を説明するた
めの特性図である。FIG. 7 is a characteristic diagram for explaining the characteristics of a conventional AGC control circuit.

【図８】　　請求項６に記載した発明の動作説明を説明
するための特性図である。FIG. 8 is a characteristic diagram for explaining the operation of the invention recited in claim 6;

【図９】　　請求項７に記載した発明の動作説明を説明
するための特性図である。9 is a characteristic diagram for explaining the operation of the invention set forth in claim 7. FIG.

【図１０】　　従来の可変利得制御回路の一例を説明す
るためのブロック図である。FIG. 10 is a block diagram for explaining an example of a conventional variable gain control circuit.

【図１１】　　図１０の回路の特性図である。FIG. 11 is a characteristic diagram of the circuit of FIG. 10.

【図１２】　　従来の音声認識装置の一例を説明するた
めの全体構成図である。FIG. 12 is an overall configuration diagram for explaining an example of a conventional speech recognition device.

[Explanation of symbols]

２１…マイクロフォン、２２…マイクアンプ、２３…特
徴量抽出部、２４…ＡＧＣ制御部、２５…音声パターン
生成部、２６…累積平均演算部、２７…ゲイン設定部、
２８…パターン生成部、２９…認識部、３０…認識用辞
書、３１…結果出力部。21...Microphone, 22...Mic amplifier, 23...Feature amount extraction unit, 24...AGC control unit, 25...Audio pattern generation unit, 26...Cumulative average calculation unit, 27...Gain setting unit,
28...Pattern generation unit, 29...Recognition unit, 30...Recognition dictionary, 31...Result output unit.

Claims

[Claims]

[Claim 1] A preprocessing unit that amplifies and corrects an audio signal input from a microphone, a power spectrum generation unit that frequency-analyzes the obtained audio signal and converts it into a power spectrum, and a A binary pattern generation unit that generates a digitized pattern, and a speech recognition device that compares the word pattern of this binarized audio signal with pre-registered dictionary patterns and selects the most similar one as a recognition result. An AGC control unit is located at the next stage of the power spectrum generation unit, and the audio power signal obtained by passing the audio power signal obtained from the summation of the power spectra through a low-pass filter and further integrating the audio power signal, means for controlling the gain of the power spectrum by controlling a gain constant for each power spectrum; a comparison means for determining whether the audio power signal decreases or increases; and means for switching the time constant of the low-pass filter in each case. An AGC control method for a speech recognition device, characterized in that when a speech signal is small, the speech signal is amplified by giving a constant gain, and when the speech signal is large, it is suppressed.

2. The AGC control method for a speech recognition device according to claim 1, wherein a switching point between said amplification and suppression can be changed.

3. The AGC control method for a speech recognition device according to claim 2, wherein the switching point between amplification and suppression is changed by setting an external register.

4. The AGC control method for a speech recognition device according to claim 2, wherein the switching point between the amplifying section and the suppressing section is controlled based on the average power of past speech.

5. A preprocessing unit that amplifies and corrects an audio signal input from a microphone, a power spectrum generation unit that frequency-analyzes the obtained audio signal and converts it into a power spectrum, and a A binary pattern generation unit that generates a digitized pattern, and a speech recognition device that compares the word pattern of this binarized audio signal with pre-registered dictionary patterns and selects the most similar one as a recognition result. An AGC control unit is located at the next stage of the power spectrum generation unit, and the AGC control unit generates an audio power signal obtained by passing the audio power signal obtained from the summation of the power spectra through a low-pass filter and further integrating the audio power signal. means for controlling the gain of the power spectrum by controlling a gain constant for each power spectrum; a comparison means for determining whether the audio power signal is decreasing or increasing; and switching the time constant of the low-pass filter in each case. AGC of a speech recognition device that has a function of applying a constant gain and amplifying the speech signal when it is small and suppressing it when it is large, and that can change the switching point between amplification and suppression.
A voice recognition device characterized in that the control method includes means for detecting a steady noise level, and suppresses the steady noise level by keeping the gain constant in the voice power range of the steady noise level.
GC control method.

[Claim 6] A preprocessing unit that amplifies and corrects an audio signal input from a microphone, a power spectrum generation unit that frequency-analyzes the obtained audio signal and converts it into a power spectrum, and generates binary values from these power spectra. a binary pattern generation unit that generates a binary pattern, and a speech recognition device that compares the word pattern of this binary audio signal with a pre-registered dictionary pattern and selects the most similar word pattern as a recognition result. An AGC control section is located at the next stage of the power spectrum generation section, and the AGC control section is located at the next stage of the power spectrum generation section, and the audio power signal obtained from the summation of the power spectra is obtained by passing it through a low-pass filter and further integrated. , means for controlling the gain of the power spectrum by controlling a gain constant for each power spectrum, a comparison means for determining whether the audio power signal decreases or increases, and means for switching the time constant of the low-pass filter in each case. It has the function of amplifying the audio signal with a constant gain when it is small, and suppressing it when it is large, and when the gain of the suppression section is 1 or less, the gain is set to 1. An AGC control method for a speech recognition device characterized in that the AGC control function is not activated for particularly large power values.

7. The AGC control method for a speech recognition device according to claim 6, further comprising a suppressor provided when the power value exceeds a certain value.