JPH02101500A

JPH02101500A - Voice recognizing device

Info

Publication number: JPH02101500A
Application number: JP63254491A
Authority: JP
Inventors: Tomofumi Nakatani; 中谷　奉文
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1988-10-07
Filing date: 1988-10-07
Publication date: 1990-04-13

Abstract

PURPOSE:To efficiently instruct a voice producing level to a talker so that the voice of the talker can be recognized at a high recognition rate by comparing an input level with two or more threshold levels and informing the compared results to the talker in the form of short alarming sounds. CONSTITUTION:This voice recognizing means is provided with a means 12 which compares an input level with two or more threshold levels and another means 14 which generates to or more kinds of short alarming sounds and, when a voice input is made, the input is compared with the threshold levels. Then, when the input does not reach the lower threshold level, an alarming sound indicating that the inputted voices are low is produced and, when the input exceeds the higher threshold, another sound indicating that the inputted voices are loud is produced. Therefore, even when a talker cannot see an indicator, etc., due to sound dial, etc., of an automobile telephone set while he drives a car, he can recognize whether or not his voice level is appropriate by the alarming sounds. Thus a high recognition rate can be achieved.

Description

【発明の詳細な説明】妓且豆互本発明は、音声認識装置に係り、例えば、自動車電話の
音声ダイヤリング等種々の機器に適用可能なものである
。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a voice recognition device, and is applicable to various devices such as voice dialing of a car phone, for example.

従米七景第２図は、従来の音声認識方式（特開昭６３−１４２０
０号公報）の−例を説明するための図で、図中、１はマ
イクロフォン、２は入力レベルＫｉｌ＋定・判定部、３
は音声認識部、４は第１のランプ、５は第２のランプで
、マイクロフォン１がら入力された音声は入力レベル１
１１！Ｉ定・判定部２でレベル測定され、音声認識部３
３で認識される。」二記人カレベルの判定は、入力レベ
ル１ｆｌ１１定・判定部２が２つのレベルのスレッショ
ル１くを持っており、これら２つのスレッショルドとの
大小判定を行い、その判定結果に基づき２つのランプ４
，５を制御する。Figure 2 of the Seven Views of Jubei shows the conventional voice recognition method
In the figure, 1 is a microphone, 2 is an input level Kil+ constant/judgment section, and 3 is a diagram for explaining an example of Publication No. 0).
is a voice recognition unit, 4 is a first lamp, 5 is a second lamp, and the voice input from microphone 1 is input level 1.
11! The level is measured by the I-determination/judgment section 2, and the speech recognition section 3
3 is recognized. 2. To judge the power level, the input level 1fl11 constant/judgment unit 2 has two level thresholds, and it makes a judgment on the magnitude of these two thresholds, and based on the judgment result, the two lamps 4
, 5.

すなわち、入力レベルが低い方のスレッショルドより小
である時はランプを点灯せず、２つのスレッショルドの
間であるときは第１のランプ４のみ点灯させ、高い方の
スレッショルドより大である時は２個のランプ４．５を
同時に点灯させる。この入力レベルの判定とランプの制
御は＜ｙ２ｙｌを行なうときだけでなく、認識のための
標準パターン登録時も同様な動作を続けている。That is, when the input level is less than the lower threshold, the lamp is not turned on, when it is between the two thresholds, only the first lamp 4 is turned on, and when it is greater than the higher threshold, the lamp 4 is turned on. 4.5 lamps are lit at the same time. This input level determination and lamp control continues in the same way not only when <y2yl is performed, but also when registering a standard pattern for recognition.

ここで、」〕記２つのスレッショルドは、高い方のスレ
ッショルドは音声認識部３のＡ／Ｄ変換器の最大レベル
、低い方のスレッショルドは高い方のスレッショルドよ
りも約２５ｄＢ低い値とじている。Here, the two thresholds are set such that the higher threshold is the maximum level of the A/D converter of the speech recognition unit 3, and the lower threshold is approximately 25 dB lower than the higher threshold.

このような入力レベルの判定とランプの点滅は音声認識
のアルゴリズムとは直接関係ないが、発声レベルの適正
化を促し、高い認識率を確保するものである。すなわち
、発声レベルが過大であると入力系で音が歪んだりＡ／
Ｄ変換器が飽和する等の障害が生し、過小であると周囲
ノイズとのＳ／Ｎが、咽くなる等、いづれにしても認識
率が低下するので、発声者が発声時にランプを見て適正
レベルにあるか否かを検知しながら発声する必要があっ
た。Although such input level determination and lamp blinking are not directly related to the speech recognition algorithm, they encourage optimization of the speech level and ensure a high recognition rate. In other words, if the vocalization level is too high, the sound may be distorted in the input system or A/
Failures such as saturation of the D converter may occur, and if it is too low, the S/N with the surrounding noise will become sluggish, and the recognition rate will decrease in any case, so the speaker should not look at the lamp while speaking. It was necessary to vocalize while detecting whether or not the level was at an appropriate level.

しかしながら、上記従来の音声認識方法では。However, in the above conventional speech recognition method.

ｒ卜助ｉｔ電話の音声ダイヤリング等に音声認識を適用
した場合に、発声者がランプを注視する必要があり、注
視できない場合には、発声者が適正なレベルを確認でき
ず、良好な認ｊｉｌ率を得ることができなかった。When voice recognition is applied to telephone voice dialing, etc., the person speaking needs to look at the lamp, and if the person is unable to do so, the person speaking cannot confirm the appropriate level and may not be able to get good recognition. It was not possible to obtain the jil rate.

Ｌ−１本発明は、上述のごとき問題を解決するためになされた
もので１発声者の発声レベルの確認を確実にし、高い認
識率を得られる音声認識装置を提供することを目的とし
てなされたものである。L-1 The present invention was made in order to solve the above-mentioned problems, and was made for the purpose of providing a speech recognition device that can ensure confirmation of the speaking level of a single speaker and obtain a high recognition rate. It is something.

３１−一火本発明は、上記目的を達成するために、入力レベルを２
つ以上のスレッショルドと比較する手段と、２種以上の
短音の警告音を発生する手段とを有し、音声入力があっ
たとき、その入力を上記スレッショルドと比較と、その
入力が低い方のスレッショルド以下の時には入力音声が
小さいことを意味する警告音を発生し、その入力が高い
方のスレッショルド以上のときには入力音声が大きいこ
とを意味する警告音を発生するようにしたことを特徴と
したものである。以下、本発明の実施例に」ルづいて説
明する。31-Ikka In order to achieve the above object, the present invention sets the input level to 2.
and a means for generating two or more types of short warning sounds. When the input audio is below a threshold, a warning sound is generated to indicate that the input audio is low, and when the input is above a higher threshold, a warning sound is generated to indicate that the input audio is loud. It is. Embodiments of the present invention will be explained below.

而して、本発明は、入力レベルをｄｉ！Ｉ定して２段ｒ
１テ以上のスレッショルドと比較する手段と、２種以上
の人間が判別できるくらいの短音の警告音を発生する手
段とを持ち、音声入力があったとき、その入力を上記ス
レッショルドと比較し、その入力が低い方のスレッショ
ルド以下の時には入力音声が小さいことを意味する警告
音を発生し、その入力が高い方のスレッショルド以上の
ときには入力音声が大きいことを意味する警告音を発生
するようにしたものである。ここで、警告音を用いる意
味を考えろ。通常、音声認識装置ｎの認識結果の出力が
候補カテゴリーを表示器か音声応答のような文字か音声
情報を用いている。ここで問題としている手段として適
応できるのは音声であるが、ｒｔ声で発声状態を知らせ
るのは、音声認識装置を初めて使用するとき等の時には
有効であるが、常時使用する場合には音声での指示は、
例えば、「大きな声で発声してください」、「声が大き
すぎます」等と一定の時間を要するので、この音声がわ
ずられしくなる。そこで、本発明のごとく発声レベルを
意味する警告音を発生することにより。Therefore, the present invention changes the input level to di! 2 steps with I fixed
It has means for comparing with a threshold of 1 TE or more, and means for generating a short warning sound that can be distinguished by two or more types of humans, and when there is a voice input, it compares the input with the threshold, When the input is below the lower threshold, a warning sound is generated to indicate that the input audio is low, and when the input is above the high threshold, a warning sound is generated to indicate that the input audio is loud. It is something. Now, think about the meaning of using a warning sound. Usually, the output of the recognition result of the speech recognition device n uses text or speech information such as a display or a voice response to indicate candidate categories. Voice can be applied as the means in question here, and while it is effective to notify the vocalization status using rt voice when using the voice recognition device for the first time, when using it constantly, voice The instructions are:
For example, since it takes a certain amount of time to say something like "Please speak loudly" or "Your voice is too loud," this sound becomes annoying. Therefore, by generating a warning sound indicating the vocalization level as in the present invention.

発声者に短時間に知らせることができる。The speaker can be notified in a short time.

従って、本発明によれば、自動車電話の音声ダイヤル等
で、発声者が運転中で表示器等が見えないときでも、警
告音により適正なレベルであるか否かを知ることが可能
で、高い認識率を達成できる効果を有する。Therefore, according to the present invention, it is possible to know whether the level is appropriate or not by the warning sound even when the person making the call is driving and cannot see the display, etc. by voice dialing of a car phone, etc. It has the effect of achieving a high recognition rate.

第１図は、本発明の一実施例を説明するための構成図で
、図中、１１はマイクロフォン、１２は入力レベル測定
・判定部、１３は音声認識部、１４は警告音発生部、１
５はスピーカで、マイクロフォン１１から入力された音
声は入力レベル測定・判定部１２でレベル測定され、音
声認識部１３で認識される。この入力レベルの判定は、
入力レベル測定・判定部１２が２つ以上のレベルのスレ
ッショルドを持っており、これら２つ以上のスレッショ
ルドとの大小判定を行い、その判定結果に基づき警告音
発生部１４に制御信号を送る。警告音発生部１４はその
指令に基づき１人間が判別できる程度（例えば約２００
　ｍ　ｓ程度）の短音を音色を変えてスピーカ１５に出
力する。FIG. 1 is a block diagram for explaining one embodiment of the present invention, in which 11 is a microphone, 12 is an input level measurement/judgment section, 13 is a voice recognition section, 14 is a warning sound generation section, 1
Reference numeral 5 denotes a speaker, and the level of the voice input from the microphone 11 is measured by an input level measuring/judgment section 12, and recognized by a voice recognition section 13. The determination of this input level is
The input level measurement/judgment unit 12 has two or more level thresholds, and determines the magnitude of the two or more thresholds, and sends a control signal to the warning sound generation unit 14 based on the determination result. Based on the command, the warning sound generator 14 generates a sound that can be recognized by one person (for example, approximately 200
A short tone (of the order of ms) is output to the speaker 15 with a different tone.

また、電話等でよく経験する小さい声や音を聞くと大声
になり、逆に大きい声を聞くと小声になる人間の習性を
利用して入力レベルが低い方のスレッショルドより小で
ある時はスピーカ１５からの出力レベルを小さくし、高
い方のスレッショルドより大である時はスピーカ１５か
らの出力レベルを大きくすることにより発声音を制御す
ることもできる。In addition, when the input level is lower than the lower threshold, it uses the human tendency to become louder when hearing a soft voice or sound, such as on a telephone, and to become softer when hearing a loud voice. The vocalization can also be controlled by decreasing the output level from speaker 15 and increasing the output level from speaker 15 when it is greater than a higher threshold.

この入力レベルの判定と警告音の制御は認識を行なうと
きだけでなく、認識のための標準パターン登録時も同様
な動作を行なうことはいうまでもない。It goes without saying that the input level determination and warning sound control are performed not only when performing recognition, but also when registering a standard pattern for recognition.

効　　　果以」二の説明から明らかなように、本発明によると、入
力レベルを２段階以上のスレッショルドと比較して、そ
の結果を短時間の警告音として発声者に知らせることに
より、効率的に発声者に発声レベルの状態を指示するこ
とができ、高い認識率を得ることができる。As is clear from the explanation in ``Effects'' 2, according to the present invention, the input level is compared with two or more thresholds, and the result is notified to the speaker as a short-term warning sound, thereby efficiently achieving the desired effect. It is possible to instruct the speaker about the state of the speech level, and a high recognition rate can be obtained.

[Brief explanation of the drawing]

第１図は、本発明による音声認識装置の一実施例を説明
するための構成図、第２図は、従来の音声認識装置の一
例を説明するための構成図である。１１・・・マイクロフォン、１２・・・入力レベル測定
・判定部、１３・・・音声認識部、１４・・・警告音発
生部。１５・・・スピーカ。FIG. 1 is a block diagram for explaining an embodiment of a speech recognition apparatus according to the present invention, and FIG. 2 is a block diagram for explaining an example of a conventional speech recognition apparatus. DESCRIPTION OF SYMBOLS 11... Microphone, 12... Input level measurement/judgment part, 13... Voice recognition part, 14... Warning sound generation part. 15...Speaker.

Claims

[Claims]

1. It has means for comparing the input level with two or more thresholds, and means for generating two or more types of short warning sounds, and when there is a voice input, the input is compared with the thresholds, When the input is below the lower threshold, a warning sound is generated to indicate that the input audio is low, and when the input is above the high threshold, a warning sound is generated to indicate that the input audio is loud. A speech recognition device characterized by: