JPH02101500A - Voice recognizing device - Google Patents

Voice recognizing device

Info

Publication number
JPH02101500A
JPH02101500A JP63254491A JP25449188A JPH02101500A JP H02101500 A JPH02101500 A JP H02101500A JP 63254491 A JP63254491 A JP 63254491A JP 25449188 A JP25449188 A JP 25449188A JP H02101500 A JPH02101500 A JP H02101500A
Authority
JP
Japan
Prior art keywords
input
level
voice
talker
threshold
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP63254491A
Other languages
Japanese (ja)
Inventor
Tomofumi Nakatani
中谷 奉文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ricoh Co Ltd
Original Assignee
Ricoh Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ricoh Co Ltd filed Critical Ricoh Co Ltd
Priority to JP63254491A priority Critical patent/JPH02101500A/en
Publication of JPH02101500A publication Critical patent/JPH02101500A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To efficiently instruct a voice producing level to a talker so that the voice of the talker can be recognized at a high recognition rate by comparing an input level with two or more threshold levels and informing the compared results to the talker in the form of short alarming sounds. CONSTITUTION:This voice recognizing means is provided with a means 12 which compares an input level with two or more threshold levels and another means 14 which generates to or more kinds of short alarming sounds and, when a voice input is made, the input is compared with the threshold levels. Then, when the input does not reach the lower threshold level, an alarming sound indicating that the inputted voices are low is produced and, when the input exceeds the higher threshold, another sound indicating that the inputted voices are loud is produced. Therefore, even when a talker cannot see an indicator, etc., due to sound dial, etc., of an automobile telephone set while he drives a car, he can recognize whether or not his voice level is appropriate by the alarming sounds. Thus a high recognition rate can be achieved.

Description

【発明の詳細な説明】 妓且豆互 本発明は、音声認識装置に係り、例えば、自動車電話の
音声ダイヤリング等種々の機器に適用可能なものである
DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a voice recognition device, and is applicable to various devices such as voice dialing of a car phone, for example.

従米七景 第2図は、従来の音声認識方式(特開昭63−1420
0号公報)の−例を説明するための図で、図中、1はマ
イクロフォン、2は入力レベルKil+定・判定部、3
は音声認識部、4は第1のランプ、5は第2のランプで
、マイクロフォン1がら入力された音声は入力レベル1
11!I定・判定部2でレベル測定され、音声認識部3
3で認識される。」二記人カレベルの判定は、入力レベ
ル1fl11定・判定部2が2つのレベルのスレッショ
ル1くを持っており、これら2つのスレッショルドとの
大小判定を行い、その判定結果に基づき2つのランプ4
,5を制御する。
Figure 2 of the Seven Views of Jubei shows the conventional voice recognition method
In the figure, 1 is a microphone, 2 is an input level Kil+ constant/judgment section, and 3 is a diagram for explaining an example of Publication No. 0).
is a voice recognition unit, 4 is a first lamp, 5 is a second lamp, and the voice input from microphone 1 is input level 1.
11! The level is measured by the I-determination/judgment section 2, and the speech recognition section 3
3 is recognized. 2. To judge the power level, the input level 1fl11 constant/judgment unit 2 has two level thresholds, and it makes a judgment on the magnitude of these two thresholds, and based on the judgment result, the two lamps 4
, 5.

すなわち、入力レベルが低い方のスレッショルドより小
である時はランプを点灯せず、2つのスレッショルドの
間であるときは第1のランプ4のみ点灯させ、高い方の
スレッショルドより大である時は2個のランプ4.5を
同時に点灯させる。この入力レベルの判定とランプの制
御は<y2ylを行なうときだけでなく、認識のための
標準パターン登録時も同様な動作を続けている。
That is, when the input level is less than the lower threshold, the lamp is not turned on, when it is between the two thresholds, only the first lamp 4 is turned on, and when it is greater than the higher threshold, the lamp 4 is turned on. 4.5 lamps are lit at the same time. This input level determination and lamp control continues in the same way not only when <y2yl is performed, but also when registering a standard pattern for recognition.

ここで、」〕記2つのスレッショルドは、高い方のスレ
ッショルドは音声認識部3のA/D変換器の最大レベル
、低い方のスレッショルドは高い方のスレッショルドよ
りも約25dB低い値とじている。
Here, the two thresholds are set such that the higher threshold is the maximum level of the A/D converter of the speech recognition unit 3, and the lower threshold is approximately 25 dB lower than the higher threshold.

このような入力レベルの判定とランプの点滅は音声認識
のアルゴリズムとは直接関係ないが、発声レベルの適正
化を促し、高い認識率を確保するものである。すなわち
、発声レベルが過大であると入力系で音が歪んだりA/
D変換器が飽和する等の障害が生し、過小であると周囲
ノイズとのS/Nが、咽くなる等、いづれにしても認識
率が低下するので、発声者が発声時にランプを見て適正
レベルにあるか否かを検知しながら発声する必要があっ
た。
Although such input level determination and lamp blinking are not directly related to the speech recognition algorithm, they encourage optimization of the speech level and ensure a high recognition rate. In other words, if the vocalization level is too high, the sound may be distorted in the input system or A/
Failures such as saturation of the D converter may occur, and if it is too low, the S/N with the surrounding noise will become sluggish, and the recognition rate will decrease in any case, so the speaker should not look at the lamp while speaking. It was necessary to vocalize while detecting whether or not the level was at an appropriate level.

しかしながら、上記従来の音声認識方法では。However, in the above conventional speech recognition method.

r卜助it電話の音声ダイヤリング等に音声認識を適用
した場合に、発声者がランプを注視する必要があり、注
視できない場合には、発声者が適正なレベルを確認でき
ず、良好な認jil率を得ることができなかった。
When voice recognition is applied to telephone voice dialing, etc., the person speaking needs to look at the lamp, and if the person is unable to do so, the person speaking cannot confirm the appropriate level and may not be able to get good recognition. It was not possible to obtain the jil rate.

L−1 本発明は、上述のごとき問題を解決するためになされた
もので1発声者の発声レベルの確認を確実にし、高い認
識率を得られる音声認識装置を提供することを目的とし
てなされたものである。
L-1 The present invention was made in order to solve the above-mentioned problems, and was made for the purpose of providing a speech recognition device that can ensure confirmation of the speaking level of a single speaker and obtain a high recognition rate. It is something.

31−一火 本発明は、上記目的を達成するために、入力レベルを2
つ以上のスレッショルドと比較する手段と、2種以上の
短音の警告音を発生する手段とを有し、音声入力があっ
たとき、その入力を上記スレッショルドと比較と、その
入力が低い方のスレッショルド以下の時には入力音声が
小さいことを意味する警告音を発生し、その入力が高い
方のスレッショルド以上のときには入力音声が大きいこ
とを意味する警告音を発生するようにしたことを特徴と
したものである。以下、本発明の実施例に」ルづいて説
明する。
31-Ikka In order to achieve the above object, the present invention sets the input level to 2.
and a means for generating two or more types of short warning sounds. When the input audio is below a threshold, a warning sound is generated to indicate that the input audio is low, and when the input is above a higher threshold, a warning sound is generated to indicate that the input audio is loud. It is. Embodiments of the present invention will be explained below.

而して、本発明は、入力レベルをdi!I定して2段r
1テ以上のスレッショルドと比較する手段と、2種以上
の人間が判別できるくらいの短音の警告音を発生する手
段とを持ち、音声入力があったとき、その入力を上記ス
レッショルドと比較し、その入力が低い方のスレッショ
ルド以下の時には入力音声が小さいことを意味する警告
音を発生し、その入力が高い方のスレッショルド以上の
ときには入力音声が大きいことを意味する警告音を発生
するようにしたものである。ここで、警告音を用いる意
味を考えろ。通常、音声認識装置nの認識結果の出力が
候補カテゴリーを表示器か音声応答のような文字か音声
情報を用いている。ここで問題としている手段として適
応できるのは音声であるが、rt声で発声状態を知らせ
るのは、音声認識装置を初めて使用するとき等の時には
有効であるが、常時使用する場合には音声での指示は、
例えば、「大きな声で発声してください」、「声が大き
すぎます」等と一定の時間を要するので、この音声がわ
ずられしくなる。そこで、本発明のごとく発声レベルを
意味する警告音を発生することにより。
Therefore, the present invention changes the input level to di! 2 steps with I fixed
It has means for comparing with a threshold of 1 TE or more, and means for generating a short warning sound that can be distinguished by two or more types of humans, and when there is a voice input, it compares the input with the threshold, When the input is below the lower threshold, a warning sound is generated to indicate that the input audio is low, and when the input is above the high threshold, a warning sound is generated to indicate that the input audio is loud. It is something. Now, think about the meaning of using a warning sound. Usually, the output of the recognition result of the speech recognition device n uses text or speech information such as a display or a voice response to indicate candidate categories. Voice can be applied as the means in question here, and while it is effective to notify the vocalization status using rt voice when using the voice recognition device for the first time, when using it constantly, voice The instructions are:
For example, since it takes a certain amount of time to say something like "Please speak loudly" or "Your voice is too loud," this sound becomes annoying. Therefore, by generating a warning sound indicating the vocalization level as in the present invention.

発声者に短時間に知らせることができる。The speaker can be notified in a short time.

従って、本発明によれば、自動車電話の音声ダイヤル等
で、発声者が運転中で表示器等が見えないときでも、警
告音により適正なレベルであるか否かを知ることが可能
で、高い認識率を達成できる効果を有する。
Therefore, according to the present invention, it is possible to know whether the level is appropriate or not by the warning sound even when the person making the call is driving and cannot see the display, etc. by voice dialing of a car phone, etc. It has the effect of achieving a high recognition rate.

第1図は、本発明の一実施例を説明するための構成図で
、図中、11はマイクロフォン、12は入力レベル測定
・判定部、13は音声認識部、14は警告音発生部、1
5はスピーカで、マイクロフォン11から入力された音
声は入力レベル測定・判定部12でレベル測定され、音
声認識部13で認識される。この入力レベルの判定は、
入力レベル測定・判定部12が2つ以上のレベルのスレ
ッショルドを持っており、これら2つ以上のスレッショ
ルドとの大小判定を行い、その判定結果に基づき警告音
発生部14に制御信号を送る。警告音発生部14はその
指令に基づき1人間が判別できる程度(例えば約200
 m s程度)の短音を音色を変えてスピーカ15に出
力する。
FIG. 1 is a block diagram for explaining one embodiment of the present invention, in which 11 is a microphone, 12 is an input level measurement/judgment section, 13 is a voice recognition section, 14 is a warning sound generation section, 1
Reference numeral 5 denotes a speaker, and the level of the voice input from the microphone 11 is measured by an input level measuring/judgment section 12, and recognized by a voice recognition section 13. The determination of this input level is
The input level measurement/judgment unit 12 has two or more level thresholds, and determines the magnitude of the two or more thresholds, and sends a control signal to the warning sound generation unit 14 based on the determination result. Based on the command, the warning sound generator 14 generates a sound that can be recognized by one person (for example, approximately 200
A short tone (of the order of ms) is output to the speaker 15 with a different tone.

また、電話等でよく経験する小さい声や音を聞くと大声
になり、逆に大きい声を聞くと小声になる人間の習性を
利用して入力レベルが低い方のスレッショルドより小で
ある時はスピーカ15からの出力レベルを小さくし、高
い方のスレッショルドより大である時はスピーカ15か
らの出力レベルを大きくすることにより発声音を制御す
ることもできる。
In addition, when the input level is lower than the lower threshold, it uses the human tendency to become louder when hearing a soft voice or sound, such as on a telephone, and to become softer when hearing a loud voice. The vocalization can also be controlled by decreasing the output level from speaker 15 and increasing the output level from speaker 15 when it is greater than a higher threshold.

この入力レベルの判定と警告音の制御は認識を行なうと
きだけでなく、認識のための標準パターン登録時も同様
な動作を行なうことはいうまでもない。
It goes without saying that the input level determination and warning sound control are performed not only when performing recognition, but also when registering a standard pattern for recognition.

効   果 以」二の説明から明らかなように、本発明によると、入
力レベルを2段階以上のスレッショルドと比較して、そ
の結果を短時間の警告音として発声者に知らせることに
より、効率的に発声者に発声レベルの状態を指示するこ
とができ、高い認識率を得ることができる。
As is clear from the explanation in ``Effects'' 2, according to the present invention, the input level is compared with two or more thresholds, and the result is notified to the speaker as a short-term warning sound, thereby efficiently achieving the desired effect. It is possible to instruct the speaker about the state of the speech level, and a high recognition rate can be obtained.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は、本発明による音声認識装置の一実施例を説明
するための構成図、第2図は、従来の音声認識装置の一
例を説明するための構成図である。 11・・・マイクロフォン、12・・・入力レベル測定
・判定部、13・・・音声認識部、14・・・警告音発
生部。 15・・・スピーカ。
FIG. 1 is a block diagram for explaining an embodiment of a speech recognition apparatus according to the present invention, and FIG. 2 is a block diagram for explaining an example of a conventional speech recognition apparatus. DESCRIPTION OF SYMBOLS 11... Microphone, 12... Input level measurement/judgment part, 13... Voice recognition part, 14... Warning sound generation part. 15...Speaker.

Claims (1)

【特許請求の範囲】[Claims] 1、入力レベルを2つ以上のスレッショルドと比較する
手段と、2種以上の短音の警告音を発生する手段とを有
し、音声入力があったとき、その入力を上記スレッショ
ルドと比較し、その入力が低い方のスレッショルド以下
の時には入力音声が小さいことを意味する警告音を発生
し、その入力が高い方のスレッショルド以上のときには
入力音声が大きいことを意味する警告音を発生するよう
にしたことを特徴とする音声認識装置。
1. It has means for comparing the input level with two or more thresholds, and means for generating two or more types of short warning sounds, and when there is a voice input, the input is compared with the thresholds, When the input is below the lower threshold, a warning sound is generated to indicate that the input audio is low, and when the input is above the high threshold, a warning sound is generated to indicate that the input audio is loud. A speech recognition device characterized by:
JP63254491A 1988-10-07 1988-10-07 Voice recognizing device Pending JPH02101500A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP63254491A JPH02101500A (en) 1988-10-07 1988-10-07 Voice recognizing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP63254491A JPH02101500A (en) 1988-10-07 1988-10-07 Voice recognizing device

Publications (1)

Publication Number Publication Date
JPH02101500A true JPH02101500A (en) 1990-04-13

Family

ID=17265791

Family Applications (1)

Application Number Title Priority Date Filing Date
JP63254491A Pending JPH02101500A (en) 1988-10-07 1988-10-07 Voice recognizing device

Country Status (1)

Country Link
JP (1) JPH02101500A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008007616A1 (en) * 2006-07-13 2008-01-17 Nec Corporation Non-audible murmur input alarm device, method, and program
US20140362999A1 (en) * 2013-06-06 2014-12-11 Robert Scheper Sound detection and visual alert system for a workspace

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008007616A1 (en) * 2006-07-13 2008-01-17 Nec Corporation Non-audible murmur input alarm device, method, and program
US20140362999A1 (en) * 2013-06-06 2014-12-11 Robert Scheper Sound detection and visual alert system for a workspace

Similar Documents

Publication Publication Date Title
US8019050B2 (en) Method and apparatus for providing feedback of vocal quality to a user
US9571617B2 (en) Controlling mute function on telephone
US6411927B1 (en) Robust preprocessing signal equalization system and method for normalizing to a target environment
US20020087306A1 (en) Computer-implemented noise normalization method and system
US20060085183A1 (en) System and method for increasing recognition accuracy and modifying the behavior of a device in response to the detection of different levels of speech
US20060161430A1 (en) Voice activation
US11388514B2 (en) Method for operating a hearing device, and hearing device
JPH02101500A (en) Voice recognizing device
JP2004013084A (en) Sound volume controller
US11610596B2 (en) Adjustment method of sound output and electronic device performing the same
JP3085317B2 (en) Telephone equipment
JP2004252085A (en) System and program for voice conversion
JP3079006B2 (en) Voice recognition control device
JPH10312196A (en) Method and device for optimizing response voice volume
KR20200010149A (en) Apparatus for recognizing call sign and method for the same
JPH1021049A (en) Voice synthesizer
CN112399004B (en) Sound output adjusting method and electronic device for executing same
JP6759370B2 (en) Ring tone recognition device and ring tone recognition method
JPS6314200A (en) Voice recognition
TWI748215B (en) Adjustment method of sound output and electronic device performing the same
KR100705039B1 (en) Voice level over warning function
JP2006317556A (en) Voice dialog apparatus
JP2000069126A (en) Portable telephone set with annoying call preventing function
JPS6377097A (en) Voice recognition equipment
JPH10301595A (en) Voice recognition and response device