JPH02101500A - Voice recognizing device - Google Patents
Voice recognizing deviceInfo
- Publication number
- JPH02101500A JPH02101500A JP63254491A JP25449188A JPH02101500A JP H02101500 A JPH02101500 A JP H02101500A JP 63254491 A JP63254491 A JP 63254491A JP 25449188 A JP25449188 A JP 25449188A JP H02101500 A JPH02101500 A JP H02101500A
- Authority
- JP
- Japan
- Prior art keywords
- input
- level
- voice
- talker
- threshold
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000010586 diagram Methods 0.000 description 4
- 238000005259 measurement Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 2
- 238000000034 method Methods 0.000 description 2
- 230000004397 blinking Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
Abstract
Description
【発明の詳細な説明】
妓且豆互
本発明は、音声認識装置に係り、例えば、自動車電話の
音声ダイヤリング等種々の機器に適用可能なものである
。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a voice recognition device, and is applicable to various devices such as voice dialing of a car phone, for example.
従米七景
第2図は、従来の音声認識方式(特開昭63−1420
0号公報)の−例を説明するための図で、図中、1はマ
イクロフォン、2は入力レベルKil+定・判定部、3
は音声認識部、4は第1のランプ、5は第2のランプで
、マイクロフォン1がら入力された音声は入力レベル1
11!I定・判定部2でレベル測定され、音声認識部3
3で認識される。」二記人カレベルの判定は、入力レベ
ル1fl11定・判定部2が2つのレベルのスレッショ
ル1くを持っており、これら2つのスレッショルドとの
大小判定を行い、その判定結果に基づき2つのランプ4
,5を制御する。Figure 2 of the Seven Views of Jubei shows the conventional voice recognition method
In the figure, 1 is a microphone, 2 is an input level Kil+ constant/judgment section, and 3 is a diagram for explaining an example of Publication No. 0).
is a voice recognition unit, 4 is a first lamp, 5 is a second lamp, and the voice input from microphone 1 is input level 1.
11! The level is measured by the I-determination/judgment section 2, and the speech recognition section 3
3 is recognized. 2. To judge the power level, the input level 1fl11 constant/judgment unit 2 has two level thresholds, and it makes a judgment on the magnitude of these two thresholds, and based on the judgment result, the two lamps 4
, 5.
すなわち、入力レベルが低い方のスレッショルドより小
である時はランプを点灯せず、2つのスレッショルドの
間であるときは第1のランプ4のみ点灯させ、高い方の
スレッショルドより大である時は2個のランプ4.5を
同時に点灯させる。この入力レベルの判定とランプの制
御は<y2ylを行なうときだけでなく、認識のための
標準パターン登録時も同様な動作を続けている。That is, when the input level is less than the lower threshold, the lamp is not turned on, when it is between the two thresholds, only the first lamp 4 is turned on, and when it is greater than the higher threshold, the lamp 4 is turned on. 4.5 lamps are lit at the same time. This input level determination and lamp control continues in the same way not only when <y2yl is performed, but also when registering a standard pattern for recognition.
ここで、」〕記2つのスレッショルドは、高い方のスレ
ッショルドは音声認識部3のA/D変換器の最大レベル
、低い方のスレッショルドは高い方のスレッショルドよ
りも約25dB低い値とじている。Here, the two thresholds are set such that the higher threshold is the maximum level of the A/D converter of the speech recognition unit 3, and the lower threshold is approximately 25 dB lower than the higher threshold.
このような入力レベルの判定とランプの点滅は音声認識
のアルゴリズムとは直接関係ないが、発声レベルの適正
化を促し、高い認識率を確保するものである。すなわち
、発声レベルが過大であると入力系で音が歪んだりA/
D変換器が飽和する等の障害が生し、過小であると周囲
ノイズとのS/Nが、咽くなる等、いづれにしても認識
率が低下するので、発声者が発声時にランプを見て適正
レベルにあるか否かを検知しながら発声する必要があっ
た。Although such input level determination and lamp blinking are not directly related to the speech recognition algorithm, they encourage optimization of the speech level and ensure a high recognition rate. In other words, if the vocalization level is too high, the sound may be distorted in the input system or A/
Failures such as saturation of the D converter may occur, and if it is too low, the S/N with the surrounding noise will become sluggish, and the recognition rate will decrease in any case, so the speaker should not look at the lamp while speaking. It was necessary to vocalize while detecting whether or not the level was at an appropriate level.
しかしながら、上記従来の音声認識方法では。However, in the above conventional speech recognition method.
r卜助it電話の音声ダイヤリング等に音声認識を適用
した場合に、発声者がランプを注視する必要があり、注
視できない場合には、発声者が適正なレベルを確認でき
ず、良好な認jil率を得ることができなかった。When voice recognition is applied to telephone voice dialing, etc., the person speaking needs to look at the lamp, and if the person is unable to do so, the person speaking cannot confirm the appropriate level and may not be able to get good recognition. It was not possible to obtain the jil rate.
L−1
本発明は、上述のごとき問題を解決するためになされた
もので1発声者の発声レベルの確認を確実にし、高い認
識率を得られる音声認識装置を提供することを目的とし
てなされたものである。L-1 The present invention was made in order to solve the above-mentioned problems, and was made for the purpose of providing a speech recognition device that can ensure confirmation of the speaking level of a single speaker and obtain a high recognition rate. It is something.
31−一火
本発明は、上記目的を達成するために、入力レベルを2
つ以上のスレッショルドと比較する手段と、2種以上の
短音の警告音を発生する手段とを有し、音声入力があっ
たとき、その入力を上記スレッショルドと比較と、その
入力が低い方のスレッショルド以下の時には入力音声が
小さいことを意味する警告音を発生し、その入力が高い
方のスレッショルド以上のときには入力音声が大きいこ
とを意味する警告音を発生するようにしたことを特徴と
したものである。以下、本発明の実施例に」ルづいて説
明する。31-Ikka In order to achieve the above object, the present invention sets the input level to 2.
and a means for generating two or more types of short warning sounds. When the input audio is below a threshold, a warning sound is generated to indicate that the input audio is low, and when the input is above a higher threshold, a warning sound is generated to indicate that the input audio is loud. It is. Embodiments of the present invention will be explained below.
而して、本発明は、入力レベルをdi!I定して2段r
1テ以上のスレッショルドと比較する手段と、2種以上
の人間が判別できるくらいの短音の警告音を発生する手
段とを持ち、音声入力があったとき、その入力を上記ス
レッショルドと比較し、その入力が低い方のスレッショ
ルド以下の時には入力音声が小さいことを意味する警告
音を発生し、その入力が高い方のスレッショルド以上の
ときには入力音声が大きいことを意味する警告音を発生
するようにしたものである。ここで、警告音を用いる意
味を考えろ。通常、音声認識装置nの認識結果の出力が
候補カテゴリーを表示器か音声応答のような文字か音声
情報を用いている。ここで問題としている手段として適
応できるのは音声であるが、rt声で発声状態を知らせ
るのは、音声認識装置を初めて使用するとき等の時には
有効であるが、常時使用する場合には音声での指示は、
例えば、「大きな声で発声してください」、「声が大き
すぎます」等と一定の時間を要するので、この音声がわ
ずられしくなる。そこで、本発明のごとく発声レベルを
意味する警告音を発生することにより。Therefore, the present invention changes the input level to di! 2 steps with I fixed
It has means for comparing with a threshold of 1 TE or more, and means for generating a short warning sound that can be distinguished by two or more types of humans, and when there is a voice input, it compares the input with the threshold, When the input is below the lower threshold, a warning sound is generated to indicate that the input audio is low, and when the input is above the high threshold, a warning sound is generated to indicate that the input audio is loud. It is something. Now, think about the meaning of using a warning sound. Usually, the output of the recognition result of the speech recognition device n uses text or speech information such as a display or a voice response to indicate candidate categories. Voice can be applied as the means in question here, and while it is effective to notify the vocalization status using rt voice when using the voice recognition device for the first time, when using it constantly, voice The instructions are:
For example, since it takes a certain amount of time to say something like "Please speak loudly" or "Your voice is too loud," this sound becomes annoying. Therefore, by generating a warning sound indicating the vocalization level as in the present invention.
発声者に短時間に知らせることができる。The speaker can be notified in a short time.
従って、本発明によれば、自動車電話の音声ダイヤル等
で、発声者が運転中で表示器等が見えないときでも、警
告音により適正なレベルであるか否かを知ることが可能
で、高い認識率を達成できる効果を有する。Therefore, according to the present invention, it is possible to know whether the level is appropriate or not by the warning sound even when the person making the call is driving and cannot see the display, etc. by voice dialing of a car phone, etc. It has the effect of achieving a high recognition rate.
第1図は、本発明の一実施例を説明するための構成図で
、図中、11はマイクロフォン、12は入力レベル測定
・判定部、13は音声認識部、14は警告音発生部、1
5はスピーカで、マイクロフォン11から入力された音
声は入力レベル測定・判定部12でレベル測定され、音
声認識部13で認識される。この入力レベルの判定は、
入力レベル測定・判定部12が2つ以上のレベルのスレ
ッショルドを持っており、これら2つ以上のスレッショ
ルドとの大小判定を行い、その判定結果に基づき警告音
発生部14に制御信号を送る。警告音発生部14はその
指令に基づき1人間が判別できる程度(例えば約200
m s程度)の短音を音色を変えてスピーカ15に出
力する。FIG. 1 is a block diagram for explaining one embodiment of the present invention, in which 11 is a microphone, 12 is an input level measurement/judgment section, 13 is a voice recognition section, 14 is a warning sound generation section, 1
Reference numeral 5 denotes a speaker, and the level of the voice input from the microphone 11 is measured by an input level measuring/judgment section 12, and recognized by a voice recognition section 13. The determination of this input level is
The input level measurement/judgment unit 12 has two or more level thresholds, and determines the magnitude of the two or more thresholds, and sends a control signal to the warning sound generation unit 14 based on the determination result. Based on the command, the warning sound generator 14 generates a sound that can be recognized by one person (for example, approximately 200
A short tone (of the order of ms) is output to the speaker 15 with a different tone.
また、電話等でよく経験する小さい声や音を聞くと大声
になり、逆に大きい声を聞くと小声になる人間の習性を
利用して入力レベルが低い方のスレッショルドより小で
ある時はスピーカ15からの出力レベルを小さくし、高
い方のスレッショルドより大である時はスピーカ15か
らの出力レベルを大きくすることにより発声音を制御す
ることもできる。In addition, when the input level is lower than the lower threshold, it uses the human tendency to become louder when hearing a soft voice or sound, such as on a telephone, and to become softer when hearing a loud voice. The vocalization can also be controlled by decreasing the output level from speaker 15 and increasing the output level from speaker 15 when it is greater than a higher threshold.
この入力レベルの判定と警告音の制御は認識を行なうと
きだけでなく、認識のための標準パターン登録時も同様
な動作を行なうことはいうまでもない。It goes without saying that the input level determination and warning sound control are performed not only when performing recognition, but also when registering a standard pattern for recognition.
効 果
以」二の説明から明らかなように、本発明によると、入
力レベルを2段階以上のスレッショルドと比較して、そ
の結果を短時間の警告音として発声者に知らせることに
より、効率的に発声者に発声レベルの状態を指示するこ
とができ、高い認識率を得ることができる。As is clear from the explanation in ``Effects'' 2, according to the present invention, the input level is compared with two or more thresholds, and the result is notified to the speaker as a short-term warning sound, thereby efficiently achieving the desired effect. It is possible to instruct the speaker about the state of the speech level, and a high recognition rate can be obtained.
第1図は、本発明による音声認識装置の一実施例を説明
するための構成図、第2図は、従来の音声認識装置の一
例を説明するための構成図である。
11・・・マイクロフォン、12・・・入力レベル測定
・判定部、13・・・音声認識部、14・・・警告音発
生部。
15・・・スピーカ。FIG. 1 is a block diagram for explaining an embodiment of a speech recognition apparatus according to the present invention, and FIG. 2 is a block diagram for explaining an example of a conventional speech recognition apparatus. DESCRIPTION OF SYMBOLS 11... Microphone, 12... Input level measurement/judgment part, 13... Voice recognition part, 14... Warning sound generation part. 15...Speaker.
Claims (1)
手段と、2種以上の短音の警告音を発生する手段とを有
し、音声入力があったとき、その入力を上記スレッショ
ルドと比較し、その入力が低い方のスレッショルド以下
の時には入力音声が小さいことを意味する警告音を発生
し、その入力が高い方のスレッショルド以上のときには
入力音声が大きいことを意味する警告音を発生するよう
にしたことを特徴とする音声認識装置。1. It has means for comparing the input level with two or more thresholds, and means for generating two or more types of short warning sounds, and when there is a voice input, the input is compared with the thresholds, When the input is below the lower threshold, a warning sound is generated to indicate that the input audio is low, and when the input is above the high threshold, a warning sound is generated to indicate that the input audio is loud. A speech recognition device characterized by:
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP63254491A JPH02101500A (en) | 1988-10-07 | 1988-10-07 | Voice recognizing device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP63254491A JPH02101500A (en) | 1988-10-07 | 1988-10-07 | Voice recognizing device |
Publications (1)
Publication Number | Publication Date |
---|---|
JPH02101500A true JPH02101500A (en) | 1990-04-13 |
Family
ID=17265791
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP63254491A Pending JPH02101500A (en) | 1988-10-07 | 1988-10-07 | Voice recognizing device |
Country Status (1)
Country | Link |
---|---|
JP (1) | JPH02101500A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008007616A1 (en) * | 2006-07-13 | 2008-01-17 | Nec Corporation | Non-audible murmur input alarm device, method, and program |
US20140362999A1 (en) * | 2013-06-06 | 2014-12-11 | Robert Scheper | Sound detection and visual alert system for a workspace |
-
1988
- 1988-10-07 JP JP63254491A patent/JPH02101500A/en active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008007616A1 (en) * | 2006-07-13 | 2008-01-17 | Nec Corporation | Non-audible murmur input alarm device, method, and program |
US20140362999A1 (en) * | 2013-06-06 | 2014-12-11 | Robert Scheper | Sound detection and visual alert system for a workspace |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8019050B2 (en) | Method and apparatus for providing feedback of vocal quality to a user | |
US9571617B2 (en) | Controlling mute function on telephone | |
US6411927B1 (en) | Robust preprocessing signal equalization system and method for normalizing to a target environment | |
US20020087306A1 (en) | Computer-implemented noise normalization method and system | |
US20060085183A1 (en) | System and method for increasing recognition accuracy and modifying the behavior of a device in response to the detection of different levels of speech | |
US20060161430A1 (en) | Voice activation | |
US11388514B2 (en) | Method for operating a hearing device, and hearing device | |
JPH02101500A (en) | Voice recognizing device | |
JP2004013084A (en) | Sound volume controller | |
US11610596B2 (en) | Adjustment method of sound output and electronic device performing the same | |
JP3085317B2 (en) | Telephone equipment | |
JP2004252085A (en) | System and program for voice conversion | |
JP3079006B2 (en) | Voice recognition control device | |
JPH10312196A (en) | Method and device for optimizing response voice volume | |
KR20200010149A (en) | Apparatus for recognizing call sign and method for the same | |
JPH1021049A (en) | Voice synthesizer | |
CN112399004B (en) | Sound output adjusting method and electronic device for executing same | |
JP6759370B2 (en) | Ring tone recognition device and ring tone recognition method | |
JPS6314200A (en) | Voice recognition | |
TWI748215B (en) | Adjustment method of sound output and electronic device performing the same | |
KR100705039B1 (en) | Voice level over warning function | |
JP2006317556A (en) | Voice dialog apparatus | |
JP2000069126A (en) | Portable telephone set with annoying call preventing function | |
JPS6377097A (en) | Voice recognition equipment | |
JPH10301595A (en) | Voice recognition and response device |