JPH0235988B2

JPH0235988B2 -

Info

Publication number: JPH0235988B2
Application number: JP57035369A
Authority: JP
Inventors: Masahiro Hibino; Kenji Shima
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1982-03-04
Filing date: 1982-03-04
Publication date: 1990-08-14
Also published as: JPS58152298A

Description

【発明の詳細な説明】この発明は音声入力制御装置に関し、特に、入
力された音声に応じて車載機器などの被制御機器
を制御するような音声入力制御装置の改良に関す
る。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a voice input control device, and more particularly to an improvement in a voice input control device that controls controlled equipment such as in-vehicle equipment in accordance with input voice.

第１図はこの発明の背景となる音声入力制御装
置の概略ブロツク図である。まず、第１図を参照
して従来の音声入力制御装置について簡単に説明
する。音声はマイクロホン１に入力されて音声信
号に変換され、この音声信号が音声認識回路１０
０に入力される。音声認識回路１００には、各被
制御機器の機能を選択するための予め定める音声
のキーワードや、機器名に対応した音声の特徴パ
ラメータが登録されている。そして、音声認識回
路１００はマイクロホン１から入力された音声の
波形を分析して特徴パラメータを抽出し、その特
徴パラメータと予め登録されている特徴パラメー
タとの照合を行なつて該当するキーワードを識別
する。制御回路２００は音声認識回路１００の認
識結果としてのキーワードに対応した制御信号を
発生し、該当する駆動回路３００に与える。応じ
て、駆動回路３００は被制御機器４００を制御す
る。 FIG. 1 is a schematic block diagram of a voice input control device which is the background of the present invention. First, a conventional voice input control device will be briefly explained with reference to FIG. Voice is input to the microphone 1 and converted into a voice signal, and this voice signal is sent to the voice recognition circuit 10.
It is input to 0. In the voice recognition circuit 100, predetermined voice keywords for selecting the functions of each controlled device and voice characteristic parameters corresponding to device names are registered. Then, the voice recognition circuit 100 analyzes the waveform of the voice input from the microphone 1, extracts characteristic parameters, compares the characteristic parameters with the characteristic parameters registered in advance, and identifies the corresponding keyword. . The control circuit 200 generates a control signal corresponding to the keyword as a recognition result of the speech recognition circuit 100, and supplies it to the corresponding drive circuit 300. Accordingly, the drive circuit 300 controls the controlled device 400.

第２図は第１図に示す音声認識回路１００の概
略ブロツク図である。次に、第２図を参照して音
声認識回路１００について説明する。第１図に示
したマイクロホン１からの音声信号は音声分析回
路１０１に与えられる。この音声分析回路１０１
は、複数個のバンドパスフイルタ、平滑回路およ
びＡ−Ｄ変換器などで構成され、入力された音声
信号の波形を分析して特徴パラメータを抽出する
ものである。この音声分析回路１０１で抽出され
た特徴パラメータは特徴パラメータレジスタ１０
２に記憶される。登録パラメータメモリ１０４
は、被制御機器４００を制御するのに必要とされ
るすべての音声のキーワードとなる特徴パラメー
タを記憶するものである。類似度計算回路１０３
は特徴パラメータレジスタ１０２に記憶されてい
る入力された音声の特徴パラメータと、登録パラ
メータメモリ１０４に登録されているいずれかの
パラメータとの類似度を計算するものである。判
定回路１０５は類似度計算回路１０３によつて計
算された類似度がもつとも高いものであるかどう
かの判定およびそのときの類似度と予め定めるし
きい値Thとの比較を行なつて、入力された音声
がいずれの登録パターンに該当するかを識別する
ものである。そして、判定回路１０５は識別結果
としてのキーワードの番号をコード化して数値デ
ータとして出力する。なお、タイミング信号発生
回路１０６は上述の一連の認識処理の手順を決定
するためのタイミング信号を発生するものであつ
て、このタイミング信号を音声分析回路１０１と
登録パラメータメモリ１０４と判定回路１０５と
に与える。 FIG. 2 is a schematic block diagram of the speech recognition circuit 100 shown in FIG. Next, the speech recognition circuit 100 will be explained with reference to FIG. A voice signal from the microphone 1 shown in FIG. 1 is given to a voice analysis circuit 101. This voice analysis circuit 101
is composed of a plurality of bandpass filters, smoothing circuits, A-D converters, etc., and analyzes the waveform of an input audio signal to extract characteristic parameters. The feature parameters extracted by this speech analysis circuit 101 are stored in the feature parameter register 10.
2. Registered parameter memory 104
stores characteristic parameters serving as keywords for all voices required to control the controlled device 400. Similarity calculation circuit 103
is to calculate the degree of similarity between the input voice feature parameter stored in the feature parameter register 102 and any parameter registered in the registered parameter memory 104. The determination circuit 105 determines whether the degree of similarity calculated by the degree of similarity calculation circuit 103 is as high as possible, and compares the degree of similarity at that time with a predetermined threshold Th. This is to identify which registered pattern the recorded voice corresponds to. Then, the determination circuit 105 encodes the keyword number as the identification result and outputs it as numerical data. Note that the timing signal generation circuit 106 generates a timing signal for determining the procedure of the series of recognition processing described above, and transmits this timing signal to the speech analysis circuit 101, registered parameter memory 104, and determination circuit 105. give.

ところで、上述のごとく構成された音声認識回
路１００は、騒音の小さい環境で使用する場合
に、入力された音声を充分に認識し得る性能を有
するものである。しかし、たとえば自動車の車内
のようにエンジンの騒音が大きい場所などにおい
ては、入力された音声の認識率が低下しかつ騒音
そのものをキーワード音声として誤認識すること
が多い。このために、このような音声認識回路１
００を音声入力制御装置に用いた場合、充分な性
能を得ることができなかつた。特に、周囲でラジ
オがなつていたり、人が話をしたりする環境にお
いては、これらの音源の波形の中に登録されたキ
ーワードの音声に類似するものもあり、誤認識す
る確率が特に高くなるという問題点があつた。 By the way, the speech recognition circuit 100 configured as described above has the ability to sufficiently recognize input speech when used in a low-noise environment. However, in places where the engine noise is loud, such as inside a car, the recognition rate of input speech decreases, and the noise itself is often mistakenly recognized as keyword speech. For this purpose, such a speech recognition circuit 1
When 00 was used in a voice input control device, sufficient performance could not be obtained. In particular, in environments where a radio is playing or people are talking, some of the waveforms of these sound sources may be similar to the sounds of the registered keywords, and the probability of misrecognition is particularly high. There was a problem.

それゆえに、この発明の主たる目的は、比較的
騒音などの高いような環境下においても誤認識を
可能な限り減少させて被制御機器の誤動作を未然
に防止し得る音声入力制御装置を提供することで
ある。 Therefore, the main object of the present invention is to provide a voice input control device that can reduce erroneous recognition as much as possible and prevent malfunctions of controlled equipment even in a relatively noisy environment. It is.

この発明を要約すれば、被制御機器を制御する
ための予め定められた音声を２回発音するように
し、最初に入力された音声の特徴パラメータと次
に入力された音声の特徴パラメータとが一致して
いれば、その入力された音声に対応して被制御機
器を制御し、最初に入力された音声の特徴パラメ
ータと次に入力された音声の特徴パラメータとが
一致していなくても、その音声が第１の種類の音
声であればその音声に対応して被制御機器を制御
するように構成したものである。 To summarize the invention, a predetermined voice for controlling a controlled device is emitted twice, and the characteristic parameters of the first input voice and the characteristic parameters of the second input voice are made to be the same. If it is, the controlled device is controlled in response to the input voice, and even if the characteristic parameters of the first input voice and the next input voice do not match, the controlled device is controlled according to the input voice. If the voice is of the first type, the device to be controlled is configured to control the controlled device in response to the voice.

この発明の上述の目的およびその他の目的と特
徴は以下に図面を参照して行なう詳細な説明から
一層明らかとなろう。 The above objects and other objects and features of the present invention will become more apparent from the detailed description given below with reference to the drawings.

第３図はこの発明の前提となる２回発音された
音声を認識する装置の概略ブロツク図である。一
般に、人間が同一の言葉を続けて発音したとき
は、それらの音声の波形の類似度は極めて高く、
この装置を使用するものが制御を意図して発音し
た音声をそれぞれ認識すると、その認識結果が不
一致になる確率が極めて小さい。この点に着目し
て、この実施例では、前述の第１図および第２図
に示した音声入力制御装置に、新たに認識新結果
レジスタ１１０と認識旧結果レジスタ１１１と比
較回路１１２とを設ける。そして、認識新結果レ
ジスタ１１０は音声認識回路１００による２回目
に発音された音声の認識結果を記憶し、認識旧結
果レジスタ１１１は音声認識回路１００によつて
最初に発音された音声の認識結果を記憶する。そ
して、比較回路１１２は認識新結果レジスタ１１
０の内容としての２回目に発音された音声の認識
結果と、認識旧結果レジスタ１１１に記憶されて
いる最初に発音された音声の認識結果との一致を
判別する。そして、比較回路１１２は両者が一致
していることを判別したとき、制御回路２００を
能動化する。 FIG. 3 is a schematic block diagram of a device for recognizing twice-pronounced speech, which is the premise of the present invention. In general, when humans pronounce the same word consecutively, the waveforms of those sounds have extremely high similarities.
If a person using this device recognizes each voice produced with the intention of controlling the device, the probability that the recognition results will be inconsistent is extremely small. Focusing on this point, in this embodiment, a new recognition result register 110, an old recognition result register 111, and a comparison circuit 112 are newly provided in the voice input control device shown in FIGS. 1 and 2. . The new recognition result register 110 stores the recognition result of the second voice pronounced by the speech recognition circuit 100, and the old recognition result register 111 stores the recognition result of the first voice pronounced by the speech recognition circuit 100. Remember. The comparison circuit 112 then recognizes the new recognition result register 11.
It is determined whether the recognition result of the voice pronounced the second time as the content of 0 matches the recognition result of the voice pronounced the first time stored in the old recognition result register 111. When the comparison circuit 112 determines that the two match, it activates the control circuit 200.

次に、動作について説明する。発音者は被制御
機器としてのたとえばラジオ４０１の電源を投入
するための必要な音声を２回発音する。最初の音
声が発音されると、マイクロホン１はその音声に
対応した音声信号を音声認識回路１００に与え
る。音声認識回路１００は前述の第２図に説明し
たごとく、その音声の波形を分析して特徴パラメ
ータを抽出し、予め登録されている複数種類の音
声の特徴パラメータとの類似度を判定してその音
声の認識を行なう。そして、最初に発音された音
声の認識結果は認識新結果レジスタ１１０に記憶
される。発音者が続いて同じ音声を発音すると、
音声認識回路１００はその音声を認識する。この
とき、認識新結果レジスタ１１０に記憶された最
初の音声の認識結果は認識旧結果レジスタ１１１
に転送される。そして、認識新結果レジスタ１１
０には２回目に発音された音声の認識結果が記憶
される。比較回路１１２は認識新結果レジスタ１
１０の内容と認識旧結果レジスタ１１１の内容と
を比較し、一致していなければ制御回路２００を
不能化し、一致していれば制御回路２００を能動
化する。したがつて、制御回路２００は２回発音
された音声の認識結果がそれぞれ一致している場
合に制御信号を発生して駆動回路３００に与え
る。応じて、駆動回路３００はラジオ４０１の電
源を投入する。 Next, the operation will be explained. The speaker produces twice the necessary sound to turn on the power of, for example, the radio 401 as the controlled device. When the first voice is produced, the microphone 1 provides a voice signal corresponding to the voice to the voice recognition circuit 100. As explained in FIG. 2 above, the speech recognition circuit 100 analyzes the waveform of the speech, extracts the feature parameters, determines the degree of similarity with the feature parameters of multiple types of speech registered in advance, and extracts the feature parameters. Perform voice recognition. Then, the recognition result of the first voice pronounced is stored in the new recognition result register 110. When the speaker continues to pronounce the same sound,
The speech recognition circuit 100 recognizes the speech. At this time, the recognition result of the first voice stored in the new recognition result register 110 is stored in the old recognition result register 110.
will be forwarded to. Then, the new recognition result register 11
0 stores the recognition result of the second voice produced. Comparison circuit 112 recognizes new result register 1
The contents of 10 and the contents of the old recognition result register 111 are compared, and if they do not match, the control circuit 200 is disabled, and if they match, the control circuit 200 is enabled. Therefore, the control circuit 200 generates a control signal and supplies it to the drive circuit 300 when the recognition results of the twice-pronounced voices match each other. In response, drive circuit 300 powers on radio 401.

なお、ラジオ４０１を制御するためには電源の
投入や周波数の選択や音量の切替えなどの音声と
して20単語程度必要とされるが、音声認識回路１
００でコード化される認識結果は5bit程度の情報
である。したがつて、認識新結果レジスタ１１０
および認識旧結果レジスタ１１１としては、若干
のフリツプフロツプを用いることにより、また比
較器１１２は若干の排他的論理和および論理積素
子を用いることによつて比較的簡単に構成するこ
とができる。 In addition, in order to control the radio 401, approximately 20 words are required as voices for turning on the power, selecting the frequency, switching the volume, etc., but the voice recognition circuit 1
The recognition result coded with 00 is approximately 5 bits of information. Therefore, the recognition new result register 110
The old recognition result register 111 can be constructed relatively easily by using some flip-flops, and the comparator 112 can be constructed relatively easily by using some exclusive OR and AND elements.

このように、第３図に示す実施例では、同じ音
声を２回連続して発音されたとき、それぞれの音
声の認識結果が等しい場合にのみラジオ４０１を
制御することができる。特に、たとえば自動車の
車内などのように比較的騒音が高い環境のもとで
は、騒音レベルが高くかつ種々の騒音が存在する
が、一般に同一の単語の音声波形が引き続き２回
発生することは希である。したがつて、この実施
例による音声入力制御装置は騒音によつて誤動作
が生じるのを未然に防止することができる。 In this way, in the embodiment shown in FIG. 3, when the same voice is pronounced twice in succession, the radio 401 can be controlled only if the recognition results of the two voices are the same. Particularly in relatively noisy environments such as the inside of a car, the noise level is high and there are a variety of noises, but it is generally rare for the same word's speech waveform to occur twice in succession. It is. Therefore, the voice input control device according to this embodiment can prevent malfunctions caused by noise.

第４図はこの発明の一実施例の概略ブロツク図
である。前述の第３図に示した例では、使用者が
ラジオ４０１を制御しようとするごとに毎回同じ
音声を２回続けて発音しなければならないので、
若干の不便を感じると思われる。また、制御しよ
うとする意図をもつたときから実際にラジオ４０
１が動作するまでの時間も若干長くなる。このよ
うな問題点を解消しようとするのがこの第４図に
示す実施例である。 FIG. 4 is a schematic block diagram of one embodiment of the present invention. In the example shown in FIG. 3 above, each time the user attempts to control the radio 401, the same voice must be pronounced twice in a row.
You may feel some inconvenience. In addition, from the time when the intention to control the radio 40
1 also takes a little longer to operate. The embodiment shown in FIG. 4 attempts to solve these problems.

すなわち、この実施例では、キーワードとして
の音声を少なくとも２種類の階層に類別する。一
方の音声は被制御機器の電源投入など一連の制御
動作の開始時に必要とされるキーワード群であ
り、他方はラジオ４０１による選局や音量調整な
ど短期間に頻繁に使用されるキーワード群であ
る。そして、これに対応して音声認識回路１００
には、少なくとも２種類のキーワードの特徴パラ
メータを記憶する登録パラメータメモリ１０４と
１５４とが設けられる。さらに、音声認識回路１
００によつて認識されたキーワードが２種類の階
層のうちいずれであるかを分類するための分類回
路１０７が設けられる。この分類回路１０７はタ
イミング信号発生回路１０６から出力される登録
パラメータメモリ１０４および１５４のメモリア
ドレス信号をデイマルチプレクスするもので、若
干の論理ゲート素子を組合わせることによつて構
成できる。そして、分類回路１０７には判定回路
１０５で認識され、認識結果レジスタ１１０に記
憶された認識結果の出力が与えられる。分類回路
１０７は入力された音声が第２のキーワード群で
あることを判定回路１０５が判定したとき、比較
回路１１２から強制的に一致信号を出力させるよ
うに働く。逆に、入力された音声が第１のキーワ
ード群であれば比較回路１１２を前述の第３図に
示した実施例と同様の動作を行なわせる。このた
めに、比較回路１１２はたとえば２入力ゲート素
子を３入力ゲート素子に置換える程度の変更で済
む。 That is, in this embodiment, sounds as keywords are classified into at least two types of hierarchy. One voice is a group of keywords required at the start of a series of control operations, such as turning on the power of a controlled device, and the other is a group of keywords that are frequently used in a short period of time, such as selecting a station on the radio 401 or adjusting the volume. . In response to this, the speech recognition circuit 100
is provided with registered parameter memories 104 and 154 that store feature parameters of at least two types of keywords. Furthermore, the speech recognition circuit 1
A classification circuit 107 is provided for classifying which of two types of hierarchy the keyword recognized by 00 belongs to. This classification circuit 107 demultiplexes the memory address signals of the registered parameter memories 104 and 154 output from the timing signal generation circuit 106, and can be constructed by combining several logic gate elements. Then, the output of the recognition result recognized by the determination circuit 105 and stored in the recognition result register 110 is given to the classification circuit 107 . The classification circuit 107 operates to force the comparison circuit 112 to output a matching signal when the determination circuit 105 determines that the input speech is the second keyword group. Conversely, if the input voice is of the first keyword group, the comparison circuit 112 is caused to perform the same operation as in the embodiment shown in FIG. 3 above. For this purpose, the comparator circuit 112 can be modified by simply replacing a 2-input gate element with a 3-input gate element, for example.

上述のごとく構成することによつて、電源投入
などの動作開始を行なう場合には、同じ音声が２
回発音されて初めて制御動作が達成され、第２の
キーワード群に対しては音声が１回発音されるだ
けで制御動作を達成できる。ここで、音声入力の
手順の一例を掲げてこの実施例の動作を説明す
る。呼びかけのキーワードとして「ニンシキ」お
よびラジオ４０１の電源投入用のキーワード「ラ
ジオ」を第１のキーワード群とし、ラジオ４０１
の選局などの操作に対応したキーワードたとえば
「１チヤネル」を第２のキーワード群に分類して
おく。使用者はまず「ニンシキ」、「ニンシキ」と
いうように同じキーワードを２回続けて発音す
る。このキーワードは音声認識回路１００によつ
て認識され、最切の認識結果が認識新結果レジス
タ１１０に記憶され、２回目の音声が認識される
と認識新結果レジスタ１１０の内容が認識旧結果
レジスタ１１１に転送されかつ２回目に発音され
た音声の認識結果が認識新結果レジスタ１１０に
記憶される。このとき、分類回路１０７は発音さ
れた音声が第１のキーワード群であると分類し、
比較回路１１２が強制的に一致信号を出力させな
い。しかし、比較回路１１２は認識新結果レジス
タ１１０の内容と認識旧結果レジスタ１１１の内
容とが一致していることを判別すると、一致信号
を制御回路２００に与える。応じて、制御回路２
００は駆動回路３００を動作させるための制御信
号を出力する。それによつて、駆動回路３００が
能動化され、以後ラジオ４０１の制御が可能とさ
れる。 By configuring as described above, when starting an operation such as turning on the power, the same voice can be heard twice.
The control action is achieved only after the voice is pronounced once, and the control action can be achieved only by pronouncing the voice once for the second keyword group. Here, the operation of this embodiment will be explained using an example of the voice input procedure. The first keyword group is "Nishiki" as a keyword for calling, and "radio" is a keyword for powering on radio 401.
A keyword corresponding to an operation such as channel selection, for example, "1 channel" is classified into the second keyword group. The user first pronounces the same keyword twice in a row, such as "ninshiki" and "ninshiki." This keyword is recognized by the speech recognition circuit 100, the most recent recognition result is stored in the new recognition result register 110, and when the second speech is recognized, the contents of the new recognition result register 110 are changed to the old recognition result register 110. The recognition result of the voice that was transferred to and pronounced the second time is stored in the new recognition result register 110. At this time, the classification circuit 107 classifies the pronounced voice as belonging to the first keyword group,
Comparison circuit 112 is not forced to output a match signal. However, when the comparison circuit 112 determines that the contents of the new recognition result register 110 and the contents of the old recognition result register 111 match, it provides a match signal to the control circuit 200. Accordingly, control circuit 2
00 outputs a control signal for operating the drive circuit 300. As a result, drive circuit 300 is activated, and radio 401 can be controlled from now on.

次に、使用者が第２のキーワード群の言葉「ラ
ジオ」、「ラジオ」と発音すると、分類回路１０７
はその音声が第１のキーワード群であると判別
し、比較回路１１２が一致信号を出力する。この
一致信号に応じて制御回路２００はラジオ４０１
の電源を投入するための制御信号を駆動回路３０
０に与える。応じて、駆動回路３００はラジオ４
０１の電源を投入する。 Next, when the user pronounces the words "radio" and "radio" of the second keyword group, the classification circuit 107
determines that the voice belongs to the first keyword group, and the comparison circuit 112 outputs a matching signal. In response to this coincidence signal, the control circuit 200 controls the radio 401.
The drive circuit 30 sends a control signal to turn on the power.
Give to 0. Accordingly, the drive circuit 300
Turn on the power of 01.

次に、使用者が「１チヤネル」や「５チヤネ
ル」などの選局を行なうための音声を１回だけ発
音する。分類回路１０７は発音された音声が第２
のキーワード群に属するものであると分類し、比
較回路１１２から強制的に一致信号を出力させ
る。制御回路２００は認識新結果レジスタ１１０
に記憶されている認識結果に基づいて、たとえば
ラジオ４０１で１チヤネルを受信するための制御
信号を発生する。応じて、駆動回路３０１はラジ
オ４０１で１チヤネルを受信させる。 Next, the user pronounces a voice for selecting channels such as "1 channel" or "5 channel" only once. The classification circuit 107 selects the pronounced voice as the second
The comparison circuit 112 is forced to output a matching signal. The control circuit 200 recognizes new result register 110
Based on the recognition result stored in , for example, a control signal for receiving one channel on radio 401 is generated. Accordingly, the drive circuit 301 causes the radio 401 to receive one channel.

上述のごとく、この実施例によれば、ラジオ４
０１の電源投入など一連の制御動作の開始時には
同じ音声を２回発音し、制御動作開始後は選局動
作などのような頻繁に制御を必要とするものにつ
いては音声を１回発音するだけでよいので、使用
者が不便を感じることなく、また制御の意図をも
つたときから実際にラジオ４０１が所望の動作を
行なうまでの時間を短縮化することができる。 As mentioned above, according to this embodiment, radio 4
At the start of a series of control operations such as powering on the 01, the same voice is emitted twice, and after the start of the control operation, for things that require frequent control such as channel selection, the same voice is emitted only once. Therefore, the user does not feel any inconvenience, and the time from when the user intends to control until the radio 401 actually performs the desired operation can be shortened.

なお、上述の実施例における音声認識回路１０
０、制御回路２００、認識結果レジスタ１１０，
１１１および比較回路１１２などは同一の半導体
チツプ状に形成すれば装置全体を小形化すること
ができる。 Note that the speech recognition circuit 10 in the above-mentioned embodiment
0, control circuit 200, recognition result register 110,
If the circuit 111 and the comparison circuit 112 are formed on the same semiconductor chip, the entire device can be made smaller.

以上のように、この発明によれば、被制御機器
を制御するための音声を２回発音し、それぞれの
音声の特徴パラメータが一致したとき対象となる
被制御機器を制御するようにし、それぞれの音声
の特徴パラメータが一致していなくても、第１の
種類の音声であれば被制御機器を制御するように
することによつて、周囲の騒音などによつて音声
認識の誤動作を未然に防止することができる。し
かも、音声入力によつて被制御機器を制御するこ
とができるので、何ら手動的な操作を必要とする
ことがなく、実用的な機能を持たせることができ
る。さらに、制御動作開始時に同じ音声を２回発
音し、制御動作開始後に頻繁に制御を必要とする
ものについては音声を１回発音するだけでよいの
で、制御動作開始後において所望の動作行なうま
での時間を短縮できる。 As described above, according to the present invention, a voice for controlling a controlled device is emitted twice, and when the characteristic parameters of each voice match, the target controlled device is controlled. By controlling the controlled device if the voice is of the first type even if the characteristic parameters of the voice do not match, malfunctions of voice recognition due to ambient noise etc. can be prevented. can do. Moreover, since the controlled device can be controlled by voice input, no manual operation is required and practical functions can be provided. Furthermore, the same sound is emitted twice at the start of the control action, and for items that require frequent control after the start of the control action, the sound only needs to be emitted once. It can save time.

[Brief explanation of drawings]

第１図はこの発明の背景となる音声入力制御装
置の概略ブロツク図である。第２図は第１図に示
す音声認識回路の概略ブロツク図である。第３図
はこの発明の前提となる２回発音された音声を認
識する装置の概略ブロツク図である。第４図はこ
の発明の一実施例の概略ブロツク図である。図において、１はマイクロホン、１００は音声
認識回路、１０１は音声分析回路、１０２は特徴
パラメータレジスタ、１０３は数似度計算回路、
１０４，１５４は登録パラメータメモリ、１０５
は判定回路、１０６はタイミング信号発生回路、
１０７は分類回路、１１０は認識新結果レジス
タ、１１１は認識旧結果レジスタ、１１２は比較
回路、２００は制御回路、３００は駆動回路、４
０１はラジオを示す。 FIG. 1 is a schematic block diagram of a voice input control device which is the background of the present invention. FIG. 2 is a schematic block diagram of the speech recognition circuit shown in FIG. 1. FIG. 3 is a schematic block diagram of a device for recognizing twice-pronounced speech, which is the premise of the present invention. FIG. 4 is a schematic block diagram of one embodiment of the present invention. In the figure, 1 is a microphone, 100 is a speech recognition circuit, 101 is a speech analysis circuit, 102 is a feature parameter register, 103 is a number similarity calculation circuit,
104, 154 are registered parameter memories, 105
106 is a determination circuit; 106 is a timing signal generation circuit;
107 is a classification circuit, 110 is a recognition new result register, 111 is a recognition old result register, 112 is a comparison circuit, 200 is a control circuit, 300 is a drive circuit, 4
01 indicates a radio.

Claims

[Scope of Claims] 1. A voice input control device that controls a controlled device according to the input of a first type of voice and a second type of voice, comprising: audio input means for outputting audio signals corresponding to the respective voices in response to input of first and second types of audio; Feature parameter extraction means for outputting feature parameters of speech; first feature parameter storage means for storing feature parameters of the first type of speech; second features for storing feature parameters of the second type of speech. parameter storage means; recognition means for recognizing whether the feature parameter of the voice outputted from the feature parameter extraction means is similar to any of the feature parameters stored in the first or second feature parameter storage means; , a first recognition result storage means for storing the recognition result of the recognition means, a second recognition result storage means for storing the previous recognition result by the recognition means, stored in the first recognition result storage means. Comparing means for comparing the current recognition result and the previous recognition result stored in the second recognition result storage means to determine a match; a recognition result by the voice recognition means and a matching output from the comparing means; control means for controlling the controlled device according to the input voice; and the recognition means recognizes that the characteristic parameter of the input voice is similar to the characteristic parameter of the first type of voice, and recognizes that the characteristic parameter of the input voice is similar to the characteristic parameter of the first type of voice, Depending on what the first recognition result storage means stores,
A voice input control device, comprising means for causing the control means to control the controlled device according to the input voice even if the comparison means derives a non-coincidence output signal.