JPS61246799A

JPS61246799A - Voice response switch

Info

Publication number: JPS61246799A
Application number: JP60089374A
Authority: JP
Inventors: 博昭竹山; 仁深川; 清隆竹原; 安一杵川
Original assignee: Matsushita Electric Works Ltd
Current assignee: Panasonic Electric Works Co Ltd
Priority date: 1985-04-24
Filing date: 1985-04-24
Publication date: 1986-11-04

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】［技術分野］本発明は音声応答スイッチ、さらに詳しくは、人の音声
を認識して作動する音声応答スイッチに関するものであ
る。DETAILED DESCRIPTION OF THE INVENTION [Technical Field] The present invention relates to a voice response switch, and more particularly to a voice response switch that operates by recognizing human voice.

［背景技術］従来上り音声応答スイッチとしては、第６図に示すよう
に、音声に相当する周波数帯域の入力信号を通過させる
フィルタ回路１１と、フィルタ回路１１の出力レベルを
検出するレベル検出回路１２と、レベル検出回路１２の
出力を予め設定された参照値と比較しレベル検出回路１
２の出力が参照値以上であるときに制御信号を出力する
制御回路１３と、制御信号により開閉されるスイッチ要
素１４とから構成されており、制御回路１３への入力レ
ベルが参照値以上であるときにフィルタ回路１１への入
力信号が音声信号であると判断するものが提供されてい
る。[Background Art] As shown in FIG. 6, a conventional upstream voice response switch includes a filter circuit 11 that passes an input signal in a frequency band corresponding to voice, and a level detection circuit 12 that detects the output level of the filter circuit 11. The level detection circuit 1 compares the output of the level detection circuit 12 with a preset reference value.
It is composed of a control circuit 13 that outputs a control signal when the output of 2 is above a reference value, and a switch element 14 that is opened and closed by the control signal, and the input level to the control circuit 13 is above the reference value. A device is provided that sometimes determines that the input signal to the filter circuit 11 is an audio signal.

この回路構成においては、特定の周波数帯域のレベル判
定のみで音声であるかどうかを判別しているものである
から、フィルタ回路１１を通過できる帯域の周波数成分
を持ちかつ参照値よりも高ぃレベルの入力信号であれば
音声ではない雑音であってもスイッチ要素１４が作動す
ることになり、誤動作を生じるという問題がある。また
音声が入力されている場合でも、それがスイッチ要素１
４を作動させる目的で発せられた音声であるかどうかに
かかわらずスイッチ要素１４が作動するから、スイッチ
要素１４の作動を希望しないときスイッチ要素１４が作
動することがあるという不都合が生じるものである。In this circuit configuration, it is determined whether or not it is a voice only by determining the level of a specific frequency band. Therefore, if the signal has a frequency component in a band that can pass through the filter circuit 11 and has a level higher than the reference value. If the input signal is , the switch element 14 will be activated even if it is a noise other than voice, resulting in a problem of malfunction. Also, even if audio is input, it will be switched to switch element 1.
Since the switch element 14 is actuated regardless of whether the sound is emitted for the purpose of actuating the switch element 4, there is an inconvenience that the switch element 14 may be actuated when the switch element 14 is not desired to be actuated. .

このため、第７図に委すように、音声認識装置１５を用
い、記憶部１６に予め記憶された制御音声と入力信号と
を比較し、両者が一致したときにスイッチ要素３を開閉
させるものが考えられているが、不特定話者を対象とす
る場合には、音声認識のための演算処理に長い時間が必
要となり実時間でスイッチ要素１４を制御することが困
難であるという問題があり、しかも現在の技術レベルで
は一般に認識率が低く誤動作しやすいという問題がある
。そして、認識率を高めるには情報量と計算量が多くな
るものであるから一層処理時間が遅れるという欠点があ
る。これに対して特定話者を対象とする場合には、使用
前に使用者自身の声を登録する必要があり、使用までの
作業が面倒である。For this reason, as shown in FIG. 7, a voice recognition device 15 is used to compare the control voice stored in advance in the storage unit 16 with the input signal, and when the two match, the switch element 3 is opened or closed. However, when targeting unspecified speakers, there is a problem that a long time is required for arithmetic processing for voice recognition, making it difficult to control the switch element 14 in real time. Moreover, with the current level of technology, there is a problem that the recognition rate is generally low and malfunctions are likely to occur. Furthermore, since increasing the recognition rate requires a large amount of information and calculation, there is a drawback that the processing time is further delayed. On the other hand, when targeting a specific speaker, it is necessary to register the user's own voice before use, and the work required to use it is troublesome.

［発明の目的１本発明は上述の点に鑑みて為されたものであって、その
主な目的とするところは、音声のうちの母音を特徴づけ
ている優勢な周波数成分である複数のフォルマントを抽
出し、各フォルマントを袖とするベクトル空間（または
平面）におけるベクトルの移動によりスイッチ要素を制
御したことにより、実時間で動作可能で認識率が高く、
しかも不特定話者を対象として使用できる音声応答スイ
ッチを提供することにあり、他の目的とするところは、
スイッチ要素のオン制御とオフ制御とを誤認しない音声
応答スイッチを提供することにある。[Objective of the Invention 1 The present invention has been made in view of the above-mentioned points, and its main purpose is to eliminate multiple formants, which are dominant frequency components characterizing vowels in speech. By extracting and controlling the switch elements by moving vectors in a vector space (or plane) with each formant as a sleeve, it can operate in real time and has a high recognition rate.
Moreover, the purpose is to provide a voice response switch that can be used for unspecified speakers.
It is an object of the present invention to provide a voice response switch that does not misidentify ON control and OFF control of a switch element.

［発明の開示］第５図は母音のスペクトルの一例を示すものであって、
母音を特徴づける優勢な周波数成分、すなわち、スペク
トルのピーク部分の周波数成分がフォルマントと呼ばれ
る。一般に母音には複数のフォルマントが存在し、周波
数の低いほうから順に第１７オルマン）Ｆ、、＄２フォ
ルマントＦ２、第３７オルマン）Ｆｓ、・・・・・・と
呼ばれる。これらのフォルマントのうち第１７オルマン
）Ｆ＋と第２７オルマン）Ｆ２との寄与率がもっとも高
く、第１７オルマン）Ｆ＋と第２７オｌレマン）Ｆ２と
を用いればかなり高い確度で母音を決定できるものであ
る。[Disclosure of the Invention] FIG. 5 shows an example of a vowel spectrum,
The dominant frequency component that characterizes a vowel, that is, the frequency component at the peak of the spectrum, is called a formant. In general, a vowel has a plurality of formants, and they are called the 17th orman) F, $2 formant F2, 37th orman) Fs, etc. in descending order of frequency. Among these formants, the contribution rate of the 17th orman) F+ and the 27th orman) F2 is the highest, and if the 17th orman) F+ and the 27th orman) F2 are used, the vowel can be determined with a fairly high degree of accuracy. It is.

ここで第１フォルマントＦｌを横軸にとり、第２フォル
マントＦ２を縦軸にとったＦＩＦ２ベクトル平面上で日
本語の母音／ａ／／　ｉ／／ｕ／／ｅ／１０／を示すと
、各母音は第４図の破線で示す範囲で表わされる。フォ
ルマントは各個人の声道長などによりかなり変動するも
のであって、Ｆ、−Ｆ２平面上である程度の広がりをも
って表わされるものであり、各母音を表わす範囲同士が
かなりの部分で重複するものであるが、一般に同一環境
で同一人物の発した５母音のフォルマントはＦ、−Ｆ２
平面上においで略５角形となり、環境が変化したり、発
話者が変わっても５母音の相対的位置関係、すなわちこ
の５角形の形状は保持された虫まで平行移動することが
知られている。したがって、母音が変化したときの相対
位置、すなわち変化ベクトルは環境や発話者がかわって
も略一定になる。つまり、母音／ａ／のベクトル成分を
（８００Ｈｚ、１８００Ｈ２）とし母音１０／のベクト
ル空間を（５００Ｈｚ、１０００Ｈｚ）とすると、／ａ
／から１０／への変化ベクトルの成分は（−３００Ｈｚ
、−８００Ｈｚ）となり、変化ベクトルの成分は環境や
発話者が異なっていても略一定になるのである。しかし
て、本発明においては、複数の母音を連続させて制御音
声を構成し、各母音間での変化ベクトルを監視すること
によって入力信号が予め設定された制御音声と一致する
かどうかを判定し、入力信号が制御音声と一致するとス
イッチ要素を開閉する音声応答スイッチを開示する。な
お、以下の説明においては、第１７オルマン）　ＦＩＩ
Ｉ％２７すルマン）Ｆ２と！−使用して音声の認識を行
なっているが、さらに認識率を高めるために、第３７す
ルマントＦ、をベクトルの第３成分として用いてもよく
、一般にＦｌ−Ｆ２−Ｆｔベクトル空間上で各母音を表
わせば、各母音間の重複部分が除去されるものであるか
ら検出確度が一層向上するものである。Here, if the Japanese vowels /a//i//u//e/10/ are shown on the FIF2 vector plane with the first formant Fl on the horizontal axis and the second formant F2 on the vertical axis, each vowel is represented by the range shown by the broken line in FIG. Formants vary considerably depending on the vocal tract length of each individual, and are expressed with a certain degree of spread on the F and -F2 planes, and the ranges representing each vowel overlap to a large extent. However, in general, the formants of five vowels uttered by the same person in the same environment are F, -F2.
It is approximately pentagonal on a plane, and it is known that even if the environment changes or the speaker changes, the relative positional relationship of the five vowels, that is, the pentagonal shape, will shift in parallel to the retained insect. . Therefore, the relative position when a vowel changes, that is, the change vector, remains approximately constant even if the environment or speaker changes. In other words, if the vector component of vowel /a/ is (800Hz, 1800H2) and the vector space of vowel 10/ is (500Hz, 1000Hz), /a
The component of the change vector from / to 10/ is (-300Hz
, -800Hz), and the components of the change vector remain approximately constant even if the environment or speaker differs. Therefore, in the present invention, a plurality of vowels are made up in succession to form a control voice, and it is determined whether the input signal matches a preset control voice by monitoring the change vector between each vowel. , discloses a voice responsive switch that opens and closes a switch element when an input signal matches a control voice. In addition, in the following explanation, the 17th Orman) FII
I%27 Le Mans) F2 and! However, in order to further increase the recognition rate, the 37th sumant F may be used as the third component of the vector, and generally each If the vowels are represented, the overlapping parts between vowels are removed, which further improves the detection accuracy.

（実施例）第１図に示すように、音声信号はフォルマント抽出回路
１に入力され第１フォルマントＦ１と第２フォルマント
Ｆ２とが抽出される。フォルマント抽出回路１の出力は
制御音声判別回路２に入力され、入力信号が予め設定さ
れた制御音声と一致したと判断されると制御信号が出力
されるようになっている。制御音声判別回路２の出力は
スイッチ要素３に入力され、スイッチ要素３に制御信号
が入力されるとスイッチ要素３が開閉される。(Example) As shown in FIG. 1, an audio signal is input to a formant extraction circuit 1, and a first formant F1 and a second formant F2 are extracted. The output of the formant extraction circuit 1 is input to a control voice discrimination circuit 2, and when it is determined that the input signal matches a preset control voice, a control signal is output. The output of the control voice discrimination circuit 2 is input to the switch element 3, and when a control signal is input to the switch element 3, the switch element 3 is opened or closed.

第２図にフォルマント抽出回路１の一例を示す。FIG. 2 shows an example of the formant extraction circuit 1.

フォルマント抽出回路１はそれぞれ２００　Ｈｚの帯域
中を有し通過周波数が互いに異なる多数の帯域フィルタ
１１．〜ｌｌｎよりなる帯域フィルタ群と、各帯域フィ
ルタ１１．〜ｔｉｎの出力信号をデジタル信号に変換す
るアナログ／デジタル変換回路１２と、各帯域フィルタ
１１．〜ｌｌｎの出力レベル値から７オル１ントを検出
するマイクロプロセッサ等からなる演算回路１３とから
構成される。The formant extraction circuit 1 includes a large number of bandpass filters 11, each having a band of 200 Hz and having different pass frequencies. -lln, and each bandpass filter 11. An analog/digital conversion circuit 12 that converts the output signals of ~tin into digital signals, and each bandpass filter 11. It is comprised of an arithmetic circuit 13 consisting of a microprocessor, etc., which detects the 7 or 1 tot from the output level values of .about.lln.

帯域フィルタ１１□〜ｌｌｎはそれぞれ０〜２００Ｈｚ
、　２００−４００　Ｈｚ、　４００−６００　Ｈｚ。Bandpass filters 11□~lln each have a frequency of 0~200Hz
, 200-400 Hz, 400-600 Hz.

・・・・・・、２２００〜２４００Ｈｚ、・・・・・・
と通過周波数帯域が互いに異なるとともに、全帯域フィ
ルタ１１、〜１１ｎを合わせると音声帯域の全周波数が
通過できるように設定されている。演算回路１３は第１
７すルマン）　Ｆ　＋　ト第２７　ｔルマントＦ２とを
検出するとともに、入力音声が変化したかどうかを判定
する音韻変化信号を出力する。なお、７すルマントの抽
出は回路構成によってハード的に行なっているが、線形
予測法などのソフト的な手法を用いて行なってもよい。・・・・・・、2200～2400Hz、・・・・・・
The pass frequency bands are different from each other, and when the full band filters 11, to 11n are combined, all frequencies in the audio band can be passed. The arithmetic circuit 13 is the first
7th Lemant) F + G27th Lemant F2 and outputs a phoneme change signal for determining whether the input speech has changed. Note that although the extraction of the seven pulse mants is performed using hardware depending on the circuit configuration, it may also be performed using a software method such as a linear prediction method.

第３図は制御音声判別回路２の一例を示すものであって
、制御音声判別回路２は、音韻変化信号が入力されると
第１フォルマントＦ１と第２７ｔルマントＦ、２とを成
分とするベクトルを記憶する第１ベクトル保持回路２２
と、音韻変化信号が入力されると第１ベクトル保持回路
２２に記憶されていたベクトルを記憶する第２ベクトル
保持回路２３と、第１ベクトル保持回路２２に記憶され
たベクトルから第２ベクトル保持回路２３に記憶された
ベクトルを減算することにより変化ベクトルを算出する
変化ベクトル算出回路２４と、スイッチ要ｙｔ、３を駆
動すべき制御音声における隣接した音韻間の変化ベクト
ルが所定の順序で記憶された記憶部２５と、変化ベクト
ル算出回路２４の出力値と記憶部２５に記憶された設定
値とを比較して入力された音声信号の変化ベクトルが記
憶部２５に記憶された変化ベクトルの順序と一致しかつ
設定範囲内であるときに一致信号を出力する第１の比較
判定回路２６ａと、第１の比較判定回路２６ａから出力
される一致信号を受けてスイッチ要素３をオン状態とす
るオン制御信号を出力するオン制御信号発生回路２７と
、変化ベクトル算出回路２４の出力値と記憶部２５に記
憶された設定値とを比較して入力された音声信号の変化
ベクトルが記憶部２５に記憶された変化ベクトルの順序
と道順でかつ設定範囲内であるときに一致信号を出力す
る第２の比較判定回路２６ｂと、第２の比較判定回路２
６ｂから出力される一致信号を受けてスイッチ要素３を
オフ状態とするオフ制御信号を出力するオフ制御信号発
生回路２８とから構成される。FIG. 3 shows an example of the control speech discriminating circuit 2. When a phoneme change signal is input, the control speech discriminating circuit 2 generates a vector having the first formant F1 and the 27th formant F,2 as components. The first vector holding circuit 22 stores
, a second vector holding circuit 23 stores the vector stored in the first vector holding circuit 22 when the phoneme change signal is input, and a second vector holding circuit stores the vector stored in the first vector holding circuit 22 . A change vector calculation circuit 24 calculates a change vector by subtracting the vector stored in 23, and change vectors between adjacent phonemes in the control speech to drive the switch yt, 3 are stored in a predetermined order. The storage unit 25 compares the output value of the change vector calculation circuit 24 with the setting value stored in the storage unit 25 and determines that the change vector of the input audio signal is in the same order as the change vectors stored in the storage unit 25. a first comparison/determination circuit 26a that outputs a coincidence signal when the match is true and within a set range; and an on control signal that turns on the switch element 3 in response to the coincidence signal output from the first comparison/judgment circuit 26a. The output value of the change vector calculation circuit 24 is compared with the set value stored in the storage unit 25, and the change vector of the input audio signal is stored in the storage unit 25. a second comparison/judgment circuit 26b that outputs a coincidence signal when the order and route of the change vector are within a set range; and a second comparison/judgment circuit 2.
The off control signal generation circuit 28 outputs an off control signal for turning off the switch element 3 in response to the coincidence signal output from the switch element 6b.

記憶部２５においては、設定された制御音声の隣接する
音韻間の変化ベクトルがある程度の誤差を許容する形で
記憶されている。すなわち、個人差や環境の差による変
化ベクトルの誤差を考慮して変化ベクトルの許容誤差範
囲が設定されているのであって、例えば、／ａ／から１
０／への変化ベクトルの範囲として（３００±Ｑ、Ｈｚ
、８００±ａ２Ｈｚ）が設定されているのであり、’ｌ
、’２の値を適宜設定することによって感度が調節され
るようになっている。しかして、制御音声判別回路２で
は音韻変化信号が制御音声判別回路２に入力されるたび
に入力された音声信号の変化ベクトルが記憶部２５に記
憶された変化ベクトルの許容誤差範囲内であるかどうか
が判定され、入力された音声信号の各音ｆＩＡ間の変化
ベクトルが記憶部２５に記憶された制御音声の変化ベク
トルの設定範囲内であると判定されると、比較判定回路
２６から一致信号が出力されるのである。なお、制御音
声判別回路２の記憶部２５を除く部分に関してはマイク
ロプロセッサ２０を用いて構成することができる。In the storage unit 25, change vectors between adjacent phonemes of the set control speech are stored in a form that allows a certain degree of error. In other words, the permissible error range of the change vector is set in consideration of errors in the change vector due to individual differences and environmental differences. For example, from /a/ to 1
As the range of the change vector to 0/(300±Q, Hz
, 800±a2Hz) is set, and 'l
, '2 are set appropriately to adjust the sensitivity. Therefore, in the control speech discriminating circuit 2, each time a phoneme change signal is input to the control speech discriminating circuit 2, the change vector of the input speech signal is checked to see if it is within the tolerance range of the change vector stored in the storage section 25. If it is determined that the change vector between each sound fIA of the input audio signal is within the set range of the change vector of the control sound stored in the storage unit 25, a match signal is sent from the comparison determination circuit 26. is output. Note that the parts of the control voice discrimination circuit 2 other than the storage section 25 can be constructed using the microprocessor 20.

（動作）以下、動作を説明する。まずなんらかの音声信号がフォ
ルマント抽出回路１に入力されると、７すルマント抽出
回路１では各入力信号のＦ＋Ｆｚ平面上でのベクトル成
分をそれぞれ抽出するとともに、音韻の変化時点でそれ
ぞれ音韻変化信号を発生する。制御音声判別回路２では
、第１音声が入力された時点でまず第１音声のベクトル
成分を第１ベクトル保持回路２２に記憶する０次に＄２
音声が入力され音韻変化信号が得られると、第１ベクト
ル保持回路２２に記憶されていた第１音声のベクトル成
分が第２ベクトル保持回路２３に入力されるとともに、
ｔＪＳ１ベクトル保持回路２２には第２音声のベクトル
成分が記憶される。このとき変化ベクトル算出回路２４
では第２ベクトル保持回路２３に記憶されたベクトル成
分と第１ベクトル保持回路２２に記憶されたベクトル成
分との変化量から変化ベクトルの成分が算出される。こ
こで記憶部２５に記憶された設定範囲と変化ベクトル算
出回路２４の出力値としての変化ベクトルの成分とが比
較され、変化ベクトルが記憶部２５に記憶された設定範
囲内であるかどうかが判断される。ここで変化ベクトル
算出回路２４の出力値と記憶部２５の設定値との比較は
第１の比較判定回路２６ｍと第２の比較判定回路２６ｂ
との両方で行なわれるが、オン制御信号を発生するため
の制御音声が／ａ、ｏｓｅ／であるとすれば、第１の比
較判定回路２６ａでは変化ベクトル算出回路２４の出力
値と／＆／から１０／への変化ベクトルとが一致するか
どうかが判定され、一方第２の比較判定回路２６ｂでは
変化ベクトル算出回路２４の出力値と／ｅ／から１０／
への変化ベクトルとが比較される。次に第３音声が入力
されると、第１ベクトル保持回路２２に記憶されていた
第２音声のベクトル成分が第２ベクトル保持回路２３に
入力されるとともに、第３音声のベクトル成分が第１ベ
クトル保持回路２２に記憶され、変化ベクトル算出回路
２４では第２ベクトル保持回路２３に記憶された第２音
声から第１ベクトル保持回路２２に記憶された第３音声
への変化ベクトルの成分が算出される。この変化ベクト
ルは両比較判定回路２６ａ、２６ｂにおいて記憶部２５
に記憶された次の変化ベクトルの設定範囲と比較され、
変化ベクトル算出回路２４の出力値が記憶部２５に記憶
された変化ベクトルの設定範囲内であるかどうかが判断
される。以上のようにして入力信号が停止するまで同様
の動作を繰り返し、入力される音声信号の変化ベクトル
が記憶部２５に記憶された設定範囲内でかつ正順または
道順であることが確認されると、いずれか一方の比較判
定回路２６ａ、２６ｂから一致信号が出力される。第１
の比較判定回路２６ａから一致信号が出力されると、オ
ン制御信号発生回路２７ではオン制御信号が出力され、
第２の比較判定回路２６ｂから一致信号が出力されると
、オフ制御信号発生回路２８からオフ制御信号が出力さ
れるのである。オン制御信号およびオフ制御信号はスイ
ッチ要素３に入力されスイッチ要素３をオン状ａまたは
オフ状態とする。入力信号が記憶部２５に設定された設
定範囲とは異なるときにはスイッチ要素３がそれまでの
状態を保つのは言うまでもない。(Operation) The operation will be explained below. First, when some speech signal is input to the formant extraction circuit 1, the seven-score mant extraction circuit 1 extracts the vector components of each input signal on the F+Fz plane, and generates a phoneme change signal at each point in time when the phoneme changes. do. In the control voice discrimination circuit 2, when the first voice is input, first the vector component of the first voice is stored in the first vector holding circuit 22.
When speech is input and a phoneme change signal is obtained, the vector component of the first speech stored in the first vector holding circuit 22 is input to the second vector holding circuit 23, and
The vector component of the second voice is stored in the tJS1 vector holding circuit 22. At this time, the change vector calculation circuit 24
Then, the components of the change vector are calculated from the amount of change between the vector component stored in the second vector holding circuit 23 and the vector component stored in the first vector holding circuit 22. Here, the setting range stored in the storage unit 25 and the component of the change vector as an output value of the change vector calculation circuit 24 are compared, and it is determined whether the change vector is within the setting range stored in the storage unit 25. be done. Here, the output value of the change vector calculation circuit 24 and the set value of the storage section 25 are compared with the first comparison judgment circuit 26m and the second comparison judgment circuit 26b.
However, if the control sounds for generating the ON control signal are /a, ose/, the first comparison judgment circuit 26a compares the output value of the change vector calculation circuit 24 and /&/ It is determined whether the change vector from /e/ to 10/ matches the output value of the change vector calculation circuit 24 and the change vector from /e/ to 10/ in the second comparison judgment circuit 26b.
is compared with the change vector. Next, when the third voice is input, the vector component of the second voice stored in the first vector holding circuit 22 is input to the second vector holding circuit 23, and the vector component of the third voice is input to the first vector holding circuit 22. The component of the change vector from the second voice stored in the second vector holding circuit 23 to the third voice stored in the first vector holding circuit 22 is calculated by the change vector calculation circuit 24. Ru. This change vector is stored in the storage section 25 in both comparison and determination circuits 26a and 26b.
is compared with the set range of the next change vector stored in
It is determined whether the output value of the change vector calculation circuit 24 is within the set range of the change vector stored in the storage section 25. The same operation as described above is repeated until the input signal stops, and when it is confirmed that the change vector of the input audio signal is within the set range stored in the storage unit 25 and is in the correct direction or the route. , a match signal is output from one of the comparison and determination circuits 26a and 26b. 1st
When a match signal is output from the comparison/judgment circuit 26a, the ON control signal generation circuit 27 outputs an ON control signal.
When the second comparison/judgment circuit 26b outputs a match signal, the OFF control signal generation circuit 28 outputs an OFF control signal. The ON control signal and the OFF control signal are input to the switch element 3 to turn the switch element 3 into the ON state a or the OFF state. Needless to say, when the input signal differs from the setting range set in the storage section 25, the switch element 3 maintains its previous state.

制御音声は２音以上の連続する母音から構成されており
、例えば八９０．ｅ／となっている。この場合に記憶部
２５には／、／から１０／、１０／から／ｅ／への変化
ベクトルとしてそれぞれ（３００±ｆｆ１Ｈｚ、８００
±ｆｆ２Ｈｚ）、（１２０±ａｓＨｚｙｌ　２００−！
−ａ。The control voice is composed of two or more consecutive vowels, for example, 890. It is e/. In this case, the storage unit 25 stores /, / to 10/, and change vectors from 10/ to /e/ (300±ff1Hz, 800
±ff2Hz), (120±asHzyl 200-!
-a.

Ｈｚ）の値が記憶される。ここでａＩ−ａ、は適宜設定
され、その設定値により音声の認識率が調節されるので
ある。また制御音声として／　ｉ／／ｅ／／ａ／１０／
／ｕ／／ｉ／の順で循環する母音ループの任意の音を始
音とし、母音ループを少なくとも１周する母音列で構成
してもよい。この場合に、制御音声の逆順においても母
音ループを１周するものであるから、変化ベクトルが母
音ループを回転する向きによってスイッチ要素３をオン
状態とするかオフ状態とするかを決定するようにしでも
よい。Hz) is stored. Here, aI-a is set as appropriate, and the speech recognition rate is adjusted by the set value. Also, as a control voice /i//e//a/10/
The vowel loop may be composed of a vowel string that goes around the vowel loop at least once, with an arbitrary sound in the vowel loop that circulates in the order of /u//i/ as the starting sound. In this case, since the vowel loop goes around the vowel loop once even in the reverse order of the control voice, it is determined whether the switch element 3 is turned on or off depending on the direction in which the change vector rotates the vowel loop. But that's fine.

上述の実施例において母音を検出するために第１フォル
マントＦ１と第２フォルマントＦ２とをベクトル成分と
して２次元空間でのベクトルを用いたが、第３７オルマ
ン）Ｆ３以上の１１１フォルマントもベクトル成分とし
て用いることにより３次元以上の多次元空間でのベクト
ルを用いて母音の判定を行なうようにしてもよい。さら
に、上述の実施例ではフォルマント抽出回路１と制御音
声判別回路２とにそれぞれマイクロプロセッサを用いた
例を示したが、両回路１．２のマイクロプロセッサを共
有化して１つにしてもよい。In the above embodiment, in order to detect a vowel, a vector in a two-dimensional space is used with the first formant F1 and the second formant F2 as vector components, but the 111th formant above F3 (37th orman) is also used as a vector component. In this way, vowels may be determined using vectors in a multidimensional space of three or more dimensions. Further, in the above embodiment, the formant extraction circuit 1 and the control speech discrimination circuit 2 each use a microprocessor, but the microprocessor of both circuits 1.2 may be shared and integrated into one.

［発明の効果］本発明は上述のように、入力される音声信号から少な（
とも第１フォルマントと第２フォルマントとを抽出する
フォルマント抽出回路と、連続する母音から構成された
制御音声の各母音間のフォルマントの変化が所定の順序
でありかつ変化量が所定範囲内であるときに制御信号を
出力する制御音声判別回路と、制御信号により開閉され
るスイッチ要素とから構成され、制御音声判別回路は制
御音声が正順に入力されるとオン制御信号を出力してス
イッチ要素をオン状態とするオン制御信号発生回路と、
制御音声が逆順に入力されるとオフ制御信号を出力して
スイッチ要素をオフ状態とするオフ制御信号発生回路と
を具備しでいるので、音声のうちの母音を特徴→けてい
る優勢な周波数成分である７市ルマントを抽出し、複数
の７ｔルマントにより形成されたベクトル空間における
音声ベクトルの移動によりスイッチ要素を作動させるｂ
・どうかを判別するようにした結果、母音のフォルマン
トの変化分を検出すれば音声の認識ができるものであり
、不特定話者を対象としているにもかかわらず計算量が
少なくかつ音声の認識を確実に行なうことができるとい
う利点を有する。また、制御音声が正順に入力されると
オン制御信号を出力してスイッチ要素をオン状態とする
オン制御信号発生回路と、制御音声が逆順に入力される
とオフ制御信号を出力してスイッチ要素をオフ状態とす
るオフ制御信号発生回路とを備えているので、制御音声
の変化ベクトルが／ｅ／から／ａ／への変化ベクトルと
／ｕ／から１０／への変化ベクトルのように似通ったも
のとなることがなく、オン制御音声とオフ制御音声との
変化ベクトルを完全に区別できるものであり、その結果
、オン制御とオフ制御との誤動作が生じないという利息
を有するものである。[Effects of the Invention] As described above, the present invention can reduce the amount of (
A formant extraction circuit that extracts a first formant and a second formant, and a formant extraction circuit that extracts a first formant and a second formant, and when the formant changes between vowels of a control speech composed of consecutive vowels are in a predetermined order and the amount of change is within a predetermined range. It is composed of a control voice discrimination circuit that outputs a control signal to the terminal, and a switch element that is opened and closed by the control signal.When the control voice is input in the correct order, the control voice discrimination circuit outputs an ON control signal and turns on the switch element. an on control signal generation circuit for setting the state;
It is equipped with an off control signal generation circuit that outputs an off control signal to turn off the switch element when the control voice is input in the reverse order, so that the dominant frequency that characterizes the vowel in the voice is detected. Extract the component 7T Lumant and operate the switch element by moving the audio vector in a vector space formed by a plurality of 7T Lumant.b
・As a result, it was found that speech can be recognized by detecting changes in vowel formants, and although it is aimed at unspecified speakers, the amount of calculation is small and speech recognition is possible. It has the advantage of being reliable. In addition, there is an on-control signal generation circuit that outputs an on-control signal to turn the switch element on when the control audio is input in the normal order, and an on-control signal generation circuit that outputs an off control signal and turns the switch element on when the control audio is input in the reverse order. Since it is equipped with an off control signal generation circuit that turns off the control voice, the change vector of the control voice is similar to the change vector from /e/ to /a/ and the change vector from /u/ to 10/. This has the advantage that the change vectors between the on-control voice and the off-control voice can be completely distinguished, and as a result, no malfunction occurs between the on-control and the off-control.

[Brief explanation of the drawing]

第１図は本発明の一実施例を示すブロック図、第２図は
同上に使用するフォルマント抽出回路を示すブロック図
、第３図は同上に使用する制御音声判別回路を示すブロ
ック図、第４図はＦ　、−Ｆ　２図の一例を示す動作説
明図、第５図は母音の周波数特性の一例を示す動作説明
図、第６図は従来例を示すブロック図、第７図は他の従
来例を示すブロック図である。１はフォルマント抽出回路、２は制御音声判別回路、３
はスイッチ要素、２７はオン制御信号発生回路、２８は
オフ制御信号発生回路である。ぐ２「FIG. 1 is a block diagram showing an embodiment of the present invention, FIG. 2 is a block diagram showing a formant extraction circuit used in the above, FIG. 3 is a block diagram showing a control speech discrimination circuit used in the same, and FIG. The figure is an operation explanatory diagram showing an example of F, -F 2 diagrams, Fig. 5 is an operation explanatory diagram showing an example of vowel frequency characteristics, Fig. 6 is a block diagram showing a conventional example, and Fig. 7 is another conventional example. FIG. 2 is a block diagram illustrating an example. 1 is a formant extraction circuit, 2 is a control speech discrimination circuit, 3
27 is a switch element, 27 is an on control signal generation circuit, and 28 is an OFF control signal generation circuit. Gu2

Claims

[Claims]

(1) A formant extraction circuit that extracts at least a first formant and a second formant from an input speech signal; It is composed of a control voice discrimination circuit that outputs a control signal when the amount of change is within a predetermined range, and a switch element that is opened and closed by the control signal.The control voice discrimination circuit turns on control when control voices are input in the normal order. Equipped with an on control signal generation section that outputs a signal to turn the switch element on, and an off control signal generation circuit that outputs an off control signal and turns the switch element off when control audio is input in reverse order. A voice response switch characterized by comprising: