JPS61246796A

JPS61246796A - Voice response switch

Info

Publication number: JPS61246796A
Application number: JP60089371A
Authority: JP
Inventors: 博昭竹山; 仁深川; 清隆竹原; 安一杵川
Original assignee: Matsushita Electric Works Ltd
Current assignee: Panasonic Electric Works Co Ltd
Priority date: 1985-04-24
Filing date: 1985-04-24
Publication date: 1986-11-04
Also published as: JPH0562757B2

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】［技術分野Ｊ本発明は音声応答スイッチ、さらに詳しくは、人の音声
を認識して作動する音声応答スイッチに関するものであ
る。DETAILED DESCRIPTION OF THE INVENTION [Technical Field J] The present invention relates to a voice response switch, and more particularly to a voice response switch that operates by recognizing human voice.

［背景技術Ｊ従来上り音声応答スイッチとしては、第６図に示すよう
に、音声に相当する周波数帯域の人カイロ号を通過させ
るフィルタ回路１１と、フィルタ回路１１の出力レベル
を検出するレベル検出回路１２と、レベル検出回路１２
の出力を予め設定された参照値と比較しレベル検出回路
１２の出力が参照値以上であるときに制御信号を出力す
る制御回路１３と、制御信号により閉成されるスイッチ
要素１４とから構成されており、制御回路１３への入力
レベルが参照値以上であるときにフィルタ回路１１への
入力信号が音声信号であると判断するようになったもの
が提供されている。[Background Art J] As shown in FIG. 6, a conventional upstream voice response switch includes a filter circuit 11 that passes a human body signal in a frequency band corresponding to voice, and a level detection circuit that detects the output level of the filter circuit 11. 12 and level detection circuit 12
The control circuit 13 compares the output of the level detection circuit 12 with a preset reference value and outputs a control signal when the output of the level detection circuit 12 is equal to or higher than the reference value, and a switch element 14 that is closed by the control signal. A device is provided in which it is determined that the input signal to the filter circuit 11 is an audio signal when the input level to the control circuit 13 is equal to or higher than a reference value.

この回路構成においては、特定の周波数帯域のレベル判
定のみで音声であるかどうかを判別しているものである
から、フィルタ回路１１を通過できる帯域の周波数成分
を持ちかつ参照値よりも高いレベルの入力信号であれば
音声ではない雑音であってもスイッチ要素１４が作動す
ることになり、誤動作を生じるという問題がある。また
音声が入力されている場合でも、それがスイッチ要素１
４を作動させる目的で発せられた音声であるかどうかに
かかわらずスイッチ要素１４が作動するから、スイッチ
要素１４の作動を希望しないときスイッチ要素１４が作
動することがあるという不都合が生じるものである。In this circuit configuration, it is determined whether or not it is a voice only by determining the level of a specific frequency band. If it is an input signal, the switch element 14 will be activated even if it is a noise that is not a voice, resulting in a problem of malfunction. Also, even if audio is input, it will be switched to switch element 1.
Since the switch element 14 is actuated regardless of whether the sound is emitted for the purpose of actuating the switch element 4, there is an inconvenience that the switch element 14 may be actuated when the switch element 14 is not desired to be actuated. .

このため、第７図に示すように、音声認識装置１５を用
い、記憶部１６に記憶された制御音声と入力音声とを比
較し、両者が一致したときにスイッチ要素３を開閉させ
るものが考えられているが、不特定話者を対象とする場
合には、音声認識のための演算処理に長い時間が必要と
なり実時間でスイッチ要素１４を制御することが困難で
あるという問題があり、しかも現在の技術レベルでは一
般に認識率が低く誤動作しやすいという問題がある。For this reason, as shown in FIG. 7, one idea is to use a voice recognition device 15 to compare the control voice stored in the storage unit 16 with the input voice, and to open or close the switch element 3 when the two match. However, when targeting unspecified speakers, there is a problem that a long time is required for arithmetic processing for voice recognition, making it difficult to control the switch element 14 in real time. The problem with the current level of technology is that the recognition rate is generally low and malfunctions are likely to occur.

そして、ｇａ率を高めるには情報量と計算量が多くなる
ものであるから一層処理時間が遅れるという大魚がある
。これに対して特定話者を対象とする場合には、予め使
用者が自分の声を登録する必要があり、使用までの作業
が面倒である。Another big problem is that increasing the GA rate requires an increase in the amount of information and calculations, which further delays processing time. On the other hand, when targeting a specific speaker, the user needs to register his/her own voice in advance, which makes the process of using the system cumbersome.

［発明の目的］本発明は上述の点に鑑みて為されたものであって、その
主な目的とするところは、音声のうちの母音を特徴づけ
ている優勢な周波数成分であるフォルマントを抽出し、
複数の７ｔルマントにより形成されたベクトル空間にお
ける音声ベクトルの移動によりスイッチ要素を作動させ
るかどうかを判別するようにして、実時間で動作可能で
認識率が高（、しかも不特定話者用を対象とした音声応
答スイッチを提供することにある。[Object of the Invention] The present invention has been made in view of the above points, and its main purpose is to extract formants, which are dominant frequency components that characterize vowels in speech. death,
The system determines whether or not to activate a switch element based on the movement of the speech vector in a vector space formed by multiple 7t speech vectors, which can operate in real time and has a high recognition rate (and can be used by any specific speaker). The purpose of the present invention is to provide a voice-responsive switch.

［発明の開示ｌ第５図は母音のスペクトルの一例を示すものであって、
母音を特徴づける優勢な周波数成分、すなわち、スペク
トルのピーク部分の周波数成分がフォルマントと呼ばれ
る。母音には普通複数のフォルマントが存在し、周波数
の低いほうから順に第１７オルマン）Ｆ、、Ｊ２７オル
マン）Ｆ２．１３フォルマントＦ　３　、・・・・・・
と呼ばれる。これらの７オｌレマントのうち第１７オル
マン）　Ｆ、　と＃２７オルマン）Ｆ２との寄与率がも
っとも高く、第１フォルマントＦＩと＠２７オルマン）
Ｆ２とを用いレバかなり高い確度で母音を決定できるも
のである。[Disclosure of the Invention I Figure 5 shows an example of a vowel spectrum,
The dominant frequency component that characterizes a vowel, that is, the frequency component at the peak of the spectrum, is called a formant. Vowels usually have multiple formants, starting with the lowest frequency (17th orman) F,, J27 orman) F2, 13th formant F 3, etc.
It is called. Among these 7 oremans, the contribution rate of the 17th orman) F, and #27 orman) F2 is the highest, and the 1st formant FI and @27 orman)
F2 can be used to determine vowels with a fairly high degree of accuracy.

ここで第１フォルマントＦ１を横軸にとり、第２フォル
マントＦ２を縦軸にとったＰＩＦ２図上に日本語の母音
である／ａ／／　ｉ／／ｕ／／ｅ／１０／をベクトルと
して示すと、各母音は第４図の破線で示す範囲で表わさ
れる。フォルマントは各個人によりかなり変動するもの
であって、各母音を表わす範囲はかなりの部分で重複す
るものであるが、一般に同一環境で同一人物の発した５
母音のフォルマントはＰＩＦ２図上において略５角形と
なり、環境が変化したり、発話者が代わっても５母音の
相対的位置関係、すなわちこの５角形の形状を保持した
ままで平行移動することが知られている。したがって、
母音が変化したときの相対位置、すなわち変化ベクトル
は環境や発話者がかわっても略一定になる。つまり、母
音／ａ／の成分を（８００Ｈｚｅ１８００Ｈｚ）とし、
母音１０／の成分を（５００Ｈｚ。Here, if the Japanese vowel /a//i//u//e/10/ is shown as a vector on the PIF2 diagram with the first formant F1 on the horizontal axis and the second formant F2 on the vertical axis, , each vowel is represented by the range shown by the broken line in FIG. Formants vary considerably depending on each individual, and the ranges representing each vowel overlap to a large extent, but in general, the five vowels uttered by the same person in the same environment
The formants of vowels are approximately pentagonal on the PIF2 diagram, and it is known that even if the environment changes or the speaker changes, the relative positional relationship of the five vowels, that is, the pentagonal shape, will be maintained and will move in parallel. It is being therefore,
The relative position when a vowel changes, that is, the change vector, remains approximately constant even if the environment or speaker changes. In other words, the vowel /a/ component is (800Hz 1800Hz),
The component of vowel 10/ (500Hz.

１０００　Ｈｚ）とすると、／ａ／から１０／への変化
ベクトルの成分は（−３００Ｈｚ、−８００Ｈｚ）とな
り、変化ベクトルの成分は環境や発話者が異なっていて
も略一定になるのである。しかして、本発明においては
、複数の母音を入力して母音の変化ベクトルが検出され
るとスイッチ要素が作動する音声応答スイッチを開示す
る。なお、以下の説明においては、第１フォルマントＦ
１と第２フォルマントＦ２とを使用して音声の認識を行
なっているが、さらに認識率を高めるために、第３フォ
ルマントＦ、を用いてもよい。この場合第３フォルマン
トＦ、を第３輪としてＦ　＋　−Ｆ　ｚ　　Ｆ　３空間
上での各母音のフォルマントを表わすことにより、重複
部分を形成せずに空間上で各母音のフォルマントを表わ
すことができるものである。1000 Hz), the components of the change vector from /a/ to 10/ are (-300Hz, -800Hz), and the components of the change vector remain approximately constant even if the environment or speaker is different. Accordingly, the present invention discloses a voice response switch in which a switch element is activated when a plurality of vowels are input and a vowel change vector is detected. In addition, in the following explanation, the first formant F
1 and the second formant F2 are used to perform voice recognition, but the third formant F may also be used to further increase the recognition rate. In this case, by representing the formant of each vowel on the F + -F z F 3 space using the third formant F as the third wheel, it is possible to represent the formant of each vowel on the space without forming overlapping parts. It is possible.

（実施例）第１図に示すように、入力信号はフォルマント抽出回路
１に入力され第１７オルマン）　Ｆ　ｌと第２フォルマ
ントＦ２とが抽出される。フォルマント抽出回路１の出
力は制御音声判別回路２に入力され、予め設定された制
御音声と一致すると制御信号が出力されるようになって
いる。制御音声判別回路２の出力はスイッチ要素３に入
力され、スイッチ要素３に制御信号が入力されるとスイ
ッチ要素３が開閉される。(Embodiment) As shown in FIG. 1, an input signal is input to a formant extraction circuit 1, and a 17th Orman F1 and a second formant F2 are extracted. The output of the formant extraction circuit 1 is input to a control voice discrimination circuit 2, and when it matches a preset control voice, a control signal is output. The output of the control voice discrimination circuit 2 is input to the switch element 3, and when a control signal is input to the switch element 3, the switch element 3 is opened or closed.

第２図にフォルマント抽出回路１の一例を示す。FIG. 2 shows an example of the formant extraction circuit 1.

フォルマント抽出回路１はそれぞれ２００　Ｈｚの帯域
中を有し通過周波数が互いに異なる多数の帯域フィルタ
群１１．〜１１ｎと、各帯域フィルタ１１、〜ｌｌｎの
出力信号をデジタル信号に変換するアナログ／デジタル
変換回路１２と、各帯域フィルタ１１．〜ｔｉｎの出力
レベル値からフォルマントヲ検出するマイクロプロセッ
サよりなる演算回路１３とから構成される。帯域フィル
タ１１１〜ｆｉｎはそれぞ・れＯ−２００Ｈｚ、　　２
００−４００Ｈｚ、　４００〜６００　Ｈｚ、−１２２
００−２４００Ｈｚ、・・・・・・と通過周波数帯域が
互いに異なるとともに、全帯域フィルタ１１．−ｆｉｎ
によって音声帯域の全周波数が含まれるように設定され
ている。演算回路１３は第１フォルマントＦＩと第２７
オルマン）Ｆ２とを検出するとともに、入力音声が変化
したかどうかを判定する音韻変化信号を出力する。なお
、フォルマントの検出は回路構成によってハード的に行
なっているが、線形予測法などのソフト的な手法を用い
て行なってもよい。The formant extraction circuit 1 includes a large number of band filter groups 11, each having a band of 200 Hz and having different pass frequencies. ~lln, each bandpass filter 11, an analog/digital conversion circuit 12 that converts the output signal of ~lln into a digital signal, and each bandpass filter 11. . . . an arithmetic circuit 13 consisting of a microprocessor that detects formants from the output level values of .about.tin. The bandpass filters 111 to fin each have a frequency of O-200Hz, 2
00-400Hz, 400-600Hz, -122
00-2400Hz, . -fin
It is set to include all frequencies in the audio band. The arithmetic circuit 13 has the first formant FI and the 27th formant FI.
Olman) F2 and outputs a phoneme change signal for determining whether the input voice has changed. Note that formant detection is performed using hardware using a circuit configuration, but it may also be performed using a software method such as a linear prediction method.

第３図は制御音声判別回路２の一例を示すものであって
、制御音声判別回路２は、音韻変化信号が入力されると
フォルマントを記憶する第１ベクトル保持回路２２と、
音韻変化信号が入力されると第１ベクトル保持回路２２
に記憶されていたフォルマントを記憶する第２ベクトル
保持回路２３と、第１ペクシル保持回路２２に記憶され
たフォルマントから第２ベクトル保持回路２３に記憶さ
れた７ｔルマントを減算することにより変化ベクトルを
算出する変化ベクトル算出回路２４と、任意の３母音を
所定の順序で並べたときの各２母音間での変化ベクトル
の範囲が記憶された記憶部２５と、変化ベクトル算出回
路２４の出力値と記憶部２５に記憶された設定値とを比
較して変化ベクトル算出回路２４の出力値が記憶部２５
に格納された設定範囲内であるときに一致信号を出力す
る比較判定回路２６と、一致信号が連続して入力される
と制御信号を出力する制御信号発生回路２７とから構成
される。制御音声判別回路２では音韻変化信号が制御音
声判別回路２に入力されるたびに入力信号の変化ベクト
ルが記憶部２５に記憶された設定範囲に属するかどうか
が判定される。そして入力信号の各音韻間の変化ベクト
ルが記憶部２５に記憶された制御音声の変化ベクトルの
設定範囲内であると判定されると、比較判定回路２６か
ら一致信号が出力されるのである。なお、制御音声判別
回路２の記憶部２５を除く部分に関してはマイクロプロ
セッサ２０を用いて構成される。FIG. 3 shows an example of the control speech discrimination circuit 2, which includes a first vector holding circuit 22 that stores formants when a phoneme change signal is input;
When the phoneme change signal is input, the first vector holding circuit 22
A change vector is calculated by subtracting the 7t formant stored in the second vector holding circuit 23 from the formant stored in the first pexyl holding circuit 22. a change vector calculation circuit 24, a storage unit 25 that stores the range of change vectors between each two vowels when three arbitrary vowels are arranged in a predetermined order, and an output value and storage of the change vector calculation circuit 24. The output value of the change vector calculation circuit 24 is compared with the setting value stored in the storage unit 25.
The comparison determination circuit 26 outputs a match signal when the match signal is within a set range stored in the 1, and the control signal generating circuit 27 outputs a control signal when the match signal is continuously input. The control speech discrimination circuit 2 determines whether the change vector of the input signal belongs to the set range stored in the storage section 25 each time a phoneme change signal is input to the control speech discrimination circuit 2. When it is determined that the change vector between each phoneme of the input signal is within the set range of the change vector of the control voice stored in the storage section 25, a match signal is output from the comparison determination circuit 26. Note that the parts of the control voice discriminating circuit 2 except for the storage section 25 are configured using a microprocessor 20.

（動作）以下、動作を説明する０例えば、制御信号を出力するよ
うに設定された制御音声が３母音／ａ／１０／／ｅ／を
順に並べて構成されているとし、記憶部２５には／、／
から１０／への変化ベクトルの範囲として（３００±α
、Ｈｚ、８００±ＱｍＨｚ）、１０／がら／ｅ／への変
化ベクトルの範囲として（１２０±Ｑ　３　ＨＺ　ｗ１
２００±α、　Ｈｚ）が設定されているものとする。(Operation) The operation will be explained below.0 For example, assume that a control voice set to output a control signal is composed of three vowels /a/10//e/ arranged in order, and the storage unit 25 stores / ,/
As the range of change vector from to 10/(300±α
, Hz, 800±QmHz), as the range of the change vector to 10/gara/e/(120±Q 3 Hz w1
200±α, Hz) is set.

ここでａ１〜ａ、の値を適宜設定することにより感度が
調節される。さて、いま母音／ａ／１０／／ｅ／が第１
音声、Ｉｊｉ２音声、第３音声と゛して連続して入力さ
れたものとすると、フォルマント抽出回路１では各音声
のＦ　＋　−Ｆ　２平面上でのベクトル成分がそれぞれ
検出されるとともに、母音の変化時点でそれぞれ音韻変
化信号が発生する。制御音声判別回路２では、第１音声
が入力された時点でまず第１音声のべりＦル成分を第１
ベクトル保持回路２２に記憶する０次に第２音声が入力
され音韻変化信号が得られると、第１ベクトル保持回路
２２に記憶されていた第１音声のベクトル成分が第２ベ
クトル保持回路２３に入力されるとともに、第１ベクト
ル保持回路２２には第２音声のベクトル成分が記憶され
る。このとき変化ベクトル算出回路２４では第２ベクト
ル保持回路２３に記憶されたベクトル成分と第１ベクト
ル保持回路２２に記憶されたベクトル成分との変化量か
ら変化ベクトルの成分が算出される。記憶部２５には／
ａ／から１０／への変化ベクトルの成分として（３００
±ＣＩ　、　Ｈｚ、　８００±ｆｆ２Ｈｚ）が記憶され
ているから、比較判定回路２６では変化ベクトル算出回
路２４の出力が記憶部２５に記憶されたこの設定範囲内
にあるかどうかが比較され、変化ベクトル算出回路２４
の出力値が記憶部２５の設定範囲内であると判定される
と、入力信号が八／から１０／に変化したものと判断さ
れるのである０次に第３音声が入力されると、第１ベク
トル保持回路２２に記憶されていた第２音声のベクトル
成分が第２ベクトル保持回路２３に入力されるとともに
、第３音声のベクトル成分が＃１ベクトル保持回路２２
に記憶され、変化ベクトル算出回路２４では第２ベクト
ル保持回路２３に記憶された第２音声から第１ベクトル
保持回路２２に記憶された第３音声への変化ベクトルの
成分が算出される。記憶部２５には１０／から／ｅ／へ
の変化ベクトルの成分として（１２０±α、。Here, the sensitivity is adjusted by appropriately setting the values of a1 to a. Now, the vowel /a/10//e/ is the first
Assuming that voice, Iji2 voice, and 3rd voice are input consecutively, the formant extraction circuit 1 detects the vector components of each voice on the F + -F2 plane, and also detects changes in vowels. A phonological change signal is generated at each point in time. At the time when the first voice is input, the control voice discriminating circuit 2 first converts the slip F component of the first voice into the first voice.
When the zero-order second voice stored in the vector holding circuit 22 is input and a phoneme change signal is obtained, the vector component of the first voice stored in the first vector holding circuit 22 is input to the second vector holding circuit 23. At the same time, the vector component of the second voice is stored in the first vector holding circuit 22. At this time, the change vector calculation circuit 24 calculates the component of the change vector from the amount of change between the vector component stored in the second vector holding circuit 23 and the vector component stored in the first vector holding circuit 22. The storage unit 25 has /
As a component of the change vector from a/ to 10/ (300
±CI, Hz, 800±ff2Hz) is stored, so the comparison/judgment circuit 26 compares whether the output of the change vector calculation circuit 24 is within this setting range stored in the storage section 25, and calculates the change vector. Calculation circuit 24
If it is determined that the output value of is within the setting range of the storage unit 25, it is determined that the input signal has changed from 8/ to 10/. The vector component of the second voice stored in the #1 vector holding circuit 22 is input to the second vector holding circuit 23, and the vector component of the third voice is input to the #1 vector holding circuit 22.
The change vector calculation circuit 24 calculates the component of the change vector from the second voice stored in the second vector holding circuit 23 to the third voice stored in the first vector holding circuit 22. The storage unit 25 stores (120±α,) as a component of the change vector from 10/ to /e/.

１２００±ａ４）が記憶されているから、比較判定回路
２６ではこの設定範囲と変化ベクトル算出回路２４の出
力値とが比較され、変化ベクトル算出回路２４の出力値
が記憶部２５の設定範囲内であると判定されると、入力
信号が１０／から／ｅ／に変化したことを認識するので
ある。以上のようにして／ａ／から１０／への変化と１
０／から／ｅ／への変化が連続して検出されると、比較
判定回路２６では一致信号を出力し、制御信号発生回路
２７では一致信号を受けて制御信号を出力するのである
。制御信号はスイッチ要素３に入力されスイッチ要素３
が開閉されるのである。入力信号が記憶部２５に設定さ
れた設定範囲とは異なるときにはスイッチ要素３はそれ
までの状態を保つ。1200±a4) is stored, the comparison/judgment circuit 26 compares this setting range with the output value of the change vector calculation circuit 24, and determines whether the output value of the change vector calculation circuit 24 is within the setting range of the storage section 25. If it is determined that there is, it is recognized that the input signal has changed from 10/ to /e/. As mentioned above, the change from /a/ to 10/ and 1
When a change from 0/ to /e/ is detected continuously, the comparison/judgment circuit 26 outputs a coincidence signal, and the control signal generation circuit 27 receives the coincidence signal and outputs a control signal. The control signal is input to the switch element 3 and the control signal is input to the switch element 3.
is opened and closed. When the input signal differs from the setting range set in the storage section 25, the switch element 3 maintains its previous state.

上述の実施例において連続した３母音を検出したときに
スイッチ要素３を開閉するようになっていたが、３母音
に限定されるものではない。また母音を検出するために
第１フォルマントＦ１と第２フォルマントＦ２とをベク
トル成分として２次元空間でのベクトルを用いたが、第
３７オルマン）Ｆｓ以上の高次フォルマントもベクトル
成分として用いることにより３次元以上の多次元空間で
のベクトルを用いて母音の判定を行なうようにしてもよ
い、さらに、上述の実施例ではフォルマント抽出回路１
と制御音声判別回路２とにそれぞれマイクロプロセッサ
を用いた例を示したが、両回路１，２のマイクロプロセ
ッサを共有化して１つにしてもよい。In the above embodiment, the switch element 3 is opened and closed when three consecutive vowels are detected, but the present invention is not limited to three vowels. In addition, in order to detect a vowel, a vector in a two-dimensional space was used with the first formant F1 and the second formant F2 as vector components. Vowels may be determined using vectors in a multidimensional space of more than 3 dimensions.Furthermore, in the above embodiment, the formant extraction circuit 1
Although an example has been shown in which microprocessors are used for each of the circuits 1 and 2, the microprocessors for both circuits 1 and 2 may be shared and integrated into one.

［発明の効果］本発明は上述のように、入力音声から少なくとも第１フ
ォルマントと第２フォルマントとを抽出するフォルマン
ト抽出回路と、２音以上の連続する母音から構成された
制御音声の各母音間のフォルマントの変化分が所定の設
定範囲内であるときに制御信号を出力する制御音声判別
回路と、制御信号により開閉されるスイッチ要素とから
構成されているので、音声のうちの母音を特徴づけてい
る優勢な周波数成分であるフォルマントを抽出し、複数
のフォルマントにより形成されたベクトル空間における
音声ベクトルの移動によりスイッチ要素を作動させるか
どうかを判別するようにした結果、母音のフォルマント
の変化のみを検出すればよく、計算量が少なくかつ音声
の認識を確実に行なうことができるものであり、実時間
での動作が可能で認識率が高いという利点を有する。ま
た、フォルマントの変化分で音声を認識するから、不特
定話者に対して動作可能であるという利点を有するもの
である。[Effects of the Invention] As described above, the present invention includes a formant extraction circuit that extracts at least a first formant and a second formant from input speech, and a formant extraction circuit that extracts at least a first formant and a second formant from an input speech, and a formant extraction circuit that extracts at least a first formant and a second formant from an input speech, and a It consists of a control speech discrimination circuit that outputs a control signal when the change in formant is within a predetermined setting range, and a switch element that is opened and closed by the control signal, so it can characterize vowels in speech. By extracting the formant, which is the dominant frequency component of the vowel, and determining whether to activate the switch element by moving the speech vector in the vector space formed by multiple formants, we can detect only the change in the formant of the vowel. It only needs to be detected, the amount of calculation is small, and speech recognition can be performed reliably, and it has the advantage of being able to operate in real time and having a high recognition rate. Furthermore, since speech is recognized based on changes in formants, it has the advantage of being operable for unspecified speakers.

[Brief explanation of the drawing]

第１図は本発明の一実施例を示すブロック図、第２図は
同上に使用するフォルマント抽出回路を示すブロック図
、第３図は同上に使用する制御音声判別回路を示すブロ
ック図、第４図はＰＩＦ２図の一例を示す動作説明図、
第５図は母音の周波数特性の一例を示す動作説明図、第
６図は従来例を示すブロック図、第７図は他の従来例を
示すブロック図である。１はフォルマント抽出回路、２は制御音声判別血路、３
はスイッチ要素である。FIG. 1 is a block diagram showing an embodiment of the present invention, FIG. 2 is a block diagram showing a formant extraction circuit used in the above, FIG. 3 is a block diagram showing a control speech discrimination circuit used in the same, and FIG. The figure is an operation explanatory diagram showing an example of PIF2 diagram,
FIG. 5 is an operation explanatory diagram showing an example of vowel frequency characteristics, FIG. 6 is a block diagram showing a conventional example, and FIG. 7 is a block diagram showing another conventional example. 1 is a formant extraction circuit, 2 is a control voice discrimination circuit, and 3 is a formant extraction circuit.
is a switch element.

Claims

[Claims]

(1) At least the first formant and the second formant from the input voice
a formant extraction circuit that extracts a formant;
A control voice discrimination circuit that outputs a control signal when the change in formant between each vowel of a control voice composed of continuous vowels of more than one vowel is within a predetermined setting range, and a switch element that is opened and closed by the control signal. A voice response switch comprising: