JPS5945160B2

JPS5945160B2 - Voice identification method

Info

Publication number: JPS5945160B2
Application number: JP1714377A
Authority: JP
Inventors: 煕康舟久保; 正孝芝
Original assignee: Individual
Current assignee: Individual
Priority date: 1977-02-21
Filing date: 1977-02-21
Publication date: 1984-11-05
Also published as: JPS53102603A

Description

【発明の詳細な説明】この発明は音声信号をコード化することによつて動力義
手やマニピュレータ等の機械装置または電子装置差制御
するための音声識別方法に関するものである。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a voice identification method for differentially controlling mechanical or electronic devices such as powered prosthetic hands and manipulators by encoding voice signals.

従来におけるこの種の方法としては、発呼者からの種々
なる音声信号を夫々周波数分析し、夫々の音声信号をパ
ターンとして記障し、次回発呼者から発声された音声信
号を同じく周波数分析して上記したパターンと比較し、
如何なる音声信号であるか否かを判別するものであつた
。Conventionally, this type of method analyzes the frequency of various voice signals from the caller, records each voice signal as a pattern, and then analyzes the frequency of the voice signal uttered by the caller the next time. and compared with the above pattern,
The purpose was to determine what kind of audio signal it is.

しかしこの方法によるものにあつて（大発呼者の音声信
号が、印こよつてあるいは時間によつて異なること、す
なわち各音節の間隔が違つたりあるいは発音が違つたり
してパターン比較が非常に困難であつた。また装置も大
型のものとなりコスト的にも高いものとなつた。この発
明は叙上の点に鑑みて成されたもので、その第１の目的
は、気管外壁にマイクを設け、通常の発声あるいはハミ
ングであつても情報を確認することが可能である音声識
別方法を提供するにある。However, with this method (the voice signals of large callers vary depending on the sign or time, i.e., the intervals between each syllable are different or the pronunciation is different), it is difficult to compare patterns. In addition, the device was large and the cost was high.This invention was made in view of the above points, and its first purpose was to attach a microphone to the outer wall of the trachea. An object of the present invention is to provide a voice identification method that allows information to be confirmed even in normal vocalization or humming.

この発明の第２の目的は、音節の数と、この音節の高低
変化パターンを類別することによつて識別するので、高
い認識率を得ることができる音声識別方法を提供するに
ある。A second object of the present invention is to provide a speech identification method that can obtain a high recognition rate because the identification is performed by classifying the number of syllables and the pitch change pattern of the syllables.

次にこの発明の一実施例を図面と共に説明する。Next, one embodiment of the present invention will be described with reference to the drawings.

第１図は全体のブロック図である。Ａは音声検出回路に
して、発呼者よりの音声信号をパルス信号に変換する回
路であつて、１は音声発呼者の気管外壁に設けたマイク
、２は５０〜２５０Ｈｚを通過させるフィルタで、音声
振動の基本周波数であり、発声時の音階を表わすパラメ
ータであるピッチ成分を抽出する。なおこのフィルタ２
は増幅器をも包含している。３はフィルタ２で抽出され
たサイン波を矩形波に変換すると共にヒステリシス特性
により、さらにピッチ成分以外の波を除去するシユミツ
トトリガ回路である。FIG. 1 is an overall block diagram. A is a voice detection circuit that converts the voice signal from the caller into a pulse signal, 1 is a microphone installed on the outer wall of the voice caller's trachea, and 2 is a filter that passes 50 to 250 Hz. , the pitch component, which is the fundamental frequency of voice vibration and is a parameter representing the scale at the time of vocalization, is extracted. Note that this filter 2
also includes amplifiers. Reference numeral 3 denotes a Schmitt trigger circuit that converts the sine wave extracted by the filter 2 into a rectangular wave and further removes waves other than pitch components using hysteresis characteristics.

Ｂは選択されたクロックにより、シユミツトトリガ回路
３より出力される矩形波の周期を測定するためのカウン
ト回路にして、４はシユミツトトリガ回路３よりのパル
ス信号の入力制御と、クロツクの選択制御を行うゲート
回路、５は上記音声検出回路Ａよりの信号の間クロツク
パルス発生回路６よりのクロツクパルスを夫々計数する
カウンタ、６は例えば１０ＫＨｚと３００Ｈｚのクロツ
クパルスを発信するクロツクパルス発生回路である。B is a count circuit for measuring the period of the rectangular wave output from the Schmitt Trigger circuit 3 using a selected clock, and 4 is a gate for controlling the input of the pulse signal from the Schmitt trigger circuit 3 and controlling the selection of the clock. The circuit 5 is a counter that counts the clock pulses from the clock pulse generating circuit 6 during the signal from the audio detection circuit A, and the numeral 6 is a clock pulse generating circuit that generates clock pulses of, for example, 10 KHz and 300 Hz.

Ｃは上記カウント回路Ｂよりの各クロツクパルス数を記
憶し、中間値である代表値を決定する代表値決定回路に
して、７は各クロツクパルス数を記憶するラツチ回路、
８は各ラツチ回路７よりのクロツク数を比較し、中間値
である代表値を決定する比較器である。Ｄは各音節毎の
代表値を記憶し、各音節の高低を検出する変化判定回路
にして、９は各音節毎の代表値を記憶するラツチ回路、
１０は各ラツチ回路９よりのクロツク数を比較し、クロ
ツク数の変イＬすなわち音節の高低を検出する比較器で
ある。C is a representative value determining circuit that stores each clock pulse number from the counting circuit B and determines a representative value that is an intermediate value; 7 is a latch circuit that stores each clock pulse number;
A comparator 8 compares the number of clocks from each latch circuit 7 and determines a representative value which is an intermediate value. D is a change determination circuit that stores the representative value of each syllable and detects the pitch of each syllable; 9 is a latch circuit that stores the representative value of each syllable;
Reference numeral 10 denotes a comparator which compares the number of clocks from each latch circuit 9 and detects a change L in the number of clocks, that is, the height of a syllable.

Ｅは前記した代表値決定回路Ｃのラツチ回路７よりの出
力で音節の区切れを検出する音節区切れ処理回路にして
、１１はラツチ回路７よりの出力と了じめ定められた値
とを比較する比較器である。Ｆは上記Ａ−Ｅの各プロツ
クに対しパルス信号を与えシーケンス制御するシステム
制御部であも次に詳細を第２図以下に基いてタイミング
チヤート図と共に説明する。今第７図における音声入力
包絡線のように、３つの音節（例えば「つ」「か」
「め」）を発呼者が発声したとすると、マイク１におい
てこの情報を気管外壁（喉仏の真下であつて比較的高周
波成分や、声道特性の影響の少ない部分）でキャツチし
、５０〜２５０Ｈｚを通過させるフイルタ２でピツチ成
分を抽出すると共に増幅する。E is a syllable break processing circuit that detects syllable breaks using the output from the latch circuit 7 of the representative value determining circuit C, and 11 is a syllable break processing circuit that detects syllable breaks using the output from the latch circuit 7 of the representative value determination circuit C. It is a comparator for comparison. Reference numeral F denotes a system control section which applies pulse signals to each of the above-mentioned programs A to E to perform sequence control.The details will be explained below with reference to FIG. 2 and timing charts. As shown in the speech input envelope in Figure 7, three syllables (for example, ``tsu'' and ``ka'')
When the caller utters "me"), microphone 1 captures this information on the outer wall of the trachea (the part directly below the Adam's apple and is relatively unaffected by high-frequency components and vocal tract characteristics), and captures this information from 50 to A filter 2 that passes 250 Hz extracts the pitch component and amplifies it.

抽出したサイン波を次段のシユミツトトリガ回路３で矩
形波に変喚する。尚上記フイルタ２は発呼者が女性の場
合、高周波側にずらす必要がある。また上記矩形波はシ
ユミツトトリガ回路３のもつヒステリシス特性により、
さらにピツチ成分以外の波は除去される。この矩形波が
ａ波形である。ここで第７図について全体のシーケンス
を説明する。The extracted sine wave is transformed into a rectangular wave by the Schmitt trigger circuit 3 at the next stage. Note that if the caller is a woman, the filter 2 needs to be shifted to the high frequency side. Also, the above rectangular wave is generated due to the hysteresis characteristic of the Schmitt trigger circuit 3.
Furthermore, waves other than the pitch component are removed. This rectangular wave is the a waveform. The entire sequence will now be explained with reference to FIG.

まずＯ〕は入力待サイクルで、全ての回路をりセツトし
、初期の状態にする。１，Ｃ５１，（９）は各音節の立
上り部分で不安定である部分のパルス信号（本実施例で
は第１番目のパルス）を除去する初期周期除去サイクル
である。First, O] is an input waiting cycle, in which all circuits are reset to their initial states. 1, C51, (9) is an initial period removal cycle that removes the unstable portion of the pulse signal (in this embodiment, the first pulse) at the rising edge of each syllable.

〔匂，（６），〔１０〕は各音節における得ようとする
パルス信号を決定するサイクルで、本実施例では各音節
の２パルス目から４パルス目までの３つのパルス信号を
測定している。３，〔７１，〔１１〕は上記各３つのパ
ルス信号から代表的な１つのパルスを決定するサイクル
で、本実施例では中間値のものを検出しているが、平均
瓢最小二乗値，最大値、最小値、大きい方からｎ番目な
どというように適当に選択できる。[Oi, (6), [10] is a cycle that determines the pulse signal to be obtained in each syllable, and in this example, three pulse signals from the 2nd pulse to the 4th pulse of each syllable are measured. There is. 3, [71, [11] are cycles for determining one representative pulse from each of the above three pulse signals, and in this example, the intermediate value is detected, but the average value, the least squares value, the maximum Value, minimum value, nth largest value, etc. can be selected as appropriate.

すなわち各音節における３つのパルス信号から同じ条件
の下で１つのパルス信号を検出すれば良い。〔４），８
，〔１２〕は各音節の区切れを判定するサイクルである
。〔１３〕は上記３つの代表値パルス信号の高低パター
ンを決定するサイクルで、このパターンによつて発呼者
からの情報を判断し、例えば義手が「つかむ」動作を開
始するものである。なおこのパターンによつて義手を制
御するのみではなく、他の応用例として、工作機械制御
、タイプ制御、帳簿整理、ドアの開閉制御等がある。そ
して上記が１つの情報によつてシーケンス制御するサイ
クルであつて、〔０′〕の入力待サイクルの後、再び情
報判断を行うものである。以上のサイクルをさらに詳細
に説明する。That is, it is sufficient to detect one pulse signal from three pulse signals in each syllable under the same conditions. [4),8
, [12] is a cycle for determining the break between each syllable. [13] is a cycle for determining the high-low pattern of the three representative value pulse signals, and based on this pattern, information from the caller is determined and, for example, a prosthetic hand starts a "grabbing" action. In addition to controlling prosthetic hands using this pattern, other applications include machine tool control, type control, bookkeeping, and door opening/closing control. The above is a cycle in which sequence control is performed using one piece of information, and after the [0'] input waiting cycle, information judgment is performed again. The above cycle will be explained in more detail.

今電源スイツチを投入すると、パワーオンリセツト回路
１２ａよりりセツトパルスが０Ｒ回路１２ｂを介して４
ビツトカウンタ１２ｃに印加さへ該カウンタ１２ｃをり
セツトすると共に回路制御部１２ｄもりセツトする。When the power switch is turned on now, a set pulse is output from the power-on reset circuit 12a through the 0R circuit 12b.
The voltage applied to the bit counter 12c resets the counter 12c and also resets the circuit control section 12d.

すなわち４ビツトカウンタ１２ｃは上記した１３のサイ
クルを順次指示するもので、上記りセツトパルスによつ
て６の状態となる。そして先ずりセツトされた回路制御
部１２ｄは一度に第８図の如くＣ，ｄ，ｅ，ｊ，ｊ′，
ｊ〃，０，ｃ／′，♂，ｌのパルス信号を送出する。上
記パルスｃは第２図におけるゲート回路４のフリツプフ
ロツプ回路４ａのりセツト入力端に入力され出力端Ｑよ
り出力を出している。これは回路全体がセツト状態にな
るまで音声入力を遮断するためである。すなわち出力端
Ｑ側より出力が送出されないためにＡＮＤ回路４ｂは閉
じシユミツトトリガ回路３よりのパルス（以下音声パル
ス信号という）を通過させない。またパルスｄは８ビツ
トカウンタ５をりセツトしたり、フリツプフロツプ回路
４ｃをりセツトし、音声パルス信号が入力されれば出力
端Ｑより出力を出し、後述する１０ＫＨｚまたは３００
ＨｚのクロツクをＡＮＤ回路４ｄを介してカウンタ５に
入力する状態にする。さらにパルスｅはフリツプフロツ
プ回路４ｅをセツトし出力端Ｑより出力を出し１０ＫＨ
ｚのクロツクパルス発生回路６ａよりのクロツクパルス
を通過させるためのＡＮＤ回路４ｆを待機状態とする。
ここで、ＡＮＤ回路４ｄが待機状態であることによりク
ロツクパルス発生回路６ａよりの１０ＫＨｚのクロツク
パルスはＡＮＤ回路４ｆ１０Ｒ回路４ｇを介して８ビツ
トカウンタ５に入力され、該カウンタ５は計数している
。またパルスＪ，ｊ′，Ｊｌ，ｌ及び０，０′，ｃｌ′
は８ビツトラツチ群７ａ，７ｂ，７ｃ，７ｄ及び９ａ，
９ｂ，９ｃをクリアーし待機状態とする。次に回路制御
部１２ｄはパルスｂを送出するので、フリツプフロツプ
回路４ａはセツトされ出力端Ｑより出力が出てＡＮＤ回
路４ｂとフリツプフロツプ回路４ｃｆ）Ｄ端に印加され
る。That is, the 4-bit counter 12c sequentially instructs the above-mentioned 13 cycles, and is brought into state 6 by the above-mentioned set pulse. Then, the circuit control section 12d that has been set first controls C, d, e, j, j',
Send out pulse signals of j〃, 0, c/', ♂, l. The pulse c is inputted to the input terminal of the flip-flop circuit 4a of the gate circuit 4 in FIG. 2, and outputted from the output terminal Q. This is to cut off audio input until the entire circuit is in the set state. That is, since no output is sent from the output terminal Q side, the AND circuit 4b is closed and does not allow the pulse (hereinafter referred to as an audio pulse signal) from the output trigger circuit 3 to pass therethrough. In addition, the pulse d resets the 8-bit counter 5 or the flip-flop circuit 4c, and when an audio pulse signal is input, output is output from the output terminal Q, and the frequency of 10KHz or 300kHz, which will be described later, is output.
The Hz clock is input to the counter 5 via the AND circuit 4d. Furthermore, the pulse e sets the flip-flop circuit 4e and outputs an output from the output terminal Q at 10KH.
The AND circuit 4f for passing the clock pulse from the clock pulse generating circuit 6a of clock z is placed in a standby state.
Here, since the AND circuit 4d is in a standby state, the 10 KHz clock pulse from the clock pulse generating circuit 6a is input to the 8-bit counter 5 via the AND circuit 4f10R circuit 4g, and the counter 5 is counting. Also pulses J, j', Jl, l and 0,0', cl'
are 8 bit latch groups 7a, 7b, 7c, 7d and 9a,
9b and 9c are cleared to enter the standby state. Next, the circuit control section 12d sends out a pulse b, so the flip-flop circuit 4a is set and an output is output from the output terminal Q and applied to the AND circuit 4b and the flip-flop circuit 4cf) and the D terminal.

そしてこの状態で音声パルス信号の第１パルスａがＡＮ
Ｄ回路４ｂに入力されると、該パルスａは通過されフリ
ツプフロツプ回路４ｃ（７）Ｔ端子に印加される。ここ
でパルスの立上り部分において出力端はＱ側に換わるた
めｇなるパルスが送出される（第９図）。またこれと同
時にインクリメントパルスが４ビツトカウンタ１２ｃに
加えられ、Ｏサイクルから１サイクルへと進む。そして
再びパルスｄが回路制御部１２ｄから送出され８ビツト
カウンタ５はりセツトされる。またフリツプフロツプ回
路４ｃもりセツトされ、出力端はＱ側に換わる。従つて
８ビツトカウンタ５は再び計数するが、この第１パルス
は除去するため、第２パルスの立上りでパルスｇが出て
も８ビツトカウンタ５の出力を次段に送出することなく
第２のパルスｄによつてりセツトする。そして第２のパ
ルスが入力されてから後、すなわち立上り時間経過の後
出力端は再びＱとなるので、８ビツトカウンタ５は１０
ＫＨｚのクロツクパルスを計数始めもまた４ビツトカ
ウンタ１２ｃに回路制御部１２ｄよりインクリメントパ
ルスが印加され、１サイクルから２サイクルへと進へ次
に音声の第３パルスが入力され、その立上りで第３パル
スｇが発生すると、回路飢御部１２ｄからｉなるパルス
が送出され、第３図におけるラツチ７ａ（７）ＳｔｒＯ
ｂｅ端に入力される。In this state, the first pulse a of the audio pulse signal is AN
When input to the D circuit 4b, the pulse a is passed through and applied to the T terminal of the flip-flop circuit 4c (7). At this point, at the rising edge of the pulse, the output terminal switches to the Q side, so a pulse of g is sent out (FIG. 9). At the same time, an increment pulse is applied to the 4-bit counter 12c, and the cycle progresses from the O cycle to the 1 cycle. Then, the pulse d is again sent out from the circuit control section 12d and the 8-bit counter 5 is reset. The flip-flop circuit 4c is also reset, and the output terminal is switched to the Q side. Therefore, the 8-bit counter 5 counts again, but since this first pulse is removed, even if pulse g is generated at the rising edge of the second pulse, the output of the 8-bit counter 5 is not sent to the next stage and is counted again. Reset by pulse d. Then, after the second pulse is input, that is, after the rise time has elapsed, the output terminal becomes Q again, so the 8-bit counter 5 becomes 10.
At the beginning of counting KHz clock pulses, an increment pulse is applied to the 4-bit counter 12c from the circuit control unit 12d, and the third pulse of the audio is inputted as the cycle progresses from the 1st cycle to the 2nd cycle. When g occurs, a pulse i is sent from the circuit starvation section 12d, and the latch 7a (7) StrO in FIG.
It is input to the be end.

従つて８ビツトカウンタ５よりの計数信号は該ラツチ７
ａに記憶される。さらにパルスｉなる信号に次いでパル
スｄが送出されるので、再び８ビツトカウンタ５はりセ
ツトされると同時に計数を開始し、次の第４音声パルス
が発生するまで計数を続ける。そして第４音声パルスの
立上りで第４パルスｇが発生し、従つてパルスｉｌが発
生し、以下同様な動作によつて第３音声パルスに対応す
るクロツクパルス数がラツチ７ｂに、第４音声パルスに
対応するクロツクパルス数がラツチ７ｃに夫々記憶され
る。次いで生じる第５のパルスｄによつてインクリメン
トパルスが出て２サイクルから３サイクルへと進む。こ
の第３サイクルにおいて第２サイクルで得られたラツチ
群７ａ，７ｂ，７ｃの出力（今仮にＡ，Ｂ，Ｃとし、そ
の中間値を決定する。Therefore, the count signal from the 8-bit counter 5 is applied to the latch 7.
It is stored in a. Further, since the pulse d is sent out after the pulse i, counting starts at the same time as the 8-bit counter 5 is reset again, and counting continues until the next fourth audio pulse is generated. Then, at the rising edge of the fourth audio pulse, a fourth pulse g is generated, and accordingly a pulse il is generated, and by the same operation, the number of clock pulses corresponding to the third audio pulse is set in the latch 7b, and the number of clock pulses corresponding to the third audio pulse is set to the latch 7b. The corresponding number of clock pulses is stored in each latch 7c. The fifth pulse d which then occurs causes an incremental pulse to proceed from the second cycle to the third cycle. In this third cycle, the outputs (temporarily A, B, and C) of the latch groups 7a, 7b, and 7c obtained in the second cycle are determined, and their intermediate values are determined.

これは８ビツトの比較器群８ａにおいて、別表の１で示
す比較を行う。すなわちＡ，Ｂ，Ｃの何れかの出力が回
路匍御部１２ｄよりのパルスｍによつてゲートが開放さ
れるデータマルチプレクサ８ｂより送出されると共に同
じく回路制御部１２ｄよりのパルスｎによつて８ビツト
ラツチ９ａに記憶される。またパルスｎと同時にパルス
ｆも送出されるので、第２図においてフリツプフロツプ
回路４ｅがりセツトさ八出力端Ｑ側に切換えられるので
、３００Ｈｚクロツクパルス発生器６ｂよりの３００Ｈ
ｚクロツクパルスがＡＮＤ回路４ｈ，０Ｒ回路４ｇ，．
ＡＮＤ回路４ｄを介して８ビツトカウンタ５に印加され
る。This is performed in the 8-bit comparator group 8a as shown in Table 1 of the appendix. That is, any one of the outputs A, B, and C is sent out from the data multiplexer 8b whose gate is opened by the pulse m from the circuit control section 12d, and the output from the data multiplexer 8b is also output by the pulse n from the circuit control section 12d. It is stored in the bit latch 9a. Furthermore, since pulse f is also sent out at the same time as pulse n, the flip-flop circuit 4e in FIG.
The z clock pulse is sent to AND circuit 4h, 0R circuit 4g, .
The signal is applied to an 8-bit counter 5 via an AND circuit 4d.

そしてパルスｍの後に回路宙ｕ御部１２ｄよりインクリ
メントパルスが４ビツトカウンタ１２ｃに印加され３サ
イクルから４サイクルへと進αこの第４サイクルは各音
節間の区切れを検出するものであるが、上記したと同様
にパルスｄ毎に８ビツトカウンタ５は３００Ｈｚのクロ
ツクパルスを計数し、回路制御部１２ｄよりのパルスｋ
毎にラツチ７ｄに記憶すると共に比較回路１１ａの一つ
の入力端に送出される。After the pulse m, an increment pulse is applied from the circuit controller 12d to the 4-bit counter 12c, and the cycle progresses from the 3rd cycle to the 4th cycle α.This fourth cycle is for detecting the break between each syllable. Similarly to the above, the 8-bit counter 5 counts 300 Hz clock pulses for each pulse d, and receives the pulse k from the circuit control section 12d.
Each signal is stored in the latch 7d and sent to one input terminal of the comparator circuit 11a.

この比較回路１１ａの他の入力端には音節の区切れとし
て適当であるところの信号が印加されている。ここでラ
ツチ７ｄからの出力をＡ、比較基準出力をＢとし、比較
回路１１ａはＡ≧Ｂの時、すなわちラツチ７ｄからの出
力が、予じめ音節の区切れとして適当であると定めた出
力より大きい時は音節区切パルスｓを、また逆のＡ＜Ｂ
の時は有声パルスｔを送出する。なお１１ｂ，１１ｃは
回路制御部１２ｄよりのタイミングパルスｒによつて上
記ｓまたはｔの信号を得るためのＡＮＤ回路である。そ
して比較回路１１ａよりの出力が有声パルスｔである場
合に（人音節区切パルスｓが表われるまで、上記動作を
繰返し行い、次の音節の第１パルスを検出し、該パルス
ｓが表われた場合、４ビツトカウンタ１２ｃからパルス
Ｅ，ｌ，ｊ，ｊ′，ｊｌが送出される。従つて第２図の
フリツプフロツプ回路４ｅはセツトされ１０ＫＨｚクロ
ツクパルス発生回路６ａに切換えら八以後８ビツトカウ
ンタ５は１０ＫＨｚのクロツクパルスを計数する。また
第３図のラツチ群７ａ〜７ｄはクリアーされる。そして
第２音節における第１のパルスｄが送出された時にイン
クリメントパルスが４ビツトカウンタ１２ｃに印加され
４サイクルから５サイクルへと進む。以下５サイクルか
ら１２サイクルまでは上記した動作と同様なので、説明
は省略する．なお第４図のラツチ９ｂには第２音節の中
間値が、ラツチ９ｃには第３音節の中間値が記憶される
ものである。A signal suitable for dividing syllables is applied to the other input terminal of this comparison circuit 11a. Here, the output from the latch 7d is set as A, and the comparison reference output is set as B, and when A≧B, the comparison circuit 11a outputs the output from the latch 7d, which is predetermined to be suitable for dividing syllables. When it is larger, the syllable break pulse s is used, and vice versa A<B
When , a voiced pulse t is sent out. Note that 11b and 11c are AND circuits for obtaining the above-mentioned signal s or t using the timing pulse r from the circuit control section 12d. When the output from the comparator circuit 11a is the voiced pulse t, the above operation is repeated until the syllable dividing pulse s appears, the first pulse of the next syllable is detected, and the pulse s is detected. In this case, pulses E, l, j, j', and jl are sent out from the 4-bit counter 12c.Therefore, the flip-flop circuit 4e in FIG. The 10 KHz clock pulses are counted.Also, the latches 7a-7d in FIG. Proceed to the 5th cycle.The operations from the 5th cycle to the 12th cycle are the same as those described above, so the explanation will be omitted.In addition, the latch 9b in Fig. 4 has the intermediate value of the second syllable, and the latch 9c has the middle value of the 3rd syllable. The intermediate value of is stored.

さらに第３音節における音節区切パルスであるｈを検出
した時には、単語の終りとして、回路制御部１２ｄより
パルンｃを送出し、音声信号の入力を遮断する。ここで
各ラツチ群９ａ〜９ｃの出力は上位４ビツトを加算器１
０ａ〜１０ｃに入力し、元の８ビツトと加算し、各音節
の中間値に幅をもたせている。Furthermore, when h, which is a syllable dividing pulse in the third syllable, is detected, the circuit control unit 12d sends a pulse c as the end of the word, and cuts off the input of the audio signal. Here, the output of each latch group 9a to 9c is added to the adder 1 by adding the upper 4 bits.
It is input to 0a to 10c and added to the original 8 bits, giving a range to the intermediate value of each syllable.

すなわち音声には変化があるので、ラツチ９ａと９ｂと
が等音であつたとしても必ずしも一致しないためである
。このため発声において高低がある場合には一音以上の
差をつけることが必要である。そして第１比較器群１０
ｄでラツチ９ａからの信号（１）と加算器１０ａからの
信号（２）及びラツチ９ｂからの信号０〕と加算器１０
ｂからの信号（４）とを比較し、また第２比較器群１０
ｅでラツチ９ｂからの信号１と加算器１０ｂからの信号
（２）及びラツチ９ｃからの信号３と加算器１０ｃから
の信号４とを比較する．その結果は別表の２に示す如き
２ビツトデータの出力を送出する。例えば３音節から成
る単語において高低が第１２Ａ図のように変つた場合１
１，０１の出力を出し、第１２Ｂ図のように変つた場合
１０，１１の出力を出す。この組合せは全部で９通りで
ある。ここで回路制御部１２ｄからのパルスｐによつて
４ビツトラツチ１０ｆに記憶し、制御器、例えば義手に
おける「つかむ」ためのモータを制御する。さらに作業
が終了した後、回路宙ｕ御部１２ｄより４ビツトラツチ
１０ｆをクリアーするパルス信号ｑを送出し、次いでパ
ルスＥ，ｌ，ｊ，ｊ′，Ｊｌ，Ｏ，Ｏ′，ｏｌを送出し
、初期の状態に戻るものである。なお上記実施例では３
音節より成る単語の認識について説明したが、２音節お
よび１音節の認識も容易である。That is, since there are changes in voice, even if the latches 9a and 9b are equal sounds, they do not necessarily match. For this reason, if there is a difference in pitch in vocalization, it is necessary to create a difference of one or more notes. and the first comparator group 10
At d, the signal (1) from the latch 9a, the signal (2) from the adder 10a, and the signal 0 from the latch 9b] and the adder 10
b and the second comparator group 10.
At e, signal 1 from latch 9b is compared with signal (2) from adder 10b, and signal 3 from latch 9c and signal 4 from adder 10c are compared. As a result, 2-bit data output as shown in Table 2 is sent out. For example, if the pitch changes in a word consisting of three syllables as shown in Figure 12A, 1
Outputs of 1 and 01 are output, and when the output changes as shown in FIG. 12B, outputs of 10 and 11 are output. There are a total of nine combinations. Here, the pulse p from the circuit control section 12d is stored in the 4-bit latch 10f, and controls a controller, for example, a motor for "grasping" in a prosthetic hand. Furthermore, after the work is completed, the circuit controller 12d sends out a pulse signal q to clear the 4-bit latch 10f, and then sends out pulses E, l, j, j', Jl, O, O', ol, It returns to the initial state. In the above example, 3
Although recognition of words consisting of syllables has been described, recognition of two and one syllables is also easy.

以下第１３図と共に説明する。今１音節だけ入力される
と、上記したと同様第３サイクルまで進行し、代表値が
決定されると共にラツチ９ａに保持さべまた３００Ｈｚ
クロツクパルス発生回路６ｂに切換えられ、８ビツトカ
ウンタ５は３００Ｈｚのクロツクパルスを計数する。こ
こで第２音節が表われないので、８ビツトカウンタ５は
オーバフローしてパルスｈを送出し、回路制御部１２ｄ
よりパルスＣが送出されフリツプフロツプ回路４ａをり
セツトして入力を閉じる。その後直ちにパルスｑが送出
されて４ビツトラツチ１０ｆをクリアーされると共にパ
ルスＵ，ｖが回路僧ｕ御部１２ｄより送出される。従つ
てインパータ回路１０ｇ，１０ｈで反転されて、比較器
群１０ｄ，１０ｅよりの出力をＡＮＤ回路１０ｉ，１０
ｊでストツプさせる。これによつて４ビツトラツチ１０
ｆよりパルスｐが入力されると終了を示す第１２Ｃ図の
如く００，００の出力を出す。次に２音節が入力されて
、３音節目が入力されない場合は、上記した第７サイク
ルまでが進行する。すなわちラツチ９ａ，９ｂに夫々第
１音節と第２音節との代表値が保持され、８ビツトカウ
ンタ５は３００Ｈｚのクロツクパルスを計数する。ここ
で第３音節が表われないので、８ビツトカウンタ５はオ
ーバフローしてパルスｈが送出され、従つてパルスｃに
よつてフリツプフロツプ回路４ａがりセツトされる。そ
して４ビツトラツチ１０ｆがパルスｑによつてクリアー
されると共に回路制御部１２ｄよりパルスｖのみが送出
される。これによりパルスｐが入力されるとＡＮＤ回路
１０１より比較器群１０ｄの出力のみが４ビツトラツチ
１０ｆに入力される。この入力は例えば第１２Ｄ図に示
す如く１１，００である。なお３音節以上の場合であつ
ても、上記した原理を応用することによつて認識可能と
なる。This will be explained below with reference to FIG. If only one syllable is input now, the process will proceed to the third cycle as described above, and the representative value will be determined and held in the latch 9a.
The clock pulse generation circuit 6b is switched, and the 8-bit counter 5 counts 300 Hz clock pulses. Since the second syllable does not appear here, the 8-bit counter 5 overflows and sends out the pulse h, and the circuit controller 12d
A pulse C is sent out to reset the flip-flop circuit 4a and close the input. Immediately thereafter, pulse q is sent out to clear the 4-bit latch 10f, and pulses U and v are sent out from the circuit controller 12d. Therefore, the inverter circuits 10g and 10h invert the outputs, and the outputs from the comparator groups 10d and 10e are sent to AND circuits 10i and 10.
Press j to stop. This results in a 4-bit latch 10
When a pulse p is input from f, outputs of 00,00 are output as shown in FIG. 12C indicating the end. Next, if the second syllable is input and the third syllable is not input, the process continues up to the seventh cycle described above. That is, the representative values of the first and second syllables are held in the latches 9a and 9b, respectively, and the 8-bit counter 5 counts 300 Hz clock pulses. Since the third syllable does not appear here, the 8-bit counter 5 overflows and pulse h is sent out, so that flip-flop circuit 4a is reset by pulse c. Then, the 4-bit latch 10f is cleared by the pulse q, and only the pulse v is sent out from the circuit control section 12d. As a result, when pulse p is input, only the output of comparator group 10d is input from AND circuit 101 to 4-bit latch 10f. This input is, for example, 11,00 as shown in Figure 12D. Note that even in the case of three or more syllables, recognition is possible by applying the above-described principle.

また上記実施例において判別処理をマイクロコンピユー
タによつて行えることも勿論可能である。さらに変化判
定の分類を３つに限定（実施例で２は前段の音節に対
して上つたか、下つたかあるいは同じか）せず、変化判
定回路Ｄを変更することにより、上下の変化を細分化す
ることもできる。この発明は上記したように、雑音の少
ない気管外壁にマイクを取付け、発声される音声をモー
ラ毎に区切つて検出し、かつ音声として安定していない
初期数周期の波形を除外すると共に残りの数周期をピツ
クアツプしてその中から代表値を決定し、このように得
られた複数の代表値の高低変化よりパターンを類別して
制御機器を制御するものであるから、高い認識率を得る
ことができると共に音声としては通常の発生であつても
ハミングであつても情報確認が可能である等の効果を有
するものである。Furthermore, it is of course possible that the discrimination processing in the above embodiments can be performed by a microcomputer. Furthermore, instead of limiting the classification of change judgment to three (in the example, 2 indicates whether the syllable is above, below, or the same as the previous syllable), by changing the change judgment circuit D, it is possible to detect vertical changes. It can also be subdivided. As described above, this invention attaches a microphone to the outer wall of the trachea where there is little noise, detects the voice uttered by dividing it into moras, and excludes the waveform of the initial few cycles that are not stable as voice, and detects the voice that is not stable as voice. Since the method picks up the period, determines a representative value from among them, and controls the control equipment by classifying patterns based on the height changes of the multiple representative values obtained in this way, it is possible to obtain a high recognition rate. In addition, it has the effect that information can be confirmed even if the sound is normal or humming.

[Brief explanation of drawings]

図はこの発明に係る音声識別方法に使用する装置の一実
施例を示し、第１図は全体のプロツク図、第２図はカウ
ンタ回路部の回路図、第３図は代表値決定回路部の回路
図、第４図は変化判定回路部の回路図、第５図は音節区
切れ処理回路部の回路図、第６図はシステム制御部の回
路図、第７〜１１図はタイミングチヤート図、第１２Ａ
−Ｄ図は音節の変化を示す線図、第１３図は上記第４図
の他の実施例を示す回路図である。The figures show an embodiment of the apparatus used in the voice identification method according to the present invention, in which Fig. 1 is an overall block diagram, Fig. 2 is a circuit diagram of the counter circuit section, and Fig. 3 is a circuit diagram of the representative value determining circuit section. 4 is a circuit diagram of the change determination circuit section, FIG. 5 is a circuit diagram of the syllable break processing circuit section, FIG. 6 is a circuit diagram of the system control section, and FIGS. 7 to 11 are timing chart diagrams. 12th A
-D is a diagram showing changes in syllables, and FIG. 13 is a circuit diagram showing another embodiment of FIG. 4.

Claims

[Claims]

1. A means for detecting the sound generated by dividing each syllable with a microphone on the outer wall of the trachea, a means for excluding the initial few cycles of the sound in each section, and a means for determining the division of the syllable.
means for determining a representative value from the measured values of the remaining several periods separated and excluded; and means for determining whether or not there is a detection signal from the microphone within a predetermined time from the time of the separation. , If there is no detection signal within the discrimination time, the pattern is classified based on the representative value, and if a detection signal is present, the representative value is obtained by the same means as described above, and thereafter, while the detection signal may exist, the pattern is classified using the representative value. A voice identification method comprising means for determining a representative value by the same means as described above, comparing each representative value with the previous representative value to determine the pitch of the sound, and classifying the pattern.