JPS58150997A - Speech feature extractor - Google Patents

Speech feature extractor

Info

Publication number
JPS58150997A
JPS58150997A JP3242682A JP3242682A JPS58150997A JP S58150997 A JPS58150997 A JP S58150997A JP 3242682 A JP3242682 A JP 3242682A JP 3242682 A JP3242682 A JP 3242682A JP S58150997 A JPS58150997 A JP S58150997A
Authority
JP
Japan
Prior art keywords
detector
circuit
vocal
contact
vibration
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP3242682A
Other languages
Japanese (ja)
Other versions
JPH036520B2 (en
Inventor
杉本 豊三
村田 程夫
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National Institute of Advanced Industrial Science and Technology AIST
Original Assignee
Agency of Industrial Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Agency of Industrial Science and Technology filed Critical Agency of Industrial Science and Technology
Priority to JP3242682A priority Critical patent/JPS58150997A/en
Publication of JPS58150997A publication Critical patent/JPS58150997A/en
Publication of JPH036520B2 publication Critical patent/JPH036520B2/ja
Granted legal-status Critical Current

Links

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。
(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】 不発明は音声以外の情報から発音の認識を行なう発音特
徴抽出装置に関するものである。
DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a pronunciation feature extraction device that recognizes pronunciation from information other than speech.

音声は肺力)ら送り出された呼気流が喉頭に存する声帯
を通過する際に声帯が振動することにより声に変換され
、口唇や鼻腔に至る呼気の通路が形を変えることにより
変調され、こ扛ら発声器管の総合的な動きの結果、産声
される。
Speech is converted into voice when the exhaled airflow sent out from the lungs passes through the vocal cords in the larynx, which vibrate, and is modulated by changing the shape of the exhaled air passage leading to the lips and nasal cavity. A voice is produced as a result of the comprehensive movement of the vocal tube.

さて従来、このような音声を抽出するには音響マイクロ
ホンにより音声波を電気信号に変換し、所定の周波数帯
域を有する多数のフィルタ回路に入力し、各フィルタ回
路の出力から判断して発音全特徴づけていた。
Conventionally, in order to extract such sounds, the sound waves are converted into electrical signals using an acoustic microphone, which is then input to a number of filter circuits with predetermined frequency bands, and the output of each filter circuit is used to determine all the characteristics of the pronunciation. I was wearing it.

しかし発声器管の総合的動きの結果である音声音、音声
波のみにより全ての音素の発音特徴を抽出して音声認識
を行なうことは極めて困難である。
However, it is extremely difficult to perform speech recognition by extracting the pronunciation characteristics of all phonemes only from speech sounds and speech waves, which are the result of the comprehensive movement of the vocal tube.

、トりわけ非定常的な子音については雑音エネルギ1[ −が強く、音声波の中でほぼ確実な特徴抽出かできる無
声摩擦音/s、  f/等を除けば、無声摩擦音/h/
や無声破裂音/p+  t、に/や有声破裂音/b、d
、g/や鼻音/m、n、  η/等はその検出及び分離
は非常に困難なものである。
, especially for non-stationary consonants, the noise energy 1[- is strong, and with the exception of voiceless fricatives /s, f/, etc., whose features can be almost certainly extracted in the speech wave, voiceless fricatives /h/
ya voiceless plosive/p+t, ni/ya voiced plosive/b, d
, g/ and nasal sounds /m, n, η/, etc. are extremely difficult to detect and separate.

本発明は上記欠点に鑑み、発声器管各部の動き全検出す
る検出器を発声器管各部の近傍に装着または配置し、前
記各検出器からの出力を処理装置によジ処理させること
により、従来よりも正確に発音抽出ができる発音特徴抽
出装置を提供するものである。
In view of the above-mentioned drawbacks, the present invention has been developed by installing or arranging detectors that detect all movements of each part of the vocal tube in the vicinity of each part of the vocal tube, and having the outputs from each of the detectors processed by a processing device. An object of the present invention is to provide a pronunciation feature extraction device that can extract pronunciation more accurately than before.

以下、図面を参照しながら不発明の一実施例について説
明する。
Hereinafter, an embodiment of the invention will be described with reference to the drawings.

第1図は不発明の一実施例における発音抽出装置のブロ
ック構成を示すものである。同図において、1は喉頭部
声帯付近に取付けられ声帯の振動を検出する声帯振動検
出器、2は真壁中央部イ4近に取付けられ鼻腔内におけ
る音声の振動を検出する翼振動検出器、3は口腔前方に
配置しロス流を検出するロス流検出器、4は口腔内口蓋
に装着し舌と口蓋との接触を検出する口蓋接触検出器で
ある。6は声帯振動検出器1.鼻振動検出器22口気流
検出器3及び口蓋接触検出器4の出力から発音特徴を抽
出する処理装置で、以下第2図を用いてさらに処理装置
6における構成の詳細な説明を行なう。
FIG. 1 shows a block configuration of a pronunciation extraction device according to an embodiment of the invention. In the figure, 1 is a vocal cord vibration detector attached near the vocal cords of the larynx to detect vibrations of the vocal cords; 2 is a wing vibration detector attached near the center of Makabe A4 to detect voice vibrations in the nasal cavity; 3 4 is a loss flow detector placed at the front of the oral cavity to detect loss flow; 4 is a palate contact detector placed on the palate in the oral cavity to detect contact between the tongue and the palate. 6 is a vocal cord vibration detector 1. This processing device extracts pronunciation characteristics from the outputs of the nasal vibration detector 22, the mouth airflow detector 3, and the palate contact detector 4. The configuration of the processing device 6 will be further explained in detail below with reference to FIG.

第2図において、6は声帯振動検出器1の声帯振動情報
から特定の値に基づいて声帯振動の有無を決定する閾値
回路、了は翼振動検出器2の鼻振動情報から特定の値に
基づいて鼻振動の有無を決定する閾値回路、8はロス流
検出器3のロス流情報を微分することにまりロス流の変
化率(加速度)を求める微分回路、9はロス流の変化率
の有無全特定の値に基づいて決定する閾値回路、1oは
[」気流検出器3のロス流情報から特定の値に基づいて
ロス流の有無を決定する閾値回路、11は口蓋接触検出
器4の口蓋接触情報を一旦測定回路12により舌と口蓋
との接触信号に変換した後に後述する前古閉鎖、復古閉
鎖及び閉鎖なしの3種類の状態全判断する舌閉鎖検出回
路、13は閾値回路6.7,9.10から出力される各
閾値情報の有無、及び舌閉鎖検出回路11における3種
類の情報から音素分類を行なう音素分類回路である。
In FIG. 2, 6 is a threshold circuit that determines the presence or absence of vocal fold vibration based on a specific value from the vocal fold vibration information of the vocal fold vibration detector 1, and 6 is a threshold circuit that determines the presence or absence of vocal fold vibration based on a specific value from the nasal vibration information of the wing vibration detector 2. 8 is a differentiation circuit that determines the rate of change (acceleration) of the loss flow by differentiating the loss flow information from the loss flow detector 3; 9 is a differentiation circuit that determines the presence or absence of the rate of change of the loss flow; 1o is a threshold circuit that determines the presence or absence of a loss flow based on a specific value from the loss flow information of the airflow detector 3; 11 is a palate of the palate contact detector 4; 13 is a threshold circuit 6.7 which converts the contact information into a contact signal between the tongue and the roof of the mouth by the measuring circuit 12, and then judges all three types of states: pre-closing, retro-closing, and no closure, which will be described later. , 9.10, and three types of information in the tongue closure detection circuit 11.

上記のように構成された発音特徴抽出装置について、以
下具体的な使用方法を第3図を用い説明を行なう。
A specific method of using the pronunciation feature extracting device configured as described above will be explained below with reference to FIG.

声帯振動検出部1として第3図に示すように加速度セン
サ−1′ヲ医療用両面テープにより人体に一訃ける喉頭
の声帯部に取り付けることにより、声帯振動を検出する
。検出された声帯振動は閾値回路6に出力され、閾値回
路6は声帯振動の値が特定の値以上であれば音素分類回
路13に有(+)信号を、″!:タ一定の値以下であれ
ば無←)信号を出力する。
As shown in FIG. 3, the vocal cord vibration detecting section 1 detects vocal cord vibration by attaching an acceleration sensor 1' to the vocal cord part of the larynx in the human body using double-sided medical tape. The detected vocal fold vibration is output to the threshold circuit 6, and the threshold circuit 6 sends a positive (+) signal to the phoneme classification circuit 13 if the value of the vocal fold vibration is above a certain value; If so, output the signal.

捷だ翼振動検出器2として加速度センサー2′ヲ医療用
両面テープにより人体における奥壁中央部付近に取り付
けることにより、鼻振動を検出する。
Nasal vibrations are detected by attaching an acceleration sensor 2' as a folded blade vibration detector 2 to the center of the back wall of the human body using double-sided medical tape.

検出さ′n−た鼻振動は閾値回路7に出力され、閾値回
路7は鼻振動の値が特定の値以上であれば音素f−tた
ロス流検出器3として熱線流量計センサー3′を人体に
おける口腔前方の机上等に固定し配置することにより、
ロス流の検出を行なう。検出されたロス流は微分回路8
に出力され、微分回路8ではロス流の変化率を求めその
変化率を閾値回路9に出力する。そして閾値回路9は変
化率の値が特定の値以上であれば音素分類回路13に有
(ト)信号音、また一定の値以下であ牡ば無(→信号を
出力する。一方熱線流量計センサー3′により検出され
たロス流は閾値回路10にも出力され、閾値回路10で
はそのロス流の値が特定値以上であれば音素分類回路1
3に有(−1−)信号を、!、た一定値以下であれば無
(→信号を出力する。
The detected nasal vibration is output to a threshold circuit 7, and the threshold circuit 7 detects the phoneme f-t if the value of the nasal vibration is above a specific value. By fixing and placing it on a desk, etc. in front of the oral cavity of the human body,
Detect loss flow. The detected loss flow is transferred to the differentiator circuit 8.
The differential circuit 8 calculates the rate of change of the loss flow and outputs the rate of change to the threshold circuit 9. Then, the threshold circuit 9 outputs a signal sound (g) to the phoneme classification circuit 13 if the value of the rate of change is above a specific value, and a signal (→signal) if the rate of change is below a certain value.On the other hand, the hot wire flowmeter The loss flow detected by the sensor 3' is also output to the threshold circuit 10, and in the threshold circuit 10, if the value of the loss flow is greater than or equal to a specific value, the loss flow is output to the phoneme classification circuit 1.
3 with a presence (-1-) signal, ! , if the value is below a certain value, no signal is output.

さらに口蓋接触検出器4としては第4図に示さ五るよう
な接触センサー4′を用いる。接触センサ゛、・・ 一4′は舌と接触する部分に多数の電極4′a k有し
、゛止め部4’bにより人体における口腔内口上蓋に装
着さ扛、電極4’aにより舌との接触状態を検出する。
Further, as the palate contact detector 4, a contact sensor 4' as shown in FIG. 4 is used. The contact sensor 14' has a large number of electrodes 4'ak on the part that comes into contact with the tongue, and is attached to the upper part of the oral cavity in the human body by the stopper part 4'b, and the contact sensor 14' is connected to the tongue by the electrode 4'a. Detect contact status.

そして検出された電極4/ aと舌との接触状態は測定
回路12及び舌閉鎖検出回路11に順次入力され、接触
状態が第5図(イ)のようなパターンとなった際には前
古閉鎖としての情報が、第6図(ロ)のようなパターン
となった際には後者閉鎖としての情報が、また舌との接
触がない場合には閉鎖なしの情報が音素分類回路13に
出力される。
The detected contact state between the electrode 4/a and the tongue is sequentially input to the measurement circuit 12 and the tongue closure detection circuit 11, and when the contact state becomes a pattern as shown in FIG. When the information regarding closure becomes a pattern as shown in FIG. 6 (b), information regarding the latter closure is output to the phoneme classification circuit 13, and when there is no contact with the tongue, information indicating no closure is output to the phoneme classification circuit 13. be done.

最終的に音素分類回路13では下表に示すような内部の
記憶テーブルから、閾値回路6,7,9゜1Q及び舌閉
鎖検出回路11よジ入力した各情報に基づいて音声を判
断できる。
Finally, the phoneme classification circuit 13 can determine the speech based on the information inputted from the threshold circuits 6, 7, 9° 1Q and the tongue closure detection circuit 11 from an internal memory table as shown in the table below.

さてたとえば第6図(イ)に示すような音素波を有する
[h?Ln2L−1という音声全発声すると、加速度セ
ンサー1′は第6図(ロ)のような波形を閾値回路6に
出力する。そして閾値回路6では特定の閾値から判断し
て「h]の部分では無(→信号を、「n」の部分では有
(十)信号を音素分類回路13に出力する。
Now, for example, if we have a phoneme wave as shown in FIG. 6 (a) [h? When the entire voice Ln2L-1 is uttered, the acceleration sensor 1' outputs a waveform as shown in FIG. 6(b) to the threshold circuit 6. The threshold circuit 6 then outputs a null (→ signal) for the "h" part and a present (10) signal for the "n" part to the phoneme classification circuit 13 based on the judgment based on the specific threshold value.

゛また加速度センサー1′は第6図(ハ)のような波形
を閾値回路7に出力する。そして閾値回路7では特定の
閾値から判断してrh」の部分では無(−)信号を、「
n」の部分では有(+)信号を音素分類回路13に出力
する。
Furthermore, the acceleration sensor 1' outputs a waveform as shown in FIG. 6(c) to the threshold circuit 7. Then, the threshold circuit 7 outputs a null (-) signal at the section "rh" judged from a specific threshold value.
At the part "n", a presence (+) signal is output to the phoneme classification circuit 13.

さらに熱線流量計センサー3′では第6図に)のような
波形を微分回路8及び閾値回路10に出力する。そして
閾値回路9では微分回路8からの微分値を特定の閾値か
ら判断してrh」及びrnJの部分で無(−)信号を音
素分類回路13に出力する。
Furthermore, the hot wire flow meter sensor 3' outputs a waveform as shown in FIG. 6 to the differentiating circuit 8 and the threshold circuit 10. Then, the threshold circuit 9 judges the differential value from the differentiator circuit 8 based on a specific threshold value and outputs a null (-) signal to the phoneme classification circuit 13 at the portions of "rh" and rnJ.

また閾値回路10でも特定の閾値から判断してrh4の
部分では有(+)信号を、「n」の部分では無(−)信
号を音素分類回路13に出力する。
The threshold circuit 10 also outputs a presence (+) signal at the rh4 portion and a no (-) signal at the “n” portion to the phoneme classification circuit 13 based on a judgment based on a specific threshold value.

一方接触センサー4′は電極4aと舌との接触状態を検
出し、測定回路12を介して舌閉鎖検出回路11に出力
する。そして舌閉鎖検出回路11では「h」の部分で接
触パターンにより「閉鎖なし」の情報を、まだ「n」の
部分では「前古閉鎖」の情報を音素分類回路13に出力
する。
On the other hand, the contact sensor 4' detects the contact state between the electrode 4a and the tongue, and outputs the detected state to the tongue closure detection circuit 11 via the measurement circuit 12. Then, the tongue closure detection circuit 11 outputs information of "no closure" at the "h" portion according to the contact pattern, and outputs information of "anterior closure" to the phoneme classification circuit 13 at the "n" portion.

そして音素分類回路13では各情報に基づいて表に示し
たような内部の記憶テーブルから「h」及びrnJ全認
識することができる。
Then, the phoneme classification circuit 13 can fully recognize "h" and rnJ from an internal storage table as shown in the table based on each information.

以上のように、声帯振動検出器1.鼻振動検出器21口
気流検出器3及び口蓋接触検出器4により各発声器管の
動き全検出し、処理装置6により各検出器が検出した情
報に基づいてあらかじめ記憶しているテーブルの中から
特定の音素を決定することにより、従来困難であった音
声の認識を正確に行なうことができる。
As mentioned above, the vocal fold vibration detector 1. The nasal vibration detector 21, the mouth airflow detector 3, and the palate contact detector 4 detect all movements of each vocal tube, and the processing device 6 detects the information from a pre-stored table based on the information detected by each detector. By determining specific phonemes, it is possible to accurately recognize speech, which has been difficult in the past.

以上のように本発明は声帯振動検出器が検出した声帯の
振動情報と、鼻振動検出器が検出した鼻腔内の振動情報
と、ロス流検出器が検出したロス流情報と、口蓋接触検
出器が検出した舌と口蓋との接触情報とに基づいて自己
が記憶している情報により特定の音素を選択する処理装
置を設けることにより、従来よりも正確な発音抽出を発
声器管から行なうことができ、その実用的効果は犬なる
ものがある。
As described above, the present invention combines vocal cord vibration information detected by a vocal cord vibration detector, intranasal vibration information detected by a nasal vibration detector, loss flow information detected by a loss flow detector, and a palate contact detector. By providing a processing device that selects a specific phoneme based on the information stored in the device based on the contact information between the tongue and the roof of the mouth detected by the device, it is possible to extract pronunciation more accurately from the vocal tube than before. It is possible, and its practical effects are similar to that of a dog.

【図面の簡単な説明】[Brief explanation of drawings]

 0 第1図は本発明の一実施例における発音特徴抽出装置の
ブロック図、第2図は同発音特徴抽出装置における処理
装置のブロック図、第3図は同発音特徴抽出装置の使用
例を示す図、第4図は接触センサーの平面図、第5図は
舌と口蓋との接触パターンを示す図、第6図は各検出器
の波形図である。 1・・・・・・声帯振動検出器、2・・・・・・鼻振動
検出器、3・・・・・・ロス流検出器、4・・・・・・
口蓋接触検出器、5・・・・・・処理装置。 特許出願人 工業技術院長 石  坂  誠  −第3
図 第4図 第5図
0 Fig. 1 is a block diagram of a pronunciation feature extraction device according to an embodiment of the present invention, Fig. 2 is a block diagram of a processing device in the pronunciation feature extraction device, and Fig. 3 shows an example of use of the pronunciation feature extraction device. 4 is a plan view of the contact sensor, FIG. 5 is a diagram showing a contact pattern between the tongue and the palate, and FIG. 6 is a waveform diagram of each detector. 1... Vocal fold vibration detector, 2... Nasal vibration detector, 3... Loss flow detector, 4...
Palatal contact detector, 5...processing device. Patent applicant Makoto Ishizaka, Director of the Agency of Industrial Science and Technology - No. 3
Figure 4 Figure 5

Claims (1)

【特許請求の範囲】 喉頭部に取り付は牧れ、声帯の振動を検出する触検出器
と、前記声帯振動検出器、鼻振動検出器。 ロス流検出器及び口蓋接触検出器の出力に基づいて自己
が記憶している記憶情報から特定の音素を選択する処理
装置とを具備した発音特徴抽出装置。
[Scope of Claims] A tactile detector attached to the larynx to detect vibrations of the vocal cords, the vocal cord vibration detector, and a nasal vibration detector. A pronunciation feature extraction device comprising a processing device that selects a specific phoneme from stored information based on the outputs of a loss flow detector and a palate contact detector.
JP3242682A 1982-03-03 1982-03-03 Speech feature extractor Granted JPS58150997A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP3242682A JPS58150997A (en) 1982-03-03 1982-03-03 Speech feature extractor

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP3242682A JPS58150997A (en) 1982-03-03 1982-03-03 Speech feature extractor

Publications (2)

Publication Number Publication Date
JPS58150997A true JPS58150997A (en) 1983-09-07
JPH036520B2 JPH036520B2 (en) 1991-01-30

Family

ID=12358621

Family Applications (1)

Application Number Title Priority Date Filing Date
JP3242682A Granted JPS58150997A (en) 1982-03-03 1982-03-03 Speech feature extractor

Country Status (1)

Country Link
JP (1) JPS58150997A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6111021A (en) * 1984-06-26 1986-01-18 工業技術院長 Speaking exercise apparatus
US6971993B2 (en) 2000-11-15 2005-12-06 Logometrix Corporation Method for utilizing oral movement and related events
US6974424B2 (en) 2000-09-19 2005-12-13 Logometrix Corporation Palatometer and nasometer apparatus
JP2016031534A (en) * 2014-07-28 2016-03-07 リウ チン フォンChing−Feng LIU Speech production recognition system, speech production recognition device, and speech production recognition method

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS501846A (en) * 1973-05-14 1975-01-09

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS501846A (en) * 1973-05-14 1975-01-09

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6111021A (en) * 1984-06-26 1986-01-18 工業技術院長 Speaking exercise apparatus
JPH0357777B2 (en) * 1984-06-26 1991-09-03 Kogyo Gijutsuin
US6974424B2 (en) 2000-09-19 2005-12-13 Logometrix Corporation Palatometer and nasometer apparatus
US6971993B2 (en) 2000-11-15 2005-12-06 Logometrix Corporation Method for utilizing oral movement and related events
JP2016031534A (en) * 2014-07-28 2016-03-07 リウ チン フォンChing−Feng LIU Speech production recognition system, speech production recognition device, and speech production recognition method

Also Published As

Publication number Publication date
JPH036520B2 (en) 1991-01-30

Similar Documents

Publication Publication Date Title
Childers et al. Electroglottography for laryngeal function assessment and speech analysis
JPS59107399A (en) Measurement of nasalization level
Werner et al. Inhalations in Speech: Acoustic and Physiological Characteristics.
Howard Peak‐picking fundamental period estimation for hearing prostheses
Abdul-Kadir et al. Difficulties of standard arabic phonemes spoken by non-arab primary school children based on formant frequencies
JPS58150997A (en) Speech feature extractor
JPS6129000B2 (en)
JPH036519B2 (en)
Paul et al. Speech recognition of throat microphone using MFCC approach
JPH0475520B2 (en)
JPS59149399A (en) Consonant sorter
Niederjohn et al. Computer recognition of the continuant phonemes in connected English speech
JPS5949742A (en) Apparatus for detecting exhalation force
Toparlak et al. Aerodynamics and articulation of word-final ejectives in Eastern Armenian
JPH025099A (en) Voiced, voiceless, and soundless state display device
JPH034919B2 (en)
JPS6329759B2 (en)
JPS59221198A (en) Microphone unit, voice identifying device and method
JPS60238899A (en) Breathing flow detector
JPS63175897A (en) Breathing flow detector
JPS58223188A (en) Plosive recognition equipment
Dominic et al. AUTOMATIC ANNOTATION USING MULTI-SENSOR DATA
JPS6078497A (en) Affricates sorter
JPS6329760B2 (en)
JPS6236700A (en) Inhaling flow detector