JPS6329759B2 - - Google Patents

Info

Publication number
JPS6329759B2
JPS6329759B2 JP56039344A JP3934481A JPS6329759B2 JP S6329759 B2 JPS6329759 B2 JP S6329759B2 JP 56039344 A JP56039344 A JP 56039344A JP 3934481 A JP3934481 A JP 3934481A JP S6329759 B2 JPS6329759 B2 JP S6329759B2
Authority
JP
Japan
Prior art keywords
sound
plosive
fricative
signal
detection means
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired
Application number
JP56039344A
Other languages
Japanese (ja)
Other versions
JPS57155596A (en
Inventor
Takeo Murata
Nobuhisa Kadowaki
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National Institute of Advanced Industrial Science and Technology AIST
Original Assignee
Agency of Industrial Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Agency of Industrial Science and Technology filed Critical Agency of Industrial Science and Technology
Priority to JP3934481A priority Critical patent/JPS57155596A/en
Publication of JPS57155596A publication Critical patent/JPS57155596A/en
Publication of JPS6329759B2 publication Critical patent/JPS6329759B2/ja
Granted legal-status Critical Current

Links

Description

【発明の詳細な説明】 本発明は破裂音を検出する破裂音検出装置に関
するものである。
DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a plosive sound detection device for detecting plosive sounds.

従来、人の音声のうちから破裂音だけを確実、
簡便、かつリアルタイムで検出する方法は知られ
ていない。
Previously, only plosive sounds could be reliably detected from human speech.
There is no known method for simple and real-time detection.

本発明は発話時の口気流の流速及び音声の情報
から破裂音のみを確実、簡便、かつリアルタイム
で検出する破裂音検出装置を提供することを目的
とする。
An object of the present invention is to provide a plosive sound detection device that reliably, simply, and in real time detects only plosive sounds based on the flow velocity of oral airflow during speech and voice information.

実験によれば、発話時の口気流の流速波形は破
裂音に対して特異的であり、この流速波形に簡単
な信号処理を行なうことにより、破裂音を有効に
検出しうることが確かめられた。しかし、この方
法では発話の仕方により、ときに、破裂音以外の
音、多くは無声摩擦音をも破裂音として検出する
ことが明らかになつた。
Experiments have shown that the velocity waveform of oral airflow during speech is specific to plosive sounds, and that plosive sounds can be detected effectively by performing simple signal processing on this velocity waveform. . However, it has become clear that this method sometimes detects sounds other than plosives, often voiceless fricatives, as plosives, depending on the way the utterance is made.

本発明は口気流情報から破裂音の候補を検出し
一方、音声情報から摩擦音を抽出して、上記破裂
音の候補の中から摩擦音を除去することにより、
破裂音のみを正しく検出するものである。
The present invention detects plosive candidates from oral airflow information, extracts fricatives from voice information, and removes fricatives from the plosive candidates.
This method correctly detects only plosive sounds.

以下、本発明の実施例について図面と共に説明
する。
Embodiments of the present invention will be described below with reference to the drawings.

第1図は本発明にかかる破裂音検出装置の一実
施例を示すブロツク図である。図において、1は
口気流速を検出する流速検出手段、2はその流速
検出手段1の出力信号理し破裂音の候補を検出す
る破裂音検出手段、3は音声を検出するマイク、
4はマイク3の出力信号より摩擦音を抽出する摩
擦音抽出手段、5は摩擦音抽出手段4の出力信号
により破裂音検出手段2の出力信号を抑制する判
定手段である。流速検出手段1は発話によつて生
じる口の前の気流の速度を測定し、電気信号を出
力するものであり、例えば熱線式の流速計等で構
成される。破裂音検出手段2は流速検出手段1の
出力信号、いわゆる口気流波形を処理して破裂音
の候補を検出するものであり、例えば抵抗、コン
デンサで構成する微分回路等で構成される。マイ
ク3は音声を検出するものであり、例えば、低雑
音接話型マイクロホン等である。摩擦音抽出手段
4はマイク3の出力信号、いわゆる音声信号から
摩擦音を抽出するものであり、例えば、ローパス
フイルター回路とハイパスフイルター回路と除算
回路とで構成し、それぞれのフイルター回路で音
声を分離し、除算回路によりハイパスフイルター
回路の出力をローパスフイルター回路の出力で除
算し、その出力の大小によつて摩擦音を抽出す
る。判定手段5は摩擦音抽出手段4の出力信号に
より破裂音検出手段2の出力信号を抑制するもの
であり、例えばNAND回路、あるいは可変利得
増幅回路等で構成される。
FIG. 1 is a block diagram showing one embodiment of a plosive sound detection device according to the present invention. In the figure, 1 is a flow velocity detection means for detecting the oral air flow velocity, 2 is a plosive sound detection means for detecting plosive candidates based on the output signal of the flow velocity detection means 1, and 3 is a microphone for detecting sound.
4 is a fricative sound extracting means for extracting a fricative sound from the output signal of the microphone 3; and 5 is a determining means for suppressing the output signal of the plosive sound detecting means 2 by the output signal of the fricative sound extracting means 4. The flow velocity detection means 1 measures the velocity of the airflow in front of the mouth caused by speech and outputs an electrical signal, and is composed of, for example, a hot wire type current meter. The plosive sound detection means 2 processes the output signal of the flow rate detection means 1, ie, the so-called oral airflow waveform, to detect plosive sound candidates, and is composed of, for example, a differentiating circuit made up of resistors and capacitors. The microphone 3 detects voice, and is, for example, a low-noise close-talk type microphone. The fricative sound extraction means 4 extracts fricative sounds from the output signal of the microphone 3, which is a so-called audio signal, and is composed of, for example, a low-pass filter circuit, a high-pass filter circuit, and a division circuit, and each filter circuit separates the sound. A division circuit divides the output of the high-pass filter circuit by the output of the low-pass filter circuit, and fricative sounds are extracted depending on the magnitude of the output. The determining means 5 suppresses the output signal of the plosive sound detecting means 2 using the output signal of the fricative sound extracting means 4, and is composed of, for example, a NAND circuit or a variable gain amplifier circuit.

第2図a〜e及び第3図a〜eは第1図に示し
た破裂音検出装置の各部の信号波形を示すもので
あり、第2図は破裂音、第3図は無声摩擦音を発
話した場合をそれそれ示している。波形イ、ロは
発話によつてマイク3が検出した破裂音と無声摩
擦音の音声信号の代表例を表わし、波形ハ、ニは
その時の流速検出手段1が検出した口気流波形を
示すものである。波形ホ、ヘは破裂音検出手段2
によつて口気流波形ハ、ニから破裂音の候補を検
出した波形を示すものである。波形ト、チは摩擦
音抽出手段4により音声信号イ、ロから摩擦音を
抽出した信号すなわち摩擦音信号波形を示すもの
であり、また波形リ、ヌは判定手段5の出力信号
を示すものである。
Figures 2 a to e and Figures 3 a to e show signal waveforms of each part of the plosive sound detection device shown in Figure 1. Figure 2 shows a plosive sound, and Figure 3 shows a voiceless fricative sound. If you do it it shows it. Waveforms A and B represent representative examples of audio signals of plosives and voiceless fricatives detected by the microphone 3 during speech, and waveforms C and D represent oral airflow waveforms detected by the flow velocity detection means 1 at that time. . Waveforms E and F are plosive sound detection means 2
This shows waveforms in which candidates for plosive sounds were detected from the oral airflow waveforms c and d. Waveforms ① and ① indicate signals obtained by extracting fricatives from audio signals ① and ② by the fricative sound extracting means 4, that is, fricative sound signal waveforms, and waveforms ① and ① indicate output signals of the determining means 5.

以上のような構成において、破裂音例えば/
P/を含む音/Pa/を発声した場合、流速検出
手段1は口気流波形ハを検出し、マイク3は音声
信号イを検出する。破裂音検出手段2は口気流波
形ハを微分することにより破裂音の候補である微
分信号ホを検出する。一方、摩擦音抽出手段4は
音声信号イより摩擦音を抽出する。音声信号イに
おける高周波成分は少なく、従つてハイパスフイ
ルター回路での出力はほとんどない。この為、波
形トに示すように摩擦音抽出手段4の出力はな
い。したがつて、摩擦音信号トが大きい時に微分
信号ホを小さくするように抑制する判定手段5の
出力は、摩擦音信号トがないため破裂音を表わす
破裂音パルス信号リを発生する。
In the above configuration, plosive sounds such as /
When the sound /Pa/ containing P/ is uttered, the flow velocity detection means 1 detects the oral airflow waveform C, and the microphone 3 detects the audio signal A. The plosive sound detection means 2 detects a differential signal E, which is a candidate for a plosive sound, by differentiating the oral airflow waveform C. On the other hand, the fricative sound extracting means 4 extracts a fricative sound from the audio signal A. There are few high frequency components in the audio signal A, so there is almost no output from the high pass filter circuit. Therefore, as shown in waveform G, there is no output from the fricative sound extraction means 4. Therefore, the output of the determining means 5, which suppresses the differential signal E to a small value when the fricative signal G is large, generates a plosive pulse signal R representing a plosive sound since there is no fricative signal G.

次に、例えば無声摩擦音/S/を含む音/
Sa/を特に/S/を強調して発声した場合、流
速検出手段1の出力信号は異常に大きな口気流波
形ニを検出する。この口気流波形ニは破裂音検出
手段2により破裂音の微分信号ホに近い微分信号
へを出力することがある。したがつて、無声摩擦
音/S/は破裂音の候補となる。一方、摩擦音抽
出手段4は音声信号ロより摩擦音を抽出する。音
声信号ロにおける高周波成分は多く、従つてハイ
パスフイルター回路での出力が大きい。この為、
摩擦音抽出手段4の出力は波形チに示すような摩
擦音信号を発生する。従つて、摩擦音信号チが大
きい時に微分信号へを小さくするように抑制する
判定手段5の出力は、波形ヌに示すように破裂音
を表わす破裂音パルス信号を発生しない。
Next, for example, a sound containing the unvoiced fricative /S/ /
When Sa/ is uttered with particular emphasis on /S/, the output signal of the flow velocity detection means 1 detects an abnormally large oral airflow waveform D. This oral airflow waveform D may be outputted by the plosive sound detection means 2 as a differential signal close to the differential signal E of the plosive sound. Therefore, the voiceless fricative /S/ is a candidate for a plosive. On the other hand, the fricative sound extracting means 4 extracts a fricative sound from the audio signal B. There are many high frequency components in the audio signal B, and therefore the output from the high pass filter circuit is large. For this reason,
The output of the fricative sound extraction means 4 generates a fricative sound signal as shown in waveform H. Therefore, the output of the determining means 5, which suppresses the differential signal so as to be small when the fricative signal Q is large, does not generate a plosive pulse signal representing a plosive as shown in waveform N.

このように本実施例によれば、発話の仕方にか
かわらず、無声摩擦音を破裂音と検出せず、破裂
音のみを正しく検出することができる。また、複
雑な信号処理手段等を用いず、流速計、簡単な微
分回路、マイク、2つのフイルター回路、及び、
NAND回路等で構成できる。さらに原理上リア
ルタイムで検出できることは明らかである。
As described above, according to this embodiment, regardless of the manner of utterance, voiceless fricatives are not detected as plosives, and only plosives can be correctly detected. In addition, without using complicated signal processing means, a current meter, a simple differentiation circuit, a microphone, two filter circuits, and
It can be configured with NAND circuits, etc. Furthermore, it is clear that detection can be performed in real time in principle.

以上説明したように本発明は口気流情報から破
裂音の候補を検出し、一方、音声情報から摩擦音
を抽出して上記破裂音の候補の中から摩擦音を除
去するようにしたので破裂音のみを正しく検出す
ることができる。
As explained above, the present invention detects plosive candidates from oral airflow information, and on the other hand, extracts fricatives from voice information and removes fricatives from the plosive candidates. Can be detected correctly.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明にかかる破裂音検出装置の一実
施例を示すブロツク図、第2図a〜e及び第3図
a〜eはそれぞれ破裂音、無声摩擦音を発話した
場合の同装置各部の信号波形を示す図である。 1……流速検出手段、2……破裂音検出手段、
3……マイク、4……摩擦音抽出手段、5……判
定手段。
Fig. 1 is a block diagram showing one embodiment of the plosive sound detection device according to the present invention, and Figs. 2 a to e and 3 a to e show the various parts of the device when a plosive sound and a voiceless fricative sound are uttered, respectively. FIG. 3 is a diagram showing signal waveforms. 1...Flow velocity detection means, 2...Plosive sound detection means,
3...Microphone, 4...Fricative sound extraction means, 5...Determination means.

Claims (1)

【特許請求の範囲】[Claims] 1 口気流を検出する流速検出手段と、前記流速
検出手段の出力信号を処理し、破裂音の候補を検
出する破裂音検出手段と、音声を検出するマイク
と、前記マイクの出力信号より摩擦音を抽出する
摩擦音抽出手段と、前記摩擦音抽出手段の出力信
号により前記破裂音検出手段の出力信号を抑制す
る手段とを備えたことを特徴とする破裂音検出装
置。
1. A flow velocity detection means for detecting oral airflow, a plosive detection means for processing the output signal of the flow velocity detection means and detecting plosive sound candidates, a microphone for detecting sound, and detecting fricative sounds from the output signal of the microphone. A plosive sound detection device comprising: a fricative sound extraction means for extracting a fricative sound; and a means for suppressing an output signal of the plosive sound detection means by an output signal of the fricative sound extraction means.
JP3934481A 1981-03-20 1981-03-20 Plosive detector Granted JPS57155596A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP3934481A JPS57155596A (en) 1981-03-20 1981-03-20 Plosive detector

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP3934481A JPS57155596A (en) 1981-03-20 1981-03-20 Plosive detector

Publications (2)

Publication Number Publication Date
JPS57155596A JPS57155596A (en) 1982-09-25
JPS6329759B2 true JPS6329759B2 (en) 1988-06-15

Family

ID=12550462

Family Applications (1)

Application Number Title Priority Date Filing Date
JP3934481A Granted JPS57155596A (en) 1981-03-20 1981-03-20 Plosive detector

Country Status (1)

Country Link
JP (1) JPS57155596A (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS59149399A (en) * 1983-02-16 1984-08-27 工業技術院長 Consonant sorter

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS501846A (en) * 1973-05-14 1975-01-09

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS501846A (en) * 1973-05-14 1975-01-09

Also Published As

Publication number Publication date
JPS57155596A (en) 1982-09-25

Similar Documents

Publication Publication Date Title
JPS6329759B2 (en)
JPS5949742A (en) Apparatus for detecting exhalation force
Niederjohn et al. Computer recognition of the continuant phonemes in connected English speech
JPS58150997A (en) Speech feature extractor
JPS58150995A (en) Speech feature extractor
JPH01112300A (en) Plosive sound detector
JPS6258519B2 (en)
JPS6260720B2 (en)
JPS6260718B2 (en)
JPS63163494A (en) Intensity detector
JPH0567039B2 (en)
JPS58120299A (en) H series sound detector
JPH0398098A (en) Voice recognition device
JPS6329760B2 (en)
JPS58120300A (en) Plosive extractor
JPS63226691A (en) Reference pattern generation system
KR970067093A (en) An epoch detection method in a voiced part of a voice signal
JPS6230640B2 (en)
JPS63226692A (en) Pattern comparison system
JPS61116400A (en) Voice information processor
JPS61190398A (en) Plosive consonant recognition system
JPS59149399A (en) Consonant sorter
JPS5926796A (en) Voice recognition equipment
Oh et al. Endpoint detection of isolated Korean utterances for bimodal speech recognition in acoustic noisy environments
JPS6331794B2 (en)