JPS6329759B2

JPS6329759B2 -

Info

Publication number: JPS6329759B2
Application number: JP56039344A
Authority: JP
Inventors: Takeo Murata; Nobuhisa Kadowaki
Original assignee: Agency of Industrial Science and Technology
Current assignee: National Institute of Advanced Industrial Science and Technology AIST
Priority date: 1981-03-20
Filing date: 1981-03-20
Publication date: 1988-06-15
Also published as: JPS57155596A

Description

【発明の詳細な説明】本発明は破裂音を検出する破裂音検出装置に関
するものである。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a plosive sound detection device for detecting plosive sounds.

従来、人の音声のうちから破裂音だけを確実、
簡便、かつリアルタイムで検出する方法は知られ
ていない。 Previously, only plosive sounds could be reliably detected from human speech.
There is no known method for simple and real-time detection.

本発明は発話時の口気流の流速及び音声の情報
から破裂音のみを確実、簡便、かつリアルタイム
で検出する破裂音検出装置を提供することを目的
とする。 An object of the present invention is to provide a plosive sound detection device that reliably, simply, and in real time detects only plosive sounds based on the flow velocity of oral airflow during speech and voice information.

実験によれば、発話時の口気流の流速波形は破
裂音に対して特異的であり、この流速波形に簡単
な信号処理を行なうことにより、破裂音を有効に
検出しうることが確かめられた。しかし、この方
法では発話の仕方により、ときに、破裂音以外の
音、多くは無声摩擦音をも破裂音として検出する
ことが明らかになつた。 Experiments have shown that the velocity waveform of oral airflow during speech is specific to plosive sounds, and that plosive sounds can be detected effectively by performing simple signal processing on this velocity waveform. . However, it has become clear that this method sometimes detects sounds other than plosives, often voiceless fricatives, as plosives, depending on the way the utterance is made.

本発明は口気流情報から破裂音の候補を検出し
一方、音声情報から摩擦音を抽出して、上記破裂
音の候補の中から摩擦音を除去することにより、
破裂音のみを正しく検出するものである。 The present invention detects plosive candidates from oral airflow information, extracts fricatives from voice information, and removes fricatives from the plosive candidates.
This method correctly detects only plosive sounds.

以下、本発明の実施例について図面と共に説明
する。 Embodiments of the present invention will be described below with reference to the drawings.

第１図は本発明にかかる破裂音検出装置の一実
施例を示すブロツク図である。図において、１は
口気流速を検出する流速検出手段、２はその流速
検出手段１の出力信号理し破裂音の候補を検出す
る破裂音検出手段、３は音声を検出するマイク、
４はマイク３の出力信号より摩擦音を抽出する摩
擦音抽出手段、５は摩擦音抽出手段４の出力信号
により破裂音検出手段２の出力信号を抑制する判
定手段である。流速検出手段１は発話によつて生
じる口の前の気流の速度を測定し、電気信号を出
力するものであり、例えば熱線式の流速計等で構
成される。破裂音検出手段２は流速検出手段１の
出力信号、いわゆる口気流波形を処理して破裂音
の候補を検出するものであり、例えば抵抗、コン
デンサで構成する微分回路等で構成される。マイ
ク３は音声を検出するものであり、例えば、低雑
音接話型マイクロホン等である。摩擦音抽出手段
４はマイク３の出力信号、いわゆる音声信号から
摩擦音を抽出するものであり、例えば、ローパス
フイルター回路とハイパスフイルター回路と除算
回路とで構成し、それぞれのフイルター回路で音
声を分離し、除算回路によりハイパスフイルター
回路の出力をローパスフイルター回路の出力で除
算し、その出力の大小によつて摩擦音を抽出す
る。判定手段５は摩擦音抽出手段４の出力信号に
より破裂音検出手段２の出力信号を抑制するもの
であり、例えばNAND回路、あるいは可変利得
増幅回路等で構成される。 FIG. 1 is a block diagram showing one embodiment of a plosive sound detection device according to the present invention. In the figure, 1 is a flow velocity detection means for detecting the oral air flow velocity, 2 is a plosive sound detection means for detecting plosive candidates based on the output signal of the flow velocity detection means 1, and 3 is a microphone for detecting sound.
4 is a fricative sound extracting means for extracting a fricative sound from the output signal of the microphone 3; and 5 is a determining means for suppressing the output signal of the plosive sound detecting means 2 by the output signal of the fricative sound extracting means 4. The flow velocity detection means 1 measures the velocity of the airflow in front of the mouth caused by speech and outputs an electrical signal, and is composed of, for example, a hot wire type current meter. The plosive sound detection means 2 processes the output signal of the flow rate detection means 1, ie, the so-called oral airflow waveform, to detect plosive sound candidates, and is composed of, for example, a differentiating circuit made up of resistors and capacitors. The microphone 3 detects voice, and is, for example, a low-noise close-talk type microphone. The fricative sound extraction means 4 extracts fricative sounds from the output signal of the microphone 3, which is a so-called audio signal, and is composed of, for example, a low-pass filter circuit, a high-pass filter circuit, and a division circuit, and each filter circuit separates the sound. A division circuit divides the output of the high-pass filter circuit by the output of the low-pass filter circuit, and fricative sounds are extracted depending on the magnitude of the output. The determining means 5 suppresses the output signal of the plosive sound detecting means 2 using the output signal of the fricative sound extracting means 4, and is composed of, for example, a NAND circuit or a variable gain amplifier circuit.

第２図ａ〜ｅ及び第３図ａ〜ｅは第１図に示し
た破裂音検出装置の各部の信号波形を示すもので
あり、第２図は破裂音、第３図は無声摩擦音を発
話した場合をそれそれ示している。波形イ、ロは
発話によつてマイク３が検出した破裂音と無声摩
擦音の音声信号の代表例を表わし、波形ハ、ニは
その時の流速検出手段１が検出した口気流波形を
示すものである。波形ホ、ヘは破裂音検出手段２
によつて口気流波形ハ、ニから破裂音の候補を検
出した波形を示すものである。波形ト、チは摩擦
音抽出手段４により音声信号イ、ロから摩擦音を
抽出した信号すなわち摩擦音信号波形を示すもの
であり、また波形リ、ヌは判定手段５の出力信号
を示すものである。 Figures 2 a to e and Figures 3 a to e show signal waveforms of each part of the plosive sound detection device shown in Figure 1. Figure 2 shows a plosive sound, and Figure 3 shows a voiceless fricative sound. If you do it it shows it. Waveforms A and B represent representative examples of audio signals of plosives and voiceless fricatives detected by the microphone 3 during speech, and waveforms C and D represent oral airflow waveforms detected by the flow velocity detection means 1 at that time. . Waveforms E and F are plosive sound detection means 2
This shows waveforms in which candidates for plosive sounds were detected from the oral airflow waveforms c and d. Waveforms ① and ① indicate signals obtained by extracting fricatives from audio signals ① and ② by the fricative sound extracting means 4, that is, fricative sound signal waveforms, and waveforms ① and ① indicate output signals of the determining means 5.

以上のような構成において、破裂音例えば／
Ｐ／を含む音／Pa／を発声した場合、流速検出
手段１は口気流波形ハを検出し、マイク３は音声
信号イを検出する。破裂音検出手段２は口気流波
形ハを微分することにより破裂音の候補である微
分信号ホを検出する。一方、摩擦音抽出手段４は
音声信号イより摩擦音を抽出する。音声信号イに
おける高周波成分は少なく、従つてハイパスフイ
ルター回路での出力はほとんどない。この為、波
形トに示すように摩擦音抽出手段４の出力はな
い。したがつて、摩擦音信号トが大きい時に微分
信号ホを小さくするように抑制する判定手段５の
出力は、摩擦音信号トがないため破裂音を表わす
破裂音パルス信号リを発生する。 In the above configuration, plosive sounds such as /
When the sound /Pa/ containing P/ is uttered, the flow velocity detection means 1 detects the oral airflow waveform C, and the microphone 3 detects the audio signal A. The plosive sound detection means 2 detects a differential signal E, which is a candidate for a plosive sound, by differentiating the oral airflow waveform C. On the other hand, the fricative sound extracting means 4 extracts a fricative sound from the audio signal A. There are few high frequency components in the audio signal A, so there is almost no output from the high pass filter circuit. Therefore, as shown in waveform G, there is no output from the fricative sound extraction means 4. Therefore, the output of the determining means 5, which suppresses the differential signal E to a small value when the fricative signal G is large, generates a plosive pulse signal R representing a plosive sound since there is no fricative signal G.

次に、例えば無声摩擦音／Ｓ／を含む音／
Sa／を特に／Ｓ／を強調して発声した場合、流
速検出手段１の出力信号は異常に大きな口気流波
形ニを検出する。この口気流波形ニは破裂音検出
手段２により破裂音の微分信号ホに近い微分信号
へを出力することがある。したがつて、無声摩擦
音／Ｓ／は破裂音の候補となる。一方、摩擦音抽
出手段４は音声信号ロより摩擦音を抽出する。音
声信号ロにおける高周波成分は多く、従つてハイ
パスフイルター回路での出力が大きい。この為、
摩擦音抽出手段４の出力は波形チに示すような摩
擦音信号を発生する。従つて、摩擦音信号チが大
きい時に微分信号へを小さくするように抑制する
判定手段５の出力は、波形ヌに示すように破裂音
を表わす破裂音パルス信号を発生しない。 Next, for example, a sound containing the unvoiced fricative /S/ /
When Sa/ is uttered with particular emphasis on /S/, the output signal of the flow velocity detection means 1 detects an abnormally large oral airflow waveform D. This oral airflow waveform D may be outputted by the plosive sound detection means 2 as a differential signal close to the differential signal E of the plosive sound. Therefore, the voiceless fricative /S/ is a candidate for a plosive. On the other hand, the fricative sound extracting means 4 extracts a fricative sound from the audio signal B. There are many high frequency components in the audio signal B, and therefore the output from the high pass filter circuit is large. For this reason,
The output of the fricative sound extraction means 4 generates a fricative sound signal as shown in waveform H. Therefore, the output of the determining means 5, which suppresses the differential signal so as to be small when the fricative signal Q is large, does not generate a plosive pulse signal representing a plosive as shown in waveform N.

このように本実施例によれば、発話の仕方にか
かわらず、無声摩擦音を破裂音と検出せず、破裂
音のみを正しく検出することができる。また、複
雑な信号処理手段等を用いず、流速計、簡単な微
分回路、マイク、２つのフイルター回路、及び、
NAND回路等で構成できる。さらに原理上リア
ルタイムで検出できることは明らかである。 As described above, according to this embodiment, regardless of the manner of utterance, voiceless fricatives are not detected as plosives, and only plosives can be correctly detected. In addition, without using complicated signal processing means, a current meter, a simple differentiation circuit, a microphone, two filter circuits, and
It can be configured with NAND circuits, etc. Furthermore, it is clear that detection can be performed in real time in principle.

以上説明したように本発明は口気流情報から破
裂音の候補を検出し、一方、音声情報から摩擦音
を抽出して上記破裂音の候補の中から摩擦音を除
去するようにしたので破裂音のみを正しく検出す
ることができる。 As explained above, the present invention detects plosive candidates from oral airflow information, and on the other hand, extracts fricatives from voice information and removes fricatives from the plosive candidates. Can be detected correctly.

[Brief explanation of the drawing]

第１図は本発明にかかる破裂音検出装置の一実
施例を示すブロツク図、第２図ａ〜ｅ及び第３図
ａ〜ｅはそれぞれ破裂音、無声摩擦音を発話した
場合の同装置各部の信号波形を示す図である。１……流速検出手段、２……破裂音検出手段、
３……マイク、４……摩擦音抽出手段、５……判
定手段。 Fig. 1 is a block diagram showing one embodiment of the plosive sound detection device according to the present invention, and Figs. 2 a to e and 3 a to e show the various parts of the device when a plosive sound and a voiceless fricative sound are uttered, respectively. FIG. 3 is a diagram showing signal waveforms. 1...Flow velocity detection means, 2...Plosive sound detection means,
3...Microphone, 4...Fricative sound extraction means, 5...Determination means.

Claims

[Claims]

1. A flow velocity detection means for detecting oral airflow, a plosive detection means for processing the output signal of the flow velocity detection means and detecting plosive sound candidates, a microphone for detecting sound, and detecting fricative sounds from the output signal of the microphone. A plosive sound detection device comprising: a fricative sound extraction means for extracting a fricative sound; and a means for suppressing an output signal of the plosive sound detection means by an output signal of the fricative sound extraction means.