JPS6217800A - Voice section decision system - Google Patents

Voice section decision system

Info

Publication number
JPS6217800A
JPS6217800A JP60159149A JP15914985A JPS6217800A JP S6217800 A JPS6217800 A JP S6217800A JP 60159149 A JP60159149 A JP 60159149A JP 15914985 A JP15914985 A JP 15914985A JP S6217800 A JPS6217800 A JP S6217800A
Authority
JP
Japan
Prior art keywords
section
speech
noise
vowel
input sound
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP60159149A
Other languages
Japanese (ja)
Other versions
JPH0456999B2 (en
Inventor
伸 神谷
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sharp Corp
Original Assignee
Sharp Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sharp Corp filed Critical Sharp Corp
Priority to JP60159149A priority Critical patent/JPS6217800A/en
Publication of JPS6217800A publication Critical patent/JPS6217800A/en
Priority to US07/256,151 priority patent/US4920568A/en
Publication of JPH0456999B2 publication Critical patent/JPH0456999B2/ja
Granted legal-status Critical Current

Links

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。
(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】 く技術分野〉 本発明は入力音の中から音声と雑音とを分離するだめの
音声区間判定方式に関するものである。
DETAILED DESCRIPTION OF THE INVENTION Technical Field The present invention relates to a speech segment determination method for separating speech and noise from input sound.

〈従来技術〉 音声と雑音とを分離する際に、今までは白色雑音やパル
ス性雑音等ある特定の雑音のみを検出し、それを抑制す
ることにより雑音の軽減をはかってきた。
<Prior Art> When separating speech and noise, conventional techniques have been to detect only certain types of noise, such as white noise or pulsed noise, and to suppress the noise in order to reduce the noise.

しかし、雑音の種類は無限にあり、したがって雑音ごと
に抑制方法を変えていくこれまでの方式では全ての雑音
に対処しきれない。
However, there are an infinite number of types of noise, and conventional methods that change the suppression method for each type of noise cannot deal with all types of noise.

く目 的〉 本発明はかかる従来の問題点に鑑みて成されたもので、
特定の雑音を検出して抑制するのではなく、入力音から
音声と雑音とを分離することにより、非常に多くの種類
の雑音をきわめて容易に取り除くことができる音声区間
判定方式を提供せんとするものである。さらに言えば、
母音の有無にもとづいて音声区間の判定を行ない、得ら
れた音声区間を雑音区間から分離し得るようにした音声
区間判定方式を提供せんとするものである。
Purpose> The present invention has been made in view of such conventional problems, and
Rather than detecting and suppressing specific noise, the present invention aims to provide a speech interval determination method that can very easily remove a large number of types of noise by separating speech and noise from input sound. It is something. Furthermore,
It is an object of the present invention to provide a speech interval determination method that determines speech intervals based on the presence or absence of vowels and is capable of separating the obtained speech intervals from noise intervals.

〈実施例〉 子音と母音の組を基本構造とする日本語音声において、
母音らしい区間の条件としては以下の4つが挙げられる
<Example> In Japanese speech whose basic structure is a pair of consonants and vowels,
The following four conditions are listed as conditions for a vowel-like interval.

■ パワーが大きい区間。■ Sections with high power.

Φ スペクトル変化が小さい区間(音声定常部)。Φ A section with small spectral changes (voice stationary part).

■ 母音の標準パターンとのマツチング距離が小さい区
間。
■ An area where the matching distance with the standard vowel pattern is small.

■ ケグストラム係数の絶対値和か大きい区間。■ Sum of absolute values of kegstrum coefficients or larger interval.

本発明はこれら4つの条件の中から特に■と■の条件に
もとづいて母音区間を検出して、雑音区間との分離を行
なうもので、これによって、母音の標準パターンとのマ
ツチングを省略し、より簡単なハードウェア構成により
音声区間の判定を行なえるようにしたところKvf、徴
がある。
The present invention detects vowel intervals based on the conditions ■ and ■ out of these four conditions and separates them from noise intervals, thereby omitting matching with the standard pattern of vowels. When the voice section can be determined using a simpler hardware configuration, Kvf is found.

次に図にもとづいて本発明方式を詳細に説明する。Next, the system of the present invention will be explained in detail based on the drawings.

第1図は本発明方式を実施した音声区間判定装置のブロ
ック構成図である。
FIG. 1 is a block diagram of a speech segment determination device implementing the method of the present invention.

図において!は音声分析部、2はケプストラム和計算部
、3は判定部である。
In the figure! 2 is a speech analysis section, 2 is a cepstrum sum calculation section, and 3 is a judgment section.

前記音声分析部1は第2図にそのブロック構成図を示す
通り、自己相関係数計算部4、線形予測係数計算部5、
ケプストラム係数計算部6及びパワー計算部7から構成
されている。
As shown in the block diagram of FIG. 2, the speech analysis section 1 includes an autocorrelation coefficient calculation section 4, a linear prediction coefficient calculation section 5,
It consists of a cepstrum coefficient calculation section 6 and a power calculation section 7.

自己相関係数計算部4ではサンプリング値5(i)(た
だし、1≦l≦256)、分析次数np=24として、
第3図の処理フローにもとづいて自己相関係数(Ri)
(ただし、1≦i≦np+Iンを求めている。
The autocorrelation coefficient calculation unit 4 calculates the sampling value 5(i) (where 1≦l≦256) and the analysis order np=24,
Based on the processing flow in Figure 3, the autocorrelation coefficient (Ri)
(However, 1≦i≦np+In is required.

一方、線形予測係数計算部5では前記計算部4からの自
己相関係数(Ri)を入力として、第4図の処理フロー
に従って線形予測係数A(i) (ただし、■≦i≦n
p  八個自己相関係数P(i)並びに残差パワーE 
(i)を算出する。又、ケプストラム係数計算部6では
前段で求められた線形予測係数A(i)(l≦i≦np
  )をもとに次式によりケグストラム係数c(i)(
1≦i≦np  )を求める。
On the other hand, the linear prediction coefficient calculation unit 5 inputs the autocorrelation coefficient (Ri) from the calculation unit 4, and follows the processing flow of FIG. 4 to calculate the linear prediction coefficient A(i) (where ■≦i≦n
p Eight autocorrelation coefficients P(i) and residual power E
Calculate (i). In addition, the cepstral coefficient calculation unit 6 calculates the linear prediction coefficient A(i) (l≦i≦np
), the kegstrum coefficient c(i)(
1≦i≦np).

さらに、パワー計算部7ではサンプリング値5(i)(
+≦i≦256)から次式にもとづいてパワーPを求め
る。
Furthermore, in the power calculation unit 7, the sampling value 5(i)(
+≦i≦256), the power P is determined based on the following equation.

次に動作を説明する。Next, the operation will be explained.

まず音声分析部1にて音声信号を16KH2でサンプリ
ングしくただし、時刻tのサンプリング値を5(t)と
する)、16m秒のハユング窓をかけて、フレーム周期
8m秒毎にパワーPとLPCケプストラムCを求める。
First, the audio signal is sampled at 16KH2 in the audio analysis unit 1, and the sampling value at time t is 5(t)), and a Hayung window of 16 ms is applied, and the power P and LPC cepstrum are calculated every 8 ms frame period. Find C.

(なお、1番目のフレームのパワー及びケプストラムを
それぞれP (t) 、 c (t)であられす)。求
められたLPCケグストラムc (t)は次段のケプス
トラム和計算部2に入力され、ここで低次(24次まで
)のケグストラム係数の絶(りとして出力される。
(Note that the power and cepstrum of the first frame are P (t) and c (t), respectively). The obtained LPC kegstrum c (t) is input to the cepstral sum calculation section 2 at the next stage, where it is output as the end of low-order (up to the 24th order) kegstrum coefficients.

こうして求められたパワーP(tJとケプストラム和C
(t)は判定部3(/c送られ、そこで次のような判定
が成される。
The power P (tJ and cepstral sum C
(t) is sent to the determination unit 3 (/c), where the following determination is made.

すなわち、パワーP (t)がいき値alより大きく(
第5図参照)、かつケプストラム和C(υがいき値a2
より大きければ(第6図参照)、そのフレームが母音区
間内Vc6ると判定する。そして、区間tI<t<t2
において(ただし、t 2− t +>84フレームと
する)、有音区間が21フレ一ム以上あシ、かつ母音区
間と判定されたフレーム数が有音区間長の174以上な
らば区間tI<t<t2は音声区間であると判定し、ま
た1/4未満ならば雑音区間であると判定する。
That is, the power P (t) is larger than the threshold al (
(see Figure 5), and the cepstral sum C (υ is the threshold a2
If it is larger (see FIG. 6), it is determined that the frame is within the vowel interval Vc6. Then, the interval tI<t<t2
(however, t 2 - t + > 84 frames), if the voiced section is 21 frames or more, and the number of frames determined to be vowel sections is 174 or more, which is the length of the voiced section, then the section tI < If t<t2, it is determined that it is a voice section, and if it is less than 1/4, it is determined that it is a noise section.

このようにして、入力音区間長に対する母音図5長の比
といき値との関係にもとづいて入力音中の音声区間と雑
音区間を判定し分離することができる。特にこの方式の
特徴は、母音区間の検出にあたって母音の標準パターン
とのマツチング処理を行なわないので、非常に簡単なハ
ードウェア構成でもって音声区間の判定を行なうことが
できることである。
In this way, it is possible to determine and separate the speech section and the noise section in the input sound based on the relationship between the threshold and the ratio of the vowel diagram 5 length to the input sound section length. A particular feature of this method is that it does not perform matching processing with a standard vowel pattern when detecting vowel sections, so it is possible to determine speech sections with a very simple hardware configuration.

なお、入力音区間長に対する母音区間長の比は実施例に
限らず適宜定めることが出来る。
Note that the ratio of the vowel section length to the input sound section length is not limited to the embodiment and can be determined as appropriate.

く効 果〉 以上詳細に説明した様に、本発明に係る音声区間判定方
式は入力音中の母音区間を検出し、入力音区間長に対す
る前記検出した母音区間長の比を求め、その比がいき値
より大きいとき入力音が音声であると判定するようにし
たから、入力音中から音声のみ検出することができ、し
かも母音区間の検出の際に母音の標準パターンとのマツ
チング処aを行なわないので、ノ・−ドウエアの構成に
あたって、その構成を著しく簡略化することができると
いう大きな効果がある。
Effect> As explained in detail above, the speech interval determination method according to the present invention detects a vowel interval in an input sound, calculates the ratio of the detected vowel interval length to the input sound interval length, and calculates the ratio of the detected vowel interval length to the input sound interval length. Since the input sound is determined to be speech when it is larger than the threshold, it is possible to detect only speech from the input sound, and when detecting vowel sections, matching process a with the standard vowel pattern is performed. Therefore, there is a great effect that the configuration of the software can be significantly simplified.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明方式を採用した音声区間判定装置のブロ
ック図、第2図は第1図の音声分析部のブロック図、第
3図は自己相関係数計算部における処理フロー図、第4
図は線形予測係数計算部における処理フロー図、第5図
はパワーのいき値と雑音及び音声との関係を示す図、第
6図はケグヌトラム和のいき値と雑音及び音声との関係
を示す図である。 lは音声分析部、2はケグヌトラム和計算部、3は判定
部、5(t)はサンプリング値、c(t)はLPGケグ
ヌトラム、P(t)はパワー、C(t)はケプヌトラム
和。 代理人 弁理士 福 士 愛 彦(他2名)第1図 91色 副      第4図 第6図
Fig. 1 is a block diagram of a speech segment determination device adopting the method of the present invention, Fig. 2 is a block diagram of the speech analysis section of Fig. 1, Fig. 3 is a processing flow diagram of the autocorrelation coefficient calculation section, and Fig. 4
Figure 5 is a diagram showing the processing flow in the linear prediction coefficient calculation unit, Figure 5 is a diagram showing the relationship between the power threshold, noise and speech, and Figure 6 is a diagram showing the relationship between the Kegnutrum sum threshold, noise and audio. It is. 1 is a speech analysis unit, 2 is a kegnutrum sum calculation unit, 3 is a determination unit, 5(t) is a sampling value, c(t) is an LPG kegnutrum, P(t) is power, and C(t) is a kegnutrum sum. Agent Patent attorney Aihiko Fuku (and 2 others) Figure 1 91 color subtitles Figure 4 Figure 6

Claims (1)

【特許請求の範囲】[Claims] 1、入力音中の母音区間を検出し、入力音区間長に対す
る前記検出した母音区間長の比を求め、その比がいき値
より大きいとき入力音が音声であると判定する音声区間
判定方式。
1. A voice section determination method that detects a vowel section in an input sound, calculates the ratio of the detected vowel section length to the input sound section length, and determines that the input sound is speech when the ratio is greater than a threshold value.
JP60159149A 1985-07-16 1985-07-16 Voice section decision system Granted JPS6217800A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP60159149A JPS6217800A (en) 1985-07-16 1985-07-16 Voice section decision system
US07/256,151 US4920568A (en) 1985-07-16 1988-10-11 Method of distinguishing voice from noise

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP60159149A JPS6217800A (en) 1985-07-16 1985-07-16 Voice section decision system

Publications (2)

Publication Number Publication Date
JPS6217800A true JPS6217800A (en) 1987-01-26
JPH0456999B2 JPH0456999B2 (en) 1992-09-10

Family

ID=15687327

Family Applications (1)

Application Number Title Priority Date Filing Date
JP60159149A Granted JPS6217800A (en) 1985-07-16 1985-07-16 Voice section decision system

Country Status (1)

Country Link
JP (1) JPS6217800A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007046267A1 (en) * 2005-10-20 2007-04-26 Nec Corporation Voice judging system, voice judging method, and program for voice judgment
WO2010089976A1 (en) 2009-02-09 2010-08-12 パナソニック株式会社 Hearing aid
US8121321B2 (en) 2008-12-26 2012-02-21 Panasonic Corporation Hearing aids

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007046267A1 (en) * 2005-10-20 2007-04-26 Nec Corporation Voice judging system, voice judging method, and program for voice judgment
JP4911034B2 (en) * 2005-10-20 2012-04-04 日本電気株式会社 Voice discrimination system, voice discrimination method, and voice discrimination program
US8175868B2 (en) 2005-10-20 2012-05-08 Nec Corporation Voice judging system, voice judging method and program for voice judgment
US8121321B2 (en) 2008-12-26 2012-02-21 Panasonic Corporation Hearing aids
WO2010089976A1 (en) 2009-02-09 2010-08-12 パナソニック株式会社 Hearing aid
US8126176B2 (en) 2009-02-09 2012-02-28 Panasonic Corporation Hearing aid

Also Published As

Publication number Publication date
JPH0456999B2 (en) 1992-09-10

Similar Documents

Publication Publication Date Title
JPH0431898A (en) Voice/noise separating device
JPS6217800A (en) Voice section decision system
JP2992324B2 (en) Voice section detection method
JPH03114100A (en) Voice section detecting device
JPS63281200A (en) Voice section detecting system
JPH04238399A (en) Voice recognition device
JPH04115299A (en) Method and device for voiced/voiceless sound decision making
KR100345402B1 (en) An apparatus and method for real - time speech detection using pitch information
JP2737109B2 (en) Voice section detection method
JPH0457000B2 (en)
JPS5925237B2 (en) Speech segment determination method using speech analysis and synthesis method
JPH0567039B2 (en)
JPS61233791A (en) Voice section detection system for voice recognition equipment
JPS62238599A (en) Voice section detecting system
JP2772598B2 (en) Audio coding device
JPH04251299A (en) Speech section detecting means
JP3233543B2 (en) Method and apparatus for extracting impulse drive point and pitch waveform
JPH0567040B2 (en)
JPS59228300A (en) Voice section detecting system
JP2891259B2 (en) Voice section detection device
JPS63155196A (en) Voiceless sound detection
JPH0259480B2 (en)
JPS5925240B2 (en) Word beginning detection method for speech sections
JPH0394300A (en) Voice detector
JPS63237100A (en) Voice detector

Legal Events

Date Code Title Description
LAPS Cancellation because of no payment of annual fees