JP2000010577A - Voiced sound/voiceless sound judging device - Google Patents

Voiced sound/voiceless sound judging device

Info

Publication number
JP2000010577A
JP2000010577A JP17338198A JP17338198A JP2000010577A JP 2000010577 A JP2000010577 A JP 2000010577A JP 17338198 A JP17338198 A JP 17338198A JP 17338198 A JP17338198 A JP 17338198A JP 2000010577 A JP2000010577 A JP 2000010577A
Authority
JP
Japan
Prior art keywords
sound
voiced
signal
unvoiced
amplitude
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP17338198A
Other languages
Japanese (ja)
Inventor
Kazuki Sakai
和樹 酒井
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Priority to JP17338198A priority Critical patent/JP2000010577A/en
Publication of JP2000010577A publication Critical patent/JP2000010577A/en
Pending legal-status Critical Current

Links

Abstract

PROBLEM TO BE SOLVED: To provide a voiced sound/voiceless sound judging device capable of rapidly and correctly judging a voiced sound or a voiceless sound and being constituted with a simple circuit. SOLUTION: This voiced sound/voiceless sound judging device is constituted of a microphone 11 for inputting a voice, an amplifier 12 for amplifying the signal from the microphone 11 to prescribed amplitude, an A/D converter 13 for converting the signal from the amplifier 12 to a digital signal, a low-pass filter 14 for extracting the signal of a low band from the digitized signal, an amplitude average measuring instrument 15 for measuring the average of the amplitude in the prescribed time interval of the extracted low band signal, a number of zero-crossing counter 16 for measuring the number of zero-crossing of the digitized signal and a voiced sound/voiceless sound discriminator 17 for discriminating the voiced sound from the voiceless sound based on the outputs of the amplitude mean measuring instrument 15 and the number of zero-cross counter 16 and deciding the ratio of the voiceless sound to the input voice.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【0001】[0001]

【発明の属する技術分野】本発明は音声入力に含まれる
有声音と無声音との割合いを正確、且つ簡便に判定する
ことを目的とした有声音/無声音判定装置に関する。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voiced / unvoiced sound judging apparatus for accurately and easily judging the ratio between voiced sound and unvoiced sound contained in a voice input.

【0002】[0002]

【従来の技術】従来より音声の有声音/無声音判定装置
としては次のようなものが知られている。
2. Description of the Related Art The following is known as a voiced / unvoiced sound discriminating apparatus.

【0003】その第1はマイクより入力された音声を、
A/D変換装置を経てディジタル信号に変換し、一定時
間間隔でこの信号の振幅レベルが零レベルと交差する回
数、即ち零交差数を測定する。そして、この零交差数が
装置内部に設定した第1の閾値と第2の閾値(第1の閾
値<第2の閾値)に関して、第1の閾値を下回った場合
は有声音と判別し、第2の閾値を上回った場合は無声音
と判別し、第1の閾値と第2の閾値との間にある場合は
有声音と無声音とが混在すると判断する有声音/無声音
判定装置である。
[0003] The first is that a voice input from a microphone is
The signal is converted into a digital signal via an A / D converter, and the number of times the amplitude level of the signal crosses the zero level at a fixed time interval, that is, the number of zero crossings is measured. If the number of zero crossings falls below the first threshold with respect to the first threshold and the second threshold (the first threshold <the second threshold) set inside the apparatus, it is determined that the voiced sound is present. The voiced / unvoiced sound determination device determines that the voiced sound is unvoiced when the value exceeds the second threshold value, and that the voiced sound and the unvoiced sound are mixed when the value is between the first threshold value and the second threshold value.

【0004】しかしながら、この装置は比較的単純な回
路で構成できる利点はあるが、周期性のある有声音にノ
イズ成分が付加されたような音声、例えば有声子音等の
音声の場合、零振幅レベル付近でのノイズ成分に多くの
影響を受けて零交差数の値が大きくなり、このためこの
音声を無声音と誤判定することがあった。
However, this apparatus has an advantage that it can be constituted by a relatively simple circuit. However, in the case of a voice in which a noise component is added to a periodic voiced sound, for example, a voice such as a voiced consonant, a zero amplitude level is obtained. The value of the number of zero crossings is increased due to the influence of noise components in the vicinity, and this voice may be erroneously determined to be unvoiced.

【0005】また、その第2はマイクより入力された音
声を、A/D変換装置を経てディジタル信号に変換し、
一定時間間隔で自己相関関数を算出するものである。こ
の自己相関関数は音声の周期性の度合いを示しており、
この自己相関関数値に明確なピークが存在すれば周期性
の強い有声音と判断し、一方、明確なピークが存在しな
ければ周期性の乏しい無声音と判断する。
[0005] The second is to convert the voice input from the microphone into a digital signal via an A / D converter,
The autocorrelation function is calculated at regular time intervals. This autocorrelation function indicates the degree of periodicity of the voice,
If there is a clear peak in the autocorrelation function value, it is determined that the voiced sound has a strong periodicity. On the other hand, if there is no clear peak, it is determined that the voiced sound has a poor periodicity.

【0006】しかしながら、この装置では音声の周期性
を判定の基本とするため、有声音/無声音の判定は正確
に行えるが、自己相関関数の算出のためには多数回の積
和演算が必要とされ、そのため回路構成が複雑となり、
また演算にも時間を必要とするものであった。
However, in this apparatus, voiced sound / unvoiced sound can be accurately determined because the periodicity of speech is used as a basis for determination. However, a large number of multiply-accumulate operations are required for calculating an autocorrelation function. Therefore, the circuit configuration becomes complicated,
Also, the calculation requires time.

【0007】[0007]

【発明が解決しようとする課題】従って本発明は、有声
音と無声音の判定を早く正確に行え、しかも簡便な回路
で構成できる有声音/無声音判定装置の提供を目的とす
る。
SUMMARY OF THE INVENTION Accordingly, an object of the present invention is to provide a voiced / unvoiced sound discriminating apparatus which can quickly and accurately determine a voiced sound and an unvoiced sound, and which can be constituted by a simple circuit.

【0008】[0008]

【課題を解決するための手段】本発明は上記課題に鑑み
なされたものであって、入力された音声信号の零交差数
を計数する計数手段と、入力された音声信号を低域通過
フィルタを通過させた後、通過した音声信号の振幅の平
均値を、所定時間間隔で測定する測定手段と、振幅の平
均値と零交差数とにより有声音と無声音とを判定する判
定手段とを具備した有声音/無声音判定装置を構成す
る。
SUMMARY OF THE INVENTION The present invention has been made in view of the above problems, and has a counting means for counting the number of zero crossings of an input audio signal, and a low-pass filter for converting the input audio signal. After passing, an average value of the amplitude of the passed audio signal is measured at predetermined time intervals, and a determination unit that determines voiced sound and unvoiced sound based on the average value of the amplitude and the number of zero crossings is provided. A voiced / unvoiced sound determination device is configured.

【0009】また、前記有声音/無声音判定装置は判定
手段内に設定される有声音と無声音との判定基準は、平
均値によって自動的に所定の値に設定される構成にし
て、上記課題を解決する。
Further, the voiced / unvoiced sound judging device has a structure in which a judgment standard for voiced sound and unvoiced sound set in the judging means is automatically set to a predetermined value by an average value. Resolve.

【0010】入力音声信号の振幅に応じて無声音の比率
を判別する基準を定め、入力音声の零交差数とその基準
に基づいて入力音声中の無声音の比率がもとまる。
A criterion for determining the ratio of unvoiced sounds is determined according to the amplitude of the input voice signal, and the ratio of unvoiced sounds in the input voice is determined based on the number of zero crossings of the input voice and the criterion.

【0011】[0011]

【発明の実施の形態】本発明は入力音声信号に含まれる
有声音と無声音の度合いの判定を正確、且つ簡便に行う
ことができる有声音/無声音判定装置に関するものであ
って、入力音声信号の振幅の大きさによって入力音声中
に含まれる無声音の比率を判定する基準を変え、その基
準に従って入力音声中の無声音の比率を判定することを
特徴とするものである。
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The present invention relates to a voiced / unvoiced sound determination device capable of accurately and easily determining the degree of voiced sound and unvoiced sound contained in an input voice signal. A criterion for determining the ratio of unvoiced sounds included in the input voice is changed according to the magnitude of the amplitude, and the ratio of unvoiced sounds in the input voice is determined according to the criterion.

【0012】つぎに実施形態例について図1ないし図3
を参照して説明する。ここで図1は本発明に係わる有声
音/無声音判定装置のブロック図であり、図2は有声音
/無声音判別器への入力である零交差数と、その出力で
ある無声音/入力音声比率UR(Unvoiced Sound Rati
o)との関係を示す図である。また、図3は振幅平均測
定器からの平均振幅値により有声音/無声音判別器内の
閾値が変更された場合の零交差数と無声音/入力音声比
率URとの関係を示す図である。
Next, an embodiment will be described with reference to FIGS.
This will be described with reference to FIG. Here, FIG. 1 is a block diagram of a voiced / unvoiced sound discriminating apparatus according to the present invention, and FIG. 2 is a diagram showing an input to a voiced / unvoiced discriminator and the number of zero crossings, and an output of the voiced / input voice ratio UR. (Unvoiced Sound Rati
It is a figure which shows the relationship with o). FIG. 3 is a diagram showing the relationship between the number of zero crossings and the unvoiced sound / input voice ratio UR when the threshold value in the voiced / unvoiced sound discriminator is changed based on the average amplitude value from the amplitude average measuring device.

【0013】まず、有声音/無声音判定装置1は図1に
示すように、音声を入力するマイク11と、マイク11
からの信号を所定の振幅に増幅する増幅器12と、増幅
器12からの信号をデジタル信号に変換するA/D変換
器13と、デジタル化された信号から低域の信号を抽出
する低域通過フィルタ14と、抽出された低域信号の所
定の時間間隔における振幅の平均を測定する振幅平均測
定器15と、デジタル化された信号の零レベルとの交差
数、即ち零交差数を測定する零交差数計数器16と、振
幅平均測定器15と零交差数計数器16との出力に基づ
いて有声音と無声音とを判別し、入力音声に対する無声
音の比率を定める有声音/無声音判別器17とから構成
されている。
First, as shown in FIG. 1, a voiced / unvoiced sound judging device 1 includes a microphone 11 for inputting a voice, a microphone 11
12, an A / D converter 13 for converting a signal from the amplifier 12 into a digital signal, and a low-pass filter for extracting a low-frequency signal from the digitized signal. 14; an amplitude average measuring device 15 for measuring the average of the amplitude of the extracted low-frequency signal in a predetermined time interval; and a zero crossing for measuring the number of crossings of the digitized signal with the zero level, that is, the number of zero crossings. A voiced / unvoiced sound discriminator 17 that discriminates between voiced sounds and unvoiced sounds based on the outputs of the number counter 16 and the amplitude average measuring device 15 and the zero-crossing number counter 16, and determines the ratio of unvoiced sound to input sound. It is configured.

【0014】つぎに上述した構成の有声音/無声音判定
装置1の動作について説明する。マイク11から入力さ
れた有声音と無声音とを含んだ音声は増幅器12で増幅
された後、A/D変換器13を経てデジタル信号に変換
される。A/D変換器13から出力された信号は2つに
分岐され、その一方は低域通過フィルタ14に送られ、
他の一方は零交差数計数器16に送られる。
Next, the operation of the voiced / unvoiced sound discriminating apparatus 1 having the above configuration will be described. The voice including the voiced sound and the unvoiced sound input from the microphone 11 is amplified by the amplifier 12 and then converted to a digital signal via the A / D converter 13. The signal output from the A / D converter 13 is split into two, one of which is sent to a low-pass filter 14,
The other is sent to the zero-crossing counter 16.

【0015】低域通過フィルタ14に送られた信号は、
その信号中に含まれる無声音成分である高周波数成分が
除去され、その結果、音声中の有声音成分である低周波
数成分のみの信号が出力されることになる。つぎにこの
低域通過フィルタ14からの信号は振幅平均測定器15
に入力され、所定時間当たりの平均振幅値が測定され
る。平均振幅値は入力音声の性質上、それが有声音に近
い場合は大きな値となり、一方、無声音や無音に近い場
合は小さな値となるものである。
The signal sent to the low-pass filter 14 is
The high-frequency component that is the unvoiced sound component included in the signal is removed, and as a result, only the low-frequency component that is the voiced sound component in the voice is output. Next, the signal from the low-pass filter 14 is applied to an amplitude average
And an average amplitude value per predetermined time is measured. Due to the nature of the input voice, the average amplitude value is a large value when the input voice is close to a voiced sound, and is a small value when it is unvoiced or close to no voice.

【0016】また、A/D変換器13から零交差数計数
器16に送られた信号は、ここでその信号に含まれる所
定時間当たりの零交差数が計数される。尚、零交差とは
信号が正から負へ、および負から正へと零レベルを交差
することであって、所定時間内におけるこれらの交差回
数を計数して、零交差数としている。従って、入力音声
中に無声音成分が多い場合、即ち高周波のノイズ成分が
多い場合、信号が零レベルと交差する回数は多くなり、
それらの計数である零交差数は大きな値をとり、一方、
入力音声中に有声音成分が多い場合、即ち低周波成分が
多い場合、信号が零レベルと交差する回数は少なくな
り、零交差数は小さな値となる。
In the signal sent from the A / D converter 13 to the zero-crossing number counter 16, the number of zero-crossings per predetermined time included in the signal is counted. The zero crossing means that a signal crosses a zero level from positive to negative and from negative to positive, and the number of these crossings within a predetermined time is counted and defined as the number of zero crossings. Therefore, when there are many unvoiced sound components in the input voice, that is, when there are many high-frequency noise components, the number of times the signal crosses the zero level increases,
Their count, the number of zero crossings, takes a large value, while
When there are many voiced sound components in the input voice, that is, when there are many low-frequency components, the number of times the signal crosses the zero level decreases, and the number of zero crossings becomes a small value.

【0017】求められた零交差数は有声音/無声音判別
器17に入力される。有声音/無声音判別器17の内部
には零交差数の閾値n1 、n2 (n1 <n2 )が設定さ
れていて、この閾値n1 、n2 と入力された零交差数と
が比較され、有声音または無声音の判別が行われる。そ
の判別結果として入力音声中における無声音の比率(U
R)が出力される。尚、比率(UR)は0.0〜1.0
の値をとる。また、閾値n1 、n2 は入力音声信号の平
均振幅値に応じて適宜変更され、後段で詳しく説明する
ように本発明の特徴を形成している。
The calculated number of zero crossings is input to a voiced / unvoiced sound discriminator 17. Thresholds n1 and n2 (n1 <n2) of the number of zero-crossings are set in the voiced / unvoiced sound discriminator 17, and the thresholds n1 and n2 are compared with the input number of zero-crossings to determine whether the voiced sound or unvoiced sound is present. An unvoiced sound is determined. As a result of the determination, the ratio of unvoiced sound (U
R) is output. The ratio (UR) is 0.0 to 1.0.
Take the value of Further, the threshold values n1 and n2 are appropriately changed according to the average amplitude value of the input audio signal, and form a feature of the present invention as will be described later in detail.

【0018】つぎに、有声音と無声音の判別について説
明する。図2に示す横軸を零交差数、縦軸を比率(U
R)としたグラフを有声音と無声音の判別に用いる。実
際にこのグラフは有声音/無声音判別器17内にテーブ
ルとして格納されている。
Next, the discrimination between a voiced sound and an unvoiced sound will be described. The horizontal axis shown in FIG. 2 is the number of zero crossings, and the vertical axis is the ratio (U
The graph R) is used for discriminating voiced sounds and unvoiced sounds. Actually, this graph is stored in the voiced / unvoiced sound discriminator 17 as a table.

【0019】図2に示すグラフにおいて、入力音声が完
全な有声音から零交差数=n1 までを有声音領域、零交
差数=n1 からn2 までを中間領域、零交差数=n2 か
ら完全な無声音までを無声音領域とする。
In the graph shown in FIG. 2, the input voice is a voiced sound region from a completely voiced sound to the number of zero crossings = n1, an intermediate region from the number of zero crossings = n1 to n2, a complete unvoiced sound from the number of zero crossings = n2. Up to the unvoiced sound area.

【0020】さて、入力音声の零交差数が有声音領域に
ある場合、比率(UR)=0.0が有声音/無声音判別
器17から出力され、入力音声は有声音のみであると判
断される。また、入力音声の零交差数が無声音領域にあ
る場合、比率(UR)=1.0が出力され、入力音声は
無声音のみであると判断される。また、入力音声の零交
差数が中間領域にある場合、比率(UR)は0.0から
1.0の間の値をとり、入力音声は有声音と無声音とを
含んでいると判断される。このとき比率(UR)は入力
音声とこれに含まれる無声音との比で与えられ、例えば
図2において零交差数=ni のとき比率(UR)=0.
45が有声音/無声音判別器17から出力されたとする
と、入力音声のうち45%が無声音であり、残りの55
%が有声音であると判断される。
When the number of zero crossings of the input voice is in the voiced sound area, the ratio (UR) = 0.0 is output from the voiced / unvoiced sound discriminator 17, and it is determined that the input voice is only the voiced sound. You. When the number of zero crossings of the input voice is in the unvoiced sound area, the ratio (UR) = 1.0 is output, and it is determined that the input voice is only the unvoiced sound. When the number of zero crossings of the input voice is in the intermediate region, the ratio (UR) takes a value between 0.0 and 1.0, and it is determined that the input voice includes voiced and unvoiced sounds. . At this time, the ratio (UR) is given by the ratio between the input voice and the unvoiced sound included in the input voice. For example, in FIG. 2, when the number of zero crossings = ni, the ratio (UR) = 0.
Assuming that 45 is output from the voiced / unvoiced sound discriminator 17, 45% of the input voice is unvoiced and the remaining 55
% Is determined to be voiced.

【0021】また、本発明の有声音/無声音判定装置1
には振幅平均測定器15が設けられていて、入力される
音声の所定時間間隔の振幅の平均が測定される。この振
幅平均測定器15から出力される入力音声信号の平均振
幅値は有声音/無声音判別器17に入力され、上述した
閾値n1 、n2 を平均振幅値の大きさに応じて適宜変更
するものである。これは平均振幅値が大きくなった場
合、入力音声には有声音が多くなり、一方、小さくなっ
た場合は無声音が多くなるからである。
The voiced / unvoiced sound determination device 1 of the present invention
Is provided with an amplitude average measuring device 15 for measuring an average of amplitudes of input voices at predetermined time intervals. The average amplitude value of the input voice signal output from the amplitude average measuring device 15 is input to the voiced / unvoiced sound discriminator 17, and the above-mentioned threshold values n1 and n2 are appropriately changed according to the magnitude of the average amplitude value. is there. This is because when the average amplitude value increases, voiced sounds increase in the input voice, and when the average amplitude value decreases, unvoiced sounds increase.

【0022】図3は平均振幅値が大きくなって、閾値n
1 、n2 をn1a、n2aに零交差数が増大する方向に変更
した場合のグラフである。この場合、入力音声が完全な
有声音から零交差数=n1aまでが有声音領域、零交差数
=n1aからn2aまでが中間領域、零交差数=n2aから完
全な無声音までが無声音領域となる。従って、入力音声
が完全な有声音から零交差数=n1aの場合、比率(U
R)=0.0が出力されて入力音声は有声音のみである
と判断し、また、入力音声が零交差数=n2aから完全な
無声音までは比率(UR)=1.0が出力されて入力音
声は無声音のみであると判断される。
FIG. 3 shows that the average amplitude value increases and the threshold value n
1 is a graph when n2 is changed to n1a and n2a in a direction in which the number of zero crossings increases. In this case, the input voice is a voiced sound area from a completely voiced sound to the number of zero crossings = n1a, an intermediate area from the number of zero crossings = n1a to n2a, and an unvoiced sound area from the number of zero crossings = n2a to a completely unvoiced sound. Therefore, if the input voice is a completely voiced sound and the number of zero crossings = n1a, the ratio (U
R) = 0.0 is output and it is determined that the input voice is only a voiced sound, and the ratio (UR) = 1.0 is output from the input voice from the number of zero crossings = n2a to the complete unvoiced sound. It is determined that the input voice is only unvoiced sound.

【0023】さて、ここで零交差数=ni の場合につい
て見てみると、図2ではn1 <ni<n2 であって、こ
のときの入力音声は有声音と無声音が所定の割合で存在
すると判別されるが、一方、図3ではni <n1a<n2a
であって、同じni であってもこの場合は入力音声は有
声音のみと判別される。逆に閾値がn1a<n2a<niで
あれば入力音声は無声音のみと判別されることになる。
Now, looking at the case where the number of zero crossings = ni, FIG. 2 shows that n1 <ni <n2, and the input speech at this time is determined to have a voiced sound and an unvoiced sound at a predetermined ratio. On the other hand, in FIG. 3, ni <n1a <n2a
In this case, even in the case of the same ni, the input voice is determined to be only a voiced sound. Conversely, if the threshold value is n1a <n2a <ni, the input voice is determined to be only unvoiced sound.

【0024】上述したように平均振幅値に応じて閾値n
1 、n2 を変更することで、零交差数だけでは無声音と
も判断されてしまうような音声、例えば低周波の周期成
分に高周波のノイズ成分が載った有声子音等の音声に対
しても精度よく有声音と判断することが可能となる。
As described above, the threshold value n depends on the average amplitude value.
By changing 1 and n2, voices that can be judged to be unvoiced only by the number of zero crossings, such as voiced consonants with high-frequency noise components added to low-frequency periodic components, are accurately obtained. It is possible to judge the voice sound.

【0025】上述したようにして判断された比率(U
R)は、入力された音声信号の各種処理に供される。例
えば入力音声のうち有声音を意味あるデータとして扱う
場合、無声音は不要であり、この無声音部分を得られた
比率(UR)に基づいて処理から除外することによっ
て、処理時間の短縮化、効率化がなされ、また、装置の
負担を軽減し、装置の軽量化に貢献するものである。
The ratio (U) determined as described above
R) is used for various processes of the input audio signal. For example, when voiced sound is treated as meaningful data in the input voice, unvoiced sound is unnecessary, and the unvoiced sound portion is excluded from processing based on the obtained ratio (UR), thereby shortening processing time and increasing efficiency. It also reduces the load on the device and contributes to the weight reduction of the device.

【0026】尚、零交差数によるURの判別のためのテ
ーブルは上述した2つに限ることなく、それ以上のテー
ブルを用いてもよい。さらに、入力音声の振幅に応じて
テーブルの閾値を自動的に連続して変化させ、それに基
づき零交差数との関係から比率(UR)を決定してもよ
い。
The table for determining the UR based on the number of zero-crossings is not limited to the above-mentioned two tables, but may be a larger table. Further, the threshold of the table may be automatically and continuously changed according to the amplitude of the input voice, and the ratio (UR) may be determined based on the threshold based on the relationship with the number of zero crossings.

【0027】また、本発明は上述した実施形態例の構成
に限ることなく、本発明の技術的思想を具現化するいず
れの構成であってもよいことは当然である。
The present invention is not limited to the configuration of the above-described embodiment, but may be any configuration that embodies the technical idea of the present invention.

【0028】[0028]

【発明の効果】以上詳細に説明したように、本発明の有
声音/無声音判定装置によれば、周期性の有る有声音に
ノイズ成分が付加されたような音声、例えば有声子音な
どの音声を無声音と誤判定することなく、精度よく有声
音/無声音の判定を簡便な回路で短時間に行うことがで
きる。
As described above in detail, according to the voiced / unvoiced sound judging device of the present invention, a voice having a noise component added to a voiced sound having a periodicity, for example, a voice such as a voiced consonant. The voiced / unvoiced sound can be accurately determined in a short time with a simple circuit without erroneously determining the unvoiced sound.

【図面の簡単な説明】[Brief description of the drawings]

【図1】 本発明の実施形態例における有声音/無声音
判定装置のブロック図である。
FIG. 1 is a block diagram of a voiced / unvoiced sound determination device according to an embodiment of the present invention.

【図2】 有声音/無声音判別器への入力である零交差
数と出力である無声音/入力音声比率URとの関係を示
す図である。
FIG. 2 is a diagram illustrating a relationship between the number of zero crossings as an input to a voiced / unvoiced sound discriminator and an unvoiced sound / input voice ratio UR as an output.

【図3】 振幅平均測定器からの平均振幅値により有声
音/無声音判別器内の閾値が変更された場合の零交差数
と無声音/入力音声比率URとの関係を示す図である。
FIG. 3 is a diagram showing the relationship between the number of zero-crossings and the unvoiced sound / input voice ratio UR when the threshold in the voiced / unvoiced sound discriminator is changed based on the average amplitude value from the amplitude average measuring device.

【符号の説明】[Explanation of symbols]

1…有声音/無声音判定装置、11…マイク、12…増
幅器、13…A/D変換器、14…低域通過フィルタ、
15…振幅平均測定器、16…零交差数計数器、17…
有声音/無声音判別器
DESCRIPTION OF SYMBOLS 1 ... Voiced / unvoiced sound determination apparatus, 11 ... Microphone, 12 ... Amplifier, 13 ... A / D converter, 14 ... Low-pass filter,
15: Amplitude average measuring instrument, 16: Zero-crossing counter, 17 ...
Voiced / unvoiced sound discriminator

Claims (2)

【特許請求の範囲】[Claims] 【請求項1】 入力された音声信号の零交差数を計数す
る計数手段と、 前記入力された音声信号を低域通過フィルタを通過させ
た後、通過した音声信号の振幅の平均値を、所定時間間
隔で測定する測定手段と、 前記振幅の平均値と零交差数とにより有声音と無声音と
を判定する判定手段とを具備したことを特徴とする有声
音/無声音判定装置。
1. A counting means for counting the number of zero crossings of an input audio signal, and after passing the input audio signal through a low-pass filter, an average value of the amplitude of the passed audio signal is determined by a predetermined value. A voiced / unvoiced sound determination device, comprising: a measuring means for measuring at time intervals; and a determining means for determining a voiced sound or an unvoiced sound based on the average value of the amplitude and the number of zero crossings.
【請求項2】 前記判定手段内に設定される有声音と無
声音との判定基準は、前記平均値によって自動的に所定
の値に設定される構成であることを特徴とする、請求項
1に記載の有声音/無声音判定装置。
2. The apparatus according to claim 1, wherein a criterion for determining whether a voiced sound or an unvoiced sound is set in said determining means is automatically set to a predetermined value based on said average value. A voiced / unvoiced sound determination device as described in the above.
JP17338198A 1998-06-19 1998-06-19 Voiced sound/voiceless sound judging device Pending JP2000010577A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP17338198A JP2000010577A (en) 1998-06-19 1998-06-19 Voiced sound/voiceless sound judging device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP17338198A JP2000010577A (en) 1998-06-19 1998-06-19 Voiced sound/voiceless sound judging device

Publications (1)

Publication Number Publication Date
JP2000010577A true JP2000010577A (en) 2000-01-14

Family

ID=15959352

Family Applications (1)

Application Number Title Priority Date Filing Date
JP17338198A Pending JP2000010577A (en) 1998-06-19 1998-06-19 Voiced sound/voiceless sound judging device

Country Status (1)

Country Link
JP (1) JP2000010577A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7472059B2 (en) 2000-12-08 2008-12-30 Qualcomm Incorporated Method and apparatus for robust speech classification
JP2014500676A (en) * 2010-12-08 2014-01-09 ヴェーデクス・アクティーセルスカプ Hearing aid and sound reproduction enhancement method

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7472059B2 (en) 2000-12-08 2008-12-30 Qualcomm Incorporated Method and apparatus for robust speech classification
JP2010176145A (en) * 2000-12-08 2010-08-12 Qualcomm Inc Method and device for robust voice classification
JP2014500676A (en) * 2010-12-08 2014-01-09 ヴェーデクス・アクティーセルスカプ Hearing aid and sound reproduction enhancement method

Similar Documents

Publication Publication Date Title
US5197113A (en) Method of and arrangement for distinguishing between voiced and unvoiced speech elements
US9454976B2 (en) Efficient discrimination of voiced and unvoiced sounds
JPH0121519B2 (en)
JPH0251303B2 (en)
JP2000010577A (en) Voiced sound/voiceless sound judging device
US8242836B2 (en) Acoustic characteristic control apparatus
JP3114757B2 (en) Voice recognition device
KR100345402B1 (en) An apparatus and method for real - time speech detection using pitch information
JP2666296B2 (en) Voice recognition device
JPH05100661A (en) Measure border time extraction device
JP2557497B2 (en) How to identify male and female voices
KR100539176B1 (en) Device and method of extracting musical feature
JP4360527B2 (en) Pitch detection method
KR20040082756A (en) Method for Speech Detection Using Removing Noise
JP2009175473A (en) Sound processing device and program
JP2951333B2 (en) Audio signal section discrimination method
JPS6028698A (en) Sound-soundless detector
JP5169297B2 (en) Sound processing apparatus and program
JPS637400B2 (en)
JPH0573035B2 (en)
JPH06110492A (en) Speech recognition device
JP2599974B2 (en) Voice detection method
Kim et al. A study on pitch detection using the local peak and valley for Korean speech recognition
KR100322203B1 (en) Device and method for recognizing sound in car
JPH0383100A (en) Detector for voice section