JP2000010577A

JP2000010577A - Voiced sound/voiceless sound judging device

Info

Publication number: JP2000010577A
Application number: JP17338198A
Authority: JP
Inventors: Kazuki Sakai; 和樹酒井
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1998-06-19
Filing date: 1998-06-19
Publication date: 2000-01-14

Abstract

PROBLEM TO BE SOLVED: To provide a voiced sound/voiceless sound judging device capable of rapidly and correctly judging a voiced sound or a voiceless sound and being constituted with a simple circuit. SOLUTION: This voiced sound/voiceless sound judging device is constituted of a microphone 11 for inputting a voice, an amplifier 12 for amplifying the signal from the microphone 11 to prescribed amplitude, an A/D converter 13 for converting the signal from the amplifier 12 to a digital signal, a low-pass filter 14 for extracting the signal of a low band from the digitized signal, an amplitude average measuring instrument 15 for measuring the average of the amplitude in the prescribed time interval of the extracted low band signal, a number of zero-crossing counter 16 for measuring the number of zero-crossing of the digitized signal and a voiced sound/voiceless sound discriminator 17 for discriminating the voiced sound from the voiceless sound based on the outputs of the amplitude mean measuring instrument 15 and the number of zero-cross counter 16 and deciding the ratio of the voiceless sound to the input voice.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は音声入力に含まれる
有声音と無声音との割合いを正確、且つ簡便に判定する
ことを目的とした有声音／無声音判定装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voiced / unvoiced sound judging apparatus for accurately and easily judging the ratio between voiced sound and unvoiced sound contained in a voice input.

【０００２】[0002]

【従来の技術】従来より音声の有声音／無声音判定装置
としては次のようなものが知られている。2. Description of the Related Art The following is known as a voiced / unvoiced sound discriminating apparatus.

【０００３】その第１はマイクより入力された音声を、
Ａ／Ｄ変換装置を経てディジタル信号に変換し、一定時
間間隔でこの信号の振幅レベルが零レベルと交差する回
数、即ち零交差数を測定する。そして、この零交差数が
装置内部に設定した第１の閾値と第２の閾値（第１の閾
値＜第２の閾値）に関して、第１の閾値を下回った場合
は有声音と判別し、第２の閾値を上回った場合は無声音
と判別し、第１の閾値と第２の閾値との間にある場合は
有声音と無声音とが混在すると判断する有声音／無声音
判定装置である。[0003] The first is that a voice input from a microphone is
The signal is converted into a digital signal via an A / D converter, and the number of times the amplitude level of the signal crosses the zero level at a fixed time interval, that is, the number of zero crossings is measured. If the number of zero crossings falls below the first threshold with respect to the first threshold and the second threshold (the first threshold <the second threshold) set inside the apparatus, it is determined that the voiced sound is present. The voiced / unvoiced sound determination device determines that the voiced sound is unvoiced when the value exceeds the second threshold value, and that the voiced sound and the unvoiced sound are mixed when the value is between the first threshold value and the second threshold value.

【０００４】しかしながら、この装置は比較的単純な回
路で構成できる利点はあるが、周期性のある有声音にノ
イズ成分が付加されたような音声、例えば有声子音等の
音声の場合、零振幅レベル付近でのノイズ成分に多くの
影響を受けて零交差数の値が大きくなり、このためこの
音声を無声音と誤判定することがあった。However, this apparatus has an advantage that it can be constituted by a relatively simple circuit. However, in the case of a voice in which a noise component is added to a periodic voiced sound, for example, a voice such as a voiced consonant, a zero amplitude level is obtained. The value of the number of zero crossings is increased due to the influence of noise components in the vicinity, and this voice may be erroneously determined to be unvoiced.

【０００５】また、その第２はマイクより入力された音
声を、Ａ／Ｄ変換装置を経てディジタル信号に変換し、
一定時間間隔で自己相関関数を算出するものである。こ
の自己相関関数は音声の周期性の度合いを示しており、
この自己相関関数値に明確なピークが存在すれば周期性
の強い有声音と判断し、一方、明確なピークが存在しな
ければ周期性の乏しい無声音と判断する。[0005] The second is to convert the voice input from the microphone into a digital signal via an A / D converter,
The autocorrelation function is calculated at regular time intervals. This autocorrelation function indicates the degree of periodicity of the voice,
If there is a clear peak in the autocorrelation function value, it is determined that the voiced sound has a strong periodicity. On the other hand, if there is no clear peak, it is determined that the voiced sound has a poor periodicity.

【０００６】しかしながら、この装置では音声の周期性
を判定の基本とするため、有声音／無声音の判定は正確
に行えるが、自己相関関数の算出のためには多数回の積
和演算が必要とされ、そのため回路構成が複雑となり、
また演算にも時間を必要とするものであった。However, in this apparatus, voiced sound / unvoiced sound can be accurately determined because the periodicity of speech is used as a basis for determination. However, a large number of multiply-accumulate operations are required for calculating an autocorrelation function. Therefore, the circuit configuration becomes complicated,
Also, the calculation requires time.

【０００７】[0007]

【発明が解決しようとする課題】従って本発明は、有声
音と無声音の判定を早く正確に行え、しかも簡便な回路
で構成できる有声音／無声音判定装置の提供を目的とす
る。SUMMARY OF THE INVENTION Accordingly, an object of the present invention is to provide a voiced / unvoiced sound discriminating apparatus which can quickly and accurately determine a voiced sound and an unvoiced sound, and which can be constituted by a simple circuit.

【０００８】[0008]

【課題を解決するための手段】本発明は上記課題に鑑み
なされたものであって、入力された音声信号の零交差数
を計数する計数手段と、入力された音声信号を低域通過
フィルタを通過させた後、通過した音声信号の振幅の平
均値を、所定時間間隔で測定する測定手段と、振幅の平
均値と零交差数とにより有声音と無声音とを判定する判
定手段とを具備した有声音／無声音判定装置を構成す
る。SUMMARY OF THE INVENTION The present invention has been made in view of the above problems, and has a counting means for counting the number of zero crossings of an input audio signal, and a low-pass filter for converting the input audio signal. After passing, an average value of the amplitude of the passed audio signal is measured at predetermined time intervals, and a determination unit that determines voiced sound and unvoiced sound based on the average value of the amplitude and the number of zero crossings is provided. A voiced / unvoiced sound determination device is configured.

【０００９】また、前記有声音／無声音判定装置は判定
手段内に設定される有声音と無声音との判定基準は、平
均値によって自動的に所定の値に設定される構成にし
て、上記課題を解決する。Further, the voiced / unvoiced sound judging device has a structure in which a judgment standard for voiced sound and unvoiced sound set in the judging means is automatically set to a predetermined value by an average value. Resolve.

【００１０】入力音声信号の振幅に応じて無声音の比率
を判別する基準を定め、入力音声の零交差数とその基準
に基づいて入力音声中の無声音の比率がもとまる。A criterion for determining the ratio of unvoiced sounds is determined according to the amplitude of the input voice signal, and the ratio of unvoiced sounds in the input voice is determined based on the number of zero crossings of the input voice and the criterion.

【００１１】[0011]

【発明の実施の形態】本発明は入力音声信号に含まれる
有声音と無声音の度合いの判定を正確、且つ簡便に行う
ことができる有声音／無声音判定装置に関するものであ
って、入力音声信号の振幅の大きさによって入力音声中
に含まれる無声音の比率を判定する基準を変え、その基
準に従って入力音声中の無声音の比率を判定することを
特徴とするものである。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The present invention relates to a voiced / unvoiced sound determination device capable of accurately and easily determining the degree of voiced sound and unvoiced sound contained in an input voice signal. A criterion for determining the ratio of unvoiced sounds included in the input voice is changed according to the magnitude of the amplitude, and the ratio of unvoiced sounds in the input voice is determined according to the criterion.

【００１２】つぎに実施形態例について図１ないし図３
を参照して説明する。ここで図１は本発明に係わる有声
音／無声音判定装置のブロック図であり、図２は有声音
／無声音判別器への入力である零交差数と、その出力で
ある無声音／入力音声比率ＵＲ（Unvoiced Sound Rati
o）との関係を示す図である。また、図３は振幅平均測
定器からの平均振幅値により有声音／無声音判別器内の
閾値が変更された場合の零交差数と無声音／入力音声比
率ＵＲとの関係を示す図である。Next, an embodiment will be described with reference to FIGS.
This will be described with reference to FIG. Here, FIG. 1 is a block diagram of a voiced / unvoiced sound discriminating apparatus according to the present invention, and FIG. 2 is a diagram showing an input to a voiced / unvoiced discriminator and the number of zero crossings, and an output of the voiced / input voice ratio UR. (Unvoiced Sound Rati
It is a figure which shows the relationship with o). FIG. 3 is a diagram showing the relationship between the number of zero crossings and the unvoiced sound / input voice ratio UR when the threshold value in the voiced / unvoiced sound discriminator is changed based on the average amplitude value from the amplitude average measuring device.

【００１３】まず、有声音／無声音判定装置１は図１に
示すように、音声を入力するマイク１１と、マイク１１
からの信号を所定の振幅に増幅する増幅器１２と、増幅
器１２からの信号をデジタル信号に変換するＡ／Ｄ変換
器１３と、デジタル化された信号から低域の信号を抽出
する低域通過フィルタ１４と、抽出された低域信号の所
定の時間間隔における振幅の平均を測定する振幅平均測
定器１５と、デジタル化された信号の零レベルとの交差
数、即ち零交差数を測定する零交差数計数器１６と、振
幅平均測定器１５と零交差数計数器１６との出力に基づ
いて有声音と無声音とを判別し、入力音声に対する無声
音の比率を定める有声音／無声音判別器１７とから構成
されている。First, as shown in FIG. 1, a voiced / unvoiced sound judging device 1 includes a microphone 11 for inputting a voice, a microphone 11
12, an A / D converter 13 for converting a signal from the amplifier 12 into a digital signal, and a low-pass filter for extracting a low-frequency signal from the digitized signal. 14; an amplitude average measuring device 15 for measuring the average of the amplitude of the extracted low-frequency signal in a predetermined time interval; and a zero crossing for measuring the number of crossings of the digitized signal with the zero level, that is, the number of zero crossings. A voiced / unvoiced sound discriminator 17 that discriminates between voiced sounds and unvoiced sounds based on the outputs of the number counter 16 and the amplitude average measuring device 15 and the zero-crossing number counter 16, and determines the ratio of unvoiced sound to input sound. It is configured.

【００１４】つぎに上述した構成の有声音／無声音判定
装置１の動作について説明する。マイク１１から入力さ
れた有声音と無声音とを含んだ音声は増幅器１２で増幅
された後、Ａ／Ｄ変換器１３を経てデジタル信号に変換
される。Ａ／Ｄ変換器１３から出力された信号は２つに
分岐され、その一方は低域通過フィルタ１４に送られ、
他の一方は零交差数計数器１６に送られる。Next, the operation of the voiced / unvoiced sound discriminating apparatus 1 having the above configuration will be described. The voice including the voiced sound and the unvoiced sound input from the microphone 11 is amplified by the amplifier 12 and then converted to a digital signal via the A / D converter 13. The signal output from the A / D converter 13 is split into two, one of which is sent to a low-pass filter 14,
The other is sent to the zero-crossing counter 16.

【００１５】低域通過フィルタ１４に送られた信号は、
その信号中に含まれる無声音成分である高周波数成分が
除去され、その結果、音声中の有声音成分である低周波
数成分のみの信号が出力されることになる。つぎにこの
低域通過フィルタ１４からの信号は振幅平均測定器１５
に入力され、所定時間当たりの平均振幅値が測定され
る。平均振幅値は入力音声の性質上、それが有声音に近
い場合は大きな値となり、一方、無声音や無音に近い場
合は小さな値となるものである。The signal sent to the low-pass filter 14 is
The high-frequency component that is the unvoiced sound component included in the signal is removed, and as a result, only the low-frequency component that is the voiced sound component in the voice is output. Next, the signal from the low-pass filter 14 is applied to an amplitude average
And an average amplitude value per predetermined time is measured. Due to the nature of the input voice, the average amplitude value is a large value when the input voice is close to a voiced sound, and is a small value when it is unvoiced or close to no voice.

【００１６】また、Ａ／Ｄ変換器１３から零交差数計数
器１６に送られた信号は、ここでその信号に含まれる所
定時間当たりの零交差数が計数される。尚、零交差とは
信号が正から負へ、および負から正へと零レベルを交差
することであって、所定時間内におけるこれらの交差回
数を計数して、零交差数としている。従って、入力音声
中に無声音成分が多い場合、即ち高周波のノイズ成分が
多い場合、信号が零レベルと交差する回数は多くなり、
それらの計数である零交差数は大きな値をとり、一方、
入力音声中に有声音成分が多い場合、即ち低周波成分が
多い場合、信号が零レベルと交差する回数は少なくな
り、零交差数は小さな値となる。In the signal sent from the A / D converter 13 to the zero-crossing number counter 16, the number of zero-crossings per predetermined time included in the signal is counted. The zero crossing means that a signal crosses a zero level from positive to negative and from negative to positive, and the number of these crossings within a predetermined time is counted and defined as the number of zero crossings. Therefore, when there are many unvoiced sound components in the input voice, that is, when there are many high-frequency noise components, the number of times the signal crosses the zero level increases,
Their count, the number of zero crossings, takes a large value, while
When there are many voiced sound components in the input voice, that is, when there are many low-frequency components, the number of times the signal crosses the zero level decreases, and the number of zero crossings becomes a small value.

【００１７】求められた零交差数は有声音／無声音判別
器１７に入力される。有声音／無声音判別器１７の内部
には零交差数の閾値ｎ1 、ｎ2 （ｎ1 ＜ｎ2 ）が設定さ
れていて、この閾値ｎ1 、ｎ2 と入力された零交差数と
が比較され、有声音または無声音の判別が行われる。そ
の判別結果として入力音声中における無声音の比率（Ｕ
Ｒ）が出力される。尚、比率（ＵＲ）は０．０〜１．０
の値をとる。また、閾値ｎ1 、ｎ2 は入力音声信号の平
均振幅値に応じて適宜変更され、後段で詳しく説明する
ように本発明の特徴を形成している。The calculated number of zero crossings is input to a voiced / unvoiced sound discriminator 17. Thresholds n1 and n2 (n1 <n2) of the number of zero-crossings are set in the voiced / unvoiced sound discriminator 17, and the thresholds n1 and n2 are compared with the input number of zero-crossings to determine whether the voiced sound or unvoiced sound is present. An unvoiced sound is determined. As a result of the determination, the ratio of unvoiced sound (U
R) is output. The ratio (UR) is 0.0 to 1.0.
Take the value of Further, the threshold values n1 and n2 are appropriately changed according to the average amplitude value of the input audio signal, and form a feature of the present invention as will be described later in detail.

【００１８】つぎに、有声音と無声音の判別について説
明する。図２に示す横軸を零交差数、縦軸を比率（Ｕ
Ｒ）としたグラフを有声音と無声音の判別に用いる。実
際にこのグラフは有声音／無声音判別器１７内にテーブ
ルとして格納されている。Next, the discrimination between a voiced sound and an unvoiced sound will be described. The horizontal axis shown in FIG. 2 is the number of zero crossings, and the vertical axis is the ratio (U
The graph R) is used for discriminating voiced sounds and unvoiced sounds. Actually, this graph is stored in the voiced / unvoiced sound discriminator 17 as a table.

【００１９】図２に示すグラフにおいて、入力音声が完
全な有声音から零交差数＝ｎ1 までを有声音領域、零交
差数＝ｎ1 からｎ2 までを中間領域、零交差数＝ｎ2 か
ら完全な無声音までを無声音領域とする。In the graph shown in FIG. 2, the input voice is a voiced sound region from a completely voiced sound to the number of zero crossings = n1, an intermediate region from the number of zero crossings = n1 to n2, a complete unvoiced sound from the number of zero crossings = n2. Up to the unvoiced sound area.

【００２０】さて、入力音声の零交差数が有声音領域に
ある場合、比率（ＵＲ）＝０．０が有声音／無声音判別
器１７から出力され、入力音声は有声音のみであると判
断される。また、入力音声の零交差数が無声音領域にあ
る場合、比率（ＵＲ）＝１．０が出力され、入力音声は
無声音のみであると判断される。また、入力音声の零交
差数が中間領域にある場合、比率（ＵＲ）は０．０から
１．０の間の値をとり、入力音声は有声音と無声音とを
含んでいると判断される。このとき比率（ＵＲ）は入力
音声とこれに含まれる無声音との比で与えられ、例えば
図２において零交差数＝ｎi のとき比率（ＵＲ）＝０．
４５が有声音／無声音判別器１７から出力されたとする
と、入力音声のうち４５％が無声音であり、残りの５５
％が有声音であると判断される。When the number of zero crossings of the input voice is in the voiced sound area, the ratio (UR) = 0.0 is output from the voiced / unvoiced sound discriminator 17, and it is determined that the input voice is only the voiced sound. You. When the number of zero crossings of the input voice is in the unvoiced sound area, the ratio (UR) = 1.0 is output, and it is determined that the input voice is only the unvoiced sound. When the number of zero crossings of the input voice is in the intermediate region, the ratio (UR) takes a value between 0.0 and 1.0, and it is determined that the input voice includes voiced and unvoiced sounds. . At this time, the ratio (UR) is given by the ratio between the input voice and the unvoiced sound included in the input voice. For example, in FIG. 2, when the number of zero crossings = ni, the ratio (UR) = 0.
Assuming that 45 is output from the voiced / unvoiced sound discriminator 17, 45% of the input voice is unvoiced and the remaining 55
% Is determined to be voiced.

【００２１】また、本発明の有声音／無声音判定装置１
には振幅平均測定器１５が設けられていて、入力される
音声の所定時間間隔の振幅の平均が測定される。この振
幅平均測定器１５から出力される入力音声信号の平均振
幅値は有声音／無声音判別器１７に入力され、上述した
閾値ｎ1 、ｎ2 を平均振幅値の大きさに応じて適宜変更
するものである。これは平均振幅値が大きくなった場
合、入力音声には有声音が多くなり、一方、小さくなっ
た場合は無声音が多くなるからである。The voiced / unvoiced sound determination device 1 of the present invention
Is provided with an amplitude average measuring device 15 for measuring an average of amplitudes of input voices at predetermined time intervals. The average amplitude value of the input voice signal output from the amplitude average measuring device 15 is input to the voiced / unvoiced sound discriminator 17, and the above-mentioned threshold values n1 and n2 are appropriately changed according to the magnitude of the average amplitude value. is there. This is because when the average amplitude value increases, voiced sounds increase in the input voice, and when the average amplitude value decreases, unvoiced sounds increase.

【００２２】図３は平均振幅値が大きくなって、閾値ｎ
1 、ｎ2 をｎ1a、ｎ2aに零交差数が増大する方向に変更
した場合のグラフである。この場合、入力音声が完全な
有声音から零交差数＝ｎ1aまでが有声音領域、零交差数
＝ｎ1aからｎ2aまでが中間領域、零交差数＝ｎ2aから完
全な無声音までが無声音領域となる。従って、入力音声
が完全な有声音から零交差数＝ｎ1aの場合、比率（Ｕ
Ｒ）＝０．０が出力されて入力音声は有声音のみである
と判断し、また、入力音声が零交差数＝ｎ2aから完全な
無声音までは比率（ＵＲ）＝１．０が出力されて入力音
声は無声音のみであると判断される。FIG. 3 shows that the average amplitude value increases and the threshold value n
1 is a graph when n2 is changed to n1a and n2a in a direction in which the number of zero crossings increases. In this case, the input voice is a voiced sound area from a completely voiced sound to the number of zero crossings = n1a, an intermediate area from the number of zero crossings = n1a to n2a, and an unvoiced sound area from the number of zero crossings = n2a to a completely unvoiced sound. Therefore, if the input voice is a completely voiced sound and the number of zero crossings = n1a, the ratio (U
R) = 0.0 is output and it is determined that the input voice is only a voiced sound, and the ratio (UR) = 1.0 is output from the input voice from the number of zero crossings = n2a to the complete unvoiced sound. It is determined that the input voice is only unvoiced sound.

【００２３】さて、ここで零交差数＝ｎi の場合につい
て見てみると、図２ではｎ1 ＜ｎi＜ｎ2 であって、こ
のときの入力音声は有声音と無声音が所定の割合で存在
すると判別されるが、一方、図３ではｎi ＜ｎ1a＜ｎ2a
であって、同じｎi であってもこの場合は入力音声は有
声音のみと判別される。逆に閾値がｎ1a＜ｎ2a＜ｎiで
あれば入力音声は無声音のみと判別されることになる。Now, looking at the case where the number of zero crossings = ni, FIG. 2 shows that n1 <ni <n2, and the input speech at this time is determined to have a voiced sound and an unvoiced sound at a predetermined ratio. On the other hand, in FIG. 3, ni <n1a <n2a
In this case, even in the case of the same ni, the input voice is determined to be only a voiced sound. Conversely, if the threshold value is n1a <n2a <ni, the input voice is determined to be only unvoiced sound.

【００２４】上述したように平均振幅値に応じて閾値ｎ
1 、ｎ2 を変更することで、零交差数だけでは無声音と
も判断されてしまうような音声、例えば低周波の周期成
分に高周波のノイズ成分が載った有声子音等の音声に対
しても精度よく有声音と判断することが可能となる。As described above, the threshold value n depends on the average amplitude value.
By changing 1 and n2, voices that can be judged to be unvoiced only by the number of zero crossings, such as voiced consonants with high-frequency noise components added to low-frequency periodic components, are accurately obtained. It is possible to judge the voice sound.

【００２５】上述したようにして判断された比率（Ｕ
Ｒ）は、入力された音声信号の各種処理に供される。例
えば入力音声のうち有声音を意味あるデータとして扱う
場合、無声音は不要であり、この無声音部分を得られた
比率（ＵＲ）に基づいて処理から除外することによっ
て、処理時間の短縮化、効率化がなされ、また、装置の
負担を軽減し、装置の軽量化に貢献するものである。The ratio (U) determined as described above
R) is used for various processes of the input audio signal. For example, when voiced sound is treated as meaningful data in the input voice, unvoiced sound is unnecessary, and the unvoiced sound portion is excluded from processing based on the obtained ratio (UR), thereby shortening processing time and increasing efficiency. It also reduces the load on the device and contributes to the weight reduction of the device.

【００２６】尚、零交差数によるＵＲの判別のためのテ
ーブルは上述した２つに限ることなく、それ以上のテー
ブルを用いてもよい。さらに、入力音声の振幅に応じて
テーブルの閾値を自動的に連続して変化させ、それに基
づき零交差数との関係から比率（ＵＲ）を決定してもよ
い。The table for determining the UR based on the number of zero-crossings is not limited to the above-mentioned two tables, but may be a larger table. Further, the threshold of the table may be automatically and continuously changed according to the amplitude of the input voice, and the ratio (UR) may be determined based on the threshold based on the relationship with the number of zero crossings.

【００２７】また、本発明は上述した実施形態例の構成
に限ることなく、本発明の技術的思想を具現化するいず
れの構成であってもよいことは当然である。The present invention is not limited to the configuration of the above-described embodiment, but may be any configuration that embodies the technical idea of the present invention.

【００２８】[0028]

【発明の効果】以上詳細に説明したように、本発明の有
声音／無声音判定装置によれば、周期性の有る有声音に
ノイズ成分が付加されたような音声、例えば有声子音な
どの音声を無声音と誤判定することなく、精度よく有声
音／無声音の判定を簡便な回路で短時間に行うことがで
きる。As described above in detail, according to the voiced / unvoiced sound judging device of the present invention, a voice having a noise component added to a voiced sound having a periodicity, for example, a voice such as a voiced consonant. The voiced / unvoiced sound can be accurately determined in a short time with a simple circuit without erroneously determining the unvoiced sound.

[Brief description of the drawings]

【図１】本発明の実施形態例における有声音／無声音
判定装置のブロック図である。FIG. 1 is a block diagram of a voiced / unvoiced sound determination device according to an embodiment of the present invention.

【図２】有声音／無声音判別器への入力である零交差
数と出力である無声音／入力音声比率ＵＲとの関係を示
す図である。FIG. 2 is a diagram illustrating a relationship between the number of zero crossings as an input to a voiced / unvoiced sound discriminator and an unvoiced sound / input voice ratio UR as an output.

【図３】振幅平均測定器からの平均振幅値により有声
音／無声音判別器内の閾値が変更された場合の零交差数
と無声音／入力音声比率ＵＲとの関係を示す図である。FIG. 3 is a diagram showing the relationship between the number of zero-crossings and the unvoiced sound / input voice ratio UR when the threshold in the voiced / unvoiced sound discriminator is changed based on the average amplitude value from the amplitude average measuring device.

[Explanation of symbols]

１…有声音／無声音判定装置、１１…マイク、１２…増
幅器、１３…Ａ／Ｄ変換器、１４…低域通過フィルタ、
１５…振幅平均測定器、１６…零交差数計数器、１７…
有声音／無声音判別器DESCRIPTION OF SYMBOLS 1 ... Voiced / unvoiced sound determination apparatus, 11 ... Microphone, 12 ... Amplifier, 13 ... A / D converter, 14 ... Low-pass filter,
15: Amplitude average measuring instrument, 16: Zero-crossing counter, 17 ...
Voiced / unvoiced sound discriminator

Claims

[Claims]

1. A counting means for counting the number of zero crossings of an input audio signal, and after passing the input audio signal through a low-pass filter, an average value of the amplitude of the passed audio signal is determined by a predetermined value. A voiced / unvoiced sound determination device, comprising: a measuring means for measuring at time intervals; and a determining means for determining a voiced sound or an unvoiced sound based on the average value of the amplitude and the number of zero crossings.

2. The apparatus according to claim 1, wherein a criterion for determining whether a voiced sound or an unvoiced sound is set in said determining means is automatically set to a predetermined value based on said average value. A voiced / unvoiced sound determination device as described in the above.