JPH1020886A - System for detecting harmonic waveform component existing in waveform data - Google Patents

System for detecting harmonic waveform component existing in waveform data

Info

Publication number
JPH1020886A
JPH1020886A JP8204031A JP20403196A JPH1020886A JP H1020886 A JPH1020886 A JP H1020886A JP 8204031 A JP8204031 A JP 8204031A JP 20403196 A JP20403196 A JP 20403196A JP H1020886 A JPH1020886 A JP H1020886A
Authority
JP
Japan
Prior art keywords
waveform
harmonic
period
component
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP8204031A
Other languages
Japanese (ja)
Inventor
Takayoshi Hirata
能睦 平田
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to JP8204031A priority Critical patent/JPH1020886A/en
Publication of JPH1020886A publication Critical patent/JPH1020886A/en
Pending legal-status Critical Current

Links

Abstract

PROBLEM TO BE SOLVED: To precisely detect a pitch period of a harmonic waveform by using short waveform data and to make possible specifying the harmonic waveform by obtaining the amplitude of a sine waveform component and a cosine waveform component related to plural values from the waveform data and detecting the period which gives a maximum for the sum of their squares as the pitch period. SOLUTION: The values obtained by multiplying respectively the waveform data by a sine waveform and a cosine waveform of a period T/n and adding them ranging over a sectional length rT are used as the waveform data, and the amplitude of the sine waveform component and cosine waveform component are determined for plural values of an integer n. Then, the period that the sum of the squares of the amplitude becomes a maximum when the period T is made a variable is detected as the pitch period. Here n, r are arbitrary integers. Then, by transmitting a waveform parameter (the pitch period and amplitude information of respective harmonics) obtained from an output terminal (b) and a residual waveform (low bit waveform information) obtained from an output terminal (c), reduction (band compression) of a bit rate in transmission of a voice signal becomes possible.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【0001】[0001]

【産業上の利用分野】音声信号や音楽信号などに含まれ
る調和波形の検出方法に関するもので、ピッチ周期の検
出とオーディオ信号の帯域圧縮に利用することができ
る。
BACKGROUND OF THE INVENTION The present invention relates to a method for detecting a harmonic waveform contained in a voice signal, a music signal, or the like, and can be used for detecting a pitch period and compressing a band of an audio signal.

【0002】[0002]

【従来の技術】ピッチ周期の変動が少なく、ピッチ周期
に比べて十分長い波形データが与えられている場合、フ
ーリエ解析(DFT、FFT)や自己相関関数の測定か
らピッチ周期を検出できることは周知の事柄である。し
かしながらピッチ周期の変動が大きく、ピッチ周期と同
程度の長さの波形データを用いてその周期を検出するこ
とが必要となるような場合は、従来法は誤差が大きく実
用にならない。
2. Description of the Related Art It is well known that when a fluctuation of a pitch period is small and waveform data sufficiently longer than the pitch period is given, the pitch period can be detected from Fourier analysis (DFT, FFT) or measurement of an autocorrelation function. It is a matter. However, when the pitch period fluctuates greatly and it is necessary to detect the period using waveform data of the same length as the pitch period, the conventional method has a large error and is not practical.

【0003】[0003]

【発明が解決しようとする課題】短い波形データを用い
て高い精度で調和波形のピッチ周期を検出すること。検
出されたピッチ周期を用いて調和波形を特定すること。
An object of the present invention is to detect a pitch period of a harmonic waveform with high accuracy using short waveform data. Specifying a harmonic waveform using the detected pitch period.

【0004】[0004]

【課題を解決するための手段】上記課題を解決するため
に、本発明は、任意の整数をnおよびr、周期をTとし
て、波形データに周期T/nなるサイン波形およびコサ
イン波形をそれぞれ掛けて区間長rTにわたって加算し
て得られる値を少なくとも用いて、前記波形データから
サイン波形成分とコサイン波形成分の振幅を前記整数n
の複数の値について求め、前記振幅の二乗の和が前記周
期Tを変数としたときに極大となる周期をピッチ周期と
して検出すること、もしくは前記ピッチ周期と前記ピッ
チ周期の1/nの周期のサイン波形成分とコサイン波形
成分の振幅を用いて前記波形データに含まれた調和波形
を特定することを特徴とした、波形データに存在する調
和波形成分の検出方式をその手段とする。
In order to solve the above problems, the present invention multiplies waveform data by a sine waveform and a cosine waveform having a period T / n, where n and r are arbitrary integers and T is a period. Using at least a value obtained by adding over the section length rT, the amplitudes of the sine waveform component and the cosine waveform component are converted from the waveform data to the integer n.
, And a cycle in which the sum of the squares of the amplitude becomes a maximum when the cycle T is a variable is detected as a pitch cycle, or the pitch cycle and 1 / n of the pitch cycle are detected. A means for detecting a harmonic waveform component present in the waveform data, wherein a harmonic waveform included in the waveform data is specified using the amplitudes of the sine waveform component and the cosine waveform component.

【0005】[0005]

【作用】以下、数式を用いて本発明の作用を説明する。
波形データをW(m)(m=1,2,…,M)、任意の
整数をn(>0)、周期をTで表わすと、波形データに
周期T/nなるサイン波形およびコサイン波形をそれぞ
れ掛けて区間長rT(≦M)にわたって加算した値X
(T/n)およびY(T/n)は、
The operation of the present invention will be described below using mathematical expressions.
When the waveform data is represented by W (m) (m = 1, 2,..., M), an arbitrary integer is represented by n (> 0), and the cycle is represented by T, the waveform data has a sine waveform and a cosine waveform having a cycle T / n. The value X multiplied by each and added over the section length rT (≦ M)
(T / n) and Y (T / n)

【数1】 と表わされる。これを用いれば、サイン波形成分とコサ
イン波形成分の振幅A(T/n)とB(T/n)は、そ
れぞれ
(Equation 1) It is expressed as Using this, the amplitudes A (T / n) and B (T / n) of the sine and cosine waveform components are respectively

【数2】 で与えられる。次にnの複数の値、例えばn=1,2,
…,Kについて上記の振幅を求め、それら振幅の二乗の
和をP(T)とすれば、
(Equation 2) Given by Next, a plurality of values of n, for example, n = 1, 2, 2,
.., K, and the sum of the squares of the amplitudes is defined as P (T).

【数3】 となる。Tを変数としてP(T)が極大となるときのT
の値をTとすれば、
(Equation 3) Becomes T when P (T) is maximized using T as a variable
Is T 1 ,

【数4】 と表わされる。Tを波形データW(m)に含まれる調
和波形のピッチ周期、A(T/n)を第n高調波のサ
イン波形成分の振幅、B(T/n)を第n高調波のコ
サイン波形成分の振幅とすれば、調和波形H(m,
)は
(Equation 4) It is expressed as Pitch period of the harmonic wave included the T 1 to the waveform data W (m), A (T 1 / n) of the sine wave component of the n harmonic amplitude, B (T 1 / n) of the n-th harmonic Assuming that the amplitude of the cosine waveform component is the harmonic waveform H (m,
T 1 )

【数5】 で表わされる。(Equation 5) Is represented by

【0006】ここでW(m)が一般に次式Here, W (m) is generally expressed by the following equation.

【数6】 によって表わされる場合について、本発明の作用を説明
する。上記(数1)ないし(数3)において、T=T
+d(d<<T)とすれば、サイン波形成分とコサイ
ン波形成分の振幅は、それぞれ
(Equation 6) The operation of the present invention will be described for the case represented by In the above (Equation 1) to (Equation 3), T = T 1
+ D (d << T 1 ), the amplitudes of the sine and cosine waveform components are respectively

【数7】 となる。従って、d=0においてP(T)は極大となる
ので、T=Tがピッチ周期として求まる。このとき第
k高調波のサイン波形成分およびコサイン波形成分の振
幅は、それぞれ
(Equation 7) Becomes Therefore, P (T) at d = 0 because the maximum, T = T 1 is obtained as the pitch period. At this time, the amplitudes of the sine and cosine waveform components of the k-th harmonic are respectively

【数8】 となり、検出された調和波形H(m,T)は(Equation 8) And the detected harmonic waveform H (m, T 1 ) is

【数9】 となる。(Equation 9) Becomes

【0007】[0007]

【実施例】図1(a)は、T=120を基本周期(ピ
ッチ周期)とする5個の正弦波から成る調和波形W
(m)を示したもので、(d)はそのスペクトルであ
る。図1(b)はW(m)のm=1から512の区間を
用いて本方式による調和波形成分の検出を行ない、波形
合成し、外挿(m>512)した場合の波形を示したも
ので、(e)はそのスペクトルである。図1(c)は、
同じくW(m)のm=1から512の区間をFFTで分
析し、IFFT(逆フーリエ変換)した波形を示したも
ので、(f)はそのスペクトルである。これらの結果か
ら、FFTは調和波形を実際に構成している正弦波を検
出するものでないことがわかる。一方、本発明による方
式では、高い精度で正弦波を検出できることが示されて
いる。
FIG. 1 (a) shows a harmonic waveform W composed of five sine waves having a basic period (pitch period) of T 1 = 120.
(M) is shown, and (d) is its spectrum. FIG. 1B shows a waveform in a case where a harmonic waveform component is detected by the present method using the section from m = 1 to 512 of W (m), the waveform is synthesized, and extrapolated (m> 512). (E) is the spectrum. FIG. 1 (c)
Similarly, a section of W (m) from m = 1 to 512 is analyzed by FFT, and shows a waveform subjected to IFFT (inverse Fourier transform), and (f) shows its spectrum. From these results, it can be seen that FFT does not detect the sine wave that actually constitutes the harmonic waveform. On the other hand, it is shown that the method according to the present invention can detect a sine wave with high accuracy.

【0008】なお、図1で用いた本発明の調和波形成分
の検出方式では、(数1)で与えられている和(サムメ
ーション)の範囲をm=513−4Tから512、Tを
10から128とした。このように波形データの任意の
区間を分析に用いることができるので、振幅A(T/
n)、B(T/n)は複数の区間から求めた値の平均値
を用いてもよい。
In the method of detecting a harmonic waveform component of the present invention used in FIG. 1, the sum (summation) range given by (Equation 1) is set to m = 513-4T to 512, and T is set to 10 128. As described above, an arbitrary section of the waveform data can be used for analysis, so that the amplitude A (T /
For n) and B (T / n), an average value of values obtained from a plurality of sections may be used.

【0009】図2は、本発明を音声信号の帯域圧縮に用
いた実施例のブロック図である。図2において、1は入
力波形データの1フレーム分を一時的に記憶する記憶
部、2は本発明によるところの調和波形成分の検出にお
ける波形データ分析部、3はピッチ周期および高調波成
分の振幅から調和波形を合成する波形合成部、4は入力
データから調和波形成分を除去して残差波形を出力する
波形減算部を表わし、aは波形データ入力端子、bは波
形パラメータ出力端子、cは残差波形出力端子を表わ
す。
FIG. 2 is a block diagram of an embodiment in which the present invention is used for band compression of an audio signal. In FIG. 2, 1 is a storage unit for temporarily storing one frame of input waveform data, 2 is a waveform data analysis unit for detecting a harmonic waveform component according to the present invention, 3 is a pitch period and amplitude of a harmonic component. Represents a waveform subtraction unit that removes a harmonic waveform component from input data and outputs a residual waveform, a represents a waveform data input terminal, b represents a waveform parameter output terminal, and c represents a waveform parameter output terminal. Indicates a residual waveform output terminal.

【0010】ここで入力波形データを、振幅を8ビット
で量子化、サンプリング周波数8kHzて標本化した音
声波形とすると、この波形を伝送(もしくは記録)する
のに64000ビット/秒を要する。一方、音声波形の
調和波形成分(母音成分)を、1フレーム16ms(1
28データ)ごとにピッチ周期情報8ビット、第8高調
波までのサイン波形成分およびコサイン波形成分の振幅
情報おのおの6ビットで表わすとすれば、調和波形の伝
送は6500ビット/秒で済むことになる。音声波形に
おいて調和波形成分は主要なものであるので、調和波形
成分を除去した残差波形は低ビットレートで伝送するこ
とができる。従って、図2の出力端子bから得られた波
形パラメータ(ピッチ周期および各高調波の振幅情報)
と出力端子cから得られた残差波形(低ビット波形情
報)を伝送することにより、音声信号の伝送におけるビ
ットレートの低減(帯域圧縮)が可能になる。
If the input waveform data is an audio waveform quantized with an 8-bit amplitude and sampled at a sampling frequency of 8 kHz, it takes 64000 bits / sec to transmit (or record) this waveform. On the other hand, the harmonic waveform component (vowel component) of the speech waveform is converted into 16 ms (1
If the pitch period information is represented by 8 bits and the amplitude information of the sine waveform component and the cosine waveform component up to the eighth harmonic is represented by 6 bits for every 28 data), the transmission of the harmonic waveform can be performed at 6500 bits / sec. . Since the harmonic waveform component is the main component in the audio waveform, the residual waveform from which the harmonic waveform component has been removed can be transmitted at a low bit rate. Therefore, waveform parameters (pitch period and amplitude information of each harmonic) obtained from the output terminal b in FIG.
And transmitting the residual waveform (low bit waveform information) obtained from the output terminal c, it is possible to reduce the bit rate (band compression) in the transmission of the audio signal.

【0011】なお、図2の波形合成部3では、分析で得
られたピッチ周期をT1、第n高調波のサイン波形成分
およびコサイン波形成分の振幅をそれぞれA(T
n)、B(T/n)として、(数5)の演算により合
成波形H(m,T)を得るものとする。また、図2の
波形減算部4では、入力波形データをW(m)とする
と、残差波形R(m,T)を次式で得るものとする。
In the waveform synthesizing unit 3 shown in FIG. 2, the pitch period obtained by the analysis is T1, and the amplitudes of the sine waveform component and the cosine waveform component of the n-th harmonic are A (T 1 /
n) and B (T 1 / n), a composite waveform H (m, T 1 ) is obtained by the operation of (Equation 5). Further, in the waveform subtracting section 4 in FIG. 2, when the input waveform data is W (m), the residual waveform R (m, T 1 ) is obtained by the following equation.

【数10】 (Equation 10)

【0012】図3は、本発明を音楽信号の帯域圧縮に用
いた実施例のブロック図である。この実施例は、図2の
1から4までの調和波形成分の検出動作をくり返すこと
により、波形データから一般に複数の調和波形成分の検
出を行なうものであり、残差波形から再び調和波形成分
を検出する構成になっている。すなわち図3の5の比較
制御部では、入力した残差波形を1の記憶部に記憶させ
て、再び調和波形成分を検出し除去して得た残差波形
を、前の残差波形データと比較し、変化が小さい場合
(残差波形のパワー比が1に近い場合)、調和波形成分
はすべて検出されたものとして、前の残差波形を出力端
子cに出力し、1の記憶部に新たな波形データを入力す
る命令を出す。音楽信号は一般に複数の音程の音が含ま
れているので、複数の調和波形成分を検出し、その波形
パラメータと残差波形を伝送することにより帯域圧縮
(低ビットレート伝送)が可能となる。
FIG. 3 is a block diagram of an embodiment in which the present invention is used for band compression of a music signal. In this embodiment, a plurality of harmonic waveform components are generally detected from waveform data by repeating the operation of detecting the harmonic waveform components 1 to 4 in FIG. 2, and the harmonic waveform components are again detected from the residual waveform. Is detected. That is, the comparison control unit 5 in FIG. 3 stores the input residual waveform in the storage unit 1 and detects and removes the harmonic waveform component again, and compares the residual waveform with the previous residual waveform data. If the change is small (when the power ratio of the residual waveform is close to 1), it is determined that all the harmonic waveform components have been detected, the previous residual waveform is output to the output terminal c, and the 1 storage unit is stored. Issue a command to input new waveform data. Since a music signal generally contains sounds of a plurality of pitches, band compression (low bit rate transmission) becomes possible by detecting a plurality of harmonic waveform components and transmitting the waveform parameters and the residual waveform.

【0013】[0013]

【発明の効果】本発明によれば、短い波形データを用い
て波形データに含まれた調和波形のピッチ周期を検出
し、調和波形成分を特定することができるので、音声信
号や音楽信号のピッチ周期検出や調和波形成分の検出に
利用でき、調和波形を波形パラメータで表わすことによ
り、音声信号や音楽信号の低ビットレート伝送(帯域圧
縮)が可能となるという効果がある。
According to the present invention, the pitch period of the harmonic waveform included in the waveform data can be detected using the short waveform data, and the harmonic waveform component can be specified. It can be used for period detection and detection of harmonic waveform components. By expressing a harmonic waveform by waveform parameters, there is an effect that a low bit rate transmission (band compression) of an audio signal or a music signal becomes possible.

【図面の簡単な説明】[Brief description of the drawings]

【図1】本発明による調和波形の分析結果を示す図であ
る。(a)は調和波形W(m)、(b)はW(m)の分
析合成波形(m≦512)と外挿波形(m>512)、
(c)はW(m)のFFTによる分析結果を逆フーリエ
変換(IFFT)して得た波形、(d)(e)(f)は
それぞれ波形(a)(b)(c)のスペクトルを示した
ものである。
FIG. 1 is a diagram showing an analysis result of a harmonic waveform according to the present invention. (A) is a harmonic waveform W (m), (b) is an analytically synthesized waveform of W (m) (m ≦ 512) and extrapolated waveform (m> 512),
(C) is a waveform obtained by performing an inverse Fourier transform (IFFT) on the analysis result of W (m) by FFT, and (d), (e), and (f) are spectra of waveforms (a), (b), and (c), respectively. It is shown.

【図2】本発明を音声信号の帯域圧縮に用いた実施例の
ブロック図である。
FIG. 2 is a block diagram of an embodiment in which the present invention is used for band compression of an audio signal.

【図3】本発明を音楽信号の帯域圧縮に用いた実施例の
ブロック図である。
FIG. 3 is a block diagram of an embodiment in which the present invention is used for band compression of a music signal.

【符号の説明】[Explanation of symbols]

1 記憶部 2 波形分析部 3 波形合成部 4 波形減算部 5 比較制御部 a 波形データ入力端子 b 波形パラメータ出力端子 c 残差波形出力端子 Reference Signs List 1 storage unit 2 waveform analysis unit 3 waveform synthesis unit 4 waveform subtraction unit 5 comparison control unit a waveform data input terminal b waveform parameter output terminal c residual waveform output terminal

Claims (1)

【特許請求の範囲】[Claims] 【請求項1】任意の整数をnおよびr、周期をTとし
て、波形データに周期T/nなるサイン波形およびコサ
イン波形をそれぞれ掛けて区間長rTにわたって加算し
て得られる値を少なくとも用いて、前記波形データから
サイン波形成分とコサイン波形成分の振幅を前記整数n
の複数の値について求め、前記振幅の二乗の和が前記周
期Tを変数としたときに極大となる周期をピッチ周期と
して検出すること、もしくは前記ピッチ周期と前記ピッ
チ周期の1/nの周期のサイン波形成分とコサイン波形
成分の振幅を用いて前記波形データに含まれた調和波形
を特定することを特徴とした、波形データに存在する調
和波形成分の検出方式。
1. Using at least values obtained by multiplying waveform data by a sine waveform and a cosine waveform having a period T / n and adding them over a section length rT, where n and r are arbitrary integers and T is a period, From the waveform data, the amplitudes of the sine waveform component and the cosine waveform component are converted to the integer n
, And a cycle in which the sum of the squares of the amplitude becomes a maximum when the cycle T is a variable is detected as a pitch cycle, or the pitch cycle and 1 / n of the pitch cycle are detected. A method for detecting a harmonic waveform component present in waveform data, wherein a harmonic waveform included in the waveform data is specified using amplitudes of a sine waveform component and a cosine waveform component.
JP8204031A 1996-07-01 1996-07-01 System for detecting harmonic waveform component existing in waveform data Pending JPH1020886A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP8204031A JPH1020886A (en) 1996-07-01 1996-07-01 System for detecting harmonic waveform component existing in waveform data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP8204031A JPH1020886A (en) 1996-07-01 1996-07-01 System for detecting harmonic waveform component existing in waveform data

Publications (1)

Publication Number Publication Date
JPH1020886A true JPH1020886A (en) 1998-01-23

Family

ID=16483617

Family Applications (1)

Application Number Title Priority Date Filing Date
JP8204031A Pending JPH1020886A (en) 1996-07-01 1996-07-01 System for detecting harmonic waveform component existing in waveform data

Country Status (1)

Country Link
JP (1) JPH1020886A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007041593A (en) * 2005-08-01 2007-02-15 Samsung Electronics Co Ltd Method and apparatus for extracting voiced/unvoiced classification information using harmonic component of voice signal
WO2008081920A1 (en) * 2007-01-05 2008-07-10 Kyushu University, National University Corporation Voice enhancement processing device
JP2008186010A (en) * 2007-01-05 2008-08-14 Kyushu Univ Voice enhancement processing device

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007041593A (en) * 2005-08-01 2007-02-15 Samsung Electronics Co Ltd Method and apparatus for extracting voiced/unvoiced classification information using harmonic component of voice signal
WO2008081920A1 (en) * 2007-01-05 2008-07-10 Kyushu University, National University Corporation Voice enhancement processing device
JP2008186010A (en) * 2007-01-05 2008-08-14 Kyushu Univ Voice enhancement processing device

Similar Documents

Publication Publication Date Title
CN1838239B (en) Apparatus for enhancing audio source decoder and method thereof
US5485543A (en) Method and apparatus for speech analysis and synthesis by sampling a power spectrum of input speech
US5029509A (en) Musical synthesizer combining deterministic and stochastic waveforms
US8412526B2 (en) Restoration of high-order Mel frequency cepstral coefficients
EP0657873B1 (en) Speech signal bandwidth compression and expansion apparatus, and bandwidth compressing speech signal transmission method, and reproducing method
JP6005510B2 (en) Selection of sound components in the audio spectrum for articulation and key analysis
EP3121808B1 (en) System for modeling characteristics of an electronic musical instrument
EP3121608B1 (en) Method of modeling characteristics of a non linear system.
US5452398A (en) Speech analysis method and device for suppyling data to synthesize speech with diminished spectral distortion at the time of pitch change
US7305339B2 (en) Restoration of high-order Mel Frequency Cepstral Coefficients
US6629049B2 (en) Method for non-harmonic analysis of waveforms for synthesis, interpolation and extrapolation
JP2779325B2 (en) Pitch search time reduction method using pre-processing correlation equation in vocoder
JPH1020886A (en) System for detecting harmonic waveform component existing in waveform data
JPH0777979A (en) Speech-operated acoustic modulating device
US5799271A (en) Method for reducing pitch search time for vocoder
KR0128851B1 (en) Pitch detecting method by spectrum harmonics matching of variable length dual impulse having different polarity
WO1994018573A1 (en) Non-harmonic analysis of waveform data and synthesizing processing system
JP3223564B2 (en) Pitch extraction method
JP3297750B2 (en) Encoding method
Thakuria et al. Musical Instrument Tuner
Miller Removal of noise from a voice signal by synthesis
US5899974A (en) Compressing speech into a digital format
JPH05281995A (en) Speech encoding method
JP2714880B2 (en) How to analyze physical waveforms into nonharmonic frequency components
Every et al. Separation of overlapping impulsive sounds by bandwise noise interpolation