WO2005031702A1 - Method for estimating resonance frequencies - Google Patents
Method for estimating resonance frequencies Download PDFInfo
- Publication number
- WO2005031702A1 WO2005031702A1 PCT/BE2004/000116 BE2004000116W WO2005031702A1 WO 2005031702 A1 WO2005031702 A1 WO 2005031702A1 BE 2004000116 W BE2004000116 W BE 2004000116W WO 2005031702 A1 WO2005031702 A1 WO 2005031702A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- estimating
- resonance frequencies
- signal
- differential
- input signal
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/09—Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/15—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
Definitions
- the present invention is related to an analysis technique for recorded speech signals that can be used in various fields of speech processing technology.
- the basic source-filter speech model is very frequently used. It mainly assumes that the speech signal is produced by exciting a filter (corresponding to vocal tract), i.e. by an excitation produced by the lung pressure and larynx (source signal or the glottal flow signal) .
- a filter corresponding to vocal tract
- source signal or the glottal flow signal source signal or the glottal flow signal
- LP analysis estimates poles of a system, which correspond to resonances of a signal . Once the resonances are estimated with LP analysis, the problem is reduced to relating source and tract resonances respectively, a difficult and important problem in speech processing technology. There are many difficulties and inefficiencies of LP estimation due to various problems like non-linear source-tract interaction, dependency on degree of linear prediction and separating source resonances from vocal tract . [0005] Despite the disadvantages of LP analysis, various methods have been proposed for source-tract separation using LP analysis.
- the present invention aims to provide a method for estimating the formant frequencies for vocal tract and glottal flow, directly from speech signals.
- the invention further aims to provide a computer program that implements such a method.
- the present invention relates to a method for estimating from an input signal the resonance frequencies of a system modelled as a source and a filter, comprising the steps of determining the Z-transform of the input signal, calculating the differential-phase spectrum of the Z- transformed input signal (without using the amplitude spectrum) , whereby the Z-transform is evaluated on a circle centered around the origin of the Z-plane,
- the circle on which the Z-transform is evaluated is different from the unit circle in the Z-plane.
- the Z-transform of the input signal is evaluated on more than one circle.
- the input signal is windowed.
- the input signal is a speech signal .
- the source is a glottal flow signal and the filter is a vocal tract system.
- the step of attributing the peaks is performed based on the sign of said peaks.
- the method for estimating the resonance frequencies further comprises the step of removing zeros of the input signal's Z-transform before performing the step of calculating the differential- phase spectrum.
- the invention also relates to a program, executable on a programmable device containing instructions, which, when executed, perform the method as described above .
- Fig. 1 represents the source-filter speech model.
- Fig. 2 shows the anti-causal character of the glottal flow signal, a) a causal filter response, b) an anti- causal filter response, c)a typical glottal flow signal.
- Fig. 3 represents a causal and an anti-causal single pole filter response plots: a)causal impulse response, b) log-amplitude spectrum of a) , c)group delay spectrum of a), d) a. ⁇ t ⁇ -causal impulse response, ejlog- amplitude spectrum of d) , f)group delay spectrum of d) .
- Fig. 4 represents a mixed phase all-pole signal with causal resonances at 1000 Hz and 2000 Hz and anti-causal resonances at 500 Hz and 1500 Hz.
- Fig. 5 shows the effect of zeros on the group delay function, a) Zeros of Z-Transform (ZZT) plotted in polar coordinates (region of zeros close to the unit circle indicated by dashed lines) , b) group delay function with ZZT close to unit circle superimposed.
- ZZT Z-Transform
- Fig. 6 represents an example of differential- phase spectrum analysis of synthetic speech.
- Fig. 7 represents a flowchart of the method according to the invention.
- the invention targets the estimation of resonance frequencies (formant frequencies) of the source and the vocal tract contributions directly from the speech signal itself.
- the source-tract separation problem needs to be handled with tools, which can detect anti-causal resonances.
- the technique according to the invention is more effective than current state of art methods, mainly because it is capable of detecting causal and anti-casual resonances without utilisation of a particular model of analysis, but only with spectral peak analysis. Additionally, the technique has no dependency on analysis degrees as in LP analysis systems.
- the source-filter model (see Fig.l) is usually accompanied by the assumption that a speech signal is a physical system output and therefore it is the output of a stable filter system.
- all the resonances of the signal shall correspond to poles inside the unit circle in z-plane.
- the system is all-pole (i.e., the system can be defined by only poles and a gain factor)
- a minimum phase system the systems having all zeros and poles inside the unit circle are classified as minimum phase systems) .
- Speech signals have been assumed to be minimum-phase signals for long years in many studies.
- an anti-causal signal x (-n) is obtained.
- the version of x (-n) time shifted to positive time indexes is also referred to as anti-causal, because the filter characteristics are time-reversed. Shifting the signal in time only introduces a linear phase component to the signal (a DC component is added to the group delay spectrum) and the amplitude spectrum is unaffected.
- the anti-causality assumption for the source is based on the characteristics of glottal flow models (as explained in detail in ⁇ Spectral correlates of glottal waveform models : an analytic study' , Doval and d 'Alessandro, Proc. ICASSP 97, Kunststoff, pp. 446-452) .
- One easy explanation is through visual inspection of signal waveforms.
- Fig.2 an example glottal flow signal is presented together with a causal and an anti-causal filter response .
- the glottal flow signal has the same characteristics as the anti-causal response, namely a slowly increasing function with a rather sharper decay.
- the glottal flow signals can be modelled by an all-pole system where the poles are anti-causal. For stability of an anti- causal all-pole system, all of the poles have to be out of the unit circle and therefore the system is maximum phase.
- the mixed-phase model assumes speech signals have two types of resonances: anti-causal resonances of the source (glottal flow) signal and causal resonances of the vocal tract filter. The invention aims to estimate these resonances from the speech signal. The estimation method is based on analysis of 'differential-phase spectra' .
- the closest concept to differential-phase spectra is the group delay, so the differential-phase spectra will be introduced as a more general form of group delay.
- the source-tract separation is based on spectral analysis of causal and anti-causal parts of the speech signal.
- the frequently used amplitude (or power) spectra offer very little help (if any) .
- the phase spectra have to be studied, since causality can only be observed in phase spectra.
- One of the main difficulties of phase analysis is its automatically wrapped nature .
- the phase spectra derivative however does not have the same property and various other advantages exist over both phase spectra and amplitude spectra.
- the group delay function GD( ) is defined as the negative of derivative of the argument ⁇ ( ⁇ ) of X ( ⁇ ) , being the discrete Fourier transform of a signal x (n) .
- Fig. 3 the effects of time reversal on the amplitude spectrum and group delay function are presented on an example.
- the signal in Fig.3a is time reversed to obtain the signal in Fig. 3d.
- Comparison of Fig. 3b with Fig. 3e and Fig. 3c with Fig. 3f shows that the only change in frequency characteristics is horizontal inversion of the group delay function.
- Fig.4 a mixed phase signal (synthesised with all-pole model) and its group delay spectrum are presented.
- X(e j ⁇ ) -Z m ) (equation 4)
- X (e ⁇ ) denotes the z-transform of a discrete time sequence x (n)
- the Z m represent the roots of the z- transform
- G is the gain factor.
- Each factor in (eq.4) corresponds, in the z-plane, to a vector starting at Z m and ending at e ⁇ .
- e J * gets very close to one of these zeros
- one of the factors in (eq.4) gets very small in amplitude, and undergoes an important argument modification which corresponds to spiky change in the group delay function.
- a simple observation on group delay spectrums does not provide the desired information, the plots are usually too noisy due to the zeros close to unit circle.
- a group delay function for a speech frame is presented together with zeros of z-transform of the same signal closely located to the unit circle. Each zero creates a spike in the group delay function hiding resonance peaks to appear as in Fig . 4.
- the problem is first redefined in a more general framework of x differential-phase spectrum' .
- the differential-phase spectrum is defined as the negative derivative of the phase spectrum calculated from the signal's z-transform, evaluated on a circle with any radius centered at the origin of the z-plane.
- the invention advantageously makes use of the insight that signal resonances can be tracked from differential phase spectra calculated on circles with radius different from 1 (the unit circle), i.e. on circles with a radius either larger or smaller than 1.
- the analysis of more than one differential-phase spectrum is advantageous for the estimation of source and tract characteristics due to the poles existing inside and outside the unit circle (though a single differential-phase spectrum can also reveal all causal and anti-causal resonances) . Therefore the method preferably includes the step of processing more than one differential-phase spectrum calculated at circles with different radius, as this yields an improved robustness.
- the resulting differential-phase spectra are much less noisy than group delay functions, but still zeros may exist anywhere in the z-plane.
- a single unexpected zero causes the same type of spiky effect for the frequency regions, where the zero is close to the analysis circle. In order to get rid of this effect, a zero-removal technique is proposed that effectively calculates noise-free differential-phase spectra.
- the procedure comprises the steps of: • estimating zeros (roots of z-transform polynomial of the speech signal) with a numerical method, • removing or displacing zeros from z-plane regions, where the differential-phase spectrum is to be calculated, and • calculating the differential-phase spectrum at this region from the remaining zeros.
- the roots (zeros) of a z-transform polynomial can be determined by a numerical method.
- the obtained set of roots of z-transform polynomial can be divided into two sets of roots (which corresponds to dividing the z- transform polynomial into two polynomials) .
- the obtained two sets of roots correspond to the spectral representation of glottal flow and vocal tract contributions of speech signal : when classifying the roots according to their distance to the origin of the z-plane (i.e. their radius), roots outside the unit circle are classified as glottal flow roots and roots inside the unit circle as vocal tract roots.
- glottal flow roots which are out of the unit circle are removed from the complete set of zeros and then the differential-phase spectrum calculation is performed.
- Fig.6 An example on synthetic speech analysis is presented in Fig.6 for the zero-removal technique and its effect to differential-phase spectrum.
- the first row of plots include the actual amplitude spectrum of glottal flow (Fig.6a) and the amplitude spectrum vocal tract (Fig.6b) used in synthesis.
- the aim is to estimate the resonance peak (formant) locations of these two systems directly from the speech signal, which is constructed by convolution of these two systems and an impulse train to obtain several cycles of speech signal.
- An all-pole vocal tract filter (of a typical vowel " " with normalised resonance frequencies at 0.075, 0.15, 0.275, 0.4 for 16000 Hz) is used for synthesis.
- the ZZT of windowed speech signal is presented in Fig. 6c and Fig. 6d with the analysis circles indicated on top.
- the differential-phase spectra obtained on the indicated analysis circles are presented in Fig. 6e and Fig. 6f respectively. Since zeros close to analysis circles exist, the resulting differential-phase spectra are noisy.
- Fig.7 summarises the method according to the invention in a flowchart. The various steps are as described previously.
Abstract
Description
Claims
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2006522850A JP2007501957A (en) | 2003-08-11 | 2004-08-11 | Method for estimating resonant frequency |
EP04761477A EP1665228A1 (en) | 2003-08-11 | 2004-08-11 | Method for estimating resonance frequencies |
US10/568,150 US7333931B2 (en) | 2003-08-11 | 2004-08-11 | Method for estimating resonance frequencies |
AU2004276847A AU2004276847B2 (en) | 2003-08-11 | 2004-08-11 | Method for estimating resonance frequencies |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US49437503P | 2003-08-11 | 2003-08-11 | |
US60/494,375 | 2003-08-11 | ||
US56405404P | 2004-04-21 | 2004-04-21 | |
US60/564,054 | 2004-04-21 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2005031702A1 true WO2005031702A1 (en) | 2005-04-07 |
Family
ID=34396150
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/BE2004/000116 WO2005031702A1 (en) | 2003-08-11 | 2004-08-11 | Method for estimating resonance frequencies |
Country Status (5)
Country | Link |
---|---|
US (1) | US7333931B2 (en) |
EP (1) | EP1665228A1 (en) |
JP (1) | JP2007501957A (en) |
AU (1) | AU2004276847B2 (en) |
WO (1) | WO2005031702A1 (en) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2002542B1 (en) * | 2006-02-21 | 2022-01-05 | Cirrus Logic International Semiconductor Limited | Method and device for low delay processing |
AT507844B1 (en) * | 2009-02-04 | 2010-11-15 | Univ Graz Tech | METHOD FOR SEPARATING SIGNALING PATH AND APPLICATION FOR IMPROVING LANGUAGE WITH ELECTRO-LARYNX |
US20120089392A1 (en) * | 2010-10-07 | 2012-04-12 | Microsoft Corporation | Speech recognition user interface |
EP2870738B1 (en) * | 2012-07-09 | 2016-03-23 | Telefonaktiebolaget LM Ericsson (publ) | Device for carrier phase recovery |
US9865247B2 (en) * | 2014-07-03 | 2018-01-09 | Google Inc. | Devices and methods for use of phase information in speech synthesis systems |
GB2537802A (en) * | 2015-02-13 | 2016-11-02 | Univ Sheffield | Parameter estimation and control method and apparatus |
US11610597B2 (en) | 2020-05-29 | 2023-03-21 | Shure Acquisition Holdings, Inc. | Anti-causal filter for audio signal processing |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0670750B2 (en) * | 1987-08-19 | 1994-09-07 | 日本電気株式会社 | Pole zero analyzer with estimation of phase characteristics |
JPH01232224A (en) * | 1988-03-11 | 1989-09-18 | Matsushita Electric Ind Co Ltd | Resonance frequency extracting device |
US6069857A (en) * | 1991-02-15 | 2000-05-30 | Discovision Associates | Optical disc system having improved circuitry for performing blank sector check on readable disc |
JP3150277B2 (en) * | 1995-10-30 | 2001-03-26 | 松下電器産業株式会社 | Linear prediction coefficient calculator |
US6195632B1 (en) * | 1998-11-25 | 2001-02-27 | Matsushita Electric Industrial Co., Ltd. | Extracting formant-based source-filter data for coding and synthesis employing cost function and inverse filtering |
US6704711B2 (en) * | 2000-01-28 | 2004-03-09 | Telefonaktiebolaget Lm Ericsson (Publ) | System and method for modifying speech signals |
JP2002169579A (en) * | 2000-12-01 | 2002-06-14 | Takayuki Arai | Device for embedding additional data in audio signal and device for reproducing additional data from audio signal |
SE0101175D0 (en) * | 2001-04-02 | 2001-04-02 | Coding Technologies Sweden Ab | Aliasing reduction using complex-exponential-modulated filter banks |
JP2003157100A (en) * | 2001-11-22 | 2003-05-30 | Nippon Telegr & Teleph Corp <Ntt> | Voice communication method and equipment, and voice communication program |
-
2004
- 2004-08-11 JP JP2006522850A patent/JP2007501957A/en active Pending
- 2004-08-11 AU AU2004276847A patent/AU2004276847B2/en not_active Ceased
- 2004-08-11 EP EP04761477A patent/EP1665228A1/en not_active Withdrawn
- 2004-08-11 WO PCT/BE2004/000116 patent/WO2005031702A1/en active Search and Examination
- 2004-08-11 US US10/568,150 patent/US7333931B2/en not_active Expired - Fee Related
Non-Patent Citations (11)
Title |
---|
B.BOZKURT AND T.DUTOIT: "Mixed-Phase Speech Modeling and Formant Estimation, Using Differential Phase Spectrums", PROC. ISCA ITRW VOQUAL 2003, 27 August 2003 (2003-08-27), GENEVA, SWITZERLAND, pages 21 - 24, XP002312214, Retrieved from the Internet <URL:http://tcts.fpms.ac.be/publications/papers/2003/voqual03_bbtd.pdf> [retrieved on 20041202] * |
B.DOVAL AND C.D'ALESSANDRO: "Spectral correlates of glottal waveform models: an analytic study", 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING 21-24 APRIL 1997 MUNICH, GERMANY, vol. 2, 21 April 1997 (1997-04-21), 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (Cat. No.97CB36052) IEEE Comput. Soc. Press Los Alamitos, CA, USA, pages 1295 - 1298 vol., ISBN: 0-8186-7919-0 * |
B.DOVAL, C.D'ALESSANDRO, NATHALIE HENRICH: "The Voice Source as a causal/anticausal linear filter", PROC. ISCA ITRW VOQUAL 2003, 27 August 2003 (2003-08-27), GENEVA, SWITZERLAND, pages 15 - 20, XP002312215, Retrieved from the Internet <URL:http://www.isca-speech.org/archive/archive_papers/voqual03/voq3_015.pdf> [retrieved on 20041202] * |
DATABASE INSPEC [online] THE INSTITUTION OF ELECTRICAL ENGINEERS, STEVENAGE, GB; 1985, REDDY N S ET AL: "High-resolution formant extraction from linear-prediction phase spectra", XP002312216, Database accession no. 2427409 * |
DATABASE INSPEC [online] THE INSTITUTION OF ELECTRICAL ENGINEERS, STEVENAGE, GB; 1989, DUNCAN G ET AL: "A nonparametric method of formant estimation using group delay spectra", XP002312217, Database accession no. 3472036 * |
DATABASE INSPEC [online] THE INSTITUTION OF ELECTRICAL ENGINEERS, STEVENAGE, GB; 1990, JACKSON L B: "Noncausal ARMA modeling of voiced speech", XP002312218, Database accession no. 3517863 * |
DATABASE INSPEC [online] THE INSTITUTION OF ELECTRICAL ENGINEERS, STEVENAGE, GB; 1997, DOVAL B ET AL: "Spectral correlates of glottal waveform models: an analytic study", XP002312219, Database accession no. 5730449 * |
ICASSP-89: 1989 INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (IEEE CAT. NO.89CH2673-2) 23-26 MAY 1989 GLASGOW, UK, 23 May 1989 (1989-05-23) - 26 May 1989 (1989-05-26), ICASSP-89: 1989 International Conference on Acoustics, Speech and Signal Processing (IEEE Cat. No.89CH2673-2) IEEE New York, NY, USA, pages 572 - 575 vol.1 * |
JACKSON LELAND B: "Noncausal ARMA modeling of voiced speech", IEEE TRANSACTIONS ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, vol. 37, no. 10, October 1989 (1989-10-01), IEEE Transactions on Acoustics, Speech and Signal Processing USA, pages 1606 - 1608, ISSN: 0096-3518 * |
N. SRIDHAR REDDY AND M.N.S. SWAMY: "High-Resolution Formant Extraction from Linear-Prediction Phase Spectra", IEEE TRANSACTIONS ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, vol. ASSP-32, no. 6, 6 December 1984 (1984-12-06), USA, pages 1136 - 1144, ISSN: 0096-3518 * |
See also references of EP1665228A1 * |
Also Published As
Publication number | Publication date |
---|---|
EP1665228A1 (en) | 2006-06-07 |
JP2007501957A (en) | 2007-02-01 |
US7333931B2 (en) | 2008-02-19 |
AU2004276847B2 (en) | 2009-10-08 |
AU2004276847A1 (en) | 2005-04-07 |
US20060229868A1 (en) | 2006-10-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Drugman et al. | Causal–anticausal decomposition of speech using complex cepstrum for glottal source estimation | |
Bozkurt et al. | Chirp group delay analysis of speech signals | |
Athineos et al. | Autoregressive modeling of temporal envelopes | |
Bayya et al. | Spectro-temporal analysis of speech signals using zero-time windowing and group delay function | |
Bozkurt et al. | Mixed-phase speech modeling and formant estimation, using differential phase spectrums | |
US6047254A (en) | System and method for determining a first formant analysis filter and prefiltering a speech signal for improved pitch estimation | |
Resch et al. | Estimation of the instantaneous pitch of speech | |
AU2004276847B2 (en) | Method for estimating resonance frequencies | |
CA2947957C (en) | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system | |
Bozkurt et al. | Improved differential phase spectrum processing for formant tracking | |
Bozkurt | Zeros of the z-transform (ZZT) representation and chirp group delay processing for the analysis of source and filter characteristics of speech signals | |
Bozkurt et al. | A method for glottal formant frequency estimation | |
Vijayan et al. | Epoch extraction from allpass residual of speech signals | |
JPH0573093A (en) | Extracting method for signal feature point | |
Schafer | Homomorphic systems and cepstrum analysis of speech | |
Yegnanarayana | Speech analysis by pole-zero decomposition of short-time spectra | |
Walker et al. | Advanced methods for glottal wave extraction | |
Chowdhury et al. | Formant estimation from speech signal using the magnitude spectrum modified with group delay spectrum | |
Giri et al. | Block sparse excitation based all-pole modeling of speech | |
US5911170A (en) | Synthesis of acoustic waveforms based on parametric modeling | |
Funaki et al. | WLP-based TV-CAR speech analysis and its evaluation for F0 estimation | |
Wiriyarattanakul et al. | Pitch segmentation of speech signals based on short-time energy waveform | |
Roebel et al. | Real time signal transposition with envelope preservation in the phase vocoder | |
Drugman et al. | Causal-Anticausal Decomposition of Speech using Complex Cepstrum for Glottal Source Estimation | |
Sagayama et al. | A composite sinusoidal model applied to spectral analysis of speech |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2006522850 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2004276847 Country of ref document: AU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006229868 Country of ref document: US Ref document number: 10568150 Country of ref document: US |
|
ENP | Entry into the national phase |
Ref document number: 2004276847 Country of ref document: AU Date of ref document: 20040811 Kind code of ref document: A |
|
WWP | Wipo information: published in national office |
Ref document number: 2004276847 Country of ref document: AU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2004761477 Country of ref document: EP |
|
DPEN | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed from 20040101) | ||
WWP | Wipo information: published in national office |
Ref document number: 2004761477 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 10568150 Country of ref document: US |