ATE378672T1 - METHOD FOR EXTRACTING FORMANTS - Google Patents

METHOD FOR EXTRACTING FORMANTS

Info

Publication number
ATE378672T1
ATE378672T1 AT04023155T AT04023155T ATE378672T1 AT E378672 T1 ATE378672 T1 AT E378672T1 AT 04023155 T AT04023155 T AT 04023155T AT 04023155 T AT04023155 T AT 04023155T AT E378672 T1 ATE378672 T1 AT E378672T1
Authority
AT
Austria
Prior art keywords
formants
judged
cauchy
maximum value
integral formula
Prior art date
Application number
AT04023155T
Other languages
German (de)
Inventor
Chan-Woo Kim
Original Assignee
Lg Electronics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lg Electronics Inc filed Critical Lg Electronics Inc
Application granted granted Critical
Publication of ATE378672T1 publication Critical patent/ATE378672T1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/15Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Electrophonic Musical Instruments (AREA)
  • Saccharide Compounds (AREA)
  • Fats And Perfumes (AREA)
  • Seasonings (AREA)
  • Apparatuses For Generation Of Mechanical Vibrations (AREA)
  • Testing Of Balance (AREA)

Abstract

In a formants extracting method capabie of precisely obtaining formants as resonance frequencies of voice with less computational complexity, the method includes searching a maximum value by a spectral peak-picking method (510), judging whether the number of formants corresponding to a zero at the obtained maximum point are two (520), and analyzing a pertinent root by roots polishing when the number of the formants are judged as two (530). The number of the formants are judged by applying Cauchy's integral formula, wherein Cauchy's integral formula is not applied repeatedly but only once at a surrounding portion of the maximum value in a z-domain. <IMAGE>
AT04023155T 2003-10-06 2004-09-29 METHOD FOR EXTRACTING FORMANTS ATE378672T1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR10-2003-0069175A KR100511316B1 (en) 2003-10-06 2003-10-06 Formant frequency detecting method of voice signal

Publications (1)

Publication Number Publication Date
ATE378672T1 true ATE378672T1 (en) 2007-11-15

Family

ID=34386745

Family Applications (1)

Application Number Title Priority Date Filing Date
AT04023155T ATE378672T1 (en) 2003-10-06 2004-09-29 METHOD FOR EXTRACTING FORMANTS

Country Status (6)

Country Link
US (1) US8000959B2 (en)
EP (1) EP1530199B1 (en)
KR (1) KR100511316B1 (en)
CN (1) CN1331111C (en)
AT (1) ATE378672T1 (en)
DE (1) DE602004010035T2 (en)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2232700B1 (en) 2007-12-21 2014-08-13 Dts Llc System for adjusting perceived loudness of audio signals
US8538042B2 (en) * 2009-08-11 2013-09-17 Dts Llc System for increasing perceived loudness of speakers
US8204742B2 (en) * 2009-09-14 2012-06-19 Srs Labs, Inc. System for processing an audio signal to enhance speech intelligibility
WO2013019562A2 (en) 2011-07-29 2013-02-07 Dts Llc. Adaptive voice intelligibility processor
US9312829B2 (en) 2012-04-12 2016-04-12 Dts Llc System for adjusting loudness of audio signals in real time
DE112012006876B4 (en) * 2012-09-04 2021-06-10 Cerence Operating Company Method and speech signal processing system for formant-dependent speech signal amplification
US9899039B2 (en) * 2014-01-24 2018-02-20 Foundation Of Soongsil University-Industry Cooperation Method for determining alcohol consumption, and recording medium and terminal for carrying out same
KR101621774B1 (en) * 2014-01-24 2016-05-19 숭실대학교산학협력단 Alcohol Analyzing Method, Recording Medium and Apparatus For Using the Same
WO2015115677A1 (en) * 2014-01-28 2015-08-06 숭실대학교산학협력단 Method for determining alcohol consumption, and recording medium and terminal for carrying out same
KR101569343B1 (en) 2014-03-28 2015-11-30 숭실대학교산학협력단 Mmethod for judgment of drinking using differential high-frequency energy, recording medium and device for performing the method
KR101621797B1 (en) 2014-03-28 2016-05-17 숭실대학교산학협력단 Method for judgment of drinking using differential energy in time domain, recording medium and device for performing the method
KR101621780B1 (en) 2014-03-28 2016-05-17 숭실대학교산학협력단 Method fomethod for judgment of drinking using differential frequency energy, recording medium and device for performing the method
US11244818B2 (en) 2018-02-19 2022-02-08 Agilent Technologies, Inc. Method for finding species peaks in mass spectrometry

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5146539A (en) * 1984-11-30 1992-09-08 Texas Instruments Incorporated Method for utilizing formant frequencies in speech recognition
CA1250368A (en) * 1985-05-28 1989-02-21 Tetsu Taguchi Formant extractor
NL8603163A (en) * 1986-12-12 1988-07-01 Philips Nv METHOD AND APPARATUS FOR DERIVING FORMANT FREQUENCIES FROM A PART OF A VOICE SIGNAL
WO1993018505A1 (en) * 1992-03-02 1993-09-16 The Walt Disney Company Voice transformation system
JP3199338B2 (en) 1993-10-01 2001-08-20 日本電信電話株式会社 Formant extraction method
KR100211965B1 (en) 1996-12-20 1999-08-02 정선종 Method for extracting pitch synchronous formant of voiced speech
US6195632B1 (en) * 1998-11-25 2001-02-27 Matsushita Electric Industrial Co., Ltd. Extracting formant-based source-filter data for coding and synthesis employing cost function and inverse filtering
US6587816B1 (en) * 2000-07-14 2003-07-01 International Business Machines Corporation Fast frequency-domain pitch estimation

Also Published As

Publication number Publication date
EP1530199B1 (en) 2007-11-14
EP1530199A3 (en) 2005-05-18
CN1331111C (en) 2007-08-08
DE602004010035D1 (en) 2007-12-27
US8000959B2 (en) 2011-08-16
DE602004010035T2 (en) 2008-09-18
KR100511316B1 (en) 2005-08-31
CN1606062A (en) 2005-04-13
US20050075864A1 (en) 2005-04-07
KR20050033206A (en) 2005-04-12
EP1530199A2 (en) 2005-05-11

Similar Documents

Publication Publication Date Title
ATE378672T1 (en) METHOD FOR EXTRACTING FORMANTS
CN102054480B (en) Method for separating monaural overlapping speeches based on fractional Fourier transform (FrFT)
KR100348899B1 (en) The Harmonic-Noise Speech Coding Algorhthm Using Cepstrum Analysis Method
DE60325482D1 (en) METHOD FOR CHARACTERIZING BIOMOLECULES BY RESULT-CONTROLLED STRATEGY
WO2005077024A3 (en) Methods and apparatus for data analysis
CN106653056A (en) Fundamental frequency extraction model based on LSTM recurrent neural network and training method thereof
CN110136730B (en) Deep learning-based piano and acoustic automatic configuration system and method
EP1675102A3 (en) Method for extracting feature vectors for speech recognition
DK1328927T3 (en) Method and system for estimating artificial high-band signal in speech codec
DE602004004519D1 (en) EMBODIMENT, METHOD FOR PRODUCING PATTERNS AND METHOD FOR PRODUCING AN ELECTRONIC DEVICE USING THIS SOLUTION, AND ELECTRONIC DEVICE
CN102201240A (en) Harmonic noise excitation model vocoder based on inverse filtering
RU2005128572A (en) METHOD FOR CARRYING OUT THE MACHINE ASSESSMENT OF QUALITY OF AUDIO SIGNALS
DE602004002312D1 (en) Method and apparatus for determining formants using a residual signal model
ATE533146T1 (en) METHOD AND DEVICE FOR SEARCHING A BASE FREQUENCY
DE602004013747D1 (en) METHOD FOR ANALYZING THE BASIC FREQUENCY, METHOD AND DEVICE FOR LANGUAGE CONVERSION USING THEREOF
ATE371923T1 (en) TRACKING VOCAL TRACT RESONANCES USING A NONLINEAR PREDICTOR
ATE333695T1 (en) TOOL FOR QUALITY CAPTURE
KR100827097B1 (en) Method for determining variable length of frame for preprocessing of a speech signal and method and apparatus for preprocessing a speech signal using the same
DE602004023024D1 (en) Antialiasing of high quality
EP1260967A2 (en) Prediction parameter analysis apparatus and a prediction parameter analysis method
WO2007076279A3 (en) Method for classifying speech data
CN104599682A (en) Method for extracting pitch period of telephone wire quality voice
RU2005133519A (en) METHOD FOR PREDICTING THE COURSE OF ESSENTIAL ARTERIAL HYPERTENSION IN TEENAGERS
CN105448297A (en) Method and device for acquiring pitch period
Siafarikas et al. Overlapping wavelet packet features for speaker verification.

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties