ATE421137T1 - Sprachmerkmalextraktionssystem - Google Patents

Sprachmerkmalextraktionssystem

Info

Publication number
ATE421137T1
ATE421137T1 AT02744395T AT02744395T ATE421137T1 AT E421137 T1 ATE421137 T1 AT E421137T1 AT 02744395 T AT02744395 T AT 02744395T AT 02744395 T AT02744395 T AT 02744395T AT E421137 T1 ATE421137 T1 AT E421137T1
Authority
AT
Austria
Prior art keywords
feature extraction
band pass
pass filters
extraction system
language feature
Prior art date
Application number
AT02744395T
Other languages
German (de)
English (en)
Inventor
Yigal Brandman
Original Assignee
Yigal Brandman
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yigal Brandman filed Critical Yigal Brandman
Application granted granted Critical
Publication of ATE421137T1 publication Critical patent/ATE421137T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Alarm Systems (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Machine Translation (AREA)
  • Telephonic Communication Services (AREA)
  • Sorting Of Articles (AREA)
AT02744395T 2001-06-15 2002-06-14 Sprachmerkmalextraktionssystem ATE421137T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US09/882,744 US6493668B1 (en) 2001-06-15 2001-06-15 Speech feature extraction system

Publications (1)

Publication Number Publication Date
ATE421137T1 true ATE421137T1 (de) 2009-01-15

Family

ID=25381249

Family Applications (1)

Application Number Title Priority Date Filing Date
AT02744395T ATE421137T1 (de) 2001-06-15 2002-06-14 Sprachmerkmalextraktionssystem

Country Status (7)

Country Link
US (2) US6493668B1 (https=)
EP (1) EP1402517B1 (https=)
JP (1) JP4177755B2 (https=)
AT (1) ATE421137T1 (https=)
CA (1) CA2450230A1 (https=)
DE (1) DE60230871D1 (https=)
WO (1) WO2002103676A1 (https=)

Families Citing this family (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3673507B2 (ja) * 2002-05-16 2005-07-20 独立行政法人科学技術振興機構 音声波形の特徴を高い信頼性で示す部分を決定するための装置およびプログラム、音声信号の特徴を高い信頼性で示す部分を決定するための装置およびプログラム、ならびに擬似音節核抽出装置およびプログラム
JP4265908B2 (ja) * 2002-12-12 2009-05-20 アルパイン株式会社 音声認識装置及び音声認識性能改善方法
DE102004008225B4 (de) * 2004-02-19 2006-02-16 Infineon Technologies Ag Verfahren und Einrichtung zum Ermitteln von Merkmalsvektoren aus einem Signal zur Mustererkennung, Verfahren und Einrichtung zur Mustererkennung sowie computerlesbare Speichermedien
US20070041517A1 (en) * 2005-06-30 2007-02-22 Pika Technologies Inc. Call transfer detection method using voice identification techniques
US20070118364A1 (en) * 2005-11-23 2007-05-24 Wise Gerald B System for generating closed captions
US20070118372A1 (en) * 2005-11-23 2007-05-24 General Electric Company System and method for generating closed captions
US8345890B2 (en) 2006-01-05 2013-01-01 Audience, Inc. System and method for utilizing inter-microphone level differences for speech enhancement
US8204252B1 (en) 2006-10-10 2012-06-19 Audience, Inc. System and method for providing close microphone adaptive array processing
US8194880B2 (en) 2006-01-30 2012-06-05 Audience, Inc. System and method for utilizing omni-directional microphones for speech enhancement
US9185487B2 (en) 2006-01-30 2015-11-10 Audience, Inc. System and method for providing noise suppression utilizing null processing noise subtraction
US8744844B2 (en) 2007-07-06 2014-06-03 Audience, Inc. System and method for adaptive intelligent noise suppression
US7778831B2 (en) * 2006-02-21 2010-08-17 Sony Computer Entertainment Inc. Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch
US8204253B1 (en) 2008-06-30 2012-06-19 Audience, Inc. Self calibration of audio device
US20080010067A1 (en) * 2006-07-07 2008-01-10 Chaudhari Upendra V Target specific data filter to speed processing
US8259926B1 (en) 2007-02-23 2012-09-04 Audience, Inc. System and method for 2-channel and 3-channel acoustic echo cancellation
US8189766B1 (en) 2007-07-26 2012-05-29 Audience, Inc. System and method for blind subband acoustic echo cancellation postfiltering
PT2571024E (pt) 2007-08-27 2014-12-23 Ericsson Telefon Ab L M Frequência de transição adaptativa entre preenchimento de ruído e extensão da largura de banda
US20090150164A1 (en) * 2007-12-06 2009-06-11 Hu Wei Tri-model audio segmentation
US8180064B1 (en) 2007-12-21 2012-05-15 Audience, Inc. System and method for providing voice equalization
US8194882B2 (en) 2008-02-29 2012-06-05 Audience, Inc. System and method for providing single microphone noise suppression fallback
US8355511B2 (en) 2008-03-18 2013-01-15 Audience, Inc. System and method for envelope-based acoustic echo cancellation
US8521530B1 (en) 2008-06-30 2013-08-27 Audience, Inc. System and method for enhancing a monaural audio signal
US8626516B2 (en) * 2009-02-09 2014-01-07 Broadcom Corporation Method and system for dynamic range control in an audio processing system
US9838784B2 (en) 2009-12-02 2017-12-05 Knowles Electronics, Llc Directional audio capture
US9008329B1 (en) 2010-01-26 2015-04-14 Audience, Inc. Noise reduction using multi-feature cluster tracker
US9142220B2 (en) 2011-03-25 2015-09-22 The Intellisis Corporation Systems and methods for reconstructing an audio signal from transformed audio information
US9183850B2 (en) 2011-08-08 2015-11-10 The Intellisis Corporation System and method for tracking sound pitch across an audio signal
US8620646B2 (en) 2011-08-08 2013-12-31 The Intellisis Corporation System and method for tracking sound pitch across an audio signal using harmonic envelope
US8548803B2 (en) * 2011-08-08 2013-10-01 The Intellisis Corporation System and method of processing a sound signal including transforming the sound signal into a frequency-chirp domain
WO2013184667A1 (en) 2012-06-05 2013-12-12 Rank Miner, Inc. System, method and apparatus for voice analytics of recorded audio
US9536540B2 (en) 2013-07-19 2017-01-03 Knowles Electronics, Llc Speech signal separation and synthesis based on auditory scene analysis and speech modeling
US9280968B2 (en) * 2013-10-04 2016-03-08 At&T Intellectual Property I, L.P. System and method of using neural transforms of robust audio features for speech processing
DE112015004185T5 (de) 2014-09-12 2017-06-01 Knowles Electronics, Llc Systeme und Verfahren zur Wiederherstellung von Sprachkomponenten
US9870785B2 (en) 2015-02-06 2018-01-16 Knuedge Incorporated Determining features of harmonic signals
US9842611B2 (en) 2015-02-06 2017-12-12 Knuedge Incorporated Estimating pitch using peak-to-peak distances
US9922668B2 (en) 2015-02-06 2018-03-20 Knuedge Incorporated Estimating fractional chirp rate with multiple frequency representations
US9820042B1 (en) 2016-05-02 2017-11-14 Knowles Electronics, Llc Stereo separation and directional suppression with omni-directional microphones

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4300229A (en) * 1979-02-21 1981-11-10 Nippon Electric Co., Ltd. Transmitter and receiver for an othogonally multiplexed QAM signal of a sampling rate N times that of PAM signals, comprising an N/2-point offset fourier transform processor
US4221934A (en) * 1979-05-11 1980-09-09 Rca Corporation Compandor for group of FDM signals
GB8307702D0 (en) * 1983-03-21 1983-04-27 British Telecomm Digital band-split filter means
NL8400677A (nl) * 1984-03-02 1985-10-01 Philips Nv Transmissiesysteem voor de overdracht van data signalen in een modulaatband.

Also Published As

Publication number Publication date
EP1402517A1 (en) 2004-03-31
US20020198711A1 (en) 2002-12-26
JP4177755B2 (ja) 2008-11-05
EP1402517B1 (en) 2009-01-14
US7013274B2 (en) 2006-03-14
US20030014245A1 (en) 2003-01-16
JP2004531767A (ja) 2004-10-14
US6493668B1 (en) 2002-12-10
DE60230871D1 (de) 2009-03-05
EP1402517A4 (en) 2007-04-25
WO2002103676A1 (en) 2002-12-27
CA2450230A1 (en) 2002-12-27

Similar Documents

Publication Publication Date Title
ATE421137T1 (de) Sprachmerkmalextraktionssystem
Zhao et al. Analyzing noise robustness of MFCC and GFCC features in speaker identification
DK1423988T3 (da) Retningsbestemt audiosignalbehandling ved brug af en oversamplet filterbank
DK2808868T3 (en) Method of Processing a Voice Segment and Hearing Aid
DE602004022787D1 (de) Verteiltes spracherkennungsverfahren
MXPA03009357A (es) Escalamiento en el tiempo y escalamiento en el tono de alta calidad de senales de audio.
ATE288666T1 (de) Verfahren zur rauschunterdrückung in einem adaptiven strahlformer
DE60131639D1 (de) Vorrichtungen und Verfahren zur Bestimmung von Leistungswerten für die Geräuschunterdrückung für ein Sprachkommunikationssystem
DE69836785D1 (de) Audiosignalkompression, Sprachsignalkompression und Spracherkennung
ATE347162T1 (de) Rauschunterdrückung zur robusten spracherkennung
DE60101094D1 (de) Vorrichtung zum trennen des frequenzbands eines eingangssignals
Dhameliya et al. Notice of Removal: Feature extraction and classification techniques for speaker recognition: A review
US9179225B2 (en) Hearing aid device
DE69726490D1 (de) Vorrichtung zur Unterdrückung von Sprache und Rauschen, sowie zur Spracherkennung
DE50210983D1 (de) Filterschaltung und verfahren zur verarbeitung eines audiosignals
DK0646300T3 (da) Fremgangsmåde til konstatering af fejl ved digitaliserede, datareducerede lyd- og datasignaler
VH et al. A study on speech recognition technology
Zhu et al. Analysis of hybrid feature research based on extraction LPCC and MFCC
GB2014406B (en) Analog speech enconder and decoder
Palo et al. Notice of Removal: Novel feature extraction technique for child emotion recognition
JP6435133B2 (ja) 音素分割装置、音声処理システム、音素分割方法、および音素分割プログラム
ATE422696T1 (de) Verfahren zur analyse von impulsen enthaltenden signalen
RU2002129029A (ru) Способ дикторонезависимого распознавания звуков речи
Sujatha et al. Notice of Removal: Biometric Identity Verification using Automatic Speaker Recognition
ATE431653T1 (de) Ultrabreitband-hochfrequenzsender und - impulsgenerator

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties