JP4177755B2 - 発話特徴抽出システム - Google Patents

発話特徴抽出システム Download PDF

Info

Publication number
JP4177755B2
JP4177755B2 JP2003505912A JP2003505912A JP4177755B2 JP 4177755 B2 JP4177755 B2 JP 4177755B2 JP 2003505912 A JP2003505912 A JP 2003505912A JP 2003505912 A JP2003505912 A JP 2003505912A JP 4177755 B2 JP4177755 B2 JP 4177755B2
Authority
JP
Japan
Prior art keywords
frequency
signal
filter
circuit
bandpass filter
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2003505912A
Other languages
English (en)
Japanese (ja)
Other versions
JP2004531767A5 (enExample
JP2004531767A (ja
Inventor
イーガル ブランドマン,
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Publication of JP2004531767A publication Critical patent/JP2004531767A/ja
Publication of JP2004531767A5 publication Critical patent/JP2004531767A5/ja
Application granted granted Critical
Publication of JP4177755B2 publication Critical patent/JP4177755B2/ja
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Alarm Systems (AREA)
  • Machine Translation (AREA)
  • Telephonic Communication Services (AREA)
  • Sorting Of Articles (AREA)
JP2003505912A 2001-06-15 2002-06-14 発話特徴抽出システム Expired - Fee Related JP4177755B2 (ja)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/882,744 US6493668B1 (en) 2001-06-15 2001-06-15 Speech feature extraction system
PCT/US2002/019182 WO2002103676A1 (en) 2001-06-15 2002-06-14 Speech feature extraction system

Publications (3)

Publication Number Publication Date
JP2004531767A JP2004531767A (ja) 2004-10-14
JP2004531767A5 JP2004531767A5 (enExample) 2008-04-17
JP4177755B2 true JP4177755B2 (ja) 2008-11-05

Family

ID=25381249

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2003505912A Expired - Fee Related JP4177755B2 (ja) 2001-06-15 2002-06-14 発話特徴抽出システム

Country Status (7)

Country Link
US (2) US6493668B1 (enExample)
EP (1) EP1402517B1 (enExample)
JP (1) JP4177755B2 (enExample)
AT (1) ATE421137T1 (enExample)
CA (1) CA2450230A1 (enExample)
DE (1) DE60230871D1 (enExample)
WO (1) WO2002103676A1 (enExample)

Families Citing this family (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3673507B2 (ja) * 2002-05-16 2005-07-20 独立行政法人科学技術振興機構 音声波形の特徴を高い信頼性で示す部分を決定するための装置およびプログラム、音声信号の特徴を高い信頼性で示す部分を決定するための装置およびプログラム、ならびに擬似音節核抽出装置およびプログラム
JP4265908B2 (ja) * 2002-12-12 2009-05-20 アルパイン株式会社 音声認識装置及び音声認識性能改善方法
DE102004008225B4 (de) * 2004-02-19 2006-02-16 Infineon Technologies Ag Verfahren und Einrichtung zum Ermitteln von Merkmalsvektoren aus einem Signal zur Mustererkennung, Verfahren und Einrichtung zur Mustererkennung sowie computerlesbare Speichermedien
US20070041517A1 (en) * 2005-06-30 2007-02-22 Pika Technologies Inc. Call transfer detection method using voice identification techniques
US20070118364A1 (en) * 2005-11-23 2007-05-24 Wise Gerald B System for generating closed captions
US20070118372A1 (en) * 2005-11-23 2007-05-24 General Electric Company System and method for generating closed captions
US8345890B2 (en) 2006-01-05 2013-01-01 Audience, Inc. System and method for utilizing inter-microphone level differences for speech enhancement
US8204252B1 (en) 2006-10-10 2012-06-19 Audience, Inc. System and method for providing close microphone adaptive array processing
US8744844B2 (en) 2007-07-06 2014-06-03 Audience, Inc. System and method for adaptive intelligent noise suppression
US9185487B2 (en) 2006-01-30 2015-11-10 Audience, Inc. System and method for providing noise suppression utilizing null processing noise subtraction
US8194880B2 (en) 2006-01-30 2012-06-05 Audience, Inc. System and method for utilizing omni-directional microphones for speech enhancement
US7778831B2 (en) * 2006-02-21 2010-08-17 Sony Computer Entertainment Inc. Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch
US8204253B1 (en) 2008-06-30 2012-06-19 Audience, Inc. Self calibration of audio device
US20080010067A1 (en) * 2006-07-07 2008-01-10 Chaudhari Upendra V Target specific data filter to speed processing
US8259926B1 (en) 2007-02-23 2012-09-04 Audience, Inc. System and method for 2-channel and 3-channel acoustic echo cancellation
US8189766B1 (en) 2007-07-26 2012-05-29 Audience, Inc. System and method for blind subband acoustic echo cancellation postfiltering
WO2009029037A1 (en) * 2007-08-27 2009-03-05 Telefonaktiebolaget Lm Ericsson (Publ) Adaptive transition frequency between noise fill and bandwidth extension
US20090150164A1 (en) * 2007-12-06 2009-06-11 Hu Wei Tri-model audio segmentation
US8180064B1 (en) 2007-12-21 2012-05-15 Audience, Inc. System and method for providing voice equalization
US8194882B2 (en) 2008-02-29 2012-06-05 Audience, Inc. System and method for providing single microphone noise suppression fallback
US8355511B2 (en) 2008-03-18 2013-01-15 Audience, Inc. System and method for envelope-based acoustic echo cancellation
US8521530B1 (en) 2008-06-30 2013-08-27 Audience, Inc. System and method for enhancing a monaural audio signal
US8626516B2 (en) * 2009-02-09 2014-01-07 Broadcom Corporation Method and system for dynamic range control in an audio processing system
US9838784B2 (en) 2009-12-02 2017-12-05 Knowles Electronics, Llc Directional audio capture
US9008329B1 (en) 2010-01-26 2015-04-14 Audience, Inc. Noise reduction using multi-feature cluster tracker
US9142220B2 (en) 2011-03-25 2015-09-22 The Intellisis Corporation Systems and methods for reconstructing an audio signal from transformed audio information
US8548803B2 (en) * 2011-08-08 2013-10-01 The Intellisis Corporation System and method of processing a sound signal including transforming the sound signal into a frequency-chirp domain
US8620646B2 (en) 2011-08-08 2013-12-31 The Intellisis Corporation System and method for tracking sound pitch across an audio signal using harmonic envelope
US9183850B2 (en) 2011-08-08 2015-11-10 The Intellisis Corporation System and method for tracking sound pitch across an audio signal
US8781880B2 (en) 2012-06-05 2014-07-15 Rank Miner, Inc. System, method and apparatus for voice analytics of recorded audio
US9536540B2 (en) 2013-07-19 2017-01-03 Knowles Electronics, Llc Speech signal separation and synthesis based on auditory scene analysis and speech modeling
US9280968B2 (en) * 2013-10-04 2016-03-08 At&T Intellectual Property I, L.P. System and method of using neural transforms of robust audio features for speech processing
DE112015004185T5 (de) 2014-09-12 2017-06-01 Knowles Electronics, Llc Systeme und Verfahren zur Wiederherstellung von Sprachkomponenten
US9842611B2 (en) 2015-02-06 2017-12-12 Knuedge Incorporated Estimating pitch using peak-to-peak distances
US9870785B2 (en) 2015-02-06 2018-01-16 Knuedge Incorporated Determining features of harmonic signals
US9922668B2 (en) 2015-02-06 2018-03-20 Knuedge Incorporated Estimating fractional chirp rate with multiple frequency representations
US9820042B1 (en) 2016-05-02 2017-11-14 Knowles Electronics, Llc Stereo separation and directional suppression with omni-directional microphones

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4300229A (en) * 1979-02-21 1981-11-10 Nippon Electric Co., Ltd. Transmitter and receiver for an othogonally multiplexed QAM signal of a sampling rate N times that of PAM signals, comprising an N/2-point offset fourier transform processor
US4221934A (en) * 1979-05-11 1980-09-09 Rca Corporation Compandor for group of FDM signals
GB8307702D0 (en) * 1983-03-21 1983-04-27 British Telecomm Digital band-split filter means
NL8400677A (nl) * 1984-03-02 1985-10-01 Philips Nv Transmissiesysteem voor de overdracht van data signalen in een modulaatband.

Also Published As

Publication number Publication date
US20020198711A1 (en) 2002-12-26
US20030014245A1 (en) 2003-01-16
EP1402517B1 (en) 2009-01-14
CA2450230A1 (en) 2002-12-27
ATE421137T1 (de) 2009-01-15
EP1402517A4 (en) 2007-04-25
WO2002103676A1 (en) 2002-12-27
US7013274B2 (en) 2006-03-14
JP2004531767A (ja) 2004-10-14
US6493668B1 (en) 2002-12-10
EP1402517A1 (en) 2004-03-31
DE60230871D1 (de) 2009-03-05

Similar Documents

Publication Publication Date Title
JP4177755B2 (ja) 発話特徴抽出システム
JP2004531767A5 (enExample)
US6804643B1 (en) Speech recognition
Sailor et al. Auditory Filterbank Learning for Temporal Modulation Features in Replay Spoof Speech Detection.
CN112382300A (zh) 声纹鉴定方法、模型训练方法、装置、设备及存储介质
JP7184236B2 (ja) 声紋を認識する方法、装置、設備、および記憶媒体
US5806022A (en) Method and system for performing speech recognition
Kim et al. Nonlinear enhancement of onset for robust speech recognition.
CN110767238B (zh) 基于地址信息的黑名单识别方法、装置、设备及存储介质
KR100571427B1 (ko) 잡음 환경에서의 음성 인식을 위한 특징 벡터 추출 장치및 역상관 필터링 방법
Maazouzi et al. MFCC and similarity measurements for speaker identification systems
JP3916834B2 (ja) 雑音が付加された周期波形の基本周期あるいは基本周波数の抽出方法
Rosell An introduction to front-end processing and acoustic features for automatic speech recognition
JPS6229799B2 (enExample)
Niyozmatova et al. Development Software for Preprocessing Voice Signals
Nikhil et al. Impact of ERB and bark scales on perceptual distortion based near-end speech enhancement
Lalitha et al. An encapsulation of vital non-linear frequency features for various speech applications
KR100381372B1 (ko) 음성특징 추출장치
JPH03122699A (ja) 雑音除去装置及び該装置を用いた音声認識装置
KR100563316B1 (ko) 보완적 특징벡터를 이용한 화자특징벡터 생성방법 및 장치
CN117079666A (zh) 歌曲打分方法、装置、终端设备以及存储介质
JP4014374B2 (ja) 音声分析方法
Lakshmi on Speech Enhancement Using Neural
JP2006084659A (ja) オーディオ信号分析方法、その方法を用いた音声認識方法、それらの装置、プログラムおよびその記録媒体
Kalamani et al. Comparison of cepstral and mel frequency cepstral coefficients for various clean and noisy speech signals

Legal Events

Date Code Title Description
A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20050613

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20071031

A601 Written request for extension of time

Free format text: JAPANESE INTERMEDIATE CODE: A601

Effective date: 20080130

A602 Written permission of extension of time

Free format text: JAPANESE INTERMEDIATE CODE: A602

Effective date: 20080206

A524 Written submission of copy of amendment under article 19 pct

Free format text: JAPANESE INTERMEDIATE CODE: A524

Effective date: 20080227

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20080402

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20080603

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20080801

A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20080822

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20110829

Year of fee payment: 3

R150 Certificate of patent or registration of utility model

Free format text: JAPANESE INTERMEDIATE CODE: R150

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20110829

Year of fee payment: 3

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20120829

Year of fee payment: 4

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20130829

Year of fee payment: 5

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

LAPS Cancellation because of no payment of annual fees