JP4177755B2 - 発話特徴抽出システム - Google Patents
発話特徴抽出システム Download PDFInfo
- Publication number
- JP4177755B2 JP4177755B2 JP2003505912A JP2003505912A JP4177755B2 JP 4177755 B2 JP4177755 B2 JP 4177755B2 JP 2003505912 A JP2003505912 A JP 2003505912A JP 2003505912 A JP2003505912 A JP 2003505912A JP 4177755 B2 JP4177755 B2 JP 4177755B2
- Authority
- JP
- Japan
- Prior art keywords
- frequency
- signal
- filter
- circuit
- bandpass filter
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
Landscapes
- Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Alarm Systems (AREA)
- Machine Translation (AREA)
- Telephonic Communication Services (AREA)
- Sorting Of Articles (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US09/882,744 US6493668B1 (en) | 2001-06-15 | 2001-06-15 | Speech feature extraction system |
| PCT/US2002/019182 WO2002103676A1 (en) | 2001-06-15 | 2002-06-14 | Speech feature extraction system |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2004531767A JP2004531767A (ja) | 2004-10-14 |
| JP2004531767A5 JP2004531767A5 (enExample) | 2008-04-17 |
| JP4177755B2 true JP4177755B2 (ja) | 2008-11-05 |
Family
ID=25381249
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2003505912A Expired - Fee Related JP4177755B2 (ja) | 2001-06-15 | 2002-06-14 | 発話特徴抽出システム |
Country Status (7)
| Country | Link |
|---|---|
| US (2) | US6493668B1 (enExample) |
| EP (1) | EP1402517B1 (enExample) |
| JP (1) | JP4177755B2 (enExample) |
| AT (1) | ATE421137T1 (enExample) |
| CA (1) | CA2450230A1 (enExample) |
| DE (1) | DE60230871D1 (enExample) |
| WO (1) | WO2002103676A1 (enExample) |
Families Citing this family (37)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP3673507B2 (ja) * | 2002-05-16 | 2005-07-20 | 独立行政法人科学技術振興機構 | 音声波形の特徴を高い信頼性で示す部分を決定するための装置およびプログラム、音声信号の特徴を高い信頼性で示す部分を決定するための装置およびプログラム、ならびに擬似音節核抽出装置およびプログラム |
| JP4265908B2 (ja) * | 2002-12-12 | 2009-05-20 | アルパイン株式会社 | 音声認識装置及び音声認識性能改善方法 |
| DE102004008225B4 (de) * | 2004-02-19 | 2006-02-16 | Infineon Technologies Ag | Verfahren und Einrichtung zum Ermitteln von Merkmalsvektoren aus einem Signal zur Mustererkennung, Verfahren und Einrichtung zur Mustererkennung sowie computerlesbare Speichermedien |
| US20070041517A1 (en) * | 2005-06-30 | 2007-02-22 | Pika Technologies Inc. | Call transfer detection method using voice identification techniques |
| US20070118364A1 (en) * | 2005-11-23 | 2007-05-24 | Wise Gerald B | System for generating closed captions |
| US20070118372A1 (en) * | 2005-11-23 | 2007-05-24 | General Electric Company | System and method for generating closed captions |
| US8345890B2 (en) | 2006-01-05 | 2013-01-01 | Audience, Inc. | System and method for utilizing inter-microphone level differences for speech enhancement |
| US8204252B1 (en) | 2006-10-10 | 2012-06-19 | Audience, Inc. | System and method for providing close microphone adaptive array processing |
| US8744844B2 (en) | 2007-07-06 | 2014-06-03 | Audience, Inc. | System and method for adaptive intelligent noise suppression |
| US9185487B2 (en) | 2006-01-30 | 2015-11-10 | Audience, Inc. | System and method for providing noise suppression utilizing null processing noise subtraction |
| US8194880B2 (en) | 2006-01-30 | 2012-06-05 | Audience, Inc. | System and method for utilizing omni-directional microphones for speech enhancement |
| US7778831B2 (en) * | 2006-02-21 | 2010-08-17 | Sony Computer Entertainment Inc. | Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch |
| US8204253B1 (en) | 2008-06-30 | 2012-06-19 | Audience, Inc. | Self calibration of audio device |
| US20080010067A1 (en) * | 2006-07-07 | 2008-01-10 | Chaudhari Upendra V | Target specific data filter to speed processing |
| US8259926B1 (en) | 2007-02-23 | 2012-09-04 | Audience, Inc. | System and method for 2-channel and 3-channel acoustic echo cancellation |
| US8189766B1 (en) | 2007-07-26 | 2012-05-29 | Audience, Inc. | System and method for blind subband acoustic echo cancellation postfiltering |
| WO2009029037A1 (en) * | 2007-08-27 | 2009-03-05 | Telefonaktiebolaget Lm Ericsson (Publ) | Adaptive transition frequency between noise fill and bandwidth extension |
| US20090150164A1 (en) * | 2007-12-06 | 2009-06-11 | Hu Wei | Tri-model audio segmentation |
| US8180064B1 (en) | 2007-12-21 | 2012-05-15 | Audience, Inc. | System and method for providing voice equalization |
| US8194882B2 (en) | 2008-02-29 | 2012-06-05 | Audience, Inc. | System and method for providing single microphone noise suppression fallback |
| US8355511B2 (en) | 2008-03-18 | 2013-01-15 | Audience, Inc. | System and method for envelope-based acoustic echo cancellation |
| US8521530B1 (en) | 2008-06-30 | 2013-08-27 | Audience, Inc. | System and method for enhancing a monaural audio signal |
| US8626516B2 (en) * | 2009-02-09 | 2014-01-07 | Broadcom Corporation | Method and system for dynamic range control in an audio processing system |
| US9838784B2 (en) | 2009-12-02 | 2017-12-05 | Knowles Electronics, Llc | Directional audio capture |
| US9008329B1 (en) | 2010-01-26 | 2015-04-14 | Audience, Inc. | Noise reduction using multi-feature cluster tracker |
| US9142220B2 (en) | 2011-03-25 | 2015-09-22 | The Intellisis Corporation | Systems and methods for reconstructing an audio signal from transformed audio information |
| US8548803B2 (en) * | 2011-08-08 | 2013-10-01 | The Intellisis Corporation | System and method of processing a sound signal including transforming the sound signal into a frequency-chirp domain |
| US8620646B2 (en) | 2011-08-08 | 2013-12-31 | The Intellisis Corporation | System and method for tracking sound pitch across an audio signal using harmonic envelope |
| US9183850B2 (en) | 2011-08-08 | 2015-11-10 | The Intellisis Corporation | System and method for tracking sound pitch across an audio signal |
| US8781880B2 (en) | 2012-06-05 | 2014-07-15 | Rank Miner, Inc. | System, method and apparatus for voice analytics of recorded audio |
| US9536540B2 (en) | 2013-07-19 | 2017-01-03 | Knowles Electronics, Llc | Speech signal separation and synthesis based on auditory scene analysis and speech modeling |
| US9280968B2 (en) * | 2013-10-04 | 2016-03-08 | At&T Intellectual Property I, L.P. | System and method of using neural transforms of robust audio features for speech processing |
| DE112015004185T5 (de) | 2014-09-12 | 2017-06-01 | Knowles Electronics, Llc | Systeme und Verfahren zur Wiederherstellung von Sprachkomponenten |
| US9842611B2 (en) | 2015-02-06 | 2017-12-12 | Knuedge Incorporated | Estimating pitch using peak-to-peak distances |
| US9870785B2 (en) | 2015-02-06 | 2018-01-16 | Knuedge Incorporated | Determining features of harmonic signals |
| US9922668B2 (en) | 2015-02-06 | 2018-03-20 | Knuedge Incorporated | Estimating fractional chirp rate with multiple frequency representations |
| US9820042B1 (en) | 2016-05-02 | 2017-11-14 | Knowles Electronics, Llc | Stereo separation and directional suppression with omni-directional microphones |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4300229A (en) * | 1979-02-21 | 1981-11-10 | Nippon Electric Co., Ltd. | Transmitter and receiver for an othogonally multiplexed QAM signal of a sampling rate N times that of PAM signals, comprising an N/2-point offset fourier transform processor |
| US4221934A (en) * | 1979-05-11 | 1980-09-09 | Rca Corporation | Compandor for group of FDM signals |
| GB8307702D0 (en) * | 1983-03-21 | 1983-04-27 | British Telecomm | Digital band-split filter means |
| NL8400677A (nl) * | 1984-03-02 | 1985-10-01 | Philips Nv | Transmissiesysteem voor de overdracht van data signalen in een modulaatband. |
-
2001
- 2001-06-15 US US09/882,744 patent/US6493668B1/en not_active Expired - Lifetime
-
2002
- 2002-06-14 WO PCT/US2002/019182 patent/WO2002103676A1/en not_active Ceased
- 2002-06-14 AT AT02744395T patent/ATE421137T1/de not_active IP Right Cessation
- 2002-06-14 JP JP2003505912A patent/JP4177755B2/ja not_active Expired - Fee Related
- 2002-06-14 CA CA002450230A patent/CA2450230A1/en not_active Abandoned
- 2002-06-14 DE DE60230871T patent/DE60230871D1/de not_active Expired - Lifetime
- 2002-06-14 US US10/173,247 patent/US7013274B2/en not_active Expired - Lifetime
- 2002-06-14 EP EP02744395A patent/EP1402517B1/en not_active Expired - Lifetime
Also Published As
| Publication number | Publication date |
|---|---|
| US20020198711A1 (en) | 2002-12-26 |
| US20030014245A1 (en) | 2003-01-16 |
| EP1402517B1 (en) | 2009-01-14 |
| CA2450230A1 (en) | 2002-12-27 |
| ATE421137T1 (de) | 2009-01-15 |
| EP1402517A4 (en) | 2007-04-25 |
| WO2002103676A1 (en) | 2002-12-27 |
| US7013274B2 (en) | 2006-03-14 |
| JP2004531767A (ja) | 2004-10-14 |
| US6493668B1 (en) | 2002-12-10 |
| EP1402517A1 (en) | 2004-03-31 |
| DE60230871D1 (de) | 2009-03-05 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP4177755B2 (ja) | 発話特徴抽出システム | |
| JP2004531767A5 (enExample) | ||
| US6804643B1 (en) | Speech recognition | |
| Sailor et al. | Auditory Filterbank Learning for Temporal Modulation Features in Replay Spoof Speech Detection. | |
| CN112382300A (zh) | 声纹鉴定方法、模型训练方法、装置、设备及存储介质 | |
| JP7184236B2 (ja) | 声紋を認識する方法、装置、設備、および記憶媒体 | |
| US5806022A (en) | Method and system for performing speech recognition | |
| Kim et al. | Nonlinear enhancement of onset for robust speech recognition. | |
| CN110767238B (zh) | 基于地址信息的黑名单识别方法、装置、设备及存储介质 | |
| KR100571427B1 (ko) | 잡음 환경에서의 음성 인식을 위한 특징 벡터 추출 장치및 역상관 필터링 방법 | |
| Maazouzi et al. | MFCC and similarity measurements for speaker identification systems | |
| JP3916834B2 (ja) | 雑音が付加された周期波形の基本周期あるいは基本周波数の抽出方法 | |
| Rosell | An introduction to front-end processing and acoustic features for automatic speech recognition | |
| JPS6229799B2 (enExample) | ||
| Niyozmatova et al. | Development Software for Preprocessing Voice Signals | |
| Nikhil et al. | Impact of ERB and bark scales on perceptual distortion based near-end speech enhancement | |
| Lalitha et al. | An encapsulation of vital non-linear frequency features for various speech applications | |
| KR100381372B1 (ko) | 음성특징 추출장치 | |
| JPH03122699A (ja) | 雑音除去装置及び該装置を用いた音声認識装置 | |
| KR100563316B1 (ko) | 보완적 특징벡터를 이용한 화자특징벡터 생성방법 및 장치 | |
| CN117079666A (zh) | 歌曲打分方法、装置、终端设备以及存储介质 | |
| JP4014374B2 (ja) | 音声分析方法 | |
| Lakshmi | on Speech Enhancement Using Neural | |
| JP2006084659A (ja) | オーディオ信号分析方法、その方法を用いた音声認識方法、それらの装置、プログラムおよびその記録媒体 | |
| Kalamani et al. | Comparison of cepstral and mel frequency cepstral coefficients for various clean and noisy speech signals |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20050613 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20071031 |
|
| A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20080130 |
|
| A602 | Written permission of extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A602 Effective date: 20080206 |
|
| A524 | Written submission of copy of amendment under article 19 pct |
Free format text: JAPANESE INTERMEDIATE CODE: A524 Effective date: 20080227 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20080402 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20080603 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20080801 |
|
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20080822 |
|
| FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20110829 Year of fee payment: 3 |
|
| R150 | Certificate of patent or registration of utility model |
Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
| FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20110829 Year of fee payment: 3 |
|
| FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20120829 Year of fee payment: 4 |
|
| FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20130829 Year of fee payment: 5 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| LAPS | Cancellation because of no payment of annual fees |