AU2001277647A1 - Method for noise robust classification in speech coding - Google Patents

Method for noise robust classification in speech coding

Info

Publication number
AU2001277647A1
AU2001277647A1 AU2001277647A AU7764701A AU2001277647A1 AU 2001277647 A1 AU2001277647 A1 AU 2001277647A1 AU 2001277647 A AU2001277647 A AU 2001277647A AU 7764701 A AU7764701 A AU 7764701A AU 2001277647 A1 AU2001277647 A1 AU 2001277647A1
Authority
AU
Australia
Prior art keywords
speech
noise
parameters
classification
frame
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
AU2001277647A
Inventor
Jens Thyssen
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Conexant Systems LLC
Original Assignee
Conexant Systems LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Conexant Systems LLC filed Critical Conexant Systems LLC
Publication of AU2001277647A1 publication Critical patent/AU2001277647A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02168Noise filtering characterised by the method used for estimating noise the estimation exclusively taking place during speech pauses
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision

Abstract

A method for robust speech classification in speech coding and, in particular, for robust classification in the presence of background noise is herein provided. A noise-free set of parameters is derived, thereby reducing the adverse effects of background noise on the classification process. The speech signal is identified as speech or non-speech. A set of basic parameters is derived for the speech frame, then the noise component of the parameters is estimated and removed. If the frame is non-speech, the noise estimations are updated. All the parameters are then compared against a predetermined set of thresholds. Because the background noise has been removed from the parameters, the set of thresholds is largely unaffected by any changes in the noise. The frame is classified into any number of classes, thereby emphasizing the perceptually important features by performing perceptual matching rather than waveform matching.
AU2001277647A 2000-08-21 2001-08-17 Method for noise robust classification in speech coding Abandoned AU2001277647A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US09/643,017 US6983242B1 (en) 2000-08-21 2000-08-21 Method for robust classification in speech coding
US09643017 2000-08-21
PCT/IB2001/001490 WO2002017299A1 (en) 2000-08-21 2001-08-17 Method for noise robust classification in speech coding

Publications (1)

Publication Number Publication Date
AU2001277647A1 true AU2001277647A1 (en) 2002-03-04

Family

ID=24579015

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2001277647A Abandoned AU2001277647A1 (en) 2000-08-21 2001-08-17 Method for noise robust classification in speech coding

Country Status (8)

Country Link
US (1) US6983242B1 (en)
EP (1) EP1312075B1 (en)
JP (2) JP2004511003A (en)
CN (2) CN1302460C (en)
AT (1) ATE319160T1 (en)
AU (1) AU2001277647A1 (en)
DE (1) DE60117558T2 (en)
WO (1) WO2002017299A1 (en)

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4178319B2 (en) * 2002-09-13 2008-11-12 インターナショナル・ビジネス・マシーンズ・コーポレーション Phase alignment in speech processing
US7698132B2 (en) * 2002-12-17 2010-04-13 Qualcomm Incorporated Sub-sampled excitation waveform codebooks
GB0321093D0 (en) * 2003-09-09 2003-10-08 Nokia Corp Multi-rate coding
KR101008022B1 (en) * 2004-02-10 2011-01-14 삼성전자주식회사 Voiced sound and unvoiced sound detection method and apparatus
KR100735246B1 (en) * 2005-09-12 2007-07-03 삼성전자주식회사 Apparatus and method for transmitting audio signal
CN100483509C (en) * 2006-12-05 2009-04-29 华为技术有限公司 Aural signal classification method and device
CN101197130B (en) * 2006-12-07 2011-05-18 华为技术有限公司 Sound activity detecting method and detector thereof
EP2118892B1 (en) * 2007-02-12 2010-07-14 Dolby Laboratories Licensing Corporation Improved ratio of speech to non-speech audio such as for elderly or hearing-impaired listeners
KR100930584B1 (en) * 2007-09-19 2009-12-09 한국전자통신연구원 Speech discrimination method and apparatus using voiced sound features of human speech
JP5377167B2 (en) * 2009-09-03 2013-12-25 株式会社レイトロン Scream detection device and scream detection method
ES2371619B1 (en) * 2009-10-08 2012-08-08 Telefónica, S.A. VOICE SEGMENT DETECTION PROCEDURE.
CN102714034B (en) * 2009-10-15 2014-06-04 华为技术有限公司 Signal processing method, device and system
CN102467669B (en) * 2010-11-17 2015-11-25 北京北大千方科技有限公司 Method and equipment for improving matching precision in laser detection
EP2702585B1 (en) 2011-04-28 2014-12-31 Telefonaktiebolaget LM Ericsson (PUBL) Frame based audio signal classification
US8990074B2 (en) * 2011-05-24 2015-03-24 Qualcomm Incorporated Noise-robust speech coding mode classification
CN102314884B (en) * 2011-08-16 2013-01-02 捷思锐科技(北京)有限公司 Voice-activation detecting method and device
CN103177728B (en) * 2011-12-21 2015-07-29 中国移动通信集团广西有限公司 Voice signal denoise processing method and device
KR20150032390A (en) * 2013-09-16 2015-03-26 삼성전자주식회사 Speech signal process apparatus and method for enhancing speech intelligibility
US9886963B2 (en) * 2015-04-05 2018-02-06 Qualcomm Incorporated Encoder selection
CN113571036B (en) * 2021-06-18 2023-08-18 上海淇玥信息技术有限公司 Automatic synthesis method and device for low-quality data and electronic equipment

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB8911153D0 (en) * 1989-05-16 1989-09-20 Smiths Industries Plc Speech recognition apparatus and methods
US5491771A (en) * 1993-03-26 1996-02-13 Hughes Aircraft Company Real-time implementation of a 8Kbps CELP coder on a DSP pair
US5459814A (en) * 1993-03-26 1995-10-17 Hughes Aircraft Company Voice activity detector for speech signals in variable background noise
CA2136891A1 (en) * 1993-12-20 1995-06-21 Kalyan Ganesan Removal of swirl artifacts from celp based speech coders
JP2897628B2 (en) * 1993-12-24 1999-05-31 三菱電機株式会社 Voice detector
JPH11514453A (en) * 1995-09-14 1999-12-07 エリクソン インコーポレイテッド A system for adaptively filtering audio signals to enhance speech intelligibility in noisy environmental conditions
JPH09152894A (en) * 1995-11-30 1997-06-10 Denso Corp Sound and silence discriminator
SE506034C2 (en) * 1996-02-01 1997-11-03 Ericsson Telefon Ab L M Method and apparatus for improving parameters representing noise speech
JPH1020891A (en) * 1996-07-09 1998-01-23 Sony Corp Method for encoding speech and device therefor
JPH10124097A (en) * 1996-10-21 1998-05-15 Olympus Optical Co Ltd Voice recording and reproducing device
US6233550B1 (en) * 1997-08-29 2001-05-15 The Regents Of The University Of California Method and apparatus for hybrid coding of speech at 4kbps
WO1999012155A1 (en) * 1997-09-30 1999-03-11 Qualcomm Incorporated Channel gain modification system and method for noise reduction in voice communication
US6453289B1 (en) * 1998-07-24 2002-09-17 Hughes Electronics Corporation Method of noise reduction for speech codecs
US6240386B1 (en) * 1998-08-24 2001-05-29 Conexant Systems, Inc. Speech codec employing noise classification for noise compensation
US6636829B1 (en) * 1999-09-22 2003-10-21 Mindspeed Technologies, Inc. Speech communication system and method for handling lost frames

Also Published As

Publication number Publication date
WO2002017299A1 (en) 2002-02-28
DE60117558D1 (en) 2006-04-27
DE60117558T2 (en) 2006-08-10
ATE319160T1 (en) 2006-03-15
CN1447963A (en) 2003-10-08
JP2008058983A (en) 2008-03-13
CN1302460C (en) 2007-02-28
CN1624766A (en) 2005-06-08
JP2004511003A (en) 2004-04-08
EP1312075B1 (en) 2006-03-01
CN1210685C (en) 2005-07-13
EP1312075A1 (en) 2003-05-21
US6983242B1 (en) 2006-01-03

Similar Documents

Publication Publication Date Title
AU2001277647A1 (en) Method for noise robust classification in speech coding
US6889186B1 (en) Method and apparatus for improving the intelligibility of digitally compressed speech
CA2382175A1 (en) Noisy acoustic signal enhancement
WO2005055197A3 (en) Noise suppressor for speech coding and speech recognition
DE602004022862D1 (en) METHOD AND DEVICE FOR LANGUAGE IMPROVEMENT IN THE PRESENCE OF BACKGROUND NOISE
ATE267443T1 (en) DEVICE FOR VOICE DETECTION IN AMBIENT NOISE
WO2006019556A3 (en) Low-complexity music detection algorithm and system
EP1635329A3 (en) System, method and program for sound source classification
DE69804310D1 (en) METHOD AND DEVICE FOR IMPROVING LANGUAGE IN A LANGUAGE TRANSMISSION SYSTEM
WO2002103695A3 (en) Device and method for embedding a watermark in an audio signal
EP1854095A1 (en) Speech analyzing system with adaptive noise codebook
WO2002073601B1 (en) Method and device for determining the quality of a speech signal
DE60034772T2 (en) REJECTION PROCEDURE IN LANGUAGE IDENTIFICATION
EP0780828A3 (en) Method and system for performing speech recognition
EP1533791A3 (en) Voice/unvoice determination and dialogue enhancement
Ishizuka et al. Study of noise robust voice activity detection based on periodic component to aperiodic component ratio.
SE470577B (en) Method and apparatus for encoding and / or decoding background noise
US20010044714A1 (en) Method of estimating the pitch of a speech signal using an average distance between peaks, use of the method, and a device adapted therefor
CA2483607A1 (en) Syllabic nuclei extracting apparatus and program product thereof
EP1343146A3 (en) Audio signal processing based on a perceptual model
Taboada et al. Explicit estimation of speech boundaries
KR100434538B1 (en) Detection apparatus and method for transitional region of speech and speech synthesis method for transitional region
AU2003247079A8 (en) Obtaining configuration data for a data processing apparatus
Babu et al. Transient detection for transform domain coders
Yoon et al. Speech enhancement based on speech/noise-dominant decision