EP1930881A3 - Speech decoder employing noise compensation - Google Patents

Speech decoder employing noise compensation Download PDF

Info

Publication number
EP1930881A3
EP1930881A3 EP08152711A EP08152711A EP1930881A3 EP 1930881 A3 EP1930881 A3 EP 1930881A3 EP 08152711 A EP08152711 A EP 08152711A EP 08152711 A EP08152711 A EP 08152711A EP 1930881 A3 EP1930881 A3 EP 1930881A3
Authority
EP
European Patent Office
Prior art keywords
speech
voice
encoder
bit rate
decoder
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
EP08152711A
Other languages
German (de)
French (fr)
Other versions
EP1930881A2 (en
Inventor
Jes Thyssen
Huan-Yu Su
Yang Gao
Adil Benyassine
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Samsung Electronics Co Ltd
Original Assignee
Mindspeed Technologies LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/154,662 external-priority patent/US6493665B1/en
Priority claimed from US09/156,832 external-priority patent/US6823303B1/en
Priority claimed from US09/198,414 external-priority patent/US6240386B1/en
Priority to EP09152359A priority Critical patent/EP2088587A1/en
Priority to EP10180379A priority patent/EP2259255A1/en
Priority to EP09152360A priority patent/EP2085966A1/en
Priority to EP09152354A priority patent/EP2088584A1/en
Application filed by Mindspeed Technologies LLC filed Critical Mindspeed Technologies LLC
Priority to EP09152357A priority patent/EP2088586A1/en
Priority to EP09152356A priority patent/EP2088585A1/en
Publication of EP1930881A2 publication Critical patent/EP1930881A2/en
Publication of EP1930881A3 publication Critical patent/EP1930881A3/en
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0232Processing in the frequency domain

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

A multi-rate speech codec supports a plurality of encoding bit rate modes by adaptively selecting encoding bit rate modes to match communication channel restrictions. In higher bit rate encoding modes, an accurate representation of speech through CELP (code excited linear prediction) and other associated modeling parameters are generated for higher quality decoding and reproduction. For each bit rate mode selected, pluralities of fixed or innovation subcodebooks are selected for use in generating innovation vectors. The speech coder distinguishes various voice signals as a function of their voice content. For example, a Voice Activity Detection (VAD) algorithm selects an appropriate coding scheme depending on whether the speech signal comprises active or inactive speech. The encoder may consider varying characteristics of the speech signal including sharpness, a delay correlation, a zero-crossing rate, and a residual energy. In another embodiment of the present invention, code excited linear prediction is used for voice active signals whereas random excitation is used for voice inactive signals; the energy level and spectral content of the voice inactive signal may also be used for noise coding. The multi-rate speech codec may employ distributed detection and compensation processing the speech signal. For high quality perceptual speech reproduction, the speech codec may perform noise detection in both an encoder and decoder. The noise detection may be coordinated between the encoder and decoder. Similarly, noise compensation may be performed in a distributed manner among both the decoder and the encoder.
EP08152711A 1998-08-24 1999-08-24 Speech decoder employing noise compensation Ceased EP1930881A3 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
EP09152356A EP2088585A1 (en) 1998-08-24 1999-08-24 Gain smoothing for speech coding
EP09152357A EP2088586A1 (en) 1998-08-24 1999-08-24 Adaptive codebook gain control for speech coding
EP09152359A EP2088587A1 (en) 1998-08-24 1999-08-24 Open-loop pitch processing for speech coding
EP10180379A EP2259255A1 (en) 1998-08-24 1999-08-24 Speech encoding method and system
EP09152360A EP2085966A1 (en) 1998-08-24 1999-08-24 Selection of scalar quantization(SQ) and vector quantization (VQ) for speech coding
EP09152354A EP2088584A1 (en) 1998-08-24 1999-08-24 Codebook sharing for LSF quantization

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US9756998P 1998-08-24 1998-08-24
US15465798A 1998-09-18 1998-09-18
US09/154,662 US6493665B1 (en) 1998-08-24 1998-09-18 Speech classification and parameter weighting used in codebook search
US09/156,832 US6823303B1 (en) 1998-08-24 1998-09-18 Speech encoder using voice activity detection in coding noise
US09/198,414 US6240386B1 (en) 1998-08-24 1998-11-24 Speech codec employing noise classification for noise compensation
EP99946655A EP1110209B1 (en) 1998-08-24 1999-08-24 Spectrum smoothing for speech coding

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
EP99946655A Division EP1110209B1 (en) 1998-08-24 1999-08-24 Spectrum smoothing for speech coding

Related Child Applications (5)

Application Number Title Priority Date Filing Date
EP09152359A Division EP2088587A1 (en) 1998-08-24 1999-08-24 Open-loop pitch processing for speech coding
EP09152357A Division EP2088586A1 (en) 1998-08-24 1999-08-24 Adaptive codebook gain control for speech coding
EP09152354A Division EP2088584A1 (en) 1998-08-24 1999-08-24 Codebook sharing for LSF quantization
EP09152356A Division EP2088585A1 (en) 1998-08-24 1999-08-24 Gain smoothing for speech coding
EP09152360A Division EP2085966A1 (en) 1998-08-24 1999-08-24 Selection of scalar quantization(SQ) and vector quantization (VQ) for speech coding

Publications (2)

Publication Number Publication Date
EP1930881A2 EP1930881A2 (en) 2008-06-11
EP1930881A3 true EP1930881A3 (en) 2008-09-17

Family

ID=39362821

Family Applications (1)

Application Number Title Priority Date Filing Date
EP08152711A Ceased EP1930881A3 (en) 1998-08-24 1999-08-24 Speech decoder employing noise compensation

Country Status (1)

Country Link
EP (1) EP1930881A3 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113630120B (en) * 2021-03-31 2024-08-09 中山大学 Zero delay communication method combined with 1-bit analog-to-digital converter and application thereof
CN113598734A (en) * 2021-07-28 2021-11-05 厦门大学 Cuff-free blood pressure prediction method based on deep neural network model
CN117423348B (en) * 2023-12-19 2024-04-02 山东省计算中心(国家超级计算济南中心) Speech compression method and system based on deep learning and vector prediction

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
TANIGUCHI T ET AL: "Enhancement of VSELP Coded Speech under Background Noise", 19950920; 19950920 - 19950922, 20 September 1995 (1995-09-20), pages 67 - 68, XP010269480 *

Also Published As

Publication number Publication date
EP1930881A2 (en) 2008-06-11

Similar Documents

Publication Publication Date Title
CA2341712A1 (en) Speech codec employing speech classification for noise compensation
KR100711280B1 (en) Methods and devices for source controlled variable bit-rate wideband speech coding
JP4851578B2 (en) Method and apparatus for performing reduced rate, variable rate speech analysis synthesis
JP4518714B2 (en) Speech code conversion method
EP0848374B1 (en) A method and a device for speech encoding
EP0785541B1 (en) Usage of voice activity detection for efficient coding of speech
US20020035470A1 (en) Speech coding system with time-domain noise attenuation
KR100488080B1 (en) Multimode speech encoder
JP2009134303A (en) Voice decoding method and device
KR20030046451A (en) Codebook structure and search for speech coding
KR20010024935A (en) Speech coding
JP2002518694A (en) Audio encoding device and audio decoding device
KR100421648B1 (en) An adaptive criterion for speech coding
EP0865027B1 (en) Method for coding the random component vector in an ACELP coder
WO2000025301A1 (en) Method and arrangement for providing comfort noise in communications systems
KR100561018B1 (en) Sound encoding apparatus and method, and sound decoding apparatus and method
US6980948B2 (en) System of dynamic pulse position tracks for pulse-like excitation in speech coding
Jayant et al. Speech coding with time-varying bit allocations to excitation and LPC parameters
EP1930881A3 (en) Speech decoder employing noise compensation
US20030055633A1 (en) Method and device for coding speech in analysis-by-synthesis speech coders
JP4985743B2 (en) Speech code conversion method
EP1808852A1 (en) Method of interoperation between adaptive multi-rate wideband (AMR-WB) and multi-mode variable bit-rate wideband (VMR-WB) codecs
Sluijter et al. State of the art and trends in speech coding
Woodard et al. A Range of Low and High Delay CELP Speech Codecs between 8 and 4 kbits/s
Xinfu et al. AMR vocoder and its multi-channel implementation based on a single DSP chip

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AC Divisional application: reference to earlier application

Ref document number: 1110209

Country of ref document: EP

Kind code of ref document: P

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): DE FI FR GB SE

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): DE FI FR GB SE

REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1113431

Country of ref document: HK

17P Request for examination filed

Effective date: 20090216

17Q First examination report despatched

Effective date: 20090320

AKX Designation fees paid

Designated state(s): DE FI FR GB SE

APBK Appeal reference recorded

Free format text: ORIGINAL CODE: EPIDOSNREFNE

APBN Date of receipt of notice of appeal recorded

Free format text: ORIGINAL CODE: EPIDOSNNOA2E

APAK Date of receipt of statement of grounds of an appeal modified

Free format text: ORIGINAL CODE: EPIDOSCNOA3E

APBR Date of receipt of statement of grounds of appeal recorded

Free format text: ORIGINAL CODE: EPIDOSNNOA3E

APAF Appeal reference modified

Free format text: ORIGINAL CODE: EPIDOSCREFNE

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: WIAV SOLUTIONS LLC

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: SAMSUNG ELECTRONICS CO., LTD.

APBT Appeal procedure closed

Free format text: ORIGINAL CODE: EPIDOSNNOA9E

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED

18R Application refused

Effective date: 20160923

REG Reference to a national code

Ref country code: HK

Ref legal event code: WD

Ref document number: 1113431

Country of ref document: HK