EP1383112A3 - Method and device for enlarged bandwidth speech coding, allowing in particular an improved quality of voiced frames - Google Patents

Method and device for enlarged bandwidth speech coding, allowing in particular an improved quality of voiced frames Download PDF

Info

Publication number
EP1383112A3
EP1383112A3 EP03291748A EP03291748A EP1383112A3 EP 1383112 A3 EP1383112 A3 EP 1383112A3 EP 03291748 A EP03291748 A EP 03291748A EP 03291748 A EP03291748 A EP 03291748A EP 1383112 A3 EP1383112 A3 EP 1383112A3
Authority
EP
European Patent Office
Prior art keywords
term
excitation
long
short
word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP03291748A
Other languages
German (de)
French (fr)
Other versions
EP1383112A2 (en
Inventor
Michael Ansorge
Giuseppina Biunedo Lotito
Benito Carnero
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
STMicroelectronics NV
Original Assignee
STMicroelectronics NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from EP02015920A external-priority patent/EP1383110A1/en
Application filed by STMicroelectronics NV filed Critical STMicroelectronics NV
Priority to EP03291748A priority Critical patent/EP1383112A3/en
Publication of EP1383112A2 publication Critical patent/EP1383112A2/en
Publication of EP1383112A3 publication Critical patent/EP1383112A3/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/083Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being an excitation gain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

On échantillonne la parole de façon à obtenir des trames vocales successives comportant chacune un nombre prédéterminé d'échantillons, et à chaque trame vocale on détermine des paramètres d'un modèle de prédiction linéaire à excitation par code, ces paramètres comportant un mot numérique d'excitation à long terme (vi) extrait d'un répertoire codé adaptatif (DLT) et un gain à long terme associé (Ga), ainsi qu'un mot d'excitation à court terme (cj) extrait d'un répertoire codé fixe (DCT) en utilisant un filtrage numérique de prédiction linéaire (FP), et un gain à court terme associé (Gc). On met à jour le répertoire codé adaptatif à partir du mot d'excitation à long terme extrait et du mot d'excitation à court terme extrait, et on met à jour l'état du filtre de prédiction linéaire (FP) avec le mot d'excitation à court terme filtré par un filtre (FLT1) d'ordre supérieur ou égal à 1 dont les coefficients dépendent de la valeur du gain à long terme, de façon à affaiblir la contribution de l'excitation à court terme lorsque le gain de l'excitation à long terme est supérieur à un seuil prédéterminé.

Figure imgaf001
The speech is sampled so as to obtain successive speech frames each comprising a predetermined number of samples, and at each speech frame, parameters of a code-excited linear prediction model are determined, these parameters comprising a digital word of long-term excitation (v i ) extract from an adaptive codebook (DLT) and associated long-term gain (Ga), as well as a short-term excitation word (cj) extracted from a fixed codebook (DCT) using linear prediction (FP) filtering, and associated short-term gain (Gc). The adaptive codebook is updated from the extracted long term excitation word and the extracted short term excitation word, and the state of the linear prediction (FP) filter is updated with the word d short-term excitation filtered by a filter (FLT1) of order greater than or equal to 1 whose coefficients depend on the value of the long-term gain, so as to weaken the contribution of the excitation in the short term when the gain of the long-term excitation is greater than a predetermined threshold.
Figure imgaf001

EP03291748A 2002-07-17 2003-07-15 Method and device for enlarged bandwidth speech coding, allowing in particular an improved quality of voiced frames Withdrawn EP1383112A3 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP03291748A EP1383112A3 (en) 2002-07-17 2003-07-15 Method and device for enlarged bandwidth speech coding, allowing in particular an improved quality of voiced frames

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP02015920 2002-07-17
EP02015920A EP1383110A1 (en) 2002-07-17 2002-07-17 Method and device for wide band speech coding, particularly allowing for an improved quality of voised speech frames
EP03291748A EP1383112A3 (en) 2002-07-17 2003-07-15 Method and device for enlarged bandwidth speech coding, allowing in particular an improved quality of voiced frames

Publications (2)

Publication Number Publication Date
EP1383112A2 EP1383112A2 (en) 2004-01-21
EP1383112A3 true EP1383112A3 (en) 2008-08-20

Family

ID=29781470

Family Applications (1)

Application Number Title Priority Date Filing Date
EP03291748A Withdrawn EP1383112A3 (en) 2002-07-17 2003-07-15 Method and device for enlarged bandwidth speech coding, allowing in particular an improved quality of voiced frames

Country Status (1)

Country Link
EP (1) EP1383112A3 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0593255A1 (en) * 1992-10-12 1994-04-20 Nec Corporation An arrangement for demodulating speech signals discontinuously transmitted from a mobile unit
US6148282A (en) * 1997-01-02 2000-11-14 Texas Instruments Incorporated Multimodal code-excited linear prediction (CELP) coder and method using peakiness measure
WO2002023534A2 (en) * 2000-09-15 2002-03-21 Conexant Systems, Inc. Selection of coding parameters based on spectral content of a speech signal
US6385573B1 (en) * 1998-08-24 2002-05-07 Conexant Systems, Inc. Adaptive tilt compensation for synthesized speech residual

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0593255A1 (en) * 1992-10-12 1994-04-20 Nec Corporation An arrangement for demodulating speech signals discontinuously transmitted from a mobile unit
US6148282A (en) * 1997-01-02 2000-11-14 Texas Instruments Incorporated Multimodal code-excited linear prediction (CELP) coder and method using peakiness measure
US6385573B1 (en) * 1998-08-24 2002-05-07 Conexant Systems, Inc. Adaptive tilt compensation for synthesized speech residual
WO2002023534A2 (en) * 2000-09-15 2002-03-21 Conexant Systems, Inc. Selection of coding parameters based on spectral content of a speech signal

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
REDWAN SALAMI ET AL: "Design and Description of CS-ACELP: A Toll Quality 8 kb/s Speech Coder", IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, IEEE SERVICE CENTER, NEW YORK, NY, US, vol. 6, no. 2, 1 March 1998 (1998-03-01), XP011054298, ISSN: 1063-6676 *

Also Published As

Publication number Publication date
EP1383112A2 (en) 2004-01-21

Similar Documents

Publication Publication Date Title
DE60121405T2 (en) Transcoder to avoid cascade coding of speech signals
DE69900786T2 (en) VOICE CODING
EP2535893B1 (en) Device and method for lost frame concealment
DE60006271T2 (en) CELP VOICE ENCODING WITH VARIABLE BITRATE BY MEANS OF PHONETIC CLASSIFICATION
US7472059B2 (en) Method and apparatus for robust speech classification
EP1886306B1 (en) Redundant audio bit stream and audio bit stream processing methods
DE60011051T2 (en) CELP TRANS CODING
CA2343661C (en) Method and apparatus for improving the intelligibility of digitally compressed speech
US5018200A (en) Communication system capable of improving a speech quality by classifying speech signals
DE602004007786D1 (en) METHOD AND DEVICE FOR QUANTIZING THE GAIN FACTOR IN A VARIABLE BITRATE BROADBAND LANGUAGE CODIER
ES2380962T3 (en) Procedure and apparatus for coding low transmission rate of high performance deaf speech bits
DE60219351D1 (en) SIGNAL MODIFICATION METHOD FOR EFFICIENT CODING OF LANGUAGE SIGNALS
JP2002530705A (en) Low bit rate coding of unvoiced segments of speech.
EP1420391A1 (en) Generalized analysis-by-synthesis speech coding method, and coder implementing such method
FR2784218A1 (en) LOW-SPEED SPEECH CODING METHOD
Wang et al. Suppression by selecting wavelets for feature compression in distributed speech recognition
CN1184548A (en) Predictive split-matrix quantization of spectral parameters for efficient coding of speech
EP1383112A3 (en) Method and device for enlarged bandwidth speech coding, allowing in particular an improved quality of voiced frames
Tsau et al. Environmental sound recognition with CELP-based features
Park et al. Analysis of confidence and control through voice of Kim Jung-un
AU687193B2 (en) A pitch post-filter
DE68917552T2 (en) Method and device for coding and decoding speech signals using multipulse excitation.
KR100550003B1 (en) Open-loop pitch estimation method in transcoder and apparatus thereof
US20050075867A1 (en) Method and device for encoding wideband speech
DE60025471T2 (en) METHOD AND DEVICE FOR FOLLOWING THE PHASE OF A FAST PERIODIC SIGNAL

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL LT LV MK

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL LT LV MK

RIC1 Information provided on ipc code assigned before grant

Ipc: G10L 19/12 20060101ALN20080715BHEP

Ipc: G10L 19/06 20060101AFI20080715BHEP

AKX Designation fees paid
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20090203

REG Reference to a national code

Ref country code: DE

Ref legal event code: 8566