EP1383112A3 - Method and device for enlarged bandwidth speech coding, allowing in particular an improved quality of voiced frames - Google Patents
Method and device for enlarged bandwidth speech coding, allowing in particular an improved quality of voiced frames Download PDFInfo
- Publication number
- EP1383112A3 EP1383112A3 EP03291748A EP03291748A EP1383112A3 EP 1383112 A3 EP1383112 A3 EP 1383112A3 EP 03291748 A EP03291748 A EP 03291748A EP 03291748 A EP03291748 A EP 03291748A EP 1383112 A3 EP1383112 A3 EP 1383112A3
- Authority
- EP
- European Patent Office
- Prior art keywords
- term
- excitation
- long
- short
- word
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 230000005284 excitation Effects 0.000 abstract 7
- 230000007774 longterm Effects 0.000 abstract 5
- 230000003044 adaptive effect Effects 0.000 abstract 2
- 238000001914 filtration Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/083—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being an excitation gain
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
On échantillonne la parole de façon à obtenir des trames vocales successives comportant chacune un nombre prédéterminé d'échantillons, et à chaque trame vocale on détermine des paramètres d'un modèle de prédiction linéaire à excitation par code, ces paramètres comportant un mot numérique d'excitation à long terme (vi) extrait d'un répertoire codé adaptatif (DLT) et un gain à long terme associé (Ga), ainsi qu'un mot d'excitation à court terme (cj) extrait d'un répertoire codé fixe (DCT) en utilisant un filtrage numérique de prédiction linéaire (FP), et un gain à court terme associé (Gc). On met à jour le répertoire codé adaptatif à partir du mot d'excitation à long terme extrait et du mot d'excitation à court terme extrait, et on met à jour l'état du filtre de prédiction linéaire (FP) avec le mot d'excitation à court terme filtré par un filtre (FLT1) d'ordre supérieur ou égal à 1 dont les coefficients dépendent de la valeur du gain à long terme, de façon à affaiblir la contribution de l'excitation à court terme lorsque le gain de l'excitation à long terme est supérieur à un seuil prédéterminé. The speech is sampled so as to obtain successive speech frames each comprising a predetermined number of samples, and at each speech frame, parameters of a code-excited linear prediction model are determined, these parameters comprising a digital word of long-term excitation (v i ) extract from an adaptive codebook (DLT) and associated long-term gain (Ga), as well as a short-term excitation word (cj) extracted from a fixed codebook (DCT) using linear prediction (FP) filtering, and associated short-term gain (Gc). The adaptive codebook is updated from the extracted long term excitation word and the extracted short term excitation word, and the state of the linear prediction (FP) filter is updated with the word d short-term excitation filtered by a filter (FLT1) of order greater than or equal to 1 whose coefficients depend on the value of the long-term gain, so as to weaken the contribution of the excitation in the short term when the gain of the long-term excitation is greater than a predetermined threshold.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP03291748A EP1383112A3 (en) | 2002-07-17 | 2003-07-15 | Method and device for enlarged bandwidth speech coding, allowing in particular an improved quality of voiced frames |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP02015920 | 2002-07-17 | ||
EP02015920A EP1383110A1 (en) | 2002-07-17 | 2002-07-17 | Method and device for wide band speech coding, particularly allowing for an improved quality of voised speech frames |
EP03291748A EP1383112A3 (en) | 2002-07-17 | 2003-07-15 | Method and device for enlarged bandwidth speech coding, allowing in particular an improved quality of voiced frames |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1383112A2 EP1383112A2 (en) | 2004-01-21 |
EP1383112A3 true EP1383112A3 (en) | 2008-08-20 |
Family
ID=29781470
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP03291748A Withdrawn EP1383112A3 (en) | 2002-07-17 | 2003-07-15 | Method and device for enlarged bandwidth speech coding, allowing in particular an improved quality of voiced frames |
Country Status (1)
Country | Link |
---|---|
EP (1) | EP1383112A3 (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0593255A1 (en) * | 1992-10-12 | 1994-04-20 | Nec Corporation | An arrangement for demodulating speech signals discontinuously transmitted from a mobile unit |
US6148282A (en) * | 1997-01-02 | 2000-11-14 | Texas Instruments Incorporated | Multimodal code-excited linear prediction (CELP) coder and method using peakiness measure |
WO2002023534A2 (en) * | 2000-09-15 | 2002-03-21 | Conexant Systems, Inc. | Selection of coding parameters based on spectral content of a speech signal |
US6385573B1 (en) * | 1998-08-24 | 2002-05-07 | Conexant Systems, Inc. | Adaptive tilt compensation for synthesized speech residual |
-
2003
- 2003-07-15 EP EP03291748A patent/EP1383112A3/en not_active Withdrawn
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0593255A1 (en) * | 1992-10-12 | 1994-04-20 | Nec Corporation | An arrangement for demodulating speech signals discontinuously transmitted from a mobile unit |
US6148282A (en) * | 1997-01-02 | 2000-11-14 | Texas Instruments Incorporated | Multimodal code-excited linear prediction (CELP) coder and method using peakiness measure |
US6385573B1 (en) * | 1998-08-24 | 2002-05-07 | Conexant Systems, Inc. | Adaptive tilt compensation for synthesized speech residual |
WO2002023534A2 (en) * | 2000-09-15 | 2002-03-21 | Conexant Systems, Inc. | Selection of coding parameters based on spectral content of a speech signal |
Non-Patent Citations (1)
Title |
---|
REDWAN SALAMI ET AL: "Design and Description of CS-ACELP: A Toll Quality 8 kb/s Speech Coder", IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, IEEE SERVICE CENTER, NEW YORK, NY, US, vol. 6, no. 2, 1 March 1998 (1998-03-01), XP011054298, ISSN: 1063-6676 * |
Also Published As
Publication number | Publication date |
---|---|
EP1383112A2 (en) | 2004-01-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE60121405T2 (en) | Transcoder to avoid cascade coding of speech signals | |
DE69900786T2 (en) | VOICE CODING | |
EP2535893B1 (en) | Device and method for lost frame concealment | |
DE60006271T2 (en) | CELP VOICE ENCODING WITH VARIABLE BITRATE BY MEANS OF PHONETIC CLASSIFICATION | |
US7472059B2 (en) | Method and apparatus for robust speech classification | |
EP1886306B1 (en) | Redundant audio bit stream and audio bit stream processing methods | |
DE60011051T2 (en) | CELP TRANS CODING | |
CA2343661C (en) | Method and apparatus for improving the intelligibility of digitally compressed speech | |
US5018200A (en) | Communication system capable of improving a speech quality by classifying speech signals | |
DE602004007786D1 (en) | METHOD AND DEVICE FOR QUANTIZING THE GAIN FACTOR IN A VARIABLE BITRATE BROADBAND LANGUAGE CODIER | |
ES2380962T3 (en) | Procedure and apparatus for coding low transmission rate of high performance deaf speech bits | |
DE60219351D1 (en) | SIGNAL MODIFICATION METHOD FOR EFFICIENT CODING OF LANGUAGE SIGNALS | |
JP2002530705A (en) | Low bit rate coding of unvoiced segments of speech. | |
EP1420391A1 (en) | Generalized analysis-by-synthesis speech coding method, and coder implementing such method | |
FR2784218A1 (en) | LOW-SPEED SPEECH CODING METHOD | |
Wang et al. | Suppression by selecting wavelets for feature compression in distributed speech recognition | |
CN1184548A (en) | Predictive split-matrix quantization of spectral parameters for efficient coding of speech | |
EP1383112A3 (en) | Method and device for enlarged bandwidth speech coding, allowing in particular an improved quality of voiced frames | |
Tsau et al. | Environmental sound recognition with CELP-based features | |
Park et al. | Analysis of confidence and control through voice of Kim Jung-un | |
AU687193B2 (en) | A pitch post-filter | |
DE68917552T2 (en) | Method and device for coding and decoding speech signals using multipulse excitation. | |
KR100550003B1 (en) | Open-loop pitch estimation method in transcoder and apparatus thereof | |
US20050075867A1 (en) | Method and device for encoding wideband speech | |
DE60025471T2 (en) | METHOD AND DEVICE FOR FOLLOWING THE PHASE OF A FAST PERIODIC SIGNAL |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR |
|
AX | Request for extension of the european patent |
Extension state: AL LT LV MK |
|
PUAL | Search report despatched |
Free format text: ORIGINAL CODE: 0009013 |
|
AK | Designated contracting states |
Kind code of ref document: A3 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR |
|
AX | Request for extension of the european patent |
Extension state: AL LT LV MK |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G10L 19/12 20060101ALN20080715BHEP Ipc: G10L 19/06 20060101AFI20080715BHEP |
|
AKX | Designation fees paid | ||
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20090203 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: 8566 |