FR2869151A1 - Procede de quantification d'un codeur de parole a tres bas debit - Google Patents

Procede de quantification d'un codeur de parole a tres bas debit

Info

Publication number
FR2869151A1
FR2869151A1 FR0404105A FR0404105A FR2869151A1 FR 2869151 A1 FR2869151 A1 FR 2869151A1 FR 0404105 A FR0404105 A FR 0404105A FR 0404105 A FR0404105 A FR 0404105A FR 2869151 A1 FR2869151 A1 FR 2869151A1
Authority
FR
France
Prior art keywords
voicing
parameters
superframe
coding
quantifying
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
FR0404105A
Other languages
English (en)
Other versions
FR2869151B1 (fr
Inventor
Francois Capman
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Thales SA
Original Assignee
Thales SA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to FR0404105A priority Critical patent/FR2869151B1/fr
Application filed by Thales SA filed Critical Thales SA
Priority to AT05733605T priority patent/ATE453909T1/de
Priority to EP05733605A priority patent/EP1756806B1/fr
Priority to PL05733605T priority patent/PL1756806T3/pl
Priority to PCT/EP2005/051661 priority patent/WO2005114653A1/fr
Priority to DE602005018637T priority patent/DE602005018637D1/de
Priority to CA2567162A priority patent/CA2567162C/fr
Priority to ES05733605T priority patent/ES2338801T3/es
Priority to US11/578,663 priority patent/US7716045B2/en
Publication of FR2869151A1 publication Critical patent/FR2869151A1/fr
Application granted granted Critical
Publication of FR2869151B1 publication Critical patent/FR2869151B1/fr
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/087Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters using mixed excitation models, e.g. MELP, MBE, split band LPC or HVXC
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/09Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0004Design or structure of the codebook
    • G10L2019/0005Multi-stage vector quantisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Magnetic Resonance Imaging Apparatus (AREA)

Abstract

Procédé de codage et de décodage de la parole pour les communications vocales utilisant un vocodeur à très bas débit comportant une partie analyse pour le codage et la transmission des paramètres du signal de parole, tels que l'information de voisement par sous-bande, le pitch, les gains, les paramètres spectraux LSF et une partie synthèse pour la réception et le décodage des paramètres transmis et la reconstruction du signal de parole comportant au moins les étapes suivantes :• regrouper les paramètres voisement, pitch, gains, coefficients LSF sur N trames consécutives pour former une super-trame,• effectuer une quantification vectorielle de l'information de voisement au cours de chaque super-trame en élaborant une classification utilisant les informations sur l'enchaînement en termes de voisement existant sur 2 trames élémentaires consécutives, l'information de voisement permet en effet d'identifier des classes de sons pour lesquels l'allocation du débit et les dictionnaires associés seront optimisés,• coder le pitch, les gains et les coefficients LSF en utilisant la classification obtenue précédemment.
FR0404105A 2004-04-19 2004-04-19 Procede de quantification d'un codeur de parole a tres bas debit Expired - Fee Related FR2869151B1 (fr)

Priority Applications (9)

Application Number Priority Date Filing Date Title
FR0404105A FR2869151B1 (fr) 2004-04-19 2004-04-19 Procede de quantification d'un codeur de parole a tres bas debit
EP05733605A EP1756806B1 (fr) 2004-04-19 2005-04-14 Procede de quantification d'un codeur de parole a tres bas debit
PL05733605T PL1756806T3 (pl) 2004-04-19 2005-04-14 Sposób kwantyzacji kodera mowy o bardzo małej przepływności
PCT/EP2005/051661 WO2005114653A1 (fr) 2004-04-19 2005-04-14 Procede de quantification d'un codeur de parole a tres bas debit
AT05733605T ATE453909T1 (de) 2004-04-19 2005-04-14 Verfahren zum quantifizieren eines sprachcodierers mit ultraniedriger rate
DE602005018637T DE602005018637D1 (de) 2004-04-19 2005-04-14 Verfahren zum quantifizieren eines sprachcodierers mit ultraniedriger rate
CA2567162A CA2567162C (fr) 2004-04-19 2005-04-14 Procede de quantification d'un codeur de parole a tres bas debit
ES05733605T ES2338801T3 (es) 2004-04-19 2005-04-14 Procedimiento de cuantificacion de un codificador de palabra de flujo muy bajo.
US11/578,663 US7716045B2 (en) 2004-04-19 2005-04-14 Method for quantifying an ultra low-rate speech coder

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
FR0404105A FR2869151B1 (fr) 2004-04-19 2004-04-19 Procede de quantification d'un codeur de parole a tres bas debit

Publications (2)

Publication Number Publication Date
FR2869151A1 true FR2869151A1 (fr) 2005-10-21
FR2869151B1 FR2869151B1 (fr) 2007-01-26

Family

ID=34945858

Family Applications (1)

Application Number Title Priority Date Filing Date
FR0404105A Expired - Fee Related FR2869151B1 (fr) 2004-04-19 2004-04-19 Procede de quantification d'un codeur de parole a tres bas debit

Country Status (9)

Country Link
US (1) US7716045B2 (fr)
EP (1) EP1756806B1 (fr)
AT (1) ATE453909T1 (fr)
CA (1) CA2567162C (fr)
DE (1) DE602005018637D1 (fr)
ES (1) ES2338801T3 (fr)
FR (1) FR2869151B1 (fr)
PL (1) PL1756806T3 (fr)
WO (1) WO2005114653A1 (fr)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008092473A1 (fr) * 2007-01-31 2008-08-07 Telecom Italia S.P.A. Procédé et système personnalisables de reconnaissance d'émotions
PT2313887T (pt) * 2008-07-10 2017-11-14 Voiceage Corp Dispositivo e método de quantificação de filtro de lpc de taxa de bits variável e quantificação inversa
CN114333862B (zh) * 2021-11-10 2024-05-03 腾讯科技(深圳)有限公司 音频编码方法、解码方法、装置、设备、存储介质及产品

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1995010760A2 (fr) * 1993-10-08 1995-04-20 Comsat Corporation Codeurs vocaux a bas debit binaire ameliores et procedes pour leur utilisation
US6263307B1 (en) * 1995-04-19 2001-07-17 Texas Instruments Incorporated Adaptive weiner filtering using line spectral frequencies
US5774837A (en) * 1995-09-13 1998-06-30 Voxware, Inc. Speech coding system and method using voicing probability determination
US5806027A (en) * 1996-09-19 1998-09-08 Texas Instruments Incorporated Variable framerate parameter encoding
US6081776A (en) * 1998-07-13 2000-06-27 Lockheed Martin Corp. Speech coding system and method including adaptive finite impulse response filter
US6377915B1 (en) * 1999-03-17 2002-04-23 Yrp Advanced Mobile Communication Systems Research Laboratories Co., Ltd. Speech decoding using mix ratio table
US7315815B1 (en) * 1999-09-22 2008-01-01 Microsoft Corporation LPC-harmonic vocoder with superframe structure
US6475145B1 (en) * 2000-05-17 2002-11-05 Baymar, Inc. Method and apparatus for detection of acid reflux

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
NANDKUMAR S ET AL: "Robust speech mode based LSF vector quantization for low bit rate coders", ACOUSTICS, SPEECH AND SIGNAL PROCESSING, 1998. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON SEATTLE, WA, USA 12-15 MAY 1998, NEW YORK, NY, USA,IEEE, US, 12 May 1998 (1998-05-12), pages 41 - 44, XP010279049, ISBN: 0-7803-4428-6 *
PADELLINI M ET AL: "Codage de la parole a très bas débit par indexation d'unités de taille variable", RENCONTRES JEUNES CHERCHEURS EN PAROLE, XX, XX, 23 September 2003 (2003-09-23), pages 1 - 3, XP002285303 *
STACHURSKI J ET AL: "High quality MELP coding at bit-rates around 4 kb/s", ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 1999. PROCEEDINGS., 1999 IEEE INTERNATIONAL CONFERENCE ON PHOENIX, AZ, USA 15-19 MARCH 1999, PISCATAWAY, NJ, USA,IEEE, US, 15 March 1999 (1999-03-15), pages 485 - 488, XP010327975, ISBN: 0-7803-5041-3 *
ULPU SINERVO1 ET AL: "Multi-Mode Matrix Quantizer for Low Bit Rate LSF Quantization", EUROSSPEECH, September 2003 (2003-09-01), GENEVA, CH, pages 1073 - 1076, XP007006802 *

Also Published As

Publication number Publication date
DE602005018637D1 (de) 2010-02-11
WO2005114653A1 (fr) 2005-12-01
US7716045B2 (en) 2010-05-11
FR2869151B1 (fr) 2007-01-26
EP1756806A1 (fr) 2007-02-28
PL1756806T3 (pl) 2010-06-30
CA2567162A1 (fr) 2005-12-01
CA2567162C (fr) 2013-07-23
EP1756806B1 (fr) 2009-12-30
US20070219789A1 (en) 2007-09-20
ATE453909T1 (de) 2010-01-15
ES2338801T3 (es) 2010-05-12

Similar Documents

Publication Publication Date Title
CN1266674C (zh) 闭环多模混合域线性预测语音编解码器和处理帧的方法
CN100350453C (zh) 强壮语音分类方法和装置
CN100362568C (zh) 用于预测量化有声语音的方法和设备
CN1154086C (zh) Celp转发
HK1082315A1 (en) Method and device for gain quantization in variable bit rate wideband speech coding
CN102150024B (zh) 编码和解码统合的语音与音频信号的设备与方法
DK1879179T3 (da) Fremgangsmåde og anordning til kodning af audiodata baseret på vektorkvantisering
EP1807826A4 (fr) Procede et dispositif de codage de paroles a faible debit binaire
DK1222659T3 (da) LPC-harmonisk talekoder med superramme-struktur
EP2037451A1 (fr) Procédé pour améliorer l'efficacité de codage d'un signal audio
US20110119054A1 (en) Apparatus for encoding and decoding of integrated speech and audio
TW200703240A (en) Systems, methods, and apparatus for quantization of spectral envelope representation
CN1131994A (zh) 进行降低速率的可变速率声码合成的方法和装置
DE60124274D1 (de) Codebuchstruktur und suchverfahren für die sprachkodierung
CN1815558A (zh) 语音中非话音部分的低数据位速率编码
ATE272885T1 (de) Multimodaler sprachkodierer
CN108231083A (zh) 一种基于silk的语音编码器编码效率提高方法
CN104254886B (zh) 自适应编码浊音语音的基音周期
CN105765653A (zh) 自适应高通后滤波器
ATE453909T1 (de) Verfahren zum quantifizieren eines sprachcodierers mit ultraniedriger rate
CN100489966C (zh) 合成分析语音编码器中用于进行语音编码的方法和装置
CN101572090A (zh) 一种自适应多速率窄带编码方法及编码器
CN101266798B (zh) 一种在语音解码器中进行增益平滑的方法及装置
CN101211561A (zh) 音乐信号质量增强方法和装置
CN1437746A (zh) 跟踪准周期性信号的相位的方法和设备

Legal Events

Date Code Title Description
ST Notification of lapse

Effective date: 20121228