CN103518122B - 码激励线性预测编码器和解码器中的变换域码本 - Google Patents

码激励线性预测编码器和解码器中的变换域码本 Download PDF

Info

Publication number
CN103518122B
CN103518122B CN201280022757.XA CN201280022757A CN103518122B CN 103518122 B CN103518122 B CN 103518122B CN 201280022757 A CN201280022757 A CN 201280022757A CN 103518122 B CN103518122 B CN 103518122B
Authority
CN
China
Prior art keywords
codebook
domain
transform
adaptive
stage
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201280022757.XA
Other languages
English (en)
Chinese (zh)
Other versions
CN103518122A (zh
Inventor
V.埃克斯勒
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shengdai Evs Ltd
Original Assignee
VoiceAge Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=47138606&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=CN103518122(B) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by VoiceAge Corp filed Critical VoiceAge Corp
Publication of CN103518122A publication Critical patent/CN103518122A/zh
Application granted granted Critical
Publication of CN103518122B publication Critical patent/CN103518122B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • G10L19/038Vector quantisation, e.g. TwinVQ audio
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/10Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
    • G10L19/107Sparse pulse excitation, e.g. by using algebraic codebook
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0004Design or structure of the codebook
    • G10L2019/0005Multi-stage vector quantisation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
CN201280022757.XA 2011-05-11 2012-05-09 码激励线性预测编码器和解码器中的变换域码本 Active CN103518122B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201161484968P 2011-05-11 2011-05-11
US61/484,968 2011-05-11
PCT/CA2012/000441 WO2012151676A1 (en) 2011-05-11 2012-05-09 Transform-domain codebook in a celp coder and decoder

Publications (2)

Publication Number Publication Date
CN103518122A CN103518122A (zh) 2014-01-15
CN103518122B true CN103518122B (zh) 2016-04-20

Family

ID=47138606

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201280022757.XA Active CN103518122B (zh) 2011-05-11 2012-05-09 码激励线性预测编码器和解码器中的变换域码本

Country Status (10)

Country Link
US (1) US8825475B2 (enExample)
EP (1) EP2707687B1 (enExample)
JP (1) JP6173304B2 (enExample)
CN (1) CN103518122B (enExample)
CA (1) CA2830105C (enExample)
DK (1) DK2707687T3 (enExample)
ES (1) ES2668920T3 (enExample)
NO (1) NO2669468T3 (enExample)
PT (1) PT2707687T (enExample)
WO (1) WO2012151676A1 (enExample)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9263053B2 (en) * 2012-04-04 2016-02-16 Google Technology Holdings LLC Method and apparatus for generating a candidate code-vector to code an informational signal
US9070356B2 (en) * 2012-04-04 2015-06-30 Google Technology Holdings LLC Method and apparatus for generating a candidate code-vector to code an informational signal
MX2019006535A (es) * 2016-12-16 2019-08-21 Ericsson Telefon Ab L M Metodos, codificador y decodificador para manejar coeficientes de representacion de envolvente.
KR20250016479A (ko) * 2017-09-20 2025-02-03 보이세지 코포레이션 씨이엘피 코덱에 있어서 비트-예산을 효율적으로 분배하는 방법 및 디바이스
US12424227B2 (en) * 2020-11-05 2025-09-23 Nippon Telegraph And Telephone Corporation Sound signal refinement method, sound signal decode method, apparatus thereof, program, and storage medium
CN112767956B (zh) * 2021-04-09 2021-07-16 腾讯科技(深圳)有限公司 音频编码方法、装置、计算机设备及介质

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6108626A (en) * 1995-10-27 2000-08-22 Cselt-Centro Studi E Laboratori Telecomunicazioni S.P.A. Object oriented audio coding
US6134518A (en) * 1997-03-04 2000-10-17 International Business Machines Corporation Digital audio signal coding using a CELP coder and a transform coder
CN1735928A (zh) * 2003-01-08 2006-02-15 法国电信公司 用于可变速率音频编解码的方法
CN1957398A (zh) * 2004-02-18 2007-05-02 沃伊斯亚吉公司 在基于代数码激励线性预测/变换编码激励的音频压缩期间低频加重的方法和设备
CN101842833A (zh) * 2007-09-11 2010-09-22 沃伊斯亚吉公司 语音和音频编码中快速代数码本搜索的方法和设备
CN101849258A (zh) * 2007-11-04 2010-09-29 高通股份有限公司 在可缩放语音和音频编解码器中的用于经量化的mdct频谱的码簿索引的编码/解码的技术
WO2011048094A1 (en) * 2009-10-20 2011-04-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Multi-mode audio codec and celp coding adapted therefore

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ES2247741T3 (es) * 1998-01-22 2006-03-01 Deutsche Telekom Ag Metodo para conmutacion controlada por señales entre esquemas de codificacion de audio.
US6453289B1 (en) * 1998-07-24 2002-09-17 Hughes Electronics Corporation Method of noise reduction for speech codecs
US7072832B1 (en) * 1998-08-24 2006-07-04 Mindspeed Technologies, Inc. System for speech encoding having an adaptive encoding arrangement
SE519985C2 (sv) * 2000-09-15 2003-05-06 Ericsson Telefon Ab L M Kodning och avkodning av signaler från flera kanaler
US20030135374A1 (en) * 2002-01-16 2003-07-17 Hardwick John C. Speech synthesizer
CA2388358A1 (en) 2002-05-31 2003-11-30 Voiceage Corporation A method and device for multi-rate lattice vector quantization
CN100583241C (zh) * 2003-04-30 2010-01-20 松下电器产业株式会社 音频编码设备、音频解码设备、音频编码方法和音频解码方法
US7177804B2 (en) * 2005-05-31 2007-02-13 Microsoft Corporation Sub-band voice codec with multi-stage codebooks and redundant coding
EP1907812B1 (fr) * 2005-07-22 2010-12-01 France Telecom Procede de commutation de debit en decodage audio scalable en debit et largeur de bande
US7877253B2 (en) * 2006-10-06 2011-01-25 Qualcomm Incorporated Systems, methods, and apparatus for frame erasure recovery
DK2102619T3 (en) * 2006-10-24 2017-05-15 Voiceage Corp METHOD AND DEVICE FOR CODING TRANSITION FRAMEWORK IN SPEECH SIGNALS
JP2011518345A (ja) * 2008-03-14 2011-06-23 ドルビー・ラボラトリーズ・ライセンシング・コーポレーション スピーチライク信号及びノンスピーチライク信号のマルチモードコーディング
EP2345027B1 (en) * 2008-10-10 2018-04-18 Telefonaktiebolaget LM Ericsson (publ) Energy-conserving multi-channel audio coding and decoding
FR2947945A1 (fr) * 2009-07-07 2011-01-14 France Telecom Allocation de bits dans un codage/decodage d'amelioration d'un codage/decodage hierarchique de signaux audionumeriques
CN102844810B (zh) 2010-04-14 2017-05-03 沃伊斯亚吉公司 用于在码激励线性预测编码器和解码器中使用的灵活和可缩放的组合式创新代码本

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6108626A (en) * 1995-10-27 2000-08-22 Cselt-Centro Studi E Laboratori Telecomunicazioni S.P.A. Object oriented audio coding
US6134518A (en) * 1997-03-04 2000-10-17 International Business Machines Corporation Digital audio signal coding using a CELP coder and a transform coder
CN1735928A (zh) * 2003-01-08 2006-02-15 法国电信公司 用于可变速率音频编解码的方法
CN1957398A (zh) * 2004-02-18 2007-05-02 沃伊斯亚吉公司 在基于代数码激励线性预测/变换编码激励的音频压缩期间低频加重的方法和设备
CN101842833A (zh) * 2007-09-11 2010-09-22 沃伊斯亚吉公司 语音和音频编码中快速代数码本搜索的方法和设备
CN101849258A (zh) * 2007-11-04 2010-09-29 高通股份有限公司 在可缩放语音和音频编解码器中的用于经量化的mdct频谱的码簿索引的编码/解码的技术
WO2011048094A1 (en) * 2009-10-20 2011-04-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Multi-mode audio codec and celp coding adapted therefore

Also Published As

Publication number Publication date
PT2707687T (pt) 2018-05-21
CN103518122A (zh) 2014-01-15
JP2014517933A (ja) 2014-07-24
CA2830105C (en) 2018-06-05
ES2668920T3 (es) 2018-05-23
EP2707687A1 (en) 2014-03-19
CA2830105A1 (en) 2012-11-15
US20120290295A1 (en) 2012-11-15
WO2012151676A1 (en) 2012-11-15
EP2707687B1 (en) 2018-03-28
HK1191395A1 (zh) 2014-07-25
JP6173304B2 (ja) 2017-08-02
EP2707687A4 (en) 2014-11-19
DK2707687T3 (en) 2018-05-28
US8825475B2 (en) 2014-09-02
NO2669468T3 (enExample) 2018-06-02

Similar Documents

Publication Publication Date Title
EP0942411B1 (en) Audio signal coding and decoding apparatus
US11881228B2 (en) Concept for encoding an audio signal and decoding an audio signal using speech related spectral shaping information
CN103518122B (zh) 码激励线性预测编码器和解码器中的变换域码本
US11798570B2 (en) Concept for encoding an audio signal and decoding an audio signal using deterministic and noise like information
JP6456412B2 (ja) Celp符号器および復号器で使用するための柔軟で拡張性のある複合革新コードブック
US6098037A (en) Formant weighted vector quantization of LPC excitation harmonic spectral amplitudes
CN105122358A (zh) 用于处理编码信号的装置和方法与用于产生编码信号的编码器和方法
HK1191395B (en) Transform-domain codebook in a celp coder and decoder
HK1175581B (en) Flexible and scalable combined innovation codebook for use in celp coder and decoder
HK1175581A (en) Flexible and scalable combined innovation codebook for use in celp coder and decoder
WO2009097763A1 (zh) 一种增益量化方法及装置

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1191395

Country of ref document: HK

C14 Grant of patent or utility model
GR01 Patent grant
REG Reference to a national code

Ref country code: HK

Ref legal event code: GR

Ref document number: 1191395

Country of ref document: HK

TR01 Transfer of patent right

Effective date of registration: 20200908

Address after: California, USA

Patentee after: Shengdai EVs Ltd.

Address before: Kaisan ohokkatsu

Patentee before: VOICEAGE Corp.

TR01 Transfer of patent right
IP01 Partial invalidation of patent right

Commission number: 4W115762

Conclusion of examination: Declare the invention patent No. 201280022757. X partially invalid, and maintain the validity of the patent on the basis of claims 1-16 submitted by the patentee on May 22, 2024

Decision date of declaring invalidation: 20240617

Decision number of declaring invalidation: 569064

Denomination of invention: Transform domain codebooks in code excited linear predictive encoders and decoders

Granted publication date: 20160420

Patentee: Shengdai EVs Ltd.

IP01 Partial invalidation of patent right