CN103518122B - 码激励线性预测编码器和解码器中的变换域码本 - Google Patents
码激励线性预测编码器和解码器中的变换域码本 Download PDFInfo
- Publication number
- CN103518122B CN103518122B CN201280022757.XA CN201280022757A CN103518122B CN 103518122 B CN103518122 B CN 103518122B CN 201280022757 A CN201280022757 A CN 201280022757A CN 103518122 B CN103518122 B CN 103518122B
- Authority
- CN
- China
- Prior art keywords
- codebook
- domain
- transform
- adaptive
- stage
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0212—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/22—Mode decision, i.e. based on audio signal content versus external parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/038—Vector quantisation, e.g. TwinVQ audio
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/10—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
- G10L19/107—Sparse pulse excitation, e.g. by using algebraic codebook
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0004—Design or structure of the codebook
- G10L2019/0005—Multi-stage vector quantisation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201161484968P | 2011-05-11 | 2011-05-11 | |
| US61/484,968 | 2011-05-11 | ||
| PCT/CA2012/000441 WO2012151676A1 (en) | 2011-05-11 | 2012-05-09 | Transform-domain codebook in a celp coder and decoder |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN103518122A CN103518122A (zh) | 2014-01-15 |
| CN103518122B true CN103518122B (zh) | 2016-04-20 |
Family
ID=47138606
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201280022757.XA Active CN103518122B (zh) | 2011-05-11 | 2012-05-09 | 码激励线性预测编码器和解码器中的变换域码本 |
Country Status (10)
| Country | Link |
|---|---|
| US (1) | US8825475B2 (enExample) |
| EP (1) | EP2707687B1 (enExample) |
| JP (1) | JP6173304B2 (enExample) |
| CN (1) | CN103518122B (enExample) |
| CA (1) | CA2830105C (enExample) |
| DK (1) | DK2707687T3 (enExample) |
| ES (1) | ES2668920T3 (enExample) |
| NO (1) | NO2669468T3 (enExample) |
| PT (1) | PT2707687T (enExample) |
| WO (1) | WO2012151676A1 (enExample) |
Families Citing this family (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9263053B2 (en) * | 2012-04-04 | 2016-02-16 | Google Technology Holdings LLC | Method and apparatus for generating a candidate code-vector to code an informational signal |
| US9070356B2 (en) * | 2012-04-04 | 2015-06-30 | Google Technology Holdings LLC | Method and apparatus for generating a candidate code-vector to code an informational signal |
| MX2019006535A (es) * | 2016-12-16 | 2019-08-21 | Ericsson Telefon Ab L M | Metodos, codificador y decodificador para manejar coeficientes de representacion de envolvente. |
| KR20250016479A (ko) * | 2017-09-20 | 2025-02-03 | 보이세지 코포레이션 | 씨이엘피 코덱에 있어서 비트-예산을 효율적으로 분배하는 방법 및 디바이스 |
| US12424227B2 (en) * | 2020-11-05 | 2025-09-23 | Nippon Telegraph And Telephone Corporation | Sound signal refinement method, sound signal decode method, apparatus thereof, program, and storage medium |
| CN112767956B (zh) * | 2021-04-09 | 2021-07-16 | 腾讯科技(深圳)有限公司 | 音频编码方法、装置、计算机设备及介质 |
Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6108626A (en) * | 1995-10-27 | 2000-08-22 | Cselt-Centro Studi E Laboratori Telecomunicazioni S.P.A. | Object oriented audio coding |
| US6134518A (en) * | 1997-03-04 | 2000-10-17 | International Business Machines Corporation | Digital audio signal coding using a CELP coder and a transform coder |
| CN1735928A (zh) * | 2003-01-08 | 2006-02-15 | 法国电信公司 | 用于可变速率音频编解码的方法 |
| CN1957398A (zh) * | 2004-02-18 | 2007-05-02 | 沃伊斯亚吉公司 | 在基于代数码激励线性预测/变换编码激励的音频压缩期间低频加重的方法和设备 |
| CN101842833A (zh) * | 2007-09-11 | 2010-09-22 | 沃伊斯亚吉公司 | 语音和音频编码中快速代数码本搜索的方法和设备 |
| CN101849258A (zh) * | 2007-11-04 | 2010-09-29 | 高通股份有限公司 | 在可缩放语音和音频编解码器中的用于经量化的mdct频谱的码簿索引的编码/解码的技术 |
| WO2011048094A1 (en) * | 2009-10-20 | 2011-04-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Multi-mode audio codec and celp coding adapted therefore |
Family Cites Families (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| ES2247741T3 (es) * | 1998-01-22 | 2006-03-01 | Deutsche Telekom Ag | Metodo para conmutacion controlada por señales entre esquemas de codificacion de audio. |
| US6453289B1 (en) * | 1998-07-24 | 2002-09-17 | Hughes Electronics Corporation | Method of noise reduction for speech codecs |
| US7072832B1 (en) * | 1998-08-24 | 2006-07-04 | Mindspeed Technologies, Inc. | System for speech encoding having an adaptive encoding arrangement |
| SE519985C2 (sv) * | 2000-09-15 | 2003-05-06 | Ericsson Telefon Ab L M | Kodning och avkodning av signaler från flera kanaler |
| US20030135374A1 (en) * | 2002-01-16 | 2003-07-17 | Hardwick John C. | Speech synthesizer |
| CA2388358A1 (en) | 2002-05-31 | 2003-11-30 | Voiceage Corporation | A method and device for multi-rate lattice vector quantization |
| CN100583241C (zh) * | 2003-04-30 | 2010-01-20 | 松下电器产业株式会社 | 音频编码设备、音频解码设备、音频编码方法和音频解码方法 |
| US7177804B2 (en) * | 2005-05-31 | 2007-02-13 | Microsoft Corporation | Sub-band voice codec with multi-stage codebooks and redundant coding |
| EP1907812B1 (fr) * | 2005-07-22 | 2010-12-01 | France Telecom | Procede de commutation de debit en decodage audio scalable en debit et largeur de bande |
| US7877253B2 (en) * | 2006-10-06 | 2011-01-25 | Qualcomm Incorporated | Systems, methods, and apparatus for frame erasure recovery |
| DK2102619T3 (en) * | 2006-10-24 | 2017-05-15 | Voiceage Corp | METHOD AND DEVICE FOR CODING TRANSITION FRAMEWORK IN SPEECH SIGNALS |
| JP2011518345A (ja) * | 2008-03-14 | 2011-06-23 | ドルビー・ラボラトリーズ・ライセンシング・コーポレーション | スピーチライク信号及びノンスピーチライク信号のマルチモードコーディング |
| EP2345027B1 (en) * | 2008-10-10 | 2018-04-18 | Telefonaktiebolaget LM Ericsson (publ) | Energy-conserving multi-channel audio coding and decoding |
| FR2947945A1 (fr) * | 2009-07-07 | 2011-01-14 | France Telecom | Allocation de bits dans un codage/decodage d'amelioration d'un codage/decodage hierarchique de signaux audionumeriques |
| CN102844810B (zh) | 2010-04-14 | 2017-05-03 | 沃伊斯亚吉公司 | 用于在码激励线性预测编码器和解码器中使用的灵活和可缩放的组合式创新代码本 |
-
2008
- 2008-10-17 NO NO13180475A patent/NO2669468T3/no unknown
-
2012
- 2012-05-09 EP EP12782641.0A patent/EP2707687B1/en active Active
- 2012-05-09 DK DK12782641.0T patent/DK2707687T3/en active
- 2012-05-09 JP JP2014509572A patent/JP6173304B2/ja active Active
- 2012-05-09 ES ES12782641.0T patent/ES2668920T3/es active Active
- 2012-05-09 WO PCT/CA2012/000441 patent/WO2012151676A1/en not_active Ceased
- 2012-05-09 PT PT127826410T patent/PT2707687T/pt unknown
- 2012-05-09 CN CN201280022757.XA patent/CN103518122B/zh active Active
- 2012-05-09 CA CA2830105A patent/CA2830105C/en active Active
- 2012-05-11 US US13/469,744 patent/US8825475B2/en active Active
Patent Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6108626A (en) * | 1995-10-27 | 2000-08-22 | Cselt-Centro Studi E Laboratori Telecomunicazioni S.P.A. | Object oriented audio coding |
| US6134518A (en) * | 1997-03-04 | 2000-10-17 | International Business Machines Corporation | Digital audio signal coding using a CELP coder and a transform coder |
| CN1735928A (zh) * | 2003-01-08 | 2006-02-15 | 法国电信公司 | 用于可变速率音频编解码的方法 |
| CN1957398A (zh) * | 2004-02-18 | 2007-05-02 | 沃伊斯亚吉公司 | 在基于代数码激励线性预测/变换编码激励的音频压缩期间低频加重的方法和设备 |
| CN101842833A (zh) * | 2007-09-11 | 2010-09-22 | 沃伊斯亚吉公司 | 语音和音频编码中快速代数码本搜索的方法和设备 |
| CN101849258A (zh) * | 2007-11-04 | 2010-09-29 | 高通股份有限公司 | 在可缩放语音和音频编解码器中的用于经量化的mdct频谱的码簿索引的编码/解码的技术 |
| WO2011048094A1 (en) * | 2009-10-20 | 2011-04-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Multi-mode audio codec and celp coding adapted therefore |
Also Published As
| Publication number | Publication date |
|---|---|
| PT2707687T (pt) | 2018-05-21 |
| CN103518122A (zh) | 2014-01-15 |
| JP2014517933A (ja) | 2014-07-24 |
| CA2830105C (en) | 2018-06-05 |
| ES2668920T3 (es) | 2018-05-23 |
| EP2707687A1 (en) | 2014-03-19 |
| CA2830105A1 (en) | 2012-11-15 |
| US20120290295A1 (en) | 2012-11-15 |
| WO2012151676A1 (en) | 2012-11-15 |
| EP2707687B1 (en) | 2018-03-28 |
| HK1191395A1 (zh) | 2014-07-25 |
| JP6173304B2 (ja) | 2017-08-02 |
| EP2707687A4 (en) | 2014-11-19 |
| DK2707687T3 (en) | 2018-05-28 |
| US8825475B2 (en) | 2014-09-02 |
| NO2669468T3 (enExample) | 2018-06-02 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP0942411B1 (en) | Audio signal coding and decoding apparatus | |
| US11881228B2 (en) | Concept for encoding an audio signal and decoding an audio signal using speech related spectral shaping information | |
| CN103518122B (zh) | 码激励线性预测编码器和解码器中的变换域码本 | |
| US11798570B2 (en) | Concept for encoding an audio signal and decoding an audio signal using deterministic and noise like information | |
| JP6456412B2 (ja) | Celp符号器および復号器で使用するための柔軟で拡張性のある複合革新コードブック | |
| US6098037A (en) | Formant weighted vector quantization of LPC excitation harmonic spectral amplitudes | |
| CN105122358A (zh) | 用于处理编码信号的装置和方法与用于产生编码信号的编码器和方法 | |
| HK1191395B (en) | Transform-domain codebook in a celp coder and decoder | |
| HK1175581B (en) | Flexible and scalable combined innovation codebook for use in celp coder and decoder | |
| HK1175581A (en) | Flexible and scalable combined innovation codebook for use in celp coder and decoder | |
| WO2009097763A1 (zh) | 一种增益量化方法及装置 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1191395 Country of ref document: HK |
|
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| REG | Reference to a national code |
Ref country code: HK Ref legal event code: GR Ref document number: 1191395 Country of ref document: HK |
|
| TR01 | Transfer of patent right |
Effective date of registration: 20200908 Address after: California, USA Patentee after: Shengdai EVs Ltd. Address before: Kaisan ohokkatsu Patentee before: VOICEAGE Corp. |
|
| TR01 | Transfer of patent right | ||
| IP01 | Partial invalidation of patent right |
Commission number: 4W115762 Conclusion of examination: Declare the invention patent No. 201280022757. X partially invalid, and maintain the validity of the patent on the basis of claims 1-16 submitted by the patentee on May 22, 2024 Decision date of declaring invalidation: 20240617 Decision number of declaring invalidation: 569064 Denomination of invention: Transform domain codebooks in code excited linear predictive encoders and decoders Granted publication date: 20160420 Patentee: Shengdai EVs Ltd. |
|
| IP01 | Partial invalidation of patent right |