CN104221287B - 矢量量化器 - Google Patents

矢量量化器 Download PDF

Info

Publication number
CN104221287B
CN104221287B CN201280072059.0A CN201280072059A CN104221287B CN 104221287 B CN104221287 B CN 104221287B CN 201280072059 A CN201280072059 A CN 201280072059A CN 104221287 B CN104221287 B CN 104221287B
Authority
CN
China
Prior art keywords
vector
codebook
flip
class
codevector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201280072059.0A
Other languages
English (en)
Chinese (zh)
Other versions
CN104221287A (zh
Inventor
沃洛佳·格兰恰诺夫
托马斯·詹森·托夫特戈德
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Telefonaktiebolaget LM Ericsson AB
Original Assignee
Telefonaktiebolaget LM Ericsson AB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telefonaktiebolaget LM Ericsson AB filed Critical Telefonaktiebolaget LM Ericsson AB
Priority to CN201710451005.XA priority Critical patent/CN107170459B/zh
Publication of CN104221287A publication Critical patent/CN104221287A/zh
Application granted granted Critical
Publication of CN104221287B publication Critical patent/CN104221287B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • G10L19/038Vector quantisation, e.g. TwinVQ audio
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/94Vector quantisation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0013Codebook search algorithms
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/3082Vector coding

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Theoretical Computer Science (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
CN201280072059.0A 2012-03-29 2012-12-12 矢量量化器 Active CN104221287B (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710451005.XA CN107170459B (zh) 2012-03-29 2012-12-12 矢量量化器

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201261617151P 2012-03-29 2012-03-29
US61/617,151 2012-03-29
PCT/SE2012/051381 WO2013147667A1 (en) 2012-03-29 2012-12-12 Vector quantizer

Related Child Applications (1)

Application Number Title Priority Date Filing Date
CN201710451005.XA Division CN107170459B (zh) 2012-03-29 2012-12-12 矢量量化器

Publications (2)

Publication Number Publication Date
CN104221287A CN104221287A (zh) 2014-12-17
CN104221287B true CN104221287B (zh) 2017-05-31

Family

ID=47631684

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201280072059.0A Active CN104221287B (zh) 2012-03-29 2012-12-12 矢量量化器
CN201710451005.XA Active CN107170459B (zh) 2012-03-29 2012-12-12 矢量量化器

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN201710451005.XA Active CN107170459B (zh) 2012-03-29 2012-12-12 矢量量化器

Country Status (12)

Country Link
US (5) US9401155B2 (https=)
EP (4) EP4521350A1 (https=)
CN (2) CN104221287B (https=)
BR (1) BR112014022848B1 (https=)
DK (1) DK2831757T3 (https=)
ES (3) ES2745143T3 (https=)
FI (1) FI3547261T3 (https=)
IN (1) IN2014DN07726A (https=)
PL (1) PL2831757T3 (https=)
RU (3) RU2726158C2 (https=)
TR (1) TR201911121T4 (https=)
WO (1) WO2013147667A1 (https=)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IN2014DN07726A (https=) * 2012-03-29 2015-05-15 Ericsson Telefon Ab L M
TR201901612T4 (tr) * 2014-07-28 2019-02-21 Ericsson Telefon Ab L M Piramit vektör niceleyici şekil araması.
US11710492B2 (en) * 2019-10-02 2023-07-25 Qualcomm Incorporated Speech encoding using a pre-encoded database
CN111798532B (zh) * 2020-08-03 2021-03-16 广州市宝绅科技应用有限公司 一种基于质心重合的网屏编码方法及系统
US12438554B1 (en) * 2024-03-31 2025-10-07 AtomBeam Technologies Inc. System and method for federated two-stage compression within a persistent cognitive machine

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101335558A (zh) * 2007-06-29 2008-12-31 华为技术有限公司 多输入多输出信道的码本生成方法及装置
US20110316732A1 (en) * 2009-02-13 2011-12-29 Panasonic Corporation Vector quantization device, vector inverse-quantization device, and methods of same
CN102436815A (zh) * 2011-09-13 2012-05-02 东南大学 一种应用于英语口语网络机考系统的语音识别装置

Family Cites Families (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS63217878A (ja) 1987-03-06 1988-09-09 Nippon Telegr & Teleph Corp <Ntt> 予測木探索ベクトル量子化方式
US5195168A (en) * 1991-03-15 1993-03-16 Codex Corporation Speech coder and method having spectral interpolation and fast codebook search
CA2135629C (en) * 1993-03-26 2000-02-08 Ira A. Gerson Multi-segment vector quantizer for a speech coder suitable for use in a radiotelephone
US5664055A (en) * 1995-06-07 1997-09-02 Lucent Technologies Inc. CS-ACELP speech compression system with adaptive pitch prediction filter gain based on a measure of periodicity
EP0788091A3 (en) * 1996-01-31 1999-02-24 Kabushiki Kaisha Toshiba Speech encoding and decoding method and apparatus therefor
JP3335841B2 (ja) * 1996-05-27 2002-10-21 日本電気株式会社 信号符号化装置
DE69710505T2 (de) * 1996-11-07 2002-06-27 Matsushita Electric Industrial Co., Ltd. Verfahren und Vorrichtung zur Erzeugung eines Vektorquantisierungs-Codebuchs
WO1999021174A1 (en) * 1997-10-22 1999-04-29 Matsushita Electric Industrial Co., Ltd. Sound encoder and sound decoder
US6148283A (en) * 1998-09-23 2000-11-14 Qualcomm Inc. Method and apparatus using multi-path multi-stage vector quantizer
JP2002531979A (ja) * 1998-12-01 2002-09-24 ザ リージェンツ オブ ザ ユニバーシティ オブ カリフォルニア 改良波形補間型符号器
US6782360B1 (en) * 1999-09-22 2004-08-24 Mindspeed Technologies, Inc. Gain quantization for a CELP speech coder
US7167828B2 (en) * 2000-01-11 2007-01-23 Matsushita Electric Industrial Co., Ltd. Multimode speech coding apparatus and decoding apparatus
US7171355B1 (en) * 2000-10-25 2007-01-30 Broadcom Corporation Method and apparatus for one-stage and two-stage noise feedback coding of speech and audio signals
WO2002045077A1 (en) * 2000-11-30 2002-06-06 Matsushita Electric Industrial Co., Ltd. Vector quantizing device for lpc parameters
US7610198B2 (en) * 2001-08-16 2009-10-27 Broadcom Corporation Robust quantization with efficient WMSE search of a sign-shape codebook using illegal space
CA2388358A1 (en) * 2002-05-31 2003-11-30 Voiceage Corporation A method and device for multi-rate lattice vector quantization
US7337110B2 (en) * 2002-08-26 2008-02-26 Motorola, Inc. Structured VSELP codebook for low complexity search
CA2415105A1 (en) * 2002-12-24 2004-06-24 Voiceage Corporation A method and device for robust predictive vector quantization of linear prediction parameters in variable bit rate speech coding
BRPI0608270A2 (pt) * 2005-04-01 2009-10-06 Qualcomm Inc sistemas, métodos e equipamento para filtragem anti-dispersão
US7587314B2 (en) * 2005-08-29 2009-09-08 Nokia Corporation Single-codebook vector quantization for multiple-rate applications
EP1946447B1 (en) * 2005-09-23 2014-06-04 Telefonaktiebolaget LM Ericsson (publ) Successively refinable lattice vector quantization
US20070129946A1 (en) * 2005-12-06 2007-06-07 Ma Changxue C High quality speech reconstruction for a dialog method and system
US8285544B2 (en) * 2006-03-21 2012-10-09 France Telecom Restrained vector quantisation
CN101198041B (zh) * 2006-12-05 2010-12-08 华为技术有限公司 矢量量化方法及装置
US8050919B2 (en) * 2007-06-29 2011-11-01 Microsoft Corporation Speaker recognition via voice sample based on multiple nearest neighbor classifiers
KR101390051B1 (ko) * 2007-10-12 2014-04-29 파나소닉 주식회사 벡터 양자화 장치, 벡터 역양자화 장치, 및 이러한 방법
ES2821432T3 (es) * 2008-02-15 2021-04-26 Nokia Technologies Oy Cuantificación de audio mediante indexación de vectores de complejidad reducida
EP2304722B1 (en) * 2008-07-17 2018-03-14 Nokia Technologies Oy Method and apparatus for fast nearest-neighbor search for vector quantizers
US20100174539A1 (en) * 2009-01-06 2010-07-08 Qualcomm Incorporated Method and apparatus for vector quantization codebook search
US8581757B2 (en) * 2009-07-02 2013-11-12 Siemens Enterprise Communications Gmbh & Co. Kg Method for vector quantization of a feature vector
RU2435214C2 (ru) 2010-02-01 2011-11-27 Государственное образовательное учреждение высшего профессионального образования Академия Федеральной службы охраны Российской Федерации (Академия ФСО России) Способ быстрого поиска в кодовой книге при векторном квантовании
IN2014DN07726A (https=) * 2012-03-29 2015-05-15 Ericsson Telefon Ab L M
ES2561603T3 (es) * 2012-03-29 2016-02-29 Telefonaktiebolaget Lm Ericsson (Publ) Extensión del ancho de banda de una señal de audio armónica
EP2831874B1 (en) * 2012-03-29 2017-05-03 Telefonaktiebolaget LM Ericsson (publ) Transform encoding/decoding of harmonic audio signals
BR112016007515B1 (pt) * 2013-10-18 2021-11-16 Telefonaktiebolaget Lm Ericsson (Publ) Método de codificação de segmento de sinal de áudio, codificador de segmento de sinal de áudio, e, terminal de usuário.
CN110649925B (zh) * 2013-11-12 2023-04-07 瑞典爱立信有限公司 划分的增益形状向量编码

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101335558A (zh) * 2007-06-29 2008-12-31 华为技术有限公司 多输入多输出信道的码本生成方法及装置
US20110316732A1 (en) * 2009-02-13 2011-12-29 Panasonic Corporation Vector quantization device, vector inverse-quantization device, and methods of same
CN102436815A (zh) * 2011-09-13 2012-05-02 东南大学 一种应用于英语口语网络机考系统的语音识别装置

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
《APPLICATION OF SORTED CODEBOOK VECTOR QUANTIZATION TO SPECTRAL, CODING OF SPEECH》;H.R. Sadegh Mohammadi et al;《IEEE Global Telecommunications Conference》;19951231;第3卷;第1595-1598页 *
《Fast codebook search algorithm for unconstrained vector quantisation》;Chen C Q et al;《 IEEE Proceedings-Vision Image and Signal Processing》;19980430;第145卷(第2期);第97-102页 *

Also Published As

Publication number Publication date
EP2831757A1 (en) 2015-02-04
CN107170459B (zh) 2020-08-04
EP4274235C0 (en) 2024-11-20
EP4274235A2 (en) 2023-11-08
EP3547261A1 (en) 2019-10-02
RU2624586C2 (ru) 2017-07-04
RU2020115683A3 (https=) 2022-02-21
ES2745143T3 (es) 2020-02-27
PL2831757T3 (pl) 2019-11-29
US11017786B2 (en) 2021-05-25
US20190378526A1 (en) 2019-12-12
BR112014022848A2 (pt) 2017-06-20
US11741977B2 (en) 2023-08-29
TR201911121T4 (tr) 2019-08-21
RU2017121373A (ru) 2019-01-29
US20210241779A1 (en) 2021-08-05
US20150051907A1 (en) 2015-02-19
BR112014022848A8 (pt) 2021-05-25
US10468044B2 (en) 2019-11-05
EP4274235A3 (en) 2024-01-10
EP3547261B1 (en) 2023-08-09
RU2014143442A (ru) 2016-05-20
WO2013147667A1 (en) 2013-10-03
CN104221287A (zh) 2014-12-17
BR112014022848B1 (pt) 2021-07-20
RU2017121373A3 (https=) 2020-03-12
US9401155B2 (en) 2016-07-26
CN107170459A (zh) 2017-09-15
ES2960582T3 (es) 2024-03-05
RU2020115683A (ru) 2021-11-12
DK2831757T3 (da) 2019-08-19
EP2831757B1 (en) 2019-06-19
EP4274235B1 (en) 2024-11-20
US9842601B2 (en) 2017-12-12
RU2726158C2 (ru) 2020-07-09
EP4521350A1 (en) 2025-03-12
US20160300581A1 (en) 2016-10-13
FI3547261T3 (fi) 2023-09-26
US20180061429A1 (en) 2018-03-01
ES3000058T3 (en) 2025-02-27
IN2014DN07726A (https=) 2015-05-15

Similar Documents

Publication Publication Date Title
US11741977B2 (en) Vector quantizer
US9269366B2 (en) Hybrid instantaneous/differential pitch period coding
CN103052984A (zh) 用于动态位分配的系统、方法、设备和计算机可读媒体
US10891758B2 (en) Geometry encoder
CN106415715A (zh) 编码装置、解码装置、及其方法、程序
KR101821532B1 (ko) 벡터 양자화
US10318891B1 (en) Geometry encoder
CN115376532B (zh) 一种音频编码、解码方法、装置、设备及存储介质
CN114841325B (zh) 神经网络模型的数据处理方法、介质及电子设备
KR20180026528A (ko) 오디오 신호 디코더를 위한 비트 에러 검출기
EP3803794B1 (en) Geometry encoder
JP6475273B2 (ja) ベクトル量子化
EP4634912A1 (en) Improved transitions in a multi-mode audio decoder
CN110709928A (zh) 多个结构化码本的有效存储
WO2009056047A1 (fr) Procédé de quantification vectorielle et quantificateur vectoriel

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant