FI3547261T3 - Vektorikvantisoija - Google Patents

Vektorikvantisoija Download PDF

Info

Publication number
FI3547261T3
FI3547261T3 FIEP19167463.9T FI19167463T FI3547261T3 FI 3547261 T3 FI3547261 T3 FI 3547261T3 FI 19167463 T FI19167463 T FI 19167463T FI 3547261 T3 FI3547261 T3 FI 3547261T3
Authority
FI
Finland
Prior art keywords
vector
code
centroid
class
vectors
Prior art date
Application number
FIEP19167463.9T
Other languages
English (en)
Finnish (fi)
Inventor
Volodya Grancharov
Toftgård Tomas Jansson
Original Assignee
Ericsson Telefon Ab L M
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ericsson Telefon Ab L M filed Critical Ericsson Telefon Ab L M
Application granted granted Critical
Publication of FI3547261T3 publication Critical patent/FI3547261T3/fi

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • G10L19/038Vector quantisation, e.g. TwinVQ audio
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/3082Vector coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/94Vector quantisation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0013Codebook search algorithms

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Theoretical Computer Science (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
FIEP19167463.9T 2012-03-29 2012-12-12 Vektorikvantisoija FI3547261T3 (fi)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US201261617151P 2012-03-29 2012-03-29

Publications (1)

Publication Number Publication Date
FI3547261T3 true FI3547261T3 (fi) 2023-09-26

Family

ID=47631684

Family Applications (1)

Application Number Title Priority Date Filing Date
FIEP19167463.9T FI3547261T3 (fi) 2012-03-29 2012-12-12 Vektorikvantisoija

Country Status (12)

Country Link
US (5) US9401155B2 (https=)
EP (4) EP4521350A1 (https=)
CN (2) CN104221287B (https=)
BR (1) BR112014022848B1 (https=)
DK (1) DK2831757T3 (https=)
ES (3) ES2745143T3 (https=)
FI (1) FI3547261T3 (https=)
IN (1) IN2014DN07726A (https=)
PL (1) PL2831757T3 (https=)
RU (3) RU2726158C2 (https=)
TR (1) TR201911121T4 (https=)
WO (1) WO2013147667A1 (https=)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IN2014DN07726A (https=) * 2012-03-29 2015-05-15 Ericsson Telefon Ab L M
TR201901612T4 (tr) * 2014-07-28 2019-02-21 Ericsson Telefon Ab L M Piramit vektör niceleyici şekil araması.
US11710492B2 (en) * 2019-10-02 2023-07-25 Qualcomm Incorporated Speech encoding using a pre-encoded database
CN111798532B (zh) * 2020-08-03 2021-03-16 广州市宝绅科技应用有限公司 一种基于质心重合的网屏编码方法及系统
US12438554B1 (en) * 2024-03-31 2025-10-07 AtomBeam Technologies Inc. System and method for federated two-stage compression within a persistent cognitive machine

Family Cites Families (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS63217878A (ja) 1987-03-06 1988-09-09 Nippon Telegr & Teleph Corp <Ntt> 予測木探索ベクトル量子化方式
US5195168A (en) * 1991-03-15 1993-03-16 Codex Corporation Speech coder and method having spectral interpolation and fast codebook search
CA2135629C (en) * 1993-03-26 2000-02-08 Ira A. Gerson Multi-segment vector quantizer for a speech coder suitable for use in a radiotelephone
US5664055A (en) * 1995-06-07 1997-09-02 Lucent Technologies Inc. CS-ACELP speech compression system with adaptive pitch prediction filter gain based on a measure of periodicity
EP0788091A3 (en) * 1996-01-31 1999-02-24 Kabushiki Kaisha Toshiba Speech encoding and decoding method and apparatus therefor
JP3335841B2 (ja) * 1996-05-27 2002-10-21 日本電気株式会社 信号符号化装置
DE69710505T2 (de) * 1996-11-07 2002-06-27 Matsushita Electric Industrial Co., Ltd. Verfahren und Vorrichtung zur Erzeugung eines Vektorquantisierungs-Codebuchs
WO1999021174A1 (en) * 1997-10-22 1999-04-29 Matsushita Electric Industrial Co., Ltd. Sound encoder and sound decoder
US6148283A (en) * 1998-09-23 2000-11-14 Qualcomm Inc. Method and apparatus using multi-path multi-stage vector quantizer
JP2002531979A (ja) * 1998-12-01 2002-09-24 ザ リージェンツ オブ ザ ユニバーシティ オブ カリフォルニア 改良波形補間型符号器
US6782360B1 (en) * 1999-09-22 2004-08-24 Mindspeed Technologies, Inc. Gain quantization for a CELP speech coder
US7167828B2 (en) * 2000-01-11 2007-01-23 Matsushita Electric Industrial Co., Ltd. Multimode speech coding apparatus and decoding apparatus
US7171355B1 (en) * 2000-10-25 2007-01-30 Broadcom Corporation Method and apparatus for one-stage and two-stage noise feedback coding of speech and audio signals
WO2002045077A1 (en) * 2000-11-30 2002-06-06 Matsushita Electric Industrial Co., Ltd. Vector quantizing device for lpc parameters
US7610198B2 (en) * 2001-08-16 2009-10-27 Broadcom Corporation Robust quantization with efficient WMSE search of a sign-shape codebook using illegal space
CA2388358A1 (en) * 2002-05-31 2003-11-30 Voiceage Corporation A method and device for multi-rate lattice vector quantization
US7337110B2 (en) * 2002-08-26 2008-02-26 Motorola, Inc. Structured VSELP codebook for low complexity search
CA2415105A1 (en) * 2002-12-24 2004-06-24 Voiceage Corporation A method and device for robust predictive vector quantization of linear prediction parameters in variable bit rate speech coding
BRPI0608270A2 (pt) * 2005-04-01 2009-10-06 Qualcomm Inc sistemas, métodos e equipamento para filtragem anti-dispersão
US7587314B2 (en) * 2005-08-29 2009-09-08 Nokia Corporation Single-codebook vector quantization for multiple-rate applications
EP1946447B1 (en) * 2005-09-23 2014-06-04 Telefonaktiebolaget LM Ericsson (publ) Successively refinable lattice vector quantization
US20070129946A1 (en) * 2005-12-06 2007-06-07 Ma Changxue C High quality speech reconstruction for a dialog method and system
US8285544B2 (en) * 2006-03-21 2012-10-09 France Telecom Restrained vector quantisation
CN101198041B (zh) * 2006-12-05 2010-12-08 华为技术有限公司 矢量量化方法及装置
CN101335558B (zh) * 2007-06-29 2012-07-04 华为技术有限公司 多输入多输出信道的码本生成方法及装置
US8050919B2 (en) * 2007-06-29 2011-11-01 Microsoft Corporation Speaker recognition via voice sample based on multiple nearest neighbor classifiers
KR101390051B1 (ko) * 2007-10-12 2014-04-29 파나소닉 주식회사 벡터 양자화 장치, 벡터 역양자화 장치, 및 이러한 방법
ES2821432T3 (es) * 2008-02-15 2021-04-26 Nokia Technologies Oy Cuantificación de audio mediante indexación de vectores de complejidad reducida
EP2304722B1 (en) * 2008-07-17 2018-03-14 Nokia Technologies Oy Method and apparatus for fast nearest-neighbor search for vector quantizers
US20100174539A1 (en) * 2009-01-06 2010-07-08 Qualcomm Incorporated Method and apparatus for vector quantization codebook search
RU2519027C2 (ru) * 2009-02-13 2014-06-10 Панасоник Корпорэйшн Устройство векторного квантования, устройство векторного обратного квантования и способы для этого
US8581757B2 (en) * 2009-07-02 2013-11-12 Siemens Enterprise Communications Gmbh & Co. Kg Method for vector quantization of a feature vector
RU2435214C2 (ru) 2010-02-01 2011-11-27 Государственное образовательное учреждение высшего профессионального образования Академия Федеральной службы охраны Российской Федерации (Академия ФСО России) Способ быстрого поиска в кодовой книге при векторном квантовании
CN102436815B (zh) * 2011-09-13 2012-12-19 东南大学 一种应用于英语口语网络机考系统的语音识别装置
IN2014DN07726A (https=) * 2012-03-29 2015-05-15 Ericsson Telefon Ab L M
ES2561603T3 (es) * 2012-03-29 2016-02-29 Telefonaktiebolaget Lm Ericsson (Publ) Extensión del ancho de banda de una señal de audio armónica
EP2831874B1 (en) * 2012-03-29 2017-05-03 Telefonaktiebolaget LM Ericsson (publ) Transform encoding/decoding of harmonic audio signals
BR112016007515B1 (pt) * 2013-10-18 2021-11-16 Telefonaktiebolaget Lm Ericsson (Publ) Método de codificação de segmento de sinal de áudio, codificador de segmento de sinal de áudio, e, terminal de usuário.
CN110649925B (zh) * 2013-11-12 2023-04-07 瑞典爱立信有限公司 划分的增益形状向量编码

Also Published As

Publication number Publication date
EP2831757A1 (en) 2015-02-04
CN107170459B (zh) 2020-08-04
EP4274235C0 (en) 2024-11-20
EP4274235A2 (en) 2023-11-08
EP3547261A1 (en) 2019-10-02
RU2624586C2 (ru) 2017-07-04
RU2020115683A3 (https=) 2022-02-21
ES2745143T3 (es) 2020-02-27
PL2831757T3 (pl) 2019-11-29
US11017786B2 (en) 2021-05-25
US20190378526A1 (en) 2019-12-12
BR112014022848A2 (pt) 2017-06-20
US11741977B2 (en) 2023-08-29
TR201911121T4 (tr) 2019-08-21
RU2017121373A (ru) 2019-01-29
US20210241779A1 (en) 2021-08-05
US20150051907A1 (en) 2015-02-19
CN104221287B (zh) 2017-05-31
BR112014022848A8 (pt) 2021-05-25
US10468044B2 (en) 2019-11-05
EP4274235A3 (en) 2024-01-10
EP3547261B1 (en) 2023-08-09
RU2014143442A (ru) 2016-05-20
WO2013147667A1 (en) 2013-10-03
CN104221287A (zh) 2014-12-17
BR112014022848B1 (pt) 2021-07-20
RU2017121373A3 (https=) 2020-03-12
US9401155B2 (en) 2016-07-26
CN107170459A (zh) 2017-09-15
ES2960582T3 (es) 2024-03-05
RU2020115683A (ru) 2021-11-12
DK2831757T3 (da) 2019-08-19
EP2831757B1 (en) 2019-06-19
EP4274235B1 (en) 2024-11-20
US9842601B2 (en) 2017-12-12
RU2726158C2 (ru) 2020-07-09
EP4521350A1 (en) 2025-03-12
US20160300581A1 (en) 2016-10-13
US20180061429A1 (en) 2018-03-01
ES3000058T3 (en) 2025-02-27
IN2014DN07726A (https=) 2015-05-15

Similar Documents

Publication Publication Date Title
FI3547261T3 (fi) Vektorikvantisoija
TWI669705B (zh) 用以使用側邊增益及殘餘增益編碼或解碼多通道信號之設備及方法
CA2732723A1 (en) Apparatus and method for processing an audio signal for speech enhancement using a feature extraction
WO2018201112A1 (en) Audio coder window sizes and time-frequency transformations
RU2009132936A (ru) Кодирующее устройство и способ кодирования
JP2016539554A5 (https=)
KR102615903B1 (ko) 오디오 코더 윈도우 및 변환 구현들
BRPI0510400A (pt) dispositivo de codificação, dispositivo de decodificação e método dos mesmos
MX2009004639A (es) Dispositivo y metodo para tratamiento posterior de valores espectrales y codificador y decodificador para señales de audio.
JP2015184470A5 (https=)
WO2008108078A1 (ja) 符号化装置および符号化方法
KR20140085415A (ko) 가중 윈도우들을 코딩/디코딩하는 지연최적화 오버랩 변환
WO2006046587A1 (ja) スケーラブル符号化装置、スケーラブル復号化装置、およびこれらの方法
JP5687706B2 (ja) 量子化装置及び量子化方法
WO2012095700A1 (en) An audio encoder/decoder apparatus
ES3047123T3 (en) Method and encoder for handling envelope representation coefficients
KR20070090217A (ko) 스케일러블 부호화 장치 및 스케일러블 부호화 방법
RU2009132935A (ru) Кодирующее устройство, декодирующее устройство и способ
WO2011071335A3 (ko) 음성 신호 부호화 방법 및 장치
JP2018526669A (ja) オーディオ信号デコーダのためのビット・エラー検出器
US20140214412A1 (en) Apparatus and method for processing voice signal
CN103098128B (zh) 脉冲位置搜索装置、码本搜索装置及其方法
KR20100115849A (ko) 다중 해싱에 기초한 오디오 핑거프린팅 시스템
CN121153079A (zh) 基于特征分割与特征组合的音频信号处理装置、方法及计算机程序
CN109716431B (zh) 样本串变形装置、样本串变形方法、记录介质