CN102496368A - 一种改进的矢量量化方法 - Google Patents

一种改进的矢量量化方法 Download PDF

Info

Publication number
CN102496368A
CN102496368A CN2011104277452A CN201110427745A CN102496368A CN 102496368 A CN102496368 A CN 102496368A CN 2011104277452 A CN2011104277452 A CN 2011104277452A CN 201110427745 A CN201110427745 A CN 201110427745A CN 102496368 A CN102496368 A CN 102496368A
Authority
CN
China
Prior art keywords
vector quantization
code book
quantization method
improved
improved vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2011104277452A
Other languages
English (en)
Other versions
CN102496368B (zh
Inventor
张小恒
肖宏
于进强
廖红云
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chongqing Jinmei Communication Co Ltd
Original Assignee
Chongqing Jinmei Communication Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chongqing Jinmei Communication Co Ltd filed Critical Chongqing Jinmei Communication Co Ltd
Priority to CN201110427745.2A priority Critical patent/CN102496368B/zh
Publication of CN102496368A publication Critical patent/CN102496368A/zh
Application granted granted Critical
Publication of CN102496368B publication Critical patent/CN102496368B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

本发明公开了一种矢量量化码本生成方法,该方法根据单一维度对训练序列进行最佳划分生成初始码书,然后用GLA算法得到优化后的码书,重复进行最终得到具有2N个码本的码书。本发明的有益技术效果是:利用单一维度可以最大程度简化计算复杂度而不失精度,从而很大程度上提高了整体矢量量化器的性能。

Description

一种改进的矢量量化方法
技术领域
本发明涉及一种语音参数数据的压缩处理技术,在医学图像,遥感图像压缩,视频处理,图片修复和文献检索等领域也有广泛应用。
背景技术
现有矢量量化技术一般在生成初始码本上存在较大的随机性,而初始码本的优良与否会导致最终码本生成的质量优劣。就随机初始码本生成算法而言,容易掉进局部最优值,而非全局最优。而现有的分裂法在初始分裂技术选择上一般存在精度和算法的时间复杂度不可兼得的缺点,导致最终生成的码本质量下降或者整个生成过程耗时过长,不利于对处理时间有特殊要求的工程实现。
发明内容
本发明提出一种改进的矢量量化技术,该方法包括根据向量数据的某一维度进行数据的初始分裂,然后基于GLA算法进行迭代优化帅选,将数据一分为二。然后再将已经一分为二的数据集合A和B,再分别进行先单维度初始分裂后GLA优化的过程,分别得到                                                
Figure 2011104277452100002DEST_PATH_IMAGE001
4个数据集合,并对这4个数据集合进行GLA优化,以此类推,可以得到个数据集合,即数据元胞,从而得到
Figure 664959DEST_PATH_IMAGE002
个码本。
对于单维度初始分裂法即通过计算搜索数据跨度最大的一维,并找到这一维的重心,再以这一维的重心作为数据的分界点,从而完成初始分裂。
对于多级码本生成而言,在第一级码本完成后,数据元胞内数据与码本之间的距离形成了生成第二级码本的样本空间,以此类推,可以生成多级码本。
本发明的有益技术效果是:通过用单维度分裂初始码本可以迅速将数据一分为二,在算法时间复杂度上很低,也能保证一定的分裂精度,从而提升了整个矢量量化系统的性能。
附图说明
图1 单维度最佳划分。
图2 GLA算法流程。
图3 本发明的系统流程图。 
具体实施方式
步骤1:计算所有训练矢量的质心
Figure 2011104277452100002DEST_PATH_IMAGE003
步骤2:依靠单维度最优划分法将所有训练矢量划分为两个集合A和B。并分别求取两个集合的质心,得到对应的码字
Figure 2011104277452100002DEST_PATH_IMAGE004
Figure 2011104277452100002DEST_PATH_IMAGE005
步骤3:以
Figure 107704DEST_PATH_IMAGE004
Figure 782399DEST_PATH_IMAGE005
为初始码字,用GLA算法设计仅含2个码字的码书
Figure 2011104277452100002DEST_PATH_IMAGE006
步骤4:通过以上的GLA算法最终将训练矢量划分为集合
Figure 2011104277452100002DEST_PATH_IMAGE007
Figure 2011104277452100002DEST_PATH_IMAGE008
,用单维度划分法将
Figure 492735DEST_PATH_IMAGE007
Figure 860262DEST_PATH_IMAGE008
划分为集合
Figure 836309DEST_PATH_IMAGE001
。并分别计算其质心,形成4个码字
Figure 2011104277452100002DEST_PATH_IMAGE009
步骤5:以这4个码字为初始码书,用GLA算法设计尽含4个码字的码书
Figure 2011104277452100002DEST_PATH_IMAGE010
,通过GLA算法将训练集合分成4个子集合
Figure 2011104277452100002DEST_PATH_IMAGE011
,将子集合用单维度划分法将其二分并计算初始码字。如此反复,经过
Figure 2011104277452100002DEST_PATH_IMAGE012
次设计,就得到所要求的含N 个码字的初始码书。
单维度最优划分法:对于M 维训练序列而言,计算不同维度的跨度
Figure 2011104277452100002DEST_PATH_IMAGE013
,找出使得
Figure 2011104277452100002DEST_PATH_IMAGE014
最大的一维,令其为L,计算其质心。根据质心将训练序列划分为A和B两个子训练序列,
Figure 2011104277452100002DEST_PATH_IMAGE016
Figure 2011104277452100002DEST_PATH_IMAGE017
  
然后分别计算A集合和B集合的质心
Figure 2011104277452100002DEST_PATH_IMAGE018
,初始码书
Figure 2011104277452100002DEST_PATH_IMAGE020

Claims (2)

1.一种改进的矢量量化方法,其特征在于:利用训练样本的某一维度来对训练样本进行初始划分。
2.根据权利要求1所述的矢量量化方法,其特征在于:利用单一维度的质心来对训练样本进行最佳划分。
CN201110427745.2A 2011-12-20 2011-12-20 一种改进的矢量量化方法 Active CN102496368B (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201110427745.2A CN102496368B (zh) 2011-12-20 2011-12-20 一种改进的矢量量化方法

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110427745.2A CN102496368B (zh) 2011-12-20 2011-12-20 一种改进的矢量量化方法

Publications (2)

Publication Number Publication Date
CN102496368A true CN102496368A (zh) 2012-06-13
CN102496368B CN102496368B (zh) 2014-03-12

Family

ID=46188185

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110427745.2A Active CN102496368B (zh) 2011-12-20 2011-12-20 一种改进的矢量量化方法

Country Status (1)

Country Link
CN (1) CN102496368B (zh)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1997036376A1 (en) * 1996-03-28 1997-10-02 Vxtreme, Inc. Table-based compression with embedded coding
US6968092B1 (en) * 2001-08-21 2005-11-22 Cisco Systems Canada Co. System and method for reduced codebook vector quantization
CN101414365A (zh) * 2008-11-20 2009-04-22 山东大学威海分校 一种基于粒子群的矢量码书量化器
CN101420230A (zh) * 2008-12-01 2009-04-29 中国人民解放军理工大学 一种选择预测矢量量化的迭代优化设计方法
CN101740029A (zh) * 2009-12-16 2010-06-16 深圳大学 应用于基于矢量量化的说话人识别的三粒子协同优化方法

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1997036376A1 (en) * 1996-03-28 1997-10-02 Vxtreme, Inc. Table-based compression with embedded coding
US6968092B1 (en) * 2001-08-21 2005-11-22 Cisco Systems Canada Co. System and method for reduced codebook vector quantization
CN101414365A (zh) * 2008-11-20 2009-04-22 山东大学威海分校 一种基于粒子群的矢量码书量化器
CN101420230A (zh) * 2008-12-01 2009-04-29 中国人民解放军理工大学 一种选择预测矢量量化的迭代优化设计方法
CN101740029A (zh) * 2009-12-16 2010-06-16 深圳大学 应用于基于矢量量化的说话人识别的三粒子协同优化方法

Also Published As

Publication number Publication date
CN102496368B (zh) 2014-03-12

Similar Documents

Publication Publication Date Title
Han et al. Ghostnet: More features from cheap operations
US20220101082A1 (en) Generating representations of input sequences using neural networks
Kriman et al. Quartznet: Deep automatic speech recognition with 1d time-channel separable convolutions
US20180121796A1 (en) Flexible neural network accelerator and methods therefor
US10255911B2 (en) System and method of automatic speech recognition using parallel processing for weighted finite state transducer-based speech decoding
TWI610295B (zh) 解壓縮及壓縮用於語音辨識之轉換器資料的電腦實施方法及電腦實施之語音辨識系統
GB201212783D0 (en) A speech processing system
CN113378784B (zh) 视频标签推荐模型的训练方法和确定视频标签的方法
JP2014002723A5 (zh)
CN111341299B (zh) 一种语音处理方法及装置
MX2020012042A (es) Dispositivo de codificacion, dispositivo de decodificacion, metodo de codificacion y metodo de decodificacion.
CN113450802A (zh) 具有高效解码的自动语音识别方法及系统
WO2016075274A1 (en) Methods, systems and apparatus for image recognition based on recursively determined exemplar-support vector machines (e-svm) features
US10511695B2 (en) Packet-level clustering for memory-assisted compression of network traffic
KR20130124704A (ko) 분산환경 리스코어링 방법 및 장치
CN110633798B (zh) 一种分布式训练中参数更新方法及装置
CN102496368A (zh) 一种改进的矢量量化方法
CN110135465B (zh) 模型参数表示空间大小估计方法及装置、推荐方法
EP2801952B1 (en) Method and device for compression of vertex data in three-dimensional image data
CN112818098B (zh) 基于知识库的对话生成方法、装置、终端及存储介质
Xu et al. Optimized pointwise convolution operation by Ghost blocks
CN103400348B (zh) 基于压缩感知的图像还原方法及系统
CN113628103B (zh) 基于多级损失的高细粒度卡通人脸生成方法及其相关组件
Aich et al. Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Պետրոսյան KNOWLEDGE BASED ECONOMY. THE PERSPECTIVES OF THE ECONOMY OF ARMENIA AND ITS ECONOMETRIC MODELLING

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant