CN109545234A - 一种基于压缩感知的语音线谱频率编码及自适应快速重构方法 - Google Patents
一种基于压缩感知的语音线谱频率编码及自适应快速重构方法 Download PDFInfo
- Publication number
- CN109545234A CN109545234A CN201811268384.XA CN201811268384A CN109545234A CN 109545234 A CN109545234 A CN 109545234A CN 201811268384 A CN201811268384 A CN 201811268384A CN 109545234 A CN109545234 A CN 109545234A
- Authority
- CN
- China
- Prior art keywords
- superframe
- line spectral
- type
- subframe
- spectral frequencies
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 43
- 230000003044 adaptive effect Effects 0.000 title claims abstract description 12
- 230000003595 spectral effect Effects 0.000 title claims description 54
- 239000011159 matrix material Substances 0.000 claims abstract description 45
- 238000013139 quantization Methods 0.000 claims abstract description 18
- 230000008447 perception Effects 0.000 claims abstract description 16
- 230000009466 transformation Effects 0.000 claims abstract description 10
- 238000001514 detection method Methods 0.000 claims abstract description 3
- 238000004364 calculation method Methods 0.000 claims description 4
- 230000000694 effects Effects 0.000 claims description 4
- 239000000284 extract Substances 0.000 claims description 4
- 230000008569 process Effects 0.000 claims description 3
- 230000006835 compression Effects 0.000 claims description 2
- 238000007906 compression Methods 0.000 claims description 2
- 235000013399 edible fruits Nutrition 0.000 claims 1
- 238000010187 selection method Methods 0.000 claims 1
- 238000001228 spectrum Methods 0.000 abstract description 6
- 230000008901 benefit Effects 0.000 abstract description 3
- 230000004913 activation Effects 0.000 abstract 1
- 238000012545 processing Methods 0.000 description 8
- 238000005070 sampling Methods 0.000 description 6
- 238000010586 diagram Methods 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 230000005236 sound signal Effects 0.000 description 3
- 238000012938 design process Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000009432 framing Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/0017—Lossless audio signal coding; Perfect reconstruction of coded audio signal by transmission of coding error
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/0018—Speech coding using phonetic or linguistical decoding of the source; Reconstruction using text-to-speech synthesis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/038—Vector quantisation, e.g. TwinVQ audio
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims (11)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811268384.XA CN109545234B (zh) | 2018-10-29 | 2018-10-29 | 一种基于压缩感知的语音线谱频率编码及自适应快速重构方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811268384.XA CN109545234B (zh) | 2018-10-29 | 2018-10-29 | 一种基于压缩感知的语音线谱频率编码及自适应快速重构方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109545234A true CN109545234A (zh) | 2019-03-29 |
CN109545234B CN109545234B (zh) | 2023-09-26 |
Family
ID=65845821
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811268384.XA Active CN109545234B (zh) | 2018-10-29 | 2018-10-29 | 一种基于压缩感知的语音线谱频率编码及自适应快速重构方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109545234B (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113990335A (zh) * | 2021-10-28 | 2022-01-28 | 南京南大电子智慧型服务机器人研究院有限公司 | 一种基于压缩感知的音频编解码方法 |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103347268A (zh) * | 2013-06-05 | 2013-10-09 | 杭州电子科技大学 | 认知传感器网络中基于能量有效性观测的自适应压缩重构方法 |
WO2013188957A1 (en) * | 2012-06-18 | 2013-12-27 | University Health Network | Method and system for compressed sensing image reconstruction |
WO2016049544A1 (en) * | 2014-09-25 | 2016-03-31 | Northwestern University | Devices, methods, and systems relating to super resolution imaging |
US20160124903A1 (en) * | 2013-11-03 | 2016-05-05 | Brian G. Agee | Subspace-constrained partial update method for high-dimensional adaptive processing systems |
CN106500735A (zh) * | 2016-11-03 | 2017-03-15 | 重庆邮电大学 | 一种基于压缩感知的fbg信号自适应修复方法 |
CN106685428A (zh) * | 2016-12-30 | 2017-05-17 | 重庆邮电大学 | 基于混沌的结构化压缩感知循环观测矩阵的构造 |
US20170146787A1 (en) * | 2015-11-20 | 2017-05-25 | Integrated Dynamic Electron Solutions, Inc. | Temporal compressive sensing systems |
CN106911622A (zh) * | 2017-01-12 | 2017-06-30 | 重庆邮电大学 | 基于压缩感知的aco‑ofdm系统信道估计方法 |
CN108418769A (zh) * | 2018-01-17 | 2018-08-17 | 南京邮电大学 | 一种分布式压缩感知稀疏度自适应重建方法 |
-
2018
- 2018-10-29 CN CN201811268384.XA patent/CN109545234B/zh active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013188957A1 (en) * | 2012-06-18 | 2013-12-27 | University Health Network | Method and system for compressed sensing image reconstruction |
CN103347268A (zh) * | 2013-06-05 | 2013-10-09 | 杭州电子科技大学 | 认知传感器网络中基于能量有效性观测的自适应压缩重构方法 |
US20160124903A1 (en) * | 2013-11-03 | 2016-05-05 | Brian G. Agee | Subspace-constrained partial update method for high-dimensional adaptive processing systems |
WO2016049544A1 (en) * | 2014-09-25 | 2016-03-31 | Northwestern University | Devices, methods, and systems relating to super resolution imaging |
US20170146787A1 (en) * | 2015-11-20 | 2017-05-25 | Integrated Dynamic Electron Solutions, Inc. | Temporal compressive sensing systems |
CN106500735A (zh) * | 2016-11-03 | 2017-03-15 | 重庆邮电大学 | 一种基于压缩感知的fbg信号自适应修复方法 |
CN106685428A (zh) * | 2016-12-30 | 2017-05-17 | 重庆邮电大学 | 基于混沌的结构化压缩感知循环观测矩阵的构造 |
CN106911622A (zh) * | 2017-01-12 | 2017-06-30 | 重庆邮电大学 | 基于压缩感知的aco‑ofdm系统信道估计方法 |
CN108418769A (zh) * | 2018-01-17 | 2018-08-17 | 南京邮电大学 | 一种分布式压缩感知稀疏度自适应重建方法 |
Non-Patent Citations (11)
Title |
---|
ANDRIANIAINA RAVELOMANANTSOA等: "Compressed Sensing: A Simple Deterministic Measurement Matrix and a Fast Recovery Algorithm", IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, vol. 64, no. 12, pages 3405 - 3413, XP011589161, DOI: 10.1109/TIM.2015.2459471 * |
HASSAN RABAH,等: "Compressed Sensing: A Simple Deterministic Measurement Matrix and a Fast Recovery Algorithm", 《IEEE》 * |
HASSAN RABAH,等: "Compressed Sensing: A Simple Deterministic Measurement Matrix and a Fast Recovery Algorithm", 《IEEE》, 31 December 2015 (2015-12-31), pages 3405 - 3413, XP011589161, DOI: 10.1109/TIM.2015.2459471 * |
YULONG GAO,等: "Eigenvalue-Based Spectrum Sensing for Multiple Received Signals Under the Non-Reconstruction Framework of Compressed Sensing", 《IEEE》 * |
YULONG GAO,等: "Eigenvalue-Based Spectrum Sensing for Multiple Received Signals Under the Non-Reconstruction Framework of Compressed Sensing", 《IEEE》, 31 December 2016 (2016-12-31), pages 4891 - 4901 * |
何伟俊: "基于感知的低速率语音编码算法研究", 中国博士学位论文全文数据库信息科技辑, no. 02, pages 136 - 85 * |
徐倩: "稀疏表示的语音信号的最佳投影与其重构技术的研究", 中国优秀硕士学位论文全文数据库信息科技辑, no. 07, pages 136 - 294 * |
李强,等: "一种基于混合 MELP/CELP 的 4 kbit/s 声码器", 《重庆邮电大学学报》 * |
李强,等: "一种基于混合 MELP/CELP 的 4 kbit/s 声码器", 《重庆邮电大学学报》, 30 April 2017 (2017-04-30), pages 143 - 148 * |
高悦,等: "基于线性预测分析和差分变换的语音信号压缩感知", 《通电子与信息学报》 * |
高悦,等: "基于线性预测分析和差分变换的语音信号压缩感知", 《通电子与信息学报》, 30 June 2012 (2012-06-30), pages 1408 - 1418 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113990335A (zh) * | 2021-10-28 | 2022-01-28 | 南京南大电子智慧型服务机器人研究院有限公司 | 一种基于压缩感知的音频编解码方法 |
Also Published As
Publication number | Publication date |
---|---|
CN109545234B (zh) | 2023-09-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Paliwal et al. | Efficient vector quantization of LPC parameters at 24 bits/frame | |
US4868867A (en) | Vector excitation speech or audio coder for transmission or storage | |
ES2433043T3 (es) | Conmutación del modo de codificación ACELP a TCX | |
US9311926B2 (en) | Apparatus and method for determining weighting function having for associating linear predictive coding (LPC) coefficients with line spectral frequency coefficients and immittance spectral frequency coefficients | |
US5732188A (en) | Method for the modification of LPC coefficients of acoustic signals | |
US20070147518A1 (en) | Methods and devices for low-frequency emphasis during audio compression based on ACELP/TCX | |
KR102461280B1 (ko) | 선형 예측 부호화 계수를 양자화하기 위한 가중치 함수 결정 장치 및 방법 | |
CN102623014A (zh) | 变换编码装置和变换编码方法 | |
EP1576585A1 (en) | Method and device for robust predictive vector quantization of linear prediction parameters in variable bit rate speech coding | |
CN101140759A (zh) | 语音或音频信号的带宽扩展方法及系统 | |
CN104981981B (zh) | 数字音频信号中的前回声的有效衰减 | |
US20050114123A1 (en) | Speech processing system and method | |
CN109545234A (zh) | 一种基于压缩感知的语音线谱频率编码及自适应快速重构方法 | |
JP3194930B2 (ja) | 音声符号化装置 | |
Vafin et al. | Towards optimal quantization in multistage audio coding | |
CA2511516C (en) | Method and device for robust predictive vector quantization of linear prediction parameters in variable bit rate speech coding | |
Dang et al. | DCT_M model for excitation parameter in low bit rate vocoder | |
Ramabadran et al. | Speech Coding using Least-Squares Estimation | |
Jin et al. | Effective complexity reduction in codebook search for ACELP? | |
Gamliel et al. | Perceptual time varying linear prediction model for speech applications | |
Vaalgamaa | Moving average vector quantization in speech coding | |
Wang | An Efficient Dimension Reduction Quantization Scheme for Speech Vocal Parameters | |
Bao | Harmonic excitation LPC (HE-LPC) speech coding at 2.3 kb/s | |
Hedlund | Speech Coding Using Orthonormal Basis Functions | |
Jamal et al. | Multi-channel implementation of G. 729 A/B on FPGA |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20231201 Address after: B21109-2, Eurasian Grand View, No. 67 Yangguang New Road, Shizhong District, Jinan City, Shandong Province, 250000 Patentee after: Jinan Lianken Information Technology Co.,Ltd. Address before: 518000 1104, Building A, Zhiyun Industrial Park, No. 13, Huaxing Road, Henglang Community, Longhua District, Shenzhen, Guangdong Province Patentee before: Shenzhen Hongyue Enterprise Management Consulting Co.,Ltd. Effective date of registration: 20231201 Address after: 518000 1104, Building A, Zhiyun Industrial Park, No. 13, Huaxing Road, Henglang Community, Longhua District, Shenzhen, Guangdong Province Patentee after: Shenzhen Hongyue Enterprise Management Consulting Co.,Ltd. Address before: 400065 No. 2, Chongwen Road, Nan'an District, Chongqing Patentee before: CHONGQING University OF POSTS AND TELECOMMUNICATIONS |