CN109545234A - 一种基于压缩感知的语音线谱频率编码及自适应快速重构方法 - Google Patents
一种基于压缩感知的语音线谱频率编码及自适应快速重构方法 Download PDFInfo
- Publication number
- CN109545234A CN109545234A CN201811268384.XA CN201811268384A CN109545234A CN 109545234 A CN109545234 A CN 109545234A CN 201811268384 A CN201811268384 A CN 201811268384A CN 109545234 A CN109545234 A CN 109545234A
- Authority
- CN
- China
- Prior art keywords
- superframe
- line spectral
- type
- spectral frequencies
- subframe
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000003595 spectral effect Effects 0.000 title claims abstract description 59
- 238000000034 method Methods 0.000 title claims abstract description 43
- 230000003044 adaptive effect Effects 0.000 title claims abstract description 13
- 239000011159 matrix material Substances 0.000 claims abstract description 45
- 238000013139 quantization Methods 0.000 claims abstract description 17
- 230000008447 perception Effects 0.000 claims abstract description 16
- 230000009466 transformation Effects 0.000 claims abstract description 10
- 230000000694 effects Effects 0.000 claims abstract description 5
- 230000006835 compression Effects 0.000 claims abstract description 3
- 238000007906 compression Methods 0.000 claims abstract description 3
- 238000001514 detection method Methods 0.000 claims abstract description 3
- 238000004364 calculation method Methods 0.000 claims description 4
- 239000000284 extract Substances 0.000 claims description 4
- 230000008569 process Effects 0.000 claims description 3
- 235000013399 edible fruits Nutrition 0.000 claims 1
- 238000010187 selection method Methods 0.000 claims 1
- 230000008901 benefit Effects 0.000 abstract description 3
- 238000012545 processing Methods 0.000 description 8
- 238000005070 sampling Methods 0.000 description 6
- 238000010586 diagram Methods 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 230000005236 sound signal Effects 0.000 description 3
- 238000012938 design process Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000009432 framing Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/0017—Lossless audio signal coding; Perfect reconstruction of coded audio signal by transmission of coding error
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/0018—Speech coding using phonetic or linguistical decoding of the source; Reconstruction using text-to-speech synthesis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/038—Vector quantisation, e.g. TwinVQ audio
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims (11)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811268384.XA CN109545234B (zh) | 2018-10-29 | 2018-10-29 | 一种基于压缩感知的语音线谱频率编码及自适应快速重构方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811268384.XA CN109545234B (zh) | 2018-10-29 | 2018-10-29 | 一种基于压缩感知的语音线谱频率编码及自适应快速重构方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109545234A true CN109545234A (zh) | 2019-03-29 |
CN109545234B CN109545234B (zh) | 2023-09-26 |
Family
ID=65845821
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811268384.XA Active CN109545234B (zh) | 2018-10-29 | 2018-10-29 | 一种基于压缩感知的语音线谱频率编码及自适应快速重构方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109545234B (zh) |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103347268A (zh) * | 2013-06-05 | 2013-10-09 | 杭州电子科技大学 | 认知传感器网络中基于能量有效性观测的自适应压缩重构方法 |
WO2013188957A1 (en) * | 2012-06-18 | 2013-12-27 | University Health Network | Method and system for compressed sensing image reconstruction |
WO2016049544A1 (en) * | 2014-09-25 | 2016-03-31 | Northwestern University | Devices, methods, and systems relating to super resolution imaging |
US20160124903A1 (en) * | 2013-11-03 | 2016-05-05 | Brian G. Agee | Subspace-constrained partial update method for high-dimensional adaptive processing systems |
CN106500735A (zh) * | 2016-11-03 | 2017-03-15 | 重庆邮电大学 | 一种基于压缩感知的fbg信号自适应修复方法 |
CN106685428A (zh) * | 2016-12-30 | 2017-05-17 | 重庆邮电大学 | 基于混沌的结构化压缩感知循环观测矩阵的构造 |
US20170146787A1 (en) * | 2015-11-20 | 2017-05-25 | Integrated Dynamic Electron Solutions, Inc. | Temporal compressive sensing systems |
CN106911622A (zh) * | 2017-01-12 | 2017-06-30 | 重庆邮电大学 | 基于压缩感知的aco‑ofdm系统信道估计方法 |
CN108418769A (zh) * | 2018-01-17 | 2018-08-17 | 南京邮电大学 | 一种分布式压缩感知稀疏度自适应重建方法 |
-
2018
- 2018-10-29 CN CN201811268384.XA patent/CN109545234B/zh active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013188957A1 (en) * | 2012-06-18 | 2013-12-27 | University Health Network | Method and system for compressed sensing image reconstruction |
CN103347268A (zh) * | 2013-06-05 | 2013-10-09 | 杭州电子科技大学 | 认知传感器网络中基于能量有效性观测的自适应压缩重构方法 |
US20160124903A1 (en) * | 2013-11-03 | 2016-05-05 | Brian G. Agee | Subspace-constrained partial update method for high-dimensional adaptive processing systems |
WO2016049544A1 (en) * | 2014-09-25 | 2016-03-31 | Northwestern University | Devices, methods, and systems relating to super resolution imaging |
US20170146787A1 (en) * | 2015-11-20 | 2017-05-25 | Integrated Dynamic Electron Solutions, Inc. | Temporal compressive sensing systems |
CN106500735A (zh) * | 2016-11-03 | 2017-03-15 | 重庆邮电大学 | 一种基于压缩感知的fbg信号自适应修复方法 |
CN106685428A (zh) * | 2016-12-30 | 2017-05-17 | 重庆邮电大学 | 基于混沌的结构化压缩感知循环观测矩阵的构造 |
CN106911622A (zh) * | 2017-01-12 | 2017-06-30 | 重庆邮电大学 | 基于压缩感知的aco‑ofdm系统信道估计方法 |
CN108418769A (zh) * | 2018-01-17 | 2018-08-17 | 南京邮电大学 | 一种分布式压缩感知稀疏度自适应重建方法 |
Non-Patent Citations (11)
Title |
---|
ANDRIANIAINA RAVELOMANANTSOA等: "Compressed Sensing: A Simple Deterministic Measurement Matrix and a Fast Recovery Algorithm", IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, vol. 64, no. 12, pages 3405 - 3413, XP011589161, DOI: 10.1109/TIM.2015.2459471 * |
HASSAN RABAH,等: "Compressed Sensing: A Simple Deterministic Measurement Matrix and a Fast Recovery Algorithm", 《IEEE》 * |
HASSAN RABAH,等: "Compressed Sensing: A Simple Deterministic Measurement Matrix and a Fast Recovery Algorithm", 《IEEE》, 31 December 2015 (2015-12-31), pages 3405 - 3413, XP011589161, DOI: 10.1109/TIM.2015.2459471 * |
YULONG GAO,等: "Eigenvalue-Based Spectrum Sensing for Multiple Received Signals Under the Non-Reconstruction Framework of Compressed Sensing", 《IEEE》 * |
YULONG GAO,等: "Eigenvalue-Based Spectrum Sensing for Multiple Received Signals Under the Non-Reconstruction Framework of Compressed Sensing", 《IEEE》, 31 December 2016 (2016-12-31), pages 4891 - 4901 * |
何伟俊: "基于感知的低速率语音编码算法研究", 中国博士学位论文全文数据库信息科技辑, no. 02, pages 136 - 85 * |
徐倩: "稀疏表示的语音信号的最佳投影与其重构技术的研究", 中国优秀硕士学位论文全文数据库信息科技辑, no. 07, pages 136 - 294 * |
李强,等: "一种基于混合 MELP/CELP 的 4 kbit/s 声码器", 《重庆邮电大学学报》 * |
李强,等: "一种基于混合 MELP/CELP 的 4 kbit/s 声码器", 《重庆邮电大学学报》, 30 April 2017 (2017-04-30), pages 143 - 148 * |
高悦,等: "基于线性预测分析和差分变换的语音信号压缩感知", 《通电子与信息学报》 * |
高悦,等: "基于线性预测分析和差分变换的语音信号压缩感知", 《通电子与信息学报》, 30 June 2012 (2012-06-30), pages 1408 - 1418 * |
Also Published As
Publication number | Publication date |
---|---|
CN109545234B (zh) | 2023-09-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US4868867A (en) | Vector excitation speech or audio coder for transmission or storage | |
CN103778919B (zh) | 基于压缩感知和稀疏表示的语音编码方法 | |
ES2433043T3 (es) | Conmutación del modo de codificación ACELP a TCX | |
US20170358309A1 (en) | Apparatus and method for determining weighting function having for associating linear predictive coding (lpc) coefficients with line spectral frequency coefficients and immittance spectral frequency coefficients | |
KR102461280B1 (ko) | 선형 예측 부호화 계수를 양자화하기 위한 가중치 함수 결정 장치 및 방법 | |
CN102623014A (zh) | 变换编码装置和变换编码方法 | |
EP3125241B1 (en) | Method and device for quantization of linear prediction coefficient and method and device for inverse quantization | |
EP1576585A1 (en) | Method and device for robust predictive vector quantization of linear prediction parameters in variable bit rate speech coding | |
CN111656445B (zh) | 解码器处的噪声衰减 | |
EP1513137A1 (en) | Speech processing system and method with multi-pulse excitation | |
CN103999153B (zh) | 用于以带选择的方式量化语音信号的方法和设备 | |
CN109545234A (zh) | 一种基于压缩感知的语音线谱频率编码及自适应快速重构方法 | |
JP3194930B2 (ja) | 音声符号化装置 | |
Vafin et al. | Towards optimal quantization in multistage audio coding | |
Arney et al. | Use case demonstration: X-ray/ventilator | |
Dang et al. | DCT_M model for excitation parameter in low bit rate vocoder | |
Li et al. | Quantization of SEW and REW magnitude for 2 kb/s waveform interpolation speech coding | |
Vaalgamaa | Moving average vector quantization in speech coding | |
Bao | Harmonic excitation LPC (HE-LPC) speech coding at 2.3 kb/s | |
Mohammadi | Combined scalar-vector quantization: a new spectral coding method for low rate speech coding | |
Li et al. | An improved 1.2 kb/s speech coder based on MELP | |
Wang | An Efficient Dimension Reduction Quantization Scheme for Speech Vocal Parameters | |
Jamal et al. | Multi-channel implementation of G. 729 A/B on FPGA | |
Hedlund | Speech Coding Using Orthonormal Basis Functions | |
Khare et al. | Generation of Excitation Signal in Voice Excited Linear Predictive Coding using Discrete Cosine Transform |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20231201 Address after: B21109-2, Eurasian Grand View, No. 67 Yangguang New Road, Shizhong District, Jinan City, Shandong Province, 250000 Patentee after: Jinan Lianken Information Technology Co.,Ltd. Address before: 518000 1104, Building A, Zhiyun Industrial Park, No. 13, Huaxing Road, Henglang Community, Longhua District, Shenzhen, Guangdong Province Patentee before: Shenzhen Hongyue Enterprise Management Consulting Co.,Ltd. Effective date of registration: 20231201 Address after: 518000 1104, Building A, Zhiyun Industrial Park, No. 13, Huaxing Road, Henglang Community, Longhua District, Shenzhen, Guangdong Province Patentee after: Shenzhen Hongyue Enterprise Management Consulting Co.,Ltd. Address before: 400065 No. 2, Chongwen Road, Nan'an District, Chongqing Patentee before: CHONGQING University OF POSTS AND TELECOMMUNICATIONS |
|
TR01 | Transfer of patent right |