WO2011071335A3 - 음성 신호 부호화 방법 및 장치 - Google Patents

음성 신호 부호화 방법 및 장치 Download PDF

Info

Publication number
WO2011071335A3
WO2011071335A3 PCT/KR2010/008848 KR2010008848W WO2011071335A3 WO 2011071335 A3 WO2011071335 A3 WO 2011071335A3 KR 2010008848 W KR2010008848 W KR 2010008848W WO 2011071335 A3 WO2011071335 A3 WO 2011071335A3
Authority
WO
WIPO (PCT)
Prior art keywords
current frame
linear prediction
quantized spectrum
acquired
encoding
Prior art date
Application number
PCT/KR2010/008848
Other languages
English (en)
French (fr)
Other versions
WO2011071335A2 (ko
Inventor
전혜정
김대환
정규혁
이민기
강홍구
이병석
김락용
Original Assignee
엘지전자 주식회사
연세대학교 산학협력단
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 엘지전자 주식회사, 연세대학교 산학협력단 filed Critical 엘지전자 주식회사
Priority to CN201080056249.4A priority Critical patent/CN102656629B/zh
Priority to EP10836230.2A priority patent/EP2511904A4/en
Priority to KR1020127017163A priority patent/KR101789632B1/ko
Priority to US13/514,613 priority patent/US9076442B2/en
Publication of WO2011071335A2 publication Critical patent/WO2011071335A2/ko
Publication of WO2011071335A3 publication Critical patent/WO2011071335A3/ko

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • G10L19/07Line spectrum pair [LSP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/09Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/10Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
    • G10L19/107Sparse pulse excitation, e.g. by using algebraic codebook
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0007Codebook element generation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0007Codebook element generation
    • G10L2019/001Interpolation of codebook vectors
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0013Codebook search algorithms
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0016Codebook for LPC parameters

Abstract

본 발명에 따르면, 선형 예측을 이용하여 입력 신호으로부터 현재 프레임의 선형 예측 필터 계수를 획득하고, 제 1 베스트 정보에 기초하여 상기 현재 프레임의 선형 예측 필터 계수에 대응하는 현재 프레임의 양자화된 스펙트럼 후보 벡터를 획득하며, 상기 현재 프레임의 양자화된 스펙트럼 후보 벡터와 이전 프레임의 양자화된 스펙트럼 벡터를 보간하는 것을 특징으로 한다. 이처럼 기존의 단계별 최적화 기법에 비해 양자화 오차를 최소화하는 최적의 파라미터를 찾을 수 있다.
PCT/KR2010/008848 2009-12-10 2010-12-10 음성 신호 부호화 방법 및 장치 WO2011071335A2 (ko)

Priority Applications (4)

Application Number Priority Date Filing Date Title
CN201080056249.4A CN102656629B (zh) 2009-12-10 2010-12-10 编码语音信号的方法和设备
EP10836230.2A EP2511904A4 (en) 2009-12-10 2010-12-10 METHOD AND APPARATUS FOR ENCODING A SPEECH SIGNAL
KR1020127017163A KR101789632B1 (ko) 2009-12-10 2010-12-10 음성 신호 부호화 방법 및 장치
US13/514,613 US9076442B2 (en) 2009-12-10 2010-12-10 Method and apparatus for encoding a speech signal

Applications Claiming Priority (8)

Application Number Priority Date Filing Date Title
US28518409P 2009-12-10 2009-12-10
US61/285,184 2009-12-10
US29516510P 2010-01-15 2010-01-15
US61/295,165 2010-01-15
US32188310P 2010-04-08 2010-04-08
US61/321,883 2010-04-08
US34822510P 2010-05-25 2010-05-25
US61/348,225 2010-05-25

Publications (2)

Publication Number Publication Date
WO2011071335A2 WO2011071335A2 (ko) 2011-06-16
WO2011071335A3 true WO2011071335A3 (ko) 2011-11-03

Family

ID=44146063

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2010/008848 WO2011071335A2 (ko) 2009-12-10 2010-12-10 음성 신호 부호화 방법 및 장치

Country Status (5)

Country Link
US (1) US9076442B2 (ko)
EP (1) EP2511904A4 (ko)
KR (1) KR101789632B1 (ko)
CN (1) CN102656629B (ko)
WO (1) WO2011071335A2 (ko)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9728200B2 (en) * 2013-01-29 2017-08-08 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for adaptive formant sharpening in linear prediction coding
PL3594946T3 (pl) * 2014-05-01 2021-03-08 Nippon Telegraph And Telephone Corporation Dekodowanie sygnału dźwiękowego

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR960015861B1 (ko) * 1993-12-18 1996-11-22 휴우즈 에어크라프트 캄파니 선 스펙트럼 주파수 벡터의 양자화 방법 및 양자화기
KR20010084468A (ko) * 2000-02-25 2001-09-06 대표이사 서승모 음성 부호화기의 lsp 양자화기를 위한 고속 탐색 방법
KR20090117877A (ko) * 2007-03-02 2009-11-13 파나소닉 주식회사 부호화 장치 및 부호화 방법

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6108624A (en) 1997-09-10 2000-08-22 Samsung Electronics Co., Ltd. Method for improving performance of a voice coder
US7072832B1 (en) * 1998-08-24 2006-07-04 Mindspeed Technologies, Inc. System for speech encoding having an adaptive encoding arrangement
US6574593B1 (en) * 1999-09-22 2003-06-03 Conexant Systems, Inc. Codebook tables for encoding and decoding
US7389227B2 (en) 2000-01-14 2008-06-17 C & S Technology Co., Ltd. High-speed search method for LSP quantizer using split VQ and fixed codebook of G.729 speech encoder
US7003454B2 (en) 2001-05-16 2006-02-21 Nokia Corporation Method and system for line spectral frequency vector quantization in speech codec
CN1975861B (zh) * 2006-12-15 2011-06-29 清华大学 声码器基音周期参数抗信道误码方法

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR960015861B1 (ko) * 1993-12-18 1996-11-22 휴우즈 에어크라프트 캄파니 선 스펙트럼 주파수 벡터의 양자화 방법 및 양자화기
KR20010084468A (ko) * 2000-02-25 2001-09-06 대표이사 서승모 음성 부호화기의 lsp 양자화기를 위한 고속 탐색 방법
KR20090117877A (ko) * 2007-03-02 2009-11-13 파나소닉 주식회사 부호화 장치 및 부호화 방법

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP2511904A4 *

Also Published As

Publication number Publication date
EP2511904A4 (en) 2013-08-21
KR101789632B1 (ko) 2017-10-25
CN102656629A (zh) 2012-09-05
KR20120109539A (ko) 2012-10-08
EP2511904A2 (en) 2012-10-17
WO2011071335A2 (ko) 2011-06-16
CN102656629B (zh) 2014-11-26
US20120245930A1 (en) 2012-09-27
US9076442B2 (en) 2015-07-07

Similar Documents

Publication Publication Date Title
WO2011013982A3 (en) A method and an apparatus for processing an audio signal
MY191376A (en) Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field
MX355346B (es) Metodo para codificar imagen, metodo para decodificar imagen, codificador de imagen y decodificador de imagen.
SG10201401664XA (en) Apparatus and method for determining weighting function having low complexity for linear predictive coding (lpc) coefficients quantization
WO2012144877A3 (en) Apparatus for quantizing linear predictive coding coefficients, sound encoding apparatus, apparatus for de-quantizing linear predictive coding coefficients, sound decoding apparatus, and electronic device therefor
WO2009126915A8 (en) Rate-distortion defined interpolation for video coding based on fixed filter or adaptive filter
WO2011101442A3 (en) Data compression for video
EP4246511A3 (en) Method and apparatus for compressing and decompressing a higher order ambisonics signal representation
RU2012147587A (ru) Аудиокодер, аудиодекодер и связанные способы обработки многоканальных аудиосигналов с использованием комплексного предсказания
WO2011053021A3 (en) Method and apparatus for encoding and decoding image by using rotational transform
WO2010087589A3 (ko) 경계 인트라 코딩을 이용한 비디오 신호 처리 방법 및 장치
WO2012057583A3 (ko) 영상 정보 부호화 방법 및 복호화 방법
CA2998689C (en) Encoder and method for encoding an audio signal with reduced background noise using linear predictive coding
MX363348B (es) Codificador, descodificador y metodo para codificar y descodificar.
WO2012144830A3 (en) Methods and apparatuses for encoding and decoding image using adaptive filtering
WO2014055826A3 (en) Improved architecture for hybrid video codec
MX2016003902A (es) Re-muestreo de una señal de audio para una codificacion/decodifica cion con bajo retardo.
EP4274101A3 (en) Method and device for arithmetic encoding or arithmetic decoding
MX355091B (es) Concepto para codificar una señal de audio y decodificar una señal de audio usando información de conformación espectral relacionada con la voz.
TW201129967A (en) Method and apparatus for compression or decompression of digital signals
WO2009104914A3 (ko) 영상의 부호화, 복호화 방법 및 장치
WO2013048171A3 (ko) 음성 신호 부호화 방법 및 음성 신호 복호화 방법 그리고 이를 이용하는 장치
WO2011126340A3 (ko) 오디오 신호 처리 방법 및 장치
WO2009131406A3 (en) Decoding image
WO2012070866A3 (ko) 스피치 시그널 부호화 방법 및 복호화 방법

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 201080056249.4

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10836230

Country of ref document: EP

Kind code of ref document: A1

REEP Request for entry into the european phase

Ref document number: 2010836230

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2010836230

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 13514613

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 20127017163

Country of ref document: KR

Kind code of ref document: A