WO2009057327A1 - 符号化装置および復号装置 - Google Patents

符号化装置および復号装置 Download PDF

Info

Publication number
WO2009057327A1
WO2009057327A1 PCT/JP2008/003151 JP2008003151W WO2009057327A1 WO 2009057327 A1 WO2009057327 A1 WO 2009057327A1 JP 2008003151 W JP2008003151 W JP 2008003151W WO 2009057327 A1 WO2009057327 A1 WO 2009057327A1
Authority
WO
WIPO (PCT)
Prior art keywords
icp
reference signal
band portion
frequency coefficient
candidates
Prior art date
Application number
PCT/JP2008/003151
Other languages
English (en)
French (fr)
Inventor
Haishan Zhong
Zongxian Liu
Kok Seng Chong
Koji Yoshida
Original Assignee
Panasonic Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Panasonic Corporation filed Critical Panasonic Corporation
Priority to US12/740,020 priority Critical patent/US8374883B2/en
Priority to CN2008801137288A priority patent/CN101842832B/zh
Priority to EP08845514.2A priority patent/EP2209114B1/en
Priority to JP2009538954A priority patent/JP5413839B2/ja
Publication of WO2009057327A1 publication Critical patent/WO2009057327A1/ja

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques

Abstract

 チャネル間予測(ICP)を用いたスケーラブルなステレオ音声符号化において、ICPの予測性能を改善する符号化装置。この符号化装置では、ICP分析部(113、114、115)は、それぞれ、サイド残差信号の低帯域部分の周波数係数sL'(f)、モノラル残差信号の各サブ帯域部分の周波数係数mM、i(f)、モノラル残差信号の低帯域部分の周波数係数mL(f)を基準信号候補として、これとサイド残差信号の各サブ帯域部分の周波数係数sM、i(f)とのICP分析を行い、第1、第2、第3ICP係数を生成する。選択部(116)は、各基準信号候補とサイド残差信号の各サブ帯域部分の周波数係数sM、i(f)との関係をチェックすることによって、基準信号候補の中から最適な基準信号を選択し、選択した基準信号を示す基準信号ID及び基準信号に対応するICP係数をICPパラメータ量子化部(117)に出力する。
PCT/JP2008/003151 2007-10-31 2008-10-31 符号化装置および復号装置 WO2009057327A1 (ja)

Priority Applications (4)

Application Number Priority Date Filing Date Title
US12/740,020 US8374883B2 (en) 2007-10-31 2008-10-31 Encoder and decoder using inter channel prediction based on optimally determined signals
CN2008801137288A CN101842832B (zh) 2007-10-31 2008-10-31 编码装置和解码装置
EP08845514.2A EP2209114B1 (en) 2007-10-31 2008-10-31 Speech coding/decoding apparatus/method
JP2009538954A JP5413839B2 (ja) 2007-10-31 2008-10-31 符号化装置および復号装置

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2007-284622 2007-10-31
JP2007284622 2007-10-31

Publications (1)

Publication Number Publication Date
WO2009057327A1 true WO2009057327A1 (ja) 2009-05-07

Family

ID=40590731

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2008/003151 WO2009057327A1 (ja) 2007-10-31 2008-10-31 符号化装置および復号装置

Country Status (5)

Country Link
US (1) US8374883B2 (ja)
EP (1) EP2209114B1 (ja)
JP (1) JP5413839B2 (ja)
CN (1) CN101842832B (ja)
WO (1) WO2009057327A1 (ja)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2427881A4 (en) * 2009-05-08 2016-04-20 Nokia Technologies Oy MULTICANAL AUDIO PROCESSING
US10885922B2 (en) 2017-07-03 2021-01-05 Qualcomm Incorporated Time-domain inter-channel prediction

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8359196B2 (en) * 2007-12-28 2013-01-22 Panasonic Corporation Stereo sound decoding apparatus, stereo sound encoding apparatus and lost-frame compensating method
US8140723B2 (en) * 2008-11-04 2012-03-20 Renesas Electronics America Inc. Digital I/O signal scheduler
JP5525540B2 (ja) 2009-10-30 2014-06-18 パナソニック株式会社 符号化装置および符号化方法
WO2012004998A1 (ja) 2010-07-06 2012-01-12 パナソニック株式会社 スペクトル係数コーディングの量子化パラメータを効率的に符号化する装置及び方法
ES2526320T3 (es) * 2010-08-24 2015-01-09 Dolby International Ab Ocultamiento de la recepción mono intermitente de receptores de radio estéreo de FM
JP5841147B2 (ja) * 2011-07-01 2016-01-13 パナソニック インテレクチュアル プロパティ コーポレーション オブアメリカPanasonic Intellectual Property Corporation of America 受信装置、送信装置、設定方法、及び特定方法
US9779731B1 (en) * 2012-08-20 2017-10-03 Amazon Technologies, Inc. Echo cancellation based on shared reference signals
WO2014126688A1 (en) 2013-02-14 2014-08-21 Dolby Laboratories Licensing Corporation Methods for audio signal transient detection and decorrelation control
TWI618051B (zh) * 2013-02-14 2018-03-11 杜比實驗室特許公司 用於利用估計之空間參數的音頻訊號增強的音頻訊號處理方法及裝置
JP6046274B2 (ja) 2013-02-14 2016-12-14 ドルビー ラボラトリーズ ライセンシング コーポレイション 上方混合されたオーディオ信号のチャネル間コヒーレンスの制御方法
TWI618050B (zh) 2013-02-14 2018-03-11 杜比實驗室特許公司 用於音訊處理系統中之訊號去相關的方法及設備
ES2641538T3 (es) 2013-09-12 2017-11-10 Dolby International Ab Codificación de contenido de audio multicanal
US10147441B1 (en) 2013-12-19 2018-12-04 Amazon Technologies, Inc. Voice controlled system
US10734001B2 (en) * 2017-10-05 2020-08-04 Qualcomm Incorporated Encoding or decoding of audio signals
CN114708874A (zh) * 2018-05-31 2022-07-05 华为技术有限公司 立体声信号的编码方法和装置
CN110719564B (zh) * 2018-07-13 2021-06-08 海信视像科技股份有限公司 音效处理方法和装置

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004151433A (ja) * 2002-10-31 2004-05-27 Nippon Telegr & Teleph Corp <Ntt> 符号化方法、復号化方法、符号化装置、復号化装置、符号化プログラム、復号化プログラム
JP2006350361A (ja) * 1998-10-13 2006-12-28 Victor Co Of Japan Ltd 音声信号伝送方法及び音声信号復号方法
JP2007017982A (ja) * 2006-07-07 2007-01-25 Victor Co Of Japan Ltd 音声符号化方法、音声復号化方法、音声受信装置及び音声信号伝送方法
JP2007279385A (ja) * 2006-04-06 2007-10-25 Nippon Telegr & Teleph Corp <Ntt> マルチチャネル符号化方法、その装置、そのプログラム及び記録媒体

Family Cites Families (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5434948A (en) * 1989-06-15 1995-07-18 British Telecommunications Public Limited Company Polyphonic coding
JP3343962B2 (ja) * 1992-11-11 2002-11-11 ソニー株式会社 高能率符号化方法及び装置
DE4320990B4 (de) 1993-06-05 2004-04-29 Robert Bosch Gmbh Verfahren zur Redundanzreduktion
DE19526366A1 (de) 1995-07-20 1997-01-23 Bosch Gmbh Robert Verfahren zur Redundanzreduktion bei der Codierung von mehrkanaligen Signalen und Vorrichtung zur Dekodierung von redundanzreduzierten, mehrkanaligen Signalen
US5812971A (en) * 1996-03-22 1998-09-22 Lucent Technologies Inc. Enhanced joint stereo coding method using temporal envelope shaping
SE512719C2 (sv) * 1997-06-10 2000-05-02 Lars Gustaf Liljeryd En metod och anordning för reduktion av dataflöde baserad på harmonisk bandbreddsexpansion
SE519552C2 (sv) * 1998-09-30 2003-03-11 Ericsson Telefon Ab L M Flerkanalig signalkodning och -avkodning
US6463410B1 (en) * 1998-10-13 2002-10-08 Victor Company Of Japan, Ltd. Audio signal processing apparatus
US7240001B2 (en) * 2001-12-14 2007-07-03 Microsoft Corporation Quality improvement techniques in an audio encoder
US7191136B2 (en) * 2002-10-01 2007-03-13 Ibiquity Digital Corporation Efficient coding of high frequency signal information in a signal using a linear/non-linear prediction model based on a low pass baseband
WO2004098105A1 (en) * 2003-04-30 2004-11-11 Nokia Corporation Support of a multichannel audio extension
DE602004028171D1 (de) * 2004-05-28 2010-08-26 Nokia Corp Mehrkanalige audio-erweiterung
ATE442644T1 (de) * 2004-08-26 2009-09-15 Panasonic Corp Mehrkanalige signal-dekodierung
SE0402652D0 (sv) * 2004-11-02 2004-11-02 Coding Tech Ab Methods for improved performance of prediction based multi- channel reconstruction
EP1798724B1 (en) * 2004-11-05 2014-06-18 Panasonic Corporation Encoder, decoder, encoding method, and decoding method
WO2006070760A1 (ja) * 2004-12-28 2006-07-06 Matsushita Electric Industrial Co., Ltd. スケーラブル符号化装置およびスケーラブル符号化方法
US7903824B2 (en) * 2005-01-10 2011-03-08 Agere Systems Inc. Compact side information for parametric coding of spatial audio
WO2006091139A1 (en) * 2005-02-23 2006-08-31 Telefonaktiebolaget Lm Ericsson (Publ) Adaptive bit allocation for multi-channel audio encoding
US8433581B2 (en) * 2005-04-28 2013-04-30 Panasonic Corporation Audio encoding device and audio encoding method
US8271275B2 (en) * 2005-05-31 2012-09-18 Panasonic Corporation Scalable encoding device, and scalable encoding method
KR101340233B1 (ko) * 2005-08-31 2013-12-10 파나소닉 주식회사 스테레오 부호화 장치, 스테레오 복호 장치 및 스테레오부호화 방법
EP1953736A4 (en) * 2005-10-31 2009-08-05 Panasonic Corp STEREO CODING DEVICE AND METHOD FOR PREDICTING STEREO SIGNAL
WO2007116809A1 (ja) * 2006-03-31 2007-10-18 Matsushita Electric Industrial Co., Ltd. ステレオ音声符号化装置、ステレオ音声復号装置、およびこれらの方法
DE102006055737A1 (de) * 2006-11-25 2008-05-29 Deutsche Telekom Ag Verfahren zur skalierbaren Codierung von Stereo-Signalen

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006350361A (ja) * 1998-10-13 2006-12-28 Victor Co Of Japan Ltd 音声信号伝送方法及び音声信号復号方法
JP2004151433A (ja) * 2002-10-31 2004-05-27 Nippon Telegr & Teleph Corp <Ntt> 符号化方法、復号化方法、符号化装置、復号化装置、符号化プログラム、復号化プログラム
JP2007279385A (ja) * 2006-04-06 2007-10-25 Nippon Telegr & Teleph Corp <Ntt> マルチチャネル符号化方法、その装置、そのプログラム及び記録媒体
JP2007017982A (ja) * 2006-07-07 2007-01-25 Victor Co Of Japan Ltd 音声符号化方法、音声復号化方法、音声受信装置及び音声信号伝送方法

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
S. MINAMI; O. OKADA: "Stereophonic ADPCM voice coding method", PROC. ICASSP'90, April 1990 (1990-04-01)
SEAN A. RAMPRASHAD: "The multimode transform predictive coding paradigm", IEEE TRAN. SPEECH AND AUDIO PROCESSING, vol. 11, March 2003 (2003-03-01), pages 117 - 129
See also references of EP2209114A4
WAI C. CHU, SPEECH CODING ALGORITHMS: FOUNDATION AND EVOLUTION OF STANDARDIZED CODERS, 2003
YE WANG; MIIKKA VILERMO: "The modified discrete cosine transform: its implications for audio coding and error concealment", AES 22ND INTERNATIONAL CONFERENCE ON VIRTUAL, SYNTHETIC AND ENTERTAINMENT, 2002

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2427881A4 (en) * 2009-05-08 2016-04-20 Nokia Technologies Oy MULTICANAL AUDIO PROCESSING
US10885922B2 (en) 2017-07-03 2021-01-05 Qualcomm Incorporated Time-domain inter-channel prediction

Also Published As

Publication number Publication date
JP5413839B2 (ja) 2014-02-12
JPWO2009057327A1 (ja) 2011-03-10
EP2209114A1 (en) 2010-07-21
US20100250244A1 (en) 2010-09-30
CN101842832B (zh) 2012-11-07
EP2209114B1 (en) 2014-05-14
EP2209114A4 (en) 2011-09-28
CN101842832A (zh) 2010-09-22
US8374883B2 (en) 2013-02-12

Similar Documents

Publication Publication Date Title
WO2009057327A1 (ja) 符号化装置および復号装置
USRE49549E1 (en) Audio or video encoder, audio or video decoder and related methods for processing multi-channel audio or video signals using a variable prediction direction
KR101945309B1 (ko) 위상 정보와 잔여 신호를 이용한 부호화/복호화 장치 및 방법
KR100954179B1 (ko) 근접-투명 또는 투명 멀티-채널 인코더/디코더 구성
CA2804907C (en) Audio encoder, audio decoder and related methods for processing multi-channel audio signals using complex prediction
DE602006015294D1 (de) Mehrkanal-audiocodierung
CA2645911A1 (en) Method for encoding and decoding object-based audio signal and apparatus thereof
MX2010004220A (es) Codificacion de audio usando mezcla descendente.
TW201120874A (en) Audio signal decoder, audio signal encoder, method for providing an upmix signal representation, method for providing a downmix signal representation, computer program and bitstream using a common inter-object-correlation parameter value
JP5977434B2 (ja) パラメトリック空間オーディオ符号化および復号化のための方法、パラメトリック空間オーディオ符号器およびパラメトリック空間オーディオ復号器
MX2011011399A (es) Aparato para suministrar uno o más parámetros ajustados para un suministro de una representación de señal de mezcla ascendente sobre la base de una representación de señal de mezcla descendete, decodificador de señal de audio, transcodificador de señal de audio, codificador de señal de audio, flujo de bits de audio, método y programa de computación que utiliza información paramétrica relacionada con el objeto.
CN102884570A (zh) 基于mdct的复数预测立体声编码
AU2014267408B2 (en) Audio object separation from mixture signal using object-specific time/frequency resolutions
WO2008126382A1 (ja) 符号化装置および符号化方法
WO2009048239A2 (en) Encoding and decoding method using variable subband analysis and apparatus thereof
KR20060109299A (ko) 멀티채널 오디오 신호에 대한 서브밴드별 공간 정보들의부호-복호화 방법
RU2804032C1 (ru) Устройство обработки звуковых сигналов для кодирования стереофонического сигнала в сигнал битового потока и способ декодирования сигнала битового потока в стереофонический сигнал, осуществляемый с использованием устройства обработки звуковых сигналов

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200880113728.8

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08845514

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2009538954

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 12740020

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 2008845514

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE