SG11201502611TA - Encoder, decoder and methods for signal-dependent zoom-transform in spatial audio object coding - Google Patents

Encoder, decoder and methods for signal-dependent zoom-transform in spatial audio object coding

Info

Publication number
SG11201502611TA
SG11201502611TA SG11201502611TA SG11201502611TA SG11201502611TA SG 11201502611T A SG11201502611T A SG 11201502611TA SG 11201502611T A SG11201502611T A SG 11201502611TA SG 11201502611T A SG11201502611T A SG 11201502611TA SG 11201502611T A SG11201502611T A SG 11201502611TA
Authority
SG
Singapore
Prior art keywords
decoder
encoder
transform
methods
signal
Prior art date
Application number
SG11201502611TA
Other languages
English (en)
Inventor
Sascha Disch
Jouni Paulus
Bernd Edler
Oliver Hellmuth
Jürgen Herre
Thorsten Kastner
Original Assignee
Fraunhofer Ges Forschung
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Ges Forschung filed Critical Fraunhofer Ges Forschung
Publication of SG11201502611TA publication Critical patent/SG11201502611TA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • G10L19/0208Subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/022Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
    • G10L19/025Detection of transients or attacks for time/frequency resolution switching
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Mathematical Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Stereophonic System (AREA)
SG11201502611TA 2012-10-05 2013-10-02 Encoder, decoder and methods for signal-dependent zoom-transform in spatial audio object coding SG11201502611TA (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201261710133P 2012-10-05 2012-10-05
EP13167487.1A EP2717262A1 (en) 2012-10-05 2013-05-13 Encoder, decoder and methods for signal-dependent zoom-transform in spatial audio object coding
PCT/EP2013/070550 WO2014053547A1 (en) 2012-10-05 2013-10-02 Encoder, decoder and methods for signal-dependent zoom-transform in spatial audio object coding

Publications (1)

Publication Number Publication Date
SG11201502611TA true SG11201502611TA (en) 2015-05-28

Family

ID=48325509

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11201502611TA SG11201502611TA (en) 2012-10-05 2013-10-02 Encoder, decoder and methods for signal-dependent zoom-transform in spatial audio object coding

Country Status (17)

Country Link
US (2) US10152978B2 (zh)
EP (4) EP2717262A1 (zh)
JP (2) JP6268180B2 (zh)
KR (2) KR101685860B1 (zh)
CN (2) CN105190747B (zh)
AR (2) AR092929A1 (zh)
AU (1) AU2013326526B2 (zh)
BR (2) BR112015007649B1 (zh)
CA (2) CA2887028C (zh)
ES (2) ES2880883T3 (zh)
HK (1) HK1213361A1 (zh)
MX (2) MX351359B (zh)
MY (1) MY178697A (zh)
RU (2) RU2639658C2 (zh)
SG (1) SG11201502611TA (zh)
TW (2) TWI541795B (zh)
WO (2) WO2014053548A1 (zh)

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2717262A1 (en) 2012-10-05 2014-04-09 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Encoder, decoder and methods for signal-dependent zoom-transform in spatial audio object coding
EP2804176A1 (en) * 2013-05-13 2014-11-19 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio object separation from mixture signal using object-specific time/frequency resolutions
EP3005353B1 (en) * 2013-05-24 2017-08-16 Dolby International AB Efficient coding of audio scenes comprising audio objects
KR102243395B1 (ko) * 2013-09-05 2021-04-22 한국전자통신연구원 오디오 부호화 장치 및 방법, 오디오 복호화 장치 및 방법, 오디오 재생 장치
US20150100324A1 (en) * 2013-10-04 2015-04-09 Nvidia Corporation Audio encoder performance for miracast
CN106409303B (zh) 2014-04-29 2019-09-20 华为技术有限公司 处理信号的方法及设备
CN105336335B (zh) 2014-07-25 2020-12-08 杜比实验室特许公司 利用子带对象概率估计的音频对象提取
SG11201706101RA (en) * 2015-02-02 2017-08-30 Fraunhofer Ges Forschung Apparatus and method for processing an encoded audio signal
EP3067885A1 (en) 2015-03-09 2016-09-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for encoding or decoding a multi-channel signal
WO2017064264A1 (en) * 2015-10-15 2017-04-20 Huawei Technologies Co., Ltd. Method and appratus for sinusoidal encoding and decoding
GB2544083B (en) * 2015-11-05 2020-05-20 Advanced Risc Mach Ltd Data stream assembly control
US9711121B1 (en) * 2015-12-28 2017-07-18 Berggram Development Oy Latency enhanced note recognition method in gaming
US9640157B1 (en) * 2015-12-28 2017-05-02 Berggram Development Oy Latency enhanced note recognition method
US10269360B2 (en) * 2016-02-03 2019-04-23 Dolby International Ab Efficient format conversion in audio coding
US10210874B2 (en) * 2017-02-03 2019-02-19 Qualcomm Incorporated Multi channel coding
CN113242508B (zh) 2017-03-06 2022-12-06 杜比国际公司 基于音频数据流渲染音频输出的方法、解码器系统和介质
CN108694955B (zh) * 2017-04-12 2020-11-17 华为技术有限公司 多声道信号的编解码方法和编解码器
WO2018201112A1 (en) 2017-04-28 2018-11-01 Goodwin Michael M Audio coder window sizes and time-frequency transformations
CN109427337B (zh) * 2017-08-23 2021-03-30 华为技术有限公司 立体声信号编码时重建信号的方法和装置
US10856755B2 (en) * 2018-03-06 2020-12-08 Ricoh Company, Ltd. Intelligent parameterization of time-frequency analysis of encephalography signals
TWI658458B (zh) * 2018-05-17 2019-05-01 張智星 歌聲分離效能提升之方法、非暫態電腦可讀取媒體及電腦程式產品
GB2577885A (en) 2018-10-08 2020-04-15 Nokia Technologies Oy Spatial audio augmentation and reproduction
BR112021025265A2 (pt) * 2019-06-14 2022-03-15 Fraunhofer Ges Forschung Sintetizador de áudio, codificador de áudio, sistema, método e unidade de armazenamento não transitória
EP4229631A2 (en) * 2020-10-13 2023-08-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for encoding a plurality of audio objects and apparatus and method for decoding using two or more relevant audio objects
CN113453114B (zh) * 2021-06-30 2023-04-07 Oppo广东移动通信有限公司 编码控制方法、装置、无线耳机及存储介质
CN114127844A (zh) * 2021-10-21 2022-03-01 北京小米移动软件有限公司 一种信号编解码方法、装置、编码设备、解码设备及存储介质

Family Cites Families (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3175446B2 (ja) * 1993-11-29 2001-06-11 ソニー株式会社 情報圧縮方法及び装置、圧縮情報伸張方法及び装置、圧縮情報記録/伝送装置、圧縮情報再生装置、圧縮情報受信装置、並びに記録媒体
DE60326782D1 (de) * 2002-04-22 2009-04-30 Koninkl Philips Electronics Nv Dekodiervorrichtung mit Dekorreliereinheit
US7272567B2 (en) * 2004-03-25 2007-09-18 Zoran Fejzo Scalable lossless audio codec and authoring tool
KR100608062B1 (ko) * 2004-08-04 2006-08-02 삼성전자주식회사 오디오 데이터의 고주파수 복원 방법 및 그 장치
CN101312041B (zh) * 2004-09-17 2011-05-11 广州广晟数码技术有限公司 多声道数字音频编码设备及其方法
US7630902B2 (en) * 2004-09-17 2009-12-08 Digital Rise Technology Co., Ltd. Apparatus and methods for digital audio coding using codebook application ranges
US8081764B2 (en) * 2005-07-15 2011-12-20 Panasonic Corporation Audio decoder
US7917358B2 (en) 2005-09-30 2011-03-29 Apple Inc. Transient detection by power weighted average
TWI329462B (en) * 2006-01-19 2010-08-21 Lg Electronics Inc Method and apparatus for processing a media signal
EP1999747B1 (en) * 2006-03-29 2016-10-12 Koninklijke Philips N.V. Audio decoding
DE602007013415D1 (de) * 2006-10-16 2011-05-05 Dolby Sweden Ab Erweiterte codierung und parameterrepräsentation einer mehrkanaligen heruntergemischten objektcodierung
EP3288027B1 (en) 2006-10-25 2021-04-07 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for generating complex-valued audio subband values
KR101100213B1 (ko) * 2007-03-16 2011-12-28 엘지전자 주식회사 오디오 신호 처리 방법 및 장치
EP3712888B1 (en) * 2007-03-30 2024-05-08 Electronics and Telecommunications Research Institute Apparatus and method for coding and decoding multi object audio signal with multi channel
EP2278582B1 (en) * 2007-06-08 2016-08-10 LG Electronics Inc. A method and an apparatus for processing an audio signal
EP2144229A1 (en) * 2008-07-11 2010-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Efficient use of phase information in audio encoding and decoding
WO2010105695A1 (en) * 2009-03-20 2010-09-23 Nokia Corporation Multi channel audio coding
KR101387808B1 (ko) * 2009-04-15 2014-04-21 한국전자통신연구원 가변 비트율을 갖는 잔차 신호 부호화를 이용한 고품질 다객체 오디오 부호화 및 복호화 장치
EP2249334A1 (en) * 2009-05-08 2010-11-10 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio format transcoder
JP5678048B2 (ja) * 2009-06-24 2015-02-25 フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ カスケード化されたオーディオオブジェクト処理ステージを用いたオーディオ信号デコーダ、オーディオ信号を復号化する方法、およびコンピュータプログラム
ES2793958T3 (es) * 2009-08-14 2020-11-17 Dts Llc Sistema para trasmitir adaptativamente objetos de audio
KR20110018107A (ko) * 2009-08-17 2011-02-23 삼성전자주식회사 레지듀얼 신호 인코딩 및 디코딩 방법 및 장치
PL2491551T3 (pl) * 2009-10-20 2015-06-30 Fraunhofer Ges Forschung Urządzenie do dostarczania reprezentacji sygnału upmixu w oparciu o reprezentację sygnału downmixu, urządzenie do dostarczania strumienia bitów reprezentującego wielokanałowy sygnał audio, sposoby, program komputerowy i strumień bitów wykorzystujący sygnalizację sterowania zniekształceniami
AU2010321013B2 (en) * 2009-11-20 2014-05-29 Dolby International Ab Apparatus for providing an upmix signal representation on the basis of the downmix signal representation, apparatus for providing a bitstream representing a multi-channel audio signal, methods, computer programs and bitstream representing a multi-channel audio signal using a linear combination parameter
US9332346B2 (en) * 2010-02-17 2016-05-03 Nokia Technologies Oy Processing of multi-device audio capture
CN102222505B (zh) * 2010-04-13 2012-12-19 中兴通讯股份有限公司 可分层音频编解码方法系统及瞬态信号可分层编解码方法
EP2717262A1 (en) 2012-10-05 2014-04-09 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Encoder, decoder and methods for signal-dependent zoom-transform in spatial audio object coding

Also Published As

Publication number Publication date
HK1213361A1 (zh) 2016-06-30
TWI539444B (zh) 2016-06-21
CA2887028C (en) 2018-08-28
AU2013326526A1 (en) 2015-05-28
WO2014053548A1 (en) 2014-04-10
US10152978B2 (en) 2018-12-11
ES2880883T3 (es) 2021-11-25
CN104798131B (zh) 2018-09-25
RU2015116287A (ru) 2016-11-27
JP6268180B2 (ja) 2018-01-24
JP6185592B2 (ja) 2017-08-23
CN105190747A (zh) 2015-12-23
EP2717265A1 (en) 2014-04-09
AU2013326526B2 (en) 2017-03-02
MY178697A (en) 2020-10-20
ES2873977T3 (es) 2021-11-04
AR092928A1 (es) 2015-05-06
AR092929A1 (es) 2015-05-06
EP2717262A1 (en) 2014-04-09
US9734833B2 (en) 2017-08-15
WO2014053547A1 (en) 2014-04-10
CN104798131A (zh) 2015-07-22
JP2015535960A (ja) 2015-12-17
RU2625939C2 (ru) 2017-07-19
MX351359B (es) 2017-10-11
EP2904610B1 (en) 2021-05-05
KR20150056875A (ko) 2015-05-27
KR101689489B1 (ko) 2016-12-23
BR112015007650A2 (pt) 2019-11-12
EP2904611B1 (en) 2021-06-23
BR112015007650B1 (pt) 2022-05-17
CA2887028A1 (en) 2014-04-10
RU2015116645A (ru) 2016-11-27
MX350691B (es) 2017-09-13
TW201419266A (zh) 2014-05-16
US20150279377A1 (en) 2015-10-01
BR112015007649B1 (pt) 2023-04-25
TW201423729A (zh) 2014-06-16
CA2886999C (en) 2018-10-23
TWI541795B (zh) 2016-07-11
KR101685860B1 (ko) 2016-12-12
KR20150065852A (ko) 2015-06-15
CA2886999A1 (en) 2014-04-10
CN105190747B (zh) 2019-01-04
US20150221314A1 (en) 2015-08-06
MX2015004018A (es) 2015-07-06
EP2904610A1 (en) 2015-08-12
RU2639658C2 (ru) 2017-12-21
EP2904611A1 (en) 2015-08-12
MX2015004019A (es) 2015-07-06
BR112015007649A2 (pt) 2022-07-19
JP2015535959A (ja) 2015-12-17

Similar Documents

Publication Publication Date Title
HK1213361A1 (zh) 編碼器、解碼器以及用於空間音頻對象編碼中的信號相關的縮放變換的方法
HK1207229A1 (zh) 視頻譯碼中的假想參考解碼器參數
PL2866439T3 (pl) Sposób dekodowania wideo i sposób kodowania wideo
SG11201406086UA (en) Decoder and decoding method, as well as encoder and encoding method
EP2824920A4 (en) VIDEO CODING METHOD, VIDEO CODING METHOD, VIDEO COORDING DEVICE, VIDEO CODING DEVICE AND VIDEO CODING / DECODING DEVICE
EP2838204A4 (en) DECODER PROCESSING METHOD AND DECODER
EP2843658A4 (en) AUDIO DECODING DEVICE, AUDIO ENCODING DEVICE, AUDIO DECODING METHOD, AUDIO ENCODING METHOD, AUDIO DECODING PROGRAM, AND AUDIO CODING PROGRAM
HK1213360A1 (zh) 編碼器、解碼器以及用於後向兼容的多分辨率空間音頻對象編碼的方法
PL3416387T3 (pl) Sposób kodowania wideo i koder wideo
EP2807824A4 (en) VIDEO CODING METHOD AND VIDEO CODING METHOD
EP2849180A4 (en) HYBRID AUDIO SIGNAL ENCODER, HYBRID AUDIO SIGNAL DECODER, AUDIO SIGNAL ENCODING METHOD, AND AUDIO SIGNAL DECODING METHOD
EP2605240A4 (en) AUDIO DECODING DEVICE, AUDIO DECODING METHOD, AUDIO DECODING PROGRAM, AUDIO ENCODING DEVICE, AUDIO ENCODING METHOD, AND AUDIO CODING PROGRAM
HK1211734A1 (zh) 對參數音頻對象編碼運用殘差概念的編碼器、解碼器、系統及方法
EP2645366A4 (en) AUDIO CODING DEVICE, METHOD AND PROGRAM, AND AUDIO CODING DEVICE, METHOD AND PROGRAM
GB2507127B (en) Encoder, decoder and method
HK1209229A1 (zh) 音頻編碼裝置、音頻編碼方法和音頻編碼程序以及音頻解碼裝置、音頻解碼方法和音頻解碼程序
EP2772914A4 (en) DECODER FOR HYBRID SOUND SIGNALS, COORDINATORS FOR HYBRID SOUND SIGNALS, DECODING PROCEDURE FOR SOUND SIGNALS AND CODING SIGNALING PROCESSES
PL2916318T3 (pl) Urządzenie do kodowania dźwięku mowy, urządzenie do dekodowania dźwięku mowy, sposób kodowania dźwięku mowy oraz sposób dekodowania dźwięku mowy
EP2879296A4 (en) ENCODING METHOD AND DECODING METHOD
EP2863629A4 (en) VIDEO ENCODING DEVICE, VIDEO DECODING DEVICE, VIDEO ENCODING METHOD, VIDEO DECODING METHOD, AND PROGRAM
EP2581904A4 (en) DECODER, ENCODER AND METHODS THEREOF