TWI569259B - 用於基於物件之音訊編碼系統中的通知響度估計之解碼器、編碼器及方法 - Google Patents

用於基於物件之音訊編碼系統中的通知響度估計之解碼器、編碼器及方法 Download PDF

Info

Publication number
TWI569259B
TWI569259B TW103141223A TW103141223A TWI569259B TW I569259 B TWI569259 B TW I569259B TW 103141223 A TW103141223 A TW 103141223A TW 103141223 A TW103141223 A TW 103141223A TW I569259 B TWI569259 B TW I569259B
Authority
TW
Taiwan
Prior art keywords
loudness
audio
signal
object signals
information
Prior art date
Application number
TW103141223A
Other languages
English (en)
Chinese (zh)
Other versions
TW201525990A (zh
Inventor
喬尼 帕露斯
薩斯洽 迪斯曲
哈拉德 福契斯
柏哈德 吉瑞爾
奧利薇 賀穆斯
愛德瑞恩 摩塔札
法科 萊德布奇
黎恩 泰倫堤夫
Original Assignee
弗勞恩霍夫爾協會
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 弗勞恩霍夫爾協會 filed Critical 弗勞恩霍夫爾協會
Publication of TW201525990A publication Critical patent/TW201525990A/zh
Application granted granted Critical
Publication of TWI569259B publication Critical patent/TWI569259B/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/0017Lossless audio signal coding; Perfect reconstruction of coded audio signal by transmission of coding error
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/005Correction of errors induced by the transmission channel, if related to the coding algorithm
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering
    • G10L19/265Pre-filtering, e.g. high frequency emphasis prior to encoding
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03GCONTROL OF AMPLIFICATION
    • H03G3/00Gain control in amplifiers or frequency changers
    • H03G3/20Automatic control

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
  • Tone Control, Compression And Expansion, Limiting Amplitude (AREA)
TW103141223A 2013-11-27 2014-11-27 用於基於物件之音訊編碼系統中的通知響度估計之解碼器、編碼器及方法 TWI569259B (zh)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP13194664.2A EP2879131A1 (en) 2013-11-27 2013-11-27 Decoder, encoder and method for informed loudness estimation in object-based audio coding systems

Publications (2)

Publication Number Publication Date
TW201525990A TW201525990A (zh) 2015-07-01
TWI569259B true TWI569259B (zh) 2017-02-01

Family

ID=49683543

Family Applications (2)

Application Number Title Priority Date Filing Date
TW103141223A TWI569259B (zh) 2013-11-27 2014-11-27 用於基於物件之音訊編碼系統中的通知響度估計之解碼器、編碼器及方法
TW103141222A TWI569260B (zh) 2013-11-27 2014-11-27 用於在基於物件之音訊編碼系統中利用旁通音訊物件信號的通知響度估計之解碼器、編碼器及方法

Family Applications After (1)

Application Number Title Priority Date Filing Date
TW103141222A TWI569260B (zh) 2013-11-27 2014-11-27 用於在基於物件之音訊編碼系統中利用旁通音訊物件信號的通知響度估計之解碼器、編碼器及方法

Country Status (18)

Country Link
US (8) US9947325B2 (https=)
EP (3) EP2879131A1 (https=)
JP (2) JP6218928B2 (https=)
KR (2) KR101852950B1 (https=)
CN (4) CN111312266B (https=)
AR (2) AR098558A1 (https=)
AU (2) AU2014356475B2 (https=)
BR (2) BR112015019958B1 (https=)
CA (2) CA2900473C (https=)
ES (2) ES2629527T3 (https=)
MX (2) MX350247B (https=)
MY (2) MY189823A (https=)
PL (2) PL2941771T3 (https=)
PT (2) PT2941771T (https=)
RU (2) RU2651211C2 (https=)
TW (2) TWI569259B (https=)
WO (2) WO2015078956A1 (https=)
ZA (1) ZA201604205B (https=)

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2879131A1 (en) 2013-11-27 2015-06-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decoder, encoder and method for informed loudness estimation in object-based audio coding systems
KR102482162B1 (ko) * 2014-10-01 2022-12-29 돌비 인터네셔널 에이비 오디오 인코더 및 디코더
JP6564068B2 (ja) * 2015-02-02 2019-08-21 フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ 符号化されたオーディオ信号を処理するための装置および方法
FI3311379T3 (fi) * 2015-06-17 2023-02-28 Äänenvoimakkuuden ohjaus käyttäjän interaktiivisuuta varten audio-koodausjärjestelmissä
CA3281204A1 (en) * 2015-06-17 2025-10-31 Sony Corporation Transmitting device, transmitting method, receiving device, and receiving method
US9590580B1 (en) 2015-09-13 2017-03-07 Guoguang Electric Company Limited Loudness-based audio-signal compensation
EP3409029B1 (en) 2016-01-29 2024-10-30 Dolby Laboratories Licensing Corporation Binaural dialogue enhancement
CN105741835B (zh) * 2016-03-18 2019-04-16 腾讯科技(深圳)有限公司 一种音频信息处理方法及终端
CN114466279B (zh) * 2016-11-25 2025-10-14 索尼公司 再现方法、装置及介质、信息处理方法及装置
US11200882B2 (en) * 2017-07-03 2021-12-14 Nec Corporation Signal processing device, signal processing method, and storage medium for storing program
JP7123134B2 (ja) * 2017-10-27 2022-08-22 フラウンホファー ゲセルシャフト ツール フェールデルンク ダー アンゲヴァンテン フォルシュンク エー.ファオ. デコーダにおけるノイズ減衰
CN111713016B (zh) 2018-02-15 2023-11-28 杜比实验室特许公司 响度控制方法和装置
EP3550561A1 (en) * 2018-04-06 2019-10-09 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Downmixer, audio encoder, method and computer program applying a phase value to a magnitude value
EP3588988B1 (en) 2018-06-26 2021-02-17 Nokia Technologies Oy Selective presentation of ambient audio content for spatial audio presentation
US11544032B2 (en) * 2019-01-24 2023-01-03 Dolby Laboratories Licensing Corporation Audio connection and transmission device
CN113366865B (zh) * 2019-02-13 2023-03-21 杜比实验室特许公司 用于音频对象聚类的自适应响度规范化
CA3193359A1 (en) * 2019-06-14 2020-12-17 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Parameter encoding and decoding
US12165657B2 (en) * 2019-08-30 2024-12-10 Dolby Laboratories Licensing Corporation Channel identification of multi-channel audio signals
KR102390643B1 (ko) * 2019-10-10 2022-04-27 가우디오랩 주식회사 오디오 라우드니스 메타데이터 생성 방법 및 이를 위한 장치
CN115668765A (zh) * 2020-04-13 2023-01-31 杜比实验室特许公司 音频描述的自动混合
US12531077B2 (en) 2021-02-22 2026-01-20 Tencent America LLC Method and apparatus in audio processing
WO2022188999A1 (en) * 2021-03-12 2022-09-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for clean dialogue loudness estimates based on deep neural networks
CN113596502B (zh) * 2021-08-03 2025-08-22 广州繁星互娱信息科技有限公司 一种直播间音效调节方法、装置、电子设备及介质
CN117837173B (zh) * 2021-08-27 2025-06-13 北京字跳网络技术有限公司 用于音频渲染的信号处理方法、装置和电子设备
CN115966216A (zh) * 2022-12-21 2023-04-14 上海哔哩哔哩科技有限公司 音频流处理方法及装置

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008035275A2 (en) * 2006-09-18 2008-03-27 Koninklijke Philips Electronics N.V. Encoding and decoding of audio objects
EP2442303A2 (en) * 2009-06-10 2012-04-18 Electronics and Telecommunications Research Institute Encoding method and encoding device, decoding method and decoding device and transcoding method and transcoder for multi-object audio signals
WO2012146757A1 (en) * 2011-04-28 2012-11-01 Dolby International Ab Efficient content classification and loudness estimation

Family Cites Families (47)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ZA921988B (en) * 1991-03-29 1993-02-24 Sony Corp High efficiency digital data encoding and decoding apparatus
US5699479A (en) * 1995-02-06 1997-12-16 Lucent Technologies Inc. Tonality for perceptual audio compression based on loudness uncertainty
CN1691841B (zh) * 1997-09-05 2010-09-29 雷克西康公司 5-2-5矩阵编码器和解码器系统
US7415120B1 (en) * 1998-04-14 2008-08-19 Akiba Electronics Institute Llc User adjustable volume control that accommodates hearing
US6462264B1 (en) * 1999-07-26 2002-10-08 Carl Elam Method and apparatus for audio broadcast of enhanced musical instrument digital interface (MIDI) data formats for control of a sound generator to create music, lyrics, and speech
EP1254513A4 (en) * 1999-11-29 2009-11-04 Syfx SYSTEMS AND METHODS FOR SIGNAL PROCESSING
EP1360798B1 (en) * 2001-02-06 2014-10-01 Polycom Israel Ltd. Control unit for multipoint multimedia/audio conference
US6852151B2 (en) * 2002-06-03 2005-02-08 Siemens Vdo Automotive Inc. Air cleaner and resonator assembly
US7631483B2 (en) * 2003-09-22 2009-12-15 General Electric Company Method and system for reduction of jet engine noise
US7617109B2 (en) * 2004-07-01 2009-11-10 Dolby Laboratories Licensing Corporation Method for correcting metadata affecting the playback loudness and dynamic range of audio information
WO2007120453A1 (en) * 2006-04-04 2007-10-25 Dolby Laboratories Licensing Corporation Calculating and adjusting the perceived loudness and/or the perceived spectral balance of an audio signal
EP1817767B1 (en) * 2004-11-30 2015-11-11 Agere Systems Inc. Parametric coding of spatial audio with object-based side information
EP1691348A1 (en) * 2005-02-14 2006-08-16 Ecole Polytechnique Federale De Lausanne Parametric joint-coding of audio sources
JP4728031B2 (ja) * 2005-04-15 2011-07-20 株式会社日立製作所 リモートコピーペアの移行を行うシステム
US8239209B2 (en) * 2006-01-19 2012-08-07 Lg Electronics Inc. Method and apparatus for decoding an audio signal using a rendering parameter
RU2407072C1 (ru) * 2006-09-29 2010-12-20 ЭлДжи ЭЛЕКТРОНИКС ИНК. Способы и устройства кодирования и декодирования объектно-ориентированных аудиосигналов
AU2007312598B2 (en) * 2006-10-16 2011-01-20 Dolby International Ab Enhanced coding and parameter representation of multichannel downmixed object coding
EP2102856A4 (en) 2006-12-07 2010-01-13 Lg Electronics Inc METHOD AND DEVICE FOR PROCESSING AN AUDIO SIGNAL
CA2645915C (en) 2007-02-14 2012-10-23 Lg Electronics Inc. Methods and apparatuses for encoding and decoding object-based audio signals
RU2406166C2 (ru) * 2007-02-14 2010-12-10 ЭлДжи ЭЛЕКТРОНИКС ИНК. Способы и устройства кодирования и декодирования основывающихся на объектах ориентированных аудиосигналов
WO2008120933A1 (en) * 2007-03-30 2008-10-09 Electronics And Telecommunications Research Institute Apparatus and method for coding and decoding multi object audio signal with multi channel
US7825322B1 (en) 2007-08-17 2010-11-02 Adobe Systems Incorporated Method and apparatus for audio mixing
US8543231B2 (en) * 2007-12-09 2013-09-24 Lg Electronics Inc. Method and an apparatus for processing a signal
KR101596504B1 (ko) * 2008-04-23 2016-02-23 한국전자통신연구원 객체기반 오디오 컨텐츠의 생성/재생 방법 및 객체기반 오디오 서비스를 위한 파일 포맷 구조를 가진 데이터를 기록한 컴퓨터 판독 가능 기록 매체
US8315396B2 (en) 2008-07-17 2012-11-20 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating audio output signals using object based metadata
MX2011011399A (es) * 2008-10-17 2012-06-27 Univ Friedrich Alexander Er Aparato para suministrar uno o más parámetros ajustados para un suministro de una representación de señal de mezcla ascendente sobre la base de una representación de señal de mezcla descendete, decodificador de señal de audio, transcodificador de señal de audio, codificador de señal de audio, flujo de bits de audio, método y programa de computación que utiliza información paramétrica relacionada con el objeto.
WO2010075377A1 (en) * 2008-12-24 2010-07-01 Dolby Laboratories Licensing Corporation Audio signal loudness determination and modification in the frequency domain
JP5340296B2 (ja) * 2009-03-26 2013-11-13 パナソニック株式会社 復号化装置、符号化復号化装置および復号化方法
US20100324915A1 (en) 2009-06-23 2010-12-23 Electronic And Telecommunications Research Institute Encoding and decoding apparatuses for high quality multi-channel audio codec
EP2489038B1 (en) * 2009-11-20 2016-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus for providing an upmix signal representation on the basis of the downmix signal representation, apparatus for providing a bitstream representing a multi-channel audio signal, methods, computer programs and bitstream representing a multi-channel audio signal using a linear combination parameter
WO2011069205A1 (en) * 2009-12-10 2011-06-16 Reality Ip Pty Ltd Improved matrix decoder for surround sound
CA2992917C (en) 2010-04-09 2020-05-26 Dolby International Ab Mdct-based complex prediction stereo coding
KR101615776B1 (ko) * 2010-05-28 2016-04-28 한국전자통신연구원 상이한 분석 단계를 사용하는 다객체 오디오 신호의 부호화 및 복호화 장치 및 방법
CN103649706B (zh) * 2011-03-16 2015-11-25 Dts(英属维尔京群岛)有限公司 三维音频音轨的编码及再现
US8965774B2 (en) * 2011-08-23 2015-02-24 Apple Inc. Automatic detection of audio compression parameters
US9952576B2 (en) 2012-10-16 2018-04-24 Sonos, Inc. Methods and apparatus to learn and share remote commands
KR102037418B1 (ko) * 2012-12-04 2019-10-28 삼성전자주식회사 오디오 제공 장치 및 오디오 제공 방법
US9805725B2 (en) * 2012-12-21 2017-10-31 Dolby Laboratories Licensing Corporation Object clustering for rendering object-based audio content based on perceptual criteria
RU2719690C2 (ru) * 2013-01-21 2020-04-21 Долби Лабораторис Лайсэнзин Корпорейшн Аудиокодер и аудиодекодер с метаданными громкости и границы программы
CN112652316B (zh) * 2013-01-21 2023-09-15 杜比实验室特许公司 利用响度处理状态元数据的音频编码器和解码器
US9559651B2 (en) * 2013-03-29 2017-01-31 Apple Inc. Metadata for loudness and dynamic range control
US9852735B2 (en) * 2013-05-24 2017-12-26 Dolby International Ab Efficient coding of audio scenes comprising audio objects
EP3044786B1 (en) * 2013-09-12 2024-04-24 Dolby Laboratories Licensing Corporation Loudness adjustment for downmixed audio content
EP2879131A1 (en) * 2013-11-27 2015-06-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decoder, encoder and method for informed loudness estimation in object-based audio coding systems
WO2015150384A1 (en) * 2014-04-01 2015-10-08 Dolby International Ab Efficient coding of audio scenes comprising audio objects
WO2020084342A1 (en) * 2018-10-26 2020-04-30 Cochlear Limited Systems and methods for customizing auditory devices
US11405544B2 (en) * 2020-10-09 2022-08-02 Sony Group Corporation Programmable rig control for three-dimensional (3D) reconstruction

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008035275A2 (en) * 2006-09-18 2008-03-27 Koninklijke Philips Electronics N.V. Encoding and decoding of audio objects
EP2442303A2 (en) * 2009-06-10 2012-04-18 Electronics and Telecommunications Research Institute Encoding method and encoding device, decoding method and decoding device and transcoding method and transcoder for multi-object audio signals
WO2012146757A1 (en) * 2011-04-28 2012-11-01 Dolby International Ab Efficient content classification and loudness estimation

Also Published As

Publication number Publication date
US20230306973A1 (en) 2023-09-28
CN111312266A (zh) 2020-06-19
EP2941771B1 (en) 2017-03-29
EP3074971A1 (en) 2016-10-05
US10497376B2 (en) 2019-12-03
EP2941771A1 (en) 2015-11-11
EP2879131A1 (en) 2015-06-03
PL2941771T3 (pl) 2017-10-31
MY196533A (en) 2023-04-19
ES2666127T3 (es) 2018-05-03
MX358306B (es) 2018-08-14
MY189823A (en) 2022-03-10
JP6218928B2 (ja) 2017-10-25
CA2900473C (en) 2018-01-30
MX350247B (es) 2017-08-31
AU2014356467A1 (en) 2016-06-09
AU2014356475B2 (en) 2016-08-18
US11688407B2 (en) 2023-06-27
ZA201604205B (en) 2017-11-29
US10891963B2 (en) 2021-01-12
US20200058313A1 (en) 2020-02-20
KR101852950B1 (ko) 2018-06-07
EP3074971B1 (en) 2018-02-21
PT3074971T (pt) 2018-05-25
BR112015019958A2 (pt) 2017-07-18
ES2629527T3 (es) 2017-08-10
AR099360A1 (es) 2016-07-20
WO2015078964A1 (en) 2015-06-04
TWI569260B (zh) 2017-02-01
MX2016006880A (es) 2016-08-19
US20220351736A1 (en) 2022-11-03
KR101742137B1 (ko) 2017-05-31
RU2015135181A (ru) 2017-02-27
US11423914B2 (en) 2022-08-23
CA2931558C (en) 2018-11-13
CN112151049A (zh) 2020-12-29
CN111312266B (zh) 2023-11-10
RU2016125242A (ru) 2018-01-09
WO2015078956A1 (en) 2015-06-04
MX2015013580A (es) 2016-02-05
TW201525990A (zh) 2015-07-01
RU2672174C2 (ru) 2018-11-12
AU2014356475A1 (en) 2015-09-03
JP6346282B2 (ja) 2018-06-20
CN112151049B (zh) 2024-05-10
CA2931558A1 (en) 2015-06-04
US10699722B2 (en) 2020-06-30
US20210118454A1 (en) 2021-04-22
US9947325B2 (en) 2018-04-17
AU2014356467B2 (en) 2016-12-15
TW201535353A (zh) 2015-09-16
US11875804B2 (en) 2024-01-16
PL3074971T3 (pl) 2018-07-31
PT2941771T (pt) 2017-06-30
US12573410B2 (en) 2026-03-10
US20180197554A1 (en) 2018-07-12
AR098558A1 (es) 2016-06-01
CA2900473A1 (en) 2015-06-04
KR20160075756A (ko) 2016-06-29
CN105874532B (zh) 2020-03-17
JP2016520865A (ja) 2016-07-14
RU2651211C2 (ru) 2018-04-18
BR112015019958B1 (pt) 2021-12-14
BR112016011988B1 (pt) 2022-09-13
JP2017502324A (ja) 2017-01-19
CN105144287A (zh) 2015-12-09
BR112016011988A2 (https=) 2017-08-08
CN105144287B (zh) 2020-09-25
US20150348564A1 (en) 2015-12-03
CN105874532A (zh) 2016-08-17
KR20150123799A (ko) 2015-11-04
US20200286496A1 (en) 2020-09-10
US20160254001A1 (en) 2016-09-01
HK1217245A1 (en) 2016-12-30

Similar Documents

Publication Publication Date Title
TWI569259B (zh) 用於基於物件之音訊編碼系統中的通知響度估計之解碼器、編碼器及方法
HK1217245B (en) Decoder, encoder and method for informed loudness estimation employing by-pass audio object signals in object-based audio coding systems
HK1229055B (en) Decoder, encoder and method for informed loudness estimation in object-based audio coding systems
HK1229055A1 (en) Decoder, encoder and method for informed loudness estimation in object-based audio coding systems