TWI571865B - 音訊編碼器裝置、音訊解碼器裝置、及其操作方法 - Google Patents

音訊編碼器裝置、音訊解碼器裝置、及其操作方法 Download PDF

Info

Publication number
TWI571865B
TWI571865B TW103136286A TW103136286A TWI571865B TW I571865 B TWI571865 B TW I571865B TW 103136286 A TW103136286 A TW 103136286A TW 103136286 A TW103136286 A TW 103136286A TW I571865 B TWI571865 B TW I571865B
Authority
TW
Taiwan
Prior art keywords
audio
dynamic range
range control
metadata
decoder
Prior art date
Application number
TW103136286A
Other languages
English (en)
Chinese (zh)
Other versions
TW201521012A (zh
Inventor
法比恩 庫奇
克里斯汀 伍雷
麥可 克拉屈瑪
柏哈德 紐吉包爾
麥可 梅爾
阿恩 波桑
Original Assignee
弗勞恩霍夫爾協會
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 弗勞恩霍夫爾協會 filed Critical 弗勞恩霍夫爾協會
Publication of TW201521012A publication Critical patent/TW201521012A/zh
Application granted granted Critical
Publication of TWI571865B publication Critical patent/TWI571865B/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0324Details of processing therefor
    • G10L21/034Automatic adjustment
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/165Management of the audio stream, e.g. setting of volume, audio stream path
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03GCONTROL OF AMPLIFICATION
    • H03G11/00Limiting amplitude; Limiting rate of change of amplitude
    • H03G11/008Limiting amplitude; Limiting rate of change of amplitude of digital or coded signals
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03GCONTROL OF AMPLIFICATION
    • H03G9/00Combinations of two or more types of control, e.g. gain control and tone control
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03GCONTROL OF AMPLIFICATION
    • H03G9/00Combinations of two or more types of control, e.g. gain control and tone control
    • H03G9/005Combinations of two or more types of control, e.g. gain control and tone control of digital or coded signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Mathematical Physics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Tone Control, Compression And Expansion, Limiting Amplitude (AREA)
  • Stereophonic System (AREA)
  • Circuit For Audible Band Transducer (AREA)
TW103136286A 2013-10-22 2014-10-21 音訊編碼器裝置、音訊解碼器裝置、及其操作方法 TWI571865B (zh)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP13189754 2013-10-22

Publications (2)

Publication Number Publication Date
TW201521012A TW201521012A (zh) 2015-06-01
TWI571865B true TWI571865B (zh) 2017-02-21

Family

ID=49447470

Family Applications (1)

Application Number Title Priority Date Filing Date
TW103136286A TWI571865B (zh) 2013-10-22 2014-10-21 音訊編碼器裝置、音訊解碼器裝置、及其操作方法

Country Status (20)

Country Link
US (6) US11170795B2 (https=)
EP (6) EP3951778B1 (https=)
JP (2) JP6588899B2 (https=)
KR (1) KR101882898B1 (https=)
CN (2) CN111580772B (https=)
AR (2) AR098153A1 (https=)
AU (1) AU2014339086B2 (https=)
BR (1) BR112016008933B1 (https=)
CA (6) CA3262089A1 (https=)
ES (3) ES2900065T3 (https=)
MX (2) MX358483B (https=)
MY (1) MY181977A (https=)
PL (3) PL3061090T3 (https=)
PT (2) PT3061090T (https=)
RU (1) RU2659490C2 (https=)
SG (1) SG11201603116XA (https=)
TR (1) TR201908748T4 (https=)
TW (1) TWI571865B (https=)
WO (1) WO2015059087A1 (https=)
ZA (1) ZA201603299B (https=)

Families Citing this family (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101261212B1 (ko) 2004-10-26 2013-05-07 돌비 레버러토리즈 라이쎈싱 코오포레이션 오디오 신호 처리 방법 및 장치
TWI447709B (zh) 2010-02-11 2014-08-01 杜比實驗室特許公司 用以非破壞地正常化可攜式裝置中音訊訊號響度之系統及方法
TWI581250B (zh) 2010-12-03 2017-05-01 杜比實驗室特許公司 利用多媒體處理節點之適應性處理技術
CN103325380B (zh) 2012-03-23 2017-09-12 杜比实验室特许公司 用于信号增强的增益后处理
US10844689B1 (en) 2019-12-19 2020-11-24 Saudi Arabian Oil Company Downhole ultrasonic actuator system for mitigating lost circulation
CN119252266A (zh) 2012-05-18 2025-01-03 杜比实验室特许公司 用于维持与参数音频编码器相关联的可逆动态范围控制信息的系统
WO2014113471A1 (en) 2013-01-21 2014-07-24 Dolby Laboratories Licensing Corporation System and method for optimizing loudness and dynamic range across different playback devices
RU2719690C2 (ru) 2013-01-21 2020-04-21 Долби Лабораторис Лайсэнзин Корпорейшн Аудиокодер и аудиодекодер с метаданными громкости и границы программы
US9715880B2 (en) 2013-02-21 2017-07-25 Dolby International Ab Methods for parametric multi-channel encoding
CN104080024B (zh) 2013-03-26 2019-02-19 杜比实验室特许公司 音量校平器控制器和控制方法以及音频分类器
US9607624B2 (en) * 2013-03-29 2017-03-28 Apple Inc. Metadata driven dynamic range control
WO2014165304A1 (en) 2013-04-05 2014-10-09 Dolby Laboratories Licensing Corporation Acquisition, recovery, and matching of unique information from file-based media for automated file detection
TWM487509U (zh) 2013-06-19 2014-10-01 杜比實驗室特許公司 音訊處理設備及電子裝置
EP3044786B1 (en) 2013-09-12 2024-04-24 Dolby Laboratories Licensing Corporation Loudness adjustment for downmixed audio content
EP3544181B1 (en) 2013-09-12 2025-12-03 Dolby Laboratories Licensing Corporation Dynamic range control for a wide variety of playback environments
CN110808723B (zh) 2014-05-26 2024-09-17 杜比实验室特许公司 音频信号响度控制
EP3518236B8 (en) 2014-10-10 2022-05-25 Dolby Laboratories Licensing Corporation Transmission-agnostic presentation-based program loudness
CA3281204A1 (en) * 2015-06-17 2025-10-31 Sony Corporation Transmitting device, transmitting method, receiving device, and receiving method
US9837086B2 (en) 2015-07-31 2017-12-05 Apple Inc. Encoded audio extended metadata-based dynamic range control
US9934790B2 (en) 2015-07-31 2018-04-03 Apple Inc. Encoded audio metadata-based equalization
US10341770B2 (en) * 2015-09-30 2019-07-02 Apple Inc. Encoded audio metadata-based loudness equalization and dynamic equalization during DRC
FR3044814A1 (fr) * 2016-04-21 2017-06-09 Continental Automotive France Systeme et procede de controle du volume sonore dans un systeme multimedia
JP6902049B2 (ja) * 2016-07-04 2021-07-14 ハーマン ベッカー オートモーティブ システムズ ゲーエムベーハー 発話信号を含むオーディオ信号のラウドネスレベル自動修正
CN106504766B (zh) * 2016-11-28 2019-11-26 湖南国科微电子股份有限公司 一种数字音频信号的动态范围压缩方法
ES2985934T3 (es) 2018-11-13 2024-11-07 Dolby Laboratories Licensing Corp Representar audio espacial por medio de una señal de audio y metadatos asociados
ES2974219T3 (es) 2018-11-13 2024-06-26 Dolby Laboratories Licensing Corp Procesamiento de audio en servicios de audio inversivos
CN109889170B (zh) * 2019-02-25 2021-06-04 珠海格力电器股份有限公司 音频信号的控制方法和装置
JP7266916B2 (ja) 2019-03-14 2023-05-01 ガウディオ・ラボ・インコーポレイテッド ラウドネスレベルを制御するオーディオ信号処理方法及び装置
US11545166B2 (en) * 2019-07-02 2023-01-03 Dolby International Ab Using metadata to aggregate signal processing operations
WO2021021750A1 (en) 2019-07-30 2021-02-04 Dolby Laboratories Licensing Corporation Dynamics processing across devices with differing playback capabilities
US12177646B2 (en) 2020-05-26 2024-12-24 Dolby International Ab Main-associated audio experience with efficient ducking gain application
KR102773326B1 (ko) * 2020-11-24 2025-02-27 가우디오랩 주식회사 오디오 신호의 정규화를 수행하는 방법 및 이를 위한 장치
CN116615781A (zh) * 2020-12-17 2023-08-18 杜比国际公司 用于使用预先配置的生成器处理音频数据的方法和装置
US11837254B2 (en) 2021-08-03 2023-12-05 Zoom Video Communications, Inc. Frontend capture with input stage, suppression module, and output stage
EP4381501A1 (en) * 2021-08-03 2024-06-12 Zoom Video Communications, Inc. Frontend capture
KR20240056387A (ko) 2022-10-21 2024-04-30 한국전자통신연구원 객체 기반 오디오의 클리핑 방지 렌더링 방법 및 이를 수행하는 장치
EP4697327A4 (en) * 2023-04-11 2026-03-25 Beijing Xiaomi Mobile Software Co Ltd METHOD AND APPARATUS FOR PROCESSING AUDIO CODE STREAM SIGNAL, ELECTRONIC DEVICE AND STORAGE MEDIA

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW200921642A (en) * 2007-02-14 2009-05-16 Lg Electronics Inc Methods and apparatuses for encoding and decoding object-based audio signals
TW201010450A (en) * 2008-07-17 2010-03-01 Fraunhofer Ges Forschung Apparatus and method for generating audio output signals using object based metadata
US20110208528A1 (en) * 2008-10-29 2011-08-25 Dolby International Ab Signal clipping protection using pre-existing audio gain metadata

Family Cites Families (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1910929A (zh) * 2004-01-16 2007-02-07 皇家飞利浦电子股份有限公司 比特流处理的方法
US7392195B2 (en) 2004-03-25 2008-06-24 Dts, Inc. Lossless multi-channel audio codec
TW200638335A (en) * 2005-04-13 2006-11-01 Dolby Lab Licensing Corp Audio metadata verification
CN102237094B (zh) * 2005-10-12 2013-02-20 三星电子株式会社 处理/发送比特流以及接收/处理比特流的方法和设备
CN101098201A (zh) * 2006-06-29 2008-01-02 乐金电子(昆山)电脑有限公司 广播接收用移动装置的音频输出装置及其控制方法
US8195454B2 (en) * 2007-02-26 2012-06-05 Dolby Laboratories Licensing Corporation Speech enhancement in entertainment audio
CN101221766B (zh) * 2008-01-23 2011-01-05 清华大学 音频编码器切换的方法
ES2988414T3 (es) * 2008-07-11 2024-11-20 Fraunhofer Ges Zur Foerderungder Angewandten Forschung E V Decodificador de audio
US8798776B2 (en) * 2008-09-30 2014-08-05 Dolby International Ab Transcoding of audio metadata
CN101605111B (zh) * 2009-06-25 2012-07-04 华为技术有限公司 一种削波控制的方法和装置
US8600076B2 (en) * 2009-11-09 2013-12-03 Neofidelity, Inc. Multiband DRC system and method for controlling the same
TWI447709B (zh) * 2010-02-11 2014-08-01 杜比實驗室特許公司 用以非破壞地正常化可攜式裝置中音訊訊號響度之系統及方法
CN101944362B (zh) * 2010-09-14 2012-05-30 北京大学 一种基于整形小波变换的音频无损压缩编码、解码方法
JP5821431B2 (ja) * 2011-09-02 2015-11-24 株式会社Jvcケンウッド 音声信号加工装置、音声信号加工方法及びプログラム
US9064497B2 (en) * 2012-02-22 2015-06-23 Htc Corporation Method and apparatus for audio intelligibility enhancement and computing apparatus
CN102768834B (zh) * 2012-03-21 2018-06-26 新奥特(北京)视频技术有限公司 一种实现音频帧解码的方法
CN119252266A (zh) * 2012-05-18 2025-01-03 杜比实验室特许公司 用于维持与参数音频编码器相关联的可逆动态范围控制信息的系统
US9805725B2 (en) * 2012-12-21 2017-10-31 Dolby Laboratories Licensing Corporation Object clustering for rendering object-based audio content based on perceptual criteria
US9715880B2 (en) * 2013-02-21 2017-07-25 Dolby International Ab Methods for parametric multi-channel encoding
US9173021B2 (en) * 2013-03-12 2015-10-27 Google Technology Holdings LLC Method and device for adjusting an audio beam orientation based on device location
US9559651B2 (en) * 2013-03-29 2017-01-31 Apple Inc. Metadata for loudness and dynamic range control
CN103280221B (zh) * 2013-05-09 2015-07-29 北京大学 一种基于基追踪的音频无损压缩编码、解码方法及系统
FR3006622B1 (fr) 2013-06-07 2015-07-17 Essilor Int Procede de fabrication d'une lentille ophtalmique
EP3044786B1 (en) 2013-09-12 2024-04-24 Dolby Laboratories Licensing Corporation Loudness adjustment for downmixed audio content

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW200921642A (en) * 2007-02-14 2009-05-16 Lg Electronics Inc Methods and apparatuses for encoding and decoding object-based audio signals
TW201010450A (en) * 2008-07-17 2010-03-01 Fraunhofer Ges Forschung Apparatus and method for generating audio output signals using object based metadata
US20110208528A1 (en) * 2008-10-29 2011-08-25 Dolby International Ab Signal clipping protection using pre-existing audio gain metadata

Also Published As

Publication number Publication date
EP4618078A2 (en) 2025-09-17
ZA201603299B (en) 2017-11-29
SG11201603116XA (en) 2016-05-30
MX358483B (es) 2018-08-22
JP2018151639A (ja) 2018-09-27
JP2016538587A (ja) 2016-12-08
EP3522157B1 (en) 2021-09-22
US20210166707A1 (en) 2021-06-03
JP6588899B2 (ja) 2019-10-09
EP4625411A3 (en) 2025-12-10
CN111580772A (zh) 2020-08-25
MX2016004921A (es) 2016-07-11
CN105814630B (zh) 2020-04-28
EP4625411A2 (en) 2025-10-01
JP6768735B2 (ja) 2020-10-14
TW201521012A (zh) 2015-06-01
RU2659490C2 (ru) 2018-07-02
KR20160072255A (ko) 2016-06-22
ES3044157T3 (en) 2025-11-26
CN111580772B (zh) 2023-09-26
ES2900065T3 (es) 2022-03-15
EP3061090B1 (en) 2019-04-17
US12051432B2 (en) 2024-07-30
US20240363129A1 (en) 2024-10-31
MX383498B (es) 2025-03-14
RU2016119525A (ru) 2017-11-28
US11551703B2 (en) 2023-01-10
US20230134916A1 (en) 2023-05-04
CA3262095A1 (en) 2025-04-02
EP3951778A1 (en) 2022-02-09
CA3262089A1 (en) 2025-03-25
BR112016008933A2 (https=) 2017-08-01
PL3951778T3 (pl) 2026-01-19
AR098153A1 (es) 2016-05-04
US20160240204A1 (en) 2016-08-18
AR115941A2 (es) 2021-03-17
ES2732304T3 (es) 2019-11-21
CN105814630A (zh) 2016-07-27
EP3061090A1 (en) 2016-08-31
EP4629236A2 (en) 2025-10-08
TR201908748T4 (tr) 2019-07-22
PT3522157T (pt) 2021-12-03
CA3262077A1 (en) 2025-02-28
EP3951778C0 (en) 2025-08-27
CA3262112A1 (en) 2025-02-28
CA3262102A1 (en) 2025-03-25
AU2014339086A1 (en) 2016-06-02
PT3061090T (pt) 2019-07-11
MY181977A (en) 2021-01-18
WO2015059087A1 (en) 2015-04-30
EP3522157A1 (en) 2019-08-07
EP3951778B1 (en) 2025-08-27
EP4618078A3 (en) 2025-12-10
CA2927664A1 (en) 2015-04-30
US20240363128A1 (en) 2024-10-31
AU2014339086B2 (en) 2017-12-21
BR112016008933B1 (pt) 2023-01-31
PL3522157T3 (pl) 2022-02-07
US11170795B2 (en) 2021-11-09
KR101882898B1 (ko) 2018-07-27
US20240363130A1 (en) 2024-10-31
EP4629236A3 (en) 2025-12-17
PL3061090T3 (pl) 2019-09-30

Similar Documents

Publication Publication Date Title
TWI571865B (zh) 音訊編碼器裝置、音訊解碼器裝置、及其操作方法
HK40068515B (en) Concept for combined dynamic range compression and guided clipping prevention for audio devices
HK40130781A (en) Concept for combined dynamic range compression and guided clipping prevention for audio devices
HK40130475A (en) Concept for combined dynamic range compression and guided clipping prevention for audio devices
HK40068515A (en) Concept for combined dynamic range compression and guided clipping prevention for audio devices
HK40011395A (en) Concept for combined dynamic range compression and guided clipping prevention for audio devices
HK40011395B (en) Concept for combined dynamic range compression and guided clipping prevention for audio devices
HK1227539B (en) Concept for combined dynamic range compression and guided clipping prevention for audio devices
HK1227539A1 (en) Concept for combined dynamic range compression and guided clipping prevention for audio devices
BR122022018605B1 (pt) Conceito para compactação de faixa dinâmica combinada e prevenção de recorte guiado para dispositivos de áudio