AU2022332970A1 - Method and apparatus for metadata-based dynamic processing of audio data - Google Patents

Method and apparatus for metadata-based dynamic processing of audio data Download PDF

Info

Publication number
AU2022332970A1
AU2022332970A1 AU2022332970A AU2022332970A AU2022332970A1 AU 2022332970 A1 AU2022332970 A1 AU 2022332970A1 AU 2022332970 A AU2022332970 A AU 2022332970A AU 2022332970 A AU2022332970 A AU 2022332970A AU 2022332970 A1 AU2022332970 A1 AU 2022332970A1
Authority
AU
Australia
Prior art keywords
metadata
audio data
loudness
dynamic
decoder
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
AU2022332970A
Other languages
English (en)
Inventor
Christof FERSCH
Scott Gregory NORCROSS
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dolby International AB
Dolby Laboratories Licensing Corp
Original Assignee
Dolby International AB
Dolby Laboratories Licensing Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dolby International AB, Dolby Laboratories Licensing Corp filed Critical Dolby International AB
Publication of AU2022332970A1 publication Critical patent/AU2022332970A1/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03GCONTROL OF AMPLIFICATION
    • H03G11/00Limiting amplitude; Limiting rate of change of amplitude
    • H03G11/008Limiting amplitude; Limiting rate of change of amplitude of digital or coded signals
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03GCONTROL OF AMPLIFICATION
    • H03G3/00Gain control in amplifiers or frequency changers
    • H03G3/002Control of digital or coded signals
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03GCONTROL OF AMPLIFICATION
    • H03G7/00Volume compression or expansion in amplifiers
    • H03G7/007Volume compression or expansion in amplifiers of digital or coded signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Tone Control, Compression And Expansion, Limiting Amplitude (AREA)
  • Stereophonic System (AREA)
  • Control Of Amplification And Gain Control (AREA)
AU2022332970A 2021-08-26 2022-08-24 Method and apparatus for metadata-based dynamic processing of audio data Pending AU2022332970A1 (en)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
US202163237231P 2021-08-26 2021-08-26
EP21193209.0 2021-08-26
EP21193209 2021-08-26
US63/237,231 2021-08-26
US202163251307P 2021-10-01 2021-10-01
US63/251,307 2021-10-01
PCT/US2022/041388 WO2023028154A1 (en) 2021-08-26 2022-08-24 Method and apparatus for metadata-based dynamic processing of audio data

Publications (1)

Publication Number Publication Date
AU2022332970A1 true AU2022332970A1 (en) 2024-02-29

Family

ID=83283433

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2022332970A Pending AU2022332970A1 (en) 2021-08-26 2022-08-24 Method and apparatus for metadata-based dynamic processing of audio data

Country Status (11)

Country Link
US (1) US20240355338A1 (https=)
EP (2) EP4683218A3 (https=)
JP (1) JP2024531963A (https=)
KR (1) KR20240043809A (https=)
AU (1) AU2022332970A1 (https=)
CA (1) CA3230363A1 (https=)
CL (1) CL2024000531A1 (https=)
ES (1) ES3059022T3 (https=)
IL (1) IL310650A (https=)
MX (1) MX2024002300A (https=)
WO (1) WO2023028154A1 (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20240412748A1 (en) * 2023-06-07 2024-12-12 The Nielsen Company (Us), Llc Communication of Payload Data Through Altered Sequence of Metadata Defining Audio-Rendering Directives

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014113471A1 (en) * 2013-01-21 2014-07-24 Dolby Laboratories Licensing Corporation System and method for optimizing loudness and dynamic range across different playback devices
EP3044786B1 (en) * 2013-09-12 2024-04-24 Dolby Laboratories Licensing Corporation Loudness adjustment for downmixed audio content
US9837086B2 (en) * 2015-07-31 2017-12-05 Apple Inc. Encoded audio extended metadata-based dynamic range control
US9934790B2 (en) * 2015-07-31 2018-04-03 Apple Inc. Encoded audio metadata-based equalization
US10356545B2 (en) * 2016-09-23 2019-07-16 Gaudio Lab, Inc. Method and device for processing audio signal by using metadata
JP7266916B2 (ja) * 2019-03-14 2023-05-01 ガウディオ・ラボ・インコーポレイテッド ラウドネスレベルを制御するオーディオ信号処理方法及び装置

Also Published As

Publication number Publication date
US20240355338A1 (en) 2024-10-24
JP2024531963A (ja) 2024-09-03
EP4683218A2 (en) 2026-01-21
CL2024000531A1 (es) 2024-12-27
IL310650A (en) 2024-04-01
EP4392970B1 (en) 2025-12-10
CA3230363A1 (en) 2023-03-02
EP4392970A1 (en) 2024-07-03
ES3059022T3 (en) 2026-03-16
WO2023028154A1 (en) 2023-03-02
EP4683218A3 (en) 2026-02-25
MX2024002300A (es) 2024-03-07
KR20240043809A (ko) 2024-04-03

Similar Documents

Publication Publication Date Title
JP6851523B2 (ja) 異なる再生装置を横断するラウドネスおよびダイナミックレンジの最適化
KR102686742B1 (ko) 객체 기반 오디오 신호 균형화
EP3111677B1 (en) Object-based audio loudness management
CN115668372B (zh) 用于在回放音频数据期间提高对话可理解性的方法和设备
US20240355338A1 (en) Method and apparatus for metadata-based dynamic processing of audio data
US12101070B2 (en) Method and system for processing audio signal
US20250046318A1 (en) Method and apparatus for processing of audio data
HK40130834A (en) Method and apparatus for metadata-based dynamic processing of audio data
EP4136753A1 (en) Automated mixing of audio description
US20250342841A1 (en) Method and apparatus for processing of audio data
CN117882133A (zh) 用于音频数据的基于元数据的动态处理的方法和装置
RU2858248C2 (ru) Способ и оборудование для динамической обработки на основе метаданных для аудиоданных
HK40104139A (zh) 用於音频数据的基於元数据的动态处理的方法和装置
CN118451498A (zh) 用于处理音频数据的方法和装置
JP7314398B2 (ja) 変更オーディオビットストリームの生成及び処理のための方法及び装置
HK40111646A (zh) 用於处理音频数据的方法和装置
HK40116098A (zh) 用於处理音频数据的方法和装置
JP7631313B2 (ja) 変更されたビットストリームを生成および処理する方法およびデバイス
HK1226580B (zh) 基於对象的音频响度管理