JP2022551535A - オーディオ符号化のための装置及び方法 - Google Patents

オーディオ符号化のための装置及び方法 Download PDF

Info

Publication number
JP2022551535A
JP2022551535A JP2022521735A JP2022521735A JP2022551535A JP 2022551535 A JP2022551535 A JP 2022551535A JP 2022521735 A JP2022521735 A JP 2022521735A JP 2022521735 A JP2022521735 A JP 2022521735A JP 2022551535 A JP2022551535 A JP 2022551535A
Authority
JP
Japan
Prior art keywords
audio
item
presentation metadata
metadata
items
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2022521735A
Other languages
English (en)
Japanese (ja)
Other versions
JP2022551535A5 (https=
Inventor
パウルス ヘンリクス アントニウス ディレン
ボン フランシスカス マリヌス ヨセフス デ
イェルーン ジェラルドゥス ヘンリクス コッペンス
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Koninklijke Philips NV
Original Assignee
Koninklijke Philips NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips NV filed Critical Koninklijke Philips NV
Publication of JP2022551535A publication Critical patent/JP2022551535A/ja
Publication of JP2022551535A5 publication Critical patent/JP2022551535A5/ja
Priority to JP2025146640A priority Critical patent/JP2025179172A/ja
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/233Processing of audio elementary streams
    • H04N21/2335Processing of audio elementary streams involving reformatting operations of audio signals, e.g. by converting from one coding standard to another
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/233Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/235Processing of additional data, e.g. scrambling of additional data or processing content descriptors
    • H04N21/2353Processing of additional data, e.g. scrambling of additional data or processing content descriptors specifically adapted to content descriptors, e.g. coding, compressing or processing of metadata
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Library & Information Science (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
JP2022521735A 2019-10-14 2020-10-08 オーディオ符号化のための装置及び方法 Pending JP2022551535A (ja)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2025146640A JP2025179172A (ja) 2019-10-14 2025-09-04 オーディオ符号化のための装置及び方法

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP19202935.3 2019-10-14
EP19202935.3A EP3809709A1 (en) 2019-10-14 2019-10-14 Apparatus and method for audio encoding
PCT/EP2020/078297 WO2021074007A1 (en) 2019-10-14 2020-10-08 Apparatus and method for audio encoding

Related Child Applications (1)

Application Number Title Priority Date Filing Date
JP2025146640A Division JP2025179172A (ja) 2019-10-14 2025-09-04 オーディオ符号化のための装置及び方法

Publications (2)

Publication Number Publication Date
JP2022551535A true JP2022551535A (ja) 2022-12-09
JP2022551535A5 JP2022551535A5 (https=) 2023-10-16

Family

ID=68280951

Family Applications (2)

Application Number Title Priority Date Filing Date
JP2022521735A Pending JP2022551535A (ja) 2019-10-14 2020-10-08 オーディオ符号化のための装置及び方法
JP2025146640A Pending JP2025179172A (ja) 2019-10-14 2025-09-04 オーディオ符号化のための装置及び方法

Family Applications After (1)

Application Number Title Priority Date Filing Date
JP2025146640A Pending JP2025179172A (ja) 2019-10-14 2025-09-04 オーディオ符号化のための装置及び方法

Country Status (8)

Country Link
US (1) US12431152B2 (https=)
EP (2) EP3809709A1 (https=)
JP (2) JP2022551535A (https=)
KR (1) KR20220084113A (https=)
CN (1) CN114600188B (https=)
BR (1) BR112022006905A2 (https=)
MX (1) MX2022004393A (https=)
WO (1) WO2021074007A1 (https=)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12531077B2 (en) * 2021-02-22 2026-01-20 Tencent America LLC Method and apparatus in audio processing
US11622221B2 (en) * 2021-05-05 2023-04-04 Tencent America LLC Method and apparatus for representing space of interest of audio scene
CN117501362B (zh) * 2021-06-15 2025-05-09 北京字跳网络技术有限公司 音频渲染系统、方法和电子设备
WO2022262758A1 (zh) * 2021-06-15 2022-12-22 北京字跳网络技术有限公司 音频渲染系统、方法和电子设备
GB2608406A (en) * 2021-06-30 2023-01-04 Nokia Technologies Oy Creating spatial audio stream from audio objects with spatial extent
GB2611800A (en) * 2021-10-15 2023-04-19 Nokia Technologies Oy A method and apparatus for efficient delivery of edge based rendering of 6DOF MPEG-I immersive audio
CN121312155A (zh) * 2023-05-31 2026-01-09 抖音视界有限公司 音频渲染方法、装置和非易失性计算机可读存储介质
CN119296553A (zh) * 2023-07-10 2025-01-10 华为技术有限公司 编码方法及电子设备
US12518772B2 (en) 2023-08-01 2026-01-06 Samsung Electronics Co., Ltd. Codec bitrate selection in audio object coding
GB2634524A (en) * 2023-10-11 2025-04-16 Nokia Technologies Oy Parametric spatial audio decoding with pass-through mode
CN118116397A (zh) * 2024-02-22 2024-05-31 中央广播电视总台 音频元数据编解码方法、传输方法、编码器终端及系统
WO2025232857A1 (en) * 2024-05-10 2025-11-13 Douyin Vision Co., Ltd. Audio processing method and apparatus

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120310654A1 (en) * 2010-02-11 2012-12-06 Dolby Laboratories Licensing Corporation System and Method for Non-destructively Normalizing Loudness of Audio Signals Within Portable Devices
US20140294200A1 (en) * 2013-03-29 2014-10-02 Apple Inc. Metadata for loudness and dynamic range control
JP2014532901A (ja) * 2011-11-01 2014-12-08 コーニンクレッカ フィリップス エヌ ヴェ オーディオオブジェクトのエンコーディング及びデコーディング
JP2015522183A (ja) * 2012-07-15 2015-08-03 クゥアルコム・インコーポレイテッドQualcomm Incorporated 基底関数係数を使用した3次元オーディオコード化のためのシステム、方法、装置、およびコンピュータ可読媒体
US20150332680A1 (en) * 2012-12-21 2015-11-19 Dolby Laboratories Licensing Corporation Object Clustering for Rendering Object-Based Audio Content Based on Perceptual Criteria
WO2018047667A1 (ja) * 2016-09-12 2018-03-15 ソニー株式会社 音声処理装置および方法
JP2018067931A (ja) * 2013-03-28 2018-04-26 ドルビー ラボラトリーズ ライセンシング コーポレイション 見かけのサイズをもつオーディオ・オブジェクトの任意のラウドスピーカー・レイアウトへのレンダリング
WO2018180531A1 (ja) * 2017-03-28 2018-10-04 ソニー株式会社 情報処理装置、情報処理方法、およびプログラム
US20190132674A1 (en) * 2016-04-22 2019-05-02 Nokia Technologies Oy Merging Audio Signals with Spatial Metadata

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2154911A1 (en) * 2008-08-13 2010-02-17 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. An apparatus for determining a spatial output multi-channel audio signal
US8908874B2 (en) * 2010-09-08 2014-12-09 Dts, Inc. Spatial audio encoding and reproduction
TWI896112B (zh) * 2010-12-03 2025-09-01 美商杜比實驗室特許公司 音頻解碼裝置、音頻解碼方法及音頻編碼方法
EP2830336A3 (en) * 2013-07-22 2015-03-04 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Renderer controlled spatial upmix
EP2830045A1 (en) * 2013-07-22 2015-01-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Concept for audio encoding and decoding for audio channels and audio objects
EP2830332A3 (en) * 2013-07-22 2015-03-11 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Method, signal processing unit, and computer program for mapping a plurality of input channels of an input channel configuration to output channels of an output channel configuration
SG11201600466PA (en) * 2013-07-22 2016-02-26 Fraunhofer Ges Forschung Multi-channel audio decoder, multi-channel audio encoder, methods, computer program and encoded audio representation using a decorrelation of rendered audio signals
EP3059732B1 (en) * 2013-10-17 2018-10-10 Socionext Inc. Audio decoding device
EP2866227A1 (en) * 2013-10-22 2015-04-29 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Method for decoding and encoding a downmix matrix, method for presenting audio content, encoder and decoder for a downmix matrix, audio encoder and audio decoder
US20180357038A1 (en) * 2017-06-09 2018-12-13 Qualcomm Incorporated Audio metadata modification at rendering device

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120310654A1 (en) * 2010-02-11 2012-12-06 Dolby Laboratories Licensing Corporation System and Method for Non-destructively Normalizing Loudness of Audio Signals Within Portable Devices
JP2014532901A (ja) * 2011-11-01 2014-12-08 コーニンクレッカ フィリップス エヌ ヴェ オーディオオブジェクトのエンコーディング及びデコーディング
JP2015522183A (ja) * 2012-07-15 2015-08-03 クゥアルコム・インコーポレイテッドQualcomm Incorporated 基底関数係数を使用した3次元オーディオコード化のためのシステム、方法、装置、およびコンピュータ可読媒体
US20150332680A1 (en) * 2012-12-21 2015-11-19 Dolby Laboratories Licensing Corporation Object Clustering for Rendering Object-Based Audio Content Based on Perceptual Criteria
JP2016509249A (ja) * 2012-12-21 2016-03-24 ドルビー ラボラトリーズ ライセンシング コーポレイション 知覚的基準に基づいてオブジェクト・ベースのオーディオ・コンテンツをレンダリングするためのオブジェクト・クラスタリング
JP2018067931A (ja) * 2013-03-28 2018-04-26 ドルビー ラボラトリーズ ライセンシング コーポレイション 見かけのサイズをもつオーディオ・オブジェクトの任意のラウドスピーカー・レイアウトへのレンダリング
US20140294200A1 (en) * 2013-03-29 2014-10-02 Apple Inc. Metadata for loudness and dynamic range control
US20190132674A1 (en) * 2016-04-22 2019-05-02 Nokia Technologies Oy Merging Audio Signals with Spatial Metadata
WO2018047667A1 (ja) * 2016-09-12 2018-03-15 ソニー株式会社 音声処理装置および方法
WO2018180531A1 (ja) * 2017-03-28 2018-10-04 ソニー株式会社 情報処理装置、情報処理方法、およびプログラム

Also Published As

Publication number Publication date
EP4046385A1 (en) 2022-08-24
EP3809709A1 (en) 2021-04-21
MX2022004393A (es) 2022-05-18
US20220383885A1 (en) 2022-12-01
CN114600188A (zh) 2022-06-07
EP4046385B1 (en) 2026-03-11
BR112022006905A2 (pt) 2022-07-05
CN114600188B (zh) 2025-07-08
JP2025179172A (ja) 2025-12-09
WO2021074007A1 (en) 2021-04-22
EP4046385C0 (en) 2026-03-11
KR20220084113A (ko) 2022-06-21
US12431152B2 (en) 2025-09-30

Similar Documents

Publication Publication Date Title
US12431152B2 (en) Apparatus and method for audio encoding
JP5281575B2 (ja) オーディオオブジェクトのエンコード及びデコード
CN101490743B (zh) 对立体声音频信号的动态解码
KR101790641B1 (ko) 하이브리드 파형-코딩 및 파라미터-코딩된 스피치 인핸스
US20150248889A1 (en) Layered approach to spatial audio coding
JP5319704B2 (ja) オーディオ信号の処理方法及び装置
CN112673649B (zh) 空间音频增强
US11545166B2 (en) Using metadata to aggregate signal processing operations
GB2580899A (en) Audio representation and associated rendering
EP3923280A1 (en) Adapting multi-source inputs for constant rate encoding
KR20240012519A (ko) 3차원 오디오 신호를 처리하기 위한 방법 및 장치
US12380904B2 (en) Seamless scalable decoding of channels, objects, and HOA audio content
WO2025136874A1 (en) Pose correction metadata for interactive headtracking
RU2823537C1 (ru) Устройство и способ кодирования аудио
CN120226077A (zh) 用于音频比特流编码和解码的方法、设备和介质
JP7703692B2 (ja) 3次元オーディオ信号符号化方法および装置、ならびにエンコーダ
EP4535831A1 (en) Modification of spatial audio scenes
Fug et al. An Introduction to MPEG-H 3D Audio
HK40128667A (zh) 用於音频比特流编码和解码的方法、设备和介质
CN121464479A (zh) 用于对空间音频内容进行编码的装置、方法和计算机程序
CN120266202A (zh) 用于对音频比特流和相关联返回声道信息进行编码和解码的方法、装置和介质
CN120835168A (zh) 用于提供个性化音频流式传输和渲染的系统和方法
HK1222470B (zh) 混合波形编码和参数编码语音增强

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20231005

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20231005

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20241127

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20241209

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20250304

A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 20250523

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20250904