JP2022551535A - オーディオ符号化のための装置及び方法 - Google Patents
オーディオ符号化のための装置及び方法 Download PDFInfo
- Publication number
- JP2022551535A JP2022551535A JP2022521735A JP2022521735A JP2022551535A JP 2022551535 A JP2022551535 A JP 2022551535A JP 2022521735 A JP2022521735 A JP 2022521735A JP 2022521735 A JP2022521735 A JP 2022521735A JP 2022551535 A JP2022551535 A JP 2022551535A
- Authority
- JP
- Japan
- Prior art keywords
- audio
- item
- presentation metadata
- metadata
- items
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/233—Processing of audio elementary streams
- H04N21/2335—Processing of audio elementary streams involving reformatting operations of audio signals, e.g. by converting from one coding standard to another
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/233—Processing of audio elementary streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/235—Processing of additional data, e.g. scrambling of additional data or processing content descriptors
- H04N21/2353—Processing of additional data, e.g. scrambling of additional data or processing content descriptors specifically adapted to content descriptors, e.g. coding, compressing or processing of metadata
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/435—Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
- H04N21/4394—Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Library & Information Science (AREA)
- Stereophonic System (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2025146640A JP2025179172A (ja) | 2019-10-14 | 2025-09-04 | オーディオ符号化のための装置及び方法 |
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP19202935.3 | 2019-10-14 | ||
| EP19202935.3A EP3809709A1 (en) | 2019-10-14 | 2019-10-14 | Apparatus and method for audio encoding |
| PCT/EP2020/078297 WO2021074007A1 (en) | 2019-10-14 | 2020-10-08 | Apparatus and method for audio encoding |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2025146640A Division JP2025179172A (ja) | 2019-10-14 | 2025-09-04 | オーディオ符号化のための装置及び方法 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JP2022551535A true JP2022551535A (ja) | 2022-12-09 |
| JP2022551535A5 JP2022551535A5 (https=) | 2023-10-16 |
Family
ID=68280951
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2022521735A Pending JP2022551535A (ja) | 2019-10-14 | 2020-10-08 | オーディオ符号化のための装置及び方法 |
| JP2025146640A Pending JP2025179172A (ja) | 2019-10-14 | 2025-09-04 | オーディオ符号化のための装置及び方法 |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2025146640A Pending JP2025179172A (ja) | 2019-10-14 | 2025-09-04 | オーディオ符号化のための装置及び方法 |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US12431152B2 (https=) |
| EP (2) | EP3809709A1 (https=) |
| JP (2) | JP2022551535A (https=) |
| KR (1) | KR20220084113A (https=) |
| CN (1) | CN114600188B (https=) |
| BR (1) | BR112022006905A2 (https=) |
| MX (1) | MX2022004393A (https=) |
| WO (1) | WO2021074007A1 (https=) |
Families Citing this family (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US12531077B2 (en) * | 2021-02-22 | 2026-01-20 | Tencent America LLC | Method and apparatus in audio processing |
| US11622221B2 (en) * | 2021-05-05 | 2023-04-04 | Tencent America LLC | Method and apparatus for representing space of interest of audio scene |
| CN117501362B (zh) * | 2021-06-15 | 2025-05-09 | 北京字跳网络技术有限公司 | 音频渲染系统、方法和电子设备 |
| WO2022262758A1 (zh) * | 2021-06-15 | 2022-12-22 | 北京字跳网络技术有限公司 | 音频渲染系统、方法和电子设备 |
| GB2608406A (en) * | 2021-06-30 | 2023-01-04 | Nokia Technologies Oy | Creating spatial audio stream from audio objects with spatial extent |
| GB2611800A (en) * | 2021-10-15 | 2023-04-19 | Nokia Technologies Oy | A method and apparatus for efficient delivery of edge based rendering of 6DOF MPEG-I immersive audio |
| CN121312155A (zh) * | 2023-05-31 | 2026-01-09 | 抖音视界有限公司 | 音频渲染方法、装置和非易失性计算机可读存储介质 |
| CN119296553A (zh) * | 2023-07-10 | 2025-01-10 | 华为技术有限公司 | 编码方法及电子设备 |
| US12518772B2 (en) | 2023-08-01 | 2026-01-06 | Samsung Electronics Co., Ltd. | Codec bitrate selection in audio object coding |
| GB2634524A (en) * | 2023-10-11 | 2025-04-16 | Nokia Technologies Oy | Parametric spatial audio decoding with pass-through mode |
| CN118116397A (zh) * | 2024-02-22 | 2024-05-31 | 中央广播电视总台 | 音频元数据编解码方法、传输方法、编码器终端及系统 |
| WO2025232857A1 (en) * | 2024-05-10 | 2025-11-13 | Douyin Vision Co., Ltd. | Audio processing method and apparatus |
Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20120310654A1 (en) * | 2010-02-11 | 2012-12-06 | Dolby Laboratories Licensing Corporation | System and Method for Non-destructively Normalizing Loudness of Audio Signals Within Portable Devices |
| US20140294200A1 (en) * | 2013-03-29 | 2014-10-02 | Apple Inc. | Metadata for loudness and dynamic range control |
| JP2014532901A (ja) * | 2011-11-01 | 2014-12-08 | コーニンクレッカ フィリップス エヌ ヴェ | オーディオオブジェクトのエンコーディング及びデコーディング |
| JP2015522183A (ja) * | 2012-07-15 | 2015-08-03 | クゥアルコム・インコーポレイテッドQualcomm Incorporated | 基底関数係数を使用した3次元オーディオコード化のためのシステム、方法、装置、およびコンピュータ可読媒体 |
| US20150332680A1 (en) * | 2012-12-21 | 2015-11-19 | Dolby Laboratories Licensing Corporation | Object Clustering for Rendering Object-Based Audio Content Based on Perceptual Criteria |
| WO2018047667A1 (ja) * | 2016-09-12 | 2018-03-15 | ソニー株式会社 | 音声処理装置および方法 |
| JP2018067931A (ja) * | 2013-03-28 | 2018-04-26 | ドルビー ラボラトリーズ ライセンシング コーポレイション | 見かけのサイズをもつオーディオ・オブジェクトの任意のラウドスピーカー・レイアウトへのレンダリング |
| WO2018180531A1 (ja) * | 2017-03-28 | 2018-10-04 | ソニー株式会社 | 情報処理装置、情報処理方法、およびプログラム |
| US20190132674A1 (en) * | 2016-04-22 | 2019-05-02 | Nokia Technologies Oy | Merging Audio Signals with Spatial Metadata |
Family Cites Families (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP2154911A1 (en) * | 2008-08-13 | 2010-02-17 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | An apparatus for determining a spatial output multi-channel audio signal |
| US8908874B2 (en) * | 2010-09-08 | 2014-12-09 | Dts, Inc. | Spatial audio encoding and reproduction |
| TWI896112B (zh) * | 2010-12-03 | 2025-09-01 | 美商杜比實驗室特許公司 | 音頻解碼裝置、音頻解碼方法及音頻編碼方法 |
| EP2830336A3 (en) * | 2013-07-22 | 2015-03-04 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Renderer controlled spatial upmix |
| EP2830045A1 (en) * | 2013-07-22 | 2015-01-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Concept for audio encoding and decoding for audio channels and audio objects |
| EP2830332A3 (en) * | 2013-07-22 | 2015-03-11 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Method, signal processing unit, and computer program for mapping a plurality of input channels of an input channel configuration to output channels of an output channel configuration |
| SG11201600466PA (en) * | 2013-07-22 | 2016-02-26 | Fraunhofer Ges Forschung | Multi-channel audio decoder, multi-channel audio encoder, methods, computer program and encoded audio representation using a decorrelation of rendered audio signals |
| EP3059732B1 (en) * | 2013-10-17 | 2018-10-10 | Socionext Inc. | Audio decoding device |
| EP2866227A1 (en) * | 2013-10-22 | 2015-04-29 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Method for decoding and encoding a downmix matrix, method for presenting audio content, encoder and decoder for a downmix matrix, audio encoder and audio decoder |
| US20180357038A1 (en) * | 2017-06-09 | 2018-12-13 | Qualcomm Incorporated | Audio metadata modification at rendering device |
-
2019
- 2019-10-14 EP EP19202935.3A patent/EP3809709A1/en not_active Withdrawn
-
2020
- 2020-10-08 US US17/765,002 patent/US12431152B2/en active Active
- 2020-10-08 KR KR1020227016218A patent/KR20220084113A/ko active Pending
- 2020-10-08 MX MX2022004393A patent/MX2022004393A/es unknown
- 2020-10-08 EP EP20785538.8A patent/EP4046385B1/en active Active
- 2020-10-08 CN CN202080072214.3A patent/CN114600188B/zh active Active
- 2020-10-08 WO PCT/EP2020/078297 patent/WO2021074007A1/en not_active Ceased
- 2020-10-08 JP JP2022521735A patent/JP2022551535A/ja active Pending
- 2020-10-08 BR BR112022006905A patent/BR112022006905A2/pt unknown
-
2025
- 2025-09-04 JP JP2025146640A patent/JP2025179172A/ja active Pending
Patent Citations (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20120310654A1 (en) * | 2010-02-11 | 2012-12-06 | Dolby Laboratories Licensing Corporation | System and Method for Non-destructively Normalizing Loudness of Audio Signals Within Portable Devices |
| JP2014532901A (ja) * | 2011-11-01 | 2014-12-08 | コーニンクレッカ フィリップス エヌ ヴェ | オーディオオブジェクトのエンコーディング及びデコーディング |
| JP2015522183A (ja) * | 2012-07-15 | 2015-08-03 | クゥアルコム・インコーポレイテッドQualcomm Incorporated | 基底関数係数を使用した3次元オーディオコード化のためのシステム、方法、装置、およびコンピュータ可読媒体 |
| US20150332680A1 (en) * | 2012-12-21 | 2015-11-19 | Dolby Laboratories Licensing Corporation | Object Clustering for Rendering Object-Based Audio Content Based on Perceptual Criteria |
| JP2016509249A (ja) * | 2012-12-21 | 2016-03-24 | ドルビー ラボラトリーズ ライセンシング コーポレイション | 知覚的基準に基づいてオブジェクト・ベースのオーディオ・コンテンツをレンダリングするためのオブジェクト・クラスタリング |
| JP2018067931A (ja) * | 2013-03-28 | 2018-04-26 | ドルビー ラボラトリーズ ライセンシング コーポレイション | 見かけのサイズをもつオーディオ・オブジェクトの任意のラウドスピーカー・レイアウトへのレンダリング |
| US20140294200A1 (en) * | 2013-03-29 | 2014-10-02 | Apple Inc. | Metadata for loudness and dynamic range control |
| US20190132674A1 (en) * | 2016-04-22 | 2019-05-02 | Nokia Technologies Oy | Merging Audio Signals with Spatial Metadata |
| WO2018047667A1 (ja) * | 2016-09-12 | 2018-03-15 | ソニー株式会社 | 音声処理装置および方法 |
| WO2018180531A1 (ja) * | 2017-03-28 | 2018-10-04 | ソニー株式会社 | 情報処理装置、情報処理方法、およびプログラム |
Also Published As
| Publication number | Publication date |
|---|---|
| EP4046385A1 (en) | 2022-08-24 |
| EP3809709A1 (en) | 2021-04-21 |
| MX2022004393A (es) | 2022-05-18 |
| US20220383885A1 (en) | 2022-12-01 |
| CN114600188A (zh) | 2022-06-07 |
| EP4046385B1 (en) | 2026-03-11 |
| BR112022006905A2 (pt) | 2022-07-05 |
| CN114600188B (zh) | 2025-07-08 |
| JP2025179172A (ja) | 2025-12-09 |
| WO2021074007A1 (en) | 2021-04-22 |
| EP4046385C0 (en) | 2026-03-11 |
| KR20220084113A (ko) | 2022-06-21 |
| US12431152B2 (en) | 2025-09-30 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US12431152B2 (en) | Apparatus and method for audio encoding | |
| JP5281575B2 (ja) | オーディオオブジェクトのエンコード及びデコード | |
| CN101490743B (zh) | 对立体声音频信号的动态解码 | |
| KR101790641B1 (ko) | 하이브리드 파형-코딩 및 파라미터-코딩된 스피치 인핸스 | |
| US20150248889A1 (en) | Layered approach to spatial audio coding | |
| JP5319704B2 (ja) | オーディオ信号の処理方法及び装置 | |
| CN112673649B (zh) | 空间音频增强 | |
| US11545166B2 (en) | Using metadata to aggregate signal processing operations | |
| GB2580899A (en) | Audio representation and associated rendering | |
| EP3923280A1 (en) | Adapting multi-source inputs for constant rate encoding | |
| KR20240012519A (ko) | 3차원 오디오 신호를 처리하기 위한 방법 및 장치 | |
| US12380904B2 (en) | Seamless scalable decoding of channels, objects, and HOA audio content | |
| WO2025136874A1 (en) | Pose correction metadata for interactive headtracking | |
| RU2823537C1 (ru) | Устройство и способ кодирования аудио | |
| CN120226077A (zh) | 用于音频比特流编码和解码的方法、设备和介质 | |
| JP7703692B2 (ja) | 3次元オーディオ信号符号化方法および装置、ならびにエンコーダ | |
| EP4535831A1 (en) | Modification of spatial audio scenes | |
| Fug et al. | An Introduction to MPEG-H 3D Audio | |
| HK40128667A (zh) | 用於音频比特流编码和解码的方法、设备和介质 | |
| CN121464479A (zh) | 用于对空间音频内容进行编码的装置、方法和计算机程序 | |
| CN120266202A (zh) | 用于对音频比特流和相关联返回声道信息进行编码和解码的方法、装置和介质 | |
| CN120835168A (zh) | 用于提供个性化音频流式传输和渲染的系统和方法 | |
| HK1222470B (zh) | 混合波形编码和参数编码语音增强 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20231005 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20231005 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20241127 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20241209 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20250304 |
|
| A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 20250523 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20250904 |