KR20220084113A - 오디오 인코딩을 위한 장치 및 방법 - Google Patents
오디오 인코딩을 위한 장치 및 방법 Download PDFInfo
- Publication number
- KR20220084113A KR20220084113A KR1020227016218A KR20227016218A KR20220084113A KR 20220084113 A KR20220084113 A KR 20220084113A KR 1020227016218 A KR1020227016218 A KR 1020227016218A KR 20227016218 A KR20227016218 A KR 20227016218A KR 20220084113 A KR20220084113 A KR 20220084113A
- Authority
- KR
- South Korea
- Prior art keywords
- audio
- item
- presentation metadata
- items
- metadata
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/233—Processing of audio elementary streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/233—Processing of audio elementary streams
- H04N21/2335—Processing of audio elementary streams involving reformatting operations of audio signals, e.g. by converting from one coding standard to another
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/235—Processing of additional data, e.g. scrambling of additional data or processing content descriptors
- H04N21/2353—Processing of additional data, e.g. scrambling of additional data or processing content descriptors specifically adapted to content descriptors, e.g. coding, compressing or processing of metadata
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/435—Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
- H04N21/4394—Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Library & Information Science (AREA)
- Stereophonic System (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP19202935.3 | 2019-10-14 | ||
| EP19202935.3A EP3809709A1 (en) | 2019-10-14 | 2019-10-14 | Apparatus and method for audio encoding |
| PCT/EP2020/078297 WO2021074007A1 (en) | 2019-10-14 | 2020-10-08 | Apparatus and method for audio encoding |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| KR20220084113A true KR20220084113A (ko) | 2022-06-21 |
Family
ID=68280951
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020227016218A Pending KR20220084113A (ko) | 2019-10-14 | 2020-10-08 | 오디오 인코딩을 위한 장치 및 방법 |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US12431152B2 (https=) |
| EP (2) | EP3809709A1 (https=) |
| JP (2) | JP2022551535A (https=) |
| KR (1) | KR20220084113A (https=) |
| CN (1) | CN114600188B (https=) |
| BR (1) | BR112022006905A2 (https=) |
| MX (1) | MX2022004393A (https=) |
| WO (1) | WO2021074007A1 (https=) |
Families Citing this family (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US12531077B2 (en) * | 2021-02-22 | 2026-01-20 | Tencent America LLC | Method and apparatus in audio processing |
| US11622221B2 (en) * | 2021-05-05 | 2023-04-04 | Tencent America LLC | Method and apparatus for representing space of interest of audio scene |
| CN117501362B (zh) * | 2021-06-15 | 2025-05-09 | 北京字跳网络技术有限公司 | 音频渲染系统、方法和电子设备 |
| WO2022262758A1 (zh) * | 2021-06-15 | 2022-12-22 | 北京字跳网络技术有限公司 | 音频渲染系统、方法和电子设备 |
| GB2608406A (en) * | 2021-06-30 | 2023-01-04 | Nokia Technologies Oy | Creating spatial audio stream from audio objects with spatial extent |
| GB2611800A (en) * | 2021-10-15 | 2023-04-19 | Nokia Technologies Oy | A method and apparatus for efficient delivery of edge based rendering of 6DOF MPEG-I immersive audio |
| CN121312155A (zh) * | 2023-05-31 | 2026-01-09 | 抖音视界有限公司 | 音频渲染方法、装置和非易失性计算机可读存储介质 |
| CN119296553A (zh) * | 2023-07-10 | 2025-01-10 | 华为技术有限公司 | 编码方法及电子设备 |
| US12518772B2 (en) | 2023-08-01 | 2026-01-06 | Samsung Electronics Co., Ltd. | Codec bitrate selection in audio object coding |
| GB2634524A (en) * | 2023-10-11 | 2025-04-16 | Nokia Technologies Oy | Parametric spatial audio decoding with pass-through mode |
| CN118116397A (zh) * | 2024-02-22 | 2024-05-31 | 中央广播电视总台 | 音频元数据编解码方法、传输方法、编码器终端及系统 |
| WO2025232857A1 (en) * | 2024-05-10 | 2025-11-13 | Douyin Vision Co., Ltd. | Audio processing method and apparatus |
Family Cites Families (19)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP2154911A1 (en) * | 2008-08-13 | 2010-02-17 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | An apparatus for determining a spatial output multi-channel audio signal |
| TWI529703B (zh) * | 2010-02-11 | 2016-04-11 | 杜比實驗室特許公司 | 用以非破壞地正常化可攜式裝置中音訊訊號響度之系統及方法 |
| US8908874B2 (en) * | 2010-09-08 | 2014-12-09 | Dts, Inc. | Spatial audio encoding and reproduction |
| TWI896112B (zh) * | 2010-12-03 | 2025-09-01 | 美商杜比實驗室特許公司 | 音頻解碼裝置、音頻解碼方法及音頻編碼方法 |
| JP6096789B2 (ja) * | 2011-11-01 | 2017-03-15 | コーニンクレッカ フィリップス エヌ ヴェKoninklijke Philips N.V. | オーディオオブジェクトのエンコーディング及びデコーディング |
| US9190065B2 (en) * | 2012-07-15 | 2015-11-17 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for three-dimensional audio coding using basis function coefficients |
| US9805725B2 (en) * | 2012-12-21 | 2017-10-31 | Dolby Laboratories Licensing Corporation | Object clustering for rendering object-based audio content based on perceptual criteria |
| CN105075292B (zh) * | 2013-03-28 | 2017-07-25 | 杜比实验室特许公司 | 用于创作和渲染音频再现数据的方法和设备 |
| US9559651B2 (en) * | 2013-03-29 | 2017-01-31 | Apple Inc. | Metadata for loudness and dynamic range control |
| EP2830336A3 (en) * | 2013-07-22 | 2015-03-04 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Renderer controlled spatial upmix |
| EP2830045A1 (en) * | 2013-07-22 | 2015-01-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Concept for audio encoding and decoding for audio channels and audio objects |
| EP2830332A3 (en) * | 2013-07-22 | 2015-03-11 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Method, signal processing unit, and computer program for mapping a plurality of input channels of an input channel configuration to output channels of an output channel configuration |
| SG11201600466PA (en) * | 2013-07-22 | 2016-02-26 | Fraunhofer Ges Forschung | Multi-channel audio decoder, multi-channel audio encoder, methods, computer program and encoded audio representation using a decorrelation of rendered audio signals |
| EP3059732B1 (en) * | 2013-10-17 | 2018-10-10 | Socionext Inc. | Audio decoding device |
| EP2866227A1 (en) * | 2013-10-22 | 2015-04-29 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Method for decoding and encoding a downmix matrix, method for presenting audio content, encoder and decoder for a downmix matrix, audio encoder and audio decoder |
| GB2549532A (en) * | 2016-04-22 | 2017-10-25 | Nokia Technologies Oy | Merging audio signals with spatial metadata |
| WO2018047667A1 (ja) * | 2016-09-12 | 2018-03-15 | ソニー株式会社 | 音声処理装置および方法 |
| US11074921B2 (en) * | 2017-03-28 | 2021-07-27 | Sony Corporation | Information processing device and information processing method |
| US20180357038A1 (en) * | 2017-06-09 | 2018-12-13 | Qualcomm Incorporated | Audio metadata modification at rendering device |
-
2019
- 2019-10-14 EP EP19202935.3A patent/EP3809709A1/en not_active Withdrawn
-
2020
- 2020-10-08 US US17/765,002 patent/US12431152B2/en active Active
- 2020-10-08 KR KR1020227016218A patent/KR20220084113A/ko active Pending
- 2020-10-08 MX MX2022004393A patent/MX2022004393A/es unknown
- 2020-10-08 EP EP20785538.8A patent/EP4046385B1/en active Active
- 2020-10-08 CN CN202080072214.3A patent/CN114600188B/zh active Active
- 2020-10-08 WO PCT/EP2020/078297 patent/WO2021074007A1/en not_active Ceased
- 2020-10-08 JP JP2022521735A patent/JP2022551535A/ja active Pending
- 2020-10-08 BR BR112022006905A patent/BR112022006905A2/pt unknown
-
2025
- 2025-09-04 JP JP2025146640A patent/JP2025179172A/ja active Pending
Also Published As
| Publication number | Publication date |
|---|---|
| EP4046385A1 (en) | 2022-08-24 |
| EP3809709A1 (en) | 2021-04-21 |
| MX2022004393A (es) | 2022-05-18 |
| US20220383885A1 (en) | 2022-12-01 |
| CN114600188A (zh) | 2022-06-07 |
| EP4046385B1 (en) | 2026-03-11 |
| BR112022006905A2 (pt) | 2022-07-05 |
| CN114600188B (zh) | 2025-07-08 |
| JP2022551535A (ja) | 2022-12-09 |
| JP2025179172A (ja) | 2025-12-09 |
| WO2021074007A1 (en) | 2021-04-22 |
| EP4046385C0 (en) | 2026-03-11 |
| US12431152B2 (en) | 2025-09-30 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP4046385B1 (en) | Apparatus and method for audio encoding | |
| JP5281575B2 (ja) | オーディオオブジェクトのエンコード及びデコード | |
| JP6045696B2 (ja) | オーディオ信号処理方法および装置 | |
| Quackenbush et al. | MPEG standards for compressed representation of immersive audio | |
| CN112673649B (zh) | 空间音频增强 | |
| KR102148217B1 (ko) | 위치기반 오디오 신호처리 방법 | |
| CN113678198A (zh) | 音频编解码器扩展 | |
| CN112567765B (zh) | 空间音频捕获、传输和再现 | |
| GB2580899A (en) | Audio representation and associated rendering | |
| EP3923280A1 (en) | Adapting multi-source inputs for constant rate encoding | |
| US11950080B2 (en) | Method and device for processing audio signal, using metadata | |
| KR102059846B1 (ko) | 오디오 신호 처리 방법 및 장치 | |
| US20240105196A1 (en) | Method and System for Encoding Loudness Metadata of Audio Components | |
| RU2823537C1 (ru) | Устройство и способ кодирования аудио | |
| EP4636762A1 (en) | System and method to provide personalized audio streaming and rendering | |
| Fug et al. | An Introduction to MPEG-H 3D Audio | |
| KR20240004869A (ko) | 3차원 오디오 신호 인코딩 방법 및 장치, 및 인코더 | |
| CN120266202A (zh) | 用于对音频比特流和相关联返回声道信息进行编码和解码的方法、装置和介质 | |
| CN119998873A (zh) | 用灵活的基于块的语法对音频比特流进行编码和解码的方法、装置和介质 | |
| CN119998871A (zh) | 用参数灵活渲染配置数据对音频比特流进行编码和解码的方法、装置和介质 | |
| CN120077434A (zh) | 用于音频比特流和关联回声参考信号的编码和解码的方法、装置和介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PA0105 | International application |
St.27 status event code: A-0-1-A10-A15-nap-PA0105 |
|
| E13-X000 | Pre-grant limitation requested |
St.27 status event code: A-2-3-E10-E13-lim-X000 |
|
| P11-X000 | Amendment of application requested |
St.27 status event code: A-2-2-P10-P11-nap-X000 |
|
| P13-X000 | Application amended |
St.27 status event code: A-2-2-P10-P13-nap-X000 |
|
| PG1501 | Laying open of application |
St.27 status event code: A-1-1-Q10-Q12-nap-PG1501 |
|
| P11-X000 | Amendment of application requested |
St.27 status event code: A-2-2-P10-P11-nap-X000 |
|
| P13-X000 | Application amended |
St.27 status event code: A-2-2-P10-P13-nap-X000 |
|
| PA0201 | Request for examination |
St.27 status event code: A-1-2-D10-D11-exm-PA0201 |
|
| R18-X000 | Changes to party contact information recorded |
St.27 status event code: A-3-3-R10-R18-oth-X000 |
|
| R18-X000 | Changes to party contact information recorded |
St.27 status event code: A-3-3-R10-R18-oth-X000 |
|
| R18 | Changes to party contact information recorded |
Free format text: ST27 STATUS EVENT CODE: A-3-3-R10-R18-OTH-X000 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| R18-X000 | Changes to party contact information recorded |
St.27 status event code: A-3-3-R10-R18-oth-X000 |