KR102721752B1 - 6DoF 오디오 렌더링을 위한 방법, 장치 및 시스템, 및 6DoF 오디오 렌더링을 위한 데이터 표현 및 비트스트림 구조 - Google Patents

6DoF 오디오 렌더링을 위한 방법, 장치 및 시스템, 및 6DoF 오디오 렌더링을 위한 데이터 표현 및 비트스트림 구조 Download PDF

Info

Publication number
KR102721752B1
KR102721752B1 KR1020207024701A KR20207024701A KR102721752B1 KR 102721752 B1 KR102721752 B1 KR 102721752B1 KR 1020207024701 A KR1020207024701 A KR 1020207024701A KR 20207024701 A KR20207024701 A KR 20207024701A KR 102721752 B1 KR102721752 B1 KR 102721752B1
Authority
KR
South Korea
Prior art keywords
audio
bitstream
3dof
rendering
6dof
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
KR1020207024701A
Other languages
English (en)
Korean (ko)
Other versions
KR20200141438A (ko
Inventor
레온 테렌티브
크리스토프 페르쉬
다니엘 피셔
Original Assignee
돌비 인터네셔널 에이비
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 돌비 인터네셔널 에이비 filed Critical 돌비 인터네셔널 에이비
Priority to KR1020247035074A priority Critical patent/KR20240155983A/ko
Publication of KR20200141438A publication Critical patent/KR20200141438A/ko
Application granted granted Critical
Publication of KR102721752B1 publication Critical patent/KR102721752B1/ko
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation
    • H04S7/303Tracking of listener position or orientation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/01Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/11Positioning of individual sound objects, e.g. moving airplane, within a sound field

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Mathematical Physics (AREA)
  • Quality & Reliability (AREA)
  • Stereophonic System (AREA)
KR1020207024701A 2018-04-11 2019-04-09 6DoF 오디오 렌더링을 위한 방법, 장치 및 시스템, 및 6DoF 오디오 렌더링을 위한 데이터 표현 및 비트스트림 구조 Active KR102721752B1 (ko)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020247035074A KR20240155983A (ko) 2018-04-11 2019-04-09 6DoF 오디오 렌더링을 위한 방법, 장치 및 시스템, 및 6DoF 오디오 렌더링을 위한 데이터 표현 및 비트스트림 구조

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201862655990P 2018-04-11 2018-04-11
US62/655,990 2018-04-11
PCT/EP2019/058955 WO2019197404A1 (en) 2018-04-11 2019-04-09 Methods, apparatus and systems for 6dof audio rendering and data representations and bitstream structures for 6dof audio rendering

Related Child Applications (1)

Application Number Title Priority Date Filing Date
KR1020247035074A Division KR20240155983A (ko) 2018-04-11 2019-04-09 6DoF 오디오 렌더링을 위한 방법, 장치 및 시스템, 및 6DoF 오디오 렌더링을 위한 데이터 표현 및 비트스트림 구조

Publications (2)

Publication Number Publication Date
KR20200141438A KR20200141438A (ko) 2020-12-18
KR102721752B1 true KR102721752B1 (ko) 2024-10-25

Family

ID=66165970

Family Applications (2)

Application Number Title Priority Date Filing Date
KR1020207024701A Active KR102721752B1 (ko) 2018-04-11 2019-04-09 6DoF 오디오 렌더링을 위한 방법, 장치 및 시스템, 및 6DoF 오디오 렌더링을 위한 데이터 표현 및 비트스트림 구조
KR1020247035074A Pending KR20240155983A (ko) 2018-04-11 2019-04-09 6DoF 오디오 렌더링을 위한 방법, 장치 및 시스템, 및 6DoF 오디오 렌더링을 위한 데이터 표현 및 비트스트림 구조

Family Applications After (1)

Application Number Title Priority Date Filing Date
KR1020247035074A Pending KR20240155983A (ko) 2018-04-11 2019-04-09 6DoF 오디오 렌더링을 위한 방법, 장치 및 시스템, 및 6DoF 오디오 렌더링을 위한 데이터 표현 및 비트스트림 구조

Country Status (7)

Country Link
US (3) US11432099B2 (https=)
EP (3) EP3776543B1 (https=)
JP (3) JP7093841B2 (https=)
KR (2) KR102721752B1 (https=)
CN (4) CN118824260A (https=)
BR (1) BR112020015835A2 (https=)
WO (1) WO2019197404A1 (https=)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2563635A (en) * 2017-06-21 2018-12-26 Nokia Technologies Oy Recording and rendering audio signals
US10405126B2 (en) 2017-06-30 2019-09-03 Qualcomm Incorporated Mixed-order ambisonics (MOA) audio data for computer-mediated reality systems
BR112020015835A2 (pt) 2018-04-11 2020-12-15 Dolby International Ab Métodos, aparelho e sistemas para renderização de áudio 6dof e representações de dados e estruturas de fluxo de bits para renderização de áudio 6dof
WO2019204214A2 (en) * 2018-04-16 2019-10-24 Dolby Laboratories Licensing Corporation Methods, apparatus and systems for encoding and decoding of directional sound sources
US11356793B2 (en) * 2019-10-01 2022-06-07 Qualcomm Incorporated Controlling rendering of audio data
KR102741553B1 (ko) * 2019-12-04 2024-12-12 한국전자통신연구원 렌더링 최적화를 위한 오디오 데이터 전송 방법 및 오디오 데이터 재생 방법, 오디오 데이터 전송 장치 및 오디오 데이터 재생 장치
EP4089673B1 (en) * 2020-01-10 2026-02-25 Sony Group Corporation Encoding device and decoding device
US11967329B2 (en) * 2020-02-20 2024-04-23 Qualcomm Incorporated Signaling for rendering tools
CN114067810B (zh) * 2020-07-31 2025-12-12 华为技术有限公司 音频信号渲染方法和装置
US11750998B2 (en) 2020-09-30 2023-09-05 Qualcomm Incorporated Controlling rendering of audio data
US11750745B2 (en) 2020-11-18 2023-09-05 Kelly Properties, Llc Processing and distribution of audio signals in a multi-party conferencing environment
US11743670B2 (en) 2020-12-18 2023-08-29 Qualcomm Incorporated Correlation-based rendering with multiple distributed streams accounting for an occlusion for six degree of freedom applications
KR20240012569A (ko) * 2021-05-27 2024-01-29 프라운호퍼-게젤샤프트 추르 푀르데룽 데어 안제반텐 포르슝 에 파우 음향 환경의 인코딩 및 디코딩
US11956409B2 (en) * 2021-08-23 2024-04-09 Tencent America LLC Immersive media interoperability
WO2023025143A1 (zh) * 2021-08-24 2023-03-02 北京字跳网络技术有限公司 音频信号的处理方法和装置
JP2024542412A (ja) * 2021-11-09 2024-11-15 フラウンホーファー-ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン パケットが、レンダリングシナリオの時間的展開を定義する1つ以上のシーン構成パケットを含み、タイムスタンプ情報を含む複数のパケットを用いる、オーディオデコーダ、オーディオエンコーダ、復号方法、符号化方法、及びビットストリーム
CN118511546A (zh) * 2021-11-09 2024-08-16 弗劳恩霍夫应用研究促进协会 后期混响距离衰减
GB202118094D0 (en) * 2021-12-14 2022-01-26 Nokia Technologies Oy A method and apparatus for AR scene modification
US20260046585A1 (en) * 2022-07-11 2026-02-12 Electronics And Telecommunications Research Institute Audio rendering method based on recording distance parameter and apparatus for performing same
US12604152B2 (en) 2022-12-07 2026-04-14 Dolby Laboratories Licensing Corporation Binarual rendering
EP4697327A4 (en) * 2023-04-11 2026-03-25 Beijing Xiaomi Mobile Software Co Ltd METHOD AND APPARATUS FOR PROCESSING AUDIO CODE STREAM SIGNAL, ELECTRONIC DEVICE AND STORAGE MEDIA
KR102837322B1 (ko) * 2023-04-19 2025-07-23 한국전자통신연구원 공간음향 렌더링을 위한 비트스트림 재구성 방법 및 장치
WO2025054331A1 (en) * 2023-09-05 2025-03-13 Virtuel Works Llc Spatial audio scene description and rendering

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140023196A1 (en) 2012-07-20 2014-01-23 Qualcomm Incorporated Scalable downmix design with feedback for object-based surround codec
US20150264484A1 (en) 2013-02-08 2015-09-17 Qualcomm Incorporated Obtaining sparseness information for higher order ambisonic audio renderers

Family Cites Families (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9009057B2 (en) 2006-02-21 2015-04-14 Koninklijke Philips N.V. Audio encoding and decoding to generate binaural virtual spatial signals
US20100324915A1 (en) * 2009-06-23 2010-12-23 Electronic And Telecommunications Research Institute Encoding and decoding apparatuses for high quality multi-channel audio codec
EP2489038B1 (en) * 2009-11-20 2016-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus for providing an upmix signal representation on the basis of the downmix signal representation, apparatus for providing a bitstream representing a multi-channel audio signal, methods, computer programs and bitstream representing a multi-channel audio signal using a linear combination parameter
WO2014124377A2 (en) * 2013-02-11 2014-08-14 Dolby Laboratories Licensing Corporation Audio bitstreams with supplementary data and encoding and decoding of such bitstreams
WO2014020181A1 (en) 2012-08-03 2014-02-06 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decoder and method for multi-instance spatial-audio-object-coding employing a parametric concept for multichannel downmix/upmix cases
US9477307B2 (en) * 2013-01-24 2016-10-25 The University Of Washington Methods and systems for six degree-of-freedom haptic interaction with streaming point data
US10178489B2 (en) * 2013-02-08 2019-01-08 Qualcomm Incorporated Signaling audio rendering information in a bitstream
WO2014184706A1 (en) 2013-05-16 2014-11-20 Koninklijke Philips N.V. An audio apparatus and method therefor
WO2014184353A1 (en) * 2013-05-16 2014-11-20 Koninklijke Philips N.V. An audio processing apparatus and method therefor
US9495968B2 (en) 2013-05-29 2016-11-15 Qualcomm Incorporated Identifying sources from which higher order ambisonic audio data is generated
US9384741B2 (en) * 2013-05-29 2016-07-05 Qualcomm Incorporated Binauralization of rotated higher order ambisonics
CN105612766B (zh) * 2013-07-22 2018-07-27 弗劳恩霍夫应用研究促进协会 使用渲染音频信号的解相关的多声道音频解码器、多声道音频编码器、方法、以及计算机可读介质
WO2015145782A1 (en) 2014-03-26 2015-10-01 Panasonic Corporation Apparatus and method for surround audio signal processing
US10068577B2 (en) * 2014-04-25 2018-09-04 Dolby Laboratories Licensing Corporation Audio segmentation based on spatial metadata
US9847088B2 (en) 2014-08-29 2017-12-19 Qualcomm Incorporated Intermediate compression for higher order ambisonic audio data
US9875745B2 (en) 2014-10-07 2018-01-23 Qualcomm Incorporated Normalization of ambient higher order ambisonic audio data
US9984693B2 (en) 2014-10-10 2018-05-29 Qualcomm Incorporated Signaling channels for scalable coding of higher order ambisonic audio data
CA2963771A1 (en) 2014-10-16 2016-04-21 Sony Corporation Transmission device, transmission method, reception device, and reception method
KR102856247B1 (ko) 2015-06-17 2025-09-04 삼성전자주식회사 저연산 포맷 변환을 위한 인터널 채널 처리 방법 및 장치
US9959880B2 (en) 2015-10-14 2018-05-01 Qualcomm Incorporated Coding higher-order ambisonic coefficients during multiple transitions
CN108701463B (zh) 2016-02-03 2020-03-10 杜比国际公司 音频译码中的高效格式转换
US20170339469A1 (en) * 2016-05-23 2017-11-23 Arjun Trikannad Efficient distribution of real-time and live streaming 360 spherical video
EP3472832A4 (en) 2016-06-17 2020-03-11 DTS, Inc. DISTANCE-BASED PANORAMIC USING NEAR / FAR FIELD RENDERING
US10262665B2 (en) * 2016-08-30 2019-04-16 Gaudio Lab, Inc. Method and apparatus for processing audio signals using ambisonic signals
US10650590B1 (en) * 2016-09-07 2020-05-12 Fastvdo Llc Method and system for fully immersive virtual reality
KR102257181B1 (ko) * 2016-09-13 2021-05-27 매직 립, 인코포레이티드 감각 안경류
CN117319917A (zh) * 2017-07-14 2023-12-29 弗劳恩霍夫应用研究促进协会 使用多点声场描述生成经修改的声场描述的装置及方法
GB2567172A (en) * 2017-10-04 2019-04-10 Nokia Technologies Oy Grouping and transport of audio objects
US10469968B2 (en) * 2017-10-12 2019-11-05 Qualcomm Incorporated Rendering for computer-mediated reality systems
KR102390208B1 (ko) * 2017-10-17 2022-04-25 삼성전자주식회사 멀티미디어 데이터를 전송하는 방법 및 장치
US10540941B2 (en) * 2018-01-30 2020-01-21 Magic Leap, Inc. Eclipse cursor for mixed reality displays
US11567627B2 (en) * 2018-01-30 2023-01-31 Magic Leap, Inc. Eclipse cursor for virtual content in mixed reality displays
BR112020015835A2 (pt) * 2018-04-11 2020-12-15 Dolby International Ab Métodos, aparelho e sistemas para renderização de áudio 6dof e representações de dados e estruturas de fluxo de bits para renderização de áudio 6dof
WO2019199046A1 (ko) * 2018-04-11 2019-10-17 엘지전자 주식회사 무선 통신 시스템에서 오디오에 대한 메타데이터를 송수신하는 방법 및 장치
US11128976B2 (en) * 2018-10-02 2021-09-21 Qualcomm Incorporated Representing occlusion when rendering for computer-mediated reality systems
US11232643B1 (en) * 2020-12-22 2022-01-25 Facebook Technologies, Llc Collapsing of 3D objects to 2D images in an artificial reality environment

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140023196A1 (en) 2012-07-20 2014-01-23 Qualcomm Incorporated Scalable downmix design with feedback for object-based surround codec
US20150264484A1 (en) 2013-02-08 2015-09-17 Qualcomm Incorporated Obtaining sparseness information for higher order ambisonic audio renderers

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Michael Wozniewski. A framework for interactive three-dimensional sound and spatial audio processing in a virtual environment. 2007.

Also Published As

Publication number Publication date
BR112020015835A2 (pt) 2020-12-15
US11432099B2 (en) 2022-08-30
US20210168550A1 (en) 2021-06-03
EP3776543B1 (en) 2022-08-31
CN118824258A (zh) 2024-10-22
EP4123644A1 (en) 2023-01-25
US12126985B2 (en) 2024-10-22
JP2022120190A (ja) 2022-08-17
RU2020127372A (ru) 2022-02-17
KR20200141438A (ko) 2020-12-18
US20250063318A1 (en) 2025-02-20
JP7418500B2 (ja) 2024-01-19
JP7704330B2 (ja) 2025-07-08
JP7093841B2 (ja) 2022-06-30
EP4513483A1 (en) 2025-02-26
JP2021517987A (ja) 2021-07-29
JP2024024085A (ja) 2024-02-21
CN118824260A (zh) 2024-10-22
CN118824259A (zh) 2024-10-22
CN111712875B (zh) 2024-09-06
EP4123644B1 (en) 2024-08-21
CN111712875A (zh) 2020-09-25
WO2019197404A1 (en) 2019-10-17
US20230065644A1 (en) 2023-03-02
KR20240155983A (ko) 2024-10-29
EP3776543A1 (en) 2021-02-17

Similar Documents

Publication Publication Date Title
KR102721752B1 (ko) 6DoF 오디오 렌더링을 위한 방법, 장치 및 시스템, 및 6DoF 오디오 렌더링을 위한 데이터 표현 및 비트스트림 구조
US20250203316A1 (en) Methods, apparatus, and systems for processing audio scenes for audio rendering
JP3908462B2 (ja) ダイナミックプロトタイプを使用してマルチマディアストリームを制御する方法およびシステム
CN119278627A (zh) 高效的映射坐标创建和传输
RU2782344C2 (ru) Способы, устройство и системы формирования звука 6dof, и представление данных, и структуры битовых потоков для формирования звука 6dof
HK40031045A (en) Methods, apparatus and systems for 6dof audio rendering and data representations and bitstream structures for 6dof audio rendering
HK40110894A (zh) 用於6dof音频渲染的方法、设备和系统及用於6dof音频渲染的数据表示和位流结构
HK40110902A (zh) 用於6dof音频渲染的方法、设备和系统及用於6dof音频渲染的数据表示和位流结构
HK40110893A (zh) 用於6dof音频渲染的方法、设备和系统及用於6dof音频渲染的数据表示和位流结构
HK40031045B (zh) 用於6dof音频渲染的方法、设备和系统及用於6dof音频渲染的数据表示和位流结构
KR20260023046A (ko) 오디오 장면 정보를 처리하기 위한 방법들, 장치들, 및 시스템들
WO2025056788A1 (en) Methods and apparatus for processing voxel-based scene representations
JP2026505661A (ja) オーディオ・レンダリングのためにオーディオ・シーンを処理するための方法、装置、およびシステム
EP4639532A1 (en) Apparatus and method for predicting voxel coordinates for ar/vr systems
CN119547138A (zh) 使用通用码本编码或解码ar/vr元数据的设备和方法
CN110959292A (zh) 视频的编码、解码的方法及装置、存储介质

Legal Events

Date Code Title Description
PA0105 International application

Patent event date: 20200826

Patent event code: PA01051R01D

Comment text: International Patent Application

PG1501 Laying open of application
A201 Request for examination
PA0201 Request for examination

Patent event code: PA02012R01D

Patent event date: 20220405

Comment text: Request for Examination of Application

E701 Decision to grant or registration of patent right
PE0701 Decision of registration

Patent event code: PE07011S01D

Comment text: Decision to Grant Registration

Patent event date: 20240719

A107 Divisional application of patent
GRNT Written decision to grant
PA0104 Divisional application for international application

Comment text: Divisional Application for International Patent

Patent event code: PA01041R01D

Patent event date: 20241021

PR0701 Registration of establishment

Comment text: Registration of Establishment

Patent event date: 20241021

Patent event code: PR07011E01D

PR1002 Payment of registration fee

Payment date: 20241022

End annual number: 3

Start annual number: 1

PG1601 Publication of registration