CN118824260A - 用于6dof音频渲染的方法、设备和系统及用于6dof音频渲染的数据表示和位流结构 - Google Patents

用于6dof音频渲染的方法、设备和系统及用于6dof音频渲染的数据表示和位流结构 Download PDF

Info

Publication number
CN118824260A
CN118824260A CN202411189985.7A CN202411189985A CN118824260A CN 118824260 A CN118824260 A CN 118824260A CN 202411189985 A CN202411189985 A CN 202411189985A CN 118824260 A CN118824260 A CN 118824260A
Authority
CN
China
Prior art keywords
audio
3dof
bitstream
6dof
rendering
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202411189985.7A
Other languages
English (en)
Chinese (zh)
Inventor
利昂·特连蒂夫
克里斯托弗·费尔施
丹尼尔·费希尔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dolby International AB
Original Assignee
Dolby International AB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dolby International AB filed Critical Dolby International AB
Publication of CN118824260A publication Critical patent/CN118824260A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation
    • H04S7/303Tracking of listener position or orientation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/01Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/11Positioning of individual sound objects, e.g. moving airplane, within a sound field

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Mathematical Physics (AREA)
  • Quality & Reliability (AREA)
  • Stereophonic System (AREA)
CN202411189985.7A 2018-04-11 2019-04-09 用于6dof音频渲染的方法、设备和系统及用于6dof音频渲染的数据表示和位流结构 Pending CN118824260A (zh)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201862655990P 2018-04-11 2018-04-11
US62/655,990 2018-04-11
PCT/EP2019/058955 WO2019197404A1 (en) 2018-04-11 2019-04-09 Methods, apparatus and systems for 6dof audio rendering and data representations and bitstream structures for 6dof audio rendering
CN201980013440.1A CN111712875B (zh) 2018-04-11 2019-04-09 用于6dof音频渲染的方法、设备和系统及用于6dof音频渲染的数据表示和位流结构

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN201980013440.1A Division CN111712875B (zh) 2018-04-11 2019-04-09 用于6dof音频渲染的方法、设备和系统及用于6dof音频渲染的数据表示和位流结构

Publications (1)

Publication Number Publication Date
CN118824260A true CN118824260A (zh) 2024-10-22

Family

ID=66165970

Family Applications (4)

Application Number Title Priority Date Filing Date
CN202411189985.7A Pending CN118824260A (zh) 2018-04-11 2019-04-09 用于6dof音频渲染的方法、设备和系统及用于6dof音频渲染的数据表示和位流结构
CN202411189983.8A Pending CN118824259A (zh) 2018-04-11 2019-04-09 用于6dof音频渲染的方法、设备和系统及用于6dof音频渲染的数据表示和位流结构
CN202411189981.9A Pending CN118824258A (zh) 2018-04-11 2019-04-09 用于6dof音频渲染的方法、设备和系统及用于6dof音频渲染的数据表示和位流结构
CN201980013440.1A Active CN111712875B (zh) 2018-04-11 2019-04-09 用于6dof音频渲染的方法、设备和系统及用于6dof音频渲染的数据表示和位流结构

Family Applications After (3)

Application Number Title Priority Date Filing Date
CN202411189983.8A Pending CN118824259A (zh) 2018-04-11 2019-04-09 用于6dof音频渲染的方法、设备和系统及用于6dof音频渲染的数据表示和位流结构
CN202411189981.9A Pending CN118824258A (zh) 2018-04-11 2019-04-09 用于6dof音频渲染的方法、设备和系统及用于6dof音频渲染的数据表示和位流结构
CN201980013440.1A Active CN111712875B (zh) 2018-04-11 2019-04-09 用于6dof音频渲染的方法、设备和系统及用于6dof音频渲染的数据表示和位流结构

Country Status (7)

Country Link
US (3) US11432099B2 (https=)
EP (3) EP3776543B1 (https=)
JP (3) JP7093841B2 (https=)
KR (2) KR102721752B1 (https=)
CN (4) CN118824260A (https=)
BR (1) BR112020015835A2 (https=)
WO (1) WO2019197404A1 (https=)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2563635A (en) * 2017-06-21 2018-12-26 Nokia Technologies Oy Recording and rendering audio signals
US10405126B2 (en) 2017-06-30 2019-09-03 Qualcomm Incorporated Mixed-order ambisonics (MOA) audio data for computer-mediated reality systems
BR112020015835A2 (pt) 2018-04-11 2020-12-15 Dolby International Ab Métodos, aparelho e sistemas para renderização de áudio 6dof e representações de dados e estruturas de fluxo de bits para renderização de áudio 6dof
WO2019204214A2 (en) * 2018-04-16 2019-10-24 Dolby Laboratories Licensing Corporation Methods, apparatus and systems for encoding and decoding of directional sound sources
US11356793B2 (en) * 2019-10-01 2022-06-07 Qualcomm Incorporated Controlling rendering of audio data
KR102741553B1 (ko) * 2019-12-04 2024-12-12 한국전자통신연구원 렌더링 최적화를 위한 오디오 데이터 전송 방법 및 오디오 데이터 재생 방법, 오디오 데이터 전송 장치 및 오디오 데이터 재생 장치
EP4089673B1 (en) * 2020-01-10 2026-02-25 Sony Group Corporation Encoding device and decoding device
US11967329B2 (en) * 2020-02-20 2024-04-23 Qualcomm Incorporated Signaling for rendering tools
CN114067810B (zh) * 2020-07-31 2025-12-12 华为技术有限公司 音频信号渲染方法和装置
US11750998B2 (en) 2020-09-30 2023-09-05 Qualcomm Incorporated Controlling rendering of audio data
US11750745B2 (en) 2020-11-18 2023-09-05 Kelly Properties, Llc Processing and distribution of audio signals in a multi-party conferencing environment
US11743670B2 (en) 2020-12-18 2023-08-29 Qualcomm Incorporated Correlation-based rendering with multiple distributed streams accounting for an occlusion for six degree of freedom applications
KR20240012569A (ko) * 2021-05-27 2024-01-29 프라운호퍼-게젤샤프트 추르 푀르데룽 데어 안제반텐 포르슝 에 파우 음향 환경의 인코딩 및 디코딩
US11956409B2 (en) * 2021-08-23 2024-04-09 Tencent America LLC Immersive media interoperability
WO2023025143A1 (zh) * 2021-08-24 2023-03-02 北京字跳网络技术有限公司 音频信号的处理方法和装置
JP2024542412A (ja) * 2021-11-09 2024-11-15 フラウンホーファー-ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン パケットが、レンダリングシナリオの時間的展開を定義する1つ以上のシーン構成パケットを含み、タイムスタンプ情報を含む複数のパケットを用いる、オーディオデコーダ、オーディオエンコーダ、復号方法、符号化方法、及びビットストリーム
CN118511546A (zh) * 2021-11-09 2024-08-16 弗劳恩霍夫应用研究促进协会 后期混响距离衰减
GB202118094D0 (en) * 2021-12-14 2022-01-26 Nokia Technologies Oy A method and apparatus for AR scene modification
US20260046585A1 (en) * 2022-07-11 2026-02-12 Electronics And Telecommunications Research Institute Audio rendering method based on recording distance parameter and apparatus for performing same
US12604152B2 (en) 2022-12-07 2026-04-14 Dolby Laboratories Licensing Corporation Binarual rendering
EP4697327A4 (en) * 2023-04-11 2026-03-25 Beijing Xiaomi Mobile Software Co Ltd METHOD AND APPARATUS FOR PROCESSING AUDIO CODE STREAM SIGNAL, ELECTRONIC DEVICE AND STORAGE MEDIA
KR102837322B1 (ko) * 2023-04-19 2025-07-23 한국전자통신연구원 공간음향 렌더링을 위한 비트스트림 재구성 방법 및 장치
WO2025054331A1 (en) * 2023-09-05 2025-03-13 Virtuel Works Llc Spatial audio scene description and rendering

Family Cites Families (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9009057B2 (en) 2006-02-21 2015-04-14 Koninklijke Philips N.V. Audio encoding and decoding to generate binaural virtual spatial signals
US20100324915A1 (en) * 2009-06-23 2010-12-23 Electronic And Telecommunications Research Institute Encoding and decoding apparatuses for high quality multi-channel audio codec
EP2489038B1 (en) * 2009-11-20 2016-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus for providing an upmix signal representation on the basis of the downmix signal representation, apparatus for providing a bitstream representing a multi-channel audio signal, methods, computer programs and bitstream representing a multi-channel audio signal using a linear combination parameter
WO2014124377A2 (en) * 2013-02-11 2014-08-14 Dolby Laboratories Licensing Corporation Audio bitstreams with supplementary data and encoding and decoding of such bitstreams
US9479886B2 (en) 2012-07-20 2016-10-25 Qualcomm Incorporated Scalable downmix design with feedback for object-based surround codec
WO2014020181A1 (en) 2012-08-03 2014-02-06 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decoder and method for multi-instance spatial-audio-object-coding employing a parametric concept for multichannel downmix/upmix cases
US9477307B2 (en) * 2013-01-24 2016-10-25 The University Of Washington Methods and systems for six degree-of-freedom haptic interaction with streaming point data
US9609452B2 (en) 2013-02-08 2017-03-28 Qualcomm Incorporated Obtaining sparseness information for higher order ambisonic audio renderers
US10178489B2 (en) * 2013-02-08 2019-01-08 Qualcomm Incorporated Signaling audio rendering information in a bitstream
WO2014184706A1 (en) 2013-05-16 2014-11-20 Koninklijke Philips N.V. An audio apparatus and method therefor
WO2014184353A1 (en) * 2013-05-16 2014-11-20 Koninklijke Philips N.V. An audio processing apparatus and method therefor
US9495968B2 (en) 2013-05-29 2016-11-15 Qualcomm Incorporated Identifying sources from which higher order ambisonic audio data is generated
US9384741B2 (en) * 2013-05-29 2016-07-05 Qualcomm Incorporated Binauralization of rotated higher order ambisonics
CN105612766B (zh) * 2013-07-22 2018-07-27 弗劳恩霍夫应用研究促进协会 使用渲染音频信号的解相关的多声道音频解码器、多声道音频编码器、方法、以及计算机可读介质
WO2015145782A1 (en) 2014-03-26 2015-10-01 Panasonic Corporation Apparatus and method for surround audio signal processing
US10068577B2 (en) * 2014-04-25 2018-09-04 Dolby Laboratories Licensing Corporation Audio segmentation based on spatial metadata
US9847088B2 (en) 2014-08-29 2017-12-19 Qualcomm Incorporated Intermediate compression for higher order ambisonic audio data
US9875745B2 (en) 2014-10-07 2018-01-23 Qualcomm Incorporated Normalization of ambient higher order ambisonic audio data
US9984693B2 (en) 2014-10-10 2018-05-29 Qualcomm Incorporated Signaling channels for scalable coding of higher order ambisonic audio data
CA2963771A1 (en) 2014-10-16 2016-04-21 Sony Corporation Transmission device, transmission method, reception device, and reception method
KR102856247B1 (ko) 2015-06-17 2025-09-04 삼성전자주식회사 저연산 포맷 변환을 위한 인터널 채널 처리 방법 및 장치
US9959880B2 (en) 2015-10-14 2018-05-01 Qualcomm Incorporated Coding higher-order ambisonic coefficients during multiple transitions
CN108701463B (zh) 2016-02-03 2020-03-10 杜比国际公司 音频译码中的高效格式转换
US20170339469A1 (en) * 2016-05-23 2017-11-23 Arjun Trikannad Efficient distribution of real-time and live streaming 360 spherical video
EP3472832A4 (en) 2016-06-17 2020-03-11 DTS, Inc. DISTANCE-BASED PANORAMIC USING NEAR / FAR FIELD RENDERING
US10262665B2 (en) * 2016-08-30 2019-04-16 Gaudio Lab, Inc. Method and apparatus for processing audio signals using ambisonic signals
US10650590B1 (en) * 2016-09-07 2020-05-12 Fastvdo Llc Method and system for fully immersive virtual reality
KR102257181B1 (ko) * 2016-09-13 2021-05-27 매직 립, 인코포레이티드 감각 안경류
CN117319917A (zh) * 2017-07-14 2023-12-29 弗劳恩霍夫应用研究促进协会 使用多点声场描述生成经修改的声场描述的装置及方法
GB2567172A (en) * 2017-10-04 2019-04-10 Nokia Technologies Oy Grouping and transport of audio objects
US10469968B2 (en) * 2017-10-12 2019-11-05 Qualcomm Incorporated Rendering for computer-mediated reality systems
KR102390208B1 (ko) * 2017-10-17 2022-04-25 삼성전자주식회사 멀티미디어 데이터를 전송하는 방법 및 장치
US10540941B2 (en) * 2018-01-30 2020-01-21 Magic Leap, Inc. Eclipse cursor for mixed reality displays
US11567627B2 (en) * 2018-01-30 2023-01-31 Magic Leap, Inc. Eclipse cursor for virtual content in mixed reality displays
BR112020015835A2 (pt) * 2018-04-11 2020-12-15 Dolby International Ab Métodos, aparelho e sistemas para renderização de áudio 6dof e representações de dados e estruturas de fluxo de bits para renderização de áudio 6dof
WO2019199046A1 (ko) * 2018-04-11 2019-10-17 엘지전자 주식회사 무선 통신 시스템에서 오디오에 대한 메타데이터를 송수신하는 방법 및 장치
US11128976B2 (en) * 2018-10-02 2021-09-21 Qualcomm Incorporated Representing occlusion when rendering for computer-mediated reality systems
US11232643B1 (en) * 2020-12-22 2022-01-25 Facebook Technologies, Llc Collapsing of 3D objects to 2D images in an artificial reality environment

Also Published As

Publication number Publication date
BR112020015835A2 (pt) 2020-12-15
US11432099B2 (en) 2022-08-30
US20210168550A1 (en) 2021-06-03
EP3776543B1 (en) 2022-08-31
CN118824258A (zh) 2024-10-22
EP4123644A1 (en) 2023-01-25
US12126985B2 (en) 2024-10-22
JP2022120190A (ja) 2022-08-17
RU2020127372A (ru) 2022-02-17
KR20200141438A (ko) 2020-12-18
US20250063318A1 (en) 2025-02-20
JP7418500B2 (ja) 2024-01-19
JP7704330B2 (ja) 2025-07-08
JP7093841B2 (ja) 2022-06-30
EP4513483A1 (en) 2025-02-26
JP2021517987A (ja) 2021-07-29
JP2024024085A (ja) 2024-02-21
CN118824259A (zh) 2024-10-22
CN111712875B (zh) 2024-09-06
EP4123644B1 (en) 2024-08-21
CN111712875A (zh) 2020-09-25
WO2019197404A1 (en) 2019-10-17
US20230065644A1 (en) 2023-03-02
KR102721752B1 (ko) 2024-10-25
KR20240155983A (ko) 2024-10-29
EP3776543A1 (en) 2021-02-17

Similar Documents

Publication Publication Date Title
JP7704330B2 (ja) 6dofオーディオ・レンダリングのための方法、装置およびシステムならびに6dofオーディオ・レンダリングのためのデータ表現およびビットストリーム構造
CN111955020B (zh) 用于音频渲染的预渲染信号的方法、设备和系统
JP2021530143A (ja) 点群のハイブリッド幾何学的コーディング
JP2025509234A (ja) オーディオレンダリングのためにオーディオシーンを処理するための方法、装置及びシステム
CN119278627A (zh) 高效的映射坐标创建和传输
HK40110902A (zh) 用於6dof音频渲染的方法、设备和系统及用於6dof音频渲染的数据表示和位流结构
HK40110893A (zh) 用於6dof音频渲染的方法、设备和系统及用於6dof音频渲染的数据表示和位流结构
HK40110894A (zh) 用於6dof音频渲染的方法、设备和系统及用於6dof音频渲染的数据表示和位流结构
HK40031045B (zh) 用於6dof音频渲染的方法、设备和系统及用於6dof音频渲染的数据表示和位流结构
RU2782344C2 (ru) Способы, устройство и системы формирования звука 6dof, и представление данных, и структуры битовых потоков для формирования звука 6dof
HK40031045A (en) Methods, apparatus and systems for 6dof audio rendering and data representations and bitstream structures for 6dof audio rendering
CN115733576A (zh) 点云媒体文件的封装与解封装方法、装置及存储介质
JP2026505661A (ja) オーディオ・レンダリングのためにオーディオ・シーンを処理するための方法、装置、およびシステム
TW202539252A (zh) 點雲編解碼器中之基於區塊之區域適應性階層變換(raht)之傳訊
WO2025056788A1 (en) Methods and apparatus for processing voxel-based scene representations
HK40118970A (zh) 用於处理音频场景以进行音频渲染的方法、装置和系统
HK40064955B (zh) 点云媒体的数据处理方法、装置、设备及可读存储介质
KR20260023046A (ko) 오디오 장면 정보를 처리하기 위한 방법들, 장치들, 및 시스템들
CN117897732A (zh) 网格面元句法

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 40110902

Country of ref document: HK