CN105027199B - 在位流中指定球谐系数和/或高阶立体混响系数 - Google Patents

在位流中指定球谐系数和/或高阶立体混响系数 Download PDF

Info

Publication number
CN105027199B
CN105027199B CN201480011198.1A CN201480011198A CN105027199B CN 105027199 B CN105027199 B CN 105027199B CN 201480011198 A CN201480011198 A CN 201480011198A CN 105027199 B CN105027199 B CN 105027199B
Authority
CN
China
Prior art keywords
bitstream
spherical harmonic
harmonic coefficients
sound field
bits
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201480011198.1A
Other languages
English (en)
Chinese (zh)
Other versions
CN105027199A (zh
Inventor
D·森
M·J·莫雷尔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualcomm Inc
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of CN105027199A publication Critical patent/CN105027199A/zh
Application granted granted Critical
Publication of CN105027199B publication Critical patent/CN105027199B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/11Application of ambisonics in stereophonic audio systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
  • Circuit For Audible Band Transducer (AREA)
CN201480011198.1A 2013-03-01 2014-02-28 在位流中指定球谐系数和/或高阶立体混响系数 Active CN105027199B (zh)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
US201361771677P 2013-03-01 2013-03-01
US61/771,677 2013-03-01
US201361860201P 2013-07-30 2013-07-30
US61/860,201 2013-07-30
US14/192,819 2014-02-27
US14/192,819 US9959875B2 (en) 2013-03-01 2014-02-27 Specifying spherical harmonic and/or higher order ambisonics coefficients in bitstreams
PCT/US2014/019446 WO2014134462A2 (en) 2013-03-01 2014-02-28 Specifying spherical harmonic and/or higher order ambisonics coefficients in bitstreams

Publications (2)

Publication Number Publication Date
CN105027199A CN105027199A (zh) 2015-11-04
CN105027199B true CN105027199B (zh) 2018-05-29

Family

ID=51420957

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201480011198.1A Active CN105027199B (zh) 2013-03-01 2014-02-28 在位流中指定球谐系数和/或高阶立体混响系数
CN201480011287.6A Active CN105027200B (zh) 2013-03-01 2014-02-28 变换球谐系数

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN201480011287.6A Active CN105027200B (zh) 2013-03-01 2014-02-28 变换球谐系数

Country Status (10)

Country Link
US (2) US9959875B2 (enExample)
EP (2) EP2962297B1 (enExample)
JP (2) JP2016510905A (enExample)
KR (2) KR20150123310A (enExample)
CN (2) CN105027199B (enExample)
BR (1) BR112015020892A2 (enExample)
ES (1) ES2738490T3 (enExample)
HU (1) HUE045446T2 (enExample)
TW (2) TWI603631B (enExample)
WO (2) WO2014134472A2 (enExample)

Families Citing this family (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2665208A1 (en) 2012-05-14 2013-11-20 Thomson Licensing Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation
US9959875B2 (en) 2013-03-01 2018-05-01 Qualcomm Incorporated Specifying spherical harmonic and/or higher order ambisonics coefficients in bitstreams
US9412385B2 (en) * 2013-05-28 2016-08-09 Qualcomm Incorporated Performing spatial masking with respect to spherical harmonic coefficients
US9466305B2 (en) 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
US9769586B2 (en) 2013-05-29 2017-09-19 Qualcomm Incorporated Performing order reduction with respect to higher order ambisonic coefficients
US9384741B2 (en) * 2013-05-29 2016-07-05 Qualcomm Incorporated Binauralization of rotated higher order ambisonics
EP3005354B1 (en) * 2013-06-05 2019-07-03 Dolby International AB Method for encoding audio signals, apparatus for encoding audio signals, method for decoding audio signals and apparatus for decoding audio signals
EP2879408A1 (en) * 2013-11-28 2015-06-03 Thomson Licensing Method and apparatus for higher order ambisonics encoding and decoding using singular value decomposition
US9489955B2 (en) 2014-01-30 2016-11-08 Qualcomm Incorporated Indicating frame parameter reusability for coding vectors
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals
US10547701B2 (en) * 2014-09-12 2020-01-28 Sony Corporation Transmission device, transmission method, reception device, and a reception method
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
WO2016062869A1 (en) * 2014-10-24 2016-04-28 Dolby International Ab Encoding and decoding of audio signals
US10452651B1 (en) 2014-12-23 2019-10-22 Palantir Technologies Inc. Searching charts
CN104795064B (zh) * 2015-03-30 2018-04-13 福州大学 低信噪比声场景下声音事件的识别方法
MX392462B (es) 2015-10-08 2025-03-24 Dolby Int Ab Codificacion en capas para representaciones de sonido o campo de sonido comprimidas
FR3050601B1 (fr) * 2016-04-26 2018-06-22 Arkamys Procede et systeme de diffusion d'un signal audio a 360°
MC200186B1 (fr) * 2016-09-30 2017-10-18 Coronal Encoding Procédé de conversion, d'encodage stéréophonique, de décodage et de transcodage d'un signal audio tridimensionnel
JP7115477B2 (ja) * 2017-07-05 2022-08-09 ソニーグループ株式会社 信号処理装置および方法、並びにプログラム
RU2736274C1 (ru) 2017-07-14 2020-11-13 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Принцип формирования улучшенного описания звукового поля или модифицированного описания звукового поля с использованием dirac-технологии с расширением глубины или других технологий
KR102652670B1 (ko) 2017-07-14 2024-04-01 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. 다중-층 묘사를 이용하여 증강된 음장 묘사 또는 수정된 음장 묘사를 생성하기 위한 개념
AU2018298874C1 (en) 2017-07-14 2023-10-19 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Concept for generating an enhanced sound field description or a modified sound field description using a multi-point sound field description
US10075802B1 (en) 2017-08-08 2018-09-11 Qualcomm Incorporated Bitrate allocation for higher order ambisonic audio data
US11281726B2 (en) * 2017-12-01 2022-03-22 Palantir Technologies Inc. System and methods for faster processor comparisons of visual graph features
US10419138B2 (en) * 2017-12-22 2019-09-17 At&T Intellectual Property I, L.P. Radio-based channel sounding using phased array antennas
GB2572650A (en) * 2018-04-06 2019-10-09 Nokia Technologies Oy Spatial audio parameters and associated spatial audio playback
US11315578B2 (en) 2018-04-16 2022-04-26 Dolby Laboratories Licensing Corporation Methods, apparatus and systems for encoding and decoding of directional sound sources
CN111819627B (zh) * 2018-07-02 2025-04-11 杜比实验室特许公司 用于对沉浸式音频信号进行编码及/或解码的方法及装置
WO2020008112A1 (en) * 2018-07-03 2020-01-09 Nokia Technologies Oy Energy-ratio signalling and synthesis
US12308034B2 (en) * 2019-06-24 2025-05-20 Qualcomm Incorporated Performing psychoacoustic audio coding based on operating conditions
US12142285B2 (en) 2019-06-24 2024-11-12 Qualcomm Incorporated Quantizing spatial components based on bit allocations determined for psychoacoustic audio coding
US11043742B2 (en) 2019-07-31 2021-06-22 At&T Intellectual Property I, L.P. Phased array mobile channel sounding system
WO2021091769A1 (en) * 2019-11-04 2021-05-14 Qualcomm Incorporated Signalling of audio effect metadata in a bitstream
GB2615236A (en) * 2020-09-25 2023-08-02 Apple Inc Higher order ambisonics encoding and decoding
CN116868588A (zh) * 2020-11-03 2023-10-10 弗劳恩霍夫应用研究促进协会 用于音频信号变换的装置和方法
EP4174637A1 (en) * 2021-10-26 2023-05-03 Koninklijke Philips N.V. Bitstream representing audio in an environment

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1942931A (zh) * 2004-04-21 2007-04-04 杜比实验室特许公司 通过树型分层数据结构的有序横向结构描述比特流语法的音频比特流格式
EP2450880A1 (en) * 2010-11-05 2012-05-09 Thomson Licensing Data structure for Higher Order Ambisonics audio data

Family Cites Families (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB9103207D0 (en) 1991-02-15 1991-04-03 Gerzon Michael A Stereophonic sound reproduction system
US5594800A (en) 1991-02-15 1997-01-14 Trifield Productions Limited Sound reproduction system having a matrix converter
AUPO099696A0 (en) 1996-07-12 1996-08-08 Lake Dsp Pty Limited Methods and apparatus for processing spatialised audio
US6021206A (en) 1996-10-02 2000-02-01 Lake Dsp Pty Ltd Methods and apparatus for processing spatialised audio
JPH1118199A (ja) 1997-06-26 1999-01-22 Nippon Columbia Co Ltd 音響処理装置
CA2406926A1 (en) 2000-04-19 2001-11-01 Sonic Solutions Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics in three dimensions
FR2847376B1 (fr) * 2002-11-19 2005-02-04 France Telecom Procede de traitement de donnees sonores et dispositif d'acquisition sonore mettant en oeuvre ce procede
US7167176B2 (en) 2003-08-15 2007-01-23 Microsoft Corporation Clustered principal components for precomputed radiance transfer
US20060247918A1 (en) 2005-04-29 2006-11-02 Microsoft Corporation Systems and methods for 3D audio programming and processing
FR2898725A1 (fr) 2006-03-15 2007-09-21 France Telecom Dispositif et procede de codage gradue d'un signal audio multi-canal selon une analyse en composante principale
US7589725B2 (en) 2006-06-30 2009-09-15 Microsoft Corporation Soft shadows in dynamic scenes
FR2916079A1 (fr) * 2007-05-10 2008-11-14 France Telecom Procede de codage et decodage audio, codeur audio, decodeur audio et programmes d'ordinateur associes
SG177277A1 (en) * 2009-06-24 2012-02-28 Fraunhofer Ges Forschung Audio signal decoder, method for decoding an audio signal and computer program using cascaded audio object processing stages
WO2011012672A1 (en) * 2009-07-29 2011-02-03 Pharnext New diagnostic tools for alzheimer disease
EP2539892B1 (fr) * 2010-02-26 2014-04-02 Orange Compression de flux audio multicanal
US9552840B2 (en) * 2010-10-25 2017-01-24 Qualcomm Incorporated Three-dimensional sound capturing and reproducing with multi-microphones
EP2469741A1 (en) 2010-12-21 2012-06-27 Thomson Licensing Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field
CN102333265B (zh) 2011-05-20 2014-02-19 南京大学 一种基于连续声源概念的三维局部空间声场重放方法
EP2541547A1 (en) 2011-06-30 2013-01-02 Thomson Licensing Method and apparatus for changing the relative positions of sound objects contained within a higher-order ambisonics representation
WO2013006322A1 (en) * 2011-07-01 2013-01-10 Dolby Laboratories Licensing Corporation Sample rate scalable lossless audio coding
TWI853425B (zh) * 2011-07-01 2024-08-21 美商杜比實驗室特許公司 用於適應性音頻信號的產生、譯碼與呈現之系統與方法
EP2898506B1 (en) 2012-09-21 2018-01-17 Dolby Laboratories Licensing Corporation Layered approach to spatial audio coding
EP2743922A1 (en) 2012-12-12 2014-06-18 Thomson Licensing Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field
US9959875B2 (en) 2013-03-01 2018-05-01 Qualcomm Incorporated Specifying spherical harmonic and/or higher order ambisonics coefficients in bitstreams

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1942931A (zh) * 2004-04-21 2007-04-04 杜比实验室特许公司 通过树型分层数据结构的有序横向结构描述比特流语法的音频比特流格式
EP2450880A1 (en) * 2010-11-05 2012-05-09 Thomson Licensing Data structure for Higher Order Ambisonics audio data
CN103250207A (zh) * 2010-11-05 2013-08-14 汤姆逊许可公司 高阶高保真度立体声响复制音频数据的数据结构

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"Multichannel Audio Coding Based on Minimum Audible Angles";ADRIEN DANIEL ET AL;《PROCEEDINGS OF 40TH INTERNATIONAL CONFERENCE:SPATIAL AUDIO:SENSE THE SOUND OF SPACE》;20100101;第9页第12,14节 *

Also Published As

Publication number Publication date
EP2962297B1 (en) 2019-06-05
KR20150123311A (ko) 2015-11-03
HUE045446T2 (hu) 2019-12-30
US9685163B2 (en) 2017-06-20
CN105027200A (zh) 2015-11-04
TW201503712A (zh) 2015-01-16
EP2962298A2 (en) 2016-01-06
WO2014134472A3 (en) 2015-03-19
ES2738490T3 (es) 2020-01-23
BR112015020892A2 (pt) 2017-07-18
KR101854964B1 (ko) 2018-05-04
TW201446016A (zh) 2014-12-01
TWI603631B (zh) 2017-10-21
WO2014134462A3 (en) 2014-11-13
JP2016513811A (ja) 2016-05-16
US9959875B2 (en) 2018-05-01
JP2016510905A (ja) 2016-04-11
WO2014134472A2 (en) 2014-09-04
KR20150123310A (ko) 2015-11-03
EP2962298B1 (en) 2019-04-24
EP2962297A2 (en) 2016-01-06
WO2014134462A2 (en) 2014-09-04
CN105027200B (zh) 2019-04-09
US20140249827A1 (en) 2014-09-04
TWI583210B (zh) 2017-05-11
US20140247946A1 (en) 2014-09-04
CN105027199A (zh) 2015-11-04

Similar Documents

Publication Publication Date Title
CN105027199B (zh) 在位流中指定球谐系数和/或高阶立体混响系数
US20220030372A1 (en) Reordering Of Audio Objects In The Ambisonics Domain
US9384741B2 (en) Binauralization of rotated higher order ambisonics
EP3165001A1 (en) Reducing correlation between higher order ambisonic (hoa) background channels
WO2016033480A2 (en) Intermediate compression for higher order ambisonic audio data
WO2015175998A1 (en) Spatial relation coding for higher order ambisonic coefficients
HK1215752B (zh) 声场的经分解表示的压缩

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant