TWI603631B - 產生及處理表示音訊內容之位元串流之方法、器件及非暫時性電腦可讀儲存媒體 - Google Patents

產生及處理表示音訊內容之位元串流之方法、器件及非暫時性電腦可讀儲存媒體 Download PDF

Info

Publication number
TWI603631B
TWI603631B TW103107128A TW103107128A TWI603631B TW I603631 B TWI603631 B TW I603631B TW 103107128 A TW103107128 A TW 103107128A TW 103107128 A TW103107128 A TW 103107128A TW I603631 B TWI603631 B TW I603631B
Authority
TW
Taiwan
Prior art keywords
bit stream
spherical harmonic
harmonic coefficients
sound field
bits
Prior art date
Application number
TW103107128A
Other languages
English (en)
Chinese (zh)
Other versions
TW201446016A (zh
Inventor
迪潘強 森
馬汀 詹姆士 摩瑞爾
Original Assignee
高通公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 高通公司 filed Critical 高通公司
Publication of TW201446016A publication Critical patent/TW201446016A/zh
Application granted granted Critical
Publication of TWI603631B publication Critical patent/TWI603631B/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/11Application of ambisonics in stereophonic audio systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
  • Circuit For Audible Band Transducer (AREA)
TW103107128A 2013-03-01 2014-03-03 產生及處理表示音訊內容之位元串流之方法、器件及非暫時性電腦可讀儲存媒體 TWI603631B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201361771677P 2013-03-01 2013-03-01
US201361860201P 2013-07-30 2013-07-30
US14/192,819 US9959875B2 (en) 2013-03-01 2014-02-27 Specifying spherical harmonic and/or higher order ambisonics coefficients in bitstreams

Publications (2)

Publication Number Publication Date
TW201446016A TW201446016A (zh) 2014-12-01
TWI603631B true TWI603631B (zh) 2017-10-21

Family

ID=51420957

Family Applications (2)

Application Number Title Priority Date Filing Date
TW103107128A TWI603631B (zh) 2013-03-01 2014-03-03 產生及處理表示音訊內容之位元串流之方法、器件及非暫時性電腦可讀儲存媒體
TW103107142A TWI583210B (zh) 2013-03-01 2014-03-03 變換球諧係數

Family Applications After (1)

Application Number Title Priority Date Filing Date
TW103107142A TWI583210B (zh) 2013-03-01 2014-03-03 變換球諧係數

Country Status (10)

Country Link
US (2) US9959875B2 (enExample)
EP (2) EP2962297B1 (enExample)
JP (2) JP2016510905A (enExample)
KR (2) KR20150123310A (enExample)
CN (2) CN105027199B (enExample)
BR (1) BR112015020892A2 (enExample)
ES (1) ES2738490T3 (enExample)
HU (1) HUE045446T2 (enExample)
TW (2) TWI603631B (enExample)
WO (2) WO2014134472A2 (enExample)

Families Citing this family (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2665208A1 (en) 2012-05-14 2013-11-20 Thomson Licensing Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation
US9959875B2 (en) 2013-03-01 2018-05-01 Qualcomm Incorporated Specifying spherical harmonic and/or higher order ambisonics coefficients in bitstreams
US9412385B2 (en) * 2013-05-28 2016-08-09 Qualcomm Incorporated Performing spatial masking with respect to spherical harmonic coefficients
US9466305B2 (en) 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
US9769586B2 (en) 2013-05-29 2017-09-19 Qualcomm Incorporated Performing order reduction with respect to higher order ambisonic coefficients
US9384741B2 (en) * 2013-05-29 2016-07-05 Qualcomm Incorporated Binauralization of rotated higher order ambisonics
EP3005354B1 (en) * 2013-06-05 2019-07-03 Dolby International AB Method for encoding audio signals, apparatus for encoding audio signals, method for decoding audio signals and apparatus for decoding audio signals
EP2879408A1 (en) * 2013-11-28 2015-06-03 Thomson Licensing Method and apparatus for higher order ambisonics encoding and decoding using singular value decomposition
US9489955B2 (en) 2014-01-30 2016-11-08 Qualcomm Incorporated Indicating frame parameter reusability for coding vectors
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals
US10547701B2 (en) * 2014-09-12 2020-01-28 Sony Corporation Transmission device, transmission method, reception device, and a reception method
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
WO2016062869A1 (en) * 2014-10-24 2016-04-28 Dolby International Ab Encoding and decoding of audio signals
US10452651B1 (en) 2014-12-23 2019-10-22 Palantir Technologies Inc. Searching charts
CN104795064B (zh) * 2015-03-30 2018-04-13 福州大学 低信噪比声场景下声音事件的识别方法
MX392462B (es) 2015-10-08 2025-03-24 Dolby Int Ab Codificacion en capas para representaciones de sonido o campo de sonido comprimidas
FR3050601B1 (fr) * 2016-04-26 2018-06-22 Arkamys Procede et systeme de diffusion d'un signal audio a 360°
MC200186B1 (fr) * 2016-09-30 2017-10-18 Coronal Encoding Procédé de conversion, d'encodage stéréophonique, de décodage et de transcodage d'un signal audio tridimensionnel
JP7115477B2 (ja) * 2017-07-05 2022-08-09 ソニーグループ株式会社 信号処理装置および方法、並びにプログラム
RU2736274C1 (ru) 2017-07-14 2020-11-13 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Принцип формирования улучшенного описания звукового поля или модифицированного описания звукового поля с использованием dirac-технологии с расширением глубины или других технологий
KR102652670B1 (ko) 2017-07-14 2024-04-01 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. 다중-층 묘사를 이용하여 증강된 음장 묘사 또는 수정된 음장 묘사를 생성하기 위한 개념
AU2018298874C1 (en) 2017-07-14 2023-10-19 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Concept for generating an enhanced sound field description or a modified sound field description using a multi-point sound field description
US10075802B1 (en) 2017-08-08 2018-09-11 Qualcomm Incorporated Bitrate allocation for higher order ambisonic audio data
US11281726B2 (en) * 2017-12-01 2022-03-22 Palantir Technologies Inc. System and methods for faster processor comparisons of visual graph features
US10419138B2 (en) * 2017-12-22 2019-09-17 At&T Intellectual Property I, L.P. Radio-based channel sounding using phased array antennas
GB2572650A (en) * 2018-04-06 2019-10-09 Nokia Technologies Oy Spatial audio parameters and associated spatial audio playback
US11315578B2 (en) 2018-04-16 2022-04-26 Dolby Laboratories Licensing Corporation Methods, apparatus and systems for encoding and decoding of directional sound sources
CN111819627B (zh) * 2018-07-02 2025-04-11 杜比实验室特许公司 用于对沉浸式音频信号进行编码及/或解码的方法及装置
WO2020008112A1 (en) * 2018-07-03 2020-01-09 Nokia Technologies Oy Energy-ratio signalling and synthesis
US12308034B2 (en) * 2019-06-24 2025-05-20 Qualcomm Incorporated Performing psychoacoustic audio coding based on operating conditions
US12142285B2 (en) 2019-06-24 2024-11-12 Qualcomm Incorporated Quantizing spatial components based on bit allocations determined for psychoacoustic audio coding
US11043742B2 (en) 2019-07-31 2021-06-22 At&T Intellectual Property I, L.P. Phased array mobile channel sounding system
WO2021091769A1 (en) * 2019-11-04 2021-05-14 Qualcomm Incorporated Signalling of audio effect metadata in a bitstream
GB2615236A (en) * 2020-09-25 2023-08-02 Apple Inc Higher order ambisonics encoding and decoding
CN116868588A (zh) * 2020-11-03 2023-10-10 弗劳恩霍夫应用研究促进协会 用于音频信号变换的装置和方法
EP4174637A1 (en) * 2021-10-26 2023-05-03 Koninklijke Philips N.V. Bitstream representing audio in an environment

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2450880A1 (en) * 2010-11-05 2012-05-09 Thomson Licensing Data structure for Higher Order Ambisonics audio data
US20120128160A1 (en) * 2010-10-25 2012-05-24 Qualcomm Incorporated Three-dimensional sound capturing and reproducing with multi-microphones

Family Cites Families (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB9103207D0 (en) 1991-02-15 1991-04-03 Gerzon Michael A Stereophonic sound reproduction system
US5594800A (en) 1991-02-15 1997-01-14 Trifield Productions Limited Sound reproduction system having a matrix converter
AUPO099696A0 (en) 1996-07-12 1996-08-08 Lake Dsp Pty Limited Methods and apparatus for processing spatialised audio
US6021206A (en) 1996-10-02 2000-02-01 Lake Dsp Pty Ltd Methods and apparatus for processing spatialised audio
JPH1118199A (ja) 1997-06-26 1999-01-22 Nippon Columbia Co Ltd 音響処理装置
CA2406926A1 (en) 2000-04-19 2001-11-01 Sonic Solutions Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics in three dimensions
FR2847376B1 (fr) * 2002-11-19 2005-02-04 France Telecom Procede de traitement de donnees sonores et dispositif d'acquisition sonore mettant en oeuvre ce procede
US7167176B2 (en) 2003-08-15 2007-01-23 Microsoft Corporation Clustered principal components for precomputed radiance transfer
BRPI0509985A (pt) 2004-04-21 2007-10-16 Dolby Lab Licensing Corp formato de fluxo de bits de áudio no qual a sintaxe do fluxo de bits é descrita por uma travessia ordenada de uma estrutura de dados de hierarquia em árvore
US20060247918A1 (en) 2005-04-29 2006-11-02 Microsoft Corporation Systems and methods for 3D audio programming and processing
FR2898725A1 (fr) 2006-03-15 2007-09-21 France Telecom Dispositif et procede de codage gradue d'un signal audio multi-canal selon une analyse en composante principale
US7589725B2 (en) 2006-06-30 2009-09-15 Microsoft Corporation Soft shadows in dynamic scenes
FR2916079A1 (fr) * 2007-05-10 2008-11-14 France Telecom Procede de codage et decodage audio, codeur audio, decodeur audio et programmes d'ordinateur associes
SG177277A1 (en) * 2009-06-24 2012-02-28 Fraunhofer Ges Forschung Audio signal decoder, method for decoding an audio signal and computer program using cascaded audio object processing stages
WO2011012672A1 (en) * 2009-07-29 2011-02-03 Pharnext New diagnostic tools for alzheimer disease
EP2539892B1 (fr) * 2010-02-26 2014-04-02 Orange Compression de flux audio multicanal
EP2469741A1 (en) 2010-12-21 2012-06-27 Thomson Licensing Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field
CN102333265B (zh) 2011-05-20 2014-02-19 南京大学 一种基于连续声源概念的三维局部空间声场重放方法
EP2541547A1 (en) 2011-06-30 2013-01-02 Thomson Licensing Method and apparatus for changing the relative positions of sound objects contained within a higher-order ambisonics representation
WO2013006322A1 (en) * 2011-07-01 2013-01-10 Dolby Laboratories Licensing Corporation Sample rate scalable lossless audio coding
TWI853425B (zh) * 2011-07-01 2024-08-21 美商杜比實驗室特許公司 用於適應性音頻信號的產生、譯碼與呈現之系統與方法
EP2898506B1 (en) 2012-09-21 2018-01-17 Dolby Laboratories Licensing Corporation Layered approach to spatial audio coding
EP2743922A1 (en) 2012-12-12 2014-06-18 Thomson Licensing Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field
US9959875B2 (en) 2013-03-01 2018-05-01 Qualcomm Incorporated Specifying spherical harmonic and/or higher order ambisonics coefficients in bitstreams

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120128160A1 (en) * 2010-10-25 2012-05-24 Qualcomm Incorporated Three-dimensional sound capturing and reproducing with multi-microphones
EP2450880A1 (en) * 2010-11-05 2012-05-09 Thomson Licensing Data structure for Higher Order Ambisonics audio data

Also Published As

Publication number Publication date
EP2962297B1 (en) 2019-06-05
CN105027199B (zh) 2018-05-29
KR20150123311A (ko) 2015-11-03
HUE045446T2 (hu) 2019-12-30
US9685163B2 (en) 2017-06-20
CN105027200A (zh) 2015-11-04
TW201503712A (zh) 2015-01-16
EP2962298A2 (en) 2016-01-06
WO2014134472A3 (en) 2015-03-19
ES2738490T3 (es) 2020-01-23
BR112015020892A2 (pt) 2017-07-18
KR101854964B1 (ko) 2018-05-04
TW201446016A (zh) 2014-12-01
WO2014134462A3 (en) 2014-11-13
JP2016513811A (ja) 2016-05-16
US9959875B2 (en) 2018-05-01
JP2016510905A (ja) 2016-04-11
WO2014134472A2 (en) 2014-09-04
KR20150123310A (ko) 2015-11-03
EP2962298B1 (en) 2019-04-24
EP2962297A2 (en) 2016-01-06
WO2014134462A2 (en) 2014-09-04
CN105027200B (zh) 2019-04-09
US20140249827A1 (en) 2014-09-04
TWI583210B (zh) 2017-05-11
US20140247946A1 (en) 2014-09-04
CN105027199A (zh) 2015-11-04

Similar Documents

Publication Publication Date Title
TWI603631B (zh) 產生及處理表示音訊內容之位元串流之方法、器件及非暫時性電腦可讀儲存媒體
US20220030372A1 (en) Reordering Of Audio Objects In The Ambisonics Domain
US9384741B2 (en) Binauralization of rotated higher order ambisonics
US20150127354A1 (en) Near field compensation for decomposed representations of a sound field
US20150332682A1 (en) Spatial relation coding for higher order ambisonic coefficients
JP2016524726A (ja) 球面調和係数に対して空間マスキングを実行すること
EP3165001A1 (en) Reducing correlation between higher order ambisonic (hoa) background channels
TW201714169A (zh) 自以通道為基礎之音訊至高階立體混響之轉換

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees