TWI583210B - 變換球諧係數 - Google Patents

變換球諧係數 Download PDF

Info

Publication number
TWI583210B
TWI583210B TW103107142A TW103107142A TWI583210B TW I583210 B TWI583210 B TW I583210B TW 103107142 A TW103107142 A TW 103107142A TW 103107142 A TW103107142 A TW 103107142A TW I583210 B TWI583210 B TW I583210B
Authority
TW
Taiwan
Prior art keywords
sound field
hierarchical elements
information
bit stream
transformed
Prior art date
Application number
TW103107142A
Other languages
English (en)
Chinese (zh)
Other versions
TW201503712A (zh
Inventor
迪潘強 森
馬汀 詹姆士 摩瑞爾
尼爾斯 古恩瑟 彼得斯
Original Assignee
高通公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 高通公司 filed Critical 高通公司
Publication of TW201503712A publication Critical patent/TW201503712A/zh
Application granted granted Critical
Publication of TWI583210B publication Critical patent/TWI583210B/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/11Application of ambisonics in stereophonic audio systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
  • Circuit For Audible Band Transducer (AREA)
TW103107142A 2013-03-01 2014-03-03 變換球諧係數 TWI583210B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201361771677P 2013-03-01 2013-03-01
US201361860201P 2013-07-30 2013-07-30
US14/192,829 US9685163B2 (en) 2013-03-01 2014-02-27 Transforming spherical harmonic coefficients

Publications (2)

Publication Number Publication Date
TW201503712A TW201503712A (zh) 2015-01-16
TWI583210B true TWI583210B (zh) 2017-05-11

Family

ID=51420957

Family Applications (2)

Application Number Title Priority Date Filing Date
TW103107142A TWI583210B (zh) 2013-03-01 2014-03-03 變換球諧係數
TW103107128A TWI603631B (zh) 2013-03-01 2014-03-03 產生及處理表示音訊內容之位元串流之方法、器件及非暫時性電腦可讀儲存媒體

Family Applications After (1)

Application Number Title Priority Date Filing Date
TW103107128A TWI603631B (zh) 2013-03-01 2014-03-03 產生及處理表示音訊內容之位元串流之方法、器件及非暫時性電腦可讀儲存媒體

Country Status (10)

Country Link
US (2) US9685163B2 (https=)
EP (2) EP2962298B1 (https=)
JP (2) JP2016510905A (https=)
KR (2) KR101854964B1 (https=)
CN (2) CN105027200B (https=)
BR (1) BR112015020892A2 (https=)
ES (1) ES2738490T3 (https=)
HU (1) HUE045446T2 (https=)
TW (2) TWI583210B (https=)
WO (2) WO2014134462A2 (https=)

Families Citing this family (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2665208A1 (en) 2012-05-14 2013-11-20 Thomson Licensing Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation
US9685163B2 (en) 2013-03-01 2017-06-20 Qualcomm Incorporated Transforming spherical harmonic coefficients
US9412385B2 (en) * 2013-05-28 2016-08-09 Qualcomm Incorporated Performing spatial masking with respect to spherical harmonic coefficients
US9466305B2 (en) 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
US9495968B2 (en) 2013-05-29 2016-11-15 Qualcomm Incorporated Identifying sources from which higher order ambisonic audio data is generated
US9384741B2 (en) * 2013-05-29 2016-07-05 Qualcomm Incorporated Binauralization of rotated higher order ambisonics
WO2014195190A1 (en) * 2013-06-05 2014-12-11 Thomson Licensing Method for encoding audio signals, apparatus for encoding audio signals, method for decoding audio signals and apparatus for decoding audio signals
EP2879408A1 (en) * 2013-11-28 2015-06-03 Thomson Licensing Method and apparatus for higher order ambisonics encoding and decoding using singular value decomposition
US9502045B2 (en) 2014-01-30 2016-11-22 Qualcomm Incorporated Coding independent frames of ambient higher-order ambisonic coefficients
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
KR102643537B1 (ko) * 2014-09-12 2024-03-06 소니그룹주식회사 송신 장치, 송신 방법, 수신 장치 및 수신 방법
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
US10304471B2 (en) * 2014-10-24 2019-05-28 Dolby International Ab Encoding and decoding of audio signals
US10452651B1 (en) 2014-12-23 2019-10-22 Palantir Technologies Inc. Searching charts
CN104795064B (zh) * 2015-03-30 2018-04-13 福州大学 低信噪比声场景下声音事件的识别方法
US10706860B2 (en) 2015-10-08 2020-07-07 Dolby International Ab Layered coding for compressed sound or sound field representations
FR3050601B1 (fr) * 2016-04-26 2018-06-22 Arkamys Procede et systeme de diffusion d'un signal audio a 360°
MC200186B1 (fr) * 2016-09-30 2017-10-18 Coronal Encoding Procédé de conversion, d'encodage stéréophonique, de décodage et de transcodage d'un signal audio tridimensionnel
JP7115477B2 (ja) * 2017-07-05 2022-08-09 ソニーグループ株式会社 信号処理装置および方法、並びにプログラム
CN117319917A (zh) 2017-07-14 2023-12-29 弗劳恩霍夫应用研究促进协会 使用多点声场描述生成经修改的声场描述的装置及方法
KR102540642B1 (ko) 2017-07-14 2023-06-08 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. 다중-층 묘사를 이용하여 증강된 음장 묘사 또는 수정된 음장 묘사를 생성하기 위한 개념
RU2736274C1 (ru) 2017-07-14 2020-11-13 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Принцип формирования улучшенного описания звукового поля или модифицированного описания звукового поля с использованием dirac-технологии с расширением глубины или других технологий
US10075802B1 (en) 2017-08-08 2018-09-11 Qualcomm Incorporated Bitrate allocation for higher order ambisonic audio data
US11281726B2 (en) 2017-12-01 2022-03-22 Palantir Technologies Inc. System and methods for faster processor comparisons of visual graph features
US10419138B2 (en) 2017-12-22 2019-09-17 At&T Intellectual Property I, L.P. Radio-based channel sounding using phased array antennas
GB2572650A (en) * 2018-04-06 2019-10-09 Nokia Technologies Oy Spatial audio parameters and associated spatial audio playback
WO2019204214A2 (en) 2018-04-16 2019-10-24 Dolby Laboratories Licensing Corporation Methods, apparatus and systems for encoding and decoding of directional sound sources
EP3818524B1 (en) * 2018-07-02 2023-12-13 Dolby Laboratories Licensing Corporation Methods and devices for generating or decoding a bitstream comprising immersive audio signals
WO2020008112A1 (en) * 2018-07-03 2020-01-09 Nokia Technologies Oy Energy-ratio signalling and synthesis
US12308034B2 (en) * 2019-06-24 2025-05-20 Qualcomm Incorporated Performing psychoacoustic audio coding based on operating conditions
US12142285B2 (en) 2019-06-24 2024-11-12 Qualcomm Incorporated Quantizing spatial components based on bit allocations determined for psychoacoustic audio coding
US11043742B2 (en) 2019-07-31 2021-06-22 At&T Intellectual Property I, L.P. Phased array mobile channel sounding system
EP4055840A1 (en) 2019-11-04 2022-09-14 Qualcomm Incorporated Signalling of audio effect metadata in a bitstream
CN116391365A (zh) * 2020-09-25 2023-07-04 苹果公司 高阶环境立体声编码和解码
CN116868588A (zh) * 2020-11-03 2023-10-10 弗劳恩霍夫应用研究促进协会 用于音频信号变换的装置和方法
EP4174637A1 (en) * 2021-10-26 2023-05-03 Koninklijke Philips N.V. Bitstream representing audio in an environment
US20250078845A1 (en) * 2023-08-29 2025-03-06 Samsung Electronics Co., Ltd. Lossless audio coding for multichannel hierarchical reconstruction

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1992015180A1 (en) * 1991-02-15 1992-09-03 Trifield Productions Ltd. Sound reproduction system
US5594800A (en) * 1991-02-15 1997-01-14 Trifield Productions Limited Sound reproduction system having a matrix converter
JPH1118199A (ja) * 1997-06-26 1999-01-22 Nippon Columbia Co Ltd 音響処理装置
WO2001082651A1 (en) * 2000-04-19 2001-11-01 Sonic Solutions Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics in three dimensions
US20050035965A1 (en) * 2003-08-15 2005-02-17 Peter-Pike Sloan Clustered principal components for precomputed radiance transfer
TW200638338A (en) * 2005-04-29 2006-11-01 Microsoft Corp Systems and methods for 3D audio programming and processing
CN102333265A (zh) * 2011-05-20 2012-01-25 南京大学 一种基于连续声源概念的三维局部空间声场重放方法
EP2459742A1 (en) * 2009-07-29 2012-06-06 Pharnext New diagnostic tools for alzheimer disease
US20120314878A1 (en) * 2010-02-26 2012-12-13 France Telecom Multichannel audio stream compression

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AUPO099696A0 (en) 1996-07-12 1996-08-08 Lake Dsp Pty Limited Methods and apparatus for processing spatialised audio
US6021206A (en) 1996-10-02 2000-02-01 Lake Dsp Pty Ltd Methods and apparatus for processing spatialised audio
FR2847376B1 (fr) * 2002-11-19 2005-02-04 France Telecom Procede de traitement de donnees sonores et dispositif d'acquisition sonore mettant en oeuvre ce procede
AU2005241905A1 (en) * 2004-04-21 2005-11-17 Dolby Laboratories Licensing Corporation Audio bitstream format in which the bitstream syntax is described by an ordered transversal of a tree hierarchy data structure
FR2898725A1 (fr) 2006-03-15 2007-09-21 France Telecom Dispositif et procede de codage gradue d'un signal audio multi-canal selon une analyse en composante principale
US7589725B2 (en) 2006-06-30 2009-09-15 Microsoft Corporation Soft shadows in dynamic scenes
FR2916079A1 (fr) * 2007-05-10 2008-11-14 France Telecom Procede de codage et decodage audio, codeur audio, decodeur audio et programmes d'ordinateur associes
MX2011013829A (es) * 2009-06-24 2012-03-07 Fraunhofer Ges Forschung Decodificador de señales de audio, metodo para decodificar una señal de audio y programa de computacion que utiliza etapas en cascada de procesamiento de objetos de audio.
US9552840B2 (en) * 2010-10-25 2017-01-24 Qualcomm Incorporated Three-dimensional sound capturing and reproducing with multi-microphones
EP2450880A1 (en) 2010-11-05 2012-05-09 Thomson Licensing Data structure for Higher Order Ambisonics audio data
EP2469741A1 (en) * 2010-12-21 2012-06-27 Thomson Licensing Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field
EP2541547A1 (en) 2011-06-30 2013-01-02 Thomson Licensing Method and apparatus for changing the relative positions of sound objects contained within a higher-order ambisonics representation
KR102185941B1 (ko) * 2011-07-01 2020-12-03 돌비 레버러토리즈 라이쎈싱 코오포레이션 적응형 오디오 신호 생성, 코딩 및 렌더링을 위한 시스템 및 방법
US20140214431A1 (en) * 2011-07-01 2014-07-31 Dolby Laboratories Licensing Corporation Sample rate scalable lossless audio coding
EP2898506B1 (en) 2012-09-21 2018-01-17 Dolby Laboratories Licensing Corporation Layered approach to spatial audio coding
EP2743922A1 (en) 2012-12-12 2014-06-18 Thomson Licensing Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field
US9685163B2 (en) 2013-03-01 2017-06-20 Qualcomm Incorporated Transforming spherical harmonic coefficients

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1992015180A1 (en) * 1991-02-15 1992-09-03 Trifield Productions Ltd. Sound reproduction system
US5594800A (en) * 1991-02-15 1997-01-14 Trifield Productions Limited Sound reproduction system having a matrix converter
JPH1118199A (ja) * 1997-06-26 1999-01-22 Nippon Columbia Co Ltd 音響処理装置
WO2001082651A1 (en) * 2000-04-19 2001-11-01 Sonic Solutions Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics in three dimensions
US20050035965A1 (en) * 2003-08-15 2005-02-17 Peter-Pike Sloan Clustered principal components for precomputed radiance transfer
TW200638338A (en) * 2005-04-29 2006-11-01 Microsoft Corp Systems and methods for 3D audio programming and processing
EP2459742A1 (en) * 2009-07-29 2012-06-06 Pharnext New diagnostic tools for alzheimer disease
US20120314878A1 (en) * 2010-02-26 2012-12-13 France Telecom Multichannel audio stream compression
CN102333265A (zh) * 2011-05-20 2012-01-25 南京大学 一种基于连续声源概念的三维局部空间声场重放方法

Also Published As

Publication number Publication date
JP2016510905A (ja) 2016-04-11
EP2962297A2 (en) 2016-01-06
WO2014134472A2 (en) 2014-09-04
BR112015020892A2 (pt) 2017-07-18
KR20150123311A (ko) 2015-11-03
EP2962298B1 (en) 2019-04-24
WO2014134462A2 (en) 2014-09-04
TWI603631B (zh) 2017-10-21
CN105027200B (zh) 2019-04-09
ES2738490T3 (es) 2020-01-23
US9685163B2 (en) 2017-06-20
TW201446016A (zh) 2014-12-01
WO2014134472A3 (en) 2015-03-19
CN105027199B (zh) 2018-05-29
US9959875B2 (en) 2018-05-01
US20140247946A1 (en) 2014-09-04
US20140249827A1 (en) 2014-09-04
JP2016513811A (ja) 2016-05-16
KR20150123310A (ko) 2015-11-03
HUE045446T2 (hu) 2019-12-30
TW201503712A (zh) 2015-01-16
EP2962297B1 (en) 2019-06-05
CN105027199A (zh) 2015-11-04
KR101854964B1 (ko) 2018-05-04
WO2014134462A3 (en) 2014-11-13
EP2962298A2 (en) 2016-01-06
CN105027200A (zh) 2015-11-04

Similar Documents

Publication Publication Date Title
TWI583210B (zh) 變換球諧係數
US9384741B2 (en) Binauralization of rotated higher order ambisonics
JP6199519B2 (ja) 音場の分解された表現の圧縮
US20150127354A1 (en) Near field compensation for decomposed representations of a sound field
US20150332682A1 (en) Spatial relation coding for higher order ambisonic coefficients
JP2016524726A (ja) 球面調和係数に対して空間マスキングを実行すること
WO2016004277A1 (en) Reducing correlation between higher order ambisonic (hoa) background channels
TW201714169A (zh) 自以通道為基礎之音訊至高階立體混響之轉換
TW201517022A (zh) 球面諧波係數之寫碼

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees