TWI583210B - 變換球諧係數 - Google Patents

變換球諧係數 Download PDF

Info

Publication number
TWI583210B
TWI583210B TW103107142A TW103107142A TWI583210B TW I583210 B TWI583210 B TW I583210B TW 103107142 A TW103107142 A TW 103107142A TW 103107142 A TW103107142 A TW 103107142A TW I583210 B TWI583210 B TW I583210B
Authority
TW
Taiwan
Prior art keywords
sound field
hierarchical elements
information
bit stream
transformed
Prior art date
Application number
TW103107142A
Other languages
English (en)
Chinese (zh)
Other versions
TW201503712A (zh
Inventor
迪潘強 森
馬汀 詹姆士 摩瑞爾
尼爾斯 古恩瑟 彼得斯
Original Assignee
高通公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 高通公司 filed Critical 高通公司
Publication of TW201503712A publication Critical patent/TW201503712A/zh
Application granted granted Critical
Publication of TWI583210B publication Critical patent/TWI583210B/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/11Application of ambisonics in stereophonic audio systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
  • Circuit For Audible Band Transducer (AREA)
TW103107142A 2013-03-01 2014-03-03 變換球諧係數 TWI583210B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201361771677P 2013-03-01 2013-03-01
US201361860201P 2013-07-30 2013-07-30
US14/192,829 US9685163B2 (en) 2013-03-01 2014-02-27 Transforming spherical harmonic coefficients

Publications (2)

Publication Number Publication Date
TW201503712A TW201503712A (zh) 2015-01-16
TWI583210B true TWI583210B (zh) 2017-05-11

Family

ID=51420957

Family Applications (2)

Application Number Title Priority Date Filing Date
TW103107142A TWI583210B (zh) 2013-03-01 2014-03-03 變換球諧係數
TW103107128A TWI603631B (zh) 2013-03-01 2014-03-03 產生及處理表示音訊內容之位元串流之方法、器件及非暫時性電腦可讀儲存媒體

Family Applications After (1)

Application Number Title Priority Date Filing Date
TW103107128A TWI603631B (zh) 2013-03-01 2014-03-03 產生及處理表示音訊內容之位元串流之方法、器件及非暫時性電腦可讀儲存媒體

Country Status (10)

Country Link
US (2) US9685163B2 (OSRAM)
EP (2) EP2962297B1 (OSRAM)
JP (2) JP2016510905A (OSRAM)
KR (2) KR101854964B1 (OSRAM)
CN (2) CN105027199B (OSRAM)
BR (1) BR112015020892A2 (OSRAM)
ES (1) ES2738490T3 (OSRAM)
HU (1) HUE045446T2 (OSRAM)
TW (2) TWI583210B (OSRAM)
WO (2) WO2014134462A2 (OSRAM)

Families Citing this family (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2665208A1 (en) 2012-05-14 2013-11-20 Thomson Licensing Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation
US9685163B2 (en) 2013-03-01 2017-06-20 Qualcomm Incorporated Transforming spherical harmonic coefficients
US9412385B2 (en) * 2013-05-28 2016-08-09 Qualcomm Incorporated Performing spatial masking with respect to spherical harmonic coefficients
US9384741B2 (en) * 2013-05-29 2016-07-05 Qualcomm Incorporated Binauralization of rotated higher order ambisonics
US9502044B2 (en) 2013-05-29 2016-11-22 Qualcomm Incorporated Compression of decomposed representations of a sound field
US9466305B2 (en) 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
EP3005354B1 (en) * 2013-06-05 2019-07-03 Dolby International AB Method for encoding audio signals, apparatus for encoding audio signals, method for decoding audio signals and apparatus for decoding audio signals
EP2879408A1 (en) * 2013-11-28 2015-06-03 Thomson Licensing Method and apparatus for higher order ambisonics encoding and decoding using singular value decomposition
US9489955B2 (en) 2014-01-30 2016-11-08 Qualcomm Incorporated Indicating frame parameter reusability for coding vectors
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
EP4481575A3 (en) * 2014-09-12 2025-03-05 Sony Group Corporation Transmission device, transmission method, reception device, and reception method
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
RU2708942C2 (ru) * 2014-10-24 2019-12-12 Долби Интернешнл Аб Кодирование и декодирование аудиосигналов
US10452651B1 (en) 2014-12-23 2019-10-22 Palantir Technologies Inc. Searching charts
CN104795064B (zh) * 2015-03-30 2018-04-13 福州大学 低信噪比声场景下声音事件的识别方法
CN108140391B (zh) 2015-10-08 2022-12-16 杜比国际公司 用于压缩声音或声场表示的分层编解码
FR3050601B1 (fr) * 2016-04-26 2018-06-22 Arkamys Procede et systeme de diffusion d'un signal audio a 360°
MC200186B1 (fr) * 2016-09-30 2017-10-18 Coronal Encoding Procédé de conversion, d'encodage stéréophonique, de décodage et de transcodage d'un signal audio tridimensionnel
US11252524B2 (en) * 2017-07-05 2022-02-15 Sony Corporation Synthesizing a headphone signal using a rotating head-related transfer function
RU2736274C1 (ru) 2017-07-14 2020-11-13 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Принцип формирования улучшенного описания звукового поля или модифицированного описания звукового поля с использованием dirac-технологии с расширением глубины или других технологий
KR102652670B1 (ko) 2017-07-14 2024-04-01 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. 다중-층 묘사를 이용하여 증강된 음장 묘사 또는 수정된 음장 묘사를 생성하기 위한 개념
CN117319917A (zh) 2017-07-14 2023-12-29 弗劳恩霍夫应用研究促进协会 使用多点声场描述生成经修改的声场描述的装置及方法
US10075802B1 (en) 2017-08-08 2018-09-11 Qualcomm Incorporated Bitrate allocation for higher order ambisonic audio data
US11281726B2 (en) * 2017-12-01 2022-03-22 Palantir Technologies Inc. System and methods for faster processor comparisons of visual graph features
US10419138B2 (en) 2017-12-22 2019-09-17 At&T Intellectual Property I, L.P. Radio-based channel sounding using phased array antennas
GB2572650A (en) * 2018-04-06 2019-10-09 Nokia Technologies Oy Spatial audio parameters and associated spatial audio playback
WO2019204214A2 (en) 2018-04-16 2019-10-24 Dolby Laboratories Licensing Corporation Methods, apparatus and systems for encoding and decoding of directional sound sources
BR112020016948A2 (pt) * 2018-07-02 2020-12-15 Dolby Laboratories Licensing Corporation Métodos e dispositivos para gerar ou decodificar um fluxo de bits compreendendo sinais de áudio imersivos
EP3818730A4 (en) * 2018-07-03 2022-08-31 Nokia Technologies Oy Energy-ratio signalling and synthesis
US12308034B2 (en) * 2019-06-24 2025-05-20 Qualcomm Incorporated Performing psychoacoustic audio coding based on operating conditions
US12142285B2 (en) 2019-06-24 2024-11-12 Qualcomm Incorporated Quantizing spatial components based on bit allocations determined for psychoacoustic audio coding
US11043742B2 (en) 2019-07-31 2021-06-22 At&T Intellectual Property I, L.P. Phased array mobile channel sounding system
CN114631332B (zh) 2019-11-04 2025-10-10 高通股份有限公司 比特流中音频效果元数据的信令
CN116391365A (zh) * 2020-09-25 2023-07-04 苹果公司 高阶环境立体声编码和解码
WO2022096376A2 (en) * 2020-11-03 2022-05-12 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for audio signal transformation
EP4174637A1 (en) * 2021-10-26 2023-05-03 Koninklijke Philips N.V. Bitstream representing audio in an environment

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1992015180A1 (en) * 1991-02-15 1992-09-03 Trifield Productions Ltd. Sound reproduction system
US5594800A (en) * 1991-02-15 1997-01-14 Trifield Productions Limited Sound reproduction system having a matrix converter
JPH1118199A (ja) * 1997-06-26 1999-01-22 Nippon Columbia Co Ltd 音響処理装置
WO2001082651A1 (en) * 2000-04-19 2001-11-01 Sonic Solutions Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics in three dimensions
US20050035965A1 (en) * 2003-08-15 2005-02-17 Peter-Pike Sloan Clustered principal components for precomputed radiance transfer
TW200638338A (en) * 2005-04-29 2006-11-01 Microsoft Corp Systems and methods for 3D audio programming and processing
CN102333265A (zh) * 2011-05-20 2012-01-25 南京大学 一种基于连续声源概念的三维局部空间声场重放方法
EP2459742A1 (en) * 2009-07-29 2012-06-06 Pharnext New diagnostic tools for alzheimer disease
US20120314878A1 (en) * 2010-02-26 2012-12-13 France Telecom Multichannel audio stream compression

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AUPO099696A0 (en) 1996-07-12 1996-08-08 Lake Dsp Pty Limited Methods and apparatus for processing spatialised audio
US6021206A (en) 1996-10-02 2000-02-01 Lake Dsp Pty Ltd Methods and apparatus for processing spatialised audio
FR2847376B1 (fr) * 2002-11-19 2005-02-04 France Telecom Procede de traitement de donnees sonores et dispositif d'acquisition sonore mettant en oeuvre ce procede
CN1942931A (zh) * 2004-04-21 2007-04-04 杜比实验室特许公司 通过树型分层数据结构的有序横向结构描述比特流语法的音频比特流格式
FR2898725A1 (fr) 2006-03-15 2007-09-21 France Telecom Dispositif et procede de codage gradue d'un signal audio multi-canal selon une analyse en composante principale
US7589725B2 (en) 2006-06-30 2009-09-15 Microsoft Corporation Soft shadows in dynamic scenes
FR2916079A1 (fr) * 2007-05-10 2008-11-14 France Telecom Procede de codage et decodage audio, codeur audio, decodeur audio et programmes d'ordinateur associes
PL2446435T3 (pl) * 2009-06-24 2013-11-29 Fraunhofer Ges Forschung Dekoder sygnału audio, sposób dekodowania sygnału audio i program komputerowy wykorzystujący kaskadowe etapy przetwarzania obiektów audio
US9552840B2 (en) 2010-10-25 2017-01-24 Qualcomm Incorporated Three-dimensional sound capturing and reproducing with multi-microphones
EP2450880A1 (en) 2010-11-05 2012-05-09 Thomson Licensing Data structure for Higher Order Ambisonics audio data
EP2469741A1 (en) 2010-12-21 2012-06-27 Thomson Licensing Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field
EP2541547A1 (en) 2011-06-30 2013-01-02 Thomson Licensing Method and apparatus for changing the relative positions of sound objects contained within a higher-order ambisonics representation
CN103650037B (zh) * 2011-07-01 2015-12-09 杜比实验室特许公司 采样率可分级的无损音频编码
TWI853425B (zh) * 2011-07-01 2024-08-21 美商杜比實驗室特許公司 用於適應性音頻信號的產生、譯碼與呈現之系統與方法
US9460729B2 (en) 2012-09-21 2016-10-04 Dolby Laboratories Licensing Corporation Layered approach to spatial audio coding
EP2743922A1 (en) 2012-12-12 2014-06-18 Thomson Licensing Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field
US9685163B2 (en) 2013-03-01 2017-06-20 Qualcomm Incorporated Transforming spherical harmonic coefficients

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1992015180A1 (en) * 1991-02-15 1992-09-03 Trifield Productions Ltd. Sound reproduction system
US5594800A (en) * 1991-02-15 1997-01-14 Trifield Productions Limited Sound reproduction system having a matrix converter
JPH1118199A (ja) * 1997-06-26 1999-01-22 Nippon Columbia Co Ltd 音響処理装置
WO2001082651A1 (en) * 2000-04-19 2001-11-01 Sonic Solutions Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics in three dimensions
US20050035965A1 (en) * 2003-08-15 2005-02-17 Peter-Pike Sloan Clustered principal components for precomputed radiance transfer
TW200638338A (en) * 2005-04-29 2006-11-01 Microsoft Corp Systems and methods for 3D audio programming and processing
EP2459742A1 (en) * 2009-07-29 2012-06-06 Pharnext New diagnostic tools for alzheimer disease
US20120314878A1 (en) * 2010-02-26 2012-12-13 France Telecom Multichannel audio stream compression
CN102333265A (zh) * 2011-05-20 2012-01-25 南京大学 一种基于连续声源概念的三维局部空间声场重放方法

Also Published As

Publication number Publication date
EP2962298A2 (en) 2016-01-06
US20140247946A1 (en) 2014-09-04
JP2016513811A (ja) 2016-05-16
US9959875B2 (en) 2018-05-01
TW201503712A (zh) 2015-01-16
WO2014134462A3 (en) 2014-11-13
JP2016510905A (ja) 2016-04-11
KR101854964B1 (ko) 2018-05-04
CN105027200A (zh) 2015-11-04
TWI603631B (zh) 2017-10-21
WO2014134472A2 (en) 2014-09-04
WO2014134462A2 (en) 2014-09-04
WO2014134472A3 (en) 2015-03-19
EP2962297A2 (en) 2016-01-06
HUE045446T2 (hu) 2019-12-30
ES2738490T3 (es) 2020-01-23
US9685163B2 (en) 2017-06-20
CN105027199B (zh) 2018-05-29
EP2962298B1 (en) 2019-04-24
CN105027200B (zh) 2019-04-09
TW201446016A (zh) 2014-12-01
BR112015020892A2 (pt) 2017-07-18
US20140249827A1 (en) 2014-09-04
CN105027199A (zh) 2015-11-04
KR20150123310A (ko) 2015-11-03
KR20150123311A (ko) 2015-11-03
EP2962297B1 (en) 2019-06-05

Similar Documents

Publication Publication Date Title
TWI583210B (zh) 變換球諧係數
US9384741B2 (en) Binauralization of rotated higher order ambisonics
JP6199519B2 (ja) 音場の分解された表現の圧縮
US20150127354A1 (en) Near field compensation for decomposed representations of a sound field
US20150332682A1 (en) Spatial relation coding for higher order ambisonic coefficients
JP2016524726A (ja) 球面調和係数に対して空間マスキングを実行すること
EP3165001A1 (en) Reducing correlation between higher order ambisonic (hoa) background channels
TW201714169A (zh) 自以通道為基礎之音訊至高階立體混響之轉換
TW201517022A (zh) 球面諧波係數之寫碼

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees