CN105027199B - 在位流中指定球谐系数和/或高阶立体混响系数 - Google Patents
在位流中指定球谐系数和/或高阶立体混响系数 Download PDFInfo
- Publication number
- CN105027199B CN105027199B CN201480011198.1A CN201480011198A CN105027199B CN 105027199 B CN105027199 B CN 105027199B CN 201480011198 A CN201480011198 A CN 201480011198A CN 105027199 B CN105027199 B CN 105027199B
- Authority
- CN
- China
- Prior art keywords
- bitstream
- spherical harmonic
- harmonic coefficients
- sound field
- bits
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/018—Audio watermarking, i.e. embedding inaudible data in the audio signal
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/11—Application of ambisonics in stereophonic audio systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Stereophonic System (AREA)
- Circuit For Audible Band Transducer (AREA)
Applications Claiming Priority (7)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201361771677P | 2013-03-01 | 2013-03-01 | |
| US61/771,677 | 2013-03-01 | ||
| US201361860201P | 2013-07-30 | 2013-07-30 | |
| US61/860,201 | 2013-07-30 | ||
| US14/192,819 | 2014-02-27 | ||
| US14/192,819 US9959875B2 (en) | 2013-03-01 | 2014-02-27 | Specifying spherical harmonic and/or higher order ambisonics coefficients in bitstreams |
| PCT/US2014/019446 WO2014134462A2 (en) | 2013-03-01 | 2014-02-28 | Specifying spherical harmonic and/or higher order ambisonics coefficients in bitstreams |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN105027199A CN105027199A (zh) | 2015-11-04 |
| CN105027199B true CN105027199B (zh) | 2018-05-29 |
Family
ID=51420957
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201480011198.1A Active CN105027199B (zh) | 2013-03-01 | 2014-02-28 | 在位流中指定球谐系数和/或高阶立体混响系数 |
| CN201480011287.6A Active CN105027200B (zh) | 2013-03-01 | 2014-02-28 | 变换球谐系数 |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201480011287.6A Active CN105027200B (zh) | 2013-03-01 | 2014-02-28 | 变换球谐系数 |
Country Status (10)
| Country | Link |
|---|---|
| US (2) | US9959875B2 (enExample) |
| EP (2) | EP2962297B1 (enExample) |
| JP (2) | JP2016510905A (enExample) |
| KR (2) | KR20150123310A (enExample) |
| CN (2) | CN105027199B (enExample) |
| BR (1) | BR112015020892A2 (enExample) |
| ES (1) | ES2738490T3 (enExample) |
| HU (1) | HUE045446T2 (enExample) |
| TW (2) | TWI603631B (enExample) |
| WO (2) | WO2014134472A2 (enExample) |
Families Citing this family (39)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP2665208A1 (en) | 2012-05-14 | 2013-11-20 | Thomson Licensing | Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation |
| US9959875B2 (en) | 2013-03-01 | 2018-05-01 | Qualcomm Incorporated | Specifying spherical harmonic and/or higher order ambisonics coefficients in bitstreams |
| US9412385B2 (en) * | 2013-05-28 | 2016-08-09 | Qualcomm Incorporated | Performing spatial masking with respect to spherical harmonic coefficients |
| US9466305B2 (en) | 2013-05-29 | 2016-10-11 | Qualcomm Incorporated | Performing positional analysis to code spherical harmonic coefficients |
| US9769586B2 (en) | 2013-05-29 | 2017-09-19 | Qualcomm Incorporated | Performing order reduction with respect to higher order ambisonic coefficients |
| US9384741B2 (en) * | 2013-05-29 | 2016-07-05 | Qualcomm Incorporated | Binauralization of rotated higher order ambisonics |
| EP3005354B1 (en) * | 2013-06-05 | 2019-07-03 | Dolby International AB | Method for encoding audio signals, apparatus for encoding audio signals, method for decoding audio signals and apparatus for decoding audio signals |
| EP2879408A1 (en) * | 2013-11-28 | 2015-06-03 | Thomson Licensing | Method and apparatus for higher order ambisonics encoding and decoding using singular value decomposition |
| US9489955B2 (en) | 2014-01-30 | 2016-11-08 | Qualcomm Incorporated | Indicating frame parameter reusability for coding vectors |
| US9922656B2 (en) | 2014-01-30 | 2018-03-20 | Qualcomm Incorporated | Transitioning of ambient higher-order ambisonic coefficients |
| US9620137B2 (en) | 2014-05-16 | 2017-04-11 | Qualcomm Incorporated | Determining between scalar and vector quantization in higher order ambisonic coefficients |
| US9852737B2 (en) | 2014-05-16 | 2017-12-26 | Qualcomm Incorporated | Coding vectors decomposed from higher-order ambisonics audio signals |
| US10770087B2 (en) | 2014-05-16 | 2020-09-08 | Qualcomm Incorporated | Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals |
| US10547701B2 (en) * | 2014-09-12 | 2020-01-28 | Sony Corporation | Transmission device, transmission method, reception device, and a reception method |
| US9747910B2 (en) | 2014-09-26 | 2017-08-29 | Qualcomm Incorporated | Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework |
| WO2016062869A1 (en) * | 2014-10-24 | 2016-04-28 | Dolby International Ab | Encoding and decoding of audio signals |
| US10452651B1 (en) | 2014-12-23 | 2019-10-22 | Palantir Technologies Inc. | Searching charts |
| CN104795064B (zh) * | 2015-03-30 | 2018-04-13 | 福州大学 | 低信噪比声场景下声音事件的识别方法 |
| MX392462B (es) | 2015-10-08 | 2025-03-24 | Dolby Int Ab | Codificacion en capas para representaciones de sonido o campo de sonido comprimidas |
| FR3050601B1 (fr) * | 2016-04-26 | 2018-06-22 | Arkamys | Procede et systeme de diffusion d'un signal audio a 360° |
| MC200186B1 (fr) * | 2016-09-30 | 2017-10-18 | Coronal Encoding | Procédé de conversion, d'encodage stéréophonique, de décodage et de transcodage d'un signal audio tridimensionnel |
| JP7115477B2 (ja) * | 2017-07-05 | 2022-08-09 | ソニーグループ株式会社 | 信号処理装置および方法、並びにプログラム |
| RU2736274C1 (ru) | 2017-07-14 | 2020-11-13 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Принцип формирования улучшенного описания звукового поля или модифицированного описания звукового поля с использованием dirac-технологии с расширением глубины или других технологий |
| KR102652670B1 (ko) | 2017-07-14 | 2024-04-01 | 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. | 다중-층 묘사를 이용하여 증강된 음장 묘사 또는 수정된 음장 묘사를 생성하기 위한 개념 |
| AU2018298874C1 (en) | 2017-07-14 | 2023-10-19 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Concept for generating an enhanced sound field description or a modified sound field description using a multi-point sound field description |
| US10075802B1 (en) | 2017-08-08 | 2018-09-11 | Qualcomm Incorporated | Bitrate allocation for higher order ambisonic audio data |
| US11281726B2 (en) * | 2017-12-01 | 2022-03-22 | Palantir Technologies Inc. | System and methods for faster processor comparisons of visual graph features |
| US10419138B2 (en) * | 2017-12-22 | 2019-09-17 | At&T Intellectual Property I, L.P. | Radio-based channel sounding using phased array antennas |
| GB2572650A (en) * | 2018-04-06 | 2019-10-09 | Nokia Technologies Oy | Spatial audio parameters and associated spatial audio playback |
| US11315578B2 (en) | 2018-04-16 | 2022-04-26 | Dolby Laboratories Licensing Corporation | Methods, apparatus and systems for encoding and decoding of directional sound sources |
| CN111819627B (zh) * | 2018-07-02 | 2025-04-11 | 杜比实验室特许公司 | 用于对沉浸式音频信号进行编码及/或解码的方法及装置 |
| WO2020008112A1 (en) * | 2018-07-03 | 2020-01-09 | Nokia Technologies Oy | Energy-ratio signalling and synthesis |
| US12308034B2 (en) * | 2019-06-24 | 2025-05-20 | Qualcomm Incorporated | Performing psychoacoustic audio coding based on operating conditions |
| US12142285B2 (en) | 2019-06-24 | 2024-11-12 | Qualcomm Incorporated | Quantizing spatial components based on bit allocations determined for psychoacoustic audio coding |
| US11043742B2 (en) | 2019-07-31 | 2021-06-22 | At&T Intellectual Property I, L.P. | Phased array mobile channel sounding system |
| WO2021091769A1 (en) * | 2019-11-04 | 2021-05-14 | Qualcomm Incorporated | Signalling of audio effect metadata in a bitstream |
| GB2615236A (en) * | 2020-09-25 | 2023-08-02 | Apple Inc | Higher order ambisonics encoding and decoding |
| CN116868588A (zh) * | 2020-11-03 | 2023-10-10 | 弗劳恩霍夫应用研究促进协会 | 用于音频信号变换的装置和方法 |
| EP4174637A1 (en) * | 2021-10-26 | 2023-05-03 | Koninklijke Philips N.V. | Bitstream representing audio in an environment |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1942931A (zh) * | 2004-04-21 | 2007-04-04 | 杜比实验室特许公司 | 通过树型分层数据结构的有序横向结构描述比特流语法的音频比特流格式 |
| EP2450880A1 (en) * | 2010-11-05 | 2012-05-09 | Thomson Licensing | Data structure for Higher Order Ambisonics audio data |
Family Cites Families (24)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| GB9103207D0 (en) | 1991-02-15 | 1991-04-03 | Gerzon Michael A | Stereophonic sound reproduction system |
| US5594800A (en) | 1991-02-15 | 1997-01-14 | Trifield Productions Limited | Sound reproduction system having a matrix converter |
| AUPO099696A0 (en) | 1996-07-12 | 1996-08-08 | Lake Dsp Pty Limited | Methods and apparatus for processing spatialised audio |
| US6021206A (en) | 1996-10-02 | 2000-02-01 | Lake Dsp Pty Ltd | Methods and apparatus for processing spatialised audio |
| JPH1118199A (ja) | 1997-06-26 | 1999-01-22 | Nippon Columbia Co Ltd | 音響処理装置 |
| CA2406926A1 (en) | 2000-04-19 | 2001-11-01 | Sonic Solutions | Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics in three dimensions |
| FR2847376B1 (fr) * | 2002-11-19 | 2005-02-04 | France Telecom | Procede de traitement de donnees sonores et dispositif d'acquisition sonore mettant en oeuvre ce procede |
| US7167176B2 (en) | 2003-08-15 | 2007-01-23 | Microsoft Corporation | Clustered principal components for precomputed radiance transfer |
| US20060247918A1 (en) | 2005-04-29 | 2006-11-02 | Microsoft Corporation | Systems and methods for 3D audio programming and processing |
| FR2898725A1 (fr) | 2006-03-15 | 2007-09-21 | France Telecom | Dispositif et procede de codage gradue d'un signal audio multi-canal selon une analyse en composante principale |
| US7589725B2 (en) | 2006-06-30 | 2009-09-15 | Microsoft Corporation | Soft shadows in dynamic scenes |
| FR2916079A1 (fr) * | 2007-05-10 | 2008-11-14 | France Telecom | Procede de codage et decodage audio, codeur audio, decodeur audio et programmes d'ordinateur associes |
| SG177277A1 (en) * | 2009-06-24 | 2012-02-28 | Fraunhofer Ges Forschung | Audio signal decoder, method for decoding an audio signal and computer program using cascaded audio object processing stages |
| WO2011012672A1 (en) * | 2009-07-29 | 2011-02-03 | Pharnext | New diagnostic tools for alzheimer disease |
| EP2539892B1 (fr) * | 2010-02-26 | 2014-04-02 | Orange | Compression de flux audio multicanal |
| US9552840B2 (en) * | 2010-10-25 | 2017-01-24 | Qualcomm Incorporated | Three-dimensional sound capturing and reproducing with multi-microphones |
| EP2469741A1 (en) | 2010-12-21 | 2012-06-27 | Thomson Licensing | Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field |
| CN102333265B (zh) | 2011-05-20 | 2014-02-19 | 南京大学 | 一种基于连续声源概念的三维局部空间声场重放方法 |
| EP2541547A1 (en) | 2011-06-30 | 2013-01-02 | Thomson Licensing | Method and apparatus for changing the relative positions of sound objects contained within a higher-order ambisonics representation |
| WO2013006322A1 (en) * | 2011-07-01 | 2013-01-10 | Dolby Laboratories Licensing Corporation | Sample rate scalable lossless audio coding |
| TWI853425B (zh) * | 2011-07-01 | 2024-08-21 | 美商杜比實驗室特許公司 | 用於適應性音頻信號的產生、譯碼與呈現之系統與方法 |
| EP2898506B1 (en) | 2012-09-21 | 2018-01-17 | Dolby Laboratories Licensing Corporation | Layered approach to spatial audio coding |
| EP2743922A1 (en) | 2012-12-12 | 2014-06-18 | Thomson Licensing | Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field |
| US9959875B2 (en) | 2013-03-01 | 2018-05-01 | Qualcomm Incorporated | Specifying spherical harmonic and/or higher order ambisonics coefficients in bitstreams |
-
2014
- 2014-02-27 US US14/192,819 patent/US9959875B2/en active Active
- 2014-02-27 US US14/192,829 patent/US9685163B2/en not_active Expired - Fee Related
- 2014-02-28 BR BR112015020892A patent/BR112015020892A2/pt not_active IP Right Cessation
- 2014-02-28 HU HUE14713289A patent/HUE045446T2/hu unknown
- 2014-02-28 ES ES14713289T patent/ES2738490T3/es active Active
- 2014-02-28 EP EP14711375.7A patent/EP2962297B1/en active Active
- 2014-02-28 JP JP2015560352A patent/JP2016510905A/ja not_active Ceased
- 2014-02-28 EP EP14713289.8A patent/EP2962298B1/en active Active
- 2014-02-28 KR KR1020157026859A patent/KR20150123310A/ko not_active Ceased
- 2014-02-28 JP JP2015560355A patent/JP2016513811A/ja active Pending
- 2014-02-28 WO PCT/US2014/019468 patent/WO2014134472A2/en not_active Ceased
- 2014-02-28 WO PCT/US2014/019446 patent/WO2014134462A2/en not_active Ceased
- 2014-02-28 CN CN201480011198.1A patent/CN105027199B/zh active Active
- 2014-02-28 KR KR1020157026860A patent/KR101854964B1/ko not_active Expired - Fee Related
- 2014-02-28 CN CN201480011287.6A patent/CN105027200B/zh active Active
- 2014-03-03 TW TW103107128A patent/TWI603631B/zh not_active IP Right Cessation
- 2014-03-03 TW TW103107142A patent/TWI583210B/zh not_active IP Right Cessation
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1942931A (zh) * | 2004-04-21 | 2007-04-04 | 杜比实验室特许公司 | 通过树型分层数据结构的有序横向结构描述比特流语法的音频比特流格式 |
| EP2450880A1 (en) * | 2010-11-05 | 2012-05-09 | Thomson Licensing | Data structure for Higher Order Ambisonics audio data |
| CN103250207A (zh) * | 2010-11-05 | 2013-08-14 | 汤姆逊许可公司 | 高阶高保真度立体声响复制音频数据的数据结构 |
Non-Patent Citations (1)
| Title |
|---|
| "Multichannel Audio Coding Based on Minimum Audible Angles";ADRIEN DANIEL ET AL;《PROCEEDINGS OF 40TH INTERNATIONAL CONFERENCE:SPATIAL AUDIO:SENSE THE SOUND OF SPACE》;20100101;第9页第12,14节 * |
Also Published As
| Publication number | Publication date |
|---|---|
| EP2962297B1 (en) | 2019-06-05 |
| KR20150123311A (ko) | 2015-11-03 |
| HUE045446T2 (hu) | 2019-12-30 |
| US9685163B2 (en) | 2017-06-20 |
| CN105027200A (zh) | 2015-11-04 |
| TW201503712A (zh) | 2015-01-16 |
| EP2962298A2 (en) | 2016-01-06 |
| WO2014134472A3 (en) | 2015-03-19 |
| ES2738490T3 (es) | 2020-01-23 |
| BR112015020892A2 (pt) | 2017-07-18 |
| KR101854964B1 (ko) | 2018-05-04 |
| TW201446016A (zh) | 2014-12-01 |
| TWI603631B (zh) | 2017-10-21 |
| WO2014134462A3 (en) | 2014-11-13 |
| JP2016513811A (ja) | 2016-05-16 |
| US9959875B2 (en) | 2018-05-01 |
| JP2016510905A (ja) | 2016-04-11 |
| WO2014134472A2 (en) | 2014-09-04 |
| KR20150123310A (ko) | 2015-11-03 |
| EP2962298B1 (en) | 2019-04-24 |
| EP2962297A2 (en) | 2016-01-06 |
| WO2014134462A2 (en) | 2014-09-04 |
| CN105027200B (zh) | 2019-04-09 |
| US20140249827A1 (en) | 2014-09-04 |
| TWI583210B (zh) | 2017-05-11 |
| US20140247946A1 (en) | 2014-09-04 |
| CN105027199A (zh) | 2015-11-04 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN105027199B (zh) | 在位流中指定球谐系数和/或高阶立体混响系数 | |
| US20220030372A1 (en) | Reordering Of Audio Objects In The Ambisonics Domain | |
| US9384741B2 (en) | Binauralization of rotated higher order ambisonics | |
| EP3165001A1 (en) | Reducing correlation between higher order ambisonic (hoa) background channels | |
| WO2016033480A2 (en) | Intermediate compression for higher order ambisonic audio data | |
| WO2015175998A1 (en) | Spatial relation coding for higher order ambisonic coefficients | |
| HK1215752B (zh) | 声场的经分解表示的压缩 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |