CN106471576B - 高阶立体混响系数的闭环量化 - Google Patents
高阶立体混响系数的闭环量化 Download PDFInfo
- Publication number
- CN106471576B CN106471576B CN201580025054.6A CN201580025054A CN106471576B CN 106471576 B CN106471576 B CN 106471576B CN 201580025054 A CN201580025054 A CN 201580025054A CN 106471576 B CN106471576 B CN 106471576B
- Authority
- CN
- China
- Prior art keywords
- audio object
- quantization
- audio
- directional information
- vector
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/038—Vector quantisation, e.g. TwinVQ audio
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/02—Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S5/00—Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation
- H04S5/005—Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation of the pseudo five- or more-channel type, e.g. virtual surround
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/01—Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/11—Application of ambisonics in stereophonic audio systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Algebra (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Stereophonic System (AREA)
- Obtaining Desirable Characteristics In Audible-Bandwidth Transducers (AREA)
- Circuit For Audible Band Transducer (AREA)
Applications Claiming Priority (9)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201461994788P | 2014-05-16 | 2014-05-16 | |
| US201461994493P | 2014-05-16 | 2014-05-16 | |
| US61/994,788 | 2014-05-16 | ||
| US61/994,493 | 2014-05-16 | ||
| US201462004082P | 2014-05-28 | 2014-05-28 | |
| US62/004,082 | 2014-05-28 | ||
| US14/712,638 | 2015-05-14 | ||
| US14/712,638 US9959876B2 (en) | 2014-05-16 | 2015-05-14 | Closed loop quantization of higher order ambisonic coefficients |
| PCT/US2015/031107 WO2015175953A1 (en) | 2014-05-16 | 2015-05-15 | Closed loop quantization of higher order ambisonic coefficients |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN106471576A CN106471576A (zh) | 2017-03-01 |
| CN106471576B true CN106471576B (zh) | 2019-08-27 |
Family
ID=53298601
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201580025054.6A Expired - Fee Related CN106471576B (zh) | 2014-05-16 | 2015-05-15 | 高阶立体混响系数的闭环量化 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US9959876B2 (cg-RX-API-DMAC7.html) |
| EP (1) | EP3143618B1 (cg-RX-API-DMAC7.html) |
| JP (1) | JP2017520785A (cg-RX-API-DMAC7.html) |
| KR (1) | KR20170010792A (cg-RX-API-DMAC7.html) |
| CN (1) | CN106471576B (cg-RX-API-DMAC7.html) |
| WO (1) | WO2015175953A1 (cg-RX-API-DMAC7.html) |
Families Citing this family (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9854377B2 (en) * | 2013-05-29 | 2017-12-26 | Qualcomm Incorporated | Interpolation for decomposed representations of a sound field |
| US9820073B1 (en) | 2017-05-10 | 2017-11-14 | Tls Corp. | Extracting a common signal from multiple audio signals |
| CN110019719B (zh) * | 2017-12-15 | 2023-04-25 | 微软技术许可有限责任公司 | 基于断言的问答 |
| US12056594B2 (en) * | 2018-06-27 | 2024-08-06 | International Business Machines Corporation | Low precision deep neural network enabled by compensation instructions |
| US11361776B2 (en) * | 2019-06-24 | 2022-06-14 | Qualcomm Incorporated | Coding scaled spatial components |
| US11538489B2 (en) | 2019-06-24 | 2022-12-27 | Qualcomm Incorporated | Correlating scene-based audio data for psychoacoustic audio coding |
| US12142285B2 (en) | 2019-06-24 | 2024-11-12 | Qualcomm Incorporated | Quantizing spatial components based on bit allocations determined for psychoacoustic audio coding |
| US12308034B2 (en) | 2019-06-24 | 2025-05-20 | Qualcomm Incorporated | Performing psychoacoustic audio coding based on operating conditions |
| GB2615236A (en) * | 2020-09-25 | 2023-08-02 | Apple Inc | Higher order ambisonics encoding and decoding |
| CN115410585A (zh) * | 2021-05-29 | 2022-11-29 | 华为技术有限公司 | 音频数据编解码方法和相关装置及计算机可读存储介质 |
Family Cites Families (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7502743B2 (en) | 2002-09-04 | 2009-03-10 | Microsoft Corporation | Multi-channel audio encoding and decoding with multi-channel transform selection |
| US7299190B2 (en) * | 2002-09-04 | 2007-11-20 | Microsoft Corporation | Quantization and inverse quantization for audio |
| US8781842B2 (en) * | 2006-03-07 | 2014-07-15 | Telefonaktiebolaget Lm Ericsson (Publ) | Scalable coding with non-casual predictive information in an enhancement layer |
| US7933770B2 (en) * | 2006-07-14 | 2011-04-26 | Siemens Audiologische Technik Gmbh | Method and device for coding audio data based on vector quantisation |
| CA2673624C (en) * | 2006-10-16 | 2014-08-12 | Johannes Hilpert | Apparatus and method for multi-channel parameter transformation |
| US20080232601A1 (en) * | 2007-03-21 | 2008-09-25 | Ville Pulkki | Method and apparatus for enhancement of audio reconstruction |
| RU2439721C2 (ru) | 2007-06-11 | 2012-01-10 | Фраунхофер-Гезелльшафт цур Фёрдерунг дер ангевандтен | Аудиокодер для кодирования аудиосигнала, имеющего импульсоподобную и стационарную составляющие, способы кодирования, декодер, способ декодирования и кодированный аудиосигнал |
| CN102576533B (zh) * | 2009-08-14 | 2014-09-17 | Dts有限责任公司 | 面向对象的音频流系统 |
| EP2469741A1 (en) * | 2010-12-21 | 2012-06-27 | Thomson Licensing | Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field |
| EP2673771B1 (en) | 2011-02-09 | 2016-06-01 | Telefonaktiebolaget LM Ericsson (publ) | Efficient encoding/decoding of audio signals |
| TWI672691B (zh) * | 2011-04-21 | 2019-09-21 | 南韓商三星電子股份有限公司 | 解碼方法 |
| US9269364B2 (en) * | 2011-11-02 | 2016-02-23 | Telefonaktiebolaget L M Ericsson (Publ) | Audio encoding/decoding based on an efficient representation of auto-regressive coefficients |
| US9761229B2 (en) * | 2012-07-20 | 2017-09-12 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for audio object clustering |
| US9854377B2 (en) | 2013-05-29 | 2017-12-26 | Qualcomm Incorporated | Interpolation for decomposed representations of a sound field |
| US9922656B2 (en) | 2014-01-30 | 2018-03-20 | Qualcomm Incorporated | Transitioning of ambient higher-order ambisonic coefficients |
-
2015
- 2015-05-14 US US14/712,638 patent/US9959876B2/en active Active
- 2015-05-15 CN CN201580025054.6A patent/CN106471576B/zh not_active Expired - Fee Related
- 2015-05-15 JP JP2016567848A patent/JP2017520785A/ja active Pending
- 2015-05-15 KR KR1020167034841A patent/KR20170010792A/ko not_active Withdrawn
- 2015-05-15 EP EP15727503.3A patent/EP3143618B1/en active Active
- 2015-05-15 WO PCT/US2015/031107 patent/WO2015175953A1/en not_active Ceased
Also Published As
| Publication number | Publication date |
|---|---|
| EP3143618A1 (en) | 2017-03-22 |
| US20150332681A1 (en) | 2015-11-19 |
| KR20170010792A (ko) | 2017-02-01 |
| WO2015175953A1 (en) | 2015-11-19 |
| US9959876B2 (en) | 2018-05-01 |
| CN106471576A (zh) | 2017-03-01 |
| EP3143618B1 (en) | 2019-11-13 |
| JP2017520785A (ja) | 2017-07-27 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN106663433B (zh) | 用于处理音频数据的方法和装置 | |
| CN105917408B (zh) | 指示用于译码向量的帧参数可重用性 | |
| CN106104680B (zh) | 将音频信道插入到声场的描述中 | |
| CN106471576B (zh) | 高阶立体混响系数的闭环量化 | |
| CN106575506B (zh) | 用于执行高阶立体混响音频数据的中间压缩的装置和方法 | |
| CN106471577B (zh) | 在高阶立体混响系数中的标量与向量之间进行确定 | |
| US10134403B2 (en) | Crossfading between higher order ambisonic signals | |
| CN106463121A (zh) | 较高阶立体混响信号压缩 | |
| CN106463129A (zh) | 选择码簿以用于译码从高阶立体混响音频信号分解的向量 | |
| EP3363213B1 (en) | Coding higher-order ambisonic coefficients during multiple transitions | |
| HK1232013B (zh) | 用於处理音频数据的方法和装置 | |
| HK1224073B (zh) | 指示用於译码向量的帧参数可重用性 | |
| HK1232013A1 (en) | Reducing correlation between higher order ambisonic (hoa) background channels |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant | ||
| CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20190827 |
|
| CF01 | Termination of patent right due to non-payment of annual fee |