CN106471576B - 高阶立体混响系数的闭环量化 - Google Patents
高阶立体混响系数的闭环量化 Download PDFInfo
- Publication number
- CN106471576B CN106471576B CN201580025054.6A CN201580025054A CN106471576B CN 106471576 B CN106471576 B CN 106471576B CN 201580025054 A CN201580025054 A CN 201580025054A CN 106471576 B CN106471576 B CN 106471576B
- Authority
- CN
- China
- Prior art keywords
- audio object
- quantization
- audio
- directional information
- vector
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/038—Vector quantisation, e.g. TwinVQ audio
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/02—Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S5/00—Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation
- H04S5/005—Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation of the pseudo five- or more-channel type, e.g. virtual surround
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/01—Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/11—Application of ambisonics in stereophonic audio systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- General Physics & Mathematics (AREA)
- Algebra (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Stereophonic System (AREA)
- Circuit For Audible Band Transducer (AREA)
- Obtaining Desirable Characteristics In Audible-Bandwidth Transducers (AREA)
Applications Claiming Priority (9)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201461994493P | 2014-05-16 | 2014-05-16 | |
| US201461994788P | 2014-05-16 | 2014-05-16 | |
| US61/994,493 | 2014-05-16 | ||
| US61/994,788 | 2014-05-16 | ||
| US201462004082P | 2014-05-28 | 2014-05-28 | |
| US62/004,082 | 2014-05-28 | ||
| US14/712,638 | 2015-05-14 | ||
| US14/712,638 US9959876B2 (en) | 2014-05-16 | 2015-05-14 | Closed loop quantization of higher order ambisonic coefficients |
| PCT/US2015/031107 WO2015175953A1 (en) | 2014-05-16 | 2015-05-15 | Closed loop quantization of higher order ambisonic coefficients |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN106471576A CN106471576A (zh) | 2017-03-01 |
| CN106471576B true CN106471576B (zh) | 2019-08-27 |
Family
ID=53298601
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201580025054.6A Expired - Fee Related CN106471576B (zh) | 2014-05-16 | 2015-05-15 | 高阶立体混响系数的闭环量化 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US9959876B2 (enExample) |
| EP (1) | EP3143618B1 (enExample) |
| JP (1) | JP2017520785A (enExample) |
| KR (1) | KR20170010792A (enExample) |
| CN (1) | CN106471576B (enExample) |
| WO (1) | WO2015175953A1 (enExample) |
Families Citing this family (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9769586B2 (en) * | 2013-05-29 | 2017-09-19 | Qualcomm Incorporated | Performing order reduction with respect to higher order ambisonic coefficients |
| US9820073B1 (en) | 2017-05-10 | 2017-11-14 | Tls Corp. | Extracting a common signal from multiple audio signals |
| CN110019719B (zh) * | 2017-12-15 | 2023-04-25 | 微软技术许可有限责任公司 | 基于断言的问答 |
| US12056594B2 (en) * | 2018-06-27 | 2024-08-06 | International Business Machines Corporation | Low precision deep neural network enabled by compensation instructions |
| US11361776B2 (en) * | 2019-06-24 | 2022-06-14 | Qualcomm Incorporated | Coding scaled spatial components |
| US12308034B2 (en) | 2019-06-24 | 2025-05-20 | Qualcomm Incorporated | Performing psychoacoustic audio coding based on operating conditions |
| US11538489B2 (en) | 2019-06-24 | 2022-12-27 | Qualcomm Incorporated | Correlating scene-based audio data for psychoacoustic audio coding |
| US12142285B2 (en) | 2019-06-24 | 2024-11-12 | Qualcomm Incorporated | Quantizing spatial components based on bit allocations determined for psychoacoustic audio coding |
| GB2615236A (en) * | 2020-09-25 | 2023-08-02 | Apple Inc | Higher order ambisonics encoding and decoding |
| CN115410585A (zh) * | 2021-05-29 | 2022-11-29 | 华为技术有限公司 | 音频数据编解码方法和相关装置及计算机可读存储介质 |
Family Cites Families (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7502743B2 (en) | 2002-09-04 | 2009-03-10 | Microsoft Corporation | Multi-channel audio encoding and decoding with multi-channel transform selection |
| US7299190B2 (en) * | 2002-09-04 | 2007-11-20 | Microsoft Corporation | Quantization and inverse quantization for audio |
| WO2007102782A2 (en) * | 2006-03-07 | 2007-09-13 | Telefonaktiebolaget Lm Ericsson (Publ) | Methods and arrangements for audio coding and decoding |
| US7933770B2 (en) * | 2006-07-14 | 2011-04-26 | Siemens Audiologische Technik Gmbh | Method and device for coding audio data based on vector quantisation |
| EP2082397B1 (en) * | 2006-10-16 | 2011-12-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for multi -channel parameter transformation |
| US20080232601A1 (en) * | 2007-03-21 | 2008-09-25 | Ville Pulkki | Method and apparatus for enhancement of audio reconstruction |
| JP5686369B2 (ja) | 2007-06-11 | 2015-03-18 | フラウンホッファー−ゲゼルシャフト ツァー フェーデルング デア アンゲバンテン フォルシュング エー ファー | インパルス状の部分と定常部分とを有するオーディオ信号を符号化するオーディオ符号器、符号化方法、復号器、および復号化方法 |
| CN102549655B (zh) * | 2009-08-14 | 2014-09-24 | Dts有限责任公司 | 自适应成流音频对象的系统 |
| EP2469741A1 (en) | 2010-12-21 | 2012-06-27 | Thomson Licensing | Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field |
| US9280980B2 (en) | 2011-02-09 | 2016-03-08 | Telefonaktiebolaget L M Ericsson (Publ) | Efficient encoding/decoding of audio signals |
| MY202459A (en) * | 2011-04-21 | 2024-04-30 | Samsung Electronics Co Ltd | Method of quantizing linear predictive coding coefficients, sound encoding method, method of de-quantizing linear predictive coding coefficients, sound decoding method, and recording medium and electronic device therefor |
| US9269364B2 (en) * | 2011-11-02 | 2016-02-23 | Telefonaktiebolaget L M Ericsson (Publ) | Audio encoding/decoding based on an efficient representation of auto-regressive coefficients |
| US9761229B2 (en) * | 2012-07-20 | 2017-09-12 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for audio object clustering |
| US9769586B2 (en) | 2013-05-29 | 2017-09-19 | Qualcomm Incorporated | Performing order reduction with respect to higher order ambisonic coefficients |
| US9922656B2 (en) | 2014-01-30 | 2018-03-20 | Qualcomm Incorporated | Transitioning of ambient higher-order ambisonic coefficients |
-
2015
- 2015-05-14 US US14/712,638 patent/US9959876B2/en active Active
- 2015-05-15 EP EP15727503.3A patent/EP3143618B1/en active Active
- 2015-05-15 WO PCT/US2015/031107 patent/WO2015175953A1/en not_active Ceased
- 2015-05-15 JP JP2016567848A patent/JP2017520785A/ja active Pending
- 2015-05-15 KR KR1020167034841A patent/KR20170010792A/ko not_active Withdrawn
- 2015-05-15 CN CN201580025054.6A patent/CN106471576B/zh not_active Expired - Fee Related
Also Published As
| Publication number | Publication date |
|---|---|
| JP2017520785A (ja) | 2017-07-27 |
| WO2015175953A1 (en) | 2015-11-19 |
| US20150332681A1 (en) | 2015-11-19 |
| US9959876B2 (en) | 2018-05-01 |
| KR20170010792A (ko) | 2017-02-01 |
| EP3143618A1 (en) | 2017-03-22 |
| EP3143618B1 (en) | 2019-11-13 |
| CN106471576A (zh) | 2017-03-01 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN106663433B (zh) | 用于处理音频数据的方法和装置 | |
| CN105917408B (zh) | 指示用于译码向量的帧参数可重用性 | |
| CN106104680B (zh) | 将音频信道插入到声场的描述中 | |
| CN106471576B (zh) | 高阶立体混响系数的闭环量化 | |
| CN106575506B (zh) | 用于执行高阶立体混响音频数据的中间压缩的装置和方法 | |
| CN106463121A (zh) | 较高阶立体混响信号压缩 | |
| US10134403B2 (en) | Crossfading between higher order ambisonic signals | |
| EP3363213B1 (en) | Coding higher-order ambisonic coefficients during multiple transitions | |
| HK1232013B (zh) | 用於处理音频数据的方法和装置 | |
| HK1224073B (zh) | 指示用於译码向量的帧参数可重用性 | |
| HK1232013A1 (en) | Reducing correlation between higher order ambisonic (hoa) background channels |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant | ||
| CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20190827 |
|
| CF01 | Termination of patent right due to non-payment of annual fee |