KR20170010792A - 고차 앰비소닉 계수들의 폐쇄 루프 양자화 - Google Patents
고차 앰비소닉 계수들의 폐쇄 루프 양자화 Download PDFInfo
- Publication number
- KR20170010792A KR20170010792A KR1020167034841A KR20167034841A KR20170010792A KR 20170010792 A KR20170010792 A KR 20170010792A KR 1020167034841 A KR1020167034841 A KR 1020167034841A KR 20167034841 A KR20167034841 A KR 20167034841A KR 20170010792 A KR20170010792 A KR 20170010792A
- Authority
- KR
- South Korea
- Prior art keywords
- audio object
- audio
- quantization
- direction information
- information associated
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000013139 quantization Methods 0.000 title claims abstract description 175
- 238000000034 method Methods 0.000 claims abstract description 101
- 239000013598 vector Substances 0.000 claims description 257
- 239000011159 matrix material Substances 0.000 claims description 52
- 230000005236 sound signal Effects 0.000 claims description 15
- 230000006870 function Effects 0.000 description 20
- 230000008859 change Effects 0.000 description 17
- 238000003860 storage Methods 0.000 description 17
- 238000010586 diagram Methods 0.000 description 15
- 230000009467 reduction Effects 0.000 description 15
- 238000004458 analytical method Methods 0.000 description 14
- 238000000605 extraction Methods 0.000 description 14
- 238000009472 formulation Methods 0.000 description 14
- 239000000203 mixture Substances 0.000 description 14
- 238000000354 decomposition reaction Methods 0.000 description 13
- 230000003111 delayed effect Effects 0.000 description 12
- 230000002093 peripheral effect Effects 0.000 description 11
- 238000009877 rendering Methods 0.000 description 10
- 230000002123 temporal effect Effects 0.000 description 9
- 230000005540 biological transmission Effects 0.000 description 8
- 238000004364 calculation method Methods 0.000 description 7
- 230000007704 transition Effects 0.000 description 7
- 238000003491 array Methods 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 238000007906 compression Methods 0.000 description 4
- 230000006835 compression Effects 0.000 description 4
- 238000013500 data storage Methods 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 239000000284 extract Substances 0.000 description 3
- 230000004075 alteration Effects 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 239000002131 composite material Substances 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000003032 molecular docking Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000008707 rearrangement Effects 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000009471 action Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000005562 fading Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 238000004091 panning Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 238000011524 similarity measure Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/038—Vector quantisation, e.g. TwinVQ audio
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/02—Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S5/00—Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation
- H04S5/005—Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation of the pseudo five- or more-channel type, e.g. virtual surround
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/01—Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/11—Application of ambisonics in stereophonic audio systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Algebra (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Stereophonic System (AREA)
- Obtaining Desirable Characteristics In Audible-Bandwidth Transducers (AREA)
- Circuit For Audible Band Transducer (AREA)
Applications Claiming Priority (9)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201461994493P | 2014-05-16 | 2014-05-16 | |
| US201461994788P | 2014-05-16 | 2014-05-16 | |
| US61/994,788 | 2014-05-16 | ||
| US61/994,493 | 2014-05-16 | ||
| US201462004082P | 2014-05-28 | 2014-05-28 | |
| US62/004,082 | 2014-05-28 | ||
| US14/712,638 US9959876B2 (en) | 2014-05-16 | 2015-05-14 | Closed loop quantization of higher order ambisonic coefficients |
| US14/712,638 | 2015-05-14 | ||
| PCT/US2015/031107 WO2015175953A1 (en) | 2014-05-16 | 2015-05-15 | Closed loop quantization of higher order ambisonic coefficients |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| KR20170010792A true KR20170010792A (ko) | 2017-02-01 |
Family
ID=53298601
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020167034841A Withdrawn KR20170010792A (ko) | 2014-05-16 | 2015-05-15 | 고차 앰비소닉 계수들의 폐쇄 루프 양자화 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US9959876B2 (enExample) |
| EP (1) | EP3143618B1 (enExample) |
| JP (1) | JP2017520785A (enExample) |
| KR (1) | KR20170010792A (enExample) |
| CN (1) | CN106471576B (enExample) |
| WO (1) | WO2015175953A1 (enExample) |
Families Citing this family (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9716959B2 (en) * | 2013-05-29 | 2017-07-25 | Qualcomm Incorporated | Compensating for error in decomposed representations of sound fields |
| US9820073B1 (en) | 2017-05-10 | 2017-11-14 | Tls Corp. | Extracting a common signal from multiple audio signals |
| CN110019719B (zh) * | 2017-12-15 | 2023-04-25 | 微软技术许可有限责任公司 | 基于断言的问答 |
| US12056594B2 (en) * | 2018-06-27 | 2024-08-06 | International Business Machines Corporation | Low precision deep neural network enabled by compensation instructions |
| US12308034B2 (en) | 2019-06-24 | 2025-05-20 | Qualcomm Incorporated | Performing psychoacoustic audio coding based on operating conditions |
| US11538489B2 (en) | 2019-06-24 | 2022-12-27 | Qualcomm Incorporated | Correlating scene-based audio data for psychoacoustic audio coding |
| US11361776B2 (en) * | 2019-06-24 | 2022-06-14 | Qualcomm Incorporated | Coding scaled spatial components |
| US12142285B2 (en) | 2019-06-24 | 2024-11-12 | Qualcomm Incorporated | Quantizing spatial components based on bit allocations determined for psychoacoustic audio coding |
| GB2615236A (en) * | 2020-09-25 | 2023-08-02 | Apple Inc | Higher order ambisonics encoding and decoding |
| CN115410585A (zh) * | 2021-05-29 | 2022-11-29 | 华为技术有限公司 | 音频数据编解码方法和相关装置及计算机可读存储介质 |
Family Cites Families (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7299190B2 (en) * | 2002-09-04 | 2007-11-20 | Microsoft Corporation | Quantization and inverse quantization for audio |
| US7502743B2 (en) | 2002-09-04 | 2009-03-10 | Microsoft Corporation | Multi-channel audio encoding and decoding with multi-channel transform selection |
| WO2007102782A2 (en) * | 2006-03-07 | 2007-09-13 | Telefonaktiebolaget Lm Ericsson (Publ) | Methods and arrangements for audio coding and decoding |
| US7933770B2 (en) * | 2006-07-14 | 2011-04-26 | Siemens Audiologische Technik Gmbh | Method and device for coding audio data based on vector quantisation |
| MY144273A (en) * | 2006-10-16 | 2011-08-29 | Fraunhofer Ges Forschung | Apparatus and method for multi-chennel parameter transformation |
| US20080232601A1 (en) * | 2007-03-21 | 2008-09-25 | Ville Pulkki | Method and apparatus for enhancement of audio reconstruction |
| CA2691993C (en) | 2007-06-11 | 2015-01-27 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder for encoding an audio signal having an impulse-like portion and stationary portion, encoding methods, decoder, decoding method, and encoded audio signal |
| JP5726874B2 (ja) * | 2009-08-14 | 2015-06-03 | ディーティーエス・エルエルシーDts Llc | オブジェクト指向オーディオストリーミングシステム |
| EP2469741A1 (en) | 2010-12-21 | 2012-06-27 | Thomson Licensing | Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field |
| EP2673771B1 (en) | 2011-02-09 | 2016-06-01 | Telefonaktiebolaget LM Ericsson (publ) | Efficient encoding/decoding of audio signals |
| BR122020023350B1 (pt) * | 2011-04-21 | 2021-04-20 | Samsung Electronics Co., Ltd | método de quantização |
| ES2657802T3 (es) * | 2011-11-02 | 2018-03-06 | Telefonaktiebolaget Lm Ericsson (Publ) | Decodificación de audio basada en una representación eficiente de coeficientes autoregresivos |
| US9761229B2 (en) * | 2012-07-20 | 2017-09-12 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for audio object clustering |
| US9716959B2 (en) | 2013-05-29 | 2017-07-25 | Qualcomm Incorporated | Compensating for error in decomposed representations of sound fields |
| US9922656B2 (en) | 2014-01-30 | 2018-03-20 | Qualcomm Incorporated | Transitioning of ambient higher-order ambisonic coefficients |
-
2015
- 2015-05-14 US US14/712,638 patent/US9959876B2/en active Active
- 2015-05-15 CN CN201580025054.6A patent/CN106471576B/zh not_active Expired - Fee Related
- 2015-05-15 EP EP15727503.3A patent/EP3143618B1/en active Active
- 2015-05-15 JP JP2016567848A patent/JP2017520785A/ja active Pending
- 2015-05-15 WO PCT/US2015/031107 patent/WO2015175953A1/en not_active Ceased
- 2015-05-15 KR KR1020167034841A patent/KR20170010792A/ko not_active Withdrawn
Also Published As
| Publication number | Publication date |
|---|---|
| JP2017520785A (ja) | 2017-07-27 |
| CN106471576A (zh) | 2017-03-01 |
| CN106471576B (zh) | 2019-08-27 |
| EP3143618A1 (en) | 2017-03-22 |
| US9959876B2 (en) | 2018-05-01 |
| WO2015175953A1 (en) | 2015-11-19 |
| EP3143618B1 (en) | 2019-11-13 |
| US20150332681A1 (en) | 2015-11-19 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR101756612B1 (ko) | 벡터들을 코딩하기 위한 프레임 파라미터 재사용성의 표시 | |
| KR102032021B1 (ko) | 고차 앰비소닉스 오디오 신호들로부터 분해된 벡터들의 코딩 | |
| KR101962000B1 (ko) | 고차 앰비소닉 (hoa) 백그라운드 채널들 간의 상관의 감소 | |
| CN106104680B (zh) | 将音频信道插入到声场的描述中 | |
| US9847088B2 (en) | Intermediate compression for higher order ambisonic audio data | |
| KR102329373B1 (ko) | 고차 앰비소닉 오디오 신호들로부터 분해된 코딩 벡터들에 대한 코드북들 선택 | |
| KR101825317B1 (ko) | 고차 앰비소닉 계수들에서 스칼라 및 벡터 양자화 사이의 결정 | |
| EP3143618B1 (en) | Closed loop quantization of higher order ambisonic coefficients | |
| EP3143617B1 (en) | Crossfading between higher order ambisonic signals | |
| KR20170066400A (ko) | Hoa 콘텐츠의 스크린 관련된 적응 | |
| US20150243292A1 (en) | Order format signaling for higher-order ambisonic audio data | |
| EP3363213B1 (en) | Coding higher-order ambisonic coefficients during multiple transitions |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PA0105 | International application |
Patent event date: 20161213 Patent event code: PA01051R01D Comment text: International Patent Application |
|
| PG1501 | Laying open of application | ||
| PC1203 | Withdrawal of no request for examination |