CN111489758B - 解码装置、解码方法及存储介质 - Google Patents
解码装置、解码方法及存储介质 Download PDFInfo
- Publication number
- CN111489758B CN111489758B CN202010176142.9A CN202010176142A CN111489758B CN 111489758 B CN111489758 B CN 111489758B CN 202010176142 A CN202010176142 A CN 202010176142A CN 111489758 B CN111489758 B CN 111489758B
- Authority
- CN
- China
- Prior art keywords
- priority information
- unit
- audio signal
- decoding
- channel
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/22—Mode decision, i.e. based on audio signal content versus external parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Quality & Reliability (AREA)
- Mathematical Physics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Theoretical Computer Science (AREA)
- Stereophonic System (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010176142.9A CN111489758B (zh) | 2014-03-24 | 2015-03-16 | 解码装置、解码方法及存储介质 |
Applications Claiming Priority (7)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2014-060486 | 2014-03-24 | ||
| JP2014060486 | 2014-03-24 | ||
| JP2014-136633 | 2014-07-02 | ||
| JP2014136633A JP6439296B2 (ja) | 2014-03-24 | 2014-07-02 | 復号装置および方法、並びにプログラム |
| PCT/JP2015/001432 WO2015146057A1 (en) | 2014-03-24 | 2015-03-16 | Encoding device and encoding method, decoding device and decoding method, and program |
| CN202010176142.9A CN111489758B (zh) | 2014-03-24 | 2015-03-16 | 解码装置、解码方法及存储介质 |
| CN201580014248.6A CN106133828B (zh) | 2014-03-24 | 2015-03-16 | 编码装置和编码方法、解码装置和解码方法及存储介质 |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201580014248.6A Division CN106133828B (zh) | 2014-03-24 | 2015-03-16 | 编码装置和编码方法、解码装置和解码方法及存储介质 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN111489758A CN111489758A (zh) | 2020-08-04 |
| CN111489758B true CN111489758B (zh) | 2023-12-01 |
Family
ID=53039543
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202010176142.9A Active CN111489758B (zh) | 2014-03-24 | 2015-03-16 | 解码装置、解码方法及存储介质 |
| CN201580014248.6A Active CN106133828B (zh) | 2014-03-24 | 2015-03-16 | 编码装置和编码方法、解码装置和解码方法及存储介质 |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201580014248.6A Active CN106133828B (zh) | 2014-03-24 | 2015-03-16 | 编码装置和编码方法、解码装置和解码方法及存储介质 |
Country Status (8)
| Country | Link |
|---|---|
| US (4) | US20180033440A1 (cg-RX-API-DMAC7.html) |
| EP (3) | EP4243016A3 (cg-RX-API-DMAC7.html) |
| JP (1) | JP6439296B2 (cg-RX-API-DMAC7.html) |
| KR (4) | KR20210111897A (cg-RX-API-DMAC7.html) |
| CN (2) | CN111489758B (cg-RX-API-DMAC7.html) |
| BR (1) | BR112016021407B1 (cg-RX-API-DMAC7.html) |
| RU (2) | RU2019112504A (cg-RX-API-DMAC7.html) |
| WO (1) | WO2015146057A1 (cg-RX-API-DMAC7.html) |
Families Citing this family (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP3059732B1 (en) * | 2013-10-17 | 2018-10-10 | Socionext Inc. | Audio decoding device |
| JP6904250B2 (ja) * | 2015-04-08 | 2021-07-14 | ソニーグループ株式会社 | 送信装置、送信方法、受信装置および受信方法 |
| WO2016163329A1 (ja) * | 2015-04-08 | 2016-10-13 | ソニー株式会社 | 送信装置、送信方法、受信装置および受信方法 |
| US10424307B2 (en) * | 2017-01-03 | 2019-09-24 | Nokia Technologies Oy | Adapting a distributed audio recording for end user free viewpoint monitoring |
| EP4054213A1 (en) * | 2017-03-06 | 2022-09-07 | Dolby International AB | Rendering in dependence on the number of loudspeaker channels |
| EP3618067B1 (en) * | 2017-04-26 | 2024-04-10 | Sony Group Corporation | Signal processing device, method, and program |
| US10885921B2 (en) * | 2017-07-07 | 2021-01-05 | Qualcomm Incorporated | Multi-stream audio coding |
| US10657974B2 (en) * | 2017-12-21 | 2020-05-19 | Qualcomm Incorporated | Priority information for higher order ambisonic audio data |
| US11270711B2 (en) | 2017-12-21 | 2022-03-08 | Qualcomm Incorproated | Higher order ambisonic audio data |
| GB2578715A (en) * | 2018-07-20 | 2020-05-27 | Nokia Technologies Oy | Controlling audio focus for spatial audio processing |
| JP7447798B2 (ja) * | 2018-10-16 | 2024-03-12 | ソニーグループ株式会社 | 信号処理装置および方法、並びにプログラム |
| CN111081226B (zh) * | 2018-10-18 | 2024-02-13 | 北京搜狗科技发展有限公司 | 语音识别解码优化方法及装置 |
| CN113016032B (zh) * | 2018-11-20 | 2024-08-20 | 索尼集团公司 | 信息处理装置和方法以及程序 |
| WO2021200260A1 (ja) * | 2020-04-01 | 2021-10-07 | ソニーグループ株式会社 | 信号処理装置および方法、並びにプログラム |
| MX2023002255A (es) * | 2020-09-03 | 2023-05-16 | Sony Group Corp | Dispositivo y método de procesamiento de señales, dispositivo y método de aprendizaje y programa. |
| DE112021005027T5 (de) * | 2020-09-25 | 2023-08-10 | Apple Inc. | Nahtloses skalierbares decodieren von kanälen, objekten und hoa-audioinhalt |
| CN112634914B (zh) * | 2020-12-15 | 2024-03-29 | 中国科学技术大学 | 基于短时谱一致性的神经网络声码器训练方法 |
| US11710491B2 (en) * | 2021-04-20 | 2023-07-25 | Tencent America LLC | Method and apparatus for space of interest of audio scene |
| CN114974273B (zh) * | 2021-08-10 | 2023-08-15 | 中移互联网有限公司 | 一种会议音频混音方法和装置 |
| CN114550732B (zh) * | 2022-04-15 | 2022-07-08 | 腾讯科技(深圳)有限公司 | 一种高频音频信号的编解码方法和相关装置 |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1272259A (zh) * | 1997-06-10 | 2000-11-01 | 拉斯·古斯塔夫·里杰利德 | 采用频带复现增强源编码 |
| CN101529504A (zh) * | 2006-10-16 | 2009-09-09 | 弗劳恩霍夫应用研究促进协会 | 多通道参数转换的装置和方法 |
| CN102549655A (zh) * | 2009-08-14 | 2012-07-04 | Srs实验室有限公司 | 自适应成流音频对象的系统 |
| WO2013181272A2 (en) * | 2012-05-31 | 2013-12-05 | Dts Llc | Object-based audio system using vector base amplitude panning |
| CN103649706A (zh) * | 2011-03-16 | 2014-03-19 | Dts(英属维尔京群岛)有限公司 | 三维音频音轨的编码及再现 |
Family Cites Families (18)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6330644B1 (en) * | 1994-10-27 | 2001-12-11 | Canon Kabushiki Kaisha | Signal processor with a plurality of kinds of processors and a shared memory accessed through a versatile control means |
| JP3519722B2 (ja) * | 1997-03-17 | 2004-04-19 | 松下電器産業株式会社 | データ処理方法及びデータ処理装置 |
| US6230130B1 (en) * | 1998-05-18 | 2001-05-08 | U.S. Philips Corporation | Scalable mixing for speech streaming |
| JP2005292702A (ja) * | 2004-04-05 | 2005-10-20 | Kddi Corp | オーディオフレームに対するフェードイン/フェードアウト処理装置及びプログラム |
| US8787594B1 (en) * | 2005-01-28 | 2014-07-22 | Texas Instruments Incorporated | Multi-stream audio level controller |
| RU2383941C2 (ru) * | 2005-06-30 | 2010-03-10 | ЭлДжи ЭЛЕКТРОНИКС ИНК. | Способ и устройство для кодирования и декодирования аудиосигналов |
| US7974422B1 (en) * | 2005-08-25 | 2011-07-05 | Tp Lab, Inc. | System and method of adjusting the sound of multiple audio objects directed toward an audio output device |
| JP4396683B2 (ja) * | 2006-10-02 | 2010-01-13 | カシオ計算機株式会社 | 音声符号化装置、音声符号化方法、及び、プログラム |
| US8085786B2 (en) * | 2007-03-16 | 2011-12-27 | Qualcomm Incorporated | H-ARQ throughput optimization by prioritized decoding |
| FR2929466A1 (fr) * | 2008-03-28 | 2009-10-02 | France Telecom | Dissimulation d'erreur de transmission dans un signal numerique dans une structure de decodage hierarchique |
| CN102714038B (zh) * | 2009-11-20 | 2014-11-05 | 弗兰霍菲尔运输应用研究公司 | 用以基于下混信号表示型态而提供上混信号表示型态的装置、用以提供表示多声道音频信号的位流的装置、方法 |
| US9531761B2 (en) * | 2010-07-01 | 2016-12-27 | Broadcom Corporation | Method and system for prioritizing and scheduling services in an IP multimedia network |
| JP2012108451A (ja) * | 2010-10-18 | 2012-06-07 | Sony Corp | 音声処理装置および方法、並びにプログラム |
| US9025458B2 (en) * | 2012-10-23 | 2015-05-05 | Verizon Patent And Licensing Inc. | Reducing congestion of media delivery over a content delivery network |
| US9805725B2 (en) * | 2012-12-21 | 2017-10-31 | Dolby Laboratories Licensing Corporation | Object clustering for rendering object-based audio content based on perceptual criteria |
| US9860663B2 (en) * | 2013-01-15 | 2018-01-02 | Koninklijke Philips N.V. | Binaural audio processing |
| EP3059732B1 (en) * | 2013-10-17 | 2018-10-10 | Socionext Inc. | Audio decoding device |
| KR102160254B1 (ko) * | 2014-01-10 | 2020-09-25 | 삼성전자주식회사 | 액티브다운 믹스 방식을 이용한 입체 음향 재생 방법 및 장치 |
-
2014
- 2014-07-02 JP JP2014136633A patent/JP6439296B2/ja active Active
-
2015
- 2015-03-16 KR KR1020217028231A patent/KR20210111897A/ko not_active Ceased
- 2015-03-16 CN CN202010176142.9A patent/CN111489758B/zh active Active
- 2015-03-16 BR BR112016021407-2A patent/BR112016021407B1/pt active IP Right Grant
- 2015-03-16 EP EP23168474.7A patent/EP4243016A3/en active Pending
- 2015-03-16 US US15/127,182 patent/US20180033440A1/en not_active Abandoned
- 2015-03-16 EP EP20183981.8A patent/EP3745397B1/en active Active
- 2015-03-16 EP EP15719835.9A patent/EP3123470B1/en active Active
- 2015-03-16 WO PCT/JP2015/001432 patent/WO2015146057A1/en not_active Ceased
- 2015-03-16 CN CN201580014248.6A patent/CN106133828B/zh active Active
- 2015-03-16 RU RU2019112504A patent/RU2019112504A/ru unknown
- 2015-03-16 KR KR1020167021269A patent/KR102300062B1/ko active Active
- 2015-03-16 RU RU2016137197A patent/RU2689438C2/ru active
- 2015-03-16 KR KR1020237005472A patent/KR102741508B1/ko active Active
- 2015-03-16 KR KR1020247040609A patent/KR20250002792A/ko active Pending
-
2019
- 2019-12-24 US US16/726,755 patent/US20200135216A1/en not_active Abandoned
-
2021
- 2021-09-01 US US17/464,594 patent/US20210398546A1/en not_active Abandoned
-
2023
- 2023-10-24 US US18/493,363 patent/US20240055007A1/en active Pending
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1272259A (zh) * | 1997-06-10 | 2000-11-01 | 拉斯·古斯塔夫·里杰利德 | 采用频带复现增强源编码 |
| CN101529504A (zh) * | 2006-10-16 | 2009-09-09 | 弗劳恩霍夫应用研究促进协会 | 多通道参数转换的装置和方法 |
| CN102549655A (zh) * | 2009-08-14 | 2012-07-04 | Srs实验室有限公司 | 自适应成流音频对象的系统 |
| CN103649706A (zh) * | 2011-03-16 | 2014-03-19 | Dts(英属维尔京群岛)有限公司 | 三维音频音轨的编码及再现 |
| WO2013181272A2 (en) * | 2012-05-31 | 2013-12-05 | Dts Llc | Object-based audio system using vector base amplitude panning |
Also Published As
| Publication number | Publication date |
|---|---|
| KR102300062B1 (ko) | 2021-09-09 |
| EP3123470A1 (en) | 2017-02-01 |
| CN111489758A (zh) | 2020-08-04 |
| JP2015194666A (ja) | 2015-11-05 |
| KR20160136278A (ko) | 2016-11-29 |
| WO2015146057A1 (en) | 2015-10-01 |
| BR112016021407B1 (pt) | 2022-09-27 |
| EP4243016A3 (en) | 2023-11-08 |
| US20200135216A1 (en) | 2020-04-30 |
| EP3123470B1 (en) | 2020-08-12 |
| KR20210111897A (ko) | 2021-09-13 |
| CN106133828B (zh) | 2020-04-10 |
| EP3745397A1 (en) | 2020-12-02 |
| US20240055007A1 (en) | 2024-02-15 |
| RU2016137197A (ru) | 2018-03-21 |
| BR112016021407A2 (pt) | 2022-07-19 |
| KR20250002792A (ko) | 2025-01-07 |
| US20210398546A1 (en) | 2021-12-23 |
| CN106133828A (zh) | 2016-11-16 |
| RU2689438C2 (ru) | 2019-05-28 |
| KR102741508B1 (ko) | 2024-12-12 |
| EP4243016A2 (en) | 2023-09-13 |
| KR20230027329A (ko) | 2023-02-27 |
| JP6439296B2 (ja) | 2018-12-19 |
| RU2016137197A3 (cg-RX-API-DMAC7.html) | 2018-10-22 |
| EP3745397B1 (en) | 2023-06-07 |
| US20180033440A1 (en) | 2018-02-01 |
| RU2019112504A (ru) | 2019-05-06 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN111489758B (zh) | 解码装置、解码方法及存储介质 | |
| US8046214B2 (en) | Low complexity decoder for complex transform coding of multi-channel sound | |
| RU2555221C2 (ru) | Канальное кодирование на основе комплексного преобразования с частотным кодированием с расширенной полосой | |
| US8817991B2 (en) | Advanced encoding of multi-channel digital audio signals | |
| US7885819B2 (en) | Bitstream syntax for multi-process audio decoding | |
| RU2625444C2 (ru) | Система обработки аудио | |
| TWI657434B (zh) | 解碼壓縮高階保真立體音響表示之方法及裝置,及編碼壓縮高階保真立體音響表示之方法及裝置 | |
| CN114008705B (zh) | 基于操作条件执行心理声学音频编解码 | |
| US9230551B2 (en) | Audio encoder or decoder apparatus | |
| JP2025061919A (ja) | 情報処理装置および方法、並びにプログラム | |
| TW201606751A (zh) | 將高階保真立體音響信號表示之次頻帶內主導方向信號之方向編碼/解碼之方法及裝置 | |
| CN114008704A (zh) | 编码已缩放空间分量 | |
| JP2025540764A (ja) | パラメトリック空間オーディオ符号化 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |