RU2643644C2 - Кодирование и декодирование аудиосигналов - Google Patents
Кодирование и декодирование аудиосигналов Download PDFInfo
- Publication number
- RU2643644C2 RU2643644C2 RU2015104074A RU2015104074A RU2643644C2 RU 2643644 C2 RU2643644 C2 RU 2643644C2 RU 2015104074 A RU2015104074 A RU 2015104074A RU 2015104074 A RU2015104074 A RU 2015104074A RU 2643644 C2 RU2643644 C2 RU 2643644C2
- Authority
- RU
- Russia
- Prior art keywords
- time
- frequency
- segments
- encoded
- audio
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title claims description 144
- 238000009877 rendering Methods 0.000 claims description 60
- 239000011159 matrix material Substances 0.000 claims description 25
- 238000000034 method Methods 0.000 claims description 24
- 230000004044 response Effects 0.000 claims description 15
- 230000002123 temporal effect Effects 0.000 claims 1
- 230000000694 effects Effects 0.000 abstract description 3
- 239000000126 substance Substances 0.000 abstract 1
- 239000000203 mixture Substances 0.000 description 113
- 238000013459 approach Methods 0.000 description 50
- 238000005457 optimization Methods 0.000 description 10
- 230000008569 process Effects 0.000 description 10
- 238000004891 communication Methods 0.000 description 9
- 230000000875 corresponding effect Effects 0.000 description 7
- 230000008901 benefit Effects 0.000 description 6
- 230000008447 perception Effects 0.000 description 6
- 238000012545 processing Methods 0.000 description 6
- 238000013507 mapping Methods 0.000 description 4
- 230000002829 reductive effect Effects 0.000 description 4
- 230000035807 sensation Effects 0.000 description 4
- 238000000926 separation method Methods 0.000 description 4
- 230000006978 adaptation Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 230000007423 decrease Effects 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 238000013139 quantization Methods 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 230000000052 comparative effect Effects 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 230000002452 interceptive effect Effects 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 238000012937 correction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 230000000873 masking effect Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000003032 molecular docking Methods 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 238000012805 post-processing Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
- G10L19/265—Pre-filtering, e.g. high frequency emphasis prior to encoding
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Stereophonic System (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201261669197P | 2012-07-09 | 2012-07-09 | |
| US61/669,197 | 2012-07-09 | ||
| PCT/IB2013/055628 WO2014009878A2 (en) | 2012-07-09 | 2013-07-09 | Encoding and decoding of audio signals |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| RU2015104074A RU2015104074A (ru) | 2016-08-27 |
| RU2643644C2 true RU2643644C2 (ru) | 2018-02-02 |
Family
ID=49170767
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| RU2015104074A RU2643644C2 (ru) | 2012-07-09 | 2013-07-09 | Кодирование и декодирование аудиосигналов |
Country Status (9)
| Country | Link |
|---|---|
| US (1) | US9478228B2 (cg-RX-API-DMAC7.html) |
| EP (2) | EP3748632A1 (cg-RX-API-DMAC7.html) |
| JP (1) | JP6231093B2 (cg-RX-API-DMAC7.html) |
| CN (1) | CN104428835B (cg-RX-API-DMAC7.html) |
| BR (1) | BR112015000247B1 (cg-RX-API-DMAC7.html) |
| MX (1) | MX342150B (cg-RX-API-DMAC7.html) |
| RU (1) | RU2643644C2 (cg-RX-API-DMAC7.html) |
| WO (1) | WO2014009878A2 (cg-RX-API-DMAC7.html) |
| ZA (1) | ZA201500888B (cg-RX-API-DMAC7.html) |
Families Citing this family (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9674587B2 (en) * | 2012-06-26 | 2017-06-06 | Sonos, Inc. | Systems and methods for networked music playback including remote add to queue |
| US9489954B2 (en) * | 2012-08-07 | 2016-11-08 | Dolby Laboratories Licensing Corporation | Encoding and rendering of object based audio indicative of game audio content |
| CA3251568A1 (en) | 2013-05-24 | 2025-02-24 | Dolby International Ab | Audio encoder and decoder |
| US9774974B2 (en) * | 2014-09-24 | 2017-09-26 | Electronics And Telecommunications Research Institute | Audio metadata providing apparatus and method, and multichannel audio data playback apparatus and method to support dynamic format conversion |
| TWI587286B (zh) | 2014-10-31 | 2017-06-11 | 杜比國際公司 | 音頻訊號之解碼和編碼的方法及系統、電腦程式產品、與電腦可讀取媒體 |
| CN113242448B (zh) | 2015-06-02 | 2023-07-14 | 索尼公司 | 发送装置和方法、媒体处理装置和方法以及接收装置 |
| US10693936B2 (en) | 2015-08-25 | 2020-06-23 | Qualcomm Incorporated | Transporting coded audio data |
| US9961467B2 (en) * | 2015-10-08 | 2018-05-01 | Qualcomm Incorporated | Conversion from channel-based audio to HOA |
| US9854375B2 (en) * | 2015-12-01 | 2017-12-26 | Qualcomm Incorporated | Selection of coded next generation audio data for transport |
| KR102261905B1 (ko) | 2016-03-15 | 2021-06-08 | 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. | 음장 기술을 생성하기 위한 장치, 방법, 또는 컴퓨터 프로그램 |
| EP3566473B8 (en) | 2017-03-06 | 2022-06-15 | Dolby International AB | Integrated reconstruction and rendering of audio signals |
| US9820073B1 (en) | 2017-05-10 | 2017-11-14 | Tls Corp. | Extracting a common signal from multiple audio signals |
| CN113016032B (zh) | 2018-11-20 | 2024-08-20 | 索尼集团公司 | 信息处理装置和方法以及程序 |
| GB2587614A (en) * | 2019-09-26 | 2021-04-07 | Nokia Technologies Oy | Audio encoding and audio decoding |
| US11930348B2 (en) * | 2020-11-24 | 2024-03-12 | Naver Corporation | Computer system for realizing customized being-there in association with audio and method thereof |
| US11930349B2 (en) * | 2020-11-24 | 2024-03-12 | Naver Corporation | Computer system for producing audio content for realizing customized being-there and method thereof |
| KR102505249B1 (ko) * | 2020-11-24 | 2023-03-03 | 네이버 주식회사 | 사용자 맞춤형 현장감 실현을 위한 오디오 콘텐츠를 전송하는 컴퓨터 시스템 및 그의 방법 |
| WO2022214730A1 (en) * | 2021-04-08 | 2022-10-13 | Nokia Technologies Oy | Separating spatial audio objects |
| CN115497485B (zh) * | 2021-06-18 | 2024-10-18 | 华为技术有限公司 | 三维音频信号编码方法、装置、编码器和系统 |
| JP7745100B2 (ja) * | 2021-11-02 | 2025-09-26 | 北京小米移動軟件有限公司 | 信号の符号化および復号化方法、装置、ユーザイクイップメント、ネットワーク側デバイス並びに記憶媒体 |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20050058304A1 (en) * | 2001-05-04 | 2005-03-17 | Frank Baumgarte | Cue-based audio coding/decoding |
| WO2005098821A2 (en) * | 2004-04-05 | 2005-10-20 | Koninklijke Philips Electronics N.V. | Multi-channel encoder |
| US20070174062A1 (en) * | 2006-01-20 | 2007-07-26 | Microsoft Corporation | Complex-transform channel coding with extended-band frequency coding |
| US20110038423A1 (en) * | 2009-08-12 | 2011-02-17 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding/decoding multi-channel audio signal by using semantic information |
Family Cites Families (18)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9014377B2 (en) * | 2006-05-17 | 2015-04-21 | Creative Technology Ltd | Multichannel surround format conversion and generalized upmix |
| US8345899B2 (en) * | 2006-05-17 | 2013-01-01 | Creative Technology Ltd | Phase-amplitude matrixed surround decoder |
| EP2068307B1 (en) * | 2006-10-16 | 2011-12-07 | Dolby International AB | Enhanced coding and parameter representation of multichannel downmixed object coding |
| JP5394931B2 (ja) * | 2006-11-24 | 2014-01-22 | エルジー エレクトロニクス インコーポレイティド | オブジェクトベースオーディオ信号の復号化方法及びその装置 |
| CN101490745B (zh) * | 2006-11-24 | 2013-02-27 | Lg电子株式会社 | 用于编码和解码基于对象的音频信号的方法和装置 |
| JP2008252834A (ja) * | 2007-03-30 | 2008-10-16 | Toshiba Corp | 音声再生装置 |
| US8612237B2 (en) * | 2007-04-04 | 2013-12-17 | Apple Inc. | Method and apparatus for determining audio spatial quality |
| KR101290394B1 (ko) * | 2007-10-17 | 2013-07-26 | 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. | 다운믹스를 이용한 오디오 코딩 |
| CA2710560C (en) * | 2008-01-01 | 2015-10-27 | Lg Electronics Inc. | A method and an apparatus for processing an audio signal |
| KR101596504B1 (ko) * | 2008-04-23 | 2016-02-23 | 한국전자통신연구원 | 객체기반 오디오 컨텐츠의 생성/재생 방법 및 객체기반 오디오 서비스를 위한 파일 포맷 구조를 가진 데이터를 기록한 컴퓨터 판독 가능 기록 매체 |
| JPWO2010005050A1 (ja) * | 2008-07-11 | 2012-01-05 | 日本電気株式会社 | 信号分析装置、信号制御装置及びその方法と、プログラム |
| JP5377505B2 (ja) * | 2009-02-04 | 2013-12-25 | パナソニック株式会社 | 結合装置、遠隔通信システム及び結合方法 |
| KR101387902B1 (ko) * | 2009-06-10 | 2014-04-22 | 한국전자통신연구원 | 다객체 오디오 신호를 부호화하는 방법 및 부호화 장치, 복호화 방법 및 복호화 장치, 그리고 트랜스코딩 방법 및 트랜스코더 |
| SG177277A1 (en) * | 2009-06-24 | 2012-02-28 | Fraunhofer Ges Forschung | Audio signal decoder, method for decoding an audio signal and computer program using cascaded audio object processing stages |
| PL2483887T3 (pl) * | 2009-09-29 | 2018-02-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Dekoder sygnału audio MPEG-SAOC, sposób dostarczania reprezentacji sygnału upmixu z wykorzystaniem dekodowania MPEG-SAOC oraz program komputerowy wykorzystujący wspólną wartość parametru korelacji międzyobiektowej uzależnioną od czasu/częstotliwości |
| KR101666465B1 (ko) * | 2010-07-22 | 2016-10-17 | 삼성전자주식회사 | 다채널 오디오 신호 부호화/복호화 장치 및 방법 |
| KR20140027954A (ko) * | 2011-03-16 | 2014-03-07 | 디티에스, 인코포레이티드 | 3차원 오디오 사운드트랙의 인코딩 및 재현 |
| KR20130093798A (ko) * | 2012-01-02 | 2013-08-23 | 한국전자통신연구원 | 다채널 신호 부호화 및 복호화 장치 및 방법 |
-
2013
- 2013-07-09 JP JP2015521121A patent/JP6231093B2/ja active Active
- 2013-07-09 WO PCT/IB2013/055628 patent/WO2014009878A2/en not_active Ceased
- 2013-07-09 US US14/413,234 patent/US9478228B2/en active Active
- 2013-07-09 BR BR112015000247-1A patent/BR112015000247B1/pt active IP Right Grant
- 2013-07-09 EP EP20182398.6A patent/EP3748632A1/en not_active Withdrawn
- 2013-07-09 MX MX2015000113A patent/MX342150B/es active IP Right Grant
- 2013-07-09 EP EP13762579.4A patent/EP2870603B1/en active Active
- 2013-07-09 RU RU2015104074A patent/RU2643644C2/ru active
- 2013-07-09 CN CN201380036886.9A patent/CN104428835B/zh active Active
-
2015
- 2015-02-06 ZA ZA2015/00888A patent/ZA201500888B/en unknown
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20050058304A1 (en) * | 2001-05-04 | 2005-03-17 | Frank Baumgarte | Cue-based audio coding/decoding |
| WO2005098821A2 (en) * | 2004-04-05 | 2005-10-20 | Koninklijke Philips Electronics N.V. | Multi-channel encoder |
| US20070174062A1 (en) * | 2006-01-20 | 2007-07-26 | Microsoft Corporation | Complex-transform channel coding with extended-band frequency coding |
| US20110038423A1 (en) * | 2009-08-12 | 2011-02-17 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding/decoding multi-channel audio signal by using semantic information |
Also Published As
| Publication number | Publication date |
|---|---|
| EP2870603B1 (en) | 2020-09-30 |
| CN104428835A (zh) | 2015-03-18 |
| RU2015104074A (ru) | 2016-08-27 |
| ZA201500888B (en) | 2017-01-25 |
| US9478228B2 (en) | 2016-10-25 |
| EP3748632A1 (en) | 2020-12-09 |
| US20150142453A1 (en) | 2015-05-21 |
| WO2014009878A2 (en) | 2014-01-16 |
| BR112015000247A2 (pt) | 2017-06-27 |
| MX2015000113A (es) | 2015-08-10 |
| WO2014009878A3 (en) | 2014-03-13 |
| BR112015000247B1 (pt) | 2021-08-03 |
| MX342150B (es) | 2016-09-15 |
| EP2870603A2 (en) | 2015-05-13 |
| JP6231093B2 (ja) | 2017-11-15 |
| JP2015527609A (ja) | 2015-09-17 |
| CN104428835B (zh) | 2017-10-31 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| RU2643644C2 (ru) | Кодирование и декодирование аудиосигналов | |
| RU2618383C2 (ru) | Кодирование и декодирование аудиообъектов | |
| JP6328662B2 (ja) | バイノーラルのオーディオ処理 | |
| US9761229B2 (en) | Systems, methods, apparatus, and computer-readable media for audio object clustering | |
| CN101479786B (zh) | 用于编码和解码基于对象的音频信号的方法和装置 | |
| JP4966981B2 (ja) | 空間キューを用いたマルチオブジェクト又はマルチチャネルオーディオ信号のレンダリング制御方法及びその装置 | |
| RU2608847C1 (ru) | Кодирование звуковых сцен | |
| RU2659497C2 (ru) | Управляемое модулем рендеринга пространственное повышающее микширование | |
| JP6888172B2 (ja) | 音場表現信号を符号化する方法及びデバイス | |
| JP2016530788A (ja) | 符号化表現に基づいて少なくとも4つのオーディオチャネル信号を提供するためのオーディオデコーダ、オーディオエンコーダ、方法、帯域幅拡張を用いた少なくとも4つのオーディオチャネル信号に基づいて符号化表現を提供するための方法およびコンピュータプログラム | |
| WO2007083958A1 (en) | Method and apparatus for decoding a signal | |
| CN112823534A (zh) | 信号处理设备和方法以及程序 | |
| KR20070081735A (ko) | 오디오 신호의 인코딩/디코딩 방법 및 장치 |