RU2643644C2 - Кодирование и декодирование аудиосигналов - Google Patents
Кодирование и декодирование аудиосигналов Download PDFInfo
- Publication number
- RU2643644C2 RU2643644C2 RU2015104074A RU2015104074A RU2643644C2 RU 2643644 C2 RU2643644 C2 RU 2643644C2 RU 2015104074 A RU2015104074 A RU 2015104074A RU 2015104074 A RU2015104074 A RU 2015104074A RU 2643644 C2 RU2643644 C2 RU 2643644C2
- Authority
- RU
- Russia
- Prior art keywords
- time
- frequency
- segments
- encoded
- audio
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title claims description 144
- 238000009877 rendering Methods 0.000 claims description 60
- 239000011159 matrix material Substances 0.000 claims description 25
- 238000000034 method Methods 0.000 claims description 24
- 230000004044 response Effects 0.000 claims description 15
- 230000002123 temporal effect Effects 0.000 claims 1
- 230000000694 effects Effects 0.000 abstract description 3
- 239000000126 substance Substances 0.000 abstract 1
- 239000000203 mixture Substances 0.000 description 113
- 238000013459 approach Methods 0.000 description 50
- 238000005457 optimization Methods 0.000 description 10
- 230000008569 process Effects 0.000 description 10
- 238000004891 communication Methods 0.000 description 9
- 230000000875 corresponding effect Effects 0.000 description 7
- 230000008901 benefit Effects 0.000 description 6
- 230000008447 perception Effects 0.000 description 6
- 238000012545 processing Methods 0.000 description 6
- 238000013507 mapping Methods 0.000 description 4
- 230000002829 reductive effect Effects 0.000 description 4
- 230000035807 sensation Effects 0.000 description 4
- 238000000926 separation method Methods 0.000 description 4
- 230000006978 adaptation Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 230000007423 decrease Effects 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 238000013139 quantization Methods 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 230000000052 comparative effect Effects 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 230000002452 interceptive effect Effects 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 238000012937 correction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 230000000873 masking effect Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000003032 molecular docking Methods 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 238000012805 post-processing Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
- G10L19/265—Pre-filtering, e.g. high frequency emphasis prior to encoding
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Stereophonic System (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201261669197P | 2012-07-09 | 2012-07-09 | |
| US61/669,197 | 2012-07-09 | ||
| PCT/IB2013/055628 WO2014009878A2 (en) | 2012-07-09 | 2013-07-09 | Encoding and decoding of audio signals |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| RU2015104074A RU2015104074A (ru) | 2016-08-27 |
| RU2643644C2 true RU2643644C2 (ru) | 2018-02-02 |
Family
ID=49170767
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| RU2015104074A RU2643644C2 (ru) | 2012-07-09 | 2013-07-09 | Кодирование и декодирование аудиосигналов |
Country Status (9)
| Country | Link |
|---|---|
| US (1) | US9478228B2 (enExample) |
| EP (2) | EP3748632A1 (enExample) |
| JP (1) | JP6231093B2 (enExample) |
| CN (1) | CN104428835B (enExample) |
| BR (1) | BR112015000247B1 (enExample) |
| MX (1) | MX342150B (enExample) |
| RU (1) | RU2643644C2 (enExample) |
| WO (1) | WO2014009878A2 (enExample) |
| ZA (1) | ZA201500888B (enExample) |
Families Citing this family (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9674587B2 (en) * | 2012-06-26 | 2017-06-06 | Sonos, Inc. | Systems and methods for networked music playback including remote add to queue |
| US9489954B2 (en) * | 2012-08-07 | 2016-11-08 | Dolby Laboratories Licensing Corporation | Encoding and rendering of object based audio indicative of game audio content |
| BR112015029031B1 (pt) | 2013-05-24 | 2021-02-23 | Dolby International Ab | Método e codificador para codificar um vetor de parâmetros em umsistema de codificação de áudio, método e decodificador para decodificar umvetor de símbolos codificados por entropia em um sistema de decodificação deáudio, e meio de armazenamento legível por computador |
| US9774974B2 (en) | 2014-09-24 | 2017-09-26 | Electronics And Telecommunications Research Institute | Audio metadata providing apparatus and method, and multichannel audio data playback apparatus and method to support dynamic format conversion |
| TWI587286B (zh) | 2014-10-31 | 2017-06-11 | 杜比國際公司 | 音頻訊號之解碼和編碼的方法及系統、電腦程式產品、與電腦可讀取媒體 |
| AU2016269886B2 (en) * | 2015-06-02 | 2020-11-12 | Sony Corporation | Transmission device, transmission method, media processing device, media processing method, and reception device |
| US10693936B2 (en) * | 2015-08-25 | 2020-06-23 | Qualcomm Incorporated | Transporting coded audio data |
| US9961467B2 (en) * | 2015-10-08 | 2018-05-01 | Qualcomm Incorporated | Conversion from channel-based audio to HOA |
| US9854375B2 (en) * | 2015-12-01 | 2017-12-26 | Qualcomm Incorporated | Selection of coded next generation audio data for transport |
| KR102261905B1 (ko) * | 2016-03-15 | 2021-06-08 | 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. | 음장 기술을 생성하기 위한 장치, 방법, 또는 컴퓨터 프로그램 |
| CN113242508B (zh) | 2017-03-06 | 2022-12-06 | 杜比国际公司 | 基于音频数据流渲染音频输出的方法、解码器系统和介质 |
| US9820073B1 (en) | 2017-05-10 | 2017-11-14 | Tls Corp. | Extracting a common signal from multiple audio signals |
| BR112021009306A2 (pt) | 2018-11-20 | 2021-08-10 | Sony Group Corporation | dispositivo e método de processamento de informações, e, programa. |
| GB2587614A (en) * | 2019-09-26 | 2021-04-07 | Nokia Technologies Oy | Audio encoding and audio decoding |
| US11930349B2 (en) * | 2020-11-24 | 2024-03-12 | Naver Corporation | Computer system for producing audio content for realizing customized being-there and method thereof |
| KR102500694B1 (ko) * | 2020-11-24 | 2023-02-16 | 네이버 주식회사 | 사용자 맞춤형 현장감 실현을 위한 오디오 콘텐츠를 제작하는 컴퓨터 시스템 및 그의 방법 |
| JP7536733B2 (ja) * | 2020-11-24 | 2024-08-20 | ネイバー コーポレーション | オーディオと関連してユーザカスタム型臨場感を実現するためのコンピュータシステムおよびその方法 |
| EP4320876A4 (en) * | 2021-04-08 | 2024-11-06 | Nokia Technologies Oy | SEPARATION OF SPATIAL AUDIO OBJECTS |
| CN115497485B (zh) * | 2021-06-18 | 2024-10-18 | 华为技术有限公司 | 三维音频信号编码方法、装置、编码器和系统 |
| JP7745100B2 (ja) * | 2021-11-02 | 2025-09-26 | 北京小米移動軟件有限公司 | 信号の符号化および復号化方法、装置、ユーザイクイップメント、ネットワーク側デバイス並びに記憶媒体 |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20050058304A1 (en) * | 2001-05-04 | 2005-03-17 | Frank Baumgarte | Cue-based audio coding/decoding |
| WO2005098821A2 (en) * | 2004-04-05 | 2005-10-20 | Koninklijke Philips Electronics N.V. | Multi-channel encoder |
| US20070174062A1 (en) * | 2006-01-20 | 2007-07-26 | Microsoft Corporation | Complex-transform channel coding with extended-band frequency coding |
| US20110038423A1 (en) * | 2009-08-12 | 2011-02-17 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding/decoding multi-channel audio signal by using semantic information |
Family Cites Families (18)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9014377B2 (en) * | 2006-05-17 | 2015-04-21 | Creative Technology Ltd | Multichannel surround format conversion and generalized upmix |
| US8345899B2 (en) * | 2006-05-17 | 2013-01-01 | Creative Technology Ltd | Phase-amplitude matrixed surround decoder |
| UA94117C2 (ru) * | 2006-10-16 | 2011-04-11 | Долби Свиден Ав | Усовершенстованное кодирование и отображение параметров многоканального кодирования микшированных объектов |
| KR101102401B1 (ko) * | 2006-11-24 | 2012-01-05 | 엘지전자 주식회사 | 오브젝트 기반 오디오 신호의 부호화 및 복호화 방법과 그 장치 |
| CN101490745B (zh) * | 2006-11-24 | 2013-02-27 | Lg电子株式会社 | 用于编码和解码基于对象的音频信号的方法和装置 |
| JP2008252834A (ja) * | 2007-03-30 | 2008-10-16 | Toshiba Corp | 音声再生装置 |
| US8612237B2 (en) * | 2007-04-04 | 2013-12-17 | Apple Inc. | Method and apparatus for determining audio spatial quality |
| CA2702986C (en) * | 2007-10-17 | 2016-08-16 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio coding using downmix |
| CA2710560C (en) * | 2008-01-01 | 2015-10-27 | Lg Electronics Inc. | A method and an apparatus for processing an audio signal |
| KR101596504B1 (ko) * | 2008-04-23 | 2016-02-23 | 한국전자통신연구원 | 객체기반 오디오 컨텐츠의 생성/재생 방법 및 객체기반 오디오 서비스를 위한 파일 포맷 구조를 가진 데이터를 기록한 컴퓨터 판독 가능 기록 매체 |
| JPWO2010005050A1 (ja) * | 2008-07-11 | 2012-01-05 | 日本電気株式会社 | 信号分析装置、信号制御装置及びその方法と、プログラム |
| US8504184B2 (en) * | 2009-02-04 | 2013-08-06 | Panasonic Corporation | Combination device, telecommunication system, and combining method |
| KR101387902B1 (ko) * | 2009-06-10 | 2014-04-22 | 한국전자통신연구원 | 다객체 오디오 신호를 부호화하는 방법 및 부호화 장치, 복호화 방법 및 복호화 장치, 그리고 트랜스코딩 방법 및 트랜스코더 |
| SG177277A1 (en) * | 2009-06-24 | 2012-02-28 | Fraunhofer Ges Forschung | Audio signal decoder, method for decoding an audio signal and computer program using cascaded audio object processing stages |
| BR112012007138B1 (pt) * | 2009-09-29 | 2021-11-30 | Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. | Decodificador de sinal de áudio, codificador de sinal de áudio, método para prover uma representação de mescla ascendente de sinal, método para prover uma representação de mescla descendente de sinal e fluxo de bits usando um valor de parâmetro comum de correlação intra- objetos |
| KR101666465B1 (ko) * | 2010-07-22 | 2016-10-17 | 삼성전자주식회사 | 다채널 오디오 신호 부호화/복호화 장치 및 방법 |
| EP2686654A4 (en) * | 2011-03-16 | 2015-03-11 | Dts Inc | CODING AND PLAYING THREE-DIMENSIONAL AUDIOSPURES |
| KR20130093798A (ko) * | 2012-01-02 | 2013-08-23 | 한국전자통신연구원 | 다채널 신호 부호화 및 복호화 장치 및 방법 |
-
2013
- 2013-07-09 CN CN201380036886.9A patent/CN104428835B/zh active Active
- 2013-07-09 JP JP2015521121A patent/JP6231093B2/ja active Active
- 2013-07-09 MX MX2015000113A patent/MX342150B/es active IP Right Grant
- 2013-07-09 EP EP20182398.6A patent/EP3748632A1/en not_active Withdrawn
- 2013-07-09 RU RU2015104074A patent/RU2643644C2/ru active
- 2013-07-09 WO PCT/IB2013/055628 patent/WO2014009878A2/en not_active Ceased
- 2013-07-09 BR BR112015000247-1A patent/BR112015000247B1/pt active IP Right Grant
- 2013-07-09 US US14/413,234 patent/US9478228B2/en active Active
- 2013-07-09 EP EP13762579.4A patent/EP2870603B1/en active Active
-
2015
- 2015-02-06 ZA ZA2015/00888A patent/ZA201500888B/en unknown
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20050058304A1 (en) * | 2001-05-04 | 2005-03-17 | Frank Baumgarte | Cue-based audio coding/decoding |
| WO2005098821A2 (en) * | 2004-04-05 | 2005-10-20 | Koninklijke Philips Electronics N.V. | Multi-channel encoder |
| US20070174062A1 (en) * | 2006-01-20 | 2007-07-26 | Microsoft Corporation | Complex-transform channel coding with extended-band frequency coding |
| US20110038423A1 (en) * | 2009-08-12 | 2011-02-17 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding/decoding multi-channel audio signal by using semantic information |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2014009878A3 (en) | 2014-03-13 |
| BR112015000247A2 (pt) | 2017-06-27 |
| BR112015000247B1 (pt) | 2021-08-03 |
| MX2015000113A (es) | 2015-08-10 |
| RU2015104074A (ru) | 2016-08-27 |
| JP6231093B2 (ja) | 2017-11-15 |
| EP2870603B1 (en) | 2020-09-30 |
| CN104428835B (zh) | 2017-10-31 |
| US9478228B2 (en) | 2016-10-25 |
| EP2870603A2 (en) | 2015-05-13 |
| EP3748632A1 (en) | 2020-12-09 |
| US20150142453A1 (en) | 2015-05-21 |
| CN104428835A (zh) | 2015-03-18 |
| WO2014009878A2 (en) | 2014-01-16 |
| MX342150B (es) | 2016-09-15 |
| ZA201500888B (en) | 2017-01-25 |
| JP2015527609A (ja) | 2015-09-17 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| RU2643644C2 (ru) | Кодирование и декодирование аудиосигналов | |
| RU2618383C2 (ru) | Кодирование и декодирование аудиообъектов | |
| JP6328662B2 (ja) | バイノーラルのオーディオ処理 | |
| US9761229B2 (en) | Systems, methods, apparatus, and computer-readable media for audio object clustering | |
| CN101479786B (zh) | 用于编码和解码基于对象的音频信号的方法和装置 | |
| JP4966981B2 (ja) | 空間キューを用いたマルチオブジェクト又はマルチチャネルオーディオ信号のレンダリング制御方法及びその装置 | |
| JP5081838B2 (ja) | オーディオ符号化及び復号 | |
| RU2608847C1 (ru) | Кодирование звуковых сцен | |
| RU2659497C2 (ru) | Управляемое модулем рендеринга пространственное повышающее микширование | |
| JP6888172B2 (ja) | 音場表現信号を符号化する方法及びデバイス | |
| JP5171622B2 (ja) | マルチチャンネルオーディオ信号の生成 | |
| JP2016530788A (ja) | 符号化表現に基づいて少なくとも4つのオーディオチャネル信号を提供するためのオーディオデコーダ、オーディオエンコーダ、方法、帯域幅拡張を用いた少なくとも4つのオーディオチャネル信号に基づいて符号化表現を提供するための方法およびコンピュータプログラム | |
| WO2007083958A1 (en) | Method and apparatus for decoding a signal | |
| JP2017535153A (ja) | オーディオ・エンコーダおよびデコーダ | |
| CN112823534A (zh) | 信号处理设备和方法以及程序 | |
| KR20070081735A (ko) | 오디오 신호의 인코딩/디코딩 방법 및 장치 |