JP7174081B2 - マルチチャンネル音声符号化 - Google Patents
マルチチャンネル音声符号化 Download PDFInfo
- Publication number
- JP7174081B2 JP7174081B2 JP2020571588A JP2020571588A JP7174081B2 JP 7174081 B2 JP7174081 B2 JP 7174081B2 JP 2020571588 A JP2020571588 A JP 2020571588A JP 2020571588 A JP2020571588 A JP 2020571588A JP 7174081 B2 JP7174081 B2 JP 7174081B2
- Authority
- JP
- Japan
- Prior art keywords
- itd
- parameter
- comparison
- channel
- stereo
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000005236 sound signal Effects 0.000 claims description 13
- 230000009466 transformation Effects 0.000 claims description 3
- 238000005311 autocorrelation function Methods 0.000 claims 1
- 238000004364 calculation method Methods 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 230000002411 adverse Effects 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 238000000034 method Methods 0.000 description 6
- 238000003786 synthesis reaction Methods 0.000 description 6
- 238000001514 detection method Methods 0.000 description 5
- 238000013459 approach Methods 0.000 description 4
- 238000012937 correction Methods 0.000 description 4
- 125000004122 cyclic group Chemical group 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 2
- 238000005314 correlation function Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000004075 alteration Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 230000001427 coherent effect Effects 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000016507 interphase Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Mathematical Physics (AREA)
- Stereophonic System (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Description
1.ウインドウにおけるウインドウ処理されたDFTとDFTブロック11、12、21、22とを使用する、入力信号の時間-周波数変換
2.ITD検出及び補償ブロック20の周波数領域内のITD推定及び補償
3.比較及び空間パラメータ計算ブロック30のステレオパラメータ抽出及び比較パラメータ計算
4.ダウンミックスブロック40のダウンミキシング
5.IDFTブロック50における周波数-時間変換に続くウインドウ処理及びオーバーラップの追加
1.DFTブロック80のウインドウ処理されたDFT(複数)を用いる時間周波数変換
2.アップミキシング及び空間復元ブロック90における周波数領域の消失残差の予測
3.アップミキシング及び空間復元ブロック90における周波数領域でのアップミキシング
4.ITD合成ブロック100での周波数領域のITD合成
5.IDFTブロック112、122、及びウインドウブロック111、121での周波数-時間領域変換、ウインドウ処理及び重複の追加
[1] MPEG-4 High Efficiency Advanced Audio Coding (HE-AAC) v2
[2] Juergen Herre, FROM JOINT STEREO TO SPATIAL AUDIO CODING - RECENT PROGRESS AND STANDARDIZATION, Proc. of the 7th Int. Conference on digital Audio Effects (DAFX-04), Naples, Italy, October 5-8, 2004
[3] Christoph Tourney and Christof Faller, Improved Time Delay Analysis/Synthesis for Parametric Stereo Audio Coding, AES Convention Paper 6753, 2006
[4] Christof Faller and Frank Baumgarte, Binaural Cue Coding Part II: Schemes and Applications, IEEE Transactions on Speech and Audio Processing, Vol. 11, No. 6, November 2003
Claims (15)
- 前記少なくとも1つのITDパラメータ(ITDt)を抽出するために、前記分析ウインドウ(w(τ))内の前記少なくとも一対の前記チャンネルの前記音声信号の周波数変換(Lt,k;Rt,k)を用いるようにさらに構成される、請求項1に記載の比較装置。
- ルックアップテーブルに記憶された前記分析ウインドウの前記自己相関関数の前記正規化バージョンの補間によって前記関数を得るようにさらに構成される、請求項4に記載の比較装置。
- 前記少なくとも1つのサイドゲイン及び前記少なくとも1つの残差ゲインを、前記エネルギーと前記少なくとも一対のITD補償された周波数変換 (Lt,k,comp;Rt,k,comp)の内積とを用いて計算するようにさらに構成される、請求項7に記載の比較装置。
- 前記少なくとも1つの前記ダウンミックス信号を、少なくとも一対のITD補償された周波数変換に基づいて生成するようにさらに構成される、請求項1ないし11のいずれか1項に記載の比較装置。
- 前記少なくとも1つのダウンミックス信号、前記少なくとも1つのITDパラメータ、及び前記少なくとも1つの比較パラメータを符号化して、デコーダに送信するようにさらに構成される請求項11または請求項12に記載の前記比較装置を備える、マルチチャンネルエンコーダ。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2022177073A JP2023017913A (ja) | 2018-06-22 | 2022-11-04 | マルチチャンネル音声符号化 |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP18179373.8A EP3588495A1 (en) | 2018-06-22 | 2018-06-22 | Multichannel audio coding |
EP18179373.8 | 2018-06-22 | ||
PCT/EP2019/066228 WO2019243434A1 (en) | 2018-06-22 | 2019-06-19 | Multichannel audio coding |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2022177073A Division JP2023017913A (ja) | 2018-06-22 | 2022-11-04 | マルチチャンネル音声符号化 |
Publications (2)
Publication Number | Publication Date |
---|---|
JP2021528693A JP2021528693A (ja) | 2021-10-21 |
JP7174081B2 true JP7174081B2 (ja) | 2022-11-17 |
Family
ID=62750879
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2020571588A Active JP7174081B2 (ja) | 2018-06-22 | 2019-06-19 | マルチチャンネル音声符号化 |
JP2022177073A Pending JP2023017913A (ja) | 2018-06-22 | 2022-11-04 | マルチチャンネル音声符号化 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2022177073A Pending JP2023017913A (ja) | 2018-06-22 | 2022-11-04 | マルチチャンネル音声符号化 |
Country Status (13)
Country | Link |
---|---|
US (2) | US11978459B2 (ja) |
EP (2) | EP3588495A1 (ja) |
JP (2) | JP7174081B2 (ja) |
CN (1) | CN112424861B (ja) |
AR (1) | AR115600A1 (ja) |
AU (1) | AU2019291054B2 (ja) |
BR (1) | BR112020025552A2 (ja) |
CA (1) | CA3103875C (ja) |
MX (1) | MX2020013856A (ja) |
SG (1) | SG11202012655QA (ja) |
TW (1) | TWI726337B (ja) |
WO (1) | WO2019243434A1 (ja) |
ZA (1) | ZA202100230B (ja) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3588495A1 (en) | 2018-06-22 | 2020-01-01 | FRAUNHOFER-GESELLSCHAFT zur Förderung der angewandten Forschung e.V. | Multichannel audio coding |
JP7380838B2 (ja) * | 2020-03-09 | 2023-11-15 | 日本電信電話株式会社 | 音信号符号化方法、音信号復号方法、音信号符号化装置、音信号復号装置、プログラム及び記録媒体 |
BR112023006291A2 (pt) * | 2020-10-09 | 2023-05-09 | Fraunhofer Ges Forschung | Dispositivo, método ou programa de computador para processar uma cena de áudio codificada usando uma conversão de parâmetro |
US11818353B2 (en) * | 2021-05-13 | 2023-11-14 | Qualcomm Incorporated | Reduced complexity transforms for high bit-depth video coding |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017125562A1 (en) | 2016-01-22 | 2017-07-27 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatuses and methods for encoding or decoding a multi-channel audio signal using frame control synchronization |
WO2017153466A1 (en) | 2016-03-09 | 2017-09-14 | Telefonaktiebolaget Lm Ericsson (Publ) | A method and apparatus for increasing stability of an inter-channel time difference parameter |
WO2018086947A1 (en) | 2016-11-08 | 2018-05-17 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for encoding or decoding a multichannel signal using a side gain and a residual gain |
Family Cites Families (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5789689A (en) * | 1997-01-17 | 1998-08-04 | Doidic; Michel | Tube modeling programmable digital guitar amplification system |
AU2003281128A1 (en) * | 2002-07-16 | 2004-02-02 | Koninklijke Philips Electronics N.V. | Audio coding |
US7809579B2 (en) * | 2003-12-19 | 2010-10-05 | Telefonaktiebolaget Lm Ericsson (Publ) | Fidelity-optimized variable frame length encoding |
SE0402650D0 (sv) | 2004-11-02 | 2004-11-02 | Coding Tech Ab | Improved parametric stereo compatible coding of spatial audio |
EP1866911B1 (en) | 2005-03-30 | 2010-06-09 | Koninklijke Philips Electronics N.V. | Scalable multi-channel audio coding |
WO2007080211A1 (en) * | 2006-01-09 | 2007-07-19 | Nokia Corporation | Decoding of binaural audio signals |
CN101556799B (zh) * | 2009-05-14 | 2013-08-28 | 华为技术有限公司 | 一种音频解码方法和音频解码器 |
US9424852B2 (en) * | 2011-02-02 | 2016-08-23 | Telefonaktiebolaget Lm Ericsson (Publ) | Determining the inter-channel time difference of a multi-channel audio signal |
US10002614B2 (en) * | 2011-02-03 | 2018-06-19 | Telefonaktiebolaget Lm Ericsson (Publ) | Determining the inter-channel time difference of a multi-channel audio signal |
KR101580240B1 (ko) * | 2012-02-17 | 2016-01-04 | 후아웨이 테크놀러지 컴퍼니 리미티드 | 다채널 오디오 신호를 인코딩하는 파라메트릭 인코더 |
EP2834813B1 (en) * | 2012-04-05 | 2015-09-30 | Huawei Technologies Co., Ltd. | Multi-channel audio encoder and method for encoding a multi-channel audio signal |
TWI546799B (zh) * | 2013-04-05 | 2016-08-21 | 杜比國際公司 | 音頻編碼器及解碼器 |
MY195412A (en) * | 2013-07-22 | 2023-01-19 | Fraunhofer Ges Forschung | Multi-Channel Audio Decoder, Multi-Channel Audio Encoder, Methods, Computer Program and Encoded Audio Representation Using a Decorrelation of Rendered Audio Signals |
US9319819B2 (en) * | 2013-07-25 | 2016-04-19 | Etri | Binaural rendering method and apparatus for decoding multi channel audio |
CN117037810A (zh) * | 2013-09-12 | 2023-11-10 | 杜比国际公司 | 多声道音频内容的编码 |
EP3067889A1 (en) | 2015-03-09 | 2016-09-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Method and apparatus for signal-adaptive transform kernel switching in audio coding |
EP3067886A1 (en) * | 2015-03-09 | 2016-09-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal |
EP3208800A1 (en) * | 2016-02-17 | 2017-08-23 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for stereo filing in multichannel coding |
EP3588495A1 (en) | 2018-06-22 | 2020-01-01 | FRAUNHOFER-GESELLSCHAFT zur Förderung der angewandten Forschung e.V. | Multichannel audio coding |
-
2018
- 2018-06-22 EP EP18179373.8A patent/EP3588495A1/en not_active Withdrawn
-
2019
- 2019-06-19 WO PCT/EP2019/066228 patent/WO2019243434A1/en active Application Filing
- 2019-06-19 AU AU2019291054A patent/AU2019291054B2/en active Active
- 2019-06-19 BR BR112020025552-1A patent/BR112020025552A2/pt unknown
- 2019-06-19 MX MX2020013856A patent/MX2020013856A/es unknown
- 2019-06-19 EP EP19732348.8A patent/EP3811357A1/en active Pending
- 2019-06-19 CN CN201980041829.7A patent/CN112424861B/zh active Active
- 2019-06-19 SG SG11202012655QA patent/SG11202012655QA/en unknown
- 2019-06-19 CA CA3103875A patent/CA3103875C/en active Active
- 2019-06-19 JP JP2020571588A patent/JP7174081B2/ja active Active
- 2019-06-21 AR ARP190101722A patent/AR115600A1/es active IP Right Grant
- 2019-06-21 TW TW108121651A patent/TWI726337B/zh active
-
2020
- 2020-12-15 US US17/122,403 patent/US11978459B2/en active Active
-
2021
- 2021-01-13 ZA ZA2021/00230A patent/ZA202100230B/en unknown
-
2022
- 2022-11-04 JP JP2022177073A patent/JP2023017913A/ja active Pending
-
2023
- 2023-09-08 US US18/464,030 patent/US20240112685A1/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017125562A1 (en) | 2016-01-22 | 2017-07-27 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatuses and methods for encoding or decoding a multi-channel audio signal using frame control synchronization |
WO2017153466A1 (en) | 2016-03-09 | 2017-09-14 | Telefonaktiebolaget Lm Ericsson (Publ) | A method and apparatus for increasing stability of an inter-channel time difference parameter |
WO2018086947A1 (en) | 2016-11-08 | 2018-05-17 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for encoding or decoding a multichannel signal using a side gain and a residual gain |
Also Published As
Publication number | Publication date |
---|---|
WO2019243434A1 (en) | 2019-12-26 |
CN112424861B (zh) | 2024-04-16 |
CN112424861A (zh) | 2021-02-26 |
US20210098007A1 (en) | 2021-04-01 |
CA3103875C (en) | 2023-09-05 |
MX2020013856A (es) | 2021-03-25 |
JP2021528693A (ja) | 2021-10-21 |
AU2019291054A1 (en) | 2021-02-18 |
BR112020025552A2 (pt) | 2021-03-16 |
ZA202100230B (en) | 2022-07-27 |
TW202016923A (zh) | 2020-05-01 |
AR115600A1 (es) | 2021-02-03 |
KR20210021554A (ko) | 2021-02-26 |
TWI726337B (zh) | 2021-05-01 |
SG11202012655QA (en) | 2021-01-28 |
EP3588495A1 (en) | 2020-01-01 |
AU2019291054B2 (en) | 2022-04-07 |
JP2023017913A (ja) | 2023-02-07 |
CA3103875A1 (en) | 2019-12-26 |
EP3811357A1 (en) | 2021-04-28 |
US11978459B2 (en) | 2024-05-07 |
US20240112685A1 (en) | 2024-04-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11871205B2 (en) | Parametric stereo upmix apparatus, a parametric stereo decoder, a parametric stereo downmix apparatus, a parametric stereo encoder | |
JP7174081B2 (ja) | マルチチャンネル音声符号化 | |
JP7270096B2 (ja) | フレーム制御同期化を使用して多チャネル信号を符号化又は復号化する装置及び方法 | |
JP2023017913A5 (ja) | ||
EP2904609B1 (en) | Encoder, decoder and methods for backward compatible multi-resolution spatial-audio-object-coding | |
JP5604933B2 (ja) | ダウンミクス装置およびダウンミクス方法 | |
WO2010097748A1 (en) | Parametric stereo encoding and decoding | |
MX2014010098A (es) | Control de coherencia de fase para señales armonicas en codecs de audio perceptual. | |
KR20190085988A (ko) | 상관해제 필터들의 적응적 제어를 위한 방법 및 장치 | |
Lang et al. | Novel low complexity coherence estimation and synthesis algorithms for parametric stereo coding | |
KR102670634B1 (ko) | 멀티 채널 오디오 코딩 | |
RU2778832C2 (ru) | Многоканальное кодирование аудио |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20210222 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20220315 |
|
A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20220316 |
|
A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20220609 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20220907 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20221004 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20221104 |
|
R150 | Certificate of patent or registration of utility model |
Ref document number: 7174081 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |