TWI703558B - 解碼聲音或音場的壓縮高階環境立體聲聲音表徵的方法及設備 - Google Patents

解碼聲音或音場的壓縮高階環境立體聲聲音表徵的方法及設備 Download PDF

Info

Publication number
TWI703558B
TWI703558B TW105132570A TW105132570A TWI703558B TW I703558 B TWI703558 B TW I703558B TW 105132570 A TW105132570 A TW 105132570A TW 105132570 A TW105132570 A TW 105132570A TW I703558 B TWI703558 B TW I703558B
Authority
TW
Taiwan
Prior art keywords
sound
side information
basic
layer
compressed
Prior art date
Application number
TW105132570A
Other languages
English (en)
Chinese (zh)
Other versions
TW201727622A (zh
Inventor
斯凡 科登
亞歷山德 克魯格
Original Assignee
瑞典商杜比國際公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 瑞典商杜比國際公司 filed Critical 瑞典商杜比國際公司
Publication of TW201727622A publication Critical patent/TW201727622A/zh
Application granted granted Critical
Publication of TWI703558B publication Critical patent/TWI703558B/zh

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/11Application of ambisonics in stereophonic audio systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Stereophonic System (AREA)
  • Compositions Of Oxide Ceramics (AREA)
  • Laminated Bodies (AREA)
TW105132570A 2015-10-08 2016-10-07 解碼聲音或音場的壓縮高階環境立體聲聲音表徵的方法及設備 TWI703558B (zh)

Applications Claiming Priority (8)

Application Number Priority Date Filing Date Title
EP15306589 2015-10-08
EP15306589.1 2015-10-08
EP15306653 2015-10-15
EP15306653.5 2015-10-15
US201662361461P 2016-07-12 2016-07-12
US201662361416P 2016-07-12 2016-07-12
US62/361,461 2016-07-12
US62/361,416 2016-07-12

Publications (2)

Publication Number Publication Date
TW201727622A TW201727622A (zh) 2017-08-01
TWI703558B true TWI703558B (zh) 2020-09-01

Family

ID=58487849

Family Applications (2)

Application Number Title Priority Date Filing Date
TW105132570A TWI703558B (zh) 2015-10-08 2016-10-07 解碼聲音或音場的壓縮高階環境立體聲聲音表徵的方法及設備
TW113100047A TWI887948B (zh) 2015-10-08 2016-10-07 解碼聲音或音場的壓縮高階環境立體聲(hoa)聲音表徵的方法、設備及非暫態電腦可讀儲存媒體

Family Applications After (1)

Application Number Title Priority Date Filing Date
TW113100047A TWI887948B (zh) 2015-10-08 2016-10-07 解碼聲音或音場的壓縮高階環境立體聲(hoa)聲音表徵的方法、設備及非暫態電腦可讀儲存媒體

Country Status (19)

Country Link
US (7) US10529343B2 (cg-RX-API-DMAC7.html)
EP (3) EP4571737A3 (cg-RX-API-DMAC7.html)
JP (3) JP6797198B2 (cg-RX-API-DMAC7.html)
KR (2) KR102715677B1 (cg-RX-API-DMAC7.html)
CN (6) CN116259323A (cg-RX-API-DMAC7.html)
AU (3) AU2016336258B2 (cg-RX-API-DMAC7.html)
BR (5) BR112018007172B1 (cg-RX-API-DMAC7.html)
CA (3) CA3217921A1 (cg-RX-API-DMAC7.html)
CL (1) CL2018000889A1 (cg-RX-API-DMAC7.html)
EA (1) EA033756B1 (cg-RX-API-DMAC7.html)
ES (1) ES2918523T3 (cg-RX-API-DMAC7.html)
IL (5) IL300036B2 (cg-RX-API-DMAC7.html)
MX (2) MX374441B (cg-RX-API-DMAC7.html)
MY (1) MY193124A (cg-RX-API-DMAC7.html)
PH (1) PH12018500702B1 (cg-RX-API-DMAC7.html)
SA (3) SA520412522B1 (cg-RX-API-DMAC7.html)
TW (2) TWI703558B (cg-RX-API-DMAC7.html)
WO (1) WO2017060410A1 (cg-RX-API-DMAC7.html)
ZA (4) ZA202001983B (cg-RX-API-DMAC7.html)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6243060B2 (ja) * 2014-03-21 2017-12-06 ドルビー・インターナショナル・アーベー 高次アンビソニックス(hoa)信号を圧縮する方法、圧縮されたhoa信号を圧縮解除する方法、hoa信号を圧縮する装置および圧縮されたhoa信号を圧縮解除する装置
EP2922057A1 (en) 2014-03-21 2015-09-23 Thomson Licensing Method for compressing a Higher Order Ambisonics (HOA) signal, method for decompressing a compressed HOA signal, apparatus for compressing a HOA signal, and apparatus for decompressing a compressed HOA signal
CN116259323A (zh) * 2015-10-08 2023-06-13 杜比国际公司 用于压缩声音或声场表示的分层编解码
US10264386B1 (en) * 2018-02-09 2019-04-16 Google Llc Directional emphasis in ambisonics
WO2021226511A1 (en) 2020-05-08 2021-11-11 Nuance Communications, Inc. System and method for data augmentation for multi-microphone signal processing

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150213809A1 (en) * 2014-01-30 2015-07-30 Qualcomm Incorporated Coding independent frames of ambient higher-order ambisonic coefficients
WO2015140293A1 (en) * 2014-03-21 2015-09-24 Thomson Licensing Method for compressing a higher order ambisonics (hoa) signal, method for decompressing a compressed hoa signal, apparatus for compressing a hoa signal, and apparatus for decompressing a compressed hoa signal

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4771674B2 (ja) * 2004-09-02 2011-09-14 パナソニック株式会社 音声符号化装置、音声復号化装置及びこれらの方法
US7903824B2 (en) 2005-01-10 2011-03-08 Agere Systems Inc. Compact side information for parametric coding of spatial audio
US8321230B2 (en) 2006-02-06 2012-11-27 France Telecom Method and device for the hierarchical coding of a source audio signal and corresponding decoding method and device, programs and signals
ES2339888T3 (es) 2006-02-21 2010-05-26 Koninklijke Philips Electronics N.V. Codificacion y decodificacion de audio.
GB2467668B (en) 2007-10-03 2011-12-07 Creative Tech Ltd Spatial audio analysis and synthesis for binaural reproduction and format conversion
WO2010103854A2 (ja) * 2009-03-13 2010-09-16 パナソニック株式会社 音声符号化装置、音声復号装置、音声符号化方法及び音声復号方法
EP2395505A1 (en) * 2010-06-11 2011-12-14 Thomson Licensing Method and apparatus for searching in a layered hierarchical bit stream followed by replay, said bit stream including a base layer and at least one enhancement layer
EP2665208A1 (en) 2012-05-14 2013-11-20 Thomson Licensing Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation
US9288603B2 (en) * 2012-07-15 2016-03-15 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for backward-compatible audio coding
TWI590234B (zh) * 2012-07-19 2017-07-01 杜比國際公司 編碼聲訊資料之方法和裝置,以及解碼已編碼聲訊資料之方法和裝置
US9479886B2 (en) * 2012-07-20 2016-10-25 Qualcomm Incorporated Scalable downmix design with feedback for object-based surround codec
US9460729B2 (en) 2012-09-21 2016-10-04 Dolby Laboratories Licensing Corporation Layered approach to spatial audio coding
US9613660B2 (en) 2013-04-05 2017-04-04 Dts, Inc. Layered audio reconstruction system
JP6377730B2 (ja) 2013-06-05 2018-08-22 ドルビー・インターナショナル・アーベー オーディオ信号を符号化する方法及び装置並びにオーディオ信号を復号する方法及び装置
EP2922057A1 (en) 2014-03-21 2015-09-23 Thomson Licensing Method for compressing a Higher Order Ambisonics (HOA) signal, method for decompressing a compressed HOA signal, apparatus for compressing a HOA signal, and apparatus for decompressing a compressed HOA signal
US10140996B2 (en) * 2014-10-10 2018-11-27 Qualcomm Incorporated Signaling layers for scalable coding of higher order ambisonic audio data
CN116259323A (zh) * 2015-10-08 2023-06-13 杜比国际公司 用于压缩声音或声场表示的分层编解码

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150213809A1 (en) * 2014-01-30 2015-07-30 Qualcomm Incorporated Coding independent frames of ambient higher-order ambisonic coefficients
WO2015140293A1 (en) * 2014-03-21 2015-09-24 Thomson Licensing Method for compressing a higher order ambisonics (hoa) signal, method for decompressing a compressed hoa signal, apparatus for compressing a hoa signal, and apparatus for decompressing a compressed hoa signal

Also Published As

Publication number Publication date
EP3360133B8 (en) 2022-06-15
IL292854A (en) 2022-07-01
US20230215446A1 (en) 2023-07-06
JP2018535447A (ja) 2018-11-29
CL2018000889A1 (es) 2018-07-06
AU2023237179A1 (en) 2023-10-19
AU2023237179B2 (en) 2025-11-27
JP7582624B2 (ja) 2024-11-13
US20200098377A1 (en) 2020-03-26
ES2918523T3 (es) 2022-07-18
JP2022160602A (ja) 2022-10-19
KR102715677B1 (ko) 2024-10-11
IL292854B2 (en) 2023-07-01
TW202443558A (zh) 2024-11-01
US20220180877A1 (en) 2022-06-09
US11626119B2 (en) 2023-04-11
IL300036B1 (en) 2023-12-01
ZA202402611B (en) 2025-07-30
CA3000905A1 (en) 2017-04-13
CA3217926C (en) 2025-05-13
CN116259324A (zh) 2023-06-13
IL292854B1 (en) 2023-03-01
JP6797198B2 (ja) 2020-12-09
WO2017060410A1 (en) 2017-04-13
CN108140392A (zh) 2018-06-08
US20240296850A1 (en) 2024-09-05
SA520412522B1 (ar) 2025-01-09
EP4571737A2 (en) 2025-06-18
US20180308496A1 (en) 2018-10-25
CN116259326A (zh) 2023-06-13
EP4068283B1 (en) 2025-02-12
EP4571737A3 (en) 2025-08-06
BR122022025393B1 (pt) 2023-04-18
US10529343B2 (en) 2020-01-07
AU2016336258A1 (en) 2018-05-10
EP3360133A1 (en) 2018-08-15
CN116206617A (zh) 2023-06-02
SA521430003B1 (ar) 2025-01-09
EP4068283A1 (en) 2022-10-05
IL308605B2 (en) 2025-09-01
AU2016336258B2 (en) 2021-05-27
IL320151A (en) 2025-06-01
ZA202204176B (en) 2024-01-31
PH12018500702A1 (en) 2018-10-15
US11948587B2 (en) 2024-04-02
SA518391259B1 (ar) 2021-10-11
AU2021221861A1 (en) 2021-09-23
BR112018007172B1 (pt) 2023-05-16
IL258360A (en) 2018-05-31
CA3217921A1 (en) 2017-04-13
IL258360B (en) 2021-03-25
KR20240152407A (ko) 2024-10-21
IL300036A (en) 2023-03-01
CA3217926A1 (en) 2017-04-13
EA033756B1 (ru) 2019-11-22
AU2021221861B2 (en) 2023-06-29
CN108140392B (zh) 2023-04-18
IL308605A (en) 2024-01-01
KR20180066136A (ko) 2018-06-18
MX2018004163A (es) 2018-08-01
US20210082440A1 (en) 2021-03-18
BR122021007299B1 (pt) 2023-04-18
PH12018500702B1 (en) 2021-09-22
US12236963B2 (en) 2025-02-25
BR122019020650A8 (pt) 2022-09-13
CN116189692A (zh) 2023-05-30
BR122022025396B1 (pt) 2023-04-18
HK1253682A1 (zh) 2019-06-28
US20250239265A1 (en) 2025-07-24
CN116259323A (zh) 2023-06-13
MX2020008983A (es) 2020-09-28
TWI887948B (zh) 2025-06-21
MX374441B (es) 2025-03-06
BR122019020650A2 (cg-RX-API-DMAC7.html) 2018-10-16
BR122019020650B1 (pt) 2023-05-02
ZA202304207B (en) 2024-08-28
IL308605B1 (en) 2025-05-01
BR112018007172A2 (pt) 2018-10-16
EA201890843A1 (ru) 2018-10-31
EP3360133B1 (en) 2022-04-27
TW201727622A (zh) 2017-08-01
IL300036B2 (en) 2024-04-01
US11232801B2 (en) 2022-01-25
CA3000905C (en) 2024-01-09
ZA202001983B (en) 2022-12-21
JP2025016548A (ja) 2025-02-04
MY193124A (en) 2022-09-26

Similar Documents

Publication Publication Date Title
JP7346676B2 (ja) 圧縮された音または音場表現のための層構成の符号化
TWI703558B (zh) 解碼聲音或音場的壓縮高階環境立體聲聲音表徵的方法及設備
TWI829956B (zh) 解碼聲音或音場的壓縮高階環境立體聲(hoa)聲音表徵的方法、設備及非暫態電腦可讀儲存媒體
HK40086424A (zh) 用於压缩声音或声场表示的分层编解码
HK40086683A (zh) 用於压缩声音或声场表示的分层编解码
HK40086437A (zh) 用於压缩声音或声场表示的分层编解码
HK40086789A (zh) 用於压缩声音或声场表示的分层编解码
HK40086142A (zh) 用於压缩声音或声场表示的分层编解码
HK40086453A (zh) 用於压缩声音或声场表示的分层编解码
HK40086729A (zh) 用於压缩声音或声场表示的分层编解码
HK40086144A (zh) 用於压缩声音或声场表示的分层编解码
HK40086409A (zh) 用於压缩声音或声场表示的分层编解码
HK40090154A (en) Layered coding for compressed sound or sound field represententations
HK40084194A (en) Layered coding for compressed sound or sound field representations
HK40063973A (en) Layered coding for compressed sound or sound field representations
HK1249800B (en) Layered coding for compressed sound or sound field representations
HK1249799B (en) Layered coding for compressed sound or sound field representations
HK1253682B (en) Layered hoa coding for compressed sound or sound field representations