TW202336739A - 用於低延時沉浸式音頻編解碼器之較高階立體混響聲之空間寫碼 - Google Patents

用於低延時沉浸式音頻編解碼器之較高階立體混響聲之空間寫碼 Download PDF

Info

Publication number
TW202336739A
TW202336739A TW112102544A TW112102544A TW202336739A TW 202336739 A TW202336739 A TW 202336739A TW 112102544 A TW112102544 A TW 112102544A TW 112102544 A TW112102544 A TW 112102544A TW 202336739 A TW202336739 A TW 202336739A
Authority
TW
Taiwan
Prior art keywords
channel
spar
channels
hoa
audio signal
Prior art date
Application number
TW112102544A
Other languages
English (en)
Chinese (zh)
Inventor
史蒂芬妮 伯朗
史蒂芬 布魯恩
里沙普 塔吉
Original Assignee
美商杜拜研究特許公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 美商杜拜研究特許公司 filed Critical 美商杜拜研究特許公司
Publication of TW202336739A publication Critical patent/TW202336739A/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/002Dynamic bit allocation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/022Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
    • G10L19/025Detection of transients or attacks for time/frequency resolution switching
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/11Application of ambisonics in stereophonic audio systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
TW112102544A 2022-01-20 2023-01-19 用於低延時沉浸式音頻編解碼器之較高階立體混響聲之空間寫碼 TW202336739A (zh)

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US202263301152P 2022-01-20 2022-01-20
US63/301,152 2022-01-20
US202263394586P 2022-08-02 2022-08-02
US63/394,586 2022-08-02
US202263476518P 2022-12-21 2022-12-21
US63/476,518 2022-12-21

Publications (1)

Publication Number Publication Date
TW202336739A true TW202336739A (zh) 2023-09-16

Family

ID=85199285

Family Applications (1)

Application Number Title Priority Date Filing Date
TW112102544A TW202336739A (zh) 2022-01-20 2023-01-19 用於低延時沉浸式音頻編解碼器之較高階立體混響聲之空間寫碼

Country Status (7)

Country Link
US (1) US20250095660A1 (https=)
EP (2) EP4716258A3 (https=)
JP (1) JP2025504862A (https=)
KR (1) KR20240137613A (https=)
ES (1) ES3059272T3 (https=)
TW (1) TW202336739A (https=)
WO (1) WO2023141034A1 (https=)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20250078845A1 (en) * 2023-08-29 2025-03-06 Samsung Electronics Co., Ltd. Lossless audio coding for multichannel hierarchical reconstruction
WO2025081393A1 (zh) * 2023-10-18 2025-04-24 北京小米移动软件有限公司 音频信号的处理方法、装置、音频设备及存储介质

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IL319278A (en) * 2018-07-02 2025-04-01 Dolby Laboratories Licensing Corp Methods and devices for generating or decoding a bit sequence comprising embedded audio signals
MX2022005146A (es) * 2019-10-30 2022-05-30 Dolby Laboratories Licensing Corp Distribucion de tasa de bits en servicios inmersivos de voz y audio.
EP4738346A1 (en) * 2020-12-02 2026-05-06 Dolby International AB Immersive voice and audio services (ivas) with adaptive downmix strategies

Also Published As

Publication number Publication date
JP2025504862A (ja) 2025-02-19
US20250095660A1 (en) 2025-03-20
ES3059272T3 (en) 2026-03-19
EP4466697B1 (en) 2025-12-03
EP4716258A3 (en) 2026-04-01
EP4466697A1 (en) 2024-11-27
KR20240137613A (ko) 2024-09-20
WO2023141034A1 (en) 2023-07-27
EP4716258A2 (en) 2026-03-25

Similar Documents

Publication Publication Date Title
JP7842798B2 (ja) パケット損失補償装置およびパケット損失補償方法、ならびに音声処理システム
US8046214B2 (en) Low complexity decoder for complex transform coding of multi-channel sound
AU2010249173B2 (en) Complex-transform channel coding with extended-band frequency coding
US7953604B2 (en) Shape and scale parameters for extended-band frequency coding
US8190425B2 (en) Complex cross-correlation parameters for multi-channel audio
JP5542306B2 (ja) オーディオ信号のスケーラブル符号化及び復号
JP7831938B2 (ja) 低遅延オーディオ・コーデックのためのパラメータの量子化およびエントロピー符号化
KR102492119B1 (ko) 오디오 코딩/디코딩 모드를 결정하는 방법 및 관련 제품
TW202336739A (zh) 用於低延時沉浸式音頻編解碼器之較高階立體混響聲之空間寫碼
HK40115398A (zh) 用於低延迟沉浸式音频编解码器的高阶高保真度立体声响复制的空间编码
CN118871986A (zh) 用于低延迟沉浸式音频编解码器的高阶高保真度立体声响复制的空间编码
RU2838373C1 (ru) Квантование и энтропийное кодирование параметров для аудиокодека с низкой задержкой
TWI897027B (zh) 用於具有元資料之參數化經寫碼獨立串流之不連續傳輸的解碼器及解碼方法
TWI897026B (zh) 用於具有元資料之參數化經寫碼獨立串流之不連續傳輸的編碼器及編碼方法
WO2025239172A1 (ja) 符号化装置および方法、復号装置および方法、プログラム、並びに情報処理システム