ES3059272T3 - Spatial coding of higher order ambisonics for a low latency immersive audio codec - Google Patents

Spatial coding of higher order ambisonics for a low latency immersive audio codec

Info

Publication number
ES3059272T3
ES3059272T3 ES23703973T ES23703973T ES3059272T3 ES 3059272 T3 ES3059272 T3 ES 3059272T3 ES 23703973 T ES23703973 T ES 23703973T ES 23703973 T ES23703973 T ES 23703973T ES 3059272 T3 ES3059272 T3 ES 3059272T3
Authority
ES
Spain
Prior art keywords
channels
hoa
spar
prediction
metadata
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
ES23703973T
Other languages
English (en)
Spanish (es)
Inventor
Stefanie Brown
Stefan Bruhn
Rishabh Tyagi
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dolby International AB
Dolby Laboratories Licensing Corp
Original Assignee
Dolby International AB
Dolby Laboratories Licensing Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dolby International AB, Dolby Laboratories Licensing Corp filed Critical Dolby International AB
Application granted granted Critical
Publication of ES3059272T3 publication Critical patent/ES3059272T3/es
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/002Dynamic bit allocation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/022Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
    • G10L19/025Detection of transients or attacks for time/frequency resolution switching
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/11Application of ambisonics in stereophonic audio systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
ES23703973T 2022-01-20 2023-01-09 Spatial coding of higher order ambisonics for a low latency immersive audio codec Active ES3059272T3 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US202263301152P 2022-01-20 2022-01-20
US202263394586P 2022-08-02 2022-08-02
US202263476518P 2022-12-21 2022-12-21
PCT/US2023/010415 WO2023141034A1 (en) 2022-01-20 2023-01-09 Spatial coding of higher order ambisonics for a low latency immersive audio codec

Publications (1)

Publication Number Publication Date
ES3059272T3 true ES3059272T3 (en) 2026-03-19

Family

ID=85199285

Family Applications (1)

Application Number Title Priority Date Filing Date
ES23703973T Active ES3059272T3 (en) 2022-01-20 2023-01-09 Spatial coding of higher order ambisonics for a low latency immersive audio codec

Country Status (7)

Country Link
US (1) US20250095660A1 (https=)
EP (2) EP4716258A3 (https=)
JP (1) JP2025504862A (https=)
KR (1) KR20240137613A (https=)
ES (1) ES3059272T3 (https=)
TW (1) TW202336739A (https=)
WO (1) WO2023141034A1 (https=)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20250078845A1 (en) * 2023-08-29 2025-03-06 Samsung Electronics Co., Ltd. Lossless audio coding for multichannel hierarchical reconstruction
WO2025081393A1 (zh) * 2023-10-18 2025-04-24 北京小米移动软件有限公司 音频信号的处理方法、装置、音频设备及存储介质

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IL319278A (en) * 2018-07-02 2025-04-01 Dolby Laboratories Licensing Corp Methods and devices for generating or decoding a bit sequence comprising embedded audio signals
MX2022005146A (es) * 2019-10-30 2022-05-30 Dolby Laboratories Licensing Corp Distribucion de tasa de bits en servicios inmersivos de voz y audio.
EP4738346A1 (en) * 2020-12-02 2026-05-06 Dolby International AB Immersive voice and audio services (ivas) with adaptive downmix strategies

Also Published As

Publication number Publication date
JP2025504862A (ja) 2025-02-19
US20250095660A1 (en) 2025-03-20
EP4466697B1 (en) 2025-12-03
EP4716258A3 (en) 2026-04-01
EP4466697A1 (en) 2024-11-27
KR20240137613A (ko) 2024-09-20
WO2023141034A1 (en) 2023-07-27
TW202336739A (zh) 2023-09-16
EP4716258A2 (en) 2026-03-25

Similar Documents

Publication Publication Date Title
JP7842798B2 (ja) パケット損失補償装置およびパケット損失補償方法、ならびに音声処理システム
JP7695320B2 (ja) マルチチャネル信号符号化方法、マルチチャネル信号復号方法、エンコーダ、およびデコーダ
ES3059272T3 (en) Spatial coding of higher order ambisonics for a low latency immersive audio codec
ES2934646T3 (es) Sistema de procesamiento de audio
ES3058595T3 (en) Method and system for encoding a stereo sound signal using coding parameters of a primary channel to encode a secondary channel
ES2798137T3 (es) Decodificador de audio multicanal, codificador de audio multicanal, procedimientos y programa informático que utilizan un ajuste basado en señal residual de una contribución de una señal decorrelacionada
ES2982183T3 (es) Codificador y descodificador de audio
JP7831938B2 (ja) 低遅延オーディオ・コーデックのためのパラメータの量子化およびエントロピー符号化
ES2693051T3 (es) Aparato y procedimiento para generar una señal mejorada mediante el uso de relleno de ruido independiente
ES3032871T3 (en) Methods and devices for encoding decoding spatial background noise within a multi-channel input signal
JP6974927B2 (ja) 時間領域ステレオエンコーディング及びデコーディング方法並びに関連製品
BR112015025080B1 (pt) Método de decodificação e decodificador para decodificar dois sinais de áudio, método de codificação e codificador para codificar dois sinais de áudio, e meio legível não transitório
ES3017425T3 (en) Method and apparatus for controlling multichannel audio frame loss concealment
KR20230018533A (ko) 오디오 코딩/디코딩 모드를 결정하는 방법 및 관련 제품
US9734836B2 (en) Method and apparatus for decoding speech/audio bitstream
WO2025113123A1 (zh) 音频编码方法、音频解码方法、装置、可读存储介质
JP2025504862A5 (https=)
EP2695301A1 (en) Method and decoder for reconstructing a source signal
KR20200035306A (ko) 시간-도메인 스테레오 인코딩 및 디코딩 방법 및 관련 제품
HK40115398A (zh) 用於低延迟沉浸式音频编解码器的高阶高保真度立体声响复制的空间编码
KR20200038297A (ko) 스테레오 신호 인코딩에서의 신호 재구성 방법 및 디바이스
CN118871986A (zh) 用于低延迟沉浸式音频编解码器的高阶高保真度立体声响复制的空间编码
TWI897026B (zh) 用於具有元資料之參數化經寫碼獨立串流之不連續傳輸的編碼器及編碼方法
TWI897027B (zh) 用於具有元資料之參數化經寫碼獨立串流之不連續傳輸的解碼器及解碼方法
RU2838373C1 (ru) Квантование и энтропийное кодирование параметров для аудиокодека с низкой задержкой