ES3059272T3 - Spatial coding of higher order ambisonics for a low latency immersive audio codec - Google Patents
Spatial coding of higher order ambisonics for a low latency immersive audio codecInfo
- Publication number
- ES3059272T3 ES3059272T3 ES23703973T ES23703973T ES3059272T3 ES 3059272 T3 ES3059272 T3 ES 3059272T3 ES 23703973 T ES23703973 T ES 23703973T ES 23703973 T ES23703973 T ES 23703973T ES 3059272 T3 ES3059272 T3 ES 3059272T3
- Authority
- ES
- Spain
- Prior art keywords
- channels
- hoa
- spar
- prediction
- metadata
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/002—Dynamic bit allocation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/022—Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
- G10L19/025—Detection of transients or attacks for time/frequency resolution switching
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/008—Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/11—Application of ambisonics in stereophonic audio systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Mathematical Physics (AREA)
- Stereophonic System (AREA)
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202263301152P | 2022-01-20 | 2022-01-20 | |
| US202263394586P | 2022-08-02 | 2022-08-02 | |
| US202263476518P | 2022-12-21 | 2022-12-21 | |
| PCT/US2023/010415 WO2023141034A1 (en) | 2022-01-20 | 2023-01-09 | Spatial coding of higher order ambisonics for a low latency immersive audio codec |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ES3059272T3 true ES3059272T3 (en) | 2026-03-19 |
Family
ID=85199285
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| ES23703973T Active ES3059272T3 (en) | 2022-01-20 | 2023-01-09 | Spatial coding of higher order ambisonics for a low latency immersive audio codec |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US20250095660A1 (https=) |
| EP (2) | EP4716258A3 (https=) |
| JP (1) | JP2025504862A (https=) |
| KR (1) | KR20240137613A (https=) |
| ES (1) | ES3059272T3 (https=) |
| TW (1) | TW202336739A (https=) |
| WO (1) | WO2023141034A1 (https=) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20250078845A1 (en) * | 2023-08-29 | 2025-03-06 | Samsung Electronics Co., Ltd. | Lossless audio coding for multichannel hierarchical reconstruction |
| WO2025081393A1 (zh) * | 2023-10-18 | 2025-04-24 | 北京小米移动软件有限公司 | 音频信号的处理方法、装置、音频设备及存储介质 |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| IL319278A (en) * | 2018-07-02 | 2025-04-01 | Dolby Laboratories Licensing Corp | Methods and devices for generating or decoding a bit sequence comprising embedded audio signals |
| MX2022005146A (es) * | 2019-10-30 | 2022-05-30 | Dolby Laboratories Licensing Corp | Distribucion de tasa de bits en servicios inmersivos de voz y audio. |
| EP4738346A1 (en) * | 2020-12-02 | 2026-05-06 | Dolby International AB | Immersive voice and audio services (ivas) with adaptive downmix strategies |
-
2023
- 2023-01-09 WO PCT/US2023/010415 patent/WO2023141034A1/en not_active Ceased
- 2023-01-09 KR KR1020247027359A patent/KR20240137613A/ko active Pending
- 2023-01-09 EP EP25219245.5A patent/EP4716258A3/en active Pending
- 2023-01-09 EP EP23703973.0A patent/EP4466697B1/en active Active
- 2023-01-09 US US18/729,248 patent/US20250095660A1/en active Pending
- 2023-01-09 ES ES23703973T patent/ES3059272T3/es active Active
- 2023-01-09 JP JP2024543106A patent/JP2025504862A/ja active Pending
- 2023-01-19 TW TW112102544A patent/TW202336739A/zh unknown
Also Published As
| Publication number | Publication date |
|---|---|
| JP2025504862A (ja) | 2025-02-19 |
| US20250095660A1 (en) | 2025-03-20 |
| EP4466697B1 (en) | 2025-12-03 |
| EP4716258A3 (en) | 2026-04-01 |
| EP4466697A1 (en) | 2024-11-27 |
| KR20240137613A (ko) | 2024-09-20 |
| WO2023141034A1 (en) | 2023-07-27 |
| TW202336739A (zh) | 2023-09-16 |
| EP4716258A2 (en) | 2026-03-25 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7842798B2 (ja) | パケット損失補償装置およびパケット損失補償方法、ならびに音声処理システム | |
| JP7695320B2 (ja) | マルチチャネル信号符号化方法、マルチチャネル信号復号方法、エンコーダ、およびデコーダ | |
| ES3059272T3 (en) | Spatial coding of higher order ambisonics for a low latency immersive audio codec | |
| ES2934646T3 (es) | Sistema de procesamiento de audio | |
| ES3058595T3 (en) | Method and system for encoding a stereo sound signal using coding parameters of a primary channel to encode a secondary channel | |
| ES2798137T3 (es) | Decodificador de audio multicanal, codificador de audio multicanal, procedimientos y programa informático que utilizan un ajuste basado en señal residual de una contribución de una señal decorrelacionada | |
| ES2982183T3 (es) | Codificador y descodificador de audio | |
| JP7831938B2 (ja) | 低遅延オーディオ・コーデックのためのパラメータの量子化およびエントロピー符号化 | |
| ES2693051T3 (es) | Aparato y procedimiento para generar una señal mejorada mediante el uso de relleno de ruido independiente | |
| ES3032871T3 (en) | Methods and devices for encoding decoding spatial background noise within a multi-channel input signal | |
| JP6974927B2 (ja) | 時間領域ステレオエンコーディング及びデコーディング方法並びに関連製品 | |
| BR112015025080B1 (pt) | Método de decodificação e decodificador para decodificar dois sinais de áudio, método de codificação e codificador para codificar dois sinais de áudio, e meio legível não transitório | |
| ES3017425T3 (en) | Method and apparatus for controlling multichannel audio frame loss concealment | |
| KR20230018533A (ko) | 오디오 코딩/디코딩 모드를 결정하는 방법 및 관련 제품 | |
| US9734836B2 (en) | Method and apparatus for decoding speech/audio bitstream | |
| WO2025113123A1 (zh) | 音频编码方法、音频解码方法、装置、可读存储介质 | |
| JP2025504862A5 (https=) | ||
| EP2695301A1 (en) | Method and decoder for reconstructing a source signal | |
| KR20200035306A (ko) | 시간-도메인 스테레오 인코딩 및 디코딩 방법 및 관련 제품 | |
| HK40115398A (zh) | 用於低延迟沉浸式音频编解码器的高阶高保真度立体声响复制的空间编码 | |
| KR20200038297A (ko) | 스테레오 신호 인코딩에서의 신호 재구성 방법 및 디바이스 | |
| CN118871986A (zh) | 用于低延迟沉浸式音频编解码器的高阶高保真度立体声响复制的空间编码 | |
| TWI897026B (zh) | 用於具有元資料之參數化經寫碼獨立串流之不連續傳輸的編碼器及編碼方法 | |
| TWI897027B (zh) | 用於具有元資料之參數化經寫碼獨立串流之不連續傳輸的解碼器及解碼方法 | |
| RU2838373C1 (ru) | Квантование и энтропийное кодирование параметров для аудиокодека с низкой задержкой |