JP2025504862A - 低レイテンシー没入型オーディオコーデック用の高次アンビソニックスの空間コーディング - Google Patents
低レイテンシー没入型オーディオコーデック用の高次アンビソニックスの空間コーディング Download PDFInfo
- Publication number
- JP2025504862A JP2025504862A JP2024543106A JP2024543106A JP2025504862A JP 2025504862 A JP2025504862 A JP 2025504862A JP 2024543106 A JP2024543106 A JP 2024543106A JP 2024543106 A JP2024543106 A JP 2024543106A JP 2025504862 A JP2025504862 A JP 2025504862A
- Authority
- JP
- Japan
- Prior art keywords
- channel
- ambisonics
- spar
- channels
- hoa
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/002—Dynamic bit allocation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/022—Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
- G10L19/025—Detection of transients or attacks for time/frequency resolution switching
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/008—Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/11—Application of ambisonics in stereophonic audio systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Mathematical Physics (AREA)
- Stereophonic System (AREA)
Applications Claiming Priority (7)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202263301152P | 2022-01-20 | 2022-01-20 | |
| US63/301,152 | 2022-01-20 | ||
| US202263394586P | 2022-08-02 | 2022-08-02 | |
| US63/394,586 | 2022-08-02 | ||
| US202263476518P | 2022-12-21 | 2022-12-21 | |
| US63/476,518 | 2022-12-21 | ||
| PCT/US2023/010415 WO2023141034A1 (en) | 2022-01-20 | 2023-01-09 | Spatial coding of higher order ambisonics for a low latency immersive audio codec |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JP2025504862A true JP2025504862A (ja) | 2025-02-19 |
| JP2025504862A5 JP2025504862A5 (https=) | 2026-01-19 |
Family
ID=85199285
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2024543106A Pending JP2025504862A (ja) | 2022-01-20 | 2023-01-09 | 低レイテンシー没入型オーディオコーデック用の高次アンビソニックスの空間コーディング |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US20250095660A1 (https=) |
| EP (2) | EP4716258A3 (https=) |
| JP (1) | JP2025504862A (https=) |
| KR (1) | KR20240137613A (https=) |
| ES (1) | ES3059272T3 (https=) |
| TW (1) | TW202336739A (https=) |
| WO (1) | WO2023141034A1 (https=) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20250078845A1 (en) * | 2023-08-29 | 2025-03-06 | Samsung Electronics Co., Ltd. | Lossless audio coding for multichannel hierarchical reconstruction |
| WO2025081393A1 (zh) * | 2023-10-18 | 2025-04-24 | 北京小米移动软件有限公司 | 音频信号的处理方法、装置、音频设备及存储介质 |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| IL319278A (en) * | 2018-07-02 | 2025-04-01 | Dolby Laboratories Licensing Corp | Methods and devices for generating or decoding a bit sequence comprising embedded audio signals |
| MX2022005146A (es) * | 2019-10-30 | 2022-05-30 | Dolby Laboratories Licensing Corp | Distribucion de tasa de bits en servicios inmersivos de voz y audio. |
| EP4738346A1 (en) * | 2020-12-02 | 2026-05-06 | Dolby International AB | Immersive voice and audio services (ivas) with adaptive downmix strategies |
-
2023
- 2023-01-09 WO PCT/US2023/010415 patent/WO2023141034A1/en not_active Ceased
- 2023-01-09 KR KR1020247027359A patent/KR20240137613A/ko active Pending
- 2023-01-09 EP EP25219245.5A patent/EP4716258A3/en active Pending
- 2023-01-09 EP EP23703973.0A patent/EP4466697B1/en active Active
- 2023-01-09 US US18/729,248 patent/US20250095660A1/en active Pending
- 2023-01-09 ES ES23703973T patent/ES3059272T3/es active Active
- 2023-01-09 JP JP2024543106A patent/JP2025504862A/ja active Pending
- 2023-01-19 TW TW112102544A patent/TW202336739A/zh unknown
Also Published As
| Publication number | Publication date |
|---|---|
| US20250095660A1 (en) | 2025-03-20 |
| ES3059272T3 (en) | 2026-03-19 |
| EP4466697B1 (en) | 2025-12-03 |
| EP4716258A3 (en) | 2026-04-01 |
| EP4466697A1 (en) | 2024-11-27 |
| KR20240137613A (ko) | 2024-09-20 |
| WO2023141034A1 (en) | 2023-07-27 |
| TW202336739A (zh) | 2023-09-16 |
| EP4716258A2 (en) | 2026-03-25 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7842798B2 (ja) | パケット損失補償装置およびパケット損失補償方法、ならびに音声処理システム | |
| US20260038523A1 (en) | Truncateable predictive coding | |
| US8190425B2 (en) | Complex cross-correlation parameters for multi-channel audio | |
| AU2007208482B2 (en) | Complex-transform channel coding with extended-band frequency coding | |
| US7953604B2 (en) | Shape and scale parameters for extended-band frequency coding | |
| JP7789811B2 (ja) | 回転の補間と量子化による空間化オーディオコーディング | |
| US8046214B2 (en) | Low complexity decoder for complex transform coding of multi-channel sound | |
| US8249883B2 (en) | Channel extension coding for multi-channel source | |
| JP6974927B2 (ja) | 時間領域ステレオエンコーディング及びデコーディング方法並びに関連製品 | |
| JP7831938B2 (ja) | 低遅延オーディオ・コーデックのためのパラメータの量子化およびエントロピー符号化 | |
| JP2025504862A (ja) | 低レイテンシー没入型オーディオコーデック用の高次アンビソニックスの空間コーディング | |
| KR20230018533A (ko) | 오디오 코딩/디코딩 모드를 결정하는 방법 및 관련 제품 | |
| KR102377434B1 (ko) | 시간-도메인 스테레오 파라미터에 대한 코딩 방법, 및 관련 제품 | |
| CN118871986A (zh) | 用于低延迟沉浸式音频编解码器的高阶高保真度立体声响复制的空间编码 | |
| HK40115398A (zh) | 用於低延迟沉浸式音频编解码器的高阶高保真度立体声响复制的空间编码 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20260108 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20260108 |