TW202336739A - 用於低延時沉浸式音頻編解碼器之較高階立體混響聲之空間寫碼 - Google Patents
用於低延時沉浸式音頻編解碼器之較高階立體混響聲之空間寫碼 Download PDFInfo
- Publication number
- TW202336739A TW202336739A TW112102544A TW112102544A TW202336739A TW 202336739 A TW202336739 A TW 202336739A TW 112102544 A TW112102544 A TW 112102544A TW 112102544 A TW112102544 A TW 112102544A TW 202336739 A TW202336739 A TW 202336739A
- Authority
- TW
- Taiwan
- Prior art keywords
- channel
- spar
- channels
- hoa
- audio signal
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/002—Dynamic bit allocation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/022—Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
- G10L19/025—Detection of transients or attacks for time/frequency resolution switching
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/008—Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/11—Application of ambisonics in stereophonic audio systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Mathematical Physics (AREA)
- Stereophonic System (AREA)
Applications Claiming Priority (6)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202263301152P | 2022-01-20 | 2022-01-20 | |
| US63/301,152 | 2022-01-20 | ||
| US202263394586P | 2022-08-02 | 2022-08-02 | |
| US63/394,586 | 2022-08-02 | ||
| US202263476518P | 2022-12-21 | 2022-12-21 | |
| US63/476,518 | 2022-12-21 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| TW202336739A true TW202336739A (zh) | 2023-09-16 |
Family
ID=85199285
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW112102544A TW202336739A (zh) | 2022-01-20 | 2023-01-19 | 用於低延時沉浸式音頻編解碼器之較高階立體混響聲之空間寫碼 |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US20250095660A1 (https=) |
| EP (2) | EP4716258A3 (https=) |
| JP (1) | JP2025504862A (https=) |
| KR (1) | KR20240137613A (https=) |
| ES (1) | ES3059272T3 (https=) |
| TW (1) | TW202336739A (https=) |
| WO (1) | WO2023141034A1 (https=) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20250078845A1 (en) * | 2023-08-29 | 2025-03-06 | Samsung Electronics Co., Ltd. | Lossless audio coding for multichannel hierarchical reconstruction |
| WO2025081393A1 (zh) * | 2023-10-18 | 2025-04-24 | 北京小米移动软件有限公司 | 音频信号的处理方法、装置、音频设备及存储介质 |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| IL319278A (en) * | 2018-07-02 | 2025-04-01 | Dolby Laboratories Licensing Corp | Methods and devices for generating or decoding a bit sequence comprising embedded audio signals |
| MX2022005146A (es) * | 2019-10-30 | 2022-05-30 | Dolby Laboratories Licensing Corp | Distribucion de tasa de bits en servicios inmersivos de voz y audio. |
| EP4738346A1 (en) * | 2020-12-02 | 2026-05-06 | Dolby International AB | Immersive voice and audio services (ivas) with adaptive downmix strategies |
-
2023
- 2023-01-09 WO PCT/US2023/010415 patent/WO2023141034A1/en not_active Ceased
- 2023-01-09 KR KR1020247027359A patent/KR20240137613A/ko active Pending
- 2023-01-09 EP EP25219245.5A patent/EP4716258A3/en active Pending
- 2023-01-09 EP EP23703973.0A patent/EP4466697B1/en active Active
- 2023-01-09 US US18/729,248 patent/US20250095660A1/en active Pending
- 2023-01-09 ES ES23703973T patent/ES3059272T3/es active Active
- 2023-01-09 JP JP2024543106A patent/JP2025504862A/ja active Pending
- 2023-01-19 TW TW112102544A patent/TW202336739A/zh unknown
Also Published As
| Publication number | Publication date |
|---|---|
| JP2025504862A (ja) | 2025-02-19 |
| US20250095660A1 (en) | 2025-03-20 |
| ES3059272T3 (en) | 2026-03-19 |
| EP4466697B1 (en) | 2025-12-03 |
| EP4716258A3 (en) | 2026-04-01 |
| EP4466697A1 (en) | 2024-11-27 |
| KR20240137613A (ko) | 2024-09-20 |
| WO2023141034A1 (en) | 2023-07-27 |
| EP4716258A2 (en) | 2026-03-25 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7842798B2 (ja) | パケット損失補償装置およびパケット損失補償方法、ならびに音声処理システム | |
| US8046214B2 (en) | Low complexity decoder for complex transform coding of multi-channel sound | |
| AU2010249173B2 (en) | Complex-transform channel coding with extended-band frequency coding | |
| US7953604B2 (en) | Shape and scale parameters for extended-band frequency coding | |
| US8190425B2 (en) | Complex cross-correlation parameters for multi-channel audio | |
| JP5542306B2 (ja) | オーディオ信号のスケーラブル符号化及び復号 | |
| JP7831938B2 (ja) | 低遅延オーディオ・コーデックのためのパラメータの量子化およびエントロピー符号化 | |
| KR102492119B1 (ko) | 오디오 코딩/디코딩 모드를 결정하는 방법 및 관련 제품 | |
| TW202336739A (zh) | 用於低延時沉浸式音頻編解碼器之較高階立體混響聲之空間寫碼 | |
| HK40115398A (zh) | 用於低延迟沉浸式音频编解码器的高阶高保真度立体声响复制的空间编码 | |
| CN118871986A (zh) | 用于低延迟沉浸式音频编解码器的高阶高保真度立体声响复制的空间编码 | |
| RU2838373C1 (ru) | Квантование и энтропийное кодирование параметров для аудиокодека с низкой задержкой | |
| TWI897027B (zh) | 用於具有元資料之參數化經寫碼獨立串流之不連續傳輸的解碼器及解碼方法 | |
| TWI897026B (zh) | 用於具有元資料之參數化經寫碼獨立串流之不連續傳輸的編碼器及編碼方法 | |
| WO2025239172A1 (ja) | 符号化装置および方法、復号装置および方法、プログラム、並びに情報処理システム |