TWI794911B - 用以編碼音訊信號或用以解碼經編碼音訊場景之設備、方法及電腦程式 - Google Patents
用以編碼音訊信號或用以解碼經編碼音訊場景之設備、方法及電腦程式 Download PDFInfo
- Publication number
- TWI794911B TWI794911B TW110127932A TW110127932A TWI794911B TW I794911 B TWI794911 B TW I794911B TW 110127932 A TW110127932 A TW 110127932A TW 110127932 A TW110127932 A TW 110127932A TW I794911 B TWI794911 B TW I794911B
- Authority
- TW
- Taiwan
- Prior art keywords
- frame
- signal
- audio signal
- audio
- sound field
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 245
- 238000000034 method Methods 0.000 title claims abstract description 57
- 238000004590 computer program Methods 0.000 title description 10
- 230000000694 effects Effects 0.000 claims abstract description 42
- 238000012545 processing Methods 0.000 claims abstract description 27
- 238000009877 rendering Methods 0.000 claims abstract description 14
- 230000002194 synthesizing effect Effects 0.000 claims abstract description 7
- 239000002131 composite material Substances 0.000 claims description 104
- 230000005540 biological transmission Effects 0.000 claims description 48
- 238000009792 diffusion process Methods 0.000 claims description 16
- 239000000203 mixture Substances 0.000 claims description 13
- 230000001427 coherent effect Effects 0.000 claims description 8
- 238000005070 sampling Methods 0.000 claims description 7
- 238000009826 distribution Methods 0.000 claims description 3
- 238000003860 storage Methods 0.000 abstract description 9
- 230000015572 biosynthetic process Effects 0.000 description 23
- 238000003786 synthesis reaction Methods 0.000 description 23
- 230000009471 action Effects 0.000 description 19
- 239000012071 phase Substances 0.000 description 17
- 239000012073 inactive phase Substances 0.000 description 16
- 238000003780 insertion Methods 0.000 description 14
- 230000037431 insertion Effects 0.000 description 14
- 239000013598 vector Substances 0.000 description 14
- 238000002156 mixing Methods 0.000 description 12
- 238000013139 quantization Methods 0.000 description 12
- 238000010586 diagram Methods 0.000 description 9
- 238000004891 communication Methods 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- 238000013213 extrapolation Methods 0.000 description 7
- 238000013459 approach Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 230000009467 reduction Effects 0.000 description 4
- 238000012935 Averaging Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 230000011664 signaling Effects 0.000 description 3
- 230000002123 temporal effect Effects 0.000 description 3
- 230000007704 transition Effects 0.000 description 3
- 239000006185 dispersion Substances 0.000 description 2
- 238000009499 grossing Methods 0.000 description 2
- 238000007654 immersion Methods 0.000 description 2
- 238000007493 shaping process Methods 0.000 description 2
- 230000001360 synchronised effect Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 206010002953 Aphonia Diseases 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 238000012952 Resampling Methods 0.000 description 1
- 239000012072 active phase Substances 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000010237 hybrid technique Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008450 motivation Effects 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000000611 regression analysis Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 238000003892 spreading Methods 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/012—Comfort noise or silence coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/173—Transcoding, i.e. converting between two coded representations avoiding cascaded coding-decoding
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Mathematical Physics (AREA)
- Stereophonic System (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP20188707.2 | 2020-07-30 | ||
EP20188707 | 2020-07-30 | ||
WOPCT/EP2021/064576 | 2021-05-31 | ||
PCT/EP2021/064576 WO2022022876A1 (en) | 2020-07-30 | 2021-05-31 | Apparatus, method and computer program for encoding an audio signal or for decoding an encoded audio scene |
Publications (2)
Publication Number | Publication Date |
---|---|
TW202230333A TW202230333A (zh) | 2022-08-01 |
TWI794911B true TWI794911B (zh) | 2023-03-01 |
Family
ID=71894727
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW112106853A TW202347316A (zh) | 2020-07-30 | 2021-07-29 | 用以編碼音訊信號或用以解碼經編碼音訊場景之設備、方法及電腦程式 |
TW110127932A TWI794911B (zh) | 2020-07-30 | 2021-07-29 | 用以編碼音訊信號或用以解碼經編碼音訊場景之設備、方法及電腦程式 |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW112106853A TW202347316A (zh) | 2020-07-30 | 2021-07-29 | 用以編碼音訊信號或用以解碼經編碼音訊場景之設備、方法及電腦程式 |
Country Status (12)
Country | Link |
---|---|
US (1) | US20230306975A1 (ja) |
EP (1) | EP4189674A1 (ja) |
JP (1) | JP2023536156A (ja) |
KR (1) | KR20230049660A (ja) |
CN (1) | CN116348951A (ja) |
AU (2) | AU2021317755B2 (ja) |
BR (1) | BR112023001616A2 (ja) |
CA (1) | CA3187342A1 (ja) |
MX (1) | MX2023001152A (ja) |
TW (2) | TW202347316A (ja) |
WO (1) | WO2022022876A1 (ja) |
ZA (1) | ZA202301024B (ja) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3719799A1 (en) * | 2019-04-04 | 2020-10-07 | FRAUNHOFER-GESELLSCHAFT zur Förderung der angewandten Forschung e.V. | A multi-channel audio encoder, decoder, methods and computer program for switching between a parametric multi-channel operation and an individual channel operation |
CN115150718A (zh) * | 2022-06-30 | 2022-10-04 | 雷欧尼斯(北京)信息技术有限公司 | 一种车载沉浸式音频的播放方法和制作方法 |
WO2024051955A1 (en) | 2022-09-09 | 2024-03-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Decoder and decoding method for discontinuous transmission of parametrically coded independent streams with metadata |
WO2024051954A1 (en) | 2022-09-09 | 2024-03-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Encoder and encoding method for discontinuous transmission of parametrically coded independent streams with metadata |
WO2024056702A1 (en) * | 2022-09-13 | 2024-03-21 | Telefonaktiebolaget Lm Ericsson (Publ) | Adaptive inter-channel time difference estimation |
CN116368460A (zh) * | 2023-02-14 | 2023-06-30 | 北京小米移动软件有限公司 | 音频处理方法、装置 |
WO2024175587A1 (en) * | 2023-02-23 | 2024-08-29 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio signal representation decoding unit and audio signal representation encoding unit |
WO2024208964A1 (en) * | 2023-04-06 | 2024-10-10 | Telefonaktiebolaget Lm Ericsson (Publ) | Stabilization of rendering with varying detail |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150213809A1 (en) * | 2014-01-30 | 2015-07-30 | Qualcomm Incorporated | Coding independent frames of ambient higher-order ambisonic coefficients |
JP5933965B2 (ja) * | 2000-11-15 | 2016-06-15 | ドルビー・インターナショナル・アクチボラゲットDolby International Ab | 高周波数の再構成方法を使用するコーディング・システムの性能拡大方法 |
US9514757B2 (en) * | 2010-11-17 | 2016-12-06 | Panasonic Intellectual Property Corporation Of America | Stereo signal encoding device, stereo signal decoding device, stereo signal encoding method, and stereo signal decoding method |
CN107742521A (zh) * | 2016-08-10 | 2018-02-27 | 华为技术有限公司 | 多声道信号的编码方法和编码器 |
CN108885879A (zh) * | 2016-01-22 | 2018-11-23 | 弗劳恩霍夫应用研究促进协会 | 使用帧控制同步来编码或解码多声道音频信号的装置和方法 |
TW201909658A (zh) * | 2011-07-01 | 2019-03-01 | 美商杜比實驗室特許公司 | 用於適應性音頻信號的產生、譯碼與呈現之系統與方法 |
CN109448741A (zh) * | 2018-11-22 | 2019-03-08 | 广州广晟数码技术有限公司 | 一种3d音频编码、解码方法及装置 |
CN110556120A (zh) * | 2014-06-27 | 2019-12-10 | 杜比国际公司 | 用于解码声音或声场的高阶高保真度立体声响复制(hoa)表示的方法 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5793636B2 (ja) * | 2012-09-11 | 2015-10-14 | テレフオンアクチーボラゲット エル エム エリクソン(パブル) | コンフォート・ノイズの生成 |
JP6790251B2 (ja) * | 2016-09-28 | 2020-11-25 | 華為技術有限公司Huawei Technologies Co.,Ltd. | マルチチャネルオーディオ信号処理方法、装置、およびシステム |
CN112334980B (zh) * | 2018-06-28 | 2024-05-14 | 瑞典爱立信有限公司 | 自适应舒适噪声参数确定 |
-
2021
- 2021-05-31 CN CN202180067397.4A patent/CN116348951A/zh active Pending
- 2021-05-31 AU AU2021317755A patent/AU2021317755B2/en active Active
- 2021-05-31 JP JP2023506177A patent/JP2023536156A/ja active Pending
- 2021-05-31 CA CA3187342A patent/CA3187342A1/en active Pending
- 2021-05-31 MX MX2023001152A patent/MX2023001152A/es unknown
- 2021-05-31 WO PCT/EP2021/064576 patent/WO2022022876A1/en active Application Filing
- 2021-05-31 KR KR1020237006968A patent/KR20230049660A/ko active Search and Examination
- 2021-05-31 EP EP21729320.8A patent/EP4189674A1/en active Pending
- 2021-05-31 BR BR112023001616A patent/BR112023001616A2/pt unknown
- 2021-07-29 TW TW112106853A patent/TW202347316A/zh unknown
- 2021-07-29 TW TW110127932A patent/TWI794911B/zh active
-
2023
- 2023-01-24 ZA ZA2023/01024A patent/ZA202301024B/en unknown
- 2023-01-27 US US18/160,894 patent/US20230306975A1/en active Pending
- 2023-12-27 AU AU2023286009A patent/AU2023286009A1/en active Pending
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5933965B2 (ja) * | 2000-11-15 | 2016-06-15 | ドルビー・インターナショナル・アクチボラゲットDolby International Ab | 高周波数の再構成方法を使用するコーディング・システムの性能拡大方法 |
US9514757B2 (en) * | 2010-11-17 | 2016-12-06 | Panasonic Intellectual Property Corporation Of America | Stereo signal encoding device, stereo signal decoding device, stereo signal encoding method, and stereo signal decoding method |
TW201909658A (zh) * | 2011-07-01 | 2019-03-01 | 美商杜比實驗室特許公司 | 用於適應性音頻信號的產生、譯碼與呈現之系統與方法 |
US20150213809A1 (en) * | 2014-01-30 | 2015-07-30 | Qualcomm Incorporated | Coding independent frames of ambient higher-order ambisonic coefficients |
US20170032798A1 (en) * | 2014-01-30 | 2017-02-02 | Qualcomm Incorporated | Coding numbers of code vectors for independent frames of higher-order ambisonic coefficients |
CN110556120A (zh) * | 2014-06-27 | 2019-12-10 | 杜比国际公司 | 用于解码声音或声场的高阶高保真度立体声响复制(hoa)表示的方法 |
CN108885879A (zh) * | 2016-01-22 | 2018-11-23 | 弗劳恩霍夫应用研究促进协会 | 使用帧控制同步来编码或解码多声道音频信号的装置和方法 |
CN107742521A (zh) * | 2016-08-10 | 2018-02-27 | 华为技术有限公司 | 多声道信号的编码方法和编码器 |
CN109448741A (zh) * | 2018-11-22 | 2019-03-08 | 广州广晟数码技术有限公司 | 一种3d音频编码、解码方法及装置 |
Also Published As
Publication number | Publication date |
---|---|
TW202347316A (zh) | 2023-12-01 |
AU2021317755B2 (en) | 2023-11-09 |
CN116348951A (zh) | 2023-06-27 |
AU2021317755A1 (en) | 2023-03-02 |
US20230306975A1 (en) | 2023-09-28 |
EP4189674A1 (en) | 2023-06-07 |
AU2023286009A1 (en) | 2024-01-25 |
ZA202301024B (en) | 2024-04-24 |
CA3187342A1 (en) | 2022-02-03 |
TW202230333A (zh) | 2022-08-01 |
BR112023001616A2 (pt) | 2023-02-23 |
JP2023536156A (ja) | 2023-08-23 |
KR20230049660A (ko) | 2023-04-13 |
MX2023001152A (es) | 2023-04-05 |
WO2022022876A1 (en) | 2022-02-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
TWI794911B (zh) | 用以編碼音訊信號或用以解碼經編碼音訊場景之設備、方法及電腦程式 | |
CN103474077B (zh) | 音频信号译码器、提供上混信号表示型态的方法 | |
US11361778B2 (en) | Audio scene encoder, audio scene decoder and related methods using hybrid encoder-decoder spatial analysis | |
TWI804004B (zh) | 在降混過程中使用方向資訊對多個音頻對象進行編碼的設備和方法、及電腦程式 | |
JP2023546851A (ja) | 複数の音声オブジェクトをエンコードする装置および方法、または2つ以上の関連する音声オブジェクトを使用してデコードする装置および方法 | |
Multrus et al. | Immersive Voice and Audio Services (IVAS) codec-The new 3GPP standard for immersive communication | |
JP2023549038A (ja) | パラメータ変換を用いて符号化されたオーディオシーンを処理するための装置、方法、またはコンピュータプログラム | |
RU2809587C1 (ru) | Устройство, способ и компьютерная программа для кодирования звукового сигнала или для декодирования кодированной аудиосцены | |
JP2023548650A (ja) | 帯域幅拡張を用いて符号化されたオーディオシーンを処理するための装置、方法、またはコンピュータプログラム | |
JP2023549033A (ja) | パラメータ平滑化を用いて符号化されたオーディオシーンを処理するための装置、方法、またはコンピュータプログラム | |
TW202341128A (zh) | 轉換音訊串流之設備及方法 |