TWI794911B - 用以編碼音訊信號或用以解碼經編碼音訊場景之設備、方法及電腦程式 - Google Patents

用以編碼音訊信號或用以解碼經編碼音訊場景之設備、方法及電腦程式 Download PDF

Info

Publication number
TWI794911B
TWI794911B TW110127932A TW110127932A TWI794911B TW I794911 B TWI794911 B TW I794911B TW 110127932 A TW110127932 A TW 110127932A TW 110127932 A TW110127932 A TW 110127932A TW I794911 B TWI794911 B TW I794911B
Authority
TW
Taiwan
Prior art keywords
frame
signal
audio signal
audio
sound field
Prior art date
Application number
TW110127932A
Other languages
English (en)
Chinese (zh)
Other versions
TW202230333A (zh
Inventor
古拉米 福契斯
亞齊特 塔瑪拉普
安德利亞 尹申瑟
斯里坎特 寇斯
史蒂芬 多希拉
馬庫斯 穆爾特斯
Original Assignee
弗勞恩霍夫爾協會
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 弗勞恩霍夫爾協會 filed Critical 弗勞恩霍夫爾協會
Publication of TW202230333A publication Critical patent/TW202230333A/zh
Application granted granted Critical
Publication of TWI794911B publication Critical patent/TWI794911B/zh

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/012Comfort noise or silence coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/173Transcoding, i.e. converting between two coded representations avoiding cascaded coding-decoding

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
TW110127932A 2020-07-30 2021-07-29 用以編碼音訊信號或用以解碼經編碼音訊場景之設備、方法及電腦程式 TWI794911B (zh)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
EP20188707.2 2020-07-30
EP20188707 2020-07-30
WOPCT/EP2021/064576 2021-05-31
PCT/EP2021/064576 WO2022022876A1 (en) 2020-07-30 2021-05-31 Apparatus, method and computer program for encoding an audio signal or for decoding an encoded audio scene

Publications (2)

Publication Number Publication Date
TW202230333A TW202230333A (zh) 2022-08-01
TWI794911B true TWI794911B (zh) 2023-03-01

Family

ID=71894727

Family Applications (2)

Application Number Title Priority Date Filing Date
TW112106853A TW202347316A (zh) 2020-07-30 2021-07-29 用以編碼音訊信號或用以解碼經編碼音訊場景之設備、方法及電腦程式
TW110127932A TWI794911B (zh) 2020-07-30 2021-07-29 用以編碼音訊信號或用以解碼經編碼音訊場景之設備、方法及電腦程式

Family Applications Before (1)

Application Number Title Priority Date Filing Date
TW112106853A TW202347316A (zh) 2020-07-30 2021-07-29 用以編碼音訊信號或用以解碼經編碼音訊場景之設備、方法及電腦程式

Country Status (12)

Country Link
US (1) US20230306975A1 (ja)
EP (1) EP4189674A1 (ja)
JP (1) JP2023536156A (ja)
KR (1) KR20230049660A (ja)
CN (1) CN116348951A (ja)
AU (2) AU2021317755B2 (ja)
BR (1) BR112023001616A2 (ja)
CA (1) CA3187342A1 (ja)
MX (1) MX2023001152A (ja)
TW (2) TW202347316A (ja)
WO (1) WO2022022876A1 (ja)
ZA (1) ZA202301024B (ja)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3719799A1 (en) * 2019-04-04 2020-10-07 FRAUNHOFER-GESELLSCHAFT zur Förderung der angewandten Forschung e.V. A multi-channel audio encoder, decoder, methods and computer program for switching between a parametric multi-channel operation and an individual channel operation
CN115150718A (zh) * 2022-06-30 2022-10-04 雷欧尼斯(北京)信息技术有限公司 一种车载沉浸式音频的播放方法和制作方法
WO2024051955A1 (en) 2022-09-09 2024-03-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decoder and decoding method for discontinuous transmission of parametrically coded independent streams with metadata
WO2024051954A1 (en) 2022-09-09 2024-03-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Encoder and encoding method for discontinuous transmission of parametrically coded independent streams with metadata
WO2024056702A1 (en) * 2022-09-13 2024-03-21 Telefonaktiebolaget Lm Ericsson (Publ) Adaptive inter-channel time difference estimation
CN116368460A (zh) * 2023-02-14 2023-06-30 北京小米移动软件有限公司 音频处理方法、装置
WO2024175587A1 (en) * 2023-02-23 2024-08-29 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio signal representation decoding unit and audio signal representation encoding unit
WO2024208964A1 (en) * 2023-04-06 2024-10-10 Telefonaktiebolaget Lm Ericsson (Publ) Stabilization of rendering with varying detail

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150213809A1 (en) * 2014-01-30 2015-07-30 Qualcomm Incorporated Coding independent frames of ambient higher-order ambisonic coefficients
JP5933965B2 (ja) * 2000-11-15 2016-06-15 ドルビー・インターナショナル・アクチボラゲットDolby International Ab 高周波数の再構成方法を使用するコーディング・システムの性能拡大方法
US9514757B2 (en) * 2010-11-17 2016-12-06 Panasonic Intellectual Property Corporation Of America Stereo signal encoding device, stereo signal decoding device, stereo signal encoding method, and stereo signal decoding method
CN107742521A (zh) * 2016-08-10 2018-02-27 华为技术有限公司 多声道信号的编码方法和编码器
CN108885879A (zh) * 2016-01-22 2018-11-23 弗劳恩霍夫应用研究促进协会 使用帧控制同步来编码或解码多声道音频信号的装置和方法
TW201909658A (zh) * 2011-07-01 2019-03-01 美商杜比實驗室特許公司 用於適應性音頻信號的產生、譯碼與呈現之系統與方法
CN109448741A (zh) * 2018-11-22 2019-03-08 广州广晟数码技术有限公司 一种3d音频编码、解码方法及装置
CN110556120A (zh) * 2014-06-27 2019-12-10 杜比国际公司 用于解码声音或声场的高阶高保真度立体声响复制(hoa)表示的方法

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5793636B2 (ja) * 2012-09-11 2015-10-14 テレフオンアクチーボラゲット エル エム エリクソン(パブル) コンフォート・ノイズの生成
JP6790251B2 (ja) * 2016-09-28 2020-11-25 華為技術有限公司Huawei Technologies Co.,Ltd. マルチチャネルオーディオ信号処理方法、装置、およびシステム
CN112334980B (zh) * 2018-06-28 2024-05-14 瑞典爱立信有限公司 自适应舒适噪声参数确定

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5933965B2 (ja) * 2000-11-15 2016-06-15 ドルビー・インターナショナル・アクチボラゲットDolby International Ab 高周波数の再構成方法を使用するコーディング・システムの性能拡大方法
US9514757B2 (en) * 2010-11-17 2016-12-06 Panasonic Intellectual Property Corporation Of America Stereo signal encoding device, stereo signal decoding device, stereo signal encoding method, and stereo signal decoding method
TW201909658A (zh) * 2011-07-01 2019-03-01 美商杜比實驗室特許公司 用於適應性音頻信號的產生、譯碼與呈現之系統與方法
US20150213809A1 (en) * 2014-01-30 2015-07-30 Qualcomm Incorporated Coding independent frames of ambient higher-order ambisonic coefficients
US20170032798A1 (en) * 2014-01-30 2017-02-02 Qualcomm Incorporated Coding numbers of code vectors for independent frames of higher-order ambisonic coefficients
CN110556120A (zh) * 2014-06-27 2019-12-10 杜比国际公司 用于解码声音或声场的高阶高保真度立体声响复制(hoa)表示的方法
CN108885879A (zh) * 2016-01-22 2018-11-23 弗劳恩霍夫应用研究促进协会 使用帧控制同步来编码或解码多声道音频信号的装置和方法
CN107742521A (zh) * 2016-08-10 2018-02-27 华为技术有限公司 多声道信号的编码方法和编码器
CN109448741A (zh) * 2018-11-22 2019-03-08 广州广晟数码技术有限公司 一种3d音频编码、解码方法及装置

Also Published As

Publication number Publication date
TW202347316A (zh) 2023-12-01
AU2021317755B2 (en) 2023-11-09
CN116348951A (zh) 2023-06-27
AU2021317755A1 (en) 2023-03-02
US20230306975A1 (en) 2023-09-28
EP4189674A1 (en) 2023-06-07
AU2023286009A1 (en) 2024-01-25
ZA202301024B (en) 2024-04-24
CA3187342A1 (en) 2022-02-03
TW202230333A (zh) 2022-08-01
BR112023001616A2 (pt) 2023-02-23
JP2023536156A (ja) 2023-08-23
KR20230049660A (ko) 2023-04-13
MX2023001152A (es) 2023-04-05
WO2022022876A1 (en) 2022-02-03

Similar Documents

Publication Publication Date Title
TWI794911B (zh) 用以編碼音訊信號或用以解碼經編碼音訊場景之設備、方法及電腦程式
CN103474077B (zh) 音频信号译码器、提供上混信号表示型态的方法
US11361778B2 (en) Audio scene encoder, audio scene decoder and related methods using hybrid encoder-decoder spatial analysis
TWI804004B (zh) 在降混過程中使用方向資訊對多個音頻對象進行編碼的設備和方法、及電腦程式
JP2023546851A (ja) 複数の音声オブジェクトをエンコードする装置および方法、または2つ以上の関連する音声オブジェクトを使用してデコードする装置および方法
Multrus et al. Immersive Voice and Audio Services (IVAS) codec-The new 3GPP standard for immersive communication
JP2023549038A (ja) パラメータ変換を用いて符号化されたオーディオシーンを処理するための装置、方法、またはコンピュータプログラム
RU2809587C1 (ru) Устройство, способ и компьютерная программа для кодирования звукового сигнала или для декодирования кодированной аудиосцены
JP2023548650A (ja) 帯域幅拡張を用いて符号化されたオーディオシーンを処理するための装置、方法、またはコンピュータプログラム
JP2023549033A (ja) パラメータ平滑化を用いて符号化されたオーディオシーンを処理するための装置、方法、またはコンピュータプログラム
TW202341128A (zh) 轉換音訊串流之設備及方法