JPWO2019207036A5 - - Google Patents

Download PDF

Info

Publication number
JPWO2019207036A5
JPWO2019207036A5 JP2020559398A JP2020559398A JPWO2019207036A5 JP WO2019207036 A5 JPWO2019207036 A5 JP WO2019207036A5 JP 2020559398 A JP2020559398 A JP 2020559398A JP 2020559398 A JP2020559398 A JP 2020559398A JP WO2019207036 A5 JPWO2019207036 A5 JP WO2019207036A5
Authority
JP
Japan
Prior art keywords
audio
audio signal
mode parameter
high frequency
frequency reconstruction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2020559398A
Other languages
Japanese (ja)
Other versions
JP2021522543A (en
JP7252976B2 (en
Publication date
Application filed filed Critical
Priority claimed from PCT/EP2019/060600 external-priority patent/WO2019207036A1/en
Publication of JP2021522543A publication Critical patent/JP2021522543A/en
Publication of JPWO2019207036A5 publication Critical patent/JPWO2019207036A5/ja
Priority to JP2023048111A priority Critical patent/JP7493076B2/en
Application granted granted Critical
Publication of JP7252976B2 publication Critical patent/JP7252976B2/en
Priority to JP2024081437A priority patent/JP2024099068A/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Claims (6)

オーディオ信号の高周波再構成を実行する方法であって、当該方法は、
符号化されたオーディオビットストリームを受信し、該符号化されたオーディオビットストリームは、前記オーディオ信号の低帯域部分を表すオーディオデータと、高周波再構成メタデータとを含み、
前記オーディオデータを復号して、復号された低帯域オーディオ信号を生成し、
前記符号化されたオーディオビットストリームから前記高周波再構成メタデータを抽出し、前記高周波再構成メタデータは、高周波再構成プロセスのための動作パラメータを含み、該動作パラメータは、前記符号化されたオーディオビットストリームの後方互換拡張コンテナ内に置かれたパッチングモードパラメータを含み、該パッチングモードパラメータの第1の値は、スペクトル変換を指し示し、該パッチングモードパラメータの第2の値は、位相ボコーダ周波数拡散による高調波トランスポジションを指し示し、
前記復号された低帯域オーディオ信号をフィルタリングして、フィルタリングされた低帯域オーディオ信号を生成し、
前記フィルタリングされた低帯域オーディオ信号及び前記高周波再構成メタデータを用いて、前記オーディオ信号の高帯域部分を再生成し、当該再生成することは、前記パッチングモードパラメータが前記第1の値である場合にスペクトル変換を含み、当該再生成することは、前記パッチングモードパラメータが前記第2の値である場合に位相ボコーダ周波数拡散による高調波トランスポジションを含み、
前記フィルタリングされた低帯域オーディオ信号を前記再生成された高帯域部分と組み合わせて、広帯域オーディオ信号を形成する、
ことを有し、
前記フィルタリングすること、前記再生成すること、及び前記組み合わせることは、オーディオチャンネル当たり3010サンプルの遅延で後処理動作として実行され、その結果、合成時間が合成ユニット内の3011番目のオーディオサンプルに適用される
方法。
A method of performing high frequency reconstruction of an audio signal, the method comprising:
receiving an encoded audio bitstream, the encoded audio bitstream including audio data representing a lowband portion of the audio signal and high frequency reconstruction metadata;
decoding the audio data to produce a decoded low-band audio signal;
extracting the high-frequency reconstruction metadata from the encoded audio bitstream, the high-frequency reconstruction metadata including operating parameters for a high-frequency reconstruction process, the operating parameters corresponding to the encoded audio; a patching mode parameter placed in a backwards compatible extension container of the bitstream, a first value of the patching mode parameter indicating spectral transform, and a second value of the patching mode parameter by phase vocoder frequency spreading; pointing to the harmonic transposition,
filtering the decoded low-band audio signal to produce a filtered low-band audio signal;
regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata, the regenerating wherein the patching mode parameter is the first value; wherein said regenerating comprises harmonic transposition by phase vocoder frequency spreading when said patching mode parameter is said second value;
combining the filtered lowband audio signal with the regenerated highband portion to form a wideband audio signal;
have the
Said filtering, said regenerating and said combining are performed as post-processing operations with a delay of 3010 samples per audio channel , so that synthesis time is applied to the 3011th audio sample in the synthesis unit. Ru
Method.
前記符号化されたオーディオビットストリームは更に充填要素を含み、該充填要素は、該充填要素の始まりを指し示す識別子と、該識別子の後の充填データとを有し、該充填データが前記後方互換拡張コンテナを含む、請求項1に記載の方法。 The encoded audio bitstream further includes a filler element, the filler element having an identifier pointing to the beginning of the filler element and filler data following the identifier, the filler data being the backward compatible extension. 2. The method of claim 1, comprising a container. 位相ボコーダ周波数拡散による前記高調波トランスポジションは、毎秒450万演算以下及び3kワード以下のメモリの推定複雑度で実行される、請求項1又は2に記載の方法。 3. The method of claim 1 or 2 , wherein the harmonic transposition by phase vocoder frequency spreading is performed with an estimated memory complexity of no more than 4.5 million operations per second and no more than 3 kwords. 命令を有するコンピュータプログラムであって、前記命令は、コンピューティング装置又はシステムによって実行されるときに、該コンピューティング装置又はシステムに、請求項1乃至のいずれかに記載の方法を実行させる、コンピュータプログラム。 A computer program comprising instructions which, when executed by a computing device or system, cause the computing device or system to perform the method of any of claims 1-3 . program. オーディオ信号の高周波再構成を実行するオーディオ処理ユニットであって、当該オーディオ処理ユニットは、
符号化されたオーディオビットストリームを受信する入力インタフェースであり、前記符号化されたオーディオビットストリームは、前記オーディオ信号の低帯域部分を表すオーディオデータと、高周波再構成メタデータとを含む、入力インタフェースと、
前記オーディオデータを復号して、復号された低帯域オーディオ信号を生成するコアオーディオデコーダと、
前記符号化されたオーディオビットストリームから前記高周波再構成メタデータを抽出するデフォーマッタであり、前記高周波再構成メタデータは、高周波再構成プロセスのための動作パラメータを含み、該動作パラメータは、前記符号化されたオーディオビットストリームの後方互換拡張コンテナ内に置かれたパッチングモードパラメータを含み、該パッチングモードパラメータの第1の値は、スペクトル変換を指し示し、該パッチングモードパラメータの第2の値は、位相ボコーダ周波数拡散による高調波トランスポジションを指し示す、デフォーマッタと、
前記復号された低帯域オーディオ信号をフィルタリングして、フィルタリングされた低帯域オーディオ信号を生成する分析フィルタバンクと、
前記フィルタリングされた低帯域オーディオ信号及び前記高周波再構成メタデータを用いて、前記オーディオ信号の高帯域部分を再構成する高周波リジェネレータであり、前記再構成することは、前記パッチングモードパラメータが前記第1の値である場合にスペクトル変換を含み、前記再構成することは、前記パッチングモードパラメータが前記第2の値である場合に位相ボコーダ周波数拡散による高調波トランスポジションを含む、高周波リジェネレータと、
前記フィルタリングされた低帯域オーディオ信号を前記再構成された高帯域部分と組み合わせて、広帯域オーディオ信号を形成する合成フィルタバンクと、
を有し、
前記分析フィルタバンク、前記高周波リジェネレータ、及び前記合成フィルタバンクは、オーディオチャンネル当たり3010サンプルの遅延でポストプロセッサにて実行され、その結果、合成時間が合成ユニット内の3011番目のオーディオサンプルに適用される
オーディオ処理ユニット。
An audio processing unit for performing high frequency reconstruction of an audio signal, the audio processing unit comprising:
an input interface for receiving an encoded audio bitstream, the encoded audio bitstream including audio data representing a lowband portion of the audio signal and high frequency reconstruction metadata; and ,
a core audio decoder that decodes the audio data to produce a decoded lowband audio signal;
A deformatter for extracting the high frequency reconstruction metadata from the encoded audio bitstream, the high frequency reconstruction metadata including operating parameters for a high frequency reconstruction process, the operating parameters being associated with the encoding. a patching mode parameter placed in a backward compatible extension container of the encoded audio bitstream, a first value of the patching mode parameter indicating a spectral transform and a second value of the patching mode parameter indicating a phase a deformatter indicating harmonic transposition by vocoder frequency spreading;
an analysis filterbank for filtering the decoded lowband audio signal to produce a filtered lowband audio signal;
A high-frequency regenerator for reconstructing a high-band portion of the audio signal using the filtered low-band audio signal and the high-frequency reconstruction metadata, wherein the reconstructing comprises: a high frequency regenerator comprising spectral conversion when it is a value of 1 and said reconstructing comprises harmonic transposition by phase vocoder frequency spreading when said patching mode parameter is said second value;
a synthesis filterbank that combines the filtered lowband audio signal with the reconstructed highband portion to form a wideband audio signal;
has
The analysis filterbank, the high frequency regenerator and the synthesis filterbank are executed in a post-processor with a delay of 3010 samples per audio channel so that the synthesis time is applied to the 3011th audio sample in the synthesis unit. Ru
audio processing unit.
位相ボコーダ周波数拡散による前記高調波トランスポジションは、毎秒450万演算以下及び3kワード以下のメモリの推定複雑度で実行される、請求項に記載のオーディオ処理ユニット。 6. The audio processing unit of claim 5 , wherein the harmonic transposition by phase vocoder frequency spreading is performed with an estimated memory complexity of less than or equal to 4.5 million operations per second and less than or equal to 3 kwords.
JP2020559398A 2018-04-25 2019-04-25 Integration of high-frequency reconstruction techniques with post-processing delay reduction Active JP7252976B2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2023048111A JP7493076B2 (en) 2018-04-25 2023-03-24 Integration of high frequency reconstruction techniques with post-processing delay reduction
JP2024081437A JP2024099068A (en) 2018-04-25 2024-05-20 Integration of high frequency reconstruction techniques with reduced post-processing delay

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP18169156 2018-04-25
EP18169156.9 2018-04-25
PCT/EP2019/060600 WO2019207036A1 (en) 2018-04-25 2019-04-25 Integration of high frequency audio reconstruction techniques

Related Child Applications (1)

Application Number Title Priority Date Filing Date
JP2023048111A Division JP7493076B2 (en) 2018-04-25 2023-03-24 Integration of high frequency reconstruction techniques with post-processing delay reduction

Publications (3)

Publication Number Publication Date
JP2021522543A JP2021522543A (en) 2021-08-30
JPWO2019207036A5 true JPWO2019207036A5 (en) 2023-01-25
JP7252976B2 JP7252976B2 (en) 2023-04-05

Family

ID=62063367

Family Applications (3)

Application Number Title Priority Date Filing Date
JP2020559398A Active JP7252976B2 (en) 2018-04-25 2019-04-25 Integration of high-frequency reconstruction techniques with post-processing delay reduction
JP2023048111A Active JP7493076B2 (en) 2018-04-25 2023-03-24 Integration of high frequency reconstruction techniques with post-processing delay reduction
JP2024081437A Pending JP2024099068A (en) 2018-04-25 2024-05-20 Integration of high frequency reconstruction techniques with reduced post-processing delay

Family Applications After (2)

Application Number Title Priority Date Filing Date
JP2023048111A Active JP7493076B2 (en) 2018-04-25 2023-03-24 Integration of high frequency reconstruction techniques with post-processing delay reduction
JP2024081437A Pending JP2024099068A (en) 2018-04-25 2024-05-20 Integration of high frequency reconstruction techniques with reduced post-processing delay

Country Status (15)

Country Link
US (7) US11527256B2 (en)
EP (1) EP3785260A1 (en)
JP (3) JP7252976B2 (en)
KR (1) KR20210005164A (en)
CN (9) CN118800273A (en)
AU (3) AU2019258524B2 (en)
BR (1) BR112020021832A2 (en)
CA (1) CA3098064A1 (en)
CL (1) CL2020002745A1 (en)
IL (4) IL278223B2 (en)
MA (1) MA52530A (en)
MX (10) MX2020011206A (en)
SG (1) SG11202010374VA (en)
WO (1) WO2019207036A1 (en)
ZA (2) ZA202006518B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113436603B (en) * 2021-06-28 2023-05-02 北京达佳互联信息技术有限公司 Method and device for training vocoder and method and vocoder for synthesizing audio signals
CN114519121A (en) * 2021-12-30 2022-05-20 赛因芯微(北京)电子科技有限公司 Audio serial metadata block generation method, device, equipment and storage medium

Family Cites Families (47)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE512719C2 (en) * 1997-06-10 2000-05-02 Lars Gustaf Liljeryd A method and apparatus for reducing data flow based on harmonic bandwidth expansion
SE9903553D0 (en) 1999-01-27 1999-10-01 Lars Liljeryd Enhancing conceptual performance of SBR and related coding methods by adaptive noise addition (ANA) and noise substitution limiting (NSL)
SE0001926D0 (en) 2000-05-23 2000-05-23 Lars Liljeryd Improved spectral translation / folding in the subband domain
EP1423847B1 (en) 2001-11-29 2005-02-02 Coding Technologies AB Reconstruction of high frequency components
US7447631B2 (en) 2002-06-17 2008-11-04 Dolby Laboratories Licensing Corporation Audio coding system using spectral hole filling
WO2003104924A2 (en) * 2002-06-05 2003-12-18 Sonic Focus, Inc. Acoustical virtual reality engine and advanced techniques for enhancing delivered sound
US7933945B2 (en) 2002-06-27 2011-04-26 Openpeak Inc. Method, system, and computer program product for managing controlled residential or non-residential environments
US6792057B2 (en) * 2002-08-29 2004-09-14 Bae Systems Information And Electronic Systems Integration Inc Partial band reconstruction of frequency channelized filters
CN1989546B (en) 2004-07-20 2011-07-13 松下电器产业株式会社 Sound encoder and sound encoding method
US8260609B2 (en) 2006-07-31 2012-09-04 Qualcomm Incorporated Systems, methods, and apparatus for wideband encoding and decoding of inactive frames
CN101140759B (en) * 2006-09-08 2010-05-12 华为技术有限公司 Band-width spreading method and system for voice or audio signal
US20100250260A1 (en) 2007-11-06 2010-09-30 Lasse Laaksonen Encoder
CN101458930B (en) 2007-12-12 2011-09-14 华为技术有限公司 Excitation signal generation in bandwidth spreading and signal reconstruction method and apparatus
DE102008015702B4 (en) 2008-01-31 2010-03-11 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for bandwidth expansion of an audio signal
KR101230479B1 (en) * 2008-03-10 2013-02-06 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. Device and method for manipulating an audio signal having a transient event
ES2796552T3 (en) 2008-07-11 2020-11-27 Fraunhofer Ges Forschung Audio signal synthesizer and audio signal encoder
JP5203077B2 (en) 2008-07-14 2013-06-05 株式会社エヌ・ティ・ティ・ドコモ Speech coding apparatus and method, speech decoding apparatus and method, and speech bandwidth extension apparatus and method
UA99878C2 (en) * 2009-01-16 2012-10-10 Долби Интернешнл Аб Cross product enhanced harmonic transposition
TWI662788B (en) 2009-02-18 2019-06-11 瑞典商杜比國際公司 Complex exponential modulated filter bank for high frequency reconstruction or parametric stereo
KR101433701B1 (en) 2009-03-17 2014-08-28 돌비 인터네셔널 에이비 Advanced stereo coding based on a combination of adaptively selectable left/right or mid/side stereo coding and of parametric stereo coding
CO6440537A2 (en) * 2009-04-09 2012-05-15 Fraunhofer Ges Forschung APPARATUS AND METHOD TO GENERATE A SYNTHESIS AUDIO SIGNAL AND TO CODIFY AN AUDIO SIGNAL
TWI556227B (en) * 2009-05-27 2016-11-01 杜比國際公司 Systems and methods for generating a high frequency component of a signal from a low frequency component of the signal, a set-top box, a computer program product and storage medium thereof
US8971551B2 (en) * 2009-09-18 2015-03-03 Dolby International Ab Virtual bass synthesis using harmonic transposition
ES2400661T3 (en) 2009-06-29 2013-04-11 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Encoding and decoding bandwidth extension
US8515768B2 (en) * 2009-08-31 2013-08-20 Apple Inc. Enhanced audio decoder
JP5433022B2 (en) * 2009-09-18 2014-03-05 ドルビー インターナショナル アーベー Harmonic conversion
ES2805349T3 (en) * 2009-10-21 2021-02-11 Dolby Int Ab Oversampling in a Combined Re-emitter Filter Bank
KR102020334B1 (en) 2010-01-19 2019-09-10 돌비 인터네셔널 에이비 Improved subband block based harmonic transposition
ES2522171T3 (en) 2010-03-09 2014-11-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for processing an audio signal using patching edge alignment
PL2545551T3 (en) 2010-03-09 2018-03-30 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Improved magnitude response and temporal alignment in phase vocoder based bandwidth extension for audio signals
EP4398249A3 (en) 2010-04-13 2024-07-24 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decoding sample-accurate representation of an audio signal
CN103155033B (en) * 2010-07-19 2014-10-22 杜比国际公司 Processing of audio signals during high frequency reconstruction
US9047875B2 (en) * 2010-07-19 2015-06-02 Futurewei Technologies, Inc. Spectrum flatness control for bandwidth extension
US8996976B2 (en) 2011-09-06 2015-03-31 Microsoft Technology Licensing, Llc Hyperlink destination visibility
JP2013068587A (en) 2011-09-22 2013-04-18 Nippon Water Solution:Kk Portable time integration water leakage detection device
EP3544006A1 (en) * 2011-11-11 2019-09-25 Dolby International AB Upsampling using oversampled sbr
US9424849B2 (en) 2011-12-14 2016-08-23 Cirrus Logic, Inc. Data transfer
EP2631906A1 (en) 2012-02-27 2013-08-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Phase coherence control for harmonic signals in perceptual audio codecs
CN102842337A (en) * 2012-06-05 2012-12-26 国光电器股份有限公司 High-fidelity audio transmission method based on WIFI (Wireless Fidelity)
RU2665281C2 (en) * 2013-09-12 2018-08-28 Долби Интернэшнл Аб Quadrature mirror filter based processing data time matching
EP2881943A1 (en) * 2013-12-09 2015-06-10 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for decoding an encoded audio signal with low computational resources
WO2015150384A1 (en) * 2014-04-01 2015-10-08 Dolby International Ab Efficient coding of audio scenes comprising audio objects
TWI693594B (en) * 2015-03-13 2020-05-11 瑞典商杜比國際公司 Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element
GR1008810B (en) * 2015-03-19 2016-07-07 Νικολαος Ευστρατιου Καβουνης Natural sparkling wine enriched with organic kozani's crocus (greek saffron)
EP3208800A1 (en) 2016-02-17 2017-08-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for stereo filing in multichannel coding
EP3382700A1 (en) 2017-03-31 2018-10-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for post-processing an audio signal using a transient location detection
TWI702594B (en) 2018-01-26 2020-08-21 瑞典商都比國際公司 Backward-compatible integration of high frequency reconstruction techniques for audio signals

Similar Documents

Publication Publication Date Title
JP7228607B2 (en) Audio encoder and decoder using frequency domain processor and time domain processor with full-band gap filling
IL278573B2 (en) Backward-compatible integration of high frequency reconstruction techniques for audio signals
JP2021157202A5 (en)
US10373623B2 (en) Apparatus and method for processing an audio signal to obtain a processed audio signal using a target time-domain envelope
JP6568566B2 (en) Encoded audio signal decoding apparatus, method and computer program
JP6838091B2 (en) Audio coders and decoders that use frequency domain processors, time domain processors and cross-processors for continuous initialization
IL313348A (en) Integration of high frequency reconstruction techniques with reduced post-processing delay
TWI469136B (en) Apparatus and method for processing a decoded audio signal in a spectral domain
KR101852749B1 (en) Audio bandwidth extension by insertion of temporal pre-shaped noise in frequency domain
JP6775063B2 (en) Improved frequency band expansion in audio signal decoders
CN103477388A (en) Hybrid sound-signal decoder, hybrid sound-signal encoder, sound-signal decoding method, and sound-signal encoding method
EA038268B1 (en) Backward-compatible integration of harmonic transposer for high frequency reconstruction of audio signals
IL313391A (en) Integration of high frequency audio reconstruction techniques
KR20160053999A (en) Time-alignment of qmf based processing data
JPWO2019207036A5 (en)
RU2020138094A (en) INTEGRATION OF HIGH FREQUENCY SOUND RECONSTRUCTION TECHNIQUES
RU2023105629A (en) INTEGRATION OF HIGH FREQUENCY SOUND RECONSTRUCTION TECHNIQUES
RU2021128983A (en) BACKWARDS COMPATIBLE INTEGRATION OF HIGH-FREQUENCY RECOVERY METHODS FOR AUDIO SIGNALS
EA044947B1 (en) BACKWARD COMPATIBLE HARMONIC TRANSPOSER LAYOUT FOR RECONSTRUCTION OF HIGH FREQUENCY AUDIO SIGNALS
KR20090043352A (en) Method and system to encode/decode audio/speech signal for supporting interoperability