US10657979B2 - Decoder for generating a frequency enhanced audio signal, method of decoding, encoder for generating an encoded signal and method of encoding using compact selection side information - Google Patents
Decoder for generating a frequency enhanced audio signal, method of decoding, encoder for generating an encoded signal and method of encoding using compact selection side information Download PDFInfo
- Publication number
- US10657979B2 US10657979B2 US14/811,722 US201514811722A US10657979B2 US 10657979 B2 US10657979 B2 US 10657979B2 US 201514811722 A US201514811722 A US 201514811722A US 10657979 B2 US10657979 B2 US 10657979B2
- Authority
- US
- United States
- Prior art keywords
- signal
- side information
- parametric representation
- selection side
- generating
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active, expires
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 79
- 238000000034 method Methods 0.000 title claims description 59
- 230000003595 spectral effect Effects 0.000 claims abstract description 36
- 230000004044 response Effects 0.000 claims abstract description 28
- 238000013179 statistical model Methods 0.000 claims description 53
- 238000004590 computer program Methods 0.000 claims description 14
- 230000005284 excitation Effects 0.000 claims description 11
- 230000000694 effects Effects 0.000 claims description 10
- 238000004458 analytical method Methods 0.000 claims description 8
- 238000003860 storage Methods 0.000 claims description 8
- 238000005457 optimization Methods 0.000 claims description 6
- 238000003786 synthesis reaction Methods 0.000 claims description 6
- 230000015572 biosynthetic process Effects 0.000 claims description 5
- 238000012545 processing Methods 0.000 claims description 5
- 238000001914 filtration Methods 0.000 claims description 4
- 238000005516 engineering process Methods 0.000 description 11
- 230000005540 biological transmission Effects 0.000 description 7
- 238000000605 extraction Methods 0.000 description 7
- 238000013459 approach Methods 0.000 description 5
- 230000008569 process Effects 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 3
- 230000004075 alteration Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
- G10L19/265—Pre-filtering, e.g. high frequency emphasis prior to encoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
- G10L21/0388—Details of processing therefor
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/002—Dynamic bit allocation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/69—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for evaluating synthetic or decoded voice signals
Definitions
- FIG. 5 illustrates a advantageous implementation of the signal estimator controlled by a voice activity detector or a speech/non-speech detector
- the feature extractor can also operate or extract a feature from the encoded core signal.
- the encoded core signal comprises a representation of scale factors for frequency bands or any other representation of audio information.
- the encoded representation of the audio signal is representative for the decoded core signal and, therefore features can be extracted.
- a feature can be extracted not only from a fully decoded core signal but also from a partly decoded core signal.
- the encoded signal is representing a frequency domain representation comprising a sequence of spectral frames. The encoded core signal can, therefore, be only partly decoded to obtain a decoded representation of a sequence of spectral frames, before actually performing a spectrum-time conversion.
- the feature extractor 104 can extract features either from the encoded core signal or a partly decoded core signal or a fully decoded core signal.
- the feature extractor 104 can be implemented, with respect to its extracted features as known in the art and the feature extractor may, for example, be implemented as in audio fingerprinting or audio ID technologies.
- FIGS. 9 to 11 Reference is made to FIGS. 9 to 11 .
- FIG. 8 illustrates an exemplary representation of the encoded input signal.
- the encoded input signal consists of subsequent frames 800 , 806 , 812 .
- Each frame has the encoded core signal.
- frame 800 has speech as the encoded core signal.
- Frame 806 has music as the encoded core signal and frame 812 again has speech as the encoded core signal.
- Frame 800 has, exemplarily, as the side information only the selection side information but no SBR side information.
- frame 800 corresponds to FIG. 9 or FIG. 10 .
- frame 806 comprises SBR information but does not contain any selection side information.
- frame 812 comprises an encoded speech signal and, in contrast to frame 800 , frame 812 does not contain any selection side information. This is due to the fact that the selection side information are not necessary, since any ambiguities in the feature extraction/statistical model process have not been found on the encoder-side.
- FIG. 12 illustrates an encoder for generating an encoded signal 1212 .
- the encoder comprises a core encoder 1200 for encoding an original signal 1206 to obtain an encoded core audio signal 1208 having information on a smaller number of frequency bands compared to the original signal 1206 .
- a selection side information generator 1202 for generating selection side information 1210 (SSI—selection side information) is provided.
- the selection side information 1210 indicate a defined parametric representation alternative provided by a statistical model in response to a feature extracted from the original signal 1206 or from the encoded audio signal 1208 or from a decoded version of the encoded audio signal.
- the encoder comprises an output interface 1204 for outputting the encoded signal 1212 .
- the encoded signal 1212 comprises the encoded audio signal 1208 and the selection side information 1210 .
- the selection side information generator 1202 is implemented as illustrated in FIG. 13 .
- the selection side information generator 1202 comprises a core decoder 1300 .
- the feature extractor 1302 is provided which operates on the decoded core signal output by block 1300 .
- the feature is input into a statistical model processor 1304 for generating a number of parametric representation alternatives for estimating a spectral range of a frequency enhanced signal not defined by the decoded core signal output by block 1300 .
- These parametric representation alternatives 1305 are all input into a signal estimator 1306 for estimating a frequency enhanced audio signal 1307 .
- the metadata extracted by the metadata extractor 1400 is discarded in the encoder and is not transmitted in the encoded signal 1212 . Instead, the selection side information 1210 is transmitted in the encoded signal together with the encoded audio signal 1208 generated by the core encoder which has a different frequency content and, typically, a smaller frequency content compared to the finally generated decoded signal or compared to the original signal 1206 .
- a further embodiment according to the invention comprises an apparatus or a system configured to transfer (for example, electronically or optically) a computer program for performing one of the methods described herein to a receiver.
- the receiver may, for example, be a computer, a mobile device, a memory device or the like.
- the apparatus or system may, for example, comprise a file server for transferring the computer program to the receiver.
Landscapes
- Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Stereophonic System (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/811,722 US10657979B2 (en) | 2013-01-29 | 2015-07-28 | Decoder for generating a frequency enhanced audio signal, method of decoding, encoder for generating an encoded signal and method of encoding using compact selection side information |
US15/668,473 US10186274B2 (en) | 2013-01-29 | 2017-08-03 | Decoder for generating a frequency enhanced audio signal, method of decoding, encoder for generating an encoded signal and method of encoding using compact selection side information |
US15/668,375 US10062390B2 (en) | 2013-01-29 | 2017-08-03 | Decoder for generating a frequency enhanced audio signal, method of decoding, encoder for generating an encoded signal and method of encoding using compact selection side information |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201361758092P | 2013-01-29 | 2013-01-29 | |
PCT/EP2014/051591 WO2014118155A1 (en) | 2013-01-29 | 2014-01-28 | Decoder for generating a frequency enhanced audio signal, method of decoding, encoder for generating an encoded signal and method of encoding using compact selection side information |
US14/811,722 US10657979B2 (en) | 2013-01-29 | 2015-07-28 | Decoder for generating a frequency enhanced audio signal, method of decoding, encoder for generating an encoded signal and method of encoding using compact selection side information |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2014/051591 Continuation WO2014118155A1 (en) | 2013-01-29 | 2014-01-28 | Decoder for generating a frequency enhanced audio signal, method of decoding, encoder for generating an encoded signal and method of encoding using compact selection side information |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/668,375 Continuation US10062390B2 (en) | 2013-01-29 | 2017-08-03 | Decoder for generating a frequency enhanced audio signal, method of decoding, encoder for generating an encoded signal and method of encoding using compact selection side information |
US15/668,473 Continuation US10186274B2 (en) | 2013-01-29 | 2017-08-03 | Decoder for generating a frequency enhanced audio signal, method of decoding, encoder for generating an encoded signal and method of encoding using compact selection side information |
Publications (2)
Publication Number | Publication Date |
---|---|
US20150332701A1 US20150332701A1 (en) | 2015-11-19 |
US10657979B2 true US10657979B2 (en) | 2020-05-19 |
Family
ID=50023570
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/811,722 Active 2036-01-20 US10657979B2 (en) | 2013-01-29 | 2015-07-28 | Decoder for generating a frequency enhanced audio signal, method of decoding, encoder for generating an encoded signal and method of encoding using compact selection side information |
US15/668,473 Active US10186274B2 (en) | 2013-01-29 | 2017-08-03 | Decoder for generating a frequency enhanced audio signal, method of decoding, encoder for generating an encoded signal and method of encoding using compact selection side information |
US15/668,375 Active US10062390B2 (en) | 2013-01-29 | 2017-08-03 | Decoder for generating a frequency enhanced audio signal, method of decoding, encoder for generating an encoded signal and method of encoding using compact selection side information |
Family Applications After (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/668,473 Active US10186274B2 (en) | 2013-01-29 | 2017-08-03 | Decoder for generating a frequency enhanced audio signal, method of decoding, encoder for generating an encoded signal and method of encoding using compact selection side information |
US15/668,375 Active US10062390B2 (en) | 2013-01-29 | 2017-08-03 | Decoder for generating a frequency enhanced audio signal, method of decoding, encoder for generating an encoded signal and method of encoding using compact selection side information |
Country Status (19)
Country | Link |
---|---|
US (3) | US10657979B2 (ko) |
EP (3) | EP2951828B1 (ko) |
JP (3) | JP6096934B2 (ko) |
KR (3) | KR101775084B1 (ko) |
CN (3) | CN105103229B (ko) |
AR (1) | AR094673A1 (ko) |
AU (3) | AU2014211523B2 (ko) |
BR (1) | BR112015018017B1 (ko) |
CA (4) | CA3013766C (ko) |
ES (3) | ES2725358T3 (ko) |
HK (1) | HK1218460A1 (ko) |
MX (1) | MX345622B (ko) |
MY (1) | MY172752A (ko) |
RU (3) | RU2627102C2 (ko) |
SG (3) | SG11201505925SA (ko) |
TR (1) | TR201906190T4 (ko) |
TW (3) | TWI585755B (ko) |
WO (1) | WO2014118155A1 (ko) |
ZA (1) | ZA201506313B (ko) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20230016637A1 (en) * | 2021-07-07 | 2023-01-19 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and Method for End-to-End Adversarial Blind Bandwidth Extension with one or more Convolutional and/or Recurrent Networks |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR3008533A1 (fr) * | 2013-07-12 | 2015-01-16 | Orange | Facteur d'echelle optimise pour l'extension de bande de frequence dans un decodeur de signaux audiofrequences |
TWI693594B (zh) | 2015-03-13 | 2020-05-11 | 瑞典商杜比國際公司 | 解碼具有增強頻譜帶複製元資料在至少一填充元素中的音訊位元流 |
US10008214B2 (en) * | 2015-09-11 | 2018-06-26 | Electronics And Telecommunications Research Institute | USAC audio signal encoding/decoding apparatus and method for digital radio services |
WO2019081070A1 (en) * | 2017-10-27 | 2019-05-02 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | APPARATUS, METHOD, OR COMPUTER PROGRAM PRODUCT FOR GENERATING ENHANCED BANDWIDTH AUDIO SIGNAL USING NEURAL NETWORK PROCESSOR |
KR102556098B1 (ko) * | 2017-11-24 | 2023-07-18 | 한국전자통신연구원 | 심리음향 기반 가중된 오류 함수를 이용한 오디오 신호 부호화 방법 및 장치, 그리고 오디오 신호 복호화 방법 및 장치 |
CN108399913B (zh) * | 2018-02-12 | 2021-10-15 | 北京容联易通信息技术有限公司 | 高鲁棒性音频指纹识别方法及系统 |
US11929085B2 (en) | 2018-08-30 | 2024-03-12 | Dolby International Ab | Method and apparatus for controlling enhancement of low-bitrate coded audio |
WO2021158531A1 (en) * | 2020-02-03 | 2021-08-12 | Pindrop Security, Inc. | Cross-channel enrollment and authentication of voice biometrics |
CN113808596A (zh) * | 2020-05-30 | 2021-12-17 | 华为技术有限公司 | 一种音频编码方法和音频编码装置 |
CN112233685B (zh) * | 2020-09-08 | 2024-04-19 | 厦门亿联网络技术股份有限公司 | 基于深度学习注意力机制的频带扩展方法及装置 |
KR20220151953A (ko) | 2021-05-07 | 2022-11-15 | 한국전자통신연구원 | 부가 정보를 이용한 오디오 신호의 부호화 및 복호화 방법과 그 방법을 수행하는 부호화기 및 복호화기 |
CN114443891B (zh) * | 2022-01-14 | 2022-12-06 | 北京有竹居网络技术有限公司 | 编码器的生成方法、指纹提取方法、介质及电子设备 |
Citations (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0720148A1 (en) | 1994-12-30 | 1996-07-03 | AT&T Corp. | Method for noise weighting filtering |
US20060140412A1 (en) * | 2004-11-02 | 2006-06-29 | Lars Villemoes | Multi parametrisation based multi-channel reconstruction |
US20070019813A1 (en) * | 2005-07-19 | 2007-01-25 | Johannes Hilpert | Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding |
US20070094027A1 (en) | 2005-10-21 | 2007-04-26 | Nokia Corporation | Methods and apparatus for implementing embedded scalable encoding and decoding of companded and vector quantized audio data |
US20070208557A1 (en) * | 2006-03-03 | 2007-09-06 | Microsoft Corporation | Perceptual, scalable audio compression |
US20070255572A1 (en) | 2004-08-27 | 2007-11-01 | Shuji Miyasaka | Audio Decoder, Method and Program |
JP2007328268A (ja) | 2006-06-09 | 2007-12-20 | Kddi Corp | 音楽信号の帯域拡張方式 |
US20080154583A1 (en) * | 2004-08-31 | 2008-06-26 | Matsushita Electric Industrial Co., Ltd. | Stereo Signal Generating Apparatus and Stereo Signal Generating Method |
US20090282298A1 (en) * | 2008-05-08 | 2009-11-12 | Broadcom Corporation | Bit error management methods for wireless audio communication channels |
US20100046762A1 (en) * | 2001-07-10 | 2010-02-25 | Coding Technologies Ab | Efficient and scalable parametric stereo coding for low bitrate audio coding applications |
TW201009808A (en) | 2008-07-11 | 2010-03-01 | Fraunhofer Ges Forschung | An apparatus and a method for decoding an encoded audio signal |
US20100080397A1 (en) | 2008-09-26 | 2010-04-01 | Fujitsu Limted | Audio decoding method and apparatus |
WO2010058518A1 (ja) | 2008-11-21 | 2010-05-27 | パナソニック株式会社 | オーディオ再生装置及びオーディオ再生方法 |
US7751572B2 (en) * | 2005-04-15 | 2010-07-06 | Dolby International Ab | Adaptive residual audio coding |
EP2239732A1 (en) | 2009-04-09 | 2010-10-13 | Fraunhofer-Gesellschaft zur Förderung der Angewandten Forschung e.V. | Apparatus and method for generating a synthesis audio signal and for encoding an audio signal |
US20110004479A1 (en) * | 2009-01-28 | 2011-01-06 | Dolby International Ab | Harmonic transposition |
TW201104674A (en) | 2009-04-28 | 2011-02-01 | Fraunhofer Ges Forschung | Apparatus for providing one or more adjusted parameters for a provision of an upmix signal representation on the basis of a downmix signal representation, audio signal decoder, audio signal transcoder, audio signal encoder, audio bitstream, method and co |
US20110054885A1 (en) | 2008-01-31 | 2011-03-03 | Frederik Nagel | Device and Method for a Bandwidth Extension of an Audio Signal |
WO2011047886A1 (en) | 2009-10-21 | 2011-04-28 | Dolby International Ab | Apparatus and method for generating a high frequency audio signal using adaptive oversampling |
US20110173006A1 (en) | 2008-07-11 | 2011-07-14 | Frederik Nagel | Audio Signal Synthesizer and Audio Signal Encoder |
TW201140563A (en) | 2009-10-23 | 2011-11-16 | Qualcomm Inc | Determining an upperband signal from a narrowband signal |
US20110295598A1 (en) * | 2010-06-01 | 2011-12-01 | Qualcomm Incorporated | Systems, methods, apparatus, and computer program products for wideband speech coding |
US20120002818A1 (en) * | 2009-03-17 | 2012-01-05 | Dolby International Ab | Advanced Stereo Coding Based on a Combination of Adaptively Selectable Left/Right or Mid/Side Stereo Coding and of Parametric Stereo Coding |
CN102714035A (zh) | 2009-10-16 | 2012-10-03 | 弗兰霍菲尔运输应用研究公司 | 用以利用平均值而基于下混信号表示形态和与下混信号表示形态相关联的参数侧边信息来提供用于提供上混信号表示形态的一或多个经调整参数的装置、方法与计算机程序 |
US20130101032A1 (en) * | 2010-04-26 | 2013-04-25 | Panasonic Corporation | Filtering mode for intra prediction inferred from statistics of surrounding blocks |
US20130121411A1 (en) * | 2010-04-13 | 2013-05-16 | Fraunhofer-Gesellschaft Zur Foerderug der angewandten Forschung e.V. | Audio or video encoder, audio or video decoder and related methods for processing multi-channel audio or video signals using a variable prediction direction |
US20130170391A1 (en) * | 2010-09-16 | 2013-07-04 | Deutsche Telekom Ag | Method of and system for measuring quality of audio and video bit stream transmissions over a transmission chain |
US8929558B2 (en) * | 2009-09-10 | 2015-01-06 | Dolby International Ab | Audio signal of an FM stereo radio receiver by using parametric stereo |
US9094754B2 (en) * | 2010-08-24 | 2015-07-28 | Dolby International Ab | Reduction of spurious uncorrelation in FM radio noise |
US9191045B2 (en) * | 2011-09-29 | 2015-11-17 | Dolby International Ab | Prediction-based FM stereo radio noise reduction |
Family Cites Families (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6226616B1 (en) * | 1999-06-21 | 2001-05-01 | Digital Theater Systems, Inc. | Sound quality of established low bit-rate audio coding systems without loss of decoder compatibility |
US7603267B2 (en) * | 2003-05-01 | 2009-10-13 | Microsoft Corporation | Rules-based grammar for slots and statistical model for preterminals in natural language understanding system |
US7447317B2 (en) * | 2003-10-02 | 2008-11-04 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V | Compatible multi-channel coding/decoding by weighting the downmix channel |
CA2457988A1 (en) * | 2004-02-18 | 2005-08-18 | Voiceage Corporation | Methods and devices for audio compression based on acelp/tcx coding and multi-rate lattice vector quantization |
JP4459267B2 (ja) * | 2005-02-28 | 2010-04-28 | パイオニア株式会社 | 辞書データ生成装置及び電子機器 |
KR20070003574A (ko) * | 2005-06-30 | 2007-01-05 | 엘지전자 주식회사 | 오디오 신호 인코딩 및 디코딩 방법 및 장치 |
DE102005032724B4 (de) * | 2005-07-13 | 2009-10-08 | Siemens Ag | Verfahren und Vorrichtung zur künstlichen Erweiterung der Bandbreite von Sprachsignalen |
US7546237B2 (en) * | 2005-12-23 | 2009-06-09 | Qnx Software Systems (Wavemakers), Inc. | Bandwidth extension of narrowband speech |
RU2393646C1 (ru) * | 2006-03-28 | 2010-06-27 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Усовершенствованный способ для формирования сигнала при восстановлении многоканального аудио |
EP1883067A1 (en) * | 2006-07-24 | 2008-01-30 | Deutsche Thomson-Brandt Gmbh | Method and apparatus for lossless encoding of a source signal, using a lossy encoded data stream and a lossless extension data stream |
CN101140759B (zh) * | 2006-09-08 | 2010-05-12 | 华为技术有限公司 | 语音或音频信号的带宽扩展方法及系统 |
CN101484935B (zh) * | 2006-09-29 | 2013-07-17 | Lg电子株式会社 | 用于编码和解码基于对象的音频信号的方法和装置 |
JP5026092B2 (ja) * | 2007-01-12 | 2012-09-12 | 三菱電機株式会社 | 動画像復号装置および動画像復号方法 |
ATE500588T1 (de) * | 2008-01-04 | 2011-03-15 | Dolby Sweden Ab | Audiokodierer und -dekodierer |
US8442836B2 (en) * | 2008-01-31 | 2013-05-14 | Agency For Science, Technology And Research | Method and device of bitrate distribution/truncation for scalable audio coding |
DE102008009719A1 (de) * | 2008-02-19 | 2009-08-20 | Siemens Enterprise Communications Gmbh & Co. Kg | Verfahren und Mittel zur Enkodierung von Hintergrundrauschinformationen |
WO2009110751A2 (ko) * | 2008-03-04 | 2009-09-11 | Lg Electronics Inc. | 오디오 신호 처리 방법 및 장치 |
CA2871268C (en) * | 2008-07-11 | 2015-11-03 | Nikolaus Rettelbach | Audio encoder, audio decoder, methods for encoding and decoding an audio signal, audio stream and computer program |
EP2410521B1 (en) * | 2008-07-11 | 2017-10-04 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio signal encoder, method for generating an audio signal and computer program |
EP2146344B1 (en) * | 2008-07-17 | 2016-07-06 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoding/decoding scheme having a switchable bypass |
EP2380172B1 (en) * | 2009-01-16 | 2013-07-24 | Dolby International AB | Cross product enhanced harmonic transposition |
ES2400661T3 (es) * | 2009-06-29 | 2013-04-11 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Codificación y decodificación de extensión de ancho de banda |
EP2497272A1 (en) * | 2009-11-04 | 2012-09-12 | Koninklijke Philips Electronics N.V. | Methods and systems for providing a combination of media data and metadata |
CN102081927B (zh) * | 2009-11-27 | 2012-07-18 | 中兴通讯股份有限公司 | 一种可分层音频编码、解码方法及系统 |
WO2011106925A1 (en) * | 2010-03-01 | 2011-09-09 | Nokia Corporation | Method and apparatus for estimating user characteristics based on user interaction data |
CN101959068B (zh) * | 2010-10-12 | 2012-12-19 | 华中科技大学 | 一种视频流解码计算复杂度估计方法 |
-
2014
- 2014-01-28 CN CN201480006567.8A patent/CN105103229B/zh active Active
- 2014-01-28 WO PCT/EP2014/051591 patent/WO2014118155A1/en active Application Filing
- 2014-01-28 CA CA3013766A patent/CA3013766C/en active Active
- 2014-01-28 ES ES14701550T patent/ES2725358T3/es active Active
- 2014-01-28 ES ES17158737T patent/ES2943588T3/es active Active
- 2014-01-28 CA CA3013756A patent/CA3013756C/en active Active
- 2014-01-28 CA CA3013744A patent/CA3013744C/en active Active
- 2014-01-28 BR BR112015018017-5A patent/BR112015018017B1/pt active IP Right Grant
- 2014-01-28 KR KR1020167021785A patent/KR101775084B1/ko active IP Right Grant
- 2014-01-28 MY MYPI2015001889A patent/MY172752A/en unknown
- 2014-01-28 SG SG11201505925SA patent/SG11201505925SA/en unknown
- 2014-01-28 RU RU2015136789A patent/RU2627102C2/ru active
- 2014-01-28 EP EP14701550.7A patent/EP2951828B1/en active Active
- 2014-01-28 JP JP2015554193A patent/JP6096934B2/ja active Active
- 2014-01-28 SG SG10201608643PA patent/SG10201608643PA/en unknown
- 2014-01-28 RU RU2017109526A patent/RU2676870C1/ru active
- 2014-01-28 KR KR1020167021784A patent/KR101775086B1/ko active IP Right Grant
- 2014-01-28 CN CN201811139722.XA patent/CN109346101B/zh active Active
- 2014-01-28 MX MX2015009747A patent/MX345622B/es active IP Right Grant
- 2014-01-28 KR KR1020157022901A patent/KR101798126B1/ko active IP Right Grant
- 2014-01-28 AU AU2014211523A patent/AU2014211523B2/en active Active
- 2014-01-28 EP EP17158737.1A patent/EP3203471B1/en active Active
- 2014-01-28 SG SG10201608613QA patent/SG10201608613QA/en unknown
- 2014-01-28 TR TR2019/06190T patent/TR201906190T4/tr unknown
- 2014-01-28 CA CA2899134A patent/CA2899134C/en active Active
- 2014-01-28 EP EP17158862.7A patent/EP3196878B1/en active Active
- 2014-01-28 RU RU2017109527A patent/RU2676242C1/ru active
- 2014-01-28 ES ES17158862T patent/ES2924427T3/es active Active
- 2014-01-28 CN CN201811139723.4A patent/CN109509483B/zh active Active
- 2014-01-29 TW TW104132428A patent/TWI585755B/zh active
- 2014-01-29 TW TW103103520A patent/TWI524333B/zh active
- 2014-01-29 TW TW104132427A patent/TWI585754B/zh active
- 2014-01-29 AR ARP140100289A patent/AR094673A1/es active IP Right Grant
-
2015
- 2015-07-28 US US14/811,722 patent/US10657979B2/en active Active
- 2015-08-28 ZA ZA2015/06313A patent/ZA201506313B/en unknown
-
2016
- 2016-06-06 HK HK16106404.9A patent/HK1218460A1/zh unknown
- 2016-11-21 AU AU2016262638A patent/AU2016262638B2/en active Active
- 2016-11-21 AU AU2016262636A patent/AU2016262636B2/en active Active
- 2016-12-20 JP JP2016246648A patent/JP6511428B2/ja active Active
- 2016-12-20 JP JP2016246647A patent/JP6513066B2/ja active Active
-
2017
- 2017-08-03 US US15/668,473 patent/US10186274B2/en active Active
- 2017-08-03 US US15/668,375 patent/US10062390B2/en active Active
Patent Citations (43)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0720148A1 (en) | 1994-12-30 | 1996-07-03 | AT&T Corp. | Method for noise weighting filtering |
US20100046762A1 (en) * | 2001-07-10 | 2010-02-25 | Coding Technologies Ab | Efficient and scalable parametric stereo coding for low bitrate audio coding applications |
US20070255572A1 (en) | 2004-08-27 | 2007-11-01 | Shuji Miyasaka | Audio Decoder, Method and Program |
US20080154583A1 (en) * | 2004-08-31 | 2008-06-26 | Matsushita Electric Industrial Co., Ltd. | Stereo Signal Generating Apparatus and Stereo Signal Generating Method |
US20060140412A1 (en) * | 2004-11-02 | 2006-06-29 | Lars Villemoes | Multi parametrisation based multi-channel reconstruction |
US7751572B2 (en) * | 2005-04-15 | 2010-07-06 | Dolby International Ab | Adaptive residual audio coding |
US20070019813A1 (en) * | 2005-07-19 | 2007-01-25 | Johannes Hilpert | Concept for bridging the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding |
US20070094027A1 (en) | 2005-10-21 | 2007-04-26 | Nokia Corporation | Methods and apparatus for implementing embedded scalable encoding and decoding of companded and vector quantized audio data |
US20070208557A1 (en) * | 2006-03-03 | 2007-09-06 | Microsoft Corporation | Perceptual, scalable audio compression |
JP2007328268A (ja) | 2006-06-09 | 2007-12-20 | Kddi Corp | 音楽信号の帯域拡張方式 |
RU2455710C2 (ru) | 2008-01-31 | 2012-07-10 | Фраунхофер-Гезелльшафт цур Фердерунг дер ангевандтен | Устройство и способ расширения полосы пропускания аудио сигнала |
US20110054885A1 (en) | 2008-01-31 | 2011-03-03 | Frederik Nagel | Device and Method for a Bandwidth Extension of an Audio Signal |
US20090282298A1 (en) * | 2008-05-08 | 2009-11-12 | Broadcom Corporation | Bit error management methods for wireless audio communication channels |
TW201009808A (en) | 2008-07-11 | 2010-03-01 | Fraunhofer Ges Forschung | An apparatus and a method for decoding an encoded audio signal |
US8275626B2 (en) | 2008-07-11 | 2012-09-25 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and a method for decoding an encoded audio signal |
RU2011101616A (ru) | 2008-07-11 | 2012-07-27 | Фраунхофер-Гезелльшафт цур Фёрдерунг дер ангевандтен Форшунг Е.Ф. (DE) | Синтезатор аудиосигнала и кодирующее устройство аудиосигнала |
JP2011527449A (ja) | 2008-07-11 | 2011-10-27 | フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ | 符号化されたオーディオ信号を復号化するための装置および方法 |
US20110202353A1 (en) * | 2008-07-11 | 2011-08-18 | Max Neuendorf | Apparatus and a Method for Decoding an Encoded Audio Signal |
US20110173006A1 (en) | 2008-07-11 | 2011-07-14 | Frederik Nagel | Audio Signal Synthesizer and Audio Signal Encoder |
CN102089814A (zh) | 2008-07-11 | 2011-06-08 | 弗劳恩霍夫应用研究促进协会 | 对编码的音频信号进行解码的设备和方法 |
US20100080397A1 (en) | 2008-09-26 | 2010-04-01 | Fujitsu Limted | Audio decoding method and apparatus |
WO2010058518A1 (ja) | 2008-11-21 | 2010-05-27 | パナソニック株式会社 | オーディオ再生装置及びオーディオ再生方法 |
JP2010122640A (ja) | 2008-11-21 | 2010-06-03 | Panasonic Corp | オーディオ再生装置及びオーディオ再生方法 |
US20110004479A1 (en) * | 2009-01-28 | 2011-01-06 | Dolby International Ab | Harmonic transposition |
US20120002818A1 (en) * | 2009-03-17 | 2012-01-05 | Dolby International Ab | Advanced Stereo Coding Based on a Combination of Adaptively Selectable Left/Right or Mid/Side Stereo Coding and of Parametric Stereo Coding |
CN102027537A (zh) | 2009-04-02 | 2011-04-20 | 弗劳恩霍夫应用研究促进协会 | 利用谐波带宽扩充及非谐波带宽扩充的组合、基于输入信号表示型态产生扩充带宽信号的表示型态的装置、方法及计算机程序 |
EP2239732A1 (en) | 2009-04-09 | 2010-10-13 | Fraunhofer-Gesellschaft zur Förderung der Angewandten Forschung e.V. | Apparatus and method for generating a synthesis audio signal and for encoding an audio signal |
CN102177545A (zh) | 2009-04-09 | 2011-09-07 | 弗兰霍菲尔运输应用研究公司 | 用以产生合成音频信号及将音频信号编码的装置与方法 |
WO2010115845A1 (en) | 2009-04-09 | 2010-10-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for generating a synthesis audio signal and for encoding an audio signal |
US8731950B2 (en) | 2009-04-28 | 2014-05-20 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus for providing one or more adjusted parameters for a provision of an upmix signal representation on the basis of a downmix signal representation, audio signal decoder, audio signal transcoder, audio signal encoder, audio bitstream, method and computer program using an object-related parametric information |
TW201104674A (en) | 2009-04-28 | 2011-02-01 | Fraunhofer Ges Forschung | Apparatus for providing one or more adjusted parameters for a provision of an upmix signal representation on the basis of a downmix signal representation, audio signal decoder, audio signal transcoder, audio signal encoder, audio bitstream, method and co |
US8929558B2 (en) * | 2009-09-10 | 2015-01-06 | Dolby International Ab | Audio signal of an FM stereo radio receiver by using parametric stereo |
CN102714035A (zh) | 2009-10-16 | 2012-10-03 | 弗兰霍菲尔运输应用研究公司 | 用以利用平均值而基于下混信号表示形态和与下混信号表示形态相关联的参数侧边信息来提供用于提供上混信号表示形态的一或多个经调整参数的装置、方法与计算机程序 |
US20120263308A1 (en) | 2009-10-16 | 2012-10-18 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus, method and computer program for providing one or more adjusted parameters for provision of an upmix signal representation on the basis of a downmix signal representation and a parametric side information associated with the downmix signal representation, using an average value |
WO2011047886A1 (en) | 2009-10-21 | 2011-04-28 | Dolby International Ab | Apparatus and method for generating a high frequency audio signal using adaptive oversampling |
US8484020B2 (en) | 2009-10-23 | 2013-07-09 | Qualcomm Incorporated | Determining an upperband signal from a narrowband signal |
TW201140563A (en) | 2009-10-23 | 2011-11-16 | Qualcomm Inc | Determining an upperband signal from a narrowband signal |
US20130121411A1 (en) * | 2010-04-13 | 2013-05-16 | Fraunhofer-Gesellschaft Zur Foerderug der angewandten Forschung e.V. | Audio or video encoder, audio or video decoder and related methods for processing multi-channel audio or video signals using a variable prediction direction |
US20130101032A1 (en) * | 2010-04-26 | 2013-04-25 | Panasonic Corporation | Filtering mode for intra prediction inferred from statistics of surrounding blocks |
US20110295598A1 (en) * | 2010-06-01 | 2011-12-01 | Qualcomm Incorporated | Systems, methods, apparatus, and computer program products for wideband speech coding |
US9094754B2 (en) * | 2010-08-24 | 2015-07-28 | Dolby International Ab | Reduction of spurious uncorrelation in FM radio noise |
US20130170391A1 (en) * | 2010-09-16 | 2013-07-04 | Deutsche Telekom Ag | Method of and system for measuring quality of audio and video bit stream transmissions over a transmission chain |
US9191045B2 (en) * | 2011-09-29 | 2015-11-17 | Dolby International Ab | Prediction-based FM stereo radio noise reduction |
Non-Patent Citations (16)
Title |
---|
Bauer, P. et al., "A Statistical Framework for Artificial Bandwidth Extension Exploiting Speech Waveform and Phonetic Transcription", retrieved online on Apr. 2, 2014 from url: http://www.researchgate.net/publication/228336475_A_Statistical_Framework_for_Artificail_Bandwidth_Extension_Exploiting_Speech_Waveform_and_Phonetic_Transcription/file/e0b495225068409423.pdf, Jan. 2009, 6 pages. |
Bessette, Bruno et al., "The Adaptive Multirate Wideband Speech Codec (AMR-WB)", IEEE Transactions on Speech and Audio Processing, vol. 10, No. 8, Nov. 8, 2002, pp. 620-636. |
Geiser, B et al., "Bandwidth Extension for Hierarchical Speech and Audio Coding in ITU-T Rec. G.729.1", IEEE Transactions on Audio, Speech and Language Processing, IEEE Service Center, vol. 15, No. 8, Nov. 2007, pp. 2496-2509. |
Geiser, B. et al., "Robust Wideband Enhancement of Speech by Combined Coding and Artificial Bandwidth Extension", Proceedings of IWAENC; Eindhoven, Netherlands, Sep. 15, 2005, pp. 21-24. |
Iser, Bernd et al., "Bandwidth Extension of Speech Signals", Springer Science + Business Media, LLC, 2008, pp. 53-66. |
Jari, Makinen et al., "A MR-WB+: A New Audio Coding Standard for 3RD Generation Mobile Audio Services", Multimedia Technologies Lasboratory Nokia Research Center, Finland. VoiceAge Corp., Montreal, Qc, Canada. University of Sherbrooke, Qc, Canada. Multimedia Technologies, Ericsson Research, Sweden., Mar. 2005, pp. II-1109-1112. |
Jelinek, Milan , "Wideband Speech Coding Advances in VMR-WB Standard", IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, No. 4,, May 4, 2007, pp. 1167-1179. |
Katsir, I et al., "Speech Bandwidth Extension Based on Speech Phonetic Content and Speaker Vocal Tract Shape Estimation", in Proc. EUSIPCO 2011, Barcelona, Spain, Aug. 29-Sep. 2, 2011, pp. 461-465. |
Larsen, Erik et al., "Audio Bandwidth Extension", Application of Psychoacoustics, Signal Processing and Loudspeaker Design, Joh Wiley& Sons Ltd, The Atrium, Southern Gate, Chichester, West Sussex PO19 8SQ, 2004, pp. 171-236. |
Miao, Lei , "G711.1 Annex D and G.722 Annex B-New ITU-T Superwideband Codecs", In the proceedings of ICASSP; Prague, Czech Republic, May 2011, pp. 5232-5235. |
Miao, Lei , "G711.1 Annex D and G.722 Annex B—New ITU-T Superwideband Codecs", In the proceedings of ICASSP; Prague, Czech Republic, May 2011, pp. 5232-5235. |
Neuendorf, M et al., "MPEG Unified Speech and Audio Coding-The ISO/MPEG Standard for High-Efficiency Audio Coding of all Content Types", Audio Engineering Society Convention Paper 8654, Presented at the 132nd Convention, Apr. 26-29, 2012, pp. 1-22. |
Neuendorf, M et al., "MPEG Unified Speech and Audio Coding—The ISO/MPEG Standard for High-Efficiency Audio Coding of all Content Types", Audio Engineering Society Convention Paper 8654, Presented at the 132nd Convention, Apr. 26-29, 2012, pp. 1-22. |
Pulakka, Hannu et al., "Bandwith Extension of Telephone Speech Using a Neutral Network and a Filter Bank Implementation for Highband Mel Spectrum", IEEE Transactions on Audio, Speech, and Language Processing, vol. 19, No. 7, Sep. 7, 2011, pp. 2170-2183. |
Sanna, M. et al., "A codebook design method for fricative enhancement in Artificial Bandwidth Extension", Proceedings of the 5th Int'l Mobile Multimedia Communications Conference, London, UK, Sep. 9, 2009, 7 pages. |
Vaillancourt, T et al., "ITU-T EV-VBR: A Robust 8-32 kbit/s Scalable Coder for Error Prone Telecommunications Channels", in Proc. EUSIPCO 2008, Lausanne, Switzerland, Aug. 2008, 5 pages. |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20230016637A1 (en) * | 2021-07-07 | 2023-01-19 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and Method for End-to-End Adversarial Blind Bandwidth Extension with one or more Convolutional and/or Recurrent Networks |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10062390B2 (en) | Decoder for generating a frequency enhanced audio signal, method of decoding, encoder for generating an encoded signal and method of encoding using compact selection side information |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V., GERMANY Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:NAGEL, FREDERIK;DISCH, SASCHA;NIEDERMEIER, ANDREAS;SIGNING DATES FROM 20150907 TO 20150928;REEL/FRAME:039426/0768 Owner name: FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWAN Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:NAGEL, FREDERIK;DISCH, SASCHA;NIEDERMEIER, ANDREAS;SIGNING DATES FROM 20150907 TO 20150928;REEL/FRAME:039426/0768 |
|
STCV | Information on status: appeal procedure |
Free format text: ON APPEAL -- AWAITING DECISION BY THE BOARD OF APPEALS |
|
STCV | Information on status: appeal procedure |
Free format text: BOARD OF APPEALS DECISION RENDERED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 4 |