TW531986B - Quantization in perceptual audio coders with compensation for synthesis filter noise spreading - Google Patents

Quantization in perceptual audio coders with compensation for synthesis filter noise spreading Download PDF

Info

Publication number
TW531986B
TW531986B TW089106700A TW89106700A TW531986B TW 531986 B TW531986 B TW 531986B TW 089106700 A TW089106700 A TW 089106700A TW 89106700 A TW89106700 A TW 89106700A TW 531986 B TW531986 B TW 531986B
Authority
TW
Taiwan
Prior art keywords
noise
sub
band
signal
quantized
Prior art date
Application number
TW089106700A
Other languages
English (en)
Chinese (zh)
Inventor
Anil Wamanrao Ubale
Grant Allen Davidson
Original Assignee
Dolby Lab Licensing Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dolby Lab Licensing Corp filed Critical Dolby Lab Licensing Corp
Application granted granted Critical
Publication of TW531986B publication Critical patent/TW531986B/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/002Dynamic bit allocation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
TW089106700A 1999-04-12 2000-04-11 Quantization in perceptual audio coders with compensation for synthesis filter noise spreading TW531986B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US09/289,865 US6363338B1 (en) 1999-04-12 1999-04-12 Quantization in perceptual audio coders with compensation for synthesis filter noise spreading

Publications (1)

Publication Number Publication Date
TW531986B true TW531986B (en) 2003-05-11

Family

ID=23113455

Family Applications (1)

Application Number Title Priority Date Filing Date
TW089106700A TW531986B (en) 1999-04-12 2000-04-11 Quantization in perceptual audio coders with compensation for synthesis filter noise spreading

Country Status (13)

Country Link
US (1) US6363338B1 (https=)
EP (1) EP1177639B1 (https=)
JP (1) JP4643019B2 (https=)
KR (1) KR100758215B1 (https=)
AR (1) AR024858A1 (https=)
AT (1) ATE248463T1 (https=)
AU (1) AU771869B2 (https=)
CA (1) CA2366560C (https=)
DE (1) DE60004814T2 (https=)
HK (1) HK1044235B (https=)
MY (1) MY120387A (https=)
TW (1) TW531986B (https=)
WO (1) WO2000062434A1 (https=)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7751572B2 (en) 2005-04-15 2010-07-06 Dolby International Ab Adaptive residual audio coding

Families Citing this family (51)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE19947877C2 (de) * 1999-10-05 2001-09-13 Fraunhofer Ges Forschung Verfahren und Vorrichtung zum Einbringen von Informationen in einen Datenstrom sowie Verfahren und Vorrichtung zum Codieren eines Audiosignals
US7720651B2 (en) * 2000-09-29 2010-05-18 Canning Francis X Compression of interaction data using directional sources and/or testers
US7734448B2 (en) * 2000-01-10 2010-06-08 Canning Francis X Sparse and efficient block factorization for interaction data
TW499672B (en) * 2000-02-18 2002-08-21 Intervideo Inc Fast convergence method for bit allocation stage of MPEG audio layer 3 encoders
US7050924B2 (en) * 2000-06-12 2006-05-23 British Telecommunications Public Limited Company Test signalling
US7945430B2 (en) 2000-09-29 2011-05-17 Canning Francis X Compression and compressed inversion of interaction data
US7031955B1 (en) * 2001-04-27 2006-04-18 I2 Technologies Us, Inc. Optimization using a multi-dimensional data model
SE0202159D0 (sv) 2001-07-10 2002-07-09 Coding Technologies Sweden Ab Efficientand scalable parametric stereo coding for low bitrate applications
US6987889B1 (en) 2001-08-10 2006-01-17 Polycom, Inc. System and method for dynamic perceptual coding of macroblocks in a video frame
US6732071B2 (en) * 2001-09-27 2004-05-04 Intel Corporation Method, apparatus, and system for efficient rate control in audio encoding
PT1423847E (pt) 2001-11-29 2005-05-31 Coding Tech Ab Reconstrucao de componentes de frequencia elevada
US7447631B2 (en) * 2002-06-17 2008-11-04 Dolby Laboratories Licensing Corporation Audio coding system using spectral hole filling
US20040030555A1 (en) * 2002-08-12 2004-02-12 Oregon Health & Science University System and method for concatenating acoustic contours for speech synthesis
SE0202770D0 (sv) 2002-09-18 2002-09-18 Coding Technologies Sweden Ab Method for reduction of aliasing introduces by spectral envelope adjustment in real-valued filterbanks
US7376553B2 (en) * 2003-07-08 2008-05-20 Robert Patel Quinn Fractal harmonic overtone mapping of speech and musical sounds
CN1839426A (zh) * 2003-09-17 2006-09-27 北京阜国数字技术有限公司 多分辨率矢量量化的音频编解码方法及装置
US7539614B2 (en) * 2003-11-14 2009-05-26 Nxp B.V. System and method for audio signal processing using different gain factors for voiced and unvoiced phonemes
JP2007520748A (ja) * 2004-01-28 2007-07-26 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ 複素値データを用いたオーディオ信号の復号
DE102004009955B3 (de) * 2004-03-01 2005-08-11 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Ermitteln einer Quantisierer-Schrittweite
US7512536B2 (en) * 2004-05-14 2009-03-31 Texas Instruments Incorporated Efficient filter bank computation for audio coding
US7903137B2 (en) * 2004-10-15 2011-03-08 Lifesize Communications, Inc. Videoconferencing echo cancellers
US20060132595A1 (en) * 2004-10-15 2006-06-22 Kenoyer Michael L Speakerphone supporting video and audio features
US7970151B2 (en) * 2004-10-15 2011-06-28 Lifesize Communications, Inc. Hybrid beamforming
US7760887B2 (en) * 2004-10-15 2010-07-20 Lifesize Communications, Inc. Updating modeling information based on online data gathering
US7720236B2 (en) * 2004-10-15 2010-05-18 Lifesize Communications, Inc. Updating modeling information based on offline calibration experiments
US7720232B2 (en) * 2004-10-15 2010-05-18 Lifesize Communications, Inc. Speakerphone
US8116500B2 (en) * 2004-10-15 2012-02-14 Lifesize Communications, Inc. Microphone orientation and size in a speakerphone
US7826624B2 (en) * 2004-10-15 2010-11-02 Lifesize Communications, Inc. Speakerphone self calibration and beam forming
US7593539B2 (en) * 2005-04-29 2009-09-22 Lifesize Communications, Inc. Microphone and speaker arrangement in speakerphone
US7970150B2 (en) * 2005-04-29 2011-06-28 Lifesize Communications, Inc. Tracking talkers using virtual broadside scan and directed beams
US7991167B2 (en) * 2005-04-29 2011-08-02 Lifesize Communications, Inc. Forming beams with nulls directed at noise sources
US7974713B2 (en) * 2005-10-12 2011-07-05 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Temporal and spatial shaping of multi-channel audio signals
US7835904B2 (en) * 2006-03-03 2010-11-16 Microsoft Corp. Perceptual, scalable audio compression
KR101393298B1 (ko) * 2006-07-08 2014-05-12 삼성전자주식회사 적응적 부호화/복호화 방법 및 장치
WO2008021247A2 (en) * 2006-08-15 2008-02-21 Dolby Laboratories Licensing Corporation Arbitrary shaping of temporal noise envelope without side-information
FR2912249A1 (fr) * 2007-02-02 2008-08-08 France Telecom Codage/decodage perfectionnes de signaux audionumeriques.
EP2274833B1 (en) * 2008-04-16 2016-08-10 Huawei Technologies Co., Ltd. Vector quantisation method
US20100106269A1 (en) * 2008-09-26 2010-04-29 Qualcomm Incorporated Method and apparatus for signal processing using transform-domain log-companding
TWI591625B (zh) 2009-05-27 2017-07-11 杜比國際公司 從訊號的低頻成份產生該訊號之高頻成份的系統與方法,及其機上盒、電腦程式產品、軟體程式及儲存媒體
US11657788B2 (en) 2009-05-27 2023-05-23 Dolby International Ab Efficient combined harmonic transposition
KR101599884B1 (ko) * 2009-08-18 2016-03-04 삼성전자주식회사 멀티 채널 오디오 디코딩 방법 및 장치
WO2011044700A1 (en) * 2009-10-15 2011-04-21 Voiceage Corporation Simultaneous time-domain and frequency-domain noise shaping for tdac transforms
ES3051141T3 (en) 2009-10-21 2025-12-26 Dolby Int Ab Oversampling in a combined transposer filter bank
US8958510B1 (en) * 2010-06-10 2015-02-17 Fredric J. Harris Selectable bandwidth filter
US8924222B2 (en) * 2010-07-30 2014-12-30 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for coding of harmonic signals
US9208792B2 (en) 2010-08-17 2015-12-08 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for noise injection
US9225310B1 (en) * 2012-11-08 2015-12-29 iZotope, Inc. Audio limiter system and method
US10325584B2 (en) 2014-12-10 2019-06-18 Stmicroelectronics S.R.L. Active noise cancelling device and method of actively cancelling acoustic noise
KR102632136B1 (ko) 2017-04-28 2024-01-31 디티에스, 인코포레이티드 오디오 코더 윈도우 사이즈 및 시간-주파수 변환
US10886943B2 (en) * 2019-03-18 2021-01-05 Samsung Electronics Co., Ltd Method and apparatus for variable rate compression with a conditional autoencoder
CN115171709B (zh) * 2022-09-05 2022-11-18 腾讯科技(深圳)有限公司 语音编码、解码方法、装置、计算机设备和存储介质

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4956871A (en) * 1988-09-30 1990-09-11 At&T Bell Laboratories Improving sub-band coding of speech at low bit rates by adding residual speech energy signals to sub-bands
US5222189A (en) * 1989-01-27 1993-06-22 Dolby Laboratories Licensing Corporation Low time-delay transform coder, decoder, and encoder/decoder for high-quality audio
JP2906646B2 (ja) * 1990-11-09 1999-06-21 松下電器産業株式会社 音声帯域分割符号化装置
EP0559348A3 (en) * 1992-03-02 1993-11-03 AT&T Corp. Rate control loop processor for perceptual encoder/decoder
JP3297050B2 (ja) * 1993-07-16 2002-07-02 ドルビー・ラボラトリーズ・ライセンシング・コーポレーション デコーダスペクトル歪み対応電算式適応ビット配分符号化方法及び装置
US5623577A (en) * 1993-07-16 1997-04-22 Dolby Laboratories Licensing Corporation Computationally efficient adaptive bit allocation for encoding method and apparatus with allowance for decoder spectral distortions
EP0722225A3 (de) * 1994-11-17 2000-06-07 Deutsche Thomson-Brandt Gmbh Audiosignalkodierung mittels Kurzzeitspektren und einem psychoakustischen Modell
JP2820117B2 (ja) * 1996-05-29 1998-11-05 日本電気株式会社 音声符号化装置
US5913191A (en) * 1997-10-17 1999-06-15 Dolby Laboratories Licensing Corporation Frame-based audio coding with additional filterbank to suppress aliasing artifacts at frame boundaries

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7751572B2 (en) 2005-04-15 2010-07-06 Dolby International Ab Adaptive residual audio coding

Also Published As

Publication number Publication date
HK1044235A1 (en) 2002-10-11
EP1177639A1 (en) 2002-02-06
KR100758215B1 (ko) 2007-09-12
CA2366560C (en) 2008-07-29
JP4643019B2 (ja) 2011-03-02
HK1044235B (en) 2003-12-24
EP1177639B1 (en) 2003-08-27
AU4338200A (en) 2000-11-14
MY120387A (en) 2005-10-31
US6363338B1 (en) 2002-03-26
AR024858A1 (es) 2002-10-30
ATE248463T1 (de) 2003-09-15
JP2002542648A (ja) 2002-12-10
AU771869B2 (en) 2004-04-01
DE60004814D1 (de) 2003-10-02
CA2366560A1 (en) 2000-10-19
DE60004814T2 (de) 2004-07-01
KR20010112423A (ko) 2001-12-20
WO2000062434A1 (en) 2000-10-19

Similar Documents

Publication Publication Date Title
TW531986B (en) Quantization in perceptual audio coders with compensation for synthesis filter noise spreading
Shlien Guide to MPEG-1 audio standard
CN103069484B (zh) 时/频二维后处理
CN1328707C (zh) 音频解码设备以及解码方法
TWI352969B (en) Method and apparatus for generating audio informat
TWI321315B (en) Methods of generating a highband excitation signal and apparatus for anti-sparseness filtering
EP1850327B1 (en) Adaptive rate control algorithm for low complexity AAC encoding
ES2278338T3 (es) Dispositivo y procedimiento para procesar una señal.
US8359194B2 (en) Device and method for graduated encoding of a multichannel audio signal based on a principal component analysis
CN101120615B (zh) 多声道编码器和解码器以及相应的编码和解码方法
TWI352973B (en) Conversion of synthesized spectral components for
CN102084418B (zh) 用于调整多通道音频信号的空间线索信息的设备和方法
CN101521014B (zh) 音频带宽扩展编解码装置
CN101494054B (zh) 一种音频码率控制方法及系统
US9443534B2 (en) Bandwidth extension system and approach
US20160140973A1 (en) Apparatus and method for decoding and encoding an audio signal using adaptive spectral tile selection
CN105225669B (zh) 音频编码中的后量化增益校正
US20200068330A1 (en) Method and device for applying dynamic range compression to a higher order ambisonics signal
US11741974B2 (en) Encoding and decoding methods, and encoding and decoding apparatuses for stereo signal
ES2273268T3 (es) Dispositivo y procedimiento para convertir en una representacion transformada o para convertir de manera inversa la representacion transformada.
CN116741188A (zh) 立体声音频编码器和解码器
WO2007111646A2 (en) Speech post-processing using mdct coefficients
HK1002743B (en) Hybrid perceptual audio coding
CN101847413B (zh) 一种使用新型心理声学模型和快速比特分配实现数字音频编码的方法
TR201907767T4 (tr) Uyarlamalı kazanç-form hızı paylaşımı.

Legal Events

Date Code Title Description
GD4A Issue of patent certificate for granted invention patent
MK4A Expiration of patent term of an invention patent