WO2004061823A1 - Reducing scale factor transmission cost for mpeg-2 aac using a lattice - Google Patents

Reducing scale factor transmission cost for mpeg-2 aac using a lattice Download PDF

Info

Publication number
WO2004061823A1
WO2004061823A1 PCT/US2003/040173 US0340173W WO2004061823A1 WO 2004061823 A1 WO2004061823 A1 WO 2004061823A1 US 0340173 W US0340173 W US 0340173W WO 2004061823 A1 WO2004061823 A1 WO 2004061823A1
Authority
WO
WIPO (PCT)
Prior art keywords
scale factor
band
frequency bands
scale
values
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US2003/040173
Other languages
English (en)
French (fr)
Inventor
Mark Stuart Vinton
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dolby Laboratories Licensing Corp
Original Assignee
Dolby Laboratories Licensing Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to DK03808458T priority Critical patent/DK1581928T3/da
Priority to AU2003303495A priority patent/AU2003303495B2/en
Priority to HK05111135.8A priority patent/HK1079327B/en
Priority to CA2507535A priority patent/CA2507535C/en
Priority to JP2004565543A priority patent/JP4425148B2/ja
Priority to MXPA05007183A priority patent/MXPA05007183A/es
Priority to DE60324465T priority patent/DE60324465D1/de
Priority to EP03808458A priority patent/EP1581928B1/en
Priority to CN2003801081720A priority patent/CN1735925B/zh
Application filed by Dolby Laboratories Licensing Corp filed Critical Dolby Laboratories Licensing Corp
Publication of WO2004061823A1 publication Critical patent/WO2004061823A1/en
Priority to IL168636A priority patent/IL168636A/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • G10L19/035Scalar quantisation

Definitions

  • the present invention is directed to a method for reducing the total bit cost of a perceptual audio encoder employing adaptive bit allocation in which a time domain representation of an audio signal is divided into successive time blocks, each time block is divided into frequency bands, and a scale factor is assigned to each of ones of the frequency bands, wherein the number of bits required to represent each block increases with increases in the scale factor values and with increases in band-to-band variations in scale factor values.
  • a ⁇ parameter which is a function of the value of the scale factor, is optional (if omitted, it is replaced by a constant value equal to 1) but greatly improves the performance of the algorithm if it is estimated accurately.
  • a i is assumed to be constant if the scale factors are only modified slightly from their preliminary value. For simplicity, this may be achieved by counting the number of MDCT coefficients in a band that has an absolute value greater than some predefined threshold.
  • Block 144 identifies the new, adjusted scale factor value for each backwardly successive scale factor band as i is decremented from N-1 to zero.
  • FIG. 3 shows the effect of applying the scale factor optimization of the present invention to the preliminary scale factors derived by means of the direct estimation technique for a single AAC audio frame.
  • the circles plotted in FIG. 3 represent the unadjusted scale factors; while the plus plotted points represent the adjusted scale factors according to an application of the present invention.
  • the scale factor optimization technique according to the present invention greatly reduces the variation in the scale factors. Also the adjusted scale factors are always increased, not just saving bits overall but decreasing the quantization noise not only in the bands in which the scale factors are increased, but also in other bands as a result of overall bit savings (thus allowing more bits to be allocated to other bands). The bit savings achieved by this technique are shown in FIG.

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Compression Of Band Width Or Redundancy In Fax (AREA)
  • Analysing Materials By The Use Of Radiation (AREA)
  • Crystals, And After-Treatments Of Crystals (AREA)
  • Control Of Indicators Other Than Cathode Ray Tubes (AREA)
  • Peptides Or Proteins (AREA)
PCT/US2003/040173 2003-01-02 2003-12-16 Reducing scale factor transmission cost for mpeg-2 aac using a lattice Ceased WO2004061823A1 (en)

Priority Applications (10)

Application Number Priority Date Filing Date Title
EP03808458A EP1581928B1 (en) 2003-01-02 2003-12-16 Reducing scale factor transmission cost for mpeg-2 aac using a lattice
AU2003303495A AU2003303495B2 (en) 2003-01-02 2003-12-16 Reducing scale factor transmission cost for MPEG-2 AAC using a lattice
HK05111135.8A HK1079327B (en) 2003-01-02 2003-12-16 Reducing scale factor transmission cost for mpeg-2 aac using a lattice
CA2507535A CA2507535C (en) 2003-01-02 2003-12-16 Reducing scale factor transmission cost for mpeg-2 advanced audio coding (aac) using a lattice based post processing technique
JP2004565543A JP4425148B2 (ja) 2003-01-02 2003-12-16 格子基ポスト処理技術を用いるmpeg−2アドバンスドオーディオコーディング(aac)のためのスケール因子伝達コスト低減
DK03808458T DK1581928T3 (da) 2003-01-02 2003-12-16 Reduktion af skalafaktor transmissionsomkostninger for en MPEG-2 AAC under anvendelse af et gitter
DE60324465T DE60324465D1 (de) 2003-01-02 2003-12-16 Verringerung von skalierungsfaktor-übertragungskosten für mpeg-2-aac unter verwendung eines gitters
MXPA05007183A MXPA05007183A (es) 2003-01-02 2003-12-16 Reduccion del costo de transmision de factores de escala para codificacion de audio avanzada mpeg-2 usando una celosia.
CN2003801081720A CN1735925B (zh) 2003-01-02 2003-12-16 使用网格降低mpeg-2高级音频编码的比例因子传输成本
IL168636A IL168636A (en) 2003-01-02 2005-05-17 Reducing scale factor transmission cost for mpeg-2 aac using a lattice

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/336,637 2003-01-02
US10/336,637 US7272566B2 (en) 2003-01-02 2003-01-02 Reducing scale factor transmission cost for MPEG-2 advanced audio coding (AAC) using a lattice based post processing technique

Publications (1)

Publication Number Publication Date
WO2004061823A1 true WO2004061823A1 (en) 2004-07-22

Family

ID=32681060

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2003/040173 Ceased WO2004061823A1 (en) 2003-01-02 2003-12-16 Reducing scale factor transmission cost for mpeg-2 aac using a lattice

Country Status (17)

Country Link
US (1) US7272566B2 (https=)
EP (1) EP1581928B1 (https=)
JP (1) JP4425148B2 (https=)
KR (1) KR101045520B1 (https=)
CN (1) CN1735925B (https=)
AT (1) ATE412960T1 (https=)
AU (1) AU2003303495B2 (https=)
CA (1) CA2507535C (https=)
DE (1) DE60324465D1 (https=)
DK (1) DK1581928T3 (https=)
ES (1) ES2312852T3 (https=)
IL (1) IL168636A (https=)
MX (1) MXPA05007183A (https=)
MY (1) MY138588A (https=)
PL (1) PL208346B1 (https=)
TW (1) TWI335145B (https=)
WO (1) WO2004061823A1 (https=)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8255212B2 (en) 2006-07-04 2012-08-28 Dolby International Ab Filter compressor and method for manufacturing compressed subband filter impulse responses

Families Citing this family (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005292702A (ja) * 2004-04-05 2005-10-20 Kddi Corp オーディオフレームに対するフェードイン/フェードアウト処理装置及びプログラム
US8543390B2 (en) 2004-10-26 2013-09-24 Qnx Software Systems Limited Multi-channel periodic signal enhancement system
US7716046B2 (en) * 2004-10-26 2010-05-11 Qnx Software Systems (Wavemakers), Inc. Advanced periodic signal enhancement
US8306821B2 (en) 2004-10-26 2012-11-06 Qnx Software Systems Limited Sub-band periodic signal enhancement system
US8170879B2 (en) * 2004-10-26 2012-05-01 Qnx Software Systems Limited Periodic signal enhancement system
US7610196B2 (en) * 2004-10-26 2009-10-27 Qnx Software Systems (Wavemakers), Inc. Periodic signal enhancement system
US7949520B2 (en) * 2004-10-26 2011-05-24 QNX Software Sytems Co. Adaptive filter pitch extraction
US7680652B2 (en) * 2004-10-26 2010-03-16 Qnx Software Systems (Wavemakers), Inc. Periodic signal enhancement system
KR100707173B1 (ko) * 2004-12-21 2007-04-13 삼성전자주식회사 저비트율 부호화/복호화방법 및 장치
US7590523B2 (en) * 2006-03-20 2009-09-15 Mindspeed Technologies, Inc. Speech post-processing using MDCT coefficients
US8032371B2 (en) * 2006-07-28 2011-10-04 Apple Inc. Determining scale factor values in encoding audio data with AAC
US8010370B2 (en) * 2006-07-28 2011-08-30 Apple Inc. Bitrate control for perceptual coding
CN101308659B (zh) * 2007-05-16 2011-11-30 中兴通讯股份有限公司 一种基于先进音频编码器的心理声学模型的处理方法
US8788264B2 (en) * 2007-06-27 2014-07-22 Nec Corporation Audio encoding method, audio decoding method, audio encoding device, audio decoding device, program, and audio encoding/decoding system
EP2186087B1 (en) * 2007-08-27 2011-11-30 Telefonaktiebolaget L M Ericsson (PUBL) Improved transform coding of speech and audio signals
US8850154B2 (en) 2007-09-11 2014-09-30 2236008 Ontario Inc. Processing system having memory partitioning
US8904400B2 (en) 2007-09-11 2014-12-02 2236008 Ontario Inc. Processing system having a partitioning component for resource partitioning
US8694310B2 (en) 2007-09-17 2014-04-08 Qnx Software Systems Limited Remote control server protocol system
CN101854175B (zh) * 2007-10-12 2013-04-17 联咏科技股份有限公司 可降低信号功率频谱密度的编码方法
GB2454190A (en) * 2007-10-30 2009-05-06 Cambridge Silicon Radio Ltd Minimising a cost function in encoding data using spectral partitioning
US8209514B2 (en) 2008-02-04 2012-06-26 Qnx Software Systems Limited Media processing system having resource partitioning
EP2274833B1 (en) * 2008-04-16 2016-08-10 Huawei Technologies Co., Ltd. Vector quantisation method
US8290782B2 (en) * 2008-07-24 2012-10-16 Dts, Inc. Compression of audio scale-factors by two-dimensional transformation
JP5304504B2 (ja) * 2009-07-17 2013-10-02 ソニー株式会社 信号符号化装置、信号復号装置、信号処理システム、これらにおける処理方法およびプログラム
EP2346031B1 (en) * 2009-11-26 2015-09-30 BlackBerry Limited Rate-distortion optimization for advanced audio coding
US8380524B2 (en) * 2009-11-26 2013-02-19 Research In Motion Limited Rate-distortion optimization for advanced audio coding
MX2013013261A (es) * 2011-05-13 2014-02-20 Samsung Electronics Co Ltd Asignacion de bits, codificacion y decodificacion de audio.
US9293146B2 (en) * 2012-09-04 2016-03-22 Apple Inc. Intensity stereo coding in advanced audio coding
US20140344159A1 (en) * 2013-05-20 2014-11-20 Dell Products, Lp License Key Generation
EP2830058A1 (en) 2013-07-22 2015-01-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Frequency-domain audio coding supporting transform length switching
TWI557726B (zh) * 2013-08-29 2016-11-11 杜比國際公司 用於決定音頻信號的高頻帶信號的主比例因子頻帶表之系統和方法
US10354668B2 (en) 2017-03-22 2019-07-16 Immersion Networks, Inc. System and method for processing audio data
CN110426569B (zh) * 2019-07-12 2021-09-21 国网上海市电力公司 一种变压器声信号降噪处理方法
US20220156982A1 (en) * 2020-11-19 2022-05-19 Nvidia Corporation Calculating data compression parameters

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5581653A (en) * 1993-08-31 1996-12-03 Dolby Laboratories Licensing Corporation Low bit-rate high-resolution spectral envelope coding for audio encoder and decoder

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5956674A (en) * 1995-12-01 1999-09-21 Digital Theater Systems, Inc. Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels
US5845249A (en) * 1996-05-03 1998-12-01 Lsi Logic Corporation Microarchitecture of audio core for an MPEG-2 and AC-3 decoder
US6430533B1 (en) * 1996-05-03 2002-08-06 Lsi Logic Corporation Audio decoder core MPEG-1/MPEG-2/AC-3 functional algorithm partitioning and implementation
US6226616B1 (en) * 1999-06-21 2001-05-01 Digital Theater Systems, Inc. Sound quality of established low bit-rate audio coding systems without loss of decoder compatibility
FR2822122B1 (fr) * 2001-03-14 2003-05-23 Nacam Assemblage d'un etrier de colonne de direction avec un pignon de direction d'un vehicule automobile
US6934677B2 (en) * 2001-12-14 2005-08-23 Microsoft Corporation Quantization matrices based on critical band pattern information for digital audio wherein quantization bands differ from critical bands
US7027982B2 (en) * 2001-12-14 2006-04-11 Microsoft Corporation Quality and rate control strategy for digital audio

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5581653A (en) * 1993-08-31 1996-12-03 Dolby Laboratories Licensing Corporation Low bit-rate high-resolution spectral envelope coding for audio encoder and decoder

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
AGGARWAL A ET AL: "Trellis-based optimization of MPEG-4 advanced audio coding", IEEE WORKSHOP ON SPEECH CODING. PROCEEDINGS. MEETING THE CHALLENGES OF THE NEW MILLENNIUM, 17 September 2000 (2000-09-17), pages 142 - 144, XP010520069 *
BOSI M ET AL: "ISO/IEC MPEG-2 ADVANCED AUDIO CODING", JOURNAL OF THE AUDIO ENGINEERING SOCIETY, AUDIO ENGINEERING SOCIETY. NEW YORK, US, vol. 45, no. 10, 1 October 1997 (1997-10-01), pages 789 - 812, XP000730161, ISSN: 0004-7554 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8255212B2 (en) 2006-07-04 2012-08-28 Dolby International Ab Filter compressor and method for manufacturing compressed subband filter impulse responses

Also Published As

Publication number Publication date
AU2003303495A1 (en) 2004-07-29
JP2006512617A (ja) 2006-04-13
ES2312852T3 (es) 2009-03-01
KR101045520B1 (ko) 2011-06-30
EP1581928A1 (en) 2005-10-05
ATE412960T1 (de) 2008-11-15
MXPA05007183A (es) 2005-09-12
CN1735925B (zh) 2010-04-28
CA2507535A1 (en) 2004-07-22
CA2507535C (en) 2013-02-12
CN1735925A (zh) 2006-02-15
DE60324465D1 (de) 2008-12-11
JP4425148B2 (ja) 2010-03-03
EP1581928B1 (en) 2008-10-29
TWI335145B (en) 2010-12-21
US7272566B2 (en) 2007-09-18
PL208346B1 (pl) 2011-04-29
PL377709A1 (pl) 2006-02-06
KR20050089870A (ko) 2005-09-08
AU2003303495B2 (en) 2009-02-19
DK1581928T3 (da) 2009-01-19
US20040131204A1 (en) 2004-07-08
TW200419929A (en) 2004-10-01
IL168636A (en) 2011-01-31
HK1079327A1 (en) 2006-03-31
MY138588A (en) 2009-07-31

Similar Documents

Publication Publication Date Title
AU2003303495B2 (en) Reducing scale factor transmission cost for MPEG-2 AAC using a lattice
CN109313908B (zh) 用于对音频信号进行编码的音频编码器以及方法
US7383180B2 (en) Constant bitrate media encoding techniques
US5226084A (en) Methods for speech quantization and error correction
CA2327405C (en) Perceptual audio coder bit allocation scheme providing improved perceptual quality consistency
EP1072036B1 (en) Fast frame optimisation in an audio encoder
KR102017892B1 (ko) 향상된 계층적 코딩
US7650277B2 (en) System, method, and apparatus for fast quantization in perceptual audio coders
JP4903130B2 (ja) 知覚コーディングのビット割り当てにおける複雑さを軽減した計算方法
US9159330B2 (en) Rate controller, rate control method, and rate control program
Oh et al. Low power MPEG/audio encoders using simplified psychoacoustic model and fast bit allocation
HK1079327B (en) Reducing scale factor transmission cost for mpeg-2 aac using a lattice
JP6224827B2 (ja) 分配量子化及び符号化を使用した累積和表現のモデル化によるオーディオ信号包絡符号化、処理及び復号化の装置と方法
KR101789085B1 (ko) 분포 양자화 및 코딩을 사용하는 오디오 신호 엔벨로프의 분할에 의한 오디오 신호 엔벨로프 인코딩, 처리 및 디코딩을 위한 장치 및 방법
JP2000137497A (ja) デジタル音響信号符号化装置、デジタル音響信号符号化方法及びデジタル音響信号符号化プログラムを記録した媒体
WO2011000434A1 (en) An apparatus
JPH06291679A (ja) オーディオ信号のためのしきい値制御量子化決定法

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 168636

Country of ref document: IL

WWE Wipo information: entry into national phase

Ref document number: 921/KOLNP/2005

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 2507535

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 2003303495

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 377709

Country of ref document: PL

WWE Wipo information: entry into national phase

Ref document number: PA/a/2005/007183

Country of ref document: MX

WWE Wipo information: entry into national phase

Ref document number: 1020057012534

Country of ref document: KR

Ref document number: 20038A81720

Country of ref document: CN

WWE Wipo information: entry into national phase

Ref document number: 2004565543

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 2003808458

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1020057012534

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 2003808458

Country of ref document: EP