WO2004061823A1 - Reducing scale factor transmission cost for mpeg-2 aac using a lattice - Google Patents
Reducing scale factor transmission cost for mpeg-2 aac using a lattice Download PDFInfo
- Publication number
- WO2004061823A1 WO2004061823A1 PCT/US2003/040173 US0340173W WO2004061823A1 WO 2004061823 A1 WO2004061823 A1 WO 2004061823A1 US 0340173 W US0340173 W US 0340173W WO 2004061823 A1 WO2004061823 A1 WO 2004061823A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- scale factor
- band
- frequency bands
- scale
- values
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/035—Scalar quantisation
Definitions
- the present invention is directed to a method for reducing the total bit cost of a perceptual audio encoder employing adaptive bit allocation in which a time domain representation of an audio signal is divided into successive time blocks, each time block is divided into frequency bands, and a scale factor is assigned to each of ones of the frequency bands, wherein the number of bits required to represent each block increases with increases in the scale factor values and with increases in band-to-band variations in scale factor values.
- a ⁇ parameter which is a function of the value of the scale factor, is optional (if omitted, it is replaced by a constant value equal to 1) but greatly improves the performance of the algorithm if it is estimated accurately.
- a i is assumed to be constant if the scale factors are only modified slightly from their preliminary value. For simplicity, this may be achieved by counting the number of MDCT coefficients in a band that has an absolute value greater than some predefined threshold.
- Block 144 identifies the new, adjusted scale factor value for each backwardly successive scale factor band as i is decremented from N-1 to zero.
- FIG. 3 shows the effect of applying the scale factor optimization of the present invention to the preliminary scale factors derived by means of the direct estimation technique for a single AAC audio frame.
- the circles plotted in FIG. 3 represent the unadjusted scale factors; while the plus plotted points represent the adjusted scale factors according to an application of the present invention.
- the scale factor optimization technique according to the present invention greatly reduces the variation in the scale factors. Also the adjusted scale factors are always increased, not just saving bits overall but decreasing the quantization noise not only in the bands in which the scale factors are increased, but also in other bands as a result of overall bit savings (thus allowing more bits to be allocated to other bands). The bit savings achieved by this technique are shown in FIG.
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Enzymes And Modification Thereof (AREA)
- Compression Of Band Width Or Redundancy In Fax (AREA)
- Analysing Materials By The Use Of Radiation (AREA)
- Crystals, And After-Treatments Of Crystals (AREA)
- Control Of Indicators Other Than Cathode Ray Tubes (AREA)
- Peptides Or Proteins (AREA)
Priority Applications (10)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP03808458A EP1581928B1 (en) | 2003-01-02 | 2003-12-16 | Reducing scale factor transmission cost for mpeg-2 aac using a lattice |
| AU2003303495A AU2003303495B2 (en) | 2003-01-02 | 2003-12-16 | Reducing scale factor transmission cost for MPEG-2 AAC using a lattice |
| HK05111135.8A HK1079327B (en) | 2003-01-02 | 2003-12-16 | Reducing scale factor transmission cost for mpeg-2 aac using a lattice |
| CA2507535A CA2507535C (en) | 2003-01-02 | 2003-12-16 | Reducing scale factor transmission cost for mpeg-2 advanced audio coding (aac) using a lattice based post processing technique |
| JP2004565543A JP4425148B2 (ja) | 2003-01-02 | 2003-12-16 | 格子基ポスト処理技術を用いるmpeg−2アドバンスドオーディオコーディング(aac)のためのスケール因子伝達コスト低減 |
| DK03808458T DK1581928T3 (da) | 2003-01-02 | 2003-12-16 | Reduktion af skalafaktor transmissionsomkostninger for en MPEG-2 AAC under anvendelse af et gitter |
| DE60324465T DE60324465D1 (de) | 2003-01-02 | 2003-12-16 | Verringerung von skalierungsfaktor-übertragungskosten für mpeg-2-aac unter verwendung eines gitters |
| MXPA05007183A MXPA05007183A (es) | 2003-01-02 | 2003-12-16 | Reduccion del costo de transmision de factores de escala para codificacion de audio avanzada mpeg-2 usando una celosia. |
| CN2003801081720A CN1735925B (zh) | 2003-01-02 | 2003-12-16 | 使用网格降低mpeg-2高级音频编码的比例因子传输成本 |
| IL168636A IL168636A (en) | 2003-01-02 | 2005-05-17 | Reducing scale factor transmission cost for mpeg-2 aac using a lattice |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US10/336,637 | 2003-01-02 | ||
| US10/336,637 US7272566B2 (en) | 2003-01-02 | 2003-01-02 | Reducing scale factor transmission cost for MPEG-2 advanced audio coding (AAC) using a lattice based post processing technique |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2004061823A1 true WO2004061823A1 (en) | 2004-07-22 |
Family
ID=32681060
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2003/040173 Ceased WO2004061823A1 (en) | 2003-01-02 | 2003-12-16 | Reducing scale factor transmission cost for mpeg-2 aac using a lattice |
Country Status (17)
| Country | Link |
|---|---|
| US (1) | US7272566B2 (https=) |
| EP (1) | EP1581928B1 (https=) |
| JP (1) | JP4425148B2 (https=) |
| KR (1) | KR101045520B1 (https=) |
| CN (1) | CN1735925B (https=) |
| AT (1) | ATE412960T1 (https=) |
| AU (1) | AU2003303495B2 (https=) |
| CA (1) | CA2507535C (https=) |
| DE (1) | DE60324465D1 (https=) |
| DK (1) | DK1581928T3 (https=) |
| ES (1) | ES2312852T3 (https=) |
| IL (1) | IL168636A (https=) |
| MX (1) | MXPA05007183A (https=) |
| MY (1) | MY138588A (https=) |
| PL (1) | PL208346B1 (https=) |
| TW (1) | TWI335145B (https=) |
| WO (1) | WO2004061823A1 (https=) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8255212B2 (en) | 2006-07-04 | 2012-08-28 | Dolby International Ab | Filter compressor and method for manufacturing compressed subband filter impulse responses |
Families Citing this family (34)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2005292702A (ja) * | 2004-04-05 | 2005-10-20 | Kddi Corp | オーディオフレームに対するフェードイン/フェードアウト処理装置及びプログラム |
| US8543390B2 (en) | 2004-10-26 | 2013-09-24 | Qnx Software Systems Limited | Multi-channel periodic signal enhancement system |
| US7716046B2 (en) * | 2004-10-26 | 2010-05-11 | Qnx Software Systems (Wavemakers), Inc. | Advanced periodic signal enhancement |
| US8306821B2 (en) | 2004-10-26 | 2012-11-06 | Qnx Software Systems Limited | Sub-band periodic signal enhancement system |
| US8170879B2 (en) * | 2004-10-26 | 2012-05-01 | Qnx Software Systems Limited | Periodic signal enhancement system |
| US7610196B2 (en) * | 2004-10-26 | 2009-10-27 | Qnx Software Systems (Wavemakers), Inc. | Periodic signal enhancement system |
| US7949520B2 (en) * | 2004-10-26 | 2011-05-24 | QNX Software Sytems Co. | Adaptive filter pitch extraction |
| US7680652B2 (en) * | 2004-10-26 | 2010-03-16 | Qnx Software Systems (Wavemakers), Inc. | Periodic signal enhancement system |
| KR100707173B1 (ko) * | 2004-12-21 | 2007-04-13 | 삼성전자주식회사 | 저비트율 부호화/복호화방법 및 장치 |
| US7590523B2 (en) * | 2006-03-20 | 2009-09-15 | Mindspeed Technologies, Inc. | Speech post-processing using MDCT coefficients |
| US8032371B2 (en) * | 2006-07-28 | 2011-10-04 | Apple Inc. | Determining scale factor values in encoding audio data with AAC |
| US8010370B2 (en) * | 2006-07-28 | 2011-08-30 | Apple Inc. | Bitrate control for perceptual coding |
| CN101308659B (zh) * | 2007-05-16 | 2011-11-30 | 中兴通讯股份有限公司 | 一种基于先进音频编码器的心理声学模型的处理方法 |
| US8788264B2 (en) * | 2007-06-27 | 2014-07-22 | Nec Corporation | Audio encoding method, audio decoding method, audio encoding device, audio decoding device, program, and audio encoding/decoding system |
| EP2186087B1 (en) * | 2007-08-27 | 2011-11-30 | Telefonaktiebolaget L M Ericsson (PUBL) | Improved transform coding of speech and audio signals |
| US8850154B2 (en) | 2007-09-11 | 2014-09-30 | 2236008 Ontario Inc. | Processing system having memory partitioning |
| US8904400B2 (en) | 2007-09-11 | 2014-12-02 | 2236008 Ontario Inc. | Processing system having a partitioning component for resource partitioning |
| US8694310B2 (en) | 2007-09-17 | 2014-04-08 | Qnx Software Systems Limited | Remote control server protocol system |
| CN101854175B (zh) * | 2007-10-12 | 2013-04-17 | 联咏科技股份有限公司 | 可降低信号功率频谱密度的编码方法 |
| GB2454190A (en) * | 2007-10-30 | 2009-05-06 | Cambridge Silicon Radio Ltd | Minimising a cost function in encoding data using spectral partitioning |
| US8209514B2 (en) | 2008-02-04 | 2012-06-26 | Qnx Software Systems Limited | Media processing system having resource partitioning |
| EP2274833B1 (en) * | 2008-04-16 | 2016-08-10 | Huawei Technologies Co., Ltd. | Vector quantisation method |
| US8290782B2 (en) * | 2008-07-24 | 2012-10-16 | Dts, Inc. | Compression of audio scale-factors by two-dimensional transformation |
| JP5304504B2 (ja) * | 2009-07-17 | 2013-10-02 | ソニー株式会社 | 信号符号化装置、信号復号装置、信号処理システム、これらにおける処理方法およびプログラム |
| EP2346031B1 (en) * | 2009-11-26 | 2015-09-30 | BlackBerry Limited | Rate-distortion optimization for advanced audio coding |
| US8380524B2 (en) * | 2009-11-26 | 2013-02-19 | Research In Motion Limited | Rate-distortion optimization for advanced audio coding |
| MX2013013261A (es) * | 2011-05-13 | 2014-02-20 | Samsung Electronics Co Ltd | Asignacion de bits, codificacion y decodificacion de audio. |
| US9293146B2 (en) * | 2012-09-04 | 2016-03-22 | Apple Inc. | Intensity stereo coding in advanced audio coding |
| US20140344159A1 (en) * | 2013-05-20 | 2014-11-20 | Dell Products, Lp | License Key Generation |
| EP2830058A1 (en) | 2013-07-22 | 2015-01-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Frequency-domain audio coding supporting transform length switching |
| TWI557726B (zh) * | 2013-08-29 | 2016-11-11 | 杜比國際公司 | 用於決定音頻信號的高頻帶信號的主比例因子頻帶表之系統和方法 |
| US10354668B2 (en) | 2017-03-22 | 2019-07-16 | Immersion Networks, Inc. | System and method for processing audio data |
| CN110426569B (zh) * | 2019-07-12 | 2021-09-21 | 国网上海市电力公司 | 一种变压器声信号降噪处理方法 |
| US20220156982A1 (en) * | 2020-11-19 | 2022-05-19 | Nvidia Corporation | Calculating data compression parameters |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5581653A (en) * | 1993-08-31 | 1996-12-03 | Dolby Laboratories Licensing Corporation | Low bit-rate high-resolution spectral envelope coding for audio encoder and decoder |
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5956674A (en) * | 1995-12-01 | 1999-09-21 | Digital Theater Systems, Inc. | Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels |
| US5845249A (en) * | 1996-05-03 | 1998-12-01 | Lsi Logic Corporation | Microarchitecture of audio core for an MPEG-2 and AC-3 decoder |
| US6430533B1 (en) * | 1996-05-03 | 2002-08-06 | Lsi Logic Corporation | Audio decoder core MPEG-1/MPEG-2/AC-3 functional algorithm partitioning and implementation |
| US6226616B1 (en) * | 1999-06-21 | 2001-05-01 | Digital Theater Systems, Inc. | Sound quality of established low bit-rate audio coding systems without loss of decoder compatibility |
| FR2822122B1 (fr) * | 2001-03-14 | 2003-05-23 | Nacam | Assemblage d'un etrier de colonne de direction avec un pignon de direction d'un vehicule automobile |
| US6934677B2 (en) * | 2001-12-14 | 2005-08-23 | Microsoft Corporation | Quantization matrices based on critical band pattern information for digital audio wherein quantization bands differ from critical bands |
| US7027982B2 (en) * | 2001-12-14 | 2006-04-11 | Microsoft Corporation | Quality and rate control strategy for digital audio |
-
2003
- 2003-01-02 US US10/336,637 patent/US7272566B2/en not_active Expired - Fee Related
- 2003-12-12 TW TW092135218A patent/TWI335145B/zh not_active IP Right Cessation
- 2003-12-16 DE DE60324465T patent/DE60324465D1/de not_active Expired - Lifetime
- 2003-12-16 WO PCT/US2003/040173 patent/WO2004061823A1/en not_active Ceased
- 2003-12-16 KR KR1020057012534A patent/KR101045520B1/ko not_active Expired - Fee Related
- 2003-12-16 EP EP03808458A patent/EP1581928B1/en not_active Expired - Lifetime
- 2003-12-16 DK DK03808458T patent/DK1581928T3/da active
- 2003-12-16 CN CN2003801081720A patent/CN1735925B/zh not_active Expired - Fee Related
- 2003-12-16 ES ES03808458T patent/ES2312852T3/es not_active Expired - Lifetime
- 2003-12-16 JP JP2004565543A patent/JP4425148B2/ja not_active Expired - Fee Related
- 2003-12-16 AT AT03808458T patent/ATE412960T1/de active
- 2003-12-16 MX MXPA05007183A patent/MXPA05007183A/es active IP Right Grant
- 2003-12-16 CA CA2507535A patent/CA2507535C/en not_active Expired - Fee Related
- 2003-12-16 PL PL377709A patent/PL208346B1/pl unknown
- 2003-12-16 AU AU2003303495A patent/AU2003303495B2/en not_active Ceased
- 2003-12-31 MY MYPI20035050A patent/MY138588A/en unknown
-
2005
- 2005-05-17 IL IL168636A patent/IL168636A/en active IP Right Grant
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5581653A (en) * | 1993-08-31 | 1996-12-03 | Dolby Laboratories Licensing Corporation | Low bit-rate high-resolution spectral envelope coding for audio encoder and decoder |
Non-Patent Citations (2)
| Title |
|---|
| AGGARWAL A ET AL: "Trellis-based optimization of MPEG-4 advanced audio coding", IEEE WORKSHOP ON SPEECH CODING. PROCEEDINGS. MEETING THE CHALLENGES OF THE NEW MILLENNIUM, 17 September 2000 (2000-09-17), pages 142 - 144, XP010520069 * |
| BOSI M ET AL: "ISO/IEC MPEG-2 ADVANCED AUDIO CODING", JOURNAL OF THE AUDIO ENGINEERING SOCIETY, AUDIO ENGINEERING SOCIETY. NEW YORK, US, vol. 45, no. 10, 1 October 1997 (1997-10-01), pages 789 - 812, XP000730161, ISSN: 0004-7554 * |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8255212B2 (en) | 2006-07-04 | 2012-08-28 | Dolby International Ab | Filter compressor and method for manufacturing compressed subband filter impulse responses |
Also Published As
| Publication number | Publication date |
|---|---|
| AU2003303495A1 (en) | 2004-07-29 |
| JP2006512617A (ja) | 2006-04-13 |
| ES2312852T3 (es) | 2009-03-01 |
| KR101045520B1 (ko) | 2011-06-30 |
| EP1581928A1 (en) | 2005-10-05 |
| ATE412960T1 (de) | 2008-11-15 |
| MXPA05007183A (es) | 2005-09-12 |
| CN1735925B (zh) | 2010-04-28 |
| CA2507535A1 (en) | 2004-07-22 |
| CA2507535C (en) | 2013-02-12 |
| CN1735925A (zh) | 2006-02-15 |
| DE60324465D1 (de) | 2008-12-11 |
| JP4425148B2 (ja) | 2010-03-03 |
| EP1581928B1 (en) | 2008-10-29 |
| TWI335145B (en) | 2010-12-21 |
| US7272566B2 (en) | 2007-09-18 |
| PL208346B1 (pl) | 2011-04-29 |
| PL377709A1 (pl) | 2006-02-06 |
| KR20050089870A (ko) | 2005-09-08 |
| AU2003303495B2 (en) | 2009-02-19 |
| DK1581928T3 (da) | 2009-01-19 |
| US20040131204A1 (en) | 2004-07-08 |
| TW200419929A (en) | 2004-10-01 |
| IL168636A (en) | 2011-01-31 |
| HK1079327A1 (en) | 2006-03-31 |
| MY138588A (en) | 2009-07-31 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| AU2003303495B2 (en) | Reducing scale factor transmission cost for MPEG-2 AAC using a lattice | |
| CN109313908B (zh) | 用于对音频信号进行编码的音频编码器以及方法 | |
| US7383180B2 (en) | Constant bitrate media encoding techniques | |
| US5226084A (en) | Methods for speech quantization and error correction | |
| CA2327405C (en) | Perceptual audio coder bit allocation scheme providing improved perceptual quality consistency | |
| EP1072036B1 (en) | Fast frame optimisation in an audio encoder | |
| KR102017892B1 (ko) | 향상된 계층적 코딩 | |
| US7650277B2 (en) | System, method, and apparatus for fast quantization in perceptual audio coders | |
| JP4903130B2 (ja) | 知覚コーディングのビット割り当てにおける複雑さを軽減した計算方法 | |
| US9159330B2 (en) | Rate controller, rate control method, and rate control program | |
| Oh et al. | Low power MPEG/audio encoders using simplified psychoacoustic model and fast bit allocation | |
| HK1079327B (en) | Reducing scale factor transmission cost for mpeg-2 aac using a lattice | |
| JP6224827B2 (ja) | 分配量子化及び符号化を使用した累積和表現のモデル化によるオーディオ信号包絡符号化、処理及び復号化の装置と方法 | |
| KR101789085B1 (ko) | 분포 양자화 및 코딩을 사용하는 오디오 신호 엔벨로프의 분할에 의한 오디오 신호 엔벨로프 인코딩, 처리 및 디코딩을 위한 장치 및 방법 | |
| JP2000137497A (ja) | デジタル音響信号符号化装置、デジタル音響信号符号化方法及びデジタル音響信号符号化プログラムを記録した媒体 | |
| WO2011000434A1 (en) | An apparatus | |
| JPH06291679A (ja) | オーディオ信号のためのしきい値制御量子化決定法 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG UZ VC VN YU ZA ZM ZW |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): BW GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
| WWE | Wipo information: entry into national phase |
Ref document number: 168636 Country of ref document: IL |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 921/KOLNP/2005 Country of ref document: IN |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2507535 Country of ref document: CA |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2003303495 Country of ref document: AU |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 377709 Country of ref document: PL |
|
| WWE | Wipo information: entry into national phase |
Ref document number: PA/a/2005/007183 Country of ref document: MX |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 1020057012534 Country of ref document: KR Ref document number: 20038A81720 Country of ref document: CN |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2004565543 Country of ref document: JP |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2003808458 Country of ref document: EP |
|
| WWP | Wipo information: published in national office |
Ref document number: 1020057012534 Country of ref document: KR |
|
| WWP | Wipo information: published in national office |
Ref document number: 2003808458 Country of ref document: EP |