CA2366560C - Quantization in perceptual audio coders with compensation for synthesis filter noise spreading - Google Patents
Quantization in perceptual audio coders with compensation for synthesis filter noise spreading Download PDFInfo
- Publication number
- CA2366560C CA2366560C CA002366560A CA2366560A CA2366560C CA 2366560 C CA2366560 C CA 2366560C CA 002366560 A CA002366560 A CA 002366560A CA 2366560 A CA2366560 A CA 2366560A CA 2366560 C CA2366560 C CA 2366560C
- Authority
- CA
- Canada
- Prior art keywords
- noise
- synthesis
- quantization
- filter
- subband
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000013139 quantization Methods 0.000 title claims abstract description 132
- 230000015572 biosynthetic process Effects 0.000 title claims abstract description 101
- 238000003786 synthesis reaction Methods 0.000 title claims abstract description 101
- 238000003892 spreading Methods 0.000 title claims abstract description 81
- 230000007480 spreading Effects 0.000 title abstract description 32
- 238000000034 method Methods 0.000 claims abstract description 108
- 230000008569 process Effects 0.000 claims abstract description 76
- 238000004458 analytical method Methods 0.000 claims abstract description 43
- 238000001228 spectrum Methods 0.000 claims description 54
- 230000004044 response Effects 0.000 claims description 24
- 238000012545 processing Methods 0.000 claims description 17
- 238000004590 computer program Methods 0.000 claims 2
- 238000005457 optimization Methods 0.000 abstract description 13
- 230000003595 spectral effect Effects 0.000 description 35
- 230000006870 function Effects 0.000 description 31
- 230000005236 sound signal Effects 0.000 description 26
- 230000014509 gene expression Effects 0.000 description 23
- 230000000873 masking effect Effects 0.000 description 21
- 239000011159 matrix material Substances 0.000 description 14
- 230000002829 reductive effect Effects 0.000 description 11
- 238000012937 correction Methods 0.000 description 8
- 238000003860 storage Methods 0.000 description 8
- 239000000284 extract Substances 0.000 description 6
- 230000000295 complement effect Effects 0.000 description 5
- 238000010586 diagram Methods 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 230000008859 change Effects 0.000 description 4
- 230000003287 optical effect Effects 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000004891 communication Methods 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 230000003466 anti-cipated effect Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 230000007812 deficiency Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 101100445834 Drosophila melanogaster E(z) gene Proteins 0.000 description 1
- 101000800807 Homo sapiens Tumor necrosis factor alpha-induced protein 8 Proteins 0.000 description 1
- 102100033649 Tumor necrosis factor alpha-induced protein 8 Human genes 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 238000009432 framing Methods 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000002085 persistent effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/002—Dynamic bit allocation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US09/289,865 | 1999-04-12 | ||
| US09/289,865 US6363338B1 (en) | 1999-04-12 | 1999-04-12 | Quantization in perceptual audio coders with compensation for synthesis filter noise spreading |
| PCT/US2000/009557 WO2000062434A1 (en) | 1999-04-12 | 2000-04-10 | Quantization in perceptual audio coders with compensation for synthesis filter noise spreading |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CA2366560A1 CA2366560A1 (en) | 2000-10-19 |
| CA2366560C true CA2366560C (en) | 2008-07-29 |
Family
ID=23113455
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CA002366560A Expired - Lifetime CA2366560C (en) | 1999-04-12 | 2000-04-10 | Quantization in perceptual audio coders with compensation for synthesis filter noise spreading |
Country Status (13)
| Country | Link |
|---|---|
| US (1) | US6363338B1 (enExample) |
| EP (1) | EP1177639B1 (enExample) |
| JP (1) | JP4643019B2 (enExample) |
| KR (1) | KR100758215B1 (enExample) |
| AR (1) | AR024858A1 (enExample) |
| AT (1) | ATE248463T1 (enExample) |
| AU (1) | AU771869B2 (enExample) |
| CA (1) | CA2366560C (enExample) |
| DE (1) | DE60004814T2 (enExample) |
| HK (1) | HK1044235B (enExample) |
| MY (1) | MY120387A (enExample) |
| TW (1) | TW531986B (enExample) |
| WO (1) | WO2000062434A1 (enExample) |
Families Citing this family (51)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| DE19947877C2 (de) * | 1999-10-05 | 2001-09-13 | Fraunhofer Ges Forschung | Verfahren und Vorrichtung zum Einbringen von Informationen in einen Datenstrom sowie Verfahren und Vorrichtung zum Codieren eines Audiosignals |
| US7734448B2 (en) * | 2000-01-10 | 2010-06-08 | Canning Francis X | Sparse and efficient block factorization for interaction data |
| US7720651B2 (en) * | 2000-09-29 | 2010-05-18 | Canning Francis X | Compression of interaction data using directional sources and/or testers |
| TW499672B (en) * | 2000-02-18 | 2002-08-21 | Intervideo Inc | Fast convergence method for bit allocation stage of MPEG audio layer 3 encoders |
| US7050924B2 (en) * | 2000-06-12 | 2006-05-23 | British Telecommunications Public Limited Company | Test signalling |
| US7945430B2 (en) | 2000-09-29 | 2011-05-17 | Canning Francis X | Compression and compressed inversion of interaction data |
| US7031955B1 (en) * | 2001-04-27 | 2006-04-18 | I2 Technologies Us, Inc. | Optimization using a multi-dimensional data model |
| SE0202159D0 (sv) | 2001-07-10 | 2002-07-09 | Coding Technologies Sweden Ab | Efficientand scalable parametric stereo coding for low bitrate applications |
| US6987889B1 (en) * | 2001-08-10 | 2006-01-17 | Polycom, Inc. | System and method for dynamic perceptual coding of macroblocks in a video frame |
| US6732071B2 (en) * | 2001-09-27 | 2004-05-04 | Intel Corporation | Method, apparatus, and system for efficient rate control in audio encoding |
| US7469206B2 (en) | 2001-11-29 | 2008-12-23 | Coding Technologies Ab | Methods for improving high frequency reconstruction |
| US7447631B2 (en) * | 2002-06-17 | 2008-11-04 | Dolby Laboratories Licensing Corporation | Audio coding system using spectral hole filling |
| US20040030555A1 (en) * | 2002-08-12 | 2004-02-12 | Oregon Health & Science University | System and method for concatenating acoustic contours for speech synthesis |
| SE0202770D0 (sv) * | 2002-09-18 | 2002-09-18 | Coding Technologies Sweden Ab | Method for reduction of aliasing introduces by spectral envelope adjustment in real-valued filterbanks |
| US7376553B2 (en) * | 2003-07-08 | 2008-05-20 | Robert Patel Quinn | Fractal harmonic overtone mapping of speech and musical sounds |
| AU2003264322A1 (en) * | 2003-09-17 | 2005-04-06 | Beijing E-World Technology Co., Ltd. | Method and device of multi-resolution vector quantilization for audio encoding and decoding |
| US7539614B2 (en) * | 2003-11-14 | 2009-05-26 | Nxp B.V. | System and method for audio signal processing using different gain factors for voiced and unvoiced phonemes |
| KR20070001115A (ko) * | 2004-01-28 | 2007-01-03 | 코닌클리케 필립스 일렉트로닉스 엔.브이. | 복소수 값 데이터를 이용하는 오디오 신호 디코딩 |
| DE102004009955B3 (de) * | 2004-03-01 | 2005-08-11 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Ermitteln einer Quantisierer-Schrittweite |
| US7512536B2 (en) * | 2004-05-14 | 2009-03-31 | Texas Instruments Incorporated | Efficient filter bank computation for audio coding |
| US7720236B2 (en) * | 2004-10-15 | 2010-05-18 | Lifesize Communications, Inc. | Updating modeling information based on offline calibration experiments |
| US7903137B2 (en) * | 2004-10-15 | 2011-03-08 | Lifesize Communications, Inc. | Videoconferencing echo cancellers |
| US8116500B2 (en) * | 2004-10-15 | 2012-02-14 | Lifesize Communications, Inc. | Microphone orientation and size in a speakerphone |
| US7720232B2 (en) * | 2004-10-15 | 2010-05-18 | Lifesize Communications, Inc. | Speakerphone |
| US7970151B2 (en) * | 2004-10-15 | 2011-06-28 | Lifesize Communications, Inc. | Hybrid beamforming |
| US20060132595A1 (en) * | 2004-10-15 | 2006-06-22 | Kenoyer Michael L | Speakerphone supporting video and audio features |
| US7826624B2 (en) * | 2004-10-15 | 2010-11-02 | Lifesize Communications, Inc. | Speakerphone self calibration and beam forming |
| US7760887B2 (en) * | 2004-10-15 | 2010-07-20 | Lifesize Communications, Inc. | Updating modeling information based on online data gathering |
| US7751572B2 (en) * | 2005-04-15 | 2010-07-06 | Dolby International Ab | Adaptive residual audio coding |
| US7970150B2 (en) * | 2005-04-29 | 2011-06-28 | Lifesize Communications, Inc. | Tracking talkers using virtual broadside scan and directed beams |
| US7593539B2 (en) * | 2005-04-29 | 2009-09-22 | Lifesize Communications, Inc. | Microphone and speaker arrangement in speakerphone |
| US7991167B2 (en) * | 2005-04-29 | 2011-08-02 | Lifesize Communications, Inc. | Forming beams with nulls directed at noise sources |
| US7974713B2 (en) * | 2005-10-12 | 2011-07-05 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Temporal and spatial shaping of multi-channel audio signals |
| US7835904B2 (en) * | 2006-03-03 | 2010-11-16 | Microsoft Corp. | Perceptual, scalable audio compression |
| KR101393298B1 (ko) * | 2006-07-08 | 2014-05-12 | 삼성전자주식회사 | 적응적 부호화/복호화 방법 및 장치 |
| JP5096468B2 (ja) * | 2006-08-15 | 2012-12-12 | ドルビー ラボラトリーズ ライセンシング コーポレイション | サイド情報なしの時間的ノイズエンベロープの自由な整形 |
| FR2912249A1 (fr) * | 2007-02-02 | 2008-08-08 | France Telecom | Codage/decodage perfectionnes de signaux audionumeriques. |
| EP2274833B1 (en) * | 2008-04-16 | 2016-08-10 | Huawei Technologies Co., Ltd. | Vector quantisation method |
| US20100106269A1 (en) * | 2008-09-26 | 2010-04-29 | Qualcomm Incorporated | Method and apparatus for signal processing using transform-domain log-companding |
| US11657788B2 (en) | 2009-05-27 | 2023-05-23 | Dolby International Ab | Efficient combined harmonic transposition |
| TWI591625B (zh) | 2009-05-27 | 2017-07-11 | 杜比國際公司 | 從訊號的低頻成份產生該訊號之高頻成份的系統與方法,及其機上盒、電腦程式產品、軟體程式及儲存媒體 |
| KR101599884B1 (ko) * | 2009-08-18 | 2016-03-04 | 삼성전자주식회사 | 멀티 채널 오디오 디코딩 방법 및 장치 |
| EP3693963B1 (en) * | 2009-10-15 | 2021-07-21 | VoiceAge Corporation | Simultaneous time-domain and frequency-domain noise shaping for tdac transforms |
| PL4542546T3 (pl) | 2009-10-21 | 2025-12-08 | Dolby International Ab | Nadpróbkowanie w banku filtrów połączonym z modułem transpozycji |
| US8958510B1 (en) * | 2010-06-10 | 2015-02-17 | Fredric J. Harris | Selectable bandwidth filter |
| US20120029926A1 (en) * | 2010-07-30 | 2012-02-02 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for dependent-mode coding of audio signals |
| US9208792B2 (en) | 2010-08-17 | 2015-12-08 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for noise injection |
| US9225310B1 (en) * | 2012-11-08 | 2015-12-29 | iZotope, Inc. | Audio limiter system and method |
| US10325584B2 (en) | 2014-12-10 | 2019-06-18 | Stmicroelectronics S.R.L. | Active noise cancelling device and method of actively cancelling acoustic noise |
| EP3616197B1 (en) | 2017-04-28 | 2025-06-18 | DTS, Inc. | Audio coder window sizes and time-frequency transformations |
| US10886943B2 (en) * | 2019-03-18 | 2021-01-05 | Samsung Electronics Co., Ltd | Method and apparatus for variable rate compression with a conditional autoencoder |
Family Cites Families (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4956871A (en) * | 1988-09-30 | 1990-09-11 | At&T Bell Laboratories | Improving sub-band coding of speech at low bit rates by adding residual speech energy signals to sub-bands |
| US5222189A (en) * | 1989-01-27 | 1993-06-22 | Dolby Laboratories Licensing Corporation | Low time-delay transform coder, decoder, and encoder/decoder for high-quality audio |
| JP2906646B2 (ja) * | 1990-11-09 | 1999-06-21 | 松下電器産業株式会社 | 音声帯域分割符号化装置 |
| EP0559348A3 (en) * | 1992-03-02 | 1993-11-03 | AT&T Corp. | Rate control loop processor for perceptual encoder/decoder |
| EP0709006B1 (en) * | 1993-07-16 | 1997-03-05 | Dolby Laboratories Licensing Corporation | Computationally efficient adaptive bit allocation for encoding method and apparatus with allowance for decoder spectral distortions |
| US5623577A (en) * | 1993-07-16 | 1997-04-22 | Dolby Laboratories Licensing Corporation | Computationally efficient adaptive bit allocation for encoding method and apparatus with allowance for decoder spectral distortions |
| EP0722225A3 (de) * | 1994-11-17 | 2000-06-07 | Deutsche Thomson-Brandt Gmbh | Audiosignalkodierung mittels Kurzzeitspektren und einem psychoakustischen Modell |
| JP2820117B2 (ja) * | 1996-05-29 | 1998-11-05 | 日本電気株式会社 | 音声符号化装置 |
| US5913191A (en) * | 1997-10-17 | 1999-06-15 | Dolby Laboratories Licensing Corporation | Frame-based audio coding with additional filterbank to suppress aliasing artifacts at frame boundaries |
-
1999
- 1999-04-12 US US09/289,865 patent/US6363338B1/en not_active Expired - Lifetime
-
2000
- 2000-04-10 EP EP00923218A patent/EP1177639B1/en not_active Expired - Lifetime
- 2000-04-10 HK HK02105731.1A patent/HK1044235B/en unknown
- 2000-04-10 KR KR1020017013052A patent/KR100758215B1/ko not_active Expired - Lifetime
- 2000-04-10 JP JP2000611392A patent/JP4643019B2/ja not_active Expired - Lifetime
- 2000-04-10 AU AU43382/00A patent/AU771869B2/en not_active Expired
- 2000-04-10 AR ARP000101633A patent/AR024858A1/es active IP Right Grant
- 2000-04-10 WO PCT/US2000/009557 patent/WO2000062434A1/en not_active Ceased
- 2000-04-10 CA CA002366560A patent/CA2366560C/en not_active Expired - Lifetime
- 2000-04-10 DE DE60004814T patent/DE60004814T2/de not_active Expired - Lifetime
- 2000-04-10 AT AT00923218T patent/ATE248463T1/de not_active IP Right Cessation
- 2000-04-11 MY MYPI20001499A patent/MY120387A/en unknown
- 2000-04-11 TW TW089106700A patent/TW531986B/zh not_active IP Right Cessation
Also Published As
| Publication number | Publication date |
|---|---|
| AU771869B2 (en) | 2004-04-01 |
| EP1177639A1 (en) | 2002-02-06 |
| KR20010112423A (ko) | 2001-12-20 |
| DE60004814T2 (de) | 2004-07-01 |
| EP1177639B1 (en) | 2003-08-27 |
| MY120387A (en) | 2005-10-31 |
| ATE248463T1 (de) | 2003-09-15 |
| JP2002542648A (ja) | 2002-12-10 |
| TW531986B (en) | 2003-05-11 |
| KR100758215B1 (ko) | 2007-09-12 |
| AR024858A1 (es) | 2002-10-30 |
| HK1044235A1 (en) | 2002-10-11 |
| JP4643019B2 (ja) | 2011-03-02 |
| DE60004814D1 (de) | 2003-10-02 |
| CA2366560A1 (en) | 2000-10-19 |
| AU4338200A (en) | 2000-11-14 |
| WO2000062434A1 (en) | 2000-10-19 |
| HK1044235B (en) | 2003-12-24 |
| US6363338B1 (en) | 2002-03-26 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CA2366560C (en) | Quantization in perceptual audio coders with compensation for synthesis filter noise spreading | |
| EP2216777B1 (en) | Audio coding system using spectral hole filling | |
| US6058362A (en) | System and method for masking quantization noise of audio signals | |
| JP3297051B2 (ja) | 適応ビット配分符号化装置及び方法 | |
| EP1080542B1 (en) | System and method for masking quantization noise of audio signals | |
| US6029126A (en) | Scalable audio coder and decoder | |
| US7627469B2 (en) | Audio signal encoding apparatus and audio signal encoding method | |
| US8032371B2 (en) | Determining scale factor values in encoding audio data with AAC | |
| MXPA01010447A (es) | Utilizacion de cuantificacion adaptativa de ganancia y longitudes de simbolos no uniformes para codificacion de audio. | |
| AU2003237295B2 (en) | Audio coding system using spectral hole filling | |
| Trinkaus et al. | An algorithm for compression of wideband diverse speech and audio signals | |
| Bhaskar | Low rate coding of audio by a predictive transform coder for efficient satellite transmission | |
| HK1141624B (en) | Audio coding system using spectral hole filling | |
| HK1070729B (en) | Audio coding system using spectral hole filling | |
| HK1141623B (en) | Audio decoding system using spectral hole filling |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| EEER | Examination request | ||
| MKEX | Expiry |
Effective date: 20200410 |