JP4643019B2 - 合成フィルタ雑音伸長の補償を持つ知覚音声コーダの量子化 - Google Patents

合成フィルタ雑音伸長の補償を持つ知覚音声コーダの量子化 Download PDF

Info

Publication number
JP4643019B2
JP4643019B2 JP2000611392A JP2000611392A JP4643019B2 JP 4643019 B2 JP4643019 B2 JP 4643019B2 JP 2000611392 A JP2000611392 A JP 2000611392A JP 2000611392 A JP2000611392 A JP 2000611392A JP 4643019 B2 JP4643019 B2 JP 4643019B2
Authority
JP
Japan
Prior art keywords
noise
synthesis filter
quantization
subband signal
signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
JP2000611392A
Other languages
English (en)
Japanese (ja)
Other versions
JP2002542648A (ja
JP2002542648A5 (enExample
Inventor
ユーベル、アニル・ワマンラオ
デビッドソン、グラント・アレン
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dolby Laboratories Licensing Corp
Original Assignee
Dolby Laboratories Licensing Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dolby Laboratories Licensing Corp filed Critical Dolby Laboratories Licensing Corp
Publication of JP2002542648A publication Critical patent/JP2002542648A/ja
Publication of JP2002542648A5 publication Critical patent/JP2002542648A5/ja
Application granted granted Critical
Publication of JP4643019B2 publication Critical patent/JP4643019B2/ja
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/002Dynamic bit allocation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
JP2000611392A 1999-04-12 2000-04-10 合成フィルタ雑音伸長の補償を持つ知覚音声コーダの量子化 Expired - Lifetime JP4643019B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US09/289,865 1999-04-12
US09/289,865 US6363338B1 (en) 1999-04-12 1999-04-12 Quantization in perceptual audio coders with compensation for synthesis filter noise spreading
PCT/US2000/009557 WO2000062434A1 (en) 1999-04-12 2000-04-10 Quantization in perceptual audio coders with compensation for synthesis filter noise spreading

Publications (3)

Publication Number Publication Date
JP2002542648A JP2002542648A (ja) 2002-12-10
JP2002542648A5 JP2002542648A5 (enExample) 2007-06-07
JP4643019B2 true JP4643019B2 (ja) 2011-03-02

Family

ID=23113455

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2000611392A Expired - Lifetime JP4643019B2 (ja) 1999-04-12 2000-04-10 合成フィルタ雑音伸長の補償を持つ知覚音声コーダの量子化

Country Status (13)

Country Link
US (1) US6363338B1 (enExample)
EP (1) EP1177639B1 (enExample)
JP (1) JP4643019B2 (enExample)
KR (1) KR100758215B1 (enExample)
AR (1) AR024858A1 (enExample)
AT (1) ATE248463T1 (enExample)
AU (1) AU771869B2 (enExample)
CA (1) CA2366560C (enExample)
DE (1) DE60004814T2 (enExample)
HK (1) HK1044235B (enExample)
MY (1) MY120387A (enExample)
TW (1) TW531986B (enExample)
WO (1) WO2000062434A1 (enExample)

Families Citing this family (51)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE19947877C2 (de) * 1999-10-05 2001-09-13 Fraunhofer Ges Forschung Verfahren und Vorrichtung zum Einbringen von Informationen in einen Datenstrom sowie Verfahren und Vorrichtung zum Codieren eines Audiosignals
US7734448B2 (en) * 2000-01-10 2010-06-08 Canning Francis X Sparse and efficient block factorization for interaction data
US7720651B2 (en) * 2000-09-29 2010-05-18 Canning Francis X Compression of interaction data using directional sources and/or testers
TW499672B (en) * 2000-02-18 2002-08-21 Intervideo Inc Fast convergence method for bit allocation stage of MPEG audio layer 3 encoders
US7050924B2 (en) * 2000-06-12 2006-05-23 British Telecommunications Public Limited Company Test signalling
US7945430B2 (en) 2000-09-29 2011-05-17 Canning Francis X Compression and compressed inversion of interaction data
US7031955B1 (en) * 2001-04-27 2006-04-18 I2 Technologies Us, Inc. Optimization using a multi-dimensional data model
SE0202159D0 (sv) 2001-07-10 2002-07-09 Coding Technologies Sweden Ab Efficientand scalable parametric stereo coding for low bitrate applications
US6987889B1 (en) * 2001-08-10 2006-01-17 Polycom, Inc. System and method for dynamic perceptual coding of macroblocks in a video frame
US6732071B2 (en) * 2001-09-27 2004-05-04 Intel Corporation Method, apparatus, and system for efficient rate control in audio encoding
US7469206B2 (en) 2001-11-29 2008-12-23 Coding Technologies Ab Methods for improving high frequency reconstruction
US7447631B2 (en) * 2002-06-17 2008-11-04 Dolby Laboratories Licensing Corporation Audio coding system using spectral hole filling
US20040030555A1 (en) * 2002-08-12 2004-02-12 Oregon Health & Science University System and method for concatenating acoustic contours for speech synthesis
SE0202770D0 (sv) * 2002-09-18 2002-09-18 Coding Technologies Sweden Ab Method for reduction of aliasing introduces by spectral envelope adjustment in real-valued filterbanks
US7376553B2 (en) * 2003-07-08 2008-05-20 Robert Patel Quinn Fractal harmonic overtone mapping of speech and musical sounds
AU2003264322A1 (en) * 2003-09-17 2005-04-06 Beijing E-World Technology Co., Ltd. Method and device of multi-resolution vector quantilization for audio encoding and decoding
US7539614B2 (en) * 2003-11-14 2009-05-26 Nxp B.V. System and method for audio signal processing using different gain factors for voiced and unvoiced phonemes
KR20070001115A (ko) * 2004-01-28 2007-01-03 코닌클리케 필립스 일렉트로닉스 엔.브이. 복소수 값 데이터를 이용하는 오디오 신호 디코딩
DE102004009955B3 (de) * 2004-03-01 2005-08-11 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Ermitteln einer Quantisierer-Schrittweite
US7512536B2 (en) * 2004-05-14 2009-03-31 Texas Instruments Incorporated Efficient filter bank computation for audio coding
US7720236B2 (en) * 2004-10-15 2010-05-18 Lifesize Communications, Inc. Updating modeling information based on offline calibration experiments
US7903137B2 (en) * 2004-10-15 2011-03-08 Lifesize Communications, Inc. Videoconferencing echo cancellers
US8116500B2 (en) * 2004-10-15 2012-02-14 Lifesize Communications, Inc. Microphone orientation and size in a speakerphone
US7720232B2 (en) * 2004-10-15 2010-05-18 Lifesize Communications, Inc. Speakerphone
US7970151B2 (en) * 2004-10-15 2011-06-28 Lifesize Communications, Inc. Hybrid beamforming
US20060132595A1 (en) * 2004-10-15 2006-06-22 Kenoyer Michael L Speakerphone supporting video and audio features
US7826624B2 (en) * 2004-10-15 2010-11-02 Lifesize Communications, Inc. Speakerphone self calibration and beam forming
US7760887B2 (en) * 2004-10-15 2010-07-20 Lifesize Communications, Inc. Updating modeling information based on online data gathering
US7751572B2 (en) * 2005-04-15 2010-07-06 Dolby International Ab Adaptive residual audio coding
US7970150B2 (en) * 2005-04-29 2011-06-28 Lifesize Communications, Inc. Tracking talkers using virtual broadside scan and directed beams
US7593539B2 (en) * 2005-04-29 2009-09-22 Lifesize Communications, Inc. Microphone and speaker arrangement in speakerphone
US7991167B2 (en) * 2005-04-29 2011-08-02 Lifesize Communications, Inc. Forming beams with nulls directed at noise sources
US7974713B2 (en) * 2005-10-12 2011-07-05 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Temporal and spatial shaping of multi-channel audio signals
US7835904B2 (en) * 2006-03-03 2010-11-16 Microsoft Corp. Perceptual, scalable audio compression
KR101393298B1 (ko) * 2006-07-08 2014-05-12 삼성전자주식회사 적응적 부호화/복호화 방법 및 장치
JP5096468B2 (ja) * 2006-08-15 2012-12-12 ドルビー ラボラトリーズ ライセンシング コーポレイション サイド情報なしの時間的ノイズエンベロープの自由な整形
FR2912249A1 (fr) * 2007-02-02 2008-08-08 France Telecom Codage/decodage perfectionnes de signaux audionumeriques.
EP2274833B1 (en) * 2008-04-16 2016-08-10 Huawei Technologies Co., Ltd. Vector quantisation method
US20100106269A1 (en) * 2008-09-26 2010-04-29 Qualcomm Incorporated Method and apparatus for signal processing using transform-domain log-companding
US11657788B2 (en) 2009-05-27 2023-05-23 Dolby International Ab Efficient combined harmonic transposition
TWI591625B (zh) 2009-05-27 2017-07-11 杜比國際公司 從訊號的低頻成份產生該訊號之高頻成份的系統與方法,及其機上盒、電腦程式產品、軟體程式及儲存媒體
KR101599884B1 (ko) * 2009-08-18 2016-03-04 삼성전자주식회사 멀티 채널 오디오 디코딩 방법 및 장치
EP3693963B1 (en) * 2009-10-15 2021-07-21 VoiceAge Corporation Simultaneous time-domain and frequency-domain noise shaping for tdac transforms
PL4542546T3 (pl) 2009-10-21 2025-12-08 Dolby International Ab Nadpróbkowanie w banku filtrów połączonym z modułem transpozycji
US8958510B1 (en) * 2010-06-10 2015-02-17 Fredric J. Harris Selectable bandwidth filter
US20120029926A1 (en) * 2010-07-30 2012-02-02 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for dependent-mode coding of audio signals
US9208792B2 (en) 2010-08-17 2015-12-08 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for noise injection
US9225310B1 (en) * 2012-11-08 2015-12-29 iZotope, Inc. Audio limiter system and method
US10325584B2 (en) 2014-12-10 2019-06-18 Stmicroelectronics S.R.L. Active noise cancelling device and method of actively cancelling acoustic noise
EP3616197B1 (en) 2017-04-28 2025-06-18 DTS, Inc. Audio coder window sizes and time-frequency transformations
US10886943B2 (en) * 2019-03-18 2021-01-05 Samsung Electronics Co., Ltd Method and apparatus for variable rate compression with a conditional autoencoder

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4956871A (en) * 1988-09-30 1990-09-11 At&T Bell Laboratories Improving sub-band coding of speech at low bit rates by adding residual speech energy signals to sub-bands
US5222189A (en) * 1989-01-27 1993-06-22 Dolby Laboratories Licensing Corporation Low time-delay transform coder, decoder, and encoder/decoder for high-quality audio
JP2906646B2 (ja) * 1990-11-09 1999-06-21 松下電器産業株式会社 音声帯域分割符号化装置
EP0559348A3 (en) * 1992-03-02 1993-11-03 AT&T Corp. Rate control loop processor for perceptual encoder/decoder
EP0709006B1 (en) * 1993-07-16 1997-03-05 Dolby Laboratories Licensing Corporation Computationally efficient adaptive bit allocation for encoding method and apparatus with allowance for decoder spectral distortions
US5623577A (en) * 1993-07-16 1997-04-22 Dolby Laboratories Licensing Corporation Computationally efficient adaptive bit allocation for encoding method and apparatus with allowance for decoder spectral distortions
EP0722225A3 (de) * 1994-11-17 2000-06-07 Deutsche Thomson-Brandt Gmbh Audiosignalkodierung mittels Kurzzeitspektren und einem psychoakustischen Modell
JP2820117B2 (ja) * 1996-05-29 1998-11-05 日本電気株式会社 音声符号化装置
US5913191A (en) * 1997-10-17 1999-06-15 Dolby Laboratories Licensing Corporation Frame-based audio coding with additional filterbank to suppress aliasing artifacts at frame boundaries

Also Published As

Publication number Publication date
AU771869B2 (en) 2004-04-01
EP1177639A1 (en) 2002-02-06
KR20010112423A (ko) 2001-12-20
DE60004814T2 (de) 2004-07-01
EP1177639B1 (en) 2003-08-27
MY120387A (en) 2005-10-31
ATE248463T1 (de) 2003-09-15
JP2002542648A (ja) 2002-12-10
CA2366560C (en) 2008-07-29
TW531986B (en) 2003-05-11
KR100758215B1 (ko) 2007-09-12
AR024858A1 (es) 2002-10-30
HK1044235A1 (en) 2002-10-11
DE60004814D1 (de) 2003-10-02
CA2366560A1 (en) 2000-10-19
AU4338200A (en) 2000-11-14
WO2000062434A1 (en) 2000-10-19
HK1044235B (en) 2003-12-24
US6363338B1 (en) 2002-03-26

Similar Documents

Publication Publication Date Title
JP4643019B2 (ja) 合成フィルタ雑音伸長の補償を持つ知覚音声コーダの量子化
US9305558B2 (en) Multi-channel audio encoding/decoding with parametric compression/decompression and weight factors
KR100991448B1 (ko) 스펙트럼 홀 충전을 사용하는 오디오 코딩 시스템
CN101425294B (zh) 声音编解码与发送接收设备及编码方法、通信终端和基站
KR101343267B1 (ko) 주파수 세그먼트화를 이용한 오디오 코딩 및 디코딩을 위한 방법 및 장치
JP3297051B2 (ja) 適応ビット配分符号化装置及び方法
JP3093179B2 (ja) 高品質オーディオ用短時間遅延変換エンコーダ及びデコーダ
US7752052B2 (en) Scalable coder and decoder performing amplitude flattening for error spectrum estimation
US6704705B1 (en) Perceptual audio coding
US20090198500A1 (en) Temporal masking in audio coding based on spectral dynamics in frequency sub-bands
JP4843142B2 (ja) 音声符号化のための利得−適応性量子化及び不均一符号長の使用
US5924060A (en) Digital coding process for transmission or storage of acoustical signals by transforming of scanning values into spectral coefficients
US9691398B2 (en) Method and a decoder for attenuation of signal regions reconstructed with low accuracy
WO2009081315A1 (en) Encoding and decoding audio or speech
Spanias et al. Analysis of the MPEG-1 Layer III (MP3) Algorithm using MATLAB
Ordentlich Low delay-code excited linear predictive (LD-CELP) coding of wide band speech at 32kbits/sec
Bhaskar Low rate coding of audio by a predictive transform coder for efficient satellite transmission

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20070405

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20070405

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20100104

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20100119

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20100416

A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 20100525

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20100903

A911 Transfer to examiner for re-examination before appeal (zenchi)

Free format text: JAPANESE INTERMEDIATE CODE: A911

Effective date: 20101004

RD02 Notification of acceptance of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7422

Effective date: 20101018

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20101019

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20101020

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20101116

A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20101202

R150 Certificate of patent or registration of utility model

Ref document number: 4643019

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

Free format text: JAPANESE INTERMEDIATE CODE: R150

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20131210

Year of fee payment: 3

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

EXPY Cancellation because of completion of term