ATE341074T1 - MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODER - Google Patents

MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODER

Info

Publication number
ATE341074T1
ATE341074T1 AT00912053T AT00912053T ATE341074T1 AT E341074 T1 ATE341074 T1 AT E341074T1 AT 00912053 T AT00912053 T AT 00912053T AT 00912053 T AT00912053 T AT 00912053T AT E341074 T1 ATE341074 T1 AT E341074T1
Authority
AT
Austria
Prior art keywords
coding mode
encoded
speech
domain coding
speech frame
Prior art date
Application number
AT00912053T
Other languages
German (de)
Inventor
Amitava Das
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Application granted granted Critical
Publication of ATE341074T1 publication Critical patent/ATE341074T1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis

Abstract

A closed-loop, multimode, mixed-domain linear prediction (MDLP) speech coder includes a high-rate, time-domain coding mode, a low-rate, frequency-domain coding mode, and a closed-loop mode-selection mechanism for selecting a coding mode for the coder based upon the speech content of frames input to the coder. Transition speech (i.e., from unvoiced speech to voiced speech, or vice versa) frames are encoded with the high-rate, time-domain coding mode, which may be a CELP coding mode. Voiced speech frames are encoded with the low-rate, frequency-domain coding mode, which may be a harmonic coding mode. Phase parameters are not encoded by the frequency-domain coding mode, and are instead modeled in accordance with, e.g., a quadratic phase model. For each speech frame encoded with the frequency-domain coding mode, the initial phase value is taken to be the initial phase value of the immediately preceding speech frame encoded with the frequency-domain coding mode. If the immediately preceding speech frame was encoded with the time-domain coding mode, the initial phase value of the current speech frame is computed from the decoded speech frame information of the immediately preceding, time-domain-encoded speech frame. Each speech frame encoded with the frequency-domain coding mode may be compared with the corresponding input speech frame to obtain a performance measure. If the performance measure falls below a predefined threshold value, the input speech frame is encoded with the time-domain coding mode.
AT00912053T 2000-02-29 2000-02-29 MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODER ATE341074T1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US2000/005140 WO2001065544A1 (en) 2000-02-29 2000-02-29 Closed-loop multimode mixed-domain linear prediction speech coder

Publications (1)

Publication Number Publication Date
ATE341074T1 true ATE341074T1 (en) 2006-10-15

Family

ID=21741098

Family Applications (1)

Application Number Title Priority Date Filing Date
AT00912053T ATE341074T1 (en) 2000-02-29 2000-02-29 MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODER

Country Status (10)

Country Link
EP (1) EP1259957B1 (en)
JP (1) JP4907826B2 (en)
KR (1) KR100711047B1 (en)
CN (1) CN1266674C (en)
AT (1) ATE341074T1 (en)
AU (1) AU2000233851A1 (en)
DE (1) DE60031002T2 (en)
ES (1) ES2269112T3 (en)
HK (1) HK1055833A1 (en)
WO (1) WO2001065544A1 (en)

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6438518B1 (en) * 1999-10-28 2002-08-20 Qualcomm Incorporated Method and apparatus for using coding scheme selection patterns in a predictive speech coder to reduce sensitivity to frame error conditions
CA2392640A1 (en) * 2002-07-05 2004-01-05 Voiceage Corporation A method and device for efficient in-based dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for cdma wireless systems
JP4568732B2 (en) * 2003-12-19 2010-10-27 クリエイティブ テクノロジー リミテッド Method and system for processing digital images
US7739120B2 (en) 2004-05-17 2010-06-15 Nokia Corporation Selection of coding models for encoding an audio signal
WO2007040365A1 (en) * 2005-10-05 2007-04-12 Lg Electronics Inc. Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
CN101283406B (en) * 2005-10-05 2013-06-19 Lg电子株式会社 Method and apparatus for signal processing and encoding and decoding method, and apparatus thereof
KR100647336B1 (en) * 2005-11-08 2006-11-23 삼성전자주식회사 Apparatus and method for adaptive time/frequency-based encoding/decoding
US8010352B2 (en) 2006-06-21 2011-08-30 Samsung Electronics Co., Ltd. Method and apparatus for adaptively encoding and decoding high frequency band
KR101390188B1 (en) * 2006-06-21 2014-04-30 삼성전자주식회사 Method and apparatus for encoding and decoding adaptive high frequency band
US9159333B2 (en) 2006-06-21 2015-10-13 Samsung Electronics Co., Ltd. Method and apparatus for adaptively encoding and decoding high frequency band
CN101145345B (en) * 2006-09-13 2011-02-09 华为技术有限公司 Audio frequency classification method
KR101131880B1 (en) * 2007-03-23 2012-04-03 삼성전자주식회사 Method and apparatus for encoding audio signal, and method and apparatus for decoding audio signal
DE112007003567A5 (en) * 2007-04-26 2010-04-08 Siemens Aktiengesellschaft Module with automatic extension of a monitoring circuit
KR101756834B1 (en) 2008-07-14 2017-07-12 삼성전자주식회사 Method and apparatus for encoding and decoding of speech and audio signal
US8990094B2 (en) * 2010-09-13 2015-03-24 Qualcomm Incorporated Coding and decoding a transient frame
KR101525185B1 (en) 2011-02-14 2015-06-02 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. Apparatus and method for coding a portion of an audio signal using a transient detection and a quality result
BR112012029132B1 (en) 2011-02-14 2021-10-05 Fraunhofer - Gesellschaft Zur Förderung Der Angewandten Forschung E.V REPRESENTATION OF INFORMATION SIGNAL USING OVERLAY TRANSFORMED
PL3239978T3 (en) 2011-02-14 2019-07-31 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Encoding and decoding of pulse positions of tracks of an audio signal
JP5849106B2 (en) 2011-02-14 2016-01-27 フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン Apparatus and method for error concealment in low delay integrated speech and audio coding
CA2827249C (en) 2011-02-14 2016-08-23 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for processing a decoded audio signal in a spectral domain
JP5625126B2 (en) 2011-02-14 2014-11-12 フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン Linear prediction based coding scheme using spectral domain noise shaping
TWI488176B (en) * 2011-02-14 2015-06-11 Fraunhofer Ges Forschung Encoding and decoding of pulse positions of tracks of an audio signal
CN103503062B (en) 2011-02-14 2016-08-10 弗劳恩霍夫应用研究促进协会 For using the prediction part of alignment by audio-frequency signal coding and the apparatus and method of decoding
CN103534754B (en) 2011-02-14 2015-09-30 弗兰霍菲尔运输应用研究公司 The audio codec utilizing noise to synthesize during the inertia stage
EP2757558A1 (en) * 2013-01-18 2014-07-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Time domain level adjustment for audio signal decoding or encoding
US9685166B2 (en) 2014-07-26 2017-06-20 Huawei Technologies Co., Ltd. Classification between time-domain coding and frequency domain coding
EP3067886A1 (en) 2015-03-09 2016-09-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal
US10957331B2 (en) * 2018-12-17 2021-03-23 Microsoft Technology Licensing, Llc Phase reconstruction in a speech decoder

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0215915A4 (en) * 1985-03-18 1987-11-25 Massachusetts Inst Technology Processing of acoustic waveforms.
US5023910A (en) * 1988-04-08 1991-06-11 At&T Bell Laboratories Vector quantization in a harmonic speech coding arrangement
JPH02288739A (en) * 1989-04-28 1990-11-28 Fujitsu Ltd Voice coding and decoding transmission system
JP3680374B2 (en) * 1995-09-28 2005-08-10 ソニー株式会社 Speech synthesis method
JPH10214100A (en) * 1997-01-31 1998-08-11 Sony Corp Voice synthesizing method
US6233550B1 (en) * 1997-08-29 2001-05-15 The Regents Of The University Of California Method and apparatus for hybrid coding of speech at 4kbps
ATE302991T1 (en) * 1998-01-22 2005-09-15 Deutsche Telekom Ag METHOD FOR SIGNAL-CONTROLLED SWITCHING BETWEEN DIFFERENT AUDIO CODING SYSTEMS
JPH11224099A (en) * 1998-02-06 1999-08-17 Sony Corp Device and method for phase quantization

Also Published As

Publication number Publication date
KR100711047B1 (en) 2007-04-24
EP1259957B1 (en) 2006-09-27
WO2001065544A1 (en) 2001-09-07
AU2000233851A1 (en) 2001-09-12
JP2003525473A (en) 2003-08-26
EP1259957A1 (en) 2002-11-27
CN1437747A (en) 2003-08-20
DE60031002D1 (en) 2006-11-09
JP4907826B2 (en) 2012-04-04
DE60031002T2 (en) 2007-05-10
CN1266674C (en) 2006-07-26
ES2269112T3 (en) 2007-04-01
HK1055833A1 (en) 2004-01-21
KR20020081374A (en) 2002-10-26

Similar Documents

Publication Publication Date Title
ATE341074T1 (en) MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODER
US20230402045A1 (en) Low bitrate audio encoding/decoding scheme having cascaded switches
ES2693229T3 (en) Coding of generic audio signals at low bit rates and low delay
CN106663441B (en) Improve the classification between time domain coding and Frequency Domain Coding
KR101853352B1 (en) Apparatus and method for encoding and decoding an audio signal using an aligned look-ahead portion
EP3301674B1 (en) Adaptive bandwidth extension and apparatus for the same
CN105359211B (en) The voiceless sound of speech processes/voiced sound decision method and device
BR122020025776B1 (en) AUDIO ENCODING/DECODING SCHEME WITH LOW BITS RATE WITH COMMON PRE-PROCESSING
KR101525185B1 (en) Apparatus and method for coding a portion of an audio signal using a transient detection and a quality result
KR101261677B1 (en) Apparatus for encoding and decoding of integrated voice and music
DE60024123D1 (en) LPC HARMONIOUS LANGUAGE CODIER WITH OVERRIDE FORMAT
ATE368278T1 (en) COMPENSATION METHOD FOR FRAME EXTENSION IN A VARIABLE DATA RATE VOICE ENCODER
KR20110043592A (en) Audio encoder and decoder for encoding and decoding frames of a sampled audio signal
EP0770990A3 (en) Speech encoding method and apparatus and speech decoding method and apparatus
DE602004003610D1 (en) Half-breed vocoder
KR101706123B1 (en) User-customizable voice revision method of converting voice by parameter modification and voice revision device implementing the same
CN106575505A (en) Frame loss management in an fd/lpd transition context
US9418671B2 (en) Adaptive high-pass post-filter
Budagavi et al. Speech coding in mobile radio communications
Hagen et al. An 8 kbit/s ACELP coder with improved background noise performance
Wang Variable rate multi-mode excitation coding of speech at 2.4 kbps
Shikui et al. Speech transcoding from AMR to G. 729 in excitation domain
Ragot et al. Noise feedback coding revisited: refurbished legacy codecs and new coding models
Song et al. Research on Open Source Encoding Technology for MPEG Unified Speech and Audio Coding
JPH07135490A (en) Voice detector and vocoder having voice detector

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties