ATE341074T1 - MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODER - Google Patents

MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODER

Info

Publication number
ATE341074T1
ATE341074T1 AT00912053T AT00912053T ATE341074T1 AT E341074 T1 ATE341074 T1 AT E341074T1 AT 00912053 T AT00912053 T AT 00912053T AT 00912053 T AT00912053 T AT 00912053T AT E341074 T1 ATE341074 T1 AT E341074T1
Authority
AT
Austria
Prior art keywords
coding mode
encoded
speech
domain coding
speech frame
Prior art date
Application number
AT00912053T
Other languages
German (de)
Inventor
Amitava Das
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Application granted granted Critical
Publication of ATE341074T1 publication Critical patent/ATE341074T1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Physical Or Chemical Processes And Apparatus (AREA)

Abstract

A closed-loop, multimode, mixed-domain linear prediction (MDLP) speech coder includes a high-rate, time-domain coding mode, a low-rate, frequency-domain coding mode, and a closed-loop mode-selection mechanism for selecting a coding mode for the coder based upon the speech content of frames input to the coder. Transition speech (i.e., from unvoiced speech to voiced speech, or vice versa) frames are encoded with the high-rate, time-domain coding mode, which may be a CELP coding mode. Voiced speech frames are encoded with the low-rate, frequency-domain coding mode, which may be a harmonic coding mode. Phase parameters are not encoded by the frequency-domain coding mode, and are instead modeled in accordance with, e.g., a quadratic phase model. For each speech frame encoded with the frequency-domain coding mode, the initial phase value is taken to be the initial phase value of the immediately preceding speech frame encoded with the frequency-domain coding mode. If the immediately preceding speech frame was encoded with the time-domain coding mode, the initial phase value of the current speech frame is computed from the decoded speech frame information of the immediately preceding, time-domain-encoded speech frame. Each speech frame encoded with the frequency-domain coding mode may be compared with the corresponding input speech frame to obtain a performance measure. If the performance measure falls below a predefined threshold value, the input speech frame is encoded with the time-domain coding mode.
AT00912053T 2000-02-29 2000-02-29 MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODER ATE341074T1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US2000/005140 WO2001065544A1 (en) 2000-02-29 2000-02-29 Closed-loop multimode mixed-domain linear prediction speech coder

Publications (1)

Publication Number Publication Date
ATE341074T1 true ATE341074T1 (en) 2006-10-15

Family

ID=21741098

Family Applications (1)

Application Number Title Priority Date Filing Date
AT00912053T ATE341074T1 (en) 2000-02-29 2000-02-29 MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODER

Country Status (10)

Country Link
EP (1) EP1259957B1 (en)
JP (1) JP4907826B2 (en)
KR (1) KR100711047B1 (en)
CN (1) CN1266674C (en)
AT (1) ATE341074T1 (en)
AU (1) AU2000233851A1 (en)
DE (1) DE60031002T2 (en)
ES (1) ES2269112T3 (en)
HK (1) HK1055833A1 (en)
WO (1) WO2001065544A1 (en)

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6438518B1 (en) * 1999-10-28 2002-08-20 Qualcomm Incorporated Method and apparatus for using coding scheme selection patterns in a predictive speech coder to reduce sensitivity to frame error conditions
CA2392640A1 (en) * 2002-07-05 2004-01-05 Voiceage Corporation A method and device for efficient in-based dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for cdma wireless systems
TWI463806B (en) * 2003-12-19 2014-12-01 Creative Tech Ltd Method and system to process a digital image
US7739120B2 (en) 2004-05-17 2010-06-15 Nokia Corporation Selection of coding models for encoding an audio signal
CN101283406B (en) * 2005-10-05 2013-06-19 Lg电子株式会社 Method and apparatus for signal processing and encoding and decoding method, and apparatus thereof
EP1946062A4 (en) * 2005-10-05 2009-09-09 Lg Electronics Inc Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
KR100647336B1 (en) * 2005-11-08 2006-11-23 삼성전자주식회사 Apparatus and method for adaptive time/frequency-based encoding/decoding
KR101390188B1 (en) * 2006-06-21 2014-04-30 삼성전자주식회사 Method and apparatus for encoding and decoding adaptive high frequency band
US8010352B2 (en) 2006-06-21 2011-08-30 Samsung Electronics Co., Ltd. Method and apparatus for adaptively encoding and decoding high frequency band
US9159333B2 (en) 2006-06-21 2015-10-13 Samsung Electronics Co., Ltd. Method and apparatus for adaptively encoding and decoding high frequency band
CN101145345B (en) * 2006-09-13 2011-02-09 华为技术有限公司 Audio frequency classification method
KR101131880B1 (en) * 2007-03-23 2012-04-03 삼성전자주식회사 Method and apparatus for encoding audio signal, and method and apparatus for decoding audio signal
KR101297120B1 (en) * 2007-04-26 2013-08-21 지멘스 악티엔게젤샤프트 Module with automatic extension of a monitoring circuit
KR101756834B1 (en) * 2008-07-14 2017-07-12 삼성전자주식회사 Method and apparatus for encoding and decoding of speech and audio signal
US8990094B2 (en) 2010-09-13 2015-03-24 Qualcomm Incorporated Coding and decoding a transient frame
JP5969513B2 (en) 2011-02-14 2016-08-17 フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン Audio codec using noise synthesis between inert phases
MY160265A (en) 2011-02-14 2017-02-28 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E V Apparatus and Method for Encoding and Decoding an Audio Signal Using an Aligned Look-Ahead Portion
AR085794A1 (en) 2011-02-14 2013-10-30 Fraunhofer Ges Forschung LINEAR PREDICTION BASED ON CODING SCHEME USING SPECTRAL DOMAIN NOISE CONFORMATION
KR101424372B1 (en) 2011-02-14 2014-08-01 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. Information signal representation using lapped transform
PL2676268T3 (en) 2011-02-14 2015-05-29 Fraunhofer Ges Forschung Apparatus and method for processing a decoded audio signal in a spectral domain
PT3239978T (en) 2011-02-14 2019-04-02 Fraunhofer Ges Forschung Encoding and decoding of pulse positions of tracks of an audio signal
TWI488176B (en) * 2011-02-14 2015-06-11 Fraunhofer Ges Forschung Encoding and decoding of pulse positions of tracks of an audio signal
BR112013020324B8 (en) 2011-02-14 2022-02-08 Fraunhofer Ges Forschung Apparatus and method for error suppression in low delay unified speech and audio coding
PT2676270T (en) 2011-02-14 2017-05-02 Fraunhofer Ges Forschung Coding a portion of an audio signal using a transient detection and a quality result
EP2757558A1 (en) * 2013-01-18 2014-07-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Time domain level adjustment for audio signal decoding or encoding
US9685166B2 (en) 2014-07-26 2017-06-20 Huawei Technologies Co., Ltd. Classification between time-domain coding and frequency domain coding
EP3067886A1 (en) 2015-03-09 2016-09-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal
US10957331B2 (en) * 2018-12-17 2021-03-23 Microsoft Technology Licensing, Llc Phase reconstruction in a speech decoder

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1986005617A1 (en) * 1985-03-18 1986-09-25 Massachusetts Institute Of Technology Processing of acoustic waveforms
US5023910A (en) * 1988-04-08 1991-06-11 At&T Bell Laboratories Vector quantization in a harmonic speech coding arrangement
JPH02288739A (en) * 1989-04-28 1990-11-28 Fujitsu Ltd Voice coding and decoding transmission system
JP3680374B2 (en) * 1995-09-28 2005-08-10 ソニー株式会社 Speech synthesis method
JPH10214100A (en) * 1997-01-31 1998-08-11 Sony Corp Voice synthesizing method
WO1999010719A1 (en) * 1997-08-29 1999-03-04 The Regents Of The University Of California Method and apparatus for hybrid coding of speech at 4kbps
ATE302991T1 (en) * 1998-01-22 2005-09-15 Deutsche Telekom Ag METHOD FOR SIGNAL-CONTROLLED SWITCHING BETWEEN DIFFERENT AUDIO CODING SYSTEMS
JPH11224099A (en) * 1998-02-06 1999-08-17 Sony Corp Device and method for phase quantization

Also Published As

Publication number Publication date
ES2269112T3 (en) 2007-04-01
EP1259957A1 (en) 2002-11-27
KR20020081374A (en) 2002-10-26
CN1266674C (en) 2006-07-26
EP1259957B1 (en) 2006-09-27
CN1437747A (en) 2003-08-20
DE60031002T2 (en) 2007-05-10
JP4907826B2 (en) 2012-04-04
HK1055833A1 (en) 2004-01-21
AU2000233851A1 (en) 2001-09-12
DE60031002D1 (en) 2006-11-09
JP2003525473A (en) 2003-08-26
WO2001065544A1 (en) 2001-09-07
KR100711047B1 (en) 2007-04-24

Similar Documents

Publication Publication Date Title
ATE341074T1 (en) MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODER
ES2693229T3 (en) Coding of generic audio signals at low bit rates and low delay
KR101853352B1 (en) Apparatus and method for encoding and decoding an audio signal using an aligned look-ahead portion
CN106663441B (en) Improve the classification between time domain coding and Frequency Domain Coding
CN105359211B (en) The voiceless sound of speech processes/voiced sound decision method and device
TW519616B (en) Method and apparatus for predictively quantizing voiced speech
EP3301674B1 (en) Adaptive bandwidth extension and apparatus for the same
TW201011738A (en) Low bitrate audio encoding/decoding scheme having cascaded switches
BR122020025776B1 (en) AUDIO ENCODING/DECODING SCHEME WITH LOW BITS RATE WITH COMMON PRE-PROCESSING
MX2012010439A (en) Audio signal decoder, audio signal encoder, method for decoding an audio signal, method for encoding an audio signal and computer program using a pitch-dependent adaptation of a coding context.
DK1222659T3 (en) LPC harmonic speech codes with superframe structure
KR101261677B1 (en) Apparatus for encoding and decoding of integrated voice and music
WO2004084180A3 (en) Voicing index controls for celp speech coding
BRPI0914056B1 (en) MULTI-RESOLUTION SWITCHED AUDIO CODING / DECODING SCHEME
ATE368278T1 (en) COMPENSATION METHOD FOR FRAME EXTENSION IN A VARIABLE DATA RATE VOICE ENCODER
KR20110043592A (en) Audio encoder and decoder for encoding and decoding frames of a sampled audio signal
EP0770990A3 (en) Speech encoding method and apparatus and speech decoding method and apparatus
PT2676270T (en) Coding a portion of an audio signal using a transient detection and a quality result
KR101706123B1 (en) User-customizable voice revision method of converting voice by parameter modification and voice revision device implementing the same
US9418671B2 (en) Adaptive high-pass post-filter
Budagavi et al. Speech coding in mobile radio communications
Moriya et al. Harmonic model for MDCT based audio coding with LPC envelope
Wang Variable rate multi-mode excitation coding of speech at 2.4 kbps
Shikui et al. Speech transcoding from AMR to G. 729 in excitation domain
Ragot et al. Noise feedback coding revisited: refurbished legacy codecs and new coding models

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties