ATE341074T1 - MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODER - Google Patents
MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODERInfo
- Publication number
- ATE341074T1 ATE341074T1 AT00912053T AT00912053T ATE341074T1 AT E341074 T1 ATE341074 T1 AT E341074T1 AT 00912053 T AT00912053 T AT 00912053T AT 00912053 T AT00912053 T AT 00912053T AT E341074 T1 ATE341074 T1 AT E341074T1
- Authority
- AT
- Austria
- Prior art keywords
- coding mode
- encoded
- speech
- domain coding
- speech frame
- Prior art date
Links
- 230000007704 transition Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Physical Or Chemical Processes And Apparatus (AREA)
Abstract
A closed-loop, multimode, mixed-domain linear prediction (MDLP) speech coder includes a high-rate, time-domain coding mode, a low-rate, frequency-domain coding mode, and a closed-loop mode-selection mechanism for selecting a coding mode for the coder based upon the speech content of frames input to the coder. Transition speech (i.e., from unvoiced speech to voiced speech, or vice versa) frames are encoded with the high-rate, time-domain coding mode, which may be a CELP coding mode. Voiced speech frames are encoded with the low-rate, frequency-domain coding mode, which may be a harmonic coding mode. Phase parameters are not encoded by the frequency-domain coding mode, and are instead modeled in accordance with, e.g., a quadratic phase model. For each speech frame encoded with the frequency-domain coding mode, the initial phase value is taken to be the initial phase value of the immediately preceding speech frame encoded with the frequency-domain coding mode. If the immediately preceding speech frame was encoded with the time-domain coding mode, the initial phase value of the current speech frame is computed from the decoded speech frame information of the immediately preceding, time-domain-encoded speech frame. Each speech frame encoded with the frequency-domain coding mode may be compared with the corresponding input speech frame to obtain a performance measure. If the performance measure falls below a predefined threshold value, the input speech frame is encoded with the time-domain coding mode.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/US2000/005140 WO2001065544A1 (en) | 2000-02-29 | 2000-02-29 | Closed-loop multimode mixed-domain linear prediction speech coder |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE341074T1 true ATE341074T1 (en) | 2006-10-15 |
Family
ID=21741098
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT00912053T ATE341074T1 (en) | 2000-02-29 | 2000-02-29 | MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODER |
Country Status (10)
Country | Link |
---|---|
EP (1) | EP1259957B1 (en) |
JP (1) | JP4907826B2 (en) |
KR (1) | KR100711047B1 (en) |
CN (1) | CN1266674C (en) |
AT (1) | ATE341074T1 (en) |
AU (1) | AU2000233851A1 (en) |
DE (1) | DE60031002T2 (en) |
ES (1) | ES2269112T3 (en) |
HK (1) | HK1055833A1 (en) |
WO (1) | WO2001065544A1 (en) |
Families Citing this family (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6438518B1 (en) * | 1999-10-28 | 2002-08-20 | Qualcomm Incorporated | Method and apparatus for using coding scheme selection patterns in a predictive speech coder to reduce sensitivity to frame error conditions |
CA2392640A1 (en) * | 2002-07-05 | 2004-01-05 | Voiceage Corporation | A method and device for efficient in-based dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for cdma wireless systems |
TWI463806B (en) * | 2003-12-19 | 2014-12-01 | Creative Tech Ltd | Method and system to process a digital image |
US7739120B2 (en) | 2004-05-17 | 2010-06-15 | Nokia Corporation | Selection of coding models for encoding an audio signal |
CN101283406B (en) * | 2005-10-05 | 2013-06-19 | Lg电子株式会社 | Method and apparatus for signal processing and encoding and decoding method, and apparatus thereof |
EP1946062A4 (en) * | 2005-10-05 | 2009-09-09 | Lg Electronics Inc | Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor |
KR100647336B1 (en) * | 2005-11-08 | 2006-11-23 | 삼성전자주식회사 | Apparatus and method for adaptive time/frequency-based encoding/decoding |
KR101390188B1 (en) * | 2006-06-21 | 2014-04-30 | 삼성전자주식회사 | Method and apparatus for encoding and decoding adaptive high frequency band |
US8010352B2 (en) | 2006-06-21 | 2011-08-30 | Samsung Electronics Co., Ltd. | Method and apparatus for adaptively encoding and decoding high frequency band |
US9159333B2 (en) | 2006-06-21 | 2015-10-13 | Samsung Electronics Co., Ltd. | Method and apparatus for adaptively encoding and decoding high frequency band |
CN101145345B (en) * | 2006-09-13 | 2011-02-09 | 华为技术有限公司 | Audio frequency classification method |
KR101131880B1 (en) * | 2007-03-23 | 2012-04-03 | 삼성전자주식회사 | Method and apparatus for encoding audio signal, and method and apparatus for decoding audio signal |
KR101297120B1 (en) * | 2007-04-26 | 2013-08-21 | 지멘스 악티엔게젤샤프트 | Module with automatic extension of a monitoring circuit |
KR101756834B1 (en) * | 2008-07-14 | 2017-07-12 | 삼성전자주식회사 | Method and apparatus for encoding and decoding of speech and audio signal |
US8990094B2 (en) | 2010-09-13 | 2015-03-24 | Qualcomm Incorporated | Coding and decoding a transient frame |
JP5969513B2 (en) | 2011-02-14 | 2016-08-17 | フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン | Audio codec using noise synthesis between inert phases |
MY160265A (en) | 2011-02-14 | 2017-02-28 | Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E V | Apparatus and Method for Encoding and Decoding an Audio Signal Using an Aligned Look-Ahead Portion |
AR085794A1 (en) | 2011-02-14 | 2013-10-30 | Fraunhofer Ges Forschung | LINEAR PREDICTION BASED ON CODING SCHEME USING SPECTRAL DOMAIN NOISE CONFORMATION |
KR101424372B1 (en) | 2011-02-14 | 2014-08-01 | 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. | Information signal representation using lapped transform |
PL2676268T3 (en) | 2011-02-14 | 2015-05-29 | Fraunhofer Ges Forschung | Apparatus and method for processing a decoded audio signal in a spectral domain |
PT3239978T (en) | 2011-02-14 | 2019-04-02 | Fraunhofer Ges Forschung | Encoding and decoding of pulse positions of tracks of an audio signal |
TWI488176B (en) * | 2011-02-14 | 2015-06-11 | Fraunhofer Ges Forschung | Encoding and decoding of pulse positions of tracks of an audio signal |
BR112013020324B8 (en) | 2011-02-14 | 2022-02-08 | Fraunhofer Ges Forschung | Apparatus and method for error suppression in low delay unified speech and audio coding |
PT2676270T (en) | 2011-02-14 | 2017-05-02 | Fraunhofer Ges Forschung | Coding a portion of an audio signal using a transient detection and a quality result |
EP2757558A1 (en) * | 2013-01-18 | 2014-07-23 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Time domain level adjustment for audio signal decoding or encoding |
US9685166B2 (en) | 2014-07-26 | 2017-06-20 | Huawei Technologies Co., Ltd. | Classification between time-domain coding and frequency domain coding |
EP3067886A1 (en) | 2015-03-09 | 2016-09-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal |
US10957331B2 (en) * | 2018-12-17 | 2021-03-23 | Microsoft Technology Licensing, Llc | Phase reconstruction in a speech decoder |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1986005617A1 (en) * | 1985-03-18 | 1986-09-25 | Massachusetts Institute Of Technology | Processing of acoustic waveforms |
US5023910A (en) * | 1988-04-08 | 1991-06-11 | At&T Bell Laboratories | Vector quantization in a harmonic speech coding arrangement |
JPH02288739A (en) * | 1989-04-28 | 1990-11-28 | Fujitsu Ltd | Voice coding and decoding transmission system |
JP3680374B2 (en) * | 1995-09-28 | 2005-08-10 | ソニー株式会社 | Speech synthesis method |
JPH10214100A (en) * | 1997-01-31 | 1998-08-11 | Sony Corp | Voice synthesizing method |
WO1999010719A1 (en) * | 1997-08-29 | 1999-03-04 | The Regents Of The University Of California | Method and apparatus for hybrid coding of speech at 4kbps |
ATE302991T1 (en) * | 1998-01-22 | 2005-09-15 | Deutsche Telekom Ag | METHOD FOR SIGNAL-CONTROLLED SWITCHING BETWEEN DIFFERENT AUDIO CODING SYSTEMS |
JPH11224099A (en) * | 1998-02-06 | 1999-08-17 | Sony Corp | Device and method for phase quantization |
-
2000
- 2000-02-29 ES ES00912053T patent/ES2269112T3/en not_active Expired - Lifetime
- 2000-02-29 AT AT00912053T patent/ATE341074T1/en not_active IP Right Cessation
- 2000-02-29 EP EP00912053A patent/EP1259957B1/en not_active Expired - Lifetime
- 2000-02-29 AU AU2000233851A patent/AU2000233851A1/en not_active Abandoned
- 2000-02-29 JP JP2001564148A patent/JP4907826B2/en not_active Expired - Lifetime
- 2000-02-29 DE DE60031002T patent/DE60031002T2/en not_active Expired - Lifetime
- 2000-02-29 WO PCT/US2000/005140 patent/WO2001065544A1/en active IP Right Grant
- 2000-02-29 CN CNB008192219A patent/CN1266674C/en not_active Expired - Lifetime
- 2000-02-29 KR KR1020027011306A patent/KR100711047B1/en active IP Right Grant
-
2003
- 2003-11-07 HK HK03108074A patent/HK1055833A1/en not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
ES2269112T3 (en) | 2007-04-01 |
EP1259957A1 (en) | 2002-11-27 |
KR20020081374A (en) | 2002-10-26 |
CN1266674C (en) | 2006-07-26 |
EP1259957B1 (en) | 2006-09-27 |
CN1437747A (en) | 2003-08-20 |
DE60031002T2 (en) | 2007-05-10 |
JP4907826B2 (en) | 2012-04-04 |
HK1055833A1 (en) | 2004-01-21 |
AU2000233851A1 (en) | 2001-09-12 |
DE60031002D1 (en) | 2006-11-09 |
JP2003525473A (en) | 2003-08-26 |
WO2001065544A1 (en) | 2001-09-07 |
KR100711047B1 (en) | 2007-04-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE341074T1 (en) | MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODER | |
ES2693229T3 (en) | Coding of generic audio signals at low bit rates and low delay | |
KR101853352B1 (en) | Apparatus and method for encoding and decoding an audio signal using an aligned look-ahead portion | |
CN106663441B (en) | Improve the classification between time domain coding and Frequency Domain Coding | |
CN105359211B (en) | The voiceless sound of speech processes/voiced sound decision method and device | |
TW519616B (en) | Method and apparatus for predictively quantizing voiced speech | |
EP3301674B1 (en) | Adaptive bandwidth extension and apparatus for the same | |
TW201011738A (en) | Low bitrate audio encoding/decoding scheme having cascaded switches | |
BR122020025776B1 (en) | AUDIO ENCODING/DECODING SCHEME WITH LOW BITS RATE WITH COMMON PRE-PROCESSING | |
MX2012010439A (en) | Audio signal decoder, audio signal encoder, method for decoding an audio signal, method for encoding an audio signal and computer program using a pitch-dependent adaptation of a coding context. | |
DK1222659T3 (en) | LPC harmonic speech codes with superframe structure | |
KR101261677B1 (en) | Apparatus for encoding and decoding of integrated voice and music | |
WO2004084180A3 (en) | Voicing index controls for celp speech coding | |
BRPI0914056B1 (en) | MULTI-RESOLUTION SWITCHED AUDIO CODING / DECODING SCHEME | |
ATE368278T1 (en) | COMPENSATION METHOD FOR FRAME EXTENSION IN A VARIABLE DATA RATE VOICE ENCODER | |
KR20110043592A (en) | Audio encoder and decoder for encoding and decoding frames of a sampled audio signal | |
EP0770990A3 (en) | Speech encoding method and apparatus and speech decoding method and apparatus | |
PT2676270T (en) | Coding a portion of an audio signal using a transient detection and a quality result | |
KR101706123B1 (en) | User-customizable voice revision method of converting voice by parameter modification and voice revision device implementing the same | |
US9418671B2 (en) | Adaptive high-pass post-filter | |
Budagavi et al. | Speech coding in mobile radio communications | |
Moriya et al. | Harmonic model for MDCT based audio coding with LPC envelope | |
Wang | Variable rate multi-mode excitation coding of speech at 2.4 kbps | |
Shikui et al. | Speech transcoding from AMR to G. 729 in excitation domain | |
Ragot et al. | Noise feedback coding revisited: refurbished legacy codecs and new coding models |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |