ATE341074T1 - MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODER - Google Patents
MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODERInfo
- Publication number
- ATE341074T1 ATE341074T1 AT00912053T AT00912053T ATE341074T1 AT E341074 T1 ATE341074 T1 AT E341074T1 AT 00912053 T AT00912053 T AT 00912053T AT 00912053 T AT00912053 T AT 00912053T AT E341074 T1 ATE341074 T1 AT E341074T1
- Authority
- AT
- Austria
- Prior art keywords
- coding mode
- encoded
- speech
- domain coding
- speech frame
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
Abstract
A closed-loop, multimode, mixed-domain linear prediction (MDLP) speech coder includes a high-rate, time-domain coding mode, a low-rate, frequency-domain coding mode, and a closed-loop mode-selection mechanism for selecting a coding mode for the coder based upon the speech content of frames input to the coder. Transition speech (i.e., from unvoiced speech to voiced speech, or vice versa) frames are encoded with the high-rate, time-domain coding mode, which may be a CELP coding mode. Voiced speech frames are encoded with the low-rate, frequency-domain coding mode, which may be a harmonic coding mode. Phase parameters are not encoded by the frequency-domain coding mode, and are instead modeled in accordance with, e.g., a quadratic phase model. For each speech frame encoded with the frequency-domain coding mode, the initial phase value is taken to be the initial phase value of the immediately preceding speech frame encoded with the frequency-domain coding mode. If the immediately preceding speech frame was encoded with the time-domain coding mode, the initial phase value of the current speech frame is computed from the decoded speech frame information of the immediately preceding, time-domain-encoded speech frame. Each speech frame encoded with the frequency-domain coding mode may be compared with the corresponding input speech frame to obtain a performance measure. If the performance measure falls below a predefined threshold value, the input speech frame is encoded with the time-domain coding mode.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/US2000/005140 WO2001065544A1 (en) | 2000-02-29 | 2000-02-29 | Closed-loop multimode mixed-domain linear prediction speech coder |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE341074T1 true ATE341074T1 (en) | 2006-10-15 |
Family
ID=21741098
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT00912053T ATE341074T1 (en) | 2000-02-29 | 2000-02-29 | MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODER |
Country Status (10)
Country | Link |
---|---|
EP (1) | EP1259957B1 (en) |
JP (1) | JP4907826B2 (en) |
KR (1) | KR100711047B1 (en) |
CN (1) | CN1266674C (en) |
AT (1) | ATE341074T1 (en) |
AU (1) | AU2000233851A1 (en) |
DE (1) | DE60031002T2 (en) |
ES (1) | ES2269112T3 (en) |
HK (1) | HK1055833A1 (en) |
WO (1) | WO2001065544A1 (en) |
Families Citing this family (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6438518B1 (en) * | 1999-10-28 | 2002-08-20 | Qualcomm Incorporated | Method and apparatus for using coding scheme selection patterns in a predictive speech coder to reduce sensitivity to frame error conditions |
CA2392640A1 (en) * | 2002-07-05 | 2004-01-05 | Voiceage Corporation | A method and device for efficient in-based dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for cdma wireless systems |
JP4568732B2 (en) * | 2003-12-19 | 2010-10-27 | クリエイティブ テクノロジー リミテッド | Method and system for processing digital images |
US7739120B2 (en) | 2004-05-17 | 2010-06-15 | Nokia Corporation | Selection of coding models for encoding an audio signal |
WO2007040365A1 (en) * | 2005-10-05 | 2007-04-12 | Lg Electronics Inc. | Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor |
CN101283406B (en) * | 2005-10-05 | 2013-06-19 | Lg电子株式会社 | Method and apparatus for signal processing and encoding and decoding method, and apparatus thereof |
KR100647336B1 (en) * | 2005-11-08 | 2006-11-23 | 삼성전자주식회사 | Apparatus and method for adaptive time/frequency-based encoding/decoding |
US8010352B2 (en) | 2006-06-21 | 2011-08-30 | Samsung Electronics Co., Ltd. | Method and apparatus for adaptively encoding and decoding high frequency band |
KR101390188B1 (en) * | 2006-06-21 | 2014-04-30 | 삼성전자주식회사 | Method and apparatus for encoding and decoding adaptive high frequency band |
US9159333B2 (en) | 2006-06-21 | 2015-10-13 | Samsung Electronics Co., Ltd. | Method and apparatus for adaptively encoding and decoding high frequency band |
CN101145345B (en) * | 2006-09-13 | 2011-02-09 | 华为技术有限公司 | Audio frequency classification method |
KR101131880B1 (en) * | 2007-03-23 | 2012-04-03 | 삼성전자주식회사 | Method and apparatus for encoding audio signal, and method and apparatus for decoding audio signal |
DE112007003567A5 (en) * | 2007-04-26 | 2010-04-08 | Siemens Aktiengesellschaft | Module with automatic extension of a monitoring circuit |
KR101756834B1 (en) | 2008-07-14 | 2017-07-12 | 삼성전자주식회사 | Method and apparatus for encoding and decoding of speech and audio signal |
US8990094B2 (en) * | 2010-09-13 | 2015-03-24 | Qualcomm Incorporated | Coding and decoding a transient frame |
KR101525185B1 (en) | 2011-02-14 | 2015-06-02 | 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. | Apparatus and method for coding a portion of an audio signal using a transient detection and a quality result |
BR112012029132B1 (en) | 2011-02-14 | 2021-10-05 | Fraunhofer - Gesellschaft Zur Förderung Der Angewandten Forschung E.V | REPRESENTATION OF INFORMATION SIGNAL USING OVERLAY TRANSFORMED |
PL3239978T3 (en) | 2011-02-14 | 2019-07-31 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Encoding and decoding of pulse positions of tracks of an audio signal |
JP5849106B2 (en) | 2011-02-14 | 2016-01-27 | フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン | Apparatus and method for error concealment in low delay integrated speech and audio coding |
CA2827249C (en) | 2011-02-14 | 2016-08-23 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for processing a decoded audio signal in a spectral domain |
JP5625126B2 (en) | 2011-02-14 | 2014-11-12 | フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン | Linear prediction based coding scheme using spectral domain noise shaping |
TWI488176B (en) * | 2011-02-14 | 2015-06-11 | Fraunhofer Ges Forschung | Encoding and decoding of pulse positions of tracks of an audio signal |
CN103503062B (en) | 2011-02-14 | 2016-08-10 | 弗劳恩霍夫应用研究促进协会 | For using the prediction part of alignment by audio-frequency signal coding and the apparatus and method of decoding |
CN103534754B (en) | 2011-02-14 | 2015-09-30 | 弗兰霍菲尔运输应用研究公司 | The audio codec utilizing noise to synthesize during the inertia stage |
EP2757558A1 (en) * | 2013-01-18 | 2014-07-23 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Time domain level adjustment for audio signal decoding or encoding |
US9685166B2 (en) | 2014-07-26 | 2017-06-20 | Huawei Technologies Co., Ltd. | Classification between time-domain coding and frequency domain coding |
EP3067886A1 (en) | 2015-03-09 | 2016-09-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal |
US10957331B2 (en) * | 2018-12-17 | 2021-03-23 | Microsoft Technology Licensing, Llc | Phase reconstruction in a speech decoder |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0215915A4 (en) * | 1985-03-18 | 1987-11-25 | Massachusetts Inst Technology | Processing of acoustic waveforms. |
US5023910A (en) * | 1988-04-08 | 1991-06-11 | At&T Bell Laboratories | Vector quantization in a harmonic speech coding arrangement |
JPH02288739A (en) * | 1989-04-28 | 1990-11-28 | Fujitsu Ltd | Voice coding and decoding transmission system |
JP3680374B2 (en) * | 1995-09-28 | 2005-08-10 | ソニー株式会社 | Speech synthesis method |
JPH10214100A (en) * | 1997-01-31 | 1998-08-11 | Sony Corp | Voice synthesizing method |
US6233550B1 (en) * | 1997-08-29 | 2001-05-15 | The Regents Of The University Of California | Method and apparatus for hybrid coding of speech at 4kbps |
ATE302991T1 (en) * | 1998-01-22 | 2005-09-15 | Deutsche Telekom Ag | METHOD FOR SIGNAL-CONTROLLED SWITCHING BETWEEN DIFFERENT AUDIO CODING SYSTEMS |
JPH11224099A (en) * | 1998-02-06 | 1999-08-17 | Sony Corp | Device and method for phase quantization |
-
2000
- 2000-02-29 CN CNB008192219A patent/CN1266674C/en not_active Expired - Lifetime
- 2000-02-29 WO PCT/US2000/005140 patent/WO2001065544A1/en active IP Right Grant
- 2000-02-29 KR KR1020027011306A patent/KR100711047B1/en active IP Right Grant
- 2000-02-29 ES ES00912053T patent/ES2269112T3/en not_active Expired - Lifetime
- 2000-02-29 AU AU2000233851A patent/AU2000233851A1/en not_active Abandoned
- 2000-02-29 EP EP00912053A patent/EP1259957B1/en not_active Expired - Lifetime
- 2000-02-29 AT AT00912053T patent/ATE341074T1/en not_active IP Right Cessation
- 2000-02-29 DE DE60031002T patent/DE60031002T2/en not_active Expired - Lifetime
- 2000-02-29 JP JP2001564148A patent/JP4907826B2/en not_active Expired - Lifetime
-
2003
- 2003-11-07 HK HK03108074A patent/HK1055833A1/en not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
KR100711047B1 (en) | 2007-04-24 |
EP1259957B1 (en) | 2006-09-27 |
WO2001065544A1 (en) | 2001-09-07 |
AU2000233851A1 (en) | 2001-09-12 |
JP2003525473A (en) | 2003-08-26 |
EP1259957A1 (en) | 2002-11-27 |
CN1437747A (en) | 2003-08-20 |
DE60031002D1 (en) | 2006-11-09 |
JP4907826B2 (en) | 2012-04-04 |
DE60031002T2 (en) | 2007-05-10 |
CN1266674C (en) | 2006-07-26 |
ES2269112T3 (en) | 2007-04-01 |
HK1055833A1 (en) | 2004-01-21 |
KR20020081374A (en) | 2002-10-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE341074T1 (en) | MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODER | |
US20230402045A1 (en) | Low bitrate audio encoding/decoding scheme having cascaded switches | |
ES2693229T3 (en) | Coding of generic audio signals at low bit rates and low delay | |
CN106663441B (en) | Improve the classification between time domain coding and Frequency Domain Coding | |
KR101853352B1 (en) | Apparatus and method for encoding and decoding an audio signal using an aligned look-ahead portion | |
EP3301674B1 (en) | Adaptive bandwidth extension and apparatus for the same | |
CN105359211B (en) | The voiceless sound of speech processes/voiced sound decision method and device | |
BR122020025776B1 (en) | AUDIO ENCODING/DECODING SCHEME WITH LOW BITS RATE WITH COMMON PRE-PROCESSING | |
KR101525185B1 (en) | Apparatus and method for coding a portion of an audio signal using a transient detection and a quality result | |
KR101261677B1 (en) | Apparatus for encoding and decoding of integrated voice and music | |
DE60024123D1 (en) | LPC HARMONIOUS LANGUAGE CODIER WITH OVERRIDE FORMAT | |
ATE368278T1 (en) | COMPENSATION METHOD FOR FRAME EXTENSION IN A VARIABLE DATA RATE VOICE ENCODER | |
KR20110043592A (en) | Audio encoder and decoder for encoding and decoding frames of a sampled audio signal | |
EP0770990A3 (en) | Speech encoding method and apparatus and speech decoding method and apparatus | |
DE602004003610D1 (en) | Half-breed vocoder | |
KR101706123B1 (en) | User-customizable voice revision method of converting voice by parameter modification and voice revision device implementing the same | |
CN106575505A (en) | Frame loss management in an fd/lpd transition context | |
US9418671B2 (en) | Adaptive high-pass post-filter | |
Budagavi et al. | Speech coding in mobile radio communications | |
Hagen et al. | An 8 kbit/s ACELP coder with improved background noise performance | |
Wang | Variable rate multi-mode excitation coding of speech at 2.4 kbps | |
Shikui et al. | Speech transcoding from AMR to G. 729 in excitation domain | |
Ragot et al. | Noise feedback coding revisited: refurbished legacy codecs and new coding models | |
Song et al. | Research on Open Source Encoding Technology for MPEG Unified Speech and Audio Coding | |
JPH07135490A (en) | Voice detector and vocoder having voice detector |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |