HK1055833A1 - Closed-loop multimode mixed-domain linear prediction speech coder and method of processing frames - Google Patents
Closed-loop multimode mixed-domain linear prediction speech coder and method of processing framesInfo
- Publication number
- HK1055833A1 HK1055833A1 HK03108074A HK03108074A HK1055833A1 HK 1055833 A1 HK1055833 A1 HK 1055833A1 HK 03108074 A HK03108074 A HK 03108074A HK 03108074 A HK03108074 A HK 03108074A HK 1055833 A1 HK1055833 A1 HK 1055833A1
- Authority
- HK
- Hong Kong
- Prior art keywords
- coding mode
- encoded
- speech
- domain coding
- speech frame
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
Abstract
A closed-loop, multimode, mixed-domain linear prediction (MDLP) speech coder includes a high-rate, time-domain coding mode, a low-rate, frequency-domain coding mode, and a closed-loop mode-selection mechanism for selecting a coding mode for the coder based upon the speech content of frames input to the coder. Transition speech (i.e., from unvoiced speech to voiced speech, or vice versa) frames are encoded with the high-rate, time-domain coding mode, which may be a CELP coding mode. Voiced speech frames are encoded with the low-rate, frequency-domain coding mode, which may be a harmonic coding mode. Phase parameters are not encoded by the frequency-domain coding mode, and are instead modeled in accordance with, e.g., a quadratic phase model. For each speech frame encoded with the frequency-domain coding mode, the initial phase value is taken to be the initial phase value of the immediately preceding speech frame encoded with the frequency-domain coding mode. If the immediately preceding speech frame was encoded with the time-domain coding mode, the initial phase value of the current speech frame is computed from the decoded speech frame information of the immediately preceding, time-domain-encoded speech frame. Each speech frame encoded with the frequency-domain coding mode may be compared with the corresponding input speech frame to obtain a performance measure. If the performance measure falls below a predefined threshold value, the input speech frame is encoded with the time-domain coding mode.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/US2000/005140 WO2001065544A1 (en) | 2000-02-29 | 2000-02-29 | Closed-loop multimode mixed-domain linear prediction speech coder |
Publications (1)
Publication Number | Publication Date |
---|---|
HK1055833A1 true HK1055833A1 (en) | 2004-01-21 |
Family
ID=21741098
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
HK03108074A HK1055833A1 (en) | 2000-02-29 | 2003-11-07 | Closed-loop multimode mixed-domain linear prediction speech coder and method of processing frames |
Country Status (10)
Country | Link |
---|---|
EP (1) | EP1259957B1 (en) |
JP (1) | JP4907826B2 (en) |
KR (1) | KR100711047B1 (en) |
CN (1) | CN1266674C (en) |
AT (1) | ATE341074T1 (en) |
AU (1) | AU2000233851A1 (en) |
DE (1) | DE60031002T2 (en) |
ES (1) | ES2269112T3 (en) |
HK (1) | HK1055833A1 (en) |
WO (1) | WO2001065544A1 (en) |
Families Citing this family (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6438518B1 (en) * | 1999-10-28 | 2002-08-20 | Qualcomm Incorporated | Method and apparatus for using coding scheme selection patterns in a predictive speech coder to reduce sensitivity to frame error conditions |
CA2392640A1 (en) * | 2002-07-05 | 2004-01-05 | Voiceage Corporation | A method and device for efficient in-based dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for cdma wireless systems |
JP4568732B2 (en) * | 2003-12-19 | 2010-10-27 | クリエイティブ テクノロジー リミテッド | Method and system for processing digital images |
US7739120B2 (en) | 2004-05-17 | 2010-06-15 | Nokia Corporation | Selection of coding models for encoding an audio signal |
WO2007040365A1 (en) * | 2005-10-05 | 2007-04-12 | Lg Electronics Inc. | Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor |
CN101283406B (en) * | 2005-10-05 | 2013-06-19 | Lg电子株式会社 | Method and apparatus for signal processing and encoding and decoding method, and apparatus thereof |
KR100647336B1 (en) * | 2005-11-08 | 2006-11-23 | 삼성전자주식회사 | Apparatus and method for adaptive time/frequency-based encoding/decoding |
US8010352B2 (en) | 2006-06-21 | 2011-08-30 | Samsung Electronics Co., Ltd. | Method and apparatus for adaptively encoding and decoding high frequency band |
KR101390188B1 (en) * | 2006-06-21 | 2014-04-30 | 삼성전자주식회사 | Method and apparatus for encoding and decoding adaptive high frequency band |
US9159333B2 (en) | 2006-06-21 | 2015-10-13 | Samsung Electronics Co., Ltd. | Method and apparatus for adaptively encoding and decoding high frequency band |
CN101145345B (en) * | 2006-09-13 | 2011-02-09 | 华为技术有限公司 | Audio frequency classification method |
KR101131880B1 (en) * | 2007-03-23 | 2012-04-03 | 삼성전자주식회사 | Method and apparatus for encoding audio signal, and method and apparatus for decoding audio signal |
DE112007003567A5 (en) * | 2007-04-26 | 2010-04-08 | Siemens Aktiengesellschaft | Module with automatic extension of a monitoring circuit |
KR101756834B1 (en) | 2008-07-14 | 2017-07-12 | 삼성전자주식회사 | Method and apparatus for encoding and decoding of speech and audio signal |
US8990094B2 (en) * | 2010-09-13 | 2015-03-24 | Qualcomm Incorporated | Coding and decoding a transient frame |
KR101525185B1 (en) | 2011-02-14 | 2015-06-02 | 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. | Apparatus and method for coding a portion of an audio signal using a transient detection and a quality result |
BR112012029132B1 (en) | 2011-02-14 | 2021-10-05 | Fraunhofer - Gesellschaft Zur Förderung Der Angewandten Forschung E.V | REPRESENTATION OF INFORMATION SIGNAL USING OVERLAY TRANSFORMED |
PL3239978T3 (en) | 2011-02-14 | 2019-07-31 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Encoding and decoding of pulse positions of tracks of an audio signal |
JP5849106B2 (en) | 2011-02-14 | 2016-01-27 | フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン | Apparatus and method for error concealment in low delay integrated speech and audio coding |
CA2827249C (en) | 2011-02-14 | 2016-08-23 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for processing a decoded audio signal in a spectral domain |
JP5625126B2 (en) | 2011-02-14 | 2014-11-12 | フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン | Linear prediction based coding scheme using spectral domain noise shaping |
TWI488176B (en) * | 2011-02-14 | 2015-06-11 | Fraunhofer Ges Forschung | Encoding and decoding of pulse positions of tracks of an audio signal |
CN103503062B (en) | 2011-02-14 | 2016-08-10 | 弗劳恩霍夫应用研究促进协会 | For using the prediction part of alignment by audio-frequency signal coding and the apparatus and method of decoding |
CN103534754B (en) | 2011-02-14 | 2015-09-30 | 弗兰霍菲尔运输应用研究公司 | The audio codec utilizing noise to synthesize during the inertia stage |
EP2757558A1 (en) * | 2013-01-18 | 2014-07-23 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Time domain level adjustment for audio signal decoding or encoding |
US9685166B2 (en) | 2014-07-26 | 2017-06-20 | Huawei Technologies Co., Ltd. | Classification between time-domain coding and frequency domain coding |
EP3067886A1 (en) | 2015-03-09 | 2016-09-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal |
US10957331B2 (en) * | 2018-12-17 | 2021-03-23 | Microsoft Technology Licensing, Llc | Phase reconstruction in a speech decoder |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0215915A4 (en) * | 1985-03-18 | 1987-11-25 | Massachusetts Inst Technology | Processing of acoustic waveforms. |
US5023910A (en) * | 1988-04-08 | 1991-06-11 | At&T Bell Laboratories | Vector quantization in a harmonic speech coding arrangement |
JPH02288739A (en) * | 1989-04-28 | 1990-11-28 | Fujitsu Ltd | Voice coding and decoding transmission system |
JP3680374B2 (en) * | 1995-09-28 | 2005-08-10 | ソニー株式会社 | Speech synthesis method |
JPH10214100A (en) * | 1997-01-31 | 1998-08-11 | Sony Corp | Voice synthesizing method |
US6233550B1 (en) * | 1997-08-29 | 2001-05-15 | The Regents Of The University Of California | Method and apparatus for hybrid coding of speech at 4kbps |
ATE302991T1 (en) * | 1998-01-22 | 2005-09-15 | Deutsche Telekom Ag | METHOD FOR SIGNAL-CONTROLLED SWITCHING BETWEEN DIFFERENT AUDIO CODING SYSTEMS |
JPH11224099A (en) * | 1998-02-06 | 1999-08-17 | Sony Corp | Device and method for phase quantization |
-
2000
- 2000-02-29 CN CNB008192219A patent/CN1266674C/en not_active Expired - Lifetime
- 2000-02-29 WO PCT/US2000/005140 patent/WO2001065544A1/en active IP Right Grant
- 2000-02-29 KR KR1020027011306A patent/KR100711047B1/en active IP Right Grant
- 2000-02-29 ES ES00912053T patent/ES2269112T3/en not_active Expired - Lifetime
- 2000-02-29 AU AU2000233851A patent/AU2000233851A1/en not_active Abandoned
- 2000-02-29 EP EP00912053A patent/EP1259957B1/en not_active Expired - Lifetime
- 2000-02-29 AT AT00912053T patent/ATE341074T1/en not_active IP Right Cessation
- 2000-02-29 DE DE60031002T patent/DE60031002T2/en not_active Expired - Lifetime
- 2000-02-29 JP JP2001564148A patent/JP4907826B2/en not_active Expired - Lifetime
-
2003
- 2003-11-07 HK HK03108074A patent/HK1055833A1/en not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
KR100711047B1 (en) | 2007-04-24 |
EP1259957B1 (en) | 2006-09-27 |
WO2001065544A1 (en) | 2001-09-07 |
AU2000233851A1 (en) | 2001-09-12 |
JP2003525473A (en) | 2003-08-26 |
EP1259957A1 (en) | 2002-11-27 |
CN1437747A (en) | 2003-08-20 |
DE60031002D1 (en) | 2006-11-09 |
JP4907826B2 (en) | 2012-04-04 |
DE60031002T2 (en) | 2007-05-10 |
CN1266674C (en) | 2006-07-26 |
ATE341074T1 (en) | 2006-10-15 |
ES2269112T3 (en) | 2007-04-01 |
KR20020081374A (en) | 2002-10-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
HK1055833A1 (en) | Closed-loop multimode mixed-domain linear prediction speech coder and method of processing frames | |
US11682404B2 (en) | Audio decoding device and method with decoding branches for decoding audio signal encoded in a plurality of domains | |
EP1747554B1 (en) | Audio encoding with different coding frame lengths | |
KR101853352B1 (en) | Apparatus and method for encoding and decoding an audio signal using an aligned look-ahead portion | |
CN105359211B (en) | The voiceless sound of speech processes/voiced sound decision method and device | |
MX2012010439A (en) | Audio signal decoder, audio signal encoder, method for decoding an audio signal, method for encoding an audio signal and computer program using a pitch-dependent adaptation of a coding context. | |
EP0770990A3 (en) | Speech encoding method and apparatus and speech decoding method and apparatus | |
CN105451842B (en) | Selection first encodes the apparatus and method of one of algorithm and second coding algorithm | |
CN105637583A (en) | Adaptive bandwidth extension and apparatus for the same | |
EP2676270B1 (en) | Coding a portion of an audio signal using a transient detection and a quality result | |
CN101286319B (en) | Speech coding system to improve packet loss repairing quality | |
KR20160128871A (en) | User-customizable voice revision method of converting voice by parameter modification and voice revision device implementing the same | |
CN106575505A (en) | Frame loss management in an fd/lpd transition context | |
US9418671B2 (en) | Adaptive high-pass post-filter | |
US20080154586A1 (en) | Dual-Pulse Excited Linear Prediction For Speech Coding | |
Stachurski et al. | A 4 kb/s hybrid MELP/CELP coder with alignment phase encoding and zero-phase equalization | |
US20080154587A1 (en) | Gain Quantization System for Speech Coding to Improve Packet Loss Concealment | |
Wang | Variable rate multi-mode excitation coding of speech at 2.4 kbps | |
Tang et al. | Fixed bit-rate PWI speech coding with variable frame length | |
Bäckström et al. | Relaxed Code-Excited Linear Prediction (RCELP) | |
Chong-White et al. | Improved signal analysis and time-synchronous reconstruction in waveform interpolation coding | |
Ku et al. | DEPENDENT SPECTRAL QUANTIZATION | |
JPH07135490A (en) | Voice detector and vocoder having voice detector | |
Balint | Efficient vector quantization of speech spectral parameters [articol] |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PE | Patent expired |
Effective date: 20200228 |