DE69309557D1 - Verfahren und Vorrichtung zur Sprachkodierung - Google Patents

Verfahren und Vorrichtung zur Sprachkodierung

Info

Publication number
DE69309557D1
DE69309557D1 DE69309557T DE69309557T DE69309557D1 DE 69309557 D1 DE69309557 D1 DE 69309557D1 DE 69309557 T DE69309557 T DE 69309557T DE 69309557 T DE69309557 T DE 69309557T DE 69309557 D1 DE69309557 D1 DE 69309557D1
Authority
DE
Germany
Prior art keywords
speech coding
speech
coding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69309557T
Other languages
English (en)
Other versions
DE69309557T2 (de
Inventor
Moriya Takehiro
Kataoka Akitoshi
Mano Kazunori
Miki Satoshi
Omuro Hitoshi
Hayashi Shinji
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nippon Telegraph and Telephone Corp
Original Assignee
Nippon Telegraph and Telephone Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=27465260&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=DE69309557(D1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Priority claimed from JP04170895A external-priority patent/JP3087796B2/ja
Priority claimed from JP4265194A external-priority patent/JP2853824B2/ja
Priority claimed from JP26519592A external-priority patent/JP2776474B2/ja
Priority claimed from JP07053493A external-priority patent/JP3148778B2/ja
Application filed by Nippon Telegraph and Telephone Corp filed Critical Nippon Telegraph and Telephone Corp
Publication of DE69309557D1 publication Critical patent/DE69309557D1/de
Application granted granted Critical
Publication of DE69309557T2 publication Critical patent/DE69309557T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/005Correction of errors induced by the transmission channel, if related to the coding algorithm
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • G10L19/07Line spectrum pair [LSP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/083Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being an excitation gain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • G10L19/135Vector sum excited linear prediction [VSELP]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0002Codebook adaptations
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0003Backward prediction of gain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0004Design or structure of the codebook
    • G10L2019/0005Multi-stage vector quantisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
DE69309557T 1992-06-29 1993-06-28 Verfahren und Vorrichtung zur Sprachkodierung Expired - Lifetime DE69309557T2 (de)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
JP04170895A JP3087796B2 (ja) 1992-06-29 1992-06-29 音声の予測符号化装置
JP4265194A JP2853824B2 (ja) 1992-10-02 1992-10-02 音声のパラメータ情報符号化法
JP26519592A JP2776474B2 (ja) 1992-10-02 1992-10-02 多段ベクトル量子化法
JP07053493A JP3148778B2 (ja) 1993-03-29 1993-03-29 音声の符号化方法

Publications (2)

Publication Number Publication Date
DE69309557D1 true DE69309557D1 (de) 1997-05-15
DE69309557T2 DE69309557T2 (de) 1997-10-09

Family

ID=27465260

Family Applications (2)

Application Number Title Priority Date Filing Date
DE69309557T Expired - Lifetime DE69309557T2 (de) 1992-06-29 1993-06-28 Verfahren und Vorrichtung zur Sprachkodierung
DE69328450T Expired - Lifetime DE69328450T2 (de) 1992-06-29 1993-06-28 Verfahren und Vorrichtung zur Sprachkodierung

Family Applications After (1)

Application Number Title Priority Date Filing Date
DE69328450T Expired - Lifetime DE69328450T2 (de) 1992-06-29 1993-06-28 Verfahren und Vorrichtung zur Sprachkodierung

Country Status (3)

Country Link
US (1) US5787391A (de)
EP (2) EP0577488B9 (de)
DE (2) DE69309557T2 (de)

Families Citing this family (49)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3559588B2 (ja) * 1994-05-30 2004-09-02 キヤノン株式会社 音声合成方法及び装置
US5751903A (en) * 1994-12-19 1998-05-12 Hughes Electronics Low rate multi-mode CELP codec that encodes line SPECTRAL frequencies utilizing an offset
US5648989A (en) * 1994-12-21 1997-07-15 Paradyne Corporation Linear prediction filter coefficient quantizer and filter set
SE504397C2 (sv) * 1995-05-03 1997-01-27 Ericsson Telefon Ab L M Metod för förstärkningskvantisering vid linjärprediktiv talkodning med kodboksexcitering
US5664055A (en) * 1995-06-07 1997-09-02 Lucent Technologies Inc. CS-ACELP speech compression system with adaptive pitch prediction filter gain based on a measure of periodicity
TW317051B (de) * 1996-02-15 1997-10-01 Philips Electronics Nv
JP3259759B2 (ja) * 1996-07-22 2002-02-25 日本電気株式会社 音声信号伝送方法及び音声符号復号化システム
CA2213909C (en) * 1996-08-26 2002-01-22 Nec Corporation High quality speech coder at low bit rates
JPH1091194A (ja) * 1996-09-18 1998-04-10 Sony Corp 音声復号化方法及び装置
US7024355B2 (en) * 1997-01-27 2006-04-04 Nec Corporation Speech coder/decoder
JPH10247098A (ja) * 1997-03-04 1998-09-14 Mitsubishi Electric Corp 可変レート音声符号化方法、可変レート音声復号化方法
US6021325A (en) * 1997-03-10 2000-02-01 Ericsson Inc. Mobile telephone having continuous recording capability
US6161089A (en) * 1997-03-14 2000-12-12 Digital Voice Systems, Inc. Multi-subframe quantization of spectral parameters
US6131084A (en) * 1997-03-14 2000-10-10 Digital Voice Systems, Inc. Dual subframe quantization of spectral magnitudes
US6073092A (en) * 1997-06-26 2000-06-06 Telogy Networks, Inc. Method for speech coding based on a code excited linear prediction (CELP) model
EP1734512B1 (de) 1997-10-22 2015-09-09 Godo Kaisha IP Bridge 1 CELP Kodierer und Verfahren für die CELP Kodierung
JP3842432B2 (ja) 1998-04-20 2006-11-08 株式会社東芝 ベクトル量子化方法
JP3273599B2 (ja) * 1998-06-19 2002-04-08 沖電気工業株式会社 音声符号化レート選択器と音声符号化装置
JP2000056799A (ja) * 1998-08-06 2000-02-25 Matsushita Electric Ind Co Ltd 励振信号生成装置並びに音声符号化装置及び音声復号化装置
US6493665B1 (en) 1998-08-24 2002-12-10 Conexant Systems, Inc. Speech classification and parameter weighting used in codebook search
US6173257B1 (en) * 1998-08-24 2001-01-09 Conexant Systems, Inc Completed fixed codebook for speech encoder
US6714907B2 (en) 1998-08-24 2004-03-30 Mindspeed Technologies, Inc. Codebook structure and search for speech coding
US6556966B1 (en) 1998-08-24 2003-04-29 Conexant Systems, Inc. Codebook structure for changeable pulse multimode speech coding
US7117146B2 (en) * 1998-08-24 2006-10-03 Mindspeed Technologies, Inc. System for improved use of pitch enhancement with subcodebooks
SE519563C2 (sv) 1998-09-16 2003-03-11 Ericsson Telefon Ab L M Förfarande och kodare för linjär prediktiv analys-genom- synteskodning
CA2259094A1 (en) * 1999-01-15 2000-07-15 Universite De Sherbrooke A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders
US6138089A (en) * 1999-03-10 2000-10-24 Infolio, Inc. Apparatus system and method for speech compression and decompression
EP1221694B1 (de) * 1999-09-14 2006-07-19 Fujitsu Limited Sprachkodierer/dekodierer
EP1221162B1 (de) * 1999-09-30 2005-06-29 STMicroelectronics Asia Pacific Pte Ltd. G.723.1 audiokodierer
AU2001253752A1 (en) * 2000-04-24 2001-11-07 Qualcomm Incorporated Method and apparatus for predictively quantizing voiced speech
KR100355033B1 (ko) * 2000-12-30 2002-10-19 주식회사 실트로닉 테크놀로지 선형예측 분석을 이용한 워터마크 삽입/추출 장치 및 그방법
US7617096B2 (en) * 2001-08-16 2009-11-10 Broadcom Corporation Robust quantization and inverse quantization using illegal space
US7647223B2 (en) * 2001-08-16 2010-01-12 Broadcom Corporation Robust composite quantization with sub-quantizers and inverse sub-quantizers using illegal space
US7054807B2 (en) * 2002-11-08 2006-05-30 Motorola, Inc. Optimizing encoder for efficiently determining analysis-by-synthesis codebook-related parameters
JP2005202262A (ja) * 2004-01-19 2005-07-28 Matsushita Electric Ind Co Ltd 音声信号符号化方法、音声信号復号化方法、送信機、受信機、及びワイヤレスマイクシステム
JP4599558B2 (ja) * 2005-04-22 2010-12-15 国立大学法人九州工業大学 ピッチ周期等化装置及びピッチ周期等化方法、並びに音声符号化装置、音声復号装置及び音声符号化方法
US7831420B2 (en) * 2006-04-04 2010-11-09 Qualcomm Incorporated Voice modifier for speech processing systems
WO2007132750A1 (ja) * 2006-05-12 2007-11-22 Panasonic Corporation Lspベクトル量子化装置、lspベクトル逆量子化装置、およびこれらの方法
US8189621B2 (en) * 2006-05-12 2012-05-29 Microsoft Corporation Stack signaling to application with lack of requested bandwidth
JPWO2008047795A1 (ja) * 2006-10-17 2010-02-25 パナソニック株式会社 ベクトル量子化装置、ベクトル逆量子化装置、およびこれらの方法
US8144793B2 (en) 2006-12-12 2012-03-27 Microsoft Corporation Cognitive multi-user OFDMA
EP2099025A4 (de) * 2006-12-14 2010-12-22 Panasonic Corp Audiocodierungseinrichtung und audiocodierungsverfahren
US20080162150A1 (en) * 2006-12-28 2008-07-03 Vianix Delaware, Llc System and Method for a High Performance Audio Codec
FR2911228A1 (fr) * 2007-01-05 2008-07-11 France Telecom Codage par transformee, utilisant des fenetres de ponderation et a faible retard.
US8195452B2 (en) * 2008-06-12 2012-06-05 Nokia Corporation High-quality encoding at low-bit rates
WO2010003253A1 (en) * 2008-07-10 2010-01-14 Voiceage Corporation Variable bit rate lpc filter quantizing and inverse quantizing device and method
CN102144256B (zh) 2008-07-17 2013-08-28 诺基亚公司 用于针对矢量量化器的快速最近邻搜索的方法和设备
US9842598B2 (en) * 2013-02-21 2017-12-12 Qualcomm Incorporated Systems and methods for mitigating potential frame instability
CN104751850B (zh) * 2013-12-25 2021-04-02 北京天籁传音数字技术有限公司 一种用于音频信号的矢量量化编解码方法及装置

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE3779351D1 (de) * 1986-03-28 1992-07-02 American Telephone And Telegraph Co., New York, N.Y., Us
IT1195350B (it) * 1986-10-21 1988-10-12 Cselt Centro Studi Lab Telecom Procedimento e dispositivo per la codifica e decodifica del segnale vocale mediante estrazione di para metri e tecniche di quantizzazione vettoriale
US4899385A (en) * 1987-06-26 1990-02-06 American Telephone And Telegraph Company Code excited linear predictive vocoder
CA1337217C (en) * 1987-08-28 1995-10-03 Daniel Kenneth Freeman Speech coding
US5010574A (en) * 1989-06-13 1991-04-23 At&T Bell Laboratories Vector quantizer search arrangement
US4975956A (en) * 1989-07-26 1990-12-04 Itt Corporation Low-bit-rate speech coder using LPC data reduction processing
JP3102015B2 (ja) * 1990-05-28 2000-10-23 日本電気株式会社 音声復号化方法
DE69022237T2 (de) * 1990-10-16 1996-05-02 Ibm Sprachsyntheseeinrichtung nach dem phonetischen Hidden-Markov-Modell.
US5396576A (en) * 1991-05-22 1995-03-07 Nippon Telegraph And Telephone Corporation Speech coding and decoding methods using adaptive and random code books
US5432883A (en) * 1992-04-24 1995-07-11 Olympus Optical Co., Ltd. Voice coding apparatus with synthesized speech LPC code book
IT1257065B (it) * 1992-07-31 1996-01-05 Sip Codificatore a basso ritardo per segnali audio, utilizzante tecniche di analisi per sintesi.
US5426460A (en) * 1993-12-17 1995-06-20 At&T Corp. Virtual multimedia service for mass market connectivity

Also Published As

Publication number Publication date
EP0577488A1 (de) 1994-01-05
EP0751496A3 (de) 1997-01-22
EP0751496B1 (de) 2000-04-19
EP0751496A2 (de) 1997-01-02
US5787391A (en) 1998-07-28
EP0577488B9 (de) 2007-10-03
DE69309557T2 (de) 1997-10-09
EP0577488B1 (de) 1997-04-09
DE69328450T2 (de) 2001-01-18
DE69328450D1 (de) 2000-05-25

Similar Documents

Publication Publication Date Title
DE69309557D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69631728D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69324629T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69625875D1 (de) Verfahren und Vorrichtung zur Sprachkodierung und -dekodierung
DE69727895D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69518705D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69524829T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69421324T2 (de) Verfahren und Vorrichtung zur Sprachkommunikation
DE69432943T2 (de) Verfahren und Vorrichtung zur Sprachdetektion
DE69031284T2 (de) Verfahren und Einrichtung zur Spracherkennung
DE69332459D1 (de) Verfahren und Vorrichtung zur Zeichenerkennung
DE69431445T2 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69532029D1 (de) Verfahren zur Kodierung und Vorrichtung dafür
DE69618408T2 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69506449T2 (de) Verfahren und vorrichtung zur zwischenbildkodierung
DE69329439T2 (de) Verfahren und Einrichtung zur Kodierung
DE69523032T2 (de) Vorrichtung und Verfahren zur Kodierung der Sprachgrundfrequenz
DE69517829D1 (de) Vorrichtung und Verfahren zur Spracherkennung
DE69030548D1 (de) Verfahren und Einrichtung zur Spracherkennung
DE69614761T2 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69315638D1 (de) Vorrichtung zur Sprachdekodierung und Verfahren zur Dekodierung
DE69821068D1 (de) Vorrichtung und Verfahren zur Sprachkodierung
DE69229974T2 (de) Verfahren und Vorrichtung zur Kodierung von Sprachparametern

Legal Events

Date Code Title Description
8364 No opposition during term of opposition