DE69529672T2 - System zur sprachkodierung - Google Patents

System zur sprachkodierung

Info

Publication number
DE69529672T2
DE69529672T2 DE69529672T DE69529672T DE69529672T2 DE 69529672 T2 DE69529672 T2 DE 69529672T2 DE 69529672 T DE69529672 T DE 69529672T DE 69529672 T DE69529672 T DE 69529672T DE 69529672 T2 DE69529672 T2 DE 69529672T2
Authority
DE
Germany
Prior art keywords
parameters
alpha
lsp
codebook
vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
DE69529672T
Other languages
English (en)
Other versions
DE69529672D1 (de
Inventor
Masayuki Nishiguchi
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Publication of DE69529672D1 publication Critical patent/DE69529672D1/de
Application granted granted Critical
Publication of DE69529672T2 publication Critical patent/DE69529672T2/de
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • G10L19/07Line spectrum pair [LSP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/09Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0004Design or structure of the codebook
    • G10L2019/0005Multi-stage vector quantisation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
  • Communication Control (AREA)
  • Golf Clubs (AREA)
  • Tires In General (AREA)
DE69529672T 1994-12-21 1995-12-19 System zur sprachkodierung Expired - Fee Related DE69529672T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP6318689A JPH08179796A (ja) 1994-12-21 1994-12-21 音声符号化方法
PCT/JP1995/002607 WO1996019798A1 (fr) 1994-12-21 1995-12-19 Systeme de codage du son

Publications (2)

Publication Number Publication Date
DE69529672D1 DE69529672D1 (de) 2003-03-27
DE69529672T2 true DE69529672T2 (de) 2003-12-18

Family

ID=18101922

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69529672T Expired - Fee Related DE69529672T2 (de) 1994-12-21 1995-12-19 System zur sprachkodierung

Country Status (16)

Country Link
US (1) US5950155A (de)
EP (1) EP0751494B1 (de)
JP (1) JPH08179796A (de)
KR (1) KR970701410A (de)
CN (1) CN1141684A (de)
AT (1) ATE233008T1 (de)
AU (1) AU703046B2 (de)
BR (1) BR9506841A (de)
CA (1) CA2182790A1 (de)
DE (1) DE69529672T2 (de)
ES (1) ES2188679T3 (de)
MY (1) MY112314A (de)
PL (1) PL316008A1 (de)
TR (1) TR199501637A2 (de)
TW (1) TW367484B (de)
WO (1) WO1996019798A1 (de)

Families Citing this family (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3273455B2 (ja) * 1994-10-07 2002-04-08 日本電信電話株式会社 ベクトル量子化方法及びその復号化器
CN1163870C (zh) * 1996-08-02 2004-08-25 松下电器产业株式会社 声音编码装置和方法,声音译码装置,以及声音译码方法
JP3707153B2 (ja) * 1996-09-24 2005-10-19 ソニー株式会社 ベクトル量子化方法、音声符号化方法及び装置
US7788092B2 (en) 1996-09-25 2010-08-31 Qualcomm Incorporated Method and apparatus for detecting bad data packets received by a mobile telephone using decoded speech parameters
KR20000048609A (ko) 1996-09-25 2000-07-25 러셀 비. 밀러 디코딩된 음성 파라미터를 이용하여 이동전화에 의해 수신된 불량 데이터 패킷을 검출하는 방법 및 장치
US6205130B1 (en) 1996-09-25 2001-03-20 Qualcomm Incorporated Method and apparatus for detecting bad data packets received by a mobile telephone using decoded speech parameters
DE19654079A1 (de) * 1996-12-23 1998-06-25 Bayer Ag Endo-ekto-parasitizide Mittel
JP3523649B2 (ja) 1997-03-12 2004-04-26 三菱電機株式会社 音声符号化装置、音声復号装置及び音声符号化復号装置、及び、音声符号化方法、音声復号方法及び音声符号化復号方法
IL120788A (en) * 1997-05-06 2000-07-16 Audiocodes Ltd Systems and methods for encoding and decoding speech for lossy transmission networks
TW408298B (en) * 1997-08-28 2000-10-11 Texas Instruments Inc Improved method for switched-predictive quantization
JP3235543B2 (ja) * 1997-10-22 2001-12-04 松下電器産業株式会社 音声符号化/復号化装置
EP2154679B1 (de) * 1997-12-24 2016-09-14 BlackBerry Limited Verfahren und Vorrichtung zur Sprachkodierung
JP4308345B2 (ja) 1998-08-21 2009-08-05 パナソニック株式会社 マルチモード音声符号化装置及び復号化装置
SE521225C2 (sv) * 1998-09-16 2003-10-14 Ericsson Telefon Ab L M Förfarande och anordning för CELP-kodning/avkodning
JP2000305597A (ja) * 1999-03-12 2000-11-02 Texas Instr Inc <Ti> 音声圧縮のコード化
JP2000308167A (ja) * 1999-04-20 2000-11-02 Mitsubishi Electric Corp 音声符号化装置
US6449313B1 (en) * 1999-04-28 2002-09-10 Lucent Technologies Inc. Shaped fixed codebook search for celp speech coding
GB2352949A (en) * 1999-08-02 2001-02-07 Motorola Ltd Speech coder for communications unit
US6721701B1 (en) * 1999-09-20 2004-04-13 Lucent Technologies Inc. Method and apparatus for sound discrimination
US6510407B1 (en) * 1999-10-19 2003-01-21 Atmel Corporation Method and apparatus for variable rate coding of speech
JP3462464B2 (ja) * 2000-10-20 2003-11-05 株式会社東芝 音声符号化方法、音声復号化方法及び電子装置
KR100446630B1 (ko) * 2002-05-08 2004-09-04 삼성전자주식회사 음성신호에 대한 벡터 양자화 및 역 벡터 양자화 장치와그 방법
EP1383109A1 (de) 2002-07-17 2004-01-21 STMicroelectronics N.V. Verfahren und Vorrichtung für breitbandige Sprachkodierung
JP4816115B2 (ja) * 2006-02-08 2011-11-16 カシオ計算機株式会社 音声符号化装置及び音声符号化方法
BRPI0818062A2 (pt) * 2007-10-12 2015-03-31 Panasonic Corp Quantizador vetorial, quantizador vetorial inverso, e métodos
CN100578619C (zh) 2007-11-05 2010-01-06 华为技术有限公司 编码方法和编码器
GB2466675B (en) * 2009-01-06 2013-03-06 Skype Speech coding
GB2466671B (en) 2009-01-06 2013-03-27 Skype Speech encoding
GB2466673B (en) 2009-01-06 2012-11-07 Skype Quantization
JP2011090031A (ja) * 2009-10-20 2011-05-06 Oki Electric Industry Co Ltd 音声帯域拡張装置及びプログラム、並びに、拡張用パラメータ学習装置及びプログラム
US8280726B2 (en) * 2009-12-23 2012-10-02 Qualcomm Incorporated Gender detection in mobile phones
WO2012091464A1 (ko) 2010-12-29 2012-07-05 삼성전자 주식회사 고주파수 대역폭 확장을 위한 부호화/복호화 장치 및 방법
US9972325B2 (en) 2012-02-17 2018-05-15 Huawei Technologies Co., Ltd. System and method for mixed codebook excitation for speech coding
CN107452391B (zh) 2014-04-29 2020-08-25 华为技术有限公司 音频编码方法及相关装置
US10878831B2 (en) * 2017-01-12 2020-12-29 Qualcomm Incorporated Characteristic-based speech codebook selection

Family Cites Families (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS56111899A (en) * 1980-02-08 1981-09-03 Matsushita Electric Ind Co Ltd Voice synthetizing system and apparatus
JPS5912499A (ja) * 1982-07-12 1984-01-23 松下電器産業株式会社 音声符号化装置
JPS60116000A (ja) * 1983-11-28 1985-06-22 ケイディディ株式会社 音声符号化装置
IT1180126B (it) * 1984-11-13 1987-09-23 Cselt Centro Studi Lab Telecom Procedimento e dispositivo per la codifica e decodifica del segnale vocale mediante tecniche di quantizzazione vettoriale
IT1195350B (it) * 1986-10-21 1988-10-12 Cselt Centro Studi Lab Telecom Procedimento e dispositivo per la codifica e decodifica del segnale vocale mediante estrazione di para metri e tecniche di quantizzazione vettoriale
US4817157A (en) * 1988-01-07 1989-03-28 Motorola, Inc. Digital speech coder having improved vector excitation source
DE3853161T2 (de) * 1988-10-19 1995-08-17 Ibm Vektorquantisierungscodierer.
US5012518A (en) * 1989-07-26 1991-04-30 Itt Corporation Low-bit-rate speech coder using LPC data reduction processing
DE4009033A1 (de) * 1990-03-21 1991-09-26 Bosch Gmbh Robert Vorrichtung zur unterdrueckung einzelner zuendvorgaenge in einer zuendanlage
EP0475759B1 (de) * 1990-09-13 1998-01-07 Oki Electric Industry Co., Ltd. Methode zur Phonemunterscheidung
JP3151874B2 (ja) * 1991-02-26 2001-04-03 日本電気株式会社 音声パラメータ符号化方式および装置
JP3296363B2 (ja) * 1991-04-30 2002-06-24 日本電信電話株式会社 音声の線形予測パラメータ符号化方法
ES2240252T3 (es) * 1991-06-11 2005-10-16 Qualcomm Incorporated Vocodificador de velocidad variable.
US5487086A (en) * 1991-09-13 1996-01-23 Comsat Corporation Transform vector quantization for adaptive predictive coding
US5371853A (en) * 1991-10-28 1994-12-06 University Of Maryland At College Park Method and system for CELP speech coding and codebook for use therewith
JPH05232996A (ja) * 1992-02-20 1993-09-10 Olympus Optical Co Ltd 音声符号化装置
US5651026A (en) * 1992-06-01 1997-07-22 Hughes Electronics Robust vector quantization of line spectral frequencies
JP2746039B2 (ja) * 1993-01-22 1998-04-28 日本電気株式会社 音声符号化方式
US5491771A (en) * 1993-03-26 1996-02-13 Hughes Aircraft Company Real-time implementation of a 8Kbps CELP coder on a DSP pair
IT1270439B (it) * 1993-06-10 1997-05-05 Sip Procedimento e dispositivo per la quantizzazione dei parametri spettrali in codificatori numerici della voce
US5533052A (en) * 1993-10-15 1996-07-02 Comsat Corporation Adaptive predictive coding with transform domain quantization based on block size adaptation, backward adaptive power gain control, split bit-allocation and zero input response compensation
US5602961A (en) * 1994-05-31 1997-02-11 Alaris, Inc. Method and apparatus for speech compression using multi-mode code excited linear predictive coding
FR2720850B1 (fr) * 1994-06-03 1996-08-14 Matra Communication Procédé de codage de parole à prédiction linéaire.
JP3557662B2 (ja) * 1994-08-30 2004-08-25 ソニー株式会社 音声符号化方法及び音声復号化方法、並びに音声符号化装置及び音声復号化装置
US5602959A (en) * 1994-12-05 1997-02-11 Motorola, Inc. Method and apparatus for characterization and reconstruction of speech excitation waveforms
US5699481A (en) * 1995-05-18 1997-12-16 Rockwell International Corporation Timing recovery scheme for packet speech in multiplexing environment of voice with data applications
US5699485A (en) * 1995-06-07 1997-12-16 Lucent Technologies Inc. Pitch delay modification during frame erasures
US5732389A (en) * 1995-06-07 1998-03-24 Lucent Technologies Inc. Voiced/unvoiced classification of speech for excitation codebook selection in celp speech decoding during frame erasures
US5710863A (en) * 1995-09-19 1998-01-20 Chen; Juin-Hwey Speech signal quantization using human auditory models in predictive coding systems

Also Published As

Publication number Publication date
AU4190196A (en) 1996-07-10
AU703046B2 (en) 1999-03-11
CN1141684A (zh) 1997-01-29
EP0751494A1 (de) 1997-01-02
CA2182790A1 (en) 1996-06-27
JPH08179796A (ja) 1996-07-12
ES2188679T3 (es) 2003-07-01
PL316008A1 (en) 1996-12-23
MY112314A (en) 2001-05-31
EP0751494B1 (de) 2003-02-19
US5950155A (en) 1999-09-07
TR199501637A2 (tr) 1996-07-21
DE69529672D1 (de) 2003-03-27
BR9506841A (pt) 1997-10-14
KR970701410A (ko) 1997-03-17
TW367484B (en) 1999-08-21
WO1996019798A1 (fr) 1996-06-27
MX9603416A (es) 1997-12-31
ATE233008T1 (de) 2003-03-15
EP0751494A4 (de) 1998-12-30

Similar Documents

Publication Publication Date Title
DE69529672T2 (de) System zur sprachkodierung
EP0542628B1 (de) Vorrichtung zur Sprachsynthese
DE69125775T2 (de) Sprachkodierungs- und Dekodierungssystem
KR970022701A (ko) 음성부호화방법 및 장치
EP0770990A3 (de) Verfahren und Vorrichtung zur Sprachkodierung und -dekodierung
KR970024627A (ko) 음성부호화방법 및 장치
ATE309601T1 (de) Kodierung periodischer sprache
EP0501420A3 (en) Speech coding method and system
DE69126062T2 (de) System zur Sprachcodierung und -decodierung
FI934424A0 (fi) Tid-frekvens -interpolering med tillaempning pao laoghastighets talkodning
US5598504A (en) Speech coding system to reduce distortion through signal overlap
DE69941947D1 (de) Celp sprachkodierer
DE68913691D1 (de) System zur Sprachcodierung und -decodierung.
FI935423A0 (fi) Foerfarande samt anordning foer kvantisering av excitationsfoerstaerkning i talkodare, som baserade pao syntesanalysteknik
JPH1011094A (ja) 広帯域音声符号化装置及び音声復号装置
DE68915057D1 (de) Kodierungsverfahren und linearer Prädiktionssprachkodierer.
DE69624207D1 (de) Sprachkodierer mit Vorrichtung zur Abschätzung der Abweichung des Leistungsverlaufs eines synthetischen Signals von einem Eingangssignal
CA2118986C (en) Speech coding system
TH22247A (th) วิธีเข้ารหัสเสียงพูด
TH22247B (th) วิธีเข้ารหัสเสียงพูด
DK0469997T3 (da) Fremgangsmåde til kodning og talekoder til analyse ved syntese
JPH0683393A (ja) 音声符号化装置

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8339 Ceased/non-payment of the annual fee