KR970701410A - 음성 부호화 방법(Sound Encoding System) - Google Patents

음성 부호화 방법(Sound Encoding System)

Info

Publication number
KR970701410A
KR970701410A KR1019960704546A KR19960704546A KR970701410A KR 970701410 A KR970701410 A KR 970701410A KR 1019960704546 A KR1019960704546 A KR 1019960704546A KR 19960704546 A KR19960704546 A KR 19960704546A KR 970701410 A KR970701410 A KR 970701410A
Authority
KR
South Korea
Prior art keywords
term prediction
short
prediction value
signal
speech
Prior art date
Application number
KR1019960704546A
Other languages
English (en)
Inventor
마사유끼 니시구찌
Original Assignee
이데이 노부유끼
소니 가부시끼가이샤
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 이데이 노부유끼, 소니 가부시끼가이샤 filed Critical 이데이 노부유끼
Publication of KR970701410A publication Critical patent/KR970701410A/ko

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • G10L19/07Line spectrum pair [LSP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/09Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0004Design or structure of the codebook
    • G10L2019/0005Multi-stage vector quantisation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum

Abstract

본 발명에서는 예를 들어 부호 여기 선형 예측(CELP)부호화를 행할 때에, 입력된 음성 신호로부터 선형예측 부호(LPC)분석 회로(12)에서 α파라메터를 취출하여 α→LSP변환 회로(13)에서 α파라메터를 선 스펙트럼쌍(LSP)파라메터로 변환하고, 이 선 스펙트럼쌍(LSP)파라메터 벡터를 LSP벡터 양자화기(14)에서 벡터 양자화한다. 이때, 피치 검출 회로(22)에 의해 검출된 피치의 값에 따라 전환 스위치(16)을 제어하여 남자 음성용 코드북(15M)과 여자 음성용 코드북(15M)중 어느 한쪽을 선택하여 사용함으로써 전송 비트 레이트를 증대시키지 않고 양자화 특성을 향상시킬 수 있다.

Description

음성 부호화 방법(Sound Encoding System)
본 내용은 요부공개 건이므로 전문내용을 수록하지 않았음
제1도는 본 발명에 따른 음성 부호화 방법이 적용되는 장치의 구체예로서의 음성 신호 부호화 장치의 개략구성을 도시한 블럭도이다.

Claims (7)

  1. 입력 음성 신호에 근거하여 단기 예측값을 생성하는 단계와; 음성 신호의 복수의 특성 파라메터 중 하나 또는 복수의 조합을 지준 파라메터로 하고, 상기 기준 파라메터에 관하여 단기 예측값을 나타내는 파라메터를 분류하여 형성한 제1 및 제2의 코드북을 작성하는 단계와; 상기 입력 음성 신호의 상기 기준 파라메터에 관하여 상기 제1 및 제2의 코드북의 한쪽을 선택하는 단계; 및 상기 선택한 코드북을 참조하여 상기 단기 예측값을 양자화함으로써 상기 입력음성 신호를 부호화하는 단계를 포함하는 것을 특징으로 하는 음성 부호화 방법.
  2. 제1항에 있어서, 상기 단기 예측값은 단기 예측 계수인 것을 특징으로 하는 음성 부호화 방법.
  3. 제1항에 있어서, 상기 단기 예측값은 단기 예측 오차인 것을 특징으로 하는 음성 부호화 방법.
  4. 제1항에 있어서, 상기 복수의 특성 파라메터는 음성 신호의 피치값, 피시 강도, 프레임 파워, 유성음 및 무성음의 판별 플래그 및 신호 스펙트럼의 기울기인 것을 특징으로 하는 음성 부호화 방법.
  5. 제1항에 있어서, 상기 단기 예측값을 베터 양자화함으로써, 상기 입력 음성 신호를 부호화하는 것을 특징으로 하는 음성 부호화 방법.
  6. 제1항에 있어서, 상기 단기 예측값을 매트릭스 양자화함으로써, 상기 입력 음성 신호를 부호화하는 것을 특징으로 하는 음성 부호화 방법.
  7. 제1항에 있어서, 상기 기준 파라메터는 음성 신호의 피치값이고, 상기 입력 음성 신호의 피치값 및 소성의 피치값의 크기의 관계에 따라서 상기 제1 및 제2의 코드북의 한쪽을 선택하는 것을 특징으로 하는 음성 부호화 방법.
    ※ 참고사항 : 최초출원 내용에 의하여 공개하는 것임.
KR1019960704546A 1994-12-21 1995-12-19 음성 부호화 방법(Sound Encoding System) KR970701410A (ko)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP6318689A JPH08179796A (ja) 1994-12-21 1994-12-21 音声符号化方法
JP94-318689 1994-12-21
PCT/JP1995/002607 WO1996019798A1 (fr) 1994-12-21 1995-12-19 Systeme de codage du son

Publications (1)

Publication Number Publication Date
KR970701410A true KR970701410A (ko) 1997-03-17

Family

ID=18101922

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1019960704546A KR970701410A (ko) 1994-12-21 1995-12-19 음성 부호화 방법(Sound Encoding System)

Country Status (16)

Country Link
US (1) US5950155A (ko)
EP (1) EP0751494B1 (ko)
JP (1) JPH08179796A (ko)
KR (1) KR970701410A (ko)
CN (1) CN1141684A (ko)
AT (1) ATE233008T1 (ko)
AU (1) AU703046B2 (ko)
BR (1) BR9506841A (ko)
CA (1) CA2182790A1 (ko)
DE (1) DE69529672T2 (ko)
ES (1) ES2188679T3 (ko)
MY (1) MY112314A (ko)
PL (1) PL316008A1 (ko)
TR (1) TR199501637A2 (ko)
TW (1) TW367484B (ko)
WO (1) WO1996019798A1 (ko)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100373614B1 (ko) * 1997-12-24 2003-02-26 미쓰비시덴키 가부시키가이샤 음성 부호화 방법 및 음성 복호화 방법 및, 음성 부호화장치 및 음성 복호화 장치

Families Citing this family (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3273455B2 (ja) * 1994-10-07 2002-04-08 日本電信電話株式会社 ベクトル量子化方法及びその復号化器
CN1163870C (zh) * 1996-08-02 2004-08-25 松下电器产业株式会社 声音编码装置和方法,声音译码装置,以及声音译码方法
JP3707153B2 (ja) * 1996-09-24 2005-10-19 ソニー株式会社 ベクトル量子化方法、音声符号化方法及び装置
US7788092B2 (en) 1996-09-25 2010-08-31 Qualcomm Incorporated Method and apparatus for detecting bad data packets received by a mobile telephone using decoded speech parameters
KR20000048609A (ko) 1996-09-25 2000-07-25 러셀 비. 밀러 디코딩된 음성 파라미터를 이용하여 이동전화에 의해 수신된 불량 데이터 패킷을 검출하는 방법 및 장치
US6205130B1 (en) 1996-09-25 2001-03-20 Qualcomm Incorporated Method and apparatus for detecting bad data packets received by a mobile telephone using decoded speech parameters
DE19654079A1 (de) * 1996-12-23 1998-06-25 Bayer Ag Endo-ekto-parasitizide Mittel
JP3523649B2 (ja) 1997-03-12 2004-04-26 三菱電機株式会社 音声符号化装置、音声復号装置及び音声符号化復号装置、及び、音声符号化方法、音声復号方法及び音声符号化復号方法
IL120788A (en) * 1997-05-06 2000-07-16 Audiocodes Ltd Systems and methods for encoding and decoding speech for lossy transmission networks
TW408298B (en) * 1997-08-28 2000-10-11 Texas Instruments Inc Improved method for switched-predictive quantization
JP3235543B2 (ja) * 1997-10-22 2001-12-04 松下電器産業株式会社 音声符号化/復号化装置
JP4308345B2 (ja) 1998-08-21 2009-08-05 パナソニック株式会社 マルチモード音声符号化装置及び復号化装置
SE521225C2 (sv) * 1998-09-16 2003-10-14 Ericsson Telefon Ab L M Förfarande och anordning för CELP-kodning/avkodning
JP2000305597A (ja) * 1999-03-12 2000-11-02 Texas Instr Inc <Ti> 音声圧縮のコード化
JP2000308167A (ja) * 1999-04-20 2000-11-02 Mitsubishi Electric Corp 音声符号化装置
US6449313B1 (en) * 1999-04-28 2002-09-10 Lucent Technologies Inc. Shaped fixed codebook search for celp speech coding
GB2352949A (en) * 1999-08-02 2001-02-07 Motorola Ltd Speech coder for communications unit
US6721701B1 (en) * 1999-09-20 2004-04-13 Lucent Technologies Inc. Method and apparatus for sound discrimination
US6510407B1 (en) * 1999-10-19 2003-01-21 Atmel Corporation Method and apparatus for variable rate coding of speech
JP3462464B2 (ja) * 2000-10-20 2003-11-05 株式会社東芝 音声符号化方法、音声復号化方法及び電子装置
KR100446630B1 (ko) * 2002-05-08 2004-09-04 삼성전자주식회사 음성신호에 대한 벡터 양자화 및 역 벡터 양자화 장치와그 방법
EP1383109A1 (fr) 2002-07-17 2004-01-21 STMicroelectronics N.V. Procédé et dispositif d'encodage de la parole à bande élargie
JP4816115B2 (ja) * 2006-02-08 2011-11-16 カシオ計算機株式会社 音声符号化装置及び音声符号化方法
BRPI0818062A2 (pt) * 2007-10-12 2015-03-31 Panasonic Corp Quantizador vetorial, quantizador vetorial inverso, e métodos
CN100578619C (zh) 2007-11-05 2010-01-06 华为技术有限公司 编码方法和编码器
GB2466675B (en) * 2009-01-06 2013-03-06 Skype Speech coding
GB2466671B (en) 2009-01-06 2013-03-27 Skype Speech encoding
GB2466673B (en) 2009-01-06 2012-11-07 Skype Quantization
JP2011090031A (ja) * 2009-10-20 2011-05-06 Oki Electric Industry Co Ltd 音声帯域拡張装置及びプログラム、並びに、拡張用パラメータ学習装置及びプログラム
US8280726B2 (en) * 2009-12-23 2012-10-02 Qualcomm Incorporated Gender detection in mobile phones
WO2012091464A1 (ko) 2010-12-29 2012-07-05 삼성전자 주식회사 고주파수 대역폭 확장을 위한 부호화/복호화 장치 및 방법
US9972325B2 (en) 2012-02-17 2018-05-15 Huawei Technologies Co., Ltd. System and method for mixed codebook excitation for speech coding
CN107452391B (zh) 2014-04-29 2020-08-25 华为技术有限公司 音频编码方法及相关装置
US10878831B2 (en) * 2017-01-12 2020-12-29 Qualcomm Incorporated Characteristic-based speech codebook selection

Family Cites Families (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS56111899A (en) * 1980-02-08 1981-09-03 Matsushita Electric Ind Co Ltd Voice synthetizing system and apparatus
JPS5912499A (ja) * 1982-07-12 1984-01-23 松下電器産業株式会社 音声符号化装置
JPS60116000A (ja) * 1983-11-28 1985-06-22 ケイディディ株式会社 音声符号化装置
IT1180126B (it) * 1984-11-13 1987-09-23 Cselt Centro Studi Lab Telecom Procedimento e dispositivo per la codifica e decodifica del segnale vocale mediante tecniche di quantizzazione vettoriale
IT1195350B (it) * 1986-10-21 1988-10-12 Cselt Centro Studi Lab Telecom Procedimento e dispositivo per la codifica e decodifica del segnale vocale mediante estrazione di para metri e tecniche di quantizzazione vettoriale
US4817157A (en) * 1988-01-07 1989-03-28 Motorola, Inc. Digital speech coder having improved vector excitation source
DE3853161T2 (de) * 1988-10-19 1995-08-17 Ibm Vektorquantisierungscodierer.
US5012518A (en) * 1989-07-26 1991-04-30 Itt Corporation Low-bit-rate speech coder using LPC data reduction processing
DE4009033A1 (de) * 1990-03-21 1991-09-26 Bosch Gmbh Robert Vorrichtung zur unterdrueckung einzelner zuendvorgaenge in einer zuendanlage
EP0475759B1 (en) * 1990-09-13 1998-01-07 Oki Electric Industry Co., Ltd. Phoneme discrimination method
JP3151874B2 (ja) * 1991-02-26 2001-04-03 日本電気株式会社 音声パラメータ符号化方式および装置
JP3296363B2 (ja) * 1991-04-30 2002-06-24 日本電信電話株式会社 音声の線形予測パラメータ符号化方法
ES2240252T3 (es) * 1991-06-11 2005-10-16 Qualcomm Incorporated Vocodificador de velocidad variable.
US5487086A (en) * 1991-09-13 1996-01-23 Comsat Corporation Transform vector quantization for adaptive predictive coding
US5371853A (en) * 1991-10-28 1994-12-06 University Of Maryland At College Park Method and system for CELP speech coding and codebook for use therewith
JPH05232996A (ja) * 1992-02-20 1993-09-10 Olympus Optical Co Ltd 音声符号化装置
US5651026A (en) * 1992-06-01 1997-07-22 Hughes Electronics Robust vector quantization of line spectral frequencies
JP2746039B2 (ja) * 1993-01-22 1998-04-28 日本電気株式会社 音声符号化方式
US5491771A (en) * 1993-03-26 1996-02-13 Hughes Aircraft Company Real-time implementation of a 8Kbps CELP coder on a DSP pair
IT1270439B (it) * 1993-06-10 1997-05-05 Sip Procedimento e dispositivo per la quantizzazione dei parametri spettrali in codificatori numerici della voce
US5533052A (en) * 1993-10-15 1996-07-02 Comsat Corporation Adaptive predictive coding with transform domain quantization based on block size adaptation, backward adaptive power gain control, split bit-allocation and zero input response compensation
US5602961A (en) * 1994-05-31 1997-02-11 Alaris, Inc. Method and apparatus for speech compression using multi-mode code excited linear predictive coding
FR2720850B1 (fr) * 1994-06-03 1996-08-14 Matra Communication Procédé de codage de parole à prédiction linéaire.
JP3557662B2 (ja) * 1994-08-30 2004-08-25 ソニー株式会社 音声符号化方法及び音声復号化方法、並びに音声符号化装置及び音声復号化装置
US5602959A (en) * 1994-12-05 1997-02-11 Motorola, Inc. Method and apparatus for characterization and reconstruction of speech excitation waveforms
US5699481A (en) * 1995-05-18 1997-12-16 Rockwell International Corporation Timing recovery scheme for packet speech in multiplexing environment of voice with data applications
US5699485A (en) * 1995-06-07 1997-12-16 Lucent Technologies Inc. Pitch delay modification during frame erasures
US5732389A (en) * 1995-06-07 1998-03-24 Lucent Technologies Inc. Voiced/unvoiced classification of speech for excitation codebook selection in celp speech decoding during frame erasures
US5710863A (en) * 1995-09-19 1998-01-20 Chen; Juin-Hwey Speech signal quantization using human auditory models in predictive coding systems

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100373614B1 (ko) * 1997-12-24 2003-02-26 미쓰비시덴키 가부시키가이샤 음성 부호화 방법 및 음성 복호화 방법 및, 음성 부호화장치 및 음성 복호화 장치

Also Published As

Publication number Publication date
AU4190196A (en) 1996-07-10
AU703046B2 (en) 1999-03-11
CN1141684A (zh) 1997-01-29
EP0751494A1 (en) 1997-01-02
CA2182790A1 (en) 1996-06-27
JPH08179796A (ja) 1996-07-12
ES2188679T3 (es) 2003-07-01
PL316008A1 (en) 1996-12-23
MY112314A (en) 2001-05-31
EP0751494B1 (en) 2003-02-19
US5950155A (en) 1999-09-07
TR199501637A2 (tr) 1996-07-21
DE69529672D1 (de) 2003-03-27
BR9506841A (pt) 1997-10-14
TW367484B (en) 1999-08-21
WO1996019798A1 (fr) 1996-06-27
DE69529672T2 (de) 2003-12-18
MX9603416A (es) 1997-12-31
ATE233008T1 (de) 2003-03-15
EP0751494A4 (en) 1998-12-30

Similar Documents

Publication Publication Date Title
KR970701410A (ko) 음성 부호화 방법(Sound Encoding System)
EP0542628B1 (en) Speech synthesis system
KR970022701A (ko) 음성부호화방법 및 장치
CA2165484A1 (en) A low rate multi-mode celp codec that uses backward prediction
CA2636684A1 (en) A method for speech coding, method for speech decoding and their apparatuses
EP0770990A3 (en) Speech encoding method and apparatus and speech decoding method and apparatus
Shlomot et al. Combined harmonic and waveform coding of speech at low bit rates
FI935423A0 (fi) Foerfarande samt anordning foer kvantisering av excitationsfoerstaerkning i talkodare, som baserade pao syntesanalysteknik
JPH0720897A (ja) ディジタルコーダにおけるスペクトルパラメータを量子化する方法および装置
JP3319396B2 (ja) 音声符号化装置ならびに音声符号化復号化装置
EP1355298A3 (en) Code Excitation linear prediction encoder and decoder
EP0347307A3 (en) Coding method and linear prediction speech coder
JPH08254998A (ja) 音声符号化/復号化装置
JP2968530B2 (ja) 適応ピッチ予測方法
TH22247B (th) วิธีเข้ารหัสเสียงพูด
KR100221186B1 (ko) 음성 부호화 및 복호화 장치와 그 방법
TH22247A (th) วิธีเข้ารหัสเสียงพูด
JP3563400B2 (ja) 音声復号化装置及び音声復号化方法
Unno et al. The multimodal multipulse excitation vocoder
JPH08248998A (ja) 音声符号化/復号化装置
Wang Variable rate multi-mode excitation coding of speech at 2.4 kbps
JP2003202884A (ja) 音声合成システム
JPH0969000A (ja) 音声パラメータ量子化装置
RU96119258A (ru) Способ кодирования речи

Legal Events

Date Code Title Description
WITN Application deemed withdrawn, e.g. because no request for examination was filed or no examination fee was paid