MX9603416A - Metodo de codificacion de habla. - Google Patents

Metodo de codificacion de habla.

Info

Publication number
MX9603416A
MX9603416A MX9603416A MX9603416A MX9603416A MX 9603416 A MX9603416 A MX 9603416A MX 9603416 A MX9603416 A MX 9603416A MX 9603416 A MX9603416 A MX 9603416A MX 9603416 A MX9603416 A MX 9603416A
Authority
MX
Mexico
Prior art keywords
lsp
parameter
alpha
code
line spectrum
Prior art date
Application number
MX9603416A
Other languages
English (en)
Other versions
MXPA96003416A (es
Inventor
Masayuki Nishiguchi
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Publication of MX9603416A publication Critical patent/MX9603416A/es
Publication of MXPA96003416A publication Critical patent/MXPA96003416A/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • G10L19/07Line spectrum pair [LSP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/09Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0004Design or structure of the codebook
    • G10L2019/0005Multi-stage vector quantisation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Communication Control (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
  • Golf Clubs (AREA)
  • Tires In General (AREA)

Abstract

Para ejecutar la codificacion de prediccion lineal de excitacion de codigo (CELP), por ejemplo, se toman alfa-parámetros de la señal de habla de entrada mediante un circuito 12 de análisis de codificacion de prediccion lineal (LPC). Los alfa-parámetros luego se convierten mediante una alfa-parámetro al circuito 13 de conversion de LSP en los parámetros del par espectral lineal (LSP) y un vector de estos parámetros del par espectral de línea (LSP) que cuantifica el vector mediante un cuantificador 14. El interruptor 16 de cambio se controla dependiendo del valor de densidad detectado mediante un circuito 22 de deteccion de densidad para seleccionar y usar uno de los libros de codigo 15M para voz masculina del libro de codigo 15F para voz femenina a fin de mejorar las características de cuantificacion sin aumentar el régimen de bit de transmision.
MXPA/A/1996/003416A 1994-12-21 1996-08-15 Metodo de codificacion de habla MXPA96003416A (es)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
JP6-318689 1994-12-21
JP6318689A JPH08179796A (ja) 1994-12-21 1994-12-21 音声符号化方法
JP318,689 1994-12-21
PCT/JP1995/002607 WO1996019798A1 (fr) 1994-12-21 1995-12-19 Systeme de codage du son

Publications (2)

Publication Number Publication Date
MX9603416A true MX9603416A (es) 1997-12-31
MXPA96003416A MXPA96003416A (es) 1998-09-18

Family

ID=

Also Published As

Publication number Publication date
AU703046B2 (en) 1999-03-11
DE69529672D1 (de) 2003-03-27
EP0751494A1 (en) 1997-01-02
EP0751494B1 (en) 2003-02-19
MY112314A (en) 2001-05-31
AU4190196A (en) 1996-07-10
PL316008A1 (en) 1996-12-23
CN1141684A (zh) 1997-01-29
BR9506841A (pt) 1997-10-14
JPH08179796A (ja) 1996-07-12
KR970701410A (ko) 1997-03-17
TW367484B (en) 1999-08-21
CA2182790A1 (en) 1996-06-27
WO1996019798A1 (fr) 1996-06-27
TR199501637A2 (tr) 1996-07-21
ES2188679T3 (es) 2003-07-01
DE69529672T2 (de) 2003-12-18
ATE233008T1 (de) 2003-03-15
EP0751494A4 (en) 1998-12-30
US5950155A (en) 1999-09-07

Similar Documents

Publication Publication Date Title
MY112314A (en) Speech encoding method
FI119085B (fi) Menetelmä ja laite koodausnopeuden valitsemiseksi muuttuvanopeuksisessa vokooderissa
KR100798668B1 (ko) 무성 음성의 코딩 방법 및 장치
US5689615A (en) Usage of voice activity detection for efficient coding of speech
KR970022701A (ko) 음성부호화방법 및 장치
EP2154681A3 (en) Method and apparatus for speech decoding
CN102150202A (zh) 对音频/语音信号进行编码和解码的方法和设备
BR0208635A (pt) Método e aparelho para quantificar os valores do parâmetro espectral no codificador de voz, codificador de voz para fornecer ao decodificador um fluxo de bit, e, estação móvel capaz de receber e pré-processar o sinal de voz de entrada
EP0714186A3 (en) ATM transmission system
SE9500452D0 (sv) Method and apparatus in coding digital information
EP1129450A1 (en) Low bit-rate coding of unvoiced segments of speech
SE9501640L (sv) Metod för förstärkningskvantisering vid linjärprediktiv talkodning med kodboksexcitering
DE68913691D1 (de) System zur Sprachcodierung und -decodierung.
KR20010093324A (ko) 스피치 코더용의 1/8 난수 발생용 방법 및 장치
AU5263396A (en) Predictive split-matrix quantization of spectral parameters for efficient coding of speech
EP1310943A3 (en) Speech coding apparatus, speech decoding apparatus and speech coding/decoding method
CA1321025C (en) Speech signal coding/decoding system
WO1996036041A3 (en) Transmission system and method for encoding speech with improved pitch detection
EP0871158A3 (en) System for speech coding using a multipulse excitation
EP0745972A3 (en) Method of and apparatus for coding speech signal
JPH0748696B2 (ja) 音声符号化方式
JPH05323996A (ja) 有音無音判定法
Wang Speech coding
EP1355298A3 (en) Code Excitation linear prediction encoder and decoder
JPS611129A (ja) 音声・デ−タ多重化伝送方式

Legal Events

Date Code Title Description
FG Grant or registration
MM Annulment or lapse due to non-payment of fees