MX9603416A - Metodo de codificacion de habla. - Google Patents
Metodo de codificacion de habla.Info
- Publication number
- MX9603416A MX9603416A MX9603416A MX9603416A MX9603416A MX 9603416 A MX9603416 A MX 9603416A MX 9603416 A MX9603416 A MX 9603416A MX 9603416 A MX9603416 A MX 9603416A MX 9603416 A MX9603416 A MX 9603416A
- Authority
- MX
- Mexico
- Prior art keywords
- lsp
- parameter
- alpha
- code
- line spectrum
- Prior art date
Links
- 238000001228 spectrum Methods 0.000 abstract 2
- 230000005540 biological transmission Effects 0.000 abstract 1
- 239000000284 extract Substances 0.000 abstract 1
- 238000013139 quantization Methods 0.000 abstract 1
- 230000005236 sound signal Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
- G10L19/07—Line spectrum pair [LSP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/09—Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0004—Design or structure of the codebook
- G10L2019/0005—Multi-stage vector quantisation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/24—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Signal Processing (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Communication Control (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
- Golf Clubs (AREA)
- Tires In General (AREA)
Abstract
Para ejecutar la codificacion de prediccion lineal de excitacion de codigo (CELP), por ejemplo, se toman alfa-parámetros de la señal de habla de entrada mediante un circuito 12 de análisis de codificacion de prediccion lineal (LPC). Los alfa-parámetros luego se convierten mediante una alfa-parámetro al circuito 13 de conversion de LSP en los parámetros del par espectral lineal (LSP) y un vector de estos parámetros del par espectral de línea (LSP) que cuantifica el vector mediante un cuantificador 14. El interruptor 16 de cambio se controla dependiendo del valor de densidad detectado mediante un circuito 22 de deteccion de densidad para seleccionar y usar uno de los libros de codigo 15M para voz masculina del libro de codigo 15F para voz femenina a fin de mejorar las características de cuantificacion sin aumentar el régimen de bit de transmision.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP6-318689 | 1994-12-21 | ||
JP6318689A JPH08179796A (ja) | 1994-12-21 | 1994-12-21 | 音声符号化方法 |
JP318,689 | 1994-12-21 | ||
PCT/JP1995/002607 WO1996019798A1 (fr) | 1994-12-21 | 1995-12-19 | Systeme de codage du son |
Publications (2)
Publication Number | Publication Date |
---|---|
MX9603416A true MX9603416A (es) | 1997-12-31 |
MXPA96003416A MXPA96003416A (es) | 1998-09-18 |
Family
ID=
Also Published As
Publication number | Publication date |
---|---|
AU703046B2 (en) | 1999-03-11 |
DE69529672D1 (de) | 2003-03-27 |
EP0751494A1 (en) | 1997-01-02 |
EP0751494B1 (en) | 2003-02-19 |
MY112314A (en) | 2001-05-31 |
AU4190196A (en) | 1996-07-10 |
PL316008A1 (en) | 1996-12-23 |
CN1141684A (zh) | 1997-01-29 |
BR9506841A (pt) | 1997-10-14 |
JPH08179796A (ja) | 1996-07-12 |
KR970701410A (ko) | 1997-03-17 |
TW367484B (en) | 1999-08-21 |
CA2182790A1 (en) | 1996-06-27 |
WO1996019798A1 (fr) | 1996-06-27 |
TR199501637A2 (tr) | 1996-07-21 |
ES2188679T3 (es) | 2003-07-01 |
DE69529672T2 (de) | 2003-12-18 |
ATE233008T1 (de) | 2003-03-15 |
EP0751494A4 (en) | 1998-12-30 |
US5950155A (en) | 1999-09-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
MY112314A (en) | Speech encoding method | |
FI119085B (fi) | Menetelmä ja laite koodausnopeuden valitsemiseksi muuttuvanopeuksisessa vokooderissa | |
KR100798668B1 (ko) | 무성 음성의 코딩 방법 및 장치 | |
US5689615A (en) | Usage of voice activity detection for efficient coding of speech | |
KR970022701A (ko) | 음성부호화방법 및 장치 | |
EP2154681A3 (en) | Method and apparatus for speech decoding | |
CN102150202A (zh) | 对音频/语音信号进行编码和解码的方法和设备 | |
BR0208635A (pt) | Método e aparelho para quantificar os valores do parâmetro espectral no codificador de voz, codificador de voz para fornecer ao decodificador um fluxo de bit, e, estação móvel capaz de receber e pré-processar o sinal de voz de entrada | |
EP0714186A3 (en) | ATM transmission system | |
SE9500452D0 (sv) | Method and apparatus in coding digital information | |
EP1129450A1 (en) | Low bit-rate coding of unvoiced segments of speech | |
SE9501640L (sv) | Metod för förstärkningskvantisering vid linjärprediktiv talkodning med kodboksexcitering | |
DE68913691D1 (de) | System zur Sprachcodierung und -decodierung. | |
KR20010093324A (ko) | 스피치 코더용의 1/8 난수 발생용 방법 및 장치 | |
AU5263396A (en) | Predictive split-matrix quantization of spectral parameters for efficient coding of speech | |
EP1310943A3 (en) | Speech coding apparatus, speech decoding apparatus and speech coding/decoding method | |
CA1321025C (en) | Speech signal coding/decoding system | |
WO1996036041A3 (en) | Transmission system and method for encoding speech with improved pitch detection | |
EP0871158A3 (en) | System for speech coding using a multipulse excitation | |
EP0745972A3 (en) | Method of and apparatus for coding speech signal | |
JPH0748696B2 (ja) | 音声符号化方式 | |
JPH05323996A (ja) | 有音無音判定法 | |
Wang | Speech coding | |
EP1355298A3 (en) | Code Excitation linear prediction encoder and decoder | |
JPS611129A (ja) | 音声・デ−タ多重化伝送方式 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FG | Grant or registration | ||
MM | Annulment or lapse due to non-payment of fees |