MX9708203A - Cuantificacion de señales vocales usando modelos de publico humano en sistemas de codificacion predictivas. - Google Patents

Cuantificacion de señales vocales usando modelos de publico humano en sistemas de codificacion predictivas.

Info

Publication number
MX9708203A
MX9708203A MX9708203A MX9708203A MX9708203A MX 9708203 A MX9708203 A MX 9708203A MX 9708203 A MX9708203 A MX 9708203A MX 9708203 A MX9708203 A MX 9708203A MX 9708203 A MX9708203 A MX 9708203A
Authority
MX
Mexico
Prior art keywords
prediction residual
quantization
residual signals
transform coding
speech coder
Prior art date
Application number
MX9708203A
Other languages
English (en)
Inventor
Juin-Hwey Chen
Original Assignee
At & T Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by At & T Corp filed Critical At & T Corp
Publication of MX9708203A publication Critical patent/MX9708203A/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/002Dynamic bit allocation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

Un sistema de compresion de señales vocales llamado "Codificacion Predictiva de la Transformada" o TPC, proporciona la codificacion de la señal vocal de banda extendida de 7 kHz (muestreo de 16 kHz) a un intervalo de velocidad de transmision de datos de blanco u objetivo de 16 a 32 kb/s (1 a 2 bitios/muestra). El sistema utiliza la prediccion a corto plazo y a largo plazo para remover la redundancia en la señal vocal. Un subproducto o residuo de la prediccion es transformado y codificado en el dominio de la frecuencia para tomar ventaja del conocimiento en la percepcion auditiva humana. El codificador de TPC utiliza solamente la cuantificacion de circuito abierto y por lo tanto tiene una complejidad bastante baja. La calidad de la señal vocal del TPC es esencialmente transparente a 32 kb/s, muy buena a 24 kb/s, y aceptable a 16kb/s.
MX9708203A 1996-02-26 1997-02-26 Cuantificacion de señales vocales usando modelos de publico humano en sistemas de codificacion predictivas. MX9708203A (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US1229696P 1996-02-26 1996-02-26
PCT/US1997/002898 WO1997031367A1 (en) 1996-02-26 1997-02-26 Multi-stage speech coder with transform coding of prediction residual signals with quantization by auditory models

Publications (1)

Publication Number Publication Date
MX9708203A true MX9708203A (es) 1997-12-31

Family

ID=21754300

Family Applications (1)

Application Number Title Priority Date Filing Date
MX9708203A MX9708203A (es) 1996-02-26 1997-02-26 Cuantificacion de señales vocales usando modelos de publico humano en sistemas de codificacion predictivas.

Country Status (5)

Country Link
EP (1) EP0954851A1 (es)
JP (1) JPH11504733A (es)
CA (1) CA2219358A1 (es)
MX (1) MX9708203A (es)
WO (1) WO1997031367A1 (es)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6397178B1 (en) 1998-09-18 2002-05-28 Conexant Systems, Inc. Data organizational scheme for enhanced selection of gain parameters for speech coding
US6778953B1 (en) * 2000-06-02 2004-08-17 Agere Systems Inc. Method and apparatus for representing masked thresholds in a perceptual audio coder
CN1244904C (zh) * 2001-05-08 2006-03-08 皇家菲利浦电子有限公司 声频信号编码方法和设备
EP1672618B1 (en) * 2003-10-07 2010-12-15 Panasonic Corporation Method for deciding time boundary for encoding spectrum envelope and frequency resolution
DE102006022346B4 (de) * 2006-05-12 2008-02-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Informationssignalcodierung
WO2012000882A1 (en) 2010-07-02 2012-01-05 Dolby International Ab Selective bass post filter
WO2012161675A1 (en) * 2011-05-20 2012-11-29 Google Inc. Redundant coding unit for audio codec
EP2772911B1 (en) * 2011-10-24 2017-12-20 LG Electronics Inc. Method and device for quantizing voice signals in a band-selective manner
CN111862995A (zh) * 2020-06-22 2020-10-30 北京达佳互联信息技术有限公司 一种码率确定模型训练方法、码率确定方法及装置

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5012517A (en) * 1989-04-18 1991-04-30 Pacific Communication Science, Inc. Adaptive transform coder having long term predictor
FR2700632B1 (fr) * 1993-01-21 1995-03-24 France Telecom Système de codage-décodage prédictif d'un signal numérique de parole par transformée adaptative à codes imbriqués.

Also Published As

Publication number Publication date
WO1997031367A1 (en) 1997-08-28
EP0954851A4 (es) 1999-11-10
JPH11504733A (ja) 1999-04-27
EP0954851A1 (en) 1999-11-10
CA2219358A1 (en) 1997-08-28

Similar Documents

Publication Publication Date Title
MX9604161A (es) Cuantificacion de señales del habla que utiliza modelos auiditivos humanos en sistemas de codificacion predictiva.
MX9604160A (es) Sintesis de señales del habla en ausencia de parametros codificados.
MX9604159A (es) Medicion de enmascaramiento de ruido perceptual basado en la respuesta de frecuencia del filtro de sintesis.
CA2194419C (en) Perceptual noise shaping in the time domain via lpc prediction in the frequency domain
US6502069B1 (en) Method and a device for coding audio signals and a method and a device for decoding a bit stream
CA2197128A1 (en) Enhanced Joint Stereo Coding Method Using Temporal Envelope Shaping
US20010038643A1 (en) Method for inserting auxiliary data in an audio data stream
AU2377600A (en) Periodic speech coding
EP0785541B1 (en) Usage of voice activity detection for efficient coding of speech
MY112314A (en) Speech encoding method
GB2030428B (en) Speech signal transform coding
KR970022701A (ko) 음성부호화방법 및 장치
KR20000076297A (ko) 오디오신호 코딩방법
GB2323759A (en) Audio coding and decoding with compression
EP1262956A3 (en) Signal encoding method and apparatus
WO1995010760A3 (en) Improved low bit rate vocoders and methods of operation therefor
SE9500452D0 (sv) Method and apparatus in coding digital information
MX9708203A (es) Cuantificacion de señales vocales usando modelos de publico humano en sistemas de codificacion predictivas.
CA2267219A1 (en) Differential coding for scalable audio coders
Mahieux et al. Transform coding of audio signals using correlation between successive transform blocks
AU5263396A (en) Predictive split-matrix quantization of spectral parameters for efficient coding of speech
TW260846B (en) Speech-coding parameter sequence reconstruction by classification and contour inventory
CA2239294A1 (en) Methods and apparatus for efficient quantization of gain parameters in glpas speech coders
CA2025455A1 (en) Speech coding system with generation of linear predictive coding parameters and control codes from a digital speech signal
CA2213020A1 (en) Wide-band speech spectral quantizer