MX9604159A - Medicion de enmascaramiento de ruido perceptual basado en la respuesta de frecuencia del filtro de sintesis. - Google Patents

Medicion de enmascaramiento de ruido perceptual basado en la respuesta de frecuencia del filtro de sintesis.

Info

Publication number
MX9604159A
MX9604159A MX9604159A MX9604159A MX9604159A MX 9604159 A MX9604159 A MX 9604159A MX 9604159 A MX9604159 A MX 9604159A MX 9604159 A MX9604159 A MX 9604159A MX 9604159 A MX9604159 A MX 9604159A
Authority
MX
Mexico
Prior art keywords
speech
tpc
frequency response
measured based
synthesis filter
Prior art date
Application number
MX9604159A
Other languages
English (en)
Inventor
Juin-Hwey Chen
Original Assignee
At & T Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by At & T Corp filed Critical At & T Corp
Publication of MX9604159A publication Critical patent/MX9604159A/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

Se proporciona un sistema de comprension de habla denominado "Transformacion de Codificacion Predictiva" o TPC, que proporciona codificacion de habla de ancho de banda de 7 kHz (muestreo de 16 kHz) en un alcance de velocidad de bitios objetivo de 16 a 32 kb/s (1 a 2 bitios/muestra). El sistema utiliza la prediccion a corto plazo y largo plazo para eliminar redundancia en el habla. Una residual de prediccion se transforma y se codifica en el dominio de frecuencia para tomar ventaja del conocimiento en la percepcion auditiva humana. El codificador de TPC utiliza unicamente la cuantificacion de circuito abierto y por lo tanto tiene una complejidlmente transparente a 32 kb/s, muy buena a 24 kb/s y aceptable a 16 kb/s.
MX9604159A 1995-09-19 1996-09-18 Medicion de enmascaramiento de ruido perceptual basado en la respuesta de frecuencia del filtro de sintesis. MX9604159A (es)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US08/530,981 US5790759A (en) 1995-09-19 1995-09-19 Perceptual noise masking measure based on synthesis filter frequency response

Publications (1)

Publication Number Publication Date
MX9604159A true MX9604159A (es) 1997-03-29

Family

ID=24115777

Family Applications (1)

Application Number Title Priority Date Filing Date
MX9604159A MX9604159A (es) 1995-09-19 1996-09-18 Medicion de enmascaramiento de ruido perceptual basado en la respuesta de frecuencia del filtro de sintesis.

Country Status (7)

Country Link
US (1) US5790759A (es)
EP (1) EP0764938B1 (es)
JP (1) JPH09152895A (es)
CA (1) CA2185746C (es)
DE (1) DE69615302T2 (es)
ES (1) ES2160772T3 (es)
MX (1) MX9604159A (es)

Families Citing this family (43)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2729246A1 (fr) * 1995-01-06 1996-07-12 Matra Communication Procede de codage de parole a analyse par synthese
JP3266819B2 (ja) * 1996-07-30 2002-03-18 株式会社エイ・ティ・アール人間情報通信研究所 周期信号変換方法、音変換方法および信号分析方法
DE19730130C2 (de) * 1997-07-14 2002-02-28 Fraunhofer Ges Forschung Verfahren zum Codieren eines Audiosignals
AU3372199A (en) * 1998-03-30 1999-10-18 Voxware, Inc. Low-complexity, low-delay, scalable and embedded speech and audio coding with adaptive frame loss concealment
US6115689A (en) * 1998-05-27 2000-09-05 Microsoft Corporation Scalable audio coder and decoder
US6253165B1 (en) * 1998-06-30 2001-06-26 Microsoft Corporation System and method for modeling probability distribution functions of transform coefficients of encoded signal
US6256607B1 (en) * 1998-09-08 2001-07-03 Sri International Method and apparatus for automatic recognition using features encoded with product-space vector quantization
US6073093A (en) * 1998-10-14 2000-06-06 Lockheed Martin Corp. Combined residual and analysis-by-synthesis pitch-dependent gain estimation for linear predictive coders
US7058572B1 (en) * 2000-01-28 2006-06-06 Nortel Networks Limited Reducing acoustic noise in wireless and landline based telephony
US6778953B1 (en) * 2000-06-02 2004-08-17 Agere Systems Inc. Method and apparatus for representing masked thresholds in a perceptual audio coder
US6754618B1 (en) * 2000-06-07 2004-06-22 Cirrus Logic, Inc. Fast implementation of MPEG audio coding
US7171355B1 (en) * 2000-10-25 2007-01-30 Broadcom Corporation Method and apparatus for one-stage and two-stage noise feedback coding of speech and audio signals
KR100871999B1 (ko) * 2001-05-08 2008-12-05 코닌클리케 필립스 일렉트로닉스 엔.브이. 오디오 코딩
US7110942B2 (en) * 2001-08-14 2006-09-19 Broadcom Corporation Efficient excitation quantization in a noise feedback coding system using correlation techniques
US7240001B2 (en) * 2001-12-14 2007-07-03 Microsoft Corporation Quality improvement techniques in an audio encoder
US7206740B2 (en) * 2002-01-04 2007-04-17 Broadcom Corporation Efficient excitation quantization in noise feedback coding with general noise shaping
US7236927B2 (en) * 2002-02-06 2007-06-26 Broadcom Corporation Pitch extraction methods and systems for speech coding using interpolation techniques
US7529661B2 (en) * 2002-02-06 2009-05-05 Broadcom Corporation Pitch extraction methods and systems for speech coding using quadratically-interpolated and filtered peaks for multiple time lag extraction
US7752037B2 (en) * 2002-02-06 2010-07-06 Broadcom Corporation Pitch extraction methods and systems for speech coding using sub-multiple time lag extraction
US7398204B2 (en) * 2002-08-27 2008-07-08 Her Majesty In Right Of Canada As Represented By The Minister Of Industry Bit rate reduction in audio encoders by exploiting inharmonicity effects and auditory temporal masking
US7502743B2 (en) 2002-09-04 2009-03-10 Microsoft Corporation Multi-channel audio encoding and decoding with multi-channel transform selection
EP1513137A1 (en) * 2003-08-22 2005-03-09 MicronasNIT LCC, Novi Sad Institute of Information Technologies Speech processing system and method with multi-pulse excitation
FR2859566B1 (fr) * 2003-09-05 2010-11-05 Eads Telecom Procede de transmission d'un flux d'information par insertion a l'interieur d'un flux de donnees de parole, et codec parametrique pour sa mise en oeuvre
US7460990B2 (en) 2004-01-23 2008-12-02 Microsoft Corporation Efficient coding of digital media spectral data using wide-sense perceptual similarity
US8473286B2 (en) * 2004-02-26 2013-06-25 Broadcom Corporation Noise feedback coding system and method for providing generalized noise shaping within a simple filter structure
KR100851970B1 (ko) * 2005-07-15 2008-08-12 삼성전자주식회사 오디오 신호의 중요주파수 성분 추출방법 및 장치와 이를이용한 저비트율 오디오 신호 부호화/복호화 방법 및 장치
US7831434B2 (en) 2006-01-20 2010-11-09 Microsoft Corporation Complex-transform channel coding with extended-band frequency coding
US8190425B2 (en) * 2006-01-20 2012-05-29 Microsoft Corporation Complex cross-correlation parameters for multi-channel audio
US20070239295A1 (en) * 2006-02-24 2007-10-11 Thompson Jeffrey K Codec conditioning system and method
DE602007003023D1 (de) * 2006-05-30 2009-12-10 Koninkl Philips Electronics Nv Linear-prädiktive codierung eines audiosignals
US9159333B2 (en) 2006-06-21 2015-10-13 Samsung Electronics Co., Ltd. Method and apparatus for adaptively encoding and decoding high frequency band
FR2912249A1 (fr) * 2007-02-02 2008-08-08 France Telecom Codage/decodage perfectionnes de signaux audionumeriques.
US7885819B2 (en) 2007-06-29 2011-02-08 Microsoft Corporation Bitstream syntax for multi-process audio decoding
EP2077550B8 (en) * 2008-01-04 2012-03-14 Dolby International AB Audio encoder and decoder
US9117458B2 (en) * 2009-11-12 2015-08-25 Lg Electronics Inc. Apparatus for processing an audio signal and method thereof
US9883312B2 (en) 2013-05-29 2018-01-30 Qualcomm Incorporated Transformed higher order ambisonics audio data
US9489955B2 (en) 2014-01-30 2016-11-08 Qualcomm Incorporated Indicating frame parameter reusability for coding vectors
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
EP3079151A1 (en) * 2015-04-09 2016-10-12 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder and method for encoding an audio signal
KR20220005379A (ko) * 2020-07-06 2022-01-13 한국전자통신연구원 천이구간 부호화 왜곡에 강인한 오디오 부호화/복호화 장치 및 방법

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3679821A (en) * 1970-04-30 1972-07-25 Bell Telephone Labor Inc Transform coding of image difference signals
JPS60116000A (ja) * 1983-11-28 1985-06-22 ケイディディ株式会社 音声符号化装置
US4969192A (en) * 1987-04-06 1990-11-06 Voicecraft, Inc. Vector adaptive predictive coder for speech and audio
NL8700985A (nl) * 1987-04-27 1988-11-16 Philips Nv Systeem voor sub-band codering van een digitaal audiosignaal.
US5012517A (en) * 1989-04-18 1991-04-30 Pacific Communication Science, Inc. Adaptive transform coder having long term predictor
US5206884A (en) * 1990-10-25 1993-04-27 Comsat Transform domain quantization technique for adaptive predictive coding
US5285498A (en) * 1992-03-02 1994-02-08 At&T Bell Laboratories Method and apparatus for coding audio signals based on perceptual model

Also Published As

Publication number Publication date
DE69615302D1 (de) 2001-10-25
EP0764938A2 (en) 1997-03-26
EP0764938A3 (en) 1998-06-10
ES2160772T3 (es) 2001-11-16
EP0764938B1 (en) 2001-09-19
JPH09152895A (ja) 1997-06-10
US5790759A (en) 1998-08-04
DE69615302T2 (de) 2002-07-04
CA2185746C (en) 2001-06-05
CA2185746A1 (en) 1997-03-20

Similar Documents

Publication Publication Date Title
MX9604159A (es) Medicion de enmascaramiento de ruido perceptual basado en la respuesta de frecuencia del filtro de sintesis.
MX9604161A (es) Cuantificacion de señales del habla que utiliza modelos auiditivos humanos en sistemas de codificacion predictiva.
MX9604160A (es) Sintesis de señales del habla en ausencia de parametros codificados.
Pan Digital audio compression
KR100346066B1 (ko) 오디오신호 코딩방법
US5781888A (en) Perceptual noise shaping in the time domain via LPC prediction in the frequency domain
AU726762B2 (en) A method and a device for coding audio signals and a method and a device for decoding a bit stream
EP0725494A1 (en) Perceptual audio compression based on loudness uncertainty
Ambikairajah et al. Auditory masking and MPEG-1 audio compression
CA2090160A1 (en) Rate loop processor for perceptual encoder/decoder
BR9914889B1 (pt) dispositivo e mÉtodo de ponderaÇço de percepÇço para codificaÇço eficiente de sinais em banda larga
WO1999060561A3 (en) Split band linear prediction vocoder
ATE85481T1 (de) System zur teilbandkodierung eines digitalen audiosignals.
CA2176665A1 (en) Method of adapting the noise masking level in an analysis-by-synthesis speech coder employing a short-term perceptual weighting filter
Mahieux et al. Transform coding of audio signals using correlation between successive transform blocks
AU5263396A (en) Predictive split-matrix quantization of spectral parameters for efficient coding of speech
MX9708203A (es) Cuantificacion de señales vocales usando modelos de publico humano en sistemas de codificacion predictivas.
Sen et al. Use of an auditory model to improve speech coders
US7050967B2 (en) Speech coding system
Boland et al. New results in low bitrate audio coding using a combined harmonic-wavelet representation
Yu et al. Efficient multiband excitation linear predictive coding of speech at 1.6 kbps
Najafzadeh-Azghandi et al. Perceptual coding of narrowband audio signals at 8 kbit/s
Brandenburg et al. Extending MPEG-Audio layer III to wideband speech coding
Murgia et al. Very low delay and high quality coding of 20 hz-15 khz speech at 64 kbit/s
Heute Speech and audio coding—aiming at high quality and low data rates

Legal Events

Date Code Title Description
FG Grant or registration
MM Annulment or lapse due to non-payment of fees