CA2219358A1 - Speech signal quantization using human auditory models in predictive coding systems - Google Patents

Speech signal quantization using human auditory models in predictive coding systems Download PDF

Info

Publication number
CA2219358A1
CA2219358A1 CA 2219358 CA2219358A CA2219358A1 CA 2219358 A1 CA2219358 A1 CA 2219358A1 CA 2219358 CA2219358 CA 2219358 CA 2219358 A CA2219358 A CA 2219358A CA 2219358 A1 CA2219358 A1 CA 2219358A1
Authority
CA
Canada
Prior art keywords
signal
speech
pitch
lpc
processor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
CA 2219358
Other languages
English (en)
French (fr)
Inventor
Juin-Hwey Chen
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AT&T Corp
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Publication of CA2219358A1 publication Critical patent/CA2219358A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/002Dynamic bit allocation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
CA 2219358 1996-02-26 1997-02-26 Speech signal quantization using human auditory models in predictive coding systems Abandoned CA2219358A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US1229696P 1996-02-26 1996-02-26
US60/012,296 1996-02-26

Publications (1)

Publication Number Publication Date
CA2219358A1 true CA2219358A1 (en) 1997-08-28

Family

ID=21754300

Family Applications (1)

Application Number Title Priority Date Filing Date
CA 2219358 Abandoned CA2219358A1 (en) 1996-02-26 1997-02-26 Speech signal quantization using human auditory models in predictive coding systems

Country Status (5)

Country Link
EP (1) EP0954851A1 (enrdf_load_stackoverflow)
JP (1) JPH11504733A (enrdf_load_stackoverflow)
CA (1) CA2219358A1 (enrdf_load_stackoverflow)
MX (1) MX9708203A (enrdf_load_stackoverflow)
WO (1) WO1997031367A1 (enrdf_load_stackoverflow)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6397178B1 (en) * 1998-09-18 2002-05-28 Conexant Systems, Inc. Data organizational scheme for enhanced selection of gain parameters for speech coding
US6778953B1 (en) * 2000-06-02 2004-08-17 Agere Systems Inc. Method and apparatus for representing masked thresholds in a perceptual audio coder
KR100871999B1 (ko) * 2001-05-08 2008-12-05 코닌클리케 필립스 일렉트로닉스 엔.브이. 오디오 코딩
US7451091B2 (en) 2003-10-07 2008-11-11 Matsushita Electric Industrial Co., Ltd. Method for determining time borders and frequency resolutions for spectral envelope coding
DE102006022346B4 (de) * 2006-05-12 2008-02-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Informationssignalcodierung
CA2976485C (en) 2010-07-02 2018-07-24 Dolby International Ab Audio decoder
WO2012161675A1 (en) * 2011-05-20 2012-11-29 Google Inc. Redundant coding unit for audio codec
KR102052144B1 (ko) * 2011-10-24 2019-12-05 엘지전자 주식회사 음성 신호의 대역 선택적 양자화 방법 및 장치
CN111862995A (zh) * 2020-06-22 2020-10-30 北京达佳互联信息技术有限公司 一种码率确定模型训练方法、码率确定方法及装置
KR20230116503A (ko) * 2022-01-28 2023-08-04 한국전자통신연구원 스칼라 양자화와 벡터 양자화를 이용한 부호화 방법 및 부호화 장치, 그리고 복호화 방법 및 복호화 장치
CN116052695A (zh) * 2022-10-28 2023-05-02 陕西师范大学 一种基于波运算的音频听觉密码方法、系统及设备

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5012517A (en) * 1989-04-18 1991-04-30 Pacific Communication Science, Inc. Adaptive transform coder having long term predictor
FR2700632B1 (fr) * 1993-01-21 1995-03-24 France Telecom Système de codage-décodage prédictif d'un signal numérique de parole par transformée adaptative à codes imbriqués.

Also Published As

Publication number Publication date
EP0954851A1 (en) 1999-11-10
MX9708203A (es) 1997-12-31
JPH11504733A (ja) 1999-04-27
EP0954851A4 (enrdf_load_stackoverflow) 1999-11-10
WO1997031367A1 (en) 1997-08-28

Similar Documents

Publication Publication Date Title
CA2185746C (en) Perceptual noise masking measure based on synthesis filter frequency response
CA2185731C (en) Speech signal quantization using human auditory models in predictive coding systems
US6014621A (en) Synthesis of speech signals in the absence of coded parameters
Gersho Advances in speech and audio compression
US6735567B2 (en) Encoding and decoding speech signals variably based on signal classification
US6574593B1 (en) Codebook tables for encoding and decoding
Paliwal et al. Vector quantization of LPC parameters in the presence of channel errors
EP0503684B1 (en) Adaptive filtering method for speech and audio
US6961698B1 (en) Multi-mode bitstream transmission protocol of encoded voice signals with embeded characteristics
JP3490685B2 (ja) 広帯域信号の符号化における適応帯域ピッチ探索のための方法および装置
US6098036A (en) Speech coding system and method including spectral formant enhancer
CA2140329C (en) Decomposition in noise and periodic signal waveforms in waveform interpolation
KR100304092B1 (ko) 오디오 신호 부호화 장치, 오디오 신호 복호화 장치 및 오디오 신호 부호화/복호화 장치
US5699382A (en) Method for noise weighting filtering
US6119082A (en) Speech coding system and method including harmonic generator having an adaptive phase off-setter
US6081776A (en) Speech coding system and method including adaptive finite impulse response filter
JP4176349B2 (ja) マルチモードの音声符号器
US6138092A (en) CELP speech synthesizer with epoch-adaptive harmonic generator for pitch harmonics below voicing cutoff frequency
MXPA96004161A (en) Quantification of speech signals using human auiditive models in predict encoding systems
KR20030046451A (ko) 음성 코딩을 위한 코드북 구조 및 탐색 방법
Ordentlich et al. Low-delay code-excited linear-predictive coding of wideband speech at 32 kbps
CA2219358A1 (en) Speech signal quantization using human auditory models in predictive coding systems
JPH01261930A (ja) 音声復号器のポスト雑音整形フィルタ
CA2303711C (en) Method for noise weighting filtering
Aarskog et al. Predictive coding of speech using microphone/speaker adaptation and vector quantization

Legal Events

Date Code Title Description
EEER Examination request
FZDE Dead