CA2219358A1 - Quantification de signaux vocaux a l'aide de modeles auditifs humains dans des systemes de codage predictif - Google Patents

Quantification de signaux vocaux a l'aide de modeles auditifs humains dans des systemes de codage predictif Download PDF

Info

Publication number
CA2219358A1
CA2219358A1 CA 2219358 CA2219358A CA2219358A1 CA 2219358 A1 CA2219358 A1 CA 2219358A1 CA 2219358 CA2219358 CA 2219358 CA 2219358 A CA2219358 A CA 2219358A CA 2219358 A1 CA2219358 A1 CA 2219358A1
Authority
CA
Canada
Prior art keywords
signal
speech
pitch
lpc
processor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
CA 2219358
Other languages
English (en)
Inventor
Juin-Hwey Chen
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AT&T Corp
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Publication of CA2219358A1 publication Critical patent/CA2219358A1/fr
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/002Dynamic bit allocation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

La présente invention concerne un système de compression de la parole dénommé "Codage Prédictif par Transformée" ou TPC (pour "Transform Predictive Coding") qui permet de coder la parole de la bande des 7 Khz (échantillonnée à 16 Khz) en atteignant un débit binaire de 16 ou 32 k-octets/s, à raison de 1 à 2 bits par échantillon. Pour annuler les redondances, le système utilise un dispositif prédictif à court terme et à long terme. Le résiduel de prédiction subit une transformation et un codage dans le domaine de fréquences représenté dans la figure, et ce, au niveau du processeur de transformée (110) après prise en compte des données du domaine temporel de l'additionneur (60) et l'entrée des paramètres depuis le processeur de réponse d'amplitude à filtre de mise en forme (100), ce qui corrige le spectre en vue de la perception auditive. Le vocodeur TPC n'utilise qu'une quantification en boucle ouverte comme le démontre la présence d'un extracteur/interpolateur de hauteur de son (70), ce qui fait que le vocodeur TPC n'est que faiblement complexe. La parole est de qualité transparente à 32 k-octets/s, de très bonne qualité à 24 k-octets/s, et acceptable à 16 k-octets/s.
CA 2219358 1996-02-26 1997-02-26 Quantification de signaux vocaux a l'aide de modeles auditifs humains dans des systemes de codage predictif Abandoned CA2219358A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US1229696P 1996-02-26 1996-02-26
US60/012,296 1996-02-26

Publications (1)

Publication Number Publication Date
CA2219358A1 true CA2219358A1 (fr) 1997-08-28

Family

ID=21754300

Family Applications (1)

Application Number Title Priority Date Filing Date
CA 2219358 Abandoned CA2219358A1 (fr) 1996-02-26 1997-02-26 Quantification de signaux vocaux a l'aide de modeles auditifs humains dans des systemes de codage predictif

Country Status (5)

Country Link
EP (1) EP0954851A1 (fr)
JP (1) JPH11504733A (fr)
CA (1) CA2219358A1 (fr)
MX (1) MX9708203A (fr)
WO (1) WO1997031367A1 (fr)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6397178B1 (en) 1998-09-18 2002-05-28 Conexant Systems, Inc. Data organizational scheme for enhanced selection of gain parameters for speech coding
US6778953B1 (en) * 2000-06-02 2004-08-17 Agere Systems Inc. Method and apparatus for representing masked thresholds in a perceptual audio coder
WO2002091363A1 (fr) * 2001-05-08 2002-11-14 Koninklijke Philips Electronics N.V. Codage audio
US7451091B2 (en) 2003-10-07 2008-11-11 Matsushita Electric Industrial Co., Ltd. Method for determining time borders and frequency resolutions for spectral envelope coding
DE102006022346B4 (de) * 2006-05-12 2008-02-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Informationssignalcodierung
KR101696632B1 (ko) 2010-07-02 2017-01-16 돌비 인터네셔널 에이비 선택적인 베이스 포스트 필터
WO2012161675A1 (fr) * 2011-05-20 2012-11-29 Google Inc. Unité de codage redondant pour codec audio
WO2013062201A1 (fr) * 2011-10-24 2013-05-02 엘지전자 주식회사 Procédé et dispositif de quantification de signaux vocaux par sélection de bande
CN111862995A (zh) * 2020-06-22 2020-10-30 北京达佳互联信息技术有限公司 一种码率确定模型训练方法、码率确定方法及装置

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5012517A (en) * 1989-04-18 1991-04-30 Pacific Communication Science, Inc. Adaptive transform coder having long term predictor
FR2700632B1 (fr) * 1993-01-21 1995-03-24 France Telecom Système de codage-décodage prédictif d'un signal numérique de parole par transformée adaptative à codes imbriqués.

Also Published As

Publication number Publication date
JPH11504733A (ja) 1999-04-27
EP0954851A4 (fr) 1999-11-10
EP0954851A1 (fr) 1999-11-10
MX9708203A (es) 1997-12-31
WO1997031367A1 (fr) 1997-08-28

Similar Documents

Publication Publication Date Title
CA2185746C (fr) Methode perceptive de masquage du bruit basee sur la reponse frequentielle d'un filtre de synthese
CA2185731C (fr) Quantification des signaux vocaux au moyen de modeles de l'audition humaine dans les systemes de codage predictif
US6014621A (en) Synthesis of speech signals in the absence of coded parameters
Gersho Advances in speech and audio compression
US6735567B2 (en) Encoding and decoding speech signals variably based on signal classification
US6574593B1 (en) Codebook tables for encoding and decoding
Paliwal et al. Vector quantization of LPC parameters in the presence of channel errors
Chen et al. Real-time vector APC speech coding at 4800 bps with adaptive postfiltering
US6961698B1 (en) Multi-mode bitstream transmission protocol of encoded voice signals with embeded characteristics
JP3490685B2 (ja) 広帯域信号の符号化における適応帯域ピッチ探索のための方法および装置
US4969192A (en) Vector adaptive predictive coder for speech and audio
US6098036A (en) Speech coding system and method including spectral formant enhancer
CA2140329C (fr) Decomposition en bruit et en signaux periodiques dans l'interpolation des formes d'onde
KR100304092B1 (ko) 오디오 신호 부호화 장치, 오디오 신호 복호화 장치 및 오디오 신호 부호화/복호화 장치
US5699382A (en) Method for noise weighting filtering
US6119082A (en) Speech coding system and method including harmonic generator having an adaptive phase off-setter
US6081776A (en) Speech coding system and method including adaptive finite impulse response filter
JP4176349B2 (ja) マルチモードの音声符号器
US6138092A (en) CELP speech synthesizer with epoch-adaptive harmonic generator for pitch harmonics below voicing cutoff frequency
MXPA96004161A (en) Quantification of speech signals using human auiditive models in predict encoding systems
KR20030046451A (ko) 음성 코딩을 위한 코드북 구조 및 탐색 방법
Ordentlich et al. Low-delay code-excited linear-predictive coding of wideband speech at 32 kbps
CA2219358A1 (fr) Quantification de signaux vocaux a l'aide de modeles auditifs humains dans des systemes de codage predictif
JPH01261930A (ja) 音声復号器のポスト雑音整形フィルタ
CA2303711C (fr) Methode de filtrage pour la ponderation du bruit

Legal Events

Date Code Title Description
EEER Examination request
FZDE Dead