BR0012540A - Método e equipamento para intercalar métodos de quantificação de informação de linha espectral em um codificador de fala - Google Patents

Método e equipamento para intercalar métodos de quantificação de informação de linha espectral em um codificador de fala

Info

Publication number
BR0012540A
BR0012540A BR0012540-7A BR0012540A BR0012540A BR 0012540 A BR0012540 A BR 0012540A BR 0012540 A BR0012540 A BR 0012540A BR 0012540 A BR0012540 A BR 0012540A
Authority
BR
Brazil
Prior art keywords
technique
vector
quantized
speech coder
moving average
Prior art date
Application number
BR0012540-7A
Other languages
English (en)
Other versions
BRPI0012540B1 (pt
Inventor
Arasanipala Ananthapadmanabhan
Sharath Manjunath
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of BR0012540A publication Critical patent/BR0012540A/pt
Publication of BRPI0012540B1 publication Critical patent/BRPI0012540B1/pt

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • G10L19/07Line spectrum pair [LSP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • G10L19/038Vector quantisation, e.g. TwinVQ audio
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0004Design or structure of the codebook
    • G10L2019/0005Multi-stage vector quantisation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/12Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being prediction coefficients

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Analogue/Digital Conversion (AREA)
  • Processing Of Color Television Signals (AREA)
  • Image Processing (AREA)

Abstract

"MéTODO E EQUIPAMENTO PARA INTERCALAR MéTODOS DE QUANTIFICAçãO, DE INFORMAçãO DE LINHA ESPECTRAL EM UM CODIFICADOR DE FALA". Um método e um equipamento para intercalar métodos de quantificação de informação de linha espectral em um codificador de fala incluem quantificar as informação de linha espectral com duas técnicas de quantificação de vetores, a primeira técnica sendo uma técnica baseada em previsão de média não móvel e a segunda técnica sendo uma técnica baseada em previsão de média móvel. Um vetor de informação de linha espectral é um vetor quantificado com a primeira técnica. São computados vetores código de média móvel equivalente para a primeira técnica. Uma memória de um livro código de média móvel de vetores código é atualizada com os vetores código de média móvel equivalente para um número predefinido de frames que foram anteriormente processados pelo codificador de fala. é calculado um vetor de quantificação meta para a segunda técnica com base na memória de livro código de média móvel atualizada. O vetor de quantificação meta é um vetor quantificado com a segunda técnica para gerar um vetor código meta quantificado. A memória do livro código de média móvel é atualizada com o vetor código meta quantificado. Vetores de informação de linha espectral quantificados são derivados a partir do vetor código meta quantificado.
BRPI0012540A 1999-07-19 2000-07-19 codificador de fala, e método para quantização vetorial de um vetor de informações de linhas espectrais de um quadro BRPI0012540B1 (pt)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/356,755 US6393394B1 (en) 1999-07-19 1999-07-19 Method and apparatus for interleaving line spectral information quantization methods in a speech coder
PCT/US2000/019672 WO2001006495A1 (en) 1999-07-19 2000-07-19 Method and apparatus for interleaving line spectral information quantization methods in a speech coder

Publications (2)

Publication Number Publication Date
BR0012540A true BR0012540A (pt) 2004-06-29
BRPI0012540B1 BRPI0012540B1 (pt) 2015-12-01

Family

ID=23402819

Family Applications (1)

Application Number Title Priority Date Filing Date
BRPI0012540A BRPI0012540B1 (pt) 1999-07-19 2000-07-19 codificador de fala, e método para quantização vetorial de um vetor de informações de linhas espectrais de um quadro

Country Status (12)

Country Link
US (1) US6393394B1 (pt)
EP (1) EP1212749B1 (pt)
JP (1) JP4511094B2 (pt)
KR (1) KR100752797B1 (pt)
CN (1) CN1145930C (pt)
AT (1) ATE322068T1 (pt)
AU (1) AU6354600A (pt)
BR (1) BRPI0012540B1 (pt)
DE (1) DE60027012T2 (pt)
ES (1) ES2264420T3 (pt)
HK (1) HK1045396B (pt)
WO (1) WO2001006495A1 (pt)

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6735253B1 (en) 1997-05-16 2004-05-11 The Trustees Of Columbia University In The City Of New York Methods and architecture for indexing and editing compressed video over the world wide web
US7143434B1 (en) 1998-11-06 2006-11-28 Seungyup Paek Video description system and method
EP1796083B1 (en) * 2000-04-24 2009-01-07 Qualcomm Incorporated Method and apparatus for predictively quantizing voiced speech
US6937979B2 (en) * 2000-09-15 2005-08-30 Mindspeed Technologies, Inc. Coding based on spectral content of a speech signal
US20040128511A1 (en) * 2000-12-20 2004-07-01 Qibin Sun Methods and systems for generating multimedia signature
US20040204935A1 (en) * 2001-02-21 2004-10-14 Krishnasamy Anandakumar Adaptive voice playout in VOP
WO2002097796A1 (en) * 2001-05-28 2002-12-05 Intel Corporation Providing shorter uniform frame lengths in dynamic time warping for voice conversion
WO2003051031A2 (en) * 2001-12-06 2003-06-19 The Trustees Of Columbia University In The City Of New York Method and apparatus for planarization of a material by growing and removing a sacrificial film
US7289459B2 (en) * 2002-08-07 2007-10-30 Motorola Inc. Radio communication system with adaptive interleaver
WO2006096612A2 (en) 2005-03-04 2006-09-14 The Trustees Of Columbia University In The City Of New York System and method for motion estimation and mode decision for low-complexity h.264 decoder
CN101185127B (zh) * 2005-04-01 2014-04-23 高通股份有限公司 用于编码和解码语音信号的高频带部分的方法和设备
EP2005756A2 (fr) * 2006-03-21 2008-12-24 France Télécom Quantification vectorielle contrainte
US7463170B2 (en) * 2006-11-30 2008-12-09 Broadcom Corporation Method and system for processing multi-rate audio from a plurality of audio processing sources
US7465241B2 (en) * 2007-03-23 2008-12-16 Acushnet Company Functionalized, crosslinked, rubber nanoparticles for use in golf ball castable thermoset layers
WO2009126785A2 (en) 2008-04-10 2009-10-15 The Trustees Of Columbia University In The City Of New York Systems and methods for image archaeology
WO2009155281A1 (en) * 2008-06-17 2009-12-23 The Trustees Of Columbia University In The City Of New York System and method for dynamically and interactively searching media data
US20100017196A1 (en) * 2008-07-18 2010-01-21 Qualcomm Incorporated Method, system, and apparatus for compression or decompression of digital signals
US8671069B2 (en) 2008-12-22 2014-03-11 The Trustees Of Columbia University, In The City Of New York Rapid image annotation via brain state decoding and visual pattern mining
CN102982807B (zh) * 2012-07-17 2016-02-03 深圳广晟信源技术有限公司 用于对语音信号lpc系数进行多级矢量量化的方法和系统

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4901307A (en) 1986-10-17 1990-02-13 Qualcomm, Inc. Spread spectrum multiple access communication system using satellite or terrestrial repeaters
US5103459B1 (en) 1990-06-25 1999-07-06 Qualcomm Inc System and method for generating signal waveforms in a cdma cellular telephone system
ATE477571T1 (de) 1991-06-11 2010-08-15 Qualcomm Inc Vocoder mit veränderlicher bitrate
US5784532A (en) 1994-02-16 1998-07-21 Qualcomm Incorporated Application specific integrated circuit (ASIC) for performing rapid speech compression in a mobile telephone system
TW271524B (pt) 1994-08-05 1996-03-01 Qualcomm Inc
US5732389A (en) * 1995-06-07 1998-03-24 Lucent Technologies Inc. Voiced/unvoiced classification of speech for excitation codebook selection in celp speech decoding during frame erasures
US5664055A (en) * 1995-06-07 1997-09-02 Lucent Technologies Inc. CS-ACELP speech compression system with adaptive pitch prediction filter gain based on a measure of periodicity
US5699485A (en) * 1995-06-07 1997-12-16 Lucent Technologies Inc. Pitch delay modification during frame erasures
JP3680380B2 (ja) * 1995-10-26 2005-08-10 ソニー株式会社 音声符号化方法及び装置
DE19845888A1 (de) * 1998-10-06 2000-05-11 Bosch Gmbh Robert Verfahren zur Codierung oder Decodierung von Sprachsignalabtastwerten sowie Coder bzw. Decoder

Also Published As

Publication number Publication date
EP1212749B1 (en) 2006-03-29
CN1361913A (zh) 2002-07-31
AU6354600A (en) 2001-02-05
JP2003524796A (ja) 2003-08-19
CN1145930C (zh) 2004-04-14
KR20020033737A (ko) 2002-05-07
BRPI0012540B1 (pt) 2015-12-01
DE60027012D1 (de) 2006-05-18
EP1212749A1 (en) 2002-06-12
HK1045396B (zh) 2005-02-18
ES2264420T3 (es) 2007-01-01
ATE322068T1 (de) 2006-04-15
WO2001006495A1 (en) 2001-01-25
HK1045396A1 (en) 2002-11-22
JP4511094B2 (ja) 2010-07-28
KR100752797B1 (ko) 2007-08-29
DE60027012T2 (de) 2007-01-11
US6393394B1 (en) 2002-05-21

Similar Documents

Publication Publication Date Title
BR0012540A (pt) Método e equipamento para intercalar métodos de quantificação de informação de linha espectral em um codificador de fala
USRE49363E1 (en) Variable bit rate LPC filter quantizing and inverse quantizing device and method
JP3680380B2 (ja) 音声符号化方法及び装置
JP4005154B2 (ja) 音声復号化方法及び装置
US8428957B2 (en) Spectral noise shaping in audio coding based on spectral dynamics in frequency sub-bands
TW200703240A (en) Systems, methods, and apparatus for quantization of spectral envelope representation
KR20010080258A (ko) 음성 부호화 장치, 기록 매체, 음성 복호화 장치, 신호 처리용 프로세서, 음성 부호화 복호화 시스템, 통신용 기지국, 통신용 단말 및 무선 통신 시스템
EP1279167A1 (en) Method and apparatus for predictively quantizing voiced speech
WO2010079169A1 (en) Pyramid vector audio coding
SE9501640L (sv) Metod för förstärkningskvantisering vid linjärprediktiv talkodning med kodboksexcitering
CN103854655A (zh) 一种低码率语音编码器以及解码器
CN101256773A (zh) 导抗谱频率参数的矢量量化方法及装置
CA2155583C (en) Speech coder using a non-uniform pulse type sparse excitation codebook
KR20170098278A (ko) 부호화 장치, 복호 장치, 이들의 방법, 프로그램 및 기록 매체
Gerson et al. A 5600 bps VSELP speech coder candidate for half-rate GSM
CN104025191A (zh) 用于自适应多速率编解码器的改进方法和设备
Kuo et al. New LSP encoding method based on two-dimensional linear prediction
López-Soler et al. Linear inter-frame dependencies for very low bit-rate speech coding
JPH0786952A (ja) 音声の予測符号化方法
Kohata et al. Bit rate reduction of the MELP coder using Lempel-Ziv segment quantization
CA2118986C (en) Speech coding system
Han et al. An efficient differential LSFs quantization for Chinese mandarin speech
Ali et al. A very low bit rate codec for wide band speech based on a long-term perceptual harmonic plus noise model
JPH09120300A (ja) ベクトル量子化装置
Tadić et al. Gaussian mixture model-based quantization of line spectral frequencies for adaptive multirate speech codec

Legal Events

Date Code Title Description
B15K Others concerning applications: alteration of classification

Free format text: A CLASSIFICACAO ANTERIOR ERA: G10L 19/06

Ipc: G10L 19/07 (2013.01), G10L 19/038 (2013.01), G10L

B06A Patent application procedure suspended [chapter 6.1 patent gazette]
B09A Decision: intention to grant [chapter 9.1 patent gazette]
B16A Patent or certificate of addition of invention granted [chapter 16.1 patent gazette]

Free format text: PRAZO DE VALIDADE: 10 (DEZ) ANOS CONTADOS A PARTIR DE 01/12/2015, OBSERVADAS AS CONDICOES LEGAIS.