WO2004090864A3 - Method and apparatus for the encoding and decoding of speech - Google Patents

Method and apparatus for the encoding and decoding of speech Download PDF

Info

Publication number
WO2004090864A3
WO2004090864A3 PCT/IN2004/000060 IN2004000060W WO2004090864A3 WO 2004090864 A3 WO2004090864 A3 WO 2004090864A3 IN 2004000060 W IN2004000060 W IN 2004000060W WO 2004090864 A3 WO2004090864 A3 WO 2004090864A3
Authority
WO
WIPO (PCT)
Prior art keywords
speech
quantised
pvq
quantisation
vector
Prior art date
Application number
PCT/IN2004/000060
Other languages
French (fr)
Other versions
WO2004090864A2 (en
WO2004090864B1 (en
Inventor
Preeti Rao
Original Assignee
Indian Inst Technology Bombay
Preeti Rao
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Indian Inst Technology Bombay, Preeti Rao filed Critical Indian Inst Technology Bombay
Publication of WO2004090864A2 publication Critical patent/WO2004090864A2/en
Publication of WO2004090864A3 publication Critical patent/WO2004090864A3/en
Publication of WO2004090864B1 publication Critical patent/WO2004090864B1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • G10L19/038Vector quantisation, e.g. TwinVQ audio
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • G10L19/07Line spectrum pair [LSP] vocoders

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)

Abstract

Methods and apparatus for encoding speech for communication to a decoder for reproduction of the speech signal where the speech signal is represented by the parameters of a speech model, and a specific quantisation' scheme is used for each parameter, with novel quantisation schemes for the spectral amplitudes. The spectral amplitudes are represented by line spectral frequencies (LSFs) and gain. The LSF vector is split into sub-vectors for quantisation by SNPVQ and frame-fill interpolation. The low-frequency split vector is quantised by an SN-PVQ scheme, and the high frequency split vector by SN-PVQ in the even-numbered frames and frame-fill interpolation in the odd-numbered frames. Optionally all LSF sub-vectors can be quantised by SN-PVQ. Further, the gain parameters of two frames are jointly quantised. These result in a system of encoder and decoder for speech coding with communication quality output speech at bit rates below 2 kbps.
PCT/IN2004/000060 2003-03-12 2004-03-12 Method and apparatus for the encoding and decoding of speech WO2004090864A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
IN273/MUM/2003 2003-03-12
IN273MU2003 2003-03-12

Publications (3)

Publication Number Publication Date
WO2004090864A2 WO2004090864A2 (en) 2004-10-21
WO2004090864A3 true WO2004090864A3 (en) 2005-03-24
WO2004090864B1 WO2004090864B1 (en) 2005-05-19

Family

ID=33156203

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IN2004/000060 WO2004090864A2 (en) 2003-03-12 2004-03-12 Method and apparatus for the encoding and decoding of speech

Country Status (1)

Country Link
WO (1) WO2004090864A2 (en)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100857115B1 (en) 2005-10-05 2008-09-05 엘지전자 주식회사 Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
US8068569B2 (en) 2005-10-05 2011-11-29 Lg Electronics, Inc. Method and apparatus for signal processing and encoding and decoding
WO2007040360A1 (en) * 2005-10-05 2007-04-12 Lg Electronics Inc. Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
US8199828B2 (en) 2005-10-13 2012-06-12 Lg Electronics Inc. Method of processing a signal and apparatus for processing a signal
EP1949698A4 (en) * 2005-10-13 2009-12-30 Lg Electronics Inc Method and apparatus for signal processing
WO2007043844A1 (en) 2005-10-13 2007-04-19 Lg Electronics Inc. Method and apparatus for processing a signal
CA2729752C (en) 2008-07-10 2018-06-05 Voiceage Corporation Multi-reference lpc filter quantization and inverse quantization device and method
US8762136B2 (en) 2011-05-03 2014-06-24 Lsi Corporation System and method of speech compression using an inter frame parameter correlation
WO2016018185A1 (en) 2014-07-28 2016-02-04 Telefonaktiebolaget L M Ericsson (Publ) Pyramid vector quantizer shape search
JP7167335B2 (en) * 2018-10-29 2022-11-08 ドルビー・インターナショナル・アーベー Method and Apparatus for Rate-Quality Scalable Coding Using Generative Models
CN113808601B (en) * 2021-11-19 2022-02-22 信瑞递(北京)科技有限公司 Method, device and electronic equipment for generating RDSS short message channel voice code
CN115050378B (en) * 2022-05-19 2024-06-07 腾讯科技(深圳)有限公司 Audio encoding and decoding method and related products

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001022403A1 (en) * 1999-09-22 2001-03-29 Microsoft Corporation Lpc-harmonic vocoder with superframe structure
WO2002025638A2 (en) * 2000-09-15 2002-03-28 Conexant Systems, Inc. Codebook structure and search for speech coding

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001022403A1 (en) * 1999-09-22 2001-03-29 Microsoft Corporation Lpc-harmonic vocoder with superframe structure
WO2002025638A2 (en) * 2000-09-15 2002-03-28 Conexant Systems, Inc. Codebook structure and search for speech coding

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
CHAMBERLAIN M.W. ET AL.: "A 6000 bps MELP vocoder for use on HF channels", MILITARY COMMUNICATIONS CONFERENCE, vol. 1, 28 October 2001 (2001-10-28) - 31 October 2001 (2001-10-31), pages 447 - 453 *
MOUY B. ET AL.: "NATO SATANAG 4479: a standard for an 800 bps vocoder and channel coding in HF-ECCM system", ACOUSTIC, SPEECH, AND SIGNAL PROCESSING, 9 May 1995 (1995-05-09) *
WANG T. ET AL.: "A 1200 BPS speech coder based on MELP", 5 June 2000 (2000-06-05) *

Also Published As

Publication number Publication date
WO2004090864A2 (en) 2004-10-21
WO2004090864B1 (en) 2005-05-19

Similar Documents

Publication Publication Date Title
CN1815558B (en) Low bit-rate coding of unvoiced segments of speech
AU7830300A (en) Lpc-harmonic vocoder with superframe structure
CN1266674C (en) Closed-loop multimode mixed-domain linear prediction (MDLP) speech coder
CN1302459C (en) A low-bit-rate coding method and apparatus for unvoiced speed
CA2179228A1 (en) Method and apparatus for reproducing speech signals and method for transmitting same
WO2004090864A3 (en) Method and apparatus for the encoding and decoding of speech
ATE368279T1 (en) METHOD AND APPARATUS FOR QUANTIZING THE GAIN FACTOR IN A VARIABLE BIT RATE WIDEBAND VOICE ENCODER
KR20070038041A (en) Method and apparatus for voice trans-rating in multi-rate voice coders for telecommunications
US6687667B1 (en) Method for quantizing speech coder parameters
JPH0850500A (en) Voice encoder and voice decoder as well as voice coding method and voice encoding method
EP1597721B1 (en) 600 bps mixed excitation linear prediction transcoding
CN103236262B (en) A kind of code-transferring method of speech coder code stream
JP2002544551A (en) Multipulse interpolation coding of transition speech frames
JP3537008B2 (en) Speech coding communication system and its transmission / reception device.
EP1397655A1 (en) Method and device for coding speech in analysis-by-synthesis speech coders
US20050102136A1 (en) Speech codecs
US8352248B2 (en) Speech compression method and apparatus
JP2001051699A (en) Device and method for coding/decoding voice containing silence voice coding and storage medium recording program
PL1756806T3 (en) Method for quantifying an ultra low-rate speech encoder
JPH0411040B2 (en)
JP2001265390A (en) Voice coding and decoding device and method including silent voice coding operating with plural rates
Choi et al. Improvement issues on transcoding algorithms: for the flexible usage to the various pairs of speech codec
CN104658539A (en) Transcoding method for code stream of voice coder
CN112133318A (en) Digital voice coding device
KANG et al. Voice Communication Processing System(Patent)

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
B Later publication of amended claims

Effective date: 20041216

122 Ep: pct application non-entry in european phase