CA2259094A1 - A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders - Google Patents

A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders Download PDF

Info

Publication number
CA2259094A1
CA2259094A1 CA002259094A CA2259094A CA2259094A1 CA 2259094 A1 CA2259094 A1 CA 2259094A1 CA 002259094 A CA002259094 A CA 002259094A CA 2259094 A CA2259094 A CA 2259094A CA 2259094 A1 CA2259094 A1 CA 2259094A1
Authority
CA
Canada
Prior art keywords
codebook
vectors
collection
designing
small
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
CA002259094A
Other languages
French (fr)
Inventor
Claude Laflamme
Roch Lefebvre
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
VoiceAge Corp
Original Assignee
VoiceAge Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by VoiceAge Corp filed Critical VoiceAge Corp
Priority to CA002259094A priority Critical patent/CA2259094A1/en
Priority to PCT/CA2000/000036 priority patent/WO2000042601A1/en
Priority to AU30286/00A priority patent/AU3028600A/en
Publication of CA2259094A1 publication Critical patent/CA2259094A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0004Design or structure of the codebook
    • G10L2019/0005Multi-stage vector quantisation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0013Codebook search algorithms

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

A codebook is designed and searched in view of encoding a sound signal. This codebook consists of a set of codevectors each of dimension N, with a memory-efficient structure whereby a huge stochastic codebook is built from a collection of a small set of random vectors. The codebook is designed such that each codevector is obtained by the addition of several signed vectors from a small collection (for example 64) of random (e.g. Gaussian) vectors. For example a codebook which consists of the addition of two signed vectors from a collection of 64 Gaussian vectors gives rise to a 13-bit (8192-entry) codebook (6 bits for each of the two vector and 1 bit for the signs). Similarly, adding 3 vectors from a collection of 64 vectors gives rise to a 19-bit codebook. Besides the memory efficient structure of the codebook, a fast search procedure is used whereby only a small subset of the codebook is searched. In this fast search procedure, a small number of vectors from the collection of random vectors are predetermined, and the search is confined to the subset of codebook consisting of these pre-determined vectors.

Claims

CA002259094A 1999-01-15 1999-01-15 A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders Abandoned CA2259094A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CA002259094A CA2259094A1 (en) 1999-01-15 1999-01-15 A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders
PCT/CA2000/000036 WO2000042601A1 (en) 1999-01-15 2000-01-14 A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders
AU30286/00A AU3028600A (en) 1999-01-15 2000-01-14 A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CA002259094A CA2259094A1 (en) 1999-01-15 1999-01-15 A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders

Publications (1)

Publication Number Publication Date
CA2259094A1 true CA2259094A1 (en) 2000-07-15

Family

ID=4163194

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002259094A Abandoned CA2259094A1 (en) 1999-01-15 1999-01-15 A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders

Country Status (3)

Country Link
AU (1) AU3028600A (en)
CA (1) CA2259094A1 (en)
WO (1) WO2000042601A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100707174B1 (en) * 2004-12-31 2007-04-13 삼성전자주식회사 Apparatus and method for highband speech encoding and decoding in wideband speech encoding and decoding system

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5396576A (en) * 1991-05-22 1995-03-07 Nippon Telegraph And Telephone Corporation Speech coding and decoding methods using adaptive and random code books
DE69328450T2 (en) * 1992-06-29 2001-01-18 Nippon Telegraph And Telephone Corp., Tokio/Tokyo Method and device for speech coding

Also Published As

Publication number Publication date
AU3028600A (en) 2000-08-01
WO2000042601A1 (en) 2000-07-20

Similar Documents

Publication Publication Date Title
US7363220B2 (en) Method for speech coding, method for speech decoding and their apparatuses
CN101154379B (en) Method and device for locating keywords in voice and voice recognition system
CA2666546A1 (en) Method and device for coding transition frames in speech signals
CA2163017A1 (en) Speech recognition method using a two-pass search
SE506379C2 (en) LPC speech encoder with combined excitation
CA2300077A1 (en) Speech coding apparatus and speech decoding apparatus
CN101317218B (en) Systems, methods, and apparatus for frequency-domain waveform alignment
US7096181B2 (en) Method for searching codebook
CA2084338A1 (en) Method for Speech Coding and Voice-Coder
CA2192143A1 (en) Speech coding device
CA2259094A1 (en) A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders
CA2090205A1 (en) Speech coding system
Kuo et al. Low bit-rate quantization of LSP parameters using two-dimensional differential coding
US20060149540A1 (en) System and method for supporting multiple speech codecs
Chen et al. Maximum-take-precedence ACELP: a low complexity search method
Hsu et al. Efficient and robust distributed speech recognition (DSR) over wireless fading channels: 2D-DCT compression, iterative bit allocation, short BCH code and interleaving
Ireton et al. On improving vector excitation coders through the use of spherical lattice codebooks (SLCs)
Ozawa et al. 4 kb/s improved CELP coder with efficient vector quantization
Shore et al. Discrete utterance speech recognition without time normalization
Ahmed et al. Fast methods for code search in CELP
Paul New developments in the Lincoln stack-decoder based large-vocabulary CSR system
Moulsley et al. Fast vector quantisation using orthogonal codebooks
Chen et al. A Study on Using Word-Level HMMs to Improve ASR Performance over State-of-the-Art Phone-Level Acoustic Modeling for LVCSR.
Sooraj et al. Performance analysis of CELP codec for Gaussian and fixed codebooks
Vasilache et al. Indexing and entropy coding of lattice codevectors

Legal Events

Date Code Title Description
FZDE Dead