CA2259094A1 - A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders - Google Patents

A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders Download PDF

Info

Publication number
CA2259094A1
CA2259094A1 CA002259094A CA2259094A CA2259094A1 CA 2259094 A1 CA2259094 A1 CA 2259094A1 CA 002259094 A CA002259094 A CA 002259094A CA 2259094 A CA2259094 A CA 2259094A CA 2259094 A1 CA2259094 A1 CA 2259094A1
Authority
CA
Canada
Prior art keywords
codebook
vectors
collection
designing
small
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
CA002259094A
Other languages
French (fr)
Inventor
Claude Laflamme
Roch Lefebvre
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
VoiceAge Corp
Original Assignee
VoiceAge Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by VoiceAge Corp filed Critical VoiceAge Corp
Priority to CA002259094A priority Critical patent/CA2259094A1/en
Priority to PCT/CA2000/000036 priority patent/WO2000042601A1/en
Priority to AU30286/00A priority patent/AU3028600A/en
Publication of CA2259094A1 publication Critical patent/CA2259094A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0004Design or structure of the codebook
    • G10L2019/0005Multi-stage vector quantisation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0013Codebook search algorithms

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

A codebook is designed and searched in view of encoding a sound signal. This codebook consists of a set of codevectors each of dimension N, with a memory-efficient structure whereby a huge stochastic codebook is built from a collection of a small set of random vectors. The codebook is designed such that each codevector is obtained by the addition of several signed vectors from a small collection (for example 64) of random (e.g. Gaussian) vectors. For example a codebook which consists of the addition of two signed vectors from a collection of 64 Gaussian vectors gives rise to a 13-bit (8192-entry) codebook (6 bits for each of the two vector and 1 bit for the signs). Similarly, adding 3 vectors from a collection of 64 vectors gives rise to a 19-bit codebook. Besides the memory efficient structure of the codebook, a fast search procedure is used whereby only a small subset of the codebook is searched. In this fast search procedure, a small number of vectors from the collection of random vectors are predetermined, and the search is confined to the subset of codebook consisting of these pre-determined vectors.

Claims

CA002259094A 1999-01-15 1999-01-15 A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders Abandoned CA2259094A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CA002259094A CA2259094A1 (en) 1999-01-15 1999-01-15 A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders
PCT/CA2000/000036 WO2000042601A1 (en) 1999-01-15 2000-01-14 A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders
AU30286/00A AU3028600A (en) 1999-01-15 2000-01-14 A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CA002259094A CA2259094A1 (en) 1999-01-15 1999-01-15 A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders

Publications (1)

Publication Number Publication Date
CA2259094A1 true CA2259094A1 (en) 2000-07-15

Family

ID=4163194

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002259094A Abandoned CA2259094A1 (en) 1999-01-15 1999-01-15 A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders

Country Status (3)

Country Link
AU (1) AU3028600A (en)
CA (1) CA2259094A1 (en)
WO (1) WO2000042601A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100707174B1 (en) * 2004-12-31 2007-04-13 삼성전자주식회사 High band Speech coding and decoding apparatus in the wide-band speech coding/decoding system, and method thereof

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5396576A (en) * 1991-05-22 1995-03-07 Nippon Telegraph And Telephone Corporation Speech coding and decoding methods using adaptive and random code books
EP0751496B1 (en) * 1992-06-29 2000-04-19 Nippon Telegraph And Telephone Corporation Speech coding method and apparatus for the same

Also Published As

Publication number Publication date
WO2000042601A1 (en) 2000-07-20
AU3028600A (en) 2000-08-01

Similar Documents

Publication Publication Date Title
US8190428B2 (en) Method for speech coding, method for speech decoding and their apparatuses
CN101154379B (en) Method and device for locating keywords in voice and voice recognition system
CA2163017A1 (en) Speech recognition method using a two-pass search
CA2300077A1 (en) Speech coding apparatus and speech decoding apparatus
SE506379C3 (en) Lpc speech encoder with combined excitation
ATE410770T1 (en) REDUCING THE MEMORY REQUIREMENTS OF A CODEBOOK VECTOR SEARCH
Svendsen et al. An improved sub-word based speech recognizer
CA2084338A1 (en) Method for Speech Coding and Voice-Coder
CA2192143A1 (en) Speech coding device
CA2259094A1 (en) A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders
CA2090205A1 (en) Speech coding system
Kuo et al. Low bit-rate quantization of LSP parameters using two-dimensional differential coding
Zheng et al. The distance measure for line spectrum pairs applied to speech recognition.
Chen et al. Maximum-take-precedence ACELP: a low complexity search method
Ozawa et al. 4 kb/s improved CELP coder with efficient vector quantization
Sooraj et al. Performance analysis of CELP codec for Gaussian and fixed codebooks
Vasilache et al. Indexing and entropy coding of lattice codevectors
Paul New developments in the Lincoln stack-decoder based large-vocabulary CSR system
Hwang et al. Subphonetic modeling for speech recognition
Moulsley et al. Fast vector quantisation using orthogonal codebooks
JP3144194B2 (en) Audio coding device
JP2001134298A (en) Speech encoding device and speech decoding device, and speech encoding/decoding system
Petrinović et al. Sparse vector linear prediction with optimal structures
Chen et al. An investigation of phonological feature systems used in detection-based ASR
KR100464310B1 (en) Method for pattern matching using LSP

Legal Events

Date Code Title Description
FZDE Dead