CA2259094A1 - A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders - Google Patents
A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders Download PDFInfo
- Publication number
- CA2259094A1 CA2259094A1 CA002259094A CA2259094A CA2259094A1 CA 2259094 A1 CA2259094 A1 CA 2259094A1 CA 002259094 A CA002259094 A CA 002259094A CA 2259094 A CA2259094 A CA 2259094A CA 2259094 A1 CA2259094 A1 CA 2259094A1
- Authority
- CA
- Canada
- Prior art keywords
- codebook
- vectors
- collection
- designing
- small
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 239000013598 vector Substances 0.000 abstract 11
- 230000005236 sound signal Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0004—Design or structure of the codebook
- G10L2019/0005—Multi-stage vector quantisation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0013—Codebook search algorithms
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
A codebook is designed and searched in view of encoding a sound signal. This codebook consists of a set of codevectors each of dimension N, with a memory-efficient structure whereby a huge stochastic codebook is built from a collection of a small set of random vectors. The codebook is designed such that each codevector is obtained by the addition of several signed vectors from a small collection (for example 64) of random (e.g. Gaussian) vectors. For example a codebook which consists of the addition of two signed vectors from a collection of 64 Gaussian vectors gives rise to a 13-bit (8192-entry) codebook (6 bits for each of the two vector and 1 bit for the signs). Similarly, adding 3 vectors from a collection of 64 vectors gives rise to a 19-bit codebook. Besides the memory efficient structure of the codebook, a fast search procedure is used whereby only a small subset of the codebook is searched. In this fast search procedure, a small number of vectors from the collection of random vectors are predetermined, and the search is confined to the subset of codebook consisting of these pre-determined vectors.
Claims
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA002259094A CA2259094A1 (en) | 1999-01-15 | 1999-01-15 | A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders |
PCT/CA2000/000036 WO2000042601A1 (en) | 1999-01-15 | 2000-01-14 | A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders |
AU30286/00A AU3028600A (en) | 1999-01-15 | 2000-01-14 | A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA002259094A CA2259094A1 (en) | 1999-01-15 | 1999-01-15 | A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2259094A1 true CA2259094A1 (en) | 2000-07-15 |
Family
ID=4163194
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002259094A Abandoned CA2259094A1 (en) | 1999-01-15 | 1999-01-15 | A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders |
Country Status (3)
Country | Link |
---|---|
AU (1) | AU3028600A (en) |
CA (1) | CA2259094A1 (en) |
WO (1) | WO2000042601A1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100707174B1 (en) * | 2004-12-31 | 2007-04-13 | 삼성전자주식회사 | Apparatus and method for highband speech encoding and decoding in wideband speech encoding and decoding system |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5396576A (en) * | 1991-05-22 | 1995-03-07 | Nippon Telegraph And Telephone Corporation | Speech coding and decoding methods using adaptive and random code books |
DE69328450T2 (en) * | 1992-06-29 | 2001-01-18 | Nippon Telegraph And Telephone Corp., Tokio/Tokyo | Method and device for speech coding |
-
1999
- 1999-01-15 CA CA002259094A patent/CA2259094A1/en not_active Abandoned
-
2000
- 2000-01-14 AU AU30286/00A patent/AU3028600A/en not_active Withdrawn
- 2000-01-14 WO PCT/CA2000/000036 patent/WO2000042601A1/en not_active Application Discontinuation
Also Published As
Publication number | Publication date |
---|---|
AU3028600A (en) | 2000-08-01 |
WO2000042601A1 (en) | 2000-07-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7363220B2 (en) | Method for speech coding, method for speech decoding and their apparatuses | |
CN101154379B (en) | Method and device for locating keywords in voice and voice recognition system | |
CA2666546A1 (en) | Method and device for coding transition frames in speech signals | |
CA2163017A1 (en) | Speech recognition method using a two-pass search | |
SE506379C2 (en) | LPC speech encoder with combined excitation | |
CA2300077A1 (en) | Speech coding apparatus and speech decoding apparatus | |
CN101317218B (en) | Systems, methods, and apparatus for frequency-domain waveform alignment | |
US7096181B2 (en) | Method for searching codebook | |
CA2084338A1 (en) | Method for Speech Coding and Voice-Coder | |
CA2192143A1 (en) | Speech coding device | |
CA2259094A1 (en) | A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders | |
CA2090205A1 (en) | Speech coding system | |
Kuo et al. | Low bit-rate quantization of LSP parameters using two-dimensional differential coding | |
US20060149540A1 (en) | System and method for supporting multiple speech codecs | |
Chen et al. | Maximum-take-precedence ACELP: a low complexity search method | |
Hsu et al. | Efficient and robust distributed speech recognition (DSR) over wireless fading channels: 2D-DCT compression, iterative bit allocation, short BCH code and interleaving | |
Ireton et al. | On improving vector excitation coders through the use of spherical lattice codebooks (SLCs) | |
Ozawa et al. | 4 kb/s improved CELP coder with efficient vector quantization | |
Shore et al. | Discrete utterance speech recognition without time normalization | |
Ahmed et al. | Fast methods for code search in CELP | |
Paul | New developments in the Lincoln stack-decoder based large-vocabulary CSR system | |
Moulsley et al. | Fast vector quantisation using orthogonal codebooks | |
Chen et al. | A Study on Using Word-Level HMMs to Improve ASR Performance over State-of-the-Art Phone-Level Acoustic Modeling for LVCSR. | |
Sooraj et al. | Performance analysis of CELP codec for Gaussian and fixed codebooks | |
Vasilache et al. | Indexing and entropy coding of lattice codevectors |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Dead |