CA2259094A1 - A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders - Google Patents
A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders Download PDFInfo
- Publication number
- CA2259094A1 CA2259094A1 CA002259094A CA2259094A CA2259094A1 CA 2259094 A1 CA2259094 A1 CA 2259094A1 CA 002259094 A CA002259094 A CA 002259094A CA 2259094 A CA2259094 A CA 2259094A CA 2259094 A1 CA2259094 A1 CA 2259094A1
- Authority
- CA
- Canada
- Prior art keywords
- codebook
- vectors
- collection
- designing
- small
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 239000013598 vector Substances 0.000 abstract 11
- 230000005236 sound signal Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0004—Design or structure of the codebook
- G10L2019/0005—Multi-stage vector quantisation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0013—Codebook search algorithms
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
A codebook is designed and searched in view of encoding a sound signal. This codebook consists of a set of codevectors each of dimension N, with a memory-efficient structure whereby a huge stochastic codebook is built from a collection of a small set of random vectors. The codebook is designed such that each codevector is obtained by the addition of several signed vectors from a small collection (for example 64) of random (e.g. Gaussian) vectors. For example a codebook which consists of the addition of two signed vectors from a collection of 64 Gaussian vectors gives rise to a 13-bit (8192-entry) codebook (6 bits for each of the two vector and 1 bit for the signs). Similarly, adding 3 vectors from a collection of 64 vectors gives rise to a 19-bit codebook. Besides the memory efficient structure of the codebook, a fast search procedure is used whereby only a small subset of the codebook is searched. In this fast search procedure, a small number of vectors from the collection of random vectors are predetermined, and the search is confined to the subset of codebook consisting of these pre-determined vectors.
Claims
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA002259094A CA2259094A1 (en) | 1999-01-15 | 1999-01-15 | A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders |
PCT/CA2000/000036 WO2000042601A1 (en) | 1999-01-15 | 2000-01-14 | A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders |
AU30286/00A AU3028600A (en) | 1999-01-15 | 2000-01-14 | A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA002259094A CA2259094A1 (en) | 1999-01-15 | 1999-01-15 | A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2259094A1 true CA2259094A1 (en) | 2000-07-15 |
Family
ID=4163194
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002259094A Abandoned CA2259094A1 (en) | 1999-01-15 | 1999-01-15 | A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders |
Country Status (3)
Country | Link |
---|---|
AU (1) | AU3028600A (en) |
CA (1) | CA2259094A1 (en) |
WO (1) | WO2000042601A1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100707174B1 (en) * | 2004-12-31 | 2007-04-13 | 삼성전자주식회사 | High band Speech coding and decoding apparatus in the wide-band speech coding/decoding system, and method thereof |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5396576A (en) * | 1991-05-22 | 1995-03-07 | Nippon Telegraph And Telephone Corporation | Speech coding and decoding methods using adaptive and random code books |
EP0751496B1 (en) * | 1992-06-29 | 2000-04-19 | Nippon Telegraph And Telephone Corporation | Speech coding method and apparatus for the same |
-
1999
- 1999-01-15 CA CA002259094A patent/CA2259094A1/en not_active Abandoned
-
2000
- 2000-01-14 AU AU30286/00A patent/AU3028600A/en not_active Withdrawn
- 2000-01-14 WO PCT/CA2000/000036 patent/WO2000042601A1/en not_active Application Discontinuation
Also Published As
Publication number | Publication date |
---|---|
WO2000042601A1 (en) | 2000-07-20 |
AU3028600A (en) | 2000-08-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8190428B2 (en) | Method for speech coding, method for speech decoding and their apparatuses | |
CN101154379B (en) | Method and device for locating keywords in voice and voice recognition system | |
CA2163017A1 (en) | Speech recognition method using a two-pass search | |
CA2300077A1 (en) | Speech coding apparatus and speech decoding apparatus | |
SE506379C3 (en) | Lpc speech encoder with combined excitation | |
ATE410770T1 (en) | REDUCING THE MEMORY REQUIREMENTS OF A CODEBOOK VECTOR SEARCH | |
Svendsen et al. | An improved sub-word based speech recognizer | |
CA2084338A1 (en) | Method for Speech Coding and Voice-Coder | |
CA2192143A1 (en) | Speech coding device | |
CA2259094A1 (en) | A method and device for designing and searching large stochastic codebooks in low bit rate speech encoders | |
CA2090205A1 (en) | Speech coding system | |
Kuo et al. | Low bit-rate quantization of LSP parameters using two-dimensional differential coding | |
Zheng et al. | The distance measure for line spectrum pairs applied to speech recognition. | |
Chen et al. | Maximum-take-precedence ACELP: a low complexity search method | |
Ozawa et al. | 4 kb/s improved CELP coder with efficient vector quantization | |
Sooraj et al. | Performance analysis of CELP codec for Gaussian and fixed codebooks | |
Vasilache et al. | Indexing and entropy coding of lattice codevectors | |
Paul | New developments in the Lincoln stack-decoder based large-vocabulary CSR system | |
Hwang et al. | Subphonetic modeling for speech recognition | |
Moulsley et al. | Fast vector quantisation using orthogonal codebooks | |
JP3144194B2 (en) | Audio coding device | |
JP2001134298A (en) | Speech encoding device and speech decoding device, and speech encoding/decoding system | |
Petrinović et al. | Sparse vector linear prediction with optimal structures | |
Chen et al. | An investigation of phonological feature systems used in detection-based ASR | |
KR100464310B1 (en) | Method for pattern matching using LSP |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Dead |