WO1993006590A1 - A speech coding device - Google Patents
A speech coding device Download PDFInfo
- Publication number
- WO1993006590A1 WO1993006590A1 PCT/BE1991/000067 BE9100067W WO9306590A1 WO 1993006590 A1 WO1993006590 A1 WO 1993006590A1 BE 9100067 W BE9100067 W BE 9100067W WO 9306590 A1 WO9306590 A1 WO 9306590A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- code
- input
- word
- code word
- words
- Prior art date
Links
- 230000000737 periodic effect Effects 0.000 claims abstract description 11
- 230000005284 excitation Effects 0.000 claims abstract description 5
- 230000007774 longterm Effects 0.000 description 2
- 238000000034 method Methods 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 235000009508 confectionery Nutrition 0.000 description 1
- 230000000875 corresponding effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0011—Long term prediction filters, i.e. pitch estimation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0013—Codebook search algorithms
- G10L2019/0014—Selection criteria for distances
Definitions
- the invention relates to a speech coding device having a first input for receiving a first error word determined upon an ideal and an estimated excitation word themselves determined from a speech ' signal, said first input being connected to a second input of a first unit, a third input of which being connected to a code word generator, said first unit being provided for determining a second error word upon inputted words, an output of said first unit being connected to a filter unit which is close looped with said code word generator, said code word generator being provided for generating a series of code words, said coding device comprises processing means provided for determining upon words output by said filter unit an energy value and for selecting among said code words the one having produced the lowest energy value.
- Such a speech coding device is known from the article of H. Hassanein, A Brind' Amour and K. Bryden entitled "A 4800 bps
- the first error word input into the speech coding device originates for example from a weighting filter unit.
- the code word generator comprises a code book having a first respectively a second part wherein periodic code words for voiced speech signals respectively non-periodic code words for unvoiced speech signals are stored. The choice between a first or a second code word being realized upon an analysis of the input speech signal in order to determine whether the input speech signal is voice or unvoiced. Such an analysis is done by using the
- LTP Long Term Prediction
- word be it an error word or a code word means a sequence of binary samples.
- a drawback of the known speech coding device is that the generated code word is fully dependent of the choice made between a voiced or unvoiced speech signal. Even if the choice is correctly made, it does not necessarily imply that the best code word is found in the selected code book. That best code word could as well be found in the non-chosen code book. When the choice voiced- unvoiced is not correctly done, this could also lead to a wrong choice of code book.
- the basic problem in chosing between a voiced and unvoiced speech signal is that a lot of speech signals are in fact most of the time built up by a mixture of voiced and unvoied signals.
- a speech coding device is therefore characterized in that said code word generator being provided for generating a first series of first code words having a substantially periodic character with a period associated with an input pitch period , said code word generator being also provided for genera ⁇ ting a second series of second code word having a substantially non- periodic character. Since the code word generator generates now as well first as second code words it is no longer necessary to make a choice between voiced and unvoiced speech so that errors caused by an erroneous choice are avoided.
- a first preferred embodiment of a speech coding device is characterized in that said code word generator is provided for generating a third series of third code words having a substantially periodic character with a period correspon- ding to a fraction m (m £ R) or a multiple r (r ⁇ . R ) of said input pitch value.
- the pitch value is generally determined by an LTP analysis.
- Figure 1 shows an embodiment of a coding device according to the invention.
- Figure 2 respectively 3 shows a first respectively a second series of code words.
- the illustrated speech coding device comprises a first input 1 for receiving a first error word.
- This first error word is for example determined by computing the difference between an ideal and an estimated excitation word obtained after processing a speech signal which originates for example from a human voice.
- the first input 1 is connected to a second input of a first unit 2.
- the first unit is for example formed by a subtracting unit.
- An output of said first unit 2 is connected to a filter unit 3, which is close looped via a unit provided for determining the energy value of the output signal of said filter unit 3, and a minimization unit 5 with a code word generator 6.
- An output of the code word generator 6 is connected via a gain multiplying element 7 with a third input of the first unit 2.
- An input first error word is supplied to the first unit 2 which also receives an estimated word. Said estimated word being formed by multiplying a code word output by the code word generator 6 with a gain value. The latter operation being realized by element 7.
- the first unit deducts the estimated word from the first word and thus forms a second error word which is supplied to the filter unit 3 in order to form a third error word.
- the unit compu ⁇ tes the energy value of input third error word. This energy value is then input into the minimization unit 5 which controls the code word generator 6.
- the code word generator 6 is provided for generating a first series of first code words as well as a second series of second code words. For the sake of clarity, the first code word will first be considered.
- Figure 2 shows an example of a first series of first code words.
- L 50 samples
- P 20 samples.
- the distance between successive * non-zero bits in the first code words is each time 20 samples, i.e. the pitch length P.
- Each subsequent (b, c,..i) first code word of said first series is then each time obtained by time directional shifting the non-zero bits over one sample period.
- the first code word illustrated under b in figure 2 has non-zero bits at sample position 1, 21 and 41 while the last first code word (i in figure 2) has only one non-zero bit at sample position 49.
- P * L however if P> L there is of course only one non-zero bit in each first code word.
- each first code word of said first series is successively supplied to said third input of the first unit in order to determine the second and third error word.
- the code word generator is further provided for selecting among the first code words, the one having produced the third error word with the lowest energy value. This is for example realized each time when comparing the energy value obtained by the considered code word with a temporarily memorized lowest energy obtained by a preceding first code word during a same operation and by overruling the memorized lowest energy value if the latter is larger than the one obtained by the considered first code word.
- the code word generator is also provided for generating a second series of second code words upon a received predetermined code value Q indicating the number of bits quantified as non-zero bits that have to be considered in said second code words.
- the signal e_ illustrates an example of an analogous form of a first error word
- the distance between the successive non-zero bits being determined by L/Q, where L is again the subframe length.
- the code word generator comprises processing means for determining upon said second code word f_ and said first error word a first resp. a second candidate word f_ resp. f . of which an example is shown in figure 3. That first candidate word f, is for example obtained by multiplying f_ with a binary form of e_, while
- the second candidate word f is obtained by inversing the first non-zero bit value of the first candidate word.
- the first and the second candidate words are now successively input into the first device and the filter unit in order to obtain each time a third error word e, . and e, _.
- the processing 1-5 means of the code word generator being also provided for comparing e. . with e, _ and for selecting among both third error words, the one having the lowest energy value.
- the candidate word having produced the third error word with the lowest energy value is then considered for further processing.
- the first candidate word is chosen for further processing because its related lower energy value indicates that this candidate word is closer to the first error word.
- a third candidate word f now generated by taking
- the energy value e, . of the remaining candidate word is compared
- the code word generator of the speech coding device is preferably provided for generating a third series of third code words.
- the third code words are built up in an analogous manner as the first code words but the distance between successive bits quantified as non-zero bits now equals a fraction m (m lR) of a multiple r .r _£ ⁇ R) of the input pitch value P.
- m 2 and thus the distance between successive non-zero bits is P/2.
- the third code words are considered in an analogous manner as the one of the first series.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP91915344A EP0558492A1 (en) | 1991-09-20 | 1991-09-20 | A speech coding device |
JP3514292A JPH06502928A (en) | 1991-09-20 | 1991-09-20 | audio coding element |
PCT/BE1991/000067 WO1993006590A1 (en) | 1991-09-20 | 1991-09-20 | A speech coding device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/BE1991/000067 WO1993006590A1 (en) | 1991-09-20 | 1991-09-20 | A speech coding device |
Publications (1)
Publication Number | Publication Date |
---|---|
WO1993006590A1 true WO1993006590A1 (en) | 1993-04-01 |
Family
ID=3885301
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/BE1991/000067 WO1993006590A1 (en) | 1991-09-20 | 1991-09-20 | A speech coding device |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP0558492A1 (en) |
JP (1) | JPH06502928A (en) |
WO (1) | WO1993006590A1 (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0137532A2 (en) * | 1983-08-26 | 1985-04-17 | Koninklijke Philips Electronics N.V. | Multi-pulse excited linear predictive speech coder |
EP0279451A2 (en) * | 1987-02-20 | 1988-08-24 | Fujitsu Limited | Speech coding transmission equipment |
-
1991
- 1991-09-20 WO PCT/BE1991/000067 patent/WO1993006590A1/en not_active Application Discontinuation
- 1991-09-20 EP EP91915344A patent/EP0558492A1/en not_active Withdrawn
- 1991-09-20 JP JP3514292A patent/JPH06502928A/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0137532A2 (en) * | 1983-08-26 | 1985-04-17 | Koninklijke Philips Electronics N.V. | Multi-pulse excited linear predictive speech coder |
EP0279451A2 (en) * | 1987-02-20 | 1988-08-24 | Fujitsu Limited | Speech coding transmission equipment |
Also Published As
Publication number | Publication date |
---|---|
JPH06502928A (en) | 1994-03-31 |
EP0558492A1 (en) | 1993-09-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8364473B2 (en) | Method and apparatus for receiving an encoded speech signal based on codebooks | |
AU683126B2 (en) | Linear prediction coefficient generation during frame erasure or packet loss | |
EP0696026B1 (en) | Speech coding device | |
US5138661A (en) | Linear predictive codeword excited speech synthesizer | |
CA1337217C (en) | Speech coding | |
US6249758B1 (en) | Apparatus and method for coding speech signals by making use of voice/unvoiced characteristics of the speech signals | |
MY119252A (en) | Depth-first algebraic-codebook search for fast coding of speech | |
EP0232456A1 (en) | Digital speech processor using arbitrary excitation coding | |
US5488704A (en) | Speech codec | |
EP0654909A1 (en) | Code excitation linear prediction encoder and decoder | |
US7302387B2 (en) | Modification of fixed codebook search in G.729 Annex E audio coding | |
US6629070B1 (en) | Voice activity detection using the degree of energy variation among multiple adjacent pairs of subframes | |
EP0784846B1 (en) | A multi-pulse analysis speech processing system and method | |
US5642368A (en) | Error protection for multimode speech coders | |
US5875423A (en) | Method for selecting noise codebook vectors in a variable rate speech coder and decoder | |
EP0578436B1 (en) | Selective application of speech coding techniques | |
JPH1097294A (en) | Voice coding device | |
WO1993006590A1 (en) | A speech coding device | |
RU2667462C1 (en) | Method of recognizing low-speed speech coding protocols | |
EP0745972B1 (en) | Method of and apparatus for coding speech signal | |
EP0903729B1 (en) | Speech coding apparatus and pitch prediction method of input speech signal | |
RU2610285C1 (en) | Method of detecting low-rate encoding protocols | |
AU682505B2 (en) | Vector coding process, especially for voice signals | |
JP2700974B2 (en) | Audio coding method | |
US6289307B1 (en) | Codebook preliminary selection device and method, and storage medium storing codebook preliminary selection program |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AU CA JP US |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH DE DK ES FR GB GR IT LU NL SE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1991915344 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 1991915344 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: CA |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 1991915344 Country of ref document: EP |