EP0545386A2 - Method for speech coding and voice-coder - Google Patents

Method for speech coding and voice-coder Download PDF

Info

Publication number
EP0545386A2
EP0545386A2 EP92120573A EP92120573A EP0545386A2 EP 0545386 A2 EP0545386 A2 EP 0545386A2 EP 92120573 A EP92120573 A EP 92120573A EP 92120573 A EP92120573 A EP 92120573A EP 0545386 A2 EP0545386 A2 EP 0545386A2
Authority
EP
European Patent Office
Prior art keywords
excitation
codevector
codebook
adaptive
candidate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP92120573A
Other languages
German (de)
French (fr)
Other versions
EP0545386A3 (en
EP0545386B1 (en
Inventor
Toshiki Miyano
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Publication of EP0545386A2 publication Critical patent/EP0545386A2/en
Publication of EP0545386A3 publication Critical patent/EP0545386A3/en
Application granted granted Critical
Publication of EP0545386B1 publication Critical patent/EP0545386B1/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0003Backward prediction of gain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0004Design or structure of the codebook
    • G10L2019/0005Multi-stage vector quantisation

Definitions

  • the present invention relates to a method for speech coding and to a voice-coder, particularly to a method for speech coding and to a voice-coder which can achieve high coding quality with a relatively small operation at bit rates not greater than 8 kbit/s.
  • CELP system using an adaptive codebook described in a paper (hereinafter referred to as literature 2) titled "IMPROVED SPEECH QUALITY AND EFFICIENT VECTOR QUANTIZATION IN SELP" (Proc. ICASSP, pp. 155-158, 1988) by W. B. Kleijin, D. J. Krasinski and R. H. Ketchum.
  • the CELP system using the adaptive codebook receives speech signals divided into frames spaced with a constant interval.
  • the CELP utilizes a linear predictive analyzer for obtaining spectral parameters of input speech signals, the adaptive codebook having excitation signals determined in the past, the excitation codebook comprising random numbers to be used for vector quantization of the excitation signals of said input speech signals.
  • the CELP selects an adaptive codevector by using the input speech signal and the synthesized signal of the adaptive codevector for every subframe made by equally dividing the frame. Subsequently, CELP performs selection of excitation codevectors by using the input signals, the synthesized signal of the selected adaptive codevector and said excitation codevector.
  • the CELP systems have the following disadvantage, in that a quite large operation is required for searching the excitation codebook. Moreover since the adaptive codebook is determined independently of the excitation codebook, it is impossible to get a high SN (signal to noise) ratio. Further in the above CELP system, although the adaptive codebook and the excitation codebook are each searched by using gains not quantized, it becomes possible to obtain a higher SN ratio when the adaptive codebook and the excitation codebook are searched for all the quantization value of gains. Furthermore, it is impossible to obtain sufficiently good speech quality with low bit rates such as 8 kbit/s or less because of the too small size of the excitation codebook.
  • An object of the present invention is to provide a method for speech coding which can solve the above problem of the conventional method and achieve high quality speech by a relatively small operation even at the low bit rates such as less than 8 kbit/s.
  • Another object of the present invention is to provide a voice-coder which can solve the above problem of the conventional method and achieve high quality speech by a relatively small operation even at low bit rates such as less than 8 kbit/s.
  • the object of the present invention can be achieved by a method for speech coding for coding speech signals divided into frames spaced with a constant interval, wherein an adaptive codebook storing excitation signals determined in the past and a plurality of excitation codebooks for multi-stage vector quantization of an excitation signal of the input speech signal are prepared; a spectral parameter of said input speech signal is obtained; said frame is divided into subframes; a candidate of a first fixed number of adaptive codevectors is selected for every said subframe from said adaptive codebook by using said input speech signal and said spectral parameter; candidates of a second fixed number of excitation codevectors are selected for every said subframe from said excitation codebooks, respectively, by using said input speech signal, said spectral parameter and the candidate of said adaptive codevector; and a combination of the adaptive vector and each of the excitation codevectors forming an excitation signal of said subframe is selected from the candidates of said adaptive codevector and each of said excitation codevector by using said input speech signal and said spectral parameter.
  • a voice-coder for coding speech signals divided into frames spaced with a constant interval, comprising: linear prediction analysis means for outputting spectral parameters of input speech signals; an adaptive codebook for storing excitation signals determined in the past; a plurality of excitation codebooks provided for multi-stage vector quantization of the excitation signal of said input speech signals; wherein, in case of searching for a combination of the adaptive codevector and each of the excitation codevectors for every subframe prepared by further division of said frame, from said adaptive codebook and each of said excitation codebooks, respectively, said combination of the adaptive codevector and each of the excitation codevectors forming an excitation signal of said subframe; a candidate of a first predetermined number of adaptive codevectors is selected from said adaptive codebook by using said input speech signal and said spectral parameter; candidates of each predetermined number of excitation codevectors are selected from a plurality of said excitation codebooks respectively by using said input speech signal, said spectral parameter and the candidate
  • a voice-coder for coding speech signals divided into frames spaced with a constant interval, comprising: linear prediction analysis means for outputting spectral parameters of input speech signals; an adaptive codebook storing excitation signals determined in the past; a plurality of excitation codebooks provided for multi-stage vector quantization of an excitation signal of said input speech signals; subframe division means for generating subframe signals by dividing said frame into subframes; first selection means for selecting a candidate of a first fixed number of adaptive codevectors from said adaptive codebook in accordance with said subframe signal and said spectral parameter; second selection means provided for every said excitation codebook for selecting the candidate of the excitation codevectors of the number predetermined for every excitation codebook, from the corresponding excitation codebook in accordance with said subframe signal, said spectral parameter and the candidate of said adaptive codevector; and means for searching the candidate of the adaptive vector and each of the excitation codevectors which forms the excitation signal of said subframe, from the candidate of
  • Fig. 1 is a block diagram showing the structure of a voice-coder of a first embodiment of the present invention.
  • Fig. 2 is a block diagram showing the structure of a voice-coder of a second embodiment of the present invention.
  • Fig. 3 is a block diagram showing the structure of a voice-coder of a third embodiment of the present invention.
  • Fig. 4 is a block diagram showing the structure of a voice-coder of a fourth embodiment of the present invention.
  • FIG. 1 A first preferable embodiment of the present invention will be described with reference to Fig. 1.
  • voice-coder shown in Fig.1 there are provided adaptive codebook 175, gain codebook 220 and two kinds of excitation codebooks 180, 190.
  • Speech input circuit 100 is provided for receiving speech signals divided into frames spaced with a constant interval.
  • Subframe division circuit 120 and linear prediction analysis circuit 110 are provided on the output side of speech input circuit 100.
  • Subframe division circuit 120 outputs subframes by equally dividing the frame, and linear prediction analysis circuit 110 performs linear prediction analyses of speech signals composing frames for obtaining spectral parameters of the speech signals.
  • weighting filter 130 is provided for performing perceptual weighting by receiving subframes and spectral parameters.
  • influence signal subtracter 140 is provided for subtracting weighted influence signal from the preceding subframe and outputting the results thereof.
  • Adaptive codebook 175 stores excitation signals decided in the past as adaptive codevectors.
  • adaptive codebook candidate selection circuit 150 is provided for selecting the previously fixed number of adaptive codevectors and for outputting thereof as candidates of adaptive codevectors.
  • Adaptive codebook candidate selection circuit 150 performs selection of the candidate according to the spectral parameter and the output signal of influence signal subtracter 140.
  • First and second excitation codebooks 180, 190 operate for multi-stage vector quantization of the excitation signal, and store the first and second excitation codevectors, respectively.
  • candidate selection circuits 160, 170 for the first and second excitation codebooks are provided respectively.
  • Candidate selection circuits 160, 170 select the previously fixed number of excitation codevectors from corresponding respective excitation codebooks 180, 190 and output thereof as the candidates of the excitation codevectors.
  • Spectral parameters, output signals of the influence signal subtracter and candidates of adaptive codevectors are inputted into each of candidate selection circuits 160, 170 for the excitation codebook.
  • Optimum combination search circuit 200 is provided for candidates selected by candidate selection circuits 150, 160, 170 for the corresponding codebooks in order to search the optimum combination of candidates. Further, gain codebook search circuit 210 and multiplexer 230 are provided. Optimum combination search circuit 200 is structured so as to output to multiplexer 230 the delay (to the adaptive codevector) or index (to each excitation codevector) with reference to each of the respective optimum adaptive codevectors, to the first and second excitation codevectors according to the results of the search, and to the output weighted synthetic signals of the above vectors to gain codebook search circuit 210, respectively.
  • Gain codebook search circuit 210 searches for the optimum gain codevector from gain codebook 220 which stores gain codevectors, and outputs the index of thus searched optimum gain codevector.
  • Multiplexer 230 is structured so as to receive delay or indices from optimum combination search circuit 200 or gain codebook search circuit 210, and output codes which correspond to input speech signals according to delay or indices.
  • the excitation signal is processed by two-stage vector quantization by using two kinds of excitation codebooks 180, 190.
  • the predetermined number L0 of the adaptive codevectors is selected, in order, from the one with smaller error E0 expressed by equation (1):
  • E0 ⁇ z - ⁇ 0sa d ⁇ 2 (1)
  • z is a signal obtained by subtracting an influence signal from a perceptually weighted input signal, sa d a perceptually weighted synthetic signal of adaptive codevector a d with delay d, ⁇ 0 a sequential optimum gain of an adaptive codevector, ⁇ ⁇ Euclid norm.
  • the sequential optimum gain ⁇ 0 of the adaptive codevector is given by:
  • candidates of predetermined number L1 of the first excitation codevectors are selected for each L0 piece of the adaptive codevectors selected by candidate selection circuit 150 for the adaptive codebook, in order, from the one with smaller error E1 expressed by equation (4):
  • E1 ⁇ za - ⁇ 0se i 1 ⁇ 2 (4)
  • se i 1 is a perceptually weighted synthesized signal of first excitation codevector e i 1 with index i, ⁇ 0 a sequential optimum gain of the first excitation codevector
  • za z - ⁇ 0 sa d .
  • equation (6) By substituting above equation (5) into equation (4), equation (6) below is obtained:
  • equation (9) is obtained.
  • error E is calculated by the following equation for all the combinations of candidates of the selected adaptive codevectors, and the first and second excitation codevectors, and then the combination of the candidates with minimum E is searched.
  • E ⁇ z- ⁇ sa d - ⁇ se11- ⁇ se j 2 ⁇ 2 (10) where ⁇ , ⁇ , ⁇ are simultaneous optimum gains of an adaptive codevector, the first and second excitation codevectors, respectively.
  • Speech input circuit 100 receives speech signals divided into each frame (e.g., 40 ms in width), which signals are outputted to linear prediction analysis circuit 110 and subframe division circuit 120.
  • linear prediction analysis circuit 110 linear prediction analysis of the inputted speech signal is performed for calculating the spectral parameter.
  • This spectral parameter is outputted to weighting filter 130, to influence signal subtracter 140, to candidate selection circuit 150 for the adaptive codebook, to candidate selection circuit 160 for the first excitation codebook, to candidate selection circuit 170 for the second excitation codebook, and to multiplexer 230.
  • a frame is divided into subframes (e.g., 8 ms in width) by subframe division circuit 120. Speech signals divided into subframes are inputted into weighting filter 130.
  • Weighting filter 130 performs perceptual weighting of inputted speech signals, and outputs the results to influence signal subtracter 140.
  • Influence signal subtracter 140 subtracts the weighted influence signal from the preceding subframe, and outputs the result to candidate selection circuit 150 for the adaptive codebook, to candidate selection circuit 160 for the first excitation codebook, to candidate selection circuit 170 for the second excitation codebook, and to gain codebook search circuit 210.
  • Candidate selection circuit 160 for the first excitation codebook selects the candidate of L1 pieces of the first excitation codevector from first excitation codebook 180, according to the output of the influence signal subtracter, the spectral parameter and the candidate of the adaptive codevector by using equation (6).
  • Candidate selection circuit 160 for the first excitation codebook outputs the weighted synthetic signal and index of the candidate of the selected first excitation codevector to optimum combination search circuit 200.
  • candidate selection circuit 170 for the second excitation codebook selects the candidate of the second excitation codevector from the second excitation codebook according to equation (9), and outputs the weighted synthetic signal and index of the selected second excitation codevector to optimum combination search circuit 200.
  • Optimum combination search circuit 200 searches for the combination of the optimum candidates according to equation (14), and outputs the delay of the adaptive codevector and the indices of the first and second excitation codevectors to multiplexer 230, and weighted synthetic signals of each codevector to gain codebook search circuit 210.
  • Gain codebook search circuit 210 searches for the optimum gain codevector from gain codebook 220 according to each of the inputted weighted synthetic signals, and outputs the index of thus obtained gain codevector to multiplexer 230.
  • multiplexer 230 assembles and outputs the code for the speech signal divided into subframes according to the delay and index outputted from optimum combination search circuit 200 and to the index outputted from gain codebook search circuit 210.
  • the candidates are selected first from the adaptive codebook and each of excitation codebooks, and then the optimum combination is selected from the combination of each of thus selected candidates, so that a sufficiently good speech quality can be obtained with a relatively small operation.
  • the gain codebook which stores the quantized gain vectors is used for selecting the optimum combination from combinations of the candidates, SN ratio is further improved.
  • each block attached with the same reference numeral as that in Fig. 1 has the same function as that in Fig. 1.
  • the voice-coder in Fig. 2 when compared with the voice-coder in Fig. 1, differs in that it has no gain codebook search circuit and optimum combination search circuit, but has instead gain-including optimum combination search circuit 300.
  • Gain-including optimum combination search circuit 300 receives candidates of the adaptive codevectors, candidates of the first and second excitation codevectors, and outputs of influence signal subtracter 140, and selects the optimum combination from all of the combinations of the candidates and gain codevectors by searching for gain codebook 220.
  • Gain-including optimum combination search circuit 300 is structured so as to output the delay or index of each codevector composing the selected combination to multiplexer 230 according to the selected combination.
  • this voice-coder will be described. It is the same as that of the voice-coder shown in Fig. 1 except that the candidates of vectors are outputted from each of candidate selection circuits 150, 160 and 170. These candidates of codevectors are inputted into gain-including optimum combination search circuit 300, whereby the optimum combination of candidates is searched according to equation (16). Then consulting the searched combination, the delay of the adaptive codevector and indices of the first and second excitation codevectors and gain codevectors are inputted into multiplexer 230, from which speech signal codes are outputted.
  • each block attached with the same reference numeral as that in Fig. 1 has the same function as that in Fig. 1.
  • This voice-coder differs from the one shown in Fig. 1 in that the second excitation codebook is composed of excitation super codebook 390.
  • a super codebook means a codebook which stores codevectors with the number of bits larger than the number of bits to be transmitted.
  • Index i of the candidate of the first excitation codevector is outputted from first excitation codebook selection circuit 160 to second excitation super codebook 390.
  • the selection of the candidate of the second excitation codevectors from second excitation super codebook 390 is carried out by searching codevectors from a portion of second excitation super codebook 390, the portion being expressed by set F2(i) of indices to be determined according to index i of the first excitation codevector.
  • optimum combination search circuit 200 When searching of the candidates of the first and second codevectors is finished, then the optimum combination of candidates is searched in optimum combination search circuit 200 according to equation (14) as searched in the first embodiment.
  • optimum combination search circuit 200 can search the optimum combination from the combination of the candidate of the adaptive codevectors, the candidate of the first excitation codevectors, and all of the second excitation codevectors corresponding to set F2(i).
  • each block attached with the same reference numeral as that in Fig. 2 has the same function as that in Fig. 2.
  • This voice-coder uses second excitation super codebook 390 instead of the second excitation codebook, differently from the voice-coder in Fig. 2.
  • Super codebook 390 is similar to the super codebook in the voice-coder shown in Fig. 3.
  • the candidate of the second excitation codevector to be selected from second excitation super codebook 390 is also selected in the same way as in the third embodiment, and other operations are conducted in the same manner as in the second embodiment.
  • each embodiment of the present invention has been described above, the operation of each embodiment can be modified in such a way that auto-correlation ⁇ se i , se i ⁇ of weighted synthetic signal se i of the excitation codevector is obtained according to the following equation for the purpose of reducing the operation: where hh is an auto-correlation function of the impulse response of a weighting synthesis filter, ee i an auto-correlation function of the excitation code vector with index i, and im a length of the impulse response.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)

Abstract

A method for speech coding and a voice-coder for coding speech signals divided into frames spaced with a constant interval are disclosed. An adaptive codebook storing excitation signal determined in advance and a plurality of excitation codebooks for multi-stage vector quantization are provided. Each frame is divided into subframes. For each subframe, a candidate of a first predetermined number of adaptive codevectors is selected, and then candidates of each predetermined number of excitation codevectors are selected from each excitation codebook, respectively, by using the candidate of the adaptive codevector. Finally, a combination of the adaptive codevector and each of the excitation codevector is selected from the candidates of the adaptive codevector and each of the sound codevectors.

Description

  • The present invention relates to a method for speech coding and to a voice-coder, particularly to a method for speech coding and to a voice-coder which can achieve high coding quality with a relatively small operation at bit rates not greater than 8 kbit/s.
  • As a speech coding system to be applied to vector quantization of excitation signals at low bit rates by using a excitation codebook comprising random numbers, a CELP system described in a paper (hereinafter referred to as literature 1) titled "CODE-EXCITED LINEAR PREDICTION (CELP): HIGH-QUALITY SPEECH AT VERY LOW BIT RATES" (Proc. ICASSP, pp. 937-940, 1985) by Manfred R. Shroeder and Bishnu S. Atal is known. There is also a CELP system using an adaptive codebook described in a paper (hereinafter referred to as literature 2) titled "IMPROVED SPEECH QUALITY AND EFFICIENT VECTOR QUANTIZATION IN SELP" (Proc. ICASSP, pp. 155-158, 1988) by W. B. Kleijin, D. J. Krasinski and R. H. Ketchum. The CELP system using the adaptive codebook receives speech signals divided into frames spaced with a constant interval. The CELP utilizes a linear predictive analyzer for obtaining spectral parameters of input speech signals, the adaptive codebook having excitation signals determined in the past, the excitation codebook comprising random numbers to be used for vector quantization of the excitation signals of said input speech signals. The CELP selects an adaptive codevector by using the input speech signal and the synthesized signal of the adaptive codevector for every subframe made by equally dividing the frame. Subsequently, CELP performs selection of excitation codevectors by using the input signals, the synthesized signal of the selected adaptive codevector and said excitation codevector.
  • However, the CELP systems have the following disadvantage, in that a quite large operation is required for searching the excitation codebook. Moreover since the adaptive codebook is determined independently of the excitation codebook, it is impossible to get a high SN (signal to noise) ratio. Further in the above CELP system, although the adaptive codebook and the excitation codebook are each searched by using gains not quantized, it becomes possible to obtain a higher SN ratio when the adaptive codebook and the excitation codebook are searched for all the quantization value of gains. Furthermore, it is impossible to obtain sufficiently good speech quality with low bit rates such as 8 kbit/s or less because of the too small size of the excitation codebook.
  • An object of the present invention is to provide a method for speech coding which can solve the above problem of the conventional method and achieve high quality speech by a relatively small operation even at the low bit rates such as less than 8 kbit/s.
  • Another object of the present invention is to provide a voice-coder which can solve the above problem of the conventional method and achieve high quality speech by a relatively small operation even at low bit rates such as less than 8 kbit/s.
  • The object of the present invention can be achieved by a method for speech coding for coding speech signals divided into frames spaced with a constant interval, wherein an adaptive codebook storing excitation signals determined in the past and a plurality of excitation codebooks for multi-stage vector quantization of an excitation signal of the input speech signal are prepared; a spectral parameter of said input speech signal is obtained; said frame is divided into subframes; a candidate of a first fixed number of adaptive codevectors is selected for every said subframe from said adaptive codebook by using said input speech signal and said spectral parameter; candidates of a second fixed number of excitation codevectors are selected for every said subframe from said excitation codebooks, respectively, by using said input speech signal, said spectral parameter and the candidate of said adaptive codevector; and a combination of the adaptive vector and each of the excitation codevectors forming an excitation signal of said subframe is selected from the candidates of said adaptive codevector and each of said excitation codevector by using said input speech signal and said spectral parameter.
  • Another object of the present invention is achieved by a voice-coder for coding speech signals divided into frames spaced with a constant interval, comprising: linear prediction analysis means for outputting spectral parameters of input speech signals; an adaptive codebook for storing excitation signals determined in the past; a plurality of excitation codebooks provided for multi-stage vector quantization of the excitation signal of said input speech signals; wherein, in case of searching for a combination of the adaptive codevector and each of the excitation codevectors for every subframe prepared by further division of said frame, from said adaptive codebook and each of said excitation codebooks, respectively, said combination of the adaptive codevector and each of the excitation codevectors forming an excitation signal of said subframe; a candidate of a first predetermined number of adaptive codevectors is selected from said adaptive codebook by using said input speech signal and said spectral parameter; candidates of each predetermined number of excitation codevectors are selected from a plurality of said excitation codebooks respectively by using said input speech signal, said spectral parameter and the candidate of said adaptive codevector; and a candidate of the adaptive codevector and each of excitation codevectors forming the excitation signal of said subframe is selected from the candidate of said adaptive codevector and each of said excitation codevectors by using said input speech signal and said spectral parameter.
  • Another object of the present invention is also achieved by a voice-coder for coding speech signals divided into frames spaced with a constant interval, comprising: linear prediction analysis means for outputting spectral parameters of input speech signals; an adaptive codebook storing excitation signals determined in the past; a plurality of excitation codebooks provided for multi-stage vector quantization of an excitation signal of said input speech signals; subframe division means for generating subframe signals by dividing said frame into subframes; first selection means for selecting a candidate of a first fixed number of adaptive codevectors from said adaptive codebook in accordance with said subframe signal and said spectral parameter; second selection means provided for every said excitation codebook for selecting the candidate of the excitation codevectors of the number predetermined for every excitation codebook, from the corresponding excitation codebook in accordance with said subframe signal, said spectral parameter and the candidate of said adaptive codevector; and means for searching the candidate of the adaptive vector and each of the excitation codevectors which forms the excitation signal of said subframe, from the candidate of said adaptive codevector and the candidate of each of said excitation codevectors in accordance with said input aural signal and said spectral parameter.
  • The above and other objects, features and advantages of the present invention will be apparent from the following description referring to the accompanying drawings which illustrate examples of preferred embodiments of the present invention.
  • BRIEF DESCRIPTION OF THE DRAWINGS:
  • Fig. 1 is a block diagram showing the structure of a voice-coder of a first embodiment of the present invention.
  • Fig. 2 is a block diagram showing the structure of a voice-coder of a second embodiment of the present invention.
  • Fig. 3 is a block diagram showing the structure of a voice-coder of a third embodiment of the present invention.
  • Fig. 4 is a block diagram showing the structure of a voice-coder of a fourth embodiment of the present invention.
  • DESCRIPTION OF THE PREFERRED EMBODIMENTS:
  • A first preferable embodiment of the present invention will be described with reference to Fig. 1. In the voice-coder shown in Fig.1, there are provided adaptive codebook 175, gain codebook 220 and two kinds of excitation codebooks 180, 190.
  • Speech input circuit 100 is provided for receiving speech signals divided into frames spaced with a constant interval. Subframe division circuit 120 and linear prediction analysis circuit 110 are provided on the output side of speech input circuit 100. Subframe division circuit 120 outputs subframes by equally dividing the frame, and linear prediction analysis circuit 110 performs linear prediction analyses of speech signals composing frames for obtaining spectral parameters of the speech signals. On the output side of subframe division circuit 120, weighting filter 130 is provided for performing perceptual weighting by receiving subframes and spectral parameters. On the output side of weighting filter 130, influence signal subtracter 140 is provided for subtracting weighted influence signal from the preceding subframe and outputting the results thereof.
  • Adaptive codebook 175 stores excitation signals decided in the past as adaptive codevectors. Corresponding to adaptive codebook 175, adaptive codebook candidate selection circuit 150 is provided for selecting the previously fixed number of adaptive codevectors and for outputting thereof as candidates of adaptive codevectors. Adaptive codebook candidate selection circuit 150 performs selection of the candidate according to the spectral parameter and the output signal of influence signal subtracter 140.
  • First and second excitation codebooks 180, 190 operate for multi-stage vector quantization of the excitation signal, and store the first and second excitation codevectors, respectively. Corresponding to first and second excitation codebooks 180, 190, candidate selection circuits 160, 170 for the first and second excitation codebooks are provided respectively. Candidate selection circuits 160, 170 select the previously fixed number of excitation codevectors from corresponding respective excitation codebooks 180, 190 and output thereof as the candidates of the excitation codevectors. Spectral parameters, output signals of the influence signal subtracter and candidates of adaptive codevectors are inputted into each of candidate selection circuits 160, 170 for the excitation codebook.
  • Optimum combination search circuit 200 is provided for candidates selected by candidate selection circuits 150, 160, 170 for the corresponding codebooks in order to search the optimum combination of candidates. Further, gain codebook search circuit 210 and multiplexer 230 are provided. Optimum combination search circuit 200 is structured so as to output to multiplexer 230 the delay (to the adaptive codevector) or index (to each excitation codevector) with reference to each of the respective optimum adaptive codevectors, to the first and second excitation codevectors according to the results of the search, and to the output weighted synthetic signals of the above vectors to gain codebook search circuit 210, respectively. Gain codebook search circuit 210 searches for the optimum gain codevector from gain codebook 220 which stores gain codevectors, and outputs the index of thus searched optimum gain codevector. Multiplexer 230 is structured so as to receive delay or indices from optimum combination search circuit 200 or gain codebook search circuit 210, and output codes which correspond to input speech signals according to delay or indices.
  • Next, description will be made with reference to selection or search algorithm of each candidate selection circuit 150, 160, 170 or optimum combination search circuit 200 of the present embodiment. Under these algorithms, the excitation signal is processed by two-stage vector quantization by using two kinds of excitation codebooks 180, 190.
  • First, in adaptive codebook candidate selection circuit 150, the predetermined number L₀ of the adaptive codevectors is selected, in order, from the one with smaller error E₀ expressed by equation (1):

    E₀ = ∥z - β₀sa d ∥²   (1)
    Figure imgb0001


       where, z is a signal obtained by subtracting an influence signal from a perceptually weighted input signal, sad a perceptually weighted synthetic signal of adaptive codevector ad with delay d, β₀ a sequential optimum gain of an adaptive codevector, ∥ ∥ Euclid norm. The sequential optimum gain β₀ of the adaptive codevector is given by:
    Figure imgb0002

       By substituting above equation (2) into equation (1), the next equation is obtained.
    Figure imgb0003

       where, 〈, 〉 represents an inner product.
  • In candidate selection circuit 160 for the first excitation codebook, candidates of predetermined number L₁ of the first excitation codevectors are selected for each L₀ piece of the adaptive codevectors selected by candidate selection circuit 150 for the adaptive codebook, in order, from the one with smaller error E₁ expressed by equation (4):

    E₁ = ∥za - γ₀se i ¹∥²   (4)
    Figure imgb0004


       where sei¹ is a perceptually weighted synthesized signal of first excitation codevector ei¹ with index i, γ₀ a sequential optimum gain of the first excitation codevector, and za = z - β₀ sad.
  • Therefore:
    Figure imgb0005

       By substituting above equation (5) into equation (4), equation (6) below is obtained:
    Figure imgb0006

       In the same way as described above, in candidate selection circuit 170 for the second excitation codebook, the candidates of predetermined number L₂ of the second excitation codevectors are selected for each L₀ piece of adaptive codevectors selected by candidate selection circuit 150 for the adaptive codebook, in order, from the one with smaller error E₂ expressed by the next equation:

    E₂ = ∥za - δ₀se j ²∥²   (7)
    Figure imgb0007


       where sej² is a perceptually weighted synthesized signal of second excitation codevector ej² with index j, and δ₀ a sequential optimum gain of the second excitation codevector. Therefore:
    Figure imgb0008

       By substituting equation (8) into equation (7), following equation (9) is obtained.
    Figure imgb0009

       In optimum combination search circuit 200, error E is calculated by the following equation for all the combinations of candidates of the selected adaptive codevectors, and the first and second excitation codevectors, and then the combination of the candidates with minimum E is searched.

    E = ∥z-βsa d -γse₁¹-δse j ²∥²   (10)
    Figure imgb0010


       where β, γ, δ are simultaneous optimum gains of an adaptive codevector, the first and second excitation codevectors, respectively. Therefore:
    Figure imgb0011

       However, R is to satisfy the following equation:
    Figure imgb0012

       By substituting equation (11) into equation (10), thus obtained,
    Figure imgb0013

       When above error E is calculated, it is acceptable to assign a particular limitation to simultaneous optimum gains γ, δ of each excitation codevector. For example, with the limitation that γ and δ are equal, error E is given by,
    Figure imgb0014

       where,
    Figure imgb0015

       Next, description will be made with reference to operation of the voice-coder of the present embodiment.
  • Speech input circuit 100 receives speech signals divided into each frame (e.g., 40 ms in width), which signals are outputted to linear prediction analysis circuit 110 and subframe division circuit 120. In linear prediction analysis circuit 110, linear prediction analysis of the inputted speech signal is performed for calculating the spectral parameter. This spectral parameter is outputted to weighting filter 130, to influence signal subtracter 140, to candidate selection circuit 150 for the adaptive codebook, to candidate selection circuit 160 for the first excitation codebook, to candidate selection circuit 170 for the second excitation codebook, and to multiplexer 230. Separately, a frame is divided into subframes (e.g., 8 ms in width) by subframe division circuit 120. Speech signals divided into subframes are inputted into weighting filter 130. Weighting filter 130 performs perceptual weighting of inputted speech signals, and outputs the results to influence signal subtracter 140. Influence signal subtracter 140 subtracts the weighted influence signal from the preceding subframe, and outputs the result to candidate selection circuit 150 for the adaptive codebook, to candidate selection circuit 160 for the first excitation codebook, to candidate selection circuit 170 for the second excitation codebook, and to gain codebook search circuit 210.
  • Subsequently, candidate selection circuit 150 for the adaptive codebook selects the candidate of L₀ pieces of adaptive codevectors from adaptive codebook 175 according to equation (3). Candidate selection circuit 150 for the adaptive codebook outputs the weighted synthetic signal of the candidate of the selected adaptive codevectors and delay d which constitutes the index of the candidate of adaptive codevectors, to candidate selection circuits 160, 170 for the first and second excitation codebooks and to optimum combination search circuit 200.
  • Candidate selection circuit 160 for the first excitation codebook selects the candidate of L₁ pieces of the first excitation codevector from first excitation codebook 180, according to the output of the influence signal subtracter, the spectral parameter and the candidate of the adaptive codevector by using equation (6). Candidate selection circuit 160 for the first excitation codebook outputs the weighted synthetic signal and index of the candidate of the selected first excitation codevector to optimum combination search circuit 200. In the same manner, candidate selection circuit 170 for the second excitation codebook selects the candidate of the second excitation codevector from the second excitation codebook according to equation (9), and outputs the weighted synthetic signal and index of the selected second excitation codevector to optimum combination search circuit 200.
  • Optimum combination search circuit 200 searches for the combination of the optimum candidates according to equation (14), and outputs the delay of the adaptive codevector and the indices of the first and second excitation codevectors to multiplexer 230, and weighted synthetic signals of each codevector to gain codebook search circuit 210. Gain codebook search circuit 210 searches for the optimum gain codevector from gain codebook 220 according to each of the inputted weighted synthetic signals, and outputs the index of thus obtained gain codevector to multiplexer 230.
  • Finally, multiplexer 230 assembles and outputs the code for the speech signal divided into subframes according to the delay and index outputted from optimum combination search circuit 200 and to the index outputted from gain codebook search circuit 210. By carrying out the above process, speech coding of every subframe is completed.
  • According to the present embodiment, the candidates are selected first from the adaptive codebook and each of excitation codebooks, and then the optimum combination is selected from the combination of each of thus selected candidates, so that a sufficiently good speech quality can be obtained with a relatively small operation. In addition, since the gain codebook which stores the quantized gain vectors is used for selecting the optimum combination from combinations of the candidates, SN ratio is further improved.
  • The second embodiment of the present invention will be described with reference to Fig. 2. In the voice-coder shown in Fig. 2, each block attached with the same reference numeral as that in Fig. 1 has the same function as that in Fig. 1.
  • The voice-coder in Fig. 2, when compared with the voice-coder in Fig. 1, differs in that it has no gain codebook search circuit and optimum combination search circuit, but has instead gain-including optimum combination search circuit 300. Gain-including optimum combination search circuit 300 receives candidates of the adaptive codevectors, candidates of the first and second excitation codevectors, and outputs of influence signal subtracter 140, and selects the optimum combination from all of the combinations of the candidates and gain codevectors by searching for gain codebook 220. Gain-including optimum combination search circuit 300 is structured so as to output the delay or index of each codevector composing the selected combination to multiplexer 230 according to the selected combination.
  • The search algorithm which controls gain-including optimum combination search circuit 300 will next be described.
  • Gain-including optimum combination search circuit 300 searches for the combination of candidates which has the minimum value of error E by calculating E for all of the combinations of candidates of the selected adaptive codevectors, the selected first and second excitation codevectors, and all of the gain codevectors, where E is calculated by the following equation:

    E = ∥z - Qβ k sa d - Qγ k se i ¹ - Qδ k se j ²∥²   (16)
    Figure imgb0016


       where Qβk, Qγk, Qδk are each gain codevector.
  • It is acceptable to use, in place of above Qβk, Qγk, Qδk, not the gain codevector itself, but gain codevectors converted by the matrix to be calculated from the quantized power of the weighted input signal, the weighted synthetic signal of the adaptive codevector and the weighted synthetic signals of the first and second excitation codevectors. Since it requires large operation to search for the minimum value of E by calculating it against all the gain codevectors, it is also possible to perform a preliminary selection of the gain codebook to reduce the operation. The preliminary selection of the gain codebook is performed, for example, by selecting the predetermined fixed number of gain codevectors whose first components are close to the sequential optimum gain of the adaptive codevector.
  • The operation of this voice-coder will be described. It is the same as that of the voice-coder shown in Fig. 1 except that the candidates of vectors are outputted from each of candidate selection circuits 150, 160 and 170. These candidates of codevectors are inputted into gain-including optimum combination search circuit 300, whereby the optimum combination of candidates is searched according to equation (16). Then consulting the searched combination, the delay of the adaptive codevector and indices of the first and second excitation codevectors and gain codevectors are inputted into multiplexer 230, from which speech signal codes are outputted.
  • Next, the third embodiment of the present invention will be described with reference to Fig. 3. In the voice-coder shown in Fig. 3, each block attached with the same reference numeral as that in Fig. 1 has the same function as that in Fig. 1.
  • This voice-coder differs from the one shown in Fig. 1 in that the second excitation codebook is composed of excitation super codebook 390. A super codebook means a codebook which stores codevectors with the number of bits larger than the number of bits to be transmitted. Index i of the candidate of the first excitation codevector is outputted from first excitation codebook selection circuit 160 to second excitation super codebook 390. The selection of the candidate of the second excitation codevectors from second excitation super codebook 390 is carried out by searching codevectors from a portion of second excitation super codebook 390, the portion being expressed by set F₂(i) of indices to be determined according to index i of the first excitation codevector.
  • When searching of the candidates of the first and second codevectors is finished, then the optimum combination of candidates is searched in optimum combination search circuit 200 according to equation (14) as searched in the first embodiment. In the present embodiment, it is possible to modify so as to output all the second excitation codevectors which correspond to set of indices F₂(i) without performing selection of candidates of the second excitation codevectors in candidate selection circuit 170 of the second excitation codebook. In this case, optimum combination search circuit 200 can search the optimum combination from the combination of the candidate of the adaptive codevectors, the candidate of the first excitation codevectors, and all of the second excitation codevectors corresponding to set F₂(i).
  • As described above in the third embodiment of the present invention, by applying the super codebook in the embodiment, it becomes possible to obtain speech quality as substantially good as the case with a excitation codebook of an increased codebook size without increasing the bit rates.
  • The fourth embodiment of the present invention will nest be described with reference to Fig. 4. In the voice-coder shown in Fig. 4, each block attached with the same reference numeral as that in Fig. 2 has the same function as that in Fig. 2.
  • This voice-coder uses second excitation super codebook 390 instead of the second excitation codebook, differently from the voice-coder in Fig. 2. Super codebook 390 is similar to the super codebook in the voice-coder shown in Fig. 3. The candidate of the second excitation codevector to be selected from second excitation super codebook 390 is also selected in the same way as in the third embodiment, and other operations are conducted in the same manner as in the second embodiment. In this case, it is also possible to modify candidate selection circuit 170 for the second excitation codevectors so as to output all of the second excitation codevectors which correspond to set of indices F₂(i) without selecting the candidate of the second excitation codevectors.
  • Although each embodiment of the present invention has been described above, the operation of each embodiment can be modified in such a way that auto-correlation 〈sei, sei〉 of weighted synthetic signal sei of the excitation codevector is obtained according to the following equation for the purpose of reducing the operation:
    Figure imgb0017

       where hh is an auto-correlation function of the impulse response of a weighting synthesis filter, eei an auto-correlation function of the excitation code vector with index i, and im a length of the impulse response.
  • As well, cross-correlation between weighted synthetic signal sei of the excitation codevector and arbitrary vector v can be calculated according to the following equation to reduce the operation:

    〈v, se i 〉 = 〈H T v, e j 〉   (18)
    Figure imgb0018


       where H is an impulse response matrix of the weighting synthesis filter.
  • Cross-correlation between weighted synthetic signal sad of the adaptive codevector and arbitrary vector v can be obtained according to the following equation in the same way:

    〈v, sa d 〉 = 〈H T v, a d 〉   (19)
    Figure imgb0019


       Further, in the case of searching for the optimum combination in the optimum combination search circuit of the first and third embodiments, although a particular limitation (γ = δ ) is now assigned to gains γ, δ of the first and second excitation codevectors as described above, it is possible to provide limitations other than γ = δ or to provide no limitation.
  • Further, it is also possible to apply a delayed decision system in each embodiment in such a way that the combination of candidates is selected so as to have the minimum cumulative error for the whole frames without uniquely determining the adaptive codevector, the first and second excitation codevectors and the gain codevector for each subframe while leaving the candidates undetermined.
  • It is to be understood that variations and modifications of the method for speech coding and of the voice-coder disclosed herein will be evident to those skilled in the art. It is intended that all such modifications and variations be included within the scope of the appended claims.

Claims (14)

  1. A method for speech coding for coding speech signals divided into frames spaced with a constant interval, wherein,
       an adaptive codebook storing excitation signals determined in the past and a plurality of excitation codebooks for multi-stage vector quantization of an excitation signal of the input speech signal are prepared;
       a spectral parameter of said input speech signal is obtained;
       said frame is divided into subframes;
       a candidate of a first fixed number of adaptive codevectors is selected for every said subframe from said adaptive codebook by using said input speech signal and said spectral parameter;
       candidates of a second fixed number of excitation codevectors are selected for every said subframe from said excitation codebooks, respectively, by using said input speech signal, said spectral parameter and the candidate of said adaptive codevector; and
       a combination of the adaptive codevector and each of the excitation codevectors forming an excitation signal of said subframe is selected from the candidates of said adaptive codevector and each of said excitation codevectors by using said input speech signal and said spectral parameter.
  2. A method for speech coding according to Claim 1, wherein selection of the candidates of the adaptive codevector and each of the excitation codevectors are performed, respectively, in order, from the selection of the candidate with a smaller error.
  3. A method for speech coding according to Claim 1 or 2, wherein,
       a gain codebook is used for performing quantization of gains of said adaptive codebook and each of said excitation codebooks, respectively; and
       a gain codevector is determined by using said gain codebook when selection of a combination of the adaptive codevector and each of the excitation codevectors forming the excitation signal of said subframe from the candidates of said adaptive codevector and said excitation codevector is performed.
  4. A method for speech coding according to any of Claims 1 to 3, wherein,
       at least one or more of excitation super codebooks is included in said plurality of excitation codebooks, said super codebook comprising bits with the number of bits larger than the number of bits to be transmitted; and
       selection of the candidate of the excitation codevector from said excitation super codebook is performed corresponding to the candidate of the excitation codevector already selected.
  5. A method for speech coding according to Claim 1, wherein the step of selecting the combination of the adaptive codevector and each of the excitation codevectors forming the excitation signal of said subframe from the candidates of said adaptive codevector and said excitation codevector, further comprising the steps of:
       determining the optimum gain codevector from said gain codebook; and
       reflecting said gain codevector on said adaptive codevector and each of said excitation codevectors forming said excitation signal.
  6. A method for speech coding according to Claim 5, wherein,
       at least one or more of excitation super codebooks is included in said plurality of excitation codebooks, said super codebook comprising bits with the number of bits larger than the number of bits to be transmitted; and
       selection of the candidate of the excitation codevector from said excitation super codebook is performed corresponding to the candidate of the excitation codevector already selected.
  7. A voice-coder for coding speech signals divided into frames spaced with a constant interval, comprising:
       linear prediction analysis means for outputting spectral parameters of input speech signals;
       an adaptive codebook for storing excitation signals determined in the past;
       a plurality of excitation codebooks provided for multi-stage vector quantization of the excitation signal of said input speech signals;
       wherein, in case of searching for a combination of the adaptive codevector and each of the excitation codevectors for every subframe prepared by further division of said frame from said adaptive codebook and each of said excitation codebooks, respectively, said combination of the adaptive codevector and each of the excitation codevectors forming an excitation signal of said subframe:
       a candidate of a first predetermined number of adaptive codevectors is selected from said adaptive codebook by using said input speech signal and said spectral parameter;
       candidates of each predetermined number of excitation codevectors are selected from a plurality of said excitation codebooks, respectively, by using said input speech signal, said spectral parameter and the candidate of said adaptive codevector; and
       a candidate of said adaptive codevector and each of said excitation codevectors forming the excitation signal of said subframe is selected from the candidate of said adaptive codevector and each of said excitation codevectors by using said input speech signal and said spectral parameter.
  8. A voice-coder according to Claim 7, further comprising:
       a gain codebook for quantization of each gain of said adaptive codebook and each of said excitation codebooks; wherein
       said input speech signal, said spectral parameter and said gain codebook are used for searching a combination of the adaptive codevector and each of the excitation codevectors which forms the excitation signal of said subframe, from the candidates of said adaptive codevector and said excitation codevectors.
  9. A voice-coder according to Claim 7 or 8, wherein,
       at least one or more of excitation super codebooks is included in said plurality of excitation codebooks, said super codebook comprising bits with the number of bits larger than the number of bits to be transmitted; and
       selection of the candidate of the excitation codevector from said excitation super codebook is performed corresponding to the candidate of the excitation codevector already selected.
  10. A voice-coder for coding speech signals divided into frames spaced with a constant interval, comprising:
       linear prediction analysis means for outputting spectral parameters of input speech signals;
       an adaptive codebook storing excitation signals determined in the past;
       a plurality of excitation codebooks provided for multi-stage vector quantization of an excitation signal of said input speech signals;
       subframe division means for generating subframe signals by dividing said frame into subframes;
       first selection means for selecting a candidate of a first fixed number of adaptive codevectors from said adaptive codebook in accordance with said subframe signal and said spectral parameter;
       second selection means provided for every said excitation codebook for selecting the candidate of the excitation codevectors of the number predetermined for every excitation codebook, from the corresponding excitation codebook in accordance with said subframe signal, said spectral parameter and the candidate of said adaptive codevector; and
       means for searching the candidate of said adaptive codevector and each of said excitation codevectors which forms the excitation signal of said subframe, from the candidate of said adaptive codevector and the candidate of each of said excitation codevectors in accordance with said input speech signal and said spectral parameter.
  11. A voice-coder according to Claim 10, wherein,
       first and second selecting means select each corresponding candidate, in order, from the candidate with a smaller error;
       said search means searches the candidate of said codevector on the condition of whose error is lowest.
  12. A voice-coder according to Claim 10, further comprising:
       a gain codebook for quantization of each gain of said adaptive codebook and each of said excitation codebooks; wherein
       said search means searches the candidate of said codevector by further consulting said gain codebook.
  13. A voice-coder according to Claim 11, further comprising:
       a gain codebook for quantization of each gain of said adaptive codebook and each of said excitation codebooks; wherein
       said search means further determines the optimum gain codevector from said gain codebook by consulting said gain codebook, and reflects said gain codevector on the adaptive codevector and each of the excitation codevector which forms said excitation signal.
  14. A voice-coder according to any of Claims 10 to 13, wherein, at least one or more of excitation super codebooks is included in said plurality of excitation codebooks, said super codebook comprising bits with the number of bits larger than the number of bits to be transmitted; and
       said second selection means corresponding to said excitation super codebook performs selection of the candidate of the excitation codevector from said excitation super codebook according to the candidate of the excitation codevector already selected.
EP92120573A 1991-12-03 1992-12-02 Method for speech coding and voice-coder Expired - Lifetime EP0545386B1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP319314/91 1991-12-03
JP03319314A JP3089769B2 (en) 1991-12-03 1991-12-03 Audio coding device

Publications (3)

Publication Number Publication Date
EP0545386A2 true EP0545386A2 (en) 1993-06-09
EP0545386A3 EP0545386A3 (en) 1993-08-18
EP0545386B1 EP0545386B1 (en) 1999-04-07

Family

ID=18108816

Family Applications (1)

Application Number Title Priority Date Filing Date
EP92120573A Expired - Lifetime EP0545386B1 (en) 1991-12-03 1992-12-02 Method for speech coding and voice-coder

Country Status (4)

Country Link
EP (1) EP0545386B1 (en)
JP (1) JP3089769B2 (en)
CA (1) CA2084338C (en)
DE (1) DE69228858T2 (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0610906A1 (en) * 1993-02-09 1994-08-17 Nec Corporation Device for encoding speech spectrum parameters with a smallest possible number of bits
WO1994027286A1 (en) * 1993-05-07 1994-11-24 Ant Nachrichtentechnik Gmbh Process for vector quantization, especially of voice signals
EP0632429A2 (en) * 1993-06-30 1995-01-04 Nec Corporation Vector quantizer
WO1996028810A1 (en) * 1995-03-10 1996-09-19 Universite De Sherbrooke Depth-first algebraic-codebook search for fast coding of speech
EP0734014A1 (en) * 1995-03-23 1996-09-25 Kabushiki Kaisha Toshiba Coding apparatus
EP0733257A1 (en) * 1993-12-07 1996-09-25 Pacific Communication Sciences, Inc. Adaptive speech coder having code excited linear prediction with multiple codebook searches
WO1998004046A2 (en) * 1996-07-17 1998-01-29 Universite De Sherbrooke Enhanced encoding of dtmf and other signalling tones
US5754976A (en) * 1990-02-23 1998-05-19 Universite De Sherbrooke Algebraic codebook with signal-selected pulse amplitude/position combinations for fast coding of speech
EP0756268A3 (en) * 1995-07-27 1998-05-27 Nec Corporation Speech encoder capable of substantially increasing a codebook size without increasing the number of transmitted bits
US7151802B1 (en) 1998-10-27 2006-12-19 Voiceage Corporation High frequency content recovering method and device for over-sampled synthesized wideband signal
US7269552B1 (en) 1998-10-06 2007-09-11 Robert Bosch Gmbh Quantizing speech signal codewords to reduce memory requirements
EP2378517A1 (en) * 1998-06-09 2011-10-19 Panasonic Corporation Speech coding apparatus and speech decoding apparatus
KR101290997B1 (en) 2012-03-26 2013-07-30 세종대학교산학협력단 A codebook-based speech enhancement method using adaptive codevector and apparatus thereof
EP2234104A4 (en) * 2008-01-16 2015-09-23 Panasonic Ip Corp America Vector quantizer, vector inverse quantizer, and methods therefor

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3541680B2 (en) 1998-06-15 2004-07-14 日本電気株式会社 Audio music signal encoding device and decoding device

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0443548A2 (en) * 1990-02-22 1991-08-28 Nec Corporation Speech coder
EP0477960A2 (en) * 1990-09-26 1992-04-01 Nec Corporation Linear prediction speech coding with high-frequency preemphasis

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0443548A2 (en) * 1990-02-22 1991-08-28 Nec Corporation Speech coder
EP0477960A2 (en) * 1990-09-26 1992-04-01 Nec Corporation Linear prediction speech coding with high-frequency preemphasis

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS vol. 37, no. 4, November 1991, NEW YORK US pages 772 - 782 SUNWOO , PARK 'Real time implementation of the VSELP on a 16-bit DSP chip' *
INTERNATIONAL CONFERENCE ON ACOUSTICS SPEECH AND SIGNAL PROCESSING vol. 1, 23 May 1989, GLASGOW SCOTLAND U.K. pages 65 - 68 JAYANT, CHEN 'Speech coding with time varying bit allocations to excitation and LPC parameters' *
INTERNATIONAL CONFERENCE ON ACOUSTICS SPEECH AND SIGNAL PROCESSING vol. 1, 3 April 1990, ALBUQUERQUE NEW MEXICO USA pages 461 - 464 GERSON, JASIUK 'Vector sum excited linear prediction speech coding at 8Kbps' *
NEC RESEARCH AND DEVELOPMENT vol. 32, no. 4, October 1991, TOKYO JAPAN pages 549 - 555 UNNO ET AL 'Learned codebook excited linear predictive speech codec for digital cellular system' *

Cited By (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5701392A (en) * 1990-02-23 1997-12-23 Universite De Sherbrooke Depth-first algebraic-codebook search for fast coding of speech
US5754976A (en) * 1990-02-23 1998-05-19 Universite De Sherbrooke Algebraic codebook with signal-selected pulse amplitude/position combinations for fast coding of speech
EP0610906A1 (en) * 1993-02-09 1994-08-17 Nec Corporation Device for encoding speech spectrum parameters with a smallest possible number of bits
US5625744A (en) * 1993-02-09 1997-04-29 Nec Corporation Speech parameter encoding device which includes a dividing circuit for dividing a frame signal of an input speech signal into subframe signals and for outputting a low rate output code signal
WO1994027286A1 (en) * 1993-05-07 1994-11-24 Ant Nachrichtentechnik Gmbh Process for vector quantization, especially of voice signals
AU681137B2 (en) * 1993-05-07 1997-08-21 Bosch Telecom Gmbh Process for vector quantization, especially of voice signals
EP0632429A2 (en) * 1993-06-30 1995-01-04 Nec Corporation Vector quantizer
US5761632A (en) * 1993-06-30 1998-06-02 Nec Corporation Vector quantinizer with distance measure calculated by using correlations
EP0632429A3 (en) * 1993-06-30 1997-01-22 Nec Corp Vector quantizer.
EP0733257A4 (en) * 1993-12-07 1999-12-08 Pacific Comm Sciences Inc Adaptive speech coder having code excited linear prediction with multiple codebook searches
EP0733257A1 (en) * 1993-12-07 1996-09-25 Pacific Communication Sciences, Inc. Adaptive speech coder having code excited linear prediction with multiple codebook searches
ES2112808A1 (en) * 1995-03-10 1998-04-01 Univ Sherbrooke Depth-first algebraic-codebook search for fast coding of speech
WO1996028810A1 (en) * 1995-03-10 1996-09-19 Universite De Sherbrooke Depth-first algebraic-codebook search for fast coding of speech
EP0734014A1 (en) * 1995-03-23 1996-09-25 Kabushiki Kaisha Toshiba Coding apparatus
US5878387A (en) * 1995-03-23 1999-03-02 Kabushiki Kaisha Toshiba Coding apparatus having adaptive coding at different bit rates and pitch emphasis
US6006178A (en) * 1995-07-27 1999-12-21 Nec Corporation Speech encoder capable of substantially increasing a codebook size without increasing the number of transmitted bits
EP0756268A3 (en) * 1995-07-27 1998-05-27 Nec Corporation Speech encoder capable of substantially increasing a codebook size without increasing the number of transmitted bits
WO1998004046A2 (en) * 1996-07-17 1998-01-29 Universite De Sherbrooke Enhanced encoding of dtmf and other signalling tones
WO1998004046A3 (en) * 1996-07-17 1998-03-26 Univ Sherbrooke Enhanced encoding of dtmf and other signalling tones
EP2378517A1 (en) * 1998-06-09 2011-10-19 Panasonic Corporation Speech coding apparatus and speech decoding apparatus
US7269552B1 (en) 1998-10-06 2007-09-11 Robert Bosch Gmbh Quantizing speech signal codewords to reduce memory requirements
US7151802B1 (en) 1998-10-27 2006-12-19 Voiceage Corporation High frequency content recovering method and device for over-sampled synthesized wideband signal
EP2234104A4 (en) * 2008-01-16 2015-09-23 Panasonic Ip Corp America Vector quantizer, vector inverse quantizer, and methods therefor
EP3288029A1 (en) * 2008-01-16 2018-02-28 III Holdings 12, LLC Vector quantizer, vector inverse quantizer, and methods therefor
KR101290997B1 (en) 2012-03-26 2013-07-30 세종대학교산학협력단 A codebook-based speech enhancement method using adaptive codevector and apparatus thereof

Also Published As

Publication number Publication date
CA2084338C (en) 1997-03-04
JP3089769B2 (en) 2000-09-18
EP0545386A3 (en) 1993-08-18
EP0545386B1 (en) 1999-04-07
DE69228858T2 (en) 1999-08-19
JPH05216500A (en) 1993-08-27
CA2084338A1 (en) 1993-06-04
DE69228858D1 (en) 1999-05-12

Similar Documents

Publication Publication Date Title
US5208862A (en) Speech coder
JP3151874B2 (en) Voice parameter coding method and apparatus
US7925501B2 (en) Speech coder using an orthogonal search and an orthogonal search method
US5778334A (en) Speech coders with speech-mode dependent pitch lag code allocation patterns minimizing pitch predictive distortion
US5142584A (en) Speech coding/decoding method having an excitation signal
JP3114197B2 (en) Voice parameter coding method
EP0514912A2 (en) Speech coding and decoding methods
US5826226A (en) Speech coding apparatus having amplitude information set to correspond with position information
EP0545386A2 (en) Method for speech coding and voice-coder
US5677986A (en) Vector quantizing apparatus
USRE43190E1 (en) Speech coding apparatus and speech decoding apparatus
JP2800618B2 (en) Voice parameter coding method
US5598504A (en) Speech coding system to reduce distortion through signal overlap
US6094630A (en) Sequential searching speech coding device
US7680669B2 (en) Sound encoding apparatus and method, and sound decoding apparatus and method
US5797119A (en) Comb filter speech coding with preselected excitation code vectors
EP0557940B1 (en) Speech coding system
US6704703B2 (en) Recursively excited linear prediction speech coder
JPH1097294A (en) Voice coding device
US5666464A (en) Speech pitch coding system
US5884252A (en) Method of and apparatus for coding speech signal
US6751585B2 (en) Speech coder for high quality at low bit rates
US5774840A (en) Speech coder using a non-uniform pulse type sparse excitation codebook
JP3192051B2 (en) Audio coding device
GB2199215A (en) A stochastic coder

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): DE FR GB

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): DE FR GB

17P Request for examination filed

Effective date: 19930707

17Q First examination report despatched

Effective date: 19960912

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): DE FR GB

REF Corresponds to:

Ref document number: 69228858

Country of ref document: DE

Date of ref document: 19990512

ET Fr: translation filed
PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed
REG Reference to a national code

Ref country code: GB

Ref legal event code: IF02

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20081212

Year of fee payment: 17

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20081127

Year of fee payment: 17

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20081126

Year of fee payment: 17

GBPC Gb: european patent ceased through non-payment of renewal fee

Effective date: 20091202

REG Reference to a national code

Ref country code: FR

Ref legal event code: ST

Effective date: 20100831

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20091231

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20100701

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20091202