EP1355298A2 - CELP Kodierer und Dekodierer - Google Patents

CELP Kodierer und Dekodierer Download PDF

Info

Publication number
EP1355298A2
EP1355298A2 EP03013629A EP03013629A EP1355298A2 EP 1355298 A2 EP1355298 A2 EP 1355298A2 EP 03013629 A EP03013629 A EP 03013629A EP 03013629 A EP03013629 A EP 03013629A EP 1355298 A2 EP1355298 A2 EP 1355298A2
Authority
EP
European Patent Office
Prior art keywords
excitation
signal
codebook
code
vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP03013629A
Other languages
English (en)
French (fr)
Other versions
EP1355298B1 (de
EP1355298A3 (de
Inventor
Kenichiro Hosoda
Hiromi Aoyaki
Hiroshi Katsuragawa
Yoshihiro Ariyama
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Oki Electric Industry Co Ltd
Original Assignee
Oki Electric Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oki Electric Industry Co Ltd filed Critical Oki Electric Industry Co Ltd
Priority claimed from PCT/JP1993/000776 external-priority patent/WO1994029965A1/ja
Priority claimed from EP93913500A external-priority patent/EP0654909A4/de
Publication of EP1355298A2 publication Critical patent/EP1355298A2/de
Publication of EP1355298A3 publication Critical patent/EP1355298A3/de
Application granted granted Critical
Publication of EP1355298B1 publication Critical patent/EP1355298B1/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0004Design or structure of the codebook
    • G10L2019/0005Multi-stage vector quantisation

Definitions

  • This invention relates to an encoder and a decoder based on the code excitation linear predictive coding (CELP) system.
  • CELP code excitation linear predictive coding
  • a code excitation linear predictive coding and its modification that is, a vector sum excitation linear predictive coding system (VSELP) have been used.
  • VSELP vector sum excitation linear predictive coding system
  • CELP code excitation linear predictive coding
  • a fundamental construction of the coding system relative to the speech signal is to obtain vocal tract parameters representing vocal tract properties and excitation source parameters representing excitation source information.
  • an excited signal as a excitation source information is encoded by means of both an adaptive excitation codevectors, which contribute to stochastically stronger periodic excitation signal and stochastic excitation codevectors which contribute to stochastic less periodic random excitation signal, and then the coded excitation signals are stored in a codebook, and an optimum adaptive excitation codevectors and stochastic excitation codevectors are found out in each codebook so that weighted error power sum between an input speech vector and synthetic speech vector becomes minimum.
  • the excitation source parameters that is, adaptive excitation code and stochastic excitation code information are transmitted.
  • CELP code excitation linear predictive
  • some communication systems require lower coding rate, for example 4kbit/s or less.
  • the number of coded bits which are assigned to the excitation source parameters is smaller and the number of adaptive excitation codevectors stored in the adaptive excitation codebook and the number of stochastic excitation codevectors stored in the stochastic excited codebook become smaller. Consequently, the quality of the regenerated speech signal inevitably degrades at the lower coding rate as described above.
  • the adaptive excited codebook are adaptively renewed by synthetic codevectors of optimum adaptive excitation codevectors and stochastic excitation codevectors and, accordingly, it can be determined that the adaptive excitation codevectors are formed on the basis of the stochastic excitation codevectors. Therefore, the current CELP coding has a poor tracking capability for a voice signal having a nature of strong periodicity. Consequently, generated speech signal lacks clearness.
  • a speech coding and decoding system that attempts to realise a higher compression of speech information is described in EP 476614.
  • a sparse adaptive codebook is used in association with a time-reversed perceptual weighting filter.
  • the present invention is based upon the foregoing problems and an object of the present invention is to provide code excitation linear predictive coding encoder and decoder which can provide a high quality regenerated speech signal even when pulse-like noise components are contained in the input speech vectors.
  • Another object of the present invention is to provide code excitation linear predictive coding encoder and decoder which can provide high-quality regenerated speech signal even when a lower coding rate is employed.
  • a code excitation linear predictive coding apparatus which uses, as a speech excitation source information, excitation signals in the form of excitation codebook, wherein the apparatus is provided with a codevectors conversion circuit which converts the frequency characteristics of fixed codevectors such as stochastic excitation codevectors transmitted from the excitation codebook into the predetermined frequency characteristics at the time of output of the excitation codevectors.
  • a codevectors conversion circuit which converts the frequency characteristics of fixed codevectors such as stochastic excitation codevectors transmitted from the excitation codebook into the predetermined frequency characteristics at the time of output of the excitation codevectors.
  • the nearer the fixed codevectors frequency characteristics is set to the frequency characteristics of the input speech vectors the higher the quality of the synthetic speech vector is obtained and, moreover, an effective frequency component of the excitation codevectors becomes much larger than a quantization error vectors so that a masking effect of the quantization error vector can be obtained.
  • parameters of LPC linear predictive coefficient
  • optimum adaptive excitation code information which means pitch predictive information (which includes VQ gains) are used.
  • the codevectors conversion circuit controls the frequency characteristics of the stochastic excitation codevectors and so forth, in accordance with these information.
  • a code excitation linear predictive decoding apparatus which has codevectors conversion circuit which forces the fixed codevector frequency characteristics near to the input speech vector frequency characteristics in accordance with the respective code excitation linear predictive coding system.
  • A, B and E are constants which are determined in the range of 0 ⁇ A ⁇ 1, 0 ⁇ B ⁇ 1 and 0 ⁇ 1, respectively, and L represents a pitch lag.
  • the present invention provides a code excitation linear predictive coding or decoding apparatus which is provided, as an excitation codebook, with a adaptive excitation codebook and stochastic excitation codebook, in which pulse-like excitation codebook storing a pulse-like excitation codevector which consists of isolated impulse in addition to the adaptive excitation codebook and stochastic excitation codebook is provided so that the current CELP coding has a good tracking capability for a speech signal having a nature of strong periodicity. Thus, clear regenerated speech signal can be obtained.
  • excitation codevectors from the stochastic excitation codebook or pulse-like excitation codebook are selectively used, and this selected information is transmitted to the code excitation linear predictive decoder apparatus.
  • the excitation codevectors from the stochastic excitation codebook or pulse-like excitation codebook are selected in accordance with the information transmitted from the code excitation linear predictive coding apparatus.
  • the output of vocal tract parameters are assigned to be LSP (linear spectral pair) parameters and this linear spectral pair parameters are utilized for the speech regeneration in the code excitation linear predictive decoder so that the regeneration speech quality at the lower coding rate can be improved from a viewpoint of vocal tract parameters.
  • LSP linear spectral pair
  • the reasons for using LSP parameters as the vocal tract parameters reside in that an interpolation characteristics relative to the frequency characteristics of the vocal tract are improved, that the LSP parameters provides less distortion to the vocal tract spectral than LPC parameters even when the LSP parameters are coded by smaller number of code bits, and that an effective coding can be obtained by combination with vector quantization.
  • Fig. 1 which shows a code excitation linear predictive encoder (coding apparatus) according the a first embodiment of the present invention
  • an input speech vector S which has been inputted in each frame from an input terminal 101 is first transmitted to a vocal tract analysis circuit 102 to obtain a vocal tract parameter aj (linear predictive coefficient).
  • An LPC (linear predictive coefficient) quantization circuit 103 quantizes vocal tract predictive parameter aj and transmits its code Ic (quantized LPC code) to an LPC inverse-quantization circuit 104 and a multiplex circuit 106.
  • the LPC inverse-quantization circuit 104 serves to convert the LPC code Ic into vocal tract predictive parameter aqj and transmits the same to a synthesis filter 105.
  • a codevector conversion circuit 109 which has an impulse response of filter transfer function H(Z) represented by the following formula (3), performs convolutional computation with stochastic excitation codevector e sl from a stochastic excitation codebook 108, and transmits a converted stochastic excitation codevector e scl.
  • aqj represents an output of LPC inverse quantization circuit 104 and p represents vocal tract analysis order.
  • the adaptive excitation codevector e ai is multiplied by the gain ⁇ k by means of a multiplier 113 to produce a vector e aik and, on the other hand, the converted stochastic excitation codevector e scl is multiplied by the gain ⁇ k by means of a multiplier 114 to produce a vector e sclk.
  • An adder 115 adds the components of vector e alk and vector e sclk and produces an excitation codevector e.
  • the synthesis filter 105 calculates synthetic speech vector Sw corresponding to the excitation codevector e and transmits it to a subtracter 116.
  • the subtracter 116 performs the subtraction between the synthesized speech vector Sw and the input speech vector S, and the obtained error vector between Sw and S is transmitted to a perceptual weighting filter 111.
  • the perceptual weighting filter 111 transmits a perceptual weighting error vector ew corresponding to the error vector er to a perceptual weighting error calculation circuit 112.
  • the perceptual weighting error calculation circuit 112 calculates a mean square value of each component of the perceptual weighting error vector ew, and determines the excitation codevector (i.e., combination of i, l and k) to minimize the mean square error power of ew for the input speech vector at the present time. Indexes Ia, Is and Ig of each codebook at this moment are transmitted to each of the adaptive excitation codebook 107, stochastic excitation codebook 108, VQ gain codebook 110 and multiplex circuit 106.
  • the adaptive excitation codebook 107 outputs an optimum adaptive excitation codevector ea0 assigned by index Ia
  • the stochastic excitation codebook 108 outputs an optimum stochastic excitation codevector es0 assigned by index Is
  • the VQ gain codebook 110 transmits optimum VC gain ⁇ 0 and ⁇ 0 assigned by index Ig.
  • a codevector conversion circuit 109 converts the stochastic codevector es0 which has been transmitted from the stochastic excitation codebook in accordance with the index Is into an optimum converted stochastic excitation codevector e sc0 and then outputs it to the multiplier 114.
  • the optimum excitation codevector e 0 pt composed by the ea 0 , esc 0 , ⁇ 0 and ⁇ 0 is transmitted to the adaptive excitation codebook 107 and updates the content of the adaptive excitation codebook 107.
  • the multiplex circuit 106 multiplexes Ic, Ia, Is and Ig, as a total code C, and transmits it to the receiver through an output terminal 117.
  • Fig. 2 is a block diagram of a code excitation linear predictive decoder corresponding to the code excitation linear predictive encoder.
  • the total code C from an input terminal 201 is separated by a demultiplex circuit 212 into LPC code Ic, adaptive excitation code index Ia, stochastic excitation code index Is, and VQ gain code index Ig and they are transmitted, respectively, to LPC inverse quantization circuit 202, adaptive excitation codebook 204, stochastic excitation codebook 205 and VQ gain codebook 207.
  • the LPC inverse quantization circuit 202 converts the LPC code Ic into vocal tract predictive parameter aj and transmits to a synthesis filter 203.
  • the adaptive excitation codebook 204 outputs adaptive excitation codevector ea assigned by the index Ia
  • the stochastic excitation codebook 205 outputs a stochastic excitation codevector es assigned by the index Is
  • a VQ gain codebook 207 outputs excitation gains ⁇ and ⁇ , assigned by index Ig.
  • a codevector conversion circuit 206 converts the vector es into vector e sc and outputs it as similar as the aforementioned code excitation linear predictive coding apparatus (encoder).
  • the adaptive excitation codevector ea is multiplied by gain ⁇ by means of multiplier 208, and the vector e sc is multiplied by gain ⁇ by means of multiplier 209. These multiplied vector components are added by adder 210, and final excitation codevector e for synthesis filter is obtained.
  • a synthesis filter 203 calculates a synthesized speech vector S corresponding to the excitation codevector e and outputs to an output terminal 211. At the same time, the content of the adaptive excitation codebook 204 is updated by vector e.
  • This code excitation linear predictive encoder according the a second embodiment has the similar construction as that of the first embodiment except the codevector conversion circuit 109 and, therefore, an operational mode of the codevector conversion circuit 109 will be explained presently.
  • the codevector conversion circuit 109 which has an impulse response of filter transfer function H(Z) shown by the following formula (4) performs convolutional computation with the vector e sl and results in vector e scl.
  • H(Z) 1/(1- ⁇ Z -L )
  • is ⁇ ⁇ 1.0
  • L is a pitchlag obtained from index of the adaptive excitation code.
  • the index of the adaptive excitation code corresponds with the pitch lag index as below.
  • the convolutional processing of the aforementioned code excitation linear predictive coding apparatus are represented by the following formula (5), provided that the e sl is an output stochastic excitation codevector of the stochastic excitation codebook, e scl is a stochastic excitation codevector after the conversion, and h is an impulse response of conversion circuit.
  • e scl e sl X h wherein:
  • a transfer function composed of a vocal tract parameter, or a transfer function composed of the pitch lag can be used for the impulse response of code conversion circuit, alternatively, said two transfer functions can be cascaded to form the impulse response.
  • Fig. 3 is a block diagram of a code excitation linear predictive encoder according to the third embodiment of the invention.
  • this code excitation linear predictive encoder is primarily composed of a input speech process portion 301, optimum synthesized speech search portion 302 and multiplex circuit 303.
  • the input speech process 301 has LSP parameter analysis circuit 311, LSP parameter coding circuit 312, LSP parameter decoding circuit 313, LPC conversion circuit 314, perceptual weighting filter 315, synthesis filter zero input response generation circuit 316, perceptual weighting filter zero input response generation circuit 317, and subtracters 318 and 319.
  • digitalized discrete input speech vector series are stored as much as the time which corresponds to an analysis frame length for obtaining a vocal tract parameter and, this analysis frame length is separated into several subframes and processed by input speech processing portion 301.
  • the input speech vector is given to the LSP parameter analysis circuit 311, analyzed by the LSP analysis circuit 311, and converted to LSP parameter as vocal tract parameter.
  • This LSP parameter is coded (for example, to be vector quantized) by LSP parameter coding circuit 312 and given to the multiplex circuit 303 and transmitted to the code excitation linear decoder.
  • the coded LSP parameter is decoded (vector quantized) by LSP parameter decoding circuit 313 and converted to LPC by the LPC conversion circuit 314.
  • the thus converted LPC is used as a tap coefficient for perceptual weighting filter 315, synthesis filter zero input response generation circuit 316, perceptual weighting filter zero input generation circuit 317 and a synthesis filter 329 which will be described presently, and given also to a code vector conversion circuit 328.
  • the quantized LSP parameter is converted into LPC.
  • the input speech vector described above is given to the perceptual weighting filter 315 and after the weighing processing in consideration of human perceptual characteristics, the input speech vector is given to a subtracter 318 to be subtracted. Further, a zero input response vector in relation to a synthesis filter 329, is given for input of subtracter 318. Thus, a speech vector, from which an influence of the synthesis filter 329 in the immediately before analysis frame is excluded, is given to subtracter 319. Further, a zero input response vector in relation to a perceptual weighting filter 315, is given for input of subtracter 139. Thus, a speech vector, from which an influence of the weighted filter 315 in the immediately before analysis frame is obtained, is given to subtracter 330.
  • the optimum synthesizedtic speech search portion 302 serves to search a excitation source parameter in which the synthesis speech vector in the local reproduction is most similar to the target speech vector, and is composed of adaptive excitation codebook 320, stochastic excitation codebook 321, pulse-like excitation codebook 322, VQ gain codebook 323, VQ gain controllers 324 and 327, adder 325, fixed codebook selection switch 326, codevector conversion circuit 328, synthesis filter 329, subtracter 330, error power sum computing circuit 331 and code selection circuit 332.
  • Each of the adaptive excitation codebook 320, stochastic excitation codebook 321 and pulse-like excitation codebook 322 stores adaptive excitation codevector, which is a waveform code in relation to an excitation signal, stochastic excitation codevector and pulse-like excitation codevector, respectively, and VQ gain codebook 323 stores VQ gain code which is related to adaptive excitation codevector and fixed codevector (which generally represents stochastic excitation codevector and pulse-like excitation codevector).
  • the adaptive excitation code vector contributes to the voiced speech signal having stochastically periodicity, while the stochastic excitation codevector contributes to the unvoiced speech signal having stochastically less periodicity.
  • the adaptive excitation codevector of the adaptive excitation codebook 320 is adaptively updated as described presently.
  • the pulse-like excitation codevector is a waveform excitation codevector consisting of an unit impulse and is considered to contribute to the steady portion of the voiced speech signal having a strong periodicity.
  • the VQ gain code is vector-quantized, for example, and one component of the vector relates to VQ gain for adaptive excitation code vector and the other component relates to VQ gain for the fixed code vector.
  • Pulse-like excitation code vector is a periodic simple signal which can be generated by means of a pulse signal generating circuit but, it can preferably be generated by coding and reading out from the codebook 322 as this code excitation linear predictive encoder, the reason of which will be explained presently. Namely, it is easy to synchronize the excitation vector with an output from the adaptive excitation codebook 320.
  • the same processing for selecting the stochastic excitation codebook can be pulse-like excitation codevector search by constituting the excitation code vector to have the same codebook construction with the codebook 321.
  • the searching is carried out with respect to the adaptive excitation code, stochastic excitation code, pulse-like excitation code and VQ gain code, in turn, in this code excitation linear predictive encoder.
  • an output from the stochastic excitation codebook 321 and the pulse-like excitation codebook 322 are assigned to be zero (0), and the VQ gain controller 324 multiply a suitable value of VQ coefficient ("1", for example).
  • the adaptive excitation codebook 320 outputs all of the stored adaptive excitation code vector sequentially or in parallel, and gives it as an excitation code vector to the synthesis filter 329 through the VQ gain controller 324 and the adder 325.
  • the synthesis filter 329 carries out a convolutional computing relative to the excitation code vector, by utilizing, as a tap coefficient, the LPC which is given from the LPC conversion circuit 314, and a synthesized speech vectors, which are synthesized only by the content of the adaptive excitation code vector as the excitation source signal, are obtained with respect to all the adaptive excitation code vector.
  • the subtracter 330 obtains, with respect to all of the adaptive excitation code vector, an error vector between the synthesized speech vector on which only the content of the adaptive excitation code vector is effected and the target speech vector, and then gives it to an error power sum calculation circuit 331.
  • the error power sum calculation circuit 331 obtains square sum (error power sum) of the error vector, with respect to all the adaptive code vector, and gives it to a code selection circuit 332.
  • the code selection circuit 332 determines the the adaptive excitation code vector to minimize the error power sum.
  • a fixed codebook selection switch 326 is driven to the side of the stochastic excitation codebook 321 the output from adaptive excitation codebook is set to zero (0) or to the previously obtained optimum adaptive excitation code vector.
  • the stochastic excitation codebook 321 outputs sequentially or in parallel, all the stored stochastic excitation code vectors,and inputs them into the code vector conversion circuit 328 through the fixed codebook selection switch 326 and VQ controller 324.
  • the code vector conversion circuit 328 proceeds the conversion of the frequency characteristics of inputted stochastic excitation code vector so that it is moved to close to frequency characteristics of an input speech vector in correspondence with time-length of the stochastic excitation code vector. As described above, all the stochastic exited code vector with its frequency characteristics being conversion-processed is given, as an excitation code vector, to a synthetic filter 329. Thereafter, it is processed as similar as the searching of the optimum adaptive excitation code vector, and the code selection circuit 332 determines an optimum stochastic excitation code vector.
  • a searching of an optimum pulse-like excitation code vector is carried out.
  • the fixed codebook selection switch 326 is driven to the side of the pulse-like excitation codebook 322 the output from adaptive excitation codebook 326 is set to zero (0) or to the previously obtained optimum adaptive excitation code vector.
  • the pulse-like excitation codebook 322 outputs sequentially or in parallel, all the stored pulse-like excitation code vectors. Processings thereafter will be substantially similar with those of the moment when an optimum stochastic excitation code vector is searched and, accordingly, more detailed explanation will not be necessary.
  • the code selection circuit 332 compares the error power sum of the selected code vector in the stochastic excitation code vector search with the error power sum of the selected code vector in the pulse-like excitation code vector search to obtain smallest error power sum, and determin a fixed code to be transmitted to the code excitation linear predictive decoder.
  • VQ gain codebook 323 is composed of VQ gain for an adaptive excitation code vector and VQ gain for the fixed code vector.
  • the VQ gain for the adaptive excitation code vector is given to a VQ gain controller 324 and the VQ gain for the fixed code vector is given to a VQ gain controller 327.
  • both the VQ gain-controlled optimum adaptive excitation code vector and the optimum fixed code vector which have been processed with respect to a frequency characteristic operation and VQ gain control, are added by an adder 325 and then given to a synthesis filter as an excitation code vector.
  • This processing is carried out sequentially or in parallel, relative to all the VQ gain codes in the VQ gain codebook 323.
  • the code selection circuit 332 gives the indexes of these codes to a multiplex circuit 303 and, a fixed codebook selection switching information which one of the stochastic excitation code vector and the pulse-like excitation code vector is selected actually, is given to the multiplex circuit 303.
  • the multiplex circuit 303 multiplexes said indexes with LSP parameter given from the LSP parameter coding circuit 312 and transmits it to the code excitation linear predictive decoder.
  • the transmitted index is vector number.
  • the coding processings described above is repeated with respect of each subframe, and the coded speech information is transmitted in turn to the code excitation linear predictive decoder.
  • Fig. 5 shows in detail the specific structure of the code vector conversion circuit 328.
  • the code vector conversion circuit 328 has two cascaded filters 328a and 328b, and a pitch lag decision circuit 328c.
  • the fixed code vector is given to a first filter 328a.
  • An impulse response H1(Z) of the first filter 328a is set as shown by formula (6), by which the frequency conversion processing is carried out relative to the fixed vector.
  • H1(Z) (1- ⁇ A j ajZ -j ) / (1- ⁇ B j ajZ -j ) wherein aj(j is 1 to p) is a tap coefficient relative to a synthesis filter 329 which is supplied from the LPC conversion circuit 324, and p is vocal tract analysis order.
  • a and B are constants which are determined in the ranges of 0 ⁇ A ⁇ 1, and 0 ⁇ B ⁇ 1.
  • the code vector which was processed in its frequency characteristics by the first filter 328a is transmitted to the second filter 328b.
  • the pitch lag decision circuit 328c obtains a pitch lag L from the index of the optimum adaptive excitation code relative to the adaptive excitation codebook 320 and then gives the pitch lag L to the second filter 328b.
  • An impulse response H2(Z) of the second filter 328b is determined as shown by formula (7), by which a frequency ' conversion is carried out relative to the inputted fixed code vector.
  • H2(Z) 1/(1- ⁇ Z -L ) wherein ⁇ is a constant determined in the range of 0 ⁇ 1.
  • An output of the second filter 328b is given to VQ gain controller 327 shown in Fig. 3.
  • the frequency characteristics of inputted fixed code vector can be made closer to the frequency characteristics of the input speech vector, in accordance with a time length of the fixed code vector.
  • the code excited linear predictive coding apparatus (encoder) can provide a high quality regenerated speech signal.
  • Fig. 4 is a block diagram of code excitation linear predictive decoder which corresponds to the code excitation linear predictive coding apparatus (encoder) shown in Fig. 3.
  • the code excitation linear predictive decoder has demultiplex circuit 440, LSP parameter decoding circuit 441, LPC conversion circuit 442, adaptive excitation codebook 443, stochastic excitation codebook 444, pulse-like excitation codebook 445, VQ gain codebook 446, VQ gain controller 447, VQ gain controller 449, fixed codebook selection switch 448, code vector conversion circuit 450, adder 451 and synthesis filter 452.
  • the coded speech information given from the code excitation linear predictive encoder is inputted to the demultiplex circuit 440.
  • the demultiplex circuit 440 separates the coded speech information into LSP parameter code, index of the optimum adaptive excitation code, index of the optimum fixed code, index of the optimum VQ gain codebook and fixed code selection switch information.
  • LSP parameter code is given to the LSP parameter decoding circuit 441 and the index of the optimum adaptive excitation code is given to the adaptive excitation codebook 443. Further, the index of optimum VQ gain code is given to the VQ gain codebook 446 and the fixed codebook selection switch information is given to the fixed codebook selection switch 448.
  • the index of the optimum fixed code 443 is given to a pulse-like excitation codebook 445 or a stochastic excitation codebook 444 which are determined by the fixed code selection switching information.
  • the adaptive excitation codebook outputs an adaptive excitation code vector which is determined by a given index, and this adaptive excitation code vector is VQ gain-controlled through VQ gain controller 447 and given to an adder 451. Further, the adaptive excitation codebook 443 gives adaptive excitation code vector to a code vector conversion circuit 450.
  • the stochastic excitation codebook 444 or pulse-like excitation codebook 445 gives a stochastic excitation code vector or pulse-like excitation code vector, which corresponds to the given index, to a code vector conversion circuit 450 through'a fixed codebook selection switch 448.
  • the code vector conversion circuit 450 operates so that the frequency characteristics become closer to a frequency characteristics of the input speech vector in accordance with the index of the LPC and adaptive excitation code vector.
  • a specific structure of the code vector conversion circuit 450 will be the same as that of the structure shown in Fig. 5.
  • the frequency-processed fixed code vector is VQ gain-controlled by a VQ gain controller and then given to an adder 451.
  • the adder 451 adds the given adaptive excitation code vector and the fixed code vector together, and the added vector is assigned to be an excitation code vector, which is then given to a synthesis filter 452.
  • the synthesis filter 452 outputs a synthesized speech vector.
  • the code excitation linear predictive decoder conducts the above-described processes every time when a decoded speech vector is given or, in other words, for each subframe.
  • the LSP parameter is used and transmitted as a vocal tract parameter; pulse-like excitation codebook is provided for giving an excitation source parameter; and a frequency characteristic of fixed code vector is controlled.
  • the coding apparatus and decoding apparatus described above are related primarily to the forward-type code excitation linear predictive encoder and decoder, respectively, but the present invention is not limited thereto but applicable to backward-type code excitation linear predictive encoder and decoder, respectively.
  • the above-described encoder and decoder were intentionally designed under the technological basis for seeking to solve the problems induced from the low rate coding of 4-bit/s or less. However, more favorable sound reproduction can be realized if they are adapted to encoders and decoders of high rate coding. If the higher coding rate is allowable, both of the stochastic excitation codebook and pulse-like excitation codebook can be co-operated effectively rather than selectively operating either the stochastic excitation codebook or the pulse-like excitation codebook.
  • a frequency characteristic of actual excitation code vector is relatively close to that of an input speech vector and, in order to make it closer the frequency of the excitation code vector to a frequency of the input speech vector, the stochastic excitation code vector is convolutionaly computed with utilizing a specific impulse response. Thereafter, an adaptive excitation code vector is added to produce excitation code vector and, therefore, an excitation code vector which is well adaptive to an input speech vector by a small number of vector can be obtained and, at the same time, quantization error can be masked with conversion operation of an excitation code vector, thereby improving a reproduction quality.
  • pulse-like excitation codebook is disposed which stores therein pulse-like excitation code vector composed of unit impulse and, accordingly, a rapid tracking to a speech signal having periodicity can be realized, and a clear pulse-like excitation code vector can be formed at a steady portion of the speech signal.
  • the apparatus of the present invention can be adapted to low rate coding, and a favorably reproduced speech can be realized at the time , for example of a transitional period of the speech in which there are random signals and pulse-like signals together.
  • an excitation code vector is selected and used from either stochastic excitation codebook or pulse-like excitation codebook and, therefore, a favorable reproduction speech sound can be realized with the condition that the number of coded bit of the excitation source parameter is small.
  • the vocal tract parameter for sound synthecization is used as lSP parameter which gives less distortion to the vocal tract vector than LPC when it is coded with a smaller number of code bit and, therefore, reproduction quality at a lower coding rate can be improved from a vocal tract parameter viewpoint.
EP03013629A 1993-06-10 1993-06-10 CELP Kodierer und Dekodierer Expired - Lifetime EP1355298B1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
PCT/JP1993/000776 WO1994029965A1 (fr) 1993-06-10 1993-06-10 Codeur-decodeur predictif lineaire a excitation par codes
EP93913500A EP0654909A4 (de) 1993-06-10 1993-06-10 Celp kodierer und dekodierer.

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
EP93913500A Division EP0654909A4 (de) 1993-06-10 1993-06-10 Celp kodierer und dekodierer.

Publications (3)

Publication Number Publication Date
EP1355298A2 true EP1355298A2 (de) 2003-10-22
EP1355298A3 EP1355298A3 (de) 2004-02-04
EP1355298B1 EP1355298B1 (de) 2007-02-21

Family

ID=28459643

Family Applications (1)

Application Number Title Priority Date Filing Date
EP03013629A Expired - Lifetime EP1355298B1 (de) 1993-06-10 1993-06-10 CELP Kodierer und Dekodierer

Country Status (1)

Country Link
EP (1) EP1355298B1 (de)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009015944A1 (en) * 2007-07-30 2009-02-05 Global Ip Solutions (Gips) Ab A low-delay audio coder
RU2462769C2 (ru) * 2006-10-24 2012-09-27 Войсэйдж Корпорейшн Способ и устройство кодирования кадров перехода в речевых сигналах
US8463615B2 (en) 2007-07-30 2013-06-11 Google Inc. Low-delay audio coder
CN111818519A (zh) * 2020-07-16 2020-10-23 郑州信大捷安信息技术股份有限公司 一种端到端语音加密、解密方法及系统

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0462559A2 (de) * 1990-06-18 1991-12-27 Fujitsu Limited System zur Sprachcodierung und -decodierung
EP0476614A2 (de) * 1990-09-18 1992-03-25 Fujitsu Limited Sprachkodierungs- und Dekodierungssystem

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0462559A2 (de) * 1990-06-18 1991-12-27 Fujitsu Limited System zur Sprachcodierung und -decodierung
EP0476614A2 (de) * 1990-09-18 1992-03-25 Fujitsu Limited Sprachkodierungs- und Dekodierungssystem

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2462769C2 (ru) * 2006-10-24 2012-09-27 Войсэйдж Корпорейшн Способ и устройство кодирования кадров перехода в речевых сигналах
US8401843B2 (en) 2006-10-24 2013-03-19 Voiceage Corporation Method and device for coding transition frames in speech signals
WO2009015944A1 (en) * 2007-07-30 2009-02-05 Global Ip Solutions (Gips) Ab A low-delay audio coder
EP2023339A1 (de) * 2007-07-30 2009-02-11 Global IP Solutions (GIPS) AB Audiodekoder mit geringer Verzögerung
US8463615B2 (en) 2007-07-30 2013-06-11 Google Inc. Low-delay audio coder
CN111818519A (zh) * 2020-07-16 2020-10-23 郑州信大捷安信息技术股份有限公司 一种端到端语音加密、解密方法及系统
CN111818519B (zh) * 2020-07-16 2022-02-11 郑州信大捷安信息技术股份有限公司 一种端到端语音加密、解密方法及系统

Also Published As

Publication number Publication date
EP1355298B1 (de) 2007-02-21
EP1355298A3 (de) 2004-02-04

Similar Documents

Publication Publication Date Title
US5727122A (en) Code excitation linear predictive (CELP) encoder and decoder and code excitation linear predictive coding method
US8364473B2 (en) Method and apparatus for receiving an encoded speech signal based on codebooks
US5729655A (en) Method and apparatus for speech compression using multi-mode code excited linear predictive coding
US5142584A (en) Speech coding/decoding method having an excitation signal
US5778334A (en) Speech coders with speech-mode dependent pitch lag code allocation patterns minimizing pitch predictive distortion
US5140638A (en) Speech coding system and a method of encoding speech
US6594626B2 (en) Voice encoding and voice decoding using an adaptive codebook and an algebraic codebook
US6023672A (en) Speech coder
KR20010024935A (ko) 음성 코딩
US5659659A (en) Speech compressor using trellis encoding and linear prediction
EP1162604B1 (de) Sprachkodierer hoher Qualität mit niedriger Bitrate
EP1005022B1 (de) Verfahren und Vorrichtung zur Sprachkodierung
CA2090205C (en) Speech coding system
US5797119A (en) Comb filter speech coding with preselected excitation code vectors
EP1355298B1 (de) CELP Kodierer und Dekodierer
US5884252A (en) Method of and apparatus for coding speech signal
EP0855699B1 (de) Mehrimpuls-angeregter Sprachkodierer/-dekodierer
US7076424B2 (en) Speech coder/decoder
WO1994029965A1 (fr) Codeur-decodeur predictif lineaire a excitation par codes

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20030707

AC Divisional application: reference to earlier application

Ref document number: 0654909

Country of ref document: EP

Kind code of ref document: P

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): DE FR GB SE

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): DE FR GB SE

AKX Designation fees paid

Designated state(s): DE FR GB SE

17Q First examination report despatched

Effective date: 20050502

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AC Divisional application: reference to earlier application

Ref document number: 0654909

Country of ref document: EP

Kind code of ref document: P

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): DE FR GB SE

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

REF Corresponds to:

Ref document number: 69334115

Country of ref document: DE

Date of ref document: 20070405

Kind code of ref document: P

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20070521

ET Fr: translation filed
PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed

Effective date: 20071122

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20120607

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20120606

Year of fee payment: 20

Ref country code: FR

Payment date: 20120619

Year of fee payment: 20

REG Reference to a national code

Ref country code: DE

Ref legal event code: R071

Ref document number: 69334115

Country of ref document: DE

REG Reference to a national code

Ref country code: GB

Ref legal event code: PE20

Expiry date: 20130609

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20130609

Ref country code: DE

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20130611