EP0481895A2 - Verfahren und Einrichtung zur Übertragung mit niedriger Bitrate einer Sprachsignals mittels CELP-Codierung - Google Patents

Verfahren und Einrichtung zur Übertragung mit niedriger Bitrate einer Sprachsignals mittels CELP-Codierung Download PDF

Info

Publication number
EP0481895A2
EP0481895A2 EP91402774A EP91402774A EP0481895A2 EP 0481895 A2 EP0481895 A2 EP 0481895A2 EP 91402774 A EP91402774 A EP 91402774A EP 91402774 A EP91402774 A EP 91402774A EP 0481895 A2 EP0481895 A2 EP 0481895A2
Authority
EP
European Patent Office
Prior art keywords
vector
vectors
dictionary
values
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP91402774A
Other languages
English (en)
French (fr)
Other versions
EP0481895B1 (de
EP0481895A3 (en
Inventor
Renaud Di Francesco
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Orange SA
Original Assignee
France Telecom SA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by France Telecom SA filed Critical France Telecom SA
Publication of EP0481895A2 publication Critical patent/EP0481895A2/de
Publication of EP0481895A3 publication Critical patent/EP0481895A3/fr
Application granted granted Critical
Publication of EP0481895B1 publication Critical patent/EP0481895B1/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/083Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being an excitation gain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0004Design or structure of the codebook
    • G10L2019/0005Multi-stage vector quantisation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0007Codebook element generation

Definitions

  • the invention relates to a method of transmission, at low speed, by CELP coding of a speech signal and to the corresponding system.
  • CELP Code Excited Linear Prediction
  • This technique for coding digital samples representing the speech signal is a hydride coding technique in which the speech signal is modeled by linear prediction filters and residues of this prediction.
  • CELP elbows as shown schematically in FIGS. 1a and 1b, exhaustively test all the elements of a list of waveforms. The waveform providing the best synthesis of the signal is retained, and its index, or characteristic address, is transmitted to the decoder. This method is called synthetic analysis.
  • the list of waveforms stored in the encoder and the decoder is called a dictionary.
  • CELP coder The quality of a CELP coder depends closely on the dictionary chosen, on the method of determination-modeling of the linear prediction filters used, these two parameters constituting two degrees of freedom, not independent, making it possible to adapt a particular CELP coder to the needs of 'a specific application.
  • Such a CELP coding technique is suitable for low bit rate coding applications (between 4 to 24 kbit / s).
  • bit rate coding applications between 4 to 24 kbit / s.
  • each block comprising L values is considered to be a vector of a vector space of dimension L.
  • the current excitation signal constituted by a vector v, read in the dictionary of waveforms, must minimize a criterion of perceptual distortion of the form: min ⁇ x - Hv ⁇ 2, in which x denotes a target signal originating from the original signal O to be transmitted after perceptual weighting and H denotes a matrix of dimension LxL of impulse response derived from the product of the transfer functions of the synthesis filter and the perceptual weighting.
  • each reference vector vi is associated with an adaptive gain value gk taken from a dictionary of gain values G, which allows, following application of the gain gk to the vector vi to form a vector vk, i, to satisfy the aforementioned minimum distortion criterion.
  • Such an operating mode therefore does not make it possible to take into account, as a reference vector, the totality of the possibilities of combinations of the ternary values of the components of the reference vectors, the minimization of the distortion criterion not being able in all cases to be optimal.
  • the object of the present invention is to remedy the aforementioned drawbacks, in order in particular to bring about a simplification of the calculations by the introduction as reference vector, in the dictionary of reference vectors, or directions, of almost all the combinations of n-ary values of the components of the vectors, n being an odd number.
  • Another object of the present invention is the implementation, prior to the conventional process of applying an adaptive gain to each of the reference vectors, of a correction process by application of a scale factor, introducing the division of the energy of the excitation signal as a function of the frequency spectrum thereof, in order to take into account the non-uniformity of the energy distribution of the signal in the frequency domain.
  • Another object of the present invention is finally the implementation of a low-speed transmission method of a speech signal in which each reference vector, constituting the excitation signal, can be regenerated at the level of a decoder based solely on index or address values of the optimal reference vector satisfying the minimum distortion criterion at the level of the coder, which has the effect of considerably simplifying and reducing the manufacturing costs of the aforementioned decoders.
  • the method for transmitting a low bit rate speech signal comprises a process for coding digital speech samples by code-excited linear prediction to generate a code signal, a process for transmitting the code signal and a process of decoding the received code signal.
  • the coding process corresponds to a process in which a waveform represented by a block of samples comprising L sample values and constituting an initial vector (o) of dimension L is represented, from a synthesis filter , by a reference waveform selected from a dictionary of reference waveforms each forming a reference vector (v) on criterion of minimum quadratic deviation of the initial vector (o) with respect to the form wave or reference vector (v), min ⁇ x - Hv ⁇ 2, where x represents a target vector obtained by perceptual weighting of the initial vector (o) and H a matrix of dimension LxL of impulse response resulting from the product of the filter synthesis and linear perceptual weighting.
  • the value n / 2 corresponds to the integer division of n by 2.
  • H. ⁇ i.yi> and of all the perceptual energies ⁇ Hy ⁇ 2, which makes it possible to attribute to the initial vector (o) the corresponding optimal reference vector vk *, i *, with vk *, i * gk * . ⁇ i * .yi *, this optimal reference vector being represented by the only values of index k *, i * satisfying the criterion min ⁇ x - gk.H. ⁇ i.yi ⁇ 2.
  • the process of transmission at low speed of a speech signal consists in transmitting, as code signal, the only values of the indices k *, i * representative of each optimal reference vector vk *, i * .
  • the process of decoding a coded speech signal transmitted at low bit rate according to a code signal is remarkable in that, in order to ensure the decoding of the code signal, this process consists in discriminating the values of the indices k *, i * constituting the code signal, in decomposing the value of the index i *, representative of the optimal reference vector, in base n to regenerate the corresponding base vector yi *, carry out, from the value of the index i *, the scale factor ⁇ i * and the corresponding adaptive gain gk *, a correction of the corresponding regenerated base vector to constitute the regenerated reference vector vk *, i *.
  • a synthesis filtering operation is performed on the regenerated reference vector vk *, i * to generate the reconstructed speech signal.
  • the method which is the subject of the invention comprises a process for coding digital speech samples by linear prediction excited by codes. This process generates a code signal.
  • the method further includes a process for transmitting the code signal and a process for decoding the received code signal.
  • the process of coding corresponds to a process in which a waveform represented by a block of samples comprising L values of samples, or frames, constitutes an initial vector denoted o of dimension L this vector being represented, and the corresponding waveform , from a synthesis filter by a reference waveform, denoted v, selected from a dictionary of reference waveforms each forming an aforementioned reference vector.
  • the selection is made on the criterion of minimum quadratic deviation of the initial vector o vis-à-vis the waveform or reference vector v, this criterion written: min ⁇ x -Hv ⁇ 2
  • x represents a target vector obtained by perceptual weighting of the initial vector o and H represents a matrix of dimension LxL of impulse response resulting from the product of the synthesis filter and from the aforementioned linear perceptual weighting.
  • the coding process is such that the selection criterion consists in establishing a dictionary factored into the product of a first dictionary Y of basic vectors denoted yi.
  • Each basic vector is a basic vector of n-ary form, that is to say that the components aj of these basic vectors, with j ⁇ [0, L-1], can take n different discrete values.
  • each value of the components aj can take a value included in the group [-n / 2, ... 0, ... n / 2] in increments of 1, n being odd, n / 2 representing the integer division of n by 2.
  • each basic vector yi is corrected by a scale factor ⁇ i taking into account the distribution of the excitation energy in the frequency domain of the signal.
  • the scale factors ⁇ i are determined from a database, experimentally, the database being constituted by recording significant speech samples over several hours for example and for several speakers of the same language of expression or of several languages distinct, experience showing that the diversity of languages of expression only intervenes at the second level in the determination of the aforementioned ⁇ i scale factors.
  • the scale factors ⁇ i are determined for each corresponding basic vector yi by a process of identification of each basic vector ⁇ i on a delocalized sequence of L successive recursive speech samples from the database , sorting the weakest adaptation coefficients and averaging a number u of identification or adaptation coefficients to obtain the corresponding scale factor ⁇ i associated with the aforementioned basic vector yi.
  • the factorized dictionary previously mentioned also consists of a second dictionary constituting the aforementioned product, this second dictionary being denoted G (y) and being formed by a gain dictionary gk.
  • this optimal reference vector is represented by the only values of the parameters of indices k *, i * satisfying the aforementioned criterion: min ⁇ x-gk.H. ⁇ i. yi ⁇ 2.
  • the minimum value of the quadratic difference min ⁇ x-gk.H. ⁇ i. yi ⁇ 2 is evaluated by selecting the corresponding gain element gk from the second dictionary G (y) making it possible to minimize the difference
  • where g checks the relation: g ⁇ x
  • the dictionary Y of basic vectors yi of n-ary form [-n / 2, ..., 0, ... n / 2] of dimension L includes all the basic vectors whose L components have for value the aforementioned n-areas values, with the exception of the null vector.
  • the index i of the basic vectors is taken equal to the value in base n of each base vector after transcoding the values ⁇ -n / 2 ..., 0 ... n / 2 ⁇ into corresponding values (0,1,2 ... n). It will thus be understood that the basic vectors yi of n-ary form are arranged as a function of their index i, this index i having for value the value in base n of each vector.
  • the set of basic vectors yi constituting the dictionary Y is defined from n / 2 ⁇ L impulse vectors of which only one component aj of order j, with j ⁇ [0, L-1], is equal to -1, -2, ... -n / 2.
  • FIGS. 3a and 3b operator cells have been shown respectively making it possible to generate, from the previously defined pulse vectors and from sub-dictionaries constituted by the pulse vector considered and the related vectors corresponding to each pulse vector, the complete dictionary comprising the collection of all of the sub-dictionaries.
  • Each operator as represented in FIG. 3a comprises an operator called delay operator R whose transfer function is denoted Z+1, according to the classical notation of transform into Z, a symmetrizer operator denoted Sy which has the function of multiplying the components of all vectors presented at its entry by the value +1, by the value 0 then by the value -1 and a summator, noted S, receiving the output of the delay operator R and of the symmetrizer Sy.
  • the summator S receives the output of the delay operator R via a switch 1, in position F, or the zero vector [0,0,0,0,0] of dimension L in position O.
  • the operators represented in FIG. 3a are constituted by a single operator represented in 1), 2) and 3) at different stages of the processing process to generate the basic vectors yi of the abovementioned dictionary Y.
  • the initial pulse or pulse vector ⁇ L-1 is present at the input of the delay operator R.
  • Le symmetrizer Sy is then supplied by a noted sub-dictionary DO ⁇ which is initially formed by the aforementioned pulse vector ⁇ L-1.
  • the pulse vector ⁇ L-2 is associated with the sub-dictionary D1 formed by the related vectors y1, y2, y3 with the pulse vector ⁇ L-2 and by the pulse vector initial L-1 initial forming the basic vector y0, as well as the null vector.
  • the pulse vector initial L-1 initial forming the basic vector y0, as well as the null vector.
  • the operator making it possible to generate the basic vectors yi is such that the latter receives the pulse vector at the level of the delay operator R ⁇ Lm, at the level of the symmetrizor Sy, the dictionary denoted D m-1 formed recursively like the dictionary D1, the summator S as represented in point 2) of the same figure 3a then delivering from the impulse vector ⁇ Lm-1 above delivered by the delay operator R or the zero vector and by the sub-dictionary D m-1 ⁇ the sub-dictionary D m.
  • the * represented at the level of the aj components for the level m processing process correspond to values 0, -1 or +1 when the vectors are ternary vectors.
  • the * represent values between -n / 2 and + n / 2, under the conditions previously mentioned.
  • the total ternary dictionary, sum or union of all the sub-dictionaries of intermediate level m, up to L can be obtained for the only positive or negative values of the components aj, the total dictionary then being able to be obtained by symmetrization by through a symmetrization operator such as Sy.
  • this operator is such that the impulse responses of the system H at the relative time 0, 1, 2, L-1, ie the values h0, h1, hL -2, hL-1 are applied to the above operator.
  • the symmetrization operator Sy multiplies the elements of S L-1 (Dm-1) by +1, 0, -1 and realizes, as described above, the union of the distinct elements obtained.
  • the elementary detripling cell is represented in FIG. 5b from the pulse vectors denoted ⁇ -1, ⁇ 0 and ⁇ 1. It will be noted that the summation of the pulse vectors ⁇ 1, ⁇ 0, ⁇ -1 amounts to replacing the last coordinate of the incident base vector by the component values +1, 0 or -1.
  • FIGS. 5a and 5b the architecture as represented in FIGS. 5a and 5b is that of a linear structure of ternary graphs. For an n-ary structure we get an n-ary graph.
  • the global graph for obtaining the energies is traversed from right to left, the initial energy E (0) being at SL-1 (0) 2.
  • the elementary cell constituting the graph represented in FIG. 5c is represented in FIG. 5d.
  • each reference vector vk *, i * can advantageously be weighted by a predicted level factor, noted ⁇ .
  • This predicted level factor ⁇ is representative of the average energy of the excitation signal estimated on at least three successive previous excitation vectors.
  • the previous expression is then calculated by filtering the expression 2 x ⁇ by the matrix transposed from the matrix H or t H.
  • the calculation process as represented by the operator in FIG. 6 allows, in a manner analogous to the calculation of the partial responses S L-1 (yi) previously described, to obtain the quantities x′0, x′Lm-1, x′L-2 and therefore the abovementioned scalar products, the zero vector being replaced by the zero value.
  • each scale factor ⁇ i can be determined from a plurality N of frames, from a speech signal database, the scale factor ⁇ i for each base vector yi being chosen so as to make the filter residue of the aforementioned frames minimum for the frame considered. It will be recalled that several processes for determining each scale factor ⁇ i can be considered.
  • the transmission of speech at low speed is carried out by the sole transmission, as a code signal, of the values of the indices k * and i * representative of each reference vector vk *, i *.
  • the transmission can be carried out using conventional transmission protocols in which a redundancy of the information transmitted is introduced in order to ensure transmission at a rate substantially zero error.
  • the value i * can be transmitted either in direct numbering either in retrograde numbering, or according to a translated numbering whose translation table is known to both the coder and the decoder.
  • the decoding process consists in discriminating in 1000 the values of the indices k * and i * constituting the code signal and then in decomposing into 1001 the value of the index i * representative of the optimal reference vector in base n in order to regenerate the corresponding base vector yi *.
  • the decoding process consists in carrying out a synthesis filtering operation 1003 of the reference vector to generate the reconstructed speech signal.
  • each reference vector vk *, i * before the synthesis filtering is weighted by a level factor predicts ⁇ which is estimated on at least three successive previous excitation vectors.
  • the determination of the predicted level a will not be described in detail since it corresponds to the level of the decoding process with operations normally known to those skilled in the art.
  • the coding circuit comprises a generator 1 of a first dictionary Y of basic vectors yi of n-ary shape of dimension L, the components of these vectors, as mentioned previously, being able to take the values between -n / 2 to n / 2.
  • the generator of the dictionary Y can advantageously be constituted by calculating means comprising the operators as described in FIGS. 3a, 3b for example and / or a storage circuit which can be constituted by a random access memory associated with this circuit computer or by ROM.
  • the read only memory is associated with a fast sequencer which makes it possible to carry out a successive reading of the basic vectors yi according to the indices in direct or retrograde numbering as described previously.
  • the coding circuit as represented in FIG. 8 comprises a circuit 2 correcting the basic vectors yi by a scale factor ⁇ i.
  • a fast multiplexer denoted MUX makes it possible successively to read the corresponding values of the corrected base vector yi ⁇ and to deliver this corresponding value to a circuit 3 generator of a second adaptive gain dictionary gk.
  • the generator circuit 3 of the second dictionary G (y) may advantageously include an amplifier circuit, denoted 30, connected to a table of values gk constituting the aforementioned second dictionary.
  • the coding circuit object of the present invention also comprises an amplifier circuit 4 which makes it possible to apply to each reference vector vk, i the level prediction coefficient ⁇ as it has been defined previously in the description.
  • the coding circuit which is the subject of the present invention then comprises in cascade the synthesis filter denoted 5 and the perceptual weighting filter denoted 6 by transmission H as described previously in the description.
  • a summator 7 makes it possible to receive on the one hand the original signal via the same perceptual weighting filter 6 after inversion the difference of the signals delivered by the algebraic summator 7, allowing the application to the signal thus obtained from the minimum distortion criterion.
  • the coding circuit which is the subject of the present invention comprises a circuit for calculating the minimum distortion 8 which comprises a first circuit 80 for calculating the product.
  • a circuit for calculating the minimum distortion 8 which comprises a first circuit 80 for calculating the product.
  • the first calculator circuit 80 delivers a first calculation result r1.
  • a second calculating circuit 81 makes it possible to calculate the energy of the reconstructed vector and perceptually weighted this energy being of the form gk2 ⁇ H. ⁇ i.yi ⁇ 2.
  • the computer circuits 80 and 81 can be constituted by program modules whose calculation graphs have been explained respectively in FIGS. 4 and 5 a) to d) respectively.
  • the second calculation circuit 81 delivers a second result of calculation noted r2.
  • a comparator 83 makes it possible to compare the value of the calculation results r1 and r2 which makes it possible to determine by discrimination of the values of the indices i and k, the indices i * and k * for which the criterion of minimum of the quadratic difference is satisfied .
  • the discrimination of the indices i * and k * is carried out for example by a sorting program noted 84 in FIG. 8.
  • the values of the indices k * and i * are then delivered, these indices being representative of the corresponding reference vector vk *, i *.
  • FIG. 8 also shows the transmission circuit according to the subject of the present invention, this transmission circuit making it possible to deliver as a code signal representative of the speech signal the only values of the indices k * and i * .
  • This transmission circuit does not have any particular characteristic insofar as it can in fact be constituted by a transmission system of conventional type used in the devices for transmitting speech signals by coding of CELP type of the prior art.
  • FIG. 9 A more detailed description of a decoding circuit allowing the implementation of the method which is the subject of the invention is shown in FIG. 9.
  • the decoding circuit comprises a module 10 for discriminating the values of the indices i *, k * of the code signal received, the code signal being of course transmitted according to a particular protocol which does not enter into the object of the present invention.
  • the discrimination circuit 10 thus performing a parallel series transformation of the information relating to the indices i *, k *
  • the decoding circuit comprises a decomposition circuit in base n of the value of the index i *.
  • the decoding circuit as shown in FIG. 9 includes a table of adaptive gain values gk denoted 11 which on reception of the value of the index k * makes it possible to deliver the corresponding adaptive gain value gk *.
  • This circuit 11 can advantageously consist of a read-only memory in which the adaptive gain values gk are stored.
  • a generator circuit 12 of the scale factor ⁇ i * is provided.
  • This circuit can consist of a read only memory forming a look-up table, which with the value i * matches the value ⁇ i *.
  • the decoding circuit comprises a circuit 13 generating the regenerated base vector by decomposition into base n of the value of the index i *.
  • a circuit 14 corresponds to the value i * by transcoding the components in base n of the value of index i *, the value ⁇ -n / 2, ..., 0, ... n / 2 ⁇ , which makes it possible to generate a regenerated reference vector vk *, i * of the product of the vector regenerated base and product A.
  • a synthesis filter 15 allows starting from the regenerated reference vector to generate the reconstructed speech signal.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
EP91402774A 1990-10-19 1991-10-17 Verfahren und Einrichtung zur Übertragung mit niedriger Bitrate eines Sprachsignals mittels CELP-Codierung Expired - Lifetime EP0481895B1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FR9012980A FR2668288B1 (fr) 1990-10-19 1990-10-19 Procede de transmission, a bas debit, par codage celp d'un signal de parole et systeme correspondant.
FR9012980 1990-10-19

Publications (3)

Publication Number Publication Date
EP0481895A2 true EP0481895A2 (de) 1992-04-22
EP0481895A3 EP0481895A3 (en) 1992-08-12
EP0481895B1 EP0481895B1 (de) 1997-12-10

Family

ID=9401407

Family Applications (1)

Application Number Title Priority Date Filing Date
EP91402774A Expired - Lifetime EP0481895B1 (de) 1990-10-19 1991-10-17 Verfahren und Einrichtung zur Übertragung mit niedriger Bitrate eines Sprachsignals mittels CELP-Codierung

Country Status (5)

Country Link
US (1) US5226085A (de)
EP (1) EP0481895B1 (de)
JP (1) JP3130348B2 (de)
DE (1) DE69128407T2 (de)
FR (1) FR2668288B1 (de)

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2658794B2 (ja) * 1993-01-22 1997-09-30 日本電気株式会社 音声符号化方式
AU675322B2 (en) * 1993-04-29 1997-01-30 Unisearch Limited Use of an auditory model to improve quality or lower the bit rate of speech synthesis systems
FR2722631B1 (fr) * 1994-07-13 1996-09-20 France Telecom Etablissement P Procede et systeme de filtrage adaptatif par egalisation aveugle d'un signal telephonique numerique et leurs applications
JP3058028B2 (ja) * 1994-10-31 2000-07-04 三菱電機株式会社 画像符号化データ再符号化装置
SE504010C2 (sv) * 1995-02-08 1996-10-14 Ericsson Telefon Ab L M Förfarande och anordning för prediktiv kodning av tal- och datasignaler
DE19516106C2 (de) * 1995-05-05 2003-04-03 Philips Corp Intellectual Pty Verfahren zum Bestimmen von Referenzwerten
US6477496B1 (en) 1996-12-20 2002-11-05 Eliot M. Case Signal synthesis by decoding subband scale factors from one audio signal and subband samples from different one
US6782365B1 (en) 1996-12-20 2004-08-24 Qwest Communications International Inc. Graphic interface system and product for editing encoded audio data
US5864813A (en) * 1996-12-20 1999-01-26 U S West, Inc. Method, system and product for harmonic enhancement of encoded audio signals
US6463405B1 (en) 1996-12-20 2002-10-08 Eliot M. Case Audiophile encoding of digital audio data using 2-bit polarity/magnitude indicator and 8-bit scale factor for each subband
US6516299B1 (en) 1996-12-20 2003-02-04 Qwest Communication International, Inc. Method, system and product for modifying the dynamic range of encoded audio signals
US5845251A (en) * 1996-12-20 1998-12-01 U S West, Inc. Method, system and product for modifying the bandwidth of subband encoded audio data
US5864820A (en) * 1996-12-20 1999-01-26 U S West, Inc. Method, system and product for mixing of encoded audio signals
FI113571B (fi) 1998-03-09 2004-05-14 Nokia Corp Puheenkoodaus
US7536298B2 (en) * 2004-03-15 2009-05-19 Intel Corporation Method of comfort noise generation for speech communication
JP4993676B2 (ja) * 2006-09-01 2012-08-08 キヤノン株式会社 画像符号化装置及び画像符号化方法
CN101430879B (zh) * 2007-11-05 2011-08-10 华为技术有限公司 一种多速率语音频编码的方法
WO2011074233A1 (ja) * 2009-12-14 2011-06-23 パナソニック株式会社 ベクトル量子化装置、音声符号化装置、ベクトル量子化方法、及び音声符号化方法

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0379296A2 (de) * 1989-01-17 1990-07-25 AT&T Corp. Linearer Prädiktivkodierer mit Code-Anregung für Sprach- oder Audiosignale mit niedriger Verzögerung
WO1991013432A1 (en) * 1990-02-23 1991-09-05 Universite De Sherbrooke Dynamic codebook for efficient speech coding based on algebraic codes

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
NL8302985A (nl) * 1983-08-26 1985-03-18 Philips Nv Multipulse excitatie lineair predictieve spraakcodeerder.
NL8500843A (nl) * 1985-03-22 1986-10-16 Koninkl Philips Electronics Nv Multipuls-excitatie lineair-predictieve spraakcoder.
US4944013A (en) * 1985-04-03 1990-07-24 British Telecommunications Public Limited Company Multi-pulse speech coder
IT1195350B (it) * 1986-10-21 1988-10-12 Cselt Centro Studi Lab Telecom Procedimento e dispositivo per la codifica e decodifica del segnale vocale mediante estrazione di para metri e tecniche di quantizzazione vettoriale
US4868867A (en) * 1987-04-06 1989-09-19 Voicecraft Inc. Vector excitation speech or audio coder for transmission or storage
US4910781A (en) * 1987-06-26 1990-03-20 At&T Bell Laboratories Code excited linear predictive vocoder using virtual searching
US4899385A (en) * 1987-06-26 1990-02-06 American Telephone And Telegraph Company Code excited linear predictive vocoder
US5091946A (en) * 1988-12-23 1992-02-25 Nec Corporation Communication system capable of improving a speech quality by effectively calculating excitation multipulses
US4980916A (en) * 1989-10-26 1990-12-25 General Electric Company Method for improving speech quality in code excited linear predictive speech coding

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0379296A2 (de) * 1989-01-17 1990-07-25 AT&T Corp. Linearer Prädiktivkodierer mit Code-Anregung für Sprach- oder Audiosignale mit niedriger Verzögerung
WO1991013432A1 (en) * 1990-02-23 1991-09-05 Universite De Sherbrooke Dynamic codebook for efficient speech coding based on algebraic codes

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Academic Publishers, Dordrecht, NL; R.A. SALAMI: "Binary pulse excitation: a novel approach to low complexity CELP coding" *
ADVANCED IN SPEECH CODING (IEEE WORKSHOP ON SPEECH CODING FOR TELECOMMUNICATIONS, Vancouver, 5 - 8 septembre 1989), pages 145-156, Kluwer Academic Publishers, Dordrecht, NL; R.A. SALAMI: "Binary pulse excitation: a novel approach to low complexity CELP coding" *

Also Published As

Publication number Publication date
EP0481895B1 (de) 1997-12-10
FR2668288B1 (fr) 1993-01-15
DE69128407T2 (de) 1998-06-04
JPH04264500A (ja) 1992-09-21
JP3130348B2 (ja) 2001-01-31
FR2668288A1 (fr) 1992-04-24
US5226085A (en) 1993-07-06
DE69128407D1 (de) 1998-01-22
EP0481895A3 (en) 1992-08-12

Similar Documents

Publication Publication Date Title
EP0481895B1 (de) Verfahren und Einrichtung zur Übertragung mit niedriger Bitrate eines Sprachsignals mittels CELP-Codierung
EP0608174B1 (de) System zur prädiktiven Kodierung/Dekodierung eines digitalen Sprachsignals mittels einer adaptiven Transformation mit eingebetteten Kodes
CA2266654C (fr) Procede et dispositif d'egalisation aveugle des effets d'un canal de transmission sur un signal de parole numerique
EP0511095B1 (de) Verfahren und Vorrichtung zur Kodierung und Dekodierung eines numerischen Signals
EP1994531A2 (de) Verbesserte verschlüsselung/entschlüsselung eines digitalen tonsignals im celp-verfahren
EP0428445B1 (de) Verfahren und Einrichtung zur Codierung von Prädiktionsfiltern in Vocodern mit sehr niedriger Datenrate
FR2690551A1 (fr) Procédé de quantification d'un filtre prédicteur pour vocodeur à très faible débit.
EP0195441B1 (de) Verfahren zur Sprachcodierung mit niedriger Bitrate unter Verwendung eines Mehrimpulsanregungssignals
FR2702075A1 (fr) Procédé de génération d'un filtre de pondération spectrale du bruit dans un codeur de la parole.
EP0692883A1 (de) Verfahren zur blinden Entzerrung, und dessen Anwendung zur Spracherkennung
EP0673113A1 (de) Signalquellencharakterisiersystem
CA2108663C (fr) Procede et dispositif de filtrage pour la reduction des preechos d'un signal audio-numerique
EP0574288B1 (de) Verfahren und Einrichtung zur Übertragungsfehlerverdeckung von mittels einer Frequenztransformation kodierten digitalen Audiosignalen
EP0341129A1 (de) Verfahren und Einrichtung zur Kodierung von Sprachsignalenergie in Vokodern mit sehr niedriger Bitrate
EP0347307B1 (de) Kodierungsverfahren und linearer Prädiktionssprachkodierer
EP1192619B1 (de) Audio-kodierung, dekodierung zur interpolation
EP1194923B1 (de) Verfahren und system für audio analyse und synthese
FR2751776A1 (fr) Procede d'extraction de la frequence fondamentale d'un signal de parole
EP3934282A1 (de) Verfahren zur umwandlung eines ersten satzes repräsentativer signale eines schallfelds in einen zweiten satz von signalen und entsprechende elektronische vorrichtung
EP1192618B1 (de) Audiokodierung mit adaptiver lifterung
WO2010076412A2 (fr) Procede et un dispositif d'estimation de signaux de source issus d'un signal de melange
EP1605440A1 (de) Verfahren zur Quellentrennung eines Signalgemisches
EP0812070A1 (de) Verfahren und Gerät zur Kompressionskodierung eines digitalen Signals
WO2002029786A1 (fr) Procede et dispositif de codage segmental d'un signal audio
EP0696110A1 (de) Verfahren und Gerät zur Tonkodierung und Tonkodierung durch Frequenzkompression, insbesondere zur Anwendung in einem Massentonspeicher

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): DE GB

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): DE GB

17P Request for examination filed

Effective date: 19920910

17Q First examination report despatched

Effective date: 19950606

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): DE GB

REF Corresponds to:

Ref document number: 69128407

Country of ref document: DE

Date of ref document: 19980122

GBT Gb: translation of ep patent filed (gb section 77(6)(a)/1977)

Effective date: 19980209

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed
REG Reference to a national code

Ref country code: GB

Ref legal event code: IF02

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20080926

Year of fee payment: 18

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20081030

Year of fee payment: 18

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20100501

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20091017