EP0801788A1 - Verfahren zur sprachkodierung mittels analyse durch synthese - Google Patents

Verfahren zur sprachkodierung mittels analyse durch synthese

Info

Publication number
EP0801788A1
EP0801788A1 EP96901008A EP96901008A EP0801788A1 EP 0801788 A1 EP0801788 A1 EP 0801788A1 EP 96901008 A EP96901008 A EP 96901008A EP 96901008 A EP96901008 A EP 96901008A EP 0801788 A1 EP0801788 A1 EP 0801788A1
Authority
EP
European Patent Office
Prior art keywords
frame
delays
delay
estimate
long
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP96901008A
Other languages
English (en)
French (fr)
Other versions
EP0801788B1 (de
Inventor
William Navarro
Michel Mauc
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nortel Networks France SAS
Original Assignee
Matra Communication SA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matra Communication SA filed Critical Matra Communication SA
Publication of EP0801788A1 publication Critical patent/EP0801788A1/de
Application granted granted Critical
Publication of EP0801788B1 publication Critical patent/EP0801788B1/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/09Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals

Definitions

  • the present invention relates to speech coding using synthesis analysis.
  • a linear prediction of the speech signal is carried out in order to obtain the coefficients of a short-term synthesis filter modeling the transfer function of the vocal tract. These coefficients are transmitted to the decoder, as well as parameters characterizing an excitation to be applied to the short-term synthesis filter.
  • further research is carried out on the longer-term correlations of the speech signal in order to characterize a long-term synthesis filter accounting for the pitch of the speech.
  • the excitation indeed has a predictable component which can be represented by the past excitation, delayed by TP samples of the speech signal and affected by a gain g p .
  • the remaining, unpredictable part of the excitation is called stochastic excitation.
  • stochastic excitation consists of a vector searched for in a predetermined dictionary.
  • MPLPC Multi-Pulse Linear Prediction Coding
  • the stochastic excitation comprises a certain number of pulses whose positions are sought by the coder.
  • CELP coders are preferred for low transmission rates, but they are more complex to implement than MPLPC coders.
  • an open loop analysis a closed loop analysis or a combination of the two is used.
  • Open-loop analysis is not very demanding in terms of volume of calculations, but its precision is limited.
  • closed loop analysis requires a lot of calculations, but it is more reliable because it directly contributes to minimizing the perceptually weighted difference between the speech signal and the synthetic signal.
  • an open loop analysis is first performed to limit the interval in which the closed loop analyzer will look for the prediction delay. This search interval must nevertheless remain relatively wide since it must be taken into account that the delay can vary rapidly.
  • the invention aims in particular to find a compromise between the quality of the modeling of the long-term part of the excitation and the complexity of finding the corresponding delay in a speech coder.
  • the invention thus proposes a coding method with analysis by synthesis of a speech signal digitized in successive frames divided into nst sub-frames, comprising the following steps: analysis by linear prediction of the speech signal to determine parameters of a short-term synthesis filter, open-loop analysis of the speech signal to detect voiced frames on the signal and to determine, for each voiced frame, a degree of signal voicing and a long prediction delay search interval term; closed loop predictive analysis of the speech signal to select, for at least some of the subframes of the voiced frames, a long-term prediction delay contained in the search interval and constituting a parameter of a long syntnesis filter term; and determining a stochastic excitation for each subframe, so as to minimize a perceptually weighted difference between the speech signal and the stochastic excitation filtered by the long-term and short-term synthesis filters.
  • the open loop analysis step we determines the search interval relative to each voiced frame so that it contains a number of delays depending on the degree of voicing of
  • the number of delays which are to be tested in closed loop is adaptable to the mode of voicing of the frame.
  • the width of the search interval will be narrower for the most voiced frames in order to take account of their greater harmonic stability. For these very closely spaced frames, it is possible to gain one or more bits on the differential quantification of the delay in the search interval, and to reassign this or these won bits to perceptually important parameters such as long-term prediction gain, which improves the quality of speech reproduction.
  • FIG. 1 is a block diagram of a radio station incorporating a speech encoder implementing the invention
  • FIG. 1 is a block diagram of a radio station able to receive a signal produced by that of Figure 1;
  • FIGS. 3 to 6 are flowcharts illustrating an open loop LTP analysis process applied in the speech coder of Figure 1;
  • FIG. 7 is a flowchart illustrating a process for determining the impulse response of the weighted synthesis filter applied in the speech coder of Figure 1;
  • FIGS. 8 to 11 are flowcharts illustrating a process for finding the stochastic excitation applied in the speech coder of Figure 1.
  • a speech coder implementing the invention is applicable in various types of transmission systems and / or speech storage using a digital compression technique.
  • the speech coder 16 is part of a mobile radio station.
  • the speech signal S is a digital signal sampled at a frequency typically equal to 8 kHz.
  • the signal S comes from an analog-digital converter 18 receiving the amplified and filtered output signal from a microphone 20.
  • the converter 18 puts the speech signal S in the form of successive frames themselves subdivided into nsc sub-frames of I st samples.
  • the speech signal S can also be subjected to conventional shaping treatments such as Hamming filtering.
  • the speech coder 16 delivers a binary sequence with a significantly lower bit rate than that of the speech signal S, and addresses this sequence to a channel coder 22 whose function is to introduce redundancy bits into the signal in order to allow detection and / or a correction of any transmission errors.
  • the output signal from the channel encoder 22 is then modulated on a carrier frequency by the modulator 24, and the modulated signal is transmitted on the air interface.
  • the speech coder 16 is a synthesis analysis coder.
  • the coder 16 determines on the one hand parameters characterizing a short-term synthesis filter modeling the speaker's vocal tract, and on the other hand an excitation sequence which, applied to the short-term synthesis filter, provides a synthetic signal constituting an estimate of the speech signal S according to a perceptual weighting criterion.
  • the short-term synthesis filter has a transfer function of the form 1 / A (z), with:
  • the coefficients a i are determined by a module 26 of analysis by short-term linear prediction of the speech signal S.
  • the a. are the linear prediction coefficients of the speech signal S.
  • the order q of the linear prediction is typically of the order of 10.
  • the methods applicable by the module 26 for short-term linear prediction are well known in the field of speech coding.
  • Module 26, for example, implements the Durbm-Levinson algorithm (see J. Makhoul: "Linear Prediction: A tutorial review", Proc. IEEE, Vol. 63, No. 4, April 1975, p.
  • the coefficients a i obtained are supplied to a module 28 which converts them into spectral line parameters
  • LSP The representation of the prediction coefficients a. by parameters LSP is frequently used in synthesis analysis speech coders.
  • the LSP parameters can be obtained by the conversion module 28 by the classical method of Chebyshev polynomials (see P. Kabal and RP Ramachandran: "The computation of line spectral frequencies using Chebysnev polymnials", IEEE Trans. ASSP, Vol. 34, No. 6, 1986, pages 1419-1426
  • These are quantization values of the LSF parameters, obtained by a quantization module 30, which are transmitted to the decoder so that the latter finds the coefficients a i of the filter of short-term synthesis.
  • the coefficients a i can be found simply, given that:
  • LST t (nst-1 ) LSP t for sub-frames 0, 1, 2, ..., nst-1 of frame t.
  • the coefficients a i of the filter 1 / A (z) are then determined, sub-frame by sub-frame from the interpolated LSP parameters.
  • the non-quantified LSP parameters are supplied by the module 28 to a module 32 for calculating the coefficients of a perceptual weighting filter 34.
  • the coefficients of the perceptual weighting filter are calculated by the module 32 for each subframe after interpolation of the LSP parameters received from the module 28.
  • the perceptual weighting filter 34 receives the speech signal S and delivers a perceptually weighted signal SW which is analyzed by modules 36, 38, 40 to determine the excitation sequence.
  • the excitation sequence of the short-term filter consists of an excitation predictable by a long-term synthesis filter modeling the pitch of the speech, and an excitation unpredictable stochastic, or innovation sequence.
  • the module 36 performs an open loop long-term prediction (LTP), that is to say that it does not contribute directly to the minimization of the weighted error.
  • LTP long-term prediction
  • the weighting filter 34 intervenes upstream of the open-loop analysis module, but it could be otherwise: the module 36 could operate directly on the speech signal S or even on the signal S cleared of its short-term correlations by a transfer function filter A (z).
  • the modules 38 and 40 operate in closed loop, that is to say that they contribute directly to the minimization of the perceptually weighted error.
  • the long-term prediction delay is determined in two stages.
  • the open loop LTP analysis module 36 detects the voiced frames of the speech signal and determines, for each voiced frame, a degree of voicing MV and a search interval for the long-term prediction delay.
  • the search interval is defined by a central value represented by its quantization index ZP and by a width in the domain of the quantization indexes, depending on the degree of voicing MV.
  • the module 30 operates the quantization of the LSP parameters which have previously been determined for this frame.
  • This quantization is for example vectorial, that is to say it consists in selecting, from one or more predetermined quantization tables, a set of quantized parameters LSP Q which has a minimum distance from the set of LSP parameters provided. by the module 28.
  • the quantization tables differ according to the degree of voicing MV provided to the quantization module 30 by the open loop analyzer 36.
  • a set of quantization tables for a degree ⁇ e voicing MV is determined, during preliminary tests, so as to be statistically representative of frames having this degree MV. These sets are stored both in the coders and in the decoders implementing the invention.
  • the module 30 delivers the ieu of quantized parameters LSP Q as well as its index Q in the applicable quantification tables.
  • the speech coder 15 further comprises a module
  • This compound filter has the transfer function W (z) / A (z).
  • the module 42 takes for the weighting filter perceptual W (z) that corresponding to the LSP parameters interpolated but not quantified, that is to say the one whose coefficients were calculated by module 32, and for the synthesis filter 1 / A (z) that corresponding to the parameters Quantified and interpolated LSP, that is to say the one that will be effectively reconstructed by the decoder.
  • the TP delay index is ZP + DP.
  • closed-loop LTP analysis consists in determining, in the search interval for long-term prediction delays T, the delay TP which maximizes, for each sub-frame of a voiced frame, the normalized correlation :
  • x (i) denotes the weighted speech signal SW of the subframe from which the memory of the weighted synthesis filter has been subtracted (i.e. the response to a zero signal, due to its initial states, of the filter whose the impulse response h has been calculated by module 42), and y T (i) denotes the convolution product:
  • u (jT) designating the predictable component of the delayed excitation sequence of T samples, estimated by the well-known technique of the adaptive codebook. For delays T less than the length of a subframe, missing values of u (jT) can be extrapolated from previous values. Fractional delays are taken into account by oversampling the signal u (jT) in the adaptive repertoire. An oversampling of a factor m is obtained by means of polyphase interpolating filters.
  • the gain g P of long-term prediction could be determined by the module 38 for each sub-frame, by applying the known formula:
  • the gain g P is calculated by the stochastic analysis module 40.
  • the stochastic excitation determined for each subframe by the module 40 is of the multi-pulse type.
  • the positions and gains calculated by the module 40 of stochastic analysis are quantified by a module 44.
  • a bit scheduling module 46 receives the various parameters which will be useful to the decoder, and constitutes the binary sequence transmitted to the channel coder 22. These parameters are:
  • a module 48 is thus provided in the coder which receives the various parameters and which adds to some of them redundancy bits making it possible to detect and / or correct any transmission errors. For example, the degree of voicing MV coded on two bits being a critical parameter, it is desired that it reaches the decoder with as few errors as possible. For this reason, redundancy bits are added to this parameter by the module 48. One can for example add a parity bit to the two bits coding MV and repeat once the three bits thus obtained. This example of redundancy makes it possible to detect all the single or double errors and to correct all the simple errors and 75% of the double errors.
  • the allocation of the bit rate per frame of 20 ⁇ s is for example that indicated in table I.
  • the channel coder 22 is that used in the pan-European system of radiocommunication with mobiles (GSM).
  • GSM pan-European system of radiocommunication with mobiles
  • This channel coder described in detail in Recommendation GSM 05.03, was developed for a 13 kbit / s speech coder of RPE-LTP type which also produces 260 bits per 20 ms frame. The sensitivity of each of the 260 bits was determined from listening tests.
  • the bits from the source encoder have been grouped into three categories. The first of these categories IA groups 50 bits which are coded convolutionally on the basis of a generator polynomial giving a redundancy of one half with a constraint length equal to 5. Three bits parity values are calculated and added to the 50 bits of category IA before convolutional coding.
  • the second category (IB) has 132 bits which are protected at a rate of a half by the same polynomial as the previous category.
  • the third category (II) contains 78 unprotected bits. After application of the convolutional code, the bits (456 per frame) are subjected to interleaving.
  • the scheduling module 46 of the new source coder implementing the invention distributes the bits in the three categories according to the subjective importance of these bits.
  • a mobile radio station capable of receiving the speech signal processed by the source encoder 16 is shown diagrammatically in FIG. 2.
  • the received radio signal is first processed by a demodulator 50 then by a channel decoder 52 which performs the dual operations of those of the modulator 24 and of the channel coder 22.
  • the channel decoder 52 supplies the speech decoder 54 with a binary sequence which, in the absence of transmission errors or when the possible errors have been corrected by the channel decoder 52, corresponds to the binary sequence delivered by the scheduling module 46 at the level of the coder 16.
  • the decoder 54 comprises a module 56 which receives this binary sequence and which identifies the parameters relating to the different frames and sub-frames.
  • the module 56 also performs some checks on the parameters received. In particular, the module 56 examines the redundancy bits introduced by the module 48 of the coder, to detect and / or correct the errors affecting the parameters associated with these redundancy bits.
  • a pulse generator 62 receives the positions p (n) of the np pulses of 1 stochastic excitation.
  • the generator 62 delivers pulses of unit amplitude which are each multiplied by 64 by the associated gain g (n).
  • the output of amplifier 64 is addressed to the long-term synthesis filter 66.
  • This filter 66 has an adaptive directory structure.
  • the output samples u of the filter 65 are stored in the adaptive directory 68 so as to be available for subsequent subframes.
  • the delay TP relative to a subframe, calculated from the quantization indices ZP and DP, is supplied to the adaptive repertoire 68 to produce the signal u suitably delayed.
  • the amplifier 70 multiplies the signal thus delayed by the gain g P of long-term prediction.
  • the long-term filter 66 finally includes an adder 72 which adds the outputs of amplifiers 64 and 70 to provide the excitation sequence u.
  • the excitation sequence is addressed to the short-term synthesis filter 60, and the resulting signal can also, in known manner, be subjected to a post-filter 74 whose coefficients depend on the synthesis parameters received, to form the signal of synthetic speech S '.
  • the output signal S 'of the decoder 54 is then converted into analog by the converter 76 before being amplified to control a loudspeaker 78.
  • the module 36 also determines, for each sub-frame st, the entire delay K st which maximizes the open-loop estimation P st (k) of the long-term prediction gain on the sub-frame st, excluding delays k for which the autocorrelation C st (k) is negative or smaller than a small fraction ⁇ of the energy RO st of the subframe.
  • the estimate P st (k) expressed in decibels is written
  • step 94 the degree of voicing MV of the current frame is taken equal to 0 in step 94, which in this case ends the operations performed by the module 36 on this frame. If on the contrary the threshold S0 is exceeded in step 92, the current frame is detected as voiced and the degree MV will be equal to 1, 2 or 3. The module 36 then calculates, for each subframe st, a list I st containing candidate delays to constitute the ZP center of the search interval for long-term prediction delays.
  • the module 36 determines the basic delay rbf in full resolution for the rest of the processing. This basic delay could be taken equal to the integer K st obtained in step 90. The fact of finding the basic delay in fractional resolution around K st however makes it possible to gain in precision.
  • Step 100 thus consists in seeking, around the integer delay K st obtained in step 90, the fractional delay which maximizes the expression C st 2 / Gst .
  • This search can be carried out at the maximum resolution of the fractional delays (1/6 in the example described here) even if the entire delay K st is not in the domain where this maximum resolution applies.
  • the autocorrelations C st (T) and the delayed energies G st (T) are obtained by interpolation from the values stored in step 90 for the whole delays.
  • the Dase delay relative to a sub-frame could also be determined in fractional resolution from step 90 and taken into account in the first estimate of the overall prediction gain on the frame.
  • step 102 the address j in the list I st and the index m of the submultiple are initialized to 0 and 1, respectively.
  • a comparison 104 is made between the submultiple rbf / m and the minimum delay rmm. The sub-multiple rbf / m is examine it if it is greater than rmm.
  • P st (r i ) 20. log 1 0 [RO st / [RO st - C st 2 (r i ) / G st (r i )]] with, for fractional delays an interpolation of the values C st and G st calculated in step 90 for entire delays. If P st (r i ) ⁇ SE st , the delay r i is not taken into account, and we go directly to step 110 of incrementing the index m before carrying out the comparison 104 again for the next submultiple.
  • step 112 is executed before incrementing the index m in step 110.
  • step 112 we stores the index i at the address j in the list I st , we give the value m to the integer m0 intended to be equal to the index of the smallest submultiple retained, then we increment by one unit l 'address j.
  • the examination of the sub-multiples of the basic delay is finished when the comparison 104 shows rbf / m ⁇ rmm.
  • We then examine the multiple delays of the smallest rbf / m0 of the submultiples previously selected according to the process illustrated in FIG. 5. This examination begins with a mitialization 114 of the index n of the multiple: n 2.
  • a comparison 116 is made between the multiple n.rbf / m0 and the maximum delay rmax. If n.rbf / m0> rmax, test 118 is carried out to determine whether the index m0 of the smallest sub-multiple is an integer multiple of n.
  • step 120 the delay n.rbf / m0 has already been examined when examining the sub-multiples of rbf, and we go directly to step 120 of incrementing the index n before carrying out again comparison 116 for the next multiple. If test 118 shows that m0 is not an integer multiple of n, the multiple n.rbf / m0 is to be examined. We then take for the integer i the value of the index of the quantized delay r i the most n.rbf / m0 (step 122), then we compare, at 124, the estimated value of the prediction gain P st (r i ) to the selection threshold SE st .
  • step 120 If P st (r i ) ⁇ SE st , the delay r i is not taken into account, and we go directly to step 120 of incrementing the index n. If test 124 shows that Pst (r i ) ⁇ SE s the delay r i is retained and step 126 is executed before incrementing the index n in step 120. In step 126, l is stored 'index i at address j in the list I st , then we increment the unit j by one.
  • the list I st contains j candidate delay index. If we wish to limit the maximum length of the list I st to jmax for the following steps, we can take the length j st of this list equal to mm (j, jmax) (step 128) then, at step 130, order the list I st in the order of gains C st 2 (r Is t (j) ) / G st 2 (r Ist (j) ) decreasing for 0 ⁇ j ⁇ j st so as to keep only the j st delays providing the greatest gain values.
  • the value of jmax is chosen according to the compromise sought between the efficiency of the search for LTP delays and the complexity of this search. Typical values of jmax range from 3 to 5.
  • the analysis module 36 calculates a quantity Ymax determining a second estimate in open loop of the prediction gain at long term over the entire frame, as well as ZP, ZP0 and ZP1 indexes in a phase 132, the progress of which is detailed in FIG. 6.
  • This phase 132 consists in testing search intervals of length NI to determine which one maximizes a second estimate of the overall prediction gain on the frame The intervals tested are those whose centers are the candidate delays contained in the list I st calculated during phase 101.
  • Phase 132 begins with a step 136 where the address j in the list I st is initialized to 0.
  • step 138 it is checked whether the index I st (j) has already been encountered by testing a previous interval centered on I st ⁇ (j ') with st' ⁇ st and 0 ⁇ j ' ⁇ j st' , in order to avoid testing the same interval twice. If test 138 reveals that I st (j) already appeared in a list I st , with st ' ⁇ st, we directly increment the address j in step 140, then we compare it to the length j st of the list l st . If the comparison 142 shows that j ⁇ j st , we return to step 138 for the new value of the address j.
  • Ymax represents the value to be maximized.
  • This Ymax value is for example mitialized to
  • step 152 the index ZP is taken equal to I st (j) and the indexes ZP0 and ZP1 are respectively taken equal to the smallest and to the largest of the i sc indexes. determined in step 148.
  • the index st is incremented by one unit (step 154) then compared, in step 156, to the number nst of subframes per frame. If st ⁇ nst, we return to step 98 to carry out the operations relating to the next sub-frame.
  • the index ZP designates the center of the search interval which will be supplied to the module 38 of LTP analysis in closed loop, and ZP0 and ZP1 are indices whose deviation is representative the dispersion of the optimal delays per subframe in the center interval on ZP.
  • Gp 20. log 10 (RO / RO-Ymax).
  • Two other thresholds S1 and S2 are used. If Gp ⁇ S1, the degree of voicing MV is taken equal to 1 for the current frame.
  • Gp> S2 the dispersion of the optimal delays for the different sub-frames of the current frame is examined. If ZP1-ZP ⁇ N3 / 2 and ZP-ZP0 ⁇ N3 / 2, an interval of length N3 centered on ZP is sufficient to take into account all the optimal delays and the degree of voicing is taken equal to 3 (if Gp> S2 ' Otherwise, if ZP1-ZP ⁇ N3 / 2 or ZP-ZPO> N3 / 2, the degree of voicing is taken equal to 2 (if Gp> S2).
  • ZP + DP index of TP delay finally determined can therefore in some cases be smaller than 0 or greater than 255.
  • the first optimizations carried out in step 90 relative to the different sub-frames are replaced by a single optimization relating to the entire frame.
  • the parameters C st (k) and G st (k) calculated for each sub-frame st are also calculated:
  • nz basic delays K 1 ', ..., K nz ' in full resolution.
  • the voiced-unvoiced decision (step 92) is taken on the basis of that of the basic delays K i 'which provides the greatest value for the first open-loop estimate of the long-term prediction gain.
  • the basic delays in fractional resolution are determined by the same process as in step 100, but only allowing quantized delay values. Examination 101 of the submultiples and multiples is not carried out.
  • the nz oase delays previously determined are taken as candidate delays. This second variant makes it possible to dispense with the systematic examination of the submultiples and of the multiples which are generally taken into account by virtue of the subdivision of the domain of possible delays.
  • phase 132 is modified in that, in the optimization steps 148, the index i st is determined on the one hand , which maximizes C st ' 2 (r i ) / G st' (r i ) for I st (j) -N1 / 2 ⁇ i ⁇ I st (J) + N1 / 2 and 0 ⁇ i ⁇ N, and on the other hand, during the same maximization loop, the index k st , which maximizes this same quantity over a reduced interval I st (j) -N3 / 2 ⁇ i ⁇ I st (j) + N3 / 2 and 0 ⁇ i ⁇ N.
  • Step 152 is also modified, the indexes ZP0 and ZP1 are no longer stored, but a quantity Ymax 'defined in the same way as Ymax but with reference to the reduced length interval:
  • Gp' 20. log 10 [R0 / (R0-Ymax ')].
  • the sub-frames for which the prediction gain is negative or negligible can be identified by consulting the nst pointers. If necessary, the module 38 is deactivated for the corresponding sub-frames. This does not affect the quality of the LTP analysis since the prediction gain corresponding to these subframes will in any case be almost zero.
  • Another aspect of the invention relates to the module 42 for calculating the impulse response of the weighted synthesis filter.
  • the closed-loop LTP analysis module 38 needs this impulse response on the duration of a subframe to calculate the convoy ions y T (i) according to formula (1).
  • the module 40 stocnasti ⁇ ue analysis also has Desoin to calculate convolutions as we will see later.
  • the operations performed by the module 42 are for example in accordance with the flowchart of FIG. 7.
  • step 160 also calculates the truncated energies of the impulse response:
  • the components h (i) of the impulse response and the truncated energies Eh (i) can be obtained by filtering a unitary pulse by means of a transfer function filter W (z) / A (z) of zero initial states , or by recurrence:
  • the coefficients a k are those used in the perceptual weighting filter, i.e. the linear prediction coefficients interpolated but not quantified, while in expression (3), the coefficients a k are those applied to the synthesis filter, i.e. the quantized and interpolated linear prediction coefficients.
  • the module 42 determines the shortest length La such that the energy Eh (L ⁇ -1) of the truncated impulse response to La samples is at least equal to a proportion ⁇ of its total energy Eh (pst-1) estimated on pst samples.
  • a typical value of ⁇ is 98%.
  • the name La is imtialized at pst in step 162 and decremented by one unit in 166 as Eh (L ⁇ -2)> ⁇ .
  • Eh (pst-1) (test 164).
  • the length La sought is obtained when test 164 shows that Eh (L ⁇ -2) ⁇ .Eh (pst-1).
  • a correcting term ⁇ (MV) is added to the value of La which has been obtained (step 168).
  • This corrective term is preferably an increasing function of the degree of voicing.
  • the truncation length Lh of the impulse response is taken equal to La if L ⁇ nst and a nst otherwise.
  • the remaining samples of the impulse response (h (i) 0 with i ⁇ Lh, can be canceled.
  • a third aspect of the invention relates to the stocnastic analysis module 40 used to model the unpredictable part of the excitation.
  • the stochastic excitation considered here is of the multi-pulse type.
  • the stochastic excitation relating to a subframe is represented by np pulses of positions p (n) and of amplitudes, or gains, g (n) (l ⁇ n ⁇ np).
  • the gain g p of long-term prediction can also be calculated during the same process.
  • the excitation sequence relating to a sub-frame comprises ne contributions associated respectively with ne gains.
  • the contributions are sample Ist vectors which, weighted by the associated gains and summed, correspond to the excitation sequence of the short-term synthesis filter.
  • np vectors comprising only 0 except an impulse of amplitude 1.
  • the line vectors F p (n) (0 ⁇ n ⁇ nc) are weighted contributions having as components i (0 ⁇ i ⁇ lst) the convection products between the contribution n to the excitation sequence and the impulse response h the weighted synthesis filter;
  • the vectors F p (n) are simply constituted by the vector of the impulse response h shifted by p (n) samples. Truncating the impulse response as described above therefore makes it possible to significantly reduce the number of operations useful for calculating scalar products involving these vectors F p (n) .
  • the multi-pulse approach generally applies a sub-optimal procedure consisting of successively calculating the gains and / or the pulse positions for each contribution. For each contribution n (0 ⁇ n ⁇ nc), we first determine the position p (n) which maximizes the normalized correlation (F p .
  • the gains g nc-1 (i) are the selected gains and the minimized quadratic error E is equal to the energy of the target vector e nc-1 .
  • L n R n. K n, R n , K n and L n -1 are each constructed by simply adding a line to the corresponding matrices of the previous iteration:
  • the stochastic analysis relating to a subframe of a voiced frame can therefore take place as indicated in FIGS. 8 to 11.
  • the contribution index n is initialized to 0 in step 180 and the vector F p (0) is taken equal to the long-term contribution Y TP provided by the module 38. If n> 0, the iteration n begins with the determination 152 of the position p (n) of the pulse n which maximizes the quantity:
  • e (e (0), ..., e (lst-1)) is a target vector calculated during the previous iteration.
  • Different constraints can be brought to the domain of maximization of the quantity above included in the interval [0,1st [.
  • the maximization of (F p .e T ) 2 / (F p .F p T ) is carried out on the set of possible positions p in the subframe.
  • the maximization is carried out in step 182 on the set of possible positions excluding the segments in which the positions p (1), ..., p (n have been found respectively) -1) pulses during previous iterations.
  • the module 40 calculates 184 of the line n of the matrices L, R and K involved in the decomposition of the matrix E, which makes it possible to complete the matrices L n , R and K n defined above.
  • the decomposition of the matrix B makes it possible to write:
  • the column index j is first mitiiized to 0, in step 186.
  • the variable tmp is first mitialized at the value of component B (n, j), that is:
  • step 188 the integer k is further mitialized to 0.
  • a comparison 190 is then made between the integers k and j. If k ⁇ j, we add the term L (n, k). R (j, k) to the variable tmp, then we increment the whole k by one
  • step 192 before re-performing the comparison 190.
  • a comparison 194 is made between the integers j and n. If j ⁇ n, the component R (n, j) is taken equal to tmp and the component L (n, j) to tmp.K (j) in step 196, then the column index j is increased by d 'a unit before returning to step 188 to calculate the following components.
  • K (n) is taken equal to 1 / tmp if tmp ⁇ 0 (step 198) and to 0 otherwise.
  • the calculation 184 requires at most one division 198, to obtain K (n).
  • any singularity of the matrix B n does not cause instabilities since we avoid divisions by 0.
  • Linv (j ') L -1 (n, j').
  • the inversion 200 then begins with an initialization 202 of the column index j 'at n-1.
  • the term Linv (j ') is initialized to -L (n, j') and the integer k 'to j' + 1.
  • a comparison 206 is then carried out between the integers k ′ and n. If k ' ⁇ n, we subtract the term L (k', j '). Linv (k') to Linv (j '), then we increment the whole k' by one unit (step 208 ⁇ before re-executing comparison 206.
  • the inversion 200 is followed by calculating 214 the reoptimized gains and the target vector E for the following iteration.
  • the computation of the reoptimized gains is also very simplified by the decomposition retained for the matrix E.
  • One can indeed compute the vector g n (g n (0), ..., g n (n)) solution of g n .
  • B n b n according to:
  • b (n) serves as the initialization value for the variable tmq.
  • the index i is also initialized to 0.
  • the comparison 218 is then carried out between the integers i and n. If i ⁇ n, we add the term b (i). Linv (i) to the variable tmq and we increment i by a unit (step 220) before returning to the comparison 218.
  • Segmental pulse search significantly decreases the number of pulse positions to be evaluated during steps 182 of the search for stochastic excitation. It also allows efficient quantification of the positions found.
  • ns> np also has the advantage that good robustness to transmission errors can be obtained with regard to the positions of the pulses, by virtue of a separate quantification of the sequence numbers of the occupied segments and of the relative positions pulses in each busy segment.
  • sequence numbers of the occupied segments are identified by a binary word of ns ⁇ lC bits each worth 1 for the occupied segments and 0 for the segments in which the stochastic excitation has no pulse.
  • the possible binary words are stored in a quantization table in which the read addresses are the received quantization indexes.
  • the order in this table determined once and for all, can be optimized so that a transmission error affecting a bit of the index (the most frequent error case, especially when an interleaving is implemented in the channel coder 22) has, on average, minimal consequences according to a neighborhood criterion.
  • the neighborhood criterion is for example that a word of ns bits can only be replaced by "neighboring" words, distant from a Hamming distance at most equal to a threshold np-2 ⁇ , so as to keep all the pulses except ⁇ of them at valid positions in the event of an error in transmission of the index relating to a single bit.
  • Other criteria would be usable in substitution or in addition, for example that two words are considered as neighbors if the replacement of one by the other does not modify the order of allocation of the gains associated with the pulses.
  • the order in the word quantification table can be determined from arithmetic considerations or, if this is insufficient, by simulating the error scenarios on a computer (exhaustively or by statistical sampling of the type Monte-Carlo according to the number of possible error cases).
  • the scheduling module 46 can thus put in the minimum protection category, or in the unprotected category, a certain number nx of the bits of the index which, if affected by a transmission error, give rise to a wrong word but checking the neighborhood criterion with a probability deemed satisfactory, and putting the other bits of the index in a more protected category.
  • This procedure calls for a different ordering of the words in the quantification table.
  • This scheduling can also be optimized by means of simulations if it is desired to maximize the number nx of the bits of the index assigned to the least protected category.
  • One possibility is to start by constituting a list of words of ns bits by counting in Gray code from 0 to 2 ns -1, and to obtain the ordered quantification table by deleting from this list the words having no weight of Hamming of np.
  • the table thus obtained is such that two consecutive words have a Hamming distance of np-2. If the indexes in this table have a binary representation in Gray code, any error on the least significant bit causes the index to vary by ⁇ 1 and therefore causes the replacement of the actual occupancy word by a neighboring word in the sense of the np-2 threshold on the Hamming distance, and an error on the i-th least significant bit also varies the index by ⁇ 1 with a prooability of around 2 1-i .
  • nx By placing the nx least significant bits of the index in Gray code in an unprotected category, a possible transmission error affecting one of these bits leads to the replacement of the busy word by a neighboring word with a probability at least equal. to (1 + 1/2 + ... +1/2 nx-1 ) / nx. This minimum probability decreases from 1 to (2 / nb) (1-1 / 2 nb ) for nx increasing from 1 to nb.
  • the errors affecting the nb-nx most significant bits of the index will most often be corrected thanks to the protection applied to them by the channel coder.
  • the value of nx is in this case chosen according to a compromise between robustness to errors (small values) and a reduced number of protected categories (large values).
  • the possible binary words to represent the occupation of the segments are arranged in ascending order in a search table.
  • An indexing table associates with each address the serial number, in the quantification table stored at the decoder, of the binary word having this address in the search table.
  • the content of the search table and the mdexing table is given in table III i in decimal values).
  • the quantification of the occupancy word of the segments deduced from the np positions provided by the analysis module stochastic 40 is carried out in two stages by the quantization module 44.
  • a dichotomous search is first carried out in the search table to determine the address in this table of the word to be quantified.
  • the quantization index is then obtained at the address determined in the indexing table and then supplied to the bit scheduling module 46.
  • the module 44 also performs the quantification of the gains calculated by the module 40.
  • the quantization bits of Gs are placed in a category protected by the channel coder 22, as are the most significant bits of the quantization indexes of the relative gains.
  • the relative gain quantization bits are ordered so as to allow their assignment to the associated impulses belonging to the segments located by the occupancy word.
  • the segmental search according to the invention also makes it possible to effectively protect the relative positions of the pulses associated with the greatest gain values.
  • the decoder 54 To reconstruct the impulse contributions of the excitation, the decoder 54 first locates the segments by means of the busy word received; he then attributes the associated winnings; then it assigns the positions relative to the impulses on the basis of the order of importance of the gains.
  • the 13 kbit / s speech coder requires around 15 million instructions per second (Mips; fixed-point. This is therefore typically done by programming a processor digital signal (DSP, commercial, as well as the decoder which requires only about 5 Mips.
  • DSP processor digital signal

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Investigating Or Analysing Materials By The Use Of Chemical Reactions (AREA)
EP96901008A 1995-01-06 1996-01-03 Verfahren zur sprachkodierung mittels analyse durch synthese Expired - Lifetime EP0801788B1 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
FR9500134A FR2729246A1 (fr) 1995-01-06 1995-01-06 Procede de codage de parole a analyse par synthese
FR9500134 1995-01-06
PCT/FR1996/000004 WO1996021218A1 (fr) 1995-01-06 1996-01-03 Procede de codage de parole a analyse par synthese

Publications (2)

Publication Number Publication Date
EP0801788A1 true EP0801788A1 (de) 1997-10-22
EP0801788B1 EP0801788B1 (de) 1999-06-09

Family

ID=9474931

Family Applications (1)

Application Number Title Priority Date Filing Date
EP96901008A Expired - Lifetime EP0801788B1 (de) 1995-01-06 1996-01-03 Verfahren zur sprachkodierung mittels analyse durch synthese

Country Status (9)

Country Link
US (1) US5974377A (de)
EP (1) EP0801788B1 (de)
CN (1) CN1145143C (de)
AT (1) ATE181170T1 (de)
AU (1) AU704229B2 (de)
CA (1) CA2209384C (de)
DE (1) DE69602822T2 (de)
FR (1) FR2729246A1 (de)
WO (1) WO1996021218A1 (de)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8329884B2 (en) 2004-12-17 2012-12-11 Roche Molecular Systems, Inc. Reagents and methods for detecting Neisseria gonorrhoeae

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998006091A1 (fr) * 1996-08-02 1998-02-12 Matsushita Electric Industrial Co., Ltd. Codec vocal, support sur lequel est enregistre un programme codec vocal, et appareil mobile de telecommunications
JP3166697B2 (ja) * 1998-01-14 2001-05-14 日本電気株式会社 音声符号化・復号装置及びシステム
US6192335B1 (en) * 1998-09-01 2001-02-20 Telefonaktieboiaget Lm Ericsson (Publ) Adaptive combining of multi-mode coding for voiced speech and noise-like signals
FI116992B (fi) * 1999-07-05 2006-04-28 Nokia Corp Menetelmät, järjestelmä ja laitteet audiosignaalin koodauksen ja siirron tehostamiseksi
US7272553B1 (en) * 1999-09-08 2007-09-18 8X8, Inc. Varying pulse amplitude multi-pulse analysis speech processor and method
JP3372908B2 (ja) * 1999-09-17 2003-02-04 エヌイーシーマイクロシステム株式会社 マルチパルス探索処理方法と音声符号化装置
KR100324204B1 (ko) * 1999-12-24 2002-02-16 오길록 예측분할벡터양자화 및 예측분할행렬양자화 방식에 의한선스펙트럼쌍 양자화기의 고속탐색방법
US6957240B2 (en) * 2001-08-08 2005-10-18 Octasic Inc. Method and apparatus for providing an error characterization estimate of an impulse response derived using least squares
US6965640B2 (en) * 2001-08-08 2005-11-15 Octasic Inc. Method and apparatus for generating a set of filter coefficients providing adaptive noise reduction
US6999509B2 (en) * 2001-08-08 2006-02-14 Octasic Inc. Method and apparatus for generating a set of filter coefficients for a time updated adaptive filter
US6970896B2 (en) 2001-08-08 2005-11-29 Octasic Inc. Method and apparatus for generating a set of filter coefficients
CA2365203A1 (en) * 2001-12-14 2003-06-14 Voiceage Corporation A signal modification method for efficient coding of speech signals
CN1860526B (zh) * 2003-09-29 2010-06-16 皇家飞利浦电子股份有限公司 音频信号编码
US7792670B2 (en) * 2003-12-19 2010-09-07 Motorola, Inc. Method and apparatus for speech coding
CN101320565B (zh) * 2007-06-08 2011-05-11 华为技术有限公司 感知加权滤波方法及感知加权滤波器
US9626982B2 (en) * 2011-02-15 2017-04-18 Voiceage Corporation Device and method for quantizing the gains of the adaptive and fixed contributions of the excitation in a CELP codec
FR2987931A1 (fr) * 2012-03-12 2013-09-13 France Telecom Modification des caracteristiques spectrales d'un filtre de prediction lineaire d'un signal audionumerique represente par ses coefficients lsf ou isf.
WO2014202784A1 (en) 2013-06-21 2014-12-24 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for improved signal fade out for switched audio coding systems during error concealment
CN107452391B (zh) * 2014-04-29 2020-08-25 华为技术有限公司 音频编码方法及相关装置

Family Cites Families (44)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
NL8302985A (nl) * 1983-08-26 1985-03-18 Philips Nv Multipulse excitatie lineair predictieve spraakcodeerder.
CA1223365A (en) * 1984-02-02 1987-06-23 Shigeru Ono Method and apparatus for speech coding
NL8500843A (nl) * 1985-03-22 1986-10-16 Koninkl Philips Electronics Nv Multipuls-excitatie lineair-predictieve spraakcoder.
US4868867A (en) * 1987-04-06 1989-09-19 Voicecraft Inc. Vector excitation speech or audio coder for transmission or storage
US4802171A (en) * 1987-06-04 1989-01-31 Motorola, Inc. Method for error correction in digitally encoded speech
US4831624A (en) * 1987-06-04 1989-05-16 Motorola, Inc. Error detection method for sub-band coding
CA1337217C (en) * 1987-08-28 1995-10-03 Daniel Kenneth Freeman Speech coding
US5359696A (en) * 1988-06-28 1994-10-25 Motorola Inc. Digital speech coder having improved sub-sample resolution long-term predictor
SE463691B (sv) * 1989-05-11 1991-01-07 Ericsson Telefon Ab L M Foerfarande att utplacera excitationspulser foer en lineaerprediktiv kodare (lpc) som arbetar enligt multipulsprincipen
US5060269A (en) * 1989-05-18 1991-10-22 General Electric Company Hybrid switched multi-pulse/stochastic speech coding technique
US5097508A (en) * 1989-08-31 1992-03-17 Codex Corporation Digital speech coder having improved long term lag parameter determination
JP3268360B2 (ja) * 1989-09-01 2002-03-25 モトローラ・インコーポレイテッド 改良されたロングターム予測器を有するデジタル音声コーダ
DE69033011T2 (de) * 1989-10-17 2001-10-04 Motorola Inc Digitaler sprachdekodierer unter verwendung einer nachfilterung mit einer reduzierten spektralverzerrung
US5073940A (en) * 1989-11-24 1991-12-17 General Electric Company Method for protecting multi-pulse coders from fading and random pattern bit errors
US5307441A (en) * 1989-11-29 1994-04-26 Comsat Corporation Wear-toll quality 4.8 kbps speech codec
US5097507A (en) * 1989-12-22 1992-03-17 General Electric Company Fading bit error protection for digital cellular multi-pulse speech coder
US5265219A (en) * 1990-06-07 1993-11-23 Motorola, Inc. Speech encoder using a soft interpolation decision for spectral parameters
JPH04264597A (ja) * 1991-02-20 1992-09-21 Fujitsu Ltd 音声符号化装置および音声復号装置
FI98104C (fi) * 1991-05-20 1997-04-10 Nokia Mobile Phones Ltd Menetelmä herätevektorin generoimiseksi ja digitaalinen puhekooderi
DK0588932T3 (da) * 1991-06-11 2002-03-11 Qualcomm Inc Vokoder med variabel hastighed
US5253269A (en) * 1991-09-05 1993-10-12 Motorola, Inc. Delta-coded lag information for use in a speech coder
DE69232166T2 (de) * 1991-09-05 2002-06-06 Motorola Inc Fehlerschutz für vielfachmodensprachkodierer
TW224191B (de) * 1992-01-28 1994-05-21 Qualcomm Inc
US5495555A (en) * 1992-06-01 1996-02-27 Hughes Aircraft Company High quality low bit rate celp-based speech codec
US5317595A (en) * 1992-06-30 1994-05-31 Nokia Mobile Phones Ltd. Rapidly adaptable channel equalizer
US5717824A (en) * 1992-08-07 1998-02-10 Pacific Communication Sciences, Inc. Adaptive speech coder having code excited linear predictor with multiple codebook searches
FI95086C (fi) * 1992-11-26 1995-12-11 Nokia Mobile Phones Ltd Menetelmä puhesignaalin tehokkaaksi koodaamiseksi
FR2702590B1 (fr) * 1993-03-12 1995-04-28 Dominique Massaloux Dispositif de codage et de décodage numériques de la parole, procédé d'exploration d'un dictionnaire pseudo-logarithmique de délais LTP, et procédé d'analyse LTP.
IT1264766B1 (it) * 1993-04-09 1996-10-04 Sip Codificatore della voce utilizzante tecniche di analisi con un'eccitazione a impulsi.
IT1270438B (it) * 1993-06-10 1997-05-05 Sip Procedimento e dispositivo per la determinazione del periodo del tono fondamentale e la classificazione del segnale vocale in codificatori numerici della voce
US5784532A (en) * 1994-02-16 1998-07-21 Qualcomm Incorporated Application specific integrated circuit (ASIC) for performing rapid speech compression in a mobile telephone system
US5751903A (en) * 1994-12-19 1998-05-12 Hughes Electronics Low rate multi-mode CELP codec that encodes line SPECTRAL frequencies utilizing an offset
FR2729245B1 (fr) * 1995-01-06 1997-04-11 Lamblin Claude Procede de codage de parole a prediction lineaire et excitation par codes algebriques
FR2734389B1 (fr) * 1995-05-17 1997-07-18 Proust Stephane Procede d'adaptation du niveau de masquage du bruit dans un codeur de parole a analyse par synthese utilisant un filtre de ponderation perceptuelle a court terme
US5732389A (en) * 1995-06-07 1998-03-24 Lucent Technologies Inc. Voiced/unvoiced classification of speech for excitation codebook selection in celp speech decoding during frame erasures
US5699485A (en) * 1995-06-07 1997-12-16 Lucent Technologies Inc. Pitch delay modification during frame erasures
US5664055A (en) * 1995-06-07 1997-09-02 Lucent Technologies Inc. CS-ACELP speech compression system with adaptive pitch prediction filter gain based on a measure of periodicity
US5790759A (en) * 1995-09-19 1998-08-04 Lucent Technologies Inc. Perceptual noise masking measure based on synthesis filter frequency response
US5710863A (en) * 1995-09-19 1998-01-20 Chen; Juin-Hwey Speech signal quantization using human auditory models in predictive coding systems
JP4005154B2 (ja) * 1995-10-26 2007-11-07 ソニー株式会社 音声復号化方法及び装置
JP3680380B2 (ja) * 1995-10-26 2005-08-10 ソニー株式会社 音声符号化方法及び装置
FR2742568B1 (fr) * 1995-12-15 1998-02-13 Catherine Quinquis Procede d'analyse par prediction lineaire d'un signal audiofrequence, et procedes de codage et de decodage d'un signal audiofrequence en comportant application
US5729694A (en) * 1996-02-06 1998-03-17 The Regents Of The University Of California Speech coding, reconstruction and recognition using acoustics and electromagnetic waves
US5708757A (en) * 1996-04-22 1998-01-13 France Telecom Method of determining parameters of a pitch synthesis filter in a speech coder, and speech coder implementing such method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO9621218A1 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8329884B2 (en) 2004-12-17 2012-12-11 Roche Molecular Systems, Inc. Reagents and methods for detecting Neisseria gonorrhoeae

Also Published As

Publication number Publication date
US5974377A (en) 1999-10-26
DE69602822D1 (de) 1999-07-15
CN1145143C (zh) 2004-04-07
AU704229B2 (en) 1999-04-15
FR2729246B1 (de) 1997-03-07
ATE181170T1 (de) 1999-06-15
DE69602822T2 (de) 1999-12-23
AU4490196A (en) 1996-07-24
WO1996021218A1 (fr) 1996-07-11
CA2209384C (en) 2001-05-29
CN1173939A (zh) 1998-02-18
FR2729246A1 (fr) 1996-07-12
EP0801788B1 (de) 1999-06-09
CA2209384A1 (en) 1996-07-11

Similar Documents

Publication Publication Date Title
EP0801790B1 (de) Verfahren zur sprachkodierung mittels analyse durch synthese
EP0801788B1 (de) Verfahren zur sprachkodierung mittels analyse durch synthese
EP0721180B1 (de) Sprachkodierung mittels Analyse durch Synthese
EP0782128B1 (de) Verfahren zur Analyse eines Audiofrequenzsignals durch lineare Prädiktion, und Anwendung auf ein Verfahren zur Kodierung und Dekodierung eines Audiofrequenzsignals
EP0749626B1 (de) Verfahren zur sprachkodierung mittels linearer prädiktion und anregung durch algebraische kodes
EP1994531B1 (de) Verbesserte celp kodierung oder dekodierung eines digitalen audiosignals
EP1692689B1 (de) Optimiertes mehrfach-codierungsverfahren
FR2734389A1 (fr) Procede d'adaptation du niveau de masquage du bruit dans un codeur de parole a analyse par synthese utilisant un filtre de ponderation perceptuelle a court terme
EP2080194B1 (de) Dämpfung von stimmüberlagerung, im besonderen zur erregungserzeugung bei einem decoder in abwesenheit von informationen
FR2690551A1 (fr) Procédé de quantification d'un filtre prédicteur pour vocodeur à très faible débit.
EP0490740A1 (de) Verfahren und Einrichtung zum Bestimmen der Sprachgrundfrequenz in Vocodern mit sehr niedriger Datenrate
EP1836699B1 (de) Verfahren und Vorrichtung zur Ausführung einer optimalizierten Audiokodierung zwischen zwei Langzeitvorhersagemodellen
EP0616315A1 (de) Vorrichtung zur digitalen Sprachkodierung und -dekodierung, Verfahren zum Durchsuchen eines pseudologarithmischen LTP-Verzögerungskodebuchs und Verfahren zur LTP-Analyse
EP1192619B1 (de) Audio-kodierung, dekodierung zur interpolation
WO2002029786A1 (fr) Procede et dispositif de codage segmental d'un signal audio
JP4007730B2 (ja) 音声符号化装置、音声符号化方法および音声符号化アルゴリズムを記録したコンピュータ読み取り可能な記録媒体
EP1192618B1 (de) Audiokodierung mit adaptiver lifterung
EP1192621B1 (de) Audiokodierung mit harmonischen komponenten
EP1194923B1 (de) Verfahren und system für audio analyse und synthese
FR2980620A1 (fr) Traitement d'amelioration de la qualite des signaux audiofrequences decodes
WO2001003119A1 (fr) Codage et decodage audio incluant des composantes non harmoniques du signal

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 19970725

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE CH DE GB IT LI LU NL SE

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: MATRA NORTEL COMMUNICATIONS

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

17Q First examination report despatched

Effective date: 19981029

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE CH DE GB IT LI LU NL SE

REF Corresponds to:

Ref document number: 181170

Country of ref document: AT

Date of ref document: 19990615

Kind code of ref document: T

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: CH

Ref legal event code: NV

Representative=s name: KELLER & PARTNER PATENTANWAELTE AG

GBT Gb: translation of ep patent filed (gb section 77(6)(a)/1977)

Effective date: 19990622

REF Corresponds to:

Ref document number: 69602822

Country of ref document: DE

Date of ref document: 19990715

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20000103

Ref country code: AT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20000103

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20000131

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20000131

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20000131

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed
BERE Be: lapsed

Owner name: MATRA NORTEL COMMUNICATIONS

Effective date: 20000131

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: NL

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20000801

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

NLV4 Nl: lapsed or anulled due to non-payment of the annual fee

Effective date: 20000801

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: SE

Payment date: 20001218

Year of fee payment: 6

REG Reference to a national code

Ref country code: GB

Ref legal event code: IF02

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20020104

EUG Se: european patent has lapsed

Ref document number: 96901008.1

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20040130

Year of fee payment: 9

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20041210

Year of fee payment: 10

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES;WARNING: LAPSES OF ITALIAN PATENTS WITH EFFECTIVE DATE BEFORE 2007 MAY HAVE OCCURRED AT ANY TIME BEFORE 2007. THE CORRECT EFFECTIVE DATE MAY BE DIFFERENT FROM THE ONE RECORDED.

Effective date: 20050103

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20050802

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20060103

GBPC Gb: european patent ceased through non-payment of renewal fee

Effective date: 20060103