EP0801789B1 - Procede de codage de parole a analyse par synthese - Google Patents
Procede de codage de parole a analyse par synthese Download PDFInfo
- Publication number
- EP0801789B1 EP0801789B1 EP96901009A EP96901009A EP0801789B1 EP 0801789 B1 EP0801789 B1 EP 0801789B1 EP 96901009 A EP96901009 A EP 96901009A EP 96901009 A EP96901009 A EP 96901009A EP 0801789 B1 EP0801789 B1 EP 0801789B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- bits
- sub
- frame
- pulses
- excitation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000003786 synthesis reaction Methods 0.000 title claims abstract description 39
- 230000015572 biosynthetic process Effects 0.000 title claims abstract description 38
- 238000000034 method Methods 0.000 title claims abstract description 35
- 238000004458 analytical method Methods 0.000 title claims description 46
- 230000005284 excitation Effects 0.000 claims abstract description 59
- 239000013598 vector Substances 0.000 claims description 46
- 239000011159 matrix material Substances 0.000 claims description 35
- 230000007774 longterm Effects 0.000 claims description 33
- 238000011002 quantification Methods 0.000 claims description 30
- 230000004044 response Effects 0.000 claims description 30
- 230000005540 biological transmission Effects 0.000 claims description 17
- 230000003111 delayed effect Effects 0.000 claims description 10
- 230000003247 decreasing effect Effects 0.000 claims description 2
- 238000012804 iterative process Methods 0.000 claims description 2
- 239000002131 composite material Substances 0.000 claims 2
- 238000001914 filtration Methods 0.000 abstract description 3
- 238000012937 correction Methods 0.000 abstract description 2
- 238000001514 detection method Methods 0.000 abstract description 2
- 230000001934 delay Effects 0.000 description 27
- 238000004364 calculation method Methods 0.000 description 19
- 238000013139 quantization Methods 0.000 description 16
- 238000012360 testing method Methods 0.000 description 16
- 230000008569 process Effects 0.000 description 13
- 230000006870 function Effects 0.000 description 12
- 238000012546 transfer Methods 0.000 description 10
- 238000000354 decomposition reaction Methods 0.000 description 8
- 238000005457 optimization Methods 0.000 description 8
- 230000008901 benefit Effects 0.000 description 7
- 230000003044 adaptive effect Effects 0.000 description 6
- 101100176198 Caenorhabditis elegans nst-1 gene Proteins 0.000 description 4
- 230000000717 retained effect Effects 0.000 description 4
- 230000001755 vocal effect Effects 0.000 description 4
- 101100148606 Caenorhabditis elegans pst-1 gene Proteins 0.000 description 3
- 241000135309 Processus Species 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 230000003595 spectral effect Effects 0.000 description 3
- 241000897276 Termes Species 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 239000006185 dispersion Substances 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000017105 transposition Effects 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- UWNXGZKSIKQKAH-UHFFFAOYSA-N Cc1cc(CNC(CO)C(O)=O)c(OCc2cccc(c2)C#N)cc1OCc1cccc(c1C)-c1ccc2OCCOc2c1 Chemical compound Cc1cc(CNC(CO)C(O)=O)c(OCc2cccc(c2)C#N)cc1OCc1cccc(c1C)-c1ccc2OCCOc2c1 UWNXGZKSIKQKAH-UHFFFAOYSA-N 0.000 description 1
- 101000822695 Clostridium perfringens (strain 13 / Type A) Small, acid-soluble spore protein C1 Proteins 0.000 description 1
- 101000655262 Clostridium perfringens (strain 13 / Type A) Small, acid-soluble spore protein C2 Proteins 0.000 description 1
- 101000952234 Homo sapiens Sphingolipid delta(4)-desaturase DES1 Proteins 0.000 description 1
- 101000655256 Paraclostridium bifermentans Small, acid-soluble spore protein alpha Proteins 0.000 description 1
- 101000655264 Paraclostridium bifermentans Small, acid-soluble spore protein beta Proteins 0.000 description 1
- 102100037416 Sphingolipid delta(4)-desaturase DES1 Human genes 0.000 description 1
- 241001080024 Telles Species 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000007493 shaping process Methods 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 238000011282 treatment Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/10—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
Definitions
- the present invention relates to speech coding using synthetic analysis.
- a linear prediction of the speech signal is carried out to obtain the coefficients of a short-term synthesis filter modeling the transfer function of the vocal tract. These coefficients are transmitted to the decoder, as well as parameters characterizing an excitation to be applied to the short-term synthesis filter.
- further research is carried out on the longer-term correlations of the speech signal in order to characterize a long-term synthesis filter accounting for the pitch of the speech.
- the excitation indeed has a predictable component which can be represented by the past excitation, delayed by TP samples of the speech signal and affected by a gain g P.
- the remaining, unpredictable part of the excitation is called stochastic excitation.
- CELP Code Excited Linear Prediction
- MPLPC Multi-Pulse Linear Prediction Coding
- the stochastic excitation comprises a certain number of pulses whose positions are sought by the coder.
- CELP coders are preferred for low transmission rates, but they are more complex to implement than MPLPC coders.
- An object of the present invention is to provide a speech coding method in which the search for the stochastic excitation is simplified.
- This mode of seeking excitement limits the complexity of the calculations required to determine the sequence of excitement, making it possible to achieve at most one division or inversion by iteration.
- contributions can be contributions impulse.
- This mode of seeking excitement is not however not exclusively applicable to MPLPC encoders. he is applicable for example to so-called VSELP coders where the contributions to stochastic excitation are vectors chosen from a predetermined dictionary (see I. Gerson and Mr. Jasiuk: "Vector Sum Excited Linear Prediction (VSELP) Speech Coding at 8 kb / s ", Proc. Int. Conf. On Acoustics, Speech and Signal Processing, Albuquerque 1990, Vol. 1, pages 461-464).
- nc contributions may understand the contribution corresponding to the excitement delayed delayed TP samples, including the associated gain gp is recalculated during successive iterations, or several contributions of this nature if multiple LTP delays are determined.
- a speech coder implementing the invention is applicable in various types of speech transmission and / or storage systems using a digital compression technique.
- the speech coder 16 is part of a mobile radio station.
- the speech signal S is a digital signal sampled at a frequency typically equal to 8 kHz.
- the signal S comes from an analog-digital converter 18 receiving the amplified and filtered output signal from a microphone 20.
- the converter 18 puts the speech signal S in the form of successive frames themselves subdivided into nst sub-frames of 1st samples.
- the speech signal S can also be subjected to conventional shaping treatments such as Hamming filtering.
- the speech coder 16 delivers a binary sequence with a significantly lower bit rate than that of the speech signal S, and addresses this sequence to a channel coder 22 whose function is to introduce redundancy bits into the signal in order to allow detection and / or a correction of any transmission errors.
- the output signal from the channel encoder 22 is then modulated on a carrier frequency by the modulator 24, and the modulated signal is transmitted on the air interface.
- the speech coder 16 is a coder with analysis by synthesis.
- the encoder 16 determines on the one hand parameters characterizing a short-term synthesis filter modeling the vocal tract of the speaker, and on the other hand a sequence excitation which, applied to the short synthesis filter term, provides a synthetic signal constituting a estimation of the speech signal S according to a criterion of perceptual weighting.
- the short-term synthesis filter has a transfer function of the form 1 / A (z), with:
- the coefficients a i are determined by a module 26 for short-term linear prediction analysis of the speech signal S.
- the a i are the linear prediction coefficients of the speech signal S.
- the order q of the linear prediction is typically of the order of 10.
- the methods applicable by module 26 for short-term linear prediction are well known in the field of speech coding.
- Module 26, for example, implements the Durbin-Levinson algorithm (see J. Makhoul: "Linear Prediction: A tutorial review", Proc. IEEE, Vol.63, N ° 4, April 1975, p. 561-580 ).
- the coefficients a i obtained are supplied to a module 28 which converts them into spectral line parameters (LSP).
- the representation of the prediction coefficients a i by LSP parameters is frequently used in speech coders with analysis by synthesis.
- LST t (nst-1) LSP t for sub -frames 0,1,2, ..., nst-1 of the frame t.
- the coefficients a i of the filter 1 / A (z) are then determined, sub-frame by sub-frame from the interpolated LSP parameters.
- the non-quantified LSP parameters are supplied by the module 28 to a module 32 for calculating the coefficients of a perceptual weighting filter 34.
- the coefficients of the perceptual weighting filter are calculated by the module 32 for each subframe after interpolation of the LSP parameters received from the module 28.
- the perceptual weighting filter 34 receives the speech signal S and delivers a perceptually weighted SW signal which is analyzed by modules 36, 38, 40 for determine the excitation sequence.
- the excitation sequence of the short-term filter consists of an excitation predictable by a long-term synthetic filter modeling the pitch of the speech, and an excitement unpredictable stochastic, or innovation sequence.
- Module 36 performs long-term prediction (LTP) in open loop, i.e. it does not contribute directly to the minimization of the weighted error.
- LTP long-term prediction
- the weighting filter 34 intervenes in upstream of the open loop analysis module, but it could otherwise: module 36 could operate directly on the speech signal S or on the signal S cleared of its short-term correlations by a filter transfer function A (z).
- modules 38 and 40 operate in a closed loop, i.e. they directly contribute to minimizing the error perceptually weighted.
- Long-term prediction lag is determined in two steps.
- the analysis module 36 Open loop LTP detects voiced frames from the speech signal and determines, for each voiced frame, a degree of voicing MV and a delay search interval long-term prediction.
- the search interval is defined by a central value represented by its quantification index ZP and by a width in the field of quantification indexes, depending on the degree of voicing MV.
- the module 30 operates the quantization of the LSP parameters which have previously been determined for this frame.
- This quantification is for example vectorial, that is to say it consists in selecting, from one or more predetermined quantification tables, a set of quantized parameters LSPQ which has a minimum distance from the set of parameters LSP provided by the module 28.
- the quantization tables differ according to the degree of voicing MV provided to the quantization module 30 by the open loop analyzer 36.
- a set of quantization tables for a degree of voicing MV is determined, during 'preliminary tests, so as to be statistically representative of frames having this degree MV. These sets are stored both in the coders and in the decoders implementing the invention.
- the module 30 delivers the set of quantized parameters LSP Q as well as its index Q in the applicable quantification tables.
- the speech coder 16 further comprises a module 42 for calculating the impulse response of the compound filter short-term synthesis filter and perceptual weighting.
- This compound filter has the function transfer W (z) / A (z).
- the module 42 takes for the filter of perceptual weighting W (z) that corresponding to LSP parameters interpolated but not quantified, i.e. the one whose coefficients were calculated by the module 32, and for the synthesis filter 1 / A (z) the corresponding one quantized and interpolated LSP parameters, i.e. the one that will be effectively reconstructed by the decoder.
- the TP delay index is ZP + DP.
- closed-loop LTP analysis consists in determining, in the search interval for long-term prediction delays T, the delay TP which maximizes, for each sub-frame of a voiced frame, the normalized correlation : where x (i) denotes the weighted speech signal SW of the subframe from which the memory of the weighted synthesis filter has been subtracted (i.e. the response to a zero signal, due to its initial states, of the filter whose impulse response has been calculated by module 42), and y T (i) denotes the convolution product: u (jT) designating the predictable component of the delayed excitation sequence of T samples, estimated by the well-known technique of the adaptive codebook.
- the missing values of u (jT) can be extrapolated from the previous values.
- Fractional delays are taken into account by oversampling the signal u (jT) in the adaptive repertoire.
- An oversampling of a factor m is obtained by means of polyphase interpolating filters.
- the gain gp of long-term prediction could be determined by the module 38 for each sub-frame, by applying the known formula: However, in a preferred version of the invention, the gain g P is calculated by the stochastic analysis module 40.
- the stochastic excitation determined for each subframe by the module 40 is of the multi-pulse type.
- the positions and gains calculated by the analysis module 40 stochastics are quantified by a module 44.
- a module 48 is thus provided in the encoder which receives the different parameters and which adds to some of them redundancy bits to detect and / or correct any transmission errors.
- the degree of voicing MV coded on two bits being a critical parameter, we want it to reach the decoder with as few errors as possible. For this reason, redundancy bits are added to this parameter by module 48.
- the channel encoder 22 is that used in the pan-European radiocommunication system with mobiles (GSM).
- GSM pan-European radiocommunication system with mobiles
- This channel encoder described in detail in GSM Recommendation 05.03, has been finalized for an RPE-LTP type 13 kbit / s speech coder which also produces 260 bits per 20 ms frame. The sensibility of each of the 260 bits was determined from tests listening.
- the bits from the source encoder have been grouped into three categories. The first of these AI categories groups 50 bits which are convolutionally coded on the base of a generator polynomial giving a redundancy of a half with a constraint length equal to 5. Three bits parity are calculated and added to the 50 bits of the category IA before convolutional coding.
- the second category (IB) has 132 bits which are protected at a rate a half by the same polynomial as the previous category.
- the third category (II) contains 78 unprotected bits. After applying the convolutional code, the bits (456 by frame) are subject to interleaving.
- the scheduling module 46 of the new source encoder implementing the invention distributes the bits in the three categories in depending on the subjective importance of these bits.
- a mobile radio station capable of receiving the speech signal processed by the source encoder 16 is shown schematically in Figure 2.
- the radio signal received is first processed by a demodulator 50 then by a channel 52 decoder which performs dual operations of those of modulator 24 and channel encoder 22.
- the decoder channel 52 provides the speech decoder 54 with a binary sequence which, in the absence of transmission errors or when any errors have been corrected by the channel decoder 52, corresponds to the binary sequence delivered by the module scheduling 46 at the coder 16.
- the decoder 54 includes a module 56 which receives this binary sequence and which identifies the parameters relating to the different frames and subframes. Module 56 also performs some checks on the parameters received. In particular, the module 56 examines the redundancy bits introduced by the module 48 of the encoder, to detect and / or correct errors affecting the parameters associated with these redundancy bits.
- a module 58 of the decoder receives the degree of voicing MV and the index of Q for quantizing the LSP parameters.
- the module 58 finds the quantized LSP parameters in the tables corresponding to the value of MV, and, after interpolation, converts them into coefficients a i for the short-term synthesis filter 60.
- a pulse generator 62 receives the positions p (n) of the np pulses of the stochastic excitation.
- the generator 62 delivers pulses of unit amplitude which are each multiplied by 64 by the associated gain g (n).
- the output of amplifier 64 is addressed to the long-term synthesis filter 66.
- This filter 66 has an adaptive directory structure.
- the output samples u of the filter 66 are stored in the adaptive directory 68 so as to be available for the subsequent subframes.
- the delay TP relative to a sub-frame, calculated from the quantization indices ZP and DP, is supplied to the adaptive repertoire 68 to produce the signal u suitably delayed.
- the amplifier 70 multiplies the signal thus delayed by the gain g P of long-term prediction.
- the long-term filter 66 finally comprises an adder 72 which adds the outputs of amplifiers 64 and 70 to provide the excitation sequence u.
- the excitation sequence is addressed to the short-term synthesis filter 60, and the resulting signal can also, in known manner, be subjected to a post-filter 74 whose coefficients depend on the synthesis parameters received, to form the signal of synthetic speech S '.
- the output signal S 'of the decoder 54 is then converted into analog by the converter 76 before being amplified to control a loudspeaker 78.
- the module 36 also determines, for each sub-frame st, the entire delay K st which maximizes the open-loop estimation P st (k) of the long-term prediction gain on the sub-frame st, excluding the delays k for which the autocorrelation C st (k) is negative or smaller than a small fraction £ of the energy R0 st of the subframe.
- step 94 the degree of voicing MV of the current frame is taken equal to 0 in step 94, which in this case ends the operations performed by the module 36 on this frame. If on the contrary the threshold S0 is exceeded in step 92, the current frame is detected as voiced and the degree MV will be equal to 1, 2 or 3. The module 36 then calculates, for each subframe st, a list I st containing candidate delays to constitute the ZP center of the search interval for long-term prediction delays.
- the module 36 determines the basic delay rbf in full resolution for the rest of the processing. This basic delay could be taken equal to the integer K st obtained in step 90. The fact of finding the basic delay in fractional resolution around K st however makes it possible to gain in precision.
- Step 100 thus consists in finding, around the integer delay K st obtained in step 90, the fractional delay which maximizes the expression C st 2 / G st .
- This search can be carried out at the maximum resolution of the fractional delays (1/6 in the example described here) even if the entire delay K st is not in the domain where this maximum resolution applies.
- the autocorrelations C st (T) and the delayed energies G st (T) are obtained by interpolation from the values stored in step 90 for the whole delays.
- the basic delay relating to a sub-frame could also be determined in fractional resolution from step 90 and taken into account in the first estimation of the overall prediction gain on the frame.
- step 102 the address j in the list I st and the index m of the submultiple are initialized to 0 and 1, respectively.
- a comparison 104 is made between the submultiple rbf / m and the minimum delay rmin. The submultiple rbf / m is to be examined if it is greater than rmin.
- step 110 If P st (r i ) ⁇ SE st , the delay r i is not taken into account, and we go directly to step 110 of incrementing the index m before carrying out the comparison 104 again for the next submultiple. If test 108 shows that P st (r i ) ⁇ SE st , the delay r i is retained and step 112 is executed before incrementing the index m in step 110. In step 112, we stores the index i at the address j in the list I st , we give the value m to the integer m0 intended to be equal to the index of the smallest submultiple retained, then we increment by one unit l 'address j.
- the examination of the sub-multiples of the basic delay is finished when the comparison 104 shows rbf / m ⁇ rmin.
- We then examine the multiple delays of the smallest rbf / m0 of the submultiples previously selected according to the process illustrated in FIG. 5. This examination begins with an initialization 114 of the index n of the multiple: n 2.
- a comparison 116 is made between the multiple n.rbf / m0 and the maximum delay rmax. If n.rbf / m0> rmax, test 118 is carried out to determine whether the index m0 of the smallest sub-multiple is an integer multiple of n.
- step 120 the delay n.rbf / m0 has already been examined when examining the sub-multiples of rbf, and we go directly to step 120 of incrementing the index n before carrying out again comparison 116 for the next multiple. If test 118 shows that m0 is not an integer multiple of n, the multiple n.rbf / m0 is to be examined. We then take for the integer i the value of the index of the quantized delay r i closest to n.rbf / m0 (step 122), then we compare, at 124, the estimated value of the prediction gain P st ( r i ) at the selection threshold SE st .
- step 120 If P st (r i ) ⁇ SE st , the delay r i is not taken into account, and we go directly to step 120 of incrementing the index n. If test 124 shows that P st (r i ) ⁇ SE st , the delay r i is retained and step 126 is executed before incrementing the index n in step 120. In step 126, we stores the index i at address j in the list I st , then the address j is incremented by one.
- the list I st contains j candidate delay index. If we wish to limit the maximum length of the list I st to jmax for the following steps, we can take the length j st of this list equal to min (j, jmax) (step 128) and then, in step 130, order the list I st in the order of gains C st 2 (r Ist (j) ) / G st 2 (r Ist (j) ) decreasing for 0 ⁇ j ⁇ j st so as to keep only the j st delays providing the largest gain values.
- the value of jmax is chosen according to the compromise sought between the efficiency of the search for LTP delays and the complexity of this search. Typical values of jmax range from 3 to 5.
- the analysis module 36 calculates a quantity Ymax determining a second open-loop estimate of the prediction gain at long term over the entire frame, as well as indexes ZP, ZP0 and ZP1 in a phase 132, the progress of which is detailed in FIG. 6.
- This phase 132 consists in testing search intervals of length N1 to determine which one maximizes a second estimate of the overall prediction gain on the frame. The intervals tested are those whose centers are the candidate delays contained in the list I st calculated during phase 101.
- Phase 132 begins with a step 136 where the address j in the list I st is initialized to 0.
- step 138 it is checked whether the index I st (j) has already been encountered by testing a previous interval centered on I st , (j') with st ' ⁇ st and 0 ⁇ j' ⁇ j st ' , in order d '' Avoid testing the same interval twice. If test 138 reveals that I st (j) already appeared in a list I st , with st ' ⁇ st, we directly increment the address j in step 140, then we compare it to the length j st of the list I st . If the comparison 142 shows that j ⁇ j st , we return to step 138 for the new value of the address j.
- the quantity Y determining the second estimate of the overall prediction gain for the interval centered on I st (j) is calculated according to: then compared to Ymax, where Ymax represents the value to be maximized.
- This value Ymax is for example initialized to 0 at the same time as the index st in step 96. If Y ⁇ Ymax, we go directly to step 140 for incrementing the index j. If the comparison 150 shows that Y> Ymax, step 152 is executed before incrementing the address j in step 140. At this step 152, the index ZP is taken equal to I st (j) and the indices ZP0 and ZP1 are respectively taken equal to the smallest and the largest of the indices i st ' determined in step 148.
- the index st is incremented by one (step 154) then compared, in step 156, to the number nst of subframes per frame. If st ⁇ nst, we return to step 98 to perform the operations relating to the following sub-frame.
- the index ZP denotes the center of the search interval that will be provided to the module 38 closed loop LTP analysis
- ZP0 and ZP1 are index whose difference is representative of the dispersion of optimal delays per subframe in the interval centered on ZP.
- Gp 20.log 10 (RO / RO-Ymax).
- Two other thresholds S1 and S2 are used. If Gp ⁇ S1, the degree of voicing MV is taken equal to 1 for the current frame.
- Gp> S2 the dispersion of the optimal delays for the different sub-frames of the current frame is examined. If ZP1-ZP ⁇ N3 / 2 and ZP-ZP0 ⁇ N3 / 2, an interval of length N3 centered on ZP is sufficient to take into account all the optimal delays and the degree of voicing is taken equal to 3 (if Gp> S2) . Otherwise, if ZP1-ZP ⁇ N3 / 2 or ZP-ZPO> N3 / 2, the degree of voicing is taken equal to 2 (if Gp> S2).
- ZP + DP index of TP delay ultimately determined may therefore in some cases be more small than 0 or larger than 255.
- This allows analysis LTP in closed loop to also carry on some delays TP smaller than rmin or larger than rmax.
- Reducing the delay search interval for very closely spaced frames reduces the complexity of the closed loop LTP analysis performed by the module 38 by reducing the number of convolutions y T (i) to be calculated according to formula (1).
- Another possibility is to provide a parity bit for the delay TP and / or the gain g P , making it possible to detect possible errors affecting these parameters.
- the first optimizations carried out in step 90 relative to the different sub-frames are replaced by a single optimization relating to the entire frame.
- the autocorrelations C (k) and the delayed energies G (k) for the entire frame are also calculated:
- nz basic delays K 1 ' , ..., K nz ' in full resolution.
- the voiced / unvoiced decision (step 92) is taken on the basis of that of the basic delays K i ' which provides the greatest value for the first open-loop estimate of the long-term prediction gain.
- the basic delays in fractional resolution are determined by the same process as in step 100, but only allowing the quantized delay values. Examination 101 of the submultiples and multiples is not carried out. For the phase 132 of calculating the second estimate of the prediction gain, the nz basic delays previously determined are taken as candidate delays. This second variant makes it possible to dispense with the systematic examination of the submultiples and of the multiples which are generally taken into account by virtue of the subdivision of the domain of possible delays.
- phase 132 is modified in that, in the optimization steps 148, the index i st , which maximizes C st ' 2 (r i ) / G st ' (r i ) for I st (j) -N1 / 2 ⁇ i ⁇ I st (j) + N1 / 2 and 0 ⁇ i ⁇ N, and on the other hand, during the same maximization loop, the index k st ' which maximizes this same quantity over a reduced interval I st (j) -N3 / 2 ⁇ i ⁇ I st ( j ) + N3 / 2 and 0 ⁇ i ⁇ N.
- Step 152 is also modified: the indexes ZP0 and ZP1 are no longer stored, but a quantity Ymax 'defined in the same way as Ymax but with reference to the reduced length interval:
- Gp' 20.log 10 [R0 / (R0-Ymax ')].
- the sub-frames for which the prediction gain is negative or negligible can be identified by consulting the nst pointers. If necessary, the module 38 is deactivated for the corresponding subframes. This does not affect the quality of the LTP analysis since the prediction gain corresponding to these subframes will be almost zero anyway.
- Another aspect of the invention relates to the module 42 for calculating the impulse response of the weighted synthesis filter.
- the closed loop LTP analysis module 38 needs this impulse response h over the duration of a subframe to calculate the convolutions y T (i) according to formula (1).
- the stochastic analysis module 40 also needs it to calculate convolutions as will be seen below.
- the operations performed by the module 42 are for example in accordance with the flowchart of FIG. 7.
- the truncated energies of the impulse response are also calculated:
- the coefficients a k are those involved in the perceptual weighting filter, i.e. the linear prediction coefficients interpolated but not quantified
- the coefficients a k are those applied to the synthesis filter, i.e. the quantized and interpolated linear prediction coefficients.
- the module 42 determines the smallest length L ⁇ such that the energy Eh (L ⁇ -1) of the response impulse truncated at L ⁇ samples or at least equal to a proportion ⁇ of its total energy Eh (pst-1) estimated on pst samples.
- a typical value of ⁇ is 98%.
- the number L ⁇ is initialized to pst in step 162 and decremented by one as 166 as Eh (L ⁇ -2)> ⁇ .Eh (pst-1) (test 164).
- the length L ⁇ sought is obtained when the test 164 shows that Eh (L ⁇ -2) ⁇ .Eh (pst-1).
- a term corrector ⁇ (MV) is added to the value of L ⁇ which has been obtained (step 168).
- This corrector term is preferably an increasing function of the degree of voicing.
- ⁇ (0) - 5
- ⁇ (3) + 7.
- the truncation length Lh of the response impulse is taken equal to L ⁇ if L ⁇ nst and to nst if not.
- a third aspect of the invention relates to the module 40 of stochastic analysis used to model the unpredictable part of the excitement.
- the stochastic excitation considered here is of the type multi-pulse.
- Stochastic excitement relating to a subframe is represented by np position pulses p (n) and amplitudes, or gains, g (n) 1 ⁇ n ⁇ np).
- the gp gain of long-term prediction can also be calculated during of the same process.
- the excitation sequence relating to a subframe comprises nc contributions associated respectively with nc gains.
- the contributions are 1st sample vectors which, weighted by the associated and summed earnings correspond to the excitation sequence of the short-term synthesis filter.
- One of the contributions can be predictable, or several in the case of a long-term synthesis filter with several taken ("multi-tap pitch synthesis filter"). Others contributions are in this case np vectors do with only 0 except an amplitude pulse 1.
- the vectors F p (n) are simply constituted by the vector of the impulse response h shifted by p (n) samples. Truncating the impulse response as described above therefore makes it possible to significantly reduce the number of operations useful for calculating the scalar products involving these vectors F p (n) .
- the gains g nc-1 (i) are the selected gains and the minimized quadratic error E is equal to the energy of the target vector e nc-1 .
- the decomposition of Cholesky and the inversion of the matrix M n however require to carry out divisions and calculations of square roots which are operations demanding in terms of computation complexity.
- Different constraints can be brought to the domain of maximization of the quantity above included in the interval [0, lst [.
- the maximization is carried out in step 182 on the set of possible positions excluding the segments in which the positions p (1), ..., p (n have been found respectively) -1) pulses during previous iterations.
- the module 40 proceeds to the calculation 184 of the line n of the matrices L, R and K involved in the decomposition of the matrix B, which makes it possible to complete the matrices L n , R n and K n defined above.
- the column index j is first initialized at 0, in step 186.
- the variable tmp is first initialized at the value of component B (n, j), that is:
- step 188 the integer k is also initialized to 0.
- a comparison 190 is then made between the integers k and j. If k ⁇ j, we add the term L (n, k). R (j, k) to the variable tmp, then we increment the whole k by one unit (step 192) before re-performing the comparison 190.
- step 196 If j ⁇ n, the component R (n, j) is taken equal to tmp and the component L (n, j) to tmp.K (j) in step 196, then the column index j is incremented d 'a unit before returning to step 188 to calculate the following components.
- K (n) is taken equal to 1 / tmp if tmp ⁇ 0 (step 198) and to 0 otherwise.
- the calculation 184 requires at most one division 198, to obtain K (n).
- any singularity of the matrix B n does not cause instabilities since we avoid divisions by 0.
- the inversion 200 then begins with an initialization 202 of the column index j 'at n-1.
- the term Linv (j ') is initialized to -L (n, j') and the integer k 'to j' + 1.
- a comparison 206 is then carried out between the integers k ′ and n.
- the inversion 200 is followed by the calculation 214 of the reoptimized gains and of the target vector E for the following iteration.
- the computation of the reoptimized gains is also very simplified by the decomposition retained for the matrix B.
- One can indeed compute the vector g n (g n (0), ..., g n (n)) solution of g n .
- B n b n according to: and
- g n (i ') g n-1 (i') + L -1 (n, i ').
- the calculation 214 is detailed in FIG. 11.
- b (n) serves as the initialization value for the variable tmq.
- index i is also initialized to 0.
- the comparison 218 is then carried out between the integers i and n. If i ⁇ n, we add the term b (i). Linv (i) to the variable tmq and we increment i by one unit (step 220) before returning to the comparison 218.
- Step 226 also includes the incrementation of the index i 'before returning to the comparison 224.
- Segmental pulse search significantly decreases the number of pulse positions to be evaluated during steps 182 of the search for stochastic excitation. It also allows efficient quantification of the positions found.
- ns> np also has the advantage that good robustness to transmission errors can be obtained with regard to the positions of the pulses, by virtue of a separate quantification of the sequence numbers of the occupied segments and of the relative positions pulses in each occupied segment.
- the possible binary words are stored in a quantification table in which the reading addresses are the quantization indexes received.
- the order in this table, determined once for all, can be optimized so that an error of transmission affecting a bit of the index (the error case the more frequent, especially when interlacing is used work in the channel encoder 22) has, on average, minimal consequences according to a neighborhood criterion.
- the neighborhood criterion is for example that a word of ns bits does not can be replaced only by words "neighbors", distant a Hamming distance at most equal to an np-2 ⁇ threshold, so as to keep all the pulses except ⁇ of them at valid positions in case of transmission error the single-bit index.
- Other criteria would be usable in substitution or in addition, for example that two words are considered neighbors if the replacement of one by the other does not change the order of assignment of gains associated with pulses.
- the order in the table word quantification can be determined from arithmetic considerations or, if this is insufficient, in simulating error scenarios on a computer (so exhaustive or by statistical sampling of the type Monte-Carlo according to the number of possible error cases).
- the module scheduling 46 can put in the category of minimum protection, or in the unprotected category, a nx number of index bits which, if they are affected by a transmission error, give rise to a wrong word but checking the neighborhood criterion with a probability considered satisfactory, and put in a category more protected the other bits of the index. This way of proceed uses another word order in the quantification table.
- This scheduling can also be optimized using simulations if you want maximize the number nx of the index bits assigned to the least protected category.
- One possibility is to start by constituting a list of words of ns bits by counting in Gray code from 0 to 2 ns -1, and to obtain the ordered quantification table by deleting from this list the words having no weight of Hamming of np.
- the table thus obtained is such that two consecutive words have a Hamming distance of np-2. If the indexes in this table have a binary representation in Gray code, any error on the least significant bits makes vary the index of ⁇ 1 and thus involves the replacement of the words of effective occupation by a neighbor word in the sense of the np-2 threshold on the Hamming distance, and an error on the i-th least significant bit also varies the index by ⁇ 1 with a probability of approximately 2 1-i .
- nx By placing the nx least significant bits of the index in Gray code in an unprotected category, a possible transmission error affecting one of these bits leads to the replacement of the busy word by a neighboring word with a probability at least equal. to (1 + 1/2 + ... + 1/2 nx-1 ) / nx. This minimum probability decreases from 1 to (2 / nb) (1-1 / 2 nb ) for nx increasing from 1 to nb.
- the errors affecting the nb-nx most significant bits of the index will most often be corrected thanks to the protection applied to them by the channel coder.
- the value of nx is in this case chosen according to a compromise between robustness to errors (small values) and a reduced size of the protected categories (large values).
- the possible binary words for represent the occupation of the segments are arranged in order growing in a search table.
- An indexing table associates with each address the serial number, in the table of quantization stored at the decoder, of the binary word having this address in the lookup table.
- the content of the table search and index table is given in the table III (in decimal values).
- the quantification of the occupation word of the segments deduced from the np positions provided by the stochastic analysis module 40 is carried out in two stages by the quantization module 44.
- a dichotomous search is first carried out in the search table to determine the address in this table of the word to be quantified.
- the quantization index is then obtained at the address determined in the indexing table and then supplied to the bit scheduling module 46.
- Address Search table Indexing table 0 3 0 1 5 1 2 6 5 3 9 2 4 10 4 5 12 3
- the module 44 also performs the quantification of the gains calculated by the module 40.
- the quantization bits of Gs are placed in a category protected by the channel 22 encoder, as well as most significant bits of the gain quantification indexes relative.
- the relative gain quantization bits are ordered to allow assignment to impulses associated belonging to the segments localized by the word of occupation. Segmental research according to the invention also allows effective protection of positions relative pulses associated with the largest values gain.
- the decoder 54 To reconstruct impulse contributions of excitation, the decoder 54 first locates the segments by means of the occupation word received; he then assigns the associated earnings; then he assigns the positions relative to pulses based on the order of importance of the gains.
- the 13 kbit / s speech coder requires order 15 million comma instructions per second (Mips) fixed. So we typically do this by programming a commercial digital signal processor (DSP) as well as the decoder which requires only about 5 Mips.
- DSP digital signal processor
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Investigating Or Analysing Materials By Optical Means (AREA)
- Analysing Materials By The Use Of Radiation (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR9500124 | 1995-01-06 | ||
FR9500124A FR2729244B1 (fr) | 1995-01-06 | 1995-01-06 | Procede de codage de parole a analyse par synthese |
PCT/FR1996/000005 WO1996021219A1 (fr) | 1995-01-06 | 1996-01-03 | Procede de codage de parole a analyse par synthese |
Publications (2)
Publication Number | Publication Date |
---|---|
EP0801789A1 EP0801789A1 (fr) | 1997-10-22 |
EP0801789B1 true EP0801789B1 (fr) | 1998-12-02 |
Family
ID=9474923
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP96901009A Expired - Lifetime EP0801789B1 (fr) | 1995-01-06 | 1996-01-03 | Procede de codage de parole a analyse par synthese |
EP96400028A Expired - Lifetime EP0721180B1 (fr) | 1995-01-06 | 1996-01-05 | Procédé de codage de parole à analyse par synthèse |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP96400028A Expired - Lifetime EP0721180B1 (fr) | 1995-01-06 | 1996-01-05 | Procédé de codage de parole à analyse par synthèse |
Country Status (8)
Country | Link |
---|---|
US (1) | US5899968A (zh) |
EP (2) | EP0801789B1 (zh) |
CN (1) | CN1134761C (zh) |
AT (2) | ATE174147T1 (zh) |
AU (1) | AU4490296A (zh) |
DE (2) | DE69601068T2 (zh) |
FR (1) | FR2729244B1 (zh) |
WO (1) | WO1996021219A1 (zh) |
Families Citing this family (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FI955266A (fi) * | 1995-11-02 | 1997-05-03 | Nokia Telecommunications Oy | Menetelmä ja laitteisto viestien välittämiseksi tietoliikennejärjestelmässä |
JP3134817B2 (ja) * | 1997-07-11 | 2001-02-13 | 日本電気株式会社 | 音声符号化復号装置 |
US6807527B1 (en) * | 1998-02-17 | 2004-10-19 | Motorola, Inc. | Method and apparatus for determination of an optimum fixed codebook vector |
US6810377B1 (en) * | 1998-06-19 | 2004-10-26 | Comsat Corporation | Lost frame recovery techniques for parametric, LPC-based speech coding systems |
US6453289B1 (en) | 1998-07-24 | 2002-09-17 | Hughes Electronics Corporation | Method of noise reduction for speech codecs |
US6493665B1 (en) * | 1998-08-24 | 2002-12-10 | Conexant Systems, Inc. | Speech classification and parameter weighting used in codebook search |
US6480822B2 (en) * | 1998-08-24 | 2002-11-12 | Conexant Systems, Inc. | Low complexity random codebook structure |
US6823303B1 (en) * | 1998-08-24 | 2004-11-23 | Conexant Systems, Inc. | Speech encoder using voice activity detection in coding noise |
US6192335B1 (en) * | 1998-09-01 | 2001-02-20 | Telefonaktieboiaget Lm Ericsson (Publ) | Adaptive combining of multi-mode coding for voiced speech and noise-like signals |
JP3372908B2 (ja) * | 1999-09-17 | 2003-02-04 | エヌイーシーマイクロシステム株式会社 | マルチパルス探索処理方法と音声符号化装置 |
JP4367808B2 (ja) * | 1999-12-03 | 2009-11-18 | 富士通株式会社 | 音声データ圧縮・解凍装置及び方法 |
US6842733B1 (en) | 2000-09-15 | 2005-01-11 | Mindspeed Technologies, Inc. | Signal processing system for filtering spectral content of a signal for speech coding |
US6850884B2 (en) * | 2000-09-15 | 2005-02-01 | Mindspeed Technologies, Inc. | Selection of coding parameters based on spectral content of a speech signal |
US7047188B2 (en) * | 2002-11-08 | 2006-05-16 | Motorola, Inc. | Method and apparatus for improvement coding of the subframe gain in a speech coding system |
CN101320565B (zh) * | 2007-06-08 | 2011-05-11 | 华为技术有限公司 | 感知加权滤波方法及感知加权滤波器 |
US9626982B2 (en) * | 2011-02-15 | 2017-04-18 | Voiceage Corporation | Device and method for quantizing the gains of the adaptive and fixed contributions of the excitation in a CELP codec |
RU2669139C1 (ru) | 2011-04-21 | 2018-10-08 | Самсунг Электроникс Ко., Лтд. | Устройство для квантования коэффициентов кодирования с линейным предсказанием, устройство кодирования звука, устройство для деквантования коэффициентов кодирования с линейным предсказанием, устройство декодирования звука, и электронное устройство для этого |
MY185091A (en) | 2011-04-21 | 2021-04-30 | Samsung Electronics Co Ltd | Method of quantizing linear predictive coding coefficients, sound encoding method, method of de-quantizing linear predictive coding coefficients, sound decoding method, and recording medium |
US9208134B2 (en) * | 2012-01-10 | 2015-12-08 | King Abdulaziz City For Science And Technology | Methods and systems for tokenizing multilingual textual documents |
Family Cites Families (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
NL8302985A (nl) * | 1983-08-26 | 1985-03-18 | Philips Nv | Multipulse excitatie lineair predictieve spraakcodeerder. |
CA1223365A (en) * | 1984-02-02 | 1987-06-23 | Shigeru Ono | Method and apparatus for speech coding |
NL8500843A (nl) * | 1985-03-22 | 1986-10-16 | Koninkl Philips Electronics Nv | Multipuls-excitatie lineair-predictieve spraakcoder. |
US4831624A (en) * | 1987-06-04 | 1989-05-16 | Motorola, Inc. | Error detection method for sub-band coding |
US4802171A (en) * | 1987-06-04 | 1989-01-31 | Motorola, Inc. | Method for error correction in digitally encoded speech |
CA1337217C (en) * | 1987-08-28 | 1995-10-03 | Daniel Kenneth Freeman | Speech coding |
ES2039655T3 (es) * | 1988-01-05 | 1993-10-01 | British Telecommunications Public Limited Company | Codificacion del habla. |
SE463691B (sv) * | 1989-05-11 | 1991-01-07 | Ericsson Telefon Ab L M | Foerfarande att utplacera excitationspulser foer en lineaerprediktiv kodare (lpc) som arbetar enligt multipulsprincipen |
US5060269A (en) * | 1989-05-18 | 1991-10-22 | General Electric Company | Hybrid switched multi-pulse/stochastic speech coding technique |
US5097508A (en) * | 1989-08-31 | 1992-03-17 | Codex Corporation | Digital speech coder having improved long term lag parameter determination |
SG47028A1 (en) * | 1989-09-01 | 1998-03-20 | Motorola Inc | Digital speech coder having improved sub-sample resolution long-term predictor |
DE69033011T2 (de) * | 1989-10-17 | 2001-10-04 | Motorola Inc | Digitaler sprachdekodierer unter verwendung einer nachfilterung mit einer reduzierten spektralverzerrung |
US5073940A (en) * | 1989-11-24 | 1991-12-17 | General Electric Company | Method for protecting multi-pulse coders from fading and random pattern bit errors |
US5097507A (en) * | 1989-12-22 | 1992-03-17 | General Electric Company | Fading bit error protection for digital cellular multi-pulse speech coder |
US5265219A (en) * | 1990-06-07 | 1993-11-23 | Motorola, Inc. | Speech encoder using a soft interpolation decision for spectral parameters |
FI98104C (fi) * | 1991-05-20 | 1997-04-10 | Nokia Mobile Phones Ltd | Menetelmä herätevektorin generoimiseksi ja digitaalinen puhekooderi |
US5253269A (en) * | 1991-09-05 | 1993-10-12 | Motorola, Inc. | Delta-coded lag information for use in a speech coder |
DE9218980U1 (de) * | 1991-09-05 | 1996-08-22 | Motorola Inc., Schaumburg, Ill. | Fehlerschutz für Mehrmoden-Sprachcoder |
TW224191B (zh) * | 1992-01-28 | 1994-05-21 | Qualcomm Inc | |
FI95085C (fi) * | 1992-05-11 | 1995-12-11 | Nokia Mobile Phones Ltd | Menetelmä puhesignaalin digitaaliseksi koodaamiseksi sekä puhekooderi menetelmän suorittamiseksi |
US5495555A (en) * | 1992-06-01 | 1996-02-27 | Hughes Aircraft Company | High quality low bit rate celp-based speech codec |
US5317595A (en) * | 1992-06-30 | 1994-05-31 | Nokia Mobile Phones Ltd. | Rapidly adaptable channel equalizer |
IT1264766B1 (it) * | 1993-04-09 | 1996-10-04 | Sip | Codificatore della voce utilizzante tecniche di analisi con un'eccitazione a impulsi. |
-
1995
- 1995-01-06 FR FR9500124A patent/FR2729244B1/fr not_active Expired - Fee Related
-
1996
- 1996-01-03 DE DE69601068T patent/DE69601068T2/de not_active Expired - Fee Related
- 1996-01-03 EP EP96901009A patent/EP0801789B1/fr not_active Expired - Lifetime
- 1996-01-03 WO PCT/FR1996/000005 patent/WO1996021219A1/fr active IP Right Grant
- 1996-01-03 AU AU44902/96A patent/AU4490296A/en not_active Abandoned
- 1996-01-03 CN CNB961917954A patent/CN1134761C/zh not_active Expired - Fee Related
- 1996-01-03 AT AT96901009T patent/ATE174147T1/de not_active IP Right Cessation
- 1996-01-03 US US08/860,799 patent/US5899968A/en not_active Expired - Fee Related
- 1996-01-05 AT AT96400028T patent/ATE183600T1/de not_active IP Right Cessation
- 1996-01-05 EP EP96400028A patent/EP0721180B1/fr not_active Expired - Lifetime
- 1996-01-05 DE DE69603755T patent/DE69603755T2/de not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
US5899968A (en) | 1999-05-04 |
FR2729244A1 (fr) | 1996-07-12 |
WO1996021219A1 (fr) | 1996-07-11 |
EP0801789A1 (fr) | 1997-10-22 |
CN1134761C (zh) | 2004-01-14 |
AU4490296A (en) | 1996-07-24 |
FR2729244B1 (fr) | 1997-03-28 |
CN1173940A (zh) | 1998-02-18 |
DE69601068D1 (de) | 1999-01-14 |
EP0721180B1 (fr) | 1999-08-18 |
ATE174147T1 (de) | 1998-12-15 |
DE69603755T2 (de) | 2000-07-06 |
DE69601068T2 (de) | 1999-07-15 |
ATE183600T1 (de) | 1999-09-15 |
EP0721180A1 (fr) | 1996-07-10 |
DE69603755D1 (de) | 1999-09-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0801790B1 (fr) | Procede de codage de parole a analyse par synthese | |
EP0801788B1 (fr) | Procede de codage de parole a analyse par synthese | |
EP0801789B1 (fr) | Procede de codage de parole a analyse par synthese | |
EP0782128B1 (fr) | Procédé d'analyse par prédiction linéaire d'un signal audiofréquence, et procédés de codage et de décodage d'un signal audiofréquence en comportant application | |
EP0749626B1 (fr) | Procede de codage de parole a prediction lineaire et excitation par codes algebriques | |
EP1994531B1 (fr) | Codage ou decodage perfectionnes d'un signal audionumerique, en technique celp | |
US8401843B2 (en) | Method and device for coding transition frames in speech signals | |
FR2734389A1 (fr) | Procede d'adaptation du niveau de masquage du bruit dans un codeur de parole a analyse par synthese utilisant un filtre de ponderation perceptuelle a court terme | |
EP1692689B1 (fr) | Procede de codage multiple optimise | |
EP0616315A1 (fr) | Dispositif de codage et de décodage numérique de la parole, procédé d'exploration d'un dictionnaire pseudo-logarithmique de délais LTP, et procédé d'analyse LTP | |
EP1192619B1 (fr) | Codage et decodage audio par interpolation | |
WO2002029786A1 (fr) | Procede et dispositif de codage segmental d'un signal audio | |
EP1194923B1 (fr) | Procedes et dispositifs d'analyse et de synthese audio | |
EP1192621B1 (fr) | Codage audio avec composants harmoniques | |
EP1192618B1 (fr) | Codage audio avec liftrage adaptif | |
Jung et al. | Efficient implementation of ITU-T G. 723.1 speech coder for multichannel voice transmission and storage | |
JP2000330594A (ja) | 音声符号化装置及び方法並びに音声符号化プログラムを記録した記憶媒体 | |
WO2001003119A1 (fr) | Codage et decodage audio incluant des composantes non harmoniques du signal | |
WO2013135997A1 (fr) | Modification des caractéristiques spectrales d'un filtre de prédiction linéaire d'un signal audionumérique représenté par ses coefficients lsf ou isf |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 19970725 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE CH DE FR GB IT LI LU NL SE |
|
GRAG | Despatch of communication of intention to grant |
Free format text: ORIGINAL CODE: EPIDOS AGRA |
|
17Q | First examination report despatched |
Effective date: 19980225 |
|
GRAG | Despatch of communication of intention to grant |
Free format text: ORIGINAL CODE: EPIDOS AGRA |
|
GRAH | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOS IGRA |
|
GRAH | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOS IGRA |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: MATRA NORTEL COMMUNICATIONS |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AT BE CH DE FR GB IT LI LU NL SE |
|
REF | Corresponds to: |
Ref document number: 174147 Country of ref document: AT Date of ref document: 19981215 Kind code of ref document: T |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: EP |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: AT Payment date: 19981221 Year of fee payment: 4 |
|
REF | Corresponds to: |
Ref document number: 69601068 Country of ref document: DE Date of ref document: 19990114 |
|
ITF | It: translation for a ep patent filed |
Owner name: BARZANO' E ZANARDO MILANO S.P.A. |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: LU Payment date: 19990121 Year of fee payment: 4 |
|
GBT | Gb: translation of ep patent filed (gb section 77(6)(a)/1977) |
Effective date: 19990105 |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: NV Representative=s name: KELLER & PARTNER PATENTANWAELTE AG |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: BE Payment date: 19990212 Year of fee payment: 4 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed | ||
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LU Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20000103 Ref country code: AT Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20000103 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LI Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20000131 Ref country code: CH Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20000131 Ref country code: BE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20000131 |
|
BERE | Be: lapsed |
Owner name: MATRA NORTEL COMMUNICATIONS Effective date: 20000131 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: NL Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20000801 |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: PL |
|
NLV4 | Nl: lapsed or anulled due to non-payment of the annual fee |
Effective date: 20000801 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: SE Payment date: 20001218 Year of fee payment: 6 |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: IF02 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20020104 |
|
EUG | Se: european patent has lapsed |
Ref document number: 96901009.9 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: CD Ref country code: FR Ref legal event code: CA |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20040130 Year of fee payment: 9 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20041210 Year of fee payment: 10 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IT Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES;WARNING: LAPSES OF ITALIAN PATENTS WITH EFFECTIVE DATE BEFORE 2007 MAY HAVE OCCURRED AT ANY TIME BEFORE 2007. THE CORRECT EFFECTIVE DATE MAY BE DIFFERENT FROM THE ONE RECORDED. Effective date: 20050103 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FR Payment date: 20050105 Year of fee payment: 10 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20050802 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20060103 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: FR Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20060131 |
|
GBPC | Gb: european patent ceased through non-payment of renewal fee |
Effective date: 20060103 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: ST Effective date: 20060929 |