WO2002077977A1

WO2002077977A1 - Method and device for centralised correction of speech tone on a telephone communication network

Info

Publication number: WO2002077977A1
Application number: PCT/FR2002/001022
Authority: WO
Inventors: Gaël MAHE; André Gilloire
Original assignee: France Telecom (Sa)
Priority date: 2001-03-28
Filing date: 2002-03-25
Publication date: 2002-10-03
Also published as: FR2822999B1; FR2822999A1

Abstract

The invention concerns a method and a device for correcting speech tone transmitted in the form of a signal through a telephone network transmission link. The correction being carried out after the analog-to-digital conversion of the signal in the network, and comprises a pre-equalization (23) of the digital signal with a fixed filter having a frequency response in a Fc-Fh frequency band corresponding to the reciprocal of a reference spectral deformation introduced by the telephone link, with Fc < 300Hz and Fh ≥ 3150Hz and an adapted equalization (24) wherein a filter is used having a frequency response automatically adapted to the real distortion introduced by the telephone link based on the ratio between the reference spectrum and a spectrum corresponding to the signal long term spectrum.

Description

METHOD AND DEVICE FOR CENTRALIZED CORRECTION OF SPEECH TIMER ON A COMMUNICATIONS NETWORK

TELEPHONE

The subject of the invention is a method and a device for centralized correction of the timbre of speech on a telephone communications network.

The invention applies to any type of communication network (fixed, mobile or other) introducing spectral distortions and modifications of the signal level, by means of the appropriate choice of certain parameters.

In the case of a wired telephone network, speech suffers two spectral distortions.

The first distortion is bandpass filtering

(300-3400Hz) at the ends of each analog subscriber line (transmitting telephone terminal respectively receiver - and analog-digital conversion point - respectively digital-analog), standardized under. name of "System of

Intermediate Reference ”(SRI) which is defined by the ITU recommendation p48.

Reference may be made to FIGS. 1a and 1b representing the frequency responses of the transmission and reception systems defined by the ITU.

This bandpass filtering degrades the timbre by strongly attenuating the low-frequency components of speech. The second distortion _. is that introduced by the analog lines themselves, which constitute low-pass filters whose slope is all the steeper the longer the line. In a simple line model analog, the attenuation in dB is proportional to the square root of the frequency:

H _dB (f) = _dB (m Hz) ^ (1)

with Hdβ (800Hz), worth 3 dB for medium lines and 9.5 dB for longest lines.

The frequency responses of different lines (short, medium and long) are shown in Figure 2.

This low-pass filtering mutes the speaker's voice.

In the case of a mobile network, the signal undergoes only 300-3400Hz bandpass filtering at the transmitter and receiver terminals. This bandpass filtering must comply with a template defined by ITU recommendation P.313.

Up to now, the compensation of the spectral distortions introduced into the speech signal by the various elements of the telephone link has been achieved by devices based on equalization. This can be fixed or adapt according to the transmission conditions.

A first state of the art relates to centralized fixed equalization devices.

Indeed, centralized equalization devices have been proposed in US patents 5,333,195 and US 5,471,527. These equalizers are fixed filters which restore the level of low frequencies attenuated by the transmitter. US patent 5,333,195, for example, proposes a gain of 10 to 15 dB on the 100-300Hz band. These methods have two drawbacks: the equalizer only compensates for the filtering of the transmitter, so that at reception the low-frequency components remain strongly weakened by the filtering SRI of reception. - This fixed equalization compensates for average transmission conditions (line and transmission system). If the real conditions are too different (for example if the analog lines are long, the device does not correct the timbre enough, or even alters it more than the link without equalization.

,. A second state: from there. technique is based on adaptive equalization devices. The device described in US Pat. No. 5,915,235 aims to correct the non-ideal frequency response of a mobile telephone transducer. The equalizer is described as being placed between an analog-digital converter and a CELP (Code Exited Linear Predictive Coding) coder, but can be in the telephone terminal as well as in the network.

Two methods are then proposed in this document:

- The first consists in calculating the long-term auto-correlation coefficients R _L τ:

R _LT (n, i) = aR τ (nl, i) + (l ~) R (n, i), (2)

With: R _LT (n, i) i ^lth long-term auto-correlation coefficient in the n ^th frame, R (n, i) i ^{lth autocorrelation} coefficient specific to the n ^th frame, and α constant smoothing set for example at 0.995. From these coefficients are derived the long term LPC (Linear Predictive Coding) coefficients, which are the coefficients. a whitening filter. At the output of this filter, the signal is filtered by a fixed filter which prints to it the long-term spectral characteristics, ideal, that is to say those it would have at the output of a transducer having the frequency response ideal. These two filters are supplemented by a multiplicative gain equal to the ratio between the long-term energies of the inlet of the whitening filter and the outlet of the second filter. - The second method consists in dividing the signal into sub-bands and, for each sub-band, applying a

multiplicative gain - so as to reach a target energy, ... this gain being defined as the ratio between the target energy of the sub-band and the long-term energy (obtained by smoothing the energy signal) in this sub-band.

These two methods have the drawback of only correcting the non-ideal response of the transmission system and not that of the reception system.

- The device described in patent FR 9408741

(US 5905969) aims to compensate for the filtering of the transmission system and of the transmission subscriber line in order to improve the centralized recognition of the speech and / or the quality of the transmitted speech. As shown in Figure 3a of this patent, the signal spectrum is divided into 24 sub-bands and each sub-band energy is multiplied by an adaptive gain. An adaptation of the gain is carried out according to the algorithm of the stochastic gradient, by minimization of the quadratic error, the error being defined as the difference between the energy of sub-band and a reference energy defined for each sub-band. The reference energy is modulated with each frame by the energy overall of the current frame, so as to respect the natural variations in short-term level of the speech signal. The convergence of the algorithm allows the 24 equalized sub-band signals to be output.

The device does not correct the filtering of the reception system and the analogue reception line.

This patent does not mention results in terms of improvement of the voice quality and recognizes that the method is sub-optimal, because one operates a transformation in the frequency domain with an error. due to the operation of., .convolution. implicit circular linked to this transformation.

In addition, there is an oscillation of the system around the optimal solution linked to the fact that the correction is carried out by an adaptive closed-loop algorithm (feedback loop).

Line effect compensation is described in the article "On line adaptation of a speech recognizer to variation in telephone lines conditions", Eurospeech, pp 1247-1250, Sep 1993 by C. Mokbel, J. Monné and D Jouvet by the cepstral subtraction method, in order to improve the robustness of speech recognition.

It is shown in this document that the cepstrum of the transmission channel can be estimated by the mean cepstrum of the received signal, the latter being previously whitened by a pre-accentuation filter. This method allows a marked improvement in the performance of speech recognition systems, but is considered to be an “offline” method, 2 to 4 seconds being necessary to estimate the average cepstrum. Therefore she cannot apply to the correction of distortions on speech introduced by the transmission channel of a telephone network.

The present invention aims .amélioration the quality of speech transmitted over the communication networks, by providing means for correcting the spectral distortion of the speech signal and the level differences with ^'respect to the nominal level desirable for speech perception, distortions and deviations caused by different links in the chain of transmission.

The invention relates more particularly to a method for correcting the timbre of speech transmitted in the form of a signal by means of a transmission link of a telephone network, the correction taking place after the analog-digital conversion of the signal in the network, mainly characterized in that it comprises at least one step of pre-equalization of the digital signal by a fixed filter having 'a frequency response in a frequency band Fc-Fh corresponding to the inverse of a deformation reference spectral introduced by the transmission link, with Fc <300Hz and Fh> 3150Hz.

According to another characteristic, the spectral distortion taken as reference is characterized, in the case of a connection of a switched telephone network (PSTN), from the cascading of an intermediate reference system (SRI) of the type defined by ITU-T recommendation P.48, and two medium analog lines (transmit and receive).

In the case where the sending terminal is a mobile telephone and the receiving terminal is an extension fixed, the spectral deformation taken as reference is that resulting from the cascading of a filter. respecting the efficiency mask of the mobiles in transmission defined by the recommendation P.313 of the ITU, 5 of a line, analog average and. of an IRS receiver system as defined by recommendation ITU-TP.48.

In the case where the transmitting terminal is a landline and the receiving terminal is a mobile phone, 0 the spectral response taken as reference is that resulting from the cascading of a transmitting system

_. IRS, such .. as. defined by the recommendation. ITU-TP.4.8,

^' .. • ^{• •} • "of - an average analog line and of a filter respecting the efficiency template of the mobiles in reception 5 defined by the recommendation P.313 of the ITU.

In the case of a link between mobile terminals, the spectral deformation taken as reference is, that resulting from the cascading of a filter respecting the efficiency template of the mobiles in 0 emission defined by recommendation P.313 of l 'And a filter respecting the efficiency template of the mobiles in reception defined by the ITU recommendation P.313.

According to another characteristic, the method of correcting the timbre of the speech further comprises an adapted equalization step in which a filter is used having a frequency response automatically adapted to the actual distortion introduced by the telephone link as a function of the ratio between a reference spectrum and a spectrum corresponding to the long-term spectrum of the signal.

According to another characteristic, the adapted equalization step comprises: - the detection of a voice activity on the line to trigger a sequence of treatments for the calculation of the coefficients, of the digital filter as a function of the ratio between the reference spectrum and the spectrum corresponding to the long-term spectrum of the signal,

- controlling the filter with the coefficients obtained and updating said coefficients,

the filtering of the signal leaving the pre-equalizer by said filter. According to another characteristic, the sequence of treatments comprises: .... - -_ le. long-term spectrum calculation of ..signal., in. des-., successive time windows - partially overlapping, this calculation being carried out in the frequency band Fc-Fh,

- the calculation of the modulus of the frequency response of the adapted equalizer by carrying out the ratio of the square root of the long-term spectrum obtained in a time window, to the square root of the reference spectrum, the square root of the reference spectrum being compensated at each frequency by a predetermined factor A (f) depending on the frequency.

According to another characteristic, the processing sequence comprises an extrapolation of the module of the frequency response of the adapted equalizer, for the frequencies outside the band Fc-Fh, the frequency response being defined for all the frequencies between 0- 4000Hz.

The chain of processing operations furthermore comprises the calculation of the impulse response of the digital filter from the module of the frequency response of the adapted equalizer extrapolated for the frequencies outside the band Fc-Fh. The long-term signal spectrum calculation includes a stiff Fourier transform operation.

The calculation of the impulse response of the .filtre consists in calculating the coefficients of the filter by operating an inverse Fourier transform on the modulus of the frequency response, followed by symmetrization, windowing and an offset.

The application of a time window corresponds to a smoothing of the initial frequency response calculated. ... The calculation of the long-term spectrum of the signal in. successive time windows. partially overlapping includes: - signal sampling in a time window, fast Fourier transform (FFT) operation of the sampled signal, calculation of power spectral density, - calculation of average spectral density of power, over a predetermined period. Advantageously, the calculation of the average of the power spectral density consists of:

for the first N time windows from the detection of the presence of speech in the signal, calculating the arithmetic mean of the power spectral densities of all the time windows elapsed since said detection, N being a predetermined number of time windows, typically but not exclusively the number of time windows in 4 seconds of speech;

- for the following time windows, to adjust the average of the spectral density of power calculated at the previous time window by a first order recursive smoothing taking into account the power spectral density of the current time window. This results in the following generic formula:

E [γ _x (f ⁾ ] _n = α (n) γ _x (f, n) + (l-α (n)) E [γχ (f)] ni _' (9)

where E [γ _x (f)] n is the spectrum in ^the long term of x at the ^n'th frame γ _x (f, n) the power spectral density of the nth frame, and

α (n) = mι.n} (", N _λ )

The method further includes an automatic gain control step.

According to one embodiment, the automatic gain control is carried out during the sequence of treatments of the equalization step by choosing a reference spectral density γ _ref corresponding to the desired level in reception.

According to another embodiment, the automatic gain control is carried out by amplification of the signal obtained after equalization with a gain α as a function of the ratio between the power spectral density of the output signal of the reception terminal when a pre -equalization and suitable equalization of the signal and, the power spectral density of the output signal from the reception terminal in the absence of pre-equalization and suitable equalization of the signal.

Another object of the invention is a fixed digital filter intended for the correction of the timbre of speech in a telephone transmission network, mainly characterized in that said filter has a frequency response in a frequency band Fc-Fh, corresponding to the inverse of a reference spectral deformation introduced by the link, with Fc <300Hz and Fh> 3150Hz.

Another object of the invention is a suitable digital filter intended for the correction of the timbre of speech in a telephone transmission network, mainly characterized in that it comprises means for processing the speech signal having a frequency response adapted automatically at the actual distortion .. introduced by the telephone link as a function of the ratio between a reference spectrum and a spectrum corresponding to the long-term spectrum of the signal.

Another object of the invention is a device for correcting the timbre of speech in a telephone transmission network, mainly characterized in that it comprises a fixed filter followed by a suitable filter and means of automatic gain control such as as previously described.

Other features and advantages of the invention will become clear on reading the description which is given below and which is given by way of non-limiting example and with reference to the drawings in which: the figure represents the template of the transmission system, - Figure 1b represents the template of the reception system, - Figure 2 represents the responses of different lines of analog subscribers, FIG. 3 represents a simplified telephone link including correction, FIG. 4 represents the functional diagram of a correction device, - FIG. 5 represents the frequency response of the pre-equalizer for Fc = 250 Hz,

FIG. 6 represents a triangular window applied to the impulse response of the filter,

FIG. 7 represents the functional diagram of the adapted equalizer,

- Figure 8 shows the spectral distortion between the signal .. emitted and the signal, received for one. speaker 1 and for speaker 2.. ^,

The description given hereinafter makes explicit reference to the transmission of speech over “conventional” telephone lines (that is to say wired lines), but of course as has been said the invention applies to any type of communication network (fixed, mobile or other) introducing spectral distortions and changes in the signal level, subject to the appropriate choice of certain parameters.

The object of the invention is to correct the spectral distortions by centralized processing, that is to say by a device 20 installed in the digital part of the telephone network between the analog-digital converter (law A in the particular case of the European PSTN or μ law in the United States) 12 and digital (law A) -analog 32, as illustrated by the diagram in FIG. 3. The correction device 20 is preceded by a module 21 for converting the digital signal (the law A) in linear and followed by a module for converting the linear signal into law A. FIG. 4 illustrates the correction device according to the invention.

A satisfactory correction of the average distortions due to the transmission 10, reception 30 system and to the analog lines 11, 31 is obtained by a pre-equalizer 23.

In order to take account of the fact that the transmission conditions are not always average transmission conditions, • ^• (the lines used are not always of average length and the transmission and reception systems may deviate from the recommendations of ITU), the correction system

_^ additionally comprises an "equalizer., adapted - 24. and an automatic gain correction (AGC) 25. As will be seen hereinafter, the automatic gain control can be either integrated into the adapted equalizer, or make the subject of a separate module.

The pre-equalizer 23 is a fixed filter, the frequency response of which on a band Fc-Fh, such that Fc <300Hz and Fh> 3150Hz, is the inverse of the overall response of the average analog channel of a telephone link. This medium channel is defined as consisting of two medium subscriber lines and a transmission and reception system which respects the nominal frequency responses defined in the ITU recommendations.

Fc is the low frequency equalization limit. It must be less than 300 Hz in order to restore the low-frequency (LF) components of the voice.

Fh is for example at 3150Hz.

FIG. 5 represents the typical frequency response of the pre-equalizer for Fc = 250 Hz. This response is calculated from the SRI and “average line” models.

The pre-equalizer 23 having the frequency response shown in FIG. 5 is produced for example by a filter with infinite impulse response IIR, the coefficients of the transfer function at z are:

_A s we have seen, the pre-equalizer 23 compensates for average conditions of transmission.

It can be used alone. However, if used alone and one of the analog lines is long, the voice seems muted at the reception. If on the contrary a line is very short, the high frequency components are too present. Other distortions of the timbre can appear if the emission and reception systems have frequency responses too far from the ITU specifications. _This is why the pre-equalization is completed by an adapted equalizer, which adapts the correction more precisely to the real transmission conditions. The equalizer is designed so that its frequency response automatically adapts to the actual distortion introduced by the telephone link according to the ratio between a reference spectrum and the long-term spectrum of the signal.

The principle is as follows:

Let s be the speech signal emitted by the speaker, y the signal received at the end of the chain, and h the filter constituted by the complete analog channel (transmission and reception) and the pre-equalizer.

According to the interference formula,

γ _y (f) = | H (f) | ² . γ _s (f), (3)

where γ _s is the power spectral density of s, γ _y that of y and H the frequency response of h.

If the channel is assumed to be time invariant,

E [γ _y (f)] = | H (f) | ² . E [γ _s (f)], (4), where E denotes the mean. As E [γ _s (f)] is not known, we approach it by the average spectrum of speech defined by 1UIT, which is called reference spectrum noted γ _re f (f) •

Thus the frequency response of the filter is estimated by:

The frequency response of the adapted equalizer then has as expression:

As the equalizer is centralized in the network, γ _y is not known. It is expressed as a function of γ _x power spectral density of the output x of the pre-equalizer, in the case where there is no suitable equalizer ,:

where L_RX is the frequency response of the reception line and S_RX the frequency response of the reception system. As these responses are unknown a priori, they are approached by the responses of a mean line and of a reception system respecting the specification of the ITU, and denoted respectively L_RXo and S_RXo. The frequency response of the adapted equalizer sought is then:

We see in this formula that the square root of the reference spectrum γ _ref is weighted by the compensation factor A (f) due to the correction already made by the pre-equalizer. This factor is a function of the frequency as shown below:

In a preferred embodiment, the output of the pre-equalizer 23 is analyzed in 32 ms frames, with an overlap of 50%.

The adapted equalizer 24 is a RIF 251 filter whose coefficients are adapted to each activity frame voice according to equation (8), as described below and shown in Figure 7.

A voice activity frame detector 240 makes it possible to trigger a processing chain corresponding to the functional modules referenced 241 to 251 in FIG. 7 to obtain the coefficients of the filter 251.

A window for analyzing the sampled signal 241 is opened. A Fourier transform 242 is applied to the samples. Typically, but not exclusively, the long-term spectrum of x, E [γ _x ], is first calculated from

1_ 'initial instant of vocal activity) by averaging 244 of γ _x over a time window increasing from 0 to 4 seconds, then recursively adjusted for each subsequent frame, which results in the following generic formula:

E [γ _x (f)] _n = α (n) γ _x (f, n) + (l-α (n)) E [γ _x (f)] _n . ₁ - (9)

where E [γ _x (f)] _n is the long-term spectrum from x to the n ^th frame γ _x (f, n) the power spectral density of the nth frame, and

α (n) ≈ ^l mm (n, N)

where Ν is the number of frames in 4 seconds.

In practice γ _x is calculated by taking the modulus squared of the fast Fourier transform 242 in FIG. 7. The frequency response of the equalizer 24 is therefore calculated according to equation (8) for the frequencies between F _c and F _H , the choice having been made to equalize the signal only on this band. The values of | EQ | outside this frequency band are calculated in a known manner by linear extrapolation 247 of the dB value of | EQ | [ _FC -

FH] • The impulse response of the equalizer is calculated by an inverse Fourier transform IFFT 248 of | EQ | followed by symmetrization 249, so as to obtain a linear phase filter.

The frequency response of this filter, however, is very irregular and, due to the approximations which tainted its calculation, only its general form is relevant. This is why one ^" proceeds to a narrow symmetrical windowing 250 of the impulse response resulting from operations 248 (inverse Fourier transform) and 249 (symmetrization). This windowing corresponds to a smoothing of the frequency response of the filter.

The windowing is followed by an offset so as to obtain a filter of the length of the window, without additional delay.

For this, a triangular window of length 11 (samples) is used for this, the coefficients of which are shown in FIG. 6 for an initial impulse response on 256 points. Once multiplied by this window, the impulse response of the adapted equalizer is shifted by 123 points to the left. This makes it possible not to delay the signal which would be the case if the zeros in front of the window intervened in the calculation of the output of the filter.

Then an automatic gain control is carried out. This control typically has one or the other of the following two objectives: - normalize the level,

- ensure the transparency of the system vis-à-vis the overall level of speech at reception. -

Two embodiments are proposed in the following corresponding respectively to these two objectives.

In a first embodiment, the gain control is carried out by the adapted equalizer 24. The choice of γ _ref indeed corresponds to a desirable nominal level for speech. Thus, depending on the level chosen for γ _r ef / the appropriate equalizer automatically corrects the speech level to reach the desired level in reception. ....'.The goal of. the second achievement is to correct the timbre while ensuring conservation of the overall level of speech compared to the same connection without the device.

For this, the long-term spectrum of the reception signal must have the same energy with the device as without. The gain α defined by the theoretical formula is therefore applied to the output of the adapted equalizer 24:

I n. — υ with γaec (k) e γ _sa πs (k) power spectral density of the signal received at the discrete frequency k, respectively with and without the pre-equalizer set 23 plus adapted equalizer 24. As the channel is invariant in time, the adapted equalizer converges towards a response varying little so that at each frame:

However, j EQι ± _ss j is not known directly, since the smoothing of the frequency response of the equalizer is carried out by windowing the impulse response from | EQ]. As the quantity | EQ | is directly available (calculated in the adapted equalization module), to simplify the implementation, we approach | EQn _ss | by λ | EQii _SS | , λ being a correction factor for the energy difference between | EQii _SS | and | EQ | related to the windowing of the impulse response. If we write W the frequency response of the window,

EQii _ss (k) = i-EQ (k) ® (k) (12)

NOT

where ® denotes the circular convolution and N the number of points of the FFT, 256 for example. | EQ | being very irregular, if we compare it to noise,

E [| EQ _liss (k) | ] =

-E [[EQ (k) j ² ] ∑ (k) ² = E [| EQ (k) | ² ]. ∑ w (n) ² (13)

^N k = 0 n = 0

w designating the time window. So,

For a triangular window of length 11, λ = 2 On the other hand, E [γ _s years (k)] [γ _x (k)] (15)

where γ _x (k) is the power spectral density of the output of the pre-equalizer and PRE_EQ (k) the frequency response of the pre-equalizer.

So,

The gain is therefore calculated with reduced complexity, since E [γ _x (k)] and | EQ (k) | are already calculated in the adapted equalization module and the other factors are constants.

If F _c is too weak, the received signal is affected by a strong quantization noise. Indeed, as the attenuation of the reception system is all the stronger the lower the frequency and that, the equalization compensating for this attenuation on the band Fc-Fh, is placed before the reception system, this anticipated equalization induces at the output of the device, differences in level between the high and low frequency components are all the greater as F _c is low. Thus, for certain phonemes, the level of the quantization noise during the conversion into A-law is close to that of the mid and high components. After the attenuation of the LF components by the system reception noise is as energetic as the speech signal.

In a typical embodiment, we have chosen: - F _c = 250 Hz, which allows an acceptable compromise between the restoration of the LF components and the limitation of the quantization noise.

For all the speakers who have been tested, the timbre of the voice at reception is much closer to the original with the correction by the device presented than without the device.

The adjustment time of the equalizer is very fast: an improvement. of the timbre is perceptible in less than a second and a stable estimate of the equalizer is obtained in less than 4 seconds for most speakers.

Figure 8 shows for two speakers the spectral distortion between the received speech signal and the original signal after 4 seconds of speech, in the case of a link composed of an ITU-compliant IRS, of a line , long on transmission and an average line on reception.

A zero distortion would be represented by an average flat curve (+++) over the entire frequency band, at around -9dB if the reception level is not changed compared to the system without correction.

The distortion is represented in three cases: without correction (dashes), with only pre-equalizer (fine dotted lines) and with the complete device (solid line). The curve in the latter case is not completely flat on the equalized band (250-3150 Hz), but the differences are less than 2.5 dB, which is hardly noticeable. Level correction is here carried out according to the second method (conservation of the overall level).

Claims

1. Method for correcting the timbre of speech transmitted in the form of a signal by means of a transmission link of a telephone network, the correction taking place after the analog-digital conversion of the signal in the network, characterized in that it comprises at least one step of pre-equalization of the digital signal by a fixed filter having a frequency response in a frequency band Fc-Fh corresponding to the inverse of a reference spectral deformation introduced by the telephone link , with Fc <300Hz and Fh> 3150Hz.

2. Method for correcting the timbre of speech according to claim 1, characterized in that the spectral distortion taken as reference is characterized, for a connection of the 'switched telephone network (PSTN), from the cascading of an intermediate reference system (IRS) of the type defined by ITU-T recommendation P.48, and two medium analog lines (transmit and receive).

3. Method for correcting the timbre of speech according to claim 1, characterized in that the spectral deformation taken as a reference is in the case where the transmitting terminal is a mobile telephone and the receiving terminal is a fixed station, that resulting from the cascading of a filter respecting the efficiency template of the mobiles in transmission defined by the ITU recommendation P.313, of a line analog medium and an IRS receiver system as defined by Recommendation ITU-TP.48.

4. Method for correcting the timbre of speech according to claim 1. characterized .in this. that the spectral distortion taken as a reference is, in the case where the transmitting terminal is a landline and the receiving terminal is a mobile telephone, that resulting from the cascading of an SRI transmitting system as defined by the ITU recommendation -TP.48, an average analog line and a filter respecting the efficiency template of the mobiles in reception defined by ITU recommendation P.313.

5. Method for correcting the timbre of speech according to claim 1, characterized in that the spectral deformation taken as reference is, in the case of a link between mobile terminals, that resulting from the cascading of a filter respecting the transmission efficiency mask of the mobiles defined ^• by the ITU recommendation P.313 and a filter- respecting the reception mobiles efficiency mask defined by the ITU recommendation P.313.

6. Method for correcting the timbre of speech according to any one of the preceding claims, characterized in that the filter used is a filter of type IIR infinite impulse response.

7. Method for correcting the timbre of speech according to any one of the preceding claims, characterized in that the low frequency Fc of the filter filter band is of the order of 250Hz.

8. Method for correcting the timbre of speech according to any one of the preceding claims, characterized in that it further comprises an adapted equalization step in which a filter having a frequency response adapted automatically to the actual distortion is used introduced by the telephone link as a function of the ratio between a reference spectrum and a spectrum corresponding to the long-term spectrum of the signal.

9. Method for correcting the timbre of speech according to claim 8, characterized in that the adapted equalization step comprises:

- the detection of a vocal activity on the line to trigger a sequence of processing operations for the calculation of the coefficients of the digital filter as a function of the ratio between the reference spectrum and the spectrum corresponding to the long-term spectrum of the signal, the filter control with the coefficients obtained and the updating of said coefficients,

- the filtering of the signal leaving the pre-equalizer by said filter.

10. Method for correcting the timbre of speech according to claim 9, characterized in that

1 sequence of treatments includes:

- the calculation of the long-term spectrum of the signal in successive partially overlapping time windows, - the calculation of the modulus of the frequency response of the adapted equalizer on the Fc-Fh band by performing the ratio of the square root of the spectrum to long term obtained in a time window at the square root of the reference spectrum, the square root of the spectrum reference being compensated at each frequency by a predetermined factor A (f).

11. Method for correcting the timbre of speech according to claim 9 or 8, characterized in that the processing sequence comprises an extrapolation of the module of the frequency response of the adapted equalizer, for frequencies outside the band Fc -Fh, typically for a 0-4000Hz band.

12. Method for correcting the timbre of speech according to any one of claims 9 to 11, characterized in that the sequence of. processing includes calculating the impulse response of the digital filter from the module of the frequency response of the adapted equalizer extrapolated for frequencies outside the Fc-Fh band.

13. Method for correcting the timbre of speech according to any one of claims 8 to 12, characterized in that the calculation of the spectrum of the long-term signal comprises a fast Fourier transform operation.

14. Method for correcting the timbre of speech according to any one of claims 8 to 12, characterized in that the calculation of the impulse response of the filter consists in calculating the coefficients of the filter by operating an inverse Fourier transform on the response frequency of the adapted equalizer, symmetrization then an operation ensuring the smoothing of the frequency response.

15. Method for correcting the timbre of speech according to claim 14, characterized in that the smoothing operation is carried out by the application of a time window on the impulse response.

16. Method for correcting the timbre of speech according to any one of claims 8 to 15, characterized in that the calculation of the long-term spectrum of the signal in successive partially overlapping time windows comprises: sampling the signal in a time window, the fast Fourier transform operation

(FFT) of the sampled signal, - the calculation of the power spectral density, the calculation of the average of the power spectral density, over a predetermined duration.

17. Method for correcting the timbre of speech according to claim 16, characterized in that the calculation of the average of the power spectral density consists of:

- for the following time windows, to adjust the average of the power spectral density calculated in the time window previous by a first order recursive smoothing taking into account the power spectral density of the current time window.

18. Method of. correction of the stamp of the. speech according to any one of claims 8 to 17, characterized in that it comprises an automatic gain control step.

19. Method for correcting the timbre of speech according to claim 18, characterized in that the automatic control of. gain. is performed during the sequence of treatments of the equalization step, adapted by choosing a reference spectral density γ _ref corresponding to the desired level in reception.

20. Method for correcting the timbre of speech according to claim 18, characterized in that the automatic gain control is carried out by amplification of the signal - obtained after suitable equalization with a gain α as a function of the ratio between the power spectral density of the signal reception when a pre-equalization and a suitable equalization of the signal have been carried out and, the spectral power density of the signal in the absence of pre-equalization and equalization of the signal.

21. Fixed digital filter intended for the correction of the timbre of speech in a telephone transmission network, characterized in that it has a frequency response in a frequency band Fc-Fh corresponding to the inverse of a spectral distortion of .. reference introduced by the telephone link, with Fc <300Hz and Fh> 3150Hz.

22. Adapted digital filter intended for the correction of the timbre of speech in a network, of telephone transmission, characterized in that it comprises means for processing the speech signal having a frequency response automatically adapted to the actual distortion introduced by the telephone link 0 as a function of the ratio between a reference spectrum and a spectrum corresponding to the long-term spectrum of the signal.

23. Device for correcting the timbre of speech 5 in a telephone transmission network, characterized in that it comprises a fixed filter according to claim 20 followed by a suitable filter according to claim 21 and means for automatic gain control . 0

24. Device for correcting the timbre of speech in a transmission network according to claim 23, characterized in that the means for automatic gain control are produced by the suitable filter. 5

25. device for correcting the timbre of speech in a transmission network according to claim 23, characterized in that the automatic gain control means are made an amplifier of the gain signal α as a function of the ratio between the power spectral density of the reception signal when a pre-equalization and a suitable equalization of the signal have been carried out and, the power spectral density signal in the absence of pre-equalization and suitable signal equalization.