WO2005027541A1 - Signal-processing methods and apparatus - Google Patents

Signal-processing methods and apparatus Download PDF

Info

Publication number
WO2005027541A1
WO2005027541A1 PCT/GB2004/003929 GB2004003929W WO2005027541A1 WO 2005027541 A1 WO2005027541 A1 WO 2005027541A1 GB 2004003929 W GB2004003929 W GB 2004003929W WO 2005027541 A1 WO2005027541 A1 WO 2005027541A1
Authority
WO
WIPO (PCT)
Prior art keywords
input signal
signal
energy
content
dtmf
Prior art date
Application number
PCT/GB2004/003929
Other languages
French (fr)
Inventor
Fredrik Gustaf Rodin
Original Assignee
Fulcrum Systems Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fulcrum Systems Limited filed Critical Fulcrum Systems Limited
Publication of WO2005027541A1 publication Critical patent/WO2005027541A1/en

Links

Classifications

    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03HIMPEDANCE NETWORKS, e.g. RESONANT CIRCUITS; RESONATORS
    • H03H21/00Adaptive networks
    • H03H21/0012Digital adaptive filters
    • H03H21/0025Particular filtering methods
    • H03H21/0027Particular filtering methods filtering in the frequency domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02085Periodic noise

Definitions

  • This invention relates to signal-processing methods and apparatus .
  • the invention is concerned especially, though not exclusively, with signal-processing methods and apparatus for use in the context of telephony.
  • amplification may have to be used in order to enable the operator to hear the caller, so that bursts of interference on the telephone line or wireless lin may result in pain or even injury to the operator.
  • the caller may start to enter that number into their telephone with the result that the operator is subjected to the stress of hearing DTMF dial tones.
  • a method of processing an input signal having a multiplicity of different frequency components within a range of frequencies wherein synthesis of the signal in a plurality of its components within the range is carried out to derive representations of the energy-content of the signal at the frequencies of those respective components, and wherein the energy-content representations are utilised as discriminants in the processing of the signal.
  • apparatus for processing an input signal having a multiplicity of different frequency components within a range of frequencies comprises means for synthesis of the signal in a plurality of its components within the range, means for deriving representations of the energy-content of the signal at the frequencies of those respective components, and means for processing the input signal in dependence upon the derived energy-content representations.
  • the input signal of both aspects of the invention may be sampled repetitively and the synthesis of the input signal may be carried out in respect of each sample in turn to derive fresh energy-content representations in relation to each sample.
  • Frequency analysis of digital- signal samples is currently performed within the framework of the digital Fourier transform (DFT) ; the fast Fourier transform (FFT) is employed when computational efficiency is crucial, which is often the case in real-time applications.
  • DFT digital Fourier transform
  • FFT fast Fourier transform
  • the DFT operates on blocks of input-signal samples, each referred to as a frame, and cannot operate sample by sample as in the case of the method and apparatus of the present invention.
  • the synthesis of the input signal in the method and apparatus of the present invention may involve RLS (recursive least squares) filtering of the input signal to derive coefficients related to the individual components.
  • the filtering which may be LMS (least mean squares) filtering, may involve quadrature-frequency signal pairs weighted individually in accordance with coefficients derived iteratively in the filtering, and the energy-content representations may each be derived as the square root of the sum of the squares of the two coefficients of each weighted signal pair.
  • the processing carried out by the method and apparatus of the invention may be, for example, for limiting the sound pressure level produced by an earphone or other electro- acoustic transducer that is driven by the input signal.
  • the method and apparatus of the invention may be applied to the detection and suppression of DTMF tones in a telephone system.
  • FIG. 1 is a schematic representation of the signal- processing apparatus according to the invention.
  • Figure 2 is a schematic representation of a signal- limiter that forms part of the apparatus of Figure 1;
  • Figure 3 is illustrative of an adaptive filter that forms part of the signal-limiter of Figure 2 ;
  • Figure 4 is a schematic representation of a DTMF-tone detector that forms part of the apparatus of Figure 1;
  • Figures 5 and 6 are illustrative of part of the operation of the DTMF-tone detector of Figure 4.
  • Figure 1 is carried out by electronic processing circuits in accordance with software, but for ease of understanding its operation will be described in terms of separate stages with dedicated function. It is to be understood in this connection that it is not of the essence of the invention whether dedicated hardware or software-controlled apparatus is utilised.
  • the signal-processing apparatus 1 is connected as part of a telephone system between the received-signal output 2 of a land-line or wireless terminal and an electro-acoustic transducer 3 (for example an earphone of a telephone headset) .
  • the signal from the output 2 is supplied to an analogue-digital converter 4 so as to produce a digital signal y (n) with a sampling frequency f s , that is supplied to a notch filter 5 used in the suppression of DTMF tones detected by a
  • the output from the notch filter 5 is supplied via a limiter 7 to a digital-analogue converter 8 and thence to the transducer 3.
  • the limiter 7 is responsive to the frequency spectrum of the signal- samples y (n) it receives from the filter 5 to limit the sound energy emitted by the transducer 3.
  • the limiter 7 has an effective structure as illustrated in Figure 2, and will now be described.
  • the limiter 7 includes a first energy-tracking section 9 that involves an LMS (least- mean-squares) filter stage 10.
  • the filter stage 10 is used to synthesize the input sample y (n) in every time instant n as a sum of N equally-spaced narrow-band frequency components; the sum is identified as synthesized signal y (n) .
  • Synthesis is achieved by choosing a mix of narrow-band components that minimizes the error (y(n) -y (n) ) , so that the resulting synthesized signal y (n) has a discrete power spectrum that is comparable with the continuous spectrum of the input signal y (n) .
  • An error e y (n) -y (n) is introduced since spectral information is lost in y (n) , however, the larger the value of N the smaller the error e.
  • the LMS filter stage 10 has the effective structure illustrated in Figure 3 where N (for example, thirteen) pairs of quadrature-frequency signals c n sin (2 ⁇ f ln n) and c 12 cos (2 ⁇ rf ln n) to c N1 sin (2 ⁇ f m n) and c m cos (2 ⁇ f N2 n) are generated using respective pairs of weighting stages 11.
  • N pairs of quadrature-frequency signals generated are summed in a stage 12 and the result subtracted from the input sample signal y (n) in a comparator stage 13 to derive an instantaneous-error signal.
  • This error signal is fed back to the stages 11 to adjust iteratively the weighting coefficients c n and c 12 to c m and c m applied to the pairs of quadrature-frequency signals respectively.
  • the iterative adjustment is towards reducing the error signal to zero so that the signal derived by the stage 12 is a synthesized signal y (n) approximating to the input signal y (n) .
  • signals in accordance with the adjusted pairs of weighting coefficients c u and c n to c m and c N2 in stages 11, are applied within the energy-tracking section 9 of unit 7 to respective computation stages 13.
  • the stages 13 derive N signals -?y 7 to Sy N representative of the energy content of the N frequency-component signals respectively, of the synthesized signal y (n) .
  • each energy- content signal Sy 2 to Sy N is computed as the square root of the sum of the squares of the respective pair of weighting coefficients c n and c J2 to c N1 and c N2 .
  • the phase information involved in each pair of frequency-component signals is not significant to the approximation provided in the energy-signals Sy t to Sy N .
  • the signals Sy 1 to Sy N are supplied from the energy- tracking section 9 of the limiter 7 to a second, frequency-weighting section 14 which comprises N weighting stages 15.
  • the weighting stages 15 impose weighting coefficients w 2 to w N on the signals _3y 2 to Sy N respectively, and pass the signals as so weighted, on to a third, attenuation-computing section 16 of the limiter 7.
  • a computing stage 17 derives the square root of the sum of the squares of the weighted signals Sy 1 .w 1 to Sy N .w N , as an estimate E ⁇ of the total energy content of the signal y (n) .
  • This estimate is made up of the sum of approximations to the energy content of each discrete frequency component of the signal y (n) as adjusted by its respective weighting coefficient w 2 to w N .
  • a signal representative of E ⁇ is supplied from stage 17 as an input to a non-linear stage 18 within the section 16.
  • the stage 16 operates according to a non-linear function to produce an output signal representative of E ou ⁇ .
  • the non-linear function imposes a maximum magnitude of (a + b) on the signal-representation of E o ⁇ , with E ou ⁇ related proportionately to E ⁇ for energy-magnitudes of E ⁇ less than or equal to magnitude b , but of reducing proportionality to E ⁇ as the magnitude of E ⁇ increases above magnitude b .
  • the transfer function of stage 18 is as represented at (1) in the annexed sheet of formulae.
  • the signal-representation of E ou ⁇ is compared with the signal-representation of E ⁇ in a comparator stage 19 to derive an attenuation factor.
  • This factor is applied via a fourth section 20 of the limiter 7, to the input signal y (n) before this latter signal is supplied to the converter 8 for application to the transducer 3.
  • the essence of the frequency-dependent limiting carried out by the limiter 7 is the sample-by-sample spectral analysis provided by the power spectrum S yl to S yN .
  • the LMS adaptive filter stage 10 minimizes the error e between signals y (n) and y (n) for every time-instant n according to the formula (2) with respect to the matrix (3) , of the annexed sheet.
  • the desired accuracy (that is to say, the maximal tolerated mean error) is dependent on the number of iterations of the LMS filter stage 10 that take place, and for which there is repeated updating of the matrix (3) , for every time instant n .
  • the weighting coefficients w t to w N and limit-maximum (a + b) are set by sweeping a narrow-band tone through the frequencies covered in the bandwidth of the input signal
  • the output of the transducer 3 is set to a limiting level for every discrete frequency by first adjusting the overall maximum (a + b) and then each weighting coefficient w ; to w N individually.
  • the transmission of DTMF tones to an operator's headset is a principal cause of nuisance and even injury, and although the limiter 7 is capable of suppressing such tones to some extent, the fast and efficient suppression of them is achieved in the apparatus of Figure 1 via the notch filter 5.
  • the notch filter 5 acts to block DTMF tones that are detected by the DTMF-tone detector 6. The detector 6 and its mode of operation will now be described with reference to Figure 4.
  • the DTMF-tone detector 6 receives signal y (n) containing speech and/or DTMF signal components. For every time instant n , the detector 6 classifies the content of the current sample y (n) as DTMF or DTMF, namely, including or not-including, respectively, DTMF tones. The classification is made according to three functional blocks or stages 22, 23 and 24.
  • Stages 22 and 23 act together as a feature extraction engine, providing the classifier stage 24 with features significant for DTMF tones and for speech respectively.
  • Block 24 makes a decision on the extracted features of y (n) and supplies a DTMF-tone index to the notch filter 5 if a DTMF tone is detected.
  • the method of detection is based on a-priori knowledge of the characteristic spectral shapes of speech and of pairs of narrow-band tones.
  • the major energy-content of speech is concentrated in the lower end of the frequency domain of 200-2000 Hz.
  • a signal containing a DTMF tone alone has an energy spectrum dominated by two narrow energy components, and a basic assumption in the present context is that the energy of all DTMF tones is larger than that of any harmonic-frequency component of speech.
  • stage 22 includes an LMS filter stage 25 that has the same effective structure as that illustrated in Figure 3 for generating pairs of quadrature-frequency signals.
  • stage 22 includes an LMS filter stage 25 that has the same effective structure as that illustrated in Figure 3 for generating pairs of quadrature-frequency signals.
  • just eight pairs of quadrature-frequency signals c n sin (2 ⁇ f ln n) and c 12 cos (2 ⁇ f ln ⁇ ) to c SI sin (2 ⁇ f 81 n) and c S2 cos (2 ⁇ f 82 n) are generated using the respective pairs of weighting stages 11.
  • the eight pairs of quadrature- frequency signals generated in this case have the same frequencies as the eight standard DTMF-tone frequencies respectively, and are summed in the stage 12.
  • the resultant sum is subtracted from the input sample signal y(n) in the comparator stage 13 to derive an instantaneous-error signal.
  • This error signal is fed back to the stages 11 to adjust iteratively the weighting coefficients c n and c 12 to c 81 and c S2 applied to the pairs of quadrature-frequency signals respectively.
  • the iterative adjustment is towards reducing the error signal to zero so that the signal derived by the stage 12 is a synthesized signal y (n) approximating to the DTMF-tone frequency components in input signal y (n) .
  • signals in accordance with the adjusted pairs of weighting coefficients c u and c 12 to c 81 and c 82 are applied within section 9 of the detector apparatus 6 to respective computation stages 26.
  • the stages 26 derive eight signals Sy 2 to Sy 8 representative respectively of the energy content of the eight DTMF-tone frequency components of the synthesized signal y (n) .
  • each energy-content signal Sy ; to Sy 8 is computed as the square root of the sum of the squares of the respective pair of weighting coefficients c n and c 12 to c 81 and c 82 .
  • the phase information involved in each pair of frequency-component signals is not significant to the approximation provided in the energy-signals S ; to Sy 8 .
  • the signals Sy t to Sy 8 are supplied from the first section 22 to a sorter 27 of the second section 23.
  • the sorter 27 acts to sort the signals Sy 2 to Sy 8 in ascending order of magnitude, so as to produce representative signals c sl to c s8 in that order.
  • the three ordered signals c s6 to c s8 correspond to whichever three of the energy-signals Sy 2 to Sy 8 are of the highest magnitude, and are supplied for classification to the classifying stage 24.
  • the classification carried out in stage 24 is based upon three criteria (represented by blocks 28 to 30) of which two (blocks 28 and 29) are derived from the spectral properties of speech and DTMF signals respectively.
  • two (blocks 28 and 29) are derived from the spectral properties of speech and DTMF signals respectively.
  • Figure 5 illustrates the energy distribution for the component frequencies where y (n) contains speech alone, showing that the highest energy is to be found in only one of the DTMF-tone frequencies.
  • Figure 6 illustrates the corresponding energy distribution where y(n) contains DTMF tones; the presence of a pair of DTMF-tone frequencies is indicated by the high energy concentrated in two of the component frequencies.
  • the first criterion used for classification is applied in block 28 to determine whether:
  • A has a value close to 1. If that criterion is satisfied, the signal y (n) is likely to contain a DTMF tone. In this latter case the block 28 signals a "1", whereas otherwise it signals a "0". The second criterion is applied in block 28 to determine whether :
  • B has a value less than 1, and preferably 0.5. This criterion is based on the fact that where the signal y (n) contains speech the difference (c s8 - c s7 ) is likely to increase as some coefficients become excited by a harmonic speech-component (see Figure 5) , and the sum (c s8 + c s7 ) will stay relatively small. On the other hand, if a DTMF-tone pair is received the value of the fraction (c s8 - c s7 ) I (c s8 + c s7 ) will be forced down. If the value of the fraction is less than B , the block 29 signals "l" if it is not, it signals a "0".
  • the third criterion, applied in the block 30, is based on the consideration that DTMF tones are each generated as a sum of two narrow-band sinusoids, with frequencies determined by the DTMF matrix.
  • This criterion can be summarised as:
  • the block 30 signals "l 11 if it is not, it signals a "0".
  • the signals from the blocks 28 to 30 are supplied to an AND gate 31. It is only in the event that each block 28 to 30 signals a "1", that the detector 6 signals to the notch filter 5 that the signal y (n) is classified DTMF, otherwise it signals that it is classified DTMF.
  • the notch filter 5 responds to the condition in which the signal y (n) is classified DTMF, to suppress from that signal as applied to the limiter 7, frequencies corresponding to those represented by the coefficients c s7 and c s8 . Identification of those two frequencies are provided by signals supplied to the notch filter 5 from the sorter 27.
  • LMS filters for stages 10 and 25
  • any form of filter from the family of RLS (recursive least squares) filter-family may be used instead.
  • the LMS filter has been found to provide high computational efficiency in the context of the described telephony system.

Abstract

Processing of a received telephone-signal to limit soundpressure from an earpiece(3) involves frequency-dependent limitation (7) after DTMF-tone suppression (5, 6). The energies (SyN) of synthesised frequency-components (1-N) of the sampled signal are computed within the limiter (7) from weighting coefficients (CNI,CN2) derived iteratively by an adaptive LMS filter(10) for the sine and cosine pairs of the synthesised components. The computed energies (SyN) after weighting (WN) according to characteristics of the earpiece (3) are combined (17) to give an estimate (ET) of the energy-content of the signal-sample. Comparison (19) of the estimate (ET) with itself before and after application of a non-linear function (18) produces an attenuation factor that is applied (20) to limit the signal supplied to the earpiece (3). DTMF-tone detection (6) uses LMS-filter synthesis (25) of the sample-­signal at DTMF-tone frequencies, and sorting (27) of their computed energies (SY1-8). The presence of DTMF tones in the sample-signal is detected if signals (CS6-CS8) representing the three highest energies are found (31) to satisfy three criteria (28-30) attributable to the presence of DTMF-tone pairs.

Description

Signal-Processing Methods and Apparatus
This invention relates to signal-processing methods and apparatus .
The invention is concerned especially, though not exclusively, with signal-processing methods and apparatus for use in the context of telephony.
Extensive use is made of the telephone for calls to emergency and enquiry services, and the operators receiving such calls are very often subjected to high sound-pressure levels that can be injurious, or sounds that are otherwise of a nature to create undesirable stress. For example, amplification may have to be used in order to enable the operator to hear the caller, so that bursts of interference on the telephone line or wireless lin may result in pain or even injury to the operator. Furthermore, for example, where the operator is communicating a telephone number to a caller, the caller may start to enter that number into their telephone with the result that the operator is subjected to the stress of hearing DTMF dial tones.
It is one of the objects of the present invention to provide signal-processing methods and apparatus that may be used to reduce significantly the likelihood of injury and stress in the above context.
According to one aspect of the present invention there is provided a method of processing an input signal having a multiplicity of different frequency components within a range of frequencies, wherein synthesis of the signal in a plurality of its components within the range is carried out to derive representations of the energy-content of the signal at the frequencies of those respective components, and wherein the energy-content representations are utilised as discriminants in the processing of the signal.
According to another aspect of the invention apparatus for processing an input signal having a multiplicity of different frequency components within a range of frequencies, comprises means for synthesis of the signal in a plurality of its components within the range, means for deriving representations of the energy-content of the signal at the frequencies of those respective components, and means for processing the input signal in dependence upon the derived energy-content representations.
The input signal of both aspects of the invention may be sampled repetitively and the synthesis of the input signal may be carried out in respect of each sample in turn to derive fresh energy-content representations in relation to each sample. Frequency analysis of digital- signal samples is currently performed within the framework of the digital Fourier transform (DFT) ; the fast Fourier transform (FFT) is employed when computational efficiency is crucial, which is often the case in real-time applications. The DFT operates on blocks of input-signal samples, each referred to as a frame, and cannot operate sample by sample as in the case of the method and apparatus of the present invention.
The synthesis of the input signal in the method and apparatus of the present invention may involve RLS (recursive least squares) filtering of the input signal to derive coefficients related to the individual components. The filtering, which may be LMS (least mean squares) filtering, may involve quadrature-frequency signal pairs weighted individually in accordance with coefficients derived iteratively in the filtering, and the energy-content representations may each be derived as the square root of the sum of the squares of the two coefficients of each weighted signal pair.
The processing carried out by the method and apparatus of the invention may be, for example, for limiting the sound pressure level produced by an earphone or other electro- acoustic transducer that is driven by the input signal. As an alternative, or in addition, the method and apparatus of the invention may be applied to the detection and suppression of DTMF tones in a telephone system.
Signal-processing methods and apparatus according to the present invention will now be described, by way of example, with reference to the accompanying drawings, in which:
Figure 1 is a schematic representation of the signal- processing apparatus according to the invention;
Figure 2 is a schematic representation of a signal- limiter that forms part of the apparatus of Figure 1;
Figure 3 is illustrative of an adaptive filter that forms part of the signal-limiter of Figure 2 ;
Figure 4 is a schematic representation of a DTMF-tone detector that forms part of the apparatus of Figure 1; and
Figures 5 and 6 are illustrative of part of the operation of the DTMF-tone detector of Figure 4.
The operation of the signal-processing apparatus of
Figure 1 is carried out by electronic processing circuits in accordance with software, but for ease of understanding its operation will be described in terms of separate stages with dedicated function. It is to be understood in this connection that it is not of the essence of the invention whether dedicated hardware or software-controlled apparatus is utilised.
Referring to Figure 1, the signal-processing apparatus 1 is connected as part of a telephone system between the received-signal output 2 of a land-line or wireless terminal and an electro-acoustic transducer 3 (for example an earphone of a telephone headset) . The signal from the output 2 is supplied to an analogue-digital converter 4 so as to produce a digital signal y (n) with a sampling frequency fs, that is supplied to a notch filter 5 used in the suppression of DTMF tones detected by a
DTMF-tone detector 6. The output from the notch filter 5 is supplied via a limiter 7 to a digital-analogue converter 8 and thence to the transducer 3. The limiter 7 is responsive to the frequency spectrum of the signal- samples y (n) it receives from the filter 5 to limit the sound energy emitted by the transducer 3.
The limiter 7 has an effective structure as illustrated in Figure 2, and will now be described.
Referring to Figure 2, the limiter 7 includes a first energy-tracking section 9 that involves an LMS (least- mean-squares) filter stage 10. The filter stage 10 is used to synthesize the input sample y (n) in every time instant n as a sum of N equally-spaced narrow-band frequency components; the sum is identified as synthesized signal y (n) . Synthesis is achieved by choosing a mix of narrow-band components that minimizes the error (y(n) -y (n) ) , so that the resulting synthesized signal y (n) has a discrete power spectrum that is comparable with the continuous spectrum of the input signal y (n) . An error e = y (n) -y (n) is introduced since spectral information is lost in y (n) , however, the larger the value of N the smaller the error e.
The LMS filter stage 10 has the effective structure illustrated in Figure 3 where N (for example, thirteen) pairs of quadrature-frequency signals cnsin (2Ηflnn) and c12cos (2ιrflnn) to cN1sin (2πfmn) and cmcos (2πfN2n) are generated using respective pairs of weighting stages 11. The N pairs of quadrature-frequency signals generated are summed in a stage 12 and the result subtracted from the input sample signal y (n) in a comparator stage 13 to derive an instantaneous-error signal. This error signal is fed back to the stages 11 to adjust iteratively the weighting coefficients cn and c12 to cm and cm applied to the pairs of quadrature-frequency signals respectively. The iterative adjustment is towards reducing the error signal to zero so that the signal derived by the stage 12 is a synthesized signal y (n) approximating to the input signal y (n) .
Returning to consideration of Figure 2, signals in accordance with the adjusted pairs of weighting coefficients cu and cn to cm and cN2 in stages 11, are applied within the energy-tracking section 9 of unit 7 to respective computation stages 13. The stages 13 derive N signals -?y7 to SyN representative of the energy content of the N frequency-component signals respectively, of the synthesized signal y (n) . In this regard each energy- content signal Sy2 to SyN is computed as the square root of the sum of the squares of the respective pair of weighting coefficients cn and cJ2 to cN1 and cN2. The phase information involved in each pair of frequency-component signals is not significant to the approximation provided in the energy-signals Syt to SyN.
The signals Sy1 to SyN are supplied from the energy- tracking section 9 of the limiter 7 to a second, frequency-weighting section 14 which comprises N weighting stages 15. The weighting stages 15 impose weighting coefficients w2 to wN on the signals _3y2 to SyN respectively, and pass the signals as so weighted, on to a third, attenuation-computing section 16 of the limiter 7.
In the third section 16, a computing stage 17 derives the square root of the sum of the squares of the weighted signals Sy1.w1 to SyN.wN, as an estimate Eτ of the total energy content of the signal y (n) . This estimate is made up of the sum of approximations to the energy content of each discrete frequency component of the signal y (n) as adjusted by its respective weighting coefficient w2 to wN. By setting these coefficients w; to wN in accordance with the frequency-dependent transfer function of the transducer 3 , a correlation can be achieved between the energy estimate Eτ and the frequency-dependent sound pressure that would be produced by the transducer 3 if fed with the synthesized signal y (n) .
A signal representative of Eτ is supplied from stage 17 as an input to a non-linear stage 18 within the section 16. The stage 16 operates according to a non-linear function to produce an output signal representative of Eouτ. The non-linear function imposes a maximum magnitude of (a + b) on the signal-representation of Eoυτ, with Eouτ related proportionately to Eτ for energy-magnitudes of Eτ less than or equal to magnitude b , but of reducing proportionality to Eτ as the magnitude of Eτ increases above magnitude b . The transfer function of stage 18 is as represented at (1) in the annexed sheet of formulae.
The signal-representation of Eouτ is compared with the signal-representation of Eτ in a comparator stage 19 to derive an attenuation factor. This factor is applied via a fourth section 20 of the limiter 7, to the input signal y (n) before this latter signal is supplied to the converter 8 for application to the transducer 3.
The essence of the frequency-dependent limiting carried out by the limiter 7 is the sample-by-sample spectral analysis provided by the power spectrum Syl to SyN. The LMS adaptive filter stage 10 minimizes the error e between signals y (n) and y (n) for every time-instant n according to the formula (2) with respect to the matrix (3) , of the annexed sheet. The desired accuracy (that is to say, the maximal tolerated mean error) is dependent on the number of iterations of the LMS filter stage 10 that take place, and for which there is repeated updating of the matrix (3) , for every time instant n .
Tuning and calibration of the limiter 7 with respect to the specific transducer 3 is necessary before use. The weighting coefficients wt to wN and limit-maximum (a + b) are set by sweeping a narrow-band tone through the frequencies covered in the bandwidth of the input signal
(for example, 200 to 4000 Hz). The resulting sound pressure at the transducer 3 is logged by a sound analyzer, measuring the maximum sound-pressure level in 1/3 octave bands. Through superposition, and assuming that the input signal is a stationary sinusoid with variance greater than (a + b) , the output of the transducer 3 is set to a limiting level for every discrete frequency by first adjusting the overall maximum (a + b) and then each weighting coefficient w; to wN individually.
The transmission of DTMF tones to an operator's headset is a principal cause of nuisance and even injury, and although the limiter 7 is capable of suppressing such tones to some extent, the fast and efficient suppression of them is achieved in the apparatus of Figure 1 via the notch filter 5. The notch filter 5 acts to block DTMF tones that are detected by the DTMF-tone detector 6. The detector 6 and its mode of operation will now be described with reference to Figure 4.
Referring to Figure 4 , the DTMF-tone detector 6 receives signal y (n) containing speech and/or DTMF signal components. For every time instant n , the detector 6 classifies the content of the current sample y (n) as DTMF or DTMF, namely, including or not-including, respectively, DTMF tones. The classification is made according to three functional blocks or stages 22, 23 and 24.
Stages 22 and 23 act together as a feature extraction engine, providing the classifier stage 24 with features significant for DTMF tones and for speech respectively. Block 24 makes a decision on the extracted features of y (n) and supplies a DTMF-tone index to the notch filter 5 if a DTMF tone is detected.
The method of detection is based on a-priori knowledge of the characteristic spectral shapes of speech and of pairs of narrow-band tones. The major energy-content of speech is concentrated in the lower end of the frequency domain of 200-2000 Hz. Also, speech is generally composed of a fundamental frequency fi and a set of narrow-band energy components located at multiples (i.e. harmonics) of r* x. Since these additional energy components are equally spaced within the frequency domain, the probability that more than one of them coincides with the eight standardised DTMF-tone frequencies Fυ i={l,8}, is small. Moreover, a signal containing a DTMF tone alone, has an energy spectrum dominated by two narrow energy components, and a basic assumption in the present context is that the energy of all DTMF tones is larger than that of any harmonic-frequency component of speech. The operation of stages 22, 23 and 24 will now be described in more detail with reference to Figure 4.
Referring to Figure 4, stage 22 includes an LMS filter stage 25 that has the same effective structure as that illustrated in Figure 3 for generating pairs of quadrature-frequency signals. In this case, however, just eight pairs of quadrature-frequency signals cnsin (2Ηflnn) and c12cos (2πflnιι) to cSIsin (2πf81n) and cS2cos (2πf82n) are generated using the respective pairs of weighting stages 11. The eight pairs of quadrature- frequency signals generated in this case have the same frequencies as the eight standard DTMF-tone frequencies respectively, and are summed in the stage 12. The resultant sum is subtracted from the input sample signal y(n) in the comparator stage 13 to derive an instantaneous-error signal. This error signal is fed back to the stages 11 to adjust iteratively the weighting coefficients cn and c12 to c81 and cS2 applied to the pairs of quadrature-frequency signals respectively. The iterative adjustment is towards reducing the error signal to zero so that the signal derived by the stage 12 is a synthesized signal y (n) approximating to the DTMF-tone frequency components in input signal y (n) .
Returning to consideration of Figure 4, signals in accordance with the adjusted pairs of weighting coefficients cu and c12 to c81 and c82 are applied within section 9 of the detector apparatus 6 to respective computation stages 26. The stages 26 derive eight signals Sy2 to Sy8 representative respectively of the energy content of the eight DTMF-tone frequency components of the synthesized signal y (n) . In this regard each energy-content signal Sy; to Sy8 is computed as the square root of the sum of the squares of the respective pair of weighting coefficients cn and c12 to c81 and c82. The phase information involved in each pair of frequency-component signals is not significant to the approximation provided in the energy-signals S ; to Sy8.
The signals Syt to Sy8 are supplied from the first section 22 to a sorter 27 of the second section 23. The sorter 27 acts to sort the signals Sy2 to Sy8 in ascending order of magnitude, so as to produce representative signals csl to cs8 in that order. The three ordered signals cs6 to cs8 correspond to whichever three of the energy-signals Sy2 to Sy8 are of the highest magnitude, and are supplied for classification to the classifying stage 24.
The classification carried out in stage 24 is based upon three criteria (represented by blocks 28 to 30) of which two (blocks 28 and 29) are derived from the spectral properties of speech and DTMF signals respectively. To visualise the concept of classification, the distribution of energy content within two typical sets of the signals csl to cs8 are illustrated in Figures 5 and 6. Figure 5 illustrates the energy distribution for the component frequencies where y (n) contains speech alone, showing that the highest energy is to be found in only one of the DTMF-tone frequencies. Figure 6 illustrates the corresponding energy distribution where y(n) contains DTMF tones; the presence of a pair of DTMF-tone frequencies is indicated by the high energy concentrated in two of the component frequencies.
The first criterion used for classification is applied in block 28 to determine whether:
(cs7 - cs6) I (cs7 + cs6) > A
where A has a value close to 1. If that criterion is satisfied, the signal y (n) is likely to contain a DTMF tone. In this latter case the block 28 signals a "1", whereas otherwise it signals a "0". The second criterion is applied in block 28 to determine whether :
(Css " cs?) I (css + crs7) < B
where B has a value less than 1, and preferably 0.5. This criterion is based on the fact that where the signal y (n) contains speech the difference (cs8 - cs7) is likely to increase as some coefficients become excited by a harmonic speech-component (see Figure 5) , and the sum (cs8 + cs7) will stay relatively small. On the other hand, if a DTMF-tone pair is received the value of the fraction (cs8 - cs7) I (cs8 + cs7) will be forced down. If the value of the fraction is less than B , the block 29 signals "l" if it is not, it signals a "0".
The third criterion, applied in the block 30, is based on the consideration that DTMF tones are each generated as a sum of two narrow-band sinusoids, with frequencies determined by the DTMF matrix. This matrix has four rows and four columns, each associated with a particular frequency, with each matrix element FiJ f i={ l , 4} , ={5 , 8} corresponding to a DTMF tone.
The third criterion exploits the property that if a DTMF tone is indicated by block 28 as likely to be present, one of the two signals cs7 and cS8 must correspond to Fi, i={ l , } and the other to Fj, j = { 5 , 8 } . This criterion can be summarised as:
°S7 ~ Ci 'S8 = C, CS7 ~ Cj → CS8 ~ Ci
and if satisfied indicates that the signal y (n) contains a DTMF tone. If the criterion is satisfied the block 30 signals "l11 if it is not, it signals a "0". The signals from the blocks 28 to 30 are supplied to an AND gate 31. It is only in the event that each block 28 to 30 signals a "1", that the detector 6 signals to the notch filter 5 that the signal y (n) is classified DTMF, otherwise it signals that it is classified DTMF.
The notch filter 5 responds to the condition in which the signal y (n) is classified DTMF, to suppress from that signal as applied to the limiter 7, frequencies corresponding to those represented by the coefficients cs7 and cs8. Identification of those two frequencies are provided by signals supplied to the notch filter 5 from the sorter 27.
Although the system described above utilises LMS filters for stages 10 and 25, any form of filter from the family of RLS (recursive least squares) filter-family may be used instead. However, the LMS filter has been found to provide high computational efficiency in the context of the described telephony system.
Formulae
Figure imgf000015_0001
Figure imgf000015_0002
Figure imgf000015_0003

Claims

Claims :
1. A method of processing an input signal having a multiplicity of different frequency components within a range of frequencies, wherein synthesis of the signal in a plurality of its components within the range is carried out to derive representations of the energy-content of the signal at the frequencies of those respective components, and wherein the energy-content representations are utilised as discriminants in the processing of the signal.
2. A method according to Claim 1 wherein the input signal is sampled repetitively and the synthesis of the input signal is carried out in respect of each sample in turn to derive fresh energy-content representations in relation to each sample.
3. A method according to Claim 1 or Claim 2 wherein synthesis of the input signal in said plurality of components involves recursive least squares filtering of the signal to derive coefficients related to those individual components .
4. A method according to Claim 3 wherein the filtering involves quadrature-frequency signal pairs weighted individually in accordance with coefficients derived iteratively in the filtering, and the energy-content representations are each derived as the square root of the sum of the squares of the two coefficients of each weighted signal pair.
5. A method according to Claim 3 or Claim 4 wherein the filtering is LMS (least mean squares) filtering.
6. A method according to any one of Claims 1 to 5 wherein comparison is made between the square root of the sum of the squares of the energy-content representations, and the result of submitting that sum to a non-linear attenuation function, and wherein an attenuation factor for application to the input signal is derived from the result of the comparison.
7. A method according to Claim 6 for use in a telephony system, wherein the input signal contains speech components and is supplied to an electro-acoustic transducer via a limiter, and wherein the limiter is responsive to the attenuation factor to attenuate the input signal accordingly and thereby limit the sound pressure produced by the transducer.
8. A method according to Claim 6 or Claim 7 wherein the energy-content representations include weightings in accordance with the frequency-dependent transfer function of the transducer.
9. A method according to any one of Claims 1 to 5 wherein the energy-content representation representing the highest energy-content is selected, the selected representation is utilised in one or more comparisons between the energy-content representations, and processing of the input signal is carried out in dependence upon the results of the comparisons.
10. A method according to Claim 9 wherein the synthesis of the input signal is in more than three of its components, and wherein three energy-content representations representing the highest energy-content are selected, the selected representations are utilised in one or more comparisons between them, and processing of the input signal is carried out in dependence upon the results of the comparisons.
11. A method according to Claim 10 for use in a telephony system for suppressing DTMF tones in the input signal, the synthesis of the input signal is in the eight DTMF-tone component frequencies, and the processing of the input signal involves suppression of the DTMF-tone frequencies in the input signal in dependence upon the results of the comparisons.
12. Apparatus for processing an input signal having a multiplicity of different frequency components within a range of frequencies, comprising means for synthesis of the signal in a plurality of its components within the range, means for deriving representations of the energy- content of the signal at the frequencies of those respective components, and means for processing the input signal in dependence upon the derived energy-content representations .
13. Apparatus according to Claim 12 wherein the input signal is sampled repetitively and the synthesis of the input signal is carried out in respect of each sample in turn to derive fresh energy-content representations in relation to each sample.
14. Apparatus according to Claim 12 or Claim 13 wherein synthesis of the input signal in said plurality of components involves recursive least squares filtering of the signal to derive coefficients related to those individual components.
15. Apparatus according to Claim 14 wherein the filtering involves quadrature-frequency signal pairs weighted individually in accordance with coefficients derived iteratively in the filtering, and the energy- content representations are each derived as the square root of the sum of the squares of the two coefficients of each weighted signal pair.
16. Apparatus according to Claim 14 or Claim 15 wherein the filtering is LMS (least mean squares) filtering.
17. Apparatus according to any one of Claims 12 to 16 wherein comparison is made between the square root of the sum of the squares of the energy-content representations, and the result of submitting that sum to a non-linear attenuation function, and wherein an attenuation factor for application to the input signal is derived from the result of the comparison.
18. Apparatus according to Claim 17 for use in a telephony system, wherein the input signal contains speech components and is supplied to an electro-acoustic transducer via a limiter, and wherein the limiter is responsive to the attenuation factor to attenuate the input signal accordingly and thereby limit the sound pressure produced by the transducer.
19. Apparatus according to Claim 17 or Claim 18 wherein the energy-content representations include weightings in accordance with the frequency-dependent transfer function of the transducer.
20. Apparatus according to any one of Claims 12 to 16 wherein the energy-content representation representing the highest energy-content is selected, the selected representation is utilised in one or more comparisons between the energy-content representations, and processing of the input signal is carried out in dependence upon the results of the comparisons.
21. Apparatus according to Claim 20 wherein the synthesis of the input signal is in more than three of its components, and wherein three energy-content representations representing the highest energy-content are selected, the selected representations are utilised in one or more comparisons between them, and processing of the input signal is carried out in dependence upon the results of the comparisons.
22. Apparatus according to Claim 21 for use in a telephony system for suppressing DTMF tones in the input signal, the synthesis of the input signal is in the eight DTMF-tone component frequencies, and the processing of the input signal involves suppression of the DTMF-tone frequencies in the input signal in dependence upon the results of the comparisons.
PCT/GB2004/003929 2003-09-15 2004-09-15 Signal-processing methods and apparatus WO2005027541A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB0321565A GB0321565D0 (en) 2003-09-15 2003-09-15 Limiter and detector apparatus and methods
GB0321565.4 2003-09-15

Publications (1)

Publication Number Publication Date
WO2005027541A1 true WO2005027541A1 (en) 2005-03-24

Family

ID=29227100

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB2004/003929 WO2005027541A1 (en) 2003-09-15 2004-09-15 Signal-processing methods and apparatus

Country Status (2)

Country Link
GB (2) GB0321565D0 (en)
WO (1) WO2005027541A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1867147A1 (en) * 2005-03-26 2007-12-19 Deutsche Telekom AG Method and system incorporating audioclips during a conversation
CN114582374A (en) * 2022-04-29 2022-06-03 北京百瑞互联技术有限公司 Dual-tone multi-frequency detection method, system, medium and equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5889851A (en) * 1996-02-26 1999-03-30 Lucent Technologies Inc. DTMF signal detection/removal using adaptive filters
US5970156A (en) * 1997-02-14 1999-10-19 Telefonaktiebolaget Lm Ericsson Method and apparatus for reducing periodic interference in audio signals
US6347134B1 (en) * 1998-02-05 2002-02-12 Picazo Communications Recording and playing voicemail with improved DTMF detection

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4538296A (en) * 1983-07-22 1985-08-27 Short Robert S Sound inhibitor for audio transducers
US5029217A (en) * 1986-01-21 1991-07-02 Harold Antin Digital hearing enhancement apparatus
US5631965A (en) * 1992-06-19 1997-05-20 Chang; Joseph S. Hearing protector
US5448646A (en) * 1993-11-01 1995-09-05 Unex Corporation Headset interface assembly
GB9905788D0 (en) * 1999-03-12 1999-05-05 Fulcrum Systems Ltd Background-noise reduction

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5889851A (en) * 1996-02-26 1999-03-30 Lucent Technologies Inc. DTMF signal detection/removal using adaptive filters
US5970156A (en) * 1997-02-14 1999-10-19 Telefonaktiebolaget Lm Ericsson Method and apparatus for reducing periodic interference in audio signals
US6347134B1 (en) * 1998-02-05 2002-02-12 Picazo Communications Recording and playing voicemail with improved DTMF detection

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1867147A1 (en) * 2005-03-26 2007-12-19 Deutsche Telekom AG Method and system incorporating audioclips during a conversation
CN114582374A (en) * 2022-04-29 2022-06-03 北京百瑞互联技术有限公司 Dual-tone multi-frequency detection method, system, medium and equipment
CN114582374B (en) * 2022-04-29 2022-07-05 北京百瑞互联技术有限公司 Dual-tone multi-frequency detection method, system, medium and equipment

Also Published As

Publication number Publication date
GB2406745A (en) 2005-04-06
GB0321565D0 (en) 2003-10-15
GB2406745B (en) 2007-05-09
GB0420544D0 (en) 2004-10-20

Similar Documents

Publication Publication Date Title
US6092040A (en) Audio signal time offset estimation algorithm and measuring normalizing block algorithms for the perceptually-consistent comparison of speech signals
EP1402517B1 (en) Speech feature extraction system
US6591234B1 (en) Method and apparatus for adaptively suppressing noise
JP4764995B2 (en) Improve the quality of acoustic signals including noise
US6820053B1 (en) Method and apparatus for suppressing audible noise in speech transmission
US5602962A (en) Mobile radio set comprising a speech processing arrangement
AU751333B2 (en) Method and device for blind equalizing of transmission channel effects on a digital speech signal
EP1592282B1 (en) Teleconferencing method and system
US5878389A (en) Method and system for generating an estimated clean speech signal from a noisy speech signal
WO1999003091A1 (en) Methods and apparatus for measuring signal level and delay at multiple sensors
JP2003534570A (en) How to suppress noise in adaptive beamformers
EP0976208B1 (en) Acoustic feedback elimination using adaptive notch filter algorithm
EP1093112B1 (en) A method for generating speech feature signals and an apparatus for carrying through this method
US20080004868A1 (en) Sub-band periodic signal enhancement system
US5768392A (en) Blind adaptive filtering of unknown signals in unknown noise in quasi-closed loop system
US20080219457A1 (en) Enhancement of Speech Intelligibility in a Mobile Communication Device by Controlling the Operation of a Vibrator of a Vibrator in Dependance of the Background Noise
JPH0667691A (en) Noise eliminating device
US6674863B2 (en) Microphone-speaker apparatus
WO2005027541A1 (en) Signal-processing methods and apparatus
JP2914332B2 (en) Spectrum feature parameter extraction device based on frequency weight evaluation function
JP3403549B2 (en) Echo canceller
Puder Kalman‐filters in subbands for noise reduction with enhanced pitch‐adaptive speech model estimation
EP1748426A2 (en) Method and apparatus for adaptively suppressing noise
US7010556B1 (en) Antenna treatment method and system
Zhang Designs, experiments, and applications of multichannel structures for hearing aids

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BW BY BZ CA CH CN CO CR CU CZ DK DM DZ EC EE EG ES FI GB GD GE GM HR HU ID IL IN IS JP KE KG KP KZ LC LK LR LS LT LU LV MA MD MK MN MW MX MZ NA NI NO NZ PG PH PL PT RO RU SC SD SE SG SK SY TJ TM TN TR TT TZ UA UG US UZ VN YU ZA ZM

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SZ TZ UG ZM ZW AM AZ BY KG MD RU TJ TM AT BE BG CH CY DE DK EE ES FI FR GB GR HU IE IT MC NL PL PT RO SE SI SK TR BF CF CG CI CM GA GN GQ GW ML MR SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
122 Ep: pct application non-entry in european phase