EP1892703A1 - Method and system for providing an acoustic signal with extended bandwidth - Google Patents

Method and system for providing an acoustic signal with extended bandwidth Download PDF

Info

Publication number
EP1892703A1
EP1892703A1 EP06017456A EP06017456A EP1892703A1 EP 1892703 A1 EP1892703 A1 EP 1892703A1 EP 06017456 A EP06017456 A EP 06017456A EP 06017456 A EP06017456 A EP 06017456A EP 1892703 A1 EP1892703 A1 EP 1892703A1
Authority
EP
European Patent Office
Prior art keywords
signal
broadband
bandwidth
bandwidth limit
acoustic signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP06017456A
Other languages
German (de)
French (fr)
Other versions
EP1892703B1 (en
Inventor
Tim Haulick
Bernd Iser
Gerhard Schmidt
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Harman Becker Automotive Systems GmbH
Original Assignee
Harman Becker Automotive Systems GmbH
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harman Becker Automotive Systems GmbH filed Critical Harman Becker Automotive Systems GmbH
Priority to DE602006009927T priority Critical patent/DE602006009927D1/en
Priority to AT06017456T priority patent/ATE446572T1/en
Priority to EP06017456A priority patent/EP1892703B1/en
Priority to CA002596411A priority patent/CA2596411A1/en
Priority to JP2007214930A priority patent/JP5150165B2/en
Priority to KR1020070084306A priority patent/KR101433833B1/en
Priority to CN2007101466102A priority patent/CN101141533B/en
Publication of EP1892703A1 publication Critical patent/EP1892703A1/en
Application granted granted Critical
Publication of EP1892703B1 publication Critical patent/EP1892703B1/en
Not-in-force legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks

Definitions

  • the invention is directed to a method and a system for providing an acoustic signal, in particular a speech signal, with extended bandwidth.
  • Acoustic signals transmitted via an analog or digital signal path usually suffer from the drawback that the signal path only has a restricted bandwidth such that the transmitted acoustic signal differs considerably from the original signal. For example, in the case of conventional telephone connections, a sampling rate of 8 kHz is used resulting in a maximal signal bandwidth of 4 kHz. Compared to the case of audio CD's, the speech and audio quality is significantly reduced.
  • the bandwidth of telephone connections could be increased by using broadband or wideband digital coding and decoding methods (so-called broadband codecs).
  • broadband codecs wideband digital coding and decoding methods
  • both the transmitter and the receiver have to support corresponding coding and decoding methods which would require the implementation of a new standard.
  • systems for bandwidth extension can be used as described, for example, in P. Jax, Enhancement of Bandlimited Speech Signals: Algorithms and Theoretical Bounds, Dissertation, Aachen, Germany, 2002 or E. Larsen, R. M. Aarts, Audio Bandwidth Extension, Wiley, Hoboken, NJ, USA, 2004 .
  • These systems are to be implemented on the receiver's side only such that existing telephone connections do not have to be changed.
  • the missing frequency components of an input signal with small bandwidth are estimated and added to the input signal.
  • Fig. 6 An example of the structure and the corresponding signal flow in such a state of the art bandwidth extension system is illustrated in Fig. 6.
  • Fig. 6 An example of the structure and the corresponding signal flow in such a state of the art bandwidth extension system is illustrated in Fig. 6.
  • both the lower and the upper frequency ranges are re-synthesized.
  • an incoming or received acoustic signal x ( n ) in digitized form is processed by sub-sampling and block extraction so as to obtain signal vectors x ( n ).
  • the variable n denotes the time.
  • the bandwidth extension is performed only within the missing frequency ranges.
  • the extension concerns low frequency (for example from 0 to 300 Hz) and/or high frequency (for example 3400 Hz to half of the desired sampling rate) ranges.
  • a narrowband spectral envelope is extracted from the narrowband signal, the narrowband signal being restricted by the bandwidth restrictions of the telephone channel.
  • a corresponding broadband envelope signal is estimated from the narrowband envelope.
  • the mappings are based, for example, on codebook pairs (see J. Epps, W. H. Holmes, A New Technique for Wideband Enhancement of Coded Narrowband Speech, IEEE Workshop on Speech Coding, Conference proceedings, pages 174 to 176 June 1999 ) or on Neural Networks (see J.-M. Valin R. Lefebvre, Bandwidth Extension of Narrowband Speech for Low Bit-Rate Wideband Coding, IEEE Workshop on Speech Coding, Conference Proceedings, pages 130 to 132, September 2000 ). In these methods, the entries of the codebooks or the weights of the neural networks are generated using training methods requiring large processor and memory resources.
  • a broadband or wideband excitation signal having a spectrally flat envelope is generated from the narrowband signal.
  • This excitation signal corresponds to the signal which would be recorded directly behind the vocal cords, i.e. the excitation signal contains information about voicing and pitch, but not about form and structures or the spectral shaping in general.
  • the excitation signal has to be weighted with the spectral envelope.
  • non-linear characteristics see U. Kornagel, Spectral Widening of the Excitation Signal for Telephone-Band Speech Enhancement, IWAENC 01, Conference Proceedings, pages 215 to 218, September 2001
  • two-ray rectifying or squaring for example.
  • the excitation signal x exc ( n ) is spectrally colored using the envelope in block 604.
  • the spectral ranges used for the extension are extracted using a band stop filter in block 606 resulting in signal vectors y ext ( n ).
  • the band stop filter can be effective, for example, in the range from 200 to 3700 Hz.
  • the signal vectors x(n) of the received signal are passed through a complementary band pass filter in block 605. Then, the signal components y ext ( n ) and y tel ( n ) are added to obtain a signal vector y ( n ) with extended bandwidth. In block 607, the different signal vectors are assembled again and an over-sampling is performed resulting in a signal y (n).
  • a method for providing an acoustic signal with extended bandwidth comprising:
  • the method according to the invention allows an adaptation of the bandwidth extension to the acoustic signal actually received. For example, when the transmitter uses an ISDN telephone, a broader frequency range is used compared to the case of a mobile phone with a hands-free system. Therefore, the bandwidth of a received acoustic signal will be extended only in those ranges where it is necessary so that the quality of the resulting signal is very high.
  • the received acoustic signal may be a digital signal or may be digitized.
  • steps (a) to (c) may be preceded by the step of converting the received acoustic signal to a predetermined sampling rate.
  • steps (a) to (c) may be preceded by the step of extracting a signal vector from the acoustic signal, in particular, the converted acoustic signal.
  • the signal vector may be obtained by sub-sampling the acoustic signal and may comprise a predefined number of entries. Then, subsequent (in time) signal vectors may overlap. The use of signal vectors simplifies further processing of the signals.
  • Steps (a) to (c) may be preceded by the step of determining a spectral vector of the received acoustic signal.
  • a window function may be applied to signal vectors of the received acoustic signal.
  • a Hann or a Hamming window may be used (see K. D. Kammeyer, K. Kroschel, Digitale Signaltechnik, 4 th Edition, Teubner, Stuttgart, Germany 1997 ).
  • Signal vectors, in particular the signal vectors weighted in this way may be transformed into the Fourier domain using a discrete Fourier transform.
  • the resulting vector is a short-term spectral vector. This allows for further processing in the Fourier domain.
  • step (b) may comprise determining a broadband spectral envelope signal and a broadband excitation signal between the lower and upper broadband bandwidth limits such that the product of spectral envelope signal and excitation signal corresponds to the received acoustic signal according to a predetermined criterion.
  • Such a decomposition into an envelope signal and an excitation signal simplifies determining the current bandwidth limits and increases the accuracy when determining a complementary signal.
  • Step (a) may comprise comparing a determined broadband spectral envelope signal and a long-term power spectrum of the received acoustic signal. It turned out that the long-term power spectrum is a suitable basis for determining current bandwidth limits of the acoustic signal.
  • determining a complementary signal in step (b) based on these current bandwidth limits and comprising determination of an envelope signal enables to iteratively adapt the current bandwidth limits by comparing again the (newly) determined envelope signal and a long-term power spectrum.
  • determining current bandwidth limits in step (a) may use a spectral envelope signal determined according to step (b), particularly in a preceding step or in a preceding iteration of the method.
  • determining a long-term power spectrum may comprise performing a first order recursive smoothing of the absolute values squared of the sub-band signals corresponding to the acoustic signal. This can be done, in particular, only if a wanted signal, such as a speech signal, has been detected in the received acoustic signal.
  • the long-term power spectrum may be normalized, particularly with respect to a long-term power spectrum within predetermined frequency limits.
  • the long-term power spectrum may be determined in the time domain. This can be done by determining the auto-correlation and performing an LPC analysis to obtain corresponding prediction coefficients.
  • the comparing step may comprise selecting the minimal and maximal frequency for which the long-term power spectrum is larger than or equal to the power spectrum of the determined broadband spectral envelope signal plus a predetermined constant.
  • the predetermined constant can be chosen based on empirical or theoretical data.
  • the predetermined constant may be negative.
  • determining a broadband spectral envelope signal may comprise selecting an envelope signal from a codebook according to a predetermined criterion.
  • codebooks By using codebooks, the required computing power can be reduced for determining an envelope signal.
  • different kinds of criteria can be used when selecting an envelope signal from a codebook.
  • using a predetermined distance criterion such as a cepstral distance can be used, particularly if the codebook entries have the form of cepstral vectors.
  • selecting an envelope signal may comprise equalizing the received acoustic signal and selecting an envelope signal from the codebook having minimal distance to the equalized acoustic signal according to a predetermined distance criterion, in particular, having a minimal cepstral distance.
  • Equalizing the acoustic signal allows to modify it such that a comparison with envelope signals from the codebook can be simplified.
  • the received acoustic signal can be equalized in such a way that the resulting signal shows a long-term power spectrum corresponding to the long-term power spectrum of the signal used for training the codebook.
  • Equalizing can be restricted to frequencies between the current upper and lower bandwidth limits of the received acoustic signal; outside these limits, the signal may remain unchanged.
  • equalizing the received acoustic signal can be performed using a normalized long-term power spectrum of the signal used for training the codebooks, particularly using the normalized long-term power spectrum divided by the normalized long-term power spectrum of the received acoustic signal itself.
  • the codebook may comprise pairs of corresponding envelope signals, each pair comprising a broadband envelope signal between the lower and upper broadband bandwidth limits and a corresponding narrowband envelope signal between a lower narrowband bandwidth limit being larger than the lower broadband bandwidth limit and an upper narrowband bandwidth limit being smaller than the upper broadband bandwidth limit, and selecting an envelope signal may comprise determining a narrowband envelope signal having minimal distance to the equalized acoustic signal according to the predetermined distance criterion and selecting the corresponding broadband envelope signal of this pair.
  • the received acoustic signal When using a cepstral distance to select an envelope signal, the received acoustic signal, particularly in its equalized form, has to be transformed into the cepstral domain.
  • the step of selecting an envelope signal can further comprise the steps of determining the absolute value squared of the sub-band signals of the received acoustic signal, determining an auto-correlation in the time domain, particularly by performing an inverse discrete Fourier transform on the vector of the absolute value squared, determining prediction coefficients, particularly using the Levinson-Durbin algorithm, performing a recursion to obtain the cepstral coefficients.
  • the method may further comprise the steps of recursively transforming a cepstral vector into prediction error coefficients, augmenting the prediction error filter vector by adding a predetermined number of zeros and subsequently performing a discrete Fourier transform to obtain an inverse spectrum, determining the reciprocal of each sub-band component to obtain a spectral envelope vector.
  • the step of selecting an envelope signal may be preceded by providing adapted narrowband codebook envelope signals being adapted to the current lower and upper bandwidth limits.
  • Such an adaptation of the codebook entries allows for an improved selection of a corresponding envelope signal from the codebook.
  • the adaptation would result in envelope signals in the codebook having an extended bandwidth. In this way, particularly fricatives can be more reliably detected.
  • the providing step may comprise processing broadband codebook envelope signals using a long-term power spectrum of the received acoustic signal.
  • the long-term power spectrum may be normalized; furthermore, the long-term power spectrum of the received acoustic signal may be divided by a normalized long-term power spectrum of a broadband signal used for training of the codebook.
  • the processing of the broadband codebook envelope signals may be performed only for frequencies outside the current bandwidth limits; within the bandwidth limits, the envelope signals may remain unchanged.
  • Processing using the long-term power spectrum may comprise weighting broadband codebook envelope signal vectors using the long-term power spectrum of the received acoustic signal.
  • determining a broadband excitation signal may be based on prediction error filtering and/or a non-linear characteristic. In this way, suitable excitation signals can be generated. Possible non-linear characteristics are disclosed, for example, in U. Kornagel, Spectral Widening of the Excitation Signal for Telephone-Band Speech Enhancement .
  • the at least one complementary signal may be based on a product of the determined broadband spectral envelope and the determined broadband excitation signal, and step (c) may comprise summing the received acoustic signal between the current lower and upper bandwidth limits and the at least one complementary signal being restricted to the band between the lower broadband bandwidth limit and a current lower bandwidth limit and/or to the band between the current upper bandwidth limit and the upper broadband bandwidth limit.
  • the complementary signal is based on spectrally coloring the excitation signal using the envelope signal.
  • Step (c) may also comprise adapting the power of the complementary signal and/or the received acoustic signal. With this step, the power of the received acoustic signal can be maintained.
  • At least one of the steps may be performed in the cepstral domain. Particularly if the entries of the codebook are cepstral vectors, this allows for performing the method in a simpler way.
  • Steps (a) to (c) of the above methods may be repeated at predetermined time intervals. Then, the repeated adaptation to the currently received acoustic signal leads to a permanent high quality of the resulting broadband signal.
  • Steps (a) to (c) of the above methods may be repeated only if a wanted signal component, such as speech activity, is detected in the received acoustic signal.
  • a wanted signal component such as speech activity
  • an extension of the bandwidth of the received acoustic signal is advantageous.
  • restricting the method to the case of detected speech activity reduces the required computing power and avoids the presence of artifacts due to mal-adaptation.
  • the invention also provides a computer program product comprising one or more computer-readable media having computer-executable instructions for performing the steps of the above-described methods when run on a computer.
  • an apparatus for providing an acoustic signal with extended bandwidth comprising:
  • such an apparatus provides an advantageous way to extend the bandwidth of a received acoustic signal.
  • the quality of the resulting output signal is increased compared to the case of bandwidth extension systems with fixed parameters.
  • the complementary signal means may comprise a means for determining a broadband spectral envelope signal and a broadband excitation signal between the lower and upper broadband bandwidth limits such that the product of spectral envelope signal and excitation signal corresponds to the received acoustic signal according to a predetermined criterion.
  • the bandwidth determining means may be configured to compare a determined broadband spectral envelope signal and a long-term power spectrum of the received acoustic signal.
  • the bandwidth determining means may be configured to select the minimal and maximal frequency for which the long-term power spectrum is larger than or equal to the power spectrum of the determined broadband spectral envelope signal plus a predetermined constant.
  • the means for determining a broadband spectral envelope signal may comprise a means for selecting an envelope signal from a codebook according to a predetermined criterion.
  • the means for selecting an envelope signal may be configured to equalize the received acoustic signal and select an envelope signal from the codebook having minimal distance to the equalized acoustic signal according to a predetermined distance criterion, in particular, having a minimal cepstral distance.
  • the codebook may comprise pairs of corresponding envelope signals, each pair comprising a broadband envelope signal between the lower and upper broadband bandwidth limits and a corresponding narrowband envelope signal between a lower narrowband bandwidth limit being larger than the lower broadband bandwidth limit and an upper narrowband bandwidth limit being smaller than the upper broadband bandwidth limit
  • the means for selecting an envelope signal may be configured to determine a narrowband envelope signal having minimal distance to the equalized acoustic signal according to the predetermined distance criterion and to select the corresponding broadband envelope signal of this pair.
  • the means for determining a broadband spectral envelope signal may comprise a means for providing adapted narrowband codebook envelope signals being adapted to the current lower and upper bandwidth limits.
  • the means for providing may be configured to process the broadband codebook envelope signal using a long-term power spectrum of the received acoustic signal.
  • the means for determining a broadband excitation signal may be configured to determine the broadband excitation signal based on prediction error filtering and/or a non-linear characteristic.
  • the at least one complementary signal may be based on a product of the determined broadband spectral envelope and the determined broadband excitation signal, and the assembling means may be configured to sum the received acoustic signal between the current lower and upper bandwidth limits and the at least one complementary signal being restricted to the band between the lower broadband bandwidth limit and a current lower bandwidth limit and/or to the band between the current upper bandwidth limit and the upper broadband bandwidth limit.
  • At least one of the means may be configured to perform at least part of its function in the cepstral domain.
  • the means of the above-described apparatus may be configured to perform their respective function repeatedly at predetermined time intervals.
  • the apparatus may further comprise a wanted signal detector, in particular, a speech detector, and the means may be configured to perform their respective function only if a wanted signal component is detected in the received acoustic signal.
  • a wanted signal detector in particular, a speech detector
  • Fig. 1 shows the structure of the signal flow in an apparatus for providing an acoustic signal with extended bandwidth.
  • Fig. 2 is a flow diagram illustrating an example of a method for providing an acoustic signal with extended bandwidth which could be performed by the apparatus corresponding to Fig. 1. In view of this, Fig.'s 1 and 2 will be described in the following simultaneously.
  • an acoustic signal such as a speech signal
  • a telephone line Because of the restricted bandwidth of the telephone line, an extension of the bandwidth is desired to improve the signal quality.
  • the signal is to be augmented so as to obtain a predetermined broader bandwidth. It is to be understood that the method described in the following can be used for bandwidth extension independent of the type of incoming signal and independent of the type of transmission line, i.e., it need not be a telephone line.
  • the acoustic signal x(n) received by block 101 has already been pre-processed by increasing the sampling rate up to the predetermined broadband or wideband bandwidth. In this way, however, no additional frequency components are generated. This can be achieved, for example, by using suitable anti-aliasing or anti-imaging filters.
  • This kind of bandwidth extension preferably, is performed only for the "missing" frequency ranges; in the case of an analog telephone line, these ranges may be between 0 and 300 Hz and 3400 Hz up to half of the desired sampling rate, for example, up to 3700 Hz.
  • signal vectors x ( n ) are generated (step 202). This can be achieved by taking every r sampling values up to a certain length.
  • the elements of this matrix can be chosen corresponding to different kinds of windows. Typical windows are the Hann or Hamming window.
  • the resulting short-term spectral vector has the form: X w n [ X e j ⁇ ⁇ 0 ⁇ n , X e j ⁇ ⁇ 1 ⁇ n , ... , X e j ⁇ ⁇ ⁇ ⁇ n , ... , X ⁇ e j ⁇ ⁇ N DFT - 1 ⁇ n ⁇ ] T , wherein ⁇ ⁇ denotes the frequency variable.
  • a long-term power spectrum of the received acoustic signal is determined in block 102 (step 204).
  • 2 diring speech activity S ⁇ xx ⁇ ⁇ ⁇ , n - 1 , else .
  • the time constant ⁇ fre is chosen to be close to 1 (0 ⁇ ⁇ fre ⁇ 1) so as to obtain a sufficiently large averaging time.
  • the recursive smoothing according to the first line of the above equation may be performed continuously. However, in order to avoid any artefacts, it may be performed only if a wanted signal component is present in the received acoustic signal, for example, if speech activity is detected.
  • a speech detector may be provided as described, for example, in E. Hänsler, G. Schmidt, Acoustic Echo and Noise Control - A Practical Approach, Wiley, Hoboken, NJ, USA, 2004 .
  • the band limits ⁇ ⁇ l and ⁇ ⁇ u denote the lower and upper limits of a predefined frequency band.
  • this frequency band may correspond to a telephone band with minimal bandwidth for which the present method is to be used, for example, the limits may be 400 Hz and 3300 Hz.
  • the limits correspond to a band which is smaller or at most equal to the frequency band of the narrow frequency band within which the codebook described below has been trained; these limits being denoted by ⁇ l and ⁇ u .
  • an estimation can be performed in the time domain as well. For this purpose, an auto-correlation is estimated for about 10 to 20 sampling cycles of offset. Afterwards, prediction coefficients can be determined using an LPC (linear predictive coding) analysis.
  • LPC linear predictive coding
  • the acoustic signal is equalized.
  • ⁇ l ( n -1 ) and ⁇ u (n -1) denote the current lower and upper bandwidth limits of the received acoustic signal.
  • the bandwidth limits at time ( n -1) are taken as the current bandwidth limits.
  • S x ⁇ x ⁇ ,norm ( ⁇ ⁇ , n ) denotes the normalized long-term power spectrum of the broadband signal which has been used for training the codebook. Normalizing of such a power spectrum is performed analogously to the case of the long-term power spectrum of the received acoustic signal described above. An example for such a normalized long-term power spectrum used for training a codebook is shown in Figure 3.
  • the acoustic signal is equalized only within the current bandwidth limits one time step before. Outside these bandwidth limits, no equalizing takes place.
  • An envelope signal corresponding to the received acoustic signal will be determined using a codebook.
  • the used codebook comprises a number of pairs of corresponding narrowband and broadband envelope signals.
  • the codebook has been obtained by training with a large database on the basis of a starting long-term power spectrum (see Y. Linde, A. Buzo, R. M. Gray, An Algorithm for Vector Quantizer Design, IEEE Trans. Comm., vol. COM-28, no. 1, pages 84 - 95, Jan. 1980 ).
  • the codebook entries are adapted in step 206 (block 104).
  • the narrowband codebook entries c i,s ( n ) are adapted.
  • the broadband envelope signals are provided as cepstral vectors c i,b ( n )
  • the corresponding spectra C i,b ( n ) are determined.
  • cepstral vectors are determined from the resulting spectral narrowband envelopes.
  • step 207 The conversion from spectral vectors to cepstral vectors and vice versa will be described in the following with respect to step 207 in which broadband spectral envelopes are determined (block 105).
  • a broadband spectral envelope from the codebook matching the acoustic signal best is determined by comparing the narrowband codebook entries with the spectral envelope of the spectrum of the acoustic signal (after equalizing).
  • the narrowband codebook entry is selected that has the smallness distance to the acoustic signal spectrum. In principle, different distance criteria can be used.
  • the cepstral distance is particularly useful as the codebook entries are provided in the form of cepstral vectors.
  • the corresponding broadband codebook entry is determined as the optimal broadband spectral envelope for the received acoustic signal. Due to the adaptation of the narrowband codebook entries as described above, an optimal narrowband envelope can be selected in a very reliable way.
  • Converting a spectral vector, particularly of the received acoustic signal, to a cepstral vector can be achieved by:
  • the optimal cepstral vector of the broadband codebook is designated by c opt,b ( n ).
  • Fig. 4 illustrates an example of a codebook with four pairs of entries.
  • a corresponding original narrowband envelope, and a corresponding adapted narrowband envelope are shown.
  • the original broadband and narrowband codebook entries have been obtained on the basis of a large database for an ISDN telephone connection.
  • the resulting optimized entries have a higher upper limit frequency. This allows for an improved detection of fricatives.
  • step 208 an excitation signal corresponding to the received acoustic signal is generated.
  • This broadband excitation signal shows a spectrally flat envelope. It corresponds to a signal which would be recorded directly behind the vocal cords.
  • the spectral envelope of the equalized short-term spectrum X eq ( n ) is estimated in the form of prediction error filter coefficients. Applying an inverse discrete Fourier transform on this spectral vector allows to determine the corresponding time signal. After that, the vector in the time domain is filtered by a prediction error filter. The corresponding filter coefficients are those that have been determined previously.
  • a non-linear characteristic such as a two-way rectification or squaring, is applied to the filtered time domain vector. This generates the missing low frequency and high frequency signal components.
  • a transformation in the Fourier domain provides, then, the spectrum of the extended excitation signal X exc ( n ) .
  • determining an excitation signal can be performed in the time sub-band or Fourier domain as well. Examples for this alternative can be found in B. Iser, G. Schmidt, Bandwidth Extension of Telephony Speech, Eurasip Newsletter, Volume 16, Number 2, pages 2 to 24, June 2005 .
  • 2 ⁇ ⁇ ⁇ l ⁇ u Y erw ( e j ⁇ ⁇ ⁇ , n ⁇ ) 2 wherein ⁇ ⁇ l and ⁇ ⁇ u denote the same bandwidth limits as in the estimation of the long-term power spectrum above.
  • the current bandwidth limits are adapted in step 210 (block 108).
  • 2 ⁇ C opt , b ( e j ⁇ ⁇ ⁇ , n ⁇ ) 2 + K C , ⁇ u n min ⁇ ⁇ ⁇
  • Fig. 5 an example for determining the bandwidth limits is illustrated.
  • the above, intermediate limit values are given by the points of intersection between the lowered broadband spectral envelope and the spectrum of the received acoustic signal.
  • These intermediate limit values may be recursively smoothed to eliminate temporary mal-estimations.
  • smoothing is performed only if speech activity is detected in the current signal frame.
  • the received acoustic signal is passed through an adaptive band pass filter to retain only components within the current bandwidth limits (block 109) to obtain a spectral vector Y tel ( n ).
  • the spectrally colored excitation signal is passed through a complementary adaptive band stop filter (block 110) so as to obtain a vector Y ext ( n ).
  • Y tel n G tel n ⁇ X w n
  • Y ext n G ext n ⁇ X ext n
  • the weighting matrices G tel ( n ) and G ext ( n ) are diagonal matrices:
  • G tel n G tel e j ⁇ ⁇ 0 ⁇ n 0 ... 0 0
  • G ext n G ext e j ⁇ ⁇ 0 ⁇ n 0 ... 0 0
  • the transitions at the bandwidth limits can be realized in a smoother way.
  • the resulting time domain vectors are, then, assembled using an overlap add method (as described in K. D. Kammeyer, K. Kroschel, Digitale Signalmaschine ) to obtain the final output signal y ( n ).
  • the steps performed in the Fourier domain may also be performed in the time domain.
  • equalizing the acoustic signal may be performed when adapting the narrowband codebook entries.
  • the above-described equalizing step may be augmented. For example, if an amplification or an attenuation is detected at particular frequencies, it may be adjusted within the bandwidth limits as well. In this case, the output vector Y tel ( n ) is modified with the weighting matrix H mod ( n ).

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Quality & Reliability (AREA)
  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)

Abstract

The invention is directed to a method for providing an acoustic signal with extended bandwidth, comprising automatically determining a current upper and a current lower bandwidth limit of a received acoustic signal, automatically determining at least one complementary signal to complement the received acoustic signal between a predefined lower broadband bandwidth limit and the current lower bandwidth limit and/or between the current upper bandwidth limit and a predefined upper broadband bandwidth limit, wherein the predefined lower broadband bandwidth limit is smaller than the current bandwidth limit and the predefined upper broadband bandwidth limit is larger than the current upper bandwidth limit, automatically assembling the at least one complementary signal and the received acoustic signal to obtain an acoustic signal with extended bandwidth.

Description

  • The invention is directed to a method and a system for providing an acoustic signal, in particular a speech signal, with extended bandwidth.
  • Acoustic signals transmitted via an analog or digital signal path usually suffer from the drawback that the signal path only has a restricted bandwidth such that the transmitted acoustic signal differs considerably from the original signal. For example, in the case of conventional telephone connections, a sampling rate of 8 kHz is used resulting in a maximal signal bandwidth of 4 kHz. Compared to the case of audio CD's, the speech and audio quality is significantly reduced.
  • Furthermore, many kinds of transmissions show additional bandwidth restrictions. In the case of an analog telephone connection, only frequencies between 300 Hz and 3.4 kHz are transmitted. As a result, only 3.1 kHz bandwidth are available.
  • In principle, the bandwidth of telephone connections could be increased by using broadband or wideband digital coding and decoding methods (so-called broadband codecs). In such a case, however, both the transmitter and the receiver have to support corresponding coding and decoding methods which would require the implementation of a new standard.
  • As an alternative, systems for bandwidth extension can be used as described, for example, in P. Jax, Enhancement of Bandlimited Speech Signals: Algorithms and Theoretical Bounds, Dissertation, Aachen, Germany, 2002 or E. Larsen, R. M. Aarts, Audio Bandwidth Extension, Wiley, Hoboken, NJ, USA, 2004. These systems are to be implemented on the receiver's side only such that existing telephone connections do not have to be changed. In these systems, the missing frequency components of an input signal with small bandwidth are estimated and added to the input signal.
  • An example of the structure and the corresponding signal flow in such a state of the art bandwidth extension system is illustrated in Fig. 6. In general, both the lower and the upper frequency ranges are re-synthesized.
  • At block 601, an incoming or received acoustic signal x(n) in digitized form is processed by sub-sampling and block extraction so as to obtain signal vectors x(n). Here, the variable n denotes the time. In this Figure, it is assumed that the incoming signal x(n) has already been converted to the desired bandwidth by increasing the sampling rate. In this conversion step, no additional frequency components are to be generated which can be achieved, for example, by using appropriate anti-aliasing or anti-imaging filter elements. ln order to not amend the transmitted signal, the bandwidth extension is performed only within the missing frequency ranges. Depending on the transmission method, the extension concerns low frequency (for example from 0 to 300 Hz) and/or high frequency (for example 3400 Hz to half of the desired sampling rate) ranges.
  • In block 602, a narrowband spectral envelope is extracted from the narrowband signal, the narrowband signal being restricted by the bandwidth restrictions of the telephone channel. Via a non-linear mapping, a corresponding broadband envelope signal is estimated from the narrowband envelope. The mappings are based, for example, on codebook pairs (see J. Epps, W. H. Holmes, A New Technique for Wideband Enhancement of Coded Narrowband Speech, IEEE Workshop on Speech Coding, Conference proceedings, pages 174 to 176 June 1999) or on Neural Networks (see J.-M. Valin R. Lefebvre, Bandwidth Extension of Narrowband Speech for Low Bit-Rate Wideband Coding, IEEE Workshop on Speech Coding, Conference Proceedings, pages 130 to 132, September 2000). In these methods, the entries of the codebooks or the weights of the neural networks are generated using training methods requiring large processor and memory resources.
  • Furthermore, in block 603, a broadband or wideband excitation signal having a spectrally flat envelope is generated from the narrowband signal. This excitation signal corresponds to the signal which would be recorded directly behind the vocal cords, i.e. the excitation signal contains information about voicing and pitch, but not about form and structures or the spectral shaping in general. Thus, to retrieve a complete signal, such as a speech signal, the excitation signal has to be weighted with the spectral envelope. For the generation of excitation signals, non-linear characteristics (see U. Kornagel, Spectral Widening of the Excitation Signal for Telephone-Band Speech Enhancement, IWAENC 01, Conference Proceedings, pages 215 to 218, September 2001) such as two-ray rectifying or squaring, for example, may be used.
  • For bandwidth extension, the excitation signal x exc (n) is spectrally colored using the envelope in block 604. After that, the spectral ranges used for the extension are extracted using a band stop filter in block 606 resulting in signal vectors y ext (n). The band stop filter can be effective, for example, in the range from 200 to 3700 Hz.
  • The signal vectors x(n) of the received signal are passed through a complementary band pass filter in block 605. Then, the signal components y ext ( n ) and y tel ( n ) are added to obtain a signal vector y(n) with extended bandwidth. In block 607, the different signal vectors are assembled again and an over-sampling is performed resulting in a signal y(n).
  • In these prior art systems, the elements and their parameters are implemented once and, then, remain unchanged. Thus, all incoming acoustic signals are treated the same way. In view of this, it is an object underlying the present invention to provide a more flexible method and apparatus for providing an acoustic signal with extended bandwidth.
  • This problem is solved by the method according to claim 1 and the apparatus according to claim 16.
  • In accordance with the invention, a method for providing an acoustic signal with extended bandwidth is provided, comprising:
    1. (a) automatically determining a current upper and a current lower bandwidth limit of received acoustic signal,
    2. (b) automatically determining at least one complementary signal to complement the received acoustic signal between a predefined lower broadband bandwidth limit and the current lower bandwidth limit and/or between the current upper bandwidth limit and a predefined upper broadband bandwidth limit, wherein the predefined lower broadband bandwidth limit is smaller than the current bandwidth limit and the predefined upper broadband bandwidth limit is larger than the current upper bandwidth limit,
    3. (c) automatically assembling the at least one complementary signal and the received acoustic signal to obtain an acoustic signal with extended bandwidth.
  • By determining current upper and lower bandwidth limits of a received acoustic signal and determining a complementary signal between the current bandwidth limits and the respective predefined broadband (or wideband) bandwidth limits, the method according to the invention allows an adaptation of the bandwidth extension to the acoustic signal actually received. For example, when the transmitter uses an ISDN telephone, a broader frequency range is used compared to the case of a mobile phone with a hands-free system. Therefore, the bandwidth of a received acoustic signal will be extended only in those ranges where it is necessary so that the quality of the resulting signal is very high.
  • In this way, on the one hand, no spectral gaps will occur even if the received signal covers only a very narrow frequency range. On the other hand, when receiving signals covering a relatively broad frequency range, no frequencies are cut-off when determining the complementary signal.
  • The received acoustic signal may be a digital signal or may be digitized. In the above method, steps (a) to (c) may be preceded by the step of converting the received acoustic signal to a predetermined sampling rate. Furthermore, steps (a) to (c) may be preceded by the step of extracting a signal vector from the acoustic signal, in particular, the converted acoustic signal. The signal vector may be obtained by sub-sampling the acoustic signal and may comprise a predefined number of entries. Then, subsequent (in time) signal vectors may overlap. The use of signal vectors simplifies further processing of the signals.
  • Steps (a) to (c) may be preceded by the step of determining a spectral vector of the received acoustic signal. In particular, a window function may be applied to signal vectors of the received acoustic signal. For example, a Hann or a Hamming window may be used (see K. D. Kammeyer, K. Kroschel, Digitale Signalverarbeitung, 4 th Edition, Teubner, Stuttgart, Germany 1997). Signal vectors, in particular the signal vectors weighted in this way, may be transformed into the Fourier domain using a discrete Fourier transform. The resulting vector is a short-term spectral vector. This allows for further processing in the Fourier domain.
  • In the above methods, step (b) may comprise determining a broadband spectral envelope signal and a broadband excitation signal between the lower and upper broadband bandwidth limits such that the product of spectral envelope signal and excitation signal corresponds to the received acoustic signal according to a predetermined criterion.
  • Such a decomposition into an envelope signal and an excitation signal simplifies determining the current bandwidth limits and increases the accuracy when determining a complementary signal.
  • Step (a) may comprise comparing a determined broadband spectral envelope signal and a long-term power spectrum of the received acoustic signal. It turned out that the long-term power spectrum is a suitable basis for determining current bandwidth limits of the acoustic signal.
  • Thus, if current bandwidth limits have been determined in step (a) in this way using a broadband spectral envelope signal of the received acoustic signal, determining a complementary signal in step (b) based on these current bandwidth limits and comprising determination of an envelope signal enables to iteratively adapt the current bandwidth limits by comparing again the (newly) determined envelope signal and a long-term power spectrum. In other words, determining current bandwidth limits in step (a) may use a spectral envelope signal determined according to step (b), particularly in a preceding step or in a preceding iteration of the method.
  • In particular, if the received acoustic signal has been transformed into the Fourier domain, determining a long-term power spectrum may comprise performing a first order recursive smoothing of the absolute values squared of the sub-band signals corresponding to the acoustic signal. This can be done, in particular, only if a wanted signal, such as a speech signal, has been detected in the received acoustic signal.
  • In addition, the long-term power spectrum may be normalized, particularly with respect to a long-term power spectrum within predetermined frequency limits.
  • Alternatively, the long-term power spectrum may be determined in the time domain. This can be done by determining the auto-correlation and performing an LPC analysis to obtain corresponding prediction coefficients.
  • The comparing step may comprise selecting the minimal and maximal frequency for which the long-term power spectrum is larger than or equal to the power spectrum of the determined broadband spectral envelope signal plus a predetermined constant.
  • This is a particularly simple and reliable way to determine the bandwidth limits. The predetermined constant can be chosen based on empirical or theoretical data. The predetermined constant may be negative.
  • In the above methods, determining a broadband spectral envelope signal may comprise selecting an envelope signal from a codebook according to a predetermined criterion.
  • By using codebooks, the required computing power can be reduced for determining an envelope signal. In principle, different kinds of criteria can be used when selecting an envelope signal from a codebook. In particular, using a predetermined distance criterion such as a cepstral distance can be used, particularly if the codebook entries have the form of cepstral vectors.
  • In particular, selecting an envelope signal may comprise equalizing the received acoustic signal and selecting an envelope signal from the codebook having minimal distance to the equalized acoustic signal according to a predetermined distance criterion, in particular, having a minimal cepstral distance.
  • Equalizing the acoustic signal allows to modify it such that a comparison with envelope signals from the codebook can be simplified. In particular, the received acoustic signal can be equalized in such a way that the resulting signal shows a long-term power spectrum corresponding to the long-term power spectrum of the signal used for training the codebook. Equalizing can be restricted to frequencies between the current upper and lower bandwidth limits of the received acoustic signal; outside these limits, the signal may remain unchanged. In particular, equalizing the received acoustic signal can be performed using a normalized long-term power spectrum of the signal used for training the codebooks, particularly using the normalized long-term power spectrum divided by the normalized long-term power spectrum of the received acoustic signal itself.
  • The codebook may comprise pairs of corresponding envelope signals, each pair comprising a broadband envelope signal between the lower and upper broadband bandwidth limits and a corresponding narrowband envelope signal between a lower narrowband bandwidth limit being larger than the lower broadband bandwidth limit and an upper narrowband bandwidth limit being smaller than the upper broadband bandwidth limit, and selecting an envelope signal may comprise determining a narrowband envelope signal having minimal distance to the equalized acoustic signal according to the predetermined distance criterion and selecting the corresponding broadband envelope signal of this pair.
  • In this way, a simple comparison between the received acoustic signal and the elements of the codebook can be performed as narrowband signals usually match a received acoustic signal with a narrow bandwidth more closely.
  • When using a cepstral distance to select an envelope signal, the received acoustic signal, particularly in its equalized form, has to be transformed into the cepstral domain. Thus, the step of selecting an envelope signal can further comprise the steps of determining the absolute value squared of the sub-band signals of the received acoustic signal, determining an auto-correlation in the time domain, particularly by performing an inverse discrete Fourier transform on the vector of the absolute value squared, determining prediction coefficients, particularly using the Levinson-Durbin algorithm, performing a recursion to obtain the cepstral coefficients.
  • In order to determine a spectral envelope from the cepstral vectors, the method may further comprise the steps of recursively transforming a cepstral vector into prediction error coefficients, augmenting the prediction error filter vector by adding a predetermined number of zeros and subsequently performing a discrete Fourier transform to obtain an inverse spectrum, determining the reciprocal of each sub-band component to obtain a spectral envelope vector.
  • In the above methods, the step of selecting an envelope signal may be preceded by providing adapted narrowband codebook envelope signals being adapted to the current lower and upper bandwidth limits.
  • Such an adaptation of the codebook entries allows for an improved selection of a corresponding envelope signal from the codebook. In particular, if the received acoustic signal shows a broader bandwidth than the original narrowband envelope signals of the codebook, the adaptation would result in envelope signals in the codebook having an extended bandwidth. In this way, particularly fricatives can be more reliably detected.
  • The providing step may comprise processing broadband codebook envelope signals using a long-term power spectrum of the received acoustic signal.
  • Due to the use of the power spectrum of the received acoustic signal, a suitable adaptation to the acoustic signal can be obtained. The long-term power spectrum may be normalized; furthermore, the long-term power spectrum of the received acoustic signal may be divided by a normalized long-term power spectrum of a broadband signal used for training of the codebook. The processing of the broadband codebook envelope signals may be performed only for frequencies outside the current bandwidth limits; within the bandwidth limits, the envelope signals may remain unchanged. Processing using the long-term power spectrum may comprise weighting broadband codebook envelope signal vectors using the long-term power spectrum of the received acoustic signal.
  • In the above methods, determining a broadband excitation signal may be based on prediction error filtering and/or a non-linear characteristic. In this way, suitable excitation signals can be generated. Possible non-linear characteristics are disclosed, for example, in U. Kornagel, Spectral Widening of the Excitation Signal for Telephone-Band Speech Enhancement.
  • In the above methods, the at least one complementary signal may be based on a product of the determined broadband spectral envelope and the determined broadband excitation signal, and step (c) may comprise summing the received acoustic signal between the current lower and upper bandwidth limits and the at least one complementary signal being restricted to the band between the lower broadband bandwidth limit and a current lower bandwidth limit and/or to the band between the current upper bandwidth limit and the upper broadband bandwidth limit.
  • Thus, the complementary signal is based on spectrally coloring the excitation signal using the envelope signal. By adding a complementary signal only outside the current bandwidth limits of the received acoustic signal, artifacts are avoided in the resulting signal with extended bandwidth.
  • Step (c) may also comprise adapting the power of the complementary signal and/or the received acoustic signal. With this step, the power of the received acoustic signal can be maintained.
  • In the above-described methods, at least one of the steps may be performed in the cepstral domain. Particularly if the entries of the codebook are cepstral vectors, this allows for performing the method in a simpler way.
  • Steps (a) to (c) of the above methods may be repeated at predetermined time intervals. Then, the repeated adaptation to the currently received acoustic signal leads to a permanent high quality of the resulting broadband signal.
  • Steps (a) to (c) of the above methods may be repeated only if a wanted signal component, such as speech activity, is detected in the received acoustic signal. Particularly in the case of speech signals, an extension of the bandwidth of the received acoustic signal is advantageous. Thus, restricting the method to the case of detected speech activity reduces the required computing power and avoids the presence of artifacts due to mal-adaptation.
  • The invention also provides a computer program product comprising one or more computer-readable media having computer-executable instructions for performing the steps of the above-described methods when run on a computer.
  • Furthermore, an apparatus for providing an acoustic signal with extended bandwidth is provided, comprising:
    • bandwidth determining means for automatically determining a current upper and a current lower bandwidth limit of a received acoustic signal,
    • complementary signal means for automatically determining at least one complementary signal to complement the received acoustic signal between a predefined lower broadband bandwidth limit and the current lower bandwidth limit and/or between the current upper bandwidth limit and a predefined upper broadband bandwidth limit, wherein the predefined lower broadband bandwidth limit is smaller than
    • the current bandwidth limit and the predefined upper broadband bandwidth limit is larger than the current upper bandwidth limit, and
    • assembling means for automatically assembling the at least one complementary signal and the received acoustic signal to obtain an acoustic signal with extended bandwidth.
  • Analogous to the above-described method, such an apparatus provides an advantageous way to extend the bandwidth of a received acoustic signal. In particular, due to the determination of current upper and lower bandwidth limits of the received acoustic signal and a corresponding determination of a complementary signal, the quality of the resulting output signal is increased compared to the case of bandwidth extension systems with fixed parameters.
  • The complementary signal means may comprise a means for determining a broadband spectral envelope signal and a broadband excitation signal between the lower and upper broadband bandwidth limits such that the product of spectral envelope signal and excitation signal corresponds to the received acoustic signal according to a predetermined criterion.
  • The bandwidth determining means may be configured to compare a determined broadband spectral envelope signal and a long-term power spectrum of the received acoustic signal.
  • The bandwidth determining means may be configured to select the minimal and maximal frequency for which the long-term power spectrum is larger than or equal to the power spectrum of the determined broadband spectral envelope signal plus a predetermined constant.
  • In the above-described apparatus, the means for determining a broadband spectral envelope signal may comprise a means for selecting an envelope signal from a codebook according to a predetermined criterion.
  • The means for selecting an envelope signal may be configured to equalize the received acoustic signal and select an envelope signal from the codebook having minimal distance to the equalized acoustic signal according to a predetermined distance criterion, in particular, having a minimal cepstral distance.
  • In the above-described apparatus, the codebook may comprise pairs of corresponding envelope signals, each pair comprising a broadband envelope signal between the lower and upper broadband bandwidth limits and a corresponding narrowband envelope signal between a lower narrowband bandwidth limit being larger than the lower broadband bandwidth limit and an upper narrowband bandwidth limit being smaller than the upper broadband bandwidth limit, and the means for selecting an envelope signal may be configured to determine a narrowband envelope signal having minimal distance to the equalized acoustic signal according to the predetermined distance criterion and to select the corresponding broadband envelope signal of this pair.
  • The means for determining a broadband spectral envelope signal may comprise a means for providing adapted narrowband codebook envelope signals being adapted to the current lower and upper bandwidth limits.
  • The means for providing may be configured to process the broadband codebook envelope signal using a long-term power spectrum of the received acoustic signal.
  • In the above-described apparatus, the means for determining a broadband excitation signal may be configured to determine the broadband excitation signal based on prediction error filtering and/or a non-linear characteristic.
  • The at least one complementary signal may be based on a product of the determined broadband spectral envelope and the determined broadband excitation signal, and the assembling means may be configured to sum the received acoustic signal between the current lower and upper bandwidth limits and the at least one complementary signal being restricted to the band between the lower broadband bandwidth limit and a current lower bandwidth limit and/or to the band between the current upper bandwidth limit and the upper broadband bandwidth limit.
  • In the above-described apparatus, at least one of the means may be configured to perform at least part of its function in the cepstral domain.
  • The means of the above-described apparatus may be configured to perform their respective function repeatedly at predetermined time intervals.
  • The apparatus may further comprise a wanted signal detector, in particular, a speech detector, and the means may be configured to perform their respective function only if a wanted signal component is detected in the received acoustic signal.
  • Further features and advantages of the invention will be described in the following with reference to the figures.
  • Fig. 1
    illustrates the structure of an example of an apparatus for providing an acoustic signal with extended bandwidth;
    Fig. 2
    is a flow diagram of an example of a method for providing an acoustic signal with extended bandwidth;
    Fig. 3
    illustrates an example of a normalized long-term power spectrum for training a codebook;
    Fig 4
    illustrates examples of codebook entries;
    Fig. 5
    illustrates the determination of current bandwidth limits;
    Fig. 6
    illustrates the structure of a prior art system.
  • Fig. 1 shows the structure of the signal flow in an apparatus for providing an acoustic signal with extended bandwidth. Fig. 2 is a flow diagram illustrating an example of a method for providing an acoustic signal with extended bandwidth which could be performed by the apparatus corresponding to Fig. 1. In view of this, Fig.'s 1 and 2 will be described in the following simultaneously.
  • According to step 201, an acoustic signal, such as a speech signal, is received via a telephone line. Because of the restricted bandwidth of the telephone line, an extension of the bandwidth is desired to improve the signal quality. Thus, the signal is to be augmented so as to obtain a predetermined broader bandwidth. It is to be understood that the method described in the following can be used for bandwidth extension independent of the type of incoming signal and independent of the type of transmission line, i.e., it need not be a telephone line.
  • The acoustic signal x(n) received by block 101 has already been pre-processed by increasing the sampling rate up to the predetermined broadband or wideband bandwidth. In this way, however, no additional frequency components are generated. This can be achieved, for example, by using suitable anti-aliasing or anti-imaging filters. This kind of bandwidth extension, preferably, is performed only for the "missing" frequency ranges; in the case of an analog telephone line, these ranges may be between 0 and 300 Hz and 3400 Hz up to half of the desired sampling rate, for example, up to 3700 Hz.
  • From the resulting signal x(n), n denoting the time variable, signal vectors x(n) are generated (step 202). This can be achieved by taking every r sampling values up to a certain length. Thus, a signal vector with Nana elements has the form: x n = [ x nr , x nr - 1 , , x nr - N ana + 1 ] T .
    Figure imgb0001
  • It is to be noted that an overlap may exist between neighboring signal vectors. For a desired or final sampling rate of 11.025 kHz, one may take the following values: r = 64 ,
    Figure imgb0002
    N ana = 256.
    Figure imgb0003
  • After that (step 203), a windowing procedure is performed on the signal vector so as to obtain a windowed signal vector x w (n) : x w n = Fx n .
    Figure imgb0004
  • The window matrix F is a diagonal matrix of the form F = h 0 0 0 0 0 h 1 0 0 0 0 h 2 0 0 0 0 h N ana - 1
    Figure imgb0005
  • The elements of this matrix can be chosen corresponding to different kinds of windows. Typical windows are the Hann or Hamming window. The weighted signal vectors are transformed into the Fourier domain using a discrete Fourier transform: X n n = DFT x w n .
    Figure imgb0006
  • The resulting short-term spectral vector has the form: X w n [ X e j Ω 0 n , X e j Ω 1 n , , X e j Ω μ n , , X e j Ω N DFT - 1 n ] T ,
    Figure imgb0007

    wherein Ωµ denotes the frequency variable.
  • Based on the spectral vectors, a long-term power spectrum of the received acoustic signal is determined in block 102 (step 204). There are different possibilities to estimate such a long-term power spectrum. According to one alternative, a first order recursive smoothing is performed on the absolute value squared of the sub-band signals X(e jΩµ , n) : S ^ xx Ω μ n = { β fre S ^ xx Ω μ , n - 1 + 1 - β fre | X w e j Ω μ n | 2 , diring speech activity S ^ xx Ω μ , n - 1 , else .
    Figure imgb0008
  • Preferably, the time constant β fre is chosen to be close to 1 (0 << β fre < 1) so as to obtain a sufficiently large averaging time.
  • In principle, the recursive smoothing according to the first line of the above equation may be performed continuously. However, in order to avoid any artefacts, it may be performed only if a wanted signal component is present in the received acoustic signal, for example, if speech activity is detected. For this purpose, a speech detector may be provided as described, for example, in E. Hänsler, G. Schmidt, Acoustic Echo and Noise Control - A Practical Approach, Wiley, Hoboken, NJ, USA, 2004.
  • In order to simplify the further processing, the long-term power spectrum may be normalized to the long-term power within a predefined frequency band: S ^ xx , norm Ω μ n = S ^ xx Ω μ n μ = μ l μ μ S ^ xx Ω μ n
    Figure imgb0009
  • The band limits Ωµ l and Ωµ u denote the lower and upper limits of a predefined frequency band. For example, this frequency band may correspond to a telephone band with minimal bandwidth for which the present method is to be used, for example, the limits may be 400 Hz and 3300 Hz. Preferably, the limits correspond to a band which is smaller or at most equal to the frequency band of the narrow frequency band within which the codebook described below has been trained; these limits being denoted by Ω l and Ω u .
  • Alternatively, to determine the long-term power spectrum in the frequency domain, an estimation can be performed in the time domain as well. For this purpose, an auto-correlation is estimated for about 10 to 20 sampling cycles of offset. Afterwards, prediction coefficients can be determined using an LPC (linear predictive coding) analysis. The long-term power spectrum is obtained via a discrete Fourier transform and a division.
  • In block 103 (step 205), the acoustic signal is equalized. The equalizing is performed on the spectral vector determined above: X eq n = H eq n X w n .
    Figure imgb0010
  • The equalizing matrix H eq (n) is a diagonal matrix of the form H eq n = H eq e j Ω 0 n 0 0 0 H eq e j Ω 1 n 0 0 0 H eq e j Ω N DFT - 1 n
    Figure imgb0011
    with the entries H eq e j Ω μ n = { 1 if Ω μ < Ω l n - 1 or Ω μ < Ω u n - 1 S ^ xx , norm Ω μ n S ^ xx , norm Ω μ n , else
    Figure imgb0012
    and H eq ( e j Ω μ , n ) = { H eq , max , if H eq ( e j Ω μ , n ) > H eq , max , H eq , min , if H eq ( e j Ω μ , n ) > H eq , min , H eq ( e j Ω μ , n ) , else ,
    Figure imgb0013
  • In the equations above, Ω l (n -1) and Ω u (n -1) denote the current lower and upper bandwidth limits of the received acoustic signal. Thus, for obtaining an updated equalized signal, the bandwidth limits at time (n-1) are taken as the current bandwidth limits. Furthermore, S ,norm µ, n) denotes the normalized long-term power spectrum of the broadband signal which has been used for training the codebook. Normalizing of such a power spectrum is performed analogously to the case of the long-term power spectrum of the received acoustic signal described above. An example for such a normalized long-term power spectrum used for training a codebook is shown in Figure 3.
  • The equalizing is restricted to minimal and maximum values, for example, to H eq , min = - 12 dB ,
    Figure imgb0014
    H eq , max = 12 dB .
    Figure imgb0015
  • As can be seen from the above, the acoustic signal is equalized only within the current bandwidth limits one time step before. Outside these bandwidth limits, no equalizing takes place.
  • In the following, determining a broadband spectrum envelope will be described in more detail. An envelope signal corresponding to the received acoustic signal will be determined using a codebook. The used codebook comprises a number of pairs of corresponding narrowband and broadband envelope signals. The codebook has been obtained by training with a large database on the basis of a starting long-term power spectrum (see Y. Linde, A. Buzo, R. M. Gray, An Algorithm for Vector Quantizer Design, IEEE Trans. Comm., vol. COM-28, no. 1, pages 84 - 95, Jan. 1980).
  • As indicated in Figure 2, the codebook entries are adapted in step 206 (block 104). In particular, the narrowband codebook entries c i,s (n) are adapted.
  • This is achieved by starting with the broadband entries of the codebook. If the broadband envelope signals are provided as cepstral vectors c i,b (n), the corresponding spectra Ci,b (n) are determined. Based on these broadband spectral envelopes, the adapted or optimized narrowband spectra are determined by a multiplication with a weighting matrix: C i , s n = H mod n C i , b n .
    Figure imgb0016
  • The weighting matrix is a diagonal matrix of the form: H mod n = H mod e j Ω 0 n 0 0 0 H mod e j Ω 1 n 0 0 0 H mod e j Ω N DFT - 1 n
    Figure imgb0017
    with the entries H mod e j Ω μ n = { 1 , if ( Ω l n - 1 < Ω μ < Ω u n - 1 ) , S ^ xx , norm Ω μ n S ^ x x , norm Ω μ n , else .
    Figure imgb0018
  • Afterwards, cepstral vectors are determined from the resulting spectral narrowband envelopes.
  • The conversion from spectral vectors to cepstral vectors and vice versa will be described in the following with respect to step 207 in which broadband spectral envelopes are determined (block 105).
  • A broadband spectral envelope from the codebook matching the acoustic signal best is determined by comparing the narrowband codebook entries with the spectral envelope of the spectrum of the acoustic signal (after equalizing). The narrowband codebook entry is selected that has the smallness distance to the acoustic signal spectrum. In principle, different distance criteria can be used. The cepstral distance is particularly useful as the codebook entries are provided in the form of cepstral vectors.
  • When an optimal narrowband codebook entry has been selected, the corresponding broadband codebook entry is determined as the optimal broadband spectral envelope for the received acoustic signal. Due to the adaptation of the narrowband codebook entries as described above, an optimal narrowband envelope can be selected in a very reliable way.
  • Converting a spectral vector, particularly of the received acoustic signal, to a cepstral vector can be achieved by:
    1. 1. Determining the absolute value squared of each sub-band signal Xeq (e µ , n).
    2. 2. Applying an inverse discrete Fourier transform on this vector results in an estimation of the auto-correlation in the time domain.
    3. 3. Using the Levinson-Durbin algorithm, prediction coefficients (with an order of about 10 to 20) can be determined from the auto-correlation.
    4. 4. By performing a recursion with respect to the order, the prediction coefficients are used to determine the cepstral coefficients. Usually, the order corresponds to one and a half of the prediction order.
  • The optimal cepstral vector of the broadband codebook is designated by c opt,b (n). The resulting broadband spectral envelope has the form: C opt , b n = [ C opt , b e j Ω 0 n , C opt , b e j Ω 1 n , , C opt , b e j Ω N DFT - 1 n ] T .
    Figure imgb0019
  • Conversion of cepstral vectors into spectral vectors is achieved by:
    1. 1. Converting the cepstral vectors using a recursion with respect to the order (as above) to obtain prediction error filter coefficients.
    2. 2. By augmenting the prediction error filter vector by a predetermined number of zeros and subsequent performing of a discrete Fourier transform, an inverse spectrum is obtained.
    3. 3. By determining the reciprocal of each sub-band component, the vector C opt,b (n) is generated. Divisions by zero have to be treated separately, for example by adding a suitable constant.
  • Fig. 4 illustrates an example of a codebook with four pairs of entries. In each diagram, a corresponding original narrowband envelope, and a corresponding adapted narrowband envelope are shown. The original broadband and narrowband codebook entries have been obtained on the basis of a large database for an ISDN telephone connection. As can be seen in this figure, after the adaptation, the resulting optimized entries have a higher upper limit frequency. This allows for an improved detection of fricatives.
  • In step 208 (block 103), an excitation signal corresponding to the received acoustic signal is generated. This broadband excitation signal shows a spectrally flat envelope. It corresponds to a signal which would be recorded directly behind the vocal cords.
  • For determining a broadband excitation signal, first of all, the spectral envelope of the equalized short-term spectrum X eq (n) is estimated in the form of prediction error filter coefficients. Applying an inverse discrete Fourier transform on this spectral vector allows to determine the corresponding time signal. After that, the vector in the time domain is filtered by a prediction error filter. The corresponding filter coefficients are those that have been determined previously.
  • Then, a non-linear characteristic, such as a two-way rectification or squaring, is applied to the filtered time domain vector. This generates the missing low frequency and high frequency signal components. A transformation in the Fourier domain provides, then, the spectrum of the extended excitation signal X exc (n).
  • Alternatively, determining an excitation signal can be performed in the time sub-band or Fourier domain as well. Examples for this alternative can be found in B. Iser, G. Schmidt, Bandwidth Extension of Telephony Speech, Eurasip Newsletter, Volume 16, .
  • In the following step 209 (block 107), the broadband spectral envelope and the excitation signal are used for spectrally coloring the excitation signal. This can be achieved by multiplication in the sub-band or Fourier domain: Y ext n = diag C opt , p n X exc n .
    Figure imgb0020
  • The diagonal matrix diag {C opt,b (n)} has the form: diag C opt , b n = C opt , b e j Ω 0 n 0 0 0 C opt , b e j Ω 1 n 0 0 0 C opt , b e j Ω N DFT - 1 n
    Figure imgb0021
  • Because of the non-linearity or the prediction error filtering when generating the excitation signal, the power of the acoustic signal need not be maintained. Therefore, a power adaptation may be performed: Y ext n = K n Y ext n .
    Figure imgb0022
  • The correction factor K can be chosen to be K n = μ = μ l μ u | X w e j Ω n | 2 μ = μ l μ u Y erw ( e j Ω μ , n ) 2
    Figure imgb0023

    wherein Ω µ l and Ω µ u denote the same bandwidth limits as in the estimation of the long-term power spectrum above.
  • The current bandwidth limits are adapted in step 210 (block 108). According to one possibility, the bandwidth limits are determined starting with a comparison of the spectrum of the received acoustic signal and the broadband spectral envelope being reduced by a predefined constant: Ω l n = min Ω μ | X w e j Ω μ , n | 2 C opt , b ( e j Ω μ , n ) 2 + K C ,
    Figure imgb0024
    Ω u n = min Ω μ | X w e j Ω μ , n | 2 C opt , b ( e j Ω μ , n ) 2 + K C .
    Figure imgb0025
  • The parameter Kc can have the value: K C = - 12 dB .
    Figure imgb0026
  • In Fig. 5, an example for determining the bandwidth limits is illustrated. The above, intermediate limit values are given by the points of intersection between the lowered broadband spectral envelope and the spectrum of the received acoustic signal.
  • These intermediate limit values may be recursively smoothed to eliminate temporary mal-estimations. In this case, preferably, smoothing is performed only if speech activity is detected in the current signal frame. Ω l n = { β bandl Ω l n - 1 + 1 - β bandl Ω l n , during speech activity , Ω l n - 1 , else ,
    Figure imgb0027
    Ω u n = { β bandl Ω u n - 1 + 1 - β bandl Ω u n , during speech activity , Ω l n - 1 , else .
    Figure imgb0028
  • Then, the received acoustic signal is passed through an adaptive band pass filter to retain only components within the current bandwidth limits (block 109) to obtain a spectral vector Y tel (n). Similarly, the spectrally colored excitation signal is passed through a complementary adaptive band stop filter (block 110) so as to obtain a vector Y ext (n).
  • An output signal with a standard bandwidth is generated (step 211) by starting with summing these two spectral vectors: Y n = Y tel n + Y ext n .
    Figure imgb0029
  • The components of these vectors are generated as: Y tel n = G tel n X w n ,
    Figure imgb0030
    Y ext n = G ext n X ext n ,
    Figure imgb0031
    wherein the weighting matrices G tel (n) and G ext (n) are diagonal matrices: G tel n = G tel e j Ω 0 n 0 0 0 G tel e j Ω 1 n 0 0 0 G tel e j Ω N DFT - 1 n ,
    Figure imgb0032
    G ext n = G ext e j Ω 0 n 0 0 0 G tel e j Ω 1 n 0 0 0 G tel e j Ω N DFT - 1 n .
    Figure imgb0033
  • The elements of the matrix G tel (n) are determined as: G tel e j Ω μ n = { 1 , if Ω l n Ω μ Ω u n , 0 , else .
    Figure imgb0034
  • The weights of the complementary weighting matrix are determined so as to yield the unity matrix when summed: G ext ( e j Ω μ , n ) = 1 - G tel ( e j Ω μ , n ) .
    Figure imgb0035
  • Alternatively, the transitions at the bandwidth limits can be realized in a smoother way.
  • The resulting output spectrum Y(n), then, is transformed into the time domain via an inverse Fourier transform: y n = IDFT Y n ,
    Figure imgb0036
    followed by windowing the resulting vector. Particularly when using the above-indicated values for N ana and r and a Hann window, this window function can be used again to obtain windowed time domain vectors: y w n = Fy n .
    Figure imgb0037
  • The resulting time domain vectors are, then, assembled using an overlap add method (as described in K. D. Kammeyer, K. Kroschel, Digitale Signalverarbeitung) to obtain the final output signal y(n).
  • In the above-described steps of the method, more complex filter bank systems may be used instead of the conventional discrete Fourier transform and inverse discrete Fourier transform (see, for example, P. P. Vaidyanathan, Multirate Systems and Filter Banks, Prentice Hall, Englewood Cliffs, NJ, USA, 1992).
  • Further alternatives to the above-described variants are possible as well. For example, the steps performed in the Fourier domain may also be performed in the time domain. Furthermore, equalizing the acoustic signal may be performed when adapting the narrowband codebook entries. Also, the above-described equalizing step may be augmented. For example, if an amplification or an attenuation is detected at particular frequencies, it may be adjusted within the bandwidth limits as well. In this case, the output vector Y tel (n) is modified with the weighting matrix H mod(n).
  • In addition to the above-described codebook analysis for estimating the broadband spectral envelopes, a so-called linear mapping (see B. Iser, G. Schmidt, Bandwidth Extension of Telephony Speech) may be used additionally.
  • Further modifications and variations of the present invention will be apparent to those skilled in the art in view of this description. Accordingly, the description is to be construed as illustrated only and is for the purpose of teaching those skilled in the art the general manner of carrying out the present invention. It is to be understood that the forms of the invention shown and described herein are to be taken as the presently preferred embodiments.

Claims (29)

  1. Method for providing an acoustic signal with extended bandwidth, comprising:
    (a) automatically determining a current upper and a current lower bandwidth limit of a received acoustic signal,
    (b) automatically determining at least one complementary signal to complement the received acoustic signal between a predefined lower broadband bandwidth limit and the current lower bandwidth limit and/or between the current upper bandwidth limit and a predefined upper broadband bandwidth limit, wherein the predefined lower broadband bandwidth limit is smaller than the current bandwidth limit and the predefined upper broadband bandwidth limit is larger than the current upper bandwidth limit,
    (c) automatically assembling the at least one complementary signal and the received acoustic signal to obtain an acoustic signal with extended bandwidth.
  2. Method according to claim 1, wherein step (b) comprises determining a broadband spectral envelope signal and a broadband excitation signal between the lower and upper broadband bandwidth limits such that the product of spectral envelope signal and excitation signal corresponds to the received acoustic signal according to a predetermined criterion.
  3. Method according claim 2, wherein step (a) comprises comparing a determined broadband spectral envelope signal and a long-term power spectrum of the received acoustic signal.
  4. Method according to claim 3, wherein the comparing step comprises selecting the minimal and maximal frequency for which the long-term power spectrum is larger than or equal to the power spectrum of the determined broadband spectral envelope signal plus a predetermined constant.
  5. Method according to one of the claims 2-4, wherein determining a broadband spectral envelope signal comprises selecting an envelope signal from a codebook according to a predetermined criterion.
  6. Method according to claim 5, wherein selecting an envelope signal comprises equalizing the received acoustic signal and selecting an envelope signal from the codebook having minimal distance to the equalized acoustic signal according to a predetermined distance criterion, in particular, having a minimal cepstral distance.
  7. Method according to claim 6, wherein
    the codebook comprises pairs of corresponding envelope signals, each pair comprising a broadband envelope signal between the lower and upper broadband bandwidth limits and a corresponding narrowband envelope signal between a lower narrowband bandwidth limit being larger than the lower broadband bandwidth limit and an upper narrowband bandwidth limit being smaller than the upper broadband bandwidth limit, and
    selecting an envelope signal comprises determining a narrowband envelope signal having minimal distance to the equalized acoustic signal according to the predetermined distance criterion and selecting the corresponding broadband envelope signal of this pair.
  8. Method according to claim 7, wherein the step of selecting an envelope signal is preceded by providing adapted narrowband codebook envelope signals being adapted to the current lower and upper bandwidth limits.
  9. Method according to claim 8, wherein the providing step comprises processing broadband codebook envelope signals using a long-term power spectrum of the received acoustic signal.
  10. Method according to one of the claims 2 - 9, wherein determining a broadband excitation signal is based on prediction error filtering and/or a nonlinear characteristic.
  11. Method according to one of claims 2-10, wherein
    the at least one complementary signal is based on a product of the determined broadband spectral envelope and the determined broadband excitation signal, and
    step (c) comprises summing the received acoustic signal between the current lower and upper bandwidth limits and the at least one complementary signal being restricted to the band between the lower broadband bandwidth limit and a current lower bandwidth limit and/or to the band between the current upper bandwidth limit and the upper broadband bandwidth limit.
  12. Method according to one of the preceding claims, wherein at least one of the steps is performed in the cepstral domain.
  13. Method according to one of the preceding claims, wherein steps (a) to (c) are repeated at predetermined time intervals.
  14. Method according to one of the preceding claims, wherein steps (a) to (c) are repeated only if a wanted signal component, in particular, speech activity, is detected in the received acoustic signal.
  15. Computer program product comprising one or more computer readable media having computer-executable instructions for performing the steps of the method of one of the preceding claims when run on a computer.
  16. Apparatus for providing an acoustic signal with extended bandwidth, comprising:
    bandwidth determining means for automatically determining a current upper and a current lower bandwidth limit of a received acoustic signal,
    complementary signal means for automatically determining at least one complementary signal to complement the received acoustic signal between a predefined lower broadband bandwidth limit and the current lower bandwidth limit and/or between the current upper bandwidth limit and a predefined upper broadband bandwidth limit, wherein the predefined lower broadband bandwidth limit is smaller than the current bandwidth limit and the predefined upper broadband bandwidth limit is larger than the current upper bandwidth limit, and
    assembling means for automatically assembling the at least one complementary signal and the received acoustic signal to obtain an acoustic signal with extended bandwidth.
  17. Apparatus according to claim 16, wherein the complementary signal means comprises a means for determining a broadband spectral envelope signal and a broadband excitation signal between the lower and upper broadband bandwidth limits such that the product of spectral envelope signal and excitation signal corresponds to the received acoustic signal according to a predetermined criterion.
  18. Apparatus according to claim 17, wherein the bandwidth determining means is configured to compare a determined broadband spectral envelope signal and a long-term power spectrum of the received acoustic signal.
  19. Apparatus according to claim 18, wherein the bandwidth determining means is configured to select the minimal and maximal frequency for which the long-term power spectrum is larger than or equal to the power spectrum of the determined broadband spectral envelope signal plus a predetermined constant.
  20. Apparatus according to one of the claims 17 - 19, wherein the means for determining a broadband spectral envelope signal comprises a means for selecting an envelope signal from a codebook according to a predetermined criterion.
  21. Apparatus according to claim 20, wherein the means for selecting an envelope signal is configured to equalize the received acoustic signal and select an envelope signal from the codebook having minimal distance to the equalized acoustic signal according to a predetermined distance criterion, in particular, having a minimal cepstral distance.
  22. Apparatus according to claim 21, wherein
    the codebook comprises pairs of corresponding envelope signals, each pair comprising a broadband envelope signal between the lower and upper broadband bandwidth limits and a corresponding narrowband envelope signal between a lower narrowband bandwidth limit being larger than the lower broadband bandwidth limit and an upper narrowband bandwidth limit being smaller than the upper broadband bandwidth limit, and
    the means for selecting an envelope signal is configured to determine a narrowband envelope signal having minimal distance to the equalized acoustic signal according to the predetermined distance criterion and to select the corresponding broadband envelope signal of this pair.
  23. Apparatus according to claim 22, wherein the means for determining a broadband spectral envelope signal comprises a means for providing adapted narrowband codebook envelope signals being adapted to the current lower and upper bandwidth limits.
  24. Apparatus according to claim 23, wherein the means for providing is configured to process the broadband codebook envelope signal using a long-term power spectrum of the received acoustic signal.
  25. Apparatus according to one of the claims 17 - 24, wherein the means for determining a broadband excitation signal is configured to determine the broadband excitation signal based on prediction error filtering and/or a nonlinear characteristic.
  26. Apparatus according to one of claims 17 - 25, wherein
    the at least one complementary signal is based on a product of the determined broadband spectral envelope and the determined broadband excitation signal, and
    the assembling means is configured to sum the received acoustic signal between the current lower and upper bandwidth limits and the at least one complementary signal being restricted to the band between the lower broadband bandwidth limit and a current lower bandwidth limit and/or to the band between the current upper bandwidth limit and the upper broadband bandwidth limit.
  27. Apparatus according to one of the claims 16 - 26, wherein at least one of the means is configured to perform at least part of its function in the cepstral domain.
  28. Apparatus according to one of the claims 16 - 27, wherein the means are configured to perform their respective function repeatedly at predetermined time intervals.
  29. Apparatus according to one of the claims 16 - 28, further comprising a wanted signal detector, in particular, a speech detector, and wherein the means are configured to perform their respective function only if a wanted signal component is detected in the received acoustic signal.
EP06017456A 2006-08-22 2006-08-22 Method and system for providing an acoustic signal with extended bandwidth Not-in-force EP1892703B1 (en)

Priority Applications (7)

Application Number Priority Date Filing Date Title
DE602006009927T DE602006009927D1 (en) 2006-08-22 2006-08-22 Method and system for providing an extended bandwidth audio signal
AT06017456T ATE446572T1 (en) 2006-08-22 2006-08-22 METHOD AND SYSTEM FOR PROVIDING AN EXTENDED BANDWIDTH AUDIO SIGNAL
EP06017456A EP1892703B1 (en) 2006-08-22 2006-08-22 Method and system for providing an acoustic signal with extended bandwidth
CA002596411A CA2596411A1 (en) 2006-08-22 2007-08-08 Method and system for providing an acoustic signal with extended bandwidth
JP2007214930A JP5150165B2 (en) 2006-08-22 2007-08-21 Method and system for providing an acoustic signal with extended bandwidth
KR1020070084306A KR101433833B1 (en) 2006-08-22 2007-08-22 Method and System for Providing an Acoustic Signal with Extended Bandwidth
CN2007101466102A CN101141533B (en) 2006-08-22 2007-08-22 Method and system for providing an acoustic signal with extended bandwidth

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP06017456A EP1892703B1 (en) 2006-08-22 2006-08-22 Method and system for providing an acoustic signal with extended bandwidth

Publications (2)

Publication Number Publication Date
EP1892703A1 true EP1892703A1 (en) 2008-02-27
EP1892703B1 EP1892703B1 (en) 2009-10-21

Family

ID=37000103

Family Applications (1)

Application Number Title Priority Date Filing Date
EP06017456A Not-in-force EP1892703B1 (en) 2006-08-22 2006-08-22 Method and system for providing an acoustic signal with extended bandwidth

Country Status (7)

Country Link
EP (1) EP1892703B1 (en)
JP (1) JP5150165B2 (en)
KR (1) KR101433833B1 (en)
CN (1) CN101141533B (en)
AT (1) ATE446572T1 (en)
CA (1) CA2596411A1 (en)
DE (1) DE602006009927D1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010021804A1 (en) * 2008-08-21 2010-02-25 Motorola, Inc. Method and apparatus to facilitate determining signal bounding frequencies
US8527283B2 (en) 2008-02-07 2013-09-03 Motorola Mobility Llc Method and apparatus for estimating high-band energy in a bandwidth extension system
US10121487B2 (en) 2016-11-18 2018-11-06 Samsung Electronics Co., Ltd. Signaling processor capable of generating and synthesizing high frequency recover signal

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8688441B2 (en) 2007-11-29 2014-04-01 Motorola Mobility Llc Method and apparatus to facilitate provision and use of an energy value to determine a spectral envelope shape for out-of-signal bandwidth content
US8433582B2 (en) 2008-02-01 2013-04-30 Motorola Mobility Llc Method and apparatus for estimating high-band energy in a bandwidth extension system
JP2010079275A (en) * 2008-08-29 2010-04-08 Sony Corp Device and method for expanding frequency band, device and method for encoding, device and method for decoding, and program
US8463599B2 (en) 2009-02-04 2013-06-11 Motorola Mobility Llc Bandwidth extension method and apparatus for a modified discrete cosine transform audio coder
US8706497B2 (en) 2009-12-28 2014-04-22 Mitsubishi Electric Corporation Speech signal restoration device and speech signal restoration method
WO2011128723A1 (en) * 2010-04-12 2011-10-20 Freescale Semiconductor, Inc. Audio communication device, method for outputting an audio signal, and communication system
JP6218855B2 (en) * 2013-01-29 2017-10-25 フラウンホーファーゲゼルシャフト ツール フォルデルング デル アンゲヴァンテン フォルシユング エー.フアー. AUDIO ENCODER, AUDIO DECODER, SYSTEM, METHOD, AND COMPUTER PROGRAM USING INCREASED TEMPERATURE RESOLUTION IN TEMPERATURE PROXIMITY OF ON-SET OR OFFSET OF FLUSION OR BRUSTING
CN107404625B (en) * 2017-07-18 2020-10-16 海信视像科技股份有限公司 Sound effect processing method and device of terminal
KR102093819B1 (en) * 2018-09-10 2020-03-26 한국과학기술연구원 Apparatus and method for separating sound sources

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0944036A1 (en) 1997-04-30 1999-09-22 Nippon Hoso Kyokai Method and device for detecting voice sections, and speech velocity conversion method and device utilizing said method and device
US20020138268A1 (en) 2001-01-12 2002-09-26 Harald Gustafsson Speech bandwidth extension
US6539355B1 (en) * 1998-10-15 2003-03-25 Sony Corporation Signal band expanding method and apparatus and signal synthesis method and apparatus
EP1298643A1 (en) 2000-06-14 2003-04-02 Kabushiki Kaisha Kenwood Frequency interpolating device and frequency interpolating method
WO2005078707A1 (en) * 2004-02-16 2005-08-25 Koninklijke Philips Electronics N.V. A transcoder and method of transcoding therefore
EP1638083A1 (en) 2004-09-17 2006-03-22 Harman Becker Automotive Systems GmbH Bandwidth extension of bandlimited audio signals

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3483958B2 (en) * 1994-10-28 2004-01-06 三菱電機株式会社 Broadband audio restoration apparatus, wideband audio restoration method, audio transmission system, and audio transmission method
JP3713200B2 (en) * 2000-11-30 2005-11-02 株式会社ケンウッド Signal interpolation device, signal interpolation method and recording medium
WO2003019533A1 (en) * 2001-08-24 2003-03-06 Kabushiki Kaisha Kenwood Device and method for interpolating frequency components of signal adaptively
KR20040035749A (en) * 2001-08-31 2004-04-29 코닌클리케 필립스 일렉트로닉스 엔.브이. Bandwidth extension of a sound signal
JP4281349B2 (en) * 2001-12-25 2009-06-17 パナソニック株式会社 Telephone equipment
WO2005055645A1 (en) * 2003-12-01 2005-06-16 Koninklijke Philips Electronics N.V. Selective audio signal enhancement

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0944036A1 (en) 1997-04-30 1999-09-22 Nippon Hoso Kyokai Method and device for detecting voice sections, and speech velocity conversion method and device utilizing said method and device
US6539355B1 (en) * 1998-10-15 2003-03-25 Sony Corporation Signal band expanding method and apparatus and signal synthesis method and apparatus
EP1298643A1 (en) 2000-06-14 2003-04-02 Kabushiki Kaisha Kenwood Frequency interpolating device and frequency interpolating method
US20020138268A1 (en) 2001-01-12 2002-09-26 Harald Gustafsson Speech bandwidth extension
WO2005078707A1 (en) * 2004-02-16 2005-08-25 Koninklijke Philips Electronics N.V. A transcoder and method of transcoding therefore
EP1638083A1 (en) 2004-09-17 2006-03-22 Harman Becker Automotive Systems GmbH Bandwidth extension of bandlimited audio signals

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
ISER B ET AL: "BANDWIDTH EXTENSION OF TELEPHONY SPEECH", EURASIP NEWS LETTER, XX, XX, June 2005 (2005-06-01), pages 1 - 148, XP002372006, ISSN: 1687-1421 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8527283B2 (en) 2008-02-07 2013-09-03 Motorola Mobility Llc Method and apparatus for estimating high-band energy in a bandwidth extension system
WO2010021804A1 (en) * 2008-08-21 2010-02-25 Motorola, Inc. Method and apparatus to facilitate determining signal bounding frequencies
CN102144258B (en) * 2008-08-21 2013-05-01 摩托罗拉移动公司 Method and apparatus to facilitate determining signal bounding frequencies
US8463412B2 (en) 2008-08-21 2013-06-11 Motorola Mobility Llc Method and apparatus to facilitate determining signal bounding frequencies
US10121487B2 (en) 2016-11-18 2018-11-06 Samsung Electronics Co., Ltd. Signaling processor capable of generating and synthesizing high frequency recover signal

Also Published As

Publication number Publication date
CA2596411A1 (en) 2008-02-22
EP1892703B1 (en) 2009-10-21
JP2008052277A (en) 2008-03-06
CN101141533B (en) 2013-09-04
JP5150165B2 (en) 2013-02-20
KR101433833B1 (en) 2014-08-27
CN101141533A (en) 2008-03-12
KR20080018132A (en) 2008-02-27
ATE446572T1 (en) 2009-11-15
DE602006009927D1 (en) 2009-12-03

Similar Documents

Publication Publication Date Title
EP1892703B1 (en) Method and system for providing an acoustic signal with extended bandwidth
US7035797B2 (en) Data-driven filtering of cepstral time trajectories for robust speech recognition
USRE43191E1 (en) Adaptive Weiner filtering using line spectral frequencies
US5706395A (en) Adaptive weiner filtering using a dynamic suppression factor
CA2210490C (en) Spectral subtraction noise suppression method
US7216074B2 (en) System for bandwidth extension of narrow-band speech
CN1750124B (en) Bandwidth extension of band limited audio signals
US8706497B2 (en) Speech signal restoration device and speech signal restoration method
US6988066B2 (en) Method of bandwidth extension for narrow-band speech
EP1970900A1 (en) Method and apparatus for providing a codebook for bandwidth extension of an acoustic signal
US8392184B2 (en) Filtering of beamformed speech signals
EP1686565B1 (en) Bandwidth extension of bandlimited speech data
EP1918910A1 (en) Model-based enhancement of speech signals
JPH0916194A (en) Noise reduction for voice signal
JPH10307599A (en) Waveform interpolating voice coding using spline
US5806022A (en) Method and system for performing speech recognition
EP1927981B1 (en) Spectral refinement of audio signals
JPH10319996A (en) Efficient decomposition of noise and periodic signal waveform in waveform interpolation
US7603271B2 (en) Speech coding apparatus with perceptual weighting and method therefor
JP3183104B2 (en) Noise reduction device
Puder Kalman‐filters in subbands for noise reduction with enhanced pitch‐adaptive speech model estimation
Thomas et al. Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speech
López-Espejo et al. On Speech Pre-emphasis as a Simple and Inexpensive Method to Boost Speech Enhancement
CN115527550A (en) Single-microphone subband domain noise reduction method and system

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL BA HR MK YU

17P Request for examination filed

Effective date: 20080318

17Q First examination report despatched

Effective date: 20080416

AKX Designation fees paid

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

REF Corresponds to:

Ref document number: 602006009927

Country of ref document: DE

Date of ref document: 20091203

Kind code of ref document: P

LTIE Lt: invalidation of european patent or patent extension

Effective date: 20091021

NLV1 Nl: lapsed or annulled due to failure to fulfill the requirements of art. 29p and 29m of the patents act
PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100222

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100201

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: LT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: IS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100221

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LV

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: PL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: SI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: AT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: BE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: RO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: BG

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100121

Ref country code: EE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: CZ

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

26N No opposition filed

Effective date: 20100722

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100122

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MC

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20100831

Ref country code: IT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20100831

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20100831

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20100822

REG Reference to a national code

Ref country code: DE

Ref legal event code: R082

Ref document number: 602006009927

Country of ref document: DE

Representative=s name: GRUENECKER, KINKELDEY, STOCKMAIR & SCHWANHAEUS, DE

REG Reference to a national code

Ref country code: DE

Ref legal event code: R082

Ref document number: 602006009927

Country of ref document: DE

Representative=s name: GRUENECKER PATENT- UND RECHTSANWAELTE PARTG MB, DE

Effective date: 20120411

Ref country code: DE

Ref legal event code: R081

Ref document number: 602006009927

Country of ref document: DE

Owner name: NUANCE COMMUNICATIONS, INC. (N.D.GES.D. STAATE, US

Free format text: FORMER OWNER: HARMAN BECKER AUTOMOTIVE SYSTEMS GMBH, 76307 KARLSBAD, DE

Effective date: 20120411

Ref country code: DE

Ref legal event code: R082

Ref document number: 602006009927

Country of ref document: DE

Representative=s name: GRUENECKER, KINKELDEY, STOCKMAIR & SCHWANHAEUS, DE

Effective date: 20120411

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20100822

Ref country code: HU

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100422

REG Reference to a national code

Ref country code: FR

Ref legal event code: TP

Owner name: NUANCE COMMUNICATIONS, INC., US

Effective date: 20120924

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: TR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20091021

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 11

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 12

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 13

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20180824

Year of fee payment: 13

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20180831

Year of fee payment: 13

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20181031

Year of fee payment: 13

REG Reference to a national code

Ref country code: DE

Ref legal event code: R119

Ref document number: 602006009927

Country of ref document: DE

GBPC Gb: european patent ceased through non-payment of renewal fee

Effective date: 20190822

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20200303

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20190831

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20190822