EP1892703A1 - Method and system for providing an acoustic signal with extended bandwidth - Google Patents
Method and system for providing an acoustic signal with extended bandwidth Download PDFInfo
- Publication number
- EP1892703A1 EP1892703A1 EP06017456A EP06017456A EP1892703A1 EP 1892703 A1 EP1892703 A1 EP 1892703A1 EP 06017456 A EP06017456 A EP 06017456A EP 06017456 A EP06017456 A EP 06017456A EP 1892703 A1 EP1892703 A1 EP 1892703A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- signal
- broadband
- bandwidth
- bandwidth limit
- acoustic signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 52
- 230000000295 complement effect Effects 0.000 claims abstract description 38
- 230000003595 spectral effect Effects 0.000 claims description 60
- 238000001228 spectrum Methods 0.000 claims description 57
- 230000007774 longterm Effects 0.000 claims description 40
- 230000005284 excitation Effects 0.000 claims description 38
- 230000000694 effects Effects 0.000 claims description 8
- 238000012545 processing Methods 0.000 claims description 7
- 238000001914 filtration Methods 0.000 claims description 5
- 238000004590 computer program Methods 0.000 claims description 2
- 230000008569 process Effects 0.000 claims description 2
- 239000013598 vector Substances 0.000 description 58
- 239000011159 matrix material Substances 0.000 description 13
- 238000005070 sampling Methods 0.000 description 12
- 230000006978 adaptation Effects 0.000 description 8
- 238000012549 training Methods 0.000 description 8
- 230000006870 function Effects 0.000 description 5
- 230000003190 augmentative effect Effects 0.000 description 4
- 238000009499 grossing Methods 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000013507 mapping Methods 0.000 description 3
- 230000003044 adaptive effect Effects 0.000 description 2
- 238000013528 artificial neural network Methods 0.000 description 2
- 238000004040 coloring Methods 0.000 description 2
- 238000003384 imaging method Methods 0.000 description 2
- 210000001260 vocal cord Anatomy 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 238000007493 shaping process Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
Definitions
- the invention is directed to a method and a system for providing an acoustic signal, in particular a speech signal, with extended bandwidth.
- Acoustic signals transmitted via an analog or digital signal path usually suffer from the drawback that the signal path only has a restricted bandwidth such that the transmitted acoustic signal differs considerably from the original signal. For example, in the case of conventional telephone connections, a sampling rate of 8 kHz is used resulting in a maximal signal bandwidth of 4 kHz. Compared to the case of audio CD's, the speech and audio quality is significantly reduced.
- the bandwidth of telephone connections could be increased by using broadband or wideband digital coding and decoding methods (so-called broadband codecs).
- broadband codecs wideband digital coding and decoding methods
- both the transmitter and the receiver have to support corresponding coding and decoding methods which would require the implementation of a new standard.
- systems for bandwidth extension can be used as described, for example, in P. Jax, Enhancement of Bandlimited Speech Signals: Algorithms and Theoretical Bounds, Dissertation, Aachen, Germany, 2002 or E. Larsen, R. M. Aarts, Audio Bandwidth Extension, Wiley, Hoboken, NJ, USA, 2004 .
- These systems are to be implemented on the receiver's side only such that existing telephone connections do not have to be changed.
- the missing frequency components of an input signal with small bandwidth are estimated and added to the input signal.
- Fig. 6 An example of the structure and the corresponding signal flow in such a state of the art bandwidth extension system is illustrated in Fig. 6.
- Fig. 6 An example of the structure and the corresponding signal flow in such a state of the art bandwidth extension system is illustrated in Fig. 6.
- both the lower and the upper frequency ranges are re-synthesized.
- an incoming or received acoustic signal x ( n ) in digitized form is processed by sub-sampling and block extraction so as to obtain signal vectors x ( n ).
- the variable n denotes the time.
- the bandwidth extension is performed only within the missing frequency ranges.
- the extension concerns low frequency (for example from 0 to 300 Hz) and/or high frequency (for example 3400 Hz to half of the desired sampling rate) ranges.
- a narrowband spectral envelope is extracted from the narrowband signal, the narrowband signal being restricted by the bandwidth restrictions of the telephone channel.
- a corresponding broadband envelope signal is estimated from the narrowband envelope.
- the mappings are based, for example, on codebook pairs (see J. Epps, W. H. Holmes, A New Technique for Wideband Enhancement of Coded Narrowband Speech, IEEE Workshop on Speech Coding, Conference proceedings, pages 174 to 176 June 1999 ) or on Neural Networks (see J.-M. Valin R. Lefebvre, Bandwidth Extension of Narrowband Speech for Low Bit-Rate Wideband Coding, IEEE Workshop on Speech Coding, Conference Proceedings, pages 130 to 132, September 2000 ). In these methods, the entries of the codebooks or the weights of the neural networks are generated using training methods requiring large processor and memory resources.
- a broadband or wideband excitation signal having a spectrally flat envelope is generated from the narrowband signal.
- This excitation signal corresponds to the signal which would be recorded directly behind the vocal cords, i.e. the excitation signal contains information about voicing and pitch, but not about form and structures or the spectral shaping in general.
- the excitation signal has to be weighted with the spectral envelope.
- non-linear characteristics see U. Kornagel, Spectral Widening of the Excitation Signal for Telephone-Band Speech Enhancement, IWAENC 01, Conference Proceedings, pages 215 to 218, September 2001
- two-ray rectifying or squaring for example.
- the excitation signal x exc ( n ) is spectrally colored using the envelope in block 604.
- the spectral ranges used for the extension are extracted using a band stop filter in block 606 resulting in signal vectors y ext ( n ).
- the band stop filter can be effective, for example, in the range from 200 to 3700 Hz.
- the signal vectors x(n) of the received signal are passed through a complementary band pass filter in block 605. Then, the signal components y ext ( n ) and y tel ( n ) are added to obtain a signal vector y ( n ) with extended bandwidth. In block 607, the different signal vectors are assembled again and an over-sampling is performed resulting in a signal y (n).
- a method for providing an acoustic signal with extended bandwidth comprising:
- the method according to the invention allows an adaptation of the bandwidth extension to the acoustic signal actually received. For example, when the transmitter uses an ISDN telephone, a broader frequency range is used compared to the case of a mobile phone with a hands-free system. Therefore, the bandwidth of a received acoustic signal will be extended only in those ranges where it is necessary so that the quality of the resulting signal is very high.
- the received acoustic signal may be a digital signal or may be digitized.
- steps (a) to (c) may be preceded by the step of converting the received acoustic signal to a predetermined sampling rate.
- steps (a) to (c) may be preceded by the step of extracting a signal vector from the acoustic signal, in particular, the converted acoustic signal.
- the signal vector may be obtained by sub-sampling the acoustic signal and may comprise a predefined number of entries. Then, subsequent (in time) signal vectors may overlap. The use of signal vectors simplifies further processing of the signals.
- Steps (a) to (c) may be preceded by the step of determining a spectral vector of the received acoustic signal.
- a window function may be applied to signal vectors of the received acoustic signal.
- a Hann or a Hamming window may be used (see K. D. Kammeyer, K. Kroschel, Digitale Signaltechnik, 4 th Edition, Teubner, Stuttgart, Germany 1997 ).
- Signal vectors, in particular the signal vectors weighted in this way may be transformed into the Fourier domain using a discrete Fourier transform.
- the resulting vector is a short-term spectral vector. This allows for further processing in the Fourier domain.
- step (b) may comprise determining a broadband spectral envelope signal and a broadband excitation signal between the lower and upper broadband bandwidth limits such that the product of spectral envelope signal and excitation signal corresponds to the received acoustic signal according to a predetermined criterion.
- Such a decomposition into an envelope signal and an excitation signal simplifies determining the current bandwidth limits and increases the accuracy when determining a complementary signal.
- Step (a) may comprise comparing a determined broadband spectral envelope signal and a long-term power spectrum of the received acoustic signal. It turned out that the long-term power spectrum is a suitable basis for determining current bandwidth limits of the acoustic signal.
- determining a complementary signal in step (b) based on these current bandwidth limits and comprising determination of an envelope signal enables to iteratively adapt the current bandwidth limits by comparing again the (newly) determined envelope signal and a long-term power spectrum.
- determining current bandwidth limits in step (a) may use a spectral envelope signal determined according to step (b), particularly in a preceding step or in a preceding iteration of the method.
- determining a long-term power spectrum may comprise performing a first order recursive smoothing of the absolute values squared of the sub-band signals corresponding to the acoustic signal. This can be done, in particular, only if a wanted signal, such as a speech signal, has been detected in the received acoustic signal.
- the long-term power spectrum may be normalized, particularly with respect to a long-term power spectrum within predetermined frequency limits.
- the long-term power spectrum may be determined in the time domain. This can be done by determining the auto-correlation and performing an LPC analysis to obtain corresponding prediction coefficients.
- the comparing step may comprise selecting the minimal and maximal frequency for which the long-term power spectrum is larger than or equal to the power spectrum of the determined broadband spectral envelope signal plus a predetermined constant.
- the predetermined constant can be chosen based on empirical or theoretical data.
- the predetermined constant may be negative.
- determining a broadband spectral envelope signal may comprise selecting an envelope signal from a codebook according to a predetermined criterion.
- codebooks By using codebooks, the required computing power can be reduced for determining an envelope signal.
- different kinds of criteria can be used when selecting an envelope signal from a codebook.
- using a predetermined distance criterion such as a cepstral distance can be used, particularly if the codebook entries have the form of cepstral vectors.
- selecting an envelope signal may comprise equalizing the received acoustic signal and selecting an envelope signal from the codebook having minimal distance to the equalized acoustic signal according to a predetermined distance criterion, in particular, having a minimal cepstral distance.
- Equalizing the acoustic signal allows to modify it such that a comparison with envelope signals from the codebook can be simplified.
- the received acoustic signal can be equalized in such a way that the resulting signal shows a long-term power spectrum corresponding to the long-term power spectrum of the signal used for training the codebook.
- Equalizing can be restricted to frequencies between the current upper and lower bandwidth limits of the received acoustic signal; outside these limits, the signal may remain unchanged.
- equalizing the received acoustic signal can be performed using a normalized long-term power spectrum of the signal used for training the codebooks, particularly using the normalized long-term power spectrum divided by the normalized long-term power spectrum of the received acoustic signal itself.
- the codebook may comprise pairs of corresponding envelope signals, each pair comprising a broadband envelope signal between the lower and upper broadband bandwidth limits and a corresponding narrowband envelope signal between a lower narrowband bandwidth limit being larger than the lower broadband bandwidth limit and an upper narrowband bandwidth limit being smaller than the upper broadband bandwidth limit, and selecting an envelope signal may comprise determining a narrowband envelope signal having minimal distance to the equalized acoustic signal according to the predetermined distance criterion and selecting the corresponding broadband envelope signal of this pair.
- the received acoustic signal When using a cepstral distance to select an envelope signal, the received acoustic signal, particularly in its equalized form, has to be transformed into the cepstral domain.
- the step of selecting an envelope signal can further comprise the steps of determining the absolute value squared of the sub-band signals of the received acoustic signal, determining an auto-correlation in the time domain, particularly by performing an inverse discrete Fourier transform on the vector of the absolute value squared, determining prediction coefficients, particularly using the Levinson-Durbin algorithm, performing a recursion to obtain the cepstral coefficients.
- the method may further comprise the steps of recursively transforming a cepstral vector into prediction error coefficients, augmenting the prediction error filter vector by adding a predetermined number of zeros and subsequently performing a discrete Fourier transform to obtain an inverse spectrum, determining the reciprocal of each sub-band component to obtain a spectral envelope vector.
- the step of selecting an envelope signal may be preceded by providing adapted narrowband codebook envelope signals being adapted to the current lower and upper bandwidth limits.
- Such an adaptation of the codebook entries allows for an improved selection of a corresponding envelope signal from the codebook.
- the adaptation would result in envelope signals in the codebook having an extended bandwidth. In this way, particularly fricatives can be more reliably detected.
- the providing step may comprise processing broadband codebook envelope signals using a long-term power spectrum of the received acoustic signal.
- the long-term power spectrum may be normalized; furthermore, the long-term power spectrum of the received acoustic signal may be divided by a normalized long-term power spectrum of a broadband signal used for training of the codebook.
- the processing of the broadband codebook envelope signals may be performed only for frequencies outside the current bandwidth limits; within the bandwidth limits, the envelope signals may remain unchanged.
- Processing using the long-term power spectrum may comprise weighting broadband codebook envelope signal vectors using the long-term power spectrum of the received acoustic signal.
- determining a broadband excitation signal may be based on prediction error filtering and/or a non-linear characteristic. In this way, suitable excitation signals can be generated. Possible non-linear characteristics are disclosed, for example, in U. Kornagel, Spectral Widening of the Excitation Signal for Telephone-Band Speech Enhancement .
- the at least one complementary signal may be based on a product of the determined broadband spectral envelope and the determined broadband excitation signal, and step (c) may comprise summing the received acoustic signal between the current lower and upper bandwidth limits and the at least one complementary signal being restricted to the band between the lower broadband bandwidth limit and a current lower bandwidth limit and/or to the band between the current upper bandwidth limit and the upper broadband bandwidth limit.
- the complementary signal is based on spectrally coloring the excitation signal using the envelope signal.
- Step (c) may also comprise adapting the power of the complementary signal and/or the received acoustic signal. With this step, the power of the received acoustic signal can be maintained.
- At least one of the steps may be performed in the cepstral domain. Particularly if the entries of the codebook are cepstral vectors, this allows for performing the method in a simpler way.
- Steps (a) to (c) of the above methods may be repeated at predetermined time intervals. Then, the repeated adaptation to the currently received acoustic signal leads to a permanent high quality of the resulting broadband signal.
- Steps (a) to (c) of the above methods may be repeated only if a wanted signal component, such as speech activity, is detected in the received acoustic signal.
- a wanted signal component such as speech activity
- an extension of the bandwidth of the received acoustic signal is advantageous.
- restricting the method to the case of detected speech activity reduces the required computing power and avoids the presence of artifacts due to mal-adaptation.
- the invention also provides a computer program product comprising one or more computer-readable media having computer-executable instructions for performing the steps of the above-described methods when run on a computer.
- an apparatus for providing an acoustic signal with extended bandwidth comprising:
- such an apparatus provides an advantageous way to extend the bandwidth of a received acoustic signal.
- the quality of the resulting output signal is increased compared to the case of bandwidth extension systems with fixed parameters.
- the complementary signal means may comprise a means for determining a broadband spectral envelope signal and a broadband excitation signal between the lower and upper broadband bandwidth limits such that the product of spectral envelope signal and excitation signal corresponds to the received acoustic signal according to a predetermined criterion.
- the bandwidth determining means may be configured to compare a determined broadband spectral envelope signal and a long-term power spectrum of the received acoustic signal.
- the bandwidth determining means may be configured to select the minimal and maximal frequency for which the long-term power spectrum is larger than or equal to the power spectrum of the determined broadband spectral envelope signal plus a predetermined constant.
- the means for determining a broadband spectral envelope signal may comprise a means for selecting an envelope signal from a codebook according to a predetermined criterion.
- the means for selecting an envelope signal may be configured to equalize the received acoustic signal and select an envelope signal from the codebook having minimal distance to the equalized acoustic signal according to a predetermined distance criterion, in particular, having a minimal cepstral distance.
- the codebook may comprise pairs of corresponding envelope signals, each pair comprising a broadband envelope signal between the lower and upper broadband bandwidth limits and a corresponding narrowband envelope signal between a lower narrowband bandwidth limit being larger than the lower broadband bandwidth limit and an upper narrowband bandwidth limit being smaller than the upper broadband bandwidth limit
- the means for selecting an envelope signal may be configured to determine a narrowband envelope signal having minimal distance to the equalized acoustic signal according to the predetermined distance criterion and to select the corresponding broadband envelope signal of this pair.
- the means for determining a broadband spectral envelope signal may comprise a means for providing adapted narrowband codebook envelope signals being adapted to the current lower and upper bandwidth limits.
- the means for providing may be configured to process the broadband codebook envelope signal using a long-term power spectrum of the received acoustic signal.
- the means for determining a broadband excitation signal may be configured to determine the broadband excitation signal based on prediction error filtering and/or a non-linear characteristic.
- the at least one complementary signal may be based on a product of the determined broadband spectral envelope and the determined broadband excitation signal, and the assembling means may be configured to sum the received acoustic signal between the current lower and upper bandwidth limits and the at least one complementary signal being restricted to the band between the lower broadband bandwidth limit and a current lower bandwidth limit and/or to the band between the current upper bandwidth limit and the upper broadband bandwidth limit.
- At least one of the means may be configured to perform at least part of its function in the cepstral domain.
- the means of the above-described apparatus may be configured to perform their respective function repeatedly at predetermined time intervals.
- the apparatus may further comprise a wanted signal detector, in particular, a speech detector, and the means may be configured to perform their respective function only if a wanted signal component is detected in the received acoustic signal.
- a wanted signal detector in particular, a speech detector
- Fig. 1 shows the structure of the signal flow in an apparatus for providing an acoustic signal with extended bandwidth.
- Fig. 2 is a flow diagram illustrating an example of a method for providing an acoustic signal with extended bandwidth which could be performed by the apparatus corresponding to Fig. 1. In view of this, Fig.'s 1 and 2 will be described in the following simultaneously.
- an acoustic signal such as a speech signal
- a telephone line Because of the restricted bandwidth of the telephone line, an extension of the bandwidth is desired to improve the signal quality.
- the signal is to be augmented so as to obtain a predetermined broader bandwidth. It is to be understood that the method described in the following can be used for bandwidth extension independent of the type of incoming signal and independent of the type of transmission line, i.e., it need not be a telephone line.
- the acoustic signal x(n) received by block 101 has already been pre-processed by increasing the sampling rate up to the predetermined broadband or wideband bandwidth. In this way, however, no additional frequency components are generated. This can be achieved, for example, by using suitable anti-aliasing or anti-imaging filters.
- This kind of bandwidth extension preferably, is performed only for the "missing" frequency ranges; in the case of an analog telephone line, these ranges may be between 0 and 300 Hz and 3400 Hz up to half of the desired sampling rate, for example, up to 3700 Hz.
- signal vectors x ( n ) are generated (step 202). This can be achieved by taking every r sampling values up to a certain length.
- the elements of this matrix can be chosen corresponding to different kinds of windows. Typical windows are the Hann or Hamming window.
- the resulting short-term spectral vector has the form: X w n [ X e j ⁇ ⁇ 0 ⁇ n , X e j ⁇ ⁇ 1 ⁇ n , ... , X e j ⁇ ⁇ ⁇ ⁇ n , ... , X ⁇ e j ⁇ ⁇ N DFT - 1 ⁇ n ⁇ ] T , wherein ⁇ ⁇ denotes the frequency variable.
- a long-term power spectrum of the received acoustic signal is determined in block 102 (step 204).
- 2 diring speech activity S ⁇ xx ⁇ ⁇ ⁇ , n - 1 , else .
- the time constant ⁇ fre is chosen to be close to 1 (0 ⁇ ⁇ fre ⁇ 1) so as to obtain a sufficiently large averaging time.
- the recursive smoothing according to the first line of the above equation may be performed continuously. However, in order to avoid any artefacts, it may be performed only if a wanted signal component is present in the received acoustic signal, for example, if speech activity is detected.
- a speech detector may be provided as described, for example, in E. Hänsler, G. Schmidt, Acoustic Echo and Noise Control - A Practical Approach, Wiley, Hoboken, NJ, USA, 2004 .
- the band limits ⁇ ⁇ l and ⁇ ⁇ u denote the lower and upper limits of a predefined frequency band.
- this frequency band may correspond to a telephone band with minimal bandwidth for which the present method is to be used, for example, the limits may be 400 Hz and 3300 Hz.
- the limits correspond to a band which is smaller or at most equal to the frequency band of the narrow frequency band within which the codebook described below has been trained; these limits being denoted by ⁇ l and ⁇ u .
- an estimation can be performed in the time domain as well. For this purpose, an auto-correlation is estimated for about 10 to 20 sampling cycles of offset. Afterwards, prediction coefficients can be determined using an LPC (linear predictive coding) analysis.
- LPC linear predictive coding
- the acoustic signal is equalized.
- ⁇ l ( n -1 ) and ⁇ u (n -1) denote the current lower and upper bandwidth limits of the received acoustic signal.
- the bandwidth limits at time ( n -1) are taken as the current bandwidth limits.
- S x ⁇ x ⁇ ,norm ( ⁇ ⁇ , n ) denotes the normalized long-term power spectrum of the broadband signal which has been used for training the codebook. Normalizing of such a power spectrum is performed analogously to the case of the long-term power spectrum of the received acoustic signal described above. An example for such a normalized long-term power spectrum used for training a codebook is shown in Figure 3.
- the acoustic signal is equalized only within the current bandwidth limits one time step before. Outside these bandwidth limits, no equalizing takes place.
- An envelope signal corresponding to the received acoustic signal will be determined using a codebook.
- the used codebook comprises a number of pairs of corresponding narrowband and broadband envelope signals.
- the codebook has been obtained by training with a large database on the basis of a starting long-term power spectrum (see Y. Linde, A. Buzo, R. M. Gray, An Algorithm for Vector Quantizer Design, IEEE Trans. Comm., vol. COM-28, no. 1, pages 84 - 95, Jan. 1980 ).
- the codebook entries are adapted in step 206 (block 104).
- the narrowband codebook entries c i,s ( n ) are adapted.
- the broadband envelope signals are provided as cepstral vectors c i,b ( n )
- the corresponding spectra C i,b ( n ) are determined.
- cepstral vectors are determined from the resulting spectral narrowband envelopes.
- step 207 The conversion from spectral vectors to cepstral vectors and vice versa will be described in the following with respect to step 207 in which broadband spectral envelopes are determined (block 105).
- a broadband spectral envelope from the codebook matching the acoustic signal best is determined by comparing the narrowband codebook entries with the spectral envelope of the spectrum of the acoustic signal (after equalizing).
- the narrowband codebook entry is selected that has the smallness distance to the acoustic signal spectrum. In principle, different distance criteria can be used.
- the cepstral distance is particularly useful as the codebook entries are provided in the form of cepstral vectors.
- the corresponding broadband codebook entry is determined as the optimal broadband spectral envelope for the received acoustic signal. Due to the adaptation of the narrowband codebook entries as described above, an optimal narrowband envelope can be selected in a very reliable way.
- Converting a spectral vector, particularly of the received acoustic signal, to a cepstral vector can be achieved by:
- the optimal cepstral vector of the broadband codebook is designated by c opt,b ( n ).
- Fig. 4 illustrates an example of a codebook with four pairs of entries.
- a corresponding original narrowband envelope, and a corresponding adapted narrowband envelope are shown.
- the original broadband and narrowband codebook entries have been obtained on the basis of a large database for an ISDN telephone connection.
- the resulting optimized entries have a higher upper limit frequency. This allows for an improved detection of fricatives.
- step 208 an excitation signal corresponding to the received acoustic signal is generated.
- This broadband excitation signal shows a spectrally flat envelope. It corresponds to a signal which would be recorded directly behind the vocal cords.
- the spectral envelope of the equalized short-term spectrum X eq ( n ) is estimated in the form of prediction error filter coefficients. Applying an inverse discrete Fourier transform on this spectral vector allows to determine the corresponding time signal. After that, the vector in the time domain is filtered by a prediction error filter. The corresponding filter coefficients are those that have been determined previously.
- a non-linear characteristic such as a two-way rectification or squaring, is applied to the filtered time domain vector. This generates the missing low frequency and high frequency signal components.
- a transformation in the Fourier domain provides, then, the spectrum of the extended excitation signal X exc ( n ) .
- determining an excitation signal can be performed in the time sub-band or Fourier domain as well. Examples for this alternative can be found in B. Iser, G. Schmidt, Bandwidth Extension of Telephony Speech, Eurasip Newsletter, Volume 16, Number 2, pages 2 to 24, June 2005 .
- 2 ⁇ ⁇ ⁇ l ⁇ u Y erw ( e j ⁇ ⁇ ⁇ , n ⁇ ) 2 wherein ⁇ ⁇ l and ⁇ ⁇ u denote the same bandwidth limits as in the estimation of the long-term power spectrum above.
- the current bandwidth limits are adapted in step 210 (block 108).
- 2 ⁇ C opt , b ( e j ⁇ ⁇ ⁇ , n ⁇ ) 2 + K C , ⁇ u n min ⁇ ⁇ ⁇
- Fig. 5 an example for determining the bandwidth limits is illustrated.
- the above, intermediate limit values are given by the points of intersection between the lowered broadband spectral envelope and the spectrum of the received acoustic signal.
- These intermediate limit values may be recursively smoothed to eliminate temporary mal-estimations.
- smoothing is performed only if speech activity is detected in the current signal frame.
- the received acoustic signal is passed through an adaptive band pass filter to retain only components within the current bandwidth limits (block 109) to obtain a spectral vector Y tel ( n ).
- the spectrally colored excitation signal is passed through a complementary adaptive band stop filter (block 110) so as to obtain a vector Y ext ( n ).
- Y tel n G tel n ⁇ X w n
- Y ext n G ext n ⁇ X ext n
- the weighting matrices G tel ( n ) and G ext ( n ) are diagonal matrices:
- G tel n G tel e j ⁇ ⁇ 0 ⁇ n 0 ... 0 0
- G ext n G ext e j ⁇ ⁇ 0 ⁇ n 0 ... 0 0
- the transitions at the bandwidth limits can be realized in a smoother way.
- the resulting time domain vectors are, then, assembled using an overlap add method (as described in K. D. Kammeyer, K. Kroschel, Digitale Signalmaschine ) to obtain the final output signal y ( n ).
- the steps performed in the Fourier domain may also be performed in the time domain.
- equalizing the acoustic signal may be performed when adapting the narrowband codebook entries.
- the above-described equalizing step may be augmented. For example, if an amplification or an attenuation is detected at particular frequencies, it may be adjusted within the bandwidth limits as well. In this case, the output vector Y tel ( n ) is modified with the weighting matrix H mod ( n ).
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Quality & Reliability (AREA)
- Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
Abstract
Description
- The invention is directed to a method and a system for providing an acoustic signal, in particular a speech signal, with extended bandwidth.
- Acoustic signals transmitted via an analog or digital signal path usually suffer from the drawback that the signal path only has a restricted bandwidth such that the transmitted acoustic signal differs considerably from the original signal. For example, in the case of conventional telephone connections, a sampling rate of 8 kHz is used resulting in a maximal signal bandwidth of 4 kHz. Compared to the case of audio CD's, the speech and audio quality is significantly reduced.
- Furthermore, many kinds of transmissions show additional bandwidth restrictions. In the case of an analog telephone connection, only frequencies between 300 Hz and 3.4 kHz are transmitted. As a result, only 3.1 kHz bandwidth are available.
- In principle, the bandwidth of telephone connections could be increased by using broadband or wideband digital coding and decoding methods (so-called broadband codecs). In such a case, however, both the transmitter and the receiver have to support corresponding coding and decoding methods which would require the implementation of a new standard.
- As an alternative, systems for bandwidth extension can be used as described, for example, in P. Jax, Enhancement of Bandlimited Speech Signals: Algorithms and Theoretical Bounds, Dissertation, Aachen, Germany, 2002 or E. Larsen, R. M. Aarts, Audio Bandwidth Extension, Wiley, Hoboken, NJ, USA, 2004. These systems are to be implemented on the receiver's side only such that existing telephone connections do not have to be changed. In these systems, the missing frequency components of an input signal with small bandwidth are estimated and added to the input signal.
- An example of the structure and the corresponding signal flow in such a state of the art bandwidth extension system is illustrated in Fig. 6. In general, both the lower and the upper frequency ranges are re-synthesized.
- At
block 601, an incoming or received acoustic signal x(n) in digitized form is processed by sub-sampling and block extraction so as to obtain signal vectors x(n). Here, the variable n denotes the time. In this Figure, it is assumed that the incoming signal x(n) has already been converted to the desired bandwidth by increasing the sampling rate. In this conversion step, no additional frequency components are to be generated which can be achieved, for example, by using appropriate anti-aliasing or anti-imaging filter elements. ln order to not amend the transmitted signal, the bandwidth extension is performed only within the missing frequency ranges. Depending on the transmission method, the extension concerns low frequency (for example from 0 to 300 Hz) and/or high frequency (for example 3400 Hz to half of the desired sampling rate) ranges. - In
block 602, a narrowband spectral envelope is extracted from the narrowband signal, the narrowband signal being restricted by the bandwidth restrictions of the telephone channel. Via a non-linear mapping, a corresponding broadband envelope signal is estimated from the narrowband envelope. The mappings are based, for example, on codebook pairs (see J. Epps, W. H. Holmes, A New Technique for Wideband Enhancement of Coded Narrowband Speech, IEEE Workshop on Speech Coding, Conference proceedings, pages 174 to 176 June 1999) or on Neural Networks (see J.-M. Valin R. Lefebvre, Bandwidth Extension of Narrowband Speech for Low Bit-Rate Wideband Coding, IEEE Workshop on Speech Coding, Conference Proceedings, pages 130 to 132, September 2000). In these methods, the entries of the codebooks or the weights of the neural networks are generated using training methods requiring large processor and memory resources. - Furthermore, in
block 603, a broadband or wideband excitation signal having a spectrally flat envelope is generated from the narrowband signal. This excitation signal corresponds to the signal which would be recorded directly behind the vocal cords, i.e. the excitation signal contains information about voicing and pitch, but not about form and structures or the spectral shaping in general. Thus, to retrieve a complete signal, such as a speech signal, the excitation signal has to be weighted with the spectral envelope. For the generation of excitation signals, non-linear characteristics (see U. Kornagel, Spectral Widening of the Excitation Signal for Telephone-Band Speech Enhancement, IWAENC 01, Conference Proceedings, pages 215 to 218, September 2001) such as two-ray rectifying or squaring, for example, may be used. - For bandwidth extension, the excitation signal x exc (n) is spectrally colored using the envelope in
block 604. After that, the spectral ranges used for the extension are extracted using a band stop filter inblock 606 resulting in signal vectors y ext (n). The band stop filter can be effective, for example, in the range from 200 to 3700 Hz. - The signal vectors x(n) of the received signal are passed through a complementary band pass filter in
block 605. Then, the signal components y ext ( n ) and y tel ( n ) are added to obtain a signal vector y(n) with extended bandwidth. Inblock 607, the different signal vectors are assembled again and an over-sampling is performed resulting in a signal y(n). - In these prior art systems, the elements and their parameters are implemented once and, then, remain unchanged. Thus, all incoming acoustic signals are treated the same way. In view of this, it is an object underlying the present invention to provide a more flexible method and apparatus for providing an acoustic signal with extended bandwidth.
- This problem is solved by the method according to
claim 1 and the apparatus according to claim 16. - In accordance with the invention, a method for providing an acoustic signal with extended bandwidth is provided, comprising:
- (a) automatically determining a current upper and a current lower bandwidth limit of received acoustic signal,
- (b) automatically determining at least one complementary signal to complement the received acoustic signal between a predefined lower broadband bandwidth limit and the current lower bandwidth limit and/or between the current upper bandwidth limit and a predefined upper broadband bandwidth limit, wherein the predefined lower broadband bandwidth limit is smaller than the current bandwidth limit and the predefined upper broadband bandwidth limit is larger than the current upper bandwidth limit,
- (c) automatically assembling the at least one complementary signal and the received acoustic signal to obtain an acoustic signal with extended bandwidth.
- By determining current upper and lower bandwidth limits of a received acoustic signal and determining a complementary signal between the current bandwidth limits and the respective predefined broadband (or wideband) bandwidth limits, the method according to the invention allows an adaptation of the bandwidth extension to the acoustic signal actually received. For example, when the transmitter uses an ISDN telephone, a broader frequency range is used compared to the case of a mobile phone with a hands-free system. Therefore, the bandwidth of a received acoustic signal will be extended only in those ranges where it is necessary so that the quality of the resulting signal is very high.
- In this way, on the one hand, no spectral gaps will occur even if the received signal covers only a very narrow frequency range. On the other hand, when receiving signals covering a relatively broad frequency range, no frequencies are cut-off when determining the complementary signal.
- The received acoustic signal may be a digital signal or may be digitized. In the above method, steps (a) to (c) may be preceded by the step of converting the received acoustic signal to a predetermined sampling rate. Furthermore, steps (a) to (c) may be preceded by the step of extracting a signal vector from the acoustic signal, in particular, the converted acoustic signal. The signal vector may be obtained by sub-sampling the acoustic signal and may comprise a predefined number of entries. Then, subsequent (in time) signal vectors may overlap. The use of signal vectors simplifies further processing of the signals.
- Steps (a) to (c) may be preceded by the step of determining a spectral vector of the received acoustic signal. In particular, a window function may be applied to signal vectors of the received acoustic signal. For example, a Hann or a Hamming window may be used (see K. D. Kammeyer, K. Kroschel, Digitale Signalverarbeitung, 4 th Edition, Teubner, Stuttgart, Germany 1997). Signal vectors, in particular the signal vectors weighted in this way, may be transformed into the Fourier domain using a discrete Fourier transform. The resulting vector is a short-term spectral vector. This allows for further processing in the Fourier domain.
- In the above methods, step (b) may comprise determining a broadband spectral envelope signal and a broadband excitation signal between the lower and upper broadband bandwidth limits such that the product of spectral envelope signal and excitation signal corresponds to the received acoustic signal according to a predetermined criterion.
- Such a decomposition into an envelope signal and an excitation signal simplifies determining the current bandwidth limits and increases the accuracy when determining a complementary signal.
- Step (a) may comprise comparing a determined broadband spectral envelope signal and a long-term power spectrum of the received acoustic signal. It turned out that the long-term power spectrum is a suitable basis for determining current bandwidth limits of the acoustic signal.
- Thus, if current bandwidth limits have been determined in step (a) in this way using a broadband spectral envelope signal of the received acoustic signal, determining a complementary signal in step (b) based on these current bandwidth limits and comprising determination of an envelope signal enables to iteratively adapt the current bandwidth limits by comparing again the (newly) determined envelope signal and a long-term power spectrum. In other words, determining current bandwidth limits in step (a) may use a spectral envelope signal determined according to step (b), particularly in a preceding step or in a preceding iteration of the method.
- In particular, if the received acoustic signal has been transformed into the Fourier domain, determining a long-term power spectrum may comprise performing a first order recursive smoothing of the absolute values squared of the sub-band signals corresponding to the acoustic signal. This can be done, in particular, only if a wanted signal, such as a speech signal, has been detected in the received acoustic signal.
- In addition, the long-term power spectrum may be normalized, particularly with respect to a long-term power spectrum within predetermined frequency limits.
- Alternatively, the long-term power spectrum may be determined in the time domain. This can be done by determining the auto-correlation and performing an LPC analysis to obtain corresponding prediction coefficients.
- The comparing step may comprise selecting the minimal and maximal frequency for which the long-term power spectrum is larger than or equal to the power spectrum of the determined broadband spectral envelope signal plus a predetermined constant.
- This is a particularly simple and reliable way to determine the bandwidth limits. The predetermined constant can be chosen based on empirical or theoretical data. The predetermined constant may be negative.
- In the above methods, determining a broadband spectral envelope signal may comprise selecting an envelope signal from a codebook according to a predetermined criterion.
- By using codebooks, the required computing power can be reduced for determining an envelope signal. In principle, different kinds of criteria can be used when selecting an envelope signal from a codebook. In particular, using a predetermined distance criterion such as a cepstral distance can be used, particularly if the codebook entries have the form of cepstral vectors.
- In particular, selecting an envelope signal may comprise equalizing the received acoustic signal and selecting an envelope signal from the codebook having minimal distance to the equalized acoustic signal according to a predetermined distance criterion, in particular, having a minimal cepstral distance.
- Equalizing the acoustic signal allows to modify it such that a comparison with envelope signals from the codebook can be simplified. In particular, the received acoustic signal can be equalized in such a way that the resulting signal shows a long-term power spectrum corresponding to the long-term power spectrum of the signal used for training the codebook. Equalizing can be restricted to frequencies between the current upper and lower bandwidth limits of the received acoustic signal; outside these limits, the signal may remain unchanged. In particular, equalizing the received acoustic signal can be performed using a normalized long-term power spectrum of the signal used for training the codebooks, particularly using the normalized long-term power spectrum divided by the normalized long-term power spectrum of the received acoustic signal itself.
- The codebook may comprise pairs of corresponding envelope signals, each pair comprising a broadband envelope signal between the lower and upper broadband bandwidth limits and a corresponding narrowband envelope signal between a lower narrowband bandwidth limit being larger than the lower broadband bandwidth limit and an upper narrowband bandwidth limit being smaller than the upper broadband bandwidth limit, and selecting an envelope signal may comprise determining a narrowband envelope signal having minimal distance to the equalized acoustic signal according to the predetermined distance criterion and selecting the corresponding broadband envelope signal of this pair.
- In this way, a simple comparison between the received acoustic signal and the elements of the codebook can be performed as narrowband signals usually match a received acoustic signal with a narrow bandwidth more closely.
- When using a cepstral distance to select an envelope signal, the received acoustic signal, particularly in its equalized form, has to be transformed into the cepstral domain. Thus, the step of selecting an envelope signal can further comprise the steps of determining the absolute value squared of the sub-band signals of the received acoustic signal, determining an auto-correlation in the time domain, particularly by performing an inverse discrete Fourier transform on the vector of the absolute value squared, determining prediction coefficients, particularly using the Levinson-Durbin algorithm, performing a recursion to obtain the cepstral coefficients.
- In order to determine a spectral envelope from the cepstral vectors, the method may further comprise the steps of recursively transforming a cepstral vector into prediction error coefficients, augmenting the prediction error filter vector by adding a predetermined number of zeros and subsequently performing a discrete Fourier transform to obtain an inverse spectrum, determining the reciprocal of each sub-band component to obtain a spectral envelope vector.
- In the above methods, the step of selecting an envelope signal may be preceded by providing adapted narrowband codebook envelope signals being adapted to the current lower and upper bandwidth limits.
- Such an adaptation of the codebook entries allows for an improved selection of a corresponding envelope signal from the codebook. In particular, if the received acoustic signal shows a broader bandwidth than the original narrowband envelope signals of the codebook, the adaptation would result in envelope signals in the codebook having an extended bandwidth. In this way, particularly fricatives can be more reliably detected.
- The providing step may comprise processing broadband codebook envelope signals using a long-term power spectrum of the received acoustic signal.
- Due to the use of the power spectrum of the received acoustic signal, a suitable adaptation to the acoustic signal can be obtained. The long-term power spectrum may be normalized; furthermore, the long-term power spectrum of the received acoustic signal may be divided by a normalized long-term power spectrum of a broadband signal used for training of the codebook. The processing of the broadband codebook envelope signals may be performed only for frequencies outside the current bandwidth limits; within the bandwidth limits, the envelope signals may remain unchanged. Processing using the long-term power spectrum may comprise weighting broadband codebook envelope signal vectors using the long-term power spectrum of the received acoustic signal.
- In the above methods, determining a broadband excitation signal may be based on prediction error filtering and/or a non-linear characteristic. In this way, suitable excitation signals can be generated. Possible non-linear characteristics are disclosed, for example, in U. Kornagel, Spectral Widening of the Excitation Signal for Telephone-Band Speech Enhancement.
- In the above methods, the at least one complementary signal may be based on a product of the determined broadband spectral envelope and the determined broadband excitation signal, and step (c) may comprise summing the received acoustic signal between the current lower and upper bandwidth limits and the at least one complementary signal being restricted to the band between the lower broadband bandwidth limit and a current lower bandwidth limit and/or to the band between the current upper bandwidth limit and the upper broadband bandwidth limit.
- Thus, the complementary signal is based on spectrally coloring the excitation signal using the envelope signal. By adding a complementary signal only outside the current bandwidth limits of the received acoustic signal, artifacts are avoided in the resulting signal with extended bandwidth.
- Step (c) may also comprise adapting the power of the complementary signal and/or the received acoustic signal. With this step, the power of the received acoustic signal can be maintained.
- In the above-described methods, at least one of the steps may be performed in the cepstral domain. Particularly if the entries of the codebook are cepstral vectors, this allows for performing the method in a simpler way.
- Steps (a) to (c) of the above methods may be repeated at predetermined time intervals. Then, the repeated adaptation to the currently received acoustic signal leads to a permanent high quality of the resulting broadband signal.
- Steps (a) to (c) of the above methods may be repeated only if a wanted signal component, such as speech activity, is detected in the received acoustic signal. Particularly in the case of speech signals, an extension of the bandwidth of the received acoustic signal is advantageous. Thus, restricting the method to the case of detected speech activity reduces the required computing power and avoids the presence of artifacts due to mal-adaptation.
- The invention also provides a computer program product comprising one or more computer-readable media having computer-executable instructions for performing the steps of the above-described methods when run on a computer.
- Furthermore, an apparatus for providing an acoustic signal with extended bandwidth is provided, comprising:
- bandwidth determining means for automatically determining a current upper and a current lower bandwidth limit of a received acoustic signal,
- complementary signal means for automatically determining at least one complementary signal to complement the received acoustic signal between a predefined lower broadband bandwidth limit and the current lower bandwidth limit and/or between the current upper bandwidth limit and a predefined upper broadband bandwidth limit, wherein the predefined lower broadband bandwidth limit is smaller than
- the current bandwidth limit and the predefined upper broadband bandwidth limit is larger than the current upper bandwidth limit, and
- assembling means for automatically assembling the at least one complementary signal and the received acoustic signal to obtain an acoustic signal with extended bandwidth.
- Analogous to the above-described method, such an apparatus provides an advantageous way to extend the bandwidth of a received acoustic signal. In particular, due to the determination of current upper and lower bandwidth limits of the received acoustic signal and a corresponding determination of a complementary signal, the quality of the resulting output signal is increased compared to the case of bandwidth extension systems with fixed parameters.
- The complementary signal means may comprise a means for determining a broadband spectral envelope signal and a broadband excitation signal between the lower and upper broadband bandwidth limits such that the product of spectral envelope signal and excitation signal corresponds to the received acoustic signal according to a predetermined criterion.
- The bandwidth determining means may be configured to compare a determined broadband spectral envelope signal and a long-term power spectrum of the received acoustic signal.
- The bandwidth determining means may be configured to select the minimal and maximal frequency for which the long-term power spectrum is larger than or equal to the power spectrum of the determined broadband spectral envelope signal plus a predetermined constant.
- In the above-described apparatus, the means for determining a broadband spectral envelope signal may comprise a means for selecting an envelope signal from a codebook according to a predetermined criterion.
- The means for selecting an envelope signal may be configured to equalize the received acoustic signal and select an envelope signal from the codebook having minimal distance to the equalized acoustic signal according to a predetermined distance criterion, in particular, having a minimal cepstral distance.
- In the above-described apparatus, the codebook may comprise pairs of corresponding envelope signals, each pair comprising a broadband envelope signal between the lower and upper broadband bandwidth limits and a corresponding narrowband envelope signal between a lower narrowband bandwidth limit being larger than the lower broadband bandwidth limit and an upper narrowband bandwidth limit being smaller than the upper broadband bandwidth limit, and the means for selecting an envelope signal may be configured to determine a narrowband envelope signal having minimal distance to the equalized acoustic signal according to the predetermined distance criterion and to select the corresponding broadband envelope signal of this pair.
- The means for determining a broadband spectral envelope signal may comprise a means for providing adapted narrowband codebook envelope signals being adapted to the current lower and upper bandwidth limits.
- The means for providing may be configured to process the broadband codebook envelope signal using a long-term power spectrum of the received acoustic signal.
- In the above-described apparatus, the means for determining a broadband excitation signal may be configured to determine the broadband excitation signal based on prediction error filtering and/or a non-linear characteristic.
- The at least one complementary signal may be based on a product of the determined broadband spectral envelope and the determined broadband excitation signal, and the assembling means may be configured to sum the received acoustic signal between the current lower and upper bandwidth limits and the at least one complementary signal being restricted to the band between the lower broadband bandwidth limit and a current lower bandwidth limit and/or to the band between the current upper bandwidth limit and the upper broadband bandwidth limit.
- In the above-described apparatus, at least one of the means may be configured to perform at least part of its function in the cepstral domain.
- The means of the above-described apparatus may be configured to perform their respective function repeatedly at predetermined time intervals.
- The apparatus may further comprise a wanted signal detector, in particular, a speech detector, and the means may be configured to perform their respective function only if a wanted signal component is detected in the received acoustic signal.
- Further features and advantages of the invention will be described in the following with reference to the figures.
- Fig. 1
- illustrates the structure of an example of an apparatus for providing an acoustic signal with extended bandwidth;
- Fig. 2
- is a flow diagram of an example of a method for providing an acoustic signal with extended bandwidth;
- Fig. 3
- illustrates an example of a normalized long-term power spectrum for training a codebook;
- Fig 4
- illustrates examples of codebook entries;
- Fig. 5
- illustrates the determination of current bandwidth limits;
- Fig. 6
- illustrates the structure of a prior art system.
- Fig. 1 shows the structure of the signal flow in an apparatus for providing an acoustic signal with extended bandwidth. Fig. 2 is a flow diagram illustrating an example of a method for providing an acoustic signal with extended bandwidth which could be performed by the apparatus corresponding to Fig. 1. In view of this, Fig.'s 1 and 2 will be described in the following simultaneously.
- According to step 201, an acoustic signal, such as a speech signal, is received via a telephone line. Because of the restricted bandwidth of the telephone line, an extension of the bandwidth is desired to improve the signal quality. Thus, the signal is to be augmented so as to obtain a predetermined broader bandwidth. It is to be understood that the method described in the following can be used for bandwidth extension independent of the type of incoming signal and independent of the type of transmission line, i.e., it need not be a telephone line.
- The acoustic signal x(n) received by
block 101 has already been pre-processed by increasing the sampling rate up to the predetermined broadband or wideband bandwidth. In this way, however, no additional frequency components are generated. This can be achieved, for example, by using suitable anti-aliasing or anti-imaging filters. This kind of bandwidth extension, preferably, is performed only for the "missing" frequency ranges; in the case of an analog telephone line, these ranges may be between 0 and 300 Hz and 3400 Hz up to half of the desired sampling rate, for example, up to 3700 Hz. -
-
-
-
-
-
- Based on the spectral vectors, a long-term power spectrum of the received acoustic signal is determined in block 102 (step 204). There are different possibilities to estimate such a long-term power spectrum. According to one alternative, a first order recursive smoothing is performed on the absolute value squared of the sub-band signals X(e jΩ
µ , n) : - Preferably, the time constant β fre is chosen to be close to 1 (0 << β fre < 1) so as to obtain a sufficiently large averaging time.
- In principle, the recursive smoothing according to the first line of the above equation may be performed continuously. However, in order to avoid any artefacts, it may be performed only if a wanted signal component is present in the received acoustic signal, for example, if speech activity is detected. For this purpose, a speech detector may be provided as described, for example, in E. Hänsler, G. Schmidt, Acoustic Echo and Noise Control - A Practical Approach, Wiley, Hoboken, NJ, USA, 2004.
-
- The band limits Ωµ l and Ωµ u denote the lower and upper limits of a predefined frequency band. For example, this frequency band may correspond to a telephone band with minimal bandwidth for which the present method is to be used, for example, the limits may be 400 Hz and 3300 Hz. Preferably, the limits correspond to a band which is smaller or at most equal to the frequency band of the narrow frequency band within which the codebook described below has been trained; these limits being denoted by Ω l and Ω u .
- Alternatively, to determine the long-term power spectrum in the frequency domain, an estimation can be performed in the time domain as well. For this purpose, an auto-correlation is estimated for about 10 to 20 sampling cycles of offset. Afterwards, prediction coefficients can be determined using an LPC (linear predictive coding) analysis. The long-term power spectrum is obtained via a discrete Fourier transform and a division.
-
-
- In the equations above,
Ω l (n -1) andΩ u (n -1) denote the current lower and upper bandwidth limits of the received acoustic signal. Thus, for obtaining an updated equalized signal, the bandwidth limits at time (n-1) are taken as the current bandwidth limits. Furthermore, Sx̃ x̃ ,norm (Ωµ, n) denotes the normalized long-term power spectrum of the broadband signal which has been used for training the codebook. Normalizing of such a power spectrum is performed analogously to the case of the long-term power spectrum of the received acoustic signal described above. An example for such a normalized long-term power spectrum used for training a codebook is shown in Figure 3. -
- As can be seen from the above, the acoustic signal is equalized only within the current bandwidth limits one time step before. Outside these bandwidth limits, no equalizing takes place.
- In the following, determining a broadband spectrum envelope will be described in more detail. An envelope signal corresponding to the received acoustic signal will be determined using a codebook. The used codebook comprises a number of pairs of corresponding narrowband and broadband envelope signals. The codebook has been obtained by training with a large database on the basis of a starting long-term power spectrum (see Y. Linde, A. Buzo, R. M. Gray, An Algorithm for Vector Quantizer Design, IEEE Trans. Comm., vol. COM-28, no. 1, pages 84 - 95, Jan. 1980).
- As indicated in Figure 2, the codebook entries are adapted in step 206 (block 104). In particular, the narrowband codebook entries c i,s (n) are adapted.
- This is achieved by starting with the broadband entries of the codebook. If the broadband envelope signals are provided as cepstral vectors c i,b (n), the corresponding spectra Ci,b (n) are determined. Based on these broadband spectral envelopes, the adapted or optimized narrowband spectra are determined by a multiplication with a weighting matrix:
-
- Afterwards, cepstral vectors are determined from the resulting spectral narrowband envelopes.
- The conversion from spectral vectors to cepstral vectors and vice versa will be described in the following with respect to step 207 in which broadband spectral envelopes are determined (block 105).
- A broadband spectral envelope from the codebook matching the acoustic signal best is determined by comparing the narrowband codebook entries with the spectral envelope of the spectrum of the acoustic signal (after equalizing). The narrowband codebook entry is selected that has the smallness distance to the acoustic signal spectrum. In principle, different distance criteria can be used. The cepstral distance is particularly useful as the codebook entries are provided in the form of cepstral vectors.
- When an optimal narrowband codebook entry has been selected, the corresponding broadband codebook entry is determined as the optimal broadband spectral envelope for the received acoustic signal. Due to the adaptation of the narrowband codebook entries as described above, an optimal narrowband envelope can be selected in a very reliable way.
- Converting a spectral vector, particularly of the received acoustic signal, to a cepstral vector can be achieved by:
- 1. Determining the absolute value squared of each sub-band signal Xeq (e jΩ
µ , n). - 2. Applying an inverse discrete Fourier transform on this vector results in an estimation of the auto-correlation in the time domain.
- 3. Using the Levinson-Durbin algorithm, prediction coefficients (with an order of about 10 to 20) can be determined from the auto-correlation.
- 4. By performing a recursion with respect to the order, the prediction coefficients are used to determine the cepstral coefficients. Usually, the order corresponds to one and a half of the prediction order.
-
- Conversion of cepstral vectors into spectral vectors is achieved by:
- 1. Converting the cepstral vectors using a recursion with respect to the order (as above) to obtain prediction error filter coefficients.
- 2. By augmenting the prediction error filter vector by a predetermined number of zeros and subsequent performing of a discrete Fourier transform, an inverse spectrum is obtained.
- 3. By determining the reciprocal of each sub-band component, the vector C opt,b (n) is generated. Divisions by zero have to be treated separately, for example by adding a suitable constant.
- Fig. 4 illustrates an example of a codebook with four pairs of entries. In each diagram, a corresponding original narrowband envelope, and a corresponding adapted narrowband envelope are shown. The original broadband and narrowband codebook entries have been obtained on the basis of a large database for an ISDN telephone connection. As can be seen in this figure, after the adaptation, the resulting optimized entries have a higher upper limit frequency. This allows for an improved detection of fricatives.
- In step 208 (block 103), an excitation signal corresponding to the received acoustic signal is generated. This broadband excitation signal shows a spectrally flat envelope. It corresponds to a signal which would be recorded directly behind the vocal cords.
- For determining a broadband excitation signal, first of all, the spectral envelope of the equalized short-term spectrum X eq (n) is estimated in the form of prediction error filter coefficients. Applying an inverse discrete Fourier transform on this spectral vector allows to determine the corresponding time signal. After that, the vector in the time domain is filtered by a prediction error filter. The corresponding filter coefficients are those that have been determined previously.
- Then, a non-linear characteristic, such as a two-way rectification or squaring, is applied to the filtered time domain vector. This generates the missing low frequency and high frequency signal components. A transformation in the Fourier domain provides, then, the spectrum of the extended excitation signal X exc (n).
- Alternatively, determining an excitation signal can be performed in the time sub-band or Fourier domain as well. Examples for this alternative can be found in B. Iser, G. Schmidt, Bandwidth Extension of Telephony Speech, Eurasip Newsletter, Volume 16, .
-
-
-
-
-
-
- In Fig. 5, an example for determining the bandwidth limits is illustrated. The above, intermediate limit values are given by the points of intersection between the lowered broadband spectral envelope and the spectrum of the received acoustic signal.
-
- Then, the received acoustic signal is passed through an adaptive band pass filter to retain only components within the current bandwidth limits (block 109) to obtain a spectral vector Y tel (n). Similarly, the spectrally colored excitation signal is passed through a complementary adaptive band stop filter (block 110) so as to obtain a vector Y ext (n).
-
-
-
-
- Alternatively, the transitions at the bandwidth limits can be realized in a smoother way.
- The resulting output spectrum Y(n), then, is transformed into the time domain via an inverse Fourier transform:
- The resulting time domain vectors are, then, assembled using an overlap add method (as described in K. D. Kammeyer, K. Kroschel, Digitale Signalverarbeitung) to obtain the final output signal y(n).
- In the above-described steps of the method, more complex filter bank systems may be used instead of the conventional discrete Fourier transform and inverse discrete Fourier transform (see, for example, P. P. Vaidyanathan, Multirate Systems and Filter Banks, Prentice Hall, Englewood Cliffs, NJ, USA, 1992).
- Further alternatives to the above-described variants are possible as well. For example, the steps performed in the Fourier domain may also be performed in the time domain. Furthermore, equalizing the acoustic signal may be performed when adapting the narrowband codebook entries. Also, the above-described equalizing step may be augmented. For example, if an amplification or an attenuation is detected at particular frequencies, it may be adjusted within the bandwidth limits as well. In this case, the output vector Y tel (n) is modified with the weighting matrix H mod(n).
- In addition to the above-described codebook analysis for estimating the broadband spectral envelopes, a so-called linear mapping (see B. Iser, G. Schmidt, Bandwidth Extension of Telephony Speech) may be used additionally.
- Further modifications and variations of the present invention will be apparent to those skilled in the art in view of this description. Accordingly, the description is to be construed as illustrated only and is for the purpose of teaching those skilled in the art the general manner of carrying out the present invention. It is to be understood that the forms of the invention shown and described herein are to be taken as the presently preferred embodiments.
Claims (29)
- Method for providing an acoustic signal with extended bandwidth, comprising:(a) automatically determining a current upper and a current lower bandwidth limit of a received acoustic signal,(b) automatically determining at least one complementary signal to complement the received acoustic signal between a predefined lower broadband bandwidth limit and the current lower bandwidth limit and/or between the current upper bandwidth limit and a predefined upper broadband bandwidth limit, wherein the predefined lower broadband bandwidth limit is smaller than the current bandwidth limit and the predefined upper broadband bandwidth limit is larger than the current upper bandwidth limit,(c) automatically assembling the at least one complementary signal and the received acoustic signal to obtain an acoustic signal with extended bandwidth.
- Method according to claim 1, wherein step (b) comprises determining a broadband spectral envelope signal and a broadband excitation signal between the lower and upper broadband bandwidth limits such that the product of spectral envelope signal and excitation signal corresponds to the received acoustic signal according to a predetermined criterion.
- Method according claim 2, wherein step (a) comprises comparing a determined broadband spectral envelope signal and a long-term power spectrum of the received acoustic signal.
- Method according to claim 3, wherein the comparing step comprises selecting the minimal and maximal frequency for which the long-term power spectrum is larger than or equal to the power spectrum of the determined broadband spectral envelope signal plus a predetermined constant.
- Method according to one of the claims 2-4, wherein determining a broadband spectral envelope signal comprises selecting an envelope signal from a codebook according to a predetermined criterion.
- Method according to claim 5, wherein selecting an envelope signal comprises equalizing the received acoustic signal and selecting an envelope signal from the codebook having minimal distance to the equalized acoustic signal according to a predetermined distance criterion, in particular, having a minimal cepstral distance.
- Method according to claim 6, wherein
the codebook comprises pairs of corresponding envelope signals, each pair comprising a broadband envelope signal between the lower and upper broadband bandwidth limits and a corresponding narrowband envelope signal between a lower narrowband bandwidth limit being larger than the lower broadband bandwidth limit and an upper narrowband bandwidth limit being smaller than the upper broadband bandwidth limit, and
selecting an envelope signal comprises determining a narrowband envelope signal having minimal distance to the equalized acoustic signal according to the predetermined distance criterion and selecting the corresponding broadband envelope signal of this pair. - Method according to claim 7, wherein the step of selecting an envelope signal is preceded by providing adapted narrowband codebook envelope signals being adapted to the current lower and upper bandwidth limits.
- Method according to claim 8, wherein the providing step comprises processing broadband codebook envelope signals using a long-term power spectrum of the received acoustic signal.
- Method according to one of the claims 2 - 9, wherein determining a broadband excitation signal is based on prediction error filtering and/or a nonlinear characteristic.
- Method according to one of claims 2-10, wherein
the at least one complementary signal is based on a product of the determined broadband spectral envelope and the determined broadband excitation signal, and
step (c) comprises summing the received acoustic signal between the current lower and upper bandwidth limits and the at least one complementary signal being restricted to the band between the lower broadband bandwidth limit and a current lower bandwidth limit and/or to the band between the current upper bandwidth limit and the upper broadband bandwidth limit. - Method according to one of the preceding claims, wherein at least one of the steps is performed in the cepstral domain.
- Method according to one of the preceding claims, wherein steps (a) to (c) are repeated at predetermined time intervals.
- Method according to one of the preceding claims, wherein steps (a) to (c) are repeated only if a wanted signal component, in particular, speech activity, is detected in the received acoustic signal.
- Computer program product comprising one or more computer readable media having computer-executable instructions for performing the steps of the method of one of the preceding claims when run on a computer.
- Apparatus for providing an acoustic signal with extended bandwidth, comprising:bandwidth determining means for automatically determining a current upper and a current lower bandwidth limit of a received acoustic signal,complementary signal means for automatically determining at least one complementary signal to complement the received acoustic signal between a predefined lower broadband bandwidth limit and the current lower bandwidth limit and/or between the current upper bandwidth limit and a predefined upper broadband bandwidth limit, wherein the predefined lower broadband bandwidth limit is smaller than the current bandwidth limit and the predefined upper broadband bandwidth limit is larger than the current upper bandwidth limit, andassembling means for automatically assembling the at least one complementary signal and the received acoustic signal to obtain an acoustic signal with extended bandwidth.
- Apparatus according to claim 16, wherein the complementary signal means comprises a means for determining a broadband spectral envelope signal and a broadband excitation signal between the lower and upper broadband bandwidth limits such that the product of spectral envelope signal and excitation signal corresponds to the received acoustic signal according to a predetermined criterion.
- Apparatus according to claim 17, wherein the bandwidth determining means is configured to compare a determined broadband spectral envelope signal and a long-term power spectrum of the received acoustic signal.
- Apparatus according to claim 18, wherein the bandwidth determining means is configured to select the minimal and maximal frequency for which the long-term power spectrum is larger than or equal to the power spectrum of the determined broadband spectral envelope signal plus a predetermined constant.
- Apparatus according to one of the claims 17 - 19, wherein the means for determining a broadband spectral envelope signal comprises a means for selecting an envelope signal from a codebook according to a predetermined criterion.
- Apparatus according to claim 20, wherein the means for selecting an envelope signal is configured to equalize the received acoustic signal and select an envelope signal from the codebook having minimal distance to the equalized acoustic signal according to a predetermined distance criterion, in particular, having a minimal cepstral distance.
- Apparatus according to claim 21, wherein
the codebook comprises pairs of corresponding envelope signals, each pair comprising a broadband envelope signal between the lower and upper broadband bandwidth limits and a corresponding narrowband envelope signal between a lower narrowband bandwidth limit being larger than the lower broadband bandwidth limit and an upper narrowband bandwidth limit being smaller than the upper broadband bandwidth limit, and
the means for selecting an envelope signal is configured to determine a narrowband envelope signal having minimal distance to the equalized acoustic signal according to the predetermined distance criterion and to select the corresponding broadband envelope signal of this pair. - Apparatus according to claim 22, wherein the means for determining a broadband spectral envelope signal comprises a means for providing adapted narrowband codebook envelope signals being adapted to the current lower and upper bandwidth limits.
- Apparatus according to claim 23, wherein the means for providing is configured to process the broadband codebook envelope signal using a long-term power spectrum of the received acoustic signal.
- Apparatus according to one of the claims 17 - 24, wherein the means for determining a broadband excitation signal is configured to determine the broadband excitation signal based on prediction error filtering and/or a nonlinear characteristic.
- Apparatus according to one of claims 17 - 25, wherein
the at least one complementary signal is based on a product of the determined broadband spectral envelope and the determined broadband excitation signal, and
the assembling means is configured to sum the received acoustic signal between the current lower and upper bandwidth limits and the at least one complementary signal being restricted to the band between the lower broadband bandwidth limit and a current lower bandwidth limit and/or to the band between the current upper bandwidth limit and the upper broadband bandwidth limit. - Apparatus according to one of the claims 16 - 26, wherein at least one of the means is configured to perform at least part of its function in the cepstral domain.
- Apparatus according to one of the claims 16 - 27, wherein the means are configured to perform their respective function repeatedly at predetermined time intervals.
- Apparatus according to one of the claims 16 - 28, further comprising a wanted signal detector, in particular, a speech detector, and wherein the means are configured to perform their respective function only if a wanted signal component is detected in the received acoustic signal.
Priority Applications (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE602006009927T DE602006009927D1 (en) | 2006-08-22 | 2006-08-22 | Method and system for providing an extended bandwidth audio signal |
AT06017456T ATE446572T1 (en) | 2006-08-22 | 2006-08-22 | METHOD AND SYSTEM FOR PROVIDING AN EXTENDED BANDWIDTH AUDIO SIGNAL |
EP06017456A EP1892703B1 (en) | 2006-08-22 | 2006-08-22 | Method and system for providing an acoustic signal with extended bandwidth |
CA002596411A CA2596411A1 (en) | 2006-08-22 | 2007-08-08 | Method and system for providing an acoustic signal with extended bandwidth |
JP2007214930A JP5150165B2 (en) | 2006-08-22 | 2007-08-21 | Method and system for providing an acoustic signal with extended bandwidth |
KR1020070084306A KR101433833B1 (en) | 2006-08-22 | 2007-08-22 | Method and System for Providing an Acoustic Signal with Extended Bandwidth |
CN2007101466102A CN101141533B (en) | 2006-08-22 | 2007-08-22 | Method and system for providing an acoustic signal with extended bandwidth |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP06017456A EP1892703B1 (en) | 2006-08-22 | 2006-08-22 | Method and system for providing an acoustic signal with extended bandwidth |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1892703A1 true EP1892703A1 (en) | 2008-02-27 |
EP1892703B1 EP1892703B1 (en) | 2009-10-21 |
Family
ID=37000103
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP06017456A Not-in-force EP1892703B1 (en) | 2006-08-22 | 2006-08-22 | Method and system for providing an acoustic signal with extended bandwidth |
Country Status (7)
Country | Link |
---|---|
EP (1) | EP1892703B1 (en) |
JP (1) | JP5150165B2 (en) |
KR (1) | KR101433833B1 (en) |
CN (1) | CN101141533B (en) |
AT (1) | ATE446572T1 (en) |
CA (1) | CA2596411A1 (en) |
DE (1) | DE602006009927D1 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010021804A1 (en) * | 2008-08-21 | 2010-02-25 | Motorola, Inc. | Method and apparatus to facilitate determining signal bounding frequencies |
US8527283B2 (en) | 2008-02-07 | 2013-09-03 | Motorola Mobility Llc | Method and apparatus for estimating high-band energy in a bandwidth extension system |
US10121487B2 (en) | 2016-11-18 | 2018-11-06 | Samsung Electronics Co., Ltd. | Signaling processor capable of generating and synthesizing high frequency recover signal |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8688441B2 (en) | 2007-11-29 | 2014-04-01 | Motorola Mobility Llc | Method and apparatus to facilitate provision and use of an energy value to determine a spectral envelope shape for out-of-signal bandwidth content |
US8433582B2 (en) | 2008-02-01 | 2013-04-30 | Motorola Mobility Llc | Method and apparatus for estimating high-band energy in a bandwidth extension system |
JP2010079275A (en) * | 2008-08-29 | 2010-04-08 | Sony Corp | Device and method for expanding frequency band, device and method for encoding, device and method for decoding, and program |
US8463599B2 (en) | 2009-02-04 | 2013-06-11 | Motorola Mobility Llc | Bandwidth extension method and apparatus for a modified discrete cosine transform audio coder |
US8706497B2 (en) | 2009-12-28 | 2014-04-22 | Mitsubishi Electric Corporation | Speech signal restoration device and speech signal restoration method |
WO2011128723A1 (en) * | 2010-04-12 | 2011-10-20 | Freescale Semiconductor, Inc. | Audio communication device, method for outputting an audio signal, and communication system |
JP6218855B2 (en) * | 2013-01-29 | 2017-10-25 | フラウンホーファーゲゼルシャフト ツール フォルデルング デル アンゲヴァンテン フォルシユング エー.フアー. | AUDIO ENCODER, AUDIO DECODER, SYSTEM, METHOD, AND COMPUTER PROGRAM USING INCREASED TEMPERATURE RESOLUTION IN TEMPERATURE PROXIMITY OF ON-SET OR OFFSET OF FLUSION OR BRUSTING |
CN107404625B (en) * | 2017-07-18 | 2020-10-16 | 海信视像科技股份有限公司 | Sound effect processing method and device of terminal |
KR102093819B1 (en) * | 2018-09-10 | 2020-03-26 | 한국과학기술연구원 | Apparatus and method for separating sound sources |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0944036A1 (en) | 1997-04-30 | 1999-09-22 | Nippon Hoso Kyokai | Method and device for detecting voice sections, and speech velocity conversion method and device utilizing said method and device |
US20020138268A1 (en) | 2001-01-12 | 2002-09-26 | Harald Gustafsson | Speech bandwidth extension |
US6539355B1 (en) * | 1998-10-15 | 2003-03-25 | Sony Corporation | Signal band expanding method and apparatus and signal synthesis method and apparatus |
EP1298643A1 (en) | 2000-06-14 | 2003-04-02 | Kabushiki Kaisha Kenwood | Frequency interpolating device and frequency interpolating method |
WO2005078707A1 (en) * | 2004-02-16 | 2005-08-25 | Koninklijke Philips Electronics N.V. | A transcoder and method of transcoding therefore |
EP1638083A1 (en) | 2004-09-17 | 2006-03-22 | Harman Becker Automotive Systems GmbH | Bandwidth extension of bandlimited audio signals |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3483958B2 (en) * | 1994-10-28 | 2004-01-06 | 三菱電機株式会社 | Broadband audio restoration apparatus, wideband audio restoration method, audio transmission system, and audio transmission method |
JP3713200B2 (en) * | 2000-11-30 | 2005-11-02 | 株式会社ケンウッド | Signal interpolation device, signal interpolation method and recording medium |
WO2003019533A1 (en) * | 2001-08-24 | 2003-03-06 | Kabushiki Kaisha Kenwood | Device and method for interpolating frequency components of signal adaptively |
KR20040035749A (en) * | 2001-08-31 | 2004-04-29 | 코닌클리케 필립스 일렉트로닉스 엔.브이. | Bandwidth extension of a sound signal |
JP4281349B2 (en) * | 2001-12-25 | 2009-06-17 | パナソニック株式会社 | Telephone equipment |
WO2005055645A1 (en) * | 2003-12-01 | 2005-06-16 | Koninklijke Philips Electronics N.V. | Selective audio signal enhancement |
-
2006
- 2006-08-22 EP EP06017456A patent/EP1892703B1/en not_active Not-in-force
- 2006-08-22 AT AT06017456T patent/ATE446572T1/en not_active IP Right Cessation
- 2006-08-22 DE DE602006009927T patent/DE602006009927D1/en active Active
-
2007
- 2007-08-08 CA CA002596411A patent/CA2596411A1/en not_active Abandoned
- 2007-08-21 JP JP2007214930A patent/JP5150165B2/en not_active Expired - Fee Related
- 2007-08-22 CN CN2007101466102A patent/CN101141533B/en active Active
- 2007-08-22 KR KR1020070084306A patent/KR101433833B1/en active IP Right Grant
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0944036A1 (en) | 1997-04-30 | 1999-09-22 | Nippon Hoso Kyokai | Method and device for detecting voice sections, and speech velocity conversion method and device utilizing said method and device |
US6539355B1 (en) * | 1998-10-15 | 2003-03-25 | Sony Corporation | Signal band expanding method and apparatus and signal synthesis method and apparatus |
EP1298643A1 (en) | 2000-06-14 | 2003-04-02 | Kabushiki Kaisha Kenwood | Frequency interpolating device and frequency interpolating method |
US20020138268A1 (en) | 2001-01-12 | 2002-09-26 | Harald Gustafsson | Speech bandwidth extension |
WO2005078707A1 (en) * | 2004-02-16 | 2005-08-25 | Koninklijke Philips Electronics N.V. | A transcoder and method of transcoding therefore |
EP1638083A1 (en) | 2004-09-17 | 2006-03-22 | Harman Becker Automotive Systems GmbH | Bandwidth extension of bandlimited audio signals |
Non-Patent Citations (1)
Title |
---|
ISER B ET AL: "BANDWIDTH EXTENSION OF TELEPHONY SPEECH", EURASIP NEWS LETTER, XX, XX, June 2005 (2005-06-01), pages 1 - 148, XP002372006, ISSN: 1687-1421 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8527283B2 (en) | 2008-02-07 | 2013-09-03 | Motorola Mobility Llc | Method and apparatus for estimating high-band energy in a bandwidth extension system |
WO2010021804A1 (en) * | 2008-08-21 | 2010-02-25 | Motorola, Inc. | Method and apparatus to facilitate determining signal bounding frequencies |
CN102144258B (en) * | 2008-08-21 | 2013-05-01 | 摩托罗拉移动公司 | Method and apparatus to facilitate determining signal bounding frequencies |
US8463412B2 (en) | 2008-08-21 | 2013-06-11 | Motorola Mobility Llc | Method and apparatus to facilitate determining signal bounding frequencies |
US10121487B2 (en) | 2016-11-18 | 2018-11-06 | Samsung Electronics Co., Ltd. | Signaling processor capable of generating and synthesizing high frequency recover signal |
Also Published As
Publication number | Publication date |
---|---|
CA2596411A1 (en) | 2008-02-22 |
EP1892703B1 (en) | 2009-10-21 |
JP2008052277A (en) | 2008-03-06 |
CN101141533B (en) | 2013-09-04 |
JP5150165B2 (en) | 2013-02-20 |
KR101433833B1 (en) | 2014-08-27 |
CN101141533A (en) | 2008-03-12 |
KR20080018132A (en) | 2008-02-27 |
ATE446572T1 (en) | 2009-11-15 |
DE602006009927D1 (en) | 2009-12-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1892703B1 (en) | Method and system for providing an acoustic signal with extended bandwidth | |
US7035797B2 (en) | Data-driven filtering of cepstral time trajectories for robust speech recognition | |
USRE43191E1 (en) | Adaptive Weiner filtering using line spectral frequencies | |
US5706395A (en) | Adaptive weiner filtering using a dynamic suppression factor | |
CA2210490C (en) | Spectral subtraction noise suppression method | |
US7216074B2 (en) | System for bandwidth extension of narrow-band speech | |
CN1750124B (en) | Bandwidth extension of band limited audio signals | |
US8706497B2 (en) | Speech signal restoration device and speech signal restoration method | |
US6988066B2 (en) | Method of bandwidth extension for narrow-band speech | |
EP1970900A1 (en) | Method and apparatus for providing a codebook for bandwidth extension of an acoustic signal | |
US8392184B2 (en) | Filtering of beamformed speech signals | |
EP1686565B1 (en) | Bandwidth extension of bandlimited speech data | |
EP1918910A1 (en) | Model-based enhancement of speech signals | |
JPH0916194A (en) | Noise reduction for voice signal | |
JPH10307599A (en) | Waveform interpolating voice coding using spline | |
US5806022A (en) | Method and system for performing speech recognition | |
EP1927981B1 (en) | Spectral refinement of audio signals | |
JPH10319996A (en) | Efficient decomposition of noise and periodic signal waveform in waveform interpolation | |
US7603271B2 (en) | Speech coding apparatus with perceptual weighting and method therefor | |
JP3183104B2 (en) | Noise reduction device | |
Puder | Kalman‐filters in subbands for noise reduction with enhanced pitch‐adaptive speech model estimation | |
Thomas et al. | Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speech | |
López-Espejo et al. | On Speech Pre-emphasis as a Simple and Inexpensive Method to Boost Speech Enhancement | |
CN115527550A (en) | Single-microphone subband domain noise reduction method and system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR |
|
AX | Request for extension of the european patent |
Extension state: AL BA HR MK YU |
|
17P | Request for examination filed |
Effective date: 20080318 |
|
17Q | First examination report despatched |
Effective date: 20080416 |
|
AKX | Designation fees paid |
Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: EP |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: FG4D |
|
REF | Corresponds to: |
Ref document number: 602006009927 Country of ref document: DE Date of ref document: 20091203 Kind code of ref document: P |
|
LTIE | Lt: invalidation of european patent or patent extension |
Effective date: 20091021 |
|
NLV1 | Nl: lapsed or annulled due to failure to fulfill the requirements of art. 29p and 29m of the patents act | ||
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: PT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20100222 Ref country code: ES Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20100201 Ref country code: FI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: LT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: IS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20100221 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LV Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: PL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: SI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: AT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: BE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: RO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: BG Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20100121 Ref country code: EE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: DK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: CZ Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 |
|
26N | No opposition filed |
Effective date: 20100722 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20100122 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MC Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20100831 Ref country code: IT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: PL |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: CH Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20100831 Ref country code: LI Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20100831 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20100822 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R082 Ref document number: 602006009927 Country of ref document: DE Representative=s name: GRUENECKER, KINKELDEY, STOCKMAIR & SCHWANHAEUS, DE |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R082 Ref document number: 602006009927 Country of ref document: DE Representative=s name: GRUENECKER PATENT- UND RECHTSANWAELTE PARTG MB, DE Effective date: 20120411 Ref country code: DE Ref legal event code: R081 Ref document number: 602006009927 Country of ref document: DE Owner name: NUANCE COMMUNICATIONS, INC. (N.D.GES.D. STAATE, US Free format text: FORMER OWNER: HARMAN BECKER AUTOMOTIVE SYSTEMS GMBH, 76307 KARLSBAD, DE Effective date: 20120411 Ref country code: DE Ref legal event code: R082 Ref document number: 602006009927 Country of ref document: DE Representative=s name: GRUENECKER, KINKELDEY, STOCKMAIR & SCHWANHAEUS, DE Effective date: 20120411 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: CY Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: NL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 Ref country code: LU Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20100822 Ref country code: HU Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20100422 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: TP Owner name: NUANCE COMMUNICATIONS, INC., US Effective date: 20120924 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: TR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20091021 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 11 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 12 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 13 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FR Payment date: 20180824 Year of fee payment: 13 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20180831 Year of fee payment: 13 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20181031 Year of fee payment: 13 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R119 Ref document number: 602006009927 Country of ref document: DE |
|
GBPC | Gb: european patent ceased through non-payment of renewal fee |
Effective date: 20190822 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20200303 Ref country code: FR Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20190831 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20190822 |