EP1557825B1 - Bandwidth expanding device and method - Google Patents

Bandwidth expanding device and method Download PDF

Info

Publication number
EP1557825B1
EP1557825B1 EP03756637A EP03756637A EP1557825B1 EP 1557825 B1 EP1557825 B1 EP 1557825B1 EP 03756637 A EP03756637 A EP 03756637A EP 03756637 A EP03756637 A EP 03756637A EP 1557825 B1 EP1557825 B1 EP 1557825B1
Authority
EP
European Patent Office
Prior art keywords
signal
circuit
voiced
gain
filter
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
EP03756637A
Other languages
German (de)
French (fr)
Other versions
EP1557825A4 (en
EP1557825A1 (en
Inventor
Kazunori; c/o NEC CORPORATION OZAWA
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Publication of EP1557825A1 publication Critical patent/EP1557825A1/en
Publication of EP1557825A4 publication Critical patent/EP1557825A4/en
Application granted granted Critical
Publication of EP1557825B1 publication Critical patent/EP1557825B1/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques

Definitions

  • This invention relates to a method and an apparatus for extending the band, according to which a narrow-band signal is entered as input signal and a band extended signal having enlarged frequency range of the input signal is output to improve the acoustic sound quality.
  • Non-Patent Publication 1
  • HMM model parameters need to be determined off-line at the outset from a voluminous speech database in a manner which entails prolonged computing time and increased cost.
  • retrieval by an HMM model is needed for the receiving side to carry out band extension processing in real time, for which a large volume of calculations are required.
  • EP-A 1 420 389 A further state of the art according to Article 54(3) (EPC) has been disclosed in EP-A 1 420 389 .
  • EPC Article 54(3)
  • US 5455888 and WO 01/35395 describe bandwidth extension techniques based on the LPC principle.
  • this object is solved by a band extending apparatus according to claim 1 and a band extending method according to claim 4.
  • a band extended signal (e.g. 7 kHz band signal) may be generated by generating a high frequency signal with processing for a narrow-band input signal (e.g. 4 kHz band signal) and by summing the resulting high frequency signal to a signal corresponding to the narrow-band input signal having its sampling frequency changed.
  • a narrow-band input signal e.g. 4 kHz band signal
  • the present invention has such meritorious effect that a band extended signal with optimum sound quality may be generated in case periodicity is required for a high frequency part of the signal, such as a vowel, by generating an adaptive codebook signal, using a delay calculated from the narrow-band input signal, and by multiplying the so generated adaptive codebook signal with a gain and by summing the resulting signal to a noise signal.
  • the present invention also has such meritorious effect that a band extended signal for higher sound quality may be generated by employing a pitch pre-filter for a sound source signal, using the delay, or by weighting the coefficients from the coefficient calculating circuit for use for the post-filter.
  • a band extension apparatus includes a spectral parameter calculating circuit 100, a noise generating circuit 120, a coefficient calculating circuit 130, a gain circuit 140, a synthesis filter circuit 170, a sampling frequency converting circuit 180, an adder 190, a voiced/unvoiced discriminating circuit 200 and a gain adjustment circuit 210.
  • the spectral parameter calculating circuit 100 divides the input signal into plural frames, each being e.g. of 10 ms, and calculates spectral parameters of a predetermined number of orders P from frame to frame. It is noted that the spectral parameters represent parameters showing the outline shape of spectrum of a speech signal in terms of a frame as a unit.
  • LPC analysis as known per se, for example, is used.
  • For converting the linear prediction coefficients into LSP reference is made e.g. to the following treatises (for example see Non-Patent Publication 2):
  • Non-Patent Publication 2
  • the coefficient calculating circuit 130 is supplied with the spectral parameters and converts the parameters into coefficients of the band extended signal.
  • well-known techniques such as a technique for simply shifting the LSP frequency to a higher frequency, a technique for non-linear conversion or a technique for linear conversion, may be used.
  • the frequency band in which the LSPs are present is shifted to a higher frequency range, using all or part of the LSP parameters, for conversion to order-P linear prediction coefficients, which order-P linear prediction coefficients are then output to the synthesis filter circuit 170.
  • the noise generating circuit 120 generates a band-limited noise signal, having an average amplitude value normalized to a predetermined level, for a time duration equal to the frame duration, and outputs the so generated noise signal to the gain circuit 140.
  • the noise signal the white noise is here used. However, other noise signal may also be used.
  • the voiced/unvoiced discriminating circuit 200 is supplied with the narrow-band input signal x(n) to verify whether the frame-based signal is voiced or unvoiced.
  • the voiced/unvoiced discriminating circuit 200 outputs the voiced/unvoiced discrimination information to the gain adjustment circuit 210.
  • N denotes the number of samples for calculating the normalized autocorrelation.
  • the gain adjustment circuit 210 is supplied with the voiced/unvoiced discrimination information from the voiced/unvoiced discriminating circuit 200 and adjusts the gain to be imparted to the noise signal depending on whether the input signal is voiced or unvoiced, to output the so adjusted gain to the gain circuit 140.
  • the gain circuit 140 is supplied with the gain from the voiced/unvoiced discriminating circuit 200 and multiplies the output signal of the noise generating circuit 120 with the gain to output the resulting signal to the synthesis filter circuit 170.
  • the synthesis filter circuit 170 is supplied with the output signal of the gain circuit 140 and with coefficients of a predetermined number of orders, from the coefficient calculating circuit 130, to form a filter, and outputs a high frequency range signal y(n) needed for band extension.
  • the sampling frequency converting circuit 180 up-samples the narrow-band input signal x(n) to a predetermined sampling frequency to output the resulting up-sampled signal.
  • the adder 190 sums an output signal y(n) of the synthesis filter circuit 170 and an output signal s(n) of the sampling frequency converting circuit 180 to each other to form and output an ultimately band extended signal.
  • Fig.2 shows the configuration of a second embodiment not belonging to the present invention.
  • the band extending apparatus includes a spectral parameter calculating circuit 100, an adaptive codebook circuit 110, a noise generating circuit 120, a coefficient calculating circuit 130, a gain circuit 340, a synthesis filter circuit 170, a sampling frequency converting circuit 180, adders 160, 190, a voiced/unvoiced discriminating circuit 200, and a gain adjustment circuit 310.
  • the same reference numerals are used to depict the same parts or components as those shown in Fig.1 . In the following, only the points of difference from Fig.1 are explained, whilst the same parts or components as those of Fig.1 are sometimes not explained.
  • the present second embodiment of the present invention includes the adaptive codebook circuit 110 and the adder 160, in addition to the components of Fig.1 .
  • the voiced/unvoiced discriminating circuit 200 is supplied with the narrow-band input signal x(n) to verify whether a frame-based signal is voiced or unvoiced.
  • a normalized autocorrelation function D(T) up to the predetermined delay time m is derived for the narrow-band input signal x(n) in accordance with the equation (1), and a maximum value of D(T) is found. If the maximum value of D(T) is larger than a predetermined threshold value, the input signal is determined to be voiced. If otherwise, the input signal is determined to be unvoiced.
  • the voiced/unvoiced discriminating circuit 200 sends the value of T, maximizing the normalized autocorrelation function D(T), as a pitch period T to the adaptive codebook circuit 110.
  • the gain circuit 340 is supplied from the gain adjustment circuit 310 with a gain which is then multiplied with an output signal of at least one of the adaptive codebook circuit 110 and the noise generating circuit 120. The resulting signal is output to the adder 160.
  • the adder 160 sums the two signals, output from the gain circuit 340, and outputs the resulting sum signal to the synthesis filter circuit 170 and to the adaptive codebook circuit 110.
  • the synthesis filter circuit 170 is supplied with an output signal (sound source signal) of the adder 160 and with a filter coefficient of a predetermined number of orders from the coefficient calculating circuit 130 to form a synthesis filter, and outputs a signal y(n) of a high frequency range needed for band extension.
  • the gain adjustment circuit 310 is supplied with the voiced/unvoiced discrimination information from the voiced/unvoiced discriminating circuit 200, and adjusts the gain of the adaptive codebook signal and the gain of the noise signal, depending on whether the input signal is voiced or unvoiced, to send the gain-adjusted signal to the gain circuit 340.
  • the adder 190 sums the output signal y(n) of the synthesis filter circuit 170 to the output signal s(n) of the sampling frequency converting circuit 180 to form and output an ultimately band extended signal.
  • an adaptive codebook signal is generated, using a delay calculated from the narrow-band input signal, based on the past sound source signal of high frequency portion, and are then multiplied with a proper gain.
  • the resulting signal is then summed to e.g. a noise signal, whereby a band extended signal with superior sound quality may be generated for e.g. a vowel in case periodicity is needed for a high frequency portion.
  • a pitch generating circuit 115 may be provided in place of the adaptive codebook circuit 110, as shown in Fig.6 .
  • the pitch generating circuit 115 calculates a pitch period from an input signal and generates a periodic signal based on the pitch period to output the so generated pitch signal to the gain circuit 340. Except for the pitch generating circuit 115, the modification is the same in the configuration as the above-described second embodiment, not belonging to the present invention.
  • Fig.3 shows the configuration of a first embodiment of the present invention.
  • the band extending apparatus of the third embodiment includes a spectral parameter calculating circuit 100, an adaptive codebook circuit 110, a noise generating circuit 120, a coefficient calculating circuit 130, a gain circuit 300, a synthesis filter circuit 170, a sampling frequency converting circuit 180, an adder 190, a voiced/unvoiced discriminating circuit 200, a gain adjustment circuit 310, and a pitch pre-filter 400.
  • the same reference numerals are used to depict the parts or components which are the same as those shown in Figs.1 and 2 . In the following, only the points of difference from the second embodiment not belonging to the present invention are explained, whilst the same parts or components as those of Fig.2 are sometimes not explained.
  • the gain circuit 300 is supplied with the gain from the gain adjustment circuit 310 and multiplies the output signals of the adaptive codebook circuit 110 and the noise generating circuit 120 with the gain. The resulting two signals are summed together and the resulting sum signal is output to the pitch pre-filter 400.
  • An output of the pitch pre-filter 400 is also supplied to the adaptive codebook circuit 110.
  • the synthesis filter circuit 170 is supplied with an output signal of the pitch pre-filter 400 and with coefficients of a predetermined number of orders from the coefficient calculating circuit 130 to form a filter, and outputs a signal y(n) of a high frequency range needed for band extension.
  • a pitch generating circuit may, of course, be used in place of the adaptive codebook circuit 110.
  • Fig.4 shows the configuration of a second embodiment of the present invention.
  • the band extending apparatus of the second embodiment includes a spectral parameter calculating circuit 100, an adaptive codebook circuit 110, a noise generating circuit 120, a coefficient calculating circuit 130, a gain circuit 340, an adder 160, a synthesis filter circuit 170, a sampling frequency converting circuit 180, an adder 190, a voiced/unvoiced discriminating circuit 200, a gain adjustment circuit 310, and a low-pass filter circuit 500.
  • the same reference numerals are used to depict the parts or components which are the same as those shown in Fig.2 .
  • the low-pass filter 500 is added to the configuration of the above-described second embodiment not belonging to the present invention shown in Fig.2 .
  • the same parts or components as those of Fig.2 are explained only as necessary.
  • the cut-off frequency of the low-pass filter 500 may be predetermined to, for example, 6 kHz.
  • h(n) denotes the impulse response of a low-pass filter
  • a symbol "*" denotes the operation of convolution.
  • a pitch generating circuit may be used in place of the adaptive codebook circuit 110, by way of a modification of the present second embodiment, as in the modification of the second embodiment described above.
  • Fig.5 shows the configuration of a third embodiment of the present invention.
  • the band extending apparatus of the third embodiment includes a spectral parameter calculating circuit 100, an adaptive codebook circuit 110, a noise generating circuit 120, a coefficient calculating circuit 130, a gain circuit 300, a synthesis filter circuit 170, a sampling frequency converting circuit 180, an adder 190, a voiced/unvoiced discriminating circuit 200, a gain adjustment circuit 310, a pitch pre-filter 400, and a post-filter 600.
  • the same reference numerals are used to depict the same parts or components as those shown in Fig.3 .
  • the third embodiment of the present invention includes the post-filter 600 in addition to the configuration of the above-described first embodiment.
  • the post-filter 600 in addition to the configuration of the above-described first embodiment.
  • the post-filter 600 is supplied from the coefficient calculating circuit 130 with coefficients (filter coefficients), which then are weighted.
  • a pitch generating circuit may also be used in place of the codebook circuit 110, by way of a modification of the second embodiment, as in the modification of the second embodiment not belonging to the present invention described above.

Abstract

A bandwidth expanding device comprising a spectrum parameter calculating circuit (100) for calculating a spectrum parameter of a narrow-bandwidth input signal x(n), a coefficient calculating circuit (130) for receiving the spectrum parameter and converting it into the coefficient of a signal the bandwidth of which is expanded, a gain circuit (140) for receiving a gain from a gain control circuit (210), multiplying the output signal from a noise generating circuit (120) and the gain, and outputting the product to a combined filter circuit (170), the combined filter circuit (170) for receiving the coefficient from coefficient calculating circuit (130) to constitute a filter and passing the signal from the gain circuit (140) through the filter to output a high-frequency signal y(n) for bandwidth expansion, a sampling frequency converting circuit (180) for receiving the narrow-bandwidth input signal x(n) and outputting a signal s(n) the frequency of which is up-sampled to a predetermined sampling frequency, and an adder (190) for adding the high-pass signal y(n) to the signal s(n) and outputting an expanded bandwidth signal.

Description

    TECHNICAL FIELD
  • This invention relates to a method and an apparatus for extending the band, according to which a narrow-band signal is entered as input signal and a band extended signal having enlarged frequency range of the input signal is output to improve the acoustic sound quality.
  • BACKGROUND ART
  • There has been known a system in which the frequency range of a speech signal, encoded at a low bit rate and reproduced, is extended on the receiving side without the transmitting side having to send the auxiliary information for band extension (for example, see Non-Patent Publication 1).
  • Non-Patent Publication 1:
  • With this state-of-the-art system, filter coefficients after band extension using HMM (Hidden Markov Model) are retrieved on the receiving side.
  • On the other hand, the processing for directly extending the band of the narrow-band input signal is unprecedented.
  • In the state-of-the-art method, shown in the Publication 1, in which modeling by HMM of filter coefficients or the broadband spectral envelope of speech is required, the following problem arises. That is, HMM model parameters need to be determined off-line at the outset from a voluminous speech database in a manner which entails prolonged computing time and increased cost. In addition, retrieval by an HMM model is needed for the receiving side to carry out band extension processing in real time, for which a large volume of calculations are required.
  • A further state of the art according to Article 54(3) (EPC) has been disclosed in EP-A 1 420 389 . An addition, US 5455888 and WO 01/35395 describe bandwidth extension techniques based on the LPC principle.
  • Accordingly, it is an object of the present invention to overcome the aforementioned problem and to provide a method and an apparatus for directly extending the frequency range of a narrow-band input signal. It is another object of the present invention to provide a method and an apparatus for extending the frequency range whereby the band-extended speech of optimum sound quality may be obtained with computational complexity less than that of the state-of-the-art system.
  • DISCLOSURE OF THE INVENTION
  • According to the present invention this object is solved by a band extending apparatus according to claim 1 and a band extending method according to claim 4.
  • Further advantageous features of the band extending apparatus and the band extending method are indicated in the dependent claims.
  • The present invention has such meritorious effect that a band extended signal (e.g. 7 kHz band signal) may be generated by generating a high frequency signal with processing for a narrow-band input signal (e.g. 4 kHz band signal) and by summing the resulting high frequency signal to a signal corresponding to the narrow-band input signal having its sampling frequency changed.
  • The present invention has such meritorious effect that a band extended signal with optimum sound quality may be generated in case periodicity is required for a high frequency part of the signal, such as a vowel, by generating an adaptive codebook signal, using a delay calculated from the narrow-band input signal, and by multiplying the so generated adaptive codebook signal with a gain and by summing the resulting signal to a noise signal.
  • The present invention also has such meritorious effect that a band extended signal for higher sound quality may be generated by employing a pitch pre-filter for a sound source signal, using the delay, or by weighting the coefficients from the coefficient calculating circuit for use for the post-filter.
  • BRIEF DESCRIPTION OF THE DRAWINGS
    • Fig.1 is a diagram showing a configuration of a first embodiment not belonging to the present invention.
    • Fig.2 is a diagram showing a configuration of a second embodiment not belonging to the present invention.
    • Fig.3 is a diagram showing a configuration of a first embodiment of the present invention.
    • Fig.4 is a diagram showing a configuration of a second embodiment of the present invention.
    • Fig.5 is a diagram showing a configuration of a third embodiment of the present invention.
    • FIG. 6 is a diagram showing a modification of the second embodiment not belonging to the present invention.
    PREFERRED EMBODIMENTS OF THE INVENTION
  • For more detailed explanation of the present invention, preferred embodiments of the present invention will be explained with reference to the drawings. It is presupposed in the following that a narrow-band input signal of a 4 kHz range is extended in band to a 5 kHz band or to a 7 kHz band.
  • Fig.1 shows the configuration of a first embodiment not belonging to the present invention. Referring to Fig.1, a band extension apparatus includes a spectral parameter calculating circuit 100, a noise generating circuit 120, a coefficient calculating circuit 130, a gain circuit 140, a synthesis filter circuit 170, a sampling frequency converting circuit 180, an adder 190, a voiced/unvoiced discriminating circuit 200 and a gain adjustment circuit 210.
  • In the band extending apparatus, supplied with a narrow-band input signal x(n), the spectral parameter calculating circuit 100 divides the input signal into plural frames, each being e.g. of 10 ms, and calculates spectral parameters of a predetermined number of orders P from frame to frame. It is noted that the spectral parameters represent parameters showing the outline shape of spectrum of a speech signal in terms of a frame as a unit. For the calculation, LPC analysis, as known per se, for example, is used. The spectral parameter calculating circuit 100 also converts the linear prediction coefficients α i (i = 1, ...P), calculated by the LPC analysis, into LPC parameters suitable for quantization or interpolation, to output the so formed LPC parameters. For converting the linear prediction coefficients into LSP, reference is made e.g. to the following treatises (for example see Non-Patent Publication 2):
  • Non-Patent Publication 2:
  • The coefficient calculating circuit 130 is supplied with the spectral parameters and converts the parameters into coefficients of the band extended signal. For this conversion, well-known techniques, such as a technique for simply shifting the LSP frequency to a higher frequency, a technique for non-linear conversion or a technique for linear conversion, may be used. Here, the frequency band in which the LSPs are present is shifted to a higher frequency range, using all or part of the LSP parameters, for conversion to order-P linear prediction coefficients, which order-P linear prediction coefficients are then output to the synthesis filter circuit 170.
  • The noise generating circuit 120 generates a band-limited noise signal, having an average amplitude value normalized to a predetermined level, for a time duration equal to the frame duration, and outputs the so generated noise signal to the gain circuit 140. As the noise signal, the white noise is here used. However, other noise signal may also be used.
  • The voiced/unvoiced discriminating circuit 200 is supplied with the narrow-band input signal x(n) to verify whether the frame-based signal is voiced or unvoiced. For verifying whether the frame-based signal is voiced or unvoiced, a normalized autocorrelation function D(T) up to a predetermined delay time m is derived for the narrow-band input signal x(n) in accordance with the equation (1): D T = n = 0 N - 1 x n x n - T / n = 0 N - 1 x 2 n - T
    Figure imgb0001

    and a maximum value of D(T) is found. If the maximum value of D(T) is larger than a predetermined threshold value, the input signal is determined to be voiced. If otherwise, the input signal is determined to be unvoiced.
  • The voiced/unvoiced discriminating circuit 200 outputs the voiced/unvoiced discrimination information to the gain adjustment circuit 210. In the above equation (1), N denotes the number of samples for calculating the normalized autocorrelation.
  • The gain adjustment circuit 210 is supplied with the voiced/unvoiced discrimination information from the voiced/unvoiced discriminating circuit 200 and adjusts the gain to be imparted to the noise signal depending on whether the input signal is voiced or unvoiced, to output the so adjusted gain to the gain circuit 140.
  • The gain circuit 140 is supplied with the gain from the voiced/unvoiced discriminating circuit 200 and multiplies the output signal of the noise generating circuit 120 with the gain to output the resulting signal to the synthesis filter circuit 170.
  • The synthesis filter circuit 170 is supplied with the output signal of the gain circuit 140 and with coefficients of a predetermined number of orders, from the coefficient calculating circuit 130, to form a filter, and outputs a high frequency range signal y(n) needed for band extension.
  • The sampling frequency converting circuit 180 up-samples the narrow-band input signal x(n) to a predetermined sampling frequency to output the resulting up-sampled signal.
  • The adder 190 sums an output signal y(n) of the synthesis filter circuit 170 and an output signal s(n) of the sampling frequency converting circuit 180 to each other to form and output an ultimately band extended signal.
  • The above completes the explanation of the first embodiment.
  • Fig.2 shows the configuration of a second embodiment not belonging to the present invention. Referring to Fig.2, the band extending apparatus includes a spectral parameter calculating circuit 100, an adaptive codebook circuit 110, a noise generating circuit 120, a coefficient calculating circuit 130, a gain circuit 340, a synthesis filter circuit 170, a sampling frequency converting circuit 180, adders 160, 190, a voiced/unvoiced discriminating circuit 200, and a gain adjustment circuit 310. In Fig.2, the same reference numerals are used to depict the same parts or components as those shown in Fig.1. In the following, only the points of difference from Fig.1 are explained, whilst the same parts or components as those of Fig.1 are sometimes not explained. The present second embodiment of the present invention includes the adaptive codebook circuit 110 and the adder 160, in addition to the components of Fig.1.
  • The voiced/unvoiced discriminating circuit 200 is supplied with the narrow-band input signal x(n) to verify whether a frame-based signal is voiced or unvoiced. For verifying whether the frame-based signal is voiced or unvoiced, a normalized autocorrelation function D(T) up to the predetermined delay time m is derived for the narrow-band input signal x(n) in accordance with the equation (1), and a maximum value of D(T) is found. If the maximum value of D(T) is larger than a predetermined threshold value, the input signal is determined to be voiced. If otherwise, the input signal is determined to be unvoiced.
  • For the voiced frame, the voiced/unvoiced discriminating circuit 200 sends the value of T, maximizing the normalized autocorrelation function D(T), as a pitch period T to the adaptive codebook circuit 110.
  • The adaptive codebook circuit 110 is supplied from the voiced/unvoiced discriminating circuit 200 with the delay T of the adaptive codebook and, based on the past sound source signal v(n), generates an adaptive code vector p(n), in accordance with the following equation (2): p n = v n - T
    Figure imgb0002

    and outputs the so generated vector to the gain circuit 340.
  • The gain circuit 340 is supplied from the gain adjustment circuit 310 with a gain which is then multiplied with an output signal of at least one of the adaptive codebook circuit 110 and the noise generating circuit 120. The resulting signal is output to the adder 160.
  • The adder 160 sums the two signals, output from the gain circuit 340, and outputs the resulting sum signal to the synthesis filter circuit 170 and to the adaptive codebook circuit 110.
  • The synthesis filter circuit 170 is supplied with an output signal (sound source signal) of the adder 160 and with a filter coefficient of a predetermined number of orders from the coefficient calculating circuit 130 to form a synthesis filter, and outputs a signal y(n) of a high frequency range needed for band extension.
  • The gain adjustment circuit 310 is supplied with the voiced/unvoiced discrimination information from the voiced/unvoiced discriminating circuit 200, and adjusts the gain of the adaptive codebook signal and the gain of the noise signal, depending on whether the input signal is voiced or unvoiced, to send the gain-adjusted signal to the gain circuit 340.
  • The adder 190 sums the output signal y(n) of the synthesis filter circuit 170 to the output signal s(n) of the sampling frequency converting circuit 180 to form and output an ultimately band extended signal.
  • With the second embodiment of not belonging to the present invention, an adaptive codebook signal is generated, using a delay calculated from the narrow-band input signal, based on the past sound source signal of high frequency portion, and are then multiplied with a proper gain. The resulting signal is then summed to e.g. a noise signal, whereby a band extended signal with superior sound quality may be generated for e.g. a vowel in case periodicity is needed for a high frequency portion. The above completes explanation of the second embodiment. As a modification of the second embodiment not belonging to the present invention, a pitch generating circuit 115 may be provided in place of the adaptive codebook circuit 110, as shown in Fig.6. The pitch generating circuit 115 calculates a pitch period from an input signal and generates a periodic signal based on the pitch period to output the so generated pitch signal to the gain circuit 340. Except for the pitch generating circuit 115, the modification is the same in the configuration as the above-described second embodiment, not belonging to the present invention.
  • Fig.3 shows the configuration of a first embodiment of the present invention. Referring to Fig.3, the band extending apparatus of the third embodiment includes a spectral parameter calculating circuit 100, an adaptive codebook circuit 110, a noise generating circuit 120, a coefficient calculating circuit 130, a gain circuit 300, a synthesis filter circuit 170, a sampling frequency converting circuit 180, an adder 190, a voiced/unvoiced discriminating circuit 200, a gain adjustment circuit 310, and a pitch pre-filter 400. In Fig.3, the same reference numerals are used to depict the parts or components which are the same as those shown in Figs.1 and 2. In the following, only the points of difference from the second embodiment not belonging to the present invention are explained, whilst the same parts or components as those of Fig.2 are sometimes not explained.
  • The gain circuit 300 is supplied with the gain from the gain adjustment circuit 310 and multiplies the output signals of the adaptive codebook circuit 110 and the noise generating circuit 120 with the gain. The resulting two signals are summed together and the resulting sum signal is output to the pitch pre-filter 400.
  • The pitch pre-filter 400 is supplied with the delay T from the voiced/unvoiced discriminating circuit 200, and performs pre-filtering on the sound source signal v(n) in accordance with the following equation (3): n = v n + βp n - T
    Figure imgb0003

    to output the resulting signal to the synthesis filter circuit 170.
  • An output of the pitch pre-filter 400 is also supplied to the adaptive codebook circuit 110.
  • The synthesis filter circuit 170 is supplied with an output signal of the pitch pre-filter 400 and with coefficients of a predetermined number of orders from the coefficient calculating circuit 130 to form a filter, and outputs a signal y(n) of a high frequency range needed for band extension.
  • By employing the pitch pre-filter 400 for pre-filtering the sound source signal, using the delay, a band extended signal of superior sound quality may be produced. The above completes the explanation of the third embodiment. In the present embodiment, as in the modification of the second embodiment, not belonging to the invention, a pitch generating circuit may, of course, be used in place of the adaptive codebook circuit 110.
  • Fig.4 shows the configuration of a second embodiment of the present invention. Referring to Fig.4, the band extending apparatus of the second embodiment includes a spectral parameter calculating circuit 100, an adaptive codebook circuit 110, a noise generating circuit 120, a coefficient calculating circuit 130, a gain circuit 340, an adder 160, a synthesis filter circuit 170, a sampling frequency converting circuit 180, an adder 190, a voiced/unvoiced discriminating circuit 200, a gain adjustment circuit 310, and a low-pass filter circuit 500. In Fig.4, the same reference numerals are used to depict the parts or components which are the same as those shown in Fig.2. In the second embodiment, the low-pass filter 500 is added to the configuration of the above-described second embodiment not belonging to the present invention shown in Fig.2. In the following, only the points of difference from this second embodiment are explained, whilst the same parts or components as those of Fig.2 are explained only as necessary.
  • The low-pass filter 500 filters the output signal of the adaptive codebook circuit 110 in accordance with the equation: n = p n * h n
    Figure imgb0004

    to permit a signal with a frequency not higher than a predetermined cut-off frequency to pass therethrough to the gain circuit 340. The cut-off frequency of the low-pass filter 500 may be predetermined to, for example, 6 kHz. Meanwhile, in Fig.4, h(n) denotes the impulse response of a low-pass filter, and a symbol "*" denotes the operation of convolution.
  • The foregoing completes the explanation of the second embodiment of the present invention. Meanwhile, a pitch generating circuit may be used in place of the adaptive codebook circuit 110, by way of a modification of the present second embodiment, as in the modification of the second embodiment described above.
  • Fig.5 shows the configuration of a third embodiment of the present invention. Referring to Fig.5, the band extending apparatus of the third embodiment includes a spectral parameter calculating circuit 100, an adaptive codebook circuit 110, a noise generating circuit 120, a coefficient calculating circuit 130, a gain circuit 300, a synthesis filter circuit 170, a sampling frequency converting circuit 180, an adder 190, a voiced/unvoiced discriminating circuit 200, a gain adjustment circuit 310, a pitch pre-filter 400, and a post-filter 600. In Fig.5, the same reference numerals are used to depict the same parts or components as those shown in Fig.3. The third embodiment of the present invention includes the post-filter 600 in addition to the configuration of the above-described first embodiment. In the following, only the points of difference from the first embodiment are explained, whilst the same parts or components as those of Fig.3 are explained only as necessary.
  • The post-filter 600 is supplied from the coefficient calculating circuit 130 with coefficients (filter coefficients), which then are weighted. The post-filter then performs post-filtering in accordance with the equation (5): n = y n - Σ a i γ 1 1 y n - i + Σ a i γ 2 1 n - i
    Figure imgb0005

    in order to deliver an output to the adder 190.
  • By employing the post-filter 600, it is possible to generate a band extended signal of superior quality. The above completes the explanation of the third embodiment. It is noted that a pitch generating circuit may also be used in place of the codebook circuit 110, by way of a modification of the second embodiment, as in the modification of the second embodiment not belonging to the present invention described above.
  • The configurations of the above-described embodiments may also be combined together, such as by employing the post-filter, explained in the third embodiment, for the above-described first embodiment. In the present invention, plural sorts of the preset frequency band signal (narrow-band signal) may be input, in place of only one sort of the signals. Although the present invention has been explained with reference to the above specific embodiments, it is to be noted that the present invention may encompass various modifications or corrections that may be occur to those skilled in the art within the scope of the invention as defined in the claims.

Claims (6)

  1. A band extending apparatus receiving at least an input signal of a preset frequency band to output a band extended signal corresponding to said input signal extended in a frequency band thereof,
    said apparatus comprising:
    (A) a spectral parameter calculating unit (100), adapted to receive at least an input signal of a preset frequency band to calculate spectral parameters representing spectral characteristics;
    (B) a coefficient calculating unit (130) adapted to shift the frequency of said spectral parameters to then calculate filter coefficients;
    (C) a voiced/unvoiced discriminating circuit (200), adapted to supply for a voiced frame, a preset delay derived from a voiced/unvoiced decision, as a pitch period to an adaptive codebook circuit (110);
    (D) said adaptive codebook circuit (110), being adapted to receive the delay from said voiced/unvoiced discriminating circuit, as a delay of said adaptive codebook, to generate an adaptive codebook signal based on the past sound source signal outputted from a synthesis filter circuit (170) and to output the adaptive codebook signal generated;
    (E) a noise generating circuit (120) adapted to generate a noise signal;
    (F) a gain adjustment circuit (310), adapted to receive voiced/unvoiced discrimination information output from said voiced/unvoiced discrimination circuit (200), to output a resulting gain adjustment signal for adjusting the gain of the output signal of said adaptive codebook circuit (110) and the gain of the output signal of noise generating unit (120), depending on whether said voiced/unvoiced discrimination information indicates voiced or unvoiced;
    (G) a gain circuit (300) adapted to receive said gain adjustment signal from said gain adjustment circuit (310) to multiply the output signal of said adaptive codebook circuit (110) and the output signal of said noise generating circuit (120) with said gain adjustment signal to output two signals that are summed together to form a resulting sum signal;
    (H) a pitch pre-filter (400) adapted to filter said resulting sum signal from said gain circuit (300), using said pitch period supplied from the voiced/unvoiced discriminating circuit (200) and to supply the output signal to the adaptive codebook circuit (110) and to the synthesis filter circuit (170);
    (I) said synthesis filter circuit (170) being adapted to pass the output signal of said pitch pre-filter (400) through a synthesis filter, formed using said filter coefficients, to reproduce a signal for band extension; and
    (J) an adder (190) adapted to add a signal corresponding to said input signal converted in a sampling frequency thereof to an output signal of said synthesis filter circuit (170) to generate the band extended signal.
  2. The band extending apparatus as defined in claim 1, further comprising:
    a low-pass filter (500), receiving an output signal of said adaptive codebook unit (110) as an input.
  3. The band extending apparatus as defined in any one of claims 1 or 2, wherein
    a post-filter (600) is formed using weighting coefficients as weighted version of filter coefficients output from said coefficient calculating unit (130), and wherein an output signal of said synthesis filter unit (170) is passed through said post-filter (600) to reproduce the signal for band extension.
  4. A band extending method for receiving at least an input signal of a preset frequency band to output a band extended signal corresponding to said input signal extended in a frequency band thereof,
    said method comprising:
    (A) a spectral parameter-calculating step for receiving at least an input signal of a preset frequency band to calculate spectral parameters representing spectral characteristics;
    (B) a coefficient-calculating step for shifting the frequency of said spectral parameters to then calculate filter coefficients;
    (C) a voiced/unvoiced-discriminating step for supplying, for a voiced frame, a preset delay derived from a voiced/unvoiced discrimination, as a pitch period to an adaptive codebook generating step;
    (D) said adaptive generating step receiving the delay from said voiced/ unvoiced discriminating step, as a delay of said adaptive codebook, generating an adaptive codebook signal based on the past sound source signal outputted from a synthesis filtering step and outputting the adaptive codebook signal generated;
    (E) a step of generating a noise signal;
    (F) a gain-adjusting step for receiving voiced/unvoiced discrimination information resulting from said voiced/unvoiced discrimination, to output a resulting gain adjustment signal for adjusting the gain of said adaptive codebook signal and the gain of the generated noise signal, depending on whether said voiced/unvoiced discrimination information indicates voiced or unvoiced;
    (G) a gain-multiplying step for receiving said gain adjustment signal from said gain adjusting step and multiplying the output signal of said adaptive codebook and the generated noise signal with said gain adjustment signal to output two signals that are summed together to form a resulting sum signal;
    (H) a pitch pre-filtering step for filtering said resulting sum signal from said gain multiplying step, using said pitch period, supplying the output signal of said pitch pre-filtering step to the adaptive codebook generating step and to the synthesis filtering step;
    (I) said synthesis filtering step passing an output signal of said pitch pre-filtering step through a synthesis filter, formed using said filter coefficients, to reproduce a signal for band extension; and
    (J) a step for adding a signal corresponding to said input signal converted in a sampling frequency thereof to an output signal of said synthesis filtering step to generate the band extended signal.
  5. The band extending method as defined in claim 4, further comprising:
    processing said adaptive codebook signal with a low-pass filter (500) to allow frequency components not higher than a preset cut-off frequency to pass there through.
  6. The band extending method as defined in any one of claims 4 or 5, comprising:
    passing an output signal of said synthesis filtering step through a post-filter (600) formed using weighting coefficients corresponding to weighted version of said filter coefficients to reproduce the signal for band extension.
EP03756637A 2002-10-31 2003-10-16 Bandwidth expanding device and method Expired - Lifetime EP1557825B1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP2002317203 2002-10-31
JP2002317203A JP4433668B2 (en) 2002-10-31 2002-10-31 Bandwidth expansion apparatus and method
PCT/JP2003/013231 WO2004040553A1 (en) 2002-10-31 2003-10-16 Bandwidth expanding device and method

Publications (3)

Publication Number Publication Date
EP1557825A1 EP1557825A1 (en) 2005-07-27
EP1557825A4 EP1557825A4 (en) 2006-01-18
EP1557825B1 true EP1557825B1 (en) 2010-12-22

Family

ID=32211713

Family Applications (1)

Application Number Title Priority Date Filing Date
EP03756637A Expired - Lifetime EP1557825B1 (en) 2002-10-31 2003-10-16 Bandwidth expanding device and method

Country Status (9)

Country Link
US (1) US7684979B2 (en)
EP (1) EP1557825B1 (en)
JP (1) JP4433668B2 (en)
KR (1) KR100715013B1 (en)
CN (1) CN1708785B (en)
AU (1) AU2003301711A1 (en)
CA (1) CA2504175A1 (en)
DE (1) DE60335486D1 (en)
WO (1) WO2004040553A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1482482A1 (en) * 2003-05-27 2004-12-01 Siemens Aktiengesellschaft Frequency expansion for Synthesiser
US8712768B2 (en) * 2004-05-25 2014-04-29 Nokia Corporation System and method for enhanced artificial bandwidth expansion
CN101023472B (en) * 2004-09-06 2010-06-23 松下电器产业株式会社 Scalable encoding device and scalable encoding method
JP5063364B2 (en) * 2005-02-10 2012-10-31 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Speech synthesis method
KR101414375B1 (en) 2008-06-13 2014-07-04 삼성전자주식회사 Apparatus and method for encoding/decoding using bandwidth extension

Family Cites Families (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS61107400A (en) * 1984-10-31 1986-05-26 日本電気株式会社 Voice synthesizer
JPS63217732A (en) 1987-03-05 1988-09-09 Kokusai Electric Co Ltd Coding transmission system for voice signal
JP3088121B2 (en) * 1991-04-12 2000-09-18 沖電気工業株式会社 Statistical excitation code vector optimization method
US5734789A (en) * 1992-06-01 1998-03-31 Hughes Electronics Voiced, unvoiced or noise modes in a CELP vocoder
US5455888A (en) * 1992-12-04 1995-10-03 Northern Telecom Limited Speech bandwidth extension method and apparatus
JP3297156B2 (en) * 1993-08-17 2002-07-02 三菱電機株式会社 Voice discrimination device
JP3483958B2 (en) * 1994-10-28 2004-01-06 三菱電機株式会社 Broadband audio restoration apparatus, wideband audio restoration method, audio transmission system, and audio transmission method
JP3328080B2 (en) * 1994-11-22 2002-09-24 沖電気工業株式会社 Code-excited linear predictive decoder
JP3189614B2 (en) * 1995-03-13 2001-07-16 松下電器産業株式会社 Voice band expansion device
EP0732687B2 (en) * 1995-03-13 2005-10-12 Matsushita Electric Industrial Co., Ltd. Apparatus for expanding speech bandwidth
US5699485A (en) 1995-06-07 1997-12-16 Lucent Technologies Inc. Pitch delay modification during frame erasures
JPH0955778A (en) * 1995-08-15 1997-02-25 Fujitsu Ltd Bandwidth widening device for sound signal
JPH09127985A (en) 1995-10-26 1997-05-16 Sony Corp Signal coding method and device therefor
EP0788091A3 (en) 1996-01-31 1999-02-24 Kabushiki Kaisha Toshiba Speech encoding and decoding method and apparatus therefor
JP3350340B2 (en) * 1996-03-29 2002-11-25 株式会社東芝 Voice coding method and voice decoding method
EP0945852A1 (en) * 1998-03-25 1999-09-29 BRITISH TELECOMMUNICATIONS public limited company Speech synthesis
TW376611B (en) 1998-05-26 1999-12-11 Koninkl Philips Electronics Nv Transmission system with improved speech encoder
JP3502268B2 (en) 1998-06-16 2004-03-02 ヤマハ株式会社 Audio signal processing device and audio signal processing method
JP3540159B2 (en) 1998-06-18 2004-07-07 ヤマハ株式会社 Voice conversion device and voice conversion method
CA2252170A1 (en) 1998-10-27 2000-04-27 Bruno Bessette A method and device for high quality coding of wideband speech and audio signals
JP2000267700A (en) * 1999-03-17 2000-09-29 Yrp Kokino Idotai Tsushin Kenkyusho:Kk Method and device for encoding and decoding voice
US6377915B1 (en) * 1999-03-17 2002-04-23 Yrp Advanced Mobile Communication Systems Research Laboratories Co., Ltd. Speech decoding using mix ratio table
JP3583945B2 (en) * 1999-04-15 2004-11-04 日本電信電話株式会社 Audio coding method
JP4464488B2 (en) 1999-06-30 2010-05-19 パナソニック株式会社 Speech decoding apparatus, code error compensation method, speech decoding method
EP1147515A1 (en) * 1999-11-10 2001-10-24 Koninklijke Philips Electronics N.V. Wide band speech synthesis by means of a mapping matrix
JP2002055699A (en) 2000-08-10 2002-02-20 Mitsubishi Electric Corp Device and method for encoding voice
DE10041512B4 (en) * 2000-08-24 2005-05-04 Infineon Technologies Ag Method and device for artificially expanding the bandwidth of speech signals
JP3462464B2 (en) * 2000-10-20 2003-11-05 株式会社東芝 Audio encoding method, audio decoding method, and electronic device
US6889182B2 (en) * 2001-01-12 2005-05-03 Telefonaktiebolaget L M Ericsson (Publ) Speech bandwidth extension
JP2003044098A (en) * 2001-07-26 2003-02-14 Nec Corp Device and method for expanding voice band
US6895375B2 (en) * 2001-10-04 2005-05-17 At&T Corp. System for bandwidth extension of Narrow-band speech
CN1282156C (en) * 2001-11-23 2006-10-25 皇家飞利浦电子股份有限公司 Audio signal bandwidth extension

Also Published As

Publication number Publication date
EP1557825A4 (en) 2006-01-18
KR20050062643A (en) 2005-06-23
WO2004040553A1 (en) 2004-05-13
CN1708785B (en) 2010-05-12
KR100715013B1 (en) 2007-05-09
EP1557825A1 (en) 2005-07-27
DE60335486D1 (en) 2011-02-03
AU2003301711A1 (en) 2004-05-25
CN1708785A (en) 2005-12-14
CA2504175A1 (en) 2004-05-13
JP2004151423A (en) 2004-05-27
US20050256709A1 (en) 2005-11-17
JP4433668B2 (en) 2010-03-17
US7684979B2 (en) 2010-03-23

Similar Documents

Publication Publication Date Title
US7454330B1 (en) Method and apparatus for speech encoding and decoding by sinusoidal analysis and waveform encoding with phase reproducibility
US7013270B2 (en) Determining linear predictive coding filter parameters for encoding a voice signal
EP0673013A1 (en) Signal encoding and decoding system
JPH10124088A (en) Device and method for expanding voice frequency band width
WO2003010752A1 (en) Speech bandwidth extension apparatus and speech bandwidth extension method
US7486719B2 (en) Transcoder and code conversion method
AU669788B2 (en) Method for generating a spectral noise weighting filter for use in a speech coder
US7684979B2 (en) Band extending apparatus and method
JPH10124089A (en) Processor and method for speech signal processing and device and method for expanding voice bandwidth
EP1564723B1 (en) Transcoder and coder conversion method
JPH08305396A (en) Device and method for expanding voice band
JPH0782360B2 (en) Speech analysis and synthesis method
EP1204094B1 (en) Excitation signal low pass filtering for speech coding
JP3583945B2 (en) Audio coding method
JP3481027B2 (en) Audio coding device
US6983241B2 (en) Method and apparatus for performing harmonic noise weighting in digital speech coders
JP3578933B2 (en) Method of creating weight codebook, method of setting initial value of MA prediction coefficient during learning at the time of codebook design, method of encoding audio signal, method of decoding the same, and computer-readable storage medium storing encoding program And computer-readable storage medium storing decryption program
JP2583883B2 (en) Speech analyzer and speech synthesizer
JP3192051B2 (en) Audio coding device
JP3199128B2 (en) Audio encoding method
JPS61128299A (en) Voice analysis/analytic synthesization system
JPH11184499A (en) Voice encoding method and voice encoding method
Yim AutoRegressive Moving Average modelling in low bit rate speech coding

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20050513

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL LT LV MK

REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1075735

Country of ref document: HK

A4 Supplementary search report drawn up and despatched

Effective date: 20051206

DAX Request for extension of the european patent (deleted)
RBV Designated contracting states (corrected)

Designated state(s): DE FR GB

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): DE FR GB

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

REF Corresponds to:

Ref document number: 60335486

Country of ref document: DE

Date of ref document: 20110203

Kind code of ref document: P

REG Reference to a national code

Ref country code: DE

Ref legal event code: R096

Ref document number: 60335486

Country of ref document: DE

Effective date: 20110203

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed

Effective date: 20110923

REG Reference to a national code

Ref country code: DE

Ref legal event code: R097

Ref document number: 60335486

Country of ref document: DE

Effective date: 20110923

REG Reference to a national code

Ref country code: HK

Ref legal event code: WD

Ref document number: 1075735

Country of ref document: HK

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 14

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 15

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 16

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20221028

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20221019

Year of fee payment: 20

Ref country code: DE

Payment date: 20221019

Year of fee payment: 20

REG Reference to a national code

Ref country code: DE

Ref legal event code: R071

Ref document number: 60335486

Country of ref document: DE

REG Reference to a national code

Ref country code: GB

Ref legal event code: PE20

Expiry date: 20231015

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20231015

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20231015