EP1557825B1 - Bandwidth expanding device and method - Google Patents
Bandwidth expanding device and method Download PDFInfo
- Publication number
- EP1557825B1 EP1557825B1 EP03756637A EP03756637A EP1557825B1 EP 1557825 B1 EP1557825 B1 EP 1557825B1 EP 03756637 A EP03756637 A EP 03756637A EP 03756637 A EP03756637 A EP 03756637A EP 1557825 B1 EP1557825 B1 EP 1557825B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- signal
- circuit
- voiced
- gain
- filter
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000000034 method Methods 0.000 title claims description 16
- 238000005070 sampling Methods 0.000 claims abstract description 14
- 230000003044 adaptive effect Effects 0.000 claims description 39
- 230000015572 biosynthetic process Effects 0.000 claims description 28
- 238000003786 synthesis reaction Methods 0.000 claims description 28
- 230000003595 spectral effect Effects 0.000 claims description 19
- 238000001914 filtration Methods 0.000 claims description 12
- 238000011045 prefiltration Methods 0.000 claims description 10
- 238000012545 processing Methods 0.000 claims description 4
- 238000001228 spectrum Methods 0.000 abstract description 4
- 238000012986 modification Methods 0.000 description 9
- 230000004048 modification Effects 0.000 description 9
- 238000010586 diagram Methods 0.000 description 6
- 238000006243 chemical reaction Methods 0.000 description 4
- 238000005311 autocorrelation function Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 2
- 238000004891 communication Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
Definitions
- This invention relates to a method and an apparatus for extending the band, according to which a narrow-band signal is entered as input signal and a band extended signal having enlarged frequency range of the input signal is output to improve the acoustic sound quality.
- Non-Patent Publication 1
- HMM model parameters need to be determined off-line at the outset from a voluminous speech database in a manner which entails prolonged computing time and increased cost.
- retrieval by an HMM model is needed for the receiving side to carry out band extension processing in real time, for which a large volume of calculations are required.
- EP-A 1 420 389 A further state of the art according to Article 54(3) (EPC) has been disclosed in EP-A 1 420 389 .
- EPC Article 54(3)
- US 5455888 and WO 01/35395 describe bandwidth extension techniques based on the LPC principle.
- this object is solved by a band extending apparatus according to claim 1 and a band extending method according to claim 4.
- a band extended signal (e.g. 7 kHz band signal) may be generated by generating a high frequency signal with processing for a narrow-band input signal (e.g. 4 kHz band signal) and by summing the resulting high frequency signal to a signal corresponding to the narrow-band input signal having its sampling frequency changed.
- a narrow-band input signal e.g. 4 kHz band signal
- the present invention has such meritorious effect that a band extended signal with optimum sound quality may be generated in case periodicity is required for a high frequency part of the signal, such as a vowel, by generating an adaptive codebook signal, using a delay calculated from the narrow-band input signal, and by multiplying the so generated adaptive codebook signal with a gain and by summing the resulting signal to a noise signal.
- the present invention also has such meritorious effect that a band extended signal for higher sound quality may be generated by employing a pitch pre-filter for a sound source signal, using the delay, or by weighting the coefficients from the coefficient calculating circuit for use for the post-filter.
- a band extension apparatus includes a spectral parameter calculating circuit 100, a noise generating circuit 120, a coefficient calculating circuit 130, a gain circuit 140, a synthesis filter circuit 170, a sampling frequency converting circuit 180, an adder 190, a voiced/unvoiced discriminating circuit 200 and a gain adjustment circuit 210.
- the spectral parameter calculating circuit 100 divides the input signal into plural frames, each being e.g. of 10 ms, and calculates spectral parameters of a predetermined number of orders P from frame to frame. It is noted that the spectral parameters represent parameters showing the outline shape of spectrum of a speech signal in terms of a frame as a unit.
- LPC analysis as known per se, for example, is used.
- For converting the linear prediction coefficients into LSP reference is made e.g. to the following treatises (for example see Non-Patent Publication 2):
- Non-Patent Publication 2
- the coefficient calculating circuit 130 is supplied with the spectral parameters and converts the parameters into coefficients of the band extended signal.
- well-known techniques such as a technique for simply shifting the LSP frequency to a higher frequency, a technique for non-linear conversion or a technique for linear conversion, may be used.
- the frequency band in which the LSPs are present is shifted to a higher frequency range, using all or part of the LSP parameters, for conversion to order-P linear prediction coefficients, which order-P linear prediction coefficients are then output to the synthesis filter circuit 170.
- the noise generating circuit 120 generates a band-limited noise signal, having an average amplitude value normalized to a predetermined level, for a time duration equal to the frame duration, and outputs the so generated noise signal to the gain circuit 140.
- the noise signal the white noise is here used. However, other noise signal may also be used.
- the voiced/unvoiced discriminating circuit 200 is supplied with the narrow-band input signal x(n) to verify whether the frame-based signal is voiced or unvoiced.
- the voiced/unvoiced discriminating circuit 200 outputs the voiced/unvoiced discrimination information to the gain adjustment circuit 210.
- N denotes the number of samples for calculating the normalized autocorrelation.
- the gain adjustment circuit 210 is supplied with the voiced/unvoiced discrimination information from the voiced/unvoiced discriminating circuit 200 and adjusts the gain to be imparted to the noise signal depending on whether the input signal is voiced or unvoiced, to output the so adjusted gain to the gain circuit 140.
- the gain circuit 140 is supplied with the gain from the voiced/unvoiced discriminating circuit 200 and multiplies the output signal of the noise generating circuit 120 with the gain to output the resulting signal to the synthesis filter circuit 170.
- the synthesis filter circuit 170 is supplied with the output signal of the gain circuit 140 and with coefficients of a predetermined number of orders, from the coefficient calculating circuit 130, to form a filter, and outputs a high frequency range signal y(n) needed for band extension.
- the sampling frequency converting circuit 180 up-samples the narrow-band input signal x(n) to a predetermined sampling frequency to output the resulting up-sampled signal.
- the adder 190 sums an output signal y(n) of the synthesis filter circuit 170 and an output signal s(n) of the sampling frequency converting circuit 180 to each other to form and output an ultimately band extended signal.
- Fig.2 shows the configuration of a second embodiment not belonging to the present invention.
- the band extending apparatus includes a spectral parameter calculating circuit 100, an adaptive codebook circuit 110, a noise generating circuit 120, a coefficient calculating circuit 130, a gain circuit 340, a synthesis filter circuit 170, a sampling frequency converting circuit 180, adders 160, 190, a voiced/unvoiced discriminating circuit 200, and a gain adjustment circuit 310.
- the same reference numerals are used to depict the same parts or components as those shown in Fig.1 . In the following, only the points of difference from Fig.1 are explained, whilst the same parts or components as those of Fig.1 are sometimes not explained.
- the present second embodiment of the present invention includes the adaptive codebook circuit 110 and the adder 160, in addition to the components of Fig.1 .
- the voiced/unvoiced discriminating circuit 200 is supplied with the narrow-band input signal x(n) to verify whether a frame-based signal is voiced or unvoiced.
- a normalized autocorrelation function D(T) up to the predetermined delay time m is derived for the narrow-band input signal x(n) in accordance with the equation (1), and a maximum value of D(T) is found. If the maximum value of D(T) is larger than a predetermined threshold value, the input signal is determined to be voiced. If otherwise, the input signal is determined to be unvoiced.
- the voiced/unvoiced discriminating circuit 200 sends the value of T, maximizing the normalized autocorrelation function D(T), as a pitch period T to the adaptive codebook circuit 110.
- the gain circuit 340 is supplied from the gain adjustment circuit 310 with a gain which is then multiplied with an output signal of at least one of the adaptive codebook circuit 110 and the noise generating circuit 120. The resulting signal is output to the adder 160.
- the adder 160 sums the two signals, output from the gain circuit 340, and outputs the resulting sum signal to the synthesis filter circuit 170 and to the adaptive codebook circuit 110.
- the synthesis filter circuit 170 is supplied with an output signal (sound source signal) of the adder 160 and with a filter coefficient of a predetermined number of orders from the coefficient calculating circuit 130 to form a synthesis filter, and outputs a signal y(n) of a high frequency range needed for band extension.
- the gain adjustment circuit 310 is supplied with the voiced/unvoiced discrimination information from the voiced/unvoiced discriminating circuit 200, and adjusts the gain of the adaptive codebook signal and the gain of the noise signal, depending on whether the input signal is voiced or unvoiced, to send the gain-adjusted signal to the gain circuit 340.
- the adder 190 sums the output signal y(n) of the synthesis filter circuit 170 to the output signal s(n) of the sampling frequency converting circuit 180 to form and output an ultimately band extended signal.
- an adaptive codebook signal is generated, using a delay calculated from the narrow-band input signal, based on the past sound source signal of high frequency portion, and are then multiplied with a proper gain.
- the resulting signal is then summed to e.g. a noise signal, whereby a band extended signal with superior sound quality may be generated for e.g. a vowel in case periodicity is needed for a high frequency portion.
- a pitch generating circuit 115 may be provided in place of the adaptive codebook circuit 110, as shown in Fig.6 .
- the pitch generating circuit 115 calculates a pitch period from an input signal and generates a periodic signal based on the pitch period to output the so generated pitch signal to the gain circuit 340. Except for the pitch generating circuit 115, the modification is the same in the configuration as the above-described second embodiment, not belonging to the present invention.
- Fig.3 shows the configuration of a first embodiment of the present invention.
- the band extending apparatus of the third embodiment includes a spectral parameter calculating circuit 100, an adaptive codebook circuit 110, a noise generating circuit 120, a coefficient calculating circuit 130, a gain circuit 300, a synthesis filter circuit 170, a sampling frequency converting circuit 180, an adder 190, a voiced/unvoiced discriminating circuit 200, a gain adjustment circuit 310, and a pitch pre-filter 400.
- the same reference numerals are used to depict the parts or components which are the same as those shown in Figs.1 and 2 . In the following, only the points of difference from the second embodiment not belonging to the present invention are explained, whilst the same parts or components as those of Fig.2 are sometimes not explained.
- the gain circuit 300 is supplied with the gain from the gain adjustment circuit 310 and multiplies the output signals of the adaptive codebook circuit 110 and the noise generating circuit 120 with the gain. The resulting two signals are summed together and the resulting sum signal is output to the pitch pre-filter 400.
- An output of the pitch pre-filter 400 is also supplied to the adaptive codebook circuit 110.
- the synthesis filter circuit 170 is supplied with an output signal of the pitch pre-filter 400 and with coefficients of a predetermined number of orders from the coefficient calculating circuit 130 to form a filter, and outputs a signal y(n) of a high frequency range needed for band extension.
- a pitch generating circuit may, of course, be used in place of the adaptive codebook circuit 110.
- Fig.4 shows the configuration of a second embodiment of the present invention.
- the band extending apparatus of the second embodiment includes a spectral parameter calculating circuit 100, an adaptive codebook circuit 110, a noise generating circuit 120, a coefficient calculating circuit 130, a gain circuit 340, an adder 160, a synthesis filter circuit 170, a sampling frequency converting circuit 180, an adder 190, a voiced/unvoiced discriminating circuit 200, a gain adjustment circuit 310, and a low-pass filter circuit 500.
- the same reference numerals are used to depict the parts or components which are the same as those shown in Fig.2 .
- the low-pass filter 500 is added to the configuration of the above-described second embodiment not belonging to the present invention shown in Fig.2 .
- the same parts or components as those of Fig.2 are explained only as necessary.
- the cut-off frequency of the low-pass filter 500 may be predetermined to, for example, 6 kHz.
- h(n) denotes the impulse response of a low-pass filter
- a symbol "*" denotes the operation of convolution.
- a pitch generating circuit may be used in place of the adaptive codebook circuit 110, by way of a modification of the present second embodiment, as in the modification of the second embodiment described above.
- Fig.5 shows the configuration of a third embodiment of the present invention.
- the band extending apparatus of the third embodiment includes a spectral parameter calculating circuit 100, an adaptive codebook circuit 110, a noise generating circuit 120, a coefficient calculating circuit 130, a gain circuit 300, a synthesis filter circuit 170, a sampling frequency converting circuit 180, an adder 190, a voiced/unvoiced discriminating circuit 200, a gain adjustment circuit 310, a pitch pre-filter 400, and a post-filter 600.
- the same reference numerals are used to depict the same parts or components as those shown in Fig.3 .
- the third embodiment of the present invention includes the post-filter 600 in addition to the configuration of the above-described first embodiment.
- the post-filter 600 in addition to the configuration of the above-described first embodiment.
- the post-filter 600 is supplied from the coefficient calculating circuit 130 with coefficients (filter coefficients), which then are weighted.
- a pitch generating circuit may also be used in place of the codebook circuit 110, by way of a modification of the second embodiment, as in the modification of the second embodiment not belonging to the present invention described above.
Abstract
Description
- This invention relates to a method and an apparatus for extending the band, according to which a narrow-band signal is entered as input signal and a band extended signal having enlarged frequency range of the input signal is output to improve the acoustic sound quality.
- There has been known a system in which the frequency range of a speech signal, encoded at a low bit rate and reproduced, is extended on the receiving side without the transmitting side having to send the auxiliary information for band extension (for example, see Non-Patent Publication 1).
- Non-Patent Publication 1:
- P. Jax, P.Vary, "Wideband extension of telephone speech using hidden Markov model", Proc. IEEE Speech Coding Workshop, pp.133-135, 2000.
- With this state-of-the-art system, filter coefficients after band extension using HMM (Hidden Markov Model) are retrieved on the receiving side.
- On the other hand, the processing for directly extending the band of the narrow-band input signal is unprecedented.
- In the state-of-the-art method, shown in the Publication 1, in which modeling by HMM of filter coefficients or the broadband spectral envelope of speech is required, the following problem arises. That is, HMM model parameters need to be determined off-line at the outset from a voluminous speech database in a manner which entails prolonged computing time and increased cost. In addition, retrieval by an HMM model is needed for the receiving side to carry out band extension processing in real time, for which a large volume of calculations are required.
- A further state of the art according to Article 54(3) (EPC) has been disclosed in
EP-A 1 420 389 . An addition,US 5455888 andWO 01/35395 - Accordingly, it is an object of the present invention to overcome the aforementioned problem and to provide a method and an apparatus for directly extending the frequency range of a narrow-band input signal. It is another object of the present invention to provide a method and an apparatus for extending the frequency range whereby the band-extended speech of optimum sound quality may be obtained with computational complexity less than that of the state-of-the-art system.
- According to the present invention this object is solved by a band extending apparatus according to claim 1 and a band extending method according to claim 4.
- Further advantageous features of the band extending apparatus and the band extending method are indicated in the dependent claims.
- The present invention has such meritorious effect that a band extended signal (e.g. 7 kHz band signal) may be generated by generating a high frequency signal with processing for a narrow-band input signal (e.g. 4 kHz band signal) and by summing the resulting high frequency signal to a signal corresponding to the narrow-band input signal having its sampling frequency changed.
- The present invention has such meritorious effect that a band extended signal with optimum sound quality may be generated in case periodicity is required for a high frequency part of the signal, such as a vowel, by generating an adaptive codebook signal, using a delay calculated from the narrow-band input signal, and by multiplying the so generated adaptive codebook signal with a gain and by summing the resulting signal to a noise signal.
- The present invention also has such meritorious effect that a band extended signal for higher sound quality may be generated by employing a pitch pre-filter for a sound source signal, using the delay, or by weighting the coefficients from the coefficient calculating circuit for use for the post-filter.
-
-
Fig.1 is a diagram showing a configuration of a first embodiment not belonging to the present invention. -
Fig.2 is a diagram showing a configuration of a second embodiment not belonging to the present invention. -
Fig.3 is a diagram showing a configuration of a first embodiment of the present invention. -
Fig.4 is a diagram showing a configuration of a second embodiment of the present invention. -
Fig.5 is a diagram showing a configuration of a third embodiment of the present invention. -
FIG. 6 is a diagram showing a modification of the second embodiment not belonging to the present invention. - For more detailed explanation of the present invention, preferred embodiments of the present invention will be explained with reference to the drawings. It is presupposed in the following that a narrow-band input signal of a 4 kHz range is extended in band to a 5 kHz band or to a 7 kHz band.
-
Fig.1 shows the configuration of a first embodiment not belonging to the present invention. Referring toFig.1 , a band extension apparatus includes a spectralparameter calculating circuit 100, anoise generating circuit 120, acoefficient calculating circuit 130, again circuit 140, asynthesis filter circuit 170, a samplingfrequency converting circuit 180, anadder 190, a voiced/unvoiceddiscriminating circuit 200 and again adjustment circuit 210. - In the band extending apparatus, supplied with a narrow-band input signal x(n), the spectral
parameter calculating circuit 100 divides the input signal into plural frames, each being e.g. of 10 ms, and calculates spectral parameters of a predetermined number of orders P from frame to frame. It is noted that the spectral parameters represent parameters showing the outline shape of spectrum of a speech signal in terms of a frame as a unit. For the calculation, LPC analysis, as known per se, for example, is used. The spectralparameter calculating circuit 100 also converts the linear prediction coefficients α i (i = 1, ...P), calculated by the LPC analysis, into LPC parameters suitable for quantization or interpolation, to output the so formed LPC parameters. For converting the linear prediction coefficients into LSP, reference is made e.g. to the following treatises (for example see Non-Patent Publication 2): - Non-Patent Publication 2:
- Sugamura and Itakura: "Speech Information Compression by Voice Analysis Synthesis System", Extended Abstract to Society of Electronic Communication, J64-A, pp.599 t- 606, 1981
- The
coefficient calculating circuit 130 is supplied with the spectral parameters and converts the parameters into coefficients of the band extended signal. For this conversion, well-known techniques, such as a technique for simply shifting the LSP frequency to a higher frequency, a technique for non-linear conversion or a technique for linear conversion, may be used. Here, the frequency band in which the LSPs are present is shifted to a higher frequency range, using all or part of the LSP parameters, for conversion to order-P linear prediction coefficients, which order-P linear prediction coefficients are then output to thesynthesis filter circuit 170. - The
noise generating circuit 120 generates a band-limited noise signal, having an average amplitude value normalized to a predetermined level, for a time duration equal to the frame duration, and outputs the so generated noise signal to thegain circuit 140. As the noise signal, the white noise is here used. However, other noise signal may also be used. - The voiced/unvoiced
discriminating circuit 200 is supplied with the narrow-band input signal x(n) to verify whether the frame-based signal is voiced or unvoiced. For verifying whether the frame-based signal is voiced or unvoiced, a normalized autocorrelation function D(T) up to a predetermined delay time m is derived for the narrow-band input signal x(n) in accordance with the equation (1):
and a maximum value of D(T) is found. If the maximum value of D(T) is larger than a predetermined threshold value, the input signal is determined to be voiced. If otherwise, the input signal is determined to be unvoiced. - The voiced/unvoiced
discriminating circuit 200 outputs the voiced/unvoiced discrimination information to thegain adjustment circuit 210. In the above equation (1), N denotes the number of samples for calculating the normalized autocorrelation. - The
gain adjustment circuit 210 is supplied with the voiced/unvoiced discrimination information from the voiced/unvoiceddiscriminating circuit 200 and adjusts the gain to be imparted to the noise signal depending on whether the input signal is voiced or unvoiced, to output the so adjusted gain to thegain circuit 140. - The
gain circuit 140 is supplied with the gain from the voiced/unvoiceddiscriminating circuit 200 and multiplies the output signal of thenoise generating circuit 120 with the gain to output the resulting signal to thesynthesis filter circuit 170. - The
synthesis filter circuit 170 is supplied with the output signal of thegain circuit 140 and with coefficients of a predetermined number of orders, from thecoefficient calculating circuit 130, to form a filter, and outputs a high frequency range signal y(n) needed for band extension. - The sampling
frequency converting circuit 180 up-samples the narrow-band input signal x(n) to a predetermined sampling frequency to output the resulting up-sampled signal. - The
adder 190 sums an output signal y(n) of thesynthesis filter circuit 170 and an output signal s(n) of the samplingfrequency converting circuit 180 to each other to form and output an ultimately band extended signal. - The above completes the explanation of the first embodiment.
-
Fig.2 shows the configuration of a second embodiment not belonging to the present invention. Referring toFig.2 , the band extending apparatus includes a spectralparameter calculating circuit 100, anadaptive codebook circuit 110, anoise generating circuit 120, acoefficient calculating circuit 130, again circuit 340, asynthesis filter circuit 170, a samplingfrequency converting circuit 180,adders discriminating circuit 200, and again adjustment circuit 310. InFig.2 , the same reference numerals are used to depict the same parts or components as those shown inFig.1 . In the following, only the points of difference fromFig.1 are explained, whilst the same parts or components as those ofFig.1 are sometimes not explained. The present second embodiment of the present invention includes theadaptive codebook circuit 110 and theadder 160, in addition to the components ofFig.1 . - The voiced/unvoiced
discriminating circuit 200 is supplied with the narrow-band input signal x(n) to verify whether a frame-based signal is voiced or unvoiced. For verifying whether the frame-based signal is voiced or unvoiced, a normalized autocorrelation function D(T) up to the predetermined delay time m is derived for the narrow-band input signal x(n) in accordance with the equation (1), and a maximum value of D(T) is found. If the maximum value of D(T) is larger than a predetermined threshold value, the input signal is determined to be voiced. If otherwise, the input signal is determined to be unvoiced. - For the voiced frame, the voiced/unvoiced
discriminating circuit 200 sends the value of T, maximizing the normalized autocorrelation function D(T), as a pitch period T to theadaptive codebook circuit 110. - The
adaptive codebook circuit 110 is supplied from the voiced/unvoiceddiscriminating circuit 200 with the delay T of the adaptive codebook and, based on the past sound source signal v(n), generates an adaptive code vector p(n), in accordance with the following equation (2):
and outputs the so generated vector to thegain circuit 340. - The
gain circuit 340 is supplied from thegain adjustment circuit 310 with a gain which is then multiplied with an output signal of at least one of theadaptive codebook circuit 110 and thenoise generating circuit 120. The resulting signal is output to theadder 160. - The
adder 160 sums the two signals, output from thegain circuit 340, and outputs the resulting sum signal to thesynthesis filter circuit 170 and to theadaptive codebook circuit 110. - The
synthesis filter circuit 170 is supplied with an output signal (sound source signal) of theadder 160 and with a filter coefficient of a predetermined number of orders from thecoefficient calculating circuit 130 to form a synthesis filter, and outputs a signal y(n) of a high frequency range needed for band extension. - The
gain adjustment circuit 310 is supplied with the voiced/unvoiced discrimination information from the voiced/unvoiceddiscriminating circuit 200, and adjusts the gain of the adaptive codebook signal and the gain of the noise signal, depending on whether the input signal is voiced or unvoiced, to send the gain-adjusted signal to thegain circuit 340. - The
adder 190 sums the output signal y(n) of thesynthesis filter circuit 170 to the output signal s(n) of the samplingfrequency converting circuit 180 to form and output an ultimately band extended signal. - With the second embodiment of not belonging to the present invention, an adaptive codebook signal is generated, using a delay calculated from the narrow-band input signal, based on the past sound source signal of high frequency portion, and are then multiplied with a proper gain. The resulting signal is then summed to e.g. a noise signal, whereby a band extended signal with superior sound quality may be generated for e.g. a vowel in case periodicity is needed for a high frequency portion. The above completes explanation of the second embodiment. As a modification of the second embodiment not belonging to the present invention, a
pitch generating circuit 115 may be provided in place of theadaptive codebook circuit 110, as shown inFig.6 . Thepitch generating circuit 115 calculates a pitch period from an input signal and generates a periodic signal based on the pitch period to output the so generated pitch signal to thegain circuit 340. Except for thepitch generating circuit 115, the modification is the same in the configuration as the above-described second embodiment, not belonging to the present invention. -
Fig.3 shows the configuration of a first embodiment of the present invention. Referring toFig.3 , the band extending apparatus of the third embodiment includes a spectralparameter calculating circuit 100, anadaptive codebook circuit 110, anoise generating circuit 120, acoefficient calculating circuit 130, again circuit 300, asynthesis filter circuit 170, a samplingfrequency converting circuit 180, anadder 190, a voiced/unvoiceddiscriminating circuit 200, again adjustment circuit 310, and apitch pre-filter 400. InFig.3 , the same reference numerals are used to depict the parts or components which are the same as those shown inFigs.1 and2 . In the following, only the points of difference from the second embodiment not belonging to the present invention are explained, whilst the same parts or components as those ofFig.2 are sometimes not explained. - The
gain circuit 300 is supplied with the gain from thegain adjustment circuit 310 and multiplies the output signals of theadaptive codebook circuit 110 and thenoise generating circuit 120 with the gain. The resulting two signals are summed together and the resulting sum signal is output to thepitch pre-filter 400. -
- An output of the
pitch pre-filter 400 is also supplied to theadaptive codebook circuit 110. - The
synthesis filter circuit 170 is supplied with an output signal of thepitch pre-filter 400 and with coefficients of a predetermined number of orders from thecoefficient calculating circuit 130 to form a filter, and outputs a signal y(n) of a high frequency range needed for band extension. - By employing the
pitch pre-filter 400 for pre-filtering the sound source signal, using the delay, a band extended signal of superior sound quality may be produced. The above completes the explanation of the third embodiment. In the present embodiment, as in the modification of the second embodiment, not belonging to the invention, a pitch generating circuit may, of course, be used in place of theadaptive codebook circuit 110. -
Fig.4 shows the configuration of a second embodiment of the present invention. Referring toFig.4 , the band extending apparatus of the second embodiment includes a spectralparameter calculating circuit 100, anadaptive codebook circuit 110, anoise generating circuit 120, acoefficient calculating circuit 130, again circuit 340, anadder 160, asynthesis filter circuit 170, a samplingfrequency converting circuit 180, anadder 190, a voiced/unvoiceddiscriminating circuit 200, again adjustment circuit 310, and a low-pass filter circuit 500. InFig.4 , the same reference numerals are used to depict the parts or components which are the same as those shown inFig.2 . In the second embodiment, the low-pass filter 500 is added to the configuration of the above-described second embodiment not belonging to the present invention shown inFig.2 . In the following, only the points of difference from this second embodiment are explained, whilst the same parts or components as those ofFig.2 are explained only as necessary. - The low-
pass filter 500 filters the output signal of theadaptive codebook circuit 110 in accordance with the equation:
to permit a signal with a frequency not higher than a predetermined cut-off frequency to pass therethrough to thegain circuit 340. The cut-off frequency of the low-pass filter 500 may be predetermined to, for example, 6 kHz. Meanwhile, inFig.4 , h(n) denotes the impulse response of a low-pass filter, and a symbol "*" denotes the operation of convolution. - The foregoing completes the explanation of the second embodiment of the present invention. Meanwhile, a pitch generating circuit may be used in place of the
adaptive codebook circuit 110, by way of a modification of the present second embodiment, as in the modification of the second embodiment described above. -
Fig.5 shows the configuration of a third embodiment of the present invention. Referring toFig.5 , the band extending apparatus of the third embodiment includes a spectralparameter calculating circuit 100, anadaptive codebook circuit 110, anoise generating circuit 120, acoefficient calculating circuit 130, again circuit 300, asynthesis filter circuit 170, a samplingfrequency converting circuit 180, anadder 190, a voiced/unvoiceddiscriminating circuit 200, again adjustment circuit 310, apitch pre-filter 400, and a post-filter 600. InFig.5 , the same reference numerals are used to depict the same parts or components as those shown inFig.3 . The third embodiment of the present invention includes the post-filter 600 in addition to the configuration of the above-described first embodiment. In the following, only the points of difference from the first embodiment are explained, whilst the same parts or components as those ofFig.3 are explained only as necessary. -
- By employing the post-filter 600, it is possible to generate a band extended signal of superior quality. The above completes the explanation of the third embodiment. It is noted that a pitch generating circuit may also be used in place of the
codebook circuit 110, by way of a modification of the second embodiment, as in the modification of the second embodiment not belonging to the present invention described above. - The configurations of the above-described embodiments may also be combined together, such as by employing the post-filter, explained in the third embodiment, for the above-described first embodiment. In the present invention, plural sorts of the preset frequency band signal (narrow-band signal) may be input, in place of only one sort of the signals. Although the present invention has been explained with reference to the above specific embodiments, it is to be noted that the present invention may encompass various modifications or corrections that may be occur to those skilled in the art within the scope of the invention as defined in the claims.
Claims (6)
- A band extending apparatus receiving at least an input signal of a preset frequency band to output a band extended signal corresponding to said input signal extended in a frequency band thereof,
said apparatus comprising:(A) a spectral parameter calculating unit (100), adapted to receive at least an input signal of a preset frequency band to calculate spectral parameters representing spectral characteristics;(B) a coefficient calculating unit (130) adapted to shift the frequency of said spectral parameters to then calculate filter coefficients;(C) a voiced/unvoiced discriminating circuit (200), adapted to supply for a voiced frame, a preset delay derived from a voiced/unvoiced decision, as a pitch period to an adaptive codebook circuit (110);(D) said adaptive codebook circuit (110), being adapted to receive the delay from said voiced/unvoiced discriminating circuit, as a delay of said adaptive codebook, to generate an adaptive codebook signal based on the past sound source signal outputted from a synthesis filter circuit (170) and to output the adaptive codebook signal generated;(E) a noise generating circuit (120) adapted to generate a noise signal;(F) a gain adjustment circuit (310), adapted to receive voiced/unvoiced discrimination information output from said voiced/unvoiced discrimination circuit (200), to output a resulting gain adjustment signal for adjusting the gain of the output signal of said adaptive codebook circuit (110) and the gain of the output signal of noise generating unit (120), depending on whether said voiced/unvoiced discrimination information indicates voiced or unvoiced;(G) a gain circuit (300) adapted to receive said gain adjustment signal from said gain adjustment circuit (310) to multiply the output signal of said adaptive codebook circuit (110) and the output signal of said noise generating circuit (120) with said gain adjustment signal to output two signals that are summed together to form a resulting sum signal;(H) a pitch pre-filter (400) adapted to filter said resulting sum signal from said gain circuit (300), using said pitch period supplied from the voiced/unvoiced discriminating circuit (200) and to supply the output signal to the adaptive codebook circuit (110) and to the synthesis filter circuit (170);(I) said synthesis filter circuit (170) being adapted to pass the output signal of said pitch pre-filter (400) through a synthesis filter, formed using said filter coefficients, to reproduce a signal for band extension; and(J) an adder (190) adapted to add a signal corresponding to said input signal converted in a sampling frequency thereof to an output signal of said synthesis filter circuit (170) to generate the band extended signal. - The band extending apparatus as defined in claim 1, further comprising:a low-pass filter (500), receiving an output signal of said adaptive codebook unit (110) as an input.
- The band extending apparatus as defined in any one of claims 1 or 2, wherein
a post-filter (600) is formed using weighting coefficients as weighted version of filter coefficients output from said coefficient calculating unit (130), and wherein an output signal of said synthesis filter unit (170) is passed through said post-filter (600) to reproduce the signal for band extension. - A band extending method for receiving at least an input signal of a preset frequency band to output a band extended signal corresponding to said input signal extended in a frequency band thereof,
said method comprising:(A) a spectral parameter-calculating step for receiving at least an input signal of a preset frequency band to calculate spectral parameters representing spectral characteristics;(B) a coefficient-calculating step for shifting the frequency of said spectral parameters to then calculate filter coefficients;(C) a voiced/unvoiced-discriminating step for supplying, for a voiced frame, a preset delay derived from a voiced/unvoiced discrimination, as a pitch period to an adaptive codebook generating step;(D) said adaptive generating step receiving the delay from said voiced/ unvoiced discriminating step, as a delay of said adaptive codebook, generating an adaptive codebook signal based on the past sound source signal outputted from a synthesis filtering step and outputting the adaptive codebook signal generated;(E) a step of generating a noise signal;(F) a gain-adjusting step for receiving voiced/unvoiced discrimination information resulting from said voiced/unvoiced discrimination, to output a resulting gain adjustment signal for adjusting the gain of said adaptive codebook signal and the gain of the generated noise signal, depending on whether said voiced/unvoiced discrimination information indicates voiced or unvoiced;(G) a gain-multiplying step for receiving said gain adjustment signal from said gain adjusting step and multiplying the output signal of said adaptive codebook and the generated noise signal with said gain adjustment signal to output two signals that are summed together to form a resulting sum signal;(H) a pitch pre-filtering step for filtering said resulting sum signal from said gain multiplying step, using said pitch period, supplying the output signal of said pitch pre-filtering step to the adaptive codebook generating step and to the synthesis filtering step;(I) said synthesis filtering step passing an output signal of said pitch pre-filtering step through a synthesis filter, formed using said filter coefficients, to reproduce a signal for band extension; and(J) a step for adding a signal corresponding to said input signal converted in a sampling frequency thereof to an output signal of said synthesis filtering step to generate the band extended signal. - The band extending method as defined in claim 4, further comprising:processing said adaptive codebook signal with a low-pass filter (500) to allow frequency components not higher than a preset cut-off frequency to pass there through.
- The band extending method as defined in any one of claims 4 or 5, comprising:passing an output signal of said synthesis filtering step through a post-filter (600) formed using weighting coefficients corresponding to weighted version of said filter coefficients to reproduce the signal for band extension.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2002317203 | 2002-10-31 | ||
JP2002317203A JP4433668B2 (en) | 2002-10-31 | 2002-10-31 | Bandwidth expansion apparatus and method |
PCT/JP2003/013231 WO2004040553A1 (en) | 2002-10-31 | 2003-10-16 | Bandwidth expanding device and method |
Publications (3)
Publication Number | Publication Date |
---|---|
EP1557825A1 EP1557825A1 (en) | 2005-07-27 |
EP1557825A4 EP1557825A4 (en) | 2006-01-18 |
EP1557825B1 true EP1557825B1 (en) | 2010-12-22 |
Family
ID=32211713
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP03756637A Expired - Lifetime EP1557825B1 (en) | 2002-10-31 | 2003-10-16 | Bandwidth expanding device and method |
Country Status (9)
Country | Link |
---|---|
US (1) | US7684979B2 (en) |
EP (1) | EP1557825B1 (en) |
JP (1) | JP4433668B2 (en) |
KR (1) | KR100715013B1 (en) |
CN (1) | CN1708785B (en) |
AU (1) | AU2003301711A1 (en) |
CA (1) | CA2504175A1 (en) |
DE (1) | DE60335486D1 (en) |
WO (1) | WO2004040553A1 (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1482482A1 (en) * | 2003-05-27 | 2004-12-01 | Siemens Aktiengesellschaft | Frequency expansion for Synthesiser |
US8712768B2 (en) * | 2004-05-25 | 2014-04-29 | Nokia Corporation | System and method for enhanced artificial bandwidth expansion |
CN101023472B (en) * | 2004-09-06 | 2010-06-23 | 松下电器产业株式会社 | Scalable encoding device and scalable encoding method |
JP5063364B2 (en) * | 2005-02-10 | 2012-10-31 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | Speech synthesis method |
KR101414375B1 (en) | 2008-06-13 | 2014-07-04 | 삼성전자주식회사 | Apparatus and method for encoding/decoding using bandwidth extension |
Family Cites Families (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS61107400A (en) * | 1984-10-31 | 1986-05-26 | 日本電気株式会社 | Voice synthesizer |
JPS63217732A (en) | 1987-03-05 | 1988-09-09 | Kokusai Electric Co Ltd | Coding transmission system for voice signal |
JP3088121B2 (en) * | 1991-04-12 | 2000-09-18 | 沖電気工業株式会社 | Statistical excitation code vector optimization method |
US5734789A (en) * | 1992-06-01 | 1998-03-31 | Hughes Electronics | Voiced, unvoiced or noise modes in a CELP vocoder |
US5455888A (en) * | 1992-12-04 | 1995-10-03 | Northern Telecom Limited | Speech bandwidth extension method and apparatus |
JP3297156B2 (en) * | 1993-08-17 | 2002-07-02 | 三菱電機株式会社 | Voice discrimination device |
JP3483958B2 (en) * | 1994-10-28 | 2004-01-06 | 三菱電機株式会社 | Broadband audio restoration apparatus, wideband audio restoration method, audio transmission system, and audio transmission method |
JP3328080B2 (en) * | 1994-11-22 | 2002-09-24 | 沖電気工業株式会社 | Code-excited linear predictive decoder |
JP3189614B2 (en) * | 1995-03-13 | 2001-07-16 | 松下電器産業株式会社 | Voice band expansion device |
EP0732687B2 (en) * | 1995-03-13 | 2005-10-12 | Matsushita Electric Industrial Co., Ltd. | Apparatus for expanding speech bandwidth |
US5699485A (en) | 1995-06-07 | 1997-12-16 | Lucent Technologies Inc. | Pitch delay modification during frame erasures |
JPH0955778A (en) * | 1995-08-15 | 1997-02-25 | Fujitsu Ltd | Bandwidth widening device for sound signal |
JPH09127985A (en) | 1995-10-26 | 1997-05-16 | Sony Corp | Signal coding method and device therefor |
EP0788091A3 (en) | 1996-01-31 | 1999-02-24 | Kabushiki Kaisha Toshiba | Speech encoding and decoding method and apparatus therefor |
JP3350340B2 (en) * | 1996-03-29 | 2002-11-25 | 株式会社東芝 | Voice coding method and voice decoding method |
EP0945852A1 (en) * | 1998-03-25 | 1999-09-29 | BRITISH TELECOMMUNICATIONS public limited company | Speech synthesis |
TW376611B (en) | 1998-05-26 | 1999-12-11 | Koninkl Philips Electronics Nv | Transmission system with improved speech encoder |
JP3502268B2 (en) | 1998-06-16 | 2004-03-02 | ヤマハ株式会社 | Audio signal processing device and audio signal processing method |
JP3540159B2 (en) | 1998-06-18 | 2004-07-07 | ヤマハ株式会社 | Voice conversion device and voice conversion method |
CA2252170A1 (en) | 1998-10-27 | 2000-04-27 | Bruno Bessette | A method and device for high quality coding of wideband speech and audio signals |
JP2000267700A (en) * | 1999-03-17 | 2000-09-29 | Yrp Kokino Idotai Tsushin Kenkyusho:Kk | Method and device for encoding and decoding voice |
US6377915B1 (en) * | 1999-03-17 | 2002-04-23 | Yrp Advanced Mobile Communication Systems Research Laboratories Co., Ltd. | Speech decoding using mix ratio table |
JP3583945B2 (en) * | 1999-04-15 | 2004-11-04 | 日本電信電話株式会社 | Audio coding method |
JP4464488B2 (en) | 1999-06-30 | 2010-05-19 | パナソニック株式会社 | Speech decoding apparatus, code error compensation method, speech decoding method |
EP1147515A1 (en) * | 1999-11-10 | 2001-10-24 | Koninklijke Philips Electronics N.V. | Wide band speech synthesis by means of a mapping matrix |
JP2002055699A (en) | 2000-08-10 | 2002-02-20 | Mitsubishi Electric Corp | Device and method for encoding voice |
DE10041512B4 (en) * | 2000-08-24 | 2005-05-04 | Infineon Technologies Ag | Method and device for artificially expanding the bandwidth of speech signals |
JP3462464B2 (en) * | 2000-10-20 | 2003-11-05 | 株式会社東芝 | Audio encoding method, audio decoding method, and electronic device |
US6889182B2 (en) * | 2001-01-12 | 2005-05-03 | Telefonaktiebolaget L M Ericsson (Publ) | Speech bandwidth extension |
JP2003044098A (en) * | 2001-07-26 | 2003-02-14 | Nec Corp | Device and method for expanding voice band |
US6895375B2 (en) * | 2001-10-04 | 2005-05-17 | At&T Corp. | System for bandwidth extension of Narrow-band speech |
CN1282156C (en) * | 2001-11-23 | 2006-10-25 | 皇家飞利浦电子股份有限公司 | Audio signal bandwidth extension |
-
2002
- 2002-10-31 JP JP2002317203A patent/JP4433668B2/en not_active Expired - Lifetime
-
2003
- 2003-10-16 CA CA002504175A patent/CA2504175A1/en not_active Abandoned
- 2003-10-16 AU AU2003301711A patent/AU2003301711A1/en not_active Abandoned
- 2003-10-16 EP EP03756637A patent/EP1557825B1/en not_active Expired - Lifetime
- 2003-10-16 CN CN200380102290.0A patent/CN1708785B/en not_active Expired - Lifetime
- 2003-10-16 KR KR1020057007431A patent/KR100715013B1/en active IP Right Grant
- 2003-10-16 WO PCT/JP2003/013231 patent/WO2004040553A1/en active Application Filing
- 2003-10-16 DE DE60335486T patent/DE60335486D1/en not_active Expired - Lifetime
-
2005
- 2005-05-02 US US11/118,337 patent/US7684979B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
EP1557825A4 (en) | 2006-01-18 |
KR20050062643A (en) | 2005-06-23 |
WO2004040553A1 (en) | 2004-05-13 |
CN1708785B (en) | 2010-05-12 |
KR100715013B1 (en) | 2007-05-09 |
EP1557825A1 (en) | 2005-07-27 |
DE60335486D1 (en) | 2011-02-03 |
AU2003301711A1 (en) | 2004-05-25 |
CN1708785A (en) | 2005-12-14 |
CA2504175A1 (en) | 2004-05-13 |
JP2004151423A (en) | 2004-05-27 |
US20050256709A1 (en) | 2005-11-17 |
JP4433668B2 (en) | 2010-03-17 |
US7684979B2 (en) | 2010-03-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7454330B1 (en) | Method and apparatus for speech encoding and decoding by sinusoidal analysis and waveform encoding with phase reproducibility | |
US7013270B2 (en) | Determining linear predictive coding filter parameters for encoding a voice signal | |
EP0673013A1 (en) | Signal encoding and decoding system | |
JPH10124088A (en) | Device and method for expanding voice frequency band width | |
WO2003010752A1 (en) | Speech bandwidth extension apparatus and speech bandwidth extension method | |
US7486719B2 (en) | Transcoder and code conversion method | |
AU669788B2 (en) | Method for generating a spectral noise weighting filter for use in a speech coder | |
US7684979B2 (en) | Band extending apparatus and method | |
JPH10124089A (en) | Processor and method for speech signal processing and device and method for expanding voice bandwidth | |
EP1564723B1 (en) | Transcoder and coder conversion method | |
JPH08305396A (en) | Device and method for expanding voice band | |
JPH0782360B2 (en) | Speech analysis and synthesis method | |
EP1204094B1 (en) | Excitation signal low pass filtering for speech coding | |
JP3583945B2 (en) | Audio coding method | |
JP3481027B2 (en) | Audio coding device | |
US6983241B2 (en) | Method and apparatus for performing harmonic noise weighting in digital speech coders | |
JP3578933B2 (en) | Method of creating weight codebook, method of setting initial value of MA prediction coefficient during learning at the time of codebook design, method of encoding audio signal, method of decoding the same, and computer-readable storage medium storing encoding program And computer-readable storage medium storing decryption program | |
JP2583883B2 (en) | Speech analyzer and speech synthesizer | |
JP3192051B2 (en) | Audio coding device | |
JP3199128B2 (en) | Audio encoding method | |
JPS61128299A (en) | Voice analysis/analytic synthesization system | |
JPH11184499A (en) | Voice encoding method and voice encoding method | |
Yim | AutoRegressive Moving Average modelling in low bit rate speech coding |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20050513 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR |
|
AX | Request for extension of the european patent |
Extension state: AL LT LV MK |
|
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1075735 Country of ref document: HK |
|
A4 | Supplementary search report drawn up and despatched |
Effective date: 20051206 |
|
DAX | Request for extension of the european patent (deleted) | ||
RBV | Designated contracting states (corrected) |
Designated state(s): DE FR GB |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): DE FR GB |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D |
|
REF | Corresponds to: |
Ref document number: 60335486 Country of ref document: DE Date of ref document: 20110203 Kind code of ref document: P |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R096 Ref document number: 60335486 Country of ref document: DE Effective date: 20110203 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20110923 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R097 Ref document number: 60335486 Country of ref document: DE Effective date: 20110923 |
|
REG | Reference to a national code |
Ref country code: HK Ref legal event code: WD Ref document number: 1075735 Country of ref document: HK |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 14 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 15 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 16 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FR Payment date: 20221028 Year of fee payment: 20 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20221019 Year of fee payment: 20 Ref country code: DE Payment date: 20221019 Year of fee payment: 20 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R071 Ref document number: 60335486 Country of ref document: DE |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: PE20 Expiry date: 20231015 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION Effective date: 20231015 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION Effective date: 20231015 |