CN101458930B

CN101458930B - Excitation signal generation in bandwidth spreading and signal reconstruction method and apparatus

Info

Publication number: CN101458930B
Application number: CN200710198774XA
Authority: CN
Inventors: 胡瑞敏; 张勇; 谢昭; 王晓晨; 肖玮; 马付伟; 王庭红
Original assignee: Huawei Technologies Co Ltd; Wuhan University WHU
Current assignee: Huawei Technologies Co Ltd; Wuhan University WHU
Priority date: 2007-12-12
Filing date: 2007-12-12
Publication date: 2011-09-14
Anticipated expiration: 2027-12-12
Also published as: WO2009076871A1; CN101458930A

Abstract

The invention discloses a method for generating exciting signals in bandwidth extension, which processes narrow-band low frequency signals via frequency spectrum folding and synthesis to generate required high frequency exciting signals. The invention further provides a reconstruction method for the high frequency signals in bandwidth extension and a device thereof. The technical scheme utilizes low frequency signals to generate high frequency signals, and is based on the harmonic characteristic of low and high frequency spectrums of the signals, therefore, the method can extend speech and music signals effectively, and the frequency spectrum folding method can confirm the signal frequency spectrum continuity of high and low frequencies at joint part. Tests prove that the technical scheme is suitable for extending ultra wideband signals of 7 to 14 kHz.

Description

The generation and the signal reconstruction method and apparatus of pumping signal in the bandwidth expansion

Technical field

The present invention relates to the bandwidth expansion technique field, be specifically related to the generation method of pumping signal in the bandwidth expansion and the method for reconstructing and the corresponding device thereof of high-frequency signal, the present invention is specially adapted to the wide expansion of ultrabroad band.

Background technology

Bandwidth expansion (BWE:BandWidth Extension) technology is a kind of by the suitable parameter model of selection, and the signal extension that frequency band range is narrower is to the wideer technology of frequency band range, thus raising sensing audio quality of signals.

Usually under the limited condition of encoder bit rate, for example move and network environment in, based on people's ear for sharper this auditory properties of low frequency signal, in order to obtain the effect of encoding preferably, generally most available bits can be distributed to low frequency signal, but, therefore still wish at decoding end reconstruction high-frequency signal as well as possible owing to the subjective impression of radio-frequency component to sound quality still plays an important role.A kind of (0～3.4kHz) expands to broadband voice (0～7kHz) method (ITU-T from narrowband speech at present employed below, G.729.1) be introduced, it adopts the mode of time domain bandwidth expansion (TDBWE:Time Domain BandWidth Extension), and concrete scheme comprises:

One, coding side

1. pre-service

To input carry out spectrum folding with the 16kHz sampling rate high-frequency signal that obtains of sampling, that is, 4～8kHz frequency range of the HFS of input signal is folded to 0～4kHz part, 160 time domain sampling points that this process is equivalent to HFS all multiply by (1) ⁿSignal after will folding again is by 3/4 low-pass filter, the frequency range of its 3～4kHz of filtering, promptly corresponding to the part of 7～8kHz in the former frequency range, through pretreated signal be S ' (n), n=0 ..., 159.

2. time domain spectrum envelope parameters extraction

The S ' of 20ms is the frame segment that to be subdivided into 16 length be 1.25ms (n), and each fragment comprises 10 sampled points.Per 10 sampling points are carried out time domain spectrum envelope CALCULATION OF PARAMETERS one time, and computing formula is as follows:

T_{env} (i) = \frac{1}{2} \log_{2} {Σ_{n = 0}^{9} {[S^{'} (n + i \times 10)]}^{2}}, i = 0, . . ., 15,

Obtain 16 time domain spectrum envelope parameter T altogether _Env(i).

3. the extraction of frequency domain spectra envelope parameters

In G.729.1, coding side only carries out the extraction of frequency domain parameter to the back 10ms subframe (80 sampling points) of 20ms frame, the frequency domain parameter of 10ms subframe before being obtained by interpolation by decoding end.When calculating frequency domain parameter, to S ' (n) behind the frame sequence of 10ms subframe add 128 Hanning window w _F, this window is made of back 56 of preceding 72 and 112 s' of 144 rising Hanning window decline Hanning window, and the junction is at the 72nd sampling point; See 32 sampling points before this window, after see 16 sampling points, 80 sampling points that add current subframe are 128 points altogether.Signal after the windowing is:

S ^w(n)＝S’(n)·w _F(n+31)，n＝-31，…，96。

To S ^w(n) to frequency domain, the length of FFT conversion is 64 to employing fast fourier transform (FFT:Fast Fourier Transform), obtains S by spatial transform ^Fft(n), n=0 ..., 64.Owing to carried out 3/4 low-pass filtering in preprocessing process, therefore after being converted into frequency domain, the frequency spectrum data that has only front 3/4 is effective; And because the FFT conversion has symmetry, so only need choose preceding 24 data just is enough to express the frequency range of 0～3kHz in 32 frequency domain datas in front, calculates the frequency domain spectra envelope parameters according to preceding 20 frequency domain datas and is:

F_{env} (j) = \frac{1}{2} \log_{2} {{Σ_{n = 2 j}^{2 (j + 1)} W_{F} (n - 2 j) {[S^{fft} (n)]}^{2}}}, j = 0, . . ., 11,

W wherein _F(n) be weighting function, W _F(0)=W _F(2)=0.5, W _F(1)=1.

4. the quantification of parameter

To 16 T _Env(i) and 12 F _Env(j) remove average division vector quantization.At first calculate T _Env(i) mean value M _T, at log-domain 5bit scalar quantization M _TCalculate T respectively _Env(i) and F _Env(j) with the residual error of quantization scalar; Then 16 time domain residual errors are split into 28 n dimensional vector ns, use same code book to quantize with 7bit respectively, 12 frequency-domain residual are split into 34 n dimensional vector ns, use different code books, preceding two 4 n dimensional vector ns quantize with 5bit respectively, and last 4 n dimensional vector n quantizes with 4bit.

Two, decoding end

1. excitation generates

The pumping signal (Excitation Signal) of bandwidth expansion is rebuild by the core layer decoding parametric and is obtained.Following core layer decoding parametric is used to generate the pumping signal of bandwidth expansion: integer pitch delay T0, mark fundamental tone time-delay frac; The ENERGY E p of the ENERGY E c of fixed codebook contribution, adaptive codebook contribution; The gain g of basic layer constant codebook excitations c (n), c (n) in the core layer _c, the gain g of adaptive codebook excitation v (n), v (n) _pEnhancement layer in the core layer strengthen excitation c ' (n), c ' gain g (n) _Enh

By the ratio of estimating clearly, the voiced sound gain contribution is calculated each frame adaptive code book and fixed codebook (comprising the enhancement layer code book) encourages, multiply by gain by the excitation separately of pure and impure sound then and form preliminary pumping signal, again preliminary pumping signal is carried out the aftertreatment that fundamental tone is delayed time according to parameters such as pitch delays, obtain final pumping signal exc (n).Exc (n) also needs by 3/4 low-pass filter frequency range to be restricted to 0～3kHz.

2. the decoding of parameter

From code stream, decode 16 time domain spectrum envelope parameter T _Env(i) and 12 frequency domain spectra envelope parameters F _Env(j), decode procedure is the inverse process of the quantization encoding process of coding side.

3. time domain spectrum envelope shaping

The time domain shaping mainly is that the energy of pumping signal is adjusted.According to coding side T _Env(i) computing method are calculated the time domain spectrum envelope parameter of pumping signal exc (n), obtain 16 T ' _Env(i), again by T _Env(i) deduct T ' respectively _Env(i) draw both energy differences, thus the energy discharge amplitude gain that obtains to adjust:

gain＝2^[T _env(i)-T’ _env(i)]；

Multiply by corresponding gain respectively by the pumping signal exc (n) of 160 sampling points then and recover the adjusted signal S of time domain ^T(n).

4. frequency domain spectra envelope shaping

The frequency domain parameter F that decodes _Env(j) characterized the back 10ms of 20ms frame, the frequency domain parameter of its preceding 10ms frame can obtain by the frequency domain parameter interpolation of present frame and previous 20ms frame, and the frequency domain parameter of 10ms before and after the present frame is designated as F respectively _{Env, 1}(j), F _{Env, 2}(j).

Disposal route with time domain is similar then, with S ^T(n) carry out frequency domain parameter according to the computing method of coding side and extract, every 10ms extracts once, calculates two groups of frequency domain parameters, is designated as F ' _{Env, 1}(j), F ' _{Env, 2}(j).By F _{Env, 1}(j), F _{Env, 2}(j) respectively with F ' _{Env, 1}(j), F ' _{Env, 2}(j) difference obtains the adjusting range G of two subframes _{F, 1}(j), G _{F, 2}(j).Because frequency-domain calculations is that frequency-division section carries out, therefore adopt a bank of filters that the spectrum envelope of corresponding with each frequency domain parameter respectively signal frequency range is adjusted respectively, obviously have 12 wave filters, adopt G _{F, 1}(j), G _{F, 2}(j) respectively the coefficient of bank of filters is weighted, respectively front and back 10ms subframe is carried out filtering then, obtain the signal output S after the frequency-domain shaping _HB(n).

5. the aftertreatment of BWE

Owing to after time domain and frequency domain two readjust, may produce the part burr, therefore adopt self-adaptation amplitude compression function to carry out aftertreatment to reduce departing from of envelope.Post-processing approach is that per 80 sampling points are handled once, and it is divided into three sections, 6 sampling points of leading portion, and 70 sampling points in stage casing, last 4 sampling points, adjusted being output as of envelope (every row is followed successively by leading portion, interruption and back segment) of process aftertreatment:

Wherein, T _Env(i) be the time domain spectrum envelope parameter corresponding with the sampling point of current adjustment.

Owing at coding side the high-frequency signal of 4～8kHz is folded to 0～4kHz, therefore when decoding end is reduced, should carries out spectrum folding once more.The spectrum folding mode of method for folding and coding side is similar, because the output signal of rebuilding is 0～3kHz, therefore the frequency coefficient of 3～4kHz can be mended and fold the high-frequency reconstruction signal that obtains 4～7kHz after 0.

In proposing process of the present invention, the inventor finds, the decoding end excitation of above-mentioned bandwidth expansion technique generates the binary excitation production method that adopts in the similar speech production model and produces, and is fit to speech signal coding, and is then relatively poor to the coding effect of class music signal; And experimental results show that under above-mentioned excitation generating mode as if the ultra broadband expansion that this bandwidth expansion technique is used for 7～14kHz, noise is big, the coding weak effect illustrates that this technology is not suitable for being applied in the ultra broadband expansion.

The invention provides the generation method of pumping signal in a kind of bandwidth expansion and the method for reconstructing and the device of corresponding high-frequency signal, be applicable in broadband and ultra broadband expansion sound signals such as voice and music are carried out high-frequency reconstruction.

The generation method of pumping signal in a kind of bandwidth expansion, comprising: the generated frequency scope is 0～B ₀The first pumping signal exc (n), n=0 ..., N-1; Exc (n) is carried out spectrum folding, and the generated frequency scope is B ₀～2B ₀The second pumping signal exc ^Fold(n); To exc (n) and exc ^Fold(n) carry out synthetic filtering, reference frequency output is 0～2B ₀The 3rd pumping signal exc _HB(m), m=0 ..., 2N-1, described the 3rd pumping signal exc _HB(m) be used for carrying out the reconstruction of high-frequency signal as high-frequency excitation signal.

A kind of method for reconstructing of bandwidth expansion medium-high frequency signal, comprising: the generation method according to aforementioned pumping signal generates pumping signal exc _HB(m), m=0 ..., 2N-1; Decoding obtains time domain spectrum envelope parameter T _Env(i) and frequency domain spectra envelope parameters F _Env(j), i=0 wherein ..., I-1, j=0 ..., J-1; According to T _Env(i) to exc _HB(m) time domain spectrum envelope is adjusted, each T _Env(i) the corresponding exc that adjusts _HB(m) comprise a section of A time domain sampling point in, A≤2N/I generates the adjusted signal S of time domain ^T(m); According to F _Env(j) to S ^T(m) frequency domain spectra envelope is adjusted, each F _Env(j) the corresponding S that adjusts ^T(m) bandwidth is B in the frequency domain ₁A subband, B ₁≤ B ₂/ J, B ₂Be S ^T(m) frequency span generates the adjusted reconstruction signal S of frequency domain ^F(m); To S ^F(m) carry out spectrum folding, the generated frequency scope is 2B ₀～2B ₀+ B ₂High-frequency reconstruction signal S _HB(m).

The generating apparatus of pumping signal in a kind of bandwidth expansion, comprising: the core codec module, being used for reference frequency output is 0～B ₀The first pumping signal exc (n), n=0 ..., N-1; The spectrum folding module is used for exc (n) is carried out spectrum folding, and reference frequency output is B ₀～2B ₀The second pumping signal exc ^Fold(n); The synthetic filtering module is used for exc (n) and exc ^Fold(n) carry out synthetic filtering, reference frequency output is 0～2B ₀The 3rd pumping signal exc _HB(m), m=0 ..., 2N-1, described the 3rd pumping signal exc _HB(m) be used for carrying out the reconstruction of high-frequency signal as high-frequency excitation signal.

A kind of reconstructing device of bandwidth expansion medium-high frequency signal comprises: the pumping signal generation unit, the logical organization of the generating apparatus of any described pumping signal of employing claim 15～17 is used to generate pumping signal exc _HB(m), m=0 ..., 2N-1; Decoding unit is used for decoding output time domain spectrum envelope parameter T _Env(i) and frequency domain spectra envelope parameters F _Env(j), i=0 wherein ..., I-1, j=0 ..., J-1; The time domain shaping unit is used for according to T _Env(i) to exc _HB(m) time domain spectrum envelope is adjusted, each T _Env(i) the corresponding exc that adjusts _HB(m) comprise a section of A time domain sampling point in, A≤2N/I, the adjusted signal S of output time domain ^T(m); The frequency-domain shaping unit is used for according to F _Env(j) to S ^T(m) frequency domain spectra envelope is adjusted, each F _Env(j) the corresponding S that adjusts ^T(m) bandwidth is B in the frequency domain ₁A subband, B ₁≤ B ₂/ J, B ₂Be S ^T(m) frequency span, the adjusted reconstruction signal S of output frequency domain ^F(m); The spectrum folding unit is used for the S to input ^F(m) carry out spectrum folding, the generated frequency scope is 2B ₀～2B ₀+ B ₂High-frequency reconstruction signal S _HB(m).

Technique scheme adopts the arrowband low frequency signal is generated needed high-frequency excitation signal by the synthetic again mode of spectrum folding; Owing to utilize low frequency signal to produce high-frequency signal, the mediation characteristic that has based on signal low frequency and high frequency spectrum, can all expand preferably voice and music signal, the spectrum folding mode that is adopted also guaranteed low-and high-frequency the joining place signal spectrum continuously; Experiment showed, not only to be fit to 4～7kHz band signal is carried out the bandwidth expansion, and be fit to 7～14kHz ultra-broadband signal is expanded.

Description of drawings

Fig. 1 is the step synoptic diagram of generation method of the pumping signal of the embodiment of the invention;

Fig. 2 is the logical organization synoptic diagram of generating apparatus of the pumping signal of the embodiment of the invention;

Fig. 3 is the step synoptic diagram of method for reconstructing of the high-frequency signal of the embodiment of the invention;

Fig. 4 is the logical organization synoptic diagram of reconstructing device of the high-frequency signal of the embodiment of the invention.

Embodiment

The embodiment of the invention provides the generation method of pumping signal in a kind of bandwidth expansion, and the arrowband low frequency signal is synthetic again by spectrum folding, generates needed high-frequency excitation signal.The embodiment of the invention also provides the method for reconstructing of corresponding bandwidth expansion medium-high frequency signal, and the generating apparatus of pumping signal and the reconstructing device of high-frequency signal in the bandwidth expansion.Below be elaborated respectively.

With reference to figure 1, the generation method of pumping signal mainly comprises step in the expansion of the bandwidth of the embodiment of the invention:

A1, generated frequency scope are 0～B ₀First pumping signal, this first pumping signal is generally a kind of arrowband pumping signal.

In the present embodiment, as the arrowband pumping signal exc (n) of first pumping signal, n=0 ..., N-1, the parameter reconstruction that is obtained by decoding core layer code stream obtains.Exc (n) can adopt code book Excited Linear Prediction (CELP:Code Excited Linear Prediction) to rebuild and obtain based on the core layer coded system of coding side, for example the pumping signal reconstruction mode in the aforementioned background art.

Process reduces computational complexity to simplify the process, and a kind of simple and effective exc (n) generating mode based on CELP is provided in the present embodiment, comprising:

1. the core code stream of decoding obtains constant codebook excitations and adaptive codebook excitation and gain separately.

According to the coded system of coding side core layer, constant codebook excitations can by basic layer constant codebook excitations c (n) and enhancement layer strengthen excitation c ' (n) two parts forms, gaining accordingly is respectively g _cAnd g _Enh

2. obtain exc (n) according to separately gain weighting superposition constant codebook excitations and adaptive codebook excitation.

Comprise under the two-part situation that at constant codebook excitations the computing formula of exc (n) is:

exc(n)＝g _p·v(n)+g _c·c(n)+g _enh·c’(n)

Wherein, v (n) is adaptive codebook excitation, g _pGain for v (n).

Usually the frequency range of exc (n) is 0～4kHz, and a frame is that 160 time domain sampling points of 20ms are formed by duration, i.e. B ₀=4kHz, N=160.

A2, exc (n) is carried out spectrum folding, the generated frequency scope is B ₀～2B ₀Second pumping signal; Corresponding to the character of exc (n) arrowband, low frequency, this second pumping signal can be considered arrowband high-frequency signal exc ^Fold(n).

N the time domain sampling point that this process is equivalent to exc (n) all multiply by (1) ⁿ

A3, to exc (n) and exc ^Fold(n) carry out synthetic filtering, reference frequency output is 0～2B ₀The 3rd pumping signal, the 3rd pumping signal is the high-frequency excitation signal of bandwidth expansion.

Alleged synthetic filtering is with exc (n) and exc ^Fold(n) frequency spectrum merges, and obtains bandwidth and expands to 0～2B ₀High-frequency excitation signal exc _HB(m), m=0 ..., 2N-1.A kind of optional synthesis mode is:

Adopt Quadrature Mirror Filter QMF (QMF:Quandrature Mirror Filter) to exc (n) and exc ^Fold(n) carry out the orthogonal mirror image synthetic filtering.

In addition, can also be 0～2B further according to the needs of practical application to frequency range ₀Exc _HB(m) carry out low pass, high pass or bandpass filtering, the exc of output frequency range _HB(m).Based on the requirement of present audio-frequency signal coding to frequency range, generally the frequency range that requires for broadband signal is 0～7kHz, comprises the low frequency part of 0～4kHz and the HFS of 4～7kHz; The frequency range that requires for ultra-broadband signal is 0～14kHz, comprise the low frequency part of 0～8kHz and the HFS of 8～14kHz, as seen the bandwidth of HFS coding is generally 3/4 of low frequency part, therefore in this case, also need the high frequency pumping that generates based on low-frequency excitation is further processed, that is:

A4, be 0～2B to frequency range ₀Exc _HB(m) carry out 3/4 low-pass filtering, reference frequency output is 0～3B ₀/ 2 exc _HB(m).

This frequency range is 0～3B ₀It is 2B that/2 high-frequency excitation signal promptly can be used for rebuilding frequency range ₀～3.5B ₀Broadband or ultra broadband high-frequency signal.

Generating apparatus to the bandwidth expansion pumping signal of the embodiment of the invention that is used for carrying out above-mentioned pumping signal generation method describes below, and with reference to figure 2, its basic logical structure comprises:

Core codec module 101, being used for reference frequency output is 0～B ₀The first pumping signal exc (n), n=0 ..., N-1; This core codec module 101 can adopt the processing module based on CELP, and the exc of output (n) can be divided into two-way, offers spectrum folding module 102 and synthetic filtering module 103 respectively;

Spectrum folding module 102 is used for exc (n) is carried out spectrum folding, and reference frequency output is B ₀～2B ₀The second pumping signal exc ^Fold(n);

Synthetic filtering module 103 is used for exc (n) and exc ^Fold(n) carry out synthetic filtering, reference frequency output is 0～2B ₀The 3rd pumping signal exc _HB(m), m=0 ..., 2N-1; This synthetic filtering module 103 can adopt the orthogonal mirror image composite filter.

In addition, based in the aforementioned generation method to the description of exciting signal frequency area requirement, the pumping signal generating apparatus of present embodiment also can comprise:

3/4 low-pass filter 104, being used for the incoming frequency scope is 0～2B ₀Exc _HB(m), it is carried out 3/4 low-pass filtering, reference frequency output is 0～3B ₀/ 2 exc _HB(m).

For better understanding the foregoing description, with a kind of example that is applied as in the expansion of ultra broadband bandwidth, above-mentioned pumping signal generative process is described below: a frame pumping signal exc (n) (160 sampling points) who at first extracts 0～4kHz by core layer based on the CELP coding; Mode by spectrum folding folds into 4～8kHz frequency range then, generates the pumping signal exc of 4～8kHz frequency range ^Fold(n) (160 sampling points); Pass through the QMF composite filter then, with exc (n) and exc ^Fold(n) synthetic required full frequency band encourages exc ^Qmf(m) (320 sampling points), this moment, the bandwidth of signal was 0～8kHz; Again with full frequency band pumping signal exc ^Qmf(m), obtain the pumping signal exc of 0～6kHz by 3/4 low-pass filter filtering _HB(m) (320 sampling points).

Above-mentioned pumping signal generates among the method and apparatus embodiment, adopts the arrowband low frequency signal is generated needed high-frequency excitation signal by the synthetic again mode of spectrum folding; Owing to utilize low frequency signal to produce high-frequency signal, the mediation characteristic that has based on signal low frequency and high frequency spectrum, can all expand preferably voice and music signal, the binary that has solved in the similar speech production model that adopts in the existing time domain bandwidth expansion encourages production method for the poor problem of the coding effect of class music signal.In addition, the spectrum folding mode that adopted also guaranteed low-and high-frequency the joining place signal spectrum continuously; Experiment showed, that above-mentioned pumping signal generates scheme and not only is fit to 4～7kHz band signal is carried out the bandwidth expansion, and be fit to 7～14kHz ultra-broadband signal is expanded.

Below the method for reconstructing based on the bandwidth expansion medium-high frequency signal of the embodiment of the invention of above-mentioned pumping signal generation method is described.With reference to figure 3, mainly comprise step:

B1, generation high-frequency excitation signal.

High-frequency excitation signal exc _HB(m), m=0 ..., 2N-1, the generation method with reference to previous embodiment, its bandwidth is B ₂, B ₂=2B ₀Or 3B ₀/ 2, use the latter usually.

B2, decoding obtain time domain spectrum envelope parameter and frequency domain spectra envelope parameters.

From code stream, decode time domain spectrum envelope parameter T according to the decoding process corresponding with the coded system of coding side _Env(i), i=0 ..., I-1 and frequency domain spectra envelope parameters F _Env(j), j=0 ..., J-1, concrete code encoding/decoding mode present embodiment does not limit.Need to prove that the step of this decoding there is no strict logical order requirement in whole process of reconstruction, can carry out synchronously or in proper order with other steps, and not necessarily require to decode simultaneously T _Env(i) and F _Env(j), as long as the decoding of executed relevant parameter before certain parameter of use in process of reconstruction.

B3, according to T _Env(i) to exc _HB(m) time domain spectrum envelope is adjusted, and generates the adjusted signal S of time domain ^T(m).

Time domain spectrum envelope adjustment process is carried out corresponding to coding side time domain spectrum envelope Parameter Extraction process, each T _Env(i) the corresponding exc that adjusts _HB(m) comprise a section of A time domain sampling point in, A≤2N/I, that is, the sampling point number of being adjusted can be all or part of of 2N sampling point.Each T _Env(i) corresponding relation with the adjustment sampling point is identical with the corresponding relation in the coding side leaching process.Concrete adjustment mode can adopt the time domain spectrum envelope adjustment mode in the aforementioned background art for example etc.

For better adjustment effect is provided, provide a kind of time domain spectrum envelope to adjust mode in the present embodiment, comprising:

1. calculate T according to coding side _Env(i) mode is calculated exc _HB(m) time domain spectrum envelope parameter T ' _Env(i).

Alleged coding side calculates T _Env(i) mode is the high-frequency signal S that coding side extracts needs coding _Hb(m) T _Env(i) process.S _Hb(m) by coding side the HFS that needs encoded signals being carried out pre-service usually obtains: the high-frequency signal that the back frequency division of at first will sampling obtains folds into low-frequency range, carries out low-pass filtering by the frequency range requirement of coding then.A kind of T ' _Env(i) account form example is as follows:

With exc _HB(m) a 2N sampling point is divided into the I section, and every section A sampling point calculates every section log-domain energy

{T^{'}}_{env} (i) = \frac{1}{2} \log_{2} {Σ_{a = 0}^{A - 1} {[{exc}_{HB} (a + i \times A)]}^{2}}, i = 0, . . ., I - 1 .

Usually desirable 10 sampling points are one section, i.e. A=10, T ' at this moment _Env(i) number is I=N/5.

2. according to T _Env(i) and T ' _Env(i) the energy difference between is calculated the preliminary gain factor g of time domain _T(i).

A kind of g _T(i) account form example is as follows:

g _T(i)＝2^[T _env(i)-T’ _env(i)]，

Obviously, each g _T(i) corresponding to exc _HB(m) comprise a section of A time domain sampling point, corresponding relation and T ' in _Env(i) and exc _HB(m) corresponding relation of sampling point is identical in.

3. each g of interpolation _T(i) obtain A gain factor.

Can adopt various interpolation methods with each g as required _T(i) expand to A gain factor g _{T, i}(a), a=0 ..., A-1 for example can simply make each g _{T, i}(a) be equal to g _T(i).For obtaining the effect of time domain adjustment preferably, under the situation of A=10, provide a kind of level and smooth interpolation algorithm to calculate g in the present embodiment _{T, i}(a):

g _{T, i}(a)=w _T(a) g _T(i)+[1-w _T(a)] g ^Last _{T, i}(a); Wherein, w _T(a) be window function, g ^Last _{T, i}(a) be previous frame exc _HB(m) gain factor of corresponding sampling point.w _T(a) be specially:

w_{T} (a) = \{\begin{matrix} \frac{1}{2} {1 - \cos [(a + 1) \frac{π}{6}]} & , a = 0, . . ., 4 \\ 1 & , a = 5, . . ., 9 \end{matrix}

Above-mentioned interpolation algorithm can be understood as, to preceding 5 g _{T, i}(a) the corresponding g that adopts the level and smooth interpolation of previous frame to obtain ^Last _{T, i}(a) carry out smoothing processing, to back 5 g _{T, i}(a) then adopt g _T(i) value.

4. according to g _{T, i}(a) adjust exc _HBThe gain of the sampling point of A * I (m) obtains S ^T(m).

Exc _HB(m) sample value and corresponding gain factor g of time domain spectrum envelope shaping by accepting to adjust _{T, i}(a) obtain by simply multiplying each other:

S ^T(m)＝g _T，i(a)·exc _HB(m)。

B4, according to F _Env(j) to S ^T(m) frequency domain spectra envelope is adjusted, and generates the adjusted reconstruction signal S of frequency domain ^F(m).

Similar with time domain spectrum envelope adjustment process, frequency domain spectra envelope adjustment process is carried out each F corresponding to the leaching process of coding side frequency domain spectra envelope parameters equally _Env(i) the corresponding S that adjusts ^T(m) bandwidth is B in the frequency domain ₁A subband, B ₁≤ B ₂/ J, B ₂Be S ^T(m) also be exc _HB(m) frequency span.Each F _Env(j) corresponding relation with the adjustment frequency band is identical with the corresponding relation in the coding side leaching process.Concrete adjustment mode can adopt the frequency domain spectra envelope adjustment mode in the aforementioned background art for example etc.

For reducing computational complexity, improve and adjust effect, provide a kind of frequency domain spectra envelope to adjust mode in the present embodiment, comprising:

1. calculate F according to coding side _Env(j) mode is to S ^T(m) carry out time-frequency conversion and generate frequency-region signal S ^F1(m) and calculate S ^F1(m) frequency domain spectra envelope parameters F ' _Env(j).

Alleged coding side calculates F _Env(j) mode is the high-frequency signal S that coding side extracts needs coding _Hb(m) F _Env(j) process.A kind of F ' _Env(i) account form example is as follows:

Be S ^T(m) and previous frame S ^{T, last}(m) windowing w _TDAC(k) the signal S after the acquisition windowing ^w(k), k=0 ..., 4N-1, wherein,

S ^w(k)＝w _TDAC(k)·S ^T，last(k)，k＝0，…，2N-1，

S ^w(k)＝w _TDAC(k)·S ^T(k-2N)，k＝2N，…，4N-1；

To S ^w(k) carry out discrete cosine transform (DCT:Diserete Cosine Transform) and generate S ^F1(m), concrete mapping mode can adopt modified discrete cosine transform (MDCT:Modified DCT),

S^{F 1} (m) = Σ_{k = 0}^{4 N - 1} S^{w} (k) \cos [\frac{π}{8 N} (2 k + 1 + 2 N) (2 m + 1)];

Extract S ^F1(m) preceding D * J sampling point calculates F ' _Env(j),

{F^{'}}_{env} (j) = \frac{1}{2} \log_{2} {Σ_{d = 0}^{D - 1} {[S^{F 1} (d + j \times D)]}^{2}} .

Because exc _HB(m) may carry out 3/4 low-pass filtering treatment of restricted band scope in the generative process, 0～3B has only been arranged in this case ₀The data of/2 frequency ranges are effectively, and therefore, after carrying out time-frequency conversion, preceding 3/2N the point that only needs to extract 2N frequency domain sampling point is used to calculate F ' _Env(j) get final product, at this moment D * J=3/2N.

Usually desirable 16 sampling points are as a sub-frequency bands, i.e. D=16, this moment F ' _Env(j) number is J=3N/32.In addition, employed window function w _TDAC(k) can select following sinusoidal windows:

w _TDAC(k)＝sin[(k+0.5)π/4N]。

2. according to F _Env(j) and F ' _Env(j) the energy difference between is calculated the preliminary gain factor g of frequency domain _F(j), each g _F(j) corresponding to S ^F1(m) comprise a section of D frequency domain sampling point, D * J≤2N in.

A kind of g _F(j) account form example is as follows:

g _F(i)＝2^[F _env(j)-F’ _env(j)]，

Each g _F(i) and S ^F1(m) corresponding relation of sub-band and F ' _Env(i) and S ^F1(m) corresponding relation of sub-band is identical.

3. each g of interpolation _F(j) obtain D gain factor g _{F, j}(d), d=0 ..., D-1.

Concrete interpolation method can certainly adopt other interpolation methods with reference to the interpolation method of aforementioned time domain gain factor, repeats no more.

4. according to g _{F, j}(d) adjust S ^F1The gain of the sampling point of D * J (m) generates adjusted frequency-region signal S ^F2(m).Similar with the adjustment of time domain spectrum envelope, with frequency domain sample value and corresponding gain factor g _{F, j}(d) simply multiply each other and get final product:

S ^F2(m)＝g _F，j(d)·S ^F1(m)。

5. to S ^F2(m) carry out the inverse transformation of described time-frequency conversion, obtain S ^F(m).

For example, if before the frequency domain adjustment, adopt MDCT to transform to frequency domain, then adopt this moment contrary MDCT (IMDCT) to transform to time domain.

B5, to S ^F(m) carry out spectrum folding, the generated frequency scope is 2B ₀～2B ₀+ B ₂High-frequency reconstruction signal S _HB(m).

Because at coding side is that high-frequency signal is folded to low-frequency range, therefore when decoding end is reduced, should carry out spectrum folding once more.Spectrum folding mode when method for folding and coding side carry out the high-frequency signal pre-service is similar.If in process of reconstruction, based on coding the requirement of frequency range has been carried out low-pass filtering to pumping signal, the frequency coefficient of the HFS that can remove filtering this moment is mended to fold after 0 and is obtained final high-frequency reconstruction signal.

Further,, make reconstruction signal burr occur probably owing in above-mentioned signal reconstruction process, crossed time domain and frequency domain two readjust, in order to eliminate these burrs, can be earlier to the adjusted signal S of time-frequency before carrying out spectrum folding ^F(m) carry out aftertreatment, that is, before step B5, increase following steps:

B51, use envelope are adjusted threshold value limit ₁(i), limit ₂(i) to S ^F(m) carry out the envelope adjustment.Adjusted S ^F(m) be:

At m=m ₁～m ₂Part in, if | S ^{F, old}(m) |＜limit ₁(i), S then ^F(m)=S ^{F, old}(m),

At m=m ₂+ 1～m ₃Part in, if limit ₁(i)≤| S ^{F, old}(m) |≤limit ₂(i), S then ^F(m)=[S ^{F, old}(m)-limit ₁(i)]/2+limit ₁(i),

At m=m ₃+ 1～m ₄Part in, if | S ^{F, old}(m) |＞limit ₂(i), S then ^F(m)=[S ^{F, old}(m)-limit ₂(i)]/16+limit ₂(i), wherein, S ^{F, old}(m) adjust preceding S for envelope ^F(m); Limit ₁(i), limit ₂(i) and S ^F(m) corresponding relation of time domain sampling point in, and T _Env(i) and S ^F(m) corresponding relation of time domain sampling point is identical in.

In above-mentioned last handling process, a kind of limit of threshold value preferably ₁(i), limit ₂(i) set-up mode is:

limit ₁(i)＝2^T _env(i)，

limit ₂(i)＝[2^T _env(i)]×2.5。

In addition, above-mentioned last handling process can be handled once per 80 sampling points, per 80 sampling points is divided into three sections, preceding 6 sampling point (m ₁～m ₂Part), middle 70 sampling point (m ₂+ 1～m ₃Part), last 4 sampling point (m ₃+ 1～m ₄Part).Illustrate as follows: if N=160, then the adjusted signal of time-frequency is 320 sampling points, can divide and carry out aftertreatment 4 times; M wherein ₁～m ₂Part be 0～5,80～85,160～165,240～245 part; m ₂+ 1～m ₃Part be 6～75,86～155,166～235,246～315 part; m ₃+ 1～m ₄Part be 76～79,156～159,236～239,316～319 part.

Reconstructing device to the bandwidth expansion medium-high frequency signal of the embodiment of the invention that is used to carry out above-mentioned high-frequency signal method for reconstructing describes below, and with reference to figure 4, its basic logical structure comprises:

Pumping signal generation unit 201, the logical organization of the generating apparatus of the pumping signal of employing previous embodiment is used to generate pumping signal exc _HB(m), m=0 ..., 2N-1;

Decoding unit 202 is used for decoding output time domain spectrum envelope parameter T _Env(i) and frequency domain spectra envelope parameters F _Env(j), i=0 wherein ..., I-1, j=0 ..., J-1;

Time domain shaping unit 203 is used for the T according to decoding unit 202 outputs _Env(i) exc that pumping signal generation unit 201 is exported _HB(m) time domain spectrum envelope is adjusted, each T _Env(i) the corresponding exc that adjusts _HB(m) comprise a section of A time domain sampling point in, A≤2N/I, the adjusted signal S of output time domain ^T(m);

Frequency-domain shaping unit 204 is used for the F according to decoding unit 202 outputs _Env(j) S that time domain shaping unit 203 is exported ^T(m) frequency domain spectra envelope is adjusted, each F _Env(j) the corresponding S that adjusts ^T(m) bandwidth is B in the frequency domain ₁A subband, B ₁≤ B ₂/ J, B ₂Be S ^T(m) frequency span, the adjusted reconstruction signal S of output frequency domain ^F(m);

Spectrum folding unit 205 is used for the S to input ^F(m) carry out spectrum folding, the generated frequency scope is 2B ₀～2B ₀+ B ₂High-frequency reconstruction signal S _HB(m).

In addition, based on the last handling process that uses for eliminate signal burr in the aforementioned method for reconstructing, the high-frequency signal reconstructing device of present embodiment also can comprise:

Post-processing unit 206 is used to use envelope to adjust threshold value limit ₁(i), limit ₂(i) S that frequency-domain shaping unit 204 is exported ^F(m) carry out the envelope adjustment, adjusted S ^F(m) be: at m=m ₁～m ₂Part in, if | S ^{F, old}(m) |＜limit ₁(i), S then ^F(m)=S ^{F, old}(m); At m=m ₂+ 1～m ₃Part in, if limit ₁(i)≤| S ^{F, old}(m) |≤limit ₂(i), S then ^F(m)=[S ^{F, old}(m)-limit ₁(i)]/2+limit ₁(i); At m=m ₃+ 1～m ₄Part in, if | S ^{F, old}(m) |＞limit ₂(i), S then ^F(m)=[S ^{F, old}(m)-limit ₂(i)]/16+limit ₂(i); Wherein, S ^{F, old}(m) adjust preceding S for envelope ^F(m); Limit ₁(i), limit ₂(i) and S ^F(m) corresponding relation of time domain sampling point in, and T _Env(i) and S ^F(m) corresponding relation of time domain sampling point is identical in; With adjusted S ^F(m) export to spectrum folding unit 205.

The level and smooth interpolation method of time domain gain factor that further provides among above-mentioned high-frequency signal method for reconstructing and the device embodiment can obtain better time domain and adjust effect; The concrete frequency domain spectra envelope adjustment mode that further provides has been avoided using multinomial bank of filters frequency-division section to signal filtering in decoding end, has simplified processing procedure, has reduced computational complexity; The shaping post processing mode that further provides can better be eliminated the burr that the shaping process occurs.

For better understanding the foregoing description, with a kind of example that is applied as in the expansion of ultra broadband bandwidth, above-mentioned high-frequency signal process of reconstruction is described below:

1. generate the high-frequency excitation signal exc of 0～6kHz _HB(m), 320 sampling points of the every frame of time domain.That is, 2N=320, B ₀=4kHz, B ₂=3B ₀/ 2=6kHz.

2. decoding obtains 32 time domain spectrum envelope parameter T from code stream _Env(i), i=0 ..., 31, each corresponding 10 time domain sampling point, i.e. I=32, A=10.

3. with exc _HB(m) be divided into 32 segments equally, every section 10 sampling points calculate corresponding T ' _Env(i):

{T^{'}}_{env} (i) = \frac{1}{2} \log_{2} {Σ_{a = 0}^{9} {[{exc}_{HB} (a + i \times 10)]}^{2}} .

Calculate time domain gain g then _T(i)=2^[T _Env(i)-T ' _Env(i)], and with level and smooth each g of interpolation algorithm interpolation _T(i):

g _T，i(a)＝w _T(a)·g _T(i)+[1-w _T(a)]·g ^last _T，i(a)，a＝0，…，4。

g _T，i(a)＝gT(i)，a＝5，…，9。

Wherein, w _T(a)=0.0669872981f, 0.2500000000f, 0.5000000000f, 0.7500000000f, 0.9330127019f}, a is followed successively by 0～4, and f represents floating number.Calculate the signal after the time domain shaping then:

S ^T(m)＝g _T，i(a)·exc _HB(m)。

4. decoding obtains 15 frequency domain spectra envelope parameters F from code stream _Env(j), j=0 ..., 14, the sub-band of each corresponding 0.4kHz bandwidth, i.e. J=15.

5. to S ^T(m) and previous frame S ^{T, last}(m) add sinusoidal windows w _TDAC(k),

w _TDAC(k)=sin[(k+0.5) π/640], k=0 ..., 639; Signal S after the acquisition windowing ^w(k),

S ^w(k)＝w _TDAC(k)·S ^T，last(k)，k＝0，…，319，

S ^w(k)＝w _TDAC(k)·S ^T(k-2N)，k＝320，…，639；

Then to the S after the windowing ^w(k) sequence is carried out 640 MDCT, generates frequency-region signal S ^F1(m),

S^{F 1} (m) = Σ_{k = 0}^{639} S^{w} (k) \cos [\frac{π}{1280} (2 k + 1 + 320) (2 m + 1)];

Owing to generate exc _HB(m) carried out 3/4 low-pass filtering in the process, filtering the frequency range data of 6～8kHz, the data of therefore having only 0～6kHz frequency range are effectively, therefore extract S ^F1(m) preceding 240 points are used to calculate 15 F ' _Env(j), one group of per 16 point, i.e. D=16,

{F^{'}}_{env} (j) = \frac{1}{2} \log_{2} {Σ_{d = 0}^{15} {[S^{F 1} (d + j \times 16)]}^{2}} .

Calculate frequency domain gain g then _F(i)=2^[F _Env(j)-F ' _Env(j)], the signal S after the acquisition frequency-domain shaping ^F2(m)=g _F(i) S ^F1(m).Again to S ^F1(m) carry out IMDCT and obtain S ^F(m).

6. to S ^F(m) per 80 sampling points of 320 sampling points are handled once, are divided into three sections at every turn, preceding 6 sampling points, and middle 70 sampling points, last 4 sampling points are according to limit ₁(i)=2^T _Env(i), limit ₂(i)=[2^T _Env(i)] * 2.5 carry out the envelope adjustment.

7. then the signal of the adjusted 0～6kHz of envelope is carried out spectrum folding, obtain the high-frequency reconstruction signal S of 8～14kHz _HB(m).

With S _HB(m) (0～8kHz) merges (for example synthetic by QMF) can obtain complete ultra broadband reconstruction signal (0～14kHz) to the low frequency signal that obtains with core code stream decoding.

One of ordinary skill in the art will appreciate that all or part of step in the whole bag of tricks of the foregoing description is to instruct relevant hardware to finish by program, this program can be stored in the computer-readable recording medium, and storage medium can comprise: ROM, RAM, disk or CD etc.

More than the generation method of pumping signal in the bandwidth provided by the present invention expansion and the method for reconstructing and the device of corresponding high-frequency signal are described in detail, used specific case herein principle of the present invention and embodiment are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, the part that all can change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.

Claims

1. the generation method of pumping signal is characterized in that during a bandwidth was expanded, and comprising:

The generated frequency scope is 0～B ₀The first pumping signal exc (n), n=0 ..., N-1;

Exc (n) is carried out spectrum folding, and the generated frequency scope is B ₀～2B ₀The second pumping signal exc ^Fold(n);

To exc (n) and exc ^Fold(n) carry out synthetic filtering, reference frequency output is 0～2B ₀The 3rd pumping signal exc _HB(m), m=0 ..., 2N-1, described the 3rd pumping signal exc _HB(m) be used for carrying out the reconstruction of high-frequency signal as high-frequency excitation signal.

2. the generation method of pumping signal according to claim 1 is characterized in that, and is described to exc (n) and exc ^Fold(n) step of carrying out synthetic filtering is specially: to exc (n) and exc ^Fold(n) carry out the orthogonal mirror image synthetic filtering.

3. the generation method of pumping signal according to claim 1 and 2 is characterized in that, also comprises:

To frequency range is 0～2B ₀Exc _HB(m) carry out 3/4 low-pass filtering, reference frequency output is 0～3B ₀/ 2 exc _HB(m).

4. the generation method of pumping signal according to claim 3 is characterized in that, the step of described generation exc (n) is specially:

The decoding core code stream obtains constant codebook excitations and adaptive codebook excitation and gain separately;

According to described constant codebook excitations of gain weighting superposition and adaptive codebook excitation acquisition exc (n) separately.

5. the generation method of pumping signal according to claim 4 is characterized in that:

Described constant codebook excitations comprises that basic layer constant codebook excitations c (n) and enhancement layer strengthen excitation c ' (n), and corresponding gain is respectively g _cAnd g _Enh

Calculate exc (n) according to following formula:

Exc (n)=g _pV (n)+g _cC (n)+g _EnhC ' (n) wherein, v (n) is adaptive codebook excitation, g _pBe the gain of v (n), N=160, B ₀=4kHz.

6. the method for reconstructing of a bandwidth expansion medium-high frequency signal is characterized in that, comprising:

Generate pumping signal exc according to any described method of claim 1～5 _HB(m), m=0 ..., 2N-1;

Decoding obtains time domain spectrum envelope parameter T _Env(i) and frequency domain spectra envelope parameters F _Env(j), i=0 wherein ..., I-1, j=0 ..., J-1;

According to T _Env(i) to exc _HB(m) time domain spectrum envelope is adjusted, each T _Env(i) the corresponding exc that adjusts _HB(m) comprise a section of A time domain sampling point in, A≤2N/I generates the adjusted signal S of time domain ^T(m);

According to F _Env(j) to S ^T(m) frequency domain spectra envelope is adjusted, each F _Env(j) the corresponding S that adjusts ^T(m) bandwidth is B in the frequency domain ₁A subband, B ₁≤ B ₂/ J, B ₂Be S ^T(m) frequency span generates the adjusted reconstruction signal S of frequency domain ^F(m);

To S ^F(m) carry out spectrum folding, the generated frequency scope is 2B ₀～2B ₀+ B ₂High-frequency reconstruction signal S _HB(m).

7. the method for reconstructing of high-frequency signal according to claim 6 is characterized in that, and is described according to T _Env(i) to exc _HB(m) step that time domain spectrum envelope is adjusted comprises:

Calculate T according to coding side _Env(i) mode is calculated exc _HB(m) time domain spectrum envelope parameter T ' _Env(i);

According to T _Env(i) and T ' _Env(i) the energy difference between is calculated the preliminary gain factor g of time domain _T(i), each g _T(i) corresponding to exc _HB(m) comprise a section of A time domain sampling point in;

Each g of interpolation _T(i) obtain A gain factor g _{T, i}(a), a=0 ..., A-1;

According to g _{T, i}(a) adjust exc _HBThe gain of the sampling point of A * I (m) obtains S ^T(m).

8. the method for reconstructing of high-frequency signal according to claim 7 is characterized in that, A=10, and I=N/5, described according to T _Env(i) and T ' _Env(i) the energy difference between is calculated g _T(i) step is specially:

g _T(i)＝2^[T _env(i)-T’ _env(i)]；

Each g of described interpolation _T(i) obtain A g _{T, i}(a) step is specially:

g _{T, i}(a)=w _T(a) g _T(i)+[1-w _T(a)] g ^Last _{T, i}(a); Wherein, w _T(a) be window function, work as a=0 ..., 4 o'clock, w _T(a)=1/2{1-cos[(a+1) π/6] }, work as a=5 ..., 9 o'clock, w _T(a)=1; g ^Last _{T, i}(a) be previous frame exc _HB(m) gain factor of corresponding sampling point.

9. according to the method for reconstructing of any described high-frequency signal of claim 6～8, it is characterized in that, described according to F _Env(j) to S ^T(m) step that frequency domain spectra envelope is adjusted comprises:

Calculate F according to coding side _Env(j) mode is to S ^T(m) carry out time-frequency conversion and generate frequency-region signal S ^F1(m) and calculate S ^F1(m) frequency domain spectra envelope parameters F ' _Env(j);

According to F _Env(j) and F ' _Env(j) the energy difference between is calculated the preliminary gain factor g of frequency domain _F(j), each g _F(j) corresponding to S ^F1(m) comprise a section of D frequency domain sampling point, D * J≤2N in;

Each g of interpolation _F(j) obtain D gain factor g _{F, j}(d), d=0 ..., D-1;

According to g _{F, j}(d) adjust S ^F1The gain of the sampling point of D * J (m) generates adjusted frequency-region signal S ^F2(m);

To S ^F2(m) carry out the inverse transformation of described time-frequency conversion, obtain S ^F(m).

10. the method for reconstructing of high-frequency signal according to claim 9 is characterized in that, and is described according to coding side calculating F _Env(j) mode generates S ^F1(m) and calculate F ' _Env(j) step comprises:

S ^w(k)＝w _TDAC(k)·S ^T，last(k)，k＝0，…，2N-1，

S ^w(k)＝w _TDAC(k)·S ^T(k-2N)，k＝2N，…，4N-1；

To S ^w(k) carry out discrete cosine transform and generate S ^F1(m),

Extract S ^F1(m) preceding D * J sampling point calculates F ' _Env(j),

11. the method for reconstructing of high-frequency signal according to claim 10 is characterized in that: D=16, J=3N/32, described window function w _TDAC(k) be:

w _TDAC(k)＝sin[(k+0.5)π/4N]。

12. the method for reconstructing according to any described high-frequency signal of claim 6～11 is characterized in that, to S ^F(m) carry out spectrum folding before, also comprise:

Use envelope to adjust threshold value limit ₁(i), limit ₂(i) to S ^F(m) carry out the envelope adjustment, adjusted S ^F(m) be:

13. the method for reconstructing of high-frequency signal according to claim 12 is characterized in that: described limit ₁(i), limit ₂(i) be,

limit ₁(i)＝2^T _env(i)，limit ₂(i)＝[2^T _env(i)]×2.5。

14. the method for reconstructing of high-frequency signal according to claim 13 is characterized in that: N=160; Described m ₁～m ₂Part be 0～5,80～85,160～165,240～245 part; Described m ₂+ 1～m ₃Part be 6～75,86～155,166～235,246～315 part; Described m ₃+ 1～m ₄Part be 76～79,156～159,236～239,316～319 part.

15. the generating apparatus of pumping signal is characterized in that during a bandwidth was expanded, and comprising:

The core codec module, being used for reference frequency output is 0～B ₀The first pumping signal exc (n), n=0 ..., N-1;

The spectrum folding module is used for exc (n) is carried out spectrum folding, and reference frequency output is B ₀～2B ₀The second pumping signal exc ^Fold(n);

The synthetic filtering module is used for exc (n) and exc ^Fold(n) carry out synthetic filtering, reference frequency output is 0～2B ₀The 3rd pumping signal exc _HB(m), m=0 ..., 2N-1, described the 3rd pumping signal exc _HB(m) be used for carrying out the reconstruction of high-frequency signal as high-frequency excitation signal.

16. the generating apparatus of pumping signal according to claim 15 is characterized in that: described synthetic filtering module is the orthogonal mirror image composite filter.

17. the generating apparatus according to claim 15 or 16 described pumping signals is characterized in that, also comprises:

3/4 low-pass filter, being used for the incoming frequency scope is 0～2B ₀Exc _HB(m), it is carried out 3/4 low-pass filtering, reference frequency output is 0～3B ₀/ 2 exc _HB(m).

18. the reconstructing device of a bandwidth expansion medium-high frequency signal is characterized in that, comprising:

The pumping signal generation unit, the logical organization of the generating apparatus of any described pumping signal of employing claim 15～17 is used to generate pumping signal exc _HB(m), m=0 ..., 2N-1;

Decoding unit is used for decoding output time domain spectrum envelope parameter T _Env(i) and frequency domain spectra envelope parameters F _Env(j), i=0 wherein ..., I-1, j=0 ..., J-1;

The time domain shaping unit is used for according to T _Env(i) to exc _HB(m) time domain spectrum envelope is adjusted, each T _Env(i) the corresponding exc that adjusts _HB(m) comprise a section of A time domain sampling point in, A≤2N/I, the adjusted signal S of output time domain ^T(m);

The frequency-domain shaping unit is used for according to F _Env(j) to S ^T(m) frequency domain spectra envelope is adjusted, each F _Env(j) the corresponding S that adjusts ^T(m) bandwidth is B in the frequency domain ₁A subband, B ₁≤ B ₂/ J, B ₂Be S ^T(m) frequency span, the adjusted reconstruction signal S of output frequency domain ^F(m);

The spectrum folding unit is used for the S to input ^F(m) carry out spectrum folding, the generated frequency scope is 2B ₀～2B ₀+ B ₂High-frequency reconstruction signal S _HB(m).

19. the reconstructing device of high-frequency signal according to claim 18 is characterized in that, also comprises:

Post-processing unit is used to use envelope to adjust threshold value limit ₁(i), limit ₂(i) S that described frequency-domain shaping unit is exported ^F(m) carry out the envelope adjustment, adjusted S ^F(m) be: at m=m ₁～m ₂Part in, if | S ^{F, old}(m) |＜limit ₁(i), S then ^F(m)=S ^{F, old}(m); At m=m ₂+ 1～m ₃Part in, if limit ₁(i)≤| S ^{F, old}(m) |≤limit ₂(i), S then ^F(m)=[S ^{F, old}(m)-limit ₁(i)]/2+limit ₁(i); At m=m ₃+ 1～m ₄Part in, if | S ^{F, old}(m) |＞limit ₂(i), S then ^F(m)=[S ^{F, old}(m)-limit ₂(i)]/16+limit ₂(i); Wherein, S ^{F, old}(m) adjust preceding S for envelope ^F(m); Limit ₁(i), limit ₂(i) and S ^F(m) corresponding relation of time domain sampling point in, and T _Env(i) and S ^F(m) corresponding relation of time domain sampling point is identical in; With adjusted S ^F(m) export to described spectrum folding unit.