CN105047201A - Broadband excitation signal synthesis method based on segmented expansion - Google Patents

Broadband excitation signal synthesis method based on segmented expansion Download PDF

Info

Publication number
CN105047201A
CN105047201A CN201510329767.3A CN201510329767A CN105047201A CN 105047201 A CN105047201 A CN 105047201A CN 201510329767 A CN201510329767 A CN 201510329767A CN 105047201 A CN105047201 A CN 105047201A
Authority
CN
China
Prior art keywords
excitation signal
band
signal
band excitation
frequency
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510329767.3A
Other languages
Chinese (zh)
Inventor
李宇
林胜义
谭洪舟
农革
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sun Yat Sen University
SYSU CMU Shunde International Joint Research Institute
National Sun Yat Sen University
Original Assignee
SYSU CMU Shunde International Joint Research Institute
National Sun Yat Sen University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SYSU CMU Shunde International Joint Research Institute, National Sun Yat Sen University filed Critical SYSU CMU Shunde International Joint Research Institute
Priority to CN201510329767.3A priority Critical patent/CN105047201A/en
Publication of CN105047201A publication Critical patent/CN105047201A/en
Pending legal-status Critical Current

Links

Landscapes

  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The invention provides a broadband excitation signal synthesis method based on segmented expansion. According to the method, two methods of narrow-band excitation spectrum translation and white noise excitation are combined, narrow-band excitation signals pass through a low-pass filter, first frequency-band excitation signals are outputted, after the translation of the first frequency-band excitation signal spectrum and high-pass filtering of the first frequency-band excitation signals are performed, second frequency-band excitation signals are obtained, white noise via energy adjustment is multiplexed by gain factors to obtain third frequency-band excitation signals, and finally, the first frequency-band excitation signals, the second frequency-band excitation signals, and the third frequency-band excitation signals are synthesized to form complete broadband excitation signals. According to the method, the harmonic structure of the excitation signals can be well reserved, too many artificial harmonic waves are not introduced, the operation complexity of the algorism is not high, the realization is easy, the performance of the conventional bandwidth expansion technology is improved, and the communication quality of narrow-band voice is improved.

Description

A kind of wideband excitation signal synthetic method based on segment extension
Technical field
The present invention relates to voice process technology field, more specifically, relate to a kind of wideband excitation signal synthetic method based on segment extension.
Background technology
Traditional telephone network, can only narrowband speech communication owing to being subject to the restriction of its transmission bandwidth, although narrowband speech can meet basic communication requirement, communication quality but reduces greatly.Due to economic cause, old telephone network can not realize real wideband transmit at short notice.The appearance of artificial speech bandwidth expansion technique, solve this difficult problem, this technology does some process at the receiving end of telephone network, some disappearance band component are added to the narrow band voice signal artificially received, receiving terminal is made to export the higher wideband speech signal of acoustical quality, after the linear prediction model proposition of voice, present most artificial speech bandwidth expansion technique is all realize based on this model (also referred to as source-filter model) design, as shown in Figure 1, the bandwidth expansion of voice is divided into independently two steps to carry out by this model: spectrum envelope expansion and pumping signal expansion.
Wideband excitation signal after expansion exports wideband speech signal eventually through composite filter (filter coefficient is the parameter calculating gained in spectrum envelope expansion process) synthesis.Therefore, pumping signal expansion is a requisite step in bandwidth expansion technique.At present, main pumping signal expansion algorithm mainly contains three kinds: narrowband excitation spectrum folds/spectrum translation, white-noise excitation and harmonic excitation.Although these three kinds of methods all comparative maturities at present, their shortcoming is: should retain suitable harmonic structure in the pumping signal of synthesis, can not introduce again too much artificial harmonic wave, because too much artificial harmonic wave can damage original harmonic structure; The computational complexity of algorithm can not be too high, otherwise this algorithm will be difficult to realize.
Summary of the invention
The invention provides a kind of wideband excitation signal synthetic method based on segment extension, the method computational complexity not high and be easy to realize.
In order to reach above-mentioned technique effect, technical scheme of the present invention is as follows:
Based on a wideband excitation signal synthetic method for segment extension, comprise the following steps:
S1: narrow band voice signal is successively passed through analysis filter after up-sampling, framing pre-service, obtains narrowband excitation signal u n;
S2: by narrowband excitation signal u nby the low-pass filter that cutoff frequency is α 1, export the first band excitation signal u l, to the first band excitation signal u lcarry out spectrum translation, be the Hi-pass filter of α 1 again by a cutoff frequency by the signal after spectrum translation, obtain the second band excitation signal u m;
S3: a unit white noise generator is set and produces white noise signal u f, calculate the first band excitation signal u lvariance obtain gain factor adjustment white noise signal u fenergy and be multiplied by gain factor then be the Hi-pass filter of α 2 by a cutoff frequency, obtain the 3rd band excitation signal u h;
S4: by the first band excitation signal u l, the second band excitation signal u mwith the 3rd band excitation signal u hmerge and obtain wideband excitation signal u.
Further, white noise signal u after energy adjusting in described step S3 ffrequency band without limit for width, the Hi-pass filter being α 2 by a cutoff frequency carrys out the spectrum component of filtering white noise α less than 2, obtains the 3rd band excitation signal u that frequency band range is 6500Hz-8000Hz h.
Preferably, to be the span of α 1 be cutoff frequency: 3000-3500HZ.
Preferably, to be the span of α 2 be cutoff frequency: 6000-6500HZ.
Further, the first band excitation signal u in described step S4 l, the second band excitation signal u mwith the 3rd band excitation signal u hall be band limiting and the pumping signal of respective band component non-overlapping copies, three pumping signals carried out sum operation in time domain, obtains the wideband excitation signal u that a frequency band is complete.
Compared with prior art, the beneficial effect of technical solution of the present invention is:
Narrowband excitation is composed translation and white-noise excitation two kinds of methods combining by the present invention, narrowband excitation signal is passed through low-pass filter, export the first band excitation signal, the second band excitation signal is obtained by after the first band excitation signal spectrum translation and high-pass filtering, by using the white noise of energy adjusting and as the 3rd band excitation signal after being multiplied by gain factor, finally by the first band excitation signal, second band excitation signal and the 3rd band excitation signal synthesize complete wideband excitation signal, the method can retain the harmonic structure of pumping signal preferably and not introduce too much artificial harmonic wave, the computational complexity of algorithm itself not high and be easy to realize, improve the performance of existing bandwidth expansion technique, improve the communication quality of narrowband speech.
Accompanying drawing explanation
Fig. 1 is the bandwidth expanding method schematic diagram based on source filter model;
Fig. 2 is the pumping signal synthetic method schematic diagram based on segment extension;
Fig. 3 is the structural representation of spectrum shift method;
Fig. 4 is the oscillogram of voice, the comparison diagram of sound spectrograph of broadband voice, narrowband speech, the present invention's synthesis.
Embodiment
Accompanying drawing, only for exemplary illustration, can not be interpreted as the restriction to this patent;
In order to better the present embodiment is described, some parts of accompanying drawing have omission, zoom in or out, and do not represent the size of actual product;
To those skilled in the art, in accompanying drawing, some known features and explanation thereof may be omitted is understandable.
Below in conjunction with drawings and Examples, technical scheme of the present invention is described further.
Embodiment 1
As shown in Figure 2, a kind of wideband excitation signal synthetic method based on segment extension, comprises the following steps:
S1: narrow band voice signal is successively passed through analysis filter after up-sampling, framing pre-service, obtains narrowband excitation signal u n;
S2: by narrowband excitation signal u nby the low-pass filter that cutoff frequency is α 1, export the first band excitation signal u l, to the first band excitation signal u lcarry out spectrum translation, be the Hi-pass filter of α 1 again by a cutoff frequency by the signal after spectrum translation, obtain the second band excitation signal u m; Wherein, to be the span of α 1 be cutoff frequency: 3000-3500HZ;
S3: a unit white noise generator is set and produces white noise signal u f, calculate the first band excitation signal u lvariance obtain gain factor adjustment white noise signal u fenergy and be multiplied by gain factor then be the Hi-pass filter of α 2 by a cutoff frequency, obtain the 3rd band excitation signal u h; The span of α 2 is: 6000-6500HZ;
S4: by the first band excitation signal u l, the second band excitation signal u mwith the 3rd band excitation signal u hmerge and obtain wideband excitation signal u.
In the present embodiment, first narrow band voice signal carries out up-sampling (sampling rate is 8000Hz), the pre-service such as framing, doing up-sampling process to narrow band signal is in order to the frequency band range by increasing its signal, expand for follow-up band component, pretreated narrow band signal is by obtaining narrowband excitation signal after analysis filter, this filter coefficient is determined by the parameter of speech linear predictive analytical calculation gained, the input signal that narrowband excitation signal is expanded as pumping signal, it is the condition precedent realizing pumping signal expansion, narrowband excitation signal is after up-sampling, sample frequency becomes 16000Hz from original 8000Hz, this signal is by obtaining low band excitation signal after low-pass filter, the parameter of corresponding low-pass filter is as follows:
Sample frequency: 16000Hz
Cut-off frequecy of passband: 3400Hz pass band damping: 1dB
Stopband cutoff frequency: 3450Hz stopband attenuation: 60dB
The the first band excitation signal u obtained lbe low band excitation signal, its frequency band range is 0-3400Hz.
Due to before acquisition narrowband excitation signal, narrow band signal is through up-sampling pre-service, so there will be unnecessary band component in the frequency spectrum of narrowband excitation signal, so the low-pass filter that must be 3400Hz by a cutoff frequency carrys out the outer unnecessary composition of filtering narrowband excitation signal low-frequency band, obtain the low band excitation signal u that frequency band range is 0-3400Hz l; Low band excitation signal u lthe second band excitation signal u is exported by spectrum shift operations module m, u mbe midband pumping signal, compose the specific implementation step of shift operations module as shown in Figure 3: lower band excitation u lbe 3100Hz with a frequency, amplitude be 2 cosine function be multiplied after be the Hi-pass filter of 3400Hz by a cutoff frequency again, output band scope is the midband pumping signal u of 3400-6500Hz m; From the modulating characteristic of Fourier transform, by signal times with an angular frequency for ω mcosine function, frequency domain is equal to the frequency band of this signal has been carried out length is ω mtranslation, spectrum translation time-domain expression respectively Ru shown in (1) formula:
u ^ e ( k ; m ) = u ^ n ( k ; m ) · 2 c o s ( ω M k ) - - - ( 1 )
Wherein, represent the midband pumping signal u of expansion mthe kth data point of m frame, similarly, represent low band excitation signal u lthe kth data point of m frame, ω in the method mbe chosen for 2 π 3100 (rad/s).
(1) frequency-domain expression corresponding to formula is as shown in (2) formula:
Φ e(ω)=Φ n(ω-ω M)+Φ n(ω+ω M)(2)
Wherein, Φ e(ω) power spectrum of signal after expression spectrum translation, Φ n(ω) power spectrum of low band excitation signal.
As can be seen from formula (2), the many Φ of the power spectrum of signal after spectrum translation n(ω-ω m) power spectrum composition.Therefore, the signal demand after spectrum translation carrys out this unnecessary composition of filtering by a Hi-pass filter, the band component needed for reservation, and the optimum configurations of this wave filter is as follows:
Sample frequency: 16000Hz
Stopband cutoff frequency: 3350Hz stopband attenuation: 60dB
Cut-off frequecy of passband: 3400Hz pass band damping: 1dB
The power spectrum of gained midband pumping signal is such as formula shown in (3):
Φ ^ e ( ω ) = Φ n ( ω + ω M ) - - - ( 3 )
Wherein, represent the power spectrum of midband pumping signal.
Calculate low band excitation signal u lvariance a unit white noise generator is set and produces white noise signal, will the variance of gained be calculated be multiplied by white noise signal as gain factor and be the Hi-pass filter of 6500Hz by a cutoff frequency, form the 3rd band excitation signal u of frequency band range 6500-8000Hz h, u hit is high band excitation signal.Low band excitation signal u l, midband pumping signal u mwith high band excitation signal u hmerge, export complete wideband excitation signal; Calculate the standard deviation of low band excitation signal, for the energy adjusting of high frequency band unit white noise, computing formula is for shown in formula (4):
σ u ^ n = 1 N k Σ k - 1 N k ( u ^ n ( k ) - μ ) 2 - - - ( 4 )
In formula, for a kth data point of low band excitation signal, μ is the average of low band excitation signal.
Produce unit white noise signal with random function, this signal obeys standardized normal distribution, and its probability density function is such as formula shown in (5):
f ( x ) = 1 2 π exp ( - x 2 2 ) - - - ( 5 )
Growth factor be multiplied by unit white noise signal obtain energy adjusting after white noise signal, the frequency band due to white noise is without limit for width, and at present required be the white-noise excitation signal of high frequency band.Therefore, the band component that the white noise signal after energy adjusting will come beyond this signal higher frequency band of filtering by a Hi-pass filter.The optimum configurations of this wave filter is as follows:
Sample frequency: 16000Hz
Stopband cutoff frequency: 6450Hz stopband attenuation: 60dB
Cut-off frequecy of passband: 6500Hz pass band damping: 1dB
By the white noise signal after Hi-pass filter, be exactly required high band excitation signal, its frequency band range is 6500-8000Hz.
Low band excitation signal, midband pumping signal and high band excitation signal are carried out sum operation in time domain, obtain complete wideband excitation signal, shown in (6):
u ^ w ( k ; m ) = u ^ n ( k ; m ) + u ^ e ( k ; m ) + u ^ h ( k ; m ) - - - ( 6 )
In formula, represent the estimated value of a m frame kth data point of wideband excitation signal, similarly, with represent low band excitation signal u respectively l, midband pumping signal u mwith high band excitation signal u ha m frame kth data dot values, experimental result is as shown in Figure 4.
The corresponding same or analogous parts of same or analogous label;
Describe in accompanying drawing position relationship for only for exemplary illustration, the restriction to this patent can not be interpreted as;
Obviously, the above embodiment of the present invention is only for example of the present invention is clearly described, and is not the restriction to embodiments of the present invention.For those of ordinary skill in the field, can also make other changes in different forms on the basis of the above description.Here exhaustive without the need to also giving all embodiments.All any amendments done within the spirit and principles in the present invention, equivalent to replace and improvement etc., within the protection domain that all should be included in the claims in the present invention.

Claims (5)

1., based on a wideband excitation signal synthetic method for segment extension, it is characterized in that, comprise the following steps:
S1: narrow band voice signal is successively passed through analysis filter after up-sampling, framing pre-service, obtains narrowband excitation signal u n;
S2: by narrowband excitation signal u nby the low-pass filter that cutoff frequency is α 1, export the first band excitation signal u l, to the first band excitation signal u lcarry out spectrum translation, be the Hi-pass filter of α 1 again by a cutoff frequency by the signal after spectrum translation, obtain the second band excitation signal u m;
S3: a unit white noise generator is set and produces white noise signal u f, calculate the first band excitation signal u lvariance obtain gain factor adjustment white noise signal u fenergy and be multiplied by gain factor then be the Hi-pass filter of α 2 by a cutoff frequency, obtain the 3rd band excitation signal u h;
S4: by the first band excitation signal u l, the second band excitation signal u mwith the 3rd band excitation signal u hmerge and obtain wideband excitation signal u.
2. the wideband excitation signal synthetic method based on segment extension according to claim 1, is characterized in that, white noise signal u after energy adjusting in described step S3 ffrequency band without limit for width, the Hi-pass filter being α 2 by a cutoff frequency carrys out the spectrum component of filtering white noise α less than 2, obtains the 3rd band excitation signal u that frequency band range is 6500Hz-8000Hz h.
3. the wideband excitation signal synthetic method based on segment extension according to claim 1, is characterized in that, to be the span of α 1 be cutoff frequency: 3000-3500HZ.
4. the wideband excitation signal synthetic method based on segment extension according to any one of claim 1-2, is characterized in that, to be the span of α 2 be cutoff frequency: 6000-6500HZ.
5. the wideband excitation signal synthetic method based on segment extension according to claim 1, is characterized in that, the first band excitation signal u in described step S4 l, the second band excitation signal u mwith the 3rd band excitation signal u hall be band limiting and the pumping signal of respective band component non-overlapping copies, three pumping signals carried out sum operation in time domain, obtains the wideband excitation signal u that a frequency band is complete.
CN201510329767.3A 2015-06-15 2015-06-15 Broadband excitation signal synthesis method based on segmented expansion Pending CN105047201A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510329767.3A CN105047201A (en) 2015-06-15 2015-06-15 Broadband excitation signal synthesis method based on segmented expansion

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510329767.3A CN105047201A (en) 2015-06-15 2015-06-15 Broadband excitation signal synthesis method based on segmented expansion

Publications (1)

Publication Number Publication Date
CN105047201A true CN105047201A (en) 2015-11-11

Family

ID=54453688

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510329767.3A Pending CN105047201A (en) 2015-06-15 2015-06-15 Broadband excitation signal synthesis method based on segmented expansion

Country Status (1)

Country Link
CN (1) CN105047201A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107659290A (en) * 2017-08-09 2018-02-02 湖南艾科诺维科技有限公司 Bandwidth expansion wave filter and its design method

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1328683A (en) * 1998-10-27 2001-12-26 沃斯艾格公司 High frequency content recovering methd and device for over-sampled synthesized wideband signal
US20070296614A1 (en) * 2006-06-21 2007-12-27 Samsung Electronics Co., Ltd Wideband signal encoding, decoding and transmission
CN101288614A (en) * 2008-05-21 2008-10-22 清华大学深圳研究生院 Electronic cochlea telephony adaptation device and method based on spectrum extension technique
CN102934163A (en) * 2010-06-01 2013-02-13 高通股份有限公司 Systems, methods, apparatus, and computer program products for wideband speech coding
CN103218539A (en) * 2013-05-06 2013-07-24 重庆科技学院 Rapid determining method of structural response driven by broadband and unsteady-state signal
CN103714822A (en) * 2013-12-27 2014-04-09 广州华多网络科技有限公司 Sub-band coding and decoding method and device based on SILK coder decoder

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1328683A (en) * 1998-10-27 2001-12-26 沃斯艾格公司 High frequency content recovering methd and device for over-sampled synthesized wideband signal
US20070296614A1 (en) * 2006-06-21 2007-12-27 Samsung Electronics Co., Ltd Wideband signal encoding, decoding and transmission
CN101288614A (en) * 2008-05-21 2008-10-22 清华大学深圳研究生院 Electronic cochlea telephony adaptation device and method based on spectrum extension technique
CN102934163A (en) * 2010-06-01 2013-02-13 高通股份有限公司 Systems, methods, apparatus, and computer program products for wideband speech coding
CN103218539A (en) * 2013-05-06 2013-07-24 重庆科技学院 Rapid determining method of structural response driven by broadband and unsteady-state signal
CN103714822A (en) * 2013-12-27 2014-04-09 广州华多网络科技有限公司 Sub-band coding and decoding method and device based on SILK coder decoder

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107659290A (en) * 2017-08-09 2018-02-02 湖南艾科诺维科技有限公司 Bandwidth expansion wave filter and its design method

Similar Documents

Publication Publication Date Title
EP3291231B1 (en) Oversampling in a combined transposer filterbank
CA2550654C (en) Frequency extension of harmonic signals
CN102246231B (en) Audio encoder and bandwidth extension decoder
RU2490728C2 (en) Efficient combined harmonic transposition
DE69821089T2 (en) IMPROVE SOURCE ENCODING USING SPECTRAL BAND REPLICATION
US7742927B2 (en) Spectral enhancing method and device
CN101458930B (en) Excitation signal generation in bandwidth spreading and signal reconstruction method and apparatus
EP0657873B1 (en) Speech signal bandwidth compression and expansion apparatus, and bandwidth compressing speech signal transmission method, and reproducing method
DE60110796T2 (en) FREQUENCY INTERPOLATION DEVICE AND FREQUENCY INTERPOLATION METHOD
JP2007535849A (en) Information signal processing by transformation in spectral / modulated spectral domain representation
CN103258543B (en) Method for expanding artificial voice bandwidth
EP3608910B1 (en) Decoding device and method, and program
WO2016002551A1 (en) Signal processing device and signal processing method
EP1657710B1 (en) Coding apparatus and decoding apparatus
GB2351889A (en) Speech band expansion
CN117975976A (en) Band expansion method, device, electronic equipment and computer readable storage medium
CN105047201A (en) Broadband excitation signal synthesis method based on segmented expansion
KR20030007528A (en) Spectral enhancing method and device
CN103155035A (en) Audio signal bandwidth extension in celp-based speech coder
KR20170080387A (en) Apparatus and method for extending bandwidth of earset with in-ear microphone
CN105280189A (en) Method and apparatus for high-frequency generation during bandwidth extension coding and decoding
JP2010515090A (en) Speech coding method and apparatus
CN101763629B (en) Construction method of integrative filter bank with cellular structure
CN111010146A (en) Signal reconstruction structure based on fast filter bank and design method thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20151111