CN102339607A - Method and device for spreading frequency bands - Google Patents

Method and device for spreading frequency bands Download PDF

Info

Publication number
CN102339607A
CN102339607A CN2010102330332A CN201010233033A CN102339607A CN 102339607 A CN102339607 A CN 102339607A CN 2010102330332 A CN2010102330332 A CN 2010102330332A CN 201010233033 A CN201010233033 A CN 201010233033A CN 102339607 A CN102339607 A CN 102339607A
Authority
CN
China
Prior art keywords
frequency band
high frequency
statistical property
property classification
parameters
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2010102330332A
Other languages
Chinese (zh)
Inventor
本特·凯瑟
马格纳斯·斯卡弗
皮特·瓦里
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN2010102330332A priority Critical patent/CN102339607A/en
Priority to PCT/CN2011/075079 priority patent/WO2011144130A1/en
Publication of CN102339607A publication Critical patent/CN102339607A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The embodiment of the invention discloses a method and device for spreading frequency bands. Taking the implementation of the method for instance, the method comprises the following steps of: acquiring the characteristic vectors of low-frequency band signals; carrying out classification on the low-frequency band signals according to the characteristic vectors and the preset statistical characteristic classification characteristic vector set pairs and obtaining a statistical characteristic classification result; obtaining predicted high-frequency band parameters according to the statistical characteristic classification result, the characteristic vectors and the preset statistical characteristic classification state transition matrix; adjusting the predicted high-frequency band parameters according to the statistical characteristic classification result and the preset post-processing smooth factor set and obtaining adjusted high-frequency band parameters; and reconstructing high-frequency band signals. With the adoption of the method, the obtained parameter information for spreading the frequency bands can be more directive, the interframe transition is smoother, and the obtained spread signals have higher hearing perception.

Description

A kind of method and apparatus of band spread
Technical field
The present invention relates to communication technical field, particularly a kind of method and apparatus of band spread.
Background technology
Along with the development of bearing technology, People more and more is not content with the quality of narrowband speech codec, thus audio coder & decoder (codec) progressively to the broadband (Windband, WB), ultra broadband (Super Windband, SWB) expansion.(the International Telecommunication Union of International Telecommunications Union (ITU) for example; ITU) released G.722, G.722.1, G.722.2, G.729.1 waited broadband encoding and decoding speech standard; 3G (Third Generation) Moblie partnership projects (Third Generation Partnership Project; 3GPP) released AMR-WB (Adaptive Multi-Rate Windband; AMR-WB) (be ITU G.722.2) this broadband voice encoding and decoding standard, 3GPP2 then released the variable Rate multi-mode wideband (Variable-Rate Multimode Windband, VMR-WB).G.711, WB&G.722 associating ultra broadband etc. ITU has proposed G.729.1&G.718 to unite ultra broadband recently again in addition.These standards are all from arrowband expansion, and core layer is generally Code Excited Linear Prediction (Code-Excited Linear-Prediction, CELP) coding, and broadband, ultra broadband partly use transition coding technological.Transition coding has a lot, and the discrete cosine transform of for example revising (Modified Discrete Cosine Transform, MDCT), the conversion code excitation (Transform Coded exciation, TCX) etc.
Band spread is used in the voice/audio coding field very widely, can improve the perceived quality of limit band voice/music effectively, and the quality enhancement techniques of on the terminal, using based on band spread is exactly one type of good application example.Band spreading technique also is applied in the embedded variable rate speech coding device by wide model, and the audio bandwidth that particularly when transmission channel conditions changes, produces switches.Common bandwidth switch mainly contain the arrowband (Narrow band, NB), broadband, ultra broadband, full band (Full Band, the switching between FB).
The method that realizes band spread can be divided into two kinds of the band spreads of band spread that side information is arranged and no side information.There is the band spread of side information to extract some characteristic informations of treating extending bandwidth, and these information products are delivered to decoding end, instruct decoding end to carry out the frequency band corresponding expansion at coding side.The band spread of no side information is called blind expansion again, need be in the coding side information extraction, and the information of the partial-band that only need obtain according to decoding end is through the information of the required extending bandwidth of the artificial generation of certain algorithm for estimating.
The method of band spread can also be divided into based on the expansion of time domain with based on the expansion of frequency domain.The time-domain information of the partial-band that normally obtains based on decoding end based on the expansion of time domain carries out obtaining the time-domain information of required extending bandwidth after the shaping of time domain and frequency domain, thereby realizes band spread.The frequency domain information of the partial-band that normally obtains based on decoding end based on the expansion of frequency domain carries out obtaining the frequency domain information of required extending bandwidth after the shaping of frequency domain and time domain, thereby realizes band spread.
At present, the processing that the band spreading technique of no side information generally carries out in time domain wherein has a kind of method to be based on the piecewise linear maps frequency range expanding method of statistical property.The performing step of this method:
1, extracts the eigenvector of the partial-band that decoding obtains;
2, compare through the statistical property characteristic of division vector set that training in advance before the eigenvector that extracts and the band spread is obtained, signal is classified; Above-mentioned training is meant: according to certain rule, between data are concentrated, extract useful information, use the guidance of these useful informations that these data are divided into different classes, represent with its corresponding useful information for of a sort data.
3, according to type corresponding predefined state-transition matrix of above-mentioned branch, obtain the parameter information of required extending bandwidth, thereby realize band spread.
The inventor finds in realizing process of the present invention: because signal is divided into limited several types; The parameter information of the extending bandwidth that therefore can generate has only limited several kinds; Can't adaptive signal characteristic widely, cause the interframe transition unsmooth, cause auditory perception poor.
Summary of the invention
The technical matters that the embodiment of the invention will solve provides a kind of method and apparatus of band spread, improves auditory perception.
For solving the problems of the technologies described above, the method embodiment of band spread provided by the present invention can realize through following technical scheme:
Obtain the eigenvector of low band signal;
Reach preset statistical property characteristic of division vector set to said low band signal is classified according to said eigenvector, obtain the statistical property classification results;
According to said statistical property classification results, eigenvector and preset statistical property classification state-transition matrix, the high frequency band parameters that obtains estimating;
According to said statistical property classification results and preset aftertreatment smoothing factor collection, the said high frequency band parameters of estimating is adjusted, obtain adjusted high frequency band parameters;
According to adjusted high frequency band parameters, rebuild high-frequency band signals.
A kind of device of band spread comprises:
The vector acquiring unit is used to obtain the eigenvector of low band signal;
Taxon is used for reaching preset statistical property characteristic of division vector set to said low band signal is classified according to said eigenvector, obtains the statistical property classification results;
Estimate the unit, be used for according to said statistical property classification results, eigenvector and preset statistical property classification state-transition matrix, the high frequency band parameters that obtains estimating;
Adjustment unit is used for according to said statistical property classification results and preset aftertreatment smoothing factor collection the said high frequency band parameters of estimating being adjusted, and obtains adjusted high frequency band parameters;
The signal reconstruction unit is used for according to adjusted high frequency band parameters, rebuilds high-frequency band signals.
Technique scheme has following beneficial effect: on the basis based on the piecewise linear maps band spread algorithm of statistical property, increased a self-adaptation aftertreatment; This method has effectively utilized the classified information that obtains in the piecewise linear maps band spread algorithm; The parameter information of the extending bandwidth that piecewise linear maps band spread algorithm is obtained carries out adaptive aftertreatment again by class; Make that the extending bandwidth parameter information that obtains is more targeted; The interframe transition is more level and smooth, and the signal that expands that obtains has higher auditory perception.
Description of drawings
In order to be illustrated more clearly in the technical scheme in the embodiment of the invention; The accompanying drawing of required use is done to introduce simply in will describing embodiment below; Obviously, the accompanying drawing in describing below only is some embodiments of the present invention, for those of ordinary skills; Under the prerequisite of not paying creative work property, can also obtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is the embodiment of the invention one a method flow synoptic diagram;
Fig. 2 is the embodiment of the invention two method flow synoptic diagram;
Fig. 3 is the embodiment of the invention three method schematic flow sheets;
Fig. 4 is the embodiment of the invention four apparatus structure synoptic diagram;
Fig. 5 is the embodiment of the invention four apparatus structure synoptic diagram;
Fig. 6 is the embodiment of the invention four apparatus structure synoptic diagram;
Fig. 7 is the embodiment of the invention four apparatus structure synoptic diagram.
Embodiment
To combine the accompanying drawing in the embodiment of the invention below, the technical scheme in the embodiment of the invention is carried out clear, intactly description, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills are not making the every other embodiment that is obtained under the creative work prerequisite, all belong to the scope of the present invention's protection.
Embodiment one, and the embodiment of the invention provides a kind of method of band spread, and is as shown in Figure 1, comprising:
101: the eigenvector that obtains low band signal;
Particularly, above-mentioned eigenvector can comprise: temporal envelope and linear predictor coefficient, above-mentioned temporal envelope are represented the energy size of each subframe signal in the time domain, the resonance peak position and the amplitude of above-mentioned linear predictor coefficient expression signal.Certain above-mentioned eigenvector can also comprise: parameters such as the frequency domain envelope of low band signal, frequency-domain linear prediction coefficient will not limit this embodiment of the invention.
102: according to above-mentioned eigenvector and preset statistical property characteristic of division vector set above-mentioned low band signal is classified, obtain the statistical property classification results;
103: according to above-mentioned statistical property classification results, eigenvector and preset statistical property classification state-transition matrix, the high frequency band parameters that obtains estimating;
Particularly, in above-mentioned 103, according to statistical property classification results, eigenvector and preset statistical property classification state-transition matrix, the high frequency band parameters that obtains estimating comprises:
According to above-mentioned statistical property classification results, the corresponding state-transition matrix of the above-mentioned classification results of inquiry in preset statistical property classification state-transition matrix; The high frequency band parameters that obtains estimating according to above-mentioned state-transition matrix and above-mentioned eigenvector.
The statistical property classification results that certain above-mentioned basis obtains, the eigenvector of low band signal and preset statistical property classification state-transition matrix; The high frequency band parameters that obtains estimating; Other modes can also be arranged; For example following dual mode: (1) is inquired about the corresponding state-transition matrix of this classification results according to the statistical property classification results that obtains in preset statistical property classification state-transition matrix; Multiply by the eigenvector of low band signal, the high frequency band parameters that obtains estimating according to the corresponding state-transition matrix of the statistical property classification results that obtains.(2) according to the statistical property classification results that obtains, the corresponding state transitions map index value of this classification results of inquiry in preset statistical property classification state-transition matrix; Table look-up and obtain the corresponding high frequency band parameters of estimating according to obtaining state transitions map index value.
Particularly, above-mentioned high frequency band parameters can comprise: temporal envelope and frequency domain envelope; Above-mentioned temporal envelope is represented the energy size of each subframe signal in the time domain, and above-mentioned frequency domain envelope is represented the gain size of each subband signal in the frequency domain.Certain above-mentioned high frequency band parameters can also comprise: parameters such as the time domain linear predictive coefficient of high-frequency band signals, frequency-domain linear prediction coefficient will not limit this embodiment of the invention.
104: according to above-mentioned statistical property classification results and preset aftertreatment smoothing factor collection, the above-mentioned high frequency band parameters of estimating is adjusted, obtained adjusted high frequency band parameters;
If said high frequency band parameters comprises: temporal envelope and frequency domain envelope; Said aftertreatment factor parameter comprises: smoothing factor in the frame; Then said the high frequency band parameters of estimating is adjusted; Obtain adjusted high frequency band parameters can for: according to smoothing factor in the corresponding frame of statistical property classification; Temporal envelope parameter and frequency domain envelope parameters in the high frequency band parameters that adjustment is estimated obtain adjusted high frequency band parameters.
Particularly; Above-mentioned according to statistical property classification results and preset aftertreatment smoothing factor collection; The above-mentioned high frequency band parameters adjustment of estimating is comprised:, concentrate the corresponding aftertreatment factor parameter of the above-mentioned statistical property classification results of inquiry at preset aftertreatment smoothing factor according to above-mentioned statistical property classification results; According to the aftertreatment factor parameter corresponding with above-mentioned statistical property classification results, adjust the above-mentioned high frequency band parameters of estimating, obtain adjusted high frequency band parameters.
Certainly; Classification results that above-mentioned basis obtains and preset aftertreatment smoothing factor collection; The method that the high frequency band parameters of estimating is adjusted can also have other mode; Give an example for for example following 3: the aftertreatment factor parameter of this classification results correspondence of inquiry is concentrated according to the statistical property classification results that obtains in (1) at preset aftertreatment smoothing factor; According to the corresponding aftertreatment factor parameter of the statistical property classification results that obtains; According to the aftertreatment factor parameter carry out in the frame to the high frequency band parameters of estimating and/or interframe level and smooth; The high frequency band parameters that adjustment is adaptively estimated obtains adjusted high frequency band parameters.(2), concentrate the corresponding aftertreatment factor parameter of this classification results of inquiry at preset aftertreatment smoothing factor according to the statistical property classification results that obtains; According to the corresponding aftertreatment factor parameter of the statistical property classification results that obtains, the high frequency band parameters of estimating carried out parameter decay or strengthen according to the aftertreatment factor parameter, the high frequency band parameters that adjustment is adaptively estimated obtains adjusted high frequency band parameters.(3), concentrate the corresponding aftertreatment factor parameter of this classification results of inquiry at preset aftertreatment smoothing factor according to the statistical property classification results that obtains; According to the corresponding aftertreatment factor parameter of the statistical property classification results that obtains; According to the aftertreatment factor parameter carry out in the frame to the high frequency band parameters of estimating and/or interframe level and smooth; The line parameter of going forward side by side decay or enhancing; The high frequency band parameters that adjustment is adaptively estimated obtains adjusted high frequency band parameters.
105:, rebuild high-frequency band signals according to above-mentioned adjusted high frequency band parameters.
Particularly, above-mentioned aftertreatment factor parameter can comprise: smoothing factor and interframe smoothing factor in the frame.
More specifically, the high frequency band parameters that above-mentioned adjustment is estimated comprises: according to smoothing factor in the corresponding frame of statistical property classification, the temporal envelope parameter in the high frequency band parameters that adjustment is estimated; According to the corresponding interframe smoothing factor of statistical property classification, the frequency domain envelope parameters in the high frequency band parameters that preliminary adjustment is estimated; According to smoothing factor in the corresponding frame of statistical property classification, preliminary adjusted high frequency band frequency domain envelope parameters is adjusted again, obtain adjusted high frequency band parameters.
The executive agent of said method can be any device that carries out band spread; The method that the embodiment of the invention provides has increased a self-adaptation aftertreatment on the basis based on the piecewise linear maps band spread algorithm of statistical property; This method has effectively utilized the classified information that obtains in the piecewise linear maps band spread algorithm; The parameter information of the extending bandwidth that piecewise linear maps band spread algorithm is obtained carries out adaptive aftertreatment again by class; Make that the extending bandwidth parameter information that obtains is more targeted, the interframe transition is more level and smooth, and the signal that expands that obtains has higher auditory perception.
Embodiment two, and the method that the embodiment of the invention gives band spread is instance more specifically, and is as shown in Figure 2, comprises the steps:
201: decoding obtains low band signal;
202: the eigenvector X that extracts low band signal f
Eigenvector can have various array modes, only needs to reflect that the characteristic of low band signal gets final product.For example, eigenvector can comprise the temporal envelope and the linear predictor coefficient of low band signal, also can comprise the temporal envelope and the frequency domain envelope of low band signal.
203: according to the eigenvector X of the low band signal that obtains fAnd preset statistical property characteristic of division vector set X F, j(j ∈ 1 ..., M}), low band signal is classified, obtain statistical property classification i.
Concrete realization can be used the method for vector quantization, and preset statistical property characteristic of division vector set as code book, is searched for and eigenvector X in code book fThe code word that distance is minimum; This code word corresponding index in code book is statistical property classification i: code book is an array; Comprised tactic M the characteristic of division vector that classification is corresponding, each eigenvector is exactly a code word, and M eigenvector is exactly M code word.The position of this code word in code book represented in the index of code word, index is corresponding classification number.
i = arg min j ∈ { 1 , . . . , M } | | X f - X f , j | | 2
Wherein || || 2Square error is calculated in expression.
204: according to the statistical property classification i that obtains, the eigenvector X of low band signal fAnd preset statistical property classification state-transition matrix collection H, estimate high frequency band parameters
Figure BSA00000199994200072
Wherein
Figure BSA00000199994200073
The statistical property classification state-transition matrix H that expression is corresponding to statistical property classification i iCalculate transposition (property sort i is corresponding i vector H among the state-transition matrix H i).Statistical property category set and statistical property classification state-transition matrix have kept one-to-one relationship when training out simultaneously.
Above-mentioned high frequency band parameters can have the different combinations mode, as long as can reflect the characteristic of high-frequency band signals.For example, high frequency band parameters can comprise the temporal envelope and the linear predictor coefficient of high-frequency band signals, also can comprise the temporal envelope and the frequency domain envelope of high-frequency band signals.The array mode that need to prove the eigenvector in high frequency band parameters and the preceding text is can be inconsistent, does not influence the realization of the embodiment of the invention.
205: according to statistical property classification i and preset aftertreatment factor set; Carry out the self-adaptation adjustment to estimating high frequency band parameters, obtain adjusted high frequency band parameters
Figure BSA00000199994200074
Above-mentioned preset aftertreatment factor set can have the different combinations mode, can only comprise the interframe smoothing factor, also can comprise smoothing factor in interframe smoothing factor and the frame, can also comprise the different aftertreatment factors such as the state transition factor.Each parameter in the preset aftertreatment factor set can be directed against the different statistic property sort respectively, thereby embodies the adaptive characteristic of post-processing approach and fully use the characteristics of statistical property classification.
206: rebuild high-frequency band signals according to adjusted high frequency band parameters.
Rebuild high-frequency band signals according to the high frequency band parameters that obtains; The particular content that mainly comprises according to high frequency band parameters; For example, when high frequency band parameters has comprised temporal envelope and frequency domain envelope, can adopt the method for frequency domain band spread; The frequency domain spectra of low-frequency band is transformed to time domain after according to the frequency domain envelope shaping, carry out the high-frequency band signals that shaping obtains rebuilding according to temporal envelope again; Also can adopt the method for time domain band spread, the time domain pumping signal of low-frequency band is carried out transforming to frequency domain after the shaping according to temporal envelope, carry out shaping according to the frequency domain envelope again, time domain is returned in conversion more at last, the high-frequency band signals that obtains rebuilding.When high frequency band parameters has comprised the high frequency band linear predictor coefficient, can use the composite filter of the time domain pumping signal of low-frequency band, the high-frequency band signals that obtains rebuilding through high frequency band linear predictor coefficient formation.
Below to preset statistical property characteristic of division vector set, preset statistical property classification state-transition matrix collection and preset aftertreatment factor set, be that the statistical property according to a large amount of signals obtains, concrete training method is following:
At first, from training set, extract the low-frequency band eigenvector and the corresponding high frequency band parameters vector of each signal, form low-frequency band eigenvector training set and high frequency band parameters vector training set respectively; Above-mentioned training set is the data set that is used to train, and this data set is previously selected voice/audio language material.
Then, according to the different statistic characteristic, according to the method for cluster, training obtains low-frequency band feature vector set X from low-frequency band eigenvector training set F, j(j ∈ 1 ..., M}), the cluster according to statistical property characteristic of division vector set obtains corresponding high frequency band parameters vector set Y simultaneously F, j(j ∈ 1 ..., M}).
According to each statistical nature classification j ∈ 1 ..., the training data that M} is corresponding (the corresponding low-frequency band eigenvector X of statistical nature classification j jWith the high frequency band parameters vector Y i), calculate corresponding state-transition matrix:
H j=X j +·Y j=(X j TX j) -1X j T·Y j
X j +Vector X is asked in expression jPseudo-inverse operation, X j +=(X j TX j) -1X j TEach statistical nature classification j ∈ 1 ..., the state-transition matrix that M} is corresponding has just constituted state-transition matrix collection H j(j ∈ 1 ..., M}).
According to trained low-frequency band feature vector set and state-transition matrix collection; According to the method for classification linear mapping band spread, each signal estimates high frequency band vector
Figure BSA00000199994200081
in the calculation training
Calculate each statistical nature classification corresponding reliable sex factor, and with it as the parameter in the aftertreatment factor set.Above-mentioned reliability factor is α iThe scope of the aftertreatment factor is more wide in range, can only comprise α i, also can comprise other factors except that reliability factor.The corresponding average error in classification of statistical nature classification i is:
e i = 1 N Σ n = 1 N 1 N y Σ l = 0 N y | | y ^ n i ( l ) - y n i ( l ) | | 2
Wherein
Figure BSA00000199994200083
Expression belongs to n the 1st component estimating the high frequency band vector of i statistical nature classification, Expression belongs to the 1st component of n actual high frequency band parameters vector of i statistical nature classification, and N is the number that belongs to all vectors of i statistical nature classification in the training set, N yDimension for the high frequency band parameters vector.
Can obtain statistical nature classification i corresponding reliable sex factor according to the corresponding average error in classification of statistical nature classification i:
α i = c e i · 1 max j ( 1 / e j )
Wherein c is a constant.
In the practical application, can only calculate one group of reliability factor α iAlso can calculate several groups of reliability factors such as α respectively to the different parameter that comprises in the high frequency band parameters vector i, β iDeng, form the reliability factor vector and with it as the aftertreatment factor set, will not limit counting the embodiment of the invention with the group of reliability factor.Use the instance of many group reliability factors, for example,, can use top method to calculate each statistical nature classification corresponding reliable sex factor note respectively and make α if comprised temporal envelope parameter and frequency domain envelope parameters in the high frequency band parameters vector iAnd β i, form reliability factor vector { α i, β i.
In above-mentioned instance, the low-frequency band feature vector set X that trains F, j(j ∈ 1 ..., M}) be preset statistical property characteristic of division vector set, the state-transition matrix collection H that trains j(j ∈ 1 ..., M}) being preset statistical property classification state-transition matrix collection, the aftertreatment factor set that trains is preset aftertreatment factor set.
The method that the embodiment of the invention provides has increased a self-adaptation aftertreatment on the basis based on the piecewise linear maps band spread algorithm of statistical property; This method has effectively utilized the classified information that obtains in the piecewise linear maps band spread algorithm; The parameter information of the extending bandwidth that piecewise linear maps band spread algorithm is obtained carries out adaptive aftertreatment again by class; Make that the extending bandwidth parameter information that obtains is more targeted; The interframe transition is more level and smooth, and the signal that expands that obtains has higher auditory perception.
Embodiment three, present embodiment provide one to be applied in the frequency expansion method of leniently taking ultra broadband in the ultra broadband demoder to; The method that it is understandable that present embodiment also can be applied to from the arrowband to the broadband; Expansion from the arrowband to the ultra broadband, present embodiment should not be construed as the qualification to the embodiment of the invention as an instance.Present embodiment medium and low frequency band signal is broadband signal, and signal bandwidth is 0~7KHz, and high-frequency band signals is ultra-broadband signal, and signal bandwidth is 7~14KHz.Composite signal sampling rate 32KHz, signal is a frame with 20ms, i.e. N=160 point/frame.The method of concrete band spread is as shown in Figure 3, comprises the steps:
301: decoding obtains broadband signal, and extracts the eigenvector X of broadband signal f
According to wideband decoded method in the ultra broadband demoder, obtain the wideband decoded signal of present frame, note is made x nIn the present embodiment, the eigenvector of broadband signal is that example describes with temporal envelope and the linear predictor coefficient that comprises broadband signal.At first, find the solution the N rank linear predictor coefficient LPC of broadband signal Low={ LPC Low(0), LPC Low(1) ..., LPC Low(K-1) }, concrete grammar can use Lay Wen Xun-Du Bin algorithm.K=64 in the present embodiment can certainly choose other exponent number.Then, calculate temporal envelope: a frame signal is divided into the L sub-frame, and each subframe N/L sampling point calculates the energy of each subframe respectively, with the signal temporal envelope E of the energy of L sub-frame Low={ E Low(0), E Low(1) ..., E Low(L-1) }, the energy E of i sub-frame wherein Low(i):
E low ( i ) = 1 N / L Σ l = 0 N / L x n 2 ( i · ( N / L ) + l )
L=8 in the present embodiment can certainly choose other time domain sub-frame division mode.
So, the eigenvector X of broadband signal fCan remember work:
X f={E low(0),E low(1),…,E low(L-1),LPC low(0),LPC low(1),…,LPC low(N-1)}
302: according to the eigenvector X of the broadband signal that obtains fAnd preset statistical property characteristic of division vector set X F, j(j ∈ 1 ..., M}), broadband signal is classified, obtain statistical property classification i.
Obtain the concrete realization of statistical property classification i and can use the method for vector quantization, preset statistical property characteristic of division vector set as code book, is searched for and eigenvector X in code book fThe code word that distance is minimum, the corresponding index of this code word are statistical property classification i:
i = arg min j ∈ { 1 , . . . , M } | | X f - X f , j | | 2
Wherein || || 2Square error, M=8 in the present embodiment are calculated in expression.
303: according to the statistical property classification i that obtains, the eigenvector X of broadband signal fAnd preset statistical property classification state-transition matrix collection H, estimate the ultra broadband parameter
Figure BSA00000199994200103
y ^ = H i T · X f
Wherein
Figure BSA00000199994200105
The statistical property classification state-transition matrix H that expression is corresponding to statistical property classification i iCalculate transposition.The ultra broadband parameter can have the different combinations mode, as long as can reflect the characteristic of ultra-broadband signal.In the present embodiment, the ultra broadband parameter has comprised the temporal envelope and the frequency domain envelope of ultra-broadband signal.Estimate ultra broadband parameter and can remember work:
y ^ = { E high ( 0 ) , E high ( 1 ) , . . . , E high ( L - 1 ) , G high ( 0 ) , G high ( 1 ) , . . . , G high ( P - 1 ) }
E wherein High(0), E High(1) ..., E High(L-1) estimate temporal envelope parameter, E for ultra-broadband signal High(i) the time domain energy parameter of expression i sub-frame, i=0 ..., L-1, L=8 in the present embodiment can certainly choose other time domain sub-frame division mode.G High(0), G High(1) ..., G High(P-1) estimate frequency domain envelope parameters, G for ultra-broadband signal High(i) the frequency domain gain factor of i subband of expression, i=0 ..., P-1, P=18 in the present embodiment can certainly choose other frequency domain sub-band division mode.
304: according to statistical property classification i and preset aftertreatment factor set; Carry out the self-adaptation adjustment to estimating the ultra broadband parameter, obtain adjusted ultra broadband parameter
Figure BSA00000199994200111
Preset aftertreatment factor set can have the different combinations mode, can only comprise the interframe smoothing factor, also can comprise smoothing factor in interframe smoothing factor and the frame, can also comprise the different aftertreatment factors such as the state transition factor.Each parameter in the preset aftertreatment factor set has embodied the adaptive characteristic of post-processing approach and has fully used the characteristics of statistical property classification respectively to the different statistic property sort.In the present embodiment, the aftertreatment factor set has comprised smoothing factor collection and interframe smoothing factor collection in the frame.The interior smoothing factor note of the frame that statistical property classification i is corresponding is made α i, interframe smoothing factor note is made β iAccording to statistical property classification i and preset aftertreatment factor set, carry out the process of self-adaptation adjustment and comprise estimating the ultra broadband parameter:
(1) according to smoothing factor α in the corresponding frame of statistical property classification i i, the temporal envelope parameter in the ultra broadband parameter is estimated in adjustment.
E high , post ( n ) ( 0 ) = α i · E high ( n ) ( 0 ) + ( 1 - α i ) · E high ( n - 1 ) ( L - 1 )
E high , post ( n ) ( k ) = α i · E high ( n ) ( k ) + ( 1 - α i ) · E high ( n ) ( k - 1 ) , k=1,…,L-1
Wherein the L-1 of
Figure BSA00000199994200114
expression former frame is individual without the ultra broadband temporal envelope of crossing adjustment, the adjusted ultra broadband temporal envelope of
Figure BSA00000199994200115
expression present frame.
(2) according to the corresponding interframe smoothing factor β of statistical property classification i i, the frequency domain envelope parameters in the ultra broadband parameter is estimated in preliminary adjustment.
G high , post ( n ) ( k ) = α i · G high ( n ) ( k ) + ( 1 - α i ) · G high ( n - 1 ) ( k ) , k=0,…,P-1
Wherein
Figure BSA00000199994200117
expression present frame is without the ultra broadband frequency domain envelope of crossing adjustment;
Figure BSA00000199994200118
expression former frame is without the ultra broadband frequency domain envelope of crossing adjustment, and
Figure BSA00000199994200119
representes the preliminary adjusted ultra broadband frequency domain envelope of present frame.
(3) according to statistical property classification i, preliminary adjusted ultra broadband frequency domain envelope parameters is adjusted again.
G high , post 2 ( n ) ( k ) = G high , post ( n ) ( k ) - const · i · ( 1 - α i ) , k=0,…,P-1
Wherein const is a constant factor, and const carries the assembly average of energy in the present embodiment for the ultra broadband frequency domain.
So adjusted ultra broadband parameter note is done:
y ^ post = E high , post ( n ) ( 0 ) , E high , post ( n ) ( 1 ) , . . . , E high , post ( n ) ( L - 1 ) , G high , post 2 ( n ) ( 0 ) , G high , post 2 ( n ) ( 1 ) , . . . , G high , post 2 ( n ) ( P - 1 )
305: rebuild high-frequency band signals according to adjusted ultra broadband parameter.
Rebuild ultra-broadband signal according to the ultra broadband parameter that obtains, the particular content that mainly comprises according to the ultra broadband parameter.In the present embodiment; The ultra broadband parameter has comprised temporal envelope and frequency domain envelope, can adopt the method for frequency domain band spread, at first duplicates the frequency domain spectra in broadband and composes as ultra broadband; According to adjusted frequency domain envelope the ultra broadband spectrum is carried out shaping then; After the shaping ultra broadband spectrum signal is transformed to time domain, carry out the time domain shaping according to adjusted temporal envelope again, the high-frequency band signals that obtains rebuilding.
The method that the embodiment of the invention provides has increased a self-adaptation post-processing approach on the basis based on the piecewise linear maps band spread algorithm of statistical property; This method has effectively utilized the classified information that obtains in the piecewise linear maps band spread algorithm; The parameter information of the extending bandwidth that piecewise linear maps band spread algorithm is obtained carries out adaptive aftertreatment again by class; Make that the extending bandwidth parameter information that obtains is more targeted; The interframe transition is more level and smooth, and the signal that expands that obtains has higher auditory perception.
Embodiment four, the embodiment of the invention also provide a kind of device of band spread, and be as shown in Figure 4, comprising:
Vector acquiring unit 401 is used to obtain the eigenvector of low band signal;
Taxon 402 is used for reaching preset statistical property characteristic of division vector set to above-mentioned low band signal is classified according to above-mentioned eigenvector, obtains the statistical property classification results;
Estimate unit 403, be used for according to above-mentioned statistical property classification results, eigenvector and preset statistical property classification state-transition matrix, the high frequency band parameters that obtains estimating;
Adjustment unit 404 is used for according to above-mentioned statistical property classification results and preset aftertreatment smoothing factor collection the above-mentioned high frequency band parameters of estimating being adjusted, and obtains adjusted high frequency band parameters;
Signal reconstruction unit 405 is used for according to adjusted high frequency band parameters, rebuilds high-frequency band signals.
Particularly, above-mentioned adjustment unit 404 as shown in Figure 5 comprises:
Aftertreatment factor query unit 501 is used for according to above-mentioned statistical property classification results, concentrates the corresponding aftertreatment factor parameter of the above-mentioned statistical property classification results of inquiry at preset aftertreatment smoothing factor;
Adjustment subelement 502 is used for the basis aftertreatment factor parameter corresponding with above-mentioned statistical property classification results, adjusts the above-mentioned high frequency band parameters of estimating, and obtains adjusted high frequency band parameters.
Particularly, as shown in Figure 6, the above-mentioned unit 403 of estimating comprises:
Matrix query unit 601 is used for according to above-mentioned statistical property classification results, the corresponding state-transition matrix of the above-mentioned classification results of inquiry in preset statistical property classification state-transition matrix;
Estimate subelement 602, be used for the high frequency band parameters that obtains estimating according to above-mentioned state-transition matrix and above-mentioned eigenvector.
Particularly, as shown in Figure 7, above-mentioned adjustment unit 404 comprises:
First adjustment unit 701 is used for according to smoothing factor in the corresponding frame of statistical property classification, the temporal envelope parameter in the high frequency band parameters that adjustment is estimated;
Second adjustment unit 702 is used for according to the corresponding interframe smoothing factor of statistical property classification, the frequency domain envelope parameters in the high frequency band parameters that preliminary adjustment is estimated;
The 3rd adjustment unit 703 is used for according to smoothing factor in the corresponding frame of statistical property classification preliminary adjusted high frequency band frequency domain envelope parameters being adjusted again, obtains adjusted high frequency band parameters.
The device that the embodiment of the invention provides has increased a self-adaptation post-processing approach on the basis based on the piecewise linear maps band spread algorithm of statistical property; This method has effectively utilized the classified information that obtains in the piecewise linear maps band spread algorithm; The parameter information of the extending bandwidth that piecewise linear maps band spread algorithm is obtained carries out adaptive aftertreatment again by class; Make that the extending bandwidth parameter information that obtains is more targeted; The interframe transition is more level and smooth, and the signal that expands that obtains has higher auditory perception.
One of ordinary skill in the art will appreciate that all or part of step that realizes in the foregoing description method is to instruct relevant hardware to accomplish through program; Above-mentioned program can be stored in a kind of computer-readable recording medium; The above-mentioned storage medium of mentioning can be a ROM (read-only memory), disk or CD etc.
More than the method and apparatus of a kind of band spread that the embodiment of the invention provided has been carried out detailed introduction; Used concrete example among this paper principle of the present invention and embodiment are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, the part that on embodiment and range of application, all can change, to sum up, this description should not be construed as limitation of the present invention.

Claims (11)

1. the method for a band spread is characterized in that, comprising:
Obtain the eigenvector of low band signal;
Reach preset statistical property characteristic of division vector set to said low band signal is classified according to said eigenvector, obtain the statistical property classification results;
According to said statistical property classification results, eigenvector and preset statistical property classification state-transition matrix, the high frequency band parameters that obtains estimating;
According to said statistical property classification results and preset aftertreatment smoothing factor collection, the said high frequency band parameters of estimating is adjusted, obtain adjusted high frequency band parameters;
According to adjusted high frequency band parameters, rebuild high-frequency band signals.
2. according to the said method of claim 1, it is characterized in that, said according to statistical property classification results and preset aftertreatment smoothing factor collection, the said high frequency band parameters adjustment of estimating is comprised:
According to said statistical property classification results, concentrate the corresponding aftertreatment factor parameter of the said statistical property classification results of inquiry at preset aftertreatment smoothing factor;
According to the aftertreatment factor parameter corresponding with said statistical property classification results, adjust the said high frequency band parameters of estimating, obtain adjusted high frequency band parameters.
3. according to claim 1 or 2 said methods, it is characterized in that said according to statistical property classification results, eigenvector and preset statistical property classification state-transition matrix, the high frequency band parameters that obtains estimating comprises:
According to said statistical property classification results, the corresponding state-transition matrix of the said classification results of inquiry in preset statistical property classification state-transition matrix;
The high frequency band parameters that obtains estimating according to said state-transition matrix and said eigenvector.
4. according to claim 1 or 2 said methods; It is characterized in that said eigenvector comprises: temporal envelope and linear predictor coefficient; Said temporal envelope is represented the energy size of each subframe signal in the time domain, the resonance peak position and the amplitude of said linear predictor coefficient expression signal.
5. according to claim 1 or 2 said methods, it is characterized in that said high frequency band parameters comprises: temporal envelope and frequency domain envelope; Said aftertreatment factor parameter comprises: smoothing factor in the frame; Said the high frequency band parameters of estimating is adjusted, is obtained adjusted high frequency band parameters and comprise:
According to smoothing factor in the corresponding frame of statistical property classification, temporal envelope parameter and frequency domain envelope parameters in the high frequency band parameters that adjustment is estimated obtain adjusted high frequency band parameters.
6. according to claim 1 or 2 said methods, it is characterized in that said aftertreatment factor parameter comprises: smoothing factor and interframe smoothing factor in the frame.
7. according to the said method of claim 6, it is characterized in that the high frequency band parameters that said adjustment is estimated comprises:
According to smoothing factor in the corresponding frame of statistical property classification, the temporal envelope parameter in the high frequency band parameters that adjustment is estimated;
According to the corresponding interframe smoothing factor of statistical property classification, the frequency domain envelope parameters in the high frequency band parameters that preliminary adjustment is estimated;
According to smoothing factor in the corresponding frame of statistical property classification, preliminary adjusted high frequency band frequency domain envelope parameters is adjusted again, obtain adjusted high frequency band parameters.
8. the device of a band spread is characterized in that, comprising:
The vector acquiring unit is used to obtain the eigenvector of low band signal;
Taxon is used for reaching preset statistical property characteristic of division vector set to said low band signal is classified according to said eigenvector, obtains the statistical property classification results;
Estimate the unit, be used for according to said statistical property classification results, eigenvector and preset statistical property classification state-transition matrix, the high frequency band parameters that obtains estimating;
Adjustment unit is used for according to said statistical property classification results and preset aftertreatment smoothing factor collection the said high frequency band parameters of estimating being adjusted, and obtains adjusted high frequency band parameters;
The signal reconstruction unit is used for according to adjusted high frequency band parameters, rebuilds high-frequency band signals.
9. said according to Claim 8 device is characterized in that said adjustment unit comprises:
Aftertreatment factor query unit is used for according to said statistical property classification results, concentrates the corresponding aftertreatment factor parameter of the said statistical property classification results of inquiry at preset aftertreatment smoothing factor;
The adjustment subelement is used for the basis aftertreatment factor parameter corresponding with said statistical property classification results, adjusts the said high frequency band parameters of estimating, and obtains adjusted high frequency band parameters.
10. according to Claim 8 or 9 said devices, it is characterized in that the said unit of estimating comprises:
The matrix query unit is used for according to said statistical property classification results, the corresponding state-transition matrix of the said classification results of inquiry in preset statistical property classification state-transition matrix;
Estimate subelement, be used for the high frequency band parameters that obtains estimating according to said state-transition matrix and said eigenvector.
11., it is characterized in that said adjustment unit comprises according to the said device of claim 10:
First adjustment unit is used for according to smoothing factor in the corresponding frame of statistical property classification, the temporal envelope parameter in the high frequency band parameters that adjustment is estimated;
Second adjustment unit is used for according to the corresponding interframe smoothing factor of statistical property classification, the frequency domain envelope parameters in the high frequency band parameters that preliminary adjustment is estimated;
The 3rd adjustment unit is used for according to smoothing factor in the corresponding frame of statistical property classification preliminary adjusted high frequency band frequency domain envelope parameters being adjusted again, obtains adjusted high frequency band parameters.
CN2010102330332A 2010-07-16 2010-07-16 Method and device for spreading frequency bands Pending CN102339607A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN2010102330332A CN102339607A (en) 2010-07-16 2010-07-16 Method and device for spreading frequency bands
PCT/CN2011/075079 WO2011144130A1 (en) 2010-07-16 2011-06-01 Method and device for frequency band expanding

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010102330332A CN102339607A (en) 2010-07-16 2010-07-16 Method and device for spreading frequency bands

Publications (1)

Publication Number Publication Date
CN102339607A true CN102339607A (en) 2012-02-01

Family

ID=44991196

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010102330332A Pending CN102339607A (en) 2010-07-16 2010-07-16 Method and device for spreading frequency bands

Country Status (2)

Country Link
CN (1) CN102339607A (en)
WO (1) WO2011144130A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015043161A1 (en) * 2013-09-26 2015-04-02 华为技术有限公司 Method and device for bandwidth extension
CN109788922A (en) * 2016-10-14 2019-05-21 公立大学法人大阪府立大学 Swallow diagnostic device and program

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1520590A (en) * 2001-06-28 2004-08-11 �ʼҷ����ֵ������޹�˾ Wideband signal transmission system
CN1992533A (en) * 2005-12-26 2007-07-04 索尼株式会社 Signal encoding device and signal encoding method, signal decoding device and signal decoding method, program, and medium
CN101076853A (en) * 2004-12-10 2007-11-21 松下电器产业株式会社 Wide-band encoding device, wide-band lsp prediction device, band scalable encoding device, wide-band encoding method
WO2008101324A1 (en) * 2007-02-23 2008-08-28 Qnx Software Systems (Wavemakers), Inc. High-frequency bandwidth extension in the time domain
US20080221905A1 (en) * 2006-10-18 2008-09-11 Markus Schnell Encoding an Information Signal
CN101620854A (en) * 2008-06-30 2010-01-06 华为技术有限公司 Method, system and device for frequency band expansion
CN101751926A (en) * 2008-12-10 2010-06-23 华为技术有限公司 Signal coding and decoding method and device, and coding and decoding system

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101304261B (en) * 2007-05-12 2011-11-09 华为技术有限公司 Method and apparatus for spreading frequency band
CN101770777B (en) * 2008-12-31 2012-04-25 华为技术有限公司 LPC (linear predictive coding) bandwidth expansion method, device and coding/decoding system

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1520590A (en) * 2001-06-28 2004-08-11 �ʼҷ����ֵ������޹�˾ Wideband signal transmission system
CN101076853A (en) * 2004-12-10 2007-11-21 松下电器产业株式会社 Wide-band encoding device, wide-band lsp prediction device, band scalable encoding device, wide-band encoding method
CN1992533A (en) * 2005-12-26 2007-07-04 索尼株式会社 Signal encoding device and signal encoding method, signal decoding device and signal decoding method, program, and medium
US20080221905A1 (en) * 2006-10-18 2008-09-11 Markus Schnell Encoding an Information Signal
WO2008101324A1 (en) * 2007-02-23 2008-08-28 Qnx Software Systems (Wavemakers), Inc. High-frequency bandwidth extension in the time domain
CN101620854A (en) * 2008-06-30 2010-01-06 华为技术有限公司 Method, system and device for frequency band expansion
CN101751926A (en) * 2008-12-10 2010-06-23 华为技术有限公司 Signal coding and decoding method and device, and coding and decoding system

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015043161A1 (en) * 2013-09-26 2015-04-02 华为技术有限公司 Method and device for bandwidth extension
CN104517610A (en) * 2013-09-26 2015-04-15 华为技术有限公司 Band spreading method and apparatus
US9666201B2 (en) 2013-09-26 2017-05-30 Huawei Technologies Co., Ltd. Bandwidth extension method and apparatus using high frequency excitation signal and high frequency energy
CN104517610B (en) * 2013-09-26 2018-03-06 华为技术有限公司 The method and device of bandspreading
US10186272B2 (en) 2013-09-26 2019-01-22 Huawei Technologies Co., Ltd. Bandwidth extension with line spectral frequency parameters
CN109788922A (en) * 2016-10-14 2019-05-21 公立大学法人大阪府立大学 Swallow diagnostic device and program
US11246526B2 (en) 2016-10-14 2022-02-15 University Public Corporation Osaka Swallowing diagnosis apparatus and storage medium

Also Published As

Publication number Publication date
WO2011144130A1 (en) 2011-11-24

Similar Documents

Publication Publication Date Title
US11580998B2 (en) Method and device for encoding a high frequency signal, and method and device for decoding a high frequency signal
EP3910630B1 (en) Transient speech or audio signal encoding method and device, decoding method and device, processing system and computer-readable storage medium
CN102177426B (en) Multi-resolution switched audio encoding/decoding scheme
AU2007206167B2 (en) Apparatus and method for encoding and decoding signal
CN108711431B (en) Method and apparatus for concealing frame errors
EP2791937B1 (en) Generation of a high band extension of a bandwidth extended audio signal
EP2770503B1 (en) Method and apparatus for concealing frame errors and method and apparatus for audio decoding
US20070219785A1 (en) Speech post-processing using MDCT coefficients
CN106128473A (en) For producing equipment and the method for bandwidth expansion signal
EP2774145B1 (en) Improving non-speech content for low rate celp decoder
CN105706166B (en) Audio decoder apparatus and method for decoding a bitstream
Bessette et al. Universal speech/audio coding using hybrid ACELP/TCX techniques
CN104025189A (en) Method for encoding voice signal, method for decoding voice signal, and apparatus using same
CN103493129B (en) For using Transient detection and quality results by the apparatus and method of the code segment of audio signal
CN104978970A (en) Noise signal processing and generation method, encoder/decoder and encoding/decoding system
EP2951816B1 (en) Noise filling without side information for celp-like coders
CN105304090A (en) Apparatus and method for encoding and decoding an audio signal using an aligned look-ahead portion
CN106233112A (en) Coding method and equipment and signal decoding method and equipment
CN102339607A (en) Method and device for spreading frequency bands
CN101622668B (en) Methods and arrangements in a telecommunications network
Zhao et al. A CNN postprocessor to enhance coded speech
Park et al. Artificial bandwidth extension of narrowband speech signals for the improvement of perceptual speech communication quality
Ali et al. A long term harmonic plus noise model for narrow-band speech coding at very low bit-rates
CN1327408C (en) A low bit-rate speech encoder
Choi et al. Efficient harmonic-CELP based hybrid coding of speech at low bit rates.

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20120201