CN101436407A - Method for encoding and decoding audio - Google Patents

Method for encoding and decoding audio Download PDF

Info

Publication number
CN101436407A
CN101436407A CNA200810232760XA CN200810232760A CN101436407A CN 101436407 A CN101436407 A CN 101436407A CN A200810232760X A CNA200810232760X A CN A200810232760XA CN 200810232760 A CN200810232760 A CN 200810232760A CN 101436407 A CN101436407 A CN 101436407A
Authority
CN
China
Prior art keywords
frequency
residual error
high frequency
domain
residual
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA200810232760XA
Other languages
Chinese (zh)
Other versions
CN101436407B (en
Inventor
马鸿飞
郭小川
熊静
徐雅俊
吴礼仲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xidian University
Original Assignee
Xidian University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xidian University filed Critical Xidian University
Priority to CN200810232760XA priority Critical patent/CN101436407B/en
Publication of CN101436407A publication Critical patent/CN101436407A/en
Application granted granted Critical
Publication of CN101436407B publication Critical patent/CN101436407B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The invention discloses a method for coding and decoding an audio, which mainly solves the problems that the prior method for coding and decoding the audio has low compression ratio and poor reconstruction audio quality. The method comprises the following steps: adopting a time frequency translation and frequency domain filtering method or a time domain filtering and time frequency translation method to analyze an audio signal so as to obtain a frequency domain residual error signal; dividing the frequency domain residual error signal into a low frequency residual error signal and a high frequency residual error signal, and performing direct coding on the low frequency residual error signal and performing parameter coding on the high frequency residual error signal respectively; reconstructing the high frequency residual error signal through the decoded low frequency residual error signal and the decoded high frequency residual error; recombining the decoded low frequency residual error signal and the reconstructed high frequency residual error signal to obtain a reconstructed frequency domain residual error signal; and finally adopting a frequency domain inverse filtering and time frequency inverse transform method or a time frequency inverse transform and time domain inverse filtering method to obtain a reconstructed audio signal. The method eliminates the redundancy in the frequency domain residual error signal and improves the compression ratio of audio coding, the channel utilization rate and the audio transmission quality, and is applied to multimedia communication and consumer electronics.

Description

Audio encoding and decoding method
Technical field
The present invention relates to the audio encoding and decoding technique field, be specifically related to a kind of signal reconfiguring method, be used for multimedia communication and field of consumer electronics based on the reconstruct of high frequency residual error.
Background technology
In field of multimedia communication, comprise that the audio frequency of voice, especially wideband audio become one of main communication service gradually.But sound signal frequency band broad, amount of coded data are bigger, this give real-time Transmission of sound signal and effectively storage bring very big difficulty.Though audio coding algorithms such as MP3, AAC, EAAC and EAAC+ preferably sound signal carry out compressed encoding, satisfy the requirement of certain application, but also can't be competent at business such as present mobile multimedia communication that is developing and various mobile platforms preferably.So be necessary to study the audio coding algorithm of the higher and better quality of efficient.In recent years, in the research field of audio compression coding, the processing of sound signal radio-frequency component, compression and reconstruct become one of gordian technique of correlative study, and how utilizing the signal of low frequency frequency range to rebuild high-frequency signal is important research contents.
The method that prior art utilizes the frequency domain low frequency to carry out the frequency domain high-frequency reconstruction mainly contains two kinds, simply is described below:
Prior art 1
The low frequency signal of audio frequency or voice is handled by a digital filter bank, obtained one group of low frequency sub-band signal; Again this group low frequency sub-band is carried out duplicating of high-frequency signal as a monoblock signal.The clone method of whole high-frequency band signal is that high-frequency signal is divided into some frequency ranges from low to high according to frequency, and every section bandwidth with above-mentioned monoblock low frequency signal is roughly the same; Then with monoblock low frequency sub-band group continuous compound rate each section to high-frequency band.Like this, monoblock low frequency sub-band group can periodically be used several times in high-frequency band, up to the whole high-frequency band that needs to recover all be replicated finish till.Concrete mode has two kinds: the one, monoblock low frequency sub-band group is moved to corresponding high-frequency band, and the 2nd, folding earlier monoblock low frequency sub-band group, promptly put upside down subband and put in order, again monoblock low frequency sub-band group is moved to corresponding high-frequency band; In reproduction process, this dual mode may intersect use.Like this, monoblock low frequency sub-band group can periodically be used, up to the whole high-frequency band of need recovering all be replicated finish till.
Prior art 2
Low frequency signal is handled by the low-pass filter group, obtained one group of low frequency sub-band signal.Here no longer as prior art 1, the low frequency sub-band group of choosing to be done as a whole, whole section HFS that the ground continuous compound rate need be recovered, but utilize subband in the low frequency sub-band group recovers the high-frequency sub-band of some Discrete Distribution respectively accordingly.If at HFS profuse harmonic component is arranged, then the frequency of its harmonic component may be exactly the integral multiple of its corresponding fundamental frequency.Under the guidance of this thought, prior art 2 proposes, if the sub-band serial number of some subband of HFS is natural integral multiples such as 2,3,4,5, it is the corresponding relation that has multiple between some high-frequency sub-band and the low frequency sub-band, there is abundant harmonic components probably in these subbands, need emphasis to recover.Like this, the process of recovering the high-frequency sub-band of Discrete Distribution with continuous low frequency sub-band group has just been finished.At last,, also to choose the close with it low frequency sub-band of waveform, the high-frequency sub-band of omitting be recovered, thereby finished duplicating of all high-frequency sub-band for the high-frequency sub-band that this method is omitted.
No matter low frequency sub-band is carried out periodically shift copy or folding duplicating as a monoblock in above-mentioned two kinds of prior aries by prior art 1, still carrying out frequency multiplication by prior art 2 duplicates, all be mechanically to recover harmonic wave, do not consider the diversity and the variability of audio speech signal, be to extract successively according to sub-band serial number to duplicate when duplicating in addition, because the waveform of low frequency sub-band and high-frequency sub-band is original just different, so the high-frequency sub-band that is replicated is compared with original high-frequency sub-band, to there be big shape differences or peak value difference, therefore the high-frequency signal accuracy of rebuilding is not too high, influences the quality of reconstructed audio signal.
Summary of the invention
One of purpose of the present invention provides a kind of audio encoding and decoding method based on frequency domain filtering and the reconstruct of high frequency residual error, two of purpose provides a kind of audio encoding and decoding method based on time-domain filtering and the reconstruct of high frequency residual error, to avoid the deficiency of above-mentioned prior art, improve the accuracy of reconstruction high frequency residual error and the ratio of compression of audio coding decoding, guarantee the reconstructed audio signal quality.
Technical scheme of the present invention is achieved in that
Technical scheme one:
Audio encoding and decoding method based on frequency domain filtering and the reconstruct of high frequency residual error comprises the steps:
1) at coding side, the original time-domain signal of audio frequency is handled through time-frequency conversion, obtains original frequency-region signal;
2) original frequency-region signal obtains original frequency domain residual error signal through frequency domain perception Filtering Processing;
3) original frequency domain residual error signal obtains low frequency residual signals coding and high frequency residual error parameter coding, and outputs to transmission channel or storage medium through frequency-domain residual analysis and encoding process.
4) in decoding end, receive from the low frequency residual signals coding and the high frequency residual error parameter coding that output to transmission channel or storage medium, and it is carried out frequency-domain residual decoding and reconstruction processing, obtain the reconstructed frequency domain residual signals;
5) the reconstructed frequency domain residual signals is handled through frequency domain perception liftering, obtains the reconstructed frequency domain signal;
6) the reconstruct frequency-region signal is carried out the time-frequency inverse transformation and handle, obtain the audio reconstruction time-domain signal.
Technical scheme two:
Audio encoding and decoding method based on time-domain filtering and the reconstruct of high frequency residual error comprises the steps:
T1) at coding side, the original time-domain signal of audio frequency obtains original time domain residual signals through time domain perception Filtering Processing;
T2) original time domain residual signals is handled through time-frequency conversion, obtains original frequency domain residual error signal;
T3) original frequency domain residual error signal obtains low frequency residual signals coding and high frequency residual error parameter coding, and outputs to transmission channel or storage medium through frequency-domain residual analysis and encoding process.
T4) in decoding end, receive from the low frequency residual signals coding and the high frequency residual error parameter coding that output to transmission channel or storage medium, and it is carried out frequency-domain residual decoding and reconstruction processing, obtain the reconstructed frequency domain residual signals;
T5) the reconstruct frequency domain residual error signal is carried out the time-frequency inversion process, obtain reconstruct time domain residual signals;
T6) reconstruct time domain residual signals is carried out the time domain liftering and handle, obtain the audio reconstruction time-domain signal.
The audio encoding and decoding method of above-mentioned two kinds of schemes, wherein said frequency-domain residual analysis and encoding process comprise the steps:
(A1) according to equiband frequency band or critical band or sound interval frequency band band division methods frequently, earlier original frequency domain residual error signal is carried out frequency band division, the code rate of setting according to audio coder is selected a frequency band division end points then, original frequency domain residual error signal is divided into original low frequency residual signals and original high frequency residual error two parts, makes these two parts respectively have several frequency bands;
(A2) original low frequency residual signals is encoded, obtain the output of low frequency residual signals coding; Again the low frequency residual signals is coded in coding side and carries out local decode, obtain the decoded low frequency residual signals;
(A3) according to the similarity or the correlativity of decoded low frequency residual signals and original high frequency residual error, high frequency residual error parameter is analyzed, promptly select a frequency band matching strategy that decoded low frequency residual signals and original high frequency residual error are carried out the frequency band coupling, and the high frequency residual error parameter of calculating optimum coupling, obtain comprising the original high frequency residual error parameter of frequency band division method, optimum matching frequency band position, energy matching attribute, sound channel coupling parameter and interframe spreading parameter;
(A4) original high frequency residual error parameter is encoded, obtain the output of high frequency residual error parameter coding.
The audio encoding and decoding method of above-mentioned two kinds of schemes, wherein said frequency-domain residual decoding and reconstruction processing comprise the steps:
(C1) the low frequency residual signals coding that receives is decoded, obtain the decoded low frequency residual signals;
(C2) the high frequency residual error parameter coding that receives is decoded, high frequency residual error parameter obtains decoding;
(C3) utilize optimum matching frequency band position and energy matching attribute in decoded low frequency residual signals, the decoding high frequency residual error parameter, reconstructed high frequency residual error frequency band R H(q 0, l);
(C4) frequency band division method, sound channel coupling parameter and the interframe spreading parameter in the utilization decoding high frequency residual error parameter makes up the reconstructed high frequency residual error frequency band that obtains, and obtains reconstructed high frequency residual error;
(C5) the frequency-domain residual height frequency range boundary frequency in the utilization decoding high frequency residual error parameter makes up decoded low frequency residual signals and reconstructed high frequency residual error, obtains the reconstructed frequency domain residual signals.
The audio encoding and decoding method of above-mentioned two kinds of schemes, wherein said frequency band matching strategy comprises: the high frequency residual error frequency band mates in low frequency residual signals zone, the high frequency residual error frequency band mates in low frequency residual signals zone and extended area thereof, carry out the high frequency residual error coupling and carry out the high frequency residual error coupling by the interframe expansion by sound channel coupling.
The present invention is owing to taken into full account the albefaction characteristic of frequency domain residual error signal frequency spectrum and correlativity and the similarity between high frequency residual error and the low frequency residual signals, by selecting the matching strategy of high frequency residual error and low frequency residual signals, the reconstruction parameter of estimation high frequency residual error, and with high frequency residual error parameter reconstructed high frequency residual error accurately, and based on this, realize the high efficient coding and the decoding of sound signal, thereby ratio of compression or the code efficiency of audio sources coding have been improved, save the required transmission bandwidth of transmitting audio signal and saved the required memory capacity of stored audio signal, improved the audio compression coding quality simultaneously.
Description of drawings
Fig. 1 is based on the audio encoding and decoding method process flow diagram of frequency domain filtering;
Fig. 2 is based on the audio encoding and decoding method process flow diagram of time-domain filtering;
Fig. 3 frequency domain residual error signal is analyzed and the coding method process flow diagram;
Decoding of Fig. 4 frequency domain residual error signal and reconstructing method process flow diagram;
Fig. 5 low-and high-frequency frequency domain residual error signal and high frequency residual error reconstruct synoptic diagram;
Fig. 6 high frequency residual error frequency band carries out the matching strategy synoptic diagram in low frequency residual signals zone;
Fig. 7 high frequency residual error frequency band in low frequency residual signals zone and extended area carry out the matching strategy synoptic diagram;
Fig. 8 carries out high frequency residual error matching strategy synoptic diagram by the sound channel coupling;
Fig. 9 carries out high frequency residual error matching strategy synoptic diagram by the interframe expansion.
Embodiment
Embodiment one
Present embodiment provides a kind of method of audio coding decoding, and this method is based on the audio coding method of frequency domain filtering and the reconstruct of high frequency residual error.High frequency residual error is the HFS of frequency domain residual error signal, and the analysis of high frequency residual error and reconstruct are in order effectively to compress the data volume of frequency domain residual error signal, to improve audio-frequency signal coding and transfer efficiency.The analysis of high frequency residual error and reconstruct are according to certain rule frequency domain residual error signal to be divided into low frequency residual signals and high frequency residual error two parts, utilize the correlativity or the similarity of high frequency residual error and low frequency residual signals, extract high frequency residual error reconstruct parameters needed, abandon high frequency residual error, come reconstructed high frequency residual error with high frequency residual error parameter then, thus the whole frequency domain residual error signal of reconstruct.
With reference to Fig. 1, the audio coding decoding step of present embodiment is as follows:
Step 101, at coding side, the original time-domain signal of audio frequency is handled through time-frequency conversion, obtains original frequency-region signal.
The original time-domain signal of audio frequency is the morbid sound of the various voice signals that comprise that voice signal, sound signal or anyone ear can be heard; The frequency range of sound signal mainly at 0Hz between the 20kHz, the sample frequency of sound signal is 96kHz, 48kHz, 44.1kHz, 32kHz, 22.05kHz, 16kHz, 11.025kHz and 8kH; The coding of sound signal normally is unit with the audio frame, the size of audio frame commonly used according to practical application generally within 50 milliseconds.Spatial transform adopts but is not limited to revise discrete cosine transform, correction lapped transform and Fast Fourier Transform (FFT) method and carries out conversion.
Step 102, original frequency-region signal obtains original frequency domain residual error signal through frequency domain perception Filtering Processing.
The frequency domain perceptual filter is the frequency domain filter of reflection human hearing characteristic, and it carries out frequency domain filtering to the frequency-region signal that comes step 101, obtain on perception meaning albefaction frequency domain residual error signal, if use H M(f) transition function of expression frequency domain perceptual filter is with the perception curve of M (f) expression by the perceptual parameters sign, then H M(f) can be expressed as H M ( f ) = 1 M ( f ) , Wherein f represents frequency, and unit is Hz.
Step 103, frequency-domain residual analysis and coding.
Shown in 3, the concrete steps of frequency-domain residual analysis and coding comprise:
Step 301, frequency band division and low-and high-frequency residual signals are cut apart, promptly according to methods such as equiband frequency band or critical band or sound interval frequency bands, earlier original frequency domain residual error signal is carried out frequency band division, select the end points of one of them divided band that original frequency domain residual error signal is divided into original low frequency residual signals and original high frequency residual error two parts according to the code rate of audio coder then, make these two parts respectively have several frequency bands.
As shown in Figure 5, complete frequency domain residual error signal is represented with R, for effectively to frequency domain residual error signal analyze, coding and reconstruct, frequency domain residual error signal is carried out frequency band division according to methods such as equiband frequency band or critical band or sound interval frequency bands, having certain attribute, be divided in the identical frequency band as the adjacent frequency components of auditory properties and go.If f Ci, f LiAnd f HiBe respectively centre frequency, low side edge frequency and the high-end edge frequency of i frequency band, b iBe band bandwidth, select the low side edge frequency f of certain frequency band LiOr high-end edge frequency f HiBe boundary frequency f D, frequency domain residual error signal is divided into low frequency residual signals R in frequency domain LWith high frequency residual error R HTwo parts are 0 to f DBetween just have N continuous LIndividual low frequency residual signals frequency band is at f DWith f ZBetween just have N continuous HIndividual high frequency residual error frequency band is higher than frequency f ZThe high frequency residual error frequency band in signal be zero.If f represents frequency, unit is H Z, f SThe expression sample frequency, normalized frequency is used f ^ = f / f S Expression; If use f BThe bandwidth of expression time-domain audio signal, then f ^ B = f B / f S = 0.5 .
Step 302 is encoded to original low frequency residual signals, obtains the output of low frequency residual signals coding.The coding of original low frequency residual signals adopts the various genuine coding methods of losing, and as linear or non-linear scalar quantization coding, vector quantization coding, perhaps adopts various undistorted coding methods simultaneously, as Huffman coding and arithmetic coding;
Step 303 is coded in coding side to the low frequency residual signals and carries out local decode, obtains the coding of decoded low frequency residual signals and low frequency residual signals;
Step 304, similarity or correlativity according to decoded low frequency residual signals and original high frequency residual error, high frequency residual error parameter is analyzed, promptly select a frequency band matching strategy that decoded low frequency residual signals and original high frequency residual error are carried out the frequency band coupling, and the high frequency residual error parameter of calculating optimum coupling, obtain original high frequency residual error parameter.
As shown in Figure 5, frequency band coupling is exactly to seek and the immediate frequency band of high frequency residual error frequency bandwidth characteristics in the low frequency residual signals, and purpose is in decoding end reconstructed high frequency residual error in high quality.Frequency band coupling adopts different frequency band matching strategies, and so-called frequency band matching strategy is meant how at low frequency residual signals R LIn select one section frequency domain residual error signal, and come reconstructed high frequency residual error R with it HIn have the method for the high frequency residual error of same frequency band width.In Fig. 5, because frequency f ZAbove residual signals makes zero, so there is no need this part residual error to be reconstructed again.
Described frequency band matching strategy includes but not limited to following several:
(P1) the high frequency residual error frequency band mates in low frequency residual signals zone
As shown in Figure 6, will be in f DWith f ZBetween N HIndividual high frequency residual error frequency band is being in 0 to f respectively independently DBetween low frequency residual signals region R LMate, seek the optimum matching frequency band of error minimum or correlativity maximum.At this moment, frequency band division method, optimum matching frequency band position and energy matching attribute etc. are high frequency residual error reconstruction parameter.
(P2) the high frequency residual error frequency band mates in low frequency residual signals zone and extended area thereof
As shown in Figure 7, adopt above-mentioned frequency band matching strategy (P1), will be in f earlier DWith f ZBetween the minimum high frequency residual error frequency band R of frequency H1Be in 0 to f DBetween low frequency residual signals region R LMate, seek the optimum matching frequency band of error minimum or correlativity maximum, and come the minimum high frequency residual error frequency band R of reconfiguration frequency with this optimum matching frequency band H1Again that the frequency of this reconstruct is minimum high frequency residual error frequency band R H1Add original low frequency residual signals R L, form a new low frequency residual signals R who forms L1=R L+ R H1Then,, adopt above-mentioned frequency band matching strategy (P1), continue the high frequency residual error frequency band of reconstruct higher frequency based on the low frequency residual signals of this new composition; The rest may be inferred, until finishing whole f DTo f ZBetween the coupling and the reconstruct of all high frequency residual error frequency bands.At this moment, frequency band division method, optimum matching frequency band position, energy matching attribute and low frequency residual error area extension parameter are high frequency residual error reconstruction parameter.
(P3) carry out the high frequency residual error coupling by the sound channel coupling
As shown in Figure 8, establishing audio sources is the multichannel audio information source with the individual main sound channel of C (C 〉=2), just has 5 main sound channels and a low stress to imitate sound channel such as 5.1 channel audios, at this moment has the low frequency residual signals of C sound channel.Because the sound signal of each sound channel has big correlativity, so the frequency domain residual error signal of each sound channel also has big correlativity, therefore any one sound channel not only can utilize the low frequency residual signals of its place sound channel to carry out high frequency residual error reconstruct, but also can utilize the low frequency residual signals of other sound channel to carry out high frequency residual error reconstruct; Can obtain more match selection like this, improve the high frequency residual error reconstruction quality.With the 1st sound channel is that example is given explanation, at first can adopt above-mentioned frequency band matching strategy (P1) and (P2), utilizes being in of place sound channel 0 to arrive f DBetween low frequency residual signals R LCoupling and reconstruct are in f DWith f ZBetween high frequency residual error.That in addition, can also utilize other sound channel is in 0 to f DBetween low frequency residual signals R LBe in f in coupling and reconstruct the 1st sound channel DWith f ZBetween high frequency residual error.The same with the 1st sound channel, the frequency domain residual error signal of all C sound channels is many can to adopt identical method to handle.
(P4) carry out the high frequency residual error coupling by the interframe expansion
Above-mentioned frequency band matching strategy (P1), (P2) and (P3) matching strategy of described high frequency residual error be the high frequency residual error of utilizing the low frequency residual signals reconstruct present frame of present frame.Because have very big correlativity between the consecutive frame sound signal usually, so the residual error of consecutive frame also has bigger correlativity.Therefore, the high frequency residual error of current audio frame not only can be reconstructed with the low frequency residual signals of present frame, also can be reconstructed with the reconstruct residual signals that comprises reconstruct low frequency residual signals and reconstructed high frequency residual error of some frames before the present frame.As shown in Figure 9, adopt low frequency residual signals that above-mentioned frequency band matching strategy (P1), (P2) and method (P3) utilize the j frame that the high frequency residual error of j frame is reconstructed, or adopt low frequency residual signals that above-mentioned frequency band matching strategy (P1), (P2) and method (P3) utilize the j-1 frame that the high frequency residual error of j frame is reconstructed, can also adopt reconstructed high frequency residual error that above-mentioned frequency band matching strategy (P1), (P2) and method (P3) utilize the j-1 frame that the high frequency residual error of j frame is reconstructed.In addition, these methods can also expand to the j-2 frame, the j-3 frame goes.
Above-mentioned various frequency band matching strategy all need be sought the optimum matching frequency band between low frequency residual signals and high frequency residual error, said here the best can be expressed with error or distortion minimum, also can represent with the correlativity maximum.The step of the high frequency residual error parameter of concrete calculating optimum coupling comprises:
(B1) calculate normalization low frequency residual error band signal
Figure A200810232760D00111
With normalization high frequency residual error band signal
Figure A200810232760D00112
If the frequency span of high frequency residual error of mating and low frequency residual signals is L frequency, R L(p, l), l=0,1 ... L-1 is low frequency residual signals frequency band, R H(q, l), l=0,1 ... L-1 is the high frequency residual error frequency band, and wherein p and q represent R respectively L(p, l) and R H(q, the l) reference position of place frequency band, p ∈ [0, fD], q ∈ [f D, f Z], calculate With Wherein,
Figure A200810232760D00123
Expression R L(p, l), l=0,1 ... L-1 is by the maximal value R of himself absolute value L max(p) carry out normalization low frequency residual signals that normalization obtains, Expression R H(p, l), l=0,1 ... L-1 is by the maximal value R of himself absolute value H max(q) carry out the normalization high frequency residual error that normalization obtains.
(B2) calculate coupling distortion measure d (p, q) or coupling related function r (p, q), if carry out The matching analysis with the distortion minimum, then Pi Pei distortion measure is expressed as:
d ( p , q ) = w ( p , q ) · Σ l = 0 l = L - 1 w ( | R ^ H ( q , l ) | ) · ( R ^ L ( p , l ) - R ^ H ( q , l ) ) 2 - - - ( 1 )
Wherein w (p q) is the frequency influence factor,
Figure A200810232760D00126
It is residual error amplitude factor of influence.If w (p, q)=1.0, w ( | R ^ H ( q , l ) | ) = 1.0 , Formula (14) can be reduced to so:
d ( p , q ) = Σ l = 0 l = L - 1 ( R ^ L ( p , l ) - R ^ H ( q , l ) ) 2 - - - ( 2 )
If carry out The matching analysis with the correlativity maximum, normalization low frequency residual signals
Figure A200810232760D00129
With the normalization high frequency residual error
Figure A200810232760D001210
Related function be expressed as:
r ( p , q ) = w ( p , q ) · Σ l = 0 l = L - 1 w ( | R ^ H ( q , l ) | ) · ( R ^ L ( p , l ) · R ^ H ( q , l ) ) - - - ( 3 )
Equally, w (p, q) the expression frequency influence factor,
Figure A200810232760D001212
Expression residual error amplitude factor of influence.If w (p, q)=1.0, w ( | R ^ H ( q , l ) | ) = 1.0 , Formula (16) can be reduced to so:
r ( p , q ) = Σ l = 0 l = N - 1 R ^ L ( p , l ) R ^ H ( q , l ) - - - ( 4 )
(B3) determine best frequency band matched position p 0And q 0, this optimum matching is exactly distortion measure d (p, minimum value q) or related function r (p, the optimum value p of pairing p of maximal value q) and q 0And q 0, they have determined the frequency band that high frequency residual error and low frequency residual error are mated.
(B4) utilize following formula calculating optimum energy matching attribute:
G R ( p 0 , q 0 ) = R H max ( q 0 ) R Lp max ( p 0 ) - - - ( 5 )
Like this, the resulting frequency band division result of step 304, optimum matching frequency band position, frequency-domain residual height frequency range boundary frequency f D, energy matching attribute, sound channel coupling parameter and interframe spreading parameter, be exactly original high frequency residual error parameter.
Step 305, high frequency residual error parameter coding.
The original high frequency residual error parameter that step 304 produces is encoded, obtain the output of high frequency residual error coding.The coding of original high frequency residual error parameter adopts the various genuine coding methods of losing, and as linear or non-linear scalar quantization coding, vector quantization coding, perhaps adopts various undistorted coding methods simultaneously, as Huffman coding and arithmetic coding.
Step 104, frequency-domain residual decoding and reconstruct.
In decoding end, low frequency residual signals coding that receives and high frequency residual error parameter coding obtain the reconstructed frequency domain residual signals, as shown in Figure 4 through frequency-domain residual decoding and reconstruction processing.Concrete steps comprise:
Step 401 is decoded to the low frequency residual signals coding that receives, and obtains the decoded low frequency residual signals.
Step 402 is decoded to the high frequency residual error parameter coding that receives, and the high frequency residual error parameter that obtains decoding, this decoding high frequency residual error parameter comprise frequency band division result, frequency-domain residual height frequency range boundary frequency f D, optimum matching frequency band position, energy matching attribute, sound channel coupling parameter and interframe spreading parameter.
Step 403, high frequency residual error reconstruct.
According to decoding high frequency residual error parameter, utilize the decoded low frequency residual signals to duplicate and reconstructed high frequency residual error, concrete steps comprise:
(D1) the decoded low frequency residual signals that obtains according to step 401 and 402, optimum matching frequency band position and energy matching attribute, carry out high frequency residual error with following formula and duplicate with energy flux matched:
R H(q 0,l)=G R(p 0,q 0)·R L(p 0,l),l=0,1,...L-1 (6)
Obtain reconstructed high frequency residual error frequency band R H(q 0, l);
(D2) the frequency band division result who obtains according to step 401 and 402, sound channel coupling parameter and interframe spreading parameter make up all reconstructed high frequency residual error frequency bands, obtain reconstructed high frequency residual error.
Step 404, the frequency-domain residual that obtains according to decoding height frequency range boundary frequency f D, decoded low frequency residual signals and reconstructed high frequency residual error are made up, obtain the reconstructed frequency domain residual signals.
Step 105, the reconstructed frequency domain residual signals is handled through frequency domain inverse filtering, obtains the reconstructed frequency domain signal; If use H R(f) expression frequency domain perception inverse filter, then H R(f) be expressed as H R ( f ) = 1 H M ( f ) = M ( f ) , Wherein f represents frequency, and unit is Hz.
Step 106 is carried out the time-frequency inverse transformation to the reconstruct frequency-region signal and is handled, and obtains the audio reconstruction time-domain signal.Corresponding with time-frequency conversion, the time domain inverse transformation adopts inverse modified discrete cosine transform, the overlapping inverse transformation of inverse modified or inverse FFT method.
Embodiment two
Present embodiment provides a kind of audio encoding and decoding method based on time-domain filtering and the reconstruct of high frequency residual error.
Referring to Fig. 2, this method step is as follows:
Step 201, at coding side, the original time-domain signal of audio frequency obtains original time domain residual signals through time domain perception Filtering Processing.Wherein, the time domain perceptual filter is the time domain filtering of reflection human hearing characteristic, and it carries out time-domain filtering to the original time-domain signal of audio frequency, obtain on perception meaning albefaction the time domain residual signals; The transition function H of time domain perceptual filter M(z) expression, the time domain perceptual filter adopts but is not limited to linear prediction filter.
Step 202, original time domain residual signals is handled through time-frequency conversion, obtains original frequency domain residual error signal, and wherein, spatial transform adopts but is not limited to revise discrete cosine transform, correction lapped transform and Fast Fourier Transform (FFT) method and carries out conversion.
Step 203, frequency-domain residual analysis and coding.
Shown in 3, the concrete steps of frequency-domain residual analysis and coding comprise:
Step 301, frequency band division and low-and high-frequency residual signals are cut apart, promptly according to methods such as equiband frequency band or critical band or sound interval frequency bands, earlier original frequency domain residual error signal is carried out frequency band division, select the end points of one of them divided band that original frequency domain residual error signal is divided into original low frequency residual signals and original high frequency residual error two parts according to the code rate of audio coder then, make these two parts respectively have several frequency bands.
As shown in Figure 5, complete frequency domain residual error signal is represented with R, for effectively to frequency domain residual error signal analyze, coding and reconstruct, frequency domain residual error signal is carried out frequency band division according to methods such as equiband frequency band or critical band or sound interval frequency bands, having certain attribute, be divided in the identical frequency band as the adjacent frequency components of auditory properties and go.If f Ci, f LiAnd f HiBe respectively centre frequency, low side edge frequency and the high-end edge frequency of i frequency band, b iBe band bandwidth, select the low side edge frequency f of certain frequency band LiOr high-end edge frequency f HiBe boundary frequency f D, frequency domain residual error signal is divided into low frequency residual signals R in frequency domain LWith high frequency residual error R HTwo parts are 0 to f DBetween just have N continuous LIndividual low frequency residual signals frequency band is at f DWith f ZBetween just have N continuous HIndividual high frequency residual error frequency band is higher than frequency f ZThe high frequency residual error frequency band in signal be zero.If f represents frequency, unit is H Z, f SThe expression sample frequency, normalized frequency is used f ^ = f / f S Expression; If use f BThe bandwidth of expression time-domain audio signal, then f ^ B = f B / f S = 0.5 .
Step 302 is encoded to original low frequency residual signals, obtains the output of low frequency residual signals coding.The coding of original low frequency residual signals adopts the various genuine coding methods of losing, and as linear or non-linear scalar quantization coding, vector quantization coding, perhaps adopts various undistorted coding methods simultaneously, as Huffman coding and arithmetic coding;
Step 303 is coded in coding side to the low frequency residual signals and carries out local decode, obtains the coding of decoded low frequency residual signals and low frequency residual signals;
Step 304, similarity or correlativity according to decoded low frequency residual signals and original high frequency residual error, high frequency residual error parameter is analyzed, promptly select a frequency band matching strategy that decoded low frequency residual signals and original high frequency residual error are carried out the frequency band coupling, and the high frequency residual error parameter of calculating optimum coupling, obtain original high frequency residual error parameter.
As shown in Figure 5, frequency band coupling is exactly to seek and the immediate frequency band of high frequency residual error frequency bandwidth characteristics in the low frequency residual signals, and purpose is in decoding end reconstructed high frequency residual error in high quality.Frequency band coupling adopts different frequency band matching strategies, and so-called frequency band matching strategy is meant how at low frequency residual signals R LIn select one section frequency domain residual error signal, and come reconstructed high frequency residual error R with it HIn have the method for the high frequency residual error of same frequency band width.In Fig. 5, because frequency f ZAbove residual signals makes zero, so there is no need this part residual error to be reconstructed again.
Described frequency band matching strategy includes but not limited to following several:
(P1) the high frequency residual error frequency band mates in low frequency residual signals zone
As shown in Figure 6, will be in f DWith f ZBetween N HIndividual high frequency residual error frequency band is being in 0 to f respectively independently DBetween low frequency residual signals region R LMate, seek the optimum matching frequency band of error minimum or correlativity maximum.At this moment, frequency band division method, optimum matching frequency band position and energy matching attribute etc. are high frequency residual error reconstruction parameter.
(P2) the high frequency residual error frequency band mates in low frequency residual signals zone and extended area thereof
As shown in Figure 7, adopt above-mentioned frequency band matching strategy (P1), will be in f earlier DWith f ZBetween the minimum high frequency residual error frequency band R of frequency H1Be in 0 to f DBetween low frequency residual signals region R LMate, seek the optimum matching frequency band of error minimum or correlativity maximum, and come the minimum high frequency residual error frequency band R of reconfiguration frequency with this optimum matching frequency band H1Again that the frequency of this reconstruct is minimum high frequency residual error frequency band R H1Add original low frequency residual signals R L, form a new low frequency residual signals R who forms L1=R L+ R H1Then,, adopt above-mentioned frequency band matching strategy (P1), continue the high frequency residual error frequency band of reconstruct higher frequency based on the low frequency residual signals of this new composition; The rest may be inferred, until finishing whole f DTo f ZBetween the coupling and the reconstruct of all high frequency residual error frequency bands.At this moment, frequency band division method, optimum matching frequency band position, energy matching attribute and low frequency residual error area extension parameter are high frequency residual error reconstruction parameter.
(P3) carry out the high frequency residual error coupling by the sound channel coupling
As shown in Figure 8, establishing audio sources is the multichannel audio information source with the individual main sound channel of C (C 〉=2), just has 5 main sound channels and a low stress to imitate sound channel such as 5.1 channel audios, at this moment has the low frequency residual signals of C sound channel.Because the sound signal of each sound channel has big correlativity, so the frequency domain residual error signal of each sound channel also has big correlativity, therefore any one sound channel not only can utilize the low frequency residual signals of its place sound channel to carry out high frequency residual error reconstruct, but also can utilize the low frequency residual signals of other sound channel to carry out high frequency residual error reconstruct; Can obtain more match selection like this, improve the high frequency residual error reconstruction quality.With the 1st sound channel is that example is given explanation, at first can adopt above-mentioned frequency band matching strategy (P1) and (P2), utilizes being in of place sound channel 0 to arrive f DBetween low frequency residual signals R LCoupling and reconstruct are in f DWith f ZBetween high frequency residual error.That in addition, can also utilize other sound channel is in 0 to f DBetween low frequency residual signals R LBe in f in coupling and reconstruct the 1st sound channel DWith f ZBetween high frequency residual error.The same with the 1st sound channel, the frequency domain residual error signal of all C sound channels is many can to adopt identical method to handle.
(P4) carry out the high frequency residual error coupling by the interframe expansion
Above-mentioned frequency band matching strategy (P1), (P2) and (P3) matching strategy of described high frequency residual error be the high frequency residual error of utilizing the low frequency residual signals reconstruct present frame of present frame.Because have very big correlativity between the consecutive frame sound signal usually, so the residual error of consecutive frame also has bigger correlativity.Therefore, the high frequency residual error of current audio frame not only can be reconstructed with the low frequency residual signals of present frame, also can be reconstructed with the reconstruct residual signals that comprises reconstruct low frequency residual signals and reconstructed high frequency residual error of some frames before the present frame.As shown in Figure 9, adopt low frequency residual signals that above-mentioned frequency band matching strategy (P1), (P2) and method (P3) utilize the j frame that the high frequency residual error of j frame is reconstructed, or adopt low frequency residual signals that above-mentioned frequency band matching strategy (P1), (P2) and method (P3) utilize the j-1 frame that the high frequency residual error of j frame is reconstructed, can also adopt reconstructed high frequency residual error that above-mentioned frequency band matching strategy (P1), (P2) and method (P3) utilize the j-1 frame that the high frequency residual error of j frame is reconstructed.In addition, these methods can also expand to the j-2 frame, the j-3 frame goes.
Above-mentioned various frequency band matching strategy all need be sought the optimum matching frequency band between low frequency residual signals and high frequency residual error, said here the best can be expressed with error or distortion minimum, also can represent with the correlativity maximum.The step of the high frequency residual error parameter of concrete calculating optimum coupling comprises:
(B1) calculate normalization low frequency residual error band signal
Figure A200810232760D00161
With normalization high frequency residual error band signal
Figure A200810232760D00162
If the frequency span of high frequency residual error of mating and low frequency residual signals is L frequency, R L(p, l), l=0,1 ... L-1 is low frequency residual signals frequency band, R H(q, l), l=0,1 ... L-1 is the high frequency residual error frequency band, and wherein p and q represent R respectively L(p, l) and R H(q, the l) reference position of place frequency band, p ∈ [0, f D], q ∈ [f D, f Z], calculate
Figure A200810232760D00171
With
Figure A200810232760D00172
Wherein, Expression R L(p, l), l=0,1 ... L-1 is by the maximal value R of himself absolute value L max(p) carry out normalization low frequency residual signals that normalization obtains,
Figure A200810232760D00174
Expression R H(p, l), l=0,1 ... L-1 is by the maximal value R of himself absolute value H max(q) carry out the normalization high frequency residual error that normalization obtains.
(B2) calculate coupling distortion measure d (p, q) or coupling related function r (p, q), if carry out The matching analysis with the distortion minimum, then Pi Pei distortion measure is expressed as:
d ( p , q ) = w ( p , q ) · Σ l = 0 l = L - 1 w ( | R ^ H ( q , l ) | ) · ( R ^ L ( p , l ) - R ^ H ( q , l ) ) 2 - - - ( 1 )
Wherein w (p q) is the frequency influence factor,
Figure A200810232760D00176
It is residual error amplitude factor of influence.If w (p, q)=1.0, w ( | R ^ H ( q , l ) | ) = 1.0 , Formula (14) can be reduced to so:
d ( p , q ) = Σ l = 0 l = L - 1 ( R ^ L ( p , l ) - R ^ H ( q , l ) ) 2 - - - ( 2 )
If carry out The matching analysis with the correlativity maximum, normalization low frequency residual signals
Figure A200810232760D00179
With the normalization high frequency residual error
Figure A200810232760D001710
Related function be expressed as:
r ( p , q ) = w ( p , q ) · Σ l = 0 l = L - 1 w ( | R ^ H ( q , l ) | ) · ( R ^ L ( p , l ) · R ^ H ( q , l ) ) - - - ( 3 )
Equally, w (p, q) the expression frequency influence factor,
Figure A200810232760D001712
Expression residual error amplitude factor of influence.If w (p, q)=1.0, w ( | R ^ H ( q , l ) | ) = 1.0 , Formula (16) can be reduced to so:
r ( p , q ) = Σ l = 0 l = N - 1 R ^ L ( p , l ) R ^ H ( q , l ) - - - ( 4 )
(B3) determine best frequency band matched position p 0And q 0, this optimum matching is exactly distortion measure d (p, minimum value q) or related function r (p, the optimum value p of pairing p of maximal value q) and q 0And q 0, they have determined the frequency band that high frequency residual error and low frequency residual error are mated.
(B4) utilize following formula calculating optimum energy matching attribute:
G R ( p 0 , q 0 ) = R H max ( q 0 ) R Lp max ( p 0 ) - - - ( 5 )
Like this, the resulting frequency band division result of step 304, optimum matching frequency band position, frequency-domain residual height frequency range boundary frequency f D, energy matching attribute, sound channel coupling parameter and interframe spreading parameter, be exactly original high frequency residual error parameter.
Step 305, high frequency residual error parameter coding.
The original high frequency residual error parameter that step 304 produces is encoded, obtain the output of high frequency residual error coding.The coding of original high frequency residual error parameter adopts the various genuine coding methods of losing, and as linear or non-linear scalar quantization coding, vector quantization coding, perhaps adopts various undistorted coding methods simultaneously, as Huffman coding and arithmetic coding.
Step 204, frequency-domain residual decoding and reconstruct.
In decoding end, low frequency residual signals coding that receives and high frequency residual error parameter coding obtain the reconstructed frequency domain residual signals, as shown in Figure 4 through frequency-domain residual decoding and reconstruction processing.Concrete steps comprise:
Step 401 is decoded to the low frequency residual signals coding that receives, and obtains the decoded low frequency residual signals.
Step 402 is decoded to the high frequency residual error parameter coding that receives, and the high frequency residual error parameter that obtains decoding, this decoding high frequency residual error parameter comprise frequency band division result, frequency-domain residual height frequency range boundary frequency f D, optimum matching frequency band position, energy matching attribute, sound channel coupling parameter and interframe spreading parameter.
Step 403, high frequency residual error reconstruct.
According to decoding high frequency residual error parameter, utilize the decoded low frequency residual signals to duplicate and reconstructed high frequency residual error, concrete steps comprise:
(D1) the decoded low frequency residual signals that obtains according to step 401 and 402, optimum matching frequency band position and energy matching attribute, carry out high frequency residual error with following formula and duplicate with energy flux matched:
R H(q 0,l)=G R(p 0,q 0)·R L(p 0,l),l=0,1,...L-1 (6)
Obtain reconstructed high frequency residual error frequency band R H(q 0, l);
(D2) the frequency band division result who obtains according to step 401 and 402, sound channel coupling parameter and interframe spreading parameter make up all reconstructed high frequency residual error frequency bands, obtain reconstructed high frequency residual error.
Step 404, the frequency-domain residual that obtains according to decoding height frequency range boundary frequency f D, decoded low frequency residual signals and reconstructed high frequency residual error are made up, obtain the reconstructed frequency domain residual signals.
Step 205, the reconstructed frequency domain residual signals is handled through the time-frequency inverse transformation, obtains reconstruct time domain residual signals; Corresponding with time-frequency conversion, the time domain inverse transformation adopts inverse modified discrete cosine transform, the overlapping inverse transformation of inverse modified or inverse FFT method.
Step 206 is carried out time domain perception liftering to reconstruct time domain residual signals and is handled, and obtains the output of audio reconstruction time-domain signal; If use H R(z) expression time domain perception inverse filter, then H R(f) can be expressed as H R ( z ) = 1 H M ( z ) .
Audio coding method that the above embodiment of the present invention provides and coding/decoding method can carry out efficient high-quality compressed encoding to the sound signal that comprises voice signal, improve the audio frequency transfer efficiency.
Above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, the part that all can change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.

Claims (10)

1. the audio encoding and decoding method based on frequency domain filtering and the reconstruct of high frequency residual error comprises the steps:
1) at coding side, the original time-domain signal of audio frequency is handled through time-frequency conversion, obtains original frequency-region signal;
2) original frequency-region signal obtains original frequency domain residual error signal through frequency domain perception Filtering Processing;
3) original frequency domain residual error signal obtains low frequency residual signals coding and high frequency residual error parameter coding, and outputs to transmission channel or storage medium through frequency-domain residual analysis and encoding process.
4) in decoding end, receive from the low frequency residual signals coding and the high frequency residual error parameter coding that output to transmission channel or storage medium, and it is carried out frequency-domain residual decoding and reconstruction processing, obtain the reconstructed frequency domain residual signals;
5) the reconstructed frequency domain residual signals is handled through frequency domain perception liftering, obtains the reconstructed frequency domain signal;
6) the reconstruct frequency-region signal is carried out the time-frequency inverse transformation and handle, obtain the audio reconstruction time-domain signal.
2. encode/decode audio signal method according to claim 1, wherein described frequency-domain residual analysis of step 3) and encoding process comprise the steps:
(A1) according to the frequency band division method of equiband frequency band or critical band or sound interval frequency band, earlier original frequency domain residual error signal is carried out frequency band division, the code rate of setting according to audio coder is selected a frequency band division end points then, original frequency domain residual error signal is divided into original low frequency residual signals and original high frequency residual error two parts, makes these two parts respectively have several frequency bands;
(A2) original low frequency residual signals is encoded, obtain the output of low frequency residual signals coding; Again the low frequency residual signals is coded in coding side and carries out local decode, obtain the decoded low frequency residual signals;
(A3) according to the similarity or the correlativity of decoded low frequency residual signals and original high frequency residual error, high frequency residual error parameter is analyzed, promptly select a frequency band matching strategy that decoded low frequency residual signals and original high frequency residual error are carried out the frequency band coupling, and the high frequency residual error parameter of calculating optimum coupling, obtain comprising the just original high frequency residual error parameter of frequency range boundary frequency, optimum matching frequency band position, energy matching attribute, sound channel coupling parameter and interframe spreading parameter of frequency band division result, frequency-domain residual;
(A4) original high frequency residual error parameter is encoded, obtain the output of high frequency residual error parameter coding.
3. encode/decode audio signal method according to claim 2, wherein the described frequency band matching strategy of step (A3) comprises: the high frequency residual error frequency band mates in low frequency residual signals zone, the high frequency residual error frequency band mates in low frequency residual signals zone and extended area thereof, carry out the high frequency residual error coupling and carry out the high frequency residual error coupling by the interframe expansion by sound channel coupling.
4. encode/decode audio signal method according to claim 3, wherein said high frequency residual error frequency band mates in low frequency residual signals zone, be to mate in low frequency residual signals zone, seek the optimum matching frequency band of error minimum or correlativity maximum with each high frequency residual error frequency band.
5. encode/decode audio signal method according to claim 3, wherein said high frequency residual error frequency band mates in low frequency residual signals zone and extended area thereof, be that the high frequency residual error frequency band that frequency is minimum mates in low frequency residual signals zone earlier, seek the optimum matching frequency band of error minimum or correlativity maximum, and with the minimum high frequency residual error frequency band of this optimum matching frequency band reconfiguration frequency; Again that the frequency of this reconstruct is minimum high frequency residual error frequency band adds original low frequency residual signals, forms a new low frequency residual signals of forming; Then based on the low frequency residual signals of this new composition, the high frequency residual error frequency band of higher frequency is mated, until the coupling of finishing all high frequency residual error frequency bands.
6. encode/decode audio signal method according to claim 3, the wherein said coupling by sound channel carried out the high frequency residual error coupling, be meant under the multi-channel audio signal situation, the high frequency residual error frequency band of any one sound channel adopts the high frequency residual error frequency band to mate in low frequency residual signals zone or the high frequency residual error frequency band mates in low frequency residual signals zone and extended area thereof, utilize the low frequency residual signals of place sound channel that the high frequency residual error of place sound channel is mated, and employing high frequency residual error frequency band mates in low frequency residual signals zone or the high frequency residual error frequency band mates in low frequency residual signals zone and extended area thereof, utilizes the low frequency residual signals of other sound channel that the high frequency residual error of place sound channel is mated.
7. encode/decode audio signal method according to claim 3, the wherein said expansion by interframe carried out the high frequency residual error coupling, the high frequency residual error frequency band that is meant present frame adopts the high frequency residual error frequency band to mate in low frequency residual signals zone, the high frequency residual error frequency band is in low frequency residual signals zone and extended area mates or carry out the high frequency residual error coupling by the sound channel coupling, utilize the low frequency residual signals of present frame that the high frequency residual error of present frame is mated, and adopt the high frequency residual error frequency band to mate in low frequency residual signals zone, the high frequency residual error frequency band is in low frequency residual signals zone and extended area mates or carry out the high frequency residual error coupling by the sound channel coupling, utilize the reconstructed frequency domain residual signals of former frame or preceding some frames, comprise reconstruct low frequency residual signals and reconstructed high frequency residual error, the high frequency residual error of present frame is mated.
8. encode/decode audio signal method according to claim 2, wherein the high frequency residual error parameter of the described calculating optimum coupling of step (A3) comprises the steps:
(B1) p ∈ [0, f D], q ∈ [f D, f Z] scope, calculate normalization low frequency residual error band signal
Figure A200810232760C00031
With normalization high frequency residual error band signal
Figure A200810232760C00032
Wherein, p and q represent respectively
Figure A200810232760C00033
With
Figure A200810232760C00034
The reference position of place frequency band, f DBe low frequency residual signals and high frequency residual error boundary frequency, l is a frequency pointer in the frequency band.
(B2) p ∈ [0, f D], q ∈ [f D, f Z] scope, calculate coupling distortion measure d (p, q) or coupling related function r (p, q);
(B3) p ∈ [0, f D], q ∈ [f D, f Z] scope, with distortion measure d (p, minimum value q) or related function r (p, the value p of pairing p of maximal value q) and q 0And q 0Be best frequency band matched position;
(B4) calculating optimum energy matching attribute G R(p 0, q 0).
9. encode/decode audio signal method according to claim 1, wherein described frequency-domain residual decoding of step 4) and reconstruction processing comprise the steps:
(C1) the low frequency residual signals coding that receives is decoded, obtain the decoded low frequency residual signals;
(C2) the high frequency residual error parameter coding that receives is decoded, high frequency residual error parameter obtains decoding;
(C3) utilize optimum matching frequency band position and energy matching attribute in decoded low frequency residual signals, the decoding high frequency residual error parameter, reconstructed high frequency residual error frequency band R H(q 0, l);
(C4) frequency band division method, sound channel coupling parameter and the interframe spreading parameter in the utilization decoding high frequency residual error parameter makes up the reconstructed high frequency residual error frequency band that obtains, and obtains reconstructed high frequency residual error;
(C5) the frequency-domain residual height frequency range boundary frequency in the utilization decoding high frequency residual error parameter makes up decoded low frequency residual signals and reconstructed high frequency residual error, obtains the reconstructed frequency domain residual signals.
10. the audio encoding and decoding method based on time-domain filtering and the reconstruct of high frequency residual error comprises the steps:
T1) at coding side, the original time-domain signal of audio frequency obtains original time domain residual signals through time domain perception Filtering Processing;
T2) original time domain residual signals is handled through time-frequency conversion, obtains original frequency domain residual error signal;
T3) original frequency domain residual error signal obtains low frequency residual signals coding and high frequency residual error parameter coding, and outputs to transmission channel or storage medium through frequency-domain residual analysis and encoding process.
T4) in decoding end, receive from the low frequency residual signals coding and the high frequency residual error parameter coding that output to transmission channel or storage medium, and it is carried out frequency-domain residual decoding and reconstruction processing, obtain the reconstructed frequency domain residual signals;
T5) the reconstruct frequency domain residual error signal is carried out the time-frequency inversion process, obtain reconstruct time domain residual signals;
T6) reconstruct time domain residual signals is carried out the time domain liftering and handle, obtain the audio reconstruction time-domain signal.
CN200810232760XA 2008-12-22 2008-12-22 Method for encoding and decoding audio Expired - Fee Related CN101436407B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN200810232760XA CN101436407B (en) 2008-12-22 2008-12-22 Method for encoding and decoding audio

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN200810232760XA CN101436407B (en) 2008-12-22 2008-12-22 Method for encoding and decoding audio

Publications (2)

Publication Number Publication Date
CN101436407A true CN101436407A (en) 2009-05-20
CN101436407B CN101436407B (en) 2011-08-24

Family

ID=40710814

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200810232760XA Expired - Fee Related CN101436407B (en) 2008-12-22 2008-12-22 Method for encoding and decoding audio

Country Status (1)

Country Link
CN (1) CN101436407B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9537694B2 (en) 2012-03-29 2017-01-03 Huawei Technologies Co., Ltd. Signal coding and decoding methods and devices
US9626972B2 (en) 2012-12-06 2017-04-18 Huawei Technologies Co., Ltd. Method and device for decoding signal
CN109243485A (en) * 2018-09-13 2019-01-18 广州酷狗计算机科技有限公司 Restore the method and apparatus of high-frequency signal
CN111371460A (en) * 2020-04-26 2020-07-03 宁夏隆基宁光仪表股份有限公司 High-low frequency matching data compression method suitable for intelligent electric meter
CN113160842A (en) * 2021-03-06 2021-07-23 西安电子科技大学 Voice dereverberation method and system based on MCLP
CN116366014A (en) * 2023-05-26 2023-06-30 苏州至盛半导体科技有限公司 Dynamic range control circuit, audio processing chip and method based on frequency domain segmentation
WO2024094006A1 (en) * 2022-11-01 2024-05-10 抖音视界有限公司 Audio signal coding method and apparatus, and audio signal decoding method and apparatus

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003044098A (en) * 2001-07-26 2003-02-14 Nec Corp Device and method for expanding voice band
EP2071565B1 (en) * 2003-09-16 2011-05-04 Panasonic Corporation Coding apparatus and decoding apparatus
BRPI0517716B1 (en) * 2004-11-05 2019-03-12 Panasonic Intellectual Property Management Co., Ltd. CODING DEVICE, DECODING DEVICE, CODING METHOD AND DECODING METHOD.
US20090319277A1 (en) * 2005-03-30 2009-12-24 Nokia Corporation Source Coding and/or Decoding
CN101276587B (en) * 2007-03-27 2012-02-01 北京天籁传音数字技术有限公司 Audio encoding apparatus and method thereof, audio decoding device and method thereof

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10600430B2 (en) 2012-03-29 2020-03-24 Huawei Technologies Co., Ltd. Signal decoding method, audio signal decoder and non-transitory computer-readable medium
US9537694B2 (en) 2012-03-29 2017-01-03 Huawei Technologies Co., Ltd. Signal coding and decoding methods and devices
US9899033B2 (en) 2012-03-29 2018-02-20 Huawei Technologies Co., Ltd. Signal coding and decoding methods and devices
US9786293B2 (en) 2012-03-29 2017-10-10 Huawei Technologies Co., Ltd. Signal coding and decoding methods and devices
US10971162B2 (en) 2012-12-06 2021-04-06 Huawei Technologies Co., Ltd. Method and device for decoding signal
US9830914B2 (en) 2012-12-06 2017-11-28 Huawei Technologies Co., Ltd. Method and device for decoding signal
US9626972B2 (en) 2012-12-06 2017-04-18 Huawei Technologies Co., Ltd. Method and device for decoding signal
US10236002B2 (en) 2012-12-06 2019-03-19 Huawei Technologies Co., Ltd. Method and device for decoding signal
US10546589B2 (en) 2012-12-06 2020-01-28 Huawei Technologies Co., Ltd. Method and device for decoding signal
US11610592B2 (en) 2012-12-06 2023-03-21 Huawei Technologies Co., Ltd. Method and device for decoding signal
CN109243485A (en) * 2018-09-13 2019-01-18 广州酷狗计算机科技有限公司 Restore the method and apparatus of high-frequency signal
CN109243485B (en) * 2018-09-13 2021-08-13 广州酷狗计算机科技有限公司 Method and apparatus for recovering high frequency signal
CN111371460B (en) * 2020-04-26 2023-05-02 宁夏隆基宁光仪表股份有限公司 High-low frequency matching data compression method suitable for intelligent ammeter
CN111371460A (en) * 2020-04-26 2020-07-03 宁夏隆基宁光仪表股份有限公司 High-low frequency matching data compression method suitable for intelligent electric meter
CN113160842A (en) * 2021-03-06 2021-07-23 西安电子科技大学 Voice dereverberation method and system based on MCLP
CN113160842B (en) * 2021-03-06 2024-04-09 西安电子科技大学 MCLP-based voice dereverberation method and system
WO2024094006A1 (en) * 2022-11-01 2024-05-10 抖音视界有限公司 Audio signal coding method and apparatus, and audio signal decoding method and apparatus
CN116366014A (en) * 2023-05-26 2023-06-30 苏州至盛半导体科技有限公司 Dynamic range control circuit, audio processing chip and method based on frequency domain segmentation
CN116366014B (en) * 2023-05-26 2023-08-18 苏州至盛半导体科技有限公司 Dynamic range control circuit, audio processing chip and method based on frequency domain segmentation

Also Published As

Publication number Publication date
CN101436407B (en) 2011-08-24

Similar Documents

Publication Publication Date Title
CN101436407B (en) Method for encoding and decoding audio
CN102280109B (en) Code device, decoding device and their method
CN101067931B (en) Efficient configurable frequency domain parameter stereo-sound and multi-sound channel coding and decoding method and system
CN1910655B (en) Apparatus and method for constructing a multi-channel output signal or for generating a downmix signal
CN102016983B (en) Apparatus for mixing plurality of input data streams
CN102158198B (en) Filter generator, filter system and method for providing intermediate filters defined signal
CN102150207B (en) Compression of audio scale-factors by two-dimensional transformation
KR100954179B1 (en) Near-transparent or transparent multi-channel encoder/decoder scheme
CN101031962B (en) Partially complex modulated filter bank
CN101053019B (en) Device and method for encoding and decoding of audio signals using complex-valued filter banks
CN101276587B (en) Audio encoding apparatus and method thereof, audio decoding device and method thereof
NO20171179A1 (en) System and method for processing spectral values, codes and decoders for audio signals
CN100481733C (en) Coder for compressing coding of multiple sound track digital audio signal
CN103177730B (en) Decoding device, communication terminal, base station apparatus and coding/decoding method
CN102656628B (en) Optimized low-throughput parametric coding/decoding
CN101662288B (en) Method, device and system for encoding and decoding audios
CN102547551A (en) Binaural multi-channel decoder in the context of non-energy-conserving upmix rules
CN105023578A (en) Decoder system and decoding method
CN104603872A (en) Apparatus and method for reproducing an audio signal, apparatus and method for generating a coded audio signal, computer program and coded audio signal
CN103366755A (en) Method and apparatus for encoding and decoding audio signal
CN1973319A (en) Method and apparatus to encode and decode multi-channel audio signals
CN101800049B (en) Coding apparatus and decoding apparatus
CN101436406B (en) Audio encoder and decoder
CN100349207C (en) High frequency coupled pseudo small wave 5-tracks audio encoding/decoding method
MXPA98010783A (en) Audio signal encoder, audio signal decoder, and method for encoding and decoding audio signal

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20110824

Termination date: 20141222

EXPY Termination of patent right or utility model