CN101185119B - Method and apparatus for decoding an audio signal - Google Patents

Method and apparatus for decoding an audio signal Download PDF

Info

Publication number
CN101185119B
CN101185119B CN2006800182450A CN200680018245A CN101185119B CN 101185119 B CN101185119 B CN 101185119B CN 2006800182450 A CN2006800182450 A CN 2006800182450A CN 200680018245 A CN200680018245 A CN 200680018245A CN 101185119 B CN101185119 B CN 101185119B
Authority
CN
China
Prior art keywords
information
channel
around
filtering
coefficient
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN2006800182450A
Other languages
Chinese (zh)
Other versions
CN101185119A (en
Inventor
吴贤午
郑亮源
房熙锡
金东秀
林宰显
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LG Electronics Inc
Original Assignee
LG Electronics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from KR1020060030670A external-priority patent/KR20060122695A/en
Application filed by LG Electronics Inc filed Critical LG Electronics Inc
Priority claimed from PCT/KR2006/002016 external-priority patent/WO2006126855A2/en
Publication of CN101185119A publication Critical patent/CN101185119A/en
Application granted granted Critical
Publication of CN101185119B publication Critical patent/CN101185119B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

Method and apparatus for processing audio signals are provided. The method for decoding an audio signal includes receiving filter information, applying spatial information to the filter information to generate surround converting information, and outputting the surround converting information. The apparatus for decoding an audio signal includes a filter information receiving part receiving filter information; an information converting part applying spatial information to the filter information to generate surround converting information, and a surround converting information output part outputting the surround converting information.

Description

The method and apparatus of decoded audio signal
Technical field
The present invention relates to Audio Signal Processing, and more specifically say, relate to the method and apparatus that is used for audio signal, it can produce pseudo-in signal.
Background technology
Recently, develop the various technology and the method that are used for the coded digital sound signal, and also made its relevant product.In addition, developed many methods, wherein had multi channel sound signal and be used the psychoacoustic model coding.
This psychoacoustic model is the method that a kind of principle of using human voice recognition mode is reduced in the data volume when removing signal unnecessary in the process of encoding process effectively.For example, human ear can't be discerned quiet sound immediately after the sound of noise and excitement, and also only hears frequency at 20-20, the sound between the 000Hz.
Though developed the above existing technology and method, do not had the known audio signal that is used for to produce pseudo-method around signal from the audio bitstream that comprises spatial information.
Summary of the invention
The invention provides the method and apparatus and the data structure thereof that are used for decoded audio signal, it can provide pseudo-surrounding effect in audio system.
According to aspect of the present invention, a kind of method that is used for decoded audio signal is provided, this method comprises: the information that accepts filter, spatial information is applied to filtering information should be around transitional information around transitional information and output to produce.
According to another aspect of the present invention, a kind of device that is used for decoded audio signal is provided, this device comprises: the filtering information receiving unit of the information that accepts filter, with spatial information be applied to filtering information with produce around the information translation part of transitional information and output around transitional information around the transitional information output.
Provide a kind of data structure of sound signal more on the one hand according to of the present invention, this data structure comprises filtering information and spatial information.Here, this filtering information spatial information of being utilized application is converted to around transitional information.
Description of drawings
This accompanying drawing of following is included to provide further to be understood the present invention, and it illustrates embodiments of the invention, and works to explain the principle of the invention with this instructions.
In the accompanying drawings:
Fig. 1 illustrates the signal processing system according to one embodiment of the invention;
Fig. 2 illustrates according to the puppet of one embodiment of the invention around the schematic block diagram that produces part;
Fig. 3 illustrates the schematic block diagram according to the information translation part of one embodiment of the invention;
Fig. 4 illustrates according to one embodiment of the invention and is used to describe puppet around the schematic block diagram that presents process and spatial information transfer process;
Fig. 5 illustrates according to another embodiment of the present invention and is used to describe puppet around the schematic block diagram that presents process and spatial information transfer process;
Fig. 6 and Fig. 7 illustrate the schematic block diagram that is used to describe the channel mapping process according to one embodiment of the invention;
Fig. 7 illustrates the schematic block diagram that is used to describe the channel mapping process according to one embodiment of the invention;
Fig. 8 illustrates according to one embodiment of the invention and is used for via the synoptic diagram of describing filter factor with channel; With
Fig. 9 to Figure 11 illustrates according to the embodiment of the invention and is used to describe the schematic block diagram that is used to produce around the process of transitional information.
Embodiment
To at length be introduced embodiments of the invention now, the accompanying drawing illustrated that its example is being followed.
At first, the present invention is by term description, and this term uses in its relevant technology usually.But, defined some terms in the present invention clearly to describe the present invention.Therefore, the present invention must be based on the term that defines in the following description and understands.
" spatial information " expression in the present invention produces the information that multichannel needs by the signal of mixing under the uppermixing.Though with hypothesis space information is that spatial parameter is described the present invention, to understand easily, this spatial information is not subjected to the restriction of spatial parameter.Here, this spatial parameter comprises channel level poor (CLD), interchannel coherence (ICC) and channel estimating coefficient (CPC) etc.This channel level poor (CLD) is illustrated in the energy difference of two interchannels.This interchannel coherence (ICC) is illustrated in the cross correlation of two interchannels.This channel estimating coefficient (CPC) expression is from the predictive coefficient of three channels of two channel estimatings.
" core codec " expression in the present invention is used for the codec of coding audio signal.This core codec is space encoder information not.The present invention will suppose that the mixing sound signal is to be described by the sound signal of core codec coding down.In addition, this core codec can comprise the layer-II of Motion Picture Experts Group (MPEG), mpeg audio layer-III (MP3), AC-3, OggVorbis, DTS, Window media audio (WMA), Advanced Audio Coding (AAC) or AAC (HE-AAC) efficiently.But, this core codec can be provided.In this case, use unpressed PCM signal.This codec can be an existing codec and at the codec in the future of exploitation in the future.
" channel distribution part " expression can be divided into the input channel of given number the division part of the delivery channel of another given number, and wherein the delivery channel number is different from the number of input channel.This channel distribution partly comprises two to three (TTT) box, and it is converted to three delivery channels with two input channels.In addition, this channel distribution partly comprises one to two (OTT) box, and it is converted to two delivery channels with an input channel.Channel distribution of the present invention partly is not limited to TTT and OTT box, but understands easily, can be to use this channel distribution part in the system arbitrarily at its input channel number and delivery channel number.
Fig. 1 illustrates the signal processing system according to one embodiment of the invention.As shown in Figure 1, this signal processing system comprises encoding device 100 and decoding device 150.Though the present invention will describe based on sound signal, to understand easily, signal processing system of the present invention can be handled all signals except that sound signal.
This encoding device 100 comprises mixing part 110, core encoder part 120 and multiplexing section 130 down.This time mixing part 110 comprises mixing part 111 and spatial information estimation part 112 under the channel.
N multi channel audio signal X of mixing part 110 under input 1, X 2..., X NThe time, depend on certain following frequency mixing method or descend frequency mixing method to produce sound signal arbitrarily.Output to the number of sound signal of core encoder part 120 less than the number " N " of input multi channel audio signal from following mixing part 110 here.This spatial information estimation part 112 is extracted spatial information from the input multi channel audio signal, sends the spatial information that extracts to multiplexing section 130 then.Here, the number of following mixing channel can be one or two, perhaps can be the specific number according to following mixing order.The number of mixing channel can be set down.In addition, descend mixed frequency signal to be used as mixing sound signal down alternatively arbitrarily.
These core encoder part 120 codings are the mixing sound signal down, and it is via mixing channel transmission down.The following mixing sound signal of this coding is inputed to multiplexing section 130.
The following mixing sound signal of these multiplexing section 130 multiplexed codings and spatial information send the bit stream that produces to decoding device 150 then to produce bit stream.Here, this bit stream can comprise core codec bit stream and spatial information bit stream.
This decoding device 150 comprises multichannel decomposition part 160, core codec part 170 and pseudo-surround decoder part 180.This puppet surround decoder part 180 can comprise pseudo-in producing part 200 and information translation part 300.In addition, this puppet surround decoder part 180 may further include the filtering information receiving unit (not shown) that is used to the information of accepting filter and the context conversion information output part that is used for the output environment transitional information divided (not shown).In addition, this decoding device 150 may further include spatial information decoded portion 190.This multichannel is decomposed part 160 and is received this bit stream, and the bit stream multichannel that receives is decomposed into core codec bit stream and spatial information bit stream.This multichannel is decomposed part 160 and extract mixed frequency signal and spatial information down from the bit stream that receives.
This core codec part 170 is decomposed part 160 from multichannel and is received the bit stream that the core codec bit stream receives with decoding, then decoded result is exported to pseudo-surround decoder part 180 as the following mixed frequency signal of decoding.For example, when 100 times mixing multi-channel signals of encoding device were single channel signal or stereo channels signal, the following mixed frequency signal of this decoding can be single channel signal or stereo channels signal.Though the single channel of mixing channel or stereo channels were described under embodiments of the invention were based on and are used as, and understood easily, the present invention is not subject to down the number of mixing channel.
This spatial information decoded portion 190 is decomposed part 160 from multichannel and is received the spatial information bit streams, this spatial information bit stream of decoding, and export this decoded results as spatial information.
This puppet surround decoder part 180 plays usage space information from the mixed frequency signal generation is pseudo-in signal down.Below be that it is included in the pseudo-surround decoder part 180 for the description of puppet around generation part 200 and information translation part 300.
This information translation part 300 receives spatial information and filtering information.In addition, these information translation part 300 usage space information and filtering information produce around transitional information.Here, having around transitional information of this generation is suitable for producing pseudo-pattern around signal.Should represent filter factor in puppet around producing under the situation that part 200 is specific wave filters around transitional information.Though the present invention is based on as what describe around the filter factor of transitional information, understands easily, should not limited by this filter factor around transitional information.In addition, be the relevant transport function (HRTF) of head though this filtering information is assumed to be, to understand easily, this filtering information is not limited to HRTF.
In the present invention, filter factor described above is represented the coefficient of specific wave filter.For example, this filter factor can be as giving a definition.Prototype HRTF filter factor is represented the original filter factor of specific hrtf filter, and can be represented as GL_L or the like.The HRTF filter factor of conversion is represented from the filter factor of prototype HRTF filter factor conversion, and can be represented as GL_L ' or the like.The HRTF filter factor of spatialization be by spatialization prototype HRTF filter factor producing the pseudo-filter factor that obtains around signal, and can be represented as FL_L1 or the like.The main coefficient table that presents is shown to carry out and presents necessary filter factor.And can be represented as HL_L or the like.The master of interpolation presents coefficient and represents by interpolation and/or the main filter factor that presents the coefficient acquisition of obfuscation, and can be represented as HL_L ' or the like.According to the present invention, to understand easily, filter factor is not limited to above-described filter factor.
This puppet receives around transitional information from the following mixed frequency signal of core codec part 170 reception decodings with from information translation part 300 around producing part 200, and the following mixed frequency signal of use decoding and pseudo-in signal around the transitional information generation.For example, pseudo-ly be used in stereo audio system, providing virtual multichannel (perhaps around) sound around signal.According to the present invention, to understand easily, puppet will play above-described effect around signal in any equipment except that stereo audio system.This puppet can be carried out presenting of various kinds according to pattern is set around producing part 200.
Suppose that encoding device 100 transmits monophony or stereo mixed frequency signal rather than multi channel audio signal down, and this time mixed frequency signal is sent by the spatial information with multi channel audio signal.In this case, though the delivery channel of this equipment 150 is stereo channels rather than multichannel, comprise that this decoding device 150 of pseudo-surround decoder part 180 can provide the user to have the effect that virtual three-dimensional sound is listened to impression.
Below be according to the description of one embodiment of the invention, as shown in Figure 1 for sound signal structure 140.When sending sound signal based on useful load, it can receive via each channel or individual channel.The audio frequency useful load of 1 frame is made up of the voice data field and the auxiliary data field of coding.Here, this auxiliary data field can comprise the spatial information of coding.For example, if the data rate of audio frequency useful load is 48~128kbps, the data rate of spatial information can be 5~32kbps.Such example will not limit the scope of the invention.
Fig. 2 illustrates according to the puppet of one embodiment of the invention around the schematic block diagram that produces part 200.
The territory of Miao Shuing comprises the wherein decoding following mixing territory of mixed frequency signal down in the present invention, wherein handle spatial information to produce spatial information territory around transitional information, wherein usage space information is presenting the territory and wherein exporting the domain output of the puppet of time domain around signal of mixed frequency signal now.Here, this domain output sound signal can be heard by the mankind.This domain output refers to time domain.This puppet comprises and presents part 220 and domain output conversion portion 230 around producing part 200.In addition, this puppet may further include around generation part 200 and presents territory conversion portion 210, and it is different from when presenting the territory in mixing territory instantly, will descend the mixing territory to be converted to and present the territory.
Below be respectively by the description that is included in three territory conversion methods that three territory conversion portions presenting in the territory conversion portion 210 carry out.At first, be set to the sub-band territory and describe though following embodiment hypothesis presents the territory, understand easily, this presents the territory can be set to any territory.According to the first territory conversion method, be under the situation of time domain in following mixing territory, time domain is converted into and presents the territory.According to the second territory conversion method, be under the situation of discrete frequency domain in following mixing territory, discrete frequency domain is converted into and presents the territory.According to the 3rd territory conversion method, be under the situation of discrete frequency domain in following mixing territory, discrete frequency domain is converted into time domain, and the time domain of conversion is converted into and presents the territory afterwards.
This presents part 220 uses and is used for the puppet of following mixed frequency signal around the transitional information execution around presenting to produce puppet around signal.Here, this puppet of exporting from pseudo-surround decoder part 180 with stereo delivery channel becomes the pseudo-surround sound output with virtual surround sound sound around signal.In addition, because be signal during presenting the territory around signal, when being not time domain, the territory needs the territory conversion when presenting from the puppet that presents part 220 output.Though the present invention describes under the situation of stereo channels, understands easily, can use the present invention, and irrelevant with the number of delivery channel.
For example, can realize puppet around rendering method by the HRTF filtering method, wherein input signal experiences one group of hrtf filter.Here, spatial information can be the value that can use in mixing filter group territory, and mixing filter group territory is around undefined at MPEG.This puppet can realize as following embodiment according to the type in following mixing territory and spatial information territory around rendering method.For this reason, make down mixing territory and spatial information territory with present the territory and overlap.
According to the embodiment of puppet around rendering method, exist a kind of wherein in sub-band territory (QMF), carry out under the puppet of mixed frequency signal around the method that presents.This sub-band territory comprises simple sub-band territory and hybrid domain.For example, mixed frequency signal is a PCM signal and when down the mixing territory is not the sub-band territory instantly, presents territory conversion portion 210 and will descend the mixing territory to be converted to the sub-band territory.On the other hand, when the mixing territory was the sub-band territory instantly, following mixing territory did not need to be converted.Sometimes, in order to make down mixed frequency signal and spatial information synchronous, need to descend mixed frequency signal or spatial information to postpone.Here, when the spatial information territory was the sub-band territory, the spatial information territory did not need to be converted.In addition, pseudo-in signal in order to produce in time domain, this domain output conversion portion 230 will present the territory and be converted to time domain.
According to puppet another embodiment around rendering method, exist a kind of wherein in discrete frequency domain, carry out under the puppet of mixed frequency signal around the method that presents.Here, this discrete frequency domain is represented the frequency domain except that the sub-band territory.That is, this frequency domain can comprise at least one of discrete frequency domain and sub-band territory.For example, when the mixing territory was not discrete frequency domain instantly, this presented territory conversion portion 210 and will descend the mixing territory to be converted to discrete frequency domain.Here, when the spatial information territory was the sub-band territory, the spatial information territory need be converted into discrete frequency domain.This method is used for replacing filtering in time domain with the operation in discrete frequency domain, makes operating speed relatively promptly to carry out.In addition, pseudo-in signal in order to produce in time domain, this domain output conversion portion 230 can be converted to time domain with presenting the territory.
According to puppet another embodiment around rendering method, exist a kind of wherein in time domain, carry out under the puppet of mixed frequency signal around the method that presents.For example, when the mixing territory was not time domain instantly, this presented time domain conversion portion 210 and will descend the mixing territory to be converted to time domain.Here, when the spatial information territory was the sub-band territory, the spatial information territory also was converted into time domain.In this case, be time domain because this presents the territory, this domain output conversion portion 230 does not need to be converted to time domain with presenting the territory.
Fig. 3 illustrates the schematic block diagram according to the information translation part 300 of the embodiment of the invention.As shown in Figure 3, this information translation part 300 comprises that channel mapping part 310, coefficient produce part 320 and integral part 330.In addition, this information translation part 300 may further include and is used for handling in addition the additional treatments part (not shown) of filter factor and/or presents territory conversion portion 340.
This information translation part 300 accept filter information and spatial information are applied to filtering information to produce around transitional information with this spatial information, and output should be around transitional information then.Here, the territory of this filtering information can be mutually identical with the spatial information territory, so that this spatial information is applied to this filtering information.When the territory of the filtering information of this reception and spatial information territory when being inequality, the territory of this filtering information can be converted, and makes that two territories can be identical mutually.From now on, be that the present invention is described in the sub-band territory with the hypothesis space information field, still, should be appreciated that, the invention is not restricted to this hypothesis.
For example, when the territory conversion was applied to the filtering information of this reception, because the territory of filtering information and spatial information territory are inequality, this filtering information appeared in each sub-band.Here, when this filtering information appears in each sub-frequency bands, use it and do not make amendment, this causes a large amount of operations.Therefore, need in the sub-band territory, reduce the amount of filtering information.The embodiment of a minimizing method is a parameterization.For convenience, below the filtering information before the parameterization, be called the prototype filtering information in sub-band for short, and be called parametric filtering information below the filtering information after parameterization for short.Last in addition parametric filtering information is that the territory by translation filtering information obtains, and this filtering information of parametrization in the territory of conversion then.Last parametric filtering information is called as the filtering information of modification, and it can comprise parametric filtering information.
This channel mapping part 310 is carried out the channel mapping, makes the spatial information of this input can be mapped at least one channel signal of multi-channel signal, produces the channel mapping output valve as the channel map information then.
This coefficient produces part 320 and produces channel coefficients information.This channel coefficients information can comprise the coefficient information of channel or the coefficient information of interchannel.Here, the coefficient information of channel is represented at least one of size information and energy information or the like, and the coefficient information of this interchannel represents the relevant information of interchannel, and it is to use filter factor and channel mapping output valve to calculate.This coefficient produces a plurality of coefficients generation parts that part 320 can comprise channel.This coefficient produces part 320 and uses filtering information and channel mapping output valve to produce channel coefficients information.Here, this channel can comprise at least one of multichannel, following mixing channel and delivery channel.From now on, this channel will be described as multichannel, and the coefficient information of channel will also be described as size information.Though channel and coefficient information will be described based on the above embodiments, understand easily, there is the modification of many admissible embodiment.In addition, this coefficient produces part 320 and can produce channel coefficients information according to channel number or other feature.
Integral part 330 integrations of the coefficient information of this receive channel or add with the coefficient information of channel to produce the coefficient information of integration.In addition, this integral part 330 uses the integral coefficient of integral coefficient information to produce filter factor.This integral part 330 can produce this integral coefficient by the coefficient of further integration additional information and this channel.This integral part 330 can be according to the coefficient of at least one channel of characteristic-integration of channel coefficients information.For example, this integral part 330 can be according to the feature of channel coefficients information, following mixing channel, and delivery channel, integration is carried out in the combination of a channel that combines with delivery channel and the channel of listing.In addition, this integral part 330 can produce additional processing coefficient information by the coefficient of handling this integration in addition.That is, this integral part 330 can produce filter factor by additional processing.For example, this integral part 330 can pass through to handle in addition this integral coefficient, such as, by using specific function, perhaps produce filter factor by merging a plurality of integral coefficients in integral coefficient.Here, this integral coefficient information is at least one in delivery channel amplitude information, delivery channel energy information and the delivery channel relevant information.
When the spatial information territory is different from when presenting the territory, this presents territory conversion portion 340 can make the spatial information territory and present the territory and overlap.This presents territory conversion portion 340 and can be converted to and present the territory being used for pseudo-territory around the filter factor that presents.
Because this integral part 330 plays a part to reduce pseudo-in the workload that presents, it can be omitted.In addition, under the situation of stereo mixed frequency signal down, in the process of the coefficient information that produces channel, produce the coefficient sets that is applied to a left side and bottom right mixed frequency signal.Here, sets of filter coefficients can comprise the filter factor that sends to their channels from each channel, with the filter factor that sends to their relative channel from each channel.
Fig. 4 illustrates according to one embodiment of the invention and is used to describe puppet around the schematic block diagram that presents process and spatial information transfer process.Then, this embodiment stereo down mixed frequency signal of illustrating decoding is received pseudo-in the situation that produces part 410.
Information translation part 400 can be created in pseudo-in producing the coefficient that sends its own channel in the part 410 to, with the coefficient that sends relative channel in puppet in around generation part 410 to.This information translation part 400 produces coefficient HL_L and coefficient HL_R, and the coefficient HL_L that will produce and HL_R export to first and present part 413.Here, this coefficient HL_L is transmitted to pseudo-in the left output terminal that produces part 410, and this coefficient HL_R is transmitted to pseudo-in the right output terminal that produces part 410.In addition, this information translation part 400 produces coefficient HR_R and HR_L, and the coefficient HR_R that produces and HR_L are exported to second presents part 414.Here, this coefficient HR_R is transmitted to pseudo-in the right output terminal that produces part 410, and this coefficient HR_L is transmitted to pseudo-in the left output terminal that produces part 410.
This puppet comprises that around producing part 410 first presents part 413, second and present part 414 and totalizer 415 and 416.In addition, this puppet may further include territory conversion portion 411 and 412 around producing part 410, and it will descend the mixing territory to overlap with presenting the territory, when two territories mutually not simultaneously, for example, when the mixing territory is not the sub-band territory instantly, and to present the territory be the sub-band territory.Here, this puppet may further include anti-territory conversion portion 417 and 418 around producing part 410, and it will present the territory, and for example the sub-band territory is converted to time domain.Therefore, the user hears the audio frequency with virtual multichannel sound via the earphone with stereo channels etc.
First and second present part 413 and 414 receives stereo mixed frequency signal and one group of filter factor down.This sets of filter coefficients is applied to a left side and bottom right mixed frequency signal respectively, and from integral part 403 outputs.
For example, first and second present part 413 and 414 and use four filter factor HL_L, HL_R, HR_L and HR_R to carry out to present to produce pseudo-in signal from mixed frequency signal down.
More particularly, first presents part 413 can use filter factor HL_L and HL_R to carry out to present, and wherein filter factor HL_L is transmitted to its oneself channel, and this filter factor HL_R is transmitted to and its oneself the relative channel of channel.First presents part 413 can comprise that son presents part (not shown) 1-1 and 1-2.Here, this son presents part 1-1 and uses filter factor HL_L execution to present, this filter factor HL_L is transmitted to pseudo-in the left output terminal that produces part 410, and this son presents part 1-2 and uses filter factor HL_R execution to present, and this filter factor HL_R is transmitted to pseudo-in the right output terminal that produces part 410.In addition, second presents part 414 uses sets of filter coefficients HR_R and HR_L to carry out to present, and wherein filter factor HR_R is transmitted to its oneself channel, and this filter factor HR_L is transmitted to and its oneself the relative channel of channel.Second presents part 414 can comprise that son presents part (not shown) 2-1 and 2-2.Here, this son presents part 2-1 and uses filter factor HR_R execution to present, this filter factor HR_R is transmitted to pseudo-in the right output terminal that produces part 410, and this son presents part 2-2 and uses filter factor HR_L execution to present, and this filter factor HR_L is transmitted to pseudo-in the left output terminal that produces part 410.HL_R and HR_R addition in totalizer 416, and HL_L and HR_L addition in totalizer 415.Here, in case of necessity, HL_R and HR_L vanishing, this coefficient that refers to cross term is zero.Here, when HL_R and HR_L were zero, two transmission did not in addition interact.
On the other hand, under monophone, under the situation of mixed frequency signal, can present by embodiment execution with the structure that is similar to Fig. 4.More particularly, original monophone input is called as first channel signal, and the signal that obtains by decorrelation first channel signal is called as the second channel signal.In this case, first and second present part 413 and 414 can receive first and second channel signals, and their execution are presented.
With reference to figure 4, the stereo mixed frequency signal down of its definition input is represented by " x ", represent by " D " by spatial information being shone upon the channel mapping coefficient that obtains to channel, the prototype HRTF filter factor of outside input is represented by " G ", interim multi-channel signal is represented by " p ", and has been experienced the output signal that presents and represented by " y ".This mark " x ", " D ", " G ", " p " and " y " can represent by the matrix form of following formula 1.Formula 1 is based on to be represented as the prototype HRTF filter factor of prototype filter factor.But when the HRTF filter factor as the modification of the filter factor of revising was used for following formula, G must be with G ' replacement in following formula.
[formula 1]
x = Li Ri , p = L Ls R Rs C LFE , D = D _ L 1 D _ L 2 D _ Ls 1 D _ Ls 2 D _ R 1 D _ R 2 D _ Rs 1 D _ Rs 2 D _ C 1 D _ C 2 D _ LFE 1 D _ LFE 2 ,
G = GL _ L GLs _ L GR _ L GRs _ L GC _ L GLFE _ L GL _ R GLs _ R GR _ R GRs _ R GC _ R GLFE _ R y = Lo Ro
Here, when each coefficient was the value of frequency domain, interim multi-channel signal " p " can be passed through channel mapping coefficient " D " and the stereo product representation of mixed frequency signal " x " down, shown in following formula 2.
[formula 2]
p=D·x, L Ls R Rs C LFE = D _ L 1 D _ L 2 D _ Ls 1 D _ Ls 2 D _ R 1 D _ R 2 D _ Rs 1 D _ Rs 2 D _ C 1 D _ C 2 D _ LFE 1 D _ LFE 2 Li Ri
Then, when using prototype HRTF filter factor " G " to present interim multichannel " p ", can be by formula 3 expression these output signals " y ".
[formula 3]
y=G·p
Then, if insert p=Dx, can be by formula 4 expressions " y ".
[formula 4]
y=GDx
Here, if definition H=GD, this output signal " y " and stereo mixed frequency signal " x " down have the relation of following formula 5.
[formula 5]
H = HL _ L HR _ L HL _ R HR _ R , y = Hx
Therefore, the product of this filter factor allows to obtain " H ".Then, can be by acquisition this output signal " y " that stereo down mixed frequency signal " x " and " H " are multiplied each other.
The coefficient F that can obtain after a while to describe by following formula 6 (FL_L1, FL_L2 ...).
[formula 6]
H = GD =
GL _ L GLs _ L GR _ L GRs _ L GC _ L GLFE _ L GL _ R GLs _ R GR _ R GRs _ R GC _ R GLFE _ R
D _ L 1 D _ L 2 D _ Ls 1 D _ Ls 2 D _ R 1 D _ R 2 D _ Rs 1 D _ Rs 2 D _ C 1 D _ C 2 D _ LFE 1 D _ LFE 2
Fig. 5 illustrates according to another embodiment of the present invention and is used to describe puppet around the schematic block diagram that presents process and spatial information transfer process.Then, this embodiment illustrates that mixed frequency signal is received pseudo-in the situation that produces part 510 under the monophone of decoding.As shown in the drawing, information translation part 500 comprises that channel mapping part 501, coefficient produce part 502 and integral part 503.Because the above-mentioned element of this information translation part 500 is carried out information translation part 400 identical functions with Fig. 4, will omit its detailed description below.Here, this information translation part 500 can produce last filter factor, and its territory overlaps around the territory that presents that presents with wherein execution is pseudo-.When the following mixed frequency signal of decoding is under the monophone mixed frequency signal time, this sets of filter coefficients can comprise sets of filter coefficients HM_L and HM_R.This filter factor HM_L is used to carry out presenting of mixed frequency signal under the monophone, exports to pseudo-in the left channel that produces part 510 will present the result.This filter factor HM_R is used to carry out presenting of mixed frequency signal under the monophone, exports to pseudo-in the right channel that produces part 510 will present the result.
This puppet comprises that around producing part 510 the 3rd presents part 512.In addition, this puppet may further include territory conversion portion 511 and reverse territory conversion portion 513 and 514 around producing part 510.This puppet is different from the puppet of Fig. 4 around the element that produces part 410 around the element that produces part 510, this is because the following mixed frequency signal of decoding is mixed frequency signal under the monophone in Fig. 5, and this puppet comprises that around producing part 510 carrying out puppet presents part 512 and a territory conversion portion 511 around one the 3rd that presents.The 3rd presents part 512 from integral part 503 accept filter coefficient sets HM_L and HM_R, and can use puppet that the filter factor of reception carries out mixed frequency signal under the monophone around presenting, and produces pseudo-in signal.
Simultaneously, be under the situation of monophonic signal at following mixed frequency signal, can obtain the stereo output of mixing down by the puppet of carrying out mixed frequency signal under the monophone around presenting according to two kinds of following methods.
According to first method, the 3rd presents part 512 (for example, hrtf filter) does not use and is used for pseudo-filter factor around sound, and is to use the value of use when handling stereo mixing down.Here, the value of when handling stereo mixing down, using can be coefficient (left front=1, right front=0 ... or the like), this coefficient " left front " is to be used for left side output here, and this coefficient " right front " is to be used for right output.
Secondly, from the centre that following mixed frequency signal produces the decode procedure of multi-channel signal, obtain to have the stereo output of mixing down of required channel number in usage space information.
With reference to figure 5, mixed frequency signal is represented by " x " under its definition input monophone, the channel mapping coefficient is represented by " D ", the prototype HRTF filter factor of outside input is represented by " G ", interim multi-channel signal is represented by " p ", and experienced the output signal that presents and represented that by " y " this mark " x ", " D ", " G ", " p " and " y " can be represented by the matrix form of following formula 7.
[formula 7]
X=[Mi] p = L Ls R Rs C LFE , D = D _ L D _ Ls D _ R D _ Rs D _ C D _ LFE
G = GL _ L GLs _ L GR _ L GRs _ L GC _ L GLFE _ L GL _ R GLs _ R GR _ R GRs _ R GC _ R GLFE _ R , y = Lo Ro
Relation between the matrix in formula 7 is described in the explanation of Fig. 4.Therefore, following description is with the descriptions thereof are omitted.Here, Fig. 4 illustrates and receives the stereo situation of mixed frequency signal down, and Fig. 5 illustrates the situation that receives mixed frequency signal under the monophone.
Fig. 6 and Fig. 7 illustrate the schematic block diagram that is used to describe the channel mapping process according to one embodiment of the invention.This channel mapping process refers to a process, and wherein at least one of channel mapping output valve is to be mapped as multi channel at least one channel by the spatial information that will receive to produce, with compatible around producing part with puppet.This channel mapping process is carried out in channel mapping part 401 and 501.Here, spatial information, for example, energy can be mapped at least two of a plurality of channels.Can divide Lfe channel and central channel C here.In this case, because above-mentioned process does not need channel distribution part 604 or 705, it can simplify calculating.
For example, under receiving monophone, mixed frequency signal the time, can coefficient of performance CLD1 to CLD5, ICC1 to ICC5 etc. produce channels mapping output valves.This channel mapping output valve can be D L, D R, D c, D LEF, D Ls, D RsDeng.Because this channel mapping output valve by the usage space information acquisition, can obtain the channel mapping output valve of various kinds according to different formula.Can change the generation of channel mapping output valve here, according to the scope of the tree structure of the spatial information that receives by decoding device 150 and the spatial information that in decoding device 150, uses.
Fig. 6 and Fig. 7 illustrate the schematic block diagram that is used to describe the channel mapping structure according to one embodiment of the invention.Here, the channel mapping structure can comprise at least one channel distribution part of expression OTT box.The channel architecture of Fig. 6 has 5151 structures.
With reference to figure 6, can use OTT box 601,602,603,604,605 and spatial information,, CLD for example 0, CLD 1, CLD 2, CLD 3, CLD 4, ICC 0, ICC 1, ICC 2, ICC 3Deng producing multi-channel signal L, R, C, LFE, Ls, Rs from following mixed frequency signal " m ".For example, when this tree structure has as shown in Figure 6 5151 structures, can only use CLD to obtain this channel mapping output valve, as shown in Equation 8.
[formula 8]
L R C LFE Ls Rs = D L D R D C D LFE D Ls D Rs m = c 1 , OTT 3 c 1 , OTT 1 c 1 , OTT 0 c 2 , OTT 3 c 1 , OTT 1 c 1 , OTT 0 c 1 , OTT 4 c 2 , OTT 1 c 1 , OTT 0 c 2 , OTT 4 c 2 , OTT 1 c 1 , OTT 0 c 1 , OTT 2 c 2 , OTT 0 c 2 , OTT 2 c 2 , OTT 0 m
Wherein,
C 1 , OT T n 1 , m = 10 CL D S 1 , m 10 1 + 1 0 CL D S 1 , m 10 , C 2 , O TT S 1 , m = 1 1 + 10 CLD S 1 , m 10
With reference to figure 7, can use OTT box 701,702,703,704,705 and spatial information,, CLD for example 0, CLD 1, CLD 2, CLD 3, CLD 4, ICC 0, ICC 1, ICC 3, ICC 4Or the like produce multi-channel signal L, Ls, R, Rs, C, LFE from following mixed frequency signal " m ".
For example, when this tree structure has as shown in Figure 7 5152 structures, can only use CLD to obtain this channel mapping output valve, as shown in Equation 9.
[formula 9]
L Ls R Rs C LFE = D L D Ls D R D Rs D C D LFE m = c 1 , OTT 3 c 1 , OTT 1 c 1 , OTT 0 c 2 , OTT 3 c 1 , OTT 1 c 1 , OTT 0 c 1 , OTT 4 c 2 , OTT 1 c 1 , OTT 0 c 2 , OTT 4 c 2 , OTT 1 c 1 , OTT 0 c 1 , OTT 2 c 2 , OTT 0 c 2 , OTT 2 c 2 , OTT 0 m
This channel mapping output valve can change according to the time slot of frequency band range, parameter band and/or transmission.Here, pseudo-when presenting if enlarging between the adjacent frequency band or in the difference that forms the channel mapping output valve between the time slot on border when carrying out, distortion may appear.In order to prevent above-mentioned distortion, may in frequency and time domain, need the obfuscation of channel mapping output valve.More particularly, prevent that the method for distortion is as follows.At first, this method can adopt frequency ambiguityization and time ambiguityization, perhaps adopts any puppet that is suitable for around the other technologies that present in addition.In addition, can prevent this distortion by each channel mapping output valve be multiply by specific gain.
Fig. 8 illustrates the synoptic diagram that is used to describe the filter factor of channel according to one embodiment of the invention.For example, this filter factor can be the HRTF coefficient.
Pseudo-in presenting in order to carry out, the filter filtering by having filter factor GL_L is from the signal of left channel information source " L " 810, and then, this filtering L*GL_L as a result is used as left side output and transmits.In addition, by the filter filtering with filter factor GL_R, then, this filtering L*GL_R as a result is used as right output and transmits from the signal of left channel information source " L " 810.For example, left and right output can arrive user's left ear and auris dextra respectively.So, all left sides and right output obtain by channel.Then, the left side of this acquisition output is added and (for example, Lo), and the right side of this acquisitions is exported and added and export (for example, Ro) to produce the last right side to produce last left side output.Therefore, having experienced puppet can be by 10 expression of following formula around a last left side that presents and right output.
[formula 10]
Lo=L*GL_L+C*GC_L+R*GR_L+Ls*GLs_L+Rs*GRs_L
Ro=L*GL_R+C*GC_R+R*GR_R+Ls*GLs_R+Rs*GRs_R
According to embodiments of the invention, the method that is used to obtain L (810), C (800), R (820), Ls (830) and Rs (840) is as follows.At first, can obtain L (810), C (800), R (820), Ls (830) and Rs (840) by the coding/decoding method that is used to use down mixed frequency signal and spatial information to produce multi-channel signal.For example, can produce this multi-channel signal by MPEG surround decoder method.Secondly, can obtain L (810), C (800), R (820), Ls (830) and Rs (840) by only relevant formula with spatial information.
Fig. 9 to Figure 11 illustrates according to the embodiment of the invention and is used to describe the schematic block diagram of generation around the process of transitional information.
Fig. 9 illustrates according to one embodiment of the invention and is used to describe the schematic block diagram of generation around the process of transitional information.As shown in Figure 9, except that channel mapping part, the information translation part can comprise that coefficient produces part 900 and integral part 910.Here, this coefficient produce part 900 comprise the subsystem number produce part (coef_1 produce part 900_1, coef_2 produce part 900_2 ..., coef_N produces part 900_N) at least one.Here, this information translation partly may further include interpolation part 920 and territory conversion portion 930 so that handle filter factor in addition.
This coefficient produces part 900 usage space information and filtering information produces coefficient.Below be to produce part at specific subsystem number, for example, coef_1 produces the description that coefficient produces among the part 900_1 (it is called as the first subsystem number and produces part).
For example, mixed frequency signal the time, the first subsystem number produces part 900_1 and uses the value D_L that produces from spatial information to produce coefficient FL_L and the FL_R that is used for multi channel left channel under the input monophone.The coefficient FL_L of this generation and FL_R can be by 11 expressions of following formula.
[formula 11]
FL_L=D_L*GL_L (be used for mixed frequency signal produces the coefficient of left side output under the monophone of input)
FL_R=D_L*GL_R (being used for producing the coefficient of right output) from the monophone channel signal of input
Here, this D_L is the channel mapping output valve that produces from spatial information in the channel mapping process.The process that is used to obtain D_L can be according to the tree structure information change that encoding device transmits and decoding device receives.Similarly, produce part 900_2 at coef_2 and be called as second subsystem number generation part, and coef_3 produces part 900_3 and is called as under the 3rd subsystem number generation situation partly, the second subsystem number produces part 900_2 can produce coefficient FR_L and FR_R, and the 3rd subsystem number generation part 900_3 can produce FC_L and FC_R or the like.
For example, when under the input stereo audio mixed frequency signal time, the first subsystem number produces part 900_1 and uses the value D_L1 and the D_L2 that produce from spatial information to produce coefficient FL_L1, FL_L2, FL_R1 and the FL_R2 that is used for multi channel left channel.Coefficient FL_L1, FL_L2, FL_R1 and the FL_R2 of this generation can be by 12 expressions of following formula.
[formula 12]
FL_L1=D_L1*GL_L (the lower-left mixed frequency signal that is used for mixed frequency signal under the input stereo audio produces the coefficient of left side output)
FL_L2=D_L2*GL_L (the bottom right mixed frequency signal that is used for mixed frequency signal under the input stereo audio produces the coefficient of right output)
FL_R1=D_L1*GL_R (the lower-left mixed frequency signal that is used for mixed frequency signal under the input stereo audio produces the coefficient of right output)
FL_R2=D_L2*GL_R (the bottom right mixed frequency signal that is used for mixed frequency signal under the input stereo audio produces the coefficient of right output)
, be similar to the situation of mixed frequency signal under this monophone of input here, when under the input stereo audio mixed frequency signal time, at least one that can produce part 900_1 to 900_N by coefficient produces a plurality of coefficients.
This integral part 910 produces filter factor by integral coefficient, and this integral coefficient produces according to channel.The integration of monophone that this integral part 910 is used to import and stereo mixed frequency signal situation down can be by 13 expressions of following formula.
[formula 13]
Under the situation of mixed frequency signal under the input monophone:
HM_L=FL_L+FR_L+FC_L+FLS_L+FRS_L+FLFE_L
HM_R=FL_R+FR_R+FC_R+FLS_R+FRS_R+FLFE_R
Under input stereo audio under the situation of mixed frequency signal:
HL_L=FL_L1+FR_L1+FC_L1+FLS_L1+FRS_L1+FLFE_L1
HR_L=FL_L2+FR_L2+FC_L2+FLS_L2+FRS_L2+FLFE_L2
HL_R=FL_R1+FR_R1+FC_R1+FLS_R1+FRS_R1+FLFE_R1
HR_R=FL_R2+FR_R2+FC_R2+FLS_R2+FRS_R2+FLFE_R2
Here, HM_L and HM_R are illustrated in and are used for puppet under the situation of importing mixed frequency signal under the monophone around the filter factor that presents.On the other hand, HL_L, HR_L, HL_R and HR_R be illustrated in be used under the situation of mixed frequency signal under the input stereo audio pseudo-in the filter factor that presents.
This interpolation part 920 can this filter factor of interpolation.In addition, can be used as the obfuscation that filter factor is carried out in aftertreatment.This time ambiguityization can be carried out in time ambiguity part (not shown).When the spatial information that transmits and produce when time shaft has wide interval, this this filter factor of interpolation part 920 interpolations is to obtain non-existent spatial information between the spatial information that transmits and produce.For example, when spatial information was present in n parameter time slot and n+K parameter time slot (K>1), the embodiment of linear interpolation can be by 14 expressions of following formula.In the embodiment of formula 14, can use the filter factor of generation, for example HL_L, HR_L, HL_R and HR_R obtain the spatial information in not having the parameter time slot that transmits.Should be appreciated that this interpolation part 920 can be passed through several different methods interpolation filter coefficient.
[formula 14]
Under the situation of mixed frequency signal under the input monophone:
HM_L(n+j)=HM_L(n)*a+HM_L(n+k)*(1-a)
HM_R(n+j)=HM_R(n)*a+HM_R(n+k)*(1-a)
Under input stereo audio under the situation of mixed frequency signal:
HL_L(n+j)=HL_L(n)*a+HL_L(n+k)*(1-a)
HR_L(n+j)=HR_L(n)*a+HR_L(n+k)*(1-a)
HL_R(n+j)=HL_R(n)*a+HL_R(n+k)*(1-a)
HR_R(n+j)=HR_R(n)*a+HR_R(n+k)*(1-a)
Here, HM_L (n+j) and HM_R (n+j) expression is used for pseudo-coefficient around the filter factor acquisition that presents by interpolation mixed frequency signal the time under the input monophone.In addition, HL_L (n+j), HR_L (n+j), HL_R (n+j) and HR_R (n+j) expression is when being used for pseudo-coefficient around the filter factor acquisition that presents by interpolation mixed frequency signal the time under the input stereo audio.Here, " j " and " k " is integer, 0<j<k.In addition, " a " is real number (0<a<1), and by 15 expressions of following formula.
[formula 15]
a=j/k
By the linear interpolation of formula 14, can use the spatial information in n and n+K parameter time slot to obtain at the spatial information that does not have in the parameter time slot that between n and n+K parameter time slot, transmits.That is, can in two parameter time slots, on the straight line that the connection value by spatial information forms, obtain the unknown-value of spatial information according to formula 15.
When promptly being changed, the coefficient value between proximity modules in time domain can produce discrete point.Then, can be by the execution time obfuscation of time ambiguity part to prevent by the caused distortion of discrete point.Can operate this time ambiguity operation of executed in parallel with interpolation.In addition, can differently handle this time ambiguityization and interpolation operation according to their sequence of operation.
Under the situation of mixing channel, the time ambiguityization of this filter factor can be by 16 expressions of following formula under monophone.
[formula 16]
HM_L(n)′=HM_L(n)*b+HM_L(n-1)′*(1-b)
HM-R(n)′=HM_R(n)*b+HM_R(n-1)′*(1-b)
The obfuscation that formula 16 is described via 1 utmost point iir filter, wherein this obfuscation result can followingly obtain.That is, this filter factor HM_L (n) and HM_R (n) multiply by " b " respectively in current module (n).Then, this filter factor HM_L (n-1) ' and HM_R (n-1) ' multiply by (1-b) respectively in the module formerly (n-1).This multiplied result is added, as shown in Equation 16.Here, " b " is constant (0<b<1).The value of " b " is more little, and this obfuscation effect increase is many more.On the contrary, the value of " b " is big more, and this obfuscation effect increase is few more.Be similar to above-described method, can carry out the obfuscation of remaining filter factor.
Use formula 16 to be used for time ambiguityization, interpolation and obfuscation can be by formula 17 expressions.
[formula 17]
HM_L(n+j)′=(HM_L(n)*a+HM_L(n+k)*(1-a))*b+HM_L(n+j-1)′*(1-b)
HM_R(n+j)′=(HM_R(n)*a+HM_R(n+k)*(1-a))*b+HM_R(n+j-1)1*(1-b)
On the other hand, when interpolation part 920 and/or time ambiguityization part is carried out interpolation and time ambiguity respectively, can obtain the filter factor that its energy value is different from original filter factor.Under the sort of situation, can further need the energy scale processing to prevent above-mentioned problem.When presenting the territory and do not overlap with the spatial information territory, this territory conversion portion 930 is converted to the spatial information territory and presents the territory.But, if presenting the territory, this overlaps with the spatial information territory, do not need above-mentioned territory conversion.Here, when the spatial information territory is sub-band territory and when presenting the territory and be frequency domain, above-mentioned territory conversion can relate to wherein that coefficient is expanded or is reduced to the processing that meets frequency range and be used for the time range of each sub-band.
Figure 10 illustrates according to another embodiment of the present invention and is used to describe the schematic block diagram of generation around the transitional information process.As shown in figure 10, except that channel mapping part, the information translation part can comprise that coefficient produces part 1000 and integral part 1020.Here, this coefficient produce part 1000 comprise the subsystem number produce part (coef_1 produce part 1000_1, coef_2 produce part 1000_2 ... and coef_N produces part 1000_N) at least one.In addition, this information translation partly may further include interpolation part 1010 and territory conversion portion 1030 so that handle filter factor in addition.Here, this interpolation part 1010 comprise sub-interpolation part 1010_1,1010_2 ... and at least one of 1010_N.Different with the embodiment of Fig. 9, in the embodiment of Figure 10, this coefficient of these interpolation part 1010 interpolations produces the corresponding coefficient that part 1000 produces according to channel.For example, under the situation of mixing channel, this coefficient produces part 1000 and produces coefficient FL_L and FL_R under monophone, and under stereo mixing channel situation down, produces coefficient FL_L1, FL_L2, FL_R1 and FL_R2.
Figure 11 illustrates and is used to describe the schematic block diagram of generation around the process of transitional information according to another embodiment of the present invention.Different with the embodiment of Fig. 9 and 10, in the embodiment of Figure 11, each channel mapping output valve of interpolation part 1100 interpolations, coefficient produces the coefficient that part 1110 use interpolation results produce channel then.
In the embodiment of Fig. 9 to Figure 11, described because channel mapping output valve is in frequency domain (for example, the parameter band unit has single value), handle and in frequency domain, carry out, produce such as filter factor.In addition, pseudo-when presenting when carrying out in the sub-band territory, this territory conversion portion 930 or 1030 is not carried out the territory conversion, but the filter coefficient in sub-band territory along separate routes perhaps can be carried out conversion with the decomposition of adjusting frequency, and exports this transformation result then.
As mentioned above, the present invention can in addition therein decoding device can't produce in the environment of multi-channel signal, in decoding device, provide to have pseudo-sound signal the sound bit stream of the spatial information of mixed frequency signal and multi-channel signal under this decoding device receives and comprises around sound.
In addition, the invention provides a kind of method and apparatus that is used to produce around transitional information, and the data structure and the medium that are used for this method and apparatus, it can be used to change down mixed frequency signal is pseudo-in signal.
In addition, the invention provides a kind of be used for application space information in filtering information to produce method and a kind of method that is used for the pre-service filtering information around transitional information.
Apparent for those skilled in the art, do not break away from spirit of the present invention or scope, can carry out various improvement and variation in the present invention.Therefore, this invention is intended to cover it improvement of the present invention and variation that is provided within appended claim and its equivalent scope is provided.

Claims (22)

1. method that is used for decoded audio signal, this method comprises:
Information accepts filter;
Described filtering information is applied to spatial information to produce around transitional information; With
Output should be around transitional information.
2. according to the method for claim 1, it is pseudo-in signal to comprise that further use will be converted to corresponding to the following mixed frequency signal of spatial information around transitional information.
3. according to the process of claim 1 wherein, this filtering information comprises the filtering information of modification.
4. according to the process of claim 1 wherein, the reception of this filtering information comprises: the filtering information that filtering information is converted to modification.
5. according to the process of claim 1 wherein, the step of this application filtering information comprises:
By producing the channel map information according to channel mapping space information;
Use channel map information and filtering information to produce channel coefficients information; With
Use channel coefficients information to produce around transitional information.
6. according to the method for claim 5, wherein:
Should be at least one of integral coefficient information and additional treatments coefficient information around transitional information, this integral coefficient information be by integration channel coefficients information acquisition, and this additional treatments coefficient information is by additional treatments integral coefficient information acquisition; With
This integral coefficient information is at least one of delivery channel amplitude information, delivery channel energy information and delivery channel relevant information.
7. according to the process of claim 1 wherein, the application of described filtering information comprises:
By producing the channel map information according to channel mapping space information;
Use described channel map information and described filtering information to produce around transitional information.
8. according to the process of claim 1 wherein, the application of described filtering information comprises:
According to the process of claim 1 wherein that the generation around transitional information comprises:
Use described spatial information and described filtering information to produce channel coefficients information; With
Use described channel coefficients information to produce around transitional information.
9. according to the method for claim 1, further comprise receiving mixed frequency signal and spatial information down.
10. according to the method for claim 9, further comprise:
The sound signal of mixed frequency signal and spatial information under reception comprises,
Wherein, described mixed frequency signal down and described spatial information extract from this sound signal.
11. according to the process of claim 1 wherein that this spatial information comprises at least one among channel level difference and the interchannel coherence.
12. a device that is used for decoded audio signal, this device comprises:
The filtering information receiving unit of the information that accepts filter;
Described filtering information is applied to spatial information to produce the information translation part around transitional information; With
Export described around transitional information around the transitional information output.
13., comprise that further use will be converted to pseudo-puppet around signal corresponding to the following mixed frequency signal of spatial information around producing part around transitional information according to the device of claim 12.
14. according to the device of claim 12, wherein, this filtering information comprises the filtering information of modification.
15. according to the device of claim 12, wherein, described information translation part is converted to described filtering information the filtering information of modification.
16. according to the device of claim 12, wherein, described information translation partly comprises:
By produce the channel mapping part of channel map information according to channel mapping space information;
The coefficient that uses described channel map information and described filtering information to produce channel coefficients information produces part; With
Use the integral part of described channel coefficients information generation around transitional information.
17. according to the device of claim 16, wherein:
Described is at least one of integral coefficient information and additional treatments coefficient information around transitional information, described integral coefficient information is by integration channel coefficients information acquisition, and described additional treatments coefficient information is by additionally handling this integral coefficient information acquisition; And described integral coefficient information is delivery channel amplitude information, delivery channel energy information and delivery channel relevant information at least one.
18. according to the device of claim 12, wherein said information translation part is by producing the channel map information according to channel mapping space information, and uses described channel map information and described filtering information to produce described around transitional information.
19. according to the device of claim 12, wherein this information translation partly uses described spatial information and described filtering information to produce channel coefficients information, and uses this channel coefficients information to produce around transitional information.
20. the device according to claim 12 further comprises:
The multichannel of mixed frequency signal and described spatial information is decomposed part under receiving.
21. according to the device of claim 20, wherein, this multichannel is decomposed part and received the sound signal that comprises described mixed frequency signal down and described spatial information, wherein said mixed frequency signal down and described spatial information extract from this sound signal.
22. according to the device of claim 12, wherein said spatial information comprises at least one among channel level difference and the interchannel coherence.
CN2006800182450A 2005-05-26 2006-05-26 Method and apparatus for decoding an audio signal Active CN101185119B (en)

Applications Claiming Priority (16)

Application Number Priority Date Filing Date Title
US68457905P 2005-05-26 2005-05-26
US60/684,579 2005-05-26
US75998006P 2006-01-19 2006-01-19
US60/759,980 2006-01-19
US77672406P 2006-02-27 2006-02-27
US60/776,724 2006-02-27
US77944106P 2006-03-07 2006-03-07
US77944206P 2006-03-07 2006-03-07
US77941706P 2006-03-07 2006-03-07
US60/779,442 2006-03-07
US60/779,417 2006-03-07
US60/779,441 2006-03-07
KR10-2006-0030670 2006-04-04
KR1020060030670A KR20060122695A (en) 2005-05-26 2006-04-04 Method and apparatus for decoding audio signal
KR1020060030670 2006-04-04
PCT/KR2006/002016 WO2006126855A2 (en) 2005-05-26 2006-05-26 Method and apparatus for decoding audio signal

Publications (2)

Publication Number Publication Date
CN101185119A CN101185119A (en) 2008-05-21
CN101185119B true CN101185119B (en) 2011-07-27

Family

ID=39449512

Family Applications (3)

Application Number Title Priority Date Filing Date
CN2006800182380A Active CN101185117B (en) 2005-05-26 2006-05-25 Method and apparatus for decoding an audio signal
CN2006800182446A Active CN101185118B (en) 2005-05-26 2006-05-25 Method and apparatus for decoding an audio signal
CN2006800182450A Active CN101185119B (en) 2005-05-26 2006-05-26 Method and apparatus for decoding an audio signal

Family Applications Before (2)

Application Number Title Priority Date Filing Date
CN2006800182380A Active CN101185117B (en) 2005-05-26 2006-05-25 Method and apparatus for decoding an audio signal
CN2006800182446A Active CN101185118B (en) 2005-05-26 2006-05-25 Method and apparatus for decoding an audio signal

Country Status (1)

Country Link
CN (3) CN101185117B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8311810B2 (en) * 2008-07-29 2012-11-13 Panasonic Corporation Reduced delay spatial coding and decoding apparatus and teleconferencing system
WO2010085083A2 (en) 2009-01-20 2010-07-29 Lg Electronics Inc. An apparatus for processing an audio signal and method thereof
KR101187075B1 (en) * 2009-01-20 2012-09-27 엘지전자 주식회사 A method for processing an audio signal and an apparatus for processing an audio signal
CH703501A2 (en) * 2010-08-03 2012-02-15 Stormingswiss Gmbh Device and method for evaluating and optimizing signals on the basis of algebraic invariants.
EP2892250A1 (en) * 2014-01-07 2015-07-08 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for generating a plurality of audio channels

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2003090208A1 (en) * 2002-04-22 2003-10-30 Koninklijke Philips Electronics N.V. pARAMETRIC REPRESENTATION OF SPATIAL AUDIO
WO2004036549A1 (en) * 2002-10-14 2004-04-29 Koninklijke Philips Electronics N.V. Signal filtering
CN1495705A (en) * 1995-12-01 2004-05-12 ���־糡ϵͳ�ɷ����޹�˾ Multichannel vocoder

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5970152A (en) * 1996-04-30 1999-10-19 Srs Labs, Inc. Audio enhancement system for use in a surround sound environment
US6931370B1 (en) * 1999-11-02 2005-08-16 Digital Theater Systems, Inc. System and method for providing interactive audio in a multi-channel audio environment
JP4399362B2 (en) * 2002-09-23 2010-01-13 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Audio signal generation

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1495705A (en) * 1995-12-01 2004-05-12 ���־糡ϵͳ�ɷ����޹�˾ Multichannel vocoder
WO2003090208A1 (en) * 2002-04-22 2003-10-30 Koninklijke Philips Electronics N.V. pARAMETRIC REPRESENTATION OF SPATIAL AUDIO
WO2004036549A1 (en) * 2002-10-14 2004-04-29 Koninklijke Philips Electronics N.V. Signal filtering

Also Published As

Publication number Publication date
CN101185117B (en) 2012-09-26
CN101185118B (en) 2013-01-16
CN101185117A (en) 2008-05-21
CN101185119A (en) 2008-05-21
CN101185118A (en) 2008-05-21

Similar Documents

Publication Publication Date Title
US8577686B2 (en) Method and apparatus for decoding an audio signal
CN103400583B (en) Enhancing coding and the Parametric Representation of object coding is mixed under multichannel
CN101553866B (en) A method and an apparatus for processing an audio signal
CN101401151B (en) Device and method for graduated encoding of a multichannel audio signal based on a principal component analysis
CN1914668B (en) Method and apparatus for time scaling of a signal
EP2088580B1 (en) Audio decoding
CN101887726B (en) Stereo coding and decoding methods and apparatuses thereof
EP3022735B1 (en) Audio encoder, audio decoder, methods and computer program using jointly encoded residual signals
US20080052089A1 (en) Acoustic Signal Encoding Device and Acoustic Signal Decoding Device
CN101253806B (en) Method and apparatus for encoding and decoding an audio signal
US9595267B2 (en) Method and apparatus for decoding an audio signal
CN101821799A (en) Audio coding using upmix
CN101361121B (en) Method and apparatus for processing a media signal
CN101185119B (en) Method and apparatus for decoding an audio signal
CN101635145B (en) Method, device and system for coding and decoding
KR20060122695A (en) Method and apparatus for decoding audio signal

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1119821

Country of ref document: HK

C14 Grant of patent or utility model
GR01 Patent grant
REG Reference to a national code

Ref country code: HK

Ref legal event code: GR

Ref document number: 1119821

Country of ref document: HK