CN102239520A - A method and an apparatus for processing an audio signal - Google Patents

A method and an apparatus for processing an audio signal Download PDF

Info

Publication number
CN102239520A
CN102239520A CN2009801490217A CN200980149021A CN102239520A CN 102239520 A CN102239520 A CN 102239520A CN 2009801490217 A CN2009801490217 A CN 2009801490217A CN 200980149021 A CN200980149021 A CN 200980149021A CN 102239520 A CN102239520 A CN 102239520A
Authority
CN
China
Prior art keywords
signal
information
background object
mixed
object signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2009801490217A
Other languages
Chinese (zh)
Inventor
吴贤午
郑亮源
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LG Electronics Inc
Original Assignee
LG Electronics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by LG Electronics Inc filed Critical LG Electronics Inc
Priority claimed from PCT/KR2009/007265 external-priority patent/WO2010064877A2/en
Publication of CN102239520A publication Critical patent/CN102239520A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/03Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/09Electronic reduction of distortion of stereophonic sound systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/03Application of parametric coding in stereophonic audio systems

Abstract

A method of processing an audio signal, comprising: receiving a downmix signal, a residual signal and object information; extracting at least one of a background-object signal and a foreground-object signal from the downmix signal using the residual signal; receiving mix information comprising gain control information for the background-object signal; generating a downmix processing information based on the object information and the mix information; and, generating a processed downmix signal comprising a modified background-object signal to which an adjusted gain corresponding to the gain control information is applied, by applying the downmix processing information to the at least one of the background-object signal and the foreground-object signal is disclosed.

Description

The method and apparatus that is used for audio signal
Technical field
The present invention relates to a kind of devices and methods therefor that is used for audio signal.Although the present invention is fit to widespread use, it is specially adapted to coding or decoded audio signal.
Background technology
Generally speaking, under being used for, be mixed down the processing of monophone or stereophonic signal, respectively extracting parameter from object signal with a plurality of objects.These parameters can be used for demoder.And the selection that translation of each object (panning) and gain are made by the user is controllable.
Summary of the invention
Technical matters
Yet,, should suitably locate or each source that translation comprises in following mixing in order to control each object signal.
And in order to provide descending compatibility according to channel directional decoding scheme, image parameter should be converted to the multi-channel parameter that is used for uppermixing.
Technical scheme
Therefore, the present invention is directed to a kind of devices and methods therefor that is used for audio signal, these apparatus and method have avoided basically because the restriction of correlation technique and one or more problems that shortcoming causes.
An object of the present invention is to provide a kind of devices and methods therefor that is used for audio signal,, can export monophonic signal, stereophonic signal and stereophonic signal by the gain and the translation of controlling object by these apparatus and method.
Another object of the present invention provides a kind of devices and methods therefor that is used for audio signal, by these apparatus and method, can prevent the distortion of sound quality under adjustment has the situation of gain of the vocal music of quite big bandwidth or background music.
Another purpose of the present invention provides a kind of devices and methods therefor that is used for audio signal, by these apparatus and method, can adjust the gain of background music under the situation of not using multichannel demoder output monophonic signal or stereophonic signal.
Beneficial effect
Therefore, the invention provides following effect or advantage.
The first, the present invention is the gain and the translation of controlling object unrestrictedly.
The second, the present invention can come the gain and the translation of controlling object based on the selection of being made by the user.
The 3rd, under the situation that fully suppresses vocal music or background music, the present invention can prevent the sound quality distortion according to gain adjustment.
The 4th, under the situation of output monophonic signal or stereophonic signal, the present invention can adjust the gain of background music, freely realizes karaoke mode thus.
Description of drawings
Accompanying drawing is included to provide to further understanding of the present invention, and is incorporated in this instructions and constitutes the part of this instructions, and accompanying drawing illustrates embodiments of the invention and is used for explaining principle of the present invention with describing.
In the accompanying drawings:
Fig. 1 is the block diagram according to the scrambler of the audio signal processor of the embodiment of the invention;
Fig. 2 is the block diagram of the NTT/NTO module that comprises in object encoder 120A/120B;
Fig. 3 is the block diagram according to the demoder of the audio signal processor of the embodiment of the invention;
Fig. 4 is the process flow diagram according to the acoustic signal processing method of the embodiment of the invention;
Fig. 5 is the block diagram of the OTN/TTN module that comprises in extraction unit 220;
Fig. 6 and Fig. 7 are used for first example of the demoder of extraction multichannel background object (MBO) signal under the situation of karaoke mode and the block diagram of second example;
Fig. 8 is the block diagram that is used for the example of the demoder of extraction mono/stereo background object (BGO) signal under the situation of karaoke mode;
Fig. 9 is used for explaining based on 5-1-5 1Tree structure is exported the view of the notion of monophone background object (BGO) signal;
Figure 10 is used for explaining based on 5-1-5 2Tree structure is exported the view of the notion of monophone background object (BGO) signal;
Figure 11 is used for explaining the view of exporting the notion of stereo background object (BGO) signal based on the 5-2-5 tree structure;
Figure 12 is the block diagram of example that is used for extracting the demoder of foreground object (FGO) signal under the situation of solo mode;
Figure 13 is the block diagram that is used for the example of the demoder of extraction at least two foreground object (FGO) signal under the situation of solo mode;
Figure 14 is the schematic block diagram of product of wherein having realized the audio signal processor of one embodiment of the present of invention; And
Figure 15 is the view that is used to explain the relation between the product of wherein having realized audio signal processor according to an embodiment of the invention.
Embodiment
To set forth supplementary features of the present invention and advantage in the following description, and will be conspicuous partly, maybe can know by enforcement of the present invention from this description.To realize and obtain purpose of the present invention and other advantages by the structure that particularly points out in the written description and claims thereof and accompanying drawing.
In order to realize these or other advantage and according to purpose of the present invention, as imbody and generalized description,
In order to realize these or other advantage and,, to provide a kind of method that is used for audio signal, comprising: receive mixed frequency signal, residue signal and object information down as imbody and generalized description according to purpose of the present invention; Use at least one from following mixed frequency signal extraction background object signal and foreground object signal of residue signal; Reception comprises the mixed information of the gain controlling information that is used for the background object signal; Generate down Frequency mixing processing information based on object information and mixed information; And, by descending Frequency mixing processing information to be applied to that at least one generates the following mixed frequency signal of having handled that comprises the background object signal of having revised described in background object signal and the foreground object signal, wherein, the background object signal that will revise to this with the corresponding adjusted gain application of gain controlling information.
According to the present invention, further use object information extract in background object signal and the foreground object signal described at least one.
According to the present invention, the background object signal is corresponding in monophone and the stereophonic signal one.
According to the present invention, the following mixed frequency signal of having handled is corresponding to time-domain signal.
According to the present invention, this method further comprises: use object information and mixed information to generate multichannel information; And use multichannel information and the following mixed frequency signal of having handled generate multi-channel signal.
For further realization these or other advantage and according to purpose of the present invention, a kind of device that is used for audio signal is provided, comprising: multiplexer, this multiplexer receive mixed frequency signal, residue signal and object information down; Extraction unit, this extraction unit use at least one from following mixed frequency signal extraction background object signal and foreground object signal of residue signal; Information generating unit, this information generating unit receives the mixed information that comprises the gain controlling information that is used for the background object signal, and generates down Frequency mixing processing information based on object information and mixed information; And, rendering unit, this rendering unit generates the following mixed frequency signal of having handled that comprises the background object signal of having revised by at least one that will descend that Frequency mixing processing information is applied in background object signal and the foreground object signal, wherein, will with the corresponding adjusted gain application of gain controlling information to the background object signal of having revised, wherein, when mixed information comprises the gain controlling information that is used for the background object signal, the following mixed frequency signal of having handled comprises the background object signal of having revised, wherein, will with the corresponding adjusted gain application of gain controlling information to the background object signal of having revised.
According to the present invention, further use object information to extract in background object signal and the foreground object signal at least one.
According to the present invention, the background object signal is corresponding to one in monophonic signal and the stereophonic signal.
According to the present invention, the following mixed frequency signal of having handled is corresponding to time-domain signal.
According to the present invention, this device further comprises the multichannel demoder, this multichannel demoder use multichannel information and the following mixed frequency signal of having handled generate multi-channel signal, and wherein, information generating unit uses object information and mixed information to generate multichannel information.
For further these and other advantages of realization and according to purpose according to the present invention, a kind of computer-readable medium that stores instruction thereon is provided, when carrying out described instruction by processor, make and comprise the processor executable operations: receive mixed frequency signal, residue signal and object information down; Use at least one from following mixed frequency signal extraction background object signal and foreground object signal of residue signal; Generate down Frequency mixing processing information based on object information and mixed information; And, generate the following mixed frequency signal of having handled by at least one that will descend that Frequency mixing processing information is applied in background object signal and the foreground object signal, wherein, when mixed information comprises the gain controlling information that is used for the background object signal, the following mixed frequency signal of having handled comprises the background object signal of having revised, wherein, the background object signal that will revise to this with the corresponding adjusted gain application of gain controlling information.
Should be appreciated that aforementioned summary description and following detailed description all are exemplary and indicative, and aim to provide claimed of the present invention further explanation.
The invention pattern
Now will be in detail with reference to the preferred embodiments of the present invention, illustrate its example in the accompanying drawings.At first, term that uses in this instructions and claims or word should not be interpreted as general or lexical meaning, and should be interpreted as can suitably defining meaning and the notion that the principle of notion of the term of the invention that is used for describing in the best way the inventor is mated technological thought of the present invention based on the inventor.Configuration shown in disclosed embodiment and the accompanying drawing only is a preferred embodiment in the disclosure, does not represent all technological thoughts of the present invention.Therefore, should be appreciated that the present invention covers the modifications and variations of this invention, as long as they are within the scope of the appended claim of the time point of submitting the application to and equivalent thereof.
According to the present invention, there is not disclosed in this manual term can be interpreted as meaning and the notion that is complementary with technological thought of the present invention.Particularly, " information " in the disclosure is the term of the value of generally including, parameter, coefficient, element etc., and its meaning can be interpreted as once in a while differently, and the present invention is nonrestrictive thus.
Fig. 1 is the block diagram according to the scrambler of the apparatus for processing audio of the embodiment of the invention.It is the situation of monophonic signal or stereophonic signal that Fig. 1 (A) shows background object (BGO).And it is the situation of multi-channel signal that Fig. 1 (B) shows background object (BGO).
With reference to figure 1 (A), demoder 100A comprises object encoder 120A.Object encoder 120A by based on the object scheme on monophone or the stereo channels with background object (BGO) and at least one foreground object under mixing generate down mixed frequency signal DMX.And, in following optical mixing process, object encoder 120A formation object information and remnants.
In this case, background object (BGO) is the background music that comprises plurality of sources signal (for example, music instrument signal) etc.And, control several apparatus sound simultaneously rather than to control separately under the situation of each instrument signal attempting, background object BGO can dispose several instrument signals.Simultaneously, be under the situation of monophonic signal at background object BGO, corresponding monophonic signal becomes an object.If background object BGO is a stereophonic signal, then left channel signal and right-hand signal become object respectively.Therefore, have two object signal.
On the contrary, for example, foreground object FGO is corresponding to a source signal, and can be corresponding at least one vocal signal.Foreground object FGO is corresponding to the general object signal by object-based encoder/decoder control.
Rank at foreground object FGO is adjusted under the situation of " 0 ", because only playback background object BGO, so can realize karaoke mode.On the contrary, if the rank of background object BGO is lower than " 0 ", then owing to only playback foreground object FGO, so can realize solo mode.Under the situation that has at least two foreground object, can realize the cappela pattern.
Mention in the explanation as described above, object encoder 120A comprises that by mixing down the object of background object BGO and foreground object FGO generates down mixing DMX, and in descending optical mixing process formation object information.In this case, object information (OI) is the information about the object that comprises in following mixed frequency signal, and is to generate the needed information of a plurality of object signal from following mixed frequency signal DMX.Object information can comprise object class information, object dependencies information etc., and the present invention is nonrestrictive thus.
Simultaneously, in following Frequency mixing processing, object encoder 120A can generate and about the corresponding residue signal of information of the difference between background object BGO and the foreground object FGO.Particularly, object encoder 120A can comprise NTO module 1220-1 or NTT module 122-2, will be described with reference to figure 2 afterwards.
With reference to figure 1 (B), if background object BGO is a multi-channel signal, then scrambler 100B further comprises spatial encoder 110B.Spatial encoder 110B is by generating monophone or stereo mixing down based on mixing multichannel background object MBO under the channel approach.Spatial encoder 110B extracts spatial information in this time Frequency mixing processing.In this case, spatial information is that being used for following mixing DMX uppermixing is the information of multi-channel signal, and comprises channel level information, channel relevancy information etc.
Therefore, spatial encoder 110B generates mixing and spatial information under monophone or the stereo channels.By delivery on bit stream this spatial information is delivered to demoder.And this monophone or stereo mixing down are imported into object encoder 120B as one or two objects.Object encoder 120B can have and the identical configuration of above-mentioned object encoder 120A shown in Fig. 1 (A), and has omitted its details in following explanation.
Fig. 2 shows the example of NTO module 122-1 and NTT module 122-2.
With reference to figure 2 (A), NTO (N to 1) module 122-1 is by mixing BGO (BGO about the monophone channel m) and two FGO (FGO 1, FGO 2) generate mixing DMX under the monophone m, and generate two residue signals---redidual 1And redidual 2For example, in monophone channel background music, may there be two vocal musics.Because background object is a monophonic signal, the following mixed frequency signal of institute also can be corresponding to monophonic signal.Simultaneously, the first remaining redidual 1Can comprise when according to a FGO FGO 1With monophone background object BGO mCombination and the signal determined when generating the first interim mixing down the invention is not restricted to this.And, the second remaining redidual 2Can comprise mixing the 2nd FGO FGO under basis 2With first interim and generate mixing DMX under last mThe time signal that extracts, the invention is not restricted to this.
With reference to figure 2 (B), NTO (N to 1) module 122-2 passes through mixing BGO (BGO down LAnd BGO R) and three FGO of stereophonic signal generate stereo mixing DMX down LAnd DMX R, and in this time Frequency mixing processing, also extract first to the 3rd remnants---the redidual 1To redidual 3In this case, because BGO is corresponding to stereo channels, the following mixed frequency signal of institute also can be corresponding to stereo channels.Just as the situation of NTO module 122-1, the first remaining redidual 1Can comprise when according to a FGO FGO 1With stereo background object BGO LAnd BGO RCombination and the signal determined when generating the first interim mixing down the invention is not restricted to this.And, the second remaining redidual 2Can comprise and working as the 2nd FGO FGO 2The signal of determining when generating the second interim mixing down with the first interim combination of mixing down the invention is not restricted to this.And, the 3rd remaining redidual 3Can comprise and working as the 3rd FGO FGO 3Generate last mixing BGO down with the second interim mixing combination down LAnd BGO RThe time signal determined, the invention is not restricted to this.
Fig. 3 is the block diagram according to the demoder of the audio signal processor of the embodiment of the invention, and Fig. 4 is the process flow diagram according to the acoustic signal processing method of the embodiment of the invention.
With reference to figure 3, demoder comprises Frequency mixing processing unit 220 and information generating unit 240 down, and may further include multiplexer (not shown) and multichannel demoder 260.In addition, following Frequency mixing processing unit 220 can comprise extraction unit 222 and rendering unit 224.
With reference to figure 3 and Fig. 4, the multiplexer (not shown) receives mixed frequency signal, residue signal and object information [S110] down via bit stream.In this case, following mixed frequency signal can be corresponding to the signal by generating with reference to mixing background object (BGO) and at least one foreground object (FGO) under the method basis of figure 1 and Fig. 2 description.Residue signal can be corresponding to the above-mentioned residue signal with reference to figure 1 and Fig. 2 description.Because object information can be the same with the object information of describing with reference to figure 1, so omitted its details in following explanation.
Extraction unit 220 extracts background object BGO and at least one foreground object FGO[S120 from dropping signal DMX].As mentioning in above stated specification, following mixed frequency signal DMX can be corresponding to monophone or stereo channels, and background object BGO can be corresponding to this monophone or stereophonic signal.Extraction unit 220 can comprise OTN (1 to N) module and TTN (2 to N) module, with reference to figure 5 description belows its configuration.
Fig. 5 is the block diagram of the OTN/TTN module that comprises in extraction unit 220.
With reference to figure 5, OTN module 222-1 mixing DMX under the monophone mExtract at least one FGO.And TTN module 222-2 is from stereo mixing DMX down LAnd DMX RExtract at least one FGO.OTN module 222-1 can carry out and the opposite processing of processing with reference to the above-mentioned NTO module 122-1 that describes among the figure 2.And TTN module 222-2 can carry out the opposite processing of processing with the above-mentioned NTT module 122-2 that describes with reference to figure 2.Therefore, in following explanation, omitted the details of this OTN and TOTT module.
With reference now to Fig. 3 and Fig. 4,, extraction unit 22 can further use object information to extract background object and at least one foreground object from monophone or stereo mixing DMX down.This object information can obtain by the mode of directly being resolved or being sent from information generating unit 240 by extraction unit 222, the invention is not restricted to this.
Simultaneously, information generating unit 240 receives mixed information MXI[S130].In this case, mixed information MXI can comprise the gain controlling information about BGO.Mixed information (MXI) is based on the information that object location information, target gain information, playback configuration information etc. generate.Object location information and target gain information are the information that is used for being controlled at down the object that mixing comprises.In this case, object comprises the notion of above-mentioned background object B GO and above-mentioned foreground object FGO.
Particularly, object location information is with the position of controlling each object or the information of translation by user input.Target gain information is by the user input information with the gain of controlling each object.Therefore, target gain information can comprise about the gain controlling information of BGO and about the gain controlling information of FGO.
Simultaneously, object location information or target gain information can be one that selects from preset mode.In this case, preset mode is to be used for the certain gain of default object or the value of position.Preset mode information can be the value that receives from another equipment, or the value that stores in equipment.Simultaneously, can determine to select one (for example, preset mode does not use, preset mode 1, preset mode 2 etc.) by user input from least one or a plurality of preset mode.
The playback configuration information is the information that comprises loudspeaker number, loudspeaker position, environmental information (virtual location of loudspeaker) etc.The playback configuration information can be imported by the user, can store in advance, maybe can receive from another equipment.
And information generating unit 220 can receive output mode information (OM) and mixed information (MXI).Output mode information (OM) is the information about output mode.For example, output mode information (OM) can comprise the information how many signals indication has used be used to export.Indication used how much be used for information that signal is used to export can be corresponding to an information of selecting from the group of forming by monophone output mode, stereo output mode and multichannel output mode.Simultaneously, output mode information (OM) can be consistent with the number of the loudspeaker of mixed information (MXI).If stored output mode information (OM) in advance, then it is based on facility information.If imported output mode information (OM) by the user, then it is based on user's input information.In this case, user's input information can be included in the mixed information (MXI).
Information generating unit 24 generates Frequency mixing processing information [S140] down based on object information that receives and the mixed information that receives in step S130 in step S110.Mixed information can comprise about the gain controlling information of BGO and about gain and/or the positional information of FGO.For example, under the situation of karaoke mode, the gain of FGO is adjusted to 0, and the gain control of BGO can be adjusted in the preset range.On the contrary, in solo or cappela pattern, the gain of BGO is adjusted to 0, and can control gain and/or the position of at least one FGO.
Rendering unit 224 is applied at least one background object BGO by the following Frequency mixing processing information that will generate in step S140 and at least one foreground object FGO generates the following mixed frequency signal of having handled [S150].
Subsequently, if output mode (OM) is monophone or stereo output mode, then rendering unit 224 generates and exports the following mixed frequency signal of having handled [S160] of time-domain signal.If output mode (OM) is the multichannel output mode, then information generating unit 240 generates multichannel information (MI) based on object information and mixed information (MXI).In this case, multichannel information (MI) is that being used for following mixing (DMX) uppermixing is the information of multi-channel signal, and can comprise channel level information, channel relevancy information etc.
If generated multichannel information (MI), then the multichannel demoder uses mixing (DMX) down and multi-channel output signal (MI) to generate multi-channel output signal [S160].
Fig. 6 and Fig. 7 are respectively first example and second examples that is used to extract the demoder of multichannel background object (MBO) signal under the situation of karaoke mode.
With reference to figure 6, demoder 200A.1 comprises the element that the element with the above-mentioned demoder of describing with reference to figure 3 200 has same names, and the functionally similar function of execution and above-mentioned demoder 200 shown in Figure 3.In the following description, the element of carrying out the function different with the function of above-mentioned demoder 200 shown in Figure 3 will be explained.
At first, as the said extracted unit 222 with reference to description among the figure 3, extraction unit 222A extracts background object and at least one foreground object from following mixing.In this case, if background object corresponding to multichannel background object (MBO), then the multiplexer (not shown) receives spatial information.In this case, spatial information is that being used for following mixing background object uppermixing is the information of multi-channel signal, and can be identical with the above-mentioned spatial information that generates by spatial encoder 1210B shown in Figure 1.
If background object BGO is corresponding to the signal of mixing under multichannel background object MBO, and (MXI) selects karaoke mode (promptly according to mixed information, if the gain of FGO is adjusted to 0), then in fact multichannel demoder 240A can use received spatial information, rather than information generating unit 230A.1 generates multichannel information (MI).This is because this spatial information is the information that generates when generating mono/stereo background object BGO from MBO.
So, before will being input to multichannel demoder 260A, can carry out control fully to the gain of rising or reduction BGO by the BGO that multichannel demoder 260A extracts.Information about this control is included in the mixed information (MXI).Then, this mixed information (MXI) is being reflected on the following Frequency mixing processing information (DPI).Therefore, before the BGO uppermixing is multi-channel signal, can adjust corresponding gain.
Situation shown in the image pattern 6, Fig. 7 shows the situation of mixing BGO under the MBO, and the situation of adjusting the gain of BGO before the BGO uppermixing is MBO.Above-mentioned demoder 220A.1 shown in Figure 6 is reflected in down this control on the Frequency mixing processing information.On the contrary, the demoder 220A.2 shown in Fig. 7 is mixing gain (ADG) down arbitrarily with this control transformation, and it can be included in the spatial information that is input to multichannel demoder 260A.1.In this case, descending the mixing gain arbitrarily is to be used for being used for the factor of mixed frequency signal down the adjustment of multichannel demoder.And descending mixing gain arbitrarily is to be applied to the gain of mixed frequency signal down before uppermixing is multi-channel signal, that is, and only monophone or stereo BGO.Therefore, can use arbitrarily mixing down to gain and adjust the gain of monophone or stereo BGO.
Fig. 8 is the block diagram that is used for the example of the demoder of extraction mono/stereo background object (BGO) signal under the situation of karaoke mode.
With reference to figure 8, as at Fig. 6 and shown in Figure 7, demoder 200B comprises the element that has same names with element with reference to the above-mentioned demoder 200 of figure 3 explanations, and great majority are carried out the functionally similar function with above-mentioned demoder 200 shown in Figure 3.In following explanation, only explain difference therebetween.
At first, unlike shown in Fig. 6 and Fig. 7, because background object BGO is not multichannel background object MBO, so demoder 200B does not have the spatial information that receives from scrambler.Therefore, mono/stereo background object BGO is not imported into multichannel demoder 260B, but can be used as time-domain signal from Frequency mixing processing unit 220B output down.Because the user has multichannel loudspeaker (for example, 5.1 channels etc.), so, then may need to shine upon by central channel or a left side and the right channel etc. of 5.1 channels if BGO is imported into multichannel demoder 260B.And, the user may be taken place to attempt shining upon monophone BGO by a left side or the same rank of right channel.The following automatic BGO that describes in detail according to output mode plays up and plays up according to the BGO of user's wish.
1. the automatic BGO according to output mode plays up
At first, be matched with at the number of monophone or stereo BGO under the situation of number of channel of output mode, demoder 200B does not need additional treatments.For example, be monophone if BGO is the output mode (OM) of monophonic signal and demoder, rendering unit 224B output time domain monophonic signal then.If BGO is that the output mode (OM) of stereophonic signal and demoder is stereo, then rendering unit 224B exports the time domain monophonic signal equally.
But, if the number of the channel of BGO corresponding to monophone or stereo, and output mode is the signal that has such as at least 3 channels of 5.1 channels etc., then should activate multichannel demoder 260B.Particularly, in order suitably to shine upon monophone or stereo BGO by multichannel, information generating unit 240B generates multichannel information (MI).For example, under the situation of monophone BGO, (C) shines upon monophone BGO by multi channel central channel.Under the situation of stereo BGO, stereo BGO can be played up respectively is multi channel left channel L and right channel R.Play up in order to carry out this, should generate and the corresponding spatial parameter of various tree structures from multichannel information (MI).And, will be with reference to figure 9, Figure 10 and the corresponding details of Figure 11 description below.
Fig. 9 is used for explaining based on 5-1-5 1Tree structure is exported the view of the notion of monophone background object (BGO) signal, and Figure 10 is used for explaining based on 5-1-5 2Tree structure is exported the view of the notion of monophone background object (BGO) signal.
With reference to figure 9, provide to be used for channel decoder 260B monophone is imported the 5-1-5 of uppermixing to 5.1 channels 1Tree structure (first tree structure).For with this 5-1-5 1Configuration is shone upon monophone BGO M by central channel (C) 0, can be provided with each channel distribution module OTT and with the corresponding interchannel rank of channel distribution module OTT poor (CLD).For example, by will with OTT 0Corresponding interchannel rank difference CLD 0Be set to maximal value (+150dB), make and to pass through OTT 0The last signal of two output signals all ranks of shining upon input channel (for example, be input to OTT 1Channel).By similar principle, CLD 1Be set to-150dB is so that shone upon by low output.If CLD 4Be set to+150dB, then can pass through 5-1-5 1Central channel in the tree structure shines upon all monophone BGO automatically.Remaining CLD (CLD 3, CLD 2) all can be set to arbitrary value respectively.
Figure 10 show be used for monophone import uppermixing to 5.1 channels to 5-1-5 2Tree structure (second tree structure).Pass through 5-1-5 1The same approach of tree structure can be provided with the channel level difference.Particularly, for monophone BGO is outputed to central channel C, CLD 0Be set to-150dB CLD 1Be set to-150dB, and CLD 2Be set to 150dB.Remaining CLD (CLD 3, CLD 2) can be set to arbitrary value respectively.
Figure 11 is used for explaining the view of exporting the notion of stereo background object (BGO) signal based on the 5-2-5 tree structure.
With reference to Figure 11, a kind of 5-3-5 configuration is provided, it is to be used for the tree structure of stereo uppermixing to 5.1 channels.The TTT parameter of TTT0 module can be confirmed as having [L, R, 0] output.By respectively with CLD 2And CLD 1Be set to+150dB, can shine upon CLD by left channel L and right channel R respectively 2And CLD 1Owing to only will be input to OTT with the signal of micro scale 0So, CLD 0Can be set to arbitrary value.
2. the BGO according to user's intention plays up
At first, under the situation that the automatic BGO according to output mode plays up, monophone BGO is configured to shine upon automatically by central channel, or stereo BGO is configured to shine upon automatically by left channel and right channel.Yet, can play up mono/stereo BGO according to user view.So, user's control that BGO is played up can be used as mixed information (MXI) and imports.
For example, under user's control, can play up monophone BGO, for this reason, use 5-1-5 shown in Figure 9 with the same rank of left channel and right channel 1Under the situation of tree structure, CLD 0Be set to+150dB CLD 1Be set to+150dB, and CLD 3Be set to 0.If monophone BGO is output to 5.1 channels, then CLD with same rank under user's control 0To CLD 4Can be set to the value of scope between-2~2dB respectively.
Generally speaking, according to such scheme, can by following formula any CLD value be set according to user view.
(formula 1)
CLD k l , m = 20 log { m k , upper l , m m k , lower l , m }
In formula 1, l indicates time slot, m indication hybrid subband index, and the index of k indication OTT frame,
Figure BPA00001387422400161
Indication upper pathway expectation abundance, and
Figure BPA00001387422400162
Indication is path expectation abundance down.
Figure 12 is the block diagram of example that is used for extracting the demoder of foreground object (FGO) signal under the situation of solo mode.
With reference to Figure 12, demoder 200C comprises the element that the element with above-mentioned demoder 300 shown in Figure 3 has same names.Above-mentioned demoder 200A.1/200A.2/200B shown in Fig. 6/7/8 is under the karaoke mode, is used to export BGO.On the contrary, demoder 200C is used to export at least one FGO corresponding to solo mode (or cappela pattern).Particularly, according to following Frequency mixing processing information (DPI), rendering unit 224C suppresses all background object BGO, and only exports FGO.If output mode has at least three channels, then activate multichannel demoder 260C, and information generating unit 240C generates the multichannel information (MI) of the uppermixing that is used for FGO.
In this case, can use such as the spatial parameter CLD in multichannel information (MI) to be provided with how to shine upon at least one FGO by multichannel.If a FGO is imported among the multichannel demoder 260C, then can determine the CLD value according to presupposed information or user's intention by following formula.
(formula 2)
CLD k l , m = 20 log { m k , upper l , m m k , lower l , m }
In formula 2,1 indication time slot, m indication hybrid subband index, and the index of k indication OTT frame,
Figure BPA00001387422400164
Indication upper pathway expectation abundance, and
Figure BPA00001387422400165
Indication is path expectation abundance down.
Under the situation of many FGO rather than single FGO, can determine CLD by following formula.
(formula 3)
CLD k l , m = 10 log { Σ i ( m i , k , upper l , m OLD i l , m ) 2 Σ i ( m i , k , lower l , m OLD i l , m ) 2 }
In formula 3,1 indication time slot, m indication hybrid subband index, and the index of k indication OTT frame,
Figure BPA00001387422400172
Indication is used for the upper pathway expectation abundance of i FGO,
Figure BPA00001387422400173
Indication is used for the following path expectation abundance of i FGO, and OLD iThe indication be used for i FGO the object rank poor.
Figure 13 is the block diagram that is used for the example of the demoder of at least two foreground object of extraction (FGO) under the solo mode situation.
With reference to Figure 13, demoder 200D comprises the element that the element with the above-mentioned demoder 200 shown in Fig. 3 has same names, and the functionally similar function of execution and above-mentioned demoder 200 shown in Figure 3.Yet extraction unit 222D extracts at least two FGO from following mixing.In this case, reconstruct the one FGO FGO fully 1With the 2nd FGO FGO 2Subsequently, rendering unit 224D carries out solo mode, wherein, suppresses BGO fully, and at least two FGO of output.
Can suppose a FGO FGO 1With the 2nd FGO FGO 2Be respectively monophone and stereosonic situation.The user with monophone FGO FGO 1Be rendered in the central channel of 5.1 channels, and with stereo FGO FGO 2Be rendered under the left channel and the situation in the right channel of 5.1 channels, rendering unit 224D does not directly export FGO, but activates multichannel demoder 260D.
Rendering unit 224D passes through at least two FGO (FGO 1And FGO 2) come the FGO (FGO that made up together C).In this case, can generate the FGO (FGO that has made up by following formula C).
(formula 4)
L=sum(m i*FGO i)
R=sum(n i*FGO i)|
Wherein, mi and ni are the hybrid gains that will be mixed into the i FGO in left channel and the right channel respectively.
Can in time domain or subband domain, carry out the processing that is used to generate the FGO that has made up.
In the processing that is used for generating the FGO that has made up by OTT-1 or TTT-1 module, remaining (residual C) be extracted and be delivered to then multichannel demoder 260D.This remnants (residual C) can be delivered to multichannel demoder 260D separately.Alternatively, according to the scheme of multichannel information (MI) bit stream, by signal generation unit 240D to remaining (residual C) encode, and can be delivered to the multichannel demoder then.
Subsequently, multichannel demoder 260D can use remnants (residual C) FGO (FGO from having made up C) complete at least two FGO (FGO of reconstruct 1And FGO 2).Because TTT (2 to the 3) module of the multichannel demoder of correlation technique is incomplete, so FGO (FGO 1And FGO 2) may not exclusively separate each other.Yet the present invention has prevented because the deterioration of using remaining imperfect separation to cause.
Audio signal processor according to the present invention can be used for the use of various products.These products mainly are divided into independent groups and portable group.TV, monitor, set-top box etc. can be included in the independent groups.And PMP, mobile phone, navigational system etc. can be included in portable group.
Figure 14 is a schematic block diagram of wherein having realized the product of audio signal processor according to an embodiment of the invention.
With reference to Figure 14, wire/wireless communication unit 510 receives bit stream via the wire/wireless communication system.Particularly, wire/wireless communication unit 310 can comprise at least one among wired communication unit 310A, infrared unit 310B, bluetooth unit 310C and the WLAN unit 310D.
User authentication unit 320 receives the input of user profile, and carries out authentification of user then.User authentication unit 320 can comprise at least one among fingerprint identification unit 320A, iris recognition unit 320B, face recognition unit 320C and the voice recognition unit 320D.Fingerprint identification unit 320A, iris recognition unit 320B, face recognition unit 320C and voice recognition unit 320D receive finger print information, iris information, facial information and voice messaging respectively, and convert them to user profile then.Determine that whether in the user profile each be complementary with the user data of registered in advance, to carry out authentification of user.
Input block 330 is to make the user can import the input equipment of various orders, and can comprise among keyboard unit 330A, touch panel unit 330B and the remote controllers unit 330C at least one, the invention is not restricted to this.
340 pairs of sound signals of signal compilation unit and/or vision signal are carried out coding or decoding, come received audio signal and/or vision signal via wire/wireless communication unit 310, and output audio signal in time domain then.Signal compilation unit 340 comprises audio signal processor 345.Mention in the explanation as described above, audio signal processor 345 is corresponding to the above embodiment of the present invention (that is, encoder stage 100 and/or demoder stage 200).Therefore, can realize audio signal processor 345 and the signal compilation unit that comprises audio signal processor 345 by at least one or a plurality of processor.
Control module 350 is from the input equipment receiving inputted signal, and all processing of control signal decoding unit 340 and output unit 360.Particularly, output unit 360 is the elements that are configured to export the output signal that generated by signal decoding unit 340 etc., and can comprise loudspeaker unit 360A and display unit 360B.If output signal is a sound signal, then this output signal is output to loudspeaker.If output signal is a vision signal, then this output signal is output via display.
Figure 15 is the view that is used to explain the relation between the product of wherein having realized audio signal processor according to an embodiment of the invention.Particularly, Figure 15 shows the relation between shown in Figure 14 and corresponding terminal of product and the server.
With reference to Figure 15 (A), can observe, first terminal 300.1 and second terminal 300.2 can be via wire/wireless communication unit swap data or bit streams two-wayly each other.With reference to Figure 15 (B), can observe, the server 400 and first terminal 300.1 can be carried out and wire/wireless communication each other.
Can be implemented as computer executable program according to acoustic signal processing method of the present invention, and can be stored in the computer readable recording medium storing program for performing.And the multi-medium data with data structure of the present invention can be stored in the computer readable recording medium storing program for performing.Computer-readable medium comprises various recording units, has wherein stored the data that computer system-readable is got.Computer-readable medium comprises for example ROM, RAM, CD-ROM, tape, floppy disk, light data storage device etc., and comprises that carrier type realizes the transmission of the Internet (for example, via).And the bit stream that generates by above-mentioned coding method can be stored in the computer readable recording medium storing program for performing or can transmit via wired.
Industrial applicibility
Therefore, the present invention can be applicable to handle and output audio signal.
Although the present invention it is evident that for those skilled in the art at this, under the situation that does not depart from the spirit and scope of the present invention, can make various modifications and variations by describing and illustrated with reference to the preferred embodiments of the present invention.Therefore, the invention is intended to cover modifications and variations of the present invention within the scope that falls into claims and equivalent thereof.

Claims (11)

1. method that is used for audio signal comprises:
Receive mixed frequency signal, residue signal and object information down;
Use at least one from described extraction background object signal of mixed frequency signal down and foreground object signal of described residue signal;
Reception comprises the mixed information of the gain controlling information that is used for described background object signal;
Generate down Frequency mixing processing information based on described object information and described mixed information; And
By described down Frequency mixing processing information is applied in described background object signal and the described foreground object signal described at least one generate the following mixed frequency signal of having handled that comprises the background object signal of having revised, wherein, will with the corresponding adjusted gain application of described gain controlling information to the described background object signal of having revised.
2. method according to claim 1, wherein, further use described object information extract in described background object signal and the described foreground object signal described at least one.
3. method according to claim 1, wherein, described background object signal is corresponding to one in monophonic signal and the stereophonic signal.
4. method according to claim 1, wherein, the described following mixed frequency signal of having handled is corresponding to time-domain signal.
5. method according to claim 1 further comprises:
Use described object information and described mixed information to generate multichannel information; And
Use described multichannel information and the described following mixed frequency signal of having handled to generate multi-channel signal.
6. device that is used for audio signal comprises:
Multiplexer, described multiplexer receive mixed frequency signal, residue signal and object information down;
Extraction unit, described extraction unit use at least one from described extraction background object signal of mixed frequency signal down and foreground object signal of described residue signal;
Information generating unit, described information generating unit receives the mixed information that comprises the gain controlling information that is used for described background object signal, and generates down Frequency mixing processing information based on described object information and mixed information; And
Rendering unit, described rendering unit by described down Frequency mixing processing information is applied in described background object signal and the described foreground object signal described at least one generate the following mixed frequency signal of having handled that comprises the background object signal of having revised, wherein, will with the corresponding adjusted gain application of described gain controlling information to the described background object signal of having revised
Wherein, when described mixed information comprises the gain controlling information that is used for described background object signal, the described following mixed frequency signal of having handled comprises the background object signal of having revised, wherein, will with the corresponding adjusted gain application of described gain controlling information to the described background object signal of having revised.
7. device according to claim 6, wherein, further use described object information extract in described background object signal and the described foreground object signal described at least one.
8. device according to claim 6, wherein, described background object signal is corresponding to one in monophonic signal and the stereophonic signal.
9. device according to claim 6, wherein, the described following mixed frequency signal of having handled is corresponding to time-domain signal.
10. device according to claim 6 further comprises:
Multichannel demoder, described multichannel demoder use multichannel information and the described following mixed frequency signal of having handled to generate multi-channel signal,
Wherein, described information generating unit uses described object information and described mixed information to generate described multichannel information.
11. one kind has the computer-readable medium of having stored instruction on it, when carrying out described instruction by processor, makes to comprise described processor executable operations:
Receive mixed frequency signal, residue signal and object information down;
Use at least one from described extraction background object signal of mixed frequency signal down and foreground object signal of described residue signal;
Generate down Frequency mixing processing information based on described object information and mixed information; And
By described down Frequency mixing processing information is applied in described background object signal and the described foreground object signal described at least one generate the following mixed frequency signal of having handled,
Wherein, when described mixed information comprises the gain controlling information that is used for described background object signal, the described following mixed frequency signal of having handled comprises the background object signal of having revised, wherein, will with the corresponding adjusted gain application of described gain controlling information to the described background object signal of having revised.
CN2009801490217A 2008-12-05 2009-12-07 A method and an apparatus for processing an audio signal Pending CN102239520A (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US12005708P 2008-12-05 2008-12-05
US61/120,057 2008-12-05
KR1020090119980A KR20100065121A (en) 2008-12-05 2009-12-04 Method and apparatus for processing an audio signal
KR10-2009-0119980 2009-12-04
PCT/KR2009/007265 WO2010064877A2 (en) 2008-12-05 2009-12-07 A method and an apparatus for processing an audio signal

Publications (1)

Publication Number Publication Date
CN102239520A true CN102239520A (en) 2011-11-09

Family

ID=42364598

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2009801490217A Pending CN102239520A (en) 2008-12-05 2009-12-07 A method and an apparatus for processing an audio signal

Country Status (2)

Country Link
KR (1) KR20100065121A (en)
CN (1) CN102239520A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104969576A (en) * 2012-12-04 2015-10-07 三星电子株式会社 Audio providing apparatus and audio providing method
CN105593930A (en) * 2013-07-22 2016-05-18 弗朗霍夫应用科学研究促进协会 Apparatus and method for enhanced spatial audio object coding
US10249311B2 (en) 2013-07-22 2019-04-02 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Concept for audio encoding and decoding for audio channels and audio objects
US10277998B2 (en) 2013-07-22 2019-04-30 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for low delay object metadata coding
US11984131B2 (en) 2013-07-22 2024-05-14 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Concept for audio encoding and decoding for audio channels and audio objects

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015152661A1 (en) * 2014-04-02 2015-10-08 삼성전자 주식회사 Method and apparatus for rendering audio object

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008046531A1 (en) * 2006-10-16 2008-04-24 Dolby Sweden Ab Enhanced coding and parameter representation of multichannel downmixed object coding
WO2008063035A1 (en) * 2006-11-24 2008-05-29 Lg Electronics Inc. Method for encoding and decoding object-based audio signal and apparatus thereof
US20080205670A1 (en) * 2006-12-07 2008-08-28 Lg Electronics, Inc. Method and an Apparatus for Decoding an Audio Signal

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008046531A1 (en) * 2006-10-16 2008-04-24 Dolby Sweden Ab Enhanced coding and parameter representation of multichannel downmixed object coding
WO2008063035A1 (en) * 2006-11-24 2008-05-29 Lg Electronics Inc. Method for encoding and decoding object-based audio signal and apparatus thereof
US20080205670A1 (en) * 2006-12-07 2008-08-28 Lg Electronics, Inc. Method and an Apparatus for Decoding an Audio Signal

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
ISO/IEC: "Call for Proposals on Spatial Audio Object Coding", 《ISO/IEC JTC1/SC29/WG11 MPEG2007/N8853》 *
JONAS ENGDEGARD ET AL: "Spatial Audio Object Coding (SAOC) – Upcoming MPEG Standard", 《124TH CONVENTION OF AUDIO ENGINEERING SOCIETY》 *

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10341800B2 (en) 2012-12-04 2019-07-02 Samsung Electronics Co., Ltd. Audio providing apparatus and audio providing method
CN104969576A (en) * 2012-12-04 2015-10-07 三星电子株式会社 Audio providing apparatus and audio providing method
US9774973B2 (en) 2012-12-04 2017-09-26 Samsung Electronics Co., Ltd. Audio providing apparatus and audio providing method
US10149084B2 (en) 2012-12-04 2018-12-04 Samsung Electronics Co., Ltd. Audio providing apparatus and audio providing method
US10659900B2 (en) 2013-07-22 2020-05-19 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for low delay object metadata coding
US10277998B2 (en) 2013-07-22 2019-04-30 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for low delay object metadata coding
US10249311B2 (en) 2013-07-22 2019-04-02 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Concept for audio encoding and decoding for audio channels and audio objects
CN105593930B (en) * 2013-07-22 2019-11-08 弗朗霍夫应用科学研究促进协会 The device and method that Spatial Audio Object for enhancing encodes
CN105593930A (en) * 2013-07-22 2016-05-18 弗朗霍夫应用科学研究促进协会 Apparatus and method for enhanced spatial audio object coding
US10701504B2 (en) 2013-07-22 2020-06-30 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for realizing a SAOC downmix of 3D audio content
US10715943B2 (en) 2013-07-22 2020-07-14 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for efficient object metadata coding
US11227616B2 (en) 2013-07-22 2022-01-18 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Concept for audio encoding and decoding for audio channels and audio objects
US11330386B2 (en) 2013-07-22 2022-05-10 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for realizing a SAOC downmix of 3D audio content
US11337019B2 (en) 2013-07-22 2022-05-17 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for low delay object metadata coding
US11463831B2 (en) 2013-07-22 2022-10-04 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for efficient object metadata coding
US11910176B2 (en) 2013-07-22 2024-02-20 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for low delay object metadata coding
US11984131B2 (en) 2013-07-22 2024-05-14 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Concept for audio encoding and decoding for audio channels and audio objects

Also Published As

Publication number Publication date
KR20100065121A (en) 2010-06-15

Similar Documents

Publication Publication Date Title
CN101248483B (en) Generation of multi-channel audio signals
US9311919B2 (en) Apparatus and method for coding and decoding multi-object audio signal with various channel
EP2209328B1 (en) An apparatus for processing an audio signal and method thereof
CN101553867B (en) A method and an apparatus for processing an audio signal
CN102100009B (en) A method and an apparatus for processing an audio signal
CN102099854B (en) A method and an apparatus for processing an audio signal
US9502043B2 (en) Method and an apparatus for processing an audio signal
CN102334158A (en) Upmixer, method and computer program for upmixing a downmix audio signal
CN102239520A (en) A method and an apparatus for processing an audio signal
CN102007532A (en) A method and an apparatus for processing an audio signal
CN102696070A (en) An apparatus for processing an audio signal and method thereof
CN104756186A (en) Decoder and method for multi-instance spatial-audio-object-coding employing a parametric concept for multichannel downmix/upmix cases
CN102007533B (en) A method and an apparatus for processing an audio signal
CN102292768B (en) A method and an apparatus for processing an audio signal
CN101361114A (en) Apparatus for processing media signal and method thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20111109