CN101233570B - Method for generating encoded audio signal and method for processing audio signal - Google Patents

Method for generating encoded audio signal and method for processing audio signal Download PDF

Info

Publication number
CN101233570B
CN101233570B CN2006800277662A CN200680027766A CN101233570B CN 101233570 B CN101233570 B CN 101233570B CN 2006800277662 A CN2006800277662 A CN 2006800277662A CN 200680027766 A CN200680027766 A CN 200680027766A CN 101233570 B CN101233570 B CN 101233570B
Authority
CN
China
Prior art keywords
channel
configuration information
layer
sound
cut apart
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN2006800277662A
Other languages
Chinese (zh)
Other versions
CN101233570A (en
Inventor
吴贤午
房熙锡
金东秀
林宰显
金孝镇
郑亮源
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LG Electronics Inc
Original Assignee
LG Electronics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from KR1020060004048A external-priority patent/KR20070031212A/en
Priority claimed from KR1020060017659A external-priority patent/KR20070014936A/en
Application filed by LG Electronics Inc filed Critical LG Electronics Inc
Publication of CN101233570A publication Critical patent/CN101233570A/en
Application granted granted Critical
Publication of CN101233570B publication Critical patent/CN101233570B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders

Abstract

A method for generating an encoded audio signal, and a method for processing the same during the multi-channel audio coding are disclosed. The present invention provides the method for generating an encoded audio signal comprising: including basic configuration information requisite for a multi-channel audio coding process; and including extension configuration information, wherein the extension configuration information includes configuration information of extension environment which is identified by a type identifier (ID) .

Description

Generate the method for encoded sound signal and the method for audio signal
Technical field
The present invention relates to a kind of multi-channel encoder method, relate in particular to a kind of method of encoded sound signal and a kind of method of audio signal of generating.
Background of invention
In general, signal (for example piece, frequency band and sound channel) configuration in every way.In signal can keep the static period of the statistical property of being scheduled to, above-mentioned signal need not to be divided into some unit can be processed, because this is favourable for compressed signal.
Transient state that characteristics of signals suddenly changes in the period preferably with the partitioning scheme processing signals, because prevented distorted signals.
Yet, if the user wants to handle aforementioned signal with partitioning scheme, the but detailed method of information signalingization that will be through not cutting apart.Therefore, be difficult to handle effectively described signal.
Summary of the invention
Therefore, the present invention relates to a kind of can the elimination in essence because the limitation of correlation technique and the method with the carve information signalingization of one or more problems that defective causes.
The one object of the present invention that is used to deal with problems is a kind of method of signal signalingization that effectively will be through cutting apart.
Purpose of the present invention can be reached by a kind of method that generates encoded sound signal is provided, and described method comprises: introduce the necessary basic configuration information of multi-channel audio coding process; And introduce expanded configuration information, wherein this expanded configuration information comprises the configuration information by the expansion environment of type identifier (ID) sign.
Brief Description Of Drawings
Be included in this and so that the accompanying drawing to further understanding of the present invention to be provided embodiments of the present invention be shown, it can be used to explain principle of the present invention with instructions.
In the accompanying drawings:
Fig. 1 is the concept map that illustrates according to the method for signaling of the piece carve information of one embodiment of the present invention;
Fig. 2 and Fig. 3 are the concept maps that illustrates according to the method for signaling of the frequency band of one embodiment of the present invention and sound channel carve information;
Fig. 4 is the concept map of method that the establishment multi-channel signal of another embodiment according to the present invention is shown; With
Fig. 5 is the concept map of method for signaling that the sound channel carve information of another embodiment according to the present invention is shown.
Embodiment
Below will be in detail with reference to better embodiment of the present invention, its concrete exemplary plot is shown in the drawings.
Describe below in conjunction with the method for signaling of accompanying drawing carve information according to the present invention (being also referred to as " splitting information ").
Method for signaling according to carve information of the present invention is classified according to the signal classification.
Before describing the present invention, should be understood that described signal disposes in every way, for example piece, frequency band and sound channel.
Described " method for signaling " can comprise the implication of " signalingization " or the implication of " signal of identification signalingization ".
Term " node " is that the indication signal has been cut apart or undivided point.
Term " spatial information " is can multi-channel audio or the information of channel expansion audio mixing multi-channel signal.
Should be pointed out that " spatial information " but the representation space parameter, yet it is not limited to described example, but can be applied to other example when needed.
Described spatial parameter is the levels of channels poor (CLD) of energy difference between two sound channels of indication, the sound channel predictive coefficient (CPC) of indicating the inter-channel coherence (ICC) of correlativity between two sound channels and being used for creating from two sound channels three sound channels.
Below to piece cut apart, band segmentation and sound channel cut apart and be elaborated.
1) piece is cut apart
Requirement is handled to compress continuous data in the time domain in the mode identical with sound signal with piece.
Term " piece processing " is illustrated on the interval of preset distance and handles input signal with partitioning scheme.
In this case, described interval is defined as " piece ", and the one or more formations " frame " that combine.
Described frame can represent to be used to send/store the unit of data.
Term " piece is cut apart " or " piece partition " can be represented a kind of detailed process, input signal are become the piece of different length in this process during signal Processing.
Term " block length information " is meant and is shown in the customizing messages of handling the block length that is obtained when input signal becomes a plurality of of different length with input signal.
In general, if signal with the configuration of the form of piece, then signal Processing is finished with long piece or short block.
Using under the situation of short block,, and making piece after the combination corresponding to single long piece with some short blocks combinations.
Yet for each at interval, signal has various characteristic, therefore is difficult to determine that fatefully all signals all can be handled according to long block signal processing scheme and short block signal Processing scheme.
Preferably, select the piece of a length-specific in specific interval from the piece of the different length that is fit to characteristics of signals, execution block is cut apart on selected block subsequently.
In more detail, all is configured to have two or more different length.Can from frame, select the piece of the predetermined length in these two or more different length pieces in every way.
For this reason, need indicate to comprise which piece in the present frame, so need be used for the method for signaling of aforesaid operations.
Described method for signaling is divided into order method for signaling and classification method for signaling.
Order method for signaling predefine frame length (i.e. the length of being represented by " N ") also uses the number M of minimum length piece to carry out the signaling process.
In this case, frame length " N " is the multiple of particular value M.Frame length can be a fixed value, perhaps can be the occurrence that can send to the destination as additional information.
For example, suppose that N is 2048 (N=2048), M is 256 (M=256), and all piece arranged with 256 → 256 → 1024 → 512 order, and then block length information can be carried out the signaling processing by M*1, M*1, M*4, M*2 → 1,1,4,2 → 0,0,3,1 order.
The classification method for signaling can be divided into method that sends layer depth information and the method that does not send layer depth information, below in conjunction with accompanying drawing it is elaborated.
Fig. 1 is the concept map that illustrates according to the method for signaling of the piece carve information of one embodiment of the present invention;
With reference to Fig. 1, each layer is represented with " layer ", and layer depth is set as " 5 ".
" layer 1 " comprises first 210, and it is the longest piece of the base unit cut apart as piece, and first 210 length is N.
Reference numeral (1), (2) ..., (a) and (b), (c) and (d) the exemplary binary signaling sequence of expression.
According to present embodiment, indicator dog whether represent by cutting apart ID (identifier) and not cutting apart ID by divided carve information.Optional network specific digit " 1 " is used as cuts apart ID, and optional network specific digit " 0 " is used as and does not cut apart ID.
The described ID of cutting apart and do not cut apart ID and in the node of each layer, represent.
Cut apart ID indication and be included in predetermined block in higher level's layer and be divided into halves in subordinate's layer, and also indicate to this subordinate's layer and distributed downstream site.
The predetermined block of not cutting apart in the ID indication higher level layer is not cut apart by subordinate's layer, and also indication does not have to this subordinate's layer distribution and do not cut apart the corresponding any downstream site of node that ID represents by this.Do not distribute downstream site to mean and do not carry out other signaling operation.
Because the value of first 210 piece carve information (1) is 1 in top (i.e. layer 1), therefore ground floor 210 execution blocks is cut apart.
Layer 2 as level layer under the layer 1 comprises two pieces 220 and 221, and the length of each piece is N/2.
The piece carve information (2) that is included in the piece 220 in the layer 2 has value " 1 ", and the piece carve information (3) of piece 221 has value " 1 ", thereby comprises four pieces 230,231,232 and 233 as the layer 3 of level layer under the layer 2, and each block length is N/4.
Be " 0 " with the value that is included in the piece carve information (4) that the piece 230 of layer in 3 be associated.The value of the piece carve information (5) that is associated with piece 231 is " 1 ".The value of the piece carve information (6) that is associated with piece 232 is " 1 ".Be " 0 " with the value that is included in the piece carve information (7) that the piece 233 of layer in 3 be associated.
Therefore, according to the piece carve information of layer 3, the piece 230 and 233 execution blocks of layer 3 are not cut apart, but the piece 231 and 232 execution blocks of layer 3 are cut apart.
In this case, do not distribute downstream sites to layer 4 as subordinate's layer of the described piece of cutting apart without piece 230 of layer 3 and 233.
The piece of cutting apart through piece 231 and 232 of layer 3 distributes downstream site to subordinate's layer.Whether the existence that piece is cut apart shows in downstream site.
Layer 4 length is N/8, and is included in the piece 240 and 241 that is partitioned on the basis of piece 231 of layer 3, also is included in other piece 242 and 243 that is partitioned on the basis of piece 232 of layer 3.
The value of the piece carve information (8) that is associated with the piece 240 of layer 4 is " 0 ".The value of the piece carve information (9) that is associated with the piece 241 of layer 4 is " 1." value of the piece carve information (a) that is associated with layer 4 piece 242 is " 0 ".The value of the piece carve information (b) that is associated with the piece 243 of layer 4 is " 0 ".
Therefore, according to the piece carve information of layer 4, execution block is not cut apart on the piece 240,242 and 243 of layer 4, but execution block is cut apart on the piece 241 of layer 4.
In this case, do not distribute downstream sites to layer 5 as subordinate's layer of the described piece of cutting apart without piece 240,242 of layer 4 and 243.
The piece of cutting apart through piece 241 of layer 4 distributes a downstream site to layer 5, thereby it indicates whether to exist piece to cut apart in described downstream site.
Layer 5 length is N/16, and is included in the piece 250 and 251 that is partitioned on the basis of piece 241 of layer 4.
The value of the piece carve information (c) that is associated with the piece 250 of layer 5 is " 0 ".The value of the piece carve information (d) that is associated with the piece 251 of layer 5 is " 0 ".
Therefore, the value of each contained piece is " 0 " in the layer 5, cuts apart thereby no longer carry out the classification piece, can identify so the piece of piece is cut apart the degree of depth.The layout structure of the piece that can be cut apart by the classification piece comprises N/4 piece (being that length is the piece of N/4), N/8 piece, N/16 piece, N/16 piece, N/8 piece, N/8 piece
If signal length is N, then the piece of cutting apart through piece has formula " N/x i" expression (and wherein i=1,2 ..., P, P is integer and x=2) any one (being N/2, N/4, N/8, N/16 and N/32 ...) in the length of expression.Expression can according to binary signaling sequence (1) (2) (3) (4) (5) (6) (7) (8) (9) (a) (b) (c) under the situation of (d) information of being cut apart by the piece of binary number representation, the piece carve information can be represented by 13 bits " 1110110010000 ".
Above explanation an example scenario is disclosed, the depth information in its middle level is not represented but separately only by being discerned by cutting apart ID and not cutting apart the piece carve information that ID represents.
Yet, should be noted that other piece carve information of other presentation layer depth information also can carry out the signaling processing.
For example, layer depth information can stop ID and cut apart continuation ID representing by cutting apart.
Stop ID can be illustrated in wherein the lowermost layer that execution block is no longer cut apart described cutting apart.Continue ID can represent all the other each layers except that lowermost layer described cutting apart.In this case, cut apart and continue ID and cut apart by " 1 " expression and stop ID and represent by " 0 ".
Layer depth shown in Fig. 1 is " 5 ", also can use to cut apart to stop ID " 0 " and cut apart continuation ID " 1 " representing with " 11110 ".
Sub-block length can be discerned by described method for signaling.Like this, in the situation of representing depth information separately, only can represent not cut apart ID, thereby the signaling processing procedure can carried out from working as anterior layer to the scope of the last layer of lowermost layer at the node place that distributes to lowermost layer.
For example, supposing to cut apart ID is not cut apart ID and is represented by " 0 " by " 1 " expression, and cut apart and continue ID by " 1 " expression and cut apart and stop ID by " 0 " expression, the node that indication distributes to lowermost layer whether divided particular value can be cut apart " 0 " expression of termination by indication.
2) band segmentation
Below in conjunction with Fig. 2-3 band segmentation is described.
Fig. 2 is the concept map of method for signaling that the band segmentation information of another embodiment according to the present invention is shown.
Fig. 2 illustrates the classification band segmentation of the tree structure that is configured to Methods of Subband Filter Banks.The frequency resolution of subband can define in every way, will be elaborated to it below.
The piece of comparing Fig. 1 is cut apart, and the band segmentation of Fig. 2 comprises a plurality of frequency bands in top, and the top of Fig. 1 is made of single long piece.
According to present embodiment, the indication frequency band whether represent by cutting apart ID and not cutting apart ID by divided band segmentation information.To be worth " 1 " as cutting apart ID, and be used as and do not cut apart ID and will be worth " 0 ".
Cut apart ID and do not cut apart ID and can indicate at every layer node place.
Cutting apart ID indicates the frequency band of M layer to be divided into halves at (M+1) layer.
Do not cut apart ID and indicate the frequency band of M layer not cut apart at (M+1) layer, also indication is not to subordinate's layer distribution and by not cutting apart the corresponding any downstream site of node that ID represents.Do not distribute downstream site to mean and do not carry out other signaling operation.
Comprise first to the 6th frequency band 310,311,312,313,314 and 315 as top layer 1.
The band segmentation information (1) of first frequency band 310 is represented by " 1 ".The band segmentation information (2) of second frequency band 311 is represented by " 1 ".The band segmentation information (3) of the 3rd frequency band 312 is represented by " 0 ".The band segmentation information (4) of the 4th frequency band 313 is represented by " 0 ".The band segmentation information (5) of the 5th frequency band 314 is represented by " 0 ".The band segmentation information (6) of the 6th frequency band 315 is represented by " 0 ".
Described band segmentation information is indicated in the node place that distributes to layer 1.According to band segmentation information (1) and (2), first frequency band 310 produces signal conversion module 310T, and second frequency band 311 produces signal conversion module 311T, thereby produces subordinate's frequency band 320,321,322 and 323 in layer 2.Distributed downstream site to subordinate's frequency band 320,321,322 and 323.It should be noted that described signal conversion module is also referred to as " frequency band conversion module " in the present embodiment.
Simultaneously, it is not carried out band shared the 3rd, the 4th, the 5th or the 6th frequency band 312,313,314 or 315 and do not produce the frequency band conversion module.Equally, in the 3rd, the 4th, the 5th or the 6th frequency band 312,313,314 or 315, do not produce and layer 2 corresponding subordinate frequency band.Therefore, do not distribute and 312,313,314 and 315 corresponding any downstream sites to layer 2.
Layer 2 is included in two frequency bands 320,321 that are partitioned on layer 1 frequency band 320 basis, and is included in two frequency bands 322 and 323 that are partitioned on the frequency band 311 of layer 1.
The band segmentation information (7) of frequency band 320 by " 1 " expression.The band segmentation information (8) of frequency band 321 by " 1 " expression.The band segmentation information (9) of frequency band 322 is represented by " 0 ".The band segmentation information (10) of frequency band 323 is represented by " 0 ".
According to described band segmentation information (7) and (8), frequency band 320 produces frequency band conversion module 320T, and frequency band 321 produces frequency band conversion module 321T, thereby produces subordinate's frequency band 330,331,332 and 333 in layer 3.Distributed downstream site to subordinate's frequency band 330,331,332 and 333.
Simultaneously, it is not carried out band shared frequency band 322 and 323 and do not produce the frequency band conversion module.In frequency band 322 and 323, do not produce and layer 3 corresponding subordinate frequency band yet.Therefore, do not distribute downstream site to frequency band 322 and 323 yet.
Layer 3 is included in two frequency bands 330,331 that are partitioned on layer 2 frequency band 320 basis, and is included in two frequency bands 332 and 333 that are partitioned on the frequency band 321 of layer 2.
The band segmentation information (11) of frequency band 330 by " 1 " expression.The band segmentation information (12) of frequency band 331 is represented by " 0 ".The band segmentation information (13) of the 3rd frequency band 332 is represented by " 0 ".The band segmentation information (14) of frequency band 333 is represented by " 0 ".
According to described band segmentation information (11), frequency band 330 produces signal conversion module 330T, and produces subordinate's frequency band 340 and 341 in layer 4.Distributed downstream site to subordinate's frequency band 340 and 341.
Simultaneously, it is not carried out band shared frequency band 331,332 and 333 and do not produce the frequency band conversion module.In frequency band 331,332 and 333, do not produce and layer 4 corresponding subordinate layer yet.Therefore, do not distribute downstream site to frequency band 322 and 323 yet.Therefore, do not distribute downstream site to frequency band 331,332 and 333 yet.
Layer 4 is included in two frequency bands 340 and 341 that are partitioned on the basis of layer 3 frequency band 330.
The band segmentation information (15) of frequency band 340 is represented by " 0 ".The band segmentation information (16) of frequency band 341 is represented by " 0 ".
Therefore, no longer include and can carry out band shared subordinate layer, the signaling processing procedure stops.In this case, lowermost layer equals layer 4.Under expression can be according to the situation of binary signaling sequence (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15) (16) by binary number representation piece carve information, the piece carve information can be represented by 16 bits " 1100001100100000 ".
Fig. 3 is the block scheme of method for signaling that the band segmentation information of another embodiment according to the present invention is shown.
Compare with Fig. 2, with regard to carrying out band shared method, the band segmentation of Fig. 3 is similar to the situation of Fig. 2.
Yet as shown in Figure 3, the binary sequence of the band segmentation information among Fig. 3 is different with Fig. 2.
Therefore, under expression can be according to the situation of binary signaling sequence (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15) (16) by binary number representation piece carve information, the piece carve information can be represented by 16 bits " 1110001001000000 ".
Above explanation an example scenario is disclosed, the depth information in its middle level is not represented but separately only by being discerned by cutting apart ID and not cutting apart the band segmentation information that ID represents.
Yet, should be noted that other band segmentation information of presentation layer depth information also can be carried out the signaling processing separately.For example, layer depth information can stop ID and cut apart continuation ID representing by cutting apart.
The described termination ID of cutting apart represents no longer to carry out band shared lowermost layer.Continue ID can represent all the other each layers except that lowermost layer described cutting apart.In this case, cut apart and continue ID and cut apart by " 1 " expression and stop ID and represent by " 0 ".
Layer depth shown in Fig. 2~3 is " 4 ", also can use to cut apart to stop ID " 0 " and cut apart continuation ID " 1 " being represented with " 1110 ".Subband length can be discerned by described method for signaling.
Like this, in the situation of representing depth information separately, only can represent not cut apart ID, thereby the signaling processing procedure can carried out from working as anterior layer to the scope of the last layer of lowermost layer at the node place that distributes to lowermost layer.
For example, supposing to cut apart ID is not cut apart ID and is represented by " 0 " by " 1 " expression, cut apart and continue ID by " 1 " expression and cut apart and stop ID by " 0 " expression, then indicate the node that distributes to lowermost layer whether divided particular value can cut apart " 0 " expression of termination by indication.
3) sound channel is cut apart
The sound channel carve information relates to the channel configuration information that is used for channel configuration, therefore hereinafter with reference to described channel configuration information sound channel is cut apart to be elaborated.
Especially, an example of the channel configuration that is obtained will at length be set forth when multi-channel audio signal is carried out Code And Decode.Fundamental space information is needed when multi-channel audio signal is encoded.Described fundamental space information comprise the basic configuration information that can represent the configuration information related with basic environmental facies and with the corresponding master data of described basic configuration information.In addition, multi-channel audio coding optionally requires extending space information.Described extending space information comprise indication and the expanded configuration information of expanding the configuration information that environment is associated and with the corresponding growth data of described expanded configuration information.The configuration information of described expansion environment can exist one or more.Described expansion environment can be identified by type i D.
Simultaneously, the channel configuration by the reference of described multi-channel signal coding mainly is divided into two kinds of channel configuration, promptly basic channel configuration and expansion channel configuration.
One or more channel configuration information are used as described basic channel configuration information.Especially, basic channel configuration information is indicated a channel configuration information of selecting from some channel configuration information.
For ease of explanation, basic channel configuration information is called as " fixedly channel configuration information ", and a plurality of sound channels (being multichannel) of creating according to fixing channel configuration information are called as " fixedly output channels ".
Fixedly channel configuration information is that the described fixing output channels of establishment is desired with the channel configuration data that is associated.
Fixedly channel configuration information can be represented a channel configuration constituent element in some channel configuration constituent elements of setting up in advance.The described channel configuration of setting up in advance can be represented in every way.For example, sound channel can be configured to the form of " 5-1-5 ", " 5-2-5 ", " 7-2-7 " or " 7-5-7 ".
Described " 5-2-5 " configuration can be represented a kind of concrete channel structure, and wherein six input sound channels are become two sound channels by multi-channel audio (downmixed), and are output to six sound channels through the sound channel of multi-channel audio." 5-2-5 " is configured to all the other outer channel configuration to have and the identical channel structure of " 5-2-5 " configuration.
Described fixedly channel configuration information is comprised in the basic configuration information, and the data that are associated with fixing channel configuration information are comprised in the master data.
Various parameters can be used as described master data, for example, inter-channel coherence (ICC) parameter of the correlativity between levels of channels poor (CLD) parameter of energy difference, two sound channels of indication and be used for creating sound channel predictive coefficient (CPC) parameter of three sound channels between two sound channels of indication from two sound channels.
The indication of described expansion channel configuration is according to the fixing channel configuration of channel configuration formation.
Described expansion channel configuration is to be formed arbitrarily by encoded signal.For ease of explanation, the expansion channel configuration information is called as any channel configuration information, and the multichannel of being created by any channel configuration information is called as any output channels.Described any channel configuration information is comprised in the expanded configuration information, and is identified by the type i D that is called sound channel ID.
Be comprised in the growth data with the corresponding any channel configuration data of any channel configuration information.
If desired, for simple to operate, described any channel configuration data can only be used the CLD parameter of energy difference between two sound channels of expression.Channel configuration information is represented by cutting apart ID and not cutting apart ID arbitrarily.The increase of cutting apart ID indication sound channel number as the ingredient of described any channel configuration information.Do not cut apart ID and indicate a kind of particular case, wherein the sound channel number does not change.
For example, cut apart input sound channel of ID indication and be converted into two output channels.Not cutting apart ID indication input sound channel does not do any change and promptly is output on the sound channel number.
Represented to cut apart under the situation of ID at higher level's node layer place, in subordinate's layer, created subordinate's sound channel, and distributed the downstream site corresponding with the sound channel of being created to subordinate's layer to higher level's layer channel allocation.
Yet, represented not cut apart in the situation of ID at higher level's node layer place to the channel allocation of higher level's layer, in subordinate's layer, do not create subordinate's sound channel, therefore do not distribute and the corresponding downstream site of subordinate's sound channel to subordinate's layer.
Cut apart ID and do not cut apart the method that ID represents described any channel configuration information below in conjunction with Fig. 2~3 pair use and describe.
Fig. 2~3 not only illustrate described band segmentation and sound channel also is shown cuts apart.
At first Fig. 2 is described in detail as follows.
Comprise six frequency bands 310,311,312,313,314 and 315 as top layer 1.Described frequency band 310,311,312,313,314 and 315 can serve as described fixing multichannel respectively.According to the present invention, cut apart ID and do not cut apart ID and represent by " 0 " by " 1 " expression.
The method of representing any channel configuration information is sequentially represented to be included in to the sound channel 310,311,312,313,314 of layer 1 and value " 0 " or " 1 " in 315 nodes that distribute.
The method of representing any channel configuration information is sequentially represented to be included in to the sound channel 320,321,322 of layer 2 and value " 0 " or " 1 " in 323 nodes that distribute.
The method of representing any channel configuration information is sequentially represented to be included in to the sound channel 330,331,332 of layer 3 and value " 0 " or " 1 " in 333 nodes that distribute.
The method of representing any channel configuration information is sequentially represented to be included in to the sound channel 340 of layer 4 and value " 0 " or " 1 " in 341 nodes that distribute.
Whether in other words, whether described method sequentially indicates the sound channel number to increase at the node place of higher level's layer, and sequentially indicate the sound channel number to increase at the node place of subordinate's layer subsequently.
Any channel configuration information according to described method is represented by 16 bits " 1100001100100000 ".
For ease of explanation, represent that the method for any channel configuration information is called as " hierarchical priority method ".
Method according to any channel configuration information of expression shown in Figure 3, if when the first node that obtains signaling higher level's layer as a result the time from the first node of higher level's layer by " 1 " expression, then whether sequentially increase with the corresponding all downstream site indication sound channel numbers of the first node of higher level's layer.If " when the first node that obtains signaling higher level's layer as a result the time from the first node of higher level's layer by " 0 " expression, then present node moves to higher level's Section Point, so that whether Section Point indication sound channel number sequentially increases.Therefore, any channel configuration information that is obtained by described method is represented by 16 bits " 1110001001000000 ".
For ease of explanation, represent that the method for any channel configuration information is called as " branch's priority approach ".
Describe the fixedly method of output channels and any output channels of creating in detail below with reference to Fig. 4.
Fig. 4 is the concept map that illustrates according to the method for establishment multi-channel signal of the present invention.
With reference to Fig. 4, create out any output channels (y) by the calculating between down-mix audio signal (x) and the fundamental matrix (m1), and create out another any output channels (z) by the fixing calculating between output channels (y) and the rearmounted matrix (m2).Can there be two or more fundamental matrixs (m1) where necessary.
The configuration element that can use at least one and described fixedly channel configuration information among CLD, ICC, the CPC to obtain fundamental matrix (m1).
Can use CLD and described any channel configuration information to obtain the configuration element of rearmounted matrix (m2).
To method that create any output channels be elaborated below.
At first, the method for using any channel configuration information to dispose any sound channel is elaborated.
Below to using described branch priority approach to represent that the illustrative methods of described any channel configuration information describes.
Described illustrative methods was sequentially discerned as the cutting apart ID and do not cut apart ID of the configuration constituent element of any channel configuration information, and according to the ID execution signal Processing that is identified.
If the ID that is identified is confirmed as cutting apart ID, then input sound channel is connected to the sound channel modular converter as an example of conversion of signals, consequently creates out two subordinate's sound channels.
Otherwise,, then the sound channel number is not made the aforementioned input sound channel of any change ground output if the ID that is identified is confirmed as not cutting apart ID.
To provide its detailed description below.
In the phase one, the initial value of ID number that be decoded is changed to " 1 ", and the initial value of output channels number is changed to " 0 " arbitrarily, and the initial value of sound channel modular converter number is changed to " 0 ".
In subordinate phase, decoded ID is wanted in identification.
In the phase III, if the ID that is identified is confirmed as cutting apart ID, then sound channel modular converter number increases progressively 1, and the ID number that will be identified increases progressively 1.
If the ID that is identified is confirmed as not cutting apart ID, then the output channels number increases progressively 1 arbitrarily, and the ID number that will be identified successively decreases 1.
Repeat aforementioned second and the phase III, up to wanting decoded ID number to arrive " 0 ".
Repeat aforementioned signal processing method according to fixing output channels number.For example, any channel configuration that obtains when any channel configuration information is represented by " 11100010010000 " is shown among Fig. 3.In this case, " 1 " expression is cut apart ID and ID is not cut apart in " 0 " expression.
The number of " 1 " is represented the number of sound channel modular converter (i.e. the signal conversion module of figure 03), and the number of " 0 " is represented the number of any output channels.
Simultaneously, fixedly output channels can be reset (that is, remapping) by different order, and can create out any output channels subsequently as shown in Figure 5.Fig. 5 is the concept map that illustrates according to the method for signaling of sound channel carve information of the present invention.
With reference to Fig. 5, fixedly output channels 310,311,312,313,314 and 315 is reset by remapping module 100.Fixedly output channels 310 ', 311 ', 312 ', 313 ', 314 ' and the 315 ' sound channel as the superiors after the rearrangement is to create described any output channels.Needless to say, can reset or remap described any output channels by different order.
Simultaneously, if comprising the sound channel map information that the sound channel of any channel configuration information is mapped to loudspeaker in the channel configuration information arbitrarily, then output channels also can be mapped to this loudspeaker arbitrarily.
The explanation of front discloses a kind of exemplary cases, presentation layer depth information separately not wherein, but can be by coming the identification layer depth information by cutting apart ID and not cutting apart any channel configuration information that ID represents.
Yet, should be noted that and also can represent other any channel configuration information of presentation layer depth information separately.For example, layer depth information can stop ID and cut apart continuation ID representing by cutting apart.
Stop ID can be illustrated in and wherein no longer carry out the lowermost layer that sound channel is cut apart described cutting apart.Continue ID can represent all the other each layers except that lowermost layer described cutting apart.In this case, cut apart and continue ID and cut apart by " 1 " expression and stop ID and represent by " 0 ".
Layer depth shown in Fig. 2~3 is " 4 ", also can use to cut apart to stop ID " 0 " and cut apart continuation ID " 1 " being represented with " 1110 ".
Like this, in the situation of representing depth information separately, only can represent not cut apart ID, thereby the signaling processing procedure can carried out from working as anterior layer to the scope of the last layer of lowermost layer at the node place that distributes to lowermost layer.
For example, supposing to cut apart ID is not cut apart ID and is represented by " 0 " by " 1 " expression, cut apart and continue ID by " 1 " expression and cut apart and stop ID by " 0 " expression, then indicate the node that distributes to lowermost layer whether divided particular value can cut apart " 0 " expression of termination by indication.
Although aforementioned circumstances is actual generation, lowermost layer can be discerned by described depth information, and infers and have abridged value " 0 ", thereby described any output channels is disposed.
Simultaneously, although described any channel configuration information is sent to demoder, should be noted that demoder can not use any channel configuration information that receives in case of necessity.The aforementioned operation of demoder may occur in a kind of exemplary cases, and wherein demoder identifies the size of any channel configuration information and any channel configuration information, but skips and the corresponding preset range of described size.
Those skilled in that art are appreciated that and can make various modifications and variations and not break away from the spirit or scope of the present invention the present invention.Therefore, the present invention is intended to cover modifications and variations of the present invention, as long as they drop in the scope of appended claims and equivalents thereof.
Industrial applicability
According to the method for signaling of carve information of the present invention have Xia row Xiao fruit.
At first, if the long piece of Yu measured length is divided into the short block of different length, but the fewest bit number of Ze method for signaling Yong according to the present invention is realized the signalling of classification piece carve information.
Secondly, according to method for signaling of the present invention not Xu Yao send separately the customizing messages that index signal is processed employed bit number, and not only can be according to identifying the layer depth after cutting apart through Xin number of signalling but also can identifying Xin number end through signalling.
Moreover, can use minimum bit number a plurality of subbands to be divided into the subband (subband that for example has the different frequency bandwidth) of a plurality of different sizes according to method for signaling of the present invention.
The 4th, can carry out the signalingization of the customizing messages that is associated with channel expansion audio mixing processing procedure according to method for signaling of the present invention, this signal that allows to receive in input sound channel is via the output channels output more much more than input sound channel number.

Claims (15)

1. method that is used to generate encoded sound signal, it comprises:
Introduce the necessary basic configuration information of multi-channel audio coding process; And
Introduce expanded configuration information,
Wherein said expanded configuration information comprises the configuration information by the expansion environment of type identifier (ID) sign.
2. the method for an audio signal comprises:
Reception comprises the necessary basic configuration information of multi-channel audio coding process and by the encoded sound signal of the configuration information of the expansion environment of type identifier (ID) sign; And
Dispose the sound channel of multi-channel audio signal according to described encoded sound signal.
3. method as claimed in claim 2 is characterized in that, described basic configuration information comprises the fixedly channel configuration information as the configuration information of predetermined output channels.
4. method as claimed in claim 3 is characterized in that, described type i D comprises channel configuration identifier (ID).
5. method as claimed in claim 4 is characterized in that, any channel configuration information of described channel configuration identifier (ID) sign expansion environment.
6. method as claimed in claim 5, it is characterized in that, whether described any channel configuration information is used to cut apart identifier (ID) and do not cut apart identifier (ID) indication increases in the node place of layer sound channel number, wherein, if the node of higher level's layer is represented by cutting apart ID, then distribute corresponding to cutting apart several downstream sites to the subordinate layer, and if the node of higher level's layer is represented by not cutting apart ID, then do not distribute downstream site to subordinate's layer.
7. the device of an audio signal comprises:
Be used for receiving and comprise the necessary basic configuration information of multi-channel audio coding process and by the device of the encoded sound signal of the configuration information of the expansion environment of type identifier (ID) sign; And
Be used for disposing the device of the sound channel of multi-channel audio signal according to described encoded sound signal.
8. device as claimed in claim 7 is characterized in that, described basic configuration information comprises the fixedly channel configuration information as the configuration information of predetermined output channels.
9. device as claimed in claim 8 is characterized in that, described type i D comprises channel configuration identifier (ID).
10. device as claimed in claim 9 is characterized in that, any channel configuration information of described channel configuration identifier sign expansion environment.
11. device as claimed in claim 10, it is characterized in that, whether described any channel configuration information is used to cut apart identifier (ID) and do not cut apart identifier (ID) indication increases in the node place of layer sound channel number, wherein, if the node of higher level's layer is represented by cutting apart ID, then distribute corresponding to cutting apart several downstream sites to the subordinate layer, and if the node of higher level's layer is represented by not cutting apart ID, then do not distribute downstream site to subordinate's layer.
12. device as claimed in claim 11, it is characterized in that, if the first node of described higher level's layer is represented by cutting apart ID, whether then described any channel configuration information indication increases to the described subordinate layer sound channel number with the corresponding downstream site of first node described higher level's layer that distribute, if and the first node of described higher level's layer is represented by not cutting apart ID, whether then described any channel configuration information indicates the sound channel number of the Section Point of described higher level's layer to increase.
13. device as claimed in claim 12, it is characterized in that, the described device that is used to dispose the sound channel of multi-channel audio signal uses described fixedly channel configuration information to generate fixedly output channels, and uses described any channel configuration information to generate any output channels.
14. device as claimed in claim 13, it is characterized in that, the device sequence ground identification of the described sound channel that is used to dispose multi-channel audio signal is cut apart identifier (ID) or is not cut apart identifier (ID) as the configuration constituent element of described any channel configuration information, and according to the ID execution signal Processing that is identified, if wherein the ID that is identified is cut apart ID, then single input sound channel is connected in the sound channel modular converter to generate two subordinate's sound channels, if and the ID that identifies is not cut apart ID, then input sound channel is output as any output channels.
15. device as claimed in claim 11, it is characterized in that, described any channel configuration information further comprises the sound channel map information that is used for any output channels is mapped to loudspeaker position, and the described device that is used to dispose the sound channel of multi-channel audio signal maps to loudspeaker to described any output channels according to described sound channel map information.
CN2006800277662A 2005-07-29 2006-07-28 Method for generating encoded audio signal and method for processing audio signal Active CN101233570B (en)

Applications Claiming Priority (16)

Application Number Priority Date Filing Date Title
US70346305P 2005-07-29 2005-07-29
US60/703,463 2005-07-29
US71652605P 2005-09-14 2005-09-14
US60/716,526 2005-09-14
KR1020060004048A KR20070031212A (en) 2005-09-14 2006-01-13 Method and Apparatus for encoding/decoding audio signal
KR1020060004048 2006-01-13
KR10-2006-0004048 2006-01-13
KR10-2006-0017659 2006-02-23
KR1020060017659A KR20070014936A (en) 2005-07-29 2006-02-23 Method and apparatus for encoding/decoding audio signal
KR1020060017660 2006-02-23
KR1020060017660A KR20070014937A (en) 2005-07-29 2006-02-23 Method and apparatus for encoding/decoding audio signal
KR1020060017659 2006-02-23
KR10-2006-0017660 2006-02-23
US81602206P 2006-06-22 2006-06-22
US60/816,022 2006-06-22
PCT/KR2006/002985 WO2007013784A1 (en) 2005-07-29 2006-07-28 Method for generating encoded audio signal amd method for processing audio signal

Publications (2)

Publication Number Publication Date
CN101233570A CN101233570A (en) 2008-07-30
CN101233570B true CN101233570B (en) 2011-06-22

Family

ID=37683623

Family Applications (5)

Application Number Title Priority Date Filing Date
CN2006800274842A Active CN101233567B (en) 2005-07-29 2006-07-28 Method for generating encoded audio signal and method for processing audio signal
CN2006800274861A Active CN101233568B (en) 2005-07-29 2006-07-28 Method for generating encoded audio signal and method for processing audio signal
CN2006800277662A Active CN101233570B (en) 2005-07-29 2006-07-28 Method for generating encoded audio signal and method for processing audio signal
CN2006800277709A Active CN101233571B (en) 2005-07-29 2006-07-28 Method and device for processing audio signal
CN2006800274908A Active CN101233569B (en) 2005-07-29 2006-07-28 Method for signaling of splitting information

Family Applications Before (2)

Application Number Title Priority Date Filing Date
CN2006800274842A Active CN101233567B (en) 2005-07-29 2006-07-28 Method for generating encoded audio signal and method for processing audio signal
CN2006800274861A Active CN101233568B (en) 2005-07-29 2006-07-28 Method for generating encoded audio signal and method for processing audio signal

Family Applications After (2)

Application Number Title Priority Date Filing Date
CN2006800277709A Active CN101233571B (en) 2005-07-29 2006-07-28 Method and device for processing audio signal
CN2006800274908A Active CN101233569B (en) 2005-07-29 2006-07-28 Method for signaling of splitting information

Country Status (7)

Country Link
EP (5) EP1915756A4 (en)
KR (5) KR100888970B1 (en)
CN (5) CN101233567B (en)
AU (1) AU2006273012B2 (en)
CA (1) CA2617050C (en)
RU (1) RU2414741C2 (en)
WO (5) WO2007013784A1 (en)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ATE354161T1 (en) 2002-04-22 2007-03-15 Koninkl Philips Electronics Nv SIGNAL SYNTHESIS
EP1853092B1 (en) 2006-05-04 2011-10-05 LG Electronics, Inc. Enhancing stereo audio with remix capability
WO2008044901A1 (en) 2006-10-12 2008-04-17 Lg Electronics Inc., Apparatus for processing a mix signal and method thereof
JP4838361B2 (en) 2006-11-15 2011-12-14 エルジー エレクトロニクス インコーポレイティド Audio signal decoding method and apparatus
WO2008069584A2 (en) 2006-12-07 2008-06-12 Lg Electronics Inc. A method and an apparatus for decoding an audio signal
JP5270566B2 (en) 2006-12-07 2013-08-21 エルジー エレクトロニクス インコーポレイティド Audio processing method and apparatus
KR20080082916A (en) 2007-03-09 2008-09-12 엘지전자 주식회사 A method and an apparatus for processing an audio signal
ES2593822T3 (en) * 2007-06-08 2016-12-13 Lg Electronics Inc. Method and apparatus for processing an audio signal
KR101569032B1 (en) 2007-09-06 2015-11-13 엘지전자 주식회사 A method and an apparatus of decoding an audio signal
EP2215630B1 (en) * 2007-12-06 2016-03-02 Lg Electronics Inc. A method and an apparatus for processing an audio signal
JP4917189B2 (en) 2009-09-01 2012-04-18 パナソニック株式会社 Digital broadcast transmission apparatus, digital broadcast reception apparatus, and digital broadcast transmission / reception system
TWI444989B (en) 2010-01-22 2014-07-11 Dolby Lab Licensing Corp Using multichannel decorrelation for improved multichannel upmixing
KR101421201B1 (en) * 2013-04-23 2014-07-22 한국산업은행 Method and apparatus for encoding/decoding scalable digital audio using uncompressed audio channel data and compressed audio channel data
US9679572B2 (en) 2013-04-23 2017-06-13 The Korea Development Bank Method and apparatus for encoding/decoding scalable digital audio using direct audio channel data and indirect audio channel data
TWI758146B (en) * 2015-03-13 2022-03-11 瑞典商杜比國際公司 Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element
AU2016325879B2 (en) 2015-09-25 2021-07-08 Voiceage Corporation Method and system for decoding left and right channels of a stereo sound signal

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ATE138238T1 (en) * 1991-01-08 1996-06-15 Dolby Lab Licensing Corp ENCODER/DECODER FOR MULTI-DIMENSIONAL SOUND FIELDS
DE4209544A1 (en) * 1992-03-24 1993-09-30 Inst Rundfunktechnik Gmbh Method for transmitting or storing digitized, multi-channel audio signals
KR100265112B1 (en) * 1997-03-31 2000-10-02 윤종용 Dvd dics and method and apparatus for dvd disc
AU750877C (en) * 1997-09-05 2004-04-29 Lexicon, Inc. 5-2-5 matrix encoder and decoder system
US20030035553A1 (en) * 2001-08-10 2003-02-20 Frank Baumgarte Backwards-compatible perceptual coding of spatial cues
WO2003090208A1 (en) * 2002-04-22 2003-10-30 Koninklijke Philips Electronics N.V. pARAMETRIC REPRESENTATION OF SPATIAL AUDIO
ATE354161T1 (en) * 2002-04-22 2007-03-15 Koninkl Philips Electronics Nv SIGNAL SYNTHESIS
CN1647156B (en) * 2002-04-22 2010-05-26 皇家飞利浦电子股份有限公司 Parameter coding method, parameter coder, device for providing audio frequency signal, decoding method, decoder, device for providing multi-channel audio signal
SE0402650D0 (en) * 2004-11-02 2004-11-02 Coding Tech Ab Improved parametric stereo compatible coding or spatial audio
US7787631B2 (en) * 2004-11-30 2010-08-31 Agere Systems Inc. Parametric coding of spatial audio with cues based on transmitted channels
KR100682904B1 (en) * 2004-12-01 2007-02-15 삼성전자주식회사 Apparatus and method for processing multichannel audio signal using space information
US7903824B2 (en) * 2005-01-10 2011-03-08 Agere Systems Inc. Compact side information for parametric coding of spatial audio

Also Published As

Publication number Publication date
AU2006273012B2 (en) 2010-06-24
CA2617050A1 (en) 2007-02-01
CN101233568A (en) 2008-07-30
AU2006273012A1 (en) 2007-02-01
KR100888970B1 (en) 2009-03-17
EP1915757A1 (en) 2008-04-30
WO2007013784A1 (en) 2007-02-01
CN101233571B (en) 2012-12-05
CA2617050C (en) 2012-10-09
WO2007013775A1 (en) 2007-02-01
WO2007013783A1 (en) 2007-02-01
WO2007013780A1 (en) 2007-02-01
KR100857104B1 (en) 2008-09-05
CN101233567B (en) 2011-06-15
WO2007013781A1 (en) 2007-02-01
CN101233570A (en) 2008-07-30
RU2008107773A (en) 2009-09-10
KR20080033452A (en) 2008-04-16
KR20080035656A (en) 2008-04-23
CN101233567A (en) 2008-07-30
CN101233568B (en) 2010-10-27
EP1915756A1 (en) 2008-04-30
EP1915756A4 (en) 2010-01-06
EP1920437A4 (en) 2010-01-06
CN101233569B (en) 2010-09-01
EP1920438A1 (en) 2008-05-14
CN101233571A (en) 2008-07-30
EP1920437A1 (en) 2008-05-14
EP1920439A1 (en) 2008-05-14
KR20080036119A (en) 2008-04-24
RU2414741C2 (en) 2011-03-20
CN101233569A (en) 2008-07-30
KR100841332B1 (en) 2008-06-25
KR100857102B1 (en) 2008-09-08
KR20080034002A (en) 2008-04-17
EP1915757A4 (en) 2010-01-06
EP1920438A4 (en) 2010-01-06
EP1920439A4 (en) 2010-01-06
KR20080030686A (en) 2008-04-04
KR100857103B1 (en) 2008-09-08

Similar Documents

Publication Publication Date Title
CN101233570B (en) Method for generating encoded audio signal and method for processing audio signal
JP5123351B2 (en) Decoding method of signal division information
KR100880642B1 (en) Method and apparatus for decoding an audio signal
JP2010156822A (en) Sound compression coding device and decoding device of multi-channel sound signal

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant