CN101233571B

CN101233571B - Method and device for processing audio signal

Info

Publication number: CN101233571B
Application number: CN2006800277709A
Authority: CN
Inventors: 吴贤午; 房熙锡; 金东秀; 林宰显; 金孝镇; 郑亮源
Original assignee: LG Electronics Inc
Current assignee: LG Electronics Inc
Priority date: 2005-07-29
Filing date: 2006-07-28
Publication date: 2012-12-05
Anticipated expiration: 2026-07-28
Also published as: CN101233567B; CN101233568A; CN101233569B; KR20080036119A; KR100888970B1; EP1920438A4; EP1920437A1; CN101233570B; CN101233571A; KR100841332B1; EP1915757A1; RU2008107773A; EP1915756A1; WO2007013781A1; CA2617050C; KR100857104B1; WO2007013780A1; EP1920439A4; AU2006273012A1; EP1915757A4

Abstract

A method for processing an audio signal during the multi-channel audio coding is disclosed. The present invention provides the method for processing an audio signal comprising: generating a fixed output channel using a down-mix signal and a basic matrix; and generating an arbitrary output channel using the fixed output channel and a post matrix.

Description

The method and apparatus of audio signal

Technical field

The present invention relates to the multi-channel encoder method, relate in particular to the method for audio signal.

Background of invention

In general, signal (for example piece, frequency band and sound channel) configuration in every way.In the static period of the statistical property that signal can keep being scheduled to, above-mentioned signal need not to be divided into some unit and can be processed, because this is favourable for compressed signal.

Transient state that characteristics of signals suddenly changes in the period preferably with the partitioning scheme processing signals, because prevented distorted signals.

Yet, if the user wants to handle aforementioned signal with partitioning scheme, but will be through the detailed method of the information signalingization cut apart.Therefore, be difficult to handle effectively said signal.

Summary of the invention

Therefore, the present invention relates to a kind of can the elimination in essence because the limitation of correlation technique and the method with the carve information signalingization of one or more problems that defective causes.

The one object of the present invention that is used to deal with problems be a kind of effectively will be through the method for the signal signalingization cut apart.The object of the invention can realize that this method comprises through a kind of method that is used for audio signal is provided: utilize down-mix audio signal and fundamental matrix to generate fixedly output channels; And utilize fixedly that output channels and rearmounted matrix generate any output channels.

Brief Description Of Drawings

Be included in this and so that the accompanying drawing to further understanding of the present invention to be provided embodiment of the present invention be shown, it can be used to explain principle of the present invention with instructions.

In the accompanying drawings:

Fig. 1 is the concept map that illustrates according to the method for signaling of the piece carve information of one embodiment of the present invention;

Fig. 2 and Fig. 3 are the concept maps that illustrates according to the method for signaling of the frequency band of one embodiment of the present invention and sound channel carve information;

Fig. 4 is the concept map of method that the establishment multi-channel signal of another embodiment according to the present invention is shown; And

Fig. 5 is the concept map of method for signaling that the sound channel carve information of another embodiment according to the present invention is shown.

Embodiment

Below will be in detail with reference to preferred embodiments of the present invention, its concrete exemplary plot is shown in the drawings.

Describe below in conjunction with the method for signaling of accompanying drawing carve information according to the present invention (being also referred to as " splitting information ").

Method for signaling according to carve information of the present invention is classified according to the signal classification.

Before describing the present invention, should be understood that said signal disposes in every way, for example piece, frequency band and sound channel.Described " method for signaling " can comprise the implication that the implication of " signalingization " perhaps " is discerned the signal of signalingization ".

Term " node " is that the indication signal has been cut apart or undivided point.

Term " spatial information " is can multi-channel audio or the information of channel expansion audio mixing multi-channel signal.

Should be pointed out that " spatial information " but the representation space parameter, yet it is not limited to said example, but can be applied to other example when needed.Said spatial parameter is the levels of channels poor (CLD) of energy difference between two sound channels of indication, the sound channel predictive coefficient (CPC) of indicating the inter-channel coherence (ICC) of correlativity between two sound channels and being used for creating from two sound channels three sound channels.

Down in the face of piece is cut apart, band segmentation and sound channel are cut apart and be elaborated.

1) piece is cut apart

Requirement is handled to compress the continuous data in the time domain with the mode identical with sound signal with piece.

Term " piece processing " is illustrated on the interval of preset distance and handles input signal with partitioning scheme.

In this case, said interval is defined as " piece ", and the one or more formations " frame " that combine.

Said frame can represent to be used to send/store the unit of data.

Term " piece is cut apart " or " piece partition " can be represented a kind of detailed process, in this process, during signal Processing, input signal are become the piece of different length.

Term " block length information " is meant and is shown in the customizing messages of handling the block length that is obtained when input signal becomes a plurality of of different length with input signal.In general, if signal with the configuration of the form of piece, then signal Processing is with long piece or short block completion.

Using under the situation of short block,, and making piece after the combination corresponding to single long piece with some short blocks combinations.

Yet for each at interval, signal has various characteristic, therefore is difficult to definite fatefully all signals and all can handles according to long block signal processing scheme and short block signal Processing scheme.

Preferably, in specific interval, from the piece of the different length that is fit to characteristics of signals, select the piece of a length-specific, execution block is cut apart on selected block subsequently.

In more detail, all pieces are configured to have two or more different length.Can from frame, select the piece of the predetermined length in these two or more different length pieces in every way.For this reason, need indicate to comprise which piece in the present frame, so need be used for the method for signaling of aforesaid operations.

Said method for signaling is divided into order method for signaling and classification method for signaling.

Order method for signaling predefine frame length (length of promptly being represented by " N ") also uses the number M of minimum length piece to carry out the signaling process.

In this case, frame length " N " is the multiple of particular value M.Frame length can be a fixed value, perhaps can be the occurrence that can send to the destination as additional information.

For example; Suppose that N is 2048 (N=2048); M is 256 (M=256), and all piece arranged with 256 → 256 → 1024 → 512 order, and then block length information can be carried out the signaling processing by M*1, M*1, M*4, M*2 → 1,1,4,2 → 0,0,3,1 order.

The classification method for signaling can be divided into method of sending layer depth information and the method for not sending layer depth information, below in conjunction with accompanying drawing it is elaborated.

Fig. 1 is the concept map that illustrates according to the method for signaling of the piece carve information of one embodiment of the present invention; With reference to Fig. 1, each layer is with " layer " expression, and layer depth is set as " 5 ".

" layer 1 " comprises first 210, and it is the longest piece of the base unit cut apart as piece, and first 210 length is N.

Reference numeral (1), (2) ..., (a) and (b), (c) and (d) the exemplary binary signaling sequence of expression.

According to this embodiment, indicator dog whether represent by cutting apart ID (identifier) and not cutting apart ID by divided carve information.Optional network specific digit " 1 " is used as cuts apart ID, and optional network specific digit " 0 " is used as and does not cut apart ID.

The said ID of cutting apart and do not cut apart ID and in the node of each layer, represent.

Cut apart ID and indicate the predetermined block that is included in higher level's layer to be divided into the halves in subordinate's layer, and also indicate to this subordinate's layer and distributed downstream site.The predetermined block of not cutting apart in the ID indication higher level layer is not cut apart by subordinate's layer, and also indication does not have to this subordinate's layer distribution and do not cut apart the corresponding any downstream site of node that ID representes by this.Do not distribute downstream site to mean and do not carry out other signaling operation.

Because the value of first 210 piece carve information (1) is 1 in top (i.e. layer 1), therefore ground floor 210 execution blocks is cut apart.

Layer 2 as level layer under the layer 1 comprises two

pieces

220 and 221, and the length of each piece is N/2.

The piece carve information (2) that is included in the piece 220 in the layer 2 has value " 1 ", and the piece carve information (3) of piece 221 has value " 1 ", thereby comprises four pieces 230,231,232 and 233 as the layer 3 of level layer under the layer 2, and each block length is N/4.

Be " 0 " with the value that is included in the piece carve information (4) that the piece 230 of layer in 3 be associated.The value of the piece carve information (5) that is associated with piece 231 is " 1 ".The value of the piece carve information (6) that is associated with piece 232 is " 1 ".Be " 0 " with the value that is included in the piece carve information (7) that the piece 233 of layer in 3 be associated.

Therefore, according to the piece carve information of layer 3, the

piece

230 and 233 execution blocks of layer 3 are not cut apart, but the

piece

231 and 232 execution blocks of layer 3 are cut apart.

In this case, do not distribute downstream sites to layer 4 as subordinate's layer of the said piece of cutting apart without piece 230 of

layer

3 and 233.

The piece of cutting apart through

piece

231 and 232 of layer 3 distributes downstream site to subordinate's layer.Whether the existence that piece is cut apart shows in downstream site.

Layer 4 length is N/8, and is included in the

piece

240 and 241 that is partitioned on the basis of piece 231 of layer 3, also is included in

other piece

242 and 243 that is partitioned on the basis of layers 3 piece 232.The value of the piece carve information (8) that is associated with the piece 240 of layer 4 is " 0 ".The value of the piece carve information (9) that is associated with the piece 241 of layer 4 is " 1 ".The value of the piece carve information (a) that is associated with the piece 242 of layer 4 is " 0 ".The value of the piece carve information (b) that is associated with the piece 243 of layer 4 is " 0 ".

Therefore, according to the piece carve information of layer 4, execution block is not cut apart on the piece 240,242 and 243 of layer 4, but execution block is cut apart on the piece 241 of layer 4.

In this case, do not distribute downstream sites to layer 5 as subordinate's layer of the said piece of cutting apart without piece 240,242 of

layer

4 and 243.

The piece of cutting apart through piece 241 of layer 4 distributes a downstream site to layer 5, thereby it indicates whether to exist piece to cut apart in said downstream site.

Layer 5 length is N/16, and is included in the

piece

250 and 251 that is partitioned on the basis of piece 241 of layer 4.

The value of the piece carve information (c) that is associated with the piece 250 of layer 5 is " 0 ".The value of the piece carve information (d) that is associated with the piece 251 of layer 5 is " 0 ".

Therefore, the value of each contained piece is " 0 " in the layer 5, cuts apart thereby no longer carry out the classification piece, can identify so the piece of piece is cut apart the degree of depth.

The layout structure of the piece that can be cut apart by the classification piece comprises N/4 piece (being that length is the piece of N/4), N/8 piece, N/16 piece, N/16 piece, N/8 piece, N/8 piece and N/8 piece.

If signal length is N, the piece of then cutting apart through piece has formula " N/x ⁱ" expression (and wherein i=1,2 ..., P, P is integer and x=2) any one (being N/2, N/4, N/8, N/16 and N/32 ...) in the length of expression.

Expression can according to binary signaling sequence (1) (2) (3) (4) (5) (6) (7) (8) (9) (a) (b) (c) under the situation of (d) information of being cut apart by the piece of binary number representation, the piece carve information can be represented by 13 bits " 1110110010000 ".

Above explanation an example scenario is disclosed, the depth information in its middle level is by expression separately but only through discerning by cutting apart ID and not cutting apart the piece carve information that ID representes.

Yet, should be noted that other piece carve information of other presentation layer depth information also can carry out the signaling processing.

For example, layer depth information can stop ID and cut apart continuation ID representing by cutting apart.

Stop ID can be illustrated in wherein the lowermost layer that execution block is no longer cut apart said cutting apart.The said continuation ID of cutting apart can represent all the other each layers except that lowermost layer.In this situation, cut apart and continue ID and cut apart by " 1 " expression and stop ID and represent by " 0 ".

Layer depth shown in Fig. 1 is " 5 ", also can use to cut apart to stop ID " 0 " and cut apart continuation ID " 1 " representing with " 11110 ".

Sub-block length can be discerned by said method for signaling.

Like this, in the situation of representing depth information separately, only can represent not cut apart ID, thereby the signaling processing procedure can carried out within the scope of the last layer of current layer to lowermost layer at the node place that distributes to lowermost layer.

For example; Supposing to cut apart ID is not cut apart ID and is represented by " 0 " by " 1 " expression; And cut apart and continue ID by " 1 " expression and cut apart and stop ID by " 0 " expression, the node that indication distributes to lowermost layer whether divided particular value can be cut apart " 0 " expression of termination by indication.

2) band segmentation

Below in conjunction with Fig. 2-3 band segmentation is described.

Fig. 2 is the concept map of method for signaling that the band segmentation information of another embodiment according to the present invention is shown.

Fig. 2 illustrates the classification band segmentation of the tree structure that is configured to Methods of Subband Filter Banks.The frequency resolution of subband can define in every way, will be elaborated to it below.

The piece of comparing Fig. 1 is cut apart, and the band segmentation of Fig. 2 comprises a plurality of frequency bands in top, and the top of Fig. 1 is made up of single long piece.

According to this embodiment, the indication frequency band whether represent by cutting apart ID and not cutting apart ID by divided band segmentation information.To be worth " 1 " as cutting apart ID, and be used as and do not cut apart ID and will be worth " 0 ".

Cut apart ID and do not cut apart ID and can indicate at every layer node place.

Cutting apart ID indicates the frequency band of M layer to be divided into halves at (M+1) layer.Do not cut apart ID and indicate the frequency band of M layer not cut apart at (M+1) layer, also indication is not to subordinate's layer distribution and by not cutting apart the corresponding any downstream site of node that ID representes.Do not distribute downstream site to mean and do not carry out other signaling operation.

Comprise first to the 6th frequency band 310,311,312,313,314 and 315 as top layer 1.

The band segmentation information (1) of first frequency band 310 is represented by " 1 ".The band segmentation information (2) of second frequency band 311 is represented by " 1 ".The band segmentation information (3) of the 3rd frequency band 312 is represented by " 0 ".The band segmentation information (4) of the 4th frequency band 313 is represented by " 0 ".The band segmentation information (5) of the 5th frequency band 314 is represented by " 0 ".The band segmentation information (6) of the 6th frequency band 315 is represented by " 0 ".

Said band segmentation information is indicated in the node place that distributes to layer 1.According to band segmentation information (1) and (2), first frequency band 310 produces signal conversion module 310T, and second frequency band 311 produces signal conversion module 311T, thereby in layer 2, produces subordinate's frequency band 320,321,322 and 323.Distributed downstream site to subordinate's frequency band 320,321,322 and 323.It should be noted that said signal conversion module also is called as " frequency band conversion module " in this embodiment.

Simultaneously, it is not carried out band shared the 3rd, the 4th, the 5th or the 6th frequency band 312,313,314 or 315 and do not produce the frequency band conversion module.Equally, in the 3rd, the 4th, the 5th or the 6th frequency band 312,313,314 or 315, do not produce and layer 2 corresponding subordinate frequency band.Therefore, do not distribute and 312,313,314 and 315 corresponding any downstream sites to layer 2.

Layer 2 is included in two frequency bands 320,321 that are partitioned on layer 1 frequency band 320 basis, and is included in two

frequency bands

322 and 323 that are partitioned on the frequency band 311 of layer 1.

The band segmentation information (7) of frequency band 320 by " 1 " expression.The band segmentation information (8) of frequency band 321 by " 1 " expression.The band segmentation information (9) of frequency band 322 is represented by " 0 ".The band segmentation information (10) of frequency band 323 is represented by " 0 ".

According to said band segmentation information (7) and (8), frequency band 320 produces frequency band conversion module 320T, and frequency band 321 produces frequency band conversion module 321T, thereby in layer 3, produces subordinate's frequency band 330,331,332 and 333.Distributed downstream site to subordinate's frequency band 330,331,332 and 333.

Simultaneously, it is not carried out band shared

frequency band

322 and 323 and do not produce the frequency band conversion

module.In frequency band

322 and 323, do not produce and layer 3 corresponding subordinate frequency band yet.Therefore, do not distribute downstream site to

frequency band

322 and 323 yet.

Layer 3 is included in two frequency bands 330,331 that are partitioned on layer 2 frequency band 320 basis, and is included in two

frequency bands

332 and 333 that are partitioned on the frequency band 321 of layer 2.

The band segmentation information (11) of frequency band 330 by " 1 " expression.The band segmentation information (12) of frequency band 331 is represented by " 0 ".The band segmentation information (13) of the 3rd frequency band 332 is represented by " 0 ".The band segmentation information (14) of frequency band 333 is represented by " 0 ".

According to said band segmentation information (11), frequency band 330 produces signal conversion module 330T, and in layer 4, produces subordinate's frequency band 340 and 341.Distributed downstream site to subordinate's

frequency band

340 and 341.

Simultaneously, it is not carried out band shared frequency band 331,332 and 333 and do not produce the frequency band conversion module.In frequency band 331,332 and 333, do not produce and layer 4 corresponding subordinate layer yet.Therefore, do not distribute downstream site to

frequency band

322 and 323 yet.Therefore, do not distribute downstream site to frequency band 331,332 and 333 yet.

Layer 4 is included in two

frequency bands

340 and 341 that are partitioned on the basis of layer 3 frequency band 330.

The band segmentation information (15) of frequency band 340 is represented by " 0 ".The band segmentation information (16) of frequency band 341 is represented by " 0 ".

Therefore, no longer include and can carry out band shared subordinate layer, the signaling processing procedure stops.In this case, lowermost layer equals layer 4.

Under expression can be according to the situation of binary signaling sequence (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15) (16) by binary number representation piece carve information, the piece carve information can be represented by 16 bits " 1100001100100000 ".

Fig. 3 is the block scheme of method for signaling that the band segmentation information of another embodiment according to the present invention is shown.

Compare with Fig. 2, with regard to carrying out band shared method, the band segmentation of Fig. 3 is similar with the situation of Fig. 2.

Yet as shown in Figure 3, the binary sequence of the band segmentation information among Fig. 3 is different with Fig. 2.

Therefore; Under expression can be according to the situation of binary signaling sequence (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15) (16) by binary number representation piece carve information, the piece carve information can be represented by 16 bits " 1110001001000000 ".

Above explanation an example scenario is disclosed, the depth information in its middle level is by expression separately but only through discerning by cutting apart ID and not cutting apart the band segmentation information that ID representes.

Yet, should be noted that other band segmentation information of presentation layer depth information also can be carried out the signaling processing separately.

The described termination ID of cutting apart representes no longer to carry out band shared lowermost layer.The said continuation ID of cutting apart can represent all the other each layers except that lowermost layer.In this situation, cut apart and continue ID and cut apart by " 1 " expression and stop ID and represent by " 0 ".

Layer depth shown in Fig. 2～3 is " 4 ", also can use to cut apart to stop ID " 0 " and cut apart continuation ID " 1 " representing with " 1110 ".Subband length can be discerned by described method for signaling.

For example; Supposing to cut apart ID is not cut apart ID and is represented by " 0 " by " 1 " expression; Cut apart and continue ID by " 1 " expression and cut apart and stop ID by " 0 " expression, then indicate the node that distributes to lowermost layer whether divided particular value can cut apart " 0 " expression of termination by indication.

3) sound channel is cut apart

The sound channel carve information relates to the channel configuration information that is used for channel configuration, so hereinafter will be cut apart sound channel with reference to described channel configuration information and be elaborated.

Especially, with an example at length setting forth the channel configuration that when multi-channel audio signal is carried out Code And Decode, is obtained.

Fundamental space information is needed when multi-channel audio signal is encoded.Said fundamental space information comprise the basic configuration information that can represent the configuration information related with basic environmental facies and with the corresponding master data of said basic configuration information.In addition, multi-channel audio coding optionally requires extending space information.Said extending space information comprise indication and the expanded configuration information of expanding the configuration information that environment is associated and with the corresponding growth data of said expanded configuration information.The configuration information of said expansion environment can exist one or more.Said expansion environment can be identified by type I D.

Simultaneously, the channel configuration by the reference of said multi-channel signal coding mainly is divided into two kinds of channel configuration, promptly basic channel configuration and expansion channel configuration.

One or more channel configuration information are used as said basic channel configuration information.Especially, basic channel configuration information is indicated a channel configuration information of from some channel configuration information, selecting.

For ease of explanation, basic channel configuration information is called as " fixedly channel configuration information ", and a plurality of sound channels (being multichannel) of creating according to fixing channel configuration information are called as " fixedly output channels ".

Fixedly channel configuration information is that the said fixing output channels of establishment is desired with the channel configuration data that is associated.

Fixedly channel configuration information can be represented a channel configuration constituent element in some channel configuration constituent elements of setting up in advance.The said channel configuration of setting up in advance can be represented in every way.For example, sound channel can be configured to the form of " 5-1-5 ", " 5-2-5 ", " 7-2-7 " or " 7-5-7 ".

Said " 5-2-5 " configuration can be represented a kind of concrete channel structure, and wherein six input sound channels are become two sound channels by multi-channel audio (downmixed), and are exported to six sound channels through the sound channel of multi-channel audio." 5-2-5 " is configured to all the other outer channel configuration to have and the identical channel structure of " 5-2-5 " configuration.

Said fixedly channel configuration information is comprised in the basic configuration information, and the data that are associated with fixing channel configuration information are comprised in the master data.

Various parameters can be used as said master data; For example, inter-channel coherence (ICC) parameter of the correlativity between levels of channels poor (CLD) parameter of energy difference, two sound channels of indication and be used for creating sound channel predictive coefficient (CPC) parameter of three sound channels between two sound channels of indication from two sound channels.

Said expansion channel configuration indication accordings to the fixedly channel configuration of channel configuration formation.

Said expansion channel configuration is by forming arbitrarily through encoded signals.For ease of explanation, the expansion channel configuration information is called as any channel configuration information, and the multichannel of being created by any channel configuration information is called as any output channels.Said any channel configuration information is comprised in the expanded configuration information, and is identified by the type I D that is called sound channel ID.

Be comprised in the growth data with the corresponding any channel configuration data of any channel configuration information.

If desired, for simple to operate, said any channel configuration data can only be used the CLD parameter of energy difference between two sound channels of expression.

Channel configuration information is represented by cutting apart ID and not cutting apart ID arbitrarily.The increase of cutting apart ID indication sound channel number as the ingredient of said any channel configuration information.Do not cut apart ID and indicate a kind of particular case, wherein the sound channel number does not change.

For example, cut apart input sound channel of ID indication and be converted into two output channels.Not cutting apart ID indication input sound channel does not do any change and is promptly exported on the sound channel number.

Represented to cut apart under the situation of ID at higher level's node layer place, in subordinate's layer, created subordinate's sound channel, and distributed the downstream site corresponding with the sound channel of being created to subordinate's layer to higher level's layer channel allocation.

Yet, represented not cut apart in the situation of ID at higher level's node layer place to the channel allocation of higher level's layer, in subordinate's layer, do not create subordinate's sound channel, therefore distribute and the corresponding downstream site of subordinate's sound channel to subordinate's layer.

Cut apart ID and do not cut apart the method that ID representes said any channel configuration information below in conjunction with Fig. 2～3 pair use and describe.

Fig. 2～3 not only illustrate said band segmentation and sound channel also is shown cuts apart.

At first Fig. 2 is specified as follows.

Comprise six frequency bands 310,311,312,313,314 and 315 as top layer 1.Said frequency band 310,311,312,313,314 and 315 can serve as said fixing multichannel respectively.According to the present invention, cut apart ID and do not cut apart ID and represent by " 0 " by " 1 " expression.The method of representing any channel configuration information sequentially representes to be included in value " 0 " or " 1 " in the node that the sound channel 310,311,312,313,314 and 315 of layer 1 is distributed.

The method of representing any channel configuration information sequentially representes to be included in value " 0 " or " 1 " in the node that the sound channel 320,321,322 and 323 of layer 2 is distributed.

The method of representing any channel configuration information sequentially representes to be included in value " 0 " or " 1 " in the node that the sound channel 330,331,332 and 333 of layer 3 is distributed.

The method of representing any channel configuration information sequentially representes to be included in value " 0 " or " 1 " in the node that the

sound channel

340 and 341 of layer 4 is distributed.

Whether in other words, whether said method sequentially indicates the sound channel number to increase at the node place of higher level's layer, and sequentially indicate the sound channel number to increase at the node place of subordinate's layer subsequently.

Any channel configuration information according to said method is represented by 16 bits " 1100001100100000 ".For ease of explanation, represent that the method for any channel configuration information is called as " hierarchical priority method ".

Method according to any channel configuration information of expression shown in Figure 3; If when the first node that obtains signaling higher level's layer as a result the time from the first node of higher level's layer by " 1 " expression, then whether sequentially increase with the corresponding all downstream site indication sound channel numbers of the first node of higher level's layer.If when the first node that obtains signaling higher level's layer as a result the time from the first node of higher level's layer by " 0 " expression, then present node moves to higher level's Section Point, so that whether Section Point indication sound channel number sequentially increases.Therefore, any channel configuration information that is obtained by said method is represented by 16 bits " 1110001001000000 ".

For ease of explanation, represent that the method for any channel configuration information is called as " branch's priority approach ".

Specify with reference to Fig. 4 below and create fixedly output channels and the method for output channels arbitrarily.

Fig. 4 is the concept map that illustrates according to the method for establishment multi-channel signal of the present invention.

With reference to Fig. 4, create out any output channels (y) through the calculating between down-mix audio signal (x) and the fundamental matrix (m1), and create out another any output channels (z) through the fixing calculating between output channels (y) and the rearmounted matrix (m2).Can there be two or more fundamental matrixs (m1) where necessary.

The configuration element that can use at least one and said fixedly channel configuration information among CLD, ICC, the CPC to obtain fundamental matrix (m1).

Can use CLD and said any channel configuration information to obtain the configuration element of rearmounted matrix (m2).

To method that create any output channels be elaborated below.

At first, the method for using any channel configuration information to dispose any sound channel is elaborated.

Down in the face of using said branch priority approach to represent that the illustrative methods of said any channel configuration information describes.

Said illustrative methods was sequentially discerned as the cutting apart ID and do not cut apart ID of the configuration constituent element of any channel configuration information, and according to the ID execution signal Processing that is identified.

Cut apart ID if the ID that is identified is confirmed as, then an input sound channel is connected to the sound channel modular converter as an example of conversion of signals, consequently creates out two subordinate's sound channels.

Otherwise, if being confirmed as, the ID that is identified do not cut apart ID, then the sound channel number is not made the aforementioned input sound channel of any change ground output.

To provide its detailed description below.

In the phase one, the initial value of ID number that be decoded is changed to " 1 ", and the initial value of output channels number is changed to " 0 " arbitrarily, and the initial value of sound channel modular converter number is changed to " 0 ".

In subordinate phase, decoded ID is wanted in identification.

In the phase III, if being confirmed as, the ID that is identified cuts apart ID, then sound channel modular converter number increases progressively 1, and the ID number that will be identified increases progressively 1.

Do not cut apart ID if the ID that is identified is confirmed as, then the output channels number increases progressively 1 arbitrarily, and the ID number that will be identified successively decreases 1.

Repeat aforementioned second and the phase III, up to wanting decoded ID number to arrive " 0 ".

Output channels number according to fixing repeats aforementioned signal processing method.

For example, any channel configuration that when any channel configuration information is represented by " 11100010010000 ", obtains is shown among Fig. 3.In this case, " 1 " expression is cut apart ID and ID is not cut apart in " 0 " expression.The number of " 1 " is represented the number of sound channel modular converter (being the signal conversion module of Fig. 3), and the number of " 0 " is represented the number of any output channels.

Simultaneously, fixedly output channels can be reset (that is, remapping) by different order, and can that kind as shown in Figure 5 subsequently create out any output channels.Fig. 5 is the concept map that illustrates according to the method for signaling of sound channel carve information of the present invention.

With reference to Fig. 5, fixedly output channels 310,311,312,313,314 and 315 is reset by remapping module 100.Fixedly output channels 310 ', 311 ', 312 ', 313 ', 314 ' and the 315 ' sound channel as the superiors after the rearrangement is to create said any output channels.Needless to say, can reset or remap said any output channels by different order.

Simultaneously, if comprising the sound channel map information that the sound channel of any channel configuration information is mapped to loudspeaker in the channel configuration information arbitrarily, then output channels also can be mapped to this loudspeaker arbitrarily.

The explanation of front discloses a kind of exemplary cases, presentation layer depth information separately not wherein, but can be through coming the identification layer depth information by cutting apart ID and not cutting apart any channel configuration information that ID representes.

Yet, should be noted that and also can represent other any channel configuration information of presentation layer depth information separately.

Stop ID can be illustrated in and wherein no longer carry out the lowermost layer that sound channel is cut apart said cutting apart.The said continuation ID of cutting apart can represent all the other each layers except that lowermost layer.In this situation, cut apart and continue ID and cut apart by " 1 " expression and stop ID and represent by " 0 ".

Layer depth shown in Fig. 2～3 is " 4 ", also can use to cut apart to stop ID " 0 " and cut apart continuation ID " 1 " representing with " 1110 ".

Although aforementioned circumstances is actual generation, lowermost layer can be discerned through said depth information, and infers and have abridged value " 0 ", thereby said any output channels is able to configuration.

Simultaneously, although said any channel configuration information is sent to demoder, should be noted that demoder can not use any channel configuration information that receives in case of necessity.The aforementioned operation of demoder may occur in a kind of exemplary cases, and wherein demoder identifies the size of any channel configuration information and any channel configuration information, but skips and the corresponding preset range of said size.

Those skilled in that art are appreciated that and can make various modifications and variation to the present invention and do not break away from the spirit or scope of the present invention.Therefore, the present invention is intended to cover modification of the present invention and variation, as long as they drop in the scope of appended claims and equivalents thereof.

Industrial applicability

Method for signaling according to carve information of the present invention has following effect.

At first, if the long piece of predetermined length is divided into the short block of different length, method for signaling then according to the present invention can use minimum bit number to realize the signalingization of classification piece carve information.

Secondly, according to method for signaling of the present invention not needs send the customizing messages that indicator signal is handled employed bit number separately, and not only can identify the layer depth after cutting apart according to signal but also can discern end through the signal of signalingization through signalingization.

Moreover, can use minimum bit number a plurality of subbands to be divided into the subband (subband that for example has the different frequency bandwidth) of a plurality of different sizes according to method for signaling of the present invention.

The 4th, can carry out the signalingization of the customizing messages that is associated with channel expansion audio mixing processing procedure according to method for signaling of the present invention, this signal that allows in input sound channel, to receive is via the output channels output more much more than input sound channel number.

Claims

1. the method for an audio signal comprises:

Utilize the configuration element of down-mix audio signal and fundamental matrix to generate fixedly output channels; And

Utilize the configuration element of said fixedly output channels and rearmounted matrix to generate any output channels,

The configuration element of wherein said fundamental matrix be utilize that levels of channels is poor, in inter-channel coherence and the sound channel predictive coefficient at least one and fixedly channel configuration information obtain,

The configuration element of said rearmounted matrix utilizes levels of channels difference and any channel configuration information to obtain, and

Said any channel configuration information indication uses to cut apart identifier ID and do not cut apart identifier ID whether increase the sound channel number.

2. the method for claim 1 is characterized in that, if the node of higher level's layer represent by cutting apart ID, then distribute corresponding to cutting apart several downstream sites to the subordinate layer,

And if the node of higher level's layer is represented by not cutting apart ID, then do not distribute downstream site to subordinate's layer.

3. method as claimed in claim 2 is characterized in that, said any channel configuration information indicates sequentially whether the sound channel number increases at the node place of said higher level's layer, and indicates sequentially whether the sound channel number increases at the downstream site place of said subordinate layer.

4. method as claimed in claim 2; It is characterized in that; If the first node of said higher level's layer representes by cutting apart ID whether then said any channel configuration information indication increases to the said subordinate layer sound channel number with the corresponding downstream site of first node said higher level's layer that distribute

And if the first node of said higher level's layer is represented by not cutting apart ID, whether then said any channel configuration information indicates the sound channel number of the Section Point of said higher level's layer to increase.

5. method as claimed in claim 4 is characterized in that, any output channels of said generation comprises:

Sequentially discern cutting apart ID or not cutting apart ID as the configuration constituent element of said any channel configuration information; And

Carry out signal Processing according to the ID that is identified, if the ID that is wherein identified is cut apart ID, then single input sound channel is connected in the sound channel modular converter and generates two subordinate's sound channels,

And if the ID that identifies is not cut apart ID, then with the output of said input sound channel with as any output channels.

6. method as claimed in claim 5 is characterized in that, any output channels of said generation comprises:

Set the initial value of ID number, the initial value of any output channels number and the initial value of sound channel modular converter number;

Identification id;

If the ID that identifies is cut apart ID, then ID number and sound channel modular converter number are increased predetermined increment unit,

If the ID that identifies is not cut apart ID, then any output channels number is increased predetermined increment unit and the ID number is reduced predetermined increment unit; And

Repeat said identification, increase ID number and sound channel modular converter number and increase any output channels number and reduce the ID number, till the ID number arrives " 0 ".

7. like each the described method in the claim 2 to 6, also comprise:

Discern the length of said any channel configuration information and any channel configuration data without the length ground of said any channel configuration information of decoding and any channel configuration data corresponding with said any channel configuration information.

8. the device of an audio signal comprises:

The first sound channel generation unit is used to utilize the configuration element of down-mix audio signal and fundamental matrix to generate fixedly output channels; And

The second sound channel generation unit is used to utilize the configuration element of said fixedly output channels and rearmounted matrix to generate any output channels,

9. device as claimed in claim 8 is characterized in that, if the node of higher level's layer represent by cutting apart ID, then distribute corresponding to cutting apart several downstream sites to the subordinate layer,

10. device as claimed in claim 9; It is characterized in that; If the first node of said higher level's layer representes by cutting apart ID whether then said any channel configuration information indication increases to the said subordinate layer sound channel number with the corresponding downstream site of first node said higher level's layer that distribute

11. device as claimed in claim 10 is characterized in that, the said second sound channel generation unit

Sequentially discern cutting apart ID or not cutting apart ID as the configuration constituent element of said any channel configuration information;

And according to the ID execution signal Processing that is identified,

If the ID that is wherein identified is cut apart ID, then single input sound channel is connected in the sound channel modular converter and generates two subordinate's sound channels,