CN101233567A - Method for generating encoded audio signal and method for processing audio signal - Google Patents
Method for generating encoded audio signal and method for processing audio signal Download PDFInfo
- Publication number
- CN101233567A CN101233567A CNA2006800274842A CN200680027484A CN101233567A CN 101233567 A CN101233567 A CN 101233567A CN A2006800274842 A CNA2006800274842 A CN A2006800274842A CN 200680027484 A CN200680027484 A CN 200680027484A CN 101233567 A CN101233567 A CN 101233567A
- Authority
- CN
- China
- Prior art keywords
- layer
- channel
- configuration information
- channel configuration
- output channels
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 85
- 230000005236 sound signal Effects 0.000 title claims abstract description 16
- 239000000470 constituent Substances 0.000 claims description 4
- 108091006146 Channels Proteins 0.000 description 157
- 230000011664 signaling Effects 0.000 description 43
- 230000011218 segmentation Effects 0.000 description 33
- 238000006243 chemical reaction Methods 0.000 description 13
- 239000011159 matrix material Substances 0.000 description 4
- 238000000638 solvent extraction Methods 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000005755 formation reaction Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 208000035126 Facies Diseases 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Mobile Radio Communication Systems (AREA)
- Time-Division Multiplex Systems (AREA)
- Stereophonic System (AREA)
Abstract
A method for generating an encoded audio signal, and a method for processing the same during the multi-channel audio coding are disclosed. The present invention provides the method for generating an encoded audio signal comprising: introducing fixing acoustic channel configure information of configure information of scheduled output acoustic channel; and introducing random acoustic channel configure information.
Description
Technical field
The present invention relates to a kind of multi-channel encoder method, relate in particular to a kind of method of encoded sound signal and a kind of method of audio signal of generating.
Background of invention
In general, signal (for example piece, frequency band and sound channel) configuration in every way.In signal must keep the static period of the statistical property of being scheduled to, above-mentioned signal need not to be divided into some unit can be processed, because this is favourable for compressed signal.
Transient state that characteristics of signals suddenly changes in the period preferably with the partitioning scheme processing signals, because prevented distorted signals.
Yet, if the user wants to handle aforementioned signal with partitioning scheme, the but detailed method of information signalingization that will be through not cutting apart.Therefore, be difficult to handle effectively described signal.
Summary of the invention
Therefore, the present invention relates to a kind of can the elimination in essence because the limitation of correlation technique and the method with the carve information signalingization of one or more problems that defective causes.
The one object of the present invention that is used to deal with problems is a kind of method of signal signalingization that effectively will be through cutting apart.
Purpose of the present invention can be reached by a kind of method that generates encoded sound signal is provided, and described method comprises: the fixedly channel configuration information of introducing the configuration information that serves as predetermined output channels; And introduce any channel configuration information.
Brief Description Of Drawings
Be included in here to provide accompanying drawing that embodiments of the present invention are shown, can explain principle of the present invention with instructions to further understanding of the present invention.
In the accompanying drawings:
Fig. 1 is the concept map that illustrates according to the method for signaling of the piece carve information of one embodiment of the present invention;
Fig. 2 and Fig. 3 are the concept maps that illustrates according to the method for signaling of the frequency band of one embodiment of the present invention and sound channel carve information;
Fig. 4 is the concept map of method that the establishment multi-channel signal of another embodiment according to the present invention is shown; With
Fig. 5 is the concept map of method for signaling that the sound channel carve information of another embodiment according to the present invention is shown.
Embodiment
Below will be in detail with reference to better embodiment of the present invention, its concrete exemplary plot is shown in the drawings.Describe below in conjunction with the method for signaling of accompanying drawing carve information according to the present invention (being also referred to as " splitting information ").
Method for signaling according to carve information of the present invention is classified according to the signal classification.
Before describing the present invention, should be understood that described signal disposes in every way, for example piece, frequency band and sound channel.
Described " method for signaling " can comprise the implication of " signalingization " or the implication of " signal of identification signalingization ".
Term node feeding is that the indication signal has been cut apart or undivided point.
Term space information feeding is can multi-channel audio or the information of channel expansion audio mixing multi-channel signal.
Should be pointed out that " spatial information " but the representation space parameter, yet it is not limited to described example, but can be applied to other example when needed.
Described spatial parameter is the levels of channels poor (CLD) of energy difference between two sound channels of indication, the sound channel predictive coefficient (CPC) of indicating the inter-channel coherence (ICC) of correlativity between two sound channels and being used for creating from two sound channels three sound channels.
Below to piece cut apart, band segmentation and sound channel cut apart and be elaborated.
Piece is cut apart
Requirement is handled to compress continuous data in the time domain in the mode identical with sound signal with piece.
Term " piece processing " is illustrated on the interval of preset distance and handles input signal with partitioning scheme.
In this case, described interval is defined as " piece ", and the one or more formations " frame " that combine.
Described frame can represent to be used to send/store the unit of data.
Term " piece is cut apart " or " piece partition " can be represented a kind of detailed process, input signal are become the piece of different length in this process during signal Processing.
Term " block length information " is meant and is shown in the customizing messages of handling the block length that is obtained when input signal becomes a plurality of of different length with input signal.
In general, if signal with the configuration of the form of piece, then signal Processing is finished with long piece or short block.
Using under the situation of short block,, and making piece after the combination corresponding to single long piece with some short blocks combinations.
Yet for each at interval, signal has various characteristic, therefore is difficult to determine that fatefully all signals all can be handled according to long block signal processing scheme and short block signal Processing scheme.
Preferably, select the piece of a length-specific in specific interval from the piece of the different length that is fit to characteristics of signals, execution block is cut apart on selected block subsequently.
In more detail, all is configured to have two or more different length.Can from frame, select the piece of the predetermined length in these two or more different length pieces in every way.
For this reason, need indicate to comprise which piece in the present frame, so need be used for the method for signaling of aforesaid operations.
Described method for signaling is divided into order method for signaling and classification method for signaling.
Order method for signaling predefine frame length (i.e. the length of being represented by " N ") also uses the number M of minimum length piece to carry out the signaling process.
In this case, frame length " N " is the multiple of particular value M.Frame size can be fixed value or the particular value that can be sent to the destination as additional information.
For example, suppose that N is 2048 (N=2048), M is 256 (M=256), and all piece arranged with 256 → 256 → 1024 → 512 order, and then block length information can be by M
*1, M
*1, M
*4, M
*2 → 1,1,4,2 → 0,0,3,1 order carries out the signaling processing.
The classification method for signaling can be divided into method that sends layer depth information and the method that does not send layer depth information, below in conjunction with accompanying drawing it is elaborated.
Fig. 1 is the concept map that illustrates according to the method for signaling of the piece carve information of one embodiment of the present invention;
With reference to Fig. 1, each layer is represented with " layer ", and layer depth is set as " 5.”
" layer 1 " comprises first 210, and it is the longest piece of the base unit cut apart as piece, and first 210 length is N.
Reference numeral (1), (2) ..., (a) and (b), (c) and (d) the exemplary binary signaling sequence of expression.According to present embodiment, indicator dog whether represent by cutting apart ID (identifier) and not cutting apart ID by divided carve information.Optional network specific digit " 1 " is used as cuts apart ID, and optional network specific digit " 0 " is used as and does not cut apart ID.
The described ID of cutting apart and do not cut apart ID and in the node of each layer, represent.
Cut apart ID indication and be included in predetermined block in higher level's layer and be divided into halves in subordinate's layer, and also indicate to this subordinate's layer and distributed downstream site.
The predetermined block of not cutting apart in the ID indication higher level layer is not cut apart by subordinate's layer, and also indication does not have to this subordinate's layer distribution and do not cut apart the corresponding any downstream site of node that ID represents by this.Do not distribute downstream site to mean and do not carry out other signaling operation.
Because the value of first 210 piece carve information (1) is 1 in top (i.e. layer 1), therefore ground floor 210 execution blocks is cut apart.
The piece carve information (2) that is included in the piece 220 in the layer 2 has value " 1 ", and the piece carve information (3) of piece 221 has value " 1 ", thereby comprises four pieces 230,231,232 and 233 as the layer 3 of level layer under the layer 2, and each block length is N/4.
Be " 0 " with the value that is included in the piece carve information (4) that the piece 230 of layer in 3 be associated.The value of the piece carve information (5) that is associated with piece 231 is " 1 ".The value of the piece carve information (6) that is associated with piece 232 is " 1 ".Be " 0 " with the value that is included in the piece carve information (7) that the piece 233 of layer in 3 be associated.
Therefore, according to the piece carve information of layer 3, the piece 230 and 233 execution blocks of layer 3 are not cut apart, but the piece 231 and 232 execution blocks of layer 3 are cut apart.
In this case, do not distribute downstream sites to layer 4 as subordinate's layer of the described piece of cutting apart without piece 230 of layer 3 and 233.
The piece of cutting apart through piece 231 and 232 of layer 3 distributes downstream site to subordinate's layer.Whether the existence that piece is cut apart shows in downstream site.
The value of the piece carve information (8) that is associated with the piece 240 of layer 4 is " 0 ".The value of the piece carve information (9) that is associated with the piece 241 of layer 4 is " 1 ".The value of the piece carve information (a) that is associated with the piece 242 of layer 4 is " 0 ".
The value of the piece carve information (b) that is associated with the piece 243 of layer 4 is " 0 ".Therefore, according to the piece carve information of layer 4, execution block is not cut apart on the piece 240,242 and 243 of layer 4, but execution block is cut apart on the piece 241 of layer 4.
In this case, do not distribute downstream sites to layer 5 as subordinate's layer of the described piece of cutting apart without piece 240,242 of layer 4 and 243.
The piece of cutting apart through piece 241 distributes a downstream site to layer 5, thereby it indicates whether to exist piece to cut apart in described downstream site.
Therefore, the value of each contained piece is " 0 " in the layer 4, cuts apart thereby no longer carry out the classification piece, can identify so the piece of piece is cut apart the degree of depth.
The layout structure of the piece that can be cut apart by the classification piece comprises N/4 piece (being that length is the piece of N/4), N/8 piece, N/16 piece, N/16 piece, N/8 piece, N/8 piece and N/8 piece.
If signal length is N, then the piece of cutting apart through piece has formula " N/x
i" expression (and wherein i=1,2 ..., P, P is integer and x=2) in the length of expression any one (be N/2, N/4, N/8, N/16 and N/32
).
Expression can according to binary signaling sequence (1) (2) (3) (4) (5) (6) (7) (8) (9) (a) (b) (c) under the situation of (d) information of being cut apart by the piece of binary number representation, the piece carve information can be represented by 13 bits " 1110110010000 ".
Above explanation an example scenario is disclosed, the depth information in its middle level is not represented but separately only by being discerned by cutting apart ID and not cutting apart the piece carve information that ID represents.
Yet, should be noted that other piece carve information of other presentation layer depth information also can carry out the signaling processing.
For example, layer depth information can stop ID and cut apart continuation ID representing by cutting apart.
Stop ID can be illustrated in wherein the lowermost layer that execution block is no longer cut apart described cutting apart.Continue ID can represent all the other each layers except that lowermost layer described cutting apart.In this case, cut apart and continue ID and cut apart by " 1 " expression and stop ID and represent by " 0 ".
Layer depth shown in Fig. 1 is " 5 ", also can use to cut apart to stop ID " 0 " and cut apart continuing ID " 1 " so that " 11110 represent.”
Sub-block length can be discerned by described method for signaling.
Like this, in the situation of representing depth information separately, only can represent not cut apart ID, thereby the signaling processing procedure can carried out from working as anterior layer to the scope of the last layer of lowermost layer at the node place that distributes to lowermost layer.
For example, supposing to cut apart ID is not cut apart ID and is represented by " 0 " by " 1 " expression, and cut apart and continue ID by " 1 " expression and cut apart and stop ID by " 0 " expression, the node that indication distributes to lowermost layer whether divided particular value can be cut apart " 0 " expression of termination by indication.
2) band segmentation
Below in conjunction with Fig. 2-3 band segmentation is described.
Fig. 2 is the concept map of method for signaling that the band segmentation information of another embodiment according to the present invention is shown.
Fig. 2 illustrates the classification band segmentation of the tree structure that is configured to Methods of Subband Filter Banks.The frequency resolution of subband can define in every way, will be elaborated to it below.
The piece of comparing Fig. 1 is cut apart, and the band segmentation of Fig. 2 comprises a plurality of frequency bands in top, and the top of Fig. 1 is made of single long piece.
According to present embodiment, the indication frequency band whether represent by cutting apart ID and not cutting apart ID by divided band segmentation information.Value " 1 " is not cut apart ID as cutting apart ID and be worth " 0 " conduct.
Cut apart ID and do not cut apart ID and can indicate at every layer node place.Cutting apart ID indicates the frequency band of M layer to be divided into halves at (M+1) layer.
Do not cut apart ID and indicate the frequency band of M layer not cut apart at (M+1) layer, also indication is not to subordinate's layer distribution and by not cutting apart the corresponding any downstream site of node that ID represents.Do not distribute downstream site to mean and do not carry out other signaling operation.
Comprise first to the 6th frequency band 310,311,312,313,314 and 315 as top layer 1.
The band segmentation information (1) of first frequency band 310 is represented by " 1 ".The band segmentation information (2) of second frequency band 311 is represented by " 1 ".The band segmentation information (3) of the 3rd frequency band 312 is represented by " 0 ".The band segmentation information (4) of the 4th frequency band 313 is represented by " 0 ".The band segmentation information (5) of the 5th frequency band 314 is represented by " 0 ".The band segmentation information (6) of the 4th frequency band 313 is represented by " 0 ".
Described band segmentation information is indicated in the node place that distributes to layer 1.
According to band segmentation information (1) and (2), first frequency band 310 produces signal conversion module 310T, and second frequency band 311 produces signal conversion module 311T, thereby produces subordinate's frequency band 320,321,322 and 323 in layer 2.Subordinate's frequency band 320,321,322 and 323 has distributed downstream site.It should be noted that described signal conversion module is also referred to as " frequency band conversion module " in the present embodiment.
Simultaneously, it is not carried out band shared the 3rd, the 4th, the 5th or the 6th frequency band 312,313,314 or 315 and do not produce the frequency band conversion module.Equally, in the 3rd, the 4th, the 5th or the 6th frequency band 312,313,314 or 315, do not produce and layer 2 corresponding subordinate frequency band.Therefore, do not distribute and 312,313,314 and 315 corresponding any downstream sites to layer 2.
The band segmentation information (7) of frequency band 320 by " 1 " expression.The band segmentation information (8) of frequency band 321 by " 1 " expression.The band segmentation information (9) of frequency band 322 is represented by " 0 ".The band segmentation information (10) of frequency band 323 is represented by " 0 ".
According to described band segmentation information (7) and (8), frequency band 320 produces frequency band conversion module 320T, and frequency band 321 produces frequency band conversion module 321T, thereby produces subordinate's frequency band 330,331,332 and 333 in layer 3.Subordinate's frequency band 330,331,332 and 333 has distributed downstream site.
Simultaneously, it is not carried out band shared frequency band 322 and 323 and do not produce the frequency band conversion module.In frequency band 322 and 323, do not produce and layer 3 corresponding subordinate frequency band yet.Therefore, do not distribute downstream site to frequency band 322 and 323 yet.
The band segmentation information (11) of frequency band 330 by " 1 " expression.The band segmentation information (12) of frequency band 331 is represented by " 0 ".The band segmentation information (13) of the 3rd frequency band 332 is represented by " 0 ".The band segmentation information (14) of frequency band 333 is represented by " 0 ".
According to described band segmentation information (11), frequency band 330 produces signal conversion module 330T, and produces subordinate's frequency band 340 and 341 in layer 4.Subordinate's frequency band 340 and 341 has distributed downstream site.
Simultaneously, it is not carried out band shared frequency band 331,332 and 333 and do not produce the frequency band conversion module.In frequency band 331,332 and 333, do not produce and layer 4 corresponding subordinate layer yet.
Therefore, do not distribute downstream site to frequency band 322 and 323 yet.Therefore, do not distribute downstream site to frequency band 331,332 and 333 yet.
The band segmentation information (15) of frequency band 340 is represented by " 0 ".The band segmentation information (16) of frequency band 341 is represented by " 0 ".
Therefore, no longer include and can carry out band shared subordinate layer, the signaling processing procedure stops.In this case, lowermost layer equals layer 4.
Can be in expression by under according to the situation of binary signaling sequence (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15) (16) by binary number representation piece carve information, the piece carve information can be represented by 16 bits " 1100001100100000 ".
Fig. 3 is the block scheme of method for signaling that the band segmentation information of another embodiment according to the present invention is shown.
Compare with Fig. 2, with regard to carrying out band shared method, the band segmentation of Fig. 3 is similar to the situation of Fig. 2.
Yet, as shown in Figure 3, different among the binary signaling sequence of the band segmentation information among Fig. 3 and Fig. 2.
Can be in expression by under according to the situation of binary signaling sequence (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15) (16) by binary number representation piece carve information, the piece carve information can be represented by 16 bits " 1110001001000000 ".
Above explanation an example scenario is disclosed, the depth information in its middle level is not represented but separately only by being discerned by cutting apart ID and not cutting apart the band segmentation information that ID represents.
Yet, should be noted that other band segmentation information of presentation layer depth information also can be carried out the signaling processing separately.For example, layer depth information can stop ID and cut apart continuation ID representing by cutting apart.
The described termination ID of cutting apart represents no longer to carry out band shared lowermost layer.Continue ID can represent all the other each layers except that lowermost layer described cutting apart.In this case, cut apart and continue ID and cut apart by " 1 " expression and stop ID and represent by " 0 ".
Layer depth shown in Fig. 2 ~ 3 is " 4 ", also can use to cut apart to stop ID " 0 " and cut apart continuation ID " 1 " being represented with " 1110 ".
Subband length can be discerned by described method for signaling.
Like this, in the situation of representing depth information separately, only can represent not cut apart ID, thereby the signaling processing procedure can carried out from working as anterior layer to the scope of the last layer of lowermost layer at the node place that distributes to lowermost layer.
For example, supposing to cut apart ID is not cut apart ID and is represented by " 0 " by " 1 " expression, cut apart and continue ID by " 1 " expression and cut apart and stop ID by " 0 " expression, then indicate the node that distributes to lowermost layer whether divided particular value can cut apart " 0 " expression of termination by indication.
3) sound channel is cut apart
The sound channel carve information relates to the channel configuration information that is used for channel configuration, therefore hereinafter with reference to described channel configuration information sound channel is cut apart to be elaborated.
Especially, an example of the channel configuration that is obtained will at length be set forth when multi-channel audio signal is carried out Code And Decode.
Fundamental space information is needed when multi-channel audio signal is encoded.Described fundamental space information comprise the basic configuration information that can represent the configuration information related with basic environmental facies and with the corresponding master data of described basic configuration information.
In addition, multi-channel audio coding optionally requires extending space information.Described extending space information comprise indication and the expanded configuration information of expanding the configuration information that environment is associated and with the corresponding growth data of described expanded configuration information.The configuration information of described expansion environment can exist one or more.Described expansion environment can be identified by type i D.
Simultaneously, the channel configuration by the reference of described multi-channel signal coding mainly is divided into two kinds of channel configuration, promptly basic channel configuration and expansion channel configuration.
One or more channel configuration information are used as described basic channel configuration information.Especially, basic channel configuration information is indicated a channel configuration information of selecting from some channel configuration information.
For ease of explanation, basic channel configuration information is called as " fixedly channel configuration information ", and a plurality of sound channels (being multichannel) of creating according to fixing channel configuration information are called as " fixedly output channels ".
Fixedly channel configuration information is that the described fixing output channels of establishment is desired with the channel configuration data that is associated.
Fixedly channel configuration information can be represented a channel configuration constituent element in some channel configuration constituent elements of setting up in advance.The described channel configuration of setting up in advance can be represented in every way.For example, sound channel can be configured to the form of " 5-1-5 ", " 5-2-5 ", " 7-2-7 " or " 7-5-7 ".
Described " 5-2-5 " configuration can be represented a kind of concrete channel structure, and wherein six input sound channels are become two sound channels by multi-channel audio (downmixed), and are output to six sound channels through the sound channel of multi-channel audio." 5-2-5 " is configured to all the other outer channel configuration to have and the identical channel structure of " 5-2-5 " configuration.
Described fixedly channel configuration information is comprised in the basic configuration information, and the data that are associated with fixing channel configuration information are comprised in the master data.
Various parameters can be used as described master data, for example, inter-channel coherence (ICC) parameter of the correlativity between levels of channels poor (CLD) parameter of energy difference, two sound channels of indication and be used for creating sound channel predictive coefficient (CPC) parameter of three sound channels between two sound channels of indication from two sound channels.
The indication of described expansion channel configuration is according to the fixing channel configuration of channel configuration formation.
Described expansion channel configuration is to be formed arbitrarily by encoded signal.For ease of explanation, the expansion channel configuration information is called as any channel configuration information, and the multichannel of being created by any channel configuration information is called as any output channels.
Described any channel configuration information is comprised in the expanded configuration information, and is identified by the type i D that is called sound channel ID.
Be comprised in the growth data with the corresponding any channel configuration data of any channel configuration information.
If desired, for simple to operate, described any channel configuration data can only be used the CLD parameter of energy difference between two sound channels of expression.
Channel configuration information is represented by cutting apart ID and not cutting apart ID arbitrarily.The increase of cutting apart ID indication sound channel number as the ingredient of described any channel configuration information.Do not cut apart ID and indicate a kind of particular case, wherein the sound channel number does not change.
For example, cut apart input sound channel of ID indication and be converted into two output channels.Not cutting apart ID indication input sound channel does not do any change and promptly is output on the sound channel number.
Represented to cut apart under the situation of ID at higher level's node layer place, in subordinate's layer, created subordinate's sound channel, and distributed the downstream site corresponding with the sound channel of being created to subordinate's layer to higher level's layer channel allocation.
Yet, represented not cut apart in the situation of ID at higher level's node layer place to the channel allocation of higher level's layer, in subordinate's layer, do not create subordinate's sound channel, therefore do not distribute and the corresponding downstream site of subordinate's sound channel to subordinate's layer.
Cut apart ID and do not cut apart the method that ID represents described any channel configuration information below in conjunction with Fig. 2-3 pair use and describe.
Fig. 2 ~ 3 not only illustrate described band segmentation and sound channel also is shown cuts apart.
At first Fig. 2 is described in detail as follows.
Comprise six frequency bands 310,311,312,313,314 and 315 as top layer 1.Described frequency band 310,311,312,313,314 and 315 can serve as described fixing multichannel respectively.According to the present invention, cut apart ID and do not cut apart ID and represent by " 0 " by " 1 " expression.
The method of representing any channel configuration information is sequentially represented to be included in to the sound channel 310,311,312,313,314 of layer 1 and value " 0 " or " 1 " in 315 nodes that distribute.
The method of representing any channel configuration information is sequentially represented to be included in to the sound channel 320,321,322 of layer 2 and value " 0 " or " 1 " in 323 nodes that distribute.
The method of representing any channel configuration information is sequentially represented to be included in to the sound channel 330,331,332 of layer 3 and value " 0 " or " 1 " in 333 nodes that distribute.
The method of representing any channel configuration information is sequentially represented to be included in to the sound channel 340 of layer 4 and value " 0 " or " 1 " in 341 nodes that distribute.
Whether in other words, whether described method sequentially indicates the sound channel number to increase at the node place of higher level's layer, and sequentially indicate the sound channel number to increase at the node place of subordinate's layer subsequently.
Any channel configuration information according to described method is represented by 16 bits " 1100001100100000 ".
For ease of explanation, represent that the method for any channel configuration information is called as " hierarchical priority method ".
Method according to any channel configuration information of expression shown in Figure 3, if when the first node that obtains signaling higher level's layer as a result the time from the first node of higher level's layer by 01 expression, then whether sequentially increase with the corresponding all downstream site indication sound channel numbers of the first node of higher level's layer.If " when the first node that obtains signaling higher level's layer as a result the time from the first node of higher level's layer by " 0 " expression, then present node moves to higher level's Section Point, so that whether Section Point indication sound channel number sequentially increases.Therefore, any channel configuration information that is obtained by described method is represented by 16 bits " 1110001001000000 ".
For ease of explanation, represent that the method for any channel configuration information is called as " branch's priority approach ".
Describe the fixedly method of output channels and any output channels of creating in detail below with reference to Fig. 4.
Fig. 4 is the concept map that illustrates according to the method for establishment multi-channel signal of the present invention.
With reference to Fig. 4, create out any output channels (y) by the calculating between down-mix audio signal (x) and the fundamental matrix (m1), and create out another any output channels (z) by the fixing calculating between output channels (y) and the rearmounted matrix (m2).Can there be two or more fundamental matrixs (m1) where necessary.
The configuration element that can use at least one and described fixedly channel configuration information among CLD, ICC, the CPC to obtain fundamental matrix (m1).
Can use CLD and described any channel configuration information to obtain the configuration element of rearmounted matrix (m2).
To method that create any output channels be elaborated below.
At first, the method for using any channel configuration information to dispose any sound channel is elaborated.
Below to using described branch priority approach to represent that the illustrative methods of described any channel configuration information describes.
Described illustrative methods was sequentially discerned as the cutting apart ID and do not cut apart ID of the configuration constituent element of any channel configuration information, and according to the ID execution signal Processing that is identified.
If the ID that is identified is confirmed as cutting apart ID, then input sound channel is connected to the sound channel modular converter as an example of conversion of signals, consequently creates out two subordinate's channels.
Otherwise,, then the sound channel number is not made the aforementioned input sound channel of any change ground output if the ID that is identified is confirmed as not cutting apart ID.
To provide its detailed description below.
In the phase one, the initial value of ID number that be decoded is changed to " 1 ", and the initial value of output channels number is changed to " 0 " arbitrarily, and the initial value of sound channel conversion module number is changed to " 0 ".
In subordinate phase, decoded ID is wanted in identification.
In the phase III, if the ID that is identified is confirmed as cutting apart ID, then sound channel modular converter number increases progressively 1, and the ID number that will be identified increases progressively 1.
If the ID that is identified is confirmed as not cutting apart ID, then the output channels number increases progressively 1 arbitrarily, and the ID number that will be identified successively decreases 1.
Repeat aforementioned second and the phase III, up to wanting decoded ID number to arrive " 0 ".
Repeat aforementioned signal processing method according to fixing output channels number.
For example, any channel configuration that obtains when any channel configuration information is represented by " 11100010010000 " is shown among the figure 03.In this case, " 1 " expression is cut apart ID and ID is not cut apart in " 0 " expression.
The number of 0 " 1 " is represented the number of sound channel modular converter (i.e. the signal conversion module of figure 03), and the number of 0 " 0 " is represented the number of any output channels.
Simultaneously, fixedly output channels can be reset (that is, remapping) by different order, and can create out any output channels subsequently as shown in Figure 5.
Fig. 5 is the concept map that illustrates according to the method for signaling of sound channel carve information of the present invention.
With reference to Fig. 5, fixedly output channels 310,311,312,313,314 and 315 is reset by remapping module 110.Fixedly output channels 310 ', 311 ', 312 ', 313 ', 314 ' and the 315 ' sound channel as the superiors after the rearrangement is to create described any output channels.Needless to say, can reset or remap described any output channels by different order.
Simultaneously, if comprising the sound channel map information that the sound channel of any channel configuration information is mapped to loudspeaker in the channel configuration information arbitrarily, then output channels also can be mapped to this loudspeaker arbitrarily.
The explanation of front discloses a kind of exemplary cases, presentation layer depth information separately not wherein, but can be by coming the identification layer depth information by cutting apart ID and not cutting apart any channel configuration information that ID represents.
Yet, should be noted that and also can represent other any channel configuration information of presentation layer depth information separately.
For example, layer depth information can stop ID and cut apart continuation ID representing by cutting apart.
Stop ID can be illustrated in and wherein no longer carry out the lowermost layer that sound channel is cut apart described cutting apart.Continue ID can represent all the other each layers except that lowermost layer described cutting apart.In this case, cut apart and continue ID and cut apart by " 1 " expression and stop ID and represent by " 0 ".
Layer depth shown in Fig. 2~3 is " 4 ", also can use to cut apart to stop ID " 0 " and cut apart continuation ID " 1 " being represented with " 1110 ".
Like this, in the situation of representing depth information separately, only can represent not cut apart ID, thereby the signaling processing procedure can carried out from working as anterior layer to the scope of the last layer of lowermost layer at the node place that distributes to lowermost layer.
For example, supposing to cut apart ID is not cut apart ID and is represented by " 0 " by " 1 " expression, cut apart and continue ID by " 1 " expression and cut apart and stop ID by " 0 " expression, then indicate the node that distributes to lowermost layer whether divided particular value can cut apart " 0 " expression of termination by indication.
Although aforementioned circumstances is actual generation, lowermost layer can be discerned by described depth information, and infers the value " 0 " that existence is save, thereby described any output channels is disposed.
Simultaneously, although described any channel configuration information is sent to demoder, should be noted that demoder can not use any channel configuration information that receives in case of necessity.The aforementioned operation of demoder may occur in a kind of exemplary cases, and wherein demoder identifies the size of any channel configuration information and any channel configuration information, but skips and the corresponding preset range of described size.
Those skilled in that art are appreciated that and can make various modifications and variations and not break away from the spirit or scope of the present invention the present invention.Therefore, the present invention is intended to cover modifications and variations of the present invention, as long as they drop in the scope of appended claims and equivalents thereof.
Industrial applicability
Method for signaling according to carve information of the present invention has following effect.
At first, if the long piece of predetermined length is divided into the short block of different length, then according to institute of the present invention State method for signaling and can realize with minimum bit number the signalling of classification piece carve information.
Secondly, according to method for signaling of the present invention not needs send separately index signal and process employed ratio The customizing messages of special number, and not only can identify according to the signal through signalling after cutting apart layer depth but also Can identify the end through the signal of signalling.
Moreover, can use minimum bit number a plurality of subbands to be divided into the subband (subband that for example has the different frequency bandwidth) of a plurality of different sizes according to method for signaling of the present invention.
The 4th, can carry out the signalingization of the customizing messages that is associated with channel expansion audio mixing processing procedure according to method for signaling of the present invention, this signal that allows to receive in input sound channel is via the output channels output more much more than input sound channel number.
Claims (15)
1. method that generates encoded sound signal, it comprises:
The fixedly channel configuration information of the configuration information of predetermined output channels is served as in introducing; And
Introduce any channel configuration information.
2. the method for claim 1, it is characterized in that, whether described any channel configuration information is used to cut apart identifier (ID) and do not cut apart identifier (ID) indication increases in the node place of layer sound channel number, if and the node of higher level's layer is represented by cutting apart ID, then distributed number to equal to cut apart several downstream sites to subordinate's layer, if and the node of higher level's layer is represented by not cutting apart ID, then do not distribute downstream site to subordinate's layer.
3. whether method as claimed in claim 2 is characterized in that, whether described any channel configuration information is sequentially indicated in the node place of higher level's layer sound channel number increases, and sequentially indicate in the downstream site place of subordinate's layer sound channel number and increase.
4. method as claimed in claim 2, it is characterized in that, if the first node of higher level's layer is represented by cutting apart ID, whether then described any channel configuration information indication increases to subordinate's layer sound channel number with the corresponding downstream site of described first node higher level's layer that distribute, if and the described first node of higher level's layer is represented by not cutting apart ID, whether the sound channel number of the Section Point of then described any channel configuration information indication higher level layer increases.
5. as claim 3 or 4 described methods, it is characterized in that the described ID of cutting apart is the information that indication converts an input sound channel to two output channels.
6. method as claimed in claim 5 is characterized in that, the described ID of not cutting apart does not indicate the sound channel number is made the information of any change ground with input sound channel output.
7. as claim 1 or 6 described methods, it is characterized in that described any channel configuration information also comprises the sound channel map information, described sound channel map information uses described any channel configuration information that any output channels is mapped to loudspeaker position.
8. the method for an audio signal comprises:
Reception comprises fixedly channel configuration information and the encoded sound signal of channel configuration information arbitrarily; And
Use fixedly channel configuration information and channel configuration information configuration output channels arbitrarily.
9. method as claimed in claim 8, it is characterized in that, whether channel configuration information is used to cut apart identifier (ID) and do not cut apart identifier (ID) indication arbitrarily increases in the node place of layer channel number, if and the node of higher level's layer is represented by cutting apart ID, then number with cut apart the downstream sites that equate of number and be assigned to subordinate's layer, if and the node of higher level's layer represents then do not have to distribute downstream site to subordinate's layer by not cutting apart ID.
10. method as claimed in claim 9 is characterized in that, described any channel configuration information further comprises the sound channel map information, and described sound channel map information uses described any channel configuration information that any output channels is mapped to loudspeaker position.
11. method as claimed in claim 8, it is characterized in that, the configuration output channels comprises the fixedly channel configuration information generation fixedly output channels corresponding with being scheduled to output channels of use, described fixedly output channels is defined as top sound channel, and uses any channel configuration information to generate any output channels.
12. method as claimed in claim 11, it is characterized in that, generate any output channels and comprise sequentially identification cutting apart identifier (ID) and not cutting apart identifier (ID) as the configuration constituent element of channel configuration information arbitrarily, and according to the ID execution signal Processing that is identified, if and the ID that identifies is cut apart ID, then single input sound channel is connected in the sound channel modular converter and generates two subordinate's sound channels, if and the ID that identifies is not cut apart ID, then the sound channel number is not made any change ground input sound channel is exported.
13. method as claimed in claim 9 is characterized in that, generates any output channels and comprises the initial value that the ID number is set, the initial value of any output channels number and the initial value of sound channel modular converter number; Identification id; If the ID that identifies is cut apart ID, then ID number and sound channel modular converter number are increased predetermined increment unit, if the ID that identifies is not cut apart ID, then any output channels number is increased predetermined increment unit and the ID number is reduced predetermined increment unit; And repeat described identification, increase ID number and sound channel modular converter number and increase any output channels number and reduce the ID number, till the ID number arrives zero " 0 ".
14. method as claimed in claim 13 is characterized in that, any output channels of described generation also comprises according to described sound channel map information any output channels is mapped to loudspeaker.
15. method as claimed in claim 9, it is characterized in that, also comprise: discern the described any channel configuration information and the length of channel configuration data arbitrarily without the length ground of described any channel configuration information of decoding and any channel configuration data corresponding with described any channel configuration information.
Applications Claiming Priority (16)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US70346305P | 2005-07-29 | 2005-07-29 | |
US60/703,463 | 2005-07-29 | ||
US71652605P | 2005-09-14 | 2005-09-14 | |
US60/716,526 | 2005-09-14 | ||
KR1020060004048A KR20070031212A (en) | 2005-09-14 | 2006-01-13 | Method and Apparatus for encoding/decoding audio signal |
KR10-2006-0004048 | 2006-01-13 | ||
KR1020060004048 | 2006-01-13 | ||
KR1020060017659 | 2006-02-23 | ||
KR10-2006-0017660 | 2006-02-23 | ||
KR1020060017659A KR20070014936A (en) | 2005-07-29 | 2006-02-23 | Method and apparatus for encoding/decoding audio signal |
KR1020060017660 | 2006-02-23 | ||
KR10-2006-0017659 | 2006-02-23 | ||
KR1020060017660A KR20070014937A (en) | 2005-07-29 | 2006-02-23 | Method and apparatus for encoding/decoding audio signal |
US81602206P | 2006-06-22 | 2006-06-22 | |
US60/816,022 | 2006-06-22 | ||
PCT/KR2006/002982 WO2007013781A1 (en) | 2005-07-29 | 2006-07-28 | Method for generating encoded audio signal and method for processing audio signal |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101233567A true CN101233567A (en) | 2008-07-30 |
CN101233567B CN101233567B (en) | 2011-06-15 |
Family
ID=37683623
Family Applications (5)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2006800274908A Active CN101233569B (en) | 2005-07-29 | 2006-07-28 | Method for signaling of splitting information |
CN2006800277709A Active CN101233571B (en) | 2005-07-29 | 2006-07-28 | Method and device for processing audio signal |
CN2006800274861A Active CN101233568B (en) | 2005-07-29 | 2006-07-28 | Method for generating encoded audio signal and method for processing audio signal |
CN2006800277662A Active CN101233570B (en) | 2005-07-29 | 2006-07-28 | Method for generating encoded audio signal and method for processing audio signal |
CN2006800274842A Active CN101233567B (en) | 2005-07-29 | 2006-07-28 | Method for generating encoded audio signal and method for processing audio signal |
Family Applications Before (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2006800274908A Active CN101233569B (en) | 2005-07-29 | 2006-07-28 | Method for signaling of splitting information |
CN2006800277709A Active CN101233571B (en) | 2005-07-29 | 2006-07-28 | Method and device for processing audio signal |
CN2006800274861A Active CN101233568B (en) | 2005-07-29 | 2006-07-28 | Method for generating encoded audio signal and method for processing audio signal |
CN2006800277662A Active CN101233570B (en) | 2005-07-29 | 2006-07-28 | Method for generating encoded audio signal and method for processing audio signal |
Country Status (7)
Country | Link |
---|---|
EP (5) | EP1915756A4 (en) |
KR (5) | KR100857102B1 (en) |
CN (5) | CN101233569B (en) |
AU (1) | AU2006273012B2 (en) |
CA (1) | CA2617050C (en) |
RU (1) | RU2414741C2 (en) |
WO (5) | WO2007013781A1 (en) |
Families Citing this family (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE60311794T2 (en) | 2002-04-22 | 2007-10-31 | Koninklijke Philips Electronics N.V. | SIGNAL SYNTHESIS |
ATE527833T1 (en) | 2006-05-04 | 2011-10-15 | Lg Electronics Inc | IMPROVE STEREO AUDIO SIGNALS WITH REMIXING |
EP2084901B1 (en) | 2006-10-12 | 2015-12-09 | LG Electronics Inc. | Apparatus for processing a mix signal and method thereof |
EP2092516A4 (en) | 2006-11-15 | 2010-01-13 | Lg Electronics Inc | A method and an apparatus for decoding an audio signal |
KR101111520B1 (en) | 2006-12-07 | 2012-05-24 | 엘지전자 주식회사 | A method an apparatus for processing an audio signal |
US8265941B2 (en) | 2006-12-07 | 2012-09-11 | Lg Electronics Inc. | Method and an apparatus for decoding an audio signal |
KR20080082917A (en) | 2007-03-09 | 2008-09-12 | 엘지전자 주식회사 | A method and an apparatus for processing an audio signal |
JP5291096B2 (en) | 2007-06-08 | 2013-09-18 | エルジー エレクトロニクス インコーポレイティド | Audio signal processing method and apparatus |
WO2009031870A1 (en) * | 2007-09-06 | 2009-03-12 | Lg Electronics Inc. | A method and an apparatus of decoding an audio signal |
JP2011507013A (en) * | 2007-12-06 | 2011-03-03 | エルジー エレクトロニクス インコーポレイティド | Audio signal processing method and apparatus |
CN102484547A (en) | 2009-09-01 | 2012-05-30 | 松下电器产业株式会社 | Digital broadcasting transmission device, digital broadcasting reception device, digital broadcasting reception system |
TWI444989B (en) * | 2010-01-22 | 2014-07-11 | Dolby Lab Licensing Corp | Using multichannel decorrelation for improved multichannel upmixing |
KR101421201B1 (en) * | 2013-04-23 | 2014-07-22 | 한국산업은행 | Method and apparatus for encoding/decoding scalable digital audio using uncompressed audio channel data and compressed audio channel data |
WO2014175617A1 (en) | 2013-04-23 | 2014-10-30 | ㈜ 소닉티어 | Method and apparatus for encoding/decoding scalable digital audio using direct audio channel data and indirect audio channel data |
TWI758146B (en) | 2015-03-13 | 2022-03-11 | 瑞典商杜比國際公司 | Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element |
ES2809677T3 (en) | 2015-09-25 | 2021-03-05 | Voiceage Corp | Method and system for encoding a stereo sound signal using encoding parameters from a primary channel to encode a secondary channel |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1992012607A1 (en) * | 1991-01-08 | 1992-07-23 | Dolby Laboratories Licensing Corporation | Encoder/decoder for multidimensional sound fields |
DE4209544A1 (en) * | 1992-03-24 | 1993-09-30 | Inst Rundfunktechnik Gmbh | Method for transmitting or storing digitized, multi-channel audio signals |
KR100265112B1 (en) * | 1997-03-31 | 2000-10-02 | 윤종용 | Dvd dics and method and apparatus for dvd disc |
JP2004507904A (en) * | 1997-09-05 | 2004-03-11 | レキシコン | 5-2-5 matrix encoder and decoder system |
US20030035553A1 (en) * | 2001-08-10 | 2003-02-20 | Frank Baumgarte | Backwards-compatible perceptual coding of spatial cues |
KR101021079B1 (en) * | 2002-04-22 | 2011-03-14 | 코닌클리케 필립스 일렉트로닉스 엔.브이. | Parametric multi-channel audio representation |
EP1500084B1 (en) * | 2002-04-22 | 2008-01-23 | Koninklijke Philips Electronics N.V. | Parametric representation of spatial audio |
DE60311794T2 (en) * | 2002-04-22 | 2007-10-31 | Koninklijke Philips Electronics N.V. | SIGNAL SYNTHESIS |
SE0402650D0 (en) * | 2004-11-02 | 2004-11-02 | Coding Tech Ab | Improved parametric stereo compatible coding or spatial audio |
US7787631B2 (en) * | 2004-11-30 | 2010-08-31 | Agere Systems Inc. | Parametric coding of spatial audio with cues based on transmitted channels |
KR100682904B1 (en) * | 2004-12-01 | 2007-02-15 | 삼성전자주식회사 | Apparatus and method for processing multichannel audio signal using space information |
US7903824B2 (en) * | 2005-01-10 | 2011-03-08 | Agere Systems Inc. | Compact side information for parametric coding of spatial audio |
-
2006
- 2006-07-28 CN CN2006800274908A patent/CN101233569B/en active Active
- 2006-07-28 KR KR1020087004585A patent/KR100857102B1/en active IP Right Grant
- 2006-07-28 EP EP06769309A patent/EP1915756A4/en not_active Ceased
- 2006-07-28 CN CN2006800277709A patent/CN101233571B/en active Active
- 2006-07-28 EP EP06769318A patent/EP1915757A4/en not_active Ceased
- 2006-07-28 EP EP06769319A patent/EP1920439A4/en not_active Ceased
- 2006-07-28 CN CN2006800274861A patent/CN101233568B/en active Active
- 2006-07-28 CN CN2006800277662A patent/CN101233570B/en active Active
- 2006-07-28 CA CA2617050A patent/CA2617050C/en active Active
- 2006-07-28 EP EP06769317A patent/EP1920438A4/en not_active Ceased
- 2006-07-28 WO PCT/KR2006/002982 patent/WO2007013781A1/en active Application Filing
- 2006-07-28 AU AU2006273012A patent/AU2006273012B2/en active Active
- 2006-07-28 CN CN2006800274842A patent/CN101233567B/en active Active
- 2006-07-28 WO PCT/KR2006/002984 patent/WO2007013783A1/en active Application Filing
- 2006-07-28 EP EP06769316A patent/EP1920437A4/en not_active Ceased
- 2006-07-28 KR KR1020087004589A patent/KR100857104B1/en active IP Right Grant
- 2006-07-28 KR KR1020087004588A patent/KR100841332B1/en active IP Right Grant
- 2006-07-28 WO PCT/KR2006/002974 patent/WO2007013775A1/en active Application Filing
- 2006-07-28 KR KR1020087004586A patent/KR100888970B1/en active IP Right Grant
- 2006-07-28 WO PCT/KR2006/002981 patent/WO2007013780A1/en active Application Filing
- 2006-07-28 KR KR1020087004587A patent/KR100857103B1/en active IP Right Grant
- 2006-07-28 RU RU2008107773/28A patent/RU2414741C2/en not_active Application Discontinuation
- 2006-07-28 WO PCT/KR2006/002985 patent/WO2007013784A1/en active Application Filing
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101233567B (en) | Method for generating encoded audio signal and method for processing audio signal | |
US20080304513A1 (en) | Method For Signaling of Splitting Information | |
CN101199121B (en) | Scalable compressed audio bit stream and codec using a hierarchical filterbank and multichannel joint coding | |
CN101297353B (en) | Apparatus for encoding and decoding audio signal and method thereof | |
KR20100134084A (en) | Apparatus and method for generating a number of output audio channels, receiver and audio playing device comprising the apparatus, data stream receiving method, and computer program product | |
CN101868821A (en) | The method and apparatus that is used for processing signals | |
RU2007141934A (en) | ADAPTIVE GROUPING OF PARAMETERS FOR IMPROVED ENCODING EFFICIENCY | |
KR100830472B1 (en) | Method and apparatus for decoding an audio signal | |
JP5135205B2 (en) | Acoustic compression encoding apparatus and decoding apparatus for multi-channel acoustic signals | |
KR102488129B1 (en) | High speed data encoder | |
WO2002103608A3 (en) | Method of expanding a biological network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant |