CN101233571B - Method and device for processing audio signal - Google Patents

Method and device for processing audio signal Download PDF

Info

Publication number
CN101233571B
CN101233571B CN2006800277709A CN200680027770A CN101233571B CN 101233571 B CN101233571 B CN 101233571B CN 2006800277709 A CN2006800277709 A CN 2006800277709A CN 200680027770 A CN200680027770 A CN 200680027770A CN 101233571 B CN101233571 B CN 101233571B
Authority
CN
China
Prior art keywords
layer
channel
configuration information
sound channel
cut apart
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN2006800277709A
Other languages
Chinese (zh)
Other versions
CN101233571A (en
Inventor
吴贤午
房熙锡
金东秀
林宰显
金孝镇
郑亮源
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LG Electronics Inc
Original Assignee
LG Electronics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from KR1020060004048A external-priority patent/KR20070031212A/en
Priority claimed from KR1020060017660A external-priority patent/KR20070014937A/en
Application filed by LG Electronics Inc filed Critical LG Electronics Inc
Publication of CN101233571A publication Critical patent/CN101233571A/en
Application granted granted Critical
Publication of CN101233571B publication Critical patent/CN101233571B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Time-Division Multiplex Systems (AREA)
  • Stereophonic System (AREA)

Abstract

A method for processing an audio signal during the multi-channel audio coding is disclosed. The present invention provides the method for processing an audio signal comprising: generating a fixed output channel using a down-mix signal and a basic matrix; and generating an arbitrary output channel using the fixed output channel and a post matrix.

Description

The method and apparatus of audio signal
Technical field
The present invention relates to the multi-channel encoder method, relate in particular to the method for audio signal.
Background of invention
In general, signal (for example piece, frequency band and sound channel) configuration in every way.In the static period of the statistical property that signal can keep being scheduled to, above-mentioned signal need not to be divided into some unit and can be processed, because this is favourable for compressed signal.
Transient state that characteristics of signals suddenly changes in the period preferably with the partitioning scheme processing signals, because prevented distorted signals.
Yet, if the user wants to handle aforementioned signal with partitioning scheme, but will be through the detailed method of the information signalingization cut apart.Therefore, be difficult to handle effectively said signal.
Summary of the invention
Therefore, the present invention relates to a kind of can the elimination in essence because the limitation of correlation technique and the method with the carve information signalingization of one or more problems that defective causes.
The one object of the present invention that is used to deal with problems be a kind of effectively will be through the method for the signal signalingization cut apart.The object of the invention can realize that this method comprises through a kind of method that is used for audio signal is provided: utilize down-mix audio signal and fundamental matrix to generate fixedly output channels; And utilize fixedly that output channels and rearmounted matrix generate any output channels.
Brief Description Of Drawings
Be included in this and so that the accompanying drawing to further understanding of the present invention to be provided embodiment of the present invention be shown, it can be used to explain principle of the present invention with instructions.
In the accompanying drawings:
Fig. 1 is the concept map that illustrates according to the method for signaling of the piece carve information of one embodiment of the present invention;
Fig. 2 and Fig. 3 are the concept maps that illustrates according to the method for signaling of the frequency band of one embodiment of the present invention and sound channel carve information;
Fig. 4 is the concept map of method that the establishment multi-channel signal of another embodiment according to the present invention is shown; And
Fig. 5 is the concept map of method for signaling that the sound channel carve information of another embodiment according to the present invention is shown.
Embodiment
Below will be in detail with reference to preferred embodiments of the present invention, its concrete exemplary plot is shown in the drawings.
Describe below in conjunction with the method for signaling of accompanying drawing carve information according to the present invention (being also referred to as " splitting information ").
Method for signaling according to carve information of the present invention is classified according to the signal classification.
Before describing the present invention, should be understood that said signal disposes in every way, for example piece, frequency band and sound channel.Described " method for signaling " can comprise the implication that the implication of " signalingization " perhaps " is discerned the signal of signalingization ".
Term " node " is that the indication signal has been cut apart or undivided point.
Term " spatial information " is can multi-channel audio or the information of channel expansion audio mixing multi-channel signal.
Should be pointed out that " spatial information " but the representation space parameter, yet it is not limited to said example, but can be applied to other example when needed.Said spatial parameter is the levels of channels poor (CLD) of energy difference between two sound channels of indication, the sound channel predictive coefficient (CPC) of indicating the inter-channel coherence (ICC) of correlativity between two sound channels and being used for creating from two sound channels three sound channels.
Down in the face of piece is cut apart, band segmentation and sound channel are cut apart and be elaborated.
1) piece is cut apart
Requirement is handled to compress the continuous data in the time domain with the mode identical with sound signal with piece.
Term " piece processing " is illustrated on the interval of preset distance and handles input signal with partitioning scheme.
In this case, said interval is defined as " piece ", and the one or more formations " frame " that combine.
Said frame can represent to be used to send/store the unit of data.
Term " piece is cut apart " or " piece partition " can be represented a kind of detailed process, in this process, during signal Processing, input signal are become the piece of different length.
Term " block length information " is meant and is shown in the customizing messages of handling the block length that is obtained when input signal becomes a plurality of of different length with input signal.In general, if signal with the configuration of the form of piece, then signal Processing is with long piece or short block completion.
Using under the situation of short block,, and making piece after the combination corresponding to single long piece with some short blocks combinations.
Yet for each at interval, signal has various characteristic, therefore is difficult to definite fatefully all signals and all can handles according to long block signal processing scheme and short block signal Processing scheme.
Preferably, in specific interval, from the piece of the different length that is fit to characteristics of signals, select the piece of a length-specific, execution block is cut apart on selected block subsequently.
In more detail, all pieces are configured to have two or more different length.Can from frame, select the piece of the predetermined length in these two or more different length pieces in every way.For this reason, need indicate to comprise which piece in the present frame, so need be used for the method for signaling of aforesaid operations.
Said method for signaling is divided into order method for signaling and classification method for signaling.
Order method for signaling predefine frame length (length of promptly being represented by " N ") also uses the number M of minimum length piece to carry out the signaling process.
In this case, frame length " N " is the multiple of particular value M.Frame length can be a fixed value, perhaps can be the occurrence that can send to the destination as additional information.
For example; Suppose that N is 2048 (N=2048); M is 256 (M=256), and all piece arranged with 256 → 256 → 1024 → 512 order, and then block length information can be carried out the signaling processing by M*1, M*1, M*4, M*2 → 1,1,4,2 → 0,0,3,1 order.
The classification method for signaling can be divided into method of sending layer depth information and the method for not sending layer depth information, below in conjunction with accompanying drawing it is elaborated.
Fig. 1 is the concept map that illustrates according to the method for signaling of the piece carve information of one embodiment of the present invention; With reference to Fig. 1, each layer is with " layer " expression, and layer depth is set as " 5 ".
" layer 1 " comprises first 210, and it is the longest piece of the base unit cut apart as piece, and first 210 length is N.
Reference numeral (1), (2) ..., (a) and (b), (c) and (d) the exemplary binary signaling sequence of expression.
According to this embodiment, indicator dog whether represent by cutting apart ID (identifier) and not cutting apart ID by divided carve information.Optional network specific digit " 1 " is used as cuts apart ID, and optional network specific digit " 0 " is used as and does not cut apart ID.
The said ID of cutting apart and do not cut apart ID and in the node of each layer, represent.
Cut apart ID and indicate the predetermined block that is included in higher level's layer to be divided into the halves in subordinate's layer, and also indicate to this subordinate's layer and distributed downstream site.The predetermined block of not cutting apart in the ID indication higher level layer is not cut apart by subordinate's layer, and also indication does not have to this subordinate's layer distribution and do not cut apart the corresponding any downstream site of node that ID representes by this.Do not distribute downstream site to mean and do not carry out other signaling operation.
Because the value of first 210 piece carve information (1) is 1 in top (i.e. layer 1), therefore ground floor 210 execution blocks is cut apart.
Layer 2 as level layer under the layer 1 comprises two pieces 220 and 221, and the length of each piece is N/2.
The piece carve information (2) that is included in the piece 220 in the layer 2 has value " 1 ", and the piece carve information (3) of piece 221 has value " 1 ", thereby comprises four pieces 230,231,232 and 233 as the layer 3 of level layer under the layer 2, and each block length is N/4.
Be " 0 " with the value that is included in the piece carve information (4) that the piece 230 of layer in 3 be associated.The value of the piece carve information (5) that is associated with piece 231 is " 1 ".The value of the piece carve information (6) that is associated with piece 232 is " 1 ".Be " 0 " with the value that is included in the piece carve information (7) that the piece 233 of layer in 3 be associated.
Therefore, according to the piece carve information of layer 3, the piece 230 and 233 execution blocks of layer 3 are not cut apart, but the piece 231 and 232 execution blocks of layer 3 are cut apart.
In this case, do not distribute downstream sites to layer 4 as subordinate's layer of the said piece of cutting apart without piece 230 of layer 3 and 233.
The piece of cutting apart through piece 231 and 232 of layer 3 distributes downstream site to subordinate's layer.Whether the existence that piece is cut apart shows in downstream site.
Layer 4 length is N/8, and is included in the piece 240 and 241 that is partitioned on the basis of piece 231 of layer 3, also is included in other piece 242 and 243 that is partitioned on the basis of layers 3 piece 232.The value of the piece carve information (8) that is associated with the piece 240 of layer 4 is " 0 ".The value of the piece carve information (9) that is associated with the piece 241 of layer 4 is " 1 ".The value of the piece carve information (a) that is associated with the piece 242 of layer 4 is " 0 ".The value of the piece carve information (b) that is associated with the piece 243 of layer 4 is " 0 ".
Therefore, according to the piece carve information of layer 4, execution block is not cut apart on the piece 240,242 and 243 of layer 4, but execution block is cut apart on the piece 241 of layer 4.
In this case, do not distribute downstream sites to layer 5 as subordinate's layer of the said piece of cutting apart without piece 240,242 of layer 4 and 243.
The piece of cutting apart through piece 241 of layer 4 distributes a downstream site to layer 5, thereby it indicates whether to exist piece to cut apart in said downstream site.
Layer 5 length is N/16, and is included in the piece 250 and 251 that is partitioned on the basis of piece 241 of layer 4.
The value of the piece carve information (c) that is associated with the piece 250 of layer 5 is " 0 ".The value of the piece carve information (d) that is associated with the piece 251 of layer 5 is " 0 ".
Therefore, the value of each contained piece is " 0 " in the layer 5, cuts apart thereby no longer carry out the classification piece, can identify so the piece of piece is cut apart the degree of depth.
The layout structure of the piece that can be cut apart by the classification piece comprises N/4 piece (being that length is the piece of N/4), N/8 piece, N/16 piece, N/16 piece, N/8 piece, N/8 piece and N/8 piece.
If signal length is N, the piece of then cutting apart through piece has formula " N/x i" expression (and wherein i=1,2 ..., P, P is integer and x=2) any one (being N/2, N/4, N/8, N/16 and N/32 ...) in the length of expression.
Expression can according to binary signaling sequence (1) (2) (3) (4) (5) (6) (7) (8) (9) (a) (b) (c) under the situation of (d) information of being cut apart by the piece of binary number representation, the piece carve information can be represented by 13 bits " 1110110010000 ".
Above explanation an example scenario is disclosed, the depth information in its middle level is by expression separately but only through discerning by cutting apart ID and not cutting apart the piece carve information that ID representes.
Yet, should be noted that other piece carve information of other presentation layer depth information also can carry out the signaling processing.
For example, layer depth information can stop ID and cut apart continuation ID representing by cutting apart.
Stop ID can be illustrated in wherein the lowermost layer that execution block is no longer cut apart said cutting apart.The said continuation ID of cutting apart can represent all the other each layers except that lowermost layer.In this situation, cut apart and continue ID and cut apart by " 1 " expression and stop ID and represent by " 0 ".
Layer depth shown in Fig. 1 is " 5 ", also can use to cut apart to stop ID " 0 " and cut apart continuation ID " 1 " representing with " 11110 ".
Sub-block length can be discerned by said method for signaling.
Like this, in the situation of representing depth information separately, only can represent not cut apart ID, thereby the signaling processing procedure can carried out within the scope of the last layer of current layer to lowermost layer at the node place that distributes to lowermost layer.
For example; Supposing to cut apart ID is not cut apart ID and is represented by " 0 " by " 1 " expression; And cut apart and continue ID by " 1 " expression and cut apart and stop ID by " 0 " expression, the node that indication distributes to lowermost layer whether divided particular value can be cut apart " 0 " expression of termination by indication.
2) band segmentation
Below in conjunction with Fig. 2-3 band segmentation is described.
Fig. 2 is the concept map of method for signaling that the band segmentation information of another embodiment according to the present invention is shown.
Fig. 2 illustrates the classification band segmentation of the tree structure that is configured to Methods of Subband Filter Banks.The frequency resolution of subband can define in every way, will be elaborated to it below.
The piece of comparing Fig. 1 is cut apart, and the band segmentation of Fig. 2 comprises a plurality of frequency bands in top, and the top of Fig. 1 is made up of single long piece.
According to this embodiment, the indication frequency band whether represent by cutting apart ID and not cutting apart ID by divided band segmentation information.To be worth " 1 " as cutting apart ID, and be used as and do not cut apart ID and will be worth " 0 ".
Cut apart ID and do not cut apart ID and can indicate at every layer node place.
Cutting apart ID indicates the frequency band of M layer to be divided into halves at (M+1) layer.Do not cut apart ID and indicate the frequency band of M layer not cut apart at (M+1) layer, also indication is not to subordinate's layer distribution and by not cutting apart the corresponding any downstream site of node that ID representes.Do not distribute downstream site to mean and do not carry out other signaling operation.
Comprise first to the 6th frequency band 310,311,312,313,314 and 315 as top layer 1.
The band segmentation information (1) of first frequency band 310 is represented by " 1 ".The band segmentation information (2) of second frequency band 311 is represented by " 1 ".The band segmentation information (3) of the 3rd frequency band 312 is represented by " 0 ".The band segmentation information (4) of the 4th frequency band 313 is represented by " 0 ".The band segmentation information (5) of the 5th frequency band 314 is represented by " 0 ".The band segmentation information (6) of the 6th frequency band 315 is represented by " 0 ".
Said band segmentation information is indicated in the node place that distributes to layer 1.According to band segmentation information (1) and (2), first frequency band 310 produces signal conversion module 310T, and second frequency band 311 produces signal conversion module 311T, thereby in layer 2, produces subordinate's frequency band 320,321,322 and 323.Distributed downstream site to subordinate's frequency band 320,321,322 and 323.It should be noted that said signal conversion module also is called as " frequency band conversion module " in this embodiment.
Simultaneously, it is not carried out band shared the 3rd, the 4th, the 5th or the 6th frequency band 312,313,314 or 315 and do not produce the frequency band conversion module.Equally, in the 3rd, the 4th, the 5th or the 6th frequency band 312,313,314 or 315, do not produce and layer 2 corresponding subordinate frequency band.Therefore, do not distribute and 312,313,314 and 315 corresponding any downstream sites to layer 2.
Layer 2 is included in two frequency bands 320,321 that are partitioned on layer 1 frequency band 320 basis, and is included in two frequency bands 322 and 323 that are partitioned on the frequency band 311 of layer 1.
The band segmentation information (7) of frequency band 320 by " 1 " expression.The band segmentation information (8) of frequency band 321 by " 1 " expression.The band segmentation information (9) of frequency band 322 is represented by " 0 ".The band segmentation information (10) of frequency band 323 is represented by " 0 ".
According to said band segmentation information (7) and (8), frequency band 320 produces frequency band conversion module 320T, and frequency band 321 produces frequency band conversion module 321T, thereby in layer 3, produces subordinate's frequency band 330,331,332 and 333.Distributed downstream site to subordinate's frequency band 330,331,332 and 333.
Simultaneously, it is not carried out band shared frequency band 322 and 323 and do not produce the frequency band conversion module.In frequency band 322 and 323, do not produce and layer 3 corresponding subordinate frequency band yet.Therefore, do not distribute downstream site to frequency band 322 and 323 yet.
Layer 3 is included in two frequency bands 330,331 that are partitioned on layer 2 frequency band 320 basis, and is included in two frequency bands 332 and 333 that are partitioned on the frequency band 321 of layer 2.
The band segmentation information (11) of frequency band 330 by " 1 " expression.The band segmentation information (12) of frequency band 331 is represented by " 0 ".The band segmentation information (13) of the 3rd frequency band 332 is represented by " 0 ".The band segmentation information (14) of frequency band 333 is represented by " 0 ".
According to said band segmentation information (11), frequency band 330 produces signal conversion module 330T, and in layer 4, produces subordinate's frequency band 340 and 341.Distributed downstream site to subordinate's frequency band 340 and 341.
Simultaneously, it is not carried out band shared frequency band 331,332 and 333 and do not produce the frequency band conversion module.In frequency band 331,332 and 333, do not produce and layer 4 corresponding subordinate layer yet.Therefore, do not distribute downstream site to frequency band 322 and 323 yet.Therefore, do not distribute downstream site to frequency band 331,332 and 333 yet.
Layer 4 is included in two frequency bands 340 and 341 that are partitioned on the basis of layer 3 frequency band 330.
The band segmentation information (15) of frequency band 340 is represented by " 0 ".The band segmentation information (16) of frequency band 341 is represented by " 0 ".
Therefore, no longer include and can carry out band shared subordinate layer, the signaling processing procedure stops.In this case, lowermost layer equals layer 4.
Under expression can be according to the situation of binary signaling sequence (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15) (16) by binary number representation piece carve information, the piece carve information can be represented by 16 bits " 1100001100100000 ".
Fig. 3 is the block scheme of method for signaling that the band segmentation information of another embodiment according to the present invention is shown.
Compare with Fig. 2, with regard to carrying out band shared method, the band segmentation of Fig. 3 is similar with the situation of Fig. 2.
Yet as shown in Figure 3, the binary sequence of the band segmentation information among Fig. 3 is different with Fig. 2.
Therefore; Under expression can be according to the situation of binary signaling sequence (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15) (16) by binary number representation piece carve information, the piece carve information can be represented by 16 bits " 1110001001000000 ".
Above explanation an example scenario is disclosed, the depth information in its middle level is by expression separately but only through discerning by cutting apart ID and not cutting apart the band segmentation information that ID representes.
Yet, should be noted that other band segmentation information of presentation layer depth information also can be carried out the signaling processing separately.
For example, layer depth information can stop ID and cut apart continuation ID representing by cutting apart.
The described termination ID of cutting apart representes no longer to carry out band shared lowermost layer.The said continuation ID of cutting apart can represent all the other each layers except that lowermost layer.In this situation, cut apart and continue ID and cut apart by " 1 " expression and stop ID and represent by " 0 ".
Layer depth shown in Fig. 2~3 is " 4 ", also can use to cut apart to stop ID " 0 " and cut apart continuation ID " 1 " representing with " 1110 ".Subband length can be discerned by described method for signaling.
Like this, in the situation of representing depth information separately, only can represent not cut apart ID, thereby the signaling processing procedure can carried out within the scope of the last layer of current layer to lowermost layer at the node place that distributes to lowermost layer.
For example; Supposing to cut apart ID is not cut apart ID and is represented by " 0 " by " 1 " expression; Cut apart and continue ID by " 1 " expression and cut apart and stop ID by " 0 " expression, then indicate the node that distributes to lowermost layer whether divided particular value can cut apart " 0 " expression of termination by indication.
3) sound channel is cut apart
The sound channel carve information relates to the channel configuration information that is used for channel configuration, so hereinafter will be cut apart sound channel with reference to described channel configuration information and be elaborated.
Especially, with an example at length setting forth the channel configuration that when multi-channel audio signal is carried out Code And Decode, is obtained.
Fundamental space information is needed when multi-channel audio signal is encoded.Said fundamental space information comprise the basic configuration information that can represent the configuration information related with basic environmental facies and with the corresponding master data of said basic configuration information.In addition, multi-channel audio coding optionally requires extending space information.Said extending space information comprise indication and the expanded configuration information of expanding the configuration information that environment is associated and with the corresponding growth data of said expanded configuration information.The configuration information of said expansion environment can exist one or more.Said expansion environment can be identified by type I D.
Simultaneously, the channel configuration by the reference of said multi-channel signal coding mainly is divided into two kinds of channel configuration, promptly basic channel configuration and expansion channel configuration.
One or more channel configuration information are used as said basic channel configuration information.Especially, basic channel configuration information is indicated a channel configuration information of from some channel configuration information, selecting.
For ease of explanation, basic channel configuration information is called as " fixedly channel configuration information ", and a plurality of sound channels (being multichannel) of creating according to fixing channel configuration information are called as " fixedly output channels ".
Fixedly channel configuration information is that the said fixing output channels of establishment is desired with the channel configuration data that is associated.
Fixedly channel configuration information can be represented a channel configuration constituent element in some channel configuration constituent elements of setting up in advance.The said channel configuration of setting up in advance can be represented in every way.For example, sound channel can be configured to the form of " 5-1-5 ", " 5-2-5 ", " 7-2-7 " or " 7-5-7 ".
Said " 5-2-5 " configuration can be represented a kind of concrete channel structure, and wherein six input sound channels are become two sound channels by multi-channel audio (downmixed), and are exported to six sound channels through the sound channel of multi-channel audio." 5-2-5 " is configured to all the other outer channel configuration to have and the identical channel structure of " 5-2-5 " configuration.
Said fixedly channel configuration information is comprised in the basic configuration information, and the data that are associated with fixing channel configuration information are comprised in the master data.
Various parameters can be used as said master data; For example, inter-channel coherence (ICC) parameter of the correlativity between levels of channels poor (CLD) parameter of energy difference, two sound channels of indication and be used for creating sound channel predictive coefficient (CPC) parameter of three sound channels between two sound channels of indication from two sound channels.
Said expansion channel configuration indication accordings to the fixedly channel configuration of channel configuration formation.
Said expansion channel configuration is by forming arbitrarily through encoded signals.For ease of explanation, the expansion channel configuration information is called as any channel configuration information, and the multichannel of being created by any channel configuration information is called as any output channels.Said any channel configuration information is comprised in the expanded configuration information, and is identified by the type I D that is called sound channel ID.
Be comprised in the growth data with the corresponding any channel configuration data of any channel configuration information.
If desired, for simple to operate, said any channel configuration data can only be used the CLD parameter of energy difference between two sound channels of expression.
Channel configuration information is represented by cutting apart ID and not cutting apart ID arbitrarily.The increase of cutting apart ID indication sound channel number as the ingredient of said any channel configuration information.Do not cut apart ID and indicate a kind of particular case, wherein the sound channel number does not change.
For example, cut apart input sound channel of ID indication and be converted into two output channels.Not cutting apart ID indication input sound channel does not do any change and is promptly exported on the sound channel number.
Represented to cut apart under the situation of ID at higher level's node layer place, in subordinate's layer, created subordinate's sound channel, and distributed the downstream site corresponding with the sound channel of being created to subordinate's layer to higher level's layer channel allocation.
Yet, represented not cut apart in the situation of ID at higher level's node layer place to the channel allocation of higher level's layer, in subordinate's layer, do not create subordinate's sound channel, therefore distribute and the corresponding downstream site of subordinate's sound channel to subordinate's layer.
Cut apart ID and do not cut apart the method that ID representes said any channel configuration information below in conjunction with Fig. 2~3 pair use and describe.
Fig. 2~3 not only illustrate said band segmentation and sound channel also is shown cuts apart.
At first Fig. 2 is specified as follows.
Comprise six frequency bands 310,311,312,313,314 and 315 as top layer 1.Said frequency band 310,311,312,313,314 and 315 can serve as said fixing multichannel respectively.According to the present invention, cut apart ID and do not cut apart ID and represent by " 0 " by " 1 " expression.The method of representing any channel configuration information sequentially representes to be included in value " 0 " or " 1 " in the node that the sound channel 310,311,312,313,314 and 315 of layer 1 is distributed.
The method of representing any channel configuration information sequentially representes to be included in value " 0 " or " 1 " in the node that the sound channel 320,321,322 and 323 of layer 2 is distributed.
The method of representing any channel configuration information sequentially representes to be included in value " 0 " or " 1 " in the node that the sound channel 330,331,332 and 333 of layer 3 is distributed.
The method of representing any channel configuration information sequentially representes to be included in value " 0 " or " 1 " in the node that the sound channel 340 and 341 of layer 4 is distributed.
Whether in other words, whether said method sequentially indicates the sound channel number to increase at the node place of higher level's layer, and sequentially indicate the sound channel number to increase at the node place of subordinate's layer subsequently.
Any channel configuration information according to said method is represented by 16 bits " 1100001100100000 ".For ease of explanation, represent that the method for any channel configuration information is called as " hierarchical priority method ".
Method according to any channel configuration information of expression shown in Figure 3; If when the first node that obtains signaling higher level's layer as a result the time from the first node of higher level's layer by " 1 " expression, then whether sequentially increase with the corresponding all downstream site indication sound channel numbers of the first node of higher level's layer.If when the first node that obtains signaling higher level's layer as a result the time from the first node of higher level's layer by " 0 " expression, then present node moves to higher level's Section Point, so that whether Section Point indication sound channel number sequentially increases.Therefore, any channel configuration information that is obtained by said method is represented by 16 bits " 1110001001000000 ".
For ease of explanation, represent that the method for any channel configuration information is called as " branch's priority approach ".
Specify with reference to Fig. 4 below and create fixedly output channels and the method for output channels arbitrarily.
Fig. 4 is the concept map that illustrates according to the method for establishment multi-channel signal of the present invention.
With reference to Fig. 4, create out any output channels (y) through the calculating between down-mix audio signal (x) and the fundamental matrix (m1), and create out another any output channels (z) through the fixing calculating between output channels (y) and the rearmounted matrix (m2).Can there be two or more fundamental matrixs (m1) where necessary.
The configuration element that can use at least one and said fixedly channel configuration information among CLD, ICC, the CPC to obtain fundamental matrix (m1).
Can use CLD and said any channel configuration information to obtain the configuration element of rearmounted matrix (m2).
To method that create any output channels be elaborated below.
At first, the method for using any channel configuration information to dispose any sound channel is elaborated.
Down in the face of using said branch priority approach to represent that the illustrative methods of said any channel configuration information describes.
Said illustrative methods was sequentially discerned as the cutting apart ID and do not cut apart ID of the configuration constituent element of any channel configuration information, and according to the ID execution signal Processing that is identified.
Cut apart ID if the ID that is identified is confirmed as, then an input sound channel is connected to the sound channel modular converter as an example of conversion of signals, consequently creates out two subordinate's sound channels.
Otherwise, if being confirmed as, the ID that is identified do not cut apart ID, then the sound channel number is not made the aforementioned input sound channel of any change ground output.
To provide its detailed description below.
In the phase one, the initial value of ID number that be decoded is changed to " 1 ", and the initial value of output channels number is changed to " 0 " arbitrarily, and the initial value of sound channel modular converter number is changed to " 0 ".
In subordinate phase, decoded ID is wanted in identification.
In the phase III, if being confirmed as, the ID that is identified cuts apart ID, then sound channel modular converter number increases progressively 1, and the ID number that will be identified increases progressively 1.
Do not cut apart ID if the ID that is identified is confirmed as, then the output channels number increases progressively 1 arbitrarily, and the ID number that will be identified successively decreases 1.
Repeat aforementioned second and the phase III, up to wanting decoded ID number to arrive " 0 ".
Output channels number according to fixing repeats aforementioned signal processing method.
For example, any channel configuration that when any channel configuration information is represented by " 11100010010000 ", obtains is shown among Fig. 3.In this case, " 1 " expression is cut apart ID and ID is not cut apart in " 0 " expression.The number of " 1 " is represented the number of sound channel modular converter (being the signal conversion module of Fig. 3), and the number of " 0 " is represented the number of any output channels.
Simultaneously, fixedly output channels can be reset (that is, remapping) by different order, and can that kind as shown in Figure 5 subsequently create out any output channels.Fig. 5 is the concept map that illustrates according to the method for signaling of sound channel carve information of the present invention.
With reference to Fig. 5, fixedly output channels 310,311,312,313,314 and 315 is reset by remapping module 100.Fixedly output channels 310 ', 311 ', 312 ', 313 ', 314 ' and the 315 ' sound channel as the superiors after the rearrangement is to create said any output channels.Needless to say, can reset or remap said any output channels by different order.
Simultaneously, if comprising the sound channel map information that the sound channel of any channel configuration information is mapped to loudspeaker in the channel configuration information arbitrarily, then output channels also can be mapped to this loudspeaker arbitrarily.
The explanation of front discloses a kind of exemplary cases, presentation layer depth information separately not wherein, but can be through coming the identification layer depth information by cutting apart ID and not cutting apart any channel configuration information that ID representes.
Yet, should be noted that and also can represent other any channel configuration information of presentation layer depth information separately.
For example, layer depth information can stop ID and cut apart continuation ID representing by cutting apart.
Stop ID can be illustrated in and wherein no longer carry out the lowermost layer that sound channel is cut apart said cutting apart.The said continuation ID of cutting apart can represent all the other each layers except that lowermost layer.In this situation, cut apart and continue ID and cut apart by " 1 " expression and stop ID and represent by " 0 ".
Layer depth shown in Fig. 2~3 is " 4 ", also can use to cut apart to stop ID " 0 " and cut apart continuation ID " 1 " representing with " 1110 ".
Like this, in the situation of representing depth information separately, only can represent not cut apart ID, thereby the signaling processing procedure can carried out within the scope of the last layer of current layer to lowermost layer at the node place that distributes to lowermost layer.
For example; Supposing to cut apart ID is not cut apart ID and is represented by " 0 " by " 1 " expression; Cut apart and continue ID by " 1 " expression and cut apart and stop ID by " 0 " expression, then indicate the node that distributes to lowermost layer whether divided particular value can cut apart " 0 " expression of termination by indication.
Although aforementioned circumstances is actual generation, lowermost layer can be discerned through said depth information, and infers and have abridged value " 0 ", thereby said any output channels is able to configuration.
Simultaneously, although said any channel configuration information is sent to demoder, should be noted that demoder can not use any channel configuration information that receives in case of necessity.The aforementioned operation of demoder may occur in a kind of exemplary cases, and wherein demoder identifies the size of any channel configuration information and any channel configuration information, but skips and the corresponding preset range of said size.
Those skilled in that art are appreciated that and can make various modifications and variation to the present invention and do not break away from the spirit or scope of the present invention.Therefore, the present invention is intended to cover modification of the present invention and variation, as long as they drop in the scope of appended claims and equivalents thereof.
Industrial applicability
Method for signaling according to carve information of the present invention has following effect.
At first, if the long piece of predetermined length is divided into the short block of different length, method for signaling then according to the present invention can use minimum bit number to realize the signalingization of classification piece carve information.
Secondly, according to method for signaling of the present invention not needs send the customizing messages that indicator signal is handled employed bit number separately, and not only can identify the layer depth after cutting apart according to signal but also can discern end through the signal of signalingization through signalingization.
Moreover, can use minimum bit number a plurality of subbands to be divided into the subband (subband that for example has the different frequency bandwidth) of a plurality of different sizes according to method for signaling of the present invention.
The 4th, can carry out the signalingization of the customizing messages that is associated with channel expansion audio mixing processing procedure according to method for signaling of the present invention, this signal that allows in input sound channel, to receive is via the output channels output more much more than input sound channel number.

Claims (11)

1. the method for an audio signal comprises:
Utilize the configuration element of down-mix audio signal and fundamental matrix to generate fixedly output channels; And
Utilize the configuration element of said fixedly output channels and rearmounted matrix to generate any output channels,
The configuration element of wherein said fundamental matrix be utilize that levels of channels is poor, in inter-channel coherence and the sound channel predictive coefficient at least one and fixedly channel configuration information obtain,
The configuration element of said rearmounted matrix utilizes levels of channels difference and any channel configuration information to obtain, and
Said any channel configuration information indication uses to cut apart identifier ID and do not cut apart identifier ID whether increase the sound channel number.
2. the method for claim 1 is characterized in that, if the node of higher level's layer represent by cutting apart ID, then distribute corresponding to cutting apart several downstream sites to the subordinate layer,
And if the node of higher level's layer is represented by not cutting apart ID, then do not distribute downstream site to subordinate's layer.
3. method as claimed in claim 2 is characterized in that, said any channel configuration information indicates sequentially whether the sound channel number increases at the node place of said higher level's layer, and indicates sequentially whether the sound channel number increases at the downstream site place of said subordinate layer.
4. method as claimed in claim 2; It is characterized in that; If the first node of said higher level's layer representes by cutting apart ID whether then said any channel configuration information indication increases to the said subordinate layer sound channel number with the corresponding downstream site of first node said higher level's layer that distribute
And if the first node of said higher level's layer is represented by not cutting apart ID, whether then said any channel configuration information indicates the sound channel number of the Section Point of said higher level's layer to increase.
5. method as claimed in claim 4 is characterized in that, any output channels of said generation comprises:
Sequentially discern cutting apart ID or not cutting apart ID as the configuration constituent element of said any channel configuration information; And
Carry out signal Processing according to the ID that is identified, if the ID that is wherein identified is cut apart ID, then single input sound channel is connected in the sound channel modular converter and generates two subordinate's sound channels,
And if the ID that identifies is not cut apart ID, then with the output of said input sound channel with as any output channels.
6. method as claimed in claim 5 is characterized in that, any output channels of said generation comprises:
Set the initial value of ID number, the initial value of any output channels number and the initial value of sound channel modular converter number;
Identification id;
If the ID that identifies is cut apart ID, then ID number and sound channel modular converter number are increased predetermined increment unit,
If the ID that identifies is not cut apart ID, then any output channels number is increased predetermined increment unit and the ID number is reduced predetermined increment unit; And
Repeat said identification, increase ID number and sound channel modular converter number and increase any output channels number and reduce the ID number, till the ID number arrives " 0 ".
7. like each the described method in the claim 2 to 6, also comprise:
Discern the length of said any channel configuration information and any channel configuration data without the length ground of said any channel configuration information of decoding and any channel configuration data corresponding with said any channel configuration information.
8. the device of an audio signal comprises:
The first sound channel generation unit is used to utilize the configuration element of down-mix audio signal and fundamental matrix to generate fixedly output channels; And
The second sound channel generation unit is used to utilize the configuration element of said fixedly output channels and rearmounted matrix to generate any output channels,
The configuration element of wherein said fundamental matrix be utilize that levels of channels is poor, in inter-channel coherence and the sound channel predictive coefficient at least one and fixedly channel configuration information obtain,
The configuration element of said rearmounted matrix utilizes levels of channels difference and any channel configuration information to obtain, and
Said any channel configuration information indication uses to cut apart identifier ID and do not cut apart identifier ID whether increase the sound channel number.
9. device as claimed in claim 8 is characterized in that, if the node of higher level's layer represent by cutting apart ID, then distribute corresponding to cutting apart several downstream sites to the subordinate layer,
And if the node of higher level's layer is represented by not cutting apart ID, then do not distribute downstream site to subordinate's layer.
10. device as claimed in claim 9; It is characterized in that; If the first node of said higher level's layer representes by cutting apart ID whether then said any channel configuration information indication increases to the said subordinate layer sound channel number with the corresponding downstream site of first node said higher level's layer that distribute
And if the first node of said higher level's layer is represented by not cutting apart ID, whether then said any channel configuration information indicates the sound channel number of the Section Point of said higher level's layer to increase.
11. device as claimed in claim 10 is characterized in that, the said second sound channel generation unit
Sequentially discern cutting apart ID or not cutting apart ID as the configuration constituent element of said any channel configuration information;
And according to the ID execution signal Processing that is identified,
If the ID that is wherein identified is cut apart ID, then single input sound channel is connected in the sound channel modular converter and generates two subordinate's sound channels,
And if the ID that identifies is not cut apart ID, then with the output of said input sound channel with as any output channels.
CN2006800277709A 2005-07-29 2006-07-28 Method and device for processing audio signal Active CN101233571B (en)

Applications Claiming Priority (16)

Application Number Priority Date Filing Date Title
US70346305P 2005-07-29 2005-07-29
US60/703,463 2005-07-29
US71652605P 2005-09-14 2005-09-14
US60/716,526 2005-09-14
KR1020060004048A KR20070031212A (en) 2005-09-14 2006-01-13 Method and Apparatus for encoding/decoding audio signal
KR10-2006-0004048 2006-01-13
KR1020060004048 2006-01-13
KR10-2006-0017659 2006-02-23
KR1020060017660 2006-02-23
KR10-2006-0017660 2006-02-23
KR1020060017660A KR20070014937A (en) 2005-07-29 2006-02-23 Method and apparatus for encoding/decoding audio signal
KR1020060017659 2006-02-23
KR1020060017659A KR20070014936A (en) 2005-07-29 2006-02-23 Method and apparatus for encoding/decoding audio signal
US81602206P 2006-06-22 2006-06-22
US60/816,022 2006-06-22
PCT/KR2006/002984 WO2007013783A1 (en) 2005-07-29 2006-07-28 Method for processing audio signal

Publications (2)

Publication Number Publication Date
CN101233571A CN101233571A (en) 2008-07-30
CN101233571B true CN101233571B (en) 2012-12-05

Family

ID=37683623

Family Applications (5)

Application Number Title Priority Date Filing Date
CN2006800277662A Active CN101233570B (en) 2005-07-29 2006-07-28 Method for generating encoded audio signal and method for processing audio signal
CN2006800277709A Active CN101233571B (en) 2005-07-29 2006-07-28 Method and device for processing audio signal
CN2006800274861A Active CN101233568B (en) 2005-07-29 2006-07-28 Method for generating encoded audio signal and method for processing audio signal
CN2006800274908A Active CN101233569B (en) 2005-07-29 2006-07-28 Method for signaling of splitting information
CN2006800274842A Active CN101233567B (en) 2005-07-29 2006-07-28 Method for generating encoded audio signal and method for processing audio signal

Family Applications Before (1)

Application Number Title Priority Date Filing Date
CN2006800277662A Active CN101233570B (en) 2005-07-29 2006-07-28 Method for generating encoded audio signal and method for processing audio signal

Family Applications After (3)

Application Number Title Priority Date Filing Date
CN2006800274861A Active CN101233568B (en) 2005-07-29 2006-07-28 Method for generating encoded audio signal and method for processing audio signal
CN2006800274908A Active CN101233569B (en) 2005-07-29 2006-07-28 Method for signaling of splitting information
CN2006800274842A Active CN101233567B (en) 2005-07-29 2006-07-28 Method for generating encoded audio signal and method for processing audio signal

Country Status (7)

Country Link
EP (5) EP1915756A4 (en)
KR (5) KR100857103B1 (en)
CN (5) CN101233570B (en)
AU (1) AU2006273012B2 (en)
CA (1) CA2617050C (en)
RU (1) RU2414741C2 (en)
WO (5) WO2007013780A1 (en)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1312660C (en) 2002-04-22 2007-04-25 皇家飞利浦电子股份有限公司 Signal synthesizing
ATE527833T1 (en) 2006-05-04 2011-10-15 Lg Electronics Inc IMPROVE STEREO AUDIO SIGNALS WITH REMIXING
WO2008044901A1 (en) 2006-10-12 2008-04-17 Lg Electronics Inc., Apparatus for processing a mix signal and method thereof
KR101100221B1 (en) 2006-11-15 2011-12-28 엘지전자 주식회사 A method and an apparatus for decoding an audio signal
WO2008069584A2 (en) 2006-12-07 2008-06-12 Lg Electronics Inc. A method and an apparatus for decoding an audio signal
KR101100222B1 (en) 2006-12-07 2011-12-28 엘지전자 주식회사 A method an apparatus for processing an audio signal
KR20080082916A (en) 2007-03-09 2008-09-12 엘지전자 주식회사 A method and an apparatus for processing an audio signal
CN103299363B (en) * 2007-06-08 2015-07-08 Lg电子株式会社 A method and an apparatus for processing an audio signal
CN101828219B (en) 2007-09-06 2012-05-09 Lg电子株式会社 A method and an apparatus of decoding an audio signal
WO2009072685A1 (en) * 2007-12-06 2009-06-11 Lg Electronics Inc. A method and an apparatus for processing an audio signal
EP2475116A4 (en) 2009-09-01 2013-11-06 Panasonic Corp Digital broadcasting transmission device, digital broadcasting reception device, digital broadcasting reception system
TWI444989B (en) 2010-01-22 2014-07-11 Dolby Lab Licensing Corp Using multichannel decorrelation for improved multichannel upmixing
US9679572B2 (en) 2013-04-23 2017-06-13 The Korea Development Bank Method and apparatus for encoding/decoding scalable digital audio using direct audio channel data and indirect audio channel data
KR101421201B1 (en) * 2013-04-23 2014-07-22 한국산업은행 Method and apparatus for encoding/decoding scalable digital audio using uncompressed audio channel data and compressed audio channel data
TWI771266B (en) 2015-03-13 2022-07-11 瑞典商杜比國際公司 Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element
RU2728535C2 (en) * 2015-09-25 2020-07-30 Войсэйдж Корпорейшн Method and system using difference of long-term correlations between left and right channels for downmixing in time area of stereophonic audio signal to primary and secondary channels

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1278996A (en) * 1997-09-05 2001-01-03 雷克西康公司 5-2-5 Matrix encoder and decoder system
CN1647157A (en) * 2002-04-22 2005-07-27 皇家飞利浦电子股份有限公司 Signal synthesizing

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ES2087522T3 (en) * 1991-01-08 1996-07-16 Dolby Lab Licensing Corp DECODING / CODING FOR MULTIDIMENSIONAL SOUND FIELDS.
DE4209544A1 (en) * 1992-03-24 1993-09-30 Inst Rundfunktechnik Gmbh Method for transmitting or storing digitized, multi-channel audio signals
KR100265112B1 (en) * 1997-03-31 2000-10-02 윤종용 Dvd dics and method and apparatus for dvd disc
US20030035553A1 (en) * 2001-08-10 2003-02-20 Frank Baumgarte Backwards-compatible perceptual coding of spatial cues
KR100978018B1 (en) * 2002-04-22 2010-08-25 코닌클리케 필립스 일렉트로닉스 엔.브이. Parametric representation of spatial audio
BR0304542A (en) * 2002-04-22 2004-07-20 Koninkl Philips Electronics Nv Method and encoder for encoding a multichannel audio signal, apparatus for providing an audio signal, encoded audio signal, storage medium, and method and decoder for decoding an audio signal
SE0402650D0 (en) * 2004-11-02 2004-11-02 Coding Tech Ab Improved parametric stereo compatible coding or spatial audio
US7787631B2 (en) * 2004-11-30 2010-08-31 Agere Systems Inc. Parametric coding of spatial audio with cues based on transmitted channels
KR100682904B1 (en) * 2004-12-01 2007-02-15 삼성전자주식회사 Apparatus and method for processing multichannel audio signal using space information
US7903824B2 (en) * 2005-01-10 2011-03-08 Agere Systems Inc. Compact side information for parametric coding of spatial audio

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1278996A (en) * 1997-09-05 2001-01-03 雷克西康公司 5-2-5 Matrix encoder and decoder system
CN1647157A (en) * 2002-04-22 2005-07-27 皇家飞利浦电子股份有限公司 Signal synthesizing

Also Published As

Publication number Publication date
CN101233567B (en) 2011-06-15
CN101233568A (en) 2008-07-30
CN101233569B (en) 2010-09-01
KR20080036119A (en) 2008-04-24
KR100888970B1 (en) 2009-03-17
EP1920438A4 (en) 2010-01-06
EP1920437A1 (en) 2008-05-14
CN101233570B (en) 2011-06-22
CN101233571A (en) 2008-07-30
KR100841332B1 (en) 2008-06-25
EP1915757A1 (en) 2008-04-30
RU2008107773A (en) 2009-09-10
EP1915756A1 (en) 2008-04-30
WO2007013781A1 (en) 2007-02-01
CA2617050C (en) 2012-10-09
KR100857104B1 (en) 2008-09-05
WO2007013780A1 (en) 2007-02-01
EP1920439A4 (en) 2010-01-06
AU2006273012A1 (en) 2007-02-01
EP1915757A4 (en) 2010-01-06
WO2007013775A1 (en) 2007-02-01
KR100857103B1 (en) 2008-09-08
WO2007013783A1 (en) 2007-02-01
KR100857102B1 (en) 2008-09-08
KR20080034002A (en) 2008-04-17
CN101233567A (en) 2008-07-30
CN101233568B (en) 2010-10-27
CN101233569A (en) 2008-07-30
KR20080033452A (en) 2008-04-16
CA2617050A1 (en) 2007-02-01
CN101233570A (en) 2008-07-30
AU2006273012B2 (en) 2010-06-24
WO2007013784A1 (en) 2007-02-01
EP1915756A4 (en) 2010-01-06
KR20080035656A (en) 2008-04-23
KR20080030686A (en) 2008-04-04
RU2414741C2 (en) 2011-03-20
EP1920439A1 (en) 2008-05-14
EP1920437A4 (en) 2010-01-06
EP1920438A1 (en) 2008-05-14

Similar Documents

Publication Publication Date Title
CN101233571B (en) Method and device for processing audio signal
US7702407B2 (en) Method for generating encoded audio signal and method for processing audio signal
CN101297353B (en) Apparatus for encoding and decoding audio signal and method thereof
KR100880642B1 (en) Method and apparatus for decoding an audio signal
JP2010156822A (en) Sound compression coding device and decoding device of multi-channel sound signal
KR20070014936A (en) Method and apparatus for encoding/decoding audio signal

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant