WO2007027056A1 - A method for decoding an audio signal - Google Patents
A method for decoding an audio signal Download PDFInfo
- Publication number
- WO2007027056A1 WO2007027056A1 PCT/KR2006/003435 KR2006003435W WO2007027056A1 WO 2007027056 A1 WO2007027056 A1 WO 2007027056A1 KR 2006003435 W KR2006003435 W KR 2006003435W WO 2007027056 A1 WO2007027056 A1 WO 2007027056A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- signal
- audio signal
- timeslot
- information
- channel
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 264
- 238000000034 method Methods 0.000 title claims abstract description 58
- 238000013507 mapping Methods 0.000 claims description 22
- 238000012546 transfer Methods 0.000 description 10
- 238000010586 diagram Methods 0.000 description 6
- 239000000284 extract Substances 0.000 description 6
- 108010076504 Protein Sorting Signals Proteins 0.000 description 3
- 230000006835 compression Effects 0.000 description 3
- 238000007906 compression Methods 0.000 description 3
- 210000002370 ICC Anatomy 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 238000010988 intraclass correlation coefficient Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
Definitions
- the present invention relates to an audio signal processing, and more particularly, to an apparatus for decoding an audio signal and method thereof.
- an audio signal encoding apparatus compresses the audio signal into a mono or stereo type downmix signal instead of compressing each multi-channel audio signal.
- the audio signal encoding apparatus transfers the compressed downmix signal to a decoding apparatus together with a spatial information signal or stores the compressed downmix signal and a spatial information signal in a storage medium.
- a spatial information signal which is extracted in downmixing a multi-channel audio signal, is used in restoring an original multi-channel audio signal from a downmix signal.
- Configuration information is non-changeable in general and a header including this information is inserted in an audio signal once. Since configuration information is transmitted by being initially inserted in an audio signal once, an audio signal decoding apparatus has a problem in decoding spatial information due to non-existence of configuration information in case of reproducing the audio signal from a random timing point.
- An audio signal encoding apparatus generates a downmix signal and a spatial information signal into bitstreams together or respectively and then transfers them to the audio signal decoding apparatus. So, if unnecessary information and the like are included in the spatial information signal, signal compression and transfer efficiencies are reduced.
- An object of the present invention is to provide an apparatus for decoding an audio signal and method thereof, by which the audio signal can be reproduced from a random timing point by selectively including a spatial information signal in a header.
- Another object of the present invention is to provide an apparatus for decoding an audio signal and method thereof, by which a position of a timeslot to which a parameter set will be applied can be efficiently represented using a variable bit number.
- Another object of the present invention is to provide an apparatus for decoding an audio signal and method thereof, by which audio signal compression and transfer efficiencies can be raised by representing an information quantity required for performing a downmix signal arrangement or mapping multi-channel to a speaker as a minimal variable bit number.
- a further object of the present invention is to provide an apparatus for decoding an audio signal and method thereof, by which an information quantity required for signal arrangement can be reduced by mapping multichannel to a speaker without performing downmix signal arrangement .
- FIG. 1 is a configurational diagram of an audio signal transferred to an audio signal decoding apparatus from an audio signal encoding apparatus according to one embodiment of the present invention.
- an audio signal includes an audio descriptor 101, a downmix signal 103 and a spatial information signal 105.
- the audio signal is able to include ancillary data as well as the audio descriptor 101 and the downmix signal 103.
- the present invention includes the spatial information signal 105 as the ancillary data.
- the audio signal is able to selectively include the audio descriptor 101.
- the audio descriptor 101 is configured with small number of basic informations necessary for audio decoding such as a transmission rate of a transmitted audio signal, a number of channels, a sampling frequency of compressed data, an identifier indicating a currently used codec and the like.
- An audio signal decoding apparatus is able to know a type of a codec done to an audio signal using the audio descriptor 101.
- the audio signal decoding apparatus is able to know whether an audio signal configures multi-channel using the spatial information signal 105 and the downmix signal 103.
- the audio descriptor 101 is located independently from the downmix signal 103 or the spatial information signal 105 included in the audio signal. For instance, the audio descriptor 101 is located within a separate field indicating an audio signal. In case that a header is not included in the downmix signal 103, the audio signal decoding apparatus is able to decode the downmix signal 103 using the audio descriptor 101.
- the downmix signal 103 is a signal generated from downmixing multi-channel. And, the downmix signal 103 can be generated from a downmixing unit included in an audio signal encoding apparatus or generated artificially.
- the downmix signal 103 can be categorized into a case of including a header and a case of not including a header. In case that the downmix signal 103 includes a header, the header is included in each frame by a frame unit. In case that the downmix signal 103 does not include a header, as mentioned in the foregoing description, the downmix signal 103 can be decoded using the audio descriptor 101.
- the downmix signal 103 takes either a form of including a header for each frame or a form of not including a header in a frame. And, the downmix signal 103 is included in an audio signal in a same manner until contents end.
- the spatial information signal 105 is also categorized into a case of including a header 107 and spatial information 111 and a case of including spatial information 111 only without including a header.
- the header 107 of the spatial information signal 105 differs from that of the downmix signal 103 in that it is unnecessary to be inserted in each frame identically.
- the spatial information signal 105 is able to use both a frame including a header and a frame not including a header together.
- Most of information included in the header 107 of the spatial information signal 105 is configuration information 109 that decodes spatial information 111 by- interpreting the spatial information 111.
- the spatial information 111 is configured with frames each of which includes timeslots.
- the timeslot means each time interval in case of dividing the frame by time intervals.
- the number of timeslots included in one frame is included in the configuration information 109.
- Configuration information 109 includes signal arrangement information, the number of signal converting units, channel configuration information, speaker mapping information and the like as well as the timeslot number.
- the signal arrangement information is an identifier that indicates whether an audio signal will be arranged for upmixing prior to restoring the decoded downmix signal 103 into multi-channel .
- the signal converting unit means an OTT (one-to-two) box converting one downmix signal 103 to two signals or a TTT (two-to-three) box converting two downmix signals 103 to three signals in generating multi-channel by upmixing the downmix signal 103.
- the OTT or TTT box is a conceptional box used in restoring multi-channel by being included in an upmixing unit (not shown in the drawing) of the audio signal decoding apparatus.
- information for types and number of the signal converting units is included in the spatial information signal 105.
- the channel configuration information is the information indicating a configuration of the upmixing unit included in the audio signal decoding apparatus.
- the channel configuration information includes an identifier indicating whether an audio signal passes through the signal converting unit or not.
- the audio signal decoding apparatus is able to know whether an audio signal inputted to the upmixing unit passes through the signal converting unit or not using the channel configuration information.
- the audio signal decoding apparatus upmixes the downmix signal 103 into a multi-channel audio signal using the information for the signal converting unit, the channel configuration information and the like.
- the audio signal decoding apparatus generates multi-channel by upmixing the downmix signal 103 using the signal converting unit information, the channel configuration information and the like included in the spatial information 111.
- the speaker mapping information is the information indicating that the multi-channel audio signal will be mapped to which speaker in outputting the multi-channel audio signals generated by upmixing to speakers, respectively.
- the audio signal decoding apparatus outputs the multi-channel audio signal to the corresponding speaker using the speaker mapping information included in the configuration information 109.
- the spatial information 111 is the information used to give a spatial sense in generating multi-channel audio signals by the combination with the downmix signal.
- the spatial information includes CLDs (Channel Level Differences) indicating an energy difference between audio signals, ICCs (Interchannel Correlations) indicating close correlation or similarity between audio signals, CPCs (Channel Prediction Coefficients) indicating a coefficient to predict an audio signal value using other signals and the like. And, a parameter set indicates a bundle of these parameters .
- FIG. 2 is a flowchart of a method of decoding an audio signal according to another embodiment of the present invention.
- an audio signal decoding apparatus receives a spatial information signal 105 transferred in a bitstream form by an audio signal encoding apparatus (S201) .
- the spatial information signal 105 can be transferred in a stream form separate from that of a downmix signal 103 or transferred by being included in ancillary data or extension data of the downmix signal 103.
- a demultiplexing unit (not shown in the drawing) of an audio signal decoding apparatus separates the received audio signal into an encoded downmix signal 103 and an encoded spatial information signal 105.
- the encoded spatial information 105 signal includes a header 107 and spatial information 111.
- the audio signal decoding apparatus decides whether the header 107 is included in the spatial information signal 105 (S203) .
- the audio signal decoding apparatus extracts configuration information 109 from the header 107 (S205) .
- the audio signal decoding apparatus decides whether the configuration information is extracted from a first header 107 included in the spatial information signal 105 (S207) .
- the audio signal decoding apparatus decodes the configuration information 109 (S215) and decodes the spatial information 111 transferred behind the configuration information 109 according to the decoded configuration information 109.
- the audio signal decoding apparatus decides whether the configuration information 109 extracted from the header 107 is identical to the configuration information 109 extracted from a first header 107 (S209) .
- the audio signal decoding apparatus decodes the spatial information 111 using the decoded configuration information 109 extracted from the first header 107. If the extracted configuration information 109 is not identical to the configuration information 109 extracted from the first header 107, the audio signal decoding apparatus decides whether an error occurs in the audio signal on a transfer path from the audio signal encoding apparatus to the audio signal decoding apparatus (S211) .
- the audio signal decoding apparatus updates the header 107 into a variable header 107 (S213) .
- the audio signal decoding apparatus then decodes configuration information 109 extracted from the updated header 107 (S215) .
- the audio signal decoding apparatus decodes spatial information 111 transferred behind the configuration information 109 according to the decoded configuration information 109.
- FIG. 3 is a flowchart of a method of decoding an audio signal according to another embodiment of the present invention .
- an audio signal decoding apparatus receives an audio signal including a downmix signal 103 and a spatial information signal 105 from an audio signal encoding apparatus (S301) .
- the audio signal decoding apparatus separates the received audio signal into the spatial information signal 105 and the downmix signal 103 (S303) and then sends the separated spatial information 105 and the separated downmix signal 103 to a core decoding unit (not shown in the drawing) and a spatial information decoding unit (not shown in the drawing) , respectively.
- the audio signal decoding apparatus extracts the number of timeslots and the number of parameter sets from the spatial information signal 105.
- the audio signal decoding apparatus finds a position of a timeslot to which a parameter set will be applied using the extracted numbers of the timeslots and the parameter sets.
- the position of the timeslot to which the corresponding parameter set will be applied is represented as a variable bit number.
- the bit number representing the position of the timeslot to which the corresponding parameter set will be applied it is able to efficiently represent the spatial information signal 105.
- the position of the timeslot, to which the corresponding parameter set will be applied will be explained in detail with reference to FIG. 4 and FIG. 5.
- the audio signal decoding apparatus decodes the spatial information signal 105 by applying the corresponding parameter set to the corresponding position (S305) . And, the audio signal decoding apparatus decodes the downmix signal 103 in the core decoding unit (S305) .
- the audio signal decoding apparatus is able to generate multi-channel by upmixing the decoded downmix signal 103 as it is. But the audio signal decoding apparatus is able to arrange a sequence of the decoded downmix signals 103 before the audio signal decoding apparatus upmix the corresponding signals (S307).
- the audio signal decoding apparatus generates multi- channel using the decoded downmix signal 103 and the decoded spatial information signal 105 (S309) .
- the audio signal decoding apparatus uses the spatial information signal 105 to generate the downmix signal 103 into multichannel.
- the spatial information signal 105 includes the number of signal converting units and channel configuration information for representing whether the downmix signal 103 passes through the signal converting unit in being upmixed or is outputted without passing through the signal converting unit.
- the audio signal decoding apparatus upmixes the downmix signal 103 using the number of signal converting units, the channel configuration information and the like (S309) .
- a method of representing the channel configuration information and a method of configuring the channel configuration information using the less number of bits will be explained with reference to FIG. 6 and FIG. 7 later.
- the audio signal decoding apparatus maps a multichannel audio signal to a speaker in a preset sequence to output the generated multi-channel audio signals (S311). In this case, as the mapped audio signal sequence increases, the bit number for mapping the multi-channel audio signal to the speaker becomes reduced. In particular, in case that numbers are given to multi-channel audio signals in order, since a first audio signal can be mapped to one of the entire speakers, an information quantity required for mapping an audio signal to a speaker is greater than that required for mapping a second or subsequent audio signal. As the second or subsequent audio signal is mapped to one of the rest of the speakers excluding the former speaker mapped with the former audio signal, the information quantity required for the mapping is reduced.
- FIG. 4 is syntax of position information of a timeslot to which a parameter set is applied according to one embodiment of the present invention.
- ⁇ FramingInfo' 401 to represent information for a number of parameter sets and information for a timeslot to which a parameter set is applied.
- ⁇ bsFramingType' field 403 indicates whether a frame included in the spatial information signal 105 is a fixed frame or a variable frame.
- the fixed frame means a frame in which a timeslot position to which a parameter set will be applied is previously set. In particular, a position of a timeslot to which a parameter set will be applied is decided according to a preset rule.
- the variable frame means a frame in which a timeslot position to which a parameter set will be applied is not set yet.
- variable frame further needs timeslot position information for representing a position of a timeslot to which a parameter set will be applied.
- the ⁇ bsFramingType' 403 shall be named ⁇ frame identifier' indicating whether a frame is a fixed frame or a variable frame.
- ⁇ bsParamSlot' field 407 or 411 indicates position information of a timeslot to which a parameter set will be applied.
- the ⁇ bsParamSlot [0] ' field 407 indicates a position of a timeslot to which a first parameter set will be applied
- the ⁇ bsParamSlot [ps] ' field 411 indicates a position of a timeslot to which a second or subsequent parameter set will be applied.
- the position of the timeslot to which the first parameter set will be applied is represented as an initial value, and a position of the timeslot to which the second or subsequent parameter set will be applied is represented as a difference value ⁇ bsDiffParamSlot [ps] ' 409, i.e., a difference between ⁇ bsParamSlot [ps] ' and ⁇ bsParamSlot [ps- 1] ' .
- ⁇ ps' means a parameter set.
- ⁇ ps' is able to represent value ranging from 0 to a value smaller than the number of total parameter sets.
- a timeslot position 407 or 409 to which a parameter set will be applied increases as a ps value increases (bsParamSlot [ps] > bsParamSlot [ps-1] ) .
- a maximum value of a timeslot position to which a first parameter set will be applied corresponds to a value resulting from adding 1 to a difference between a timeslot number and a parameter set number and a timeslot position is represented as an information quantity of ⁇ nBitsParamSlot (0) ' 413.
- a timeslot position to which an Nth parameter set will be applied is greater by at least 1 than a timeslot position to which an (N-I) th parameter set will be applied and is even able to have a value resulting from adding a value N to a value resulting from subtracting a parameter set number from a timeslot number.
- a timeslot position ⁇ bsParamSlot [ps] ' to which a second or subsequent parameter set will be applied is represented as a difference value ⁇ bsDiffParamSlot [ps] ' 409. And, this value is represented as an information quantity of
- the corresponding position is applicable to one of timeslots belonging to a range between 1 to maximum 8.
- the timeslot position 407 to which the first parameter set will be applied needs three bits to indicate 1 to 8 , which can be represented as ceil ⁇ Iog 2 (k-i+1) ⁇ .
- ⁇ k' is the number of timeslots
- ⁇ i' is the number of parameters .
- the timeslot position 407 to which the first parameter set will be applied is ⁇ 5'
- the timeslot position to which the second parameter set will be applied can be represented as a value resulting from adding a difference value ⁇ bsDiffParamSlot [ps] ' 409 to a value resulting from adding 1 to the timeslot position to which the first parameter set will be applied- So, the difference value 409 is able to correspond to 0 to 3, which can be represented as two bits.
- the bit number For the second or subsequent parameter set, by representing a timeslot position to which a parameter set will be applied as the difference value 409 instead of representing the timeslot position in direct, it is able to reduce the bit number.
- four bits are needed to represent one of 6 to 9 in case of representing the timeslot position in direct.
- only two bits are needed to represent a timeslot position as the difference value .
- a position information indicating quantity 'nBitsParamSlot (0) ' or ⁇ nBitsParamSlot (ps) ' 413 or 415 of a timeslot to which a parameter set will be applied can be represented not as a fixed bit number but as a variable bit number.
- FIG. 5 is a flowchart of a method of decoding a spatial information signal by applying a parameter set to a timeslot according to another embodiment of the present invention.
- an audio signal decoding apparatus receives an audio signal including a downmix signal 103 and a spatial information signal 105 (S501).
- the audio signal decoding apparatus extracts the number of timeslots included in a frame from configuration information 109 included in the header 107 (S503) . If a header 107 is not included in the spatial information signal 105, the audio signal decoding apparatus extracts the number of timeslots from the configuration information 109 included in a previously extracted header 107.
- the audio signal decoding apparatus extracts the number of parameter sets to be applied to a frame from the spatial information signal 105 (S505) .
- the audio signal decoding apparatus decides whether positions of timeslots, to which parameter sets will be applied, in a frame are fixed or variable using a frame identifier included in the spatial information signal 105 (S507) .
- the audio signal decoding apparatus decodes the spatial information signal 105 by applying the parameter set to the corresponding slot according to a preset rule (S513) .
- the audio signal decoding apparatus extracts information for a timeslot position to which a first parameter set will be applied (S509) .
- the timeslot position to which the first parameter will be applied can maximally be a value resulting from adding 1 to a difference between the timeslot number and the parameter set number.
- the audio signal decoding apparatus obtains information for a timeslot position to which a second or subsequent parameter set will be applied using the information for the timeslot position to which the first parameter set will be applied (S511) .
- a timeslot position to which a parameter set will be applied can be represented as a minimum bit number using a fact that a timeslot position to which an Nth parameter set will be applied is greater by at least 1 than a timeslot position to which an (N-I) th parameter set will be applied and even can have a value resulting from adding N to a value resulting from subtracting the parameter set number from the timeslot number .
- the audio signal decoding apparatus decodes the spatial information signal 105 by applying the parameter set to the obtained timeslot position (S513) .
- FIG. 6 and FIG. 7 are diagrams of an upmixing unit of an audio signal decoding apparatus according to one embodiment of the present invention.
- An audio signal decoding apparatus separates an audio signal received from an audio signal encoding apparatus into a downmix signal 103 and a spatial information signal 105 and then decodes the downmix signal 103 and the spatial information signal 105 respectively.
- the audio signal decoding apparatus decodes the spatial information signal 105 by applying a parameter to a timeslot. And, the audio signal decoding apparatus generates multi-channel audio signals using the decoded downmix signal 103 and the decoded spatial information signal 105.
- the audio signal decoding apparatus restores and output the original N channels.
- This configuration is called an N-M-N structure.
- the audio signal decoding apparatus is unable to restore the N channels, the downmix signal 103 is outputted into two stereo signals without considering the spatial information signal 105. Yet, this will not be further discussed.
- a structure, in which values of N and M are fixed, shall be called a fixed channel structure.
- a structure, in which values of M and N are represented as random values, shall be called a random channel structure.
- the audio signal encoding apparatus transfers an audio signal by having a channel structure included in the audio signal.
- the audio signal decoding apparatus then decodes the audio signal by reading the channel structure.
- the audio signal decoding apparatus uses an upmixing unit including a signal converting unit to restore M audio signals into N multi-channel.
- the signal converting unit is a conceptional box used to convert one downmix signal 103 to two signals or convert two downmix signals 103 to three signals in generating multi-channel by upmixing downmix signals 103.
- the audio signal decoding apparatus is able to obtain information for a structure of the upmixing unit by extracting channel configuration information from the configuration information 109 included in the spatial information signal 105.
- the channel configuration information is the information indicating a configuration of the upmixing unit included in the audio signal decoding apparatus.
- the channel configuration information includes an identifier that indicates whether an audio signal passes through the signal converting unit.
- the channel configuration information can be represented as a segmenting identifier since the numbers of input and output signals of the signal converting unit are changed in case that a decoded downmix signal passes through the signal converting unit in the upmixing unit.
- the channel configuration information can be represented as a non- segmenting identifier since an input signal of the signal converting unit is outputted intact in case that a decoded downmix signal does not pass through the signal converting unit included in the upmixing unit.
- the segmenting identifier shall be represented as ⁇ l' and the non-segmenting identifier shall be represented as ⁇ 0' .
- the channel configuration information can be represented in two ways, a horizontal method and a vertical method.
- an audio signal passes through a signal converting unit, i.e., if channel configuration information is ⁇ l' , whether a lower layer signal outputted via the signal converting unit passes through another signal converting unit is sequentially indicated by the segmenting or non-segmenting identifier. If channel configuration information is ⁇ 0' , whether a next audio signal of a same or upper layer passes through a signal converting unit is indicated by the segmenting or non-segmenting identifier.
- four audio signals Xi to X 4 enter an upmixing unit.
- Xi enters a fist signal converting unit and is then converted to two signals 601 and 603.
- the signal converting unit included in the upmixing unit converts the audio signal using spatial parameters such as CLD, ICC and the like.
- the signals 601 and 603 converted by the first signal converting unit enter a second converting unit and a third converting unit to be outputted as multichannel audio signals Yi to Y 4 .
- X 2 enters a fourth signal converting unit and is then outputted as Y 5 and Ye. And, X 3 and X 4 are directly outputted without passing through signal converting units.
- channel configuration information is represented as a segmenting identifier ⁇ l' . Since the channel configuration information is represented by the horizontal method in FIG. 6, if the channel configuration information is represented as the segmenting identifier, whether the two signals 601 and 603 outputted via the first signal converting unit pass through another signal converting units is sequentially represented as a segmenting or non-segmenting identifier.
- channel configuration information is ⁇ 0' , whether a next audio signal of a same or upper layer passes through a signal converting unit is represented as a segmenting or non-segmenting identifier. So, channel configuration information is represented for the signal X 2 of the upper layer .
- X 2 which passes through the fourth signal converting unit, is represented as a segmenting identifier 1.
- Signals through the fourth signal converting unit are directly outputted as Y 5 and Y 6 , thereby being represented as non- segmenting identifiers 0, respectively.
- X 3 and X 4 which are directly outputted without passing through signal converting units, are represented as non-segmenting identifiers 0, respectively.
- the channel configuration information is represented as 110010010000 by the horizontal method.
- the channel configuration information is extracted through the configuration of the upmixing unit for convenience of understanding.
- the audio signal decoding apparatus reads the channel configuration information to obtain the information for the structure of the upmixing unit in a reverse way.
- FIG. I 1 like FIG. 6, four audio signals Xi to X 4 enter an upmixing unit. Since channel configuration information is represented as a segmenting or non-segmenting identifier from an upper layer to a lower layer by the vertical method, identifiers of audio signals of a first layer 701 as a most upper layer are represented in sequence.
- An audio signal decoding apparatus reads the channel configuration information and then configures an upmixing unit.
- an identifier indicating that whether the channel configuration is represented by the horizontal method or the vertical method should be included in an audio signal.
- channel configuration information is basically represented by the horizontal method.
- an audio signal encoding apparatus may enable an identifier indicating that channel configuration is represented by the vertical method to be included in an audio signal.
- An audio signal decoding apparatus reads channel configuration information represented by the horizontal method and is then able to configure an upmixing unit. Yet, in case of channel configuration information is represented by the vertical method, an audio signal decoding apparatus is able to configure an upmixing unit only if knowing the number of signal converting units included in the upmixing unit or the numbers of input and output channels. So, an audio signal decoding apparatus is able to configure an upmixing unit in a manner of extracting the number of signal converting units or the numbers of input and output channels from the configuration information 109 included in the spatial information signal 105. An audio signal decoding apparatus interprets channel configuration information in sequence from a front.
- the audio signal decoding apparatus needs not to further read the channel configuration information. This is because the number of segmenting identifiers 1 included in the channel configuration information is equal to the number of signal converting units included in the upmixing unit as the segmenting identifier 1 indicates that an audio signal is inputted to the signal converting unit.
- channel configuration information represented by the vertical method is 110011000000
- an audio signal decoding apparatus needs to read total 12 bits in order to decode the channel configuration information. Yet, if the audio signal decoding apparatus detects that the number of signal converting units is 4, the audio signal decoding apparatus decodes the channel configuration information until the number of Is included in the channel configuration information appears four times. Namely, the audio signal decoding apparatus decodes the channel configuration information up to 110011 only. This is because the rest of values are represented as non-segmenting identifiers 0 despite not using the channel configuration information further. Hence, as it is unnecessary for the audio signal decoding apparatus to decode six bits, decoding efficiency can be enhanced.
- the number of output channels becomes a value resulting from adding the number of OTT or TTT boxes to the input signal.
- the number of the signal converting units becomes a value resulting from subtracting the number of input signals and the number of TTT boxes from the number of output channels. Since it is able to use maximum 32 output channels in general, information for indicating signal converting units can be represented as a value within five bits.
- an audio signal encoding apparatus separately should represent the number of signal converting units as maximum five bits in the spatial information signal 105.
- 6-bit channel configuration information and 5-bit information for indicating signal converting units are needed. Namely, total eleven bits are required. This indicates that a bit quantity required for configuring an upmixing unit is reduced rather than the channel configuration information represented by the horizontal method. Therefore, if channel configuration information is represented by the vertical method, the bit number can be reduced.
- FIG. 8 is a block diagram of an audio signal decoding apparatus according to one embodiment of the present invention.
- an audio signal decoding apparatus includes a receiving unit, a demultiplexing unit, a core decoding unit, a spatial information decoding unit, a signal arranging unit, a multi-channel generating unit and a speaker mapping unit.
- the receiving unit 801 receives an audio signal including a downmix signal 103 and a spatial information signal 105.
- the demultiplexing unit 803 parses the audio signal received by the receiving unit 801 into an encoded downmix signal 103 and an encoded spatial information signal 105 and then sends the encoded downmix signal 103 and the encoded spatial information signal to the core decoding unit 805 and the spatial information decoding unit 807, respectively.
- the coder decoding unit 805 and the spatial information decoding unit 807 decode the encoded downmix signal and the encoded spatial information signal, respectively.
- the spatial information decoding unit 807 decodes the spatial information signal 105 by extracting a frame identifier, a timeslot number, a parameter set number, timeslot position information and the like from the spatial information signal 105 and by applying a parameter set to a corresponding timeslot.
- the audio signal decoding apparatus is able to include the signal arranging unit 809.
- the signal arranging unit 809 arranges a plurality of downmix signals according to a preset arrangement to upmix the decoded downmix signal 103.
- the signal arranging unit 809 arranges M downmix signals into M' audio signals in an N-M-N channel configuration .
- the audio signal decoding apparatus directly can upmix downmix signals according to a seguence that the downmix signals have passed through the core decoding unit 805. Yet, in some cases, the audio signal decoding apparatus may perform upmixing after the audio signal decoding apparatus arranges a sequence of downmix signals.
- signal arrangement can be performed on signals entering a signal converting unit that upmixes two downmix signals into three signals.
- signal arrangement information indicating the corresponding case should be included in the audio signal by the audio signal encoding apparatus.
- the signal arrangement information is an identifier indicating whether signal sequences will be arranged for upmixing prior to restoring an audio signal into multi-channel, whether arrangement will be performed on a specific signal only, or the like. If a header 107 is included in the spatial information signal 105, the audio signal decoding apparatus arranges downmix signals using the audio signal arrangement information included in configuration information 109 extracted from the header 107.
- the audio signal decoding apparatus is able to arrange audio signals using the audio signal arrangement information extracted from configuration information 109 included in a previous header 107.
- the audio signal decoding apparatus may not perform the downmix signal arrangement.
- the audio signal decoding apparatus is able to generate multi-channel by directly upmixing the signal decoded and transferred to the multi-channel generating unit 811 by the core decoding unit 805 instead of performing downmix signal arrangement. This is because a desired purpose of the signal arrangement can be achieved by mapping the generated multi-channel to speakers. In this case, it is able to compress and transfer an audio signal more efficiently by not inserting information for the downmix signal arrangement in the audio signal.
- the signal arranging unit 809 sends the arranged downmix signal to the multi-channel generating unit 811.
- the spatial information decoding unit 809 sends the decoded spatial information signal 105 to the multi-channel generating unit 811 as well.
- the multi-channel generating unit 811 generates a multi-channel audio signal using the downmix signal 103 and the spatial information signal 105.
- the audio signal decoding apparatus includes the speaker mapping unit 813 to output an audio signal through the multi-channel generating unit 811 to a speaker.
- the speaker mapping unit 813 decides that the multichannel audio signal will be outputted by being mapped to which speaker. And, types of speakers used to output audio signals in general are shown in Table 1 as follows. [Table 1 ]
- the speaker mapping unit 813 enables the audio signal to be mapped to the speaker (Loudspeaker) corresponding to each number in a manner of giving a specific one of numbers (bsOutputCahnnelPos) between 0 and 31 to the multi-channel audio signal.
- the speaker mapping unit 813 since one of total 32 speakers should be selected to map a first audio signal among multi-channel audio signals outputted from the multi-channel generating unit 811 to a speaker, 5 bits are needed. Since one of the remaining 31 speakers should be selected to map a second audio signal to a speaker, 5 bits are needed as well.
- a header can be selectively included in a spatial information signal.
- a transferred data quantity can be reduced in a manner of representing a position of a timeslot to which a parameter set will be applied as a variable bit number.
- audio signal compression and transfer efficiencies can be raised in a manner of representing an information quantity required for performing downmix signal arrangement or for mapping multi-channel to a speaker as a minimum variable bit number.
- an audio signal can be more efficiently compressed and transferred and complexity of an audio signal decoding apparatus can be reduced, in a manner of upmixing signals decoded and transferred to a multi-channel generating unit by a core decoding unit in a sequence without performing downmix signal arrangement.
- FIG. 1 is a configurational diagram of an audio signal according to one embodiment of the present invention.
- FIG. 2 is a flowchart of a method of decoding an audio signal according to another embodiment of the present invention.
- FIG. 3 is a flowchart of a method of decoding an audio signal according to another embodiment of the present invention.
- FIG. 4 is syntax of position information of a timeslot to which a parameter set is applied according to one embodiment of the present invention.
- FIG. 5 is a flowchart of a method of decoding a spatial information signal by applying a parameter set to a timeslot according to another embodiment of the present invention.
- FIG. 6 and FIG. 7 are diagrams of an upmixing unit of an audio signal decoding apparatus according to one embodiment of the present invention.
- FIG. 8 is a block diagram of an audio signal decoding apparatus according to one embodiment of the present invention.
- a method of decoding an audio signal including receiving an audio signal including a spatial information signal and a downmix signal, obtaining position information of a timeslot using a timeslot number and a parameter number included in the audio signal, generating a multi-channel audio signal by applying the spatial information signal to the downmix signal according to the position information of the timeslot, and arranging multi-channel audio signal correspondingly to an output channel.
- the position information of the timeslot may be represented as a variable bit number.
- the position information may include an initial value and a difference value, wherein the initial value indicates the position information of the timeslot to which a first parameter is applied and wherein the difference value indicates the position information of the timeslot to which a second or subsequent parameter is applied.
- the initial value may be represented as a variable bit number decided using at least one of the timeslot number and the parameter number.
- the difference value may be represented as a variable bit number decided using at least one of the timeslot number, the parameter number and the position information of the timeslot to which a previous parameter is applied.
- the method may further include arranging downmix signal for the downmix signal according to a preset method. And arranging the downmix signal may be performed on the downmix signal entering a signal converting unit upmixing two downmix signals into three signals.
- the downmix signal arrangement may be to arrange the downmix signal using audio signal arrangement information included in configuration information extracted from the header.
- information quantity required for mapping an ith audio signal or for arranging an ith downmix signal may be an minimum integer equal to or greater than Iog 2 [(the number of total audio signals or the number of total downmix signals) -(a value of the ⁇ i')+1].
- the arranging of the multi-channel audio signal may further include arranging the audio signal correspondingly to a speaker.
- an apparatus for decoding an audio signal including an upmixing unit upmixing an audio signal into a multi-channel audio signal and a multi-channel arranging unit mapping the multi-channel audio signal to output channels according to a preset arrangement.
- an apparatus for decoding an audio signal including a core decoding unit decoding an encoded downmix signal, an arranging unit arranging the decoded audio signal according to a preset arrangement, and an upmixing unit upmixing the arranged audio signal into a multi ⁇ channel audio signal.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Mathematical Physics (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Stereophonic System (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims
Priority Applications (10)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2008528949A JP4568363B2 (en) | 2005-08-30 | 2006-08-30 | Audio signal decoding method and apparatus |
AU2006285544A AU2006285544B2 (en) | 2005-08-30 | 2006-08-30 | A method for decoding an audio signal |
CA2620030A CA2620030C (en) | 2005-08-30 | 2006-08-30 | Method and apparatus for decoding an audio signal |
US12/065,269 US7788107B2 (en) | 2005-08-30 | 2006-08-30 | Method for decoding an audio signal |
KR1020087021436A KR101169280B1 (en) | 2005-08-30 | 2006-08-30 | Method and apparatus for decoding an audio signal |
MX2008002760A MX2008002760A (en) | 2005-08-30 | 2006-08-30 | A method for decoding an audio signal. |
BRPI0615317-8A BRPI0615317A2 (en) | 2005-08-30 | 2006-08-30 | method for decoding an audio signal |
CN2006800316239A CN101253552B (en) | 2005-08-30 | 2006-08-30 | Method for decoding an audio signal |
EP06798588A EP1932147A4 (en) | 2005-08-30 | 2006-08-30 | A method for decoding an audio signal |
HK09101884.8A HK1124682A1 (en) | 2005-08-30 | 2009-02-27 | A method and apparatus for decoding an audio signal |
Applications Claiming Priority (26)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US71211905P | 2005-08-30 | 2005-08-30 | |
US60/712,119 | 2005-08-30 | ||
US71920205P | 2005-09-22 | 2005-09-22 | |
US60/719,202 | 2005-09-22 | ||
US72300705P | 2005-10-04 | 2005-10-04 | |
US60/723,007 | 2005-10-04 | ||
US72622805P | 2005-10-14 | 2005-10-14 | |
US60/726,228 | 2005-10-14 | ||
US72922505P | 2005-10-24 | 2005-10-24 | |
US60/729,225 | 2005-10-24 | ||
US73562805P | 2005-11-12 | 2005-11-12 | |
US60/735,628 | 2005-11-12 | ||
US74860705P | 2005-12-09 | 2005-12-09 | |
US60/748,607 | 2005-12-09 | ||
KR10-2006-0004065 | 2006-01-13 | ||
KR20060004065 | 2006-01-13 | ||
KR10-2006-0004056 | 2006-01-13 | ||
KR20060004056 | 2006-01-13 | ||
KR20060004055 | 2006-01-13 | ||
KR10-2006-0004055 | 2006-01-13 | ||
US76253606P | 2006-01-27 | 2006-01-27 | |
US60/762,536 | 2006-01-27 | ||
US80382506P | 2006-06-02 | 2006-06-02 | |
US60/803,825 | 2006-06-02 | ||
KR1020060056480A KR20070003574A (en) | 2005-06-30 | 2006-06-22 | Method and apparatus for encoding and decoding an audio signal |
KR10-2006-0056480 | 2006-06-22 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2007027056A1 true WO2007027056A1 (en) | 2007-03-08 |
Family
ID=37809094
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2006/003436 WO2007027057A1 (en) | 2005-08-30 | 2006-08-30 | A method for decoding an audio signal |
PCT/KR2006/003434 WO2007027055A1 (en) | 2005-08-30 | 2006-08-30 | A method for decoding an audio signal |
PCT/KR2006/003435 WO2007027056A1 (en) | 2005-08-30 | 2006-08-30 | A method for decoding an audio signal |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2006/003436 WO2007027057A1 (en) | 2005-08-30 | 2006-08-30 | A method for decoding an audio signal |
PCT/KR2006/003434 WO2007027055A1 (en) | 2005-08-30 | 2006-08-30 | A method for decoding an audio signal |
Country Status (5)
Country | Link |
---|---|
EP (3) | EP1932147A4 (en) |
KR (1) | KR100830472B1 (en) |
AU (1) | AU2006285544B2 (en) |
CA (1) | CA2620030C (en) |
WO (3) | WO2007027057A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112951250A (en) * | 2014-09-12 | 2021-06-11 | 索尼公司 | Transmission device, transmission method, reception device, and reception method |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8615088B2 (en) | 2008-01-23 | 2013-12-24 | Lg Electronics Inc. | Method and an apparatus for processing an audio signal using preset matrix for controlling gain or panning |
WO2009093867A2 (en) | 2008-01-23 | 2009-07-30 | Lg Electronics Inc. | A method and an apparatus for processing audio signal |
KR101024924B1 (en) * | 2008-01-23 | 2011-03-31 | 엘지전자 주식회사 | A method and an apparatus for processing an audio signal |
KR20140046980A (en) | 2012-10-11 | 2014-04-21 | 한국전자통신연구원 | Apparatus and method for generating audio data, apparatus and method for playing audio data |
WO2014058275A1 (en) * | 2012-10-11 | 2014-04-17 | 한국전자통신연구원 | Device and method for generating audio data, and device and method for playing audio data |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH09275544A (en) * | 1996-02-07 | 1997-10-21 | Matsushita Electric Ind Co Ltd | Decoder and decoding method |
JP2001188578A (en) * | 1998-11-16 | 2001-07-10 | Victor Co Of Japan Ltd | Voice coding method and voice decoding method |
US6339760B1 (en) * | 1998-04-28 | 2002-01-15 | Hitachi, Ltd. | Method and system for synchronization of decoded audio and video by adding dummy data to compressed audio data |
US6631352B1 (en) * | 1999-01-08 | 2003-10-07 | Matushita Electric Industrial Co. Ltd. | Decoding circuit and reproduction apparatus which mutes audio after header parameter changes |
US20040199276A1 (en) * | 2003-04-03 | 2004-10-07 | Wai-Leong Poon | Method and apparatus for audio synchronization |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
NL9000338A (en) * | 1989-06-02 | 1991-01-02 | Koninkl Philips Electronics Nv | DIGITAL TRANSMISSION SYSTEM, TRANSMITTER AND RECEIVER FOR USE IN THE TRANSMISSION SYSTEM AND RECORD CARRIED OUT WITH THE TRANSMITTER IN THE FORM OF A RECORDING DEVICE. |
EP0827312A3 (en) * | 1996-08-22 | 2003-10-01 | Marconi Communications GmbH | Method for changing the configuration of data packets |
JPH11330980A (en) * | 1998-05-13 | 1999-11-30 | Matsushita Electric Ind Co Ltd | Decoding device and method and recording medium recording decoding procedure |
MY149792A (en) | 1999-04-07 | 2013-10-14 | Dolby Lab Licensing Corp | Matrix improvements to lossless encoding and decoding |
BR0304540A (en) | 2002-04-22 | 2004-07-20 | Koninkl Philips Electronics Nv | Methods for encoding an audio signal, and for decoding an encoded audio signal, encoder for encoding an audio signal, apparatus for providing an audio signal, encoded audio signal, storage medium, and decoder for decoding an audio signal. encoded audio |
WO2004008806A1 (en) * | 2002-07-16 | 2004-01-22 | Koninklijke Philips Electronics N.V. | Audio coding |
JP4084990B2 (en) * | 2002-11-19 | 2008-04-30 | 株式会社ケンウッド | Encoding device, decoding device, encoding method and decoding method |
JP2005352396A (en) | 2004-06-14 | 2005-12-22 | Matsushita Electric Ind Co Ltd | Sound signal encoding device and sound signal decoding device |
KR100663729B1 (en) * | 2004-07-09 | 2007-01-02 | 한국전자통신연구원 | Method and apparatus for encoding and decoding multi-channel audio signal using virtual source location information |
US8204261B2 (en) * | 2004-10-20 | 2012-06-19 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Diffuse sound shaping for BCC schemes and the like |
KR100682904B1 (en) * | 2004-12-01 | 2007-02-15 | 삼성전자주식회사 | Apparatus and method for processing multichannel audio signal using space information |
-
2006
- 2006-08-30 AU AU2006285544A patent/AU2006285544B2/en not_active Ceased
- 2006-08-30 EP EP06798588A patent/EP1932147A4/en not_active Withdrawn
- 2006-08-30 WO PCT/KR2006/003436 patent/WO2007027057A1/en active Application Filing
- 2006-08-30 EP EP06798587A patent/EP1922721A4/en not_active Withdrawn
- 2006-08-30 WO PCT/KR2006/003434 patent/WO2007027055A1/en active Application Filing
- 2006-08-30 WO PCT/KR2006/003435 patent/WO2007027056A1/en active Application Filing
- 2006-08-30 EP EP06798589A patent/EP1922722A4/en not_active Withdrawn
- 2006-08-30 KR KR1020060083010A patent/KR100830472B1/en not_active IP Right Cessation
- 2006-08-30 CA CA2620030A patent/CA2620030C/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH09275544A (en) * | 1996-02-07 | 1997-10-21 | Matsushita Electric Ind Co Ltd | Decoder and decoding method |
US6339760B1 (en) * | 1998-04-28 | 2002-01-15 | Hitachi, Ltd. | Method and system for synchronization of decoded audio and video by adding dummy data to compressed audio data |
JP2001188578A (en) * | 1998-11-16 | 2001-07-10 | Victor Co Of Japan Ltd | Voice coding method and voice decoding method |
US6631352B1 (en) * | 1999-01-08 | 2003-10-07 | Matushita Electric Industrial Co. Ltd. | Decoding circuit and reproduction apparatus which mutes audio after header parameter changes |
US20040199276A1 (en) * | 2003-04-03 | 2004-10-07 | Wai-Leong Poon | Method and apparatus for audio synchronization |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112951250A (en) * | 2014-09-12 | 2021-06-11 | 索尼公司 | Transmission device, transmission method, reception device, and reception method |
Also Published As
Publication number | Publication date |
---|---|
WO2007027057A1 (en) | 2007-03-08 |
EP1932147A4 (en) | 2011-03-30 |
KR20070061280A (en) | 2007-06-13 |
CA2620030C (en) | 2011-08-23 |
EP1922722A1 (en) | 2008-05-21 |
EP1922721A4 (en) | 2011-04-13 |
EP1922722A4 (en) | 2011-03-30 |
KR100830472B1 (en) | 2008-05-20 |
AU2006285544A1 (en) | 2007-03-08 |
WO2007027055A1 (en) | 2007-03-08 |
EP1932147A1 (en) | 2008-06-18 |
CA2620030A1 (en) | 2007-03-08 |
AU2006285544B2 (en) | 2012-01-12 |
EP1922721A1 (en) | 2008-05-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7987097B2 (en) | Method for decoding an audio signal | |
US8577483B2 (en) | Method for decoding an audio signal | |
JP5006315B2 (en) | Audio signal encoding and decoding method and apparatus | |
JP4601669B2 (en) | Apparatus and method for generating a multi-channel signal or parameter data set | |
CN101253554B (en) | Method and device for decoding an audio signal | |
EP1987594A1 (en) | Method and apparatus for processing an audio signal | |
US7788107B2 (en) | Method for decoding an audio signal | |
CA2620030C (en) | Method and apparatus for decoding an audio signal | |
RU2383942C2 (en) | Method and device for audio signal decoding | |
KR20070108314A (en) | Method and apparatus for encoding/decoding an audio signal |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 200680031623.9 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2006285544 Country of ref document: AU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 468/KOLNP/2008 Country of ref document: IN |
|
ENP | Entry into the national phase |
Ref document number: 2620030 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: MX/a/2008/002760 Country of ref document: MX |
|
ENP | Entry into the national phase |
Ref document number: 2006285544 Country of ref document: AU Date of ref document: 20060830 Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2008528949 Country of ref document: JP Ref document number: 12065269 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006798588 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2008112174 Country of ref document: RU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020087021436 Country of ref document: KR |
|
ENP | Entry into the national phase |
Ref document number: PI0615317 Country of ref document: BR Kind code of ref document: A2 Effective date: 20080229 |