CN101243491A - Method and apparatus for encoding and decoding an audio signal - Google Patents
Method and apparatus for encoding and decoding an audio signal Download PDFInfo
- Publication number
- CN101243491A CN101243491A CNA2006800296076A CN200680029607A CN101243491A CN 101243491 A CN101243491 A CN 101243491A CN A2006800296076 A CNA2006800296076 A CN A2006800296076A CN 200680029607 A CN200680029607 A CN 200680029607A CN 101243491 A CN101243491 A CN 101243491A
- Authority
- CN
- China
- Prior art keywords
- audio signal
- channel
- gain
- energy level
- signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 465
- 238000000034 method Methods 0.000 title claims abstract description 103
- 239000000203 mixture Substances 0.000 claims description 301
- 238000012545 processing Methods 0.000 claims description 19
- 239000000284 extract Substances 0.000 claims description 16
- 230000008859 change Effects 0.000 claims description 14
- 230000000737 periodic effect Effects 0.000 claims description 3
- 238000001514 detection method Methods 0.000 claims 2
- 238000012937 correction Methods 0.000 description 30
- 238000000605 extraction Methods 0.000 description 16
- 230000008569 process Effects 0.000 description 12
- 238000010586 diagram Methods 0.000 description 9
- 230000006866 deterioration Effects 0.000 description 7
- 230000008901 benefit Effects 0.000 description 6
- 230000000694 effects Effects 0.000 description 4
- 230000009467 reduction Effects 0.000 description 4
- 210000005069 ears Anatomy 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 2
- 238000009499 grossing Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 240000006409 Acacia auriculiformis Species 0.000 description 1
- AFCARXCZXQIEQB-UHFFFAOYSA-N N-[3-oxo-3-(2,4,6,7-tetrahydrotriazolo[4,5-c]pyridin-5-yl)propyl]-2-[[3-(trifluoromethoxy)phenyl]methylamino]pyrimidine-5-carboxamide Chemical compound O=C(CCNC(=O)C=1C=NC(=NC=1)NCC1=CC(=CC=C1)OC(F)(F)F)N1CC2=C(CC1)NN=N2 AFCARXCZXQIEQB-UHFFFAOYSA-N 0.000 description 1
- 235000003140 Panax quinquefolius Nutrition 0.000 description 1
- 240000005373 Panax quinquefolius Species 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 239000006185 dispersion Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000012886 linear function Methods 0.000 description 1
- 238000002715 modification method Methods 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Stereophonic System (AREA)
Abstract
A method and/or apparatus for encoding and/or decoding an audio signal is disclosed, in which a downmix gain is applied to a downmix signal in an encoding apparatus which, in turn, transmits, to a decoding apparatus, a bitstream containing information as to the applied downmix gain. The decoding apparatus recovers the downmix signal, using the downmix gain information. A method and/or apparatus for encoding and/or decoding an audio signal is also disclosed, in which the encoding apparatus can apply an arbitrary downmix gain (ADG) to the downmix signal, and can transmit a bitstream containing information as to the applied ADG to the decoding apparatus. The decoding apparatus recovers the downmix signal, using the ADG information. A method and/or apparatus for encoding and/or decoding an audio signal is also disclosed, in which the method and/or apparatus can also vary the energy level of a specific channel, and can recover the varied energy level .
Description
Technical field
The present invention relates to a kind ofly be used to encode and/or the method and/or the device of decoded audio signal.
Background of invention
The present invention relates to the coding and/or the decoding of the spatial information of multi-channel audio signal.Recently, develop various coding techniques and the methods that are used for digital audio and video signals, and produced various product associated therewith.
Yet, when multi-channel audio signal is become the form of monophony or stereo audio signal by multi-channel audio, may have the problem of sound signal sound level loss.Even especially encoded signal still shows sound level loss phenomenon after by core codec it being encoded, because encoded signal has limited size, 16 bits for example.This sound level loss phenomenon of sound signal has influence on the output characteristics of sound signal, and causes the deterioration of tonequality.
Summary of the invention
Design is to apply the sound level loss problem that the multi-channel audio gain solves multi-channel audio signal by the multi-channel signal to multi-channel audio signal with the one object of the present invention that addresses the above problem.
Another object of the present invention is to apply the sound level loss problem that the gain of arbitrariness multi-channel audio solves multi-channel audio signal by the down-mix audio signal to multi-channel audio signal.
Another object of the present invention is to apply the sound level loss problem that the particular channel gain solves multi-channel audio signal by the particular channel to multi-channel audio signal.
Another object of the present invention is by using at least two sound level loss problems that solve multi-channel audio signal in multi-channel audio gain, the gain of arbitrariness multi-channel audio and the particular channel gain.
For the advantage that realizes these and other and according to purpose of the present invention, a kind of method according to decoded audio signal of the present invention may further comprise the steps: isolate down-mix audio signal and spatial signal information from the bit stream of sound signal; Use this spatial signal information that this down-mix audio signal is transformed into multi-channel audio signal; And the particular channel that shows energy level variations of this multi-channel audio signal applied the particular channel gain, thereby revise the energy level of this sound channel.
In order further to realize the advantage of these and other and according to purpose of the present invention, a kind of method according to decoded audio signal of the present invention may further comprise the steps: isolate down-mix audio signal from the bit stream of sound signal; And will apply the particular channel gain, thereby revise the energy level of this sound channel to the particular channel that shows energy level variations of down-mix audio signal.
In order further to realize the advantage of these and other and according to purpose of the present invention, a kind of method according to coding audio signal of the present invention may further comprise the steps: the particular channel to multi-channel audio signal applies the particular channel gain; Generate down-mix audio signal and spatial signal information from the multi-channel audio signal that applies the particular channel gain; And generation comprises the bit stream of described down-mix audio signal and described spatial signal information.
In order further to realize the advantage of these and other and according to purpose of the present invention, a kind of method according to coding audio signal of the present invention may further comprise the steps: generate down-mix audio signal and spatial signal information from multi-channel audio signal; Particular channel to down-mix audio signal applies the particular channel gain; And generate and to comprise the down-mix audio signal that applies the particular channel gain and the bit stream of spatial signal information.
In order further to realize the advantage of these and other and according to purpose of the present invention, a kind of data structure according to the present invention comprises: the down-mix audio signal of multi-channel audio signal; And about the information of the particular channel gain of the particular channel that puts on this multi-channel audio signal.
For further realize and harmonious other advantage and according to purpose of the present invention, a kind of device according to decoded audio signal of the present invention comprises: demultiplexer is used for isolating down-mix audio signal and spatial signal information from the bit stream of sound signal; The multichannel generation unit, described multichannel generation unit uses this spatial signal information that this down-mix audio signal is transformed into multi-channel audio signal; And particular channel energy level processing unit, described particular channel energy level processing unit applies the particular channel gain to the particular channel that shows energy level variations of this multi-channel audio signal, thereby revises the energy level of this particular channel.
In order further to realize the purpose of these and other and according to purpose of the present invention, a kind of device according to coding audio signal of the present invention comprises: sound channel energy level processing unit is used for the particular channel of multi-channel audio signal is applied the particular channel gain; Down-mix unit is used for generating down-mix audio signal from the multi-channel audio signal that applies the particular channel gain; And the spatial information generation unit, be used for extracting spatial information from this multi-channel audio signal.
Brief Description Of Drawings
Be included in here to provide accompanying drawing that embodiments of the present invention are shown, can explain principle of the present invention with instructions to further understanding of the present invention.In the accompanying drawings:
Fig. 1 is the synoptic diagram that the method for the spatial information that a kind of mankind of making can recognize in the sound signal to be comprised is shown;
Fig. 2 is the oscillogram that is illustrated in the sound level loss phenomenon of the sound signal that takes place in the process of coding audio signal;
Fig. 3 illustrates according to an embodiment of the invention therein down-mix audio signal to be applied the block scheme of multi-channel audio gain with first code device of correction down-mix audio signal;
Fig. 4 illustrates according to an embodiment of the invention therein down-mix audio signal to be applied the block scheme of multi-channel audio gain with first decoding device of correction down-mix audio signal;
Fig. 5 illustrates according to an embodiment of the invention multi-channel audio signal is applied the block scheme of multi-channel audio gain with second code device of correction multi-channel audio signal;
Fig. 6 illustrates according to an embodiment of the invention multi-channel audio signal is applied the block scheme of multi-channel audio gain with second decoding device of correction multi-channel audio signal;
Fig. 7 illustrates according to an embodiment of the invention down-mix audio signal is applied the block scheme of multi-channel audio gain with the 3rd code device of correction down-mix audio signal;
Fig. 8 illustrates according to an embodiment of the invention down-mix audio signal is applied the block scheme of multi-channel audio gain with the 3rd decoding device of correction down-mix audio signal;
Fig. 9 is the figure that illustrates according to the bit stream of each self-contained multi-channel audio gain information of the embodiment of the invention;
Figure 10 A and 10B are the tables that various forms of according to an embodiment of the invention multi-channel audio gains are shown;
Figure 11 illustrates according to of the present invention a kind ofly to be used for preventing owing to apply the curve map of the method for tonequality deterioration around the frame that multi-channel audio gain causes;
Figure 12 illustrates a kind of according to an embodiment of the invention employing applies the scheme of multi-channel audio gain to down-mix audio signal the process flow diagram of audio-frequency signal coding method;
Figure 13 illustrates a kind of according to an embodiment of the invention process flow diagram that down-mix audio signal is applied the audio signal decoding method of multi-channel audio gain;
Figure 14 illustrates a kind of according to an embodiment of the invention down-mix audio signal to be applied arbitrariness multi-channel audio gain (ADG) block scheme with the code device of revising down-mix audio signal;
Figure 15 illustrates according to an embodiment of the invention down-mix audio signal is applied the block scheme of ADG with the decoding device of correction down-mix audio signal;
Figure 16 illustrates according to an embodiment of the invention down-mix audio signal is applied multi-channel audio gain and the ADG block scheme with the code device of correction down-mix audio signal;
Figure 17 illustrates according to an embodiment of the invention down-mix audio signal is applied multi-channel audio gain and the ADG block scheme with the decoding device of correction down-mix audio signal;
Figure 18 is the table that a plurality of frequency bands that applied ADG according to an embodiment of the invention are shown;
Figure 19 illustrates according to an embodiment of the invention down-mix audio signal is applied the process flow diagram of ADG with the audio-frequency signal coding method of correction down-mix audio signal;
Figure 20 illustrates according to an embodiment of the invention down-mix audio signal is applied the process flow diagram of ADG with the audio signal decoding method of correction down-mix audio signal;
Figure 21 is the block scheme that the code device of the sound level of revising particular channel according to an embodiment of the invention is shown;
Figure 22 is the block scheme that the decoding device of the sound level of revising particular channel according to an embodiment of the invention is shown; And
Figure 23 is the block scheme that the decoding device of the sound level of revising particular channel according to an embodiment of the invention is shown.
Embodiment
Below will be in detail with reference to better embodiment of the present invention, its concrete exemplary plot is shown in the drawings.
Fig. 1 illustrates the method that makes the mankind can recognize the spatial information in the sound signal.
The coding of multi-channel audio signal has utilized the following fact: because the mankind can recognize sound signal three-dimensionally, therefore can use the formal representation sound signal of a plurality of parameter sets with three-dimensional spatial information.
" spatial parameter " that be used to represent the spatial information of multi-channel audio signal comprises sound channel differential (CLD), inter-channel coherence (ICC) and sound channel mistiming (CTD).CLD represents the energy difference between two sound channels.ICC represents the correlativity between two sound channels.CTD represents the mistiming between two sound channels.
Fig. 1 illustrates how ground, space identification sound signal of the mankind, and is the notion of parameter between clearancen of how creating.
With reference to Fig. 1, from the human left ear 107 of direct sound wave 103 arrival of far-end sound source 101, and another direct sound wave 102 arrives human auris dextra 106 behind the diffraction around human head.
These two sound waves 102 and 103 have difference with regard to arrival time and energy level.Since this species diversity, create out aforesaid CTD and CLD parameter.
On the other hand, if arrive human ears through reflected sound wave 104 and 105, if perhaps sound source 101 comprises the sound source of a plurality of dispersions, the sound wave that then correlativity is very little arrives human ears.As a result, the aforesaid ICC parameter of creating out.
Use just can send multi-channel audio signal with the form of monophony or stereophonic signal, and export monophony or the stereophonic signal that is sent with the form of multi-channel audio signal according to the spatial parameter of above-mentioned principle creation.
The invention provides a kind of method of when down-mix audio signal is transformed into multi-channel audio signal, using above-mentioned spatial information to revise down-mix audio signal.
Fig. 2 is illustrated in the sound signal sound level loss that produces during the audio-frequency signal coding.The sound level loss of sound signal mainly produces because of two factors.The first, when the sound level of original signal is very high, can produce such sound level loss.The second, when will be by the number of the input sound channel of multi-channel audio also very big, can produce such sound level loss.For example, compare wherein 3 sound channels by the situation of multi-channel audio to a sound channel, when 7 sound channels during by multi-channel audio to sound channel, the sound level loss produces more continually.The corresponding situation of sound level loss of Fig. 2 is that 5 sound channels are by multi-channel audio to a sound channel.Yet the present invention is not limited to described situation.The loss of this sound level also may produce because of the various factors of for example amplitude limit (clipping) etc.
Fig. 2 (a) illustrates the sound level of the original signal that is made of 5 sound channels.Each sound channel of original signal can be utilized the almost gamut of limited size (for example 16 bits).Fig. 2 (b) illustrates the down-mix audio signal according to the multi-channel audio generation of 5 sound channels.Shown in Fig. 2 (b), this down-mix audio signal may have a plurality of crests that limit size that surpass.Fig. 2 (c) is illustrated in and uses core codec (for example AAC codec) down-mix audio signal to be carried out the sound signal that generates behind the coding/decoding.Even in the situation of such sound signal that the coding/decoding operation according to core codec produces, still have the sound level loss, because this sound signal is to express in limited size (for example 16 bits).This sound level loss influences the output characteristics of multi-channel audio signal, and causes the deterioration of tonequality.
Fig. 3 illustrates according to an embodiment of the invention down-mix audio signal is applied the multi-channel audio gain to revise first code device of down-mix audio signal.This first code device comprises down-mix unit 302, spatial information generation unit 303, multi-channel audio gain applying unit 306 and multiplexer 308.
With reference to Fig. 3,302 pairs of multi-channel audio signals of down-mix unit 301 carry out multi-channel audio, generate down-mix audio signal 304 thus.In Fig. 3, " n " expression input sound channel number.Down-mix audio signal 304 can be monophony, stereo or multi-channel audio signal.
Spatial information generation unit 303 extracts spatial information from multi-channel audio signal 301.Here, the spatial information feeding is meant about the information in the sound signal sound channel that down-mix audio signal channel expansion audio mixing is used during to multi-channel audio signal, and wherein this down-mix audio signal is that multi-channel audio by multi-channel audio signal generates.
306 pairs of down-mix audio signal 304 of multi-channel audio gain applying unit apply the multi-channel audio gain to reduce the sound level of down-mix audio signal 304.Here, multi-channel audio gain feeding is meant and puts on (for example multiply by) down-mix audio signal or the multi-channel audio signal value with the sound level that changes this signal.In code device, it mainly is the sound level that is used for reducing down-mix audio signal that down-mix audio signal is applied this multi-channel audio gain.For example, when using greater than 1 multi-channel audio gain, down-mix audio signal multiply by the inverse of this multi-channel audio gain to reduce the overall level of down-mix audio signal.
Can apply particular channel gain at least one sound channel of multi-channel audio signal 301, for example low frequency (LFE) gain or around gain.Down-mix unit 302 can generate the down-mix audio signal 304 that is associated with multi-channel audio signal 301 under the situation that as described above at least one sound channel of multi-channel audio signal 301 has been applied the particular channel gain.Afterwards, carry out down-mix audio signal 304 is applied the operation of multi-channel audio gain.Certainly, multi-channel audio gain applying unit 306 can be carried out the operation that this applies the multi-channel audio gain the process that generates down-mix audio signal 304 from multi-channel audio signal 301.
Multiplexer 308 generates and comprises the down-mix audio signal 307 that applies the multi-channel audio gain and the bit stream 309 of spatial signal information 305.Spatial signal information 305 is made of the spatial information that spatial information generation unit 303 is extracted.Bit stream 309 is sent to decoding device.Bit stream 309 also can comprise the information about the multi-channel audio gain, i.e. multi-channel audio gain information.
Fig. 4 illustrates according to an embodiment of the invention down-mix audio signal is applied the multi-channel audio gain to revise first decoding device of down-mix audio signal.This first decoding device comprises demultiplexer 402, down-mix audio signal decoding device 405, spatial signal information decoding unit 406, multi-channel audio gain applying unit 409 and multichannel generation unit 411.
With reference to Fig. 4, the bit stream 401 of demultiplexer 402 received audio signals, and from bit stream 403, isolate encoded down-mix audio signal 404 and encoded spatial signal information 401.
Down-mix audio signal decoding unit 405 is encoded down-mix audio signal 403 decoding, and with gained as a result through decoded signal as down-mix audio signal 407 outputs.Spatial signal information decoding unit 406 is encoded spatial signal information 404 decoding, and with gained as a result through the signal of decoding as spatial information 408 outputs.
409 pairs of down-mix audio signal 407 of multi-channel audio gain applying unit apply the multi-channel audio gain, and output has the down-mix audio signal 410 of original sound level thus.For example, when multi-channel audio gain greater than 1 the time, down-mix audio signal multiply by this multi-channel audio gain to improve the sound level of down-mix audio signal.Simultaneously, multi-channel audio gain applying unit 409 is carried out the operation that applies the multi-channel audio gain in the process that down-mix audio signal is converted into multi-channel audio signal.
Fig. 5 illustrates according to an embodiment of the invention multi-channel audio signal is applied the multi-channel audio gain to revise second code device of multi-channel audio signal.Similar to first code device, this second code device comprises down-mix unit 504, spatial information generation unit 505, multi-channel audio gain applying unit 502 and multiplexer 508.
With reference to Fig. 5, this second code device is similar to first code device.This second code device and first code device are distinct on the position of multi-channel audio gain applying unit 502.That is, although the multi-channel audio gain is to be applied in down-mix audio signal in first code device, yet the multi-channel audio gain is to be applied in multi-channel audio signal in second code device.
In detail, 502 pairs of multi-channel audio signals 501 of multi-channel audio gain applying unit apply the multi-channel audio gain, generate the multi-channel audio signal 503 that has applied the multi-channel audio gain thus.504 pairs of multi-channel audio signals of down-mix unit 503 carry out multi-channel audio, generate down-mix audio signal 506 thus.Spatial information generation unit 505 extracts spatial information from the multi-channel audio signal 503 that applies the multi-channel audio gain.Multiplexer 508 generates the bit stream 509 that comprises down-mix audio signal 506 and spatial signal information 507.
Fig. 6 illustrates according to an embodiment of the invention multi-channel audio signal is applied the multi-channel audio gain to revise second decoding device of multi-channel audio signal.Similar to first decoding device, this second decoding device comprises demultiplexer 602, down-mix audio signal decoding unit 605, spatial signal information decoding unit 606, multichannel generation unit 609 and multi-channel audio gain applying unit 611.
Because demultiplexer 602, down-mix audio signal decoding unit 605 and spatial signal information decoding unit 606 are same or similar with those parts of first decoding device that combines Fig. 4 explanation, therefore no longer it are elaborated.
611 pairs of multi-channel audio signals 610 of multi-channel audio gain applying unit apply the multi-channel audio gain, and output has applied the multi-channel audio signal (out2) that multi-channel audio gains thus.When decoding device can not be exported multi-channel audio signal, usage space information can be directly from down-mix audio signal decoding unit 605 output channels reduction audio signal 607 (out1).
Fig. 7 illustrates according to an embodiment of the invention down-mix audio signal is applied the multi-channel audio gain to revise the 3rd code device of down-mix audio signal.This 3rd code device comprises down-mix unit 702, spatial information generation unit 703, multi-channel audio gain determining unit 706, multi-channel audio gain applying unit 708 and multiplexer 710.
With reference to Fig. 7, this 3rd code device is similar to first code device.The difference of this 3rd code device and first code device is that the 3rd code device comprises multi-channel audio gain determining unit 706.Applying unit 708 is identical or similar with those parts of first code device of describing in conjunction with Fig. 3 with multiplexer 710 because down-mix unit 702, spatial information generation unit 703, multi-channel audio gain, and therefore it is not described in detail.
Multi-channel audio gain determining unit 706 determines to be applied in the multi-channel audio gain of down-mix audio signal.Multi-channel audio gain determining unit 706 can determine that multi-channel audio gains by measuring the frequency that produced when generating multi-channel mixed frequency signal 704 by the multi-channel mixing at multi-channel audio signal 701 and the degree of sound level loss.
As hypothesis " x
k(n) " (k=1,2,3 ..., N) expression multi-channel audio signal each sound channel signal, and down-mix audio signal be generated as "
" time, the maximal value of multi-channel audio gain can be confirmed as "
".For example, work as a
1=1, a
2=1, a
3=1, a
4=1/ , a
4=1/ and
The time, the maximal value of multi-channel audio gain can be confirmed as 4.73.When the maximal value of multi-channel audio gain is contained fashionablely downwards, it is confirmed as 4.
Fig. 8 illustrates according to an embodiment of the invention down-mix audio signal is applied the multi-channel audio gain to revise the 3rd decoding device of down-mix audio signal.This 3rd decoding device comprises demultiplexer 802, down-mix audio signal decoding unit 805, spatial signal information decoding unit 807, multi-channel audio gain extraction unit 808, multi-channel audio gain applying unit 809 and multichannel generation unit 812.
With reference to Fig. 8, this 3rd decoding device is similar to first decoding device.This 3rd decoding device is different with first decoding device on multi-channel audio gain extraction unit 808.
Because demultiplexer 802, down-mix audio signal decoding unit 805, spatial signal information decoding unit 807, multi-channel audio gain applying unit 809 and multichannel generation unit 812 are same or similar with those parts of first decoding device that combines Fig. 4 explanation, therefore do not provide its detailed description.
Multi-channel audio gain extraction unit 808 can extract the multi-channel audio gain information from the spatial signal information 804 through decoding or through the down-mix audio signal 803 of decoding.
Fig. 9 illustrates the bit stream that comprises the multi-channel audio gain information respectively of all embodiment according to the present invention.Shown in Fig. 9 (a), the multi-channel audio gain information can by every frame be inserted in the spatial signal information 902 of bit stream, wherein bit stream comprises down-mix audio signal 901 and spatial signal information 902.
Shown in Fig. 9 (b), also the every frame of multi-channel audio gain information ground can be inserted in the down-mix audio signal 903 of bit stream.Equally, can with the every a plurality of frames of multi-channel audio gain information be inserted in the bit stream.The multi-channel audio gain can have steady state value for all frames of this bit stream, perhaps can have every frame or the variable value of every a plurality of frame.
According to the present invention, can realize a kind of method, wherein the every frame of spatial signal information or every a plurality of frame have a head (or configuration information district), and described head comprises the multi-channel audio gain information.Have the occasion of a head at the every frame of spatial signal information, decoding device extracts the multi-channel audio gain information from the head and this frame is applied the multi-channel audio gain.On the other hand, have the occasion of a head at the every a plurality of frames of spatial signal information, decoding device extracts the multi-channel audio gain information from the frame with head.Then, decoding device applies the multi-channel audio gain to the frame with head, and all the other each frames that do not have head are applied the multi-channel audio gain of extraction from last head.Head can be periodically or non-periodic be included in all frames of spatial signal information.
Shown in Fig. 9 (c), the multi-channel audio gain information also can be inserted in the head 904 of bit stream.Head 904 comprises configuration information etc.In this case, the multi-channel audio gain information can be inserted head with the form of independent values, or with its with organize into groups such as other values of particular channel gain after to become the form insertion head of class value.
According to the present invention, can realize another kind of method, wherein the multi-channel audio gain information is inserted in the reserved field of bit stream and does not use additional bit.
In addition,, can realize another kind of method, wherein can use the combination of method shown in Fig. 9 (a), 9 (b) and 9 (c) according to the present invention.For example, can shown in Fig. 9 (c), like that head be inserted in the multi-channel audio gain, and can shown in Fig. 9 (a), be inserted into spatial signal information like that simultaneously.Whether in addition, bit stream can be directly inserted in multi-channel audio gain, or can according to about using the identification information of multi-channel audio gain to be selectively inserted into bit stream.For example, the head of bit stream can have about whether using first identification information of multi-channel audio gain.When determining to use the multi-channel audio gain based on first identification information, each frame of bit stream has about whether using second identification information of multi-channel audio gain.When determining in a frame, to use the multi-channel audio gain, then in this frame, comprise the multi-channel audio gain.
Figure 10 A and 10B illustrate various types of according to an embodiment of the invention multi-channel audio gains.The multi-channel audio gain can have various value.For example, (for example around gain and LFE gain) and table of multi-channel audio gain formation shown in Figure 10 A and 10B, can be gained by particular channel.With reference to table 1, for gaining around gain and LFE, can use respectively " 1/ " and
For the multi-channel audio gain, can use " 1 " or " 1/2 ".
With reference to table 2, for gaining around gain and LFE, can use respectively " 1/ " and
For the multi-channel audio gain, can use " 1 ", " 1/2 " or " 1/4 ".
With reference to table 3, for gaining around gain and LFE, can use respectively " 1/ " and
For the multi-channel audio gain, can use " 1 ", " 1/ " or " 1/2 ".
With reference to table 4, for gaining around gain and LFE, can use respectively " 1/ " and
For the multi-channel audio gain, can use " 1 ", " 1/ ", " 1/2 " or " 1/ (2 ) ".
With reference to table 5, for gaining around gain and LFE, can use respectively " 1/ " and
For the multi-channel audio gain, can use " 1 ", " 3/4 ", " 2/3 " or " 1/2 ".
With reference to table 5, for gaining around gain and LFE, can use respectively " 1/ " and
For the multi-channel audio gain, can use " 1 ", " 3/4 ", " 2/4 " or " 1/4 ".
Although in Figure 10 A and 10B will around gain and LFE gain be described as being fixed in particular value (for example, be respectively " 1/ " and
), yet the present invention is not limited only to this.According to the present invention, can as in the multi-channel audio gain, from a plurality of particular values, select around gain and LFE gain.According to the present invention, can use and be somebody's turn to do around gain other particular channel gains different with the LFE gain.
Figure 11 illustrates according to a kind of method that prevents frame tonequality deterioration on every side of the present invention, and wherein the tonequality deterioration causes owing to applying the multi-channel audio gain.When owing to applying multi-channel audio gain and cause change in sound level, the tonequality deterioration can take place in the value mutation part of multi-channel audio gain around frame.This is because the sound level sudden change can take place multi-channel audio yield value sudden change part around frame.For this reason, need to set transition period so that the influence that causes because of the multi-channel audio change in gain manifests smoothly.To this, can use following expression to carry out smoothing processing.
DG(n)=a(n)DG
t-i(n-1)+(1-a(n)DG
t(n),
Wherein n=0,1,2 ..., N
In the superincumbent expression formula, " a (n) " can be first-order linear function or general n rank polynomial function." a (n) " also can be the function that shows smooth change when multi-channel audio gain (DG) when changing, for example Gaussian function, peaceful (Hanning) function of the Chinese or Hamming (Hamming) function.
Simultaneously, although carried out above-mentioned smoothing processing, yet the adverse effect that the sudden change that gains because of multi-channel audio causes may still exist.Therefore, can in cataloged procedure, retrain to prevent multi-channel audio gain sudden change.Certainly, even when code device does not comprise the configuration that can prevent multi-channel audio gain sudden change, also can in decoding device, execution analysis suddenly change to prevent the multi-channel audio gain.For example, when use has the multi-channel audio gain of the value that increasing or decreasing changes, by being changed, multi-channel audio is controlled in the increment or decrement that a unit is arranged between each frame in succession, perhaps being controlled to every predetermined number frame (n frame) has the increment or the decrement of a unit, just can prevent the unexpected variation of multi-channel audio gain.
Figure 12 illustrates the process flow diagram that according to an embodiment of the invention down-mix audio signal is applied the audio-frequency signal coding method of multi-channel audio gain.With reference to Figure 12, the code device of carrying out this audio-frequency signal coding method is therein at first received multi-channel audio signal (S1201).This multi-channel audio signal carries out multi-channel audio by the down-mix unit of this code device subsequently, and then generates down-mix audio signal (S1202).Although as mentioned above, the multi-channel signal is to obtain according to the multi-channel audio of multi-channel audio signal, yet also can use from the code device extraneous directly down-mix audio signal of input, for example arbitrariness down-mix audio signal.Spatial signal information generates (S1202) by the spatial information generation unit of this code device from multi-channel audio signal.
After this, the multi-channel gain applying unit by this code device applies multi-channel audio gain (S1203) to down-mix audio signal.For example, when multi-channel audio gain greater than 1 the time, with the reciprocal multiplication of down-mix audio signal and multi-channel audio gain sound level with the reduction down-mix audio signal.On the other hand, when multi-channel audio gain less than 1 the time, down-mix audio signal and multi-channel audio gain be multiply by mutually the sound level of reduction down-mix audio signal.
Generate by the multiplier of this code device subsequently and comprise the down-mix audio signal that applies the multi-channel audio gain and the bit stream (S1204) of spatial signal information.The bit stream that is generated can be sent to decoding device (S1204).
This multi-channel audio gain can be applied in all frames of the down-mix audio signal of this bit stream.Although this method is preferred for the down-mix audio signal frame with big sound level, yet, when this method being applied to down-mix audio signal frame, can produce defective, because the signal to noise ratio (snr) deterioration may take place with little sound level.Therefore, can use different multi-channel audio yield values at the fixed time at interval.
Can in bit stream, the gain of every frame ground definition multi-channel audio apply sentence structure.In this case, can apply sentence structure according to the multi-channel audio gain selects every frame optionally to apply the multi-channel audio gain.For example, down-mix audio signal being applied the multi-channel audio gain can carry out as follows.
At first, the head at bit stream is provided with the multi-channel audio gain.In this case, this multi-channel audio gain can be applied in the whole frames that influenced by this head of this down-mix audio signal.
The second, according to the every frame of the sentence structure of independent definition down-mix audio signal is applied an independently multi-channel audio gain.
The 3rd, use the combination of first and second methods.That is the setting multi-channel audio gain (hereinafter being referred to as first multi-channel audio gain feeding) that will apply, to all frames of down-mix audio signal.The gain of this first multi-channel audio is used to whole period or is used for a very long scope in period, for example 1-2 second.Beyond the gain of first multi-channel audio, every frame ground applies another multi-channel audio gain (hereinafter being referred to as second multi-channel audio gain feeding) to down-mix audio signal, so that can carry out gain control in the period that is not covered by the gain of first multi-channel audio.
As mentioned above, when the down-mix audio signal reproduced with monophony or stereophonic signal form through decoding, can directly decode and the multi-channel audio of not considering to put on this down-mix audio signal gains to the down-mix audio signal that applies the multi-channel audio gain.Yet, thereby when the decoded form with multi-channel audio signal of down-mix audio signal is reproduced, need make in the following method.
First method is to apply multi-channel audio gain to the gamut of down-mix audio signal or to the scope of having used head in the down-mix audio signal, with the sound level of the sound signal recovering to be associated.
Second method be every frame apply multi-channel audio gain, or a plurality of frames that are shorter than the scope of having used head in the down-mix audio signal are applied the multi-channel audio gain.
Third party's method is to use the combination of first and second methods.Being every frame ground or every a plurality of frames ground applies the multi-channel audio gain to down-mix audio signal, and subsequently the gamut of down-mix audio signal is applied another multi-channel audio gain.
Figure 13 illustrates the process flow diagram that according to an embodiment of the invention down-mix audio signal is applied the audio signal decoding method of multi-channel audio gain.With reference to Figure 13, the decoding device of using this sound channel signal coding/decoding method receives a sound signal bit stream (S1301).Described bit stream comprises encoded down-mix audio signal and encoded spatial signal information.
The demultiplexer of this decoding device is isolated encoded down-mix audio signal and encoded spatial signal information (S1302) from the bit stream that receives.The down-mix audio signal decoding unit of this decoding device is with encoded down-mix audio signal decoding and the down-mix audio signal (S1303) of output through decoding.
Can not usage space information during the output multi-channel audio signal (S1304) when this decoding device, this decoding device can directly be exported the down-mix audio signal (S1308) by the down-mix audio signal decoding unit decodes.On the other hand, when this decoding device can be exported multi-channel audio signal (S1304), then carry out following procedure.
That is, the spatial signal information decoding unit of this decoding device is with isolated spatial signal information decoding and span information.The multi-channel audio gain extraction unit of this decoding device extracts multi-channel audio gain information (S1305) from spatial signal information or down-mix audio signal.Can determine the multi-channel audio gain based on the multi-channel audio gain information that is extracted.The multi-channel audio gain applying unit of this decoding device applies determined multi-channel audio gain (S1306) to down-mix audio signal.The multichannel generation unit of this decoding device is transformed into multi-channel audio signal (S1307) by the down-mix audio signal that usage space information will apply the multi-channel audio gain.
Figure 14 illustrates according to an embodiment of the invention down-mix audio signal is applied arbitrariness multi-channel audio gain (ADG) to revise the code device of down-mix audio signal.This code device comprises down-mix unit 1402, spatial information generation unit 1403, ADG generation unit 1407, ADG applying unit 1409 and multiplexer 1411.
With reference to Figure 14,1402 pairs of multi-channel audio signals of down-mix unit 1401 carry out multi-channel audio, generate down-mix audio signal 1404 thus.In Figure 14, " n " expression input sound channel number.Spatial information generation unit 1403 extracts spatial information from multi-channel audio signal 1401.
Multiplexer 1411 generates and comprises the bit stream 1412 that its ADG that applies ADG has been applied down-mix audio signal 1408 and spatial signal information 1406.Spatial signal information 1406 is made of the spatial information that spatial information generation unit 1403 is extracted.Bit stream 1412 is sent to decoding device.Bit stream 1412 also can comprise the information about ADG.
Figure 15 illustrates according to an embodiment of the invention down-mix audio signal is applied AGC to revise the decoding device of down-mix audio signal.This decoding device comprises demultiplexer 1502, down-mix audio signal decoding unit 1505, spatial signal information decoding unit 1507, ADG extraction unit 1508, ADG applying unit 1509 and multichannel generation unit 1512.
With reference to Figure 15, demultiplexer 1502 is isolated encoded down-mix audio signal 1503 and encoded spatial signal information 1504 from bit stream 1501.
Down-mix audio signal decoding unit 1505 is with 1503 decodings of encoded down-mix audio signal, and with gained as a result through decoded signal as down-mix audio signal 1506 outputs, this signal can be monophony, stereo or multi-channel audio signal.Down-mix audio signal decoding unit 1505 can use the demoder of core codec.When this decoding device can not be handled down-mix audio signal 1506 with the output multi-channel audio signal, can be directly from decoding device output channels reduction audio signal (out1).
Spatial signal information decoding unit 1507 is encoded spatial signal information 1504 decoding, and with gained as a result through the signal of decoding as spatial information 1511 outputs.ADG extraction unit 1508 extracts the information about ADG, i.e. ADG information from spatial signal information 1504.ADG extraction unit 1508 also can extract ADG information from down-mix audio signal 1506.1509 pairs of down-mix audio signal 1506 of ADG applying unit apply AGC, and wherein ADG is based on that the ADG information extracted by ADG extraction unit 1508 determines.The down-mix audio signal 1510 that multichannel generation unit 1512 usage space information 1508 will apply ADG is transformed into multi-channel audio signal, and exports this multi-channel audio signal (out2).
Figure 16 illustrates according to an embodiment of the invention down-mix audio signal is applied multi-channel audio gain and the ADG code device with the correction down-mix audio signal.This code device comprises down-mix unit 1602, spatial information generation unit 1603, multi-channel audio gain applying unit 1606, ADG applying unit 1608 and multiplexer 1610.
With reference to Figure 16,, therefore no longer it is elaborated because down-mix unit 1602, spatial information generation unit 1603 and multiplexer 1610 are same or similar with those parts of Figure 14.
The difference of the code device of Figure 16 and the code device of Figure 14 is that the code device of Figure 16 comprises multi-channel audio gain applying unit 1606 and ADG applying unit 1608 both applying with gain of realization multi-channel audio and ADG.Although not shown in Figure 16, the code device of Figure 16 also can comprise multi-channel audio gain generation unit and ADG generation unit.
In detail, 1606 pairs of down-mix audio signal 1604 of multi-channel audio gain applying unit apply the multi-channel audio gain.The multi-channel audio gain can put on the gamut of down-mix audio signal 1604 uniformly.In addition, can in down-mix unit 1602, carry out carrying out the operation that applies the multi-channel audio gain during the multi-channel audio, and generate down-mix audio signal 1604 thus multi-channel audio signal 1601.
The down-mix audio signal 1607 that 1608 pairs of ADG applying units have applied the multi-channel audio gain applies ADG.As mentioned above, can carry out the operation that down-mix audio signal 1607 is applied ADG in every frame ground.According to applying of ADG, it is similar to have applied the effect that the effect of waveform of the down-mix audio signal of ADG may show when having applied dynamic range control (DRC).Can more particularly in hybrid domain, apply ADG at frequency domain to down-mix audio signal.According to the present invention, multi-channel audio gains and ADG also is feasible to applying from the extraneous down-mix audio signal (not shown) of importing of code device.
Multiplexer 1610 generates and comprises the bit stream 1611 that it is applied down-mix audio signal 1609 and the spatial signal information 1605 of ADG.
Figure 17 illustrates according to an embodiment of the invention down-mix audio signal is applied multi-channel audio gain and the ADG decoding device with the correction down-mix audio signal.This decoding device comprises demultiplexer 1702, down-mix audio signal decoding unit 1705, spatial signal information decoding unit 1707, multi-channel audio gain and ADG extraction unit 1708, ADG applying unit 1709, multi-channel audio gain applying unit 1711 and multichannel generation unit 1714.
With reference to Figure 17, demultiplexer 1702, down-mix audio signal decoding unit 1705, spatial signal information decoding unit 1101 and multichannel generation unit 1714 have and demultiplexer shown in Figure 15 1502, down-mix audio signal decoding unit 1505, spatial signal information decoding unit 1507 and multichannel generation unit 1512 same or analogous functions.Therefore, with the detailed description that no longer provides these building blocks.
The difference of the decoding device of Figure 17 and the decoding device of Figure 15 is that the decoding device of Figure 17 comprises that multi-channel audio gain and ADG extraction unit 1708, ADG applying unit 1709 and multi-channel audio gain applying unit 1711 are to realize applying of multi-channel audio gain and AGC.
Multi-channel audio gain and ADG extraction unit 1708 extract multi-channel audio gain and ADG information from spatial signal information 1704.Multi-channel audio gain and ADG information can be extracted by same building block.Alternatively, the multi-channel audio gain can be extracted by different building block (not shown) respectively with ADG information.Also can from down-mix audio signal 1706, extract multi-channel audio gain and ADG information.
Multi-channel audio gain applying unit 1711 will put on the down-mix audio signal 1710 that applies ADG according to the multi-channel audio gain that the multi-channel audio gain information generates.Multichannel generation unit 1714 usage space information 1713 will apply the down-mix audio signal 1712 of ADG and multi-channel audio gain as multi-channel audio signal output (out2).When this decoding device can not be exported such multi-channel audio signal, it can directly export the down-mix audio signal 1706 (out1) according to the decode operation generation of down-mix audio signal decoding unit 1705.
Figure 18 illustrates a plurality of frequency bands that applied ADG according to an embodiment of the invention.When the frequency band to sound signal applied ADG, the value of ADG can be identical with the sound channel of sound signal differential (CLD).For example, ADG can have the parameter band with the CLD similar number.Therefore, when in decoding device, having realized applying the function of ADG, can determine the group number that whole frequency band should be divided into based on " bsFreqResStridexxx " as shown in figure 18.
When " pbStride " is 1, do not carry out the grouping of whole frequency band.In this case, each frequency band is carried out ADG read, and the ADG that reads is put on this frequency band.When " pbStride " was 5, per 5 frequency band ground were carried out ADG and are read, and these 5 frequency bands are applied the ADG that is read.On the other hand, when " pbStride " is 28, carries out ADG and read, and whole frequency band is applied the ADG that is read.Thus, when " pbStride " is 28, carry out the gain control of whole frequency band, and when work as " pbStride " be value beyond 28, execution multiband gain control.
Also can carry out gain control to each sound channel of down-mix audio signal based on ADG.
Also can carry out ADG on the time slot basis applies.Here, time slot represents to be used for the time interval at medium minute audio frequency signal of time domain.Therefore, when the sound level that takes place in the special time position to tend to loud is suddenlyd change, can on this special time position, carry out at loud gain control.When the ADG value changes, ADG is carried out step interpolation just.Otherwise, keep the ADG value constant.Thus, in the situation of full range band gain control, there is ADG of every time slot for whole frequency band.On the other hand, in the situation of multiband gain control, there is ADG of every time slot for multiband.
Figure 19 illustrates according to an embodiment of the invention down-mix audio signal is applied the process flow diagram of ADG with the audio-frequency signal coding method of correction down-mix audio signal.The code device that carries out this audio-frequency signal coding method is therein at first received multi-channel audio signal (S1901).
This multi-channel audio signal carries out multi-channel audio by the down-mix unit of this code device subsequently, and then generates first down-mix audio signal (S1902).
By the spatial information generation unit of this code device from this multi-channel audio signal span information signal (S1902).
After this, the ADG generation unit by this code device is that second audio signal compares with first down-mix audio signal with the down-mix audio signal of directly importing from this code device external world.Result based on the comparison, the ADG generation unit generates ADG (S1903).The ADG that is generated is applied in first down-mix audio signal or second down-mix audio signal (S1904) subsequently in the ADG of this code device applying unit.Then, generate by the multiplexer of this code device and comprise the down-mix audio signal that applies ADG and the bit stream (S1905) of spatial signal information.The bit stream that is generated is sent to decoding device (S1905).
According to the present invention, can realize another kind of audio-frequency signal coding method, wherein down-mix audio signal is applied multi-channel audio gain and ADG with the correction down-mix audio signal.This coding method is similar to coding method shown in Figure 19.The difference of this coding method and coding method shown in Figure 19 is that this method also is included in and down-mix audio signal is applied the multi-channel audio gain after generating down-mix audio signal and spatial signal information as illustrated in fig. 19.In this coding method, subsequently the down-mix audio signal that applies the multi-channel audio gain is applied ADG.
According to the present invention, the generation of ADG is carried out in such a way: the low frequency part of ADG is not to generate as gain, but generate by the low frequency component execution residual coding to first down-mix audio signal, and the HFS of ADG is as being to generate as gain in the conventional method, thereby makes the ADG that is generated can show progressive performance.Here, the residual coding feeding is represented the part of down-mix audio signal is carried out direct coding.
In said method, the low frequency part of ADG is to generate by the low frequency component of first down-mix audio signal is carried out residual coding.Yet the low frequency part of ADG also can generate by the difference of first and second down-mix audio signal is carried out residual coding.
ADG that generates as gain and the ADG that generates according to the residual coding to the low frequency component of first down-mix audio signal are applied in down-mix audio signal to revise down-mix audio signal.According to the present invention, the recovering information that produces that spot correlation connection of sound level loss with down-mix audio signal can be added among the ADG, or can be sent out, so that have the ADG of recovering information can be used to down-mix audio signal in decoding device correction with ADG.
According to the present invention, in ADG, also can comprise being used to revise the information of down-mix audio signal (for example changing the amplitude of down-mix audio signal) and being used to recover the information of second down-mix audio signal with the difference that reduces second down-mix audio signal and first down-mix audio signal.The ADG of Sheng Chenging can be sent out with the state that is included in the spatial signal information in the above described manner.
Figure 20 illustrates according to an embodiment of the invention down-mix audio signal is applied the process flow diagram of ADG with the audio signal decoding method of correction down-mix audio signal.With reference to Figure 20, used the bit stream (S2001) of the decoding device received audio signal of this audio signal decoding method.Described bit stream comprises encoded down-mix audio signal and encoded spatial signal information.
From the bit stream that is received, isolate encoded down-mix audio signal and encoded spatial signal information (S2002) by the demultiplexer of this decoding device.Isolated down-mix audio signal is by the down-mix audio signal decoding unit decodes (S2003) of this decoding device.
When this decoding device can not usage space information be output as multi-channel audio signal with down-mix audio signal (S2004), this decoding device can directly be exported the down-mix audio signal (S2008) by the down-mix audio signal decoding unit decodes.On the other hand, when this decoding device can be output as multi-channel audio signal with down-mix audio signal (S2004), carry out following program.
That is, by the spatial signal information decoding unit of this decoding device isolated spatial signal information is decoded, with span information.Also the ADG extraction unit by this decoding device extracts ADG information (S2005) from spatial signal information or down-mix audio signal.Can determine ADG based on the ADG information of being extracted.Determined ADG puts on down-mix audio signal (S2006) by the ADG applying unit of this decoding device.The down-mix audio signal that has applied ADG is transformed into multi-channel audio signal by the multichannel generation unit of this decoding device based on spatial information, and exports this multi-channel audio signal (S2007) from this decoding device.
According to the present invention, also can realize another kind of coding/decoding method, wherein down-mix audio signal is applied multi-channel audio gain and ADG with the correction down-mix audio signal.This coding/decoding method is similar to coding/decoding method shown in Figure 20.This coding/decoding method and coding/decoding method difference shown in Figure 20 are: this method also is included in down-mix audio signal applied before the ADG down-mix audio signal is applied multi-channel audio gain (S2006).Hereinafter this coding/decoding method will be described in more detail.
Multi-channel audio gain information and ADG information are extracted from spatial signal information or down-mix audio signal by multi-channel audio gain and ADG extraction unit (not shown).To put on down-mix audio signal based on the multi-channel audio gain that the multi-channel audio gain information that is extracted generates subsequently.This multi-channel audio gain can be applied in the gamut of down-mix audio signal.After this, the ADG that generates based on the ADG information of being extracted is applied in down-mix audio signal.The operation that down-mix audio signal is applied ADG can be carried out on every frame ground.
Figure 21 is the block scheme that the code device of the energy level that is used to revise particular channel according to an embodiment of the invention is shown.This code device comprises particular channel energy level processing unit 2102, down-mix unit 2104, spatial information generation unit 2105 and multiplexer 2108.
With reference to Figure 21, particular channel energy level processing unit 2102 receives multi-channel audio signal 2101, the energy level of the particular channel in the multi-channel audio signal 2101 of corrected received, and the multi-channel audio signal 2103 of output through revising.Here, the energy level feeding is represented and the proportional value of the amplitude of the signal that is associated, and is comprised sound level.Can whether change and how to change by measuring or calculate the energy level of determining particular channel.Preferably gain and make the energy level correction by the sound channel signal that energy level variations has wherein taken place being applied particular channel.For example, can make the energy level correction by surround channel or LFE sound channel being applied gain around gain or LFE.2014 pairs of multi-channel audio signals 2103 through the energy level correction of down-mix unit carry out multi-channel audio, generate down-mix audio signal 2106 thus.
Also can extract spatial information from multi-channel audio signal 2103 by spatial information generation unit 2105.
Multiplexer 2108 generates the bit stream 2109 that comprises down-mix audio signal 2106 and spatial signal information 2107.Spatial signal information 2107 is made of the spatial information that spatial information generation unit 2105 is extracted.Bit stream 2109 is sent to decoding device.Bit stream 2109 also can comprise the particular channel gain information.
Figure 22 is the block scheme that the decoding device of the energy level that is used to revise particular channel according to an embodiment of the invention is shown.This decoding device comprises demultiplexer 2202, down-mix audio signal decoding unit 2205, spatial signal information decoding unit 2206, multichannel generation unit 2210 and particular channel energy level processing unit 2212.
With reference to Figure 22, the bit stream 2201 of demultiplexer 2202 received audio signals, and from bit stream 2203, isolate encoded down-mix audio signal 2204 and encoded spatial signal information 2201.
Down-mix audio signal decoding unit 2205 is decoded encoded down-mix audio signal 2203, and exports the down-mix audio signal 2208 through decoding of gained as a result.Down-mix audio signal decoding unit 2205 also can generate the down-mix audio signal 2209 of (PCM) data layout that has pulse code modulated by encoded down-mix audio signal 2203 is decoded.
Spatial signal information decoding unit 2206 is decoded spatial signal information 2204, and exports the spatial information 2207 of gained as a result.Multichannel generation unit 2210 is transformed into multi-channel audio signal 2211 with down-mix audio signal 2209.
Particular channel energy level processing unit 2212 receives multi-channel audio signal 2211, spatial information 2207 and down-mix audio signal 2208, and carries out the energy level correction based on the every sound channel of the signal that receives ground.
Particular channel energy level processing unit 2212 comprises sound channel energy level detecting unit 2213, revises judgement unit 2214 and sound channel energy level amending unit 2215.Whether the sound channel energy level that sound channel energy level detecting unit 2213 every sound channel ground detect multi-channel audio signal 2211 changes and how to change.Revise judgement unit 2214 and differentiate whether should carry out the energy level correction in every sound channel ground based on the testing result of in sound channel energy level detecting unit 2213, carrying out.Sound channel energy level amending unit 2215 is revised the energy level of particular channel based on the differentiation result who carries out in revising judgement unit 2214.
When this decoding device can not be exported multi-channel audio signal, this decoding device can directly be exported the down-mix audio signal 2008 (out1) according to the decode operation generation of down-mix audio signal decoding unit 2005.On the other hand, when this decoding device can be exported multi-channel audio signal, this decoding device can be exported multi-channel audio signal (out2) after the energy level of every sound channel ground correction multi-channel audio signal.
When not from code device send about the energy level update information of particular channel the time, decoding device shown in Figure 22 can be revised the energy level of particular channel voluntarily.One of this decoding device is characterised in that, particular channel energy level processing unit 2212 is configured to be independent of multichannel generation unit 2210.The sound channel energy level detecting unit 2213 that is included in the particular channel energy level processing unit 2212 can be based on being included in the energy level that CLD in spatial information and the down-mix audio signal 2218 calculates original audio signal.The energy level of energy level that calculates and the multi-channel audio signal of importing from multichannel generation unit 2,210 2211 is compared.
When the result determines to have energy level difference based on the comparison, in sound channel energy level amending unit 2215, carry out the energy level correction.That is, sound channel energy level amending unit 2215 multiply by predetermined particular channel gain to revise the energy level of multi-channel audio signal 2211 with the energy level of multi-channel audio signal 2211.In this case, when having energy level difference, revise judgement unit 2214 and determine to need to carry out the correction of sound channel energy level.Perhaps, revise judgement unit 2214 and can only when energy level difference surpasses a predetermined threshold value, just determine to need to carry out the correction of sound channel energy level.
According to the present invention, can realize similar but have another decoding device of difference with decoding device shown in Figure 22 to decoding device shown in Figure 22, this difference is to have comprised in the multichannel generation unit sound channel energy level detecting unit and revises judgement unit, and sound channel energy level amending unit is by separate configurations.
According to the present invention, can realize but have another decoding device of difference that this difference is: comprised sound channel energy level detecting unit in the multichannel generation unit, revised judgement unit and sound channel energy level amending unit similar with decoding device shown in Figure 22 to decoding device shown in Figure 22.In this case, can use the intrinsic function in the multichannel generation unit to come every sound channel ground to carry out the energy level correction.Use the energy level modification method of intrinsic function to comprise: the method for when the wave filter that uses such as quadrature mirror filter (QMF) or compound filter, adjusting the gain of this type of wave filter; Adjust the method for entire gain; Adjust prematrix or rearmounted matrix-valued method, adjust with the subband envelope and apply the method that instrument or temporal envelope apply the function that instrument is associated; When the method for adjusting the gain of these signals to through the signal of decorrelation and original signal summation the time; Or use particular module to replace the method for said method.Using QMF wave filter or compound filter to realize the occasion of decoding, can analyze the frequency bandwidth characteristics of each sound channel.Using the subband envelope to apply instrument or temporal envelope applies the occasion that instrument is realized decoding, can make the final signal that the user can generate provides real effect.
Figure 23 is the block scheme that the decoding device of the energy level that is used to revise particular channel according to an embodiment of the invention is shown.This decoding device has the configuration similar to decoding device shown in Figure 22.Therefore, no longer the similar configuration that comprises demultiplexer 2302, down-mix audio signal decoding unit 2305 and spatial signal information decoding unit 2303 is described in detail.The difference of the decoding device of Figure 23 and the decoding device of Figure 22 is that the position of particular channel energy level processing unit 2308 is different with its residing position in decoding device shown in Figure 22.
With reference to Figure 23, particular channel energy level processing unit 2308 comprises sound channel energy level detecting unit 2309, revises judgement unit 2310 and sound channel energy level amending unit 2311.Particular channel energy level processing unit 2308 can be revised to every sound channel the energy level of the down-mix audio signal 2307 with PCM data layout.
In detail, when hypothesis can according between the energy level of the energy level of original signal and reproducing signal relatively detect energy level difference between original signal and the reproducing signal time, sound channel energy level amending unit 2311 is revised the energy level of down-mix audio signal 2307 on the basis of sound channel.
Particular channel energy level processing unit 2308 is sent to multichannel generation unit 2313 with down-mix audio signal 2312.Multichannel generation unit 2313 can be output as multi-channel audio signal 2314 with down-mix audio signal 2312 after usage space information signal 2304 is handled down-mix audio signal 2312, wherein spatial information is that decode operation according to 2303 pairs of spatial signal information of spatial signal information decoding unit (out2) generates.
Simultaneously, according to the present invention, can use the bit stream of the sound signal that is associated to realize revising the function of the energy level of particular channel.In detail, when the energy level of code device correction particular channel and the state that comprises update information in bit stream when sending about the information revised, the decoding device that receives bit stream can extract update information from bit stream, and can recover the energy level of particular channel based on the update information of being extracted.For example, code device set have various different values around gain, with these around in the gain selected one put on surround channel, and in bit stream, comprise information, promptly around gain information around gain about being applied.In this case, can be comprised in the spatial signal information of bit stream around gain information.Decoding device extracts around gain information from bit stream.Use the information of being extracted, decoding device can return to original energy level with the energy level of this surround channel.Hereinafter will describe the method for update information being inserted bit stream in detail.
At first, spatial signal information is formatted so that its every frame or every a plurality of frame have a head.Update information (for example around gain information) about particular channel is comprised in the head.Have the occasion of a head at the every a plurality of frames of spatial signal information, can every a plurality of frames ground with head cycle or non-periodic be included in the spatial signal information.
Bit stream also can comprise the bit information of expression " which sound channel should be exaggerated or decay and how to amplify or decay (dB) this sound channel ".In this case, bit stream can comprise about whether revising the energy level of particular channel and the information that whether should use data before when carrying out correction continuously.Bit stream also can comprise about revising the information of which sound channel.In addition, bit stream can comprise about the decay of the sound channel that will be corrected or the information of amplifying energy level (dB).
According to the present invention, can realize a kind of method, wherein the particular channel marshalling is carried out the particular channel gain with every group of ground and adjust.That is, in code device, respectively on the same group particular channel is not applied different channel gain.After the multi-channel audio operation, code device sends this particular channel gain information with the state that the particular channel gain information is comprised in the bit stream that operation generates according to multi-channel audio.The inverse of decoding device by the channel gain that every group Di will use in code device puts on multi-channel audio signal the energy level of multi-channel audio signal returned to original energy level.
For example, all sound channels of sound signal can be divided into three groups, promptly constitute by center channels, left front sound channel and right front channels first group, second group of constituting by left back sound channel and right back sound channel and the 3rd group of constituting by the LFE sound channel.In this case, can use the first particular channel gain adjusting method, wherein the operation that each sound channel is applied particular channel gain is that every group of ground is carried out, and the sound channel of gained is summed to generate monaural down-mix audio signal as a result.In decoding device, monaural down-mix audio signal is transformed to multichannel, and each sound channel in the multichannel is multiplied each other so that it is output after returning to original energy level with the particular channel gain that is associated by every group of ground.This particular channel gain multiplied can be after conversion process or during carry out.
Also can use the second particular channel gain adjusting method.According to second method, every group of ground applies the particular channel gain to each sound channel.After this, left front sound channel and left back sound channel are summed with the generation L channel, and right front channels and right back sound channel are summed to generate R channel.In center channels and the LFE sound channel each is applied particular channel gain, and then it be multiply by 1/2^ (1/2).The sound channel of gained is added into L channel and R channel respectively to generate stereosonic down-mix audio signal as a result.Decoded when generating final signal when the stereo downmix signal that generates as mentioned above, every sound channel ground is carried out the particular channel gain and is applied.Specifically, be multiplied by 2^ (1/2) from the L channel of down-mix audio signal and the signal of R channel extraction, and be added into center channels and LFE sound channel.Specifically, be multiplied by 2^ (1/2) from the L channel of down-mix audio signal and the signal of R channel extraction, and be added into center channels and LFE sound channel.Although the embodiment that is associated with monophony or stereosonic down-mix audio signal of description, yet the present invention is not limited only to this.
According to the present invention, can realize other method, wherein after every group ground each sound channel being applied the particular channel gain, generate down-mix audio signal, and the operation that applies the multi-channel audio gain is to the down-mix audio signal execution that generated.
Those skilled in that art are appreciated that and can make various modifications and variations and not break away from the spirit or scope of the present invention the present invention.Therefore, the present invention is intended to cover modifications and variations of the present invention, as long as they drop in the scope of appended claims and equivalents thereof.
Industrial applicability
Can it is evident that from top explanation, according to the present invention, can be by to according to multi-channel audio signal The down-mix audio signal that generates of multi-channel audio apply the multi-channel audio gain or by to many Channel audio signal is carried out multi-channel audio to multi-channel audio signal after applying the multi-channel audio gain Effectively prevent the sound level loss of multi-channel audio signal.
Also can execute by the down-mix audio signal that the multi-channel audio according to multi-channel audio signal is generated Add ADG or by after down-mix audio signal being applied multi-channel audio gain, carrying out sound channel is contracted Subtract the sound level loss problem that operation that audio signal applies ADG prevents multi-channel audio signal.
The energy level of particular channel that in addition, can be by revising multi-channel audio signal and to the multichannel sound through revising Frequently signal carries out multi-channel audio prevents multi-channel audio signal to generate down-mix audio signal energy level The loss problem.
Claims (19)
1. the method for a decoded audio signal, described method comprises:
From the bit stream of sound signal, isolate down-mix audio signal and spatial signal information;
Use described spatial signal information that described down-mix audio signal is transformed into multi-channel audio signal; And
The particular channel that manifests energy level variations to described multi-channel audio signal applies the particular channel gain, thereby revises the energy level of this sound channel.
2. the method for claim 1 is characterized in that, at least one during whether the energy level of the described multi-channel audio signal of detection has changed and how to have changed with also comprising every sound channel.
3. method as claimed in claim 2 is characterized in that, also comprises the energy level that determines whether to revise the sound channel that shows energy level variations based on the intensity of variation of energy level.
4. the method for claim 1 is characterized in that, comprises that also the head of the spatial signal information from be included in described bit stream extracts the information that gains about described particular channel.
5. method as claimed in claim 4 is characterized in that, described head by every frame be included in the described spatial signal information, perhaps be included in the described spatial signal information every a plurality of frames.
6. method as claimed in claim 5 is characterized in that, described head by every a plurality of frames ground cycle or non-periodic be included in the described spatial signal information.
7. the method for claim 1 is characterized in that, described particular channel gain is to be applied in the sound channel that shows energy level variations in time domain.
8. the method for claim 1 is characterized in that, describedly applies particular channel gain and comprises:
Organize into groups the particular channel of described multi-channel audio signal; And
Described particular channel gained every group puts on described particular channel, thereby revises the energy level of described particular channel.
9. the method for a decoded audio signal, described method comprises:
From the bit stream of described sound signal, isolate down-mix audio signal; And
The particular channel that shows energy level variations to described down-mix audio signal applies the particular channel gain, thereby revises the energy level of this sound channel.
10. method as claimed in claim 9 is characterized in that, at least one during whether the energy level of the described down-mix audio signal of detection has changed and how to have changed with also comprising every sound channel.
11. method as claimed in claim 10 is characterized in that, also comprises the energy level that determines whether to revise the sound channel that shows energy level variations based on the intensity of variation of energy level.
12. the method for a coding audio signal, described method comprises:
Particular channel to described multi-channel audio signal applies the particular channel gain;
Generate down-mix audio signal and spatial signal information from the multi-channel audio signal that applies the particular channel gain; And
Generation comprises the bit stream of described down-mix audio signal and described spatial signal information.
13. method as claimed in claim 12 is characterized in that, describedly applies particular channel gain and comprises:
Organize into groups the particular channel of described multi-channel audio signal; And
Described particular channel gained every group puts on described particular channel, thereby revises the energy level of described particular channel.
14. method as claimed in claim 12 is characterized in that, also comprises inserting in the described spatial signal information about the information of described particular channel gain.
15. the method for a coding audio signal, described method comprises:
Generate down-mix audio signal and spatial signal information from multi-channel audio signal;
Particular channel to described down-mix audio signal applies the particular channel gain; And
Generation comprises the down-mix audio signal that applies particular channel gain and the bit stream of described spatial signal information.
16. a data structure, it comprises:
The down-mix audio signal of multi-channel audio signal; And
Information about the particular channel gain of the particular channel that puts on described multi-channel audio signal.
17. the device of a decoded audio signal, it comprises:
Demultiplexer is used for isolating down-mix audio signal and spatial signal information from the bit stream of described sound signal;
The multichannel generation unit, described multichannel generation unit uses described spatial signal information that described down-mix audio signal is transformed into multi-channel audio signal; And
Particular channel energy level processing unit is used for the particular channel that manifests energy level variations of described multi-channel audio signal is applied the particular channel gain, thereby revises the energy level of this particular channel.
18. device as claimed in claim 17 is characterized in that, described particular channel energy level processing unit comprises:
Sound channel energy level detecting unit, whether the energy level that is used for detecting the particular channel that shows energy level variations changes and how to change;
Revise judgement unit, whether be used for differentiating and revise the energy level of the sound channel that shows energy level variations; And
Sound channel energy level amending unit is used for the particular channel that shows energy level variations is applied the particular channel gain, thereby revises the energy level of this particular channel.
19. the device of a coding audio signal, it comprises:
Sound channel energy level processing unit is used for the particular channel of multi-channel audio signal is applied the particular channel gain;
Down-mix unit is used for generating down-mix audio signal from the multi-channel audio signal that applies the particular channel gain; And
The spatial information generation unit is used for extracting spatial information from described multi-channel audio signal.
Applications Claiming Priority (52)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US69500705P | 2005-06-30 | 2005-06-30 | |
US60/695,007 | 2005-06-30 | ||
US69585805P | 2005-07-05 | 2005-07-05 | |
US60/695,858 | 2005-07-05 | ||
US74860805P | 2005-12-09 | 2005-12-09 | |
US60/748,608 | 2005-12-09 | ||
US75700406P | 2006-01-09 | 2006-01-09 | |
US60/757,004 | 2006-01-09 | ||
US75823606P | 2006-01-12 | 2006-01-12 | |
US60/758,236 | 2006-01-12 | ||
US75860906P | 2006-01-13 | 2006-01-13 | |
KR10-2006-0004056 | 2006-01-13 | ||
KR20060004065 | 2006-01-13 | ||
KR1020060004056 | 2006-01-13 | ||
KR1020060004065 | 2006-01-13 | ||
KR10-2006-0004055 | 2006-01-13 | ||
KR10-2006-0004065 | 2006-01-13 | ||
US60/758,609 | 2006-01-13 | ||
KR20060004055 | 2006-01-13 | ||
KR20060004056 | 2006-01-13 | ||
KR1020060004055 | 2006-01-13 | ||
US75962306P | 2006-01-18 | 2006-01-18 | |
US60/759,623 | 2006-01-18 | ||
US76035906P | 2006-01-20 | 2006-01-20 | |
US60/760,359 | 2006-01-20 | ||
US77807006P | 2006-03-02 | 2006-03-02 | |
US60/778,070 | 2006-03-02 | ||
KR1020060030653 | 2006-04-04 | ||
KR1020060030671A KR20070003545A (en) | 2005-06-30 | 2006-04-04 | Clipping restoration for multi-channel audio coding |
KR10-2006-0030653 | 2006-04-04 | ||
KR1020060030671 | 2006-04-04 | ||
KR10-2006-0030671 | 2006-04-04 | ||
KR1020060030653A KR20070003544A (en) | 2005-06-30 | 2006-04-04 | Clipping restoration by arbitrary downmix gain |
KR1020060056480A KR20070003574A (en) | 2005-06-30 | 2006-06-22 | Method and apparatus for encoding and decoding an audio signal |
KR10-2006-0056480 | 2006-06-22 | ||
KR1020060056480 | 2006-06-22 | ||
KR10-2006-0058142 | 2006-06-24 | ||
KR1020060058141 | 2006-06-27 | ||
KR10-2006-0058141 | 2006-06-27 | ||
KR10-2006-0058120 | 2006-06-27 | ||
KR1020060058141A KR20070075237A (en) | 2006-01-12 | 2006-06-27 | Encoding and decoding method of multi-channel audio signal |
KR1020060058140 | 2006-06-27 | ||
KR1020060058142 | 2006-06-27 | ||
KR10-2006-0058140 | 2006-06-27 | ||
KR1020060058139 | 2006-06-27 | ||
KR1020060058120A KR20070005477A (en) | 2005-07-05 | 2006-06-27 | Method for compensation of energy levels in specific channel signals for multi-channel audio coding and aparatuses for encoding and deconding multi-channel audio signals performancing the compensation |
KR1020060058142A KR20070076363A (en) | 2006-01-18 | 2006-06-27 | Method of encoding and decoding an audio signal |
KR1020060058120 | 2006-06-27 | ||
KR1020060058139A KR20070003593A (en) | 2005-06-30 | 2006-06-27 | Encoding and decoding method of multi-channel audio signal |
KR1020060058140A KR20070003594A (en) | 2005-06-30 | 2006-06-27 | Method of clipping sound restoration for multi-channel audio signal |
KR10-2006-0058139 | 2006-06-27 | ||
PCT/KR2006/002578 WO2007004829A2 (en) | 2005-06-30 | 2006-06-30 | Apparatus for encoding and decoding audio signal and method thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101243491A true CN101243491A (en) | 2008-08-13 |
CN101243491B CN101243491B (en) | 2014-03-05 |
Family
ID=37870372
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2006800292910A Active CN101297352B (en) | 2005-06-30 | 2006-06-30 | Apparatus for encoding and decoding audio signal and method thereof |
CN2006800294367A Active CN101243490B (en) | 2005-06-30 | 2006-06-30 | Method and apparatus for encoding and decoding an audio signal |
CN200680029607.6A Active CN101243491B (en) | 2005-06-30 | 2006-06-30 | Method and apparatus for encoding and decoding audio signal |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2006800292910A Active CN101297352B (en) | 2005-06-30 | 2006-06-30 | Apparatus for encoding and decoding audio signal and method thereof |
CN2006800294367A Active CN101243490B (en) | 2005-06-30 | 2006-06-30 | Method and apparatus for encoding and decoding an audio signal |
Country Status (2)
Country | Link |
---|---|
KR (2) | KR20070003593A (en) |
CN (3) | CN101297352B (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102272829A (en) * | 2008-12-29 | 2011-12-07 | 摩托罗拉移动公司 | Method and apparatus for generating an enhancement layer within a multiple-channel audio coding system |
CN103201792A (en) * | 2010-11-12 | 2013-07-10 | 杜比实验室特许公司 | Downmix limiting |
CN103400581A (en) * | 2010-02-18 | 2013-11-20 | 杜比实验室特许公司 | Audio decoding using efficient downmixing and decoding method |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102484547A (en) * | 2009-09-01 | 2012-05-30 | 松下电器产业株式会社 | Digital broadcasting transmission device, digital broadcasting reception device, digital broadcasting reception system |
US9514768B2 (en) | 2010-08-06 | 2016-12-06 | Samsung Electronics Co., Ltd. | Audio reproducing method, audio reproducing apparatus therefor, and information storage medium |
US8762158B2 (en) | 2010-08-06 | 2014-06-24 | Samsung Electronics Co., Ltd. | Decoding method and decoding apparatus therefor |
US8948403B2 (en) | 2010-08-06 | 2015-02-03 | Samsung Electronics Co., Ltd. | Method of processing signal, encoding apparatus thereof, decoding apparatus thereof, and signal processing system |
US8948406B2 (en) | 2010-08-06 | 2015-02-03 | Samsung Electronics Co., Ltd. | Signal processing method, encoding apparatus using the signal processing method, decoding apparatus using the signal processing method, and information storage medium |
EP2477188A1 (en) * | 2011-01-18 | 2012-07-18 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Encoding and decoding of slot positions of events in an audio signal frame |
JPWO2014007097A1 (en) | 2012-07-02 | 2016-06-02 | ソニー株式会社 | Decoding device and method, encoding device and method, and program |
TWI517142B (en) | 2012-07-02 | 2016-01-11 | Sony Corp | Audio decoding apparatus and method, audio coding apparatus and method, and program |
BR112014004127A2 (en) * | 2012-07-02 | 2017-04-04 | Sony Corp | device and decoding method, program, and, device and encoding method |
EP3198594B1 (en) * | 2014-09-25 | 2018-11-28 | Dolby Laboratories Licensing Corporation | Insertion of sound objects into a downmixed audio signal |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH11282496A (en) * | 1998-03-30 | 1999-10-15 | Matsushita Electric Ind Co Ltd | Decoding device |
JPH11330980A (en) * | 1998-05-13 | 1999-11-30 | Matsushita Electric Ind Co Ltd | Decoding device and method and recording medium recording decoding procedure |
US7536305B2 (en) * | 2002-09-04 | 2009-05-19 | Microsoft Corporation | Mixed lossless audio compression |
-
2006
- 2006-06-27 KR KR1020060058139A patent/KR20070003593A/en unknown
- 2006-06-27 KR KR1020060058140A patent/KR20070003594A/en unknown
- 2006-06-30 CN CN2006800292910A patent/CN101297352B/en active Active
- 2006-06-30 CN CN2006800294367A patent/CN101243490B/en active Active
- 2006-06-30 CN CN200680029607.6A patent/CN101243491B/en active Active
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102272829A (en) * | 2008-12-29 | 2011-12-07 | 摩托罗拉移动公司 | Method and apparatus for generating an enhancement layer within a multiple-channel audio coding system |
CN102272829B (en) * | 2008-12-29 | 2013-07-31 | 摩托罗拉移动公司 | Method and apparatus for generating an enhancement layer within a multiple-channel audio coding system |
CN103400581A (en) * | 2010-02-18 | 2013-11-20 | 杜比实验室特许公司 | Audio decoding using efficient downmixing and decoding method |
CN103400581B (en) * | 2010-02-18 | 2016-05-11 | 杜比实验室特许公司 | Use efficient lower audio decoder and the coding/decoding method mixing |
CN103201792A (en) * | 2010-11-12 | 2013-07-10 | 杜比实验室特许公司 | Downmix limiting |
CN103201792B (en) * | 2010-11-12 | 2015-09-09 | 杜比实验室特许公司 | Downmix limits |
US9224400B2 (en) | 2010-11-12 | 2015-12-29 | Dolby Laboratories Licensing Corporation | Downmix limiting |
Also Published As
Publication number | Publication date |
---|---|
KR20070003594A (en) | 2007-01-05 |
CN101243490A (en) | 2008-08-13 |
CN101297352A (en) | 2008-10-29 |
CN101243491B (en) | 2014-03-05 |
KR20070003593A (en) | 2007-01-05 |
CN101243490B (en) | 2013-01-09 |
CN101297352B (en) | 2013-02-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101297352B (en) | Apparatus for encoding and decoding audio signal and method thereof | |
CA2613731C (en) | Apparatus for encoding and decoding audio signal and method thereof | |
US8494667B2 (en) | Apparatus for encoding and decoding audio signal and method thereof | |
CN101543098B (en) | decorrelator and method for generation of output signal, and audio decoder for producing multi-channel output signals | |
CN101542597B (en) | Methods and apparatuses for encoding and decoding object-based audio signals | |
CN101479785B (en) | Method for encoding and decoding object-based audio signal and apparatus thereof | |
AU2006301612B2 (en) | Temporal and spatial shaping of multi-channel audio signals | |
EP2109861B1 (en) | Audio decoder | |
US8817992B2 (en) | Multichannel audio coder and decoder | |
CN108307272A (en) | Acoustic signal processing method and equipment | |
CN101868821A (en) | The method and apparatus that is used for processing signals | |
CN101243488B (en) | Apparatus for encoding and decoding audio signal and method thereof | |
RU2393550C2 (en) | Device and method for coding and decoding of sound signal | |
CN104205211A (en) | Multi-channel audio encoder and method for encoding a multi-channel audio signal |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |