CN101223598A - Virtual source location information based channel level difference quantization and dequantization method - Google Patents

Virtual source location information based channel level difference quantization and dequantization method Download PDF

Info

Publication number
CN101223598A
CN101223598A CNA2006800259842A CN200680025984A CN101223598A CN 101223598 A CN101223598 A CN 101223598A CN A2006800259842 A CNA2006800259842 A CN A2006800259842A CN 200680025984 A CN200680025984 A CN 200680025984A CN 101223598 A CN101223598 A CN 101223598A
Authority
CN
China
Prior art keywords
cld
quantization
vsli
quantized value
coding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2006800259842A
Other languages
Chinese (zh)
Other versions
CN101223598B (en
Inventor
徐廷一
姜京玉
洪镇佑
金光基
白承权
韩敏洙
田相培
成宏模
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Electronics and Telecommunications Research Institute ETRI
Original Assignee
Electronics and Telecommunications Research Institute ETRI
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Electronics and Telecommunications Research Institute ETRI filed Critical Electronics and Telecommunications Research Institute ETRI
Priority claimed from PCT/KR2006/002824 external-priority patent/WO2007011157A1/en
Publication of CN101223598A publication Critical patent/CN101223598A/en
Application granted granted Critical
Publication of CN101223598B publication Critical patent/CN101223598B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components

Abstract

Methods for Spatial Audio Coding (SAC) of a multi-channel audio signal and decoding of an audio bitstream generated by the SAC are provided. More particularly, methods of efficient quantization and dequantization of Channel Level Difference (CLD) used as a spatial parameter when SAC -based encoding of a multi-channel audio signal is performed are provided. A method of CLD quantization includes extracting sub-band-specific CLDs from an N-channel audio signal (N>1), and quantizing the CLDs by reference to a Virtual Source Location Information (VSLI)-based CLD quantization table designed using CLD quantization values derived from VSLI quantization values of the N-channel audio signal.

Description

Channel grade residual quantity reconciliation quantization method based on virtual source location information
Technical field
The decoding of the audio bitstream that the present invention relates to the spatial audio coding (SAC) of multi-channel audio signal and generate by SAC, and relate more specifically to when carry out multi-channel audio signal based on the coding of SAC the time, as the effective quantification and the de-quantization of the channel grade poor (CLD) of spatial parameter.
Background technology
Spatial audio coding (SAC) is to be used for compressing effectively multi-channel audio signal to keep technology with the compatibility of existing stereo audio system simultaneously.In Motion Picture Experts Group (MPEG), the SAC technology from 2002 by standardization and called after " around MPEG ", and, describe in detail among the ISO/IEC CD 14996-x (on February 18th, 2005 published, and after this be called " SAC normative document ") at ISO/IEC work document.
Particularly, the SAC method is to be used for by using down mixed frequency signal and N multi-channel audio signal of one group of auxiliary space parameter coding (N>2) to improve the coding method of transfer efficiency, wherein descend mixed frequency signal to be mixed down monophony or stereo, the human consciousness characteristic of auxiliary space parametric representation multi-channel audio signal.Spatial parameter can comprise expression according to T/F the channel grade of the grade difference between two passages poor (CLD), expression according to correlativity or coherence's the interchannel correlativity/coherence (ICC) of T/F between two passages, make can be by prediction from the passage predictive coefficient (COC) of two channel reproduction third channels etc.
CLD is the core element that recovers the power gain of each passage, and is extracted in every way in the processing of SAC coding.Shown in Figure 1A, based on a reference channel, CLD expresses by each power ratio of reference channel and other passages.For example,, can obtain five power ratios based on a reference channel so if six channel signal L, R, C, LFE, Ls and Rs are arranged, and CLD1 to the CLD5 correspondence by denary logarithm being applied to each grade that obtains of five power ratios.
Simultaneously, shown in Figure 1B, it is right that hyperchannel is divided into a plurality of passages, and each passage is to based on stereo analyzed, and in each analytical procedure, extract a CLD value.This progressively use by a plurality of a pair of two (OTT) module is carried out, and these a pair of two modules are got two input channels to an output channel.In each OTT, any one input stereo audio signal is identified as reference channel, and the denary logarithm value of the power ratio of reference channel and other passages is used as the output of CLD value.
The CLD value has the dynamic range from-∞ to+∞.Therefore, in order to express the CLD value, need useful quantitative with limited figure place.Typically, CLD quantizes by using normalized quantization table to carry out.An example of such quantization table provides in SAC normative document (seeing the 41st page, table 57).By this way, because only can not express all CLD values, so the dynamic range of CLD value is limited to predetermined grade or still less with limited figure place.Thus, introduce quantization error, and therefore spectrum information distortion.For example, when using 5 to carry out CLD when quantizing, the dynamic range of CLD value will be limited to-and 25dB arrives+scope between the 25dB.
Summary of the invention
The technical matters that solves
The present invention is directed to channel grade poor (CLD) and quantize conciliates quantization method, its can minimize multi-channel audio signal based on the sound deterioration in the cataloged procedure of spatial audio coding (SAC).
The present invention also is directed to CLD and quantize conciliates quantization method, and it can be in the process based on the SAC coding of multi-channel audio signal, uses the advantage of the quantification of the virtual source location information (VSLI) that available CLD replaces to minimize the sound deterioration.
In addition, the present invention is directed to by providing the CLD quantization table improvement sound quality based on VSLI not have extra complexity, and this table can be replaced by the CLD quantification that is used for SAC system of Motion Picture Experts Group (MPEG)-4 and the CLD quantization table of de-quantization.
Technical scheme
A first aspect of the present invention provides a kind of method, be used for when carry out N channel audio signal (N>1) based on the coding of spatial audio coding (SAC) time, quantize channel grade poor (CLD) parameter as spatial parameter.This CLD quantization method comprises the steps: to extract from the N channel audio signal CLD of each frequency band; And by with reference to quantizing CLD based on the CLD quantization table of virtual source location information (VSLI), this quantization table is to use the CLD quantized value design that obtains from the VSLI quantized value of N channel audio signal.
A second aspect of the present invention provides a kind of computer readable recording medium storing program for performing, records the computer program that is used to carry out the CLD quantization method on it.
A third aspect of the present invention provides a kind of method based on spatial audio coding (SAC) coding N channel audio signal (N>1).This method comprises the steps: down mixing and coding N channel audio signal; Extract the spatial parameter that comprises channel grade poor (CLD), interchannel correlativity/coherence (ICC) and passage predictive coefficient (CPC) for each frequency band from the N channel audio signal; And the spatial parameter that quantizes extraction.In the step of the spatial parameter that quantize to extract, by quantizing CLD with reference to the CLD quantization table based on VSLI, this quantization table is to use the CLD quantized value that obtains from the VSLI quantized value of N channel audio signal to design.
A fourth aspect of the present invention provides a kind of device based on spatial audio coding (SAC) coding N channel audio signal (N>1).This device comprises: the SAC code device, be used for down mixing N channel audio signal generating down mixed frequency signal, and comprise the spatial parameter of channel grade poor (CLD), interchannel correlativity/coherence (ICC) and passage predictive coefficient (CPC) from the N channel audio signal for each frequency band extraction; Audio coding apparatus is used for generating the audio bitstream that compresses from the following mixed frequency signal that is generated by the SAC code device; The spatial parameter quantization device is used to quantize the spatial parameter that is extracted by the SAC code device; And the spatial parameter code device, be used for the spatial parameter grade of coded quantization.The spatial parameter quantization device is by quantizing CLD with reference to the CLD quantization table based on virtual source location information (VSLI), and this quantization table is to use the CLD quantized value that obtains from the VSLI quantized value of N channel audio signal to design.
A fifth aspect of the present invention provides a kind of method, is used for when based on spatial audio coding (SAC) the N channel audio bit stream (N>1) of coding being decoded channel grade poor (CLD) quantized value of de-quantization coding.This CLD de-quantization method comprises the steps: the CLD quantized value of coding is carried out Hofmann decoding; And by using the CLD quantized value based on the CLD quantization table de-quantization decoding of virtual source location information (VSLI), this quantization table is to use the CLD quantized value design that draws from the VSLI quantized value of N channel audio signal.
A sixth aspect of the present invention provides a kind of computer readable recording medium storing program for performing, records the computer program that is used to carry out CLD de-quantization method on it.
It is a kind of based on the method for spatial audio coding (SAC) to N channel audio bit stream (N>1) decoding of coding that a seventh aspect of the present invention provides.This method comprises the steps: the N channel audio bit stream of decoding and coding; The quantized value of at least one spatial parameter that de-quantization receives with the N channel audio bit stream of coding; And based on the synthetic N channel audio bit stream of decoding of the spatial parameter of de-quantization, to recover the N channel audio signal.In the step of the quantized value of at least one spatial parameter of de-quantization, by with reference to coming de-quantization to be included in channel grade poor (CLD) in the spatial parameter based on the CLD quantization table of virtual source location information (VSLI), this quantization table is to use the CLD quantized value design that obtains from the VSLI quantized value of N channel audio signal.
It is a kind of based on the device of spatial audio coding (SAC) to N channel audio bit stream (N>1) decoding of coding that a eighth aspect of the present invention provides.This device comprises: the device that is used for the N channel audio bit stream of decoding and coding; The device of quantized value of at least one spatial parameter of receiving with the N channel audio bit stream of encoding is used to decode; The device that is used for the quantized value of de-quantization spatial parameter; And be used for based on the synthetic N channel audio bit stream of decoding of the spatial parameter of de-quantization to recover the device of N channel audio signal.The device that is used for the quantized value of de-quantization spatial parameter passes through with reference to the CLD quantization table based on virtual source location information (VSLI), de-quantization is included in the channel grade poor (CLD) in the spatial parameter, and this quantization table is to use the CLD quantized value design that obtains from the VSLI quantized value of N channel audio signal.
Beneficial effect
The CLD quantization table of creating according to the present invention based on VSLI can substitute the CLD quantization table that uses in existing SAC system.By using, can prevent the sound deterioration as much as possible according to the CLD quantization table based on VSLI of the present invention.In addition, by in compression CLD index, using the Huffman code book that proposes in the present invention, can reduce the required bit rate of transmission CLD.
Description of drawings
The conceptive process of from multi channel signals, extracting channel grade poor (CLD) value that illustrates of Figure 1A and Figure 1B;
Fig. 2 schematically illustrates the configuration of having used spatial audio coding of the present invention (SAC) system;
Fig. 3 A and Fig. 3 B are the views of notion that is used to explain the VSLI of the reference that quantizes as CLD according to the present invention; And
Fig. 4 is the figure that illustrates from the CLD quantized value of VSLI quantized value conversion according to the present invention.
Embodiment
Below, will describe exemplary embodiment of the present invention in detail.Yet, the exemplary embodiment that the invention is not restricted to describe below, and can realize with many forms.Therefore, provide these exemplary embodiments, be used for complete open the present invention, and fully pass on scope of the present invention those of ordinary skills.
Fig. 2 schematically illustrates the configuration of using spatial audio coding of the present invention (SAC) system.As shown, the SAC system can be divided into from the N channel audio signal generate, coding and transmission mixed frequency signal and the coded portion of spatial parameter and the decoded portion of recovering the N channel audio signal from following mixed frequency signal and spatial parameter down by the coded portion transmission.Coded portion comprises SAC scrambler 210, audio coder 220, spatial parameter quantizer 230 and spatial parameter scrambler 240.Decoded portion comprises audio decoder 250, spatial parameter demoder 260, spatial parameter de-quantizer 270 and SAC demoder 280.
SAC scrambler 210 generates mixed frequency signal down from the N channel audio signal of input, and analyze the spatial character of N channel audio signal, extract spatial parameter thus, as channel grade poor (CLD), interchannel correlativity/coherence (ICC) and passage predictive coefficient (CPC).
Particularly, N (N>1) multi channel signals that is input to SAC scrambler 210 is broken down into each frequency band by analysis filterbank.In order to separate the signal into each sub-band of frequency domain with low complex degree, use quadrature mirror filter (QMF).The spatial character relevant with spatial perception be from the sub-band signal analysis, and extract spatial parameter as CLD, ICC and CPC selectively according to the encoding operation pattern.In addition, sub-band signal is by the synthetic following mixed frequency signal of organizing the following mixing of quilt and being converted to time domain of QMF.
As an alternative, following mixed frequency signal can be replaced by the following mixed frequency signal that produces in advance by acoustic engineers (or following mixed frequency signal of art/manual mixing).At this moment, SAC scrambler 210 is based on the following mixed frequency signal adjustment and the transmission space parameter that produce in advance, and the hyperchannel of optimizing thus at the demoder place recovers.
Audio coder 220 is by (for example using existing audio compression techniques, the algorithm coding (BSAC) that Motion Picture Experts Group (MPEG)-4, Advanced Audio Coding (AAC), the efficient Advanced Audio Coding of MPEG-4 (HE-AAC), MPEG-4 position are cut apart etc.), compression generates the audio bitstream of compression thus by following mixed frequency signal or artistic mixed frequency signal down that SAC scrambler 210 generates.
Simultaneously, the spatial parameter that is generated by SAC scrambler 210 transmits after by spatial parameter quantizer 230 and 240 quantifications of spatial parameter scrambler and coding.Spatial parameter quantizer 230 provides quantization table, and it is used to quantize each CLD, ICC and CPC.As described below, in order to minimize the sound deterioration that causes by using existing normalization CLD quantization table to quantize CLD, can in spatial parameter quantizer 230, use CLD quantization table based on virtual source location information (VSLI).
Spatial parameter scrambler 240 is carried out entropy coding, so that the spatial parameter that compression is quantized by spatial parameter quantizer 230, and preferably use the Huffman code book that the quantization index of spatial parameter is carried out huffman coding.As described below, the present invention proposes new Huffman code book, so that the transfer efficiency of maximization CLD quantization index.
Audio decoder 250 decodings are by the audio bitstream of existing audio compression techniques (for example, MPEG-4, AAC, MPEG-4HE-AAC, MPEG-4BSAC etc.) compression.
Spatial parameter demoder 260 and spatial parameter de-quantizer 270 are the modules that are used to carry out the reverse operating of the quantification carried out by spatial parameter quantizer 230 and spatial parameter scrambler 240 and coding.Spatial parameter demoder 260 is based on the quantization index of the coding of parameter between Huffman code book decode empty, and spatial parameter de-quantizer 270 is from the spatial parameter of quantization table acquisition corresponding to quantization index.With the quantification of spatial parameter with encode similarly, CLD quantization table and the Huffman code book based on the VSLI that propose among the present invention are used for the decoding of spatial parameter and the processing of de-quantization.
SAC demoder 280 by the audio bitstream of audio decoder 250 decodings and the spatial parameter that is obtained by spatial parameter de-quantizer 270, recovers the N multi-channel audio signal by synthetic.As an alternative, when can not decoding during multi-channel audio signal,, make that it is possible independently serving by using the existing audio decoder mixed frequency signal of only can decoding down.Therefore, the SAC system can provide the compatibility with existing monophony or stereo audio coding system.
The present invention relates to provide CLD to quantize, the quantification advantage of the VSLI of the space audio image that it can be by utilizing the expression multi-channel audio signal minimizes the sound deterioration that produces from quantizing.The present invention is based on such fact: when the orientation angles of expression of space AV, people's ear is difficult to discern 3 ° or littler error.The VSLI that expresses with orientation angles has 90 ° limited dynamic range, makes the quantization error that can avoid when quantizing the restriction by dynamic range to cause.When the quantification advantage based on VSLI designs the CLD quantization table, can minimize the sound deterioration that produces from quantizing.
Fig. 3 A and Fig. 3 B are used to explain the view of notion that quantizes the VSLI of reference as CLD according to the present invention.Fig. 3 A diagram wherein two loudspeakers is positioned at the boombox environment of 60 ° angle, and Fig. 3 B is that stereo audio signal in the boombox environment of wherein Fig. 3 A is by the power of mixed frequency signal and the view that VSLI represents down.As shown, stereo or multi-channel audio signal can be by the amplitude vector and the VSLI of mixing sound signal represent that the latter can obtain by each channel power of analyzing multi-channel audio signal down.Biao Shi multi-channel audio signal can recover by the position vector projection amplitude vector according to sound source by this way.
As shown in Fig. 3 A and Fig. 3 B, the signal power of supposing left-hand loudspeaker is P L, the signal power of right-hand loudspeaker is P R, and the angle of left-hand loudspeaker and right-hand loudspeaker is respectively A LAnd A R, then the VSLI of sound source can draw by equation 1 and 2.
Equation 1
θ = tan - 1 P R P L
Equation 2
VSLI=θ×(A R-A L)/90+A L
The VSLI of Ji Suaning has A by this way LAnd A RBetween value.P LAnd P RCan followingly recover: at first, as equation 3, use firm power to shake (CPP, constant powerpanning) rule VSLI is mapped to value VSLI ' between 0 ° and 90 ° from VSLI.
Equation 3
VSLI ′ = VSLI - A L A R - A L × 90
The VSLI ' by using mapping by this way and the power P of following mixed frequency signal D, utilize equation 4 and 5 to calculate P LAnd P R
Equation 4
P L=P D×(cos(VSLI′)) 2
Equation 5
P R=P D×(sin(VSLI′) 2
As previously mentioned, theme of the present invention relates to the quantification that the quantification advantage of VSLI is applied to spatial parameter CLD.In the boombox environment of Fig. 3 A, CLD can express as equation 6.
Equation 6
CLD = 10 lo g 10 P R P L
CLD can draw from VSLI according to equation 7.
Equation 7
CLD = 10 log 10 ( sin ( VSLI ′ ) ) 2 ( cos ( VSLI ′ ) ) 2 = 20 log 10 ( tan ( VSLI ′ ) )
= 20 log 10 ( tan ( VSLI - A L A R - A L × 90 ) )
In addition, such as following equation 8 definition, CLD can natural logarithm rather than denary logarithm by getting VSLI obtain.
Equation 8
CLD = 20 log e ( tan ( VSLI ′ ) ) = 20 log e ( tan ( VSLI - A L A R - A L × 90 ) )
The CLD values that obtain by equation 7 and 8 can directly be used as the spatial parameter of general SAC system.
As previously mentioned, because CLD have-∞ is to the dynamic range the between+∞, so use limited figure place to quantize to go wrong.Subject matter is the quantization error that the restriction by dynamic range causes.Because can not only express all dynamic ranges of CLD, so the dynamic range of CLD is limited in predetermine level or lower with limited figure place.As a result, introduce quantization error, and the spectrum information distortion.If use 5 to be used for CLD and to quantize, then the dynamic range of CLD be restricted to-25dB is between+the 25dB.
On the contrary, because VSLI has 90 ° limited dynamic range, so this quantization error that can avoid when quantizing the restriction by dynamic range to cause.
In one embodiment, when VSLI quantizes,, and use linear quantizer if use 5 to be used for CLD and to quantize, then the quantification gradation number be 31 and quantized interval be 3 °.The validity of VSLI quantization method can be verified from the following fact: when discerning the spatial image of sound signal, people can not discern 3 ° or littler difference.
The CLD that the advantage that this VSLI quantizes is applied to stereo encoding method quantizes, and the CLD quantization table that uses in the existing SAC system can be replaced by the quantization table based on VSLI.
In one embodiment, provided among Fig. 1 with 3 ° quantized interval and carried out the quantized value of the VSLI that 5 bit linear quantize and corresponding to the CLD conversion level of VSLI quantized value.
Table 1VSLI quantized value and CLD value
Index The VSLI quantized value The CLD value Index The VSLI quantized value The CLD value
-15 0 -324.2604 1 48 0.9113
-14 3 -25.6121 2 51 1.8326
-13 6 -19.5676 3 54 2.7748
-12 9 -16.0057 4 57 3.7497
-11 12 -13.4505 5 60 4.7712
-10 15 -11.4390 6 63 5.8567
-9 18 -9.7645 7 66 7.0283
-8 21 -8.3165 8 69 8.3165
-7 24 -7.0283 9 72 9.7645
-6 27 -5.8567 10 75 11.4390
-5 30 -4.7712 11 78 13.4505
-4 33 -3.7497 12 81 16.0057
-3 36 -2.7748 13 84 19.5676
-2 39 -1.8326 14 87 25.6121
-1 42 -0.9113 15 90 324.2604
0 45 0.000
In addition, the VSLI judgement grade that quantizes by the judgement of the intermediate value between adjacent quantized value VSLI.The judgement grade that intermediate value is converted into CLD and quantizes as CLD.Quantize the judgement grade based on the CLD of VSLI and have value beyond the intermediate value between the adjacent quantized value as shown in table 2, unlike common CLD quantizes, adjudicate grade and have intermediate value between the adjacent quantized value.
Fig. 4 is the figure that illustrates from the CLD quantized value of VSLI quantized value conversion according to the present invention.As shown, when when quantizing VSLI based on 45 ° unified angle, the judgement grade between the angle of quantification is two intermediate values between the angle.Yet, when this VSLI judgement grade is converted into the CLD value, can find that VSLI judgement grade has two intermediate values between adjacent C LD value value in addition.Following table 2 has been listed judgement grade and corresponding C LD value that VSLI quantizes.
Table 2
VSLI VSLI adjudicates grade The CLD value VSLI VSLI adjudicates grade The CLD value
0 -- -- 45 -- --
1.5 -31.6386 46.5 0.4550
3 48
4.5 -22.0803 49.5 1.3700
6 51
7.5 -17.6114 52.5 2.3004
9 54
10.5 -14.6407 55.5 3.2573
12 57
13.5 -12.3929 58.5 4.2536
15 60
16.5 -10.5679 61.5 5.3047
18 63
19.5 -9.0170 64.5 6.4301
21 66
22.5 -7.6555 67.5 7.6555
24 69
25.5 -6.4301 70.5 9.0170
27 72
28.5 -5.3047 73.5 10.5679
30 75
31.5 -4.2536 76.5 12.3929
33 78
34.5 -3.2573 79.5 14.6407
36 81
37.5 -2.3004 82.5 17.6114
39 84
40.5 -1.3700 85.5 22.0803
42 87
43.5 -0.4550 88.5 31.6386
45 90
-- -- -- --
Following table 3-7 is the CLD quantization table of creating by use table 1 and table 2 based on VSLI, and wherein table 3 has provided time the CLD quantized value of the 4th decimal place, and table 4 is time to the 3rd decimal place, and table 5 is time to second decimal place, and table 6 is time to first decimal place, and table 7 is to integer.
Use the CLD quantized value of VSLI can be by getting denary logarithm or natural logarithm is calculated.When taking from right logarithm, when spectrum information recovers by using the CLD value, use e rather than 10 end of as.
Table 3 is based on the CLD quantization table (the 4th decimal place) of VSLI
Index CLD Index CLD
Denary logarithm Natural logarithm Denary logarithm Natural logarithm
-15 -65.1400 -150.0000 1 0.9113 2.0982
-14 -25.6121 -58.9740 2 1.8326 4.2198
-13 -19.5676 -45.0561 3 2.7748 6.3892
-12 -16.0057 -36.8546 4 3.7497 8.6339
-11 -13.4505 -30.9709 5 4.7712 10.9861
-10 -11.4390 -26.3392 6 5.8567 13.4855
-9 -9.7645 -22.4835 7 7.0283 16.1833
-8 -8.3165 -19.1493 8 8.3165 19.1493
-7 -7.0283 -16.1833 9 9.7645 22.4835
-6 -5.8567 -13.4855 10 11.4390 26.3392
-5 -4.7712 -10.9861 11 13.4505 30.9709
-4 -3.7497 -8.6339 12 16.0057 36.8546
-3 -2.7748 -6.3892 13 19.5676 45.0561
-2 -1.8326 -4.2198 14 25.6121 58.9740
-1 -0.9113 -2.0982 15 65.1400 150.0000
0 0.0000 0.0000
Table 4 is based on the CLD quantization table (the 3rd decimal place) of VSLI
Index CLD Index CLD
Denary logarithm Natural logarithm Denary logarithm Natural logarithm
-15 -65.140 -150.000 1 0.911 2.098
-14 -25.612 -58.974 2 1.832 4.219
-13 -19.567 -45.056 3 2.774 6.389
-12 -16.005 -36.854 4 3.749 8.633
-11 -13.450 -30.970 5 4.771 10.986
-10 -11.439 -26.339 6 5.856 13.485
-9 -9.764 -22.483 7 7.028 16.183
-8 -8.316 -19.149 8 8.316 19.149
-7 -7.028 -16.183 9 9.764 22.483
-6 -5.856 -13.485 10 11.439 26.339
-5 -4.771 -10.986 11 13.450 30.970
-4 -3.749 -8.633 12 16.005 36.854
-3 -2.774 -6.389 13 19.567 45.056
-2 -1.832 -4.219 14 25.612 58.974
-1 -0.911 -2.098 15 65.140 150.000
0 0.000 0.000
Table 5 is based on the CLD quantization table (second decimal place) of VSLI
Index CLD Index CLD
Denary logarithm Natural logarithm Denary logarithm Natural logarithm
-15 -65.14 -150.00 1 0.91 2.09
-14 -25.61 -58.97 2 1.83 4.21
-13 -19.56 -45.05 3 2.77 6.38
-12 -16.00 -36.85 4 3.74 8.63
-11 -13.45 -30.97 5 4.77 10.98
-10 -11.43 -26.33 6 5.85 13.48
-9 -9.76 -22.48 7 7.02 16.18
-8 -8.31 -19.14 8 8.31 19.14
-7 -7.02 -16.18 9 9.76 22.48
-6 -5.85 -13.48 10 11.43 26.33
-5 -4.77 -10.98 11 13.45 30.97
-4 -3.74 -8.63 12 16.00 36.85
-3 -2.77 -6.38 13 19.56 45.05
-2 -1.83 -4.21 14 25.61 58.97
-1 -0.91 -2.09 15 65.14 150.00
0 0.00 0.00
Table 6 is based on the CLD quantization table (first decimal place) of VSLI
Index CLD Index CLD
Denary logarithm Natural logarithm Denary logarithm Natural logarithm
-15 -65.1 -150.0 1 0.9 2.0
-14 -25.6 -58.9 2 1.8 4.2
-13 -19.5 -45.0 3 2.7 6.3
-12 -16.0 -36.8 4 3.7 8.6
-11 -13.4 -30.9 5 4.7 10.9
-10 -11.4 -26.3 6 5.8 13.4
-9 -9.7 -22.4 7 7.0 16.1
-8 -8.3 -19.1 8 8.3 19.1
-7 -7.0 -16.1 9 9.7 22.4
-6 -5.8 -13.4 10 11.4 26.3
-5 -4.7 -10.9 11 13.4 30.9
-4 -3.7 -8.6 12 16.0 36.8
-3 -2.7 -6.3 13 19.5 45.0
-2 -1.8 -4.2 14 25.6 58.9
-1 -0.9 -2.0 15 65.1 150.0
0 0.0 0.0
Table 7 is based on the CLD quantization table (integer) of VSLI
Index CLD Index CLD
Denary logarithm Natural logarithm Denary logarithm Natural logarithm
-15 -65 -150 1 0 2
-14 -25 -58 2 1 4
-13 -19 -45 3 2 6
-12 -16 -36 4 3 8
-11 -13 -30 5 4 10
-10 -11 -26 6 5 13
-9 -9 -22 7 7 16
-8 -8 -19 8 8 19
-7 -7 -16 9 9 22
-6 -5 -13 10 11 26
-5 -4 -10 11 13 30
-4 -3 -8 12 16 36
-3 -2 -6 13 19 45
-2 -1 -4 14 25 58
-1 -0 -2 15 65 150
0 0 0
Next, in table 8, table 9, table 10, table 11 and table 12, provide judgement grade based on the CLD quantization table of VSLI by decimal place classification.
Table 8 quantizes judgement grade (the 4th decimal place) based on the CLD of VSLI
Denary logarithm Natural logarithm
CLD The judgement grade CLD The judgement grade CLD The judgement grade CLD The judgement grade
-65.140 -- 0.0000 -- -150.000 -- 0.0000 --
-31.638 0.4550 -72.850 1.0477
-25.612 0.9113 -58.9740 2.0982
-22.080 1.3700 -50.841 3.1546
-19.567 1.8326 -45.0561 4.2198
-17.611 2.3004 -40.551 5.2968
-16.005 2.7748 -36.8546 6.3892
-14.640 3.2573 -33.711 7.5002
-13.450 3.7497 -30.9709 8.6339
-12.392 4.2536 -28.535 9.7943
-11.439 4.7712 -26.3392 10.9861
-10.567 5.3047 -24.333 12.2146
-9.7645 5.8567 -22.4835 13.4855
-9.0170 6.4301 -20.762 14.8058
-8.3165 7.0283 -19.1493 16.1833
-7.6555 7.6555 -17.627 17.6275
-7.0283 8.3165 -16.1833 19.1493
-6.4301 9.0170 -14.805 20.7625
-5.8567 9.7645 -13.4855 22.4835
-5.3047 10.567 -12.214 24.3335
-4.7712 11.439 -10.9861 26.3392
-4.2536 12.392 -9.7943 28.5358
-3.7497 13.450 -8.6339 30.9709
-3.2573 14.640 -7.5002 33.7114
-2.7748 16.005 -6.3892 36.8546
-2.3004 17.611 -5.2968 40.5518
-1.8326 19.567 -4.2198 45.0561
-1.3700 22.080 -3.1546 50.8418
-0.9113 25.612 -2.0982 58.9740
-0.4550 31.638 -1.0477 72.8507
0.0000 65.140 0.0000 150.0000
-- -- -- --
Table 9 quantizes judgement grade (the 3rd decimal place) based on the CLD of VSLI
Denary logarithm Natural logarithm
CLD The judgement grade CLD The judgement grade CLD The judgement grade CLD The judgement grade
-65.140 -- 0.000 -- -150.000 -- 0.000 --
-31.638 0.455 -72.850 1.047
-25.612 0.911 -58.974 2.098
-22.080 1.370 -50.841 3.154
-19.567 1.832 -45.056 4.219
-17.611 2.300 -40.551 5.296
-16.005 2.774 -36.854 6.389
-14.640 3.257 -33.711 7.500
-13.450 3.749 -30.970 8.633
-12.392 4.253 -28.535 9.794
-11.439 4.771 -26.339 10.986
-10.567 5.304 -24.333 12.214
-9.764 5.856 -22.483 13.485
-9.017 6.430 -20.762 14.805
-8.316 7.028 -19.149 16.183
-7.655 7.655 -17.627 17.627
-7.028 8.316 -16.183 19.149
-6.430 9.017 -14.805 20.762
-5.856 9.764 -13.485 22.483
-5.304 10.567 -12.214 24.333
-4.771 11.439 -10.986 26.339
-4.253 12.392 -9.794 28.535
-3.749 13.450 -8.633 30.970
-3.257 14.640 -7.500 33.711
-2.774 16.005 -6.389 36.854
-2.300 17.611 -5.296 40.551
-1.832 19.567 -4.219 45.056
-1.370 22.080 -3.154 50.841
-0.911 25.612 -2.098 58.974
-0.455 31.638 -1.047 72.850
0.000 65.140 0.000 150.000
-- -- -- --
Table 10 quantizes judgement grade (second decimal place) based on the CLD of VSLI
Denary logarithm Natural logarithm
CLD The judgement grade CLD The judgement grade CLD The judgement grade CLD The judgement grade
-65.14 -- 0.00 -- -150.00 -- 0.00 --
-31.63 0.45 -72.85 1.04
-25.61 0.91 -58.97 2.09
-22.08 1.37 -50.84 3.15
-19.56 1.83 -45.05 4.21
-17.61 2.30 -40.55 5.29
-16.00 2.77 -36.85 6.38
-14.64 3.25 -33.71 7.50
-13.45 3.74 -30.97 8.63
-12.39 4.25 -28.53 9.79
-11.43 4.77 -26.33 10.98
-10.56 5.30 -24.33 12.21
-9.76 5.85 -22.48 13.48
-9.01 6.43 -20.76 14.80
-8.31 7.02 -19.14 16.18
-7.65 7.65 -17.62 17.62
-7.02 8.31 -16.18 19.14
-6.43 9.01 -14.80 20.76
-5.85 9.76 -13.48 22.48
-5.30 10.56 -12.21 24.33
-4.77 11.43 -10.98 26.33
-4.25 12.39 -9.79 28.53
-3.74 13.45 -8.63 30.97
-3.25 14.64 -7.50 33.71
-2.77 16.00 -6.38 36.85
-2.30 17.61 -5.29 40.55
-1.83 19.56 -4.21 45.05
-1.37 22.08 -3.15 50.84
-0.91 25.61 -2.09 58.97
-0.45 31.63 -1.04 72.85
0.00 65.14 0.00 150.00
-- -- -- --
Table 11 quantizes judgement grade (first decimal place) based on the CLD of VSLI
Denary logarithm Natural logarithm
CLD The judgement grade CLD The judgement grade CLD The judgement grade CLD The judgement grade
-65.1 -- 0.0 -- -150.0 -- 0.0 --
-31.6 0.4 -72.8 1.0
-25.6 0.9 -58.9 2.0
-22.0 1.3 -50.8 3.1
-19.5 1.8 -45.0 4.2
-17.6 2.3 -40.5 5.2
-16.0 2.7 -36.8 6.3
-14.6 3.2 -33.7 7.5
-13.4 3.7 -30.9 8.6
-12.3 4.2 -28.5 9.7
-11.4 4.7 -26.3 10.9
-10.5 5.3 -24.3 12.2
-9.7 5.8 -22.4 13.4
-9.0 6.4 -20.7 14.8
-8.3 7.0 -19.1 16.1
-7.6 7.6 -17.6 17.6
-7.0 8.3 -16.1 19.1
-6.4 9.0 -14.8 20.7
-5.8 9.7 -13.4 22.4
-5.3 10.5 -12.2 24.3
-4.7 11.4 -10.9 26.3
-4.2 12.3 -9.7 28.5
-3.7 13.4 -8.6 30.9
-3.2 14.6 -7.5 33.7
-2.7 16.0 -6.3 36.8
-2.3 17.6 -5.2 40.5
-1.8 19.5 -4.2 45.0
-1.3 22.0 -3.1 50.8
-0.9 25.6 -2.0 58.9
-0.4 31.6 -1.0 72.8
0.0 65.1 0.0 150.0
-- -- -- --
Table 12 quantizes judgement grade (integer) based on the CLD of VSLI
Denary logarithm Natural logarithm
CLD The judgement grade CLD The judgement grade CLD The judgement grade CLD The judgement grade
-65 -- 0 -- -150 -- 0 --
-31 0 -72 1
-25 0 -58 2
-22 1 -50 3
-19 1 -45 4
-17 2 -40 5
-16 2 -36 6
-14 3 -33 7
-13 3 -30 8
-12 4 -28 9
-11 4 -26 10
-10 5 -24 12
-9 5 -22 13
-9 6 -20 14
-8 7 -19 16
-7 7 -17 17
-7 8 -16 19
-6 9 -14 20
-5 9 -13 22
-5 10 -12 24
-4 11 -10 26
-4 12 -9 28
-3 13 -8 30
-3 14 -7 33
-2 16 -6 36
-2 17 -5 40
-1 19 -4 45
-1 22 -3 50
-0 25 -2 58
-0 31 -1 72
0 65 0 150
-- -- -- --
Shown in table 7 and table 12, when when getting denary logarithm CLD quantized value and CLD are quantized the judgement grade and be expressed as integer, can see, exist number of C LD quantized value and number of C LD to quantize to adjudicate the problem that grade equates.Therefore, use the CLD quantized value of natural logarithm and judgement grade to be preferably used for actual quantization.In other words, when intention use be expressed as integer quantize the judgement grade based on the CLD quantization table of VSLI with based on the CLD of VSLI the time, obtain the CLD quantized value by natural logarithm rather than the denary logarithm of getting VSLI.
That creates by this way adopts in the spatial parameter quantizer 230 of SAC system shown in Figure 2 and spatial parameter de-quantizer 270 based on the CLD quantization table of VSLI, makes to minimize the sound deterioration that is produced by the CLD quantization error.
In addition, the present invention proposes the Huffman code book, it can optimize the huffman coding according to the above-mentioned CLD quantization index that obtains based on the CLD quantization table of VSLI.
In the SAC system, multi-channel audio signal is processed behind the sub-band that is divided into frequency domain by bank of filters.When multi-channel audio signal was divided into 20 sub-frequency bands, Differential video coding method was applied to the quantization index of each sub-band, thus quantization index was categorized as the quantization index of first sub-band and the differential indices between other 19 adjacent sub-bands.As an alternative, they can be divided into the differential indices between the consecutive frame.Each class calculating probability to the three class index divided by this way distributes, and then the huffman coding method is applied to each class of three class index.Thus, can obtain the Huffman code book of description in following table 13 and the table 14.
Table 13 is the Huffman code books for the index of first sub-band, and table 14 is the Huffman code books for the index between other adjacent sub-bands.
Table 13
Index Figure place Code word (sexadecimal) Index Figure place Code word (sexadecimal)
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 5 8 8 8 8 7 7 6 6 6 6 5 5 5 4 5 0x17 0x64 0x65 0xf0 0xf1 0x33 0x79 0x18 0x22 0x23 0x3d 0x0b 0x12 0x1a 0x04 0x1f 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 5 5 5 5 5 5 5 5 5 5 5 5 5 5 2 0x1d 0x19 0x1c 0x16 0x18 0x14 0x13 0x15 0x1b 0x10 0x0e 0x0f 0x0d 0x0a 0x00
Table 14
Index Between the nearby frequency bands Between the consecutive frame
Figure place Code word Figure place Code word
0 2 0x00003 1 0x0000
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 2 3 3 4 4 5 5 6 6 7 7 8 8 9 9 10 10 11 11 12 12 13 13 14 14 14 15 16 17 17 0x00001 0x00005 0x00001 0x00009 0x00001 0x00011 0x00001 0x00021 0x00001 0x00041 0x00001 0x00080 0x00000 0x00102 0x00002 0x00206 0x00006 0x0040e 0x0000e 0x0081f 0x0001f 0x0103c 0x0003d 0x0207a 0x00079 0x00078 0x040f6 0x081ef 0x103dd 0x103dc 2 4 4 5 5 6 6 7 7 8 8 9 10 10 11 11 11 12 12 12 12 13 13 13 13 13 13 13 13 11 0x0002 0x000f 0x000d 0x001d 0x0019 0x0039 0x0031 0x0071 0x0061 0x00e0 0x00c0 0x0183 0x0386 0x0305 0x070b 0x0708 0x0609 0x0e1f 0x0e15 0x0c10 0x0e14 0x1c3a 0x1c3d 0x1c38 0x1c39 0x1823 0x1822 0x1c3c 0x1c3b 0x0709
By this way, the Huffman code book that proposes among the present invention is used to the spatial parameter scrambler 240 and the spatial parameter demoder 260 of illustrated SAC system among Fig. 2, makes to reduce the bit rate that transmission CLD quantization index needs.
As an alternative, when the figure place of the huffman coding that is used for 20 sub-frequency bands surpasses 100, can carry out 5 digit pulse sign indicating numbers modulation (PCM) coding to each sub-band.
Industrial applicibility
The present invention may be provided in at least one product (as, floppy disk, hard disk, CD ROM, flash card, PROM, RAM, ROM or tape) form be stored in computer program at least one computer-readable medium.Usually, computer program can be write as with any programming language (as C, C++ or JAVA).
Although illustrate and described the present invention with reference to its some exemplary embodiment, but it will be understood by those skilled in the art that and wherein can carry out the variation on various forms and the details and do not deviate from as the spirit and scope of the present invention by additional claims definition.

Claims (26)

1. channel grade difference CLD quantization method, be used for when carry out N channel audio signal (N>1) based on the coding of spatial audio coding SAC the time, quantize CLD parameter as spatial parameter, this CLD quantization method comprises the steps:
Extract the CLD of each sub-band from the N channel audio signal; And
By with reference to quantizing CLD based on the CLD quantization table of virtual source location information VSLI, this quantization table is to use the CLD quantized value design that obtains from the VSLI quantized value of N channel audio signal.
2. CLD quantization method as claimed in claim 1, wherein the VSLI quantized value quantizes at interval with the predetermined quantitative in the scope between 0 ° and 90 °.
3. CLD quantization method as claimed in claim 2, wherein predetermined quantitative is 3 ° at interval.
4. CLD quantization method as claimed in claim 1, wherein the CLD quantized value draws from the VSLI quantized value according to following equation:
CLD = 20 log 10 { tan ( VSLI - A L A R - A L × 90 ) }
5. CLD quantization method as claimed in claim 1, wherein the CLD quantized value draws from the VSLI quantized value according to following equation:
CLD = 20 log e { tan ( VSLI - A L A R - A L × 90 ) }
6. CLD quantization method as claimed in claim 1, wherein the judgement grade of CLD quantification draws from the VSLI judgement grade that VSLI quantizes.
7. CLD quantization method as claimed in claim 1, wherein as follows based on the CLD quantization table of VSLI:
Index CLD Index CLD Denary logarithm Natural logarithm Denary logarithm Natural logarithm -15 -65.1 -150.0 1 0.9 2.0 -14 -25.6 -58.9 2 1.8 4.2 -13 -19.5 -45.0 3 2.7 6.3 -12 -16.0 -36.8 4 3.7 8.6
-11 -13.4 -30.9 5 4.7 10.9 -10 -11.4 -26.3 6 5.8 13.4 -9 -9.7 -22.4 7 7.0 16.1 -8 -8.3 -19.1 8 8.3 19.1 -7 -7.0 -16.1 9 9.7 22.4 -6 -5.8 -13.4 10 11.4 26.3 -5 -4.7 -10.9 11 13.4 30.9 -4 -3.7 -8.6 12 16.0 36.8 -3 -2.7 -6.3 13 19.5 45.0 -2 -1.8 -4.2 14 25.6 58.9 -1 -0.9 -2.0 15 65.1 150.0 0 0.0 0.0
8. CLD quantization method as claimed in claim 7 wherein quantizes the judgement rank correlation based on the CLD quantization table of VSLI and following CLD:
Denary logarithm Natural logarithm CLD The judgement grade CLD The judgement grade CLD The judgement grade CLD The judgement grade -65.1 -- 0.0 -- -150.0 -- 0.0 -- -31.6 0.4 -72.8 1.0 -25.6 0.9 -58.9 2.0 -22.0 1.3 -50.8 3.1 -19.5 1.8 -45.0 4.2 -17.6 2.3 -40.5 5.2 -16.0 2.7 -36.8 6.3 -14.6 3.2 -33.7 7.5 -13.4 3.7 -30.9 8.6 -12.3 4.2 -28.5 9.7 -11.4 4.7 -26.3 10.9 -10.5 5.3 -24.3 12.2 -9.7 5.8 -22.4 13.4 -9.0 6.4 -20.7 14.8 -8.3 7.0 -19.1 16.1 -7.6 7.6 -17.6 17.6 -7.0 8.3 -16.1 19.1 -6.4 9.0 -14.8 20.7 -5.8 9.7 -13.4 22.4 -5.3 10.5 -12.2 24.3 -4.7 11.4 -10.9 26.3 -4.2 12.3 -9.7 28.5 -3.7 13.4 -8.6 30.9 -3.2 14.6 -7.5 33.7 -2.7 16.0 -6.3 36.8 -2.3 17.6 -5.2 40.5 -1.8 19.5 -4.2 45.0 -1.3 22.0 -3.1 50.8 -0.9 25.6 -2.0 58.9 -0.4 31.6 -1.0 72.8 0.0 65.1 0.0 150.0 -- -- -- --
9. CLD quantization method as claimed in claim 1 also comprises the step of the quantization index of CLD being carried out huffman coding.
10. CLD quantization method as claimed in claim 9, wherein huffman coding is by carrying out the quantization index of first sub-band with reference to following Huffman code book:
Index Figure place Code word (sexadecimal) Index Figure place Code word (sexadecimal) 0 5 0x17 16 5 0x1d 1 8 0x64 17 5 0x19 2 8 0x65 18 5 0x1c 3 8 0xf0 19 5 0x16 4 8 0xf1 20 5 0x18 5 7 0x33 21 5 0x14 6 7 0x79 22 5 0x13 7 6 0x18 23 5 0x15 8 6 0x22 24 5 0x1b 9 6 0x23 25 5 0x10 10 6 0x3d 26 5 0x0e 11 5 0x0b 27 5 0x0f 12 5 0x12 28 5 0x0d 13 5 0x1a 29 5 0x0a 14 4 0x04 30 2 0x00 15 5 0x1f
11. CLD quantization method as claimed in claim 10, wherein huffman coding is carried out the quantization index of all the other sub-bands beyond first sub-band by the following Huffman code book of reference:
Index Figure place Code word (sexadecimal) Index Figure place Code word (sexadecimal) 0 2 0x00003 16 10 0x00206 1 2 0x00001 17 10 0x00006 2 3 0x00005 18 11 0x0040e 3 3 0x00001 19 11 0x0000e 4 4 0x00009 20 12 0x0081f 5 4 0x00001 21 12 0x0001f 6 5 0x00011 22 13 0x0103c 7 5 0x00001 23 13 0x0003d 8 6 0x00021 24 14 0x0207a
9 6 0x00001 25 14 0x00079 10 7 0x00041 26 14 0x00078 11 7 0x00001 27 15 0x040f6 12 8 0x00080 28 16 0x081ef 13 8 0x00000 29 17 0x103dd 14 9 0x00102 30 17 0x103dc 15 9 0x00002
12. a computer readable recording medium storing program for performing records the computer program that is used to carry out according to arbitrary CLD quantization method of claim 1 to 11 on it.
13. the method based on spatial audio coding SAC coding N channel audio signal (N>1), this method comprises the steps:
Following mixing and coding N channel audio signal;
For each sub-band extracts spatial parameter, comprise channel grade difference CLD, interchannel correlativity/coherence ICC and passage predictive coefficient CPC from the N channel audio signal; And
The spatial parameter that quantification is extracted,
Wherein, in the step of the spatial parameter that quantizes to be extracted, by quantizing CLD with reference to the CLD quantization table based on virtual source location information VSLI, this quantization table is to use the CLD quantized value that obtains from the VSLI quantized value of N channel audio signal to design.
14. the device based on spatial audio coding SAC coding N channel audio signal (N>1), this device comprises:
The SAC code device, mixing N channel audio signal is to generate down mixed frequency signal under being used for, and for each sub-band extracts spatial parameter, comprise channel grade difference CLD, interchannel correlativity/coherence ICC and passage predictive coefficient CPC from the N channel audio signal;
Audio coding apparatus is used for generating the audio bitstream that compresses from the following mixed frequency signal that is generated by the SAC code device;
The spatial parameter quantization device is used to quantize the spatial parameter that is extracted by the SAC code device; And
The spatial parameter code device is used for the spatial parameter of coded quantization,
Wherein the spatial parameter quantization device is by quantizing CLD with reference to the CLD quantization table based on virtual source location information VSLI, and this quantization table is to use the CLD quantized value that obtains from the VSLI quantized value of N channel audio signal to design.
15. device as claimed in claim 14 is wherein as follows based on the CLD quantization table of VSLI:
Index CLD Index CLD Denary logarithm Natural logarithm Denary logarithm Natural logarithm -15 -65.1 -150.0 1 0.9 2.0 -14 -25.6 -58.9 2 1.8 4.2 -13 -19.5 -45.0 3 2.7 6.3 -12 -16.0 -36.8 4 3.7 8.6 -11 -13.4 -30.9 5 4.7 10.9 -10 -11.4 -26.3 6 5.8 13.4 -9 -9.7 -22.4 7 7.0 16.1 -8 -8.3 -19.1 8 8.3 19.1 -7 -7.0 -16.1 9 9.7 22.4 -6 -5.8 -13.4 10 11.4 26.3 -5 -4.7 -10.9 11 13.4 30.9 -4 -3.7 -8.6 12 16.0 36.8 -3 -2.7 -6.3 13 19.5 45.0 -2 -1.8 -4.2 14 25.6 58.9 -1 -0.9 -2.0 15 65.1 150.0 0 0.0 0.0
16. device as claimed in claim 15 wherein quantizes the judgement rank correlation based on the CLD quantization table of VSLI and following CLD:
Denary logarithm Natural logarithm CLD The judgement grade CLD The judgement grade CLD The judgement grade CLD The judgement grade -65.1 -- 0.0 -- -150.0 -- 0.0 -- -31.6 0.4 -72.8 1.0 -25.6 0.9 -58.9 2.0 -22.0 1.3 -50.8 3.1 -19.5 1.8 -45.0 4.2 -17.6 2.3 -40.5 5.2 -16.0 2.7 -36.8 6.3 -14.6 3.2 -33.7 7.5 -13.4 3.7 -30.9 8.6 -12.3 4.2 -28.5 9.7 -11.4 4.7 -26.3 10.9 -10.5 5.3 -24.3 12.2 -9.7 5.8 -22.4 13.4 -9.0 6.4 -20.7 14.8 -8.3 7.0 -19.1 16.1 -7.6 7.6 -17.6 17.6 -7.0 8.3 -16.1 19.1 -6.4 9.0 -14.8 20.7 -5.8 9.7 -13.4 22.4 -5.3 10.5 -12.2 24.3 -4.7 11.4 -10.9 26.3 -4.2 12.3 -9.7 28.5 -3.7 13.4 -8.6 30.9 -3.2 14.6 -7.5 33.7 -2.7 16.0 -6.3 36.8 -2.3 17.6 -5.2 40.5 -1.8 19.5 -4.2 45.0 -1.3 22.0 -3.1 50.8 -0.9 25.6 -2.0 58.9 -0.4 31.6 -1.0 72.8 0.0 65.1 0.0 150.0 -- -- -- --
17. a method is used for when based on spatial audio coding SAC the N channel audio bit stream (N>1) of coding being decoded, the channel grade difference CLD quantized value of de-quantization coding, and this method comprises the steps:
CLD quantized value to coding is carried out Hofmann decoding; And
By using the CLD quantized value based on the CLD quantization table de-quantization decoding of virtual source location information VSLI, this quantization table is to use the CLD quantized value design that draws from the VSLI quantized value of N channel audio signal.
18. method as claimed in claim 18 is wherein as follows based on the CLD quantization table of VSLI:
Index CLD Index CLD Denary logarithm Natural logarithm Denary logarithm Natural logarithm -15 -65.1 -150.0 1 0.9 2.0 -14 -25.6 -58.9 2 1.8 4.2
-13 -19.5 -45.0 3 2.7 6.3 -12 -16.0 -36.8 4 3.7 8.6 -11 -13.4 -30.9 5 4.7 10.9 -10 -11.4 -26.3 6 5.8 13.4 -9 -9.7 -22.4 7 7.0 16.1 -8 -8.3 -19.1 8 8.3 19.1 -7 -7.0 -16.1 9 9.7 22.4 -6 -5.8 -13.4 10 11.4 26.3 -5 -4.7 -10.9 11 13.4 30.9 -4 -3.7 -8.6 12 16.0 36.8 -3 -2.7 -6.3 13 19.5 45.0 -2 -1.8 -4.2 14 25.6 58.9 -1 -0.9 -2.0 15 65.1 150.0 0 0.0 0.0
19. method as claimed in claim 18, wherein CLD quantization table and the following CLD based on VSLI quantizes the judgement rank correlation:
Denary logarithm Natural logarithm CLD The judgement grade CLD The judgement grade CLD The judgement grade CLD The judgement grade -65.1 -- 0.0 -- -150.0 -- 0.0 -- -31.6 0.4 -72.8 1.0 -25.6 0.9 -58.9 2.0 -22.0 1.3 -50.8 3.1 -19.5 1.8 -45.0 4.2 -17.6 2.3 -40.5 5.2 -16.0 2.7 -36.8 6.3 -14.6 3.2 -33.7 7.5 -13.4 3.7 -30.9 8.6 -12.3 4.2 -28.5 9.7 -11.4 4.7 -26.3 10.9 -10.5 5.3 -24.3 12.2 -9.7 5.8 -22.4 13.4 -9.0 6.4 -20.7 14.8 -8.3 7.0 -19.1 16.1 -7.6 7.6 -17.6 17.6 -7.0 8.3 -16.1 19.1 -6.4 9.0 -14.8 20.7 -5.8 9.7 -13.4 22.4 -5.3 10.5 -12.2 24.3 -4.7 11.4 -10.9 26.3 -4.2 12.3 -9.7 28.5 -3.7 13.4 -8.6 30.9 -3.2 14.6 -7.5 33.7 -2.7 16.0 -6.3 36.8 -2.3 17.6 -5.2 40.5 -1.8 19.5 -4.2 45.0 -1.3 22.0 -3.1 50.8 -0.9 25.6 -2.0 58.9 -0.4 31.6 -1.0 72.8 0.0 65.1 0.0 150.0 -- -- -- --
20. method as claimed in claim 17 is wherein being carried out the CLD quantized value of coding in the step of Hofmann decoding, by the CLD quantized value of first sub-band of decoding with reference to following Huffman code book:
Index Figure place Code word (sexadecimal) Index Figure place Code word (sexadecimal) 0 5 0x17 16 5 0x1d 1 8 0x64 17 5 0x19 2 8 0x65 18 5 0x1c 3 8 0xf0 19 5 0x16 4 8 0xf1 20 5 0x18 5 7 0x33 21 5 0x14 6 7 0x79 22 5 0x13 7 6 0x18 23 5 0x15 8 6 0x22 24 5 0x1b 9 6 0x23 25 5 0x10 10 6 0x3d 26 5 0x0e 11 5 0x0b 27 5 0x0f 12 5 0x12 28 5 0x0d 13 5 0x1a 29 5 0x0a 14 4 0x04 30 2 0x00 15 5 0x1f
21. method as claimed in claim 20 wherein by the following Huffman code book of reference, is carried out huffman coding to the quantization index of all the other sub-bands beyond first sub-band:
Index Figure place Code word (sexadecimal) Index Figure place Code word (sexadecimal) 0 2 0x00003 16 10 0x00206 1 2 0x00001 17 10 0x00006 2 3 0x00005 18 11 0x0040e 3 3 0x00001 19 11 0x0000e 4 4 0x00009 20 12 0x0081f 5 4 0x00001 21 12 0x0001f 6 5 0x00011 22 13 0x0103c 7 5 0x00001 23 13 0x0003d 8 6 0x00021 24 14 0x0207a 9 6 0x00001 25 14 0x00079
10 7 0x00041 26 14 0x00078 11 7 0x00001 27 15 0x040f6 12 8 0x00080 28 16 0x081ef 13 8 0x00000 29 17 0x103dd 14 9 0x00102 30 17 0x103dc 15 9 0x00002
22. a computer readable recording medium storing program for performing records the computer program that is used for carrying out according to the arbitrary described CLD de-quantization method of claim 17 to 21 on it.
23. a method of the N channel audio bit stream (N>1) of coding being decoded based on spatial audio coding SAC, this method comprises the steps:
N channel audio bit stream decoding to coding;
The quantized value of at least one spatial parameter that de-quantization receives with the N channel audio bit stream of coding; And
Based on the synthetic N channel audio bit stream of decoding of the spatial parameter of de-quantization, with recovery N channel audio signal,
Wherein, in the step of the quantized value of at least one spatial parameter of de-quantization, by with reference to coming de-quantization to be included in CLD in the spatial parameter based on the CLD quantization table of virtual source location information VSLI, this quantization table is to use the CLD quantized value design that obtains from the VSLI quantized value of N channel audio signal.
24. a device of the N channel audio bit stream (N>1) of coding being decoded based on spatial audio coding SAC, this device comprises:
Be used for device to the N channel audio bit stream decoding of coding;
The device of quantized value of at least one spatial parameter of receiving with the N channel audio bit stream of encoding is used to decode;
The device that is used for the quantized value of de-quantization spatial parameter; And
Based on the synthetic N channel audio bit stream of decoding of the spatial parameter of de-quantization, with recovery N channel audio signal,
The device that wherein is used for the quantized value of de-quantization spatial parameter passes through with reference to the CLD quantization table based on virtual source location information VSLI, de-quantization is included in the CLD in the spatial parameter, and this quantization table is to use the CLD quantized value design that obtains from the VSLI quantized value of N channel audio signal.
25. device as claimed in claim 24 is wherein as follows based on the CLD quantization table of VSLI:
Index CLD Index CLD Denary logarithm Natural logarithm Denary logarithm Natural logarithm
-15 -65.1 -150.0 1 0.9 2.0 -14 -25.6 -58.9 2 1.8 4.2 -13 -19.5 -45.0 3 2.7 6.3 -12 -16.0 -36.8 4 3.7 8.6 -11 -13.4 -30.9 5 4.7 10.9 -10 -11.4 -26.3 6 5.8 13.4 -9 -9.7 -22.4 7 7.0 16.1 -8 -8.3 -19.1 8 8.3 19.1 -7 -7.0 -16.1 9 9.7 22.4 -6 -5.8 -13.4 10 11.4 26.3 -5 -4.7 -10.9 11 13.4 30.9 -4 -3.7 -8.6 12 16.0 36.8 -3 -2.7 -6.3 13 19.5 45.0 -2 -1.8 -4.2 14 25.6 58.9 -1 -0.9 -2.0 15 65.1 150.0 0 0.0 0.0
26. device as claimed in claim 25, wherein CLD quantization table and the following CLD based on VSLI quantizes the judgement rank correlation:
Denary logarithm Natural logarithm CLD The judgement grade CLD The judgement grade CLD The judgement grade CLD The judgement grade -65.1 -- 0.0 -- -150.0 -- 0.0 -- -31.6 0.4 -72.8 1.0 -25.6 0.9 -58.9 2.0 -22.0 1.3 -50.8 3.1 -19.5 1.8 -45.0 4.2 -17.6 2.3 -40.5 5.2 -16.0 2.7 -36.8 6.3 -14.6 3.2 -33.7 7.5 -13.4 3.7 -30.9 8.6 -12.3 4.2 -28.5 9.7 -11.4 4.7 -26.3 10.9 -10.5 5.3 -24.3 12.2 -9.7 5.8 -22.4 13.4 -9.0 6.4 -20.7 14.8 -8.3 7.0 -19.1 16.1 -7.6 7.6 -17.6 17.6 -7.0 8.3 -16.1 19.1 -6.4 9.0 -14.8 20.7 -5.8 9.7 -13.4 22.4 -5.3 10.5 -12.2 24.3 -4.7 11.4 -10.9 26.3 -4.2 12.3 -9.7 28.5 -3.7 13.4 -8.6 30.9 -3.2 14.6 -7.5 33.7 -2.7 16.0 -6.3 36.8 -2.3 17.6 -5.2 40.5 -1.8 19.5 -4.2 45.0 -1.3 22.0 -3.1 50.8 -0.9 25.6 -2.0 58.9 -0.4 31.6 -1.0 72.8 0.0 65.1 0.0 -- 150.0 -- -- -- --
CN2006800259842A 2005-07-19 2006-07-19 Virtual source location information based channel level difference quantization and dequantization method Expired - Fee Related CN101223598B (en)

Applications Claiming Priority (10)

Application Number Priority Date Filing Date Title
KR1020050065515 2005-07-19
KR20050065515 2005-07-19
KR10-2005-0065515 2005-07-19
KR20050096256 2005-10-12
KR1020050096256 2005-10-12
KR10-2005-0096256 2005-10-12
KR10-2006-0066822 2006-07-18
KR1020060066822 2006-07-18
KR1020060066822A KR100755471B1 (en) 2005-07-19 2006-07-18 Virtual source location information based channel level difference quantization and dequantization method
PCT/KR2006/002824 WO2007011157A1 (en) 2005-07-19 2006-07-19 Virtual source location information based channel level difference quantization and dequantization method

Publications (2)

Publication Number Publication Date
CN101223598A true CN101223598A (en) 2008-07-16
CN101223598B CN101223598B (en) 2011-07-20

Family

ID=38012181

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2006800259842A Expired - Fee Related CN101223598B (en) 2005-07-19 2006-07-19 Virtual source location information based channel level difference quantization and dequantization method

Country Status (4)

Country Link
JP (1) JP4685165B2 (en)
KR (1) KR100755471B1 (en)
CN (1) CN101223598B (en)
AT (1) ATE511691T1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102157153A (en) * 2010-02-11 2011-08-17 华为技术有限公司 Multichannel signal encoding method, device and system as well as multichannel signal decoding method, device and system
CN102157151A (en) * 2010-02-11 2011-08-17 华为技术有限公司 Encoding method, decoding method, device and system of multichannel signals
CN102483921A (en) * 2009-08-18 2012-05-30 三星电子株式会社 Method and apparatus for encoding multi-channel audio signal and method and apparatus for decoding multi-channel audio signal
CN103493127A (en) * 2012-04-05 2014-01-01 华为技术有限公司 Method for parametric spatial audio coding and decoding, parametric spatial audio coder and parametric spatial audio decoder
CN104464742A (en) * 2014-12-31 2015-03-25 武汉大学 System and method for carrying out comprehensive non-uniform quantitative coding on 3D audio space parameters

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101422745B1 (en) * 2007-03-30 2014-07-24 한국전자통신연구원 Apparatus and method for coding and decoding multi object audio signal with multi channel
JP5098458B2 (en) * 2007-06-20 2012-12-12 カシオ計算機株式会社 Speech coding apparatus, speech coding method, and program
KR101268616B1 (en) * 2008-07-14 2013-05-29 한국전자통신연구원 Method and device about channel information parameter quantization for enhancement of audio channel coding
JP5533502B2 (en) * 2010-09-28 2014-06-25 富士通株式会社 Audio encoding apparatus, audio encoding method, and audio encoding computer program
KR20120038311A (en) 2010-10-13 2012-04-23 삼성전자주식회사 Apparatus and method for encoding and decoding spatial parameter

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3664416B2 (en) * 1996-06-18 2005-06-29 Ykk Ap株式会社 Building fence structure
US6016473A (en) * 1998-04-07 2000-01-18 Dolby; Ray M. Low bit-rate spatial coding method and system
US6446037B1 (en) * 1999-08-09 2002-09-03 Dolby Laboratories Licensing Corporation Scalable coding method for high quality audio
JP3582589B2 (en) * 2001-03-07 2004-10-27 日本電気株式会社 Speech coding apparatus and speech decoding apparatus
KR100378796B1 (en) * 2001-04-03 2003-04-03 엘지전자 주식회사 Digital audio encoder and decoding method
US20030035553A1 (en) * 2001-08-10 2003-02-20 Frank Baumgarte Backwards-compatible perceptual coding of spatial cues
US7292901B2 (en) * 2002-06-24 2007-11-06 Agere Systems Inc. Hybrid multi-channel/cue coding/decoding of audio signals
ATE426235T1 (en) * 2002-04-22 2009-04-15 Koninkl Philips Electronics Nv DECODING DEVICE WITH DECORORATION UNIT
KR100923297B1 (en) * 2002-12-14 2009-10-23 삼성전자주식회사 Method for encoding stereo audio, apparatus thereof, method for decoding audio stream and apparatus thereof
KR100773539B1 (en) * 2004-07-14 2007-11-05 삼성전자주식회사 Multi channel audio data encoding/decoding method and apparatus

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102483921A (en) * 2009-08-18 2012-05-30 三星电子株式会社 Method and apparatus for encoding multi-channel audio signal and method and apparatus for decoding multi-channel audio signal
US8798276B2 (en) 2009-08-18 2014-08-05 Samsung Electronics Co., Ltd. Method and apparatus for encoding multi-channel audio signal and method and apparatus for decoding multi-channel audio signal
CN102483921B (en) * 2009-08-18 2014-07-30 三星电子株式会社 Method and apparatus for encoding multi-channel audio signal and method and apparatus for decoding multi-channel audio signal
US8626518B2 (en) 2010-02-11 2014-01-07 Huawei Technologies Co., Ltd. Multi-channel signal encoding and decoding method, apparatus, and system
WO2011097903A1 (en) * 2010-02-11 2011-08-18 华为技术有限公司 Multi-channel signal coding, decoding method and device, and coding-decoding system
CN102157151B (en) * 2010-02-11 2012-10-03 华为技术有限公司 Encoding method, decoding method, device and system of multichannel signals
CN102157153B (en) * 2010-02-11 2013-04-24 华为技术有限公司 Multichannel signal encoding method, device and system as well as multichannel signal decoding method, device and system
CN102157153A (en) * 2010-02-11 2011-08-17 华为技术有限公司 Multichannel signal encoding method, device and system as well as multichannel signal decoding method, device and system
WO2011097912A1 (en) * 2010-02-11 2011-08-18 华为技术有限公司 Multi-channel signal encoding and decoding method, device and system
CN102157151A (en) * 2010-02-11 2011-08-17 华为技术有限公司 Encoding method, decoding method, device and system of multichannel signals
US10008210B2 (en) 2010-02-11 2018-06-26 Huawei Technologies Co., Ltd. Method, apparatus, and system for encoding and decoding multi-channel signals
CN103493127A (en) * 2012-04-05 2014-01-01 华为技术有限公司 Method for parametric spatial audio coding and decoding, parametric spatial audio coder and parametric spatial audio decoder
CN103493127B (en) * 2012-04-05 2015-03-11 华为技术有限公司 Method for parametric spatial audio coding and decoding, parametric spatial audio coder and parametric spatial audio decoder
US9324329B2 (en) 2012-04-05 2016-04-26 Huawei Technologies Co., Ltd. Method for parametric spatial audio coding and decoding, parametric spatial audio coder and parametric spatial audio decoder
CN104464742A (en) * 2014-12-31 2015-03-25 武汉大学 System and method for carrying out comprehensive non-uniform quantitative coding on 3D audio space parameters
CN104464742B (en) * 2014-12-31 2017-07-11 武汉大学 A kind of comprehensive non-uniform quantizing coded system of 3D audio spaces parameter and method

Also Published As

Publication number Publication date
KR100755471B1 (en) 2007-09-05
ATE511691T1 (en) 2011-06-15
KR20070011136A (en) 2007-01-24
JP4685165B2 (en) 2011-05-18
JP2009502086A (en) 2009-01-22
CN101223598B (en) 2011-07-20

Similar Documents

Publication Publication Date Title
CN101223598B (en) Virtual source location information based channel level difference quantization and dequantization method
EP1905034B1 (en) Virtual source location information based channel level difference quantization and dequantization
EP1749296B1 (en) Multichannel audio extension
CN103329197B (en) For the stereo parameter coding/decoding of the improvement of anti-phase sound channel
JP5265853B2 (en) Processing device, processing method, recording medium, encoding method, and decoding method
CN101055720B (en) Method and apparatus for encoding and decoding an audio signal
EP2054883B1 (en) Audio encoding system
US8489405B2 (en) Compression coding and decoding method, coder, decoder, and coding device
US6735339B1 (en) Multi-stage encoding of signal components that are classified according to component value
CN101199121A (en) Scalable compressed audio bit stream and codec using a hierarchical filterbank and multichannel joint coding
US8515770B2 (en) Method and apparatus for encoding and decoding excitation patterns from which the masking levels for an audio signal encoding and decoding are determined
WO1995032499A1 (en) Encoding method, decoding method, encoding-decoding method, encoder, decoder, and encoder-decoder
WO2023241254A1 (en) Audio encoding and decoding method and apparatus, electronic device, computer readable storage medium, and computer program product
WO2023241222A1 (en) Audio processing method and apparatus, and device, storage medium and computer program product
KR101381602B1 (en) Method and apparatus for scalable encoding and decoding
EP2993665A1 (en) Method and apparatus for coding or decoding subband configuration data for subband groups
US11176954B2 (en) Encoding and decoding of multichannel or stereo audio signals
CN113314131B (en) Multistep audio object coding and decoding method based on two-stage filtering
Derrien Detection of genuine lossless audio files: Application to the MPEG-AAC codec
Joseph et al. Speech coding based on orthogonal and biorthogonal wavelet
Werner et al. An enhanced SBR tool for low-delay applications
Kurth An audio codec for multiple generations compression without loss of perceptual quality
CN113948094A (en) Audio encoding and decoding method and related device and computer readable storage medium
CN117198301A (en) Audio encoding method, audio decoding method, apparatus, and readable storage medium
CN115410585A (en) Audio data encoding and decoding method, related device and computer readable storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
EE01 Entry into force of recordation of patent licensing contract

Application publication date: 20080716

Assignee: Neo Lab Convergence Inc.

Assignor: Korea Electronic Communication Institute

Contract record no.: 2016990000259

Denomination of invention: Virtual source location information based channel level difference quantization and dequantization method

Granted publication date: 20110720

License type: Exclusive License

Record date: 20160630

LICC Enforcement, change and cancellation of record of contracts on the licence for exploitation of a patent or utility model
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20110720

Termination date: 20170719

CF01 Termination of patent right due to non-payment of annual fee