CN1110955C

CN1110955C - Apparatus for adaptively encoding input digital audio signals from plurality of channels

Info

Publication number: CN1110955C
Application number: CN95101987A
Authority: CN
Inventors: 金钟一
Original assignee: Daewoo Electronics Co Ltd
Current assignee: Fengye Vision Technology Co., Ltd.
Priority date: 1995-02-13
Filing date: 1995-02-13
Publication date: 2003-06-04
Anticipated expiration: 2015-02-13
Also published as: CN1145567A

Abstract

The present invention relates to a device for encoding input digital audio signals from N channels in a self-adaptive mode. The device comprises N filters, a first estimating device, a second estimating device, a first-bit allocating unit, a second-bit allocating unit, N quantizers and a formatting circuit, wherein the first estimating device is used for estimating a first parameter for each input signal; the second estimating device is used for estimating the sensation entropy of each frame of each input signal according to respective first parameter, and in this way, the average deviation and the standard deviation of a first frame group is derived, and a second frame group is generated for each channel; the first-bit allocating unit is used for determining bits and generating bit allocating information for all the second frame groups according to the sensation entropies and the parameters of average deviation and standard deviation of the second frame groups; the second-bit allocating unit is used for determining bits of partial bands of frames in all the second frame groups; the quantizers are used for quantifying filtered audio signals.

Description

Encode adaptively from the device of the input digital audio signal of many passages

Technical field

The present invention relates to a kind of digital audio encoding apparatus; More specifically, the perceptual entropy that relates to a kind of basis each input digital audio signal consistent with the mankind's the sense of hearing is encoded adaptively from the improved device of the input digital audio signal of many passages.

Background technology

The transmission of digital audio signal makes transmission become possibility with the high-quality audio signal that the signal of CD and/or digital audiotape is equal to mutually.When representing audio signal, just need to send considerable data volume, especially in the situation of high definition TV with digital form.Yet, because it is limited distributing to the available frequency bandwidth of these digital audio and video signals, in order to send considerable amount of digital data, for example send every channel 768Kbps (kbps) of 16 PCM (pulse code modulated) audio signal of 48KHz sampling frequency, just will compress this digital audio-frequency data inevitably by limited audio bandwidth (approximately 128KHz).

In various audio compression devices or technology, adopt MPEG (active images expert group) audio algorithm that is called of psychologic acoustics algorithm to be proposed and to be used in during HDTV uses.

The mpeg audio algorithm uses four major parts: subrane filtering, psychoacoustic modelization, quantification and coding and frame formatting.Subrane filtering is that an input pcm digital audio signal is mapped to a process on the frequency domain from time domain.Can adopt a bank of filters with the individual subrane of B (such as 32).In each subrane, make up 12 or 36 sampling for its processing; And from the sampling of the combination of a described B subrane, promptly B * 12 or 36 constitute one " frame ", it be used to encode, transmission and decoded audio signal one handle unit.Psychoacoustic model turns to each subrane or subrane is set up upright one group of data, such as SMR (signal-mask ratio) data, quantizes and coding to control it.Then, in quantizing the process of sampling, adaptively available position is distributed to each subrane of a frame with reference to SMR with the coding subrane.A frame formatter is to be suitable for the additional information formatted frame data of transmission manner together with other needs.

Yet, in above-mentioned mpeg audio technology because what distribute to each frame is the position of fixed number, just can not reflect the input digital audio signal that may change continuously in interframe such as average, standard deviation, and statistical property such as perceptual entropy.Moreover if the input digital audio signal that will have different perceptual entropy with this prior art offers traditional device, it is encoded these input digital audio signals and is not taken perceptual entropy into account, and what draw is the signal inferior to the human auditory.

Summary of the invention

Therefore, a main purpose of the present invention can be the novel apparatus of many channel adaptive ground coding input digital audio signals according to the perceptual entropy of each input digital audio signal for providing a kind of, thereby promotes code efficiency and sound quality.

According to one aspect of the present invention, providing a kind of encodes adaptively from the novel apparatus of the input digital audio signal of N bar passage, wherein each input digital audio signal comprises a plurality of frames and each frame comprises a plurality of subranes, wherein N is a positive integer, this device comprises: parellel arranged N subrane filter, be used in the basis of subrane, reception and these input digital audio signals of filtering respectively one by one; First estimating device is used to each subrane that is included in this input digital audio signal respectively, estimation smnr data, sound pressure level and mask threshold value; Second estimating device, be used for according to signal-mask of estimating than data, sound pressure level and mask threshold value, estimation is included in the perceptual entropy of each frame in this input digital audio signal respectively, be derive average and standard deviation parameter of an interchannel frame group of the frame that comprises N * F current and front of whole N bar passages whereby, described interchannel frame group is corresponding to the perceptual entropy of estimation, and be used to respectively this N bar passage, generation comprises frame group in the passage of the current frame with the front of F, and wherein F is a positive integer; First bit allocation apparatus is used for according to perceptual entropy and average and standard deviation parameter for frame group estimation in each passage, adaptively for frame group in this passage respectively determines the position, and be used to generate corresponding to determine for frame group in this passage respectively allocation information; Second bit allocation apparatus is used for signal-mask according to estimation than the allocation information of data and generation, determines the position for being included in this passage respectively each subrane among the frame group, and be used to generate corresponding to determine for each subrane allocation information; A parellel arranged N quantization device is used to respond the corresponding allocation information that each subrane into frame group in this passage respectively generates, and is this N bar passage, the subrane audio signal behind the quantification filtering; And the audio signal that is used for formaing this quantification together with the allocation information that is generated.

Description of drawings

From the description of the preferred embodiment of getting below in conjunction with accompanying drawing, above-mentioned and other purpose of the present invention and feature will be conspicuous, in the accompanying drawing:

Fig. 1 illustrates and schematically illustrates according to of the present invention and encode adaptively from the block diagram of the novel apparatus of the input digital audio signal of many passages; And

Fig. 2 shows the more detailed block diagram of first allocation units shown in Fig. 1.

Embodiment

Referring to Fig. 1, wherein show the block diagram that schematically illustrates according to a kind of code device 100 of the present invention.

Code device 100 comprises a subrane filter 10, first and second feeling parameter estimating device 20 and 30, first, second and 40,50 and 60, quantization devices 70 of the 3rd allocation units and a formating circuit 80.

In code device 100, will be by N (such as 2) bar passage (promptly first (or a left side) and second (or right) passage) the input digital audio signal X1 (m of the i frame (or present frame) that receives, i) with X2 (s, i) act on the first feeling parameter estimating device 20 and the subrane filter 10, wherein each input digital audio signal comprises M and S sampling, thus m=0,1,, M-1; S=0,1 ..., S-1, and M and S be positive integer, wherein each M and the number of S sampling are to be directly proportional with sampling frequency from respectively this input digital audio signal of these passages.Here used " frame " represented a part corresponding to the digital audio and video signals of the audio sample of a fixed number, and is a processing unit of encoding and decoding digital audio and video signals.

As shown in the figure, subrane filter 10 comprises a plurality of subrane filters, such as two subrane filters 11 and 12, they are connected in parallel, so that receive the input digital audio signal of present frame simultaneously, and a kind of subrane filtering technique of knowing in the employing present technique is carried out the filtering from the input digital audio signal of first and second passage, this filtering technique is such as being disclosed in IS0/IEC JTCI/SC2/WG11, " part 3; audio frequency suggestion ", the method in the so-called mpeg audio algorithm described in the CD-11172-3 (1991).This is that respectively this subrane filter 11 and 12 plays and will have sampling frequency f _sInput digital audio signal split into the individual f of having of B (such as 32) _sThe effect of the uniformly-spaced subrane of the sampling frequency of/B, and with the division the subrane audio sample offer quantization device 70.

On the other hand, the input digital audio signal that the first feeling parameter estimating device 20 receives from the present frame of first and second passage, and utilize such as a kind of psychoacoustic model of in the mpeg audio algorithm, discussing, carry out estimating signal-mask than (SMR) data SMR1 and SMR2, sound pressure level P1 and P2 and mask threshold value M1 and M2 for being included in from each subrane in the present frame of first and second passage.SMR1 from each subrane of first passage draws from following formula:

SMR1(j)＝P1(j)-M1(j)

Formula (1) wherein j is a partial wave hop count mark, j=0, and 1 ..., B-1, B are the sum of the subrane in the frame; SMR1 (j) is the signal-mask ratio among the subrane j of first passage; P1 (j) is the sound pressure level from the subrane j of the first passage of FFT (fast Fourier transform) estimation; M1 (j) is the mask threshold value among the subrane j of first passage; And SMR1 (j), P1 (j) and the unit of M1 (j) are dB (decibels).

Similarly, the SMR2 from each subrane of second channel can derive with following formula: SMR2 (j)=P2 (j)-M2 (j) dB

Formula (2) wherein j has previously defined identical meanings.

The mask threshold value is represented the limit that can hear, and it is that a kind of intrinsic of sound can be heard the increment sum that the appearance other tone and composition non-pitch of the limit or threshold value and audio signal causes.Then, signal-the mask of first and second passage is presented to the 3rd allocation units 60 than data SMR1 (j) and SMR2 (j), simultaneously the sound pressure level P1 (j) of first and second passage and P2 (j) and mask threshold value M1 (j) and M2 (j) are coupled on the second feeling parameter estimating device 30 that comprises the average and standard deviation estimator 34 of a perceptual entropy estimation device 32 and.

Perceptual entropy estimation device 32 plays the sound pressure level P1 (j) and P2 (j) and mask threshold value M1 (j) and M2 (j) according to 20 feed-ins of the first feeling parameter estimating device, is the i frame estimation perceptual entropy PE1 (i) of first and second sound channel and the effect of PE2 (i).The perceptual entropy of the i frame of first passage as knowing in the present technique, can be expressed as follows:

PE 1 (i) =

\frac{1}{B} Σ_{j = 0}^{B - 1} MAX [0, \frac{1}{2} \log_{2} \frac{P 1 (j)}{M 1 (j)}] dB

Formula (3) is i wherein, and j and B have previously defined identical meanings.

Similarly, the perceptual entropy PE2 (i) of the i frame of second channel as knowing in the present technique, can draw from following formula:

PE 2 (i) = \frac{1}{B} Σ_{j = 0}^{B - 1} MAX [0, \frac{1}{2} \log_{2} \frac{P 2 (j)}{M 2 (j)}] dB

Formula (4) is i wherein, and j has previously defined identical implication with B.

Formula (3) can be used so-called rate distortion theory with (4) and draw; And corresponding to perceptual entropy according to the human auditory.Then, will estimate on the device 32 that the perceptual entropy of i frame of first and second passage of estimation delivers to second allocation units 50 and average and standard deviation estimator 34 in perceptual entropy.

Adopt average and standard deviation estimator 34 to make up the perceptual entropy of estimation of the frame of individual current and its front of the Q (such as 4) of first and second passage, be PE1 (i-1), PE1 (i), PE2 (i-1) and PE2 (i), they are to divide coordination adaptively between first and second passage for the processing according to first allocation units 40, and estimate what device was presented from perceptual entropy, first allocation units 40 will contrast Fig. 2 below and describe in detail; And utilize an interchannel frame group's of the current frame with the front of four of comprising first and second passage the panesthesia entropy of estimation, of their statistical property of estimation expression average with a standard deviation parameter.The mean parameter PEM of interchannel frame group's panesthesia entropy, as knowing in the present technique, can estimate as follows:

PEm = \frac{1}{Q} Σ_{p = 0}^{Q - 1} PE (p)

Formula (5) wherein p is a frame number mark that is used among the interchannel frame group, p=0, and 1 ... Q-1, Q are this interchannel frame group's totalframes; PE (p) then represents the perceptual entropy of the p frame among this interchannel frame group.

Correspondingly, the standard deviation parameter PEstd of this interchannel frame group's panesthesia entropy as knowing in the present technique, can be expressed as follows:

PEstd = \sqrt{\frac{1}{Q} Σ_{p = 0}^{Q - 1} [PE (p) - PEm]^{2}}

Formula (6) wherein p and Q has previously defined identical meanings.

In a preferred embodiment of the present invention, in order adaptively the position to be distributed to each bar in first and second passage according to their perceptual entropy and to be included in each frame in each passage, average and standard deviation estimator 34 also make up the perceptual entropy of estimation of the frame of individual current and its front of the F (such as 2) of each bar of first and second passage, be PE1 (i) and PE1 (i-1) and PE2 (i) and PE2 (i-1), and become perceptual entropy PE (1) and PE (2) for frame all living creatures in two passages of first and second passage.Then, will on this average and standard deviation estimator 34, generate with the perceptual entropy PE (1) of first and second passage of estimation and PE (2) and on average and standard deviation parameter PEm and PEstd be transferred to first allocation units 40.

Adopt first allocation units 40 according to from the perceptual entropy PE (1) of average and standard deviation estimator 34 and PE (2) and average to come with standard deviation parameter PEm and PEstd be that frame group in two passages of first and second passage determines the position, and will be corresponding to being that determined of frame group's allocation information CBI1 and CBI2 offer second allocation units 50 and formating circuit 80 in two passages of first and second passage.

Referring to Fig. 2, wherein show the more detailed block diagram of first allocation units 40 shown in Fig. 1.First allocation units 40 comprise that one is differentiated level estimation device 41, multiplier 42, subtracter 43, a divider 44 and a bit allocation apparatus 45.

Differentiating level estimation device 41 plays according to average and standard deviation parameter PEm and PEstd from the average and standard deviation estimator 34 shown in Fig. 1, respectively the position is distributed to frame group in two passages of first and second passage for bit allocation apparatus 45, and the best of estimation interchannel frame group is differentiated the effect of level.According to a preferred embodiment of the present invention, differentiate the available following formula of level D (k) and draw for k of interchannel frame group:

D(k)＝NF·PEstd·k

Formula (7) wherein k is a differentiation level number mark, and k=-q to q, q are a positive integer, and NF is a normalization factor among the interchannel frame group.

From formula (7) as seen, interchannel frame group's k the level spacing of differentiating between level D (k) and (k-1) the individual differentiation level D (k-1) depends on from average and the standard deviation PEstd of standard deviation estimator 34 and this interchannel frame group's normalization factor NF, differentiates the sum (for example 2q+1) of level and then is scheduled to.Should be appreciated that the sum of differentiating level can be definite according to the code efficiency and the sound quality of desired code device.Be used in the normalization factor NF that differentiates the interchannel frame group in the level estimation device 41 and preferably can be used to determine with average and standard deviation parameter PEm and the PEstd of standard deviation estimator 34 and a whole process (global) mean parameter PEgm and the PEstd average and an omnidistance standard deviation that is stored in advance in the memory (not shown), so that derive closely the best differentiation level of the interchannel frame group of mating with the human auditory of reality from average.Each omnidistance average mean parameter with omnidistance standard deviation can utilize the average and standard deviation parameter of estimating in a predetermined time duration easily to measure respectively.According to the present invention, interchannel frame group's normalization factor NF can draw from following formula:

NF = (\frac{PEgstd}{PEgm}) \cdot (\frac{PEm}{PEstd})

Formula (8)

As seen, it should be noted that from formula (7) and (8) interchannel frame group's differentiation level can be used as an integral multiple of mean parameter and determines.To on differentiation level estimation device 41, estimation offer bit allocation apparatus 45 with differentiation level of being scheduled to and differentiation level sum.

On the other hand, multiplier 42 is presented the mean parameter PEm that comes with the average and standard deviation estimator 34 shown in Fig. 1 and be multiply by a coefficient F (for example being 2 in this example) who is stored in sum in the memory (not shown), that represent to be included in the frame in any one among the frame group in two passages in advance, generates an output parameter 2PEm after taking advantage of whereby.Then, the perceptual entropy PE (1) that the output parameter 2PEm that multiplier 42 is generated reaches from frame group in two passages of the average and standard deviation estimator 34 shown in Fig. 1 delivers to subtracter 43 with PE (2), this subtracter 43 is used for calculating the difference signal E (1) and E (2) of frame group in two passages by deduct each perceptual entropy PE (1) respectively from output parameter 2PEm with PE (2), and generates the difference signal E (1) and E (2) that calculates.Divider 44 is used for receiving the difference signal E (1) and E (2) from subtracter 43, whereby, each difference signal is divided into the predetermined frame number F (for example being 2 in this example) that is included in the passage in any one among the frame group, be respectively applied for the best differentiation level of dividing equally difference signal so that draw, and generate difference signal E (1)/2, E (2)/2 after removing.Subsequently, will differentiate estimation and predetermined differentiation level D (k) and the sum (being 2q+1) of differentiating level on the level estimation device 41, and the difference signal E (1)/2 that divider 44 generates offers bit allocation apparatus 45 simultaneously with E (2)/2.

Bit allocation apparatus 45 play according to from the differentiation level of differentiating level estimation device 41 with differentiate the level sum and from the difference signal of divider 44, be that among the frame group in two passages of first and second passage each is determined effect of position.According to a preferred embodiment of the present invention, the position of each passage distributes FB can determine as follows:

FB (r) = F \cdot FBm + \frac{F \cdot BV}{2 q + 1} \cdot I

Formula (9) wherein r is as positive integer of port number target; FBm is an average bit, such as 3072 of every frames of 16 PCM (pulse code modulated) that have the 48KHz sampling frequency on the message transmission rate of every passage 128Kbps; BV is a predetermined position changing value; 2p+1 is the sum of predetermined differentiation level; I is a level number mark among the passage r; And F has previously defined identical meanings.

As seen, the position of r bar passage distributes FB (r) to determine by add the variation figure place that can derive from its second on the total FFBm of average bit from formula (9).It is next definite that pre-determined bit changing value BV wherein can be used as a value of the position excursion between minimum position that expression can distribute in the position that each interchannel changes and the multidigit; And frame group's level number mark I can be according to drawing with E (2)/2 from the decision level D (k) that differentiates level estimation device 41 and from each difference signal E (1)/2 of divider 44 in the passage of r bar passage.According to the present invention, in the passage of r bar passage frame group's level number mark I can be expressed as shown in Table (wherein supposed differentiate level be spaced apart 1.27 and differentiate level number mark k and be-2 to 2):

As seen, if frame group's difference signal E (r) is present between differentiation level-2.55 and-1.28 in the passage of r passage, then its level number mark I may be selected to be-1 from table 1; And if it is being differentiated between the level-1.27 and 1.26, then level number mark I may be selected to be 0, by that analogy.In this way, frame group's position distributes FB (r) can use formula (9) to determine expediently in the passage of r passage.

Then, will act on simultaneously on second allocation units 50 corresponding to the allocation information CBI1 of first and second passage of the position of determining for each passage on the allocation units 45 on the throne and CBI2 and from the perceptual entropy PEI (i) and the PE2 (i) of the estimation of the perceptual entropy shown in Fig. 1 device 32; And the allocation information CBI1 and the CBI2 of each passage offered formating circuit 80.

Later referring to Fig. 1, second allocation units 50 are according to from the perceptual entropy of the correspondence of perceptual entropy estimation device 32 and from the allocation information CBI1 and the CBI2 of first allocation units 40, for each frame in each among the frame group in two passages that are included in first and second passage is determined the position.On second allocation units 50, in two passages among the frame group position of each frame in each distribute a kind of the distribution technique that preferably can be used in the digital audio encoding apparatus to determine unsettled the owning together in the U.S. Patent application that this device is disclosed in " adaptive digital audio encoding apparatus and a kind of bit allocation method thereof " by name.

Subsequently, will be corresponding on second allocation units 50, being the allocation information FBI1 and the FBI2 of first and second passage of being included in two passages of first and second passage among the frame group the determined position of each frame in each, and offer the 3rd allocation units 60 simultaneously than data SMR1 (j) and SMR2 (j) from the signal-mask of the first feeling parameter estimating device 20 shown in Fig. 1; And allocation information FBI1 and FBI2 acted on the formating circuit 80.

The 3rd allocation units 60 receive the signal-mask of the first feeling parameter estimating device, 20 feed-ins than data SMR1 and SMR2, and the allocation information FBI1 and the FBI2 of each frame of providing of second allocation units 50, for each subrane in each frame that is included in frame group in two passages is determined the position, and will offer quantizer 70 and formating circuit 80 corresponding to the allocation information SBI1 and the SBI2 of the position of determining for each subrane of first and second passage respectively.Be used in the 3rd principle in the allocation units 60 and be to be no more than under the restriction of the figure place that this frame of sending from second allocation units 50 can utilize, optimize the total signal-mask ratio on the frame in employed figure place.Immediately, will act on simultaneously on the quantization device 70 that comprises a plurality of quantizers (such as 71 and 72) from the allocation information SBI1 of each subrane of first and second passage of the 3rd allocation units 60 and SBI2 and from the subrane audio sample of the part of subrane filter 11 and 12.

Each quantizer 71 and 72 plays according to the corresponding allocation information from the 3rd allocation units 60, quantize adaptively from the effect of this subrane filter 11 respectively, and respectively the quantization audio signal of this first and second passage offers formating circuit 80 with the subrane audio sample of 12 corresponding part.

On formating circuit 80, format is from the quantization audio sampling of this quantizer 71 respectively and 72 and from the allocation information of first, second and the 3rd allocation units 40,50 and 60, and they are transferred to transmitter (not shown) usefulness for transmission, improve code efficiency and sound quality whereby from the input digital audio signal of first and second passage.The principle of subrane filter 10, the first feeling parameter estimating device 20, the 3rd allocation units 60, quantization device 70 and formating circuit 80 and function basically with in the mpeg audio algorithm, can see those are identical.

Though the present invention illustrates and describes with reference to certain embodiments, person skilled in the art person is clear, can make many changes and correction under the condition that does not break away from defined invention spirit and scope in the appending claims.

Table 1

Differentiate the scope of level D	Level number mark (I)
Differentiate the scope of level D	Level number mark (I)	～-2.56	-2
-2.55～-1.28	-1	～-2.56	-2
-2.55～-1.28	-1	-1.27～1.26	0
1.27～2.54	1	-1.27～1.26	0
1.27～2.54	1	2.55～	2

Claims

1, a kind of input digital audio signal (X that is used for encoding adaptively from N bar passage ₁(m, i), X ₂The device of (S, i)), wherein respectively this input digital audio signal comprises a plurality of frames and each frame comprises a plurality of subranes, and wherein N is a positive integer, and this device comprises:

Parellel arranged N subrane filter (11,12) is respectively applied on the basis of subrane one by one, receives and this input digital audio signal of filtering;

First estimating device (20) is used to each the subrane estimating signal-mask that is included in this input digital audio signal respectively than data (SMR1 (j), SMR2 (j)), sound pressure level (P2 (j), P2 (j)) and mask threshold value (M1 (j), M2 (j));

Second estimating device (30), be used for comparing data according to the signal-mask of estimation, sound pressure level and mask threshold value, estimation is included in the first perceptual entropy (PE1 (i) of each frame in this input digital audio signal respectively, PE2 (i)), an interchannel frame group of the frame of current for the N * F that comprises whole N bar passages whereby and front derive one average (PEm) and a standard deviation parameter (PEstd), wherein F is a positive integer, representative is included in the number of frames among the frame group in each passage, described interchannel frame group is corresponding to the perceptual entropy of estimation, and be used to F the interior frame group of passage current and previous frame that comprise of each passage of N passage to estimate second perceptual entropy (PE (1), PE (2));

First bit allocation apparatus (40), be used for according to second perceptual entropy and average and standard deviation parameter for the respectively interior frame group estimation of this passage, determine the position for frame group in this passage respectively adaptively, and be used to generate corresponding to for first allocation information of the determined position of frame group in this passage respectively (CBI1, CBI2);

Second bit allocation apparatus (50), being used for based on the first estimation perceptual entropy and first allocation information is that each frame that is included in each group of frame group in the passage is determined the position, and be used for corresponding to determined each frame the position produce second allocation information (FBI1, FBI2);

The 3rd bit allocation apparatus (60), be used for according to second allocation information of signal-mask of estimating than data and generation, determine the position for being included in this passage respectively each subrane among the frame group, and be used to generate the 3rd allocation information corresponding to the position of determining for this subrane respectively (SBI1, SBI2);

A parellel arranged N quantization device (71,72) is used for responding the 3rd allocation information into each subrane generation of frame group in the passage, quantizes the filtered subrane audio signal of N bar passage; And

Be used for formaing the device (80) of the audio signal of this quantification together with first, second and the 3rd allocation information that are generated.

2, the device described in the claim 1, wherein said first bit allocation apparatus (40) comprising:

Be used for the perceptual entropy according to estimation, average and standard deviation parameter, the device (41) of estimation interchannel frame group's differentiation level (D (K));

Be used for generating the device (42) of an average function (2PEm) by the predetermined factor that mean parameter be multiply by the frame number that an expression is included in frame group in this passage respectively;

A difference signal (the E (1) of the difference between the average function that is used to generate respectively this perceptual entropy of frame group in the expression passage and is generated, E (2)), and be used for difference signal by respectively this generation and be divided into the predetermined factor difference signal of dividing equally (E (1)/2 is provided, E (2)/2) device (43,44); And

Bit allocation apparatus (45), differentiation level according to estimation, the sum of differentiation level and the difference signal that respectively provides are provided, adaptively for frame group in this passage respectively determines the position, and be used to generate first allocation information corresponding to the position of determining for frame group in this passage respectively (CBI1, CBI2).

3, the device described in the claim 2, wherein interchannel frame group's the level D that respectively differentiates determines as follows:

D=NFPEstdk wherein k is a differentiation level number mark, and k=-q to q, q are a positive integer, and NF is a normalization factor among this interchannel frame group; PEstd then is this interchannel frame group's a standard deviation parameter.

4, the device described in the claim 2, wherein the position of the r passage in the N bar passage distributes FB (r) to draw from following formula:

FB (r) = F \cdot FBm + \frac{F \cdot BV}{2 q + 1} \cdot I

Wherein r is for being used as positive integer of port number target; FBm is a function of the average bit of a frame; F is included in a factor of the frame number among the frame group in this passage respectively for expression; BV is a predetermined position changing value; 2q+1 is the sum of predetermined differentiation level; I is a level number mark in the r passage.