WO2006003813A1 - Audio encoding and decoding apparatus - Google Patents

Audio encoding and decoding apparatus Download PDF

Info

Publication number
WO2006003813A1
WO2006003813A1 PCT/JP2005/011340 JP2005011340W WO2006003813A1 WO 2006003813 A1 WO2006003813 A1 WO 2006003813A1 JP 2005011340 W JP2005011340 W JP 2005011340W WO 2006003813 A1 WO2006003813 A1 WO 2006003813A1
Authority
WO
Grant status
Application
Patent type
Prior art keywords
signal
transience
channel
information
original
Prior art date
Application number
PCT/JP2005/011340
Other languages
French (fr)
Japanese (ja)
Inventor
Naoya Tanaka
Kok Seng Chong
Mineo Tsushima
Original Assignee
Matsushita Electric Industrial Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding, i.e. using interchannel correlation to reduce redundancies, e.g. joint-stereo, intensity-coding, matrixing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • H04S1/007Two-channel systems in which the audio signals are in digital form

Abstract

An audio decoding apparatus for improving the signal separation process based on spatial acoustic information to improve the sound quality, wherein signal separating means (104) performs, based on both an inter-channel phase difference information IPD parameter (112) and a signal transition degree TF parameter (113), a signal separation process for an inputted monaural FFT coefficient (111). FFT coefficients (116) as separated for a plurality of channels are adjusted in inter-channel correlation by correction control means (105) and thereafter adjusted in gain by gain control means (106). The signal separation process is achieved by shifting, based on a phase shift amount as calculated from the IPD parameter, the phase of the monaural FFT coefficient. The shift amount for each channel is adjusted in accordance with the TF parameter, thereby improving the sound quality against a transient signal, particularly, an attack sound or the like.

Description

Specification

Audio encoding and decoding apparatus

Technical field

[0001] The present invention is efficiently encoded with a small amount of information to the audio signal, also relates to a device for decoding the encoded information.

BACKGROUND

[0002] The object of the audio coding, as efficiently as possible compression encoding digital I spoon audio signal, transmitted by decoding I spoon processing in the decoder is to reproduce the audio signal of the highest possible quality .

[0003] converts an input audio signal into a frequency Supegairu signal, and take advantage of human auditory characteristics, but efficiently compressed code I spoon technique has many known, as one , spatial acoustic information (Spatial information) or, there is a technique for utilizing the information called auditory acoustic information (Binaural Cue). Examples of such techniques, there is a parametric stereo (PARAMET ric Stereo) scheme defined in an ISO international standard M PEG- 4 Audio (ISO / IEC 14496- 3) (see Non-Patent Document 1). Further, as another example, there is a method disclosed Te Patent Document 1 smell.

[0004] As a typical example, show the latter configuration in FIG. In this configuration, the information to be input is a monaural time signal 504, a BCC (Binaural Cue Coding) parameters 506. Monaural time signal 504 is typically a downmix signal of the original signal, for example, (L + R) / 2. TF conversion unit 501 converts the monaural time signal 504 to the frequency parameter 505. Type of transformation used is the Fourier transform, cosine transform or the like, may be used any known transformation. Here it will be described as using a fast Fourier transform (FFT).

[0005] Auditory Tekioto field generating means 502, of the monaural FFT coefficients 505 are input, performs processing based on spatial audio information provided by the BCC Bruno meter, FFT coefficients of the stereo representing the predetermined acoustic space 507 to generate. Inverse TF transform unit 503 inverse transform the FFT coefficients 50 7, and outputs a stereo time signal 508. Here, spatial audio information provided by the BCC parameters, ITD (Inter- aura Fireflys the Inter- channel Time Difference), ILD (Inter- aural also ί or Inter- channel Level Difference), ICC (Inter- aural Also ί or Inter- represented by channel Coherence). Generally, these spatial audio information is transmitted to FFT coefficients 507 in units of subband which is divided into a plurality of sections in the frequency direction, a signal having a Nozomu Tokoro characteristics in each sub-band are processed to be generated.

[0006] Auditory Tekioto field generating means 502, of the monaural FFT coefficient 505 to generate a stereo signal having a time difference Ts represented by the ITD. In general, with respect to monaural FFT coefficient 505, respectively + Ts / 2 and - realized by performing Ts / 2 of the delay or phase shift operation causes a time difference. Then, to reflect the inter-channel level difference indicated by the ILD, adjusting the gain of each channel.

[0007] Furthermore, ICC is, as shown in Patent Document 2, a parameter indicating the spread of the acoustic space. For example, in Patent Document 2, the gain difference between the channels is pseudo-randomly change, further, by adjusting accordingly the ICC transmitted the gain difference changes, a method of controlling the spread of the acoustic space is shown ing. Further, by adjusting the time difference indicated by the ITD accordance I CC, it is shown a method of controlling the spread of the acoustic space.

[0008] Thus, according to BCC parameters, the time difference between the channels, the stereo signal 508 which is generated is adjusted feeling Ri level differences and wide force S is the original stereo signal equivalent spatial acoustic characteristics as a reference will have, hearing on, the sound quality close to the original sound stereo signal can be realized.

Non-Patent Document l: ISO / IEC 14496-3: 2001 AMD2 "Parametric Coding for High Quality Audio

Patent Document 1: US Patent Publication No. US2003 / 0035553 "Backwards- compatible Perceptual Cod ing of Spatial Cues

Patent Document 2: US Patent Publication No. US2003 / 0219130 "Coherence- based Audio Coding and S ynthesis"

Disclosure of the Invention

Problems that the Invention is to you'll solve

While [0009] to force all spatial acoustic characteristics of the original signal, ITD, ILD, it is difficult to express in three parameters ICC. For example, transient components to the original sound signal, particularly when contains § tack sound, that exists in a narrow range of the attack sound power acoustic space, tight sound image of the or, or with a sense of spaciousness sound there is a need to represent the nano force.

[0010] Conventionally, as an index indicating the spread of the sound image, force the value scale has been used, called coherence (Coheren ce) typified by ICC is a measure of the correlation between channels, attack sound etc., relative to the original signal having a low correlation between channels, the reliability of the measure is lowered. For this problem, for example, ISO / IEC 14496- 3: The parametric stereo method disclosed in 2001 AMD2, detecting the transient of the signal in the decoder, for transient of the signal, the reverberation component amount to be added by adjusting a control is performed in the spread of the sound image.

While [0011] is the force, in this process, because the transience detection is performed with respect to the downmix signal, it is impossible to representable the transience of each channel needed to represent the original spatial acoustic characteristics. Moreover, it requires the detection processing to the decoder side, resulting a problem that requires the amount of calculation definitive playback device is increased.

[0012] The present invention has been made to solve the above problems, to solve the problems and to provide an audio code I spoon and decoding I 匕技 surgery can be more accurately represent the spatial sound characteristics for attack sound means for

[0013] Audio coding I spoon device of the present invention, the original sound signal of m channels (m is a natural number of 2 or more), n-channel (n is a natural number smaller than m) and a down-mix signal, the channel of the original signal an audio encoding apparatus for generating a spatial sound information signal representing the phase difference between, by downmixing the original signal, the downmix means for generating the downmix signal, analyzing the original signal by, together with the spatial audio information signal, the spatial audio information analysis means for generating transience information signal representing the transience degree indicating the transience of the original signal for each channel, the downmix signal, before Symbol spatial audio information signal, and a bit stream multiplexing means and outputting the multiplexing the transience information signals into a single bit stream. [0014] The audio decoder I spoon device of the present invention, Daunmi box of m channels (m is a natural number of 2 or more) (natural number smaller than the n m) n-channel obtained by downmixing an original sound signal from the signal, an audio decoding apparatus generates a decoded signal of the m channel based on the phase difference determined between the channels, transience information signal representative of the transience of case indicating the transience of the original signal for each channel a signal acquisition means for acquiring, from said Daunmitsu task signal, said decoded signal Te based! ヽ to said transience degree and the phase difference generated for each channel, signal generating means for outputting the generated decoded signal provided with a door.

[0015] Furthermore, the present invention provides an audio encoding device, and an audio decoding Nag just can be implemented as an apparatus or Rukoto be implemented as an audio transmission system composed of both forces, characteristic units of these devices comprises it is also possible to implement the processing to be executed audio coding method shall be the steps, and as the audio decoding I 匕方 method by. Effect of the invention

According to [0016] the present invention, the characteristic configuration called using transience degree of each channel representing the transience of the original signal, when the voice playback, the phase difference between the channels, correlation level difference such spatial sound information, can be selectively applied to each channel in accordance with the transience of speech. Therefore, the audio spatial audio information accurately determine it is difficult transitional speech (e.g., attack sound) to be, if indicated by the transience degree, spatial audio information is applied during the reproduction the degree by inhibiting that, the sound is reproduced according to inaccurate spatial sound information, the sound image of the resulting reproduced sound will spread against the intention, inconvenience is eliminated that.

BRIEF DESCRIPTION OF THE DRAWINGS

[0017] FIG 1 is a diagram showing an exemplary configuration of an audio transmission system according to Embodiment 1 of the present invention.

FIG. 2 is a diagram showing an example of the configuration of an audio decoding apparatus according to the first embodiment of the present invention

FIG. 3 is a diagram for explaining a smoothing I spoon processing of the phase shift amount.

FIG. 4 is FIG. 5 is a diagram illustrating a configuration example of an audio decoding apparatus according to the second embodiment of the present invention, an example of the configuration of an audio decoding apparatus according to the third embodiment of the present invention is a diagram illustrating 1-

FIG. 6 is a diagram illustrating a configuration example of a prior art audio decoding apparatus.

DESCRIPTION OF SYMBOLS

Encoding device

20 decoder

100 signal generation means

101 bit stream separating unit

102 core decoding means

103 FFT means

104 signal separating means

105 correlation control means

106 gain control means

107 IFFT means

108 input bit stream

109 Core bit stream

110 monaural PCM signal

111 monaural FFT coefficient

112 IPD parameters

113 TF parameters

114 ICC parameters

115 gain parameter

116 separated FFT coefficient

117 channel isolation FFT coefficients after correlation adjustment

118 output stereo FFT coefficient

119 output stereo PCM signal

201 frequency sub-bands 0

202 frequency subband 1 203 frequency subband 2

301 transience detection means

302 transience degree

303 TC parameters

401 second signal separating means

501 TF conversion means

502 hearing Tekioto field generating means

503 reverse TF conversion means

504 input monaural signal

505 Monoranore FFT coefficient

506 input BCC parameters

507 stereo FFT coefficient

508 output stereo signal

601 down-mixing means

602 core encoding means

603 spatial audio information analysis means

604 transience detection means

605 bit stream multiplexing means

BEST MODE FOR CARRYING OUT THE INVENTION

[0019] (Embodiment 1)

Figure 1 is a functional block diagram showing an example of a configuration of an audio transmission system according to a first embodiment of the present invention. The present audio transmission system, the stereo of the original signal L, and R, mono downmix signal M of, be an audio transmission system for transmitting represent to the spatial sound information signal representing the phase difference between the channels of at least the original signal , composed of O one Do encoder 10 and the audio decoder 20.

[0020] Audio transmission system of the present invention, particularly, the with the downmix signal and the spatial audio information signal, characterized in that transmitting the transience information signal representing the transience degree indicating the transience of the original signal for each channel attached. The technical significance of this transience degree will be described in detail later.

[0021] Here, if the original sound signal of the stereo is an example of a case m of the original signal of the m channels is 2, the downmix signal monaural, n downmix signal of the n-channel is Ru 1 der it is an example of.

[0022] Audio encoder 10 is a device for outputting the multiplexed down-mixed signal M, spatial sound-information signal, and the transience information signals into a single bit stream 108 obtained from the original signal L, R, downmixing unit 601, a core encoding unit 602, the spatial audio information analysis unit 603, is composed of transience detecting means 604 and bit stream multiplexing means 605,.

[0023] The downmix unit 601, the original signal L inputted to obtain a down-mixed signal M Te cowpea to downmixing R outputs to the core encoding means 602. Specifically, the dust Unmikkusu process, the original signal L, and obtains an average of R, it may be processing shall be the downmix signal M.

[0024] Core encoding means 602 to encode the downmix signal M inputted Thus obtained core bit stream and outputs it to the bit stream multiplexing means 605. The E down code, specifically, can be conducted by using conventional encoding techniques such as MPEG-4 AAC system.

[0025] spatial audio information analyzing unit 603, the original signal L, by analyzing the R, detects a phase difference between the channels of the original signal, the spatial audio information signal representing the detected phase difference, the bit stream multiplexing and outputs to means 605. Spatial audio information analyzing unit 603, in addition to the phase difference, detects a correlation and level differences between the channels of the original signal, the detected correlation and level difference may represent the spatial audio information signal with the phase difference . Such (a time difference mathematically synonymous) phase difference from the original signal, correlation, techniques for handling them in search of level differences as spatial audio information, as described in the Background section, is a known , a detailed description thereof will be omitted.

[0026] transience detecting means 604, the original signal L, to detect the transience of case showing the transience Te R respectively Nitsu ヽ. Specifically, the transience detecting means 604, for each channel of the original signal, in a time determined Me pre, for example by detecting a change value of the change value and the signal amplitude of the signal energy, the detected change value in advance binary information indicating a force whether it exceeds a defined threshold, or the detected change values ​​in the form of quantized number value information, Ru give transience degree. Then, the spatial audio information analysis means 603 outputs the transience information signal representing the transience degree of each detected channel in transience detecting means 604 into the bit stream multiplexing means 6 05.

[0027] transience degree of each channel, thus in addition, expressed as transience degree itself, in deviation of the form of each channel in relation to the standard (channel mean for example transience degree) transience degree force It represented also contemplated.

[0028] bit stream multiplexing means 605, core bit stream inputted, to obtain a bit stream 108 the acoustic spatial information signal, and the transience information signals by multiplexing, the bit stream 108, not shown communication line, broadcasting link, and outputs to the recording medium or the like.

[0029] Audio decoder 20, the illustrated such ヽ communication line, broadcasting link, from the recording medium or the like, to obtain a bit stream 108, the original signal L, the decoded signal L which simulates the R ', R' and an obtaining unit, and a bit stream separating unit 101, the core decoding unit 102, the signal generating means 10 0.

[0030] The bit stream separating unit 101 is an example of a signal acquisition means.

[0031] bit stream separating unit 101 obtains the bit stream 108, the bitstream 108 obtained, to multiplex separation core bit stream, the spatial audio information signal, and the transience information signal. Spatial audio information signal, as described above, represents the phase difference between the channels of at least the original signal, in addition to the phase difference, may further represent a correlation and level differences between the channels of the original signal.

[0032] core decoding unit 102 outputs to the signal generator 100 to obtain a down-mixed signal M by decoding the separated core bit stream. The core decoding unit 10 2 obtains a by connexion down-mixed signal M is also But Nag the inverse of the encoding by the core encoding unit 602 mentioned above refers.

[0033] signal generating means 100, based on the down-mixed signal M that is input, the phase difference, and determined circle transience degree from the separated transience information signals between channels is represented by the separated space environment signal Te, decoded signals L ', R' generates and outputs for each channel.

[0034] Hereinafter, while applying a functional detailed configuration and focus on processing performed therein of the signal generation means 100, the audio decoder I spoon device 20 continuously described.

[0035] FIG. 2 is a diagram illustrating a functional example of configuration of the audio decoding I spoon device 20 according to the first embodiment of the present invention. 2 shows the internal functional configuration of the signal generating means 100 is shown in detail. Signal generating means 100 is constituted from the FFT unit 103, signal separating means 104, the correlation control hand stage 105, gain control means 106 and the IFFT unit 107,.

[0036] Input bitstream 108, in the bit stream separating unit 101, is separated into a core bit string over beam 109, IPD Roh parameter 112, TF parameters 113, ICC parameters 114 and Gain parameters 115,. The core bitstream 109 is decoded processed Te core decoding unit 102 smell, mono PCM signal 110 is generated. Corresponding to the down-mixed signal M described in the monophonic PCM signal 110 forces the destination. For the core bitstream 109 and co Adekodo unit 102 is, for example, MPEG-4 AAC method, and the like, may be used any existing marks Goi 匕方 expression. Mono PCM signal 110 decoded is have you the FFT unit 103, it is converted to mono FFT coefficients 111.

[0037] IPD parameter 112 is a parameter representing the phase difference information (Inter- channel Phase Difference) between channels is ITD previously described mathematically synonymous. The IPD parameter 1 12, ICC parameters 114 and Gain parameters 115, is the table by spatial audio information signals are transmitted. Further, TF parameters 113 are parameters representing the transience degree described above, is transmitted represented by the transience information signal.

[0038] signal separating means 104, to the monaural FFT coefficients 111, by applying a separation process based on the IPD parameter 112 and TF parameters 113, and generates a separate FFT coefficients 116. Correlation control unit 105 for separating FFT coefficients 116, to adjust the correlation between Chiyane Le accordance ICC parameter 114. The operation of the signal separating unit 104 and the correlation control unit 105 will be described later in detail. FFT coefficients 117 adjusted inter-channel correlation is the gain adjustment in the gain control unit 106, a stereo FFT coefficients 118 are generated. Finally, the inverse FFT unit 107, inverse FFT converts the stereo FFT coefficients 118 to output the stereo PCM signal 119. [0039] Next, will be described in detail the operation of the signal separating means 104. IPD encoder (audio marks Goka device 10, a is an example of the encoder) is the phase difference information between the channels of the encoded original stereo signal is analyzed in. IPD In general, Te ヽ Tsu the FFT coefficients are divided into a plurality of sub-bands are encoded as a representative value for a plurality of FFT coefficients included in each subband. Therefore, the signal separation processing is performed in units of subband. In the following description, representative of the channel between the phase difference at the subbands 0. Signal separating means 104, a monaural FFT coefficient 111 is separated into two signals having a phase difference Θ between the channels. Examples of the separation process, of the monaural FFT coefficient 111 - two FFT coefficients with theta / 2 and + theta / 2 phase is calculated as follows.

[0040] [number 1]

[0041] Here, s is monaural FFT coefficient 111, k is served FFT coefficients shaped, N represents a point number of the FFT to be used. More inter-channel phase difference Θ is large, the stereo signal is generated, the acoustically more spread sound image. Stereo FFT coefficient d and the original monaural FFT coefficient si 16 are separated is input to the correlation control unit 105, according to ICC parameter 114, inter-channel-correlation is adjusted, generated stereo FFT coefficients 105 with the desired spaciousness It is. Adjustment of the correlation between Chiyane Le can be realized by adding or subtracting a separate stereo FFT coefficients and the original monaural FFT coefficients, for example, it is carried out according to the equation shown below.

[0042] [number 2] L '(k) = co ^ {a) sk) + sin ^^ (k)

R '(k) = cos ( a) s (k) + sin (a) d R (k)

a - A - arccos (/ CC)

[0043] Here, the ICC parameter value indicating the inter-channel correlation, the normalized cross-correlation value of, for example, both channel signals, -1 <= ICC <= 1 or, assuming 0 <= value of the ICC <= 1 . Further, A is a predetermined constant, for example, an A = 0.5. This process, the maximum correlation between the two channels indicated by the ICC parameter, i.e., when the ICC = 1, the correlation of the correlation adjusted signals between channels becomes a maximum, with the ICC value decreases, Chiya correlation channel correlation adjusted signal is also reduced.

While [0044] is a force, as mentioned problem above, if the input original stereo signal is transient signal, calculated in the encoder, the reliability of the ICC parameter transmitted decreases. To solve this problem, in the embodiment of the present invention, a TF parameters 113 representing the transience degree of the input signal. TF parameters 113, in the encoder, calculated for each channel of the input original stereo signal is transmitted. The measure to be used, Oh change value of the signal energy in Rakajime determined time frame, the signal amplitude can be used change values ​​or the like, binary information by the presence or absence of transience or the degree of transience It is transmitted as a number value information representing a plurality step. Further, TF parameters for each Chiyane Le, calculated as a value representative of the entire frequency band, or to transmit, or the frequency band is divided into a plurality of sub-bands, calculated for each sub-band, and transmits .

[0045] In the decoder, TF parameters separated in the bit stream separating unit 101

113 is input to the signal separation unit 104. (1) represented by the formula, signal separation processing in the signal separating unit 1 04, by using the TF parameters 113, for example, is modified as follows.

[0046] [number 3]

e R = e - (\ .o ~~ TF R)

[0047] Here, it indicates that the subscript subscript L, R is L-channel, respectively, the parameters of the R channel. TF parameters are transient of the signal is the code I spoon so that TF = 0. 0 when TF = 1.0, the minimum at the maximum. This process, based on the degree of transience of the input signal to each channel independently, it is possible to adjust the amount of phase shift. The closer to the amount of phase shift is 0 in transience channel, spaciousness is suppressed Erare as sound separation processing after the signal, required in the attack sound or the like to obtain a firm sound.

[0048] In addition, inter-channel phase difference Θ Nitsu, Te is high, relative to the signal of the frequency sub-bands, is not necessary to transmit coded. This is because the human hearing characteristic, for example, for a 2kHz about more sound, because the detection sensitivity with respect to the phase difference between the channels is reduced. However, although difficult the detection of the phase difference between the accurate channel, due to the fact that between the channels there is a phase difference, spaciousness of the sound can be detected still. Therefore, for the high-frequency signal wave number subband, randomly, or by applying the phase difference Θ between the predetermined channels in the sub-band unit, to be able to reproduce the spread of the sound image. Further, in the very low bit rate, there because that the amount of information that can be transmitted is very small, also be omitted entirely the transmission of inter-channel phase difference information by IPD parameter. In such a configuration, the phase difference between channels, and control is performed based on the analysis results in the encoder, Do, so, by performing the control of the phase shift amount by TF parameters introduced, greater effect on speech quality enhancement it is possible to obtain.

[0049] In the above description, contact within each frequency subband!, Te is the phase difference Θ between the channels is constant, by introducing a process of spoon smooth I the phase shift between adjacent frequency subbands it may be. Figure 3 is a diagram showing an example of the phase shift smoothness I spoon. 0 1 the phase shift amount of the frequency subbands 202, as 0 0, 0 2 respectively are the phase shift amount of the both sides of the sub-band 201, 203 adjacent the phase shift amount with respect to FFT coefficients included in each subband, based on its position on the frequency, from 1 theta 0 theta, to theta 2, it is controlled to change stepwise. By performing such a phase shift amount of smoothing I spoon, it is possible to prevent sound quality degradation caused by sudden phase change in sub-band boundaries. In the example shown in FIG. 3, the flat Namerai匕 is being done by linear interpolation, etc. curve interpolation by polynomial use in any interpolation method can Rukoto.

[0050] Further, contact the above description, Te is the inter-channel phase difference information by IPD parameter, L, this force being transmitted as a phase difference information between R Ji Yaneru, downmixed monophonic signal (M for expressed) in, L, the phase difference information theta LM and between the M-channel, R, it may be transmitted as a phase difference information theta RM between M channels. Signal separation processing in this case is, for example, can be realized as follows.

[0051] [number 4]

R = ^ - (io-r ^)

[0052] In addition, L, R, the phase difference, the phase difference between any two sets of channels (in the above example between the two channels arbitrarily selected from the three channels of the M, L, and the combination of M, R, combination of M) if it is transmitted, all can be reproduced. Thus, inter-channel phase Sajo report is any arbitrary two pairs of channel combinations Nitsu, it may also be transmitted Te.

[0053] FIG. 4 (Embodiment 2) is a diagram showing an example of the functional configuration of the audio decoding apparatus 20a according to a second embodiment of the present invention. Audio decoding apparatus 20a adds the first implementation mode of the audio decoding device 20 described above, the signal generating unit 100a, a new transient characteristic detection unit 301, also transience degree the instead of TF parameters 113 shown, the point was Unishi by the use of TC (transient channel) parameter 303 is transience channel information is different. Transience detecting unit 301 analyzes the output monaural PCM signal 110 of the core decoding unit 102, calculates the TF parameter 302 indicating the transience of the degree of signal. TC parameters 303 are calculated in the encoder, a value indicating the degree of relative transience between channels transmitted, for example, in the stereo signal, when the transient of the L channel is maximum, TC = -1.0 when transience of L and R channels are equal, TC = 0.0, when the transience of the R channel is maximum, taking the value of TC = 1.0. Using TC parameters, transience degree for each channel can be calculated as follows.

[0054] [number 5]

[0055] After the transience degree is calculated for each channel, the previously described (3) may be performed signal separation process based on the equation. According to this configuration, it is necessary to provide a new transience detected hand stage decoder, generally information Horyou the required TC parameters in order to transmit the code I spoon is code I and TF parameters for each channel spoon small compared with the amount of information required to transmit. Therefore, a smaller amount of information, it is possible to realize high quality coding transmission.

[0056] In the above description, the transience detecting unit 301 calculates the transience degree with mono PCM signal 110 which is output from the core decoding unit, but Ru, when the FFT transform unit 10 3 a configuration in which to use a change in monaural FFT coefficient 111 which is outputted to sequence and connexion may.

[0057] (Embodiment 3)

Figure 5 is a diagram showing a functional configuration of an audio decoding apparatus 20b according to a third embodiment of the present invention. Audio decoding apparatus 20b, to the first exemplary form status of the audio decoding device 20 described above, the signal generating unit 100b, deprived of the signal separation hand stage 104 and the correlation control unit 105, newly, a second signal separating means 401 is provided, or in the point which is adapted to enter an ICC parameter 114 to the second signal separator 401 differs. In this configuration, the signal separation processing and the inter-channel correlation control process is integrated, according TF parameters 113 and ICC parameters 114, are performed simultaneously. Processing in the second signal separation hand stage 401, depending on whether Chikaraina the inter-channel phase difference information by IPD parameter 112 is transmitted. This is because, in the phase difference and the inter-channel correlation co between channels, a measure of the degree of separation signals between channels, generally, inter-channel correlation between processed signals as with the correct inter-channel level phase difference It is consistent with the correlation between the desired channel. Thus, for the sub-band IPD parameter 112 it is transmitted to control the phase shift amount according to the phase difference information. IPD parameter is transmitted processed in, Ru frequency subband is expressed by the following equation.

[0058] [6]

f 2τάθ τ.

R '(k) = s (k) oxxi

,, W IN j

0 L = ^ - (l -O -T j

0 R =

[0059] In addition, IPD parameter is transmitted! A ヽ in the frequency subband from inter-channel correlation represented connexion by the ICC parameter, it is necessary to estimate the phase difference between the channels. Processing in the frequency subbands IPD Roh meter has not been transmitted is expressed by the following equation. [0060] [Equation 7]

2τ ± θ

L '(k) = xp L

S () e

IN Roh

0 L = a - (l .0 -TF L)

θ κ = a - (l.0- TF R)

a = A - arccos (lCC)

[0061] Here, a is the phase difference between the channels in which the channel correlation ICC force is also estimated. A is a constant which is determined because Ji rough force, for example, A = 1.0.

[0062] According to this arrangement, per cent in each frequency subband Te, depending on the type of the parameter to be transmitted, it is possible to perform appropriate signal separation processing, to improve the sound quality of the encoded signal Rukoto It can be.

[0063] In the example explained above, For a frequency subband IPD parameter is transmitted, without requiring an ICC parameter, it is possible to omit the transmission, a more low V ヽ amount of information, high it is possible to realize the sound quality coding transmission.

[0064] In addition, inter-channel phase difference information transmitted by the IPD parameter are the values ​​representing a certain frequency sub-bands may not be obtained correlation between the correct channel. Therefore, even in the process in the frequency subband IPD parameter is transmitted, it may be introduced correction terms by ICC parameter. The correction term performs ICC parameters thus correction as the amount of phase shift increases as the inter-channel correlation is large enough phase shift amount is small instrument channel correlation force S decreases represented, for example, (6) in the process shown may be corrected as follows.

[0065] [number 8] 2nk6j ヽ

L '(k) = (;) exp |

IN Roh

[0066] In the description of the embodiments 1, 2 and 3 configuration of the present invention, instead of the force FFT using an example of performing decrypt processing using the FFT coefficients, DCT, filter bank or the like, known be used squid becomes time-frequency transform, it is easy to implement a similar configuration.

[0067] In the description of the embodiments 1, 2 and 3 configuration of the present invention, for the stereo original signal, a mono signal that is down-mixed, the spatial acoustic parameters for representing a stereo acoustic space code feed I 匕伝 force configuration has been described to decode the stereo signal on the basis of the information transmitted this configuration is also applicable for multi-channel original signal of the number of any channel. That is, for the input original signal of the n-channel, generating a downmix signal m channel to be m rather n, encoded and transmitted as a core bitstream. For the remaining nm channel, and represented in the spatial acoustic parameters, transmission coding. In the decoder decodes the signals of m channels in the core decoding unit, connection V Te, be generated a signal line ヽ nm channel decoding according to the configuration of the present invention, signals of the same n-channel as the input original signal it can be generated.

[0068] (Summary)

As described above, the audio coding apparatus of the present invention, an audio decoding apparatus, and according to both force becomes audio transmission system, the characteristic configuration of using a transience degree of each channel of the original signal, the phase difference between the channels, correlation, and level differences! /, One has conventionally force also used, Ru spatial audio information accurately determine that the flame, transient specific voice (for example, attack sound) selectively for the degree of spatial sound-information when the reproduction is applied it is possible to suppress. [0069] This Yotsute, originally narrow!, Is reproduced in accordance with the transient voice is inaccurate spatial sound information to the sound image is expected, would spread the sound image of the result of the reproduced sound, inconvenience is eliminated that that.

[0070] (Modification)

The present invention forces the invention has been described based on the embodiment, it is of course not limited to the embodiments described above. The following cases are also included in the present invention.

[0071] The present invention is an audio coding method for audio encoding apparatus described above, and the processing that will be performed by only Nag characteristic units included in these apparatuses may be realized as an audio decoding apparatus as step and Chide monkey be realized as an audio decoding method.

[0072] For example, the following three steps performed in the audio code I spoon device 10,

(1) the original signal by downmixing, Daunmi Tsu box generating a downmix signal,

(2) by analyzing the original signal, together with spatial audio information signal representing the phase difference between the channels of the original signal, transience information signal representing the transience degree indicating the transience of the original sound signal every Chiyane Le spatial audio information analysis step generates the and,

(3) the downmix signal, the spatial audio information signal, and a signal output step of outputting the transience information signal

Audio encoding method, are included in the present invention.

[0073] Further, for example, the following two steps performed in the audio decoder spoon device 20,

(1) the signal acquisition step of acquiring transience information signals, and

(2) from said down-mix signal, the audio decoding comprising a signal generating step of said generating a phase difference between the decrypt signal Te based! ヽ said to the transience degree for each channel, and outputs the generated decoded signal methods are included in the present invention.

[0074] Further, to perform using these methods computer, computer executable program, and also a program recording medium storing the program also included in the present invention.

INDUSTRIAL APPLICABILITY The present invention can transmit the audio signal of the stereo or multi-channel low ヽ amount of information with high quality, a technique that makes it possible to reproduce, broadcast, communication, internet including music, etc. in, and enables high-quality services with less bandwidth, also, CD, DVD, the media such as a hard disk is one that enables the recording and preserving a longer high quality audio signal.

Claims

The scope of the claims
[1] m channel (m is a natural number of 2 or more) from the down-mix signal n Chiyane Le obtained by downmixing original sound signal (n is a natural number smaller than m), and based on the phase difference between the channels an audio decoding apparatus generates a decoded signal of the m channels,
A signal acquisition means for acquiring table to the transience information signals for each channel transience degree indicating the magnitude of the transients contained in the original signal,
Audio, characterized in that it comprises from the downmix signal, and the phase difference, the decrypt signals based on said transience degree was generated for each channel, and a signal generating means for outputting the generated decoded signal decryption spoon apparatus.
[2] the transience information signal have been multiplexed into one bit stream along with spatial audio information signal representing the downmix signal and the phase difference,
The signal acquisition unit acquires the bit stream, the obtained bit stream or, et al., The downmix signal, the spatial audio information signal, and the transience information signal demultiplexes
It said signal generating means, transience degree of the space to calculate the phase shift amount for each Chiyane Le from the phase difference represented by the acoustic information signals, represented connexion by the phase shift amount calculated in the transient property information signal is corrected according to the, by providing the phase shift amount corrected in the Daunmitsu task signal, to generate the decoded signal
Audio decoder spoon device of claim 1, wherein the.
[3] the transience information signal represents the transience degree of each channel of the original signal
Audio decoder spoon device of claim 2, characterized in that.
[4] the transience information signal represents the channel your capital deviations of the original signal from the transience degree as a reference
Audio decoder spoon device of claim 2, characterized in that.
[5] The transience information signal represents a deviation from the mean of the transience degree,
Said signal generating means includes a wherein said calculated from the down-mix signal the average of the transience degree, determine the transience degree of each channel from the deviation represented calculated mean and by the transience information signal audio decoder spoon device of claim 4,.
[6] The phase difference is defined for each of the plurality of representative frequency,
Said signal generating means, the phase difference and calculates a representative phase shift amount for each of the representative frequency using said transience degree, calculated representative phase shift smoothness I spoon thing Accordingly plurality of frequencies for calculating the phase shift amount, by providing a phase shift of the calculated amount for each of the frequency to the down-mix signal, and generates the decoded signal
Audio decoder spoon device of claim 1, wherein the.
[7] The phase difference, the force is the phase difference between the channels of the original signal, or is the phase difference defined between each channel of the decoded signal or random depending on the sound that will be intended
Audio decoder spoon device of claim 1, wherein the.
[8] The signal generating means is further based on the correlation defined between the channels, and adjusting the correlation between the respective channels of the raw made the decoded signal, and outputs the decoded signal after correlation adjustment
Audio decoder spoon device of claim 1, wherein the.
[9] The signal generating unit calculates the phase shift amount in each channel using said correlation and the phase difference, depending calculated phase shift amount in the transience degree represented by the transience information signal correction Te, and I the phase shift amount of the corrected child stranded 〖be given to the downmix signal, performs a correlation adjustment with the phase shift imparted to generate and output a decoded signal which is correlated adjustment has been subjected
Audio decoder spoon device of claim 8, characterized in that.
[10] The signal generating means, further wherein adjusting the gain of each channel of the decoded signal after correlation adjustment, and outputs the decoded signal after the gain adjustment
Audio decoder spoon device of claim 8, characterized in that.
[11] The m is 2, wherein n is 1
Audio decoder spoon device of claim 1, wherein the.
[12] from the original sound signal of m channels (m is a natural number of 2 or more), n-channel (n is a natural number smaller than m) spatial sound-information representing a downmix signal, the phase difference between the channels of the original signal an audio encoding apparatus for generating a signal,
By downmixing the original signal, the downmix means for generating the down-mix signal,
By analyzing the original signal, together with the spatial audio information signal, the spatial audio information analysis means for generating a transient Seijo report signal representative of the transience degree indicating the magnitude of the transients contained in the original signal for each channel ,
The downmix signal and the spatial audio information signal, and the bit stream multiplexing means and outputting the multiplexing the transience information signal to one of the bit streams
Audio code spoon apparatus comprising: a.
The original signal of the m channels (m is a natural number of 2 or more), n-channel (n is a small natural number than m) and a down-mix signal, and table and spatial sound information signal representing the phase difference between the channels an audio transmission system for transmitting,
And a audio coding device and audio decoding device,
The audio encoding apparatus,
By downmixing the original signal, the downmix means for generating the down-mix signal,
By analyzing the original signal, together with the spatial audio information signal, the spatial audio information analysis means for generating a transient Seijo report signal representative of the transience degree indicating the magnitude of the transients contained in the original signal for each channel ,
The downmix signal and the spatial audio information signal, and the bit stream multiplexing means and outputting the multiplexing the transience information signal to one of the bit streams
Equipped with a,
The audio decoding apparatus,
It acquires the bit stream, from the obtained bit stream, the downmix signal, and the spatial audio information signal, and the signal acquisition means for demultiplexing the transience information signal,
Wherein the downmix signal, wherein the Te spatial audio information signal based ヽ to the transience degree obtained from the phase difference and the prior SL transience information signal represented by, and generates a decoded signal of the m channels per channel, generated and signal generating means for outputting a decoded signal
Audio transmission system comprising: a.
[14] m channel (m is a natural number of 2 or more) from the down-mix signal n Chiyane Le obtained by downmixing original sound signal (n is a natural number smaller than m), and based on the phase difference between the channels an audio decoding it 匕方 method for generating a decoded signal of the m channels,
A signal acquisition step of acquiring table to transience information signals transience degree for each channel indicative of the magnitude of the transients contained in the original signal,
From the downmix signal, said decoded signal Te based! ヽ to said transience degree and the phase difference generated in each channel, characterized in that it comprises a signal generation step of outputting the generated decoded signal audio decoding 匕方 method.
[15] from the original sound signal of m channels (m is a natural number of 2 or more), n-channel (n is a natural number smaller than m) spatial sound-information representing a downmix signal, the phase difference between the channels of the original signal an audio encoding method for generating a signal,
By downmixing the original signal, the downmix step of generating the down-mix signal,
By analyzing the original signal, together with the spatial audio information signal, the spatial audio information analysis step of generating a transient Seijo report signal representative of the transience degree indicating the magnitude of the transients contained in the original signal for each channel ,
The downmix signal, the spatial audio information signal, and a signal output step of output the said transience information signal
Audio encoding I 匕方 method characterized by comprising.
[16] from m channels (m is a natural number of 2 or more) down-mix signal n Chiyane Le obtained by downmixing original sound signal (n is a natural number smaller than m), based V to the phase difference between the channels , for generating a decoded signal of the m channels Te, a computer executable program,
An acquisition step of acquiring the table to the transience information signals for each channel transience degree indicating the magnitude of the transients contained in the original signal, from said down-mix signal, the phase difference between the transience degree and based on Dzu! generates the decoded signal for each channel Te ヽ, program characterized by executing a signal generation step of outputting the generated decoded signal to the computer.
[17] from the original sound signal of m channels (m is a natural number of 2 or more), n-channel (n is a natural number smaller than m) spatial sound-information representing a downmix signal, the phase difference between the channels of the original signal for generating a signal, a computer executable program, by downmixing the original signal, the downmix step of generating the down-mix signal,
By analyzing the original signal, together with the spatial audio information signal, the spatial audio information analysis step of generating a transient Seijo report signal representative of the transience degree indicating the magnitude of the transients contained in the original signal for each channel ,
The downmix signal, the spatial audio information signal, and a signal output step of output the said transience information signal
A program characterized by causing a computer to execute the.
[18] Claim 15 and a computer-readable recording medium characterized by storing a program according to at least one of claims 16.
PCT/JP2005/011340 2004-07-02 2005-06-21 Audio encoding and decoding apparatus WO2006003813A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2004197339 2004-07-02
JP2004-197339 2004-07-02

Publications (1)

Publication Number Publication Date
WO2006003813A1 true true WO2006003813A1 (en) 2006-01-12

Family

ID=35782624

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2005/011340 WO2006003813A1 (en) 2004-07-02 2005-06-21 Audio encoding and decoding apparatus

Country Status (1)

Country Link
WO (1) WO2006003813A1 (en)

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007116809A1 (en) * 2006-03-31 2007-10-18 Matsushita Electric Industrial Co., Ltd. Stereo audio encoding device, stereo audio decoding device, and method thereof
JP2009524338A (en) * 2006-01-19 2009-06-25 エルジー エレクトロニクス インコーポレイティド Processing method and apparatus of a media signal
JP2009543112A (en) * 2006-06-29 2009-12-03 エヌエックスピー ビー ヴィNxp B.V. Decoding the speech parameters
WO2010036059A2 (en) * 2008-09-25 2010-04-01 Lg Electronics Inc. A method and an apparatus for processing a signal
JP2010521909A (en) * 2007-03-21 2010-06-24 フラウンホファー・ゲゼルシャフト・ツール・フォルデルング・デル・アンゲバンテン・フォルシュング・アインゲトラーゲネル・フェライン Method and apparatus for enhancing the reproduction of sound
JP2011509429A (en) * 2008-01-01 2011-03-24 エルジー エレクトロニクス インコーポレイティド Signal processing method and apparatus
KR101108061B1 (en) 2008-09-25 2012-01-25 엘지전자 주식회사 A method and an apparatus for processing a signal
EP2467851A2 (en) * 2009-08-18 2012-06-27 Samsung Electronics Co., Ltd. Multi-channel audio decoding method and apparatus therefor
US8258849B2 (en) 2008-09-25 2012-09-04 Lg Electronics Inc. Method and an apparatus for processing a signal
US8290167B2 (en) 2007-03-21 2012-10-16 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Method and apparatus for conversion between multi-channel audio formats
US8346380B2 (en) 2008-09-25 2013-01-01 Lg Electronics Inc. Method and an apparatus for processing a signal
EP2612322A1 (en) * 2010-10-05 2013-07-10 Huawei Technologies Co., Ltd. Method and apparatus for encoding/decoding multichannel audio signal
JP5340378B2 (en) * 2009-02-26 2013-11-13 パナソニック株式会社 Channel signal generating device, an acoustic signal encoding apparatus, acoustic signal decoding apparatus, the audio signal coding method and an acoustic signal decoding method
US8908873B2 (en) 2007-03-21 2014-12-09 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Method and apparatus for conversion between multi-channel audio formats
US8917874B2 (en) 2005-05-26 2014-12-23 Lg Electronics Inc. Method and apparatus for decoding an audio signal
US9015051B2 (en) 2007-03-21 2015-04-21 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Reconstruction of audio channels with direction parameters indicating direction of origin
US9595267B2 (en) 2005-05-26 2017-03-14 Lg Electronics Inc. Method and apparatus for decoding an audio signal
US9626976B2 (en) 2006-02-07 2017-04-18 Lg Electronics Inc. Apparatus and method for encoding/decoding signal

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002237143A (en) * 1997-03-07 2002-08-23 Victor Co Of Japan Ltd Encoding method and decoding method of audio signal
JP2003263864A (en) * 1997-11-28 2003-09-19 Victor Co Of Japan Ltd Method for encoding and decoding audio signal
JP2003303487A (en) * 1997-11-28 2003-10-24 Victor Co Of Japan Ltd Audio disk and decoding device for audio signal

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002237143A (en) * 1997-03-07 2002-08-23 Victor Co Of Japan Ltd Encoding method and decoding method of audio signal
JP2003263864A (en) * 1997-11-28 2003-09-19 Victor Co Of Japan Ltd Method for encoding and decoding audio signal
JP2003303487A (en) * 1997-11-28 2003-10-24 Victor Co Of Japan Ltd Audio disk and decoding device for audio signal

Cited By (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9595267B2 (en) 2005-05-26 2017-03-14 Lg Electronics Inc. Method and apparatus for decoding an audio signal
US8917874B2 (en) 2005-05-26 2014-12-23 Lg Electronics Inc. Method and apparatus for decoding an audio signal
JP4787331B2 (en) * 2006-01-19 2011-10-05 エルジー エレクトロニクス インコーポレイティド Processing method and apparatus of a media signal
JP2009524338A (en) * 2006-01-19 2009-06-25 エルジー エレクトロニクス インコーポレイティド Processing method and apparatus of a media signal
JP2009524339A (en) * 2006-01-19 2009-06-25 エルジー エレクトロニクス インコーポレイティド Processing method and apparatus of a media signal
JP2009524341A (en) * 2006-01-19 2009-06-25 エルジー エレクトロニクス インコーポレイティド Processing method and apparatus of a media signal
JP4806031B2 (en) * 2006-01-19 2011-11-02 エルジー エレクトロニクス インコーポレイティド Processing method and apparatus of a media signal
JP4801174B2 (en) * 2006-01-19 2011-10-26 エルジー エレクトロニクス インコーポレイティド Processing method and apparatus of a media signal
US9626976B2 (en) 2006-02-07 2017-04-18 Lg Electronics Inc. Apparatus and method for encoding/decoding signal
WO2007116809A1 (en) * 2006-03-31 2007-10-18 Matsushita Electric Industrial Co., Ltd. Stereo audio encoding device, stereo audio decoding device, and method thereof
JP2009543112A (en) * 2006-06-29 2009-12-03 エヌエックスピー ビー ヴィNxp B.V. Decoding the speech parameters
JP2010521909A (en) * 2007-03-21 2010-06-24 フラウンホファー・ゲゼルシャフト・ツール・フォルデルング・デル・アンゲバンテン・フォルシュング・アインゲトラーゲネル・フェライン Method and apparatus for enhancing the reproduction of sound
US9015051B2 (en) 2007-03-21 2015-04-21 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Reconstruction of audio channels with direction parameters indicating direction of origin
US8908873B2 (en) 2007-03-21 2014-12-09 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Method and apparatus for conversion between multi-channel audio formats
US8290167B2 (en) 2007-03-21 2012-10-16 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Method and apparatus for conversion between multi-channel audio formats
US8483411B2 (en) 2008-01-01 2013-07-09 Lg Electronics Inc. Method and an apparatus for processing a signal
JP2011509429A (en) * 2008-01-01 2011-03-24 エルジー エレクトロニクス インコーポレイティド Signal processing method and apparatus
WO2010036059A3 (en) * 2008-09-25 2010-07-22 Lg Electronics Inc. A method and an apparatus for processing a signal
CN102165520B (en) 2008-09-25 2012-11-28 Lg电子株式会社 A method and an apparatus for processing a signal
US8346380B2 (en) 2008-09-25 2013-01-01 Lg Electronics Inc. Method and an apparatus for processing a signal
US8346379B2 (en) 2008-09-25 2013-01-01 Lg Electronics Inc. Method and an apparatus for processing a signal
KR101108061B1 (en) 2008-09-25 2012-01-25 엘지전자 주식회사 A method and an apparatus for processing a signal
JP2012503791A (en) * 2008-09-25 2012-02-09 エルジー エレクトロニクス インコーポレイティド Signal processing method and apparatus
WO2010036059A2 (en) * 2008-09-25 2010-04-01 Lg Electronics Inc. A method and an apparatus for processing a signal
JP2012503792A (en) * 2008-09-25 2012-02-09 エルジー エレクトロニクス インコーポレイティド Signal processing method and apparatus
US8258849B2 (en) 2008-09-25 2012-09-04 Lg Electronics Inc. Method and an apparatus for processing a signal
JP5340378B2 (en) * 2009-02-26 2013-11-13 パナソニック株式会社 Channel signal generating device, an acoustic signal encoding apparatus, acoustic signal decoding apparatus, the audio signal coding method and an acoustic signal decoding method
US9053701B2 (en) 2009-02-26 2015-06-09 Panasonic Intellectual Property Corporation Of America Channel signal generation device, acoustic signal encoding device, acoustic signal decoding device, acoustic signal encoding method, and acoustic signal decoding method
EP2467851A4 (en) * 2009-08-18 2014-01-08 Samsung Electronics Co Ltd Multi-channel audio decoding method and apparatus therefor
JP2013502607A (en) * 2009-08-18 2013-01-24 サムスン エレクトロニクス カンパニー リミテッド Multi-channel audio decoding method and apparatus
EP2467851A2 (en) * 2009-08-18 2012-06-27 Samsung Electronics Co., Ltd. Multi-channel audio decoding method and apparatus therefor
EP2612322A4 (en) * 2010-10-05 2013-08-14 Huawei Tech Co Ltd Method and apparatus for encoding/decoding multichannel audio signal
EP2612322A1 (en) * 2010-10-05 2013-07-10 Huawei Technologies Co., Ltd. Method and apparatus for encoding/decoding multichannel audio signal
CN103262159A (en) * 2010-10-05 2013-08-21 华为技术有限公司 Method and apparatus for encoding/decoding multichannel audio signal

Similar Documents

Publication Publication Date Title
EP1376538A1 (en) Hybrid multi-channel/cue coding/decoding of audio signals
Faller Coding of spatial audio compatible with different playback formats
US20070081597A1 (en) Temporal and spatial shaping of multi-channel audio signals
EP1691348A1 (en) Parametric joint-coding of audio sources
US20060153408A1 (en) Compact side information for parametric coding of spatial audio
US7761304B2 (en) Synchronizing parametric coding of spatial audio with externally provided downmix
US20090210238A1 (en) Methods and Apparatuses for Encoding and Decoding Object-Based Audio Signals
US7181019B2 (en) Audio coding
US7720230B2 (en) Individual channel shaping for BCC schemes and the like
US7916873B2 (en) Stereo compatible multi-channel audio coding
US20070244706A1 (en) Audio Signal Encoder and Audio Signal Decoder
US20080013614A1 (en) Device and method for generating a data stream and for generating a multi-channel representation
US20060085200A1 (en) Diffuse sound shaping for BCC schemes and the like
US7979282B2 (en) Methods and apparatuses for encoding and decoding object-based audio signals
US20110046964A1 (en) Method and apparatus for encoding multi-channel audio signal and method and apparatus for decoding multi-channel audio signal
US20120314876A1 (en) Apparatus and method for extracting a direct/ambience signal from a downmix signal and spatial parametric information
US8255211B2 (en) Temporal envelope shaping for spatial audio coding using frequency domain wiener filtering
US20110173005A1 (en) Efficient Use of Phase Information in Audio Encoding and Decoding
US7542896B2 (en) Audio coding/decoding with spatial parameters and non-uniform segmentation for transients
Faller et al. Binaural cue coding applied to stereo and multi-channel audio compression
US20100114582A1 (en) Apparatus and method for coding and decoding multi-object audio signal with various channel including information bitstream conversion
US20090010440A1 (en) Apparatus and Method for Encoding/Decoding Signal
US20090182564A1 (en) Apparatus and method for visualization of multichannel audio signals
WO2005098826A1 (en) Method, device, encoder apparatus, decoder apparatus and audio system
KR100754220B1 (en) Binaural decoder for spatial stereo sound and method for decoding thereof

Legal Events

Date Code Title Description
AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase in:

Ref country code: DE

WWW Wipo information: withdrawn in national office

Country of ref document: DE

122 Ep: pct app. not ent. europ. phase
NENP Non-entry into the national phase in:

Ref country code: JP