EP1057292A1 - A fast frequency transformation techique for transform audio coders - Google Patents

A fast frequency transformation techique for transform audio coders

Info

Publication number
EP1057292A1
EP1057292A1 EP98909964A EP98909964A EP1057292A1 EP 1057292 A1 EP1057292 A1 EP 1057292A1 EP 98909964 A EP98909964 A EP 98909964A EP 98909964 A EP98909964 A EP 98909964A EP 1057292 A1 EP1057292 A1 EP 1057292A1
Authority
EP
European Patent Office
Prior art keywords
sequence
transform coefficient
samples
transform
complex
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP98909964A
Other languages
German (de)
French (fr)
Other versions
EP1057292B1 (en
Inventor
Mohammed Javed Absar
Sapna George
Antonio Mario Alvarez-Tinoco
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
STMicroelectronics Asia Pacific Pte Ltd
Original Assignee
SGS Thomson Microelectronics Pte Ltd
SGS Thomson Microelectronics Asia Pacific Pte Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SGS Thomson Microelectronics Pte Ltd, SGS Thomson Microelectronics Asia Pacific Pte Ltd filed Critical SGS Thomson Microelectronics Pte Ltd
Publication of EP1057292A1 publication Critical patent/EP1057292A1/en
Application granted granted Critical
Publication of EP1057292B1 publication Critical patent/EP1057292B1/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H20/00Arrangements for broadcast or for distribution combined with broadcast
    • H04H20/86Arrangements characterised by the broadcast information itself
    • H04H20/88Stereophonic broadcast systems
    • H04H20/89Stereophonic broadcast systems using three or more audio channels, e.g. triphonic or quadraphonic

Definitions

  • This invention is applicable in the field of multi-channel audio coders which use modified discrete cosine transform as a step in the compression of audio signals.
  • the amount of information required to represent the audio signals may be reduced.
  • the amount of digital information needed to accurately reproduce the original pulse code modulation (PCM) samples may be reduced by applying a digital compression algorithm, resulting in a digitally compressed representation of the original signal.
  • the goal of the digital compression algorithm is to produce a digital representation of an audio signal which, when decoded and reproduced, sounds the same as the original signal, while using a minimum of digital information for the compressed or encoded representation.
  • the time domain audio signal is first converted to the frequency domain using a bank of filters.
  • the frequency domain coefficients are converted to fixed point representation.
  • each coefficient is represented as a mantissa and an exponent.
  • the bulk of the compressed bitstream transmitted to the decoder comprises these exponents and mantissas. - 2 -
  • each mantissa must be truncated to a fixed or variable number of decimal places.
  • the number of bits to be used for coding each mantissa is obtained from a bit allocation algorithm which may be based on the masking property of the human auditory system. Lower numbers of bits result in higher compression ratios because less space is required to transmit the coefficients. However, this may cause high quantization errors, leading to audible distortion.
  • a good distribution of available bits to each mantissa forms the core of the advanced audio coders.
  • the frequency transformation phase has one of the greatest computation requirements in a transform coder. Therefore, an efficient implementation of this phase can decrease the computation requirement of the system significantly and make real time operation of the encoder more easily attainable.
  • the frequency domain transformation of signals is performed by the modified discrete cosine transform (MDCT).
  • MDCT discrete cosine transform
  • the MDCT requires OfiV 2 ) additions and multiplications.
  • FFT Fast Fourier Transform
  • a method for coding audio data comprising a sequence of digital audio samples, including d e steps of: i) multiplying the input samples with a first trigonometric function factor to generate an intermediate sample sequence; ii) computing a fast Fourier transform of the intermediate sample sequence to generate a Fourier transform coefficient sequence; - 3 - iii) for each transform coefficient in the sequence, multiplying die real and imaginary components of e transform coefficient by respective second trigonometric function factors, adding the multiplied real and imaginary transform coefficient components to generate an addition stream coefficient, and subtracting me multiplied real and imaginary transform coefficient components to generate a subtraction stream coefficient; iv) multiplying the addition and subtraction stream coefficients with respective third trigonometric function factors; and v) subtracting the corresponding multiplied addition and subtraction stream coefficients to generate audio coded frequency domain coefficients.
  • the present invention also provides a method for coding audio data, including the steps of: combining first and second sequences of digital audio samples from first and second audio channels into a single complex sample sequence; determining a Fourier transform coefficient sequence as defined above; generating first and second transform coefficient sequences by combining and/or differencing first and second selected transform coefficients from said Fourier transform coefficient sequence; and for each of me first and second transform coefficient sequences, generating audio coded frequency domain coefficients as defined above, so as to generate respective sequences of said audio coded frequency domain coefficients for the first and second audio channels.
  • the present invention also provides a method for coding audio data including d e steps of: obtaining at least one input sequence of digital audio samples; pre-processing die input sequence samples including applying a pre-multiplication factor to obtain modified input sequence samples; transforming e modified input sequence samples into a transform coefficient sequence utilising a fast Fourier transform; and post-processing the sequence of transform coefficients including applying first post- - 4 - multiplication factors to the real and imaginary coefficient components, differencing and combining die post-multiplied real and imaginary components, applying second post- multiplication factors to die difference and combination results, and differencing to obtain a sequence of modified discrete cosine transform coefficients representing said input sequence of digital audio samples.
  • the present invention also provides a method for coding audio data including die steps of: obtaining first and second input sequences of digital audio samples corresponding to respective first and second audio channels; combining me first and second input sequences of digital audio samples into a single complex input sample sequence; pre-processing the complex input sequence samples including applying a pre- multiplication factor to obtain modified complex input sequence samples; transforming die modified complex input sequence samples into a complex transform coefficient sequence utilising a fast Fourier transform; and post-processing the sequence of complex transform coefficients to obtain first and second sequences of audio coded frequency domain coefficients corresponding to me first and second audio channels including, for each corresponding frequency domain coefficient in d e first and second sequences, selecting first and second complex transform coefficients from said sequence of complex transform coefficients, combining the first complex transform coefficient and die complex conjugate of me second complex transform coefficient for said first channel and differencing die first complex transform coefficient and the complex conjugate of the second complex transform coefficient for said second channel, and applying respective post-multiplication factors to the combination and difference to obtain said audio coded frequency domain coefficient
  • G k is a transform coefficient sequence for the first channel
  • G' k is a transform coefficient sequence for the second channel; g Kr and g u are the real and imaginary transform coefficient components of G k ; g' k r and g' k ! are the real and imaginary transform coefficient components of G' k ;
  • the modified discrete cosine transform equation can be expressed as - 6 -
  • xfnj is the input sequence for a channel and N is die transform length.
  • X k cosy *(g kr cos( ⁇ (k+l/2 N)-g k ⁇ sm( ⁇ (k+ ⁇ /2)/N)) -smy *(g kr sin( ⁇ (k+l/2)/N)+g k ⁇ cos( ⁇ (k+V2)/N)) g kr ,g kl e 3t(set of real numbers)
  • n N- ⁇ where G k - g kr + jg k y ⁇ ( ⁇ [n ⁇ e j ⁇ nlN )*e j2 ⁇ " m .
  • Figure 1 is a diagrammatic representation of a stream of audio data and die substructure arrangement thereof;
  • Figure 2 is a functional block diagram of a digital audio encoder
  • Figure 3 is a functional block diagram of a system for encoding a single audio channel
  • Figure 4 is a functional block diagram of a system for encoding a pair of audio channels.
  • the input to an audio coder comprises a stream of digitised samples of the time domain analog signal.
  • the stream consists of interleaved samples for each channel.
  • the input stream is sectioned into blocks, each block containing N consecutive samples of each channel (see Fig. 1).
  • N samples of a channel form a sequence ⁇ x[0], x[l], x[2], ... , x[N-l] ⁇ .
  • the time domain samples are next converted to die frequency domain using an analysis filter bank (see Fig. 2).
  • the frequency domain coefficients, thus generated, form a coefficient set which can be identified as (X 0 , X ⁇ , X 2 , ..., X N/2 ⁇ ). Since die signal is real only the first Nil frequency components are considered.
  • X 0 is the lowest frequency (DC) component while X N/2 _, is the highest frequency component of the signal.
  • DC lowest frequency
  • X N/2 _ is the highest frequency component of the signal.
  • Audio compression essentially entails finding how much of the information in the set (X 0 , X_, X 2 , .... X N/2 ._) is necessary to reproduce the original analog signal at the decoder with minimal audible distortion.
  • the coefficient set is normally converted into floating point format, where each coefficient is represented by an exponent and mantissa.
  • the exponent set is usually transmitted in its original form.
  • the mantissa is truncated to a fixed or variable number of decimal places.
  • the value of number of bits for coding a mantissa is usually obtained from a bit allocation algorithm which for advanced psychoacoustic coders may be based on the masking property of the human auditory system.
  • a low number of bits results in high compression ratio because less space is required to transmit the coefficients. However this causes very high quantization error leading to audible distortion.
  • a good distribution of available bits to each mantissa forms the core of the most advanced encoders.
  • d e frequency domain transformation of signals is performed by die (MDCT) modified discrete cosine transform (Eq. 1).
  • die MDCT requires O ⁇ N 2 ) additions and multiplications.
  • G k g kr + jg i is computed in O(NogN) operation by use of FFT algorithms.
  • the additional operation outlined in Eq. 16 to extract the final X k is only of order 9( ⁇ - Therefore the MDCT can now be computed in 0(Mog 2 N) time.
  • the operations required to obtain die MDCT are illustrated in Fig. 3.
  • die multi-channel encoder is required to process m audio channels. Instead of computing an FFT for each channel as described in die previous section, it is possible to further reduce the computational requirement of the coder by combining two channels and using a single FFT only. In effect, instead of m FFTs only m/2 FFTS need to be computed.
  • DFT for any two channels can be computed with only one FFT block by considering the input as a complex number.
  • the real part is formed from the sequence for any one channel and the imaginary part is from data of another channel. After the Fourier Transform is computed for the resulting complex variable, the resulting transform for each channel can be easily retrieved.
  • the input data to the FFT block is actually a complex number (formed by multiplying the real data by complex variable e ' m/N ).
  • e ' m/N complex variable
  • using some processing after the FFT one can still compute the DFT of two channel using a single FFT block. -13-
  • the frequency transform lengtii N is decided by die encoder based on temporal and spectral resolution requirements.
  • the input signal is usually analysed witii a high frequency bandpass filter to detect die presence of transients. This information is used to adjust die block lengtii, restricting quantization noise associated witii die transient within a small temporal region about die transient, avoiding temporal masking.
  • two short transform of length N/2 each are taken.
  • a single long transform of length N is used, thus providing higher spectral resolution.
  • a short transform is required for restricting quantization noise associated with die transient within a small temporal region about the transient, avoiding temporal masking.
  • a long transform gives slight better frequency resolution but die error is not much compared to die case when in die presence of transient a long transform is utilised. Forcing a long transform onto a channel in the presence of transient leads to greater distortion in die final produced music. This conjecture was proven true by experimental studies on benchmark music streams.
  • x[n] Before d e time domain signal x[n] is transformed to die frequency domain, a windowing function is usually applied.
  • the invention has been described herein primarily in terms of its mathematical derivation and application, and the procedures required for implementation, it will be readily recognised by those skilled in the art that die procedures described can be implemented by means of any desired computational apparatus.
  • the invention may be embodied in computer software operating on general purpose computing equipment, or may be embodied in purpose built circuitry or contained in microcode or the like in an integrated circuit or set of integrated circuits.
  • G * ⁇ (x[n]*e ]nnlN )*e ⁇ " m
  • T 2 M2j( ⁇ k * myN G k -e - ** ⁇ G ⁇

Abstract

A method for coding digital audio data in which coded Fast Modified Discrete Cosine Transform (FMDCT) coefficients are computed utilising a Fast Fourier Transform (FFT) method. The described method allows a significant reduction in computations as compared to an ordinary DCT coding procedure. Also, pairs of audio channels can be combined to use a single FFT computation, where the selected transform length for the paired channels is the same. In such cases where pairing of identical transform length channels is not possible, a long transform length channel is combined with a short transform length channel and converted in two short transforms. A windowing function is also combined with a pre-processing stage to the transformation, further decreasing computational requireements.

Description

A FAST FREQUENCY TRANSFORMAΗON TECHNIQUE FOR TRANSFORM AUDIO CODERS
Technical Field
This invention is applicable in the field of multi-channel audio coders which use modified discrete cosine transform as a step in the compression of audio signals.
Background Art
In order to more efficiently broadcast or record audio signals, the amount of information required to represent the audio signals may be reduced. In the case of digital audio signals, the amount of digital information needed to accurately reproduce the original pulse code modulation (PCM) samples may be reduced by applying a digital compression algorithm, resulting in a digitally compressed representation of the original signal. The goal of the digital compression algorithm is to produce a digital representation of an audio signal which, when decoded and reproduced, sounds the same as the original signal, while using a minimum of digital information for the compressed or encoded representation.
Recent advances in audio coding technology have led to high compression ratios while keeping audible degradation in the compressed signal to a minimum. These coders are intended for a variety of applications, including 5.1 channel film soundtracks, HDTV, laser discs and multimedia. Description of one applicable method can be found in the Advanced Television Systems Committee (ATSC) Standard document entitled "Digital Audio Compression (AC-3) Standard", Document A/52, 20 December, 1995.
In the basic approach, at the encoder the time domain audio signal is first converted to the frequency domain using a bank of filters. The frequency domain coefficients, thus generated, are converted to fixed point representation. In fixed point syntax, each coefficient is represented as a mantissa and an exponent. The bulk of the compressed bitstream transmitted to the decoder comprises these exponents and mantissas. - 2 -
The exponents are usually transmitted in eir original form. However, each mantissa must be truncated to a fixed or variable number of decimal places. The number of bits to be used for coding each mantissa is obtained from a bit allocation algorithm which may be based on the masking property of the human auditory system. Lower numbers of bits result in higher compression ratios because less space is required to transmit the coefficients. However, this may cause high quantization errors, leading to audible distortion. A good distribution of available bits to each mantissa forms the core of the advanced audio coders.
The frequency transformation phase has one of the greatest computation requirements in a transform coder. Therefore, an efficient implementation of this phase can decrease the computation requirement of the system significantly and make real time operation of the encoder more easily attainable.
In some encoders such as those specified in the AC-3 standard, the frequency domain transformation of signals is performed by the modified discrete cosine transform (MDCT). If directiy implemented, the MDCT requires OfiV2) additions and multiplications. However it has been found possible to reduce the number of required operations significantly if the MDCT equation is able to be computed in a from that is amenable to the use of the well known Fast Fourier Transform (FFT) method of J.W. Cooley and J.W. Tukey (1960). Moreover, using a single FFT for two channels can result in greater reduction in computational requirements of the system.
Summary of the Invention
In accordance with the present invention there is provided a method for coding audio data comprising a sequence of digital audio samples, including d e steps of: i) multiplying the input samples with a first trigonometric function factor to generate an intermediate sample sequence; ii) computing a fast Fourier transform of the intermediate sample sequence to generate a Fourier transform coefficient sequence; - 3 - iii) for each transform coefficient in the sequence, multiplying die real and imaginary components of e transform coefficient by respective second trigonometric function factors, adding the multiplied real and imaginary transform coefficient components to generate an addition stream coefficient, and subtracting me multiplied real and imaginary transform coefficient components to generate a subtraction stream coefficient; iv) multiplying the addition and subtraction stream coefficients with respective third trigonometric function factors; and v) subtracting the corresponding multiplied addition and subtraction stream coefficients to generate audio coded frequency domain coefficients.
The present invention also provides a method for coding audio data, including the steps of: combining first and second sequences of digital audio samples from first and second audio channels into a single complex sample sequence; determining a Fourier transform coefficient sequence as defined above; generating first and second transform coefficient sequences by combining and/or differencing first and second selected transform coefficients from said Fourier transform coefficient sequence; and for each of me first and second transform coefficient sequences, generating audio coded frequency domain coefficients as defined above, so as to generate respective sequences of said audio coded frequency domain coefficients for the first and second audio channels.
The present invention also provides a method for coding audio data including d e steps of: obtaining at least one input sequence of digital audio samples; pre-processing die input sequence samples including applying a pre-multiplication factor to obtain modified input sequence samples; transforming e modified input sequence samples into a transform coefficient sequence utilising a fast Fourier transform; and post-processing the sequence of transform coefficients including applying first post- - 4 - multiplication factors to the real and imaginary coefficient components, differencing and combining die post-multiplied real and imaginary components, applying second post- multiplication factors to die difference and combination results, and differencing to obtain a sequence of modified discrete cosine transform coefficients representing said input sequence of digital audio samples.
The present invention also provides a method for coding audio data including die steps of: obtaining first and second input sequences of digital audio samples corresponding to respective first and second audio channels; combining me first and second input sequences of digital audio samples into a single complex input sample sequence; pre-processing the complex input sequence samples including applying a pre- multiplication factor to obtain modified complex input sequence samples; transforming die modified complex input sequence samples into a complex transform coefficient sequence utilising a fast Fourier transform; and post-processing the sequence of complex transform coefficients to obtain first and second sequences of audio coded frequency domain coefficients corresponding to me first and second audio channels including, for each corresponding frequency domain coefficient in d e first and second sequences, selecting first and second complex transform coefficients from said sequence of complex transform coefficients, combining the first complex transform coefficient and die complex conjugate of me second complex transform coefficient for said first channel and differencing die first complex transform coefficient and the complex conjugate of the second complex transform coefficient for said second channel, and applying respective post-multiplication factors to the combination and difference to obtain said audio coded frequency domain coefficients corresponding to the first and second audio channels.
The present invention further provides A method for coding audio data including the steps of: obtaining first and second input sequences of digital audio samples x[n], y[n] corresponding to respective first and second audio channels; - 5 - combining the first and second input sequences of digital audio samples into a single complex input sample sequence z[n], where z[n] = x[n] + jy[n]; pre-processing the complex input sequence samples including applying a pre- multiplication factor cos(7m/N) + jsin(πn/N) to obtain modified complex input sequence samples, where N is die number of audio samples in each of die first and second input sequences and n = 0,.... ,(N-l); transforming the modified complex input sequence samples into a complex transform coefficient sequence Zk utilising a fast Fourier transform, wherein k = 0,.... ,(M2-l); and post-processing the sequence of complex transform coefficients to obtain first and second sequences of audio coded frequency domain coefficients corresponding to die first and second audio channels Xk, Yk according to:
Gk = (Zt+Z;.fc.,)/2 A=0.JV72-1
G'k = (Zk-Z _k7/2j *=0.JV72-1
Xk = cosγ * (gk Cθs(π(k+\l2)IN)-gk m(π(k+V2)IN)
- sinγ * (gkιrsm(π(k+l/2)/N)+gcos(π(k+V2)/N)
Yk = cosγ * ^cos^^+l^yΛ -^si^π^+l^VΛ
- sinγ * (g; ijrsm(π(£+ l/2)/N) +g^,cos(Tt(A:+l/2)/JV)
where Gk is a transform coefficient sequence for the first channel;
G'k is a transform coefficient sequence for the second channel; gKr and gu are the real and imaginary transform coefficient components of Gk; g'k r and g'k ! are the real and imaginary transform coefficient components of G'k;
2 N_k_, is the complex conjugate of ZN.k.,; and y(k) = π(2k+ 1)/4.
The modified discrete cosine transform equation can be expressed as - 6 -
X k = Σ *M * cos(2π*(2«+l)*(2£+l)/4N + π*(2/fc+l)/4) Jfc=0...( 2-l) n=0
where xfnj is the input sequence for a channel and N is die transform length.
Instead of evaluating Xk in die form given above it could be computed as
Xk = cosy *(gkrcos(π(k+l/2 N)-gsm(π(k+\/2)/N)) -smy *(gkrsin(π(k+l/2)/N)+gcos(π(k+V2)/N)) gkr,gkl e 3t(set of real numbers)
n=N-\ where Gk - gkr + jgky ∑ (χ[n}ejπnlN)*ej2π"m . The symbol j represents the n=0
imaginary number <f . The expression ∑ (χ[n]eJ7tnW)*ej2l"'m is obtained from
die well known FFT metiiod, by first using transformation x'[n] =x[n] * eiπn Ν and dien n=jV-l computing the FFT Gk = ∑ xf[n]*ej2mkaf .
For a two channel approach, a complex variable z[n] = x[n]*e'm N + jy[n]*ejm/N is defined, where x[n] and y[n] are sample sequence for the two channels and e} , N represents die pre-multiplication factor. Using FFT approach, d e frequency coefficient Zk for the variable z[ ] is computed. From Zk the value Gk = (Zk + Z*^.ti)/2 and G'k =(Zk - 77N.k.^/2), required to compute the final MDCT for each channel, respectively, is calculated.
If either or both the channels require short length transformers, two short transforms are taken using the above approach. If neitiier need short transform, a single long transform is used. As an additional step in reducing computation, the windowing function can be combined with the pre-processing stage. - 7 - Brief Description of die Drawings
The invention is described in detail hereinafter, by way of example only, with reference to preferred embodiments diereof and witii aid of d e accompanying drawings, wherein: Figure 1 is a diagrammatic representation of a stream of audio data and die substructure arrangement thereof;
Figure 2 is a functional block diagram of a digital audio encoder; Figure 3 is a functional block diagram of a system for encoding a single audio channel; and Figure 4 is a functional block diagram of a system for encoding a pair of audio channels.
Detailed Description of die Preferred Embodiments
The above mentioned Advanced Television Systems Committee (ATSC) Standard document entitled "Digital Audio Compression (AC-3) Standard" (Document A/52, 20 December, 1995) describes med ods for encoding and decoding audio signals, and is hereby expressly incorporated herein by reference.
In general, the input to an audio coder comprises a stream of digitised samples of the time domain analog signal. For a multi-channel encoder the stream consists of interleaved samples for each channel. The input stream is sectioned into blocks, each block containing N consecutive samples of each channel (see Fig. 1). Thus within a block die N samples of a channel form a sequence {x[0], x[l], x[2], ... , x[N-l]}.
The time domain samples are next converted to die frequency domain using an analysis filter bank (see Fig. 2). The frequency domain coefficients, thus generated, form a coefficient set which can be identified as (X0, X{, X2, ..., X N/2Λ). Since die signal is real only the first Nil frequency components are considered. Here X0 is the lowest frequency (DC) component while XN/2_, is the highest frequency component of the signal. - 8 -
Audio compression essentially entails finding how much of the information in the set (X0, X_, X2, .... XN/2._) is necessary to reproduce the original analog signal at the decoder with minimal audible distortion.
The coefficient set is normally converted into floating point format, where each coefficient is represented by an exponent and mantissa. The exponent set is usually transmitted in its original form. However, the mantissa is truncated to a fixed or variable number of decimal places. The value of number of bits for coding a mantissa is usually obtained from a bit allocation algorithm which for advanced psychoacoustic coders may be based on the masking property of the human auditory system. A low number of bits results in high compression ratio because less space is required to transmit the coefficients. However this causes very high quantization error leading to audible distortion. A good distribution of available bits to each mantissa forms the core of the most advanced encoders.
In some encoders such as die AC-3, d e frequency domain transformation of signals is performed by die (MDCT) modified discrete cosine transform (Eq. 1). n=Λ x k = Σ XW * cos(2π*(2n+l)*(2 +l)/4N + π*(2 +l)/4) Jfc=0...(M2-l) Eq. 1
If directly implemented in die form given above, die MDCT requires OζN2) additions and multiplications.
Single Channel FFT
It is possible to reduce the number of required operations significantly if one is able to evaluate Eq. 1 using the well known Fast Fourier Transform method of J.W. Cooley and J.W. Tukey (1960). The general Discrete Fourier Transform (DFT) is given below (Eq. 2). It requires OiN2) complex additions and multiplications. By using the Fast Fourier Transform method the DFT in Eq. 2 can be computed with 0(N oglN) operations only. n=N-l
F k = Σ (x[n] *e2*J"m) k=0. -l Eq. 2
Here j is the symbol for imaginary number, i.e. j = 7-1.
Although it may not be immediately apparent how Eq. 1 can be transformed to Eq. 2, a careful analysis shows tiiat this is indeed possible. To simplify Eq. 1, two functions can be defined a(n,k) = 2π(2n+ l)(2jfc+ l)/4N ' Eq. 3 (k) = π(2k+ 1)/4 Eq. 4
Then, using tiiese functions, Eq. 1 can be rewritten as
n=ΛM x k = ∑ x[n] *∞s(a(n,k)+y(k)) Eq. 5 n=tf-l
= ∑ x[n] *(cos (n,k)cosy(k)-sm (n,k)siny(k)) Eq. 6
«=o
In Eq. 6 the trigonometric equality, cos(α-f-b) = cosα cosb-sinα sinb is used for simplification. Furthermore, since the function γ(k) is not dependant on variable n, it can be brought outside the summation expression to give
Xk = cosγ(A) ∑ x[n] *cos (n,k) - sinγ(λ) ∑ x[ ] *sina(n,k)
= ^cosγ^) - r2sinγ(Aτ) Eq η
n=N- \ n=N-l where Tχ = ∑∑ xx[[nn]] **ccoossaa((nn,,kk)) aanndd TT22 == ∑∑ x[n] *sina(n,k) n=0 n=0
The two terms, T, and T2, can now be evaluated separately. Using Euler's identity e'θ = cosθ + 'sinθ, we can express: cosα(Λ,£)=(eiα(n'k) + e-j"<,α))/2 and sinα(n,jt) =(eiα("Λ)-e"jα("-lc))/2/". - 10- Therefore we can rewrite the term T, as n=N-\ n=V-l
T, = ∑ x[«]*(eα+e-)/2 = l/2( ∑ x[n]*ea + ∑ x[n\*e^a) n Σ=0 n=0 π=0 Eq.8
1/2 ( j _42)
n=/V-l n=V-l where A, = ∑ x[n]*eja and A2 = ∑ x[ri\*e'ja n=0 n=Q
Similarly
n=N-l Λ=ΛM Λ=Λ
T 2 = Σ ]*(eα-e";α)/2 = 1/2/(5 xW*^α - Σ M*e) Eq.9
= \l2j{A, - A2)
The term A, can thus be evaluated from Eq.8 and Eq.9
Aχ = ∑ x[n]*eja n=.V-l = ^ χrj!)tgΛ2π(2n*l)(^l)4Jv) n=0 _ ejn(k+V2yN * ^ (χ[n] *eJ*"M\ * ej2vnUN n=o Eq.10
If a complex variable is defined as: x,[n]= n]*eiπn/N Eq.11 then Eq.10 is simply: n=N-\
A _ e k*mw^ x /[n]*ej2lt"m
= e^i *Gk Eq. l2
n-W-l where GA = ∑ x '[«] *<?-'ΛM,fcW n=0 - 11 -
The complex term Gk = g,r+gk-i, where gk r and gk _, e 3t (set of real numbers) in Eq.12 is essentially die same as Fk in Eq.2. Therefore the FFT approach can be used to evaluate Gk. This brings down computation from OiN2) to O(NlogN). Similarly, the second term A2 in Eq.8 and Eq.9 can be evaluated n=ιV-l n=W-l
A2 = ∑ x[n]*e'M"'k) = e-MV"inyN* (x[n]*e'jππ,N)*e'j2πnk/N n-Q n-0
= e-^'W * Gk ' Eq.13
n=/V-l where Gk = ∑ (χ[n]*e-J/N)*e-j2πnm n=0
Note that Gk * is actually the complex conjugate of Gk which was obtained by Eq.12. That is, if Gk = gk,r+gk,i» where gk r and gki e Si as defined earlier, then Gk * = gk-r - ygki. Therefore Gk * in Eq.13 does not need to be computed again, and the result from Eq.12 can be re-used. That is, only one FFT needs to be computed for the evaluation of T. The result of Eq.8 to Eq.13 is thus
T = \l2( ^υWN Gk+e-^k+myN Gk) Eq.14
Next, the term T2 can be analysed
T2 = ∑ x[nW-e- i2j = \l2j{A -A2)
= \l2j(e^k I2 N Gk-e -^^w Q^ £q 15
Finally, after simplifications of Eq.7, 14 and 15 12
Xk = cosγ(/fc) \l2(e^ N Gk^ -^ ),N Gk ')
- sinγ(/t) \l2j(e^mW Gk-e -J**W Gk')
= cosγ * (gircos(π fc+l/2)/Λ -gi in(π(Λ:+l/2yΛ')
- sinγ * (g^rsin(π(A:+l/2yΛ +^f,cos(π(ιt+l/2yΛ
= cosγ * T, - sinγ * T2 Eq 16
The term Gk = gkr + jg i is computed in O(NogN) operation by use of FFT algorithms. The additional operation outlined in Eq. 16 to extract the final Xk is only of order 9(Λ - Therefore the MDCT can now be computed in 0(Mog2N) time. The operations required to obtain die MDCT are illustrated in Fig. 3.
Combining Two Channels into Single FFT
Suppose die multi-channel encoder is required to process m audio channels. Instead of computing an FFT for each channel as described in die previous section, it is possible to further reduce the computational requirement of the coder by combining two channels and using a single FFT only. In effect, instead of m FFTs only m/2 FFTS need to be computed.
If the input sequence are real numbers then it is known that DFT for any two channels can be computed with only one FFT block by considering the input as a complex number. The real part is formed from the sequence for any one channel and the imaginary part is from data of another channel. After the Fourier Transform is computed for the resulting complex variable, the resulting transform for each channel can be easily retrieved.
However, in the present case the input data to the FFT block is actually a complex number (formed by multiplying the real data by complex variable e 'm/N). In this case, there is no straightforward way of retrieving the frequency transform after having combined two channels. However, using some processing after the FFT one can still compute the DFT of two channel using a single FFT block. -13-
Let {x[0],x[l],x[2],...,x[N-l]} be N input samples of the first channel and {y[0].)'[l]');[2].---}'[N-l]} be the samples for the second channel. As described above, the n-ΛM frequency coefficients Gk = ∑ x[ri\e>™'N * e j2κnk,N (Eq. 12 and 13) must be
obtained for the first channel; and similarly, for the second channel n-N-\ G = ∑ y[n]eJIN * ej2π"m
Defming complex variable z[n] = x[/z]*ejπnN + ; [n]*ejπnN Eq. 17 and computing its DFT using the FFT method, yields n=W-l
Z k = Σ 7*e2m k=0..N-\
= ∑ ( ] + jy[n])ejππ/N*eJ2πnm n=0
= ∑ (Φl + jy[n])*ej2πn(k /2yN n=0 Eq.18
Now substituting N-k for k in the above expression,
Z N-k = Σ (Φ] + jy[n])*eJ2*"N-k*U2VN k=0..7l-l
- ∑ (Φ] + jy[n])*ej2π"(~k*ιnyM*e-j2
17=0 n=W-l
= Σ (Φl + jy[n])*ej2πn('k nyN
«=o Eq.19
Since <-2πa = l,nel (the set of integers), the term 2na vanishes in the above expression. Taking the complex conjugate of ZN_k:
-k - "∑ (x[ri\-jy[ή\)*e-Wk^N n=0 n=W-l
= Σ (x[n]-jy[n])*ej2*"V<-mW n--o Eq.20 - 14 -
Using Eq. 18 and 20, separate expressions for Gk and G'k are required. In a simple case die conjugates in Eq. 18 and 20 should add and subtract to give die required expressions. However in this instance tiiat is not the case. But, substituting N-k by N-k-] in Eq. 18, the following is obtained
Zs-k-x = Σ (x[n]-jy[n])*eJ2™^V2 /N Eq. 21 π=0
Now the term e i2πn<lc+1/2) N is common in both Eq. 17 and 19, and it is possible to isolate.
ZyZύ-k-x = "∑ x[ri\ *e^"^myN + j"~∑ y[n]*EJ~2«"^υ2W π=0 rt-0
+ ("∑ x[n] *ej2πn^l/2yN - j '∑ y[n] *Ej2πn(k*υ2W) n=0 n=0 n=N- l
= 2 ∑ {x[ri\ei™'N)*ej2πnm n=0
= 2Gk
Similarly,
ZyZ^-. = "∑ χ[ri *eW*m N + j ∑ y[ri *E^n^m N rt=0 π=0 π=W-l n=ΛT-l
- ( ∑ χ[n] *ei2πnik nyN - j ∑ y[n]*Ej2*"(k+U2)/N) n=0 n=0 π=Λ
= 2 ∑ (y[n]ejπn/N)*ej2l"'m n=0
2jG
That is
Gk = (Zk+Z^k_ )l2 /t=0.JV/2-l Eq. 22
and
G'k = ( k-Zύ-k__)l2j k=0..N/2-l Eq. 23 - 15 -
From the expression from Eq. 22 and 23 into Eq. 16, the MDCT for each channel is obtained. The overall process is illustrated in Fig. 4.
Transform Lengtii Adjustment Technique
The frequency transform lengtii N is decided by die encoder based on temporal and spectral resolution requirements. The input signal is usually analysed witii a high frequency bandpass filter to detect die presence of transients. This information is used to adjust die block lengtii, restricting quantization noise associated witii die transient within a small temporal region about die transient, avoiding temporal masking. Thus, if transient is detected in a channel, two short transform of length N/2 each are taken. In the absence of transient, a single long transform of length N is used, thus providing higher spectral resolution.
From the method described in the previous section for computing MDCT for two channels using a single FFT block, it is evident that the transform length for the two paired channels must be the same. Therefore, pairing for the transformation phase much be such that channels with identical transform length are grouped together.
It is however possible tiiat not all channels can be paired with such convenience. Assume that die total number of channels are an even number (if not, take a single FFT for one channel and the rest form an even group). Suppose out of die m channels, need long transform and tiierefore m-l require short transform.
If / is an even number, then since die total is even, it follows that l-m is also even. In this case, from the / channels that need long transform, 1/2 pairs are formed and for each of the 1/2 pairs a single FFT is computed to estimate the MDCT for the original paired channels. Similarly, the l-m channels are paired to form (l-m)ll pairs and for the (l-m)ll pairs two short FFTs are computed.
Now consider the case when 1 / = 2r + 1 is an odd number. Therefore m - 1 = 2s + 1 is - 16 - also an odd number. The 2r channels requiring long transform are paired togedier to form r pairs and dien 2r transforms are computed using r FFTs only. Similarly, for the Is channels s pairs are formed. What remains is one channel requiring long transform and another requiring two short transforms. Both of tiiese channels are paired togedier and two short FFTs are computed to derive the MDCT.
The rationale for constraining die long transform to two short ones is as follows. A short transform is required for restricting quantization noise associated with die transient within a small temporal region about the transient, avoiding temporal masking. A long transform gives slight better frequency resolution but die error is not much compared to die case when in die presence of transient a long transform is utilised. Forcing a long transform onto a channel in the presence of transient leads to greater distortion in die final produced music. This conjecture was proven true by experimental studies on benchmark music streams.
Combining Windowing with pre-processing
Before d e time domain signal x[n] is transformed to die frequency domain, a windowing function is usually applied. Thus, if the sampled signal is p[ή] then die sequence that is applied to die frequency transformation block is x[n] = p[n] * w[n], where w[n] is the windowing function. From die previous sections we noted tiiat before the FFT is computed for a block a pre-processing is performed as given in Eq. 11 (reproduced below for convenience). Thus x'[n] = x[n] * ejm N = (p[n] * w[n]) * eJm N
= ip[n]* w[n])* (cosm/N + j sin τm/N)
= p[n] * ((w[n] * cosππ/Λ + yXvv[/z]*sin πrι/Λ)) Eq. 24
From Eq. 24 we note that the windowing function can be combined with the cosine and sine multiplication required in Eq. 11. This brings down the computation even further since the sine and cosine are usually implemented in a real time system as table-lookup. If - 17 - two tables are constructed as defined below
rcos[«] = w[n] * cos( τm/N) rs [ή = w[n]* sin( m/N)
then Eq. 11 can be rewritten as
'M = (PW * rcos[n]) + j(p[n]* r sin [n]) Eq. 25
Although the invention has been described herein primarily in terms of its mathematical derivation and application, and the procedures required for implementation, it will be readily recognised by those skilled in the art that die procedures described can be implemented by means of any desired computational apparatus. For example, the invention may be embodied in computer software operating on general purpose computing equipment, or may be embodied in purpose built circuitry or contained in microcode or the like in an integrated circuit or set of integrated circuits.
The foregoing detailed description of embodiments of die invention has been presented by way of example only, and is not intended to be considered limiting to die invention as defined in die claims appended hereto.
-18- Glossary of Equations:
MDCT n=N*\ x k = Σ Φl * cos(2π*(2«+l)*(2 +l)/4N + π*(2/fc+l)/4) k=0...(N/2-l)
»=0
= cosγ * gk^os(π(k+\l2)IN)-gkμn(π(k+\l2)IN) - sinγ * ( A.jrsin(π(A:+l/2)/Λr)+gtι;cos(π(Λ+l/2)/Λ = Tlcosy(k) - T2siny(k)
n=A/-l n=Λ r, = ∑ x[n] *cosa(n,k) T2 = ∑ x[n] *sina(n,k) n=0 n-Q = 1204, + A2) = \l2j(Ax - A2)
Al = Σ Φ]* "* A2 = Σ Φ]*e"yα n=0 Λ=0
_ ejπ(k+V2 N_Q _ e -n(2k+myN ^ Q «
»=ΛM n=iV-l
G* = Σ (x[n]*e]nnlN)*e^"m Gk = ∑ (x[n]*e-jnn/N)*e-j2m n=0 „=0
T2 = M2j( ^k*myN Gk-e - **^ G^
Gk = (^+^-*-ιV2 k=0..M2-\
G', = (Zfc-Z .4-1)/2/- *=O.JV72-l
α(#ι,Jfc) = 2π(2rt+l)(2:+l)/4N γ(Λ) = π(2*+l)/4

Claims

- 19 - Claims
1. A method for coding audio data comprising a sequence of digital audio samples, including the steps of: i) multiplying die input samples with a first trigonometric function factor to generate an intermediate sample sequence; ii) computing a fast Fourier transform of the intermediate sample sequence to generate a Fourier transform coefficient sequence; iii) for each transform coefficient in d e sequence, multiplying die real and imaginary components of die transform coefficient by respective second trigonometric function factors, adding the multiplied real and imaginary transform coefficient components to generate an addition stream coefficient, and subtracting the multiplied real and imaginary transform coefficient components to generate a subtraction stream coefficient; iv) multiplying the addition and subtraction stream coefficients with respective tiiird trigonometric function factors; and v) subtracting the corresponding multiplied addition and subtraction stream coefficients to generate audio coded frequency domain coefficients.
2. A metiiod for coding audio data as claimed in claim 1, wherein the audio coded frequency domain coefficients comprise modified discrete cosine transform coefficients.
3. A method for coding audio data as claimed in claim 1 or 2, wherein the first trigonometric function factor for each audio sample is a function of the audio sample sequence position and die number of samples in die sequence.
4. A method for coding audio data as claimed in claim 3, wherein the respective second trigonometric function factors for each transform coefficient in the sequence are respective functions of the transform coefficient sequence position and the number of coefficients in the sequence. - 20 -
5. A method for coding audio data as claimed in claim 4, wherein the respective third trigonometric function factors are respective functions of the transform coefficient sequence position.
6. A method for coding audio data as claimed in claim 5, wherein step i) comprises multiplying the input sequence samples xfnj by the first trigonometric function factor cos(m/N) to generate the intermediate sample sequence, where: xfnj are the input sequence audio samples; N is the number of input sequence audio samples; and n = 0,.... ,N-l.
7. A method for coding audio data as claimed in claim 6, wherein step ii) comprises computing die fast Fourier transform of the intermediate sample sequence so as to generate said transform coefficient sequence Gk = gk.r + jg !, where: Gk is the transform coefficient sequence; g r are the real transform coefficient components; g i are the imaginary transform coefficient components; and k = 0,....,(N/2-l).
8. A method for coding audio data as claimed in claim 7, wherein step iii) comprises determining the addition stream coefficients T2 and subtraction stream coefficients Tt according to:
Tt = g„r cos(π(k+l/2)/N) - g , sin(π(k+l/2)/N) T2 = gkr cos(π(k+l/2)/N) + g i sin(π(k+l/2)/N) where T, and T2 are the subtraction stream and addition stream coefficients, respectively.
9. A method for coding audio data as claimed in claim 8, wherein steps iv) and v) comprise generating die audio coded frequency domain coefficients Xk according to: Xk = T, cos(π(2k+l)/4) - T2 sin(π(2k+l)/4) where Xk are the audio coded frequency domain coefficients; and cos(π(2k+l)/4) and sin(π(2k+l)/4) are the third trigonometric function factors. - 21 -
10. A method for coding audio data, including the steps of: combining first and second sequences of digital audio samples from first and second audio channels into a single complex sample sequence; determining a Fourier transform coefficient sequence as defined in any preceding claim; generating first and second transform coefficient sequences by combining and/or differencing first and second selected transform coefficients from said Fourier transform coefficient sequence; and for each of the first and second transform coefficient sequences, generating audio coded frequency domain coefficients as defined in any preceding claim, so as to generate respective sequences of said audio coded frequency domain coefficients for die first and second audio channels.
11. A method for coding audio data as claimed in claim 10, wherein the step of generating first and second transform coefficient sequences comprises, for each corresponding coefficient in the first and second transform coefficient sequences, selecting first and second transform coefficients from said Fourier transform coefficient sequence, determining a complex conjugate of said second transform coefficient, combining said first transform coefficient and said complex conjugate for said first transform coefficient sequence and differencing said first transform coefficient and said complex conjugate for said second transform coefficient sequence.
12. A method for coding audio data as claimed in claim 10 or 11, wherein the multiplying step i) comprises multiplying the input sequence samples zfnj by the first trigonometric function factor cosfiτn/N) + jsin(τm/N) to generate the intermediate sample sequence, where: zfnj — x[n] + jy[n] is die complex sample sequence; x[n] is the first sequence of digital audio samples; y[n] is the second sequence of digital audio samples; N is the number of input sequence audio samples in each sequence; n = 0,.... ,N-1; and - 22 - j is the complex constant.
13. A method for coding audio data as claimed in claim 11 or 12, wherein said first and second transform coefficient sequences are generated according to:
5 ck = (zk + r^n
G'k = (Zk - Zr╬╣^1)/2i where Gk is said first transform coefficient sequence;
G'k is said second transform coefficient sequence; N is the number of input sequence audio samples; 10 k = 0,.... ,(N/2-l);
Zk is said first transform coefficient; 7N.M is the complex conjugate of said second transform coefficient; and j is the complex constant.
15 14. A method for coding audio data as claimed in any one of claims 10 to 13, including examining said first and second sequences of digital audio samples to determine a short or long transform length, and coding the audio samples using a short or long transform length as determined.
20 15. A method for coding audio data comprising sequences of digital audio samples from a plurality of audio channels, comprising determining a transform length for each of the channels, pairing the channels according to tiieir determined transform length, and coding the audio samples of first and second channels in each pair, as defined in any one of claims 10 to 13, according to die determined transform length.
25
16. A method for coding audio data as claimed in any preceding claim, including applying a windowing function in combination witii said multiplying step i).
17. A method for coding audio data including the steps of:
30 obtaining at least one input sequence of digital audio samples; pre-processing the input sequence samples including applying a pre-multiplication - 23 - factor to obtain modified input sequence samples; transforming die modified input sequence samples into a transform coefficient sequence utilising a fast Fourier transform- and post-processing the sequence of transform coefficients including applying first post- multiplication factors to the real and imaginary coefficient components, differencing and combining die post-multiplied real and imaginary components, applying second post- multiplication factors to die difference and combination results, and differencing to obtain a sequence of modified discrete cosine transform coefficients representing said input sequence of digital audio samples.
18. A method as claimed in claim 17, wherein the pre-multiplication factor, and first and second post-multiplication factors are trigonometric function factors.
19. A method as claimed in claim 18, wherein the pre-multiplication factor applied to each digital audio sample in die input sequence is a trigonometric function of die audio sample sequence position and die number of samples in the sequence.
20. A method as claimed in claim 18, wherein the first post-multiplication factors for each transform coefficient in the sequence are trigonometric functions of d e transform coefficient sequence position and die number of coefficients in the sequence.
21. A method as claimed in claim 18, wherein the second post-multiplication factor for each difference or combination result is trigonometric functions of the transform coefficient sequence position of die coefficients used in the difference or combination.
22. A method as claimed in any one of claims 17 to 21, wherein the pre-processing operations are performed on each sample in die input sequence individually.
23. A metiiod as claimed in any one of claims 17 to 22, wherein the post-processing operations are performed on each transform coefficient in the sequence individually. - 24 -
24. A method for coding audio data including die steps of: obtaining first and second input sequences of digital audio samples corresponding to respective first and second audio channels; combining the first and second input sequences of digital audio samples into a single complex input sample sequence; pre-processing me complex input sequence samples including applying a pre- multiplication factor to obtain modified complex input sequence samples; transforming the modified complex input sequence samples into a complex transform coefficient sequence utilising a fast Fourier transform; and post-processing die sequence of complex transform coefficients to obtain first and second sequences of audio coded frequency domain coefficients corresponding to the first and second audio channels including, for each corresponding frequency domain coefficient in the first and second sequences, selecting first and second complex transform coefficients from said sequence of complex transform coefficients, combining the first complex transform coefficient and the complex conjugate of the second complex transform coefficient for said first channel and differencing the first complex transform coefficient and the complex conjugate of the second complex transform coefficient for said second channel, and applying respective post-multiplication factors to the combination and difference to obtain said audio coded frequency domain coefficients corresponding to the first and second audio channels.
25. A method as claimed in claim 24, wherein the pre-multiplication factor for each sample in the complex input sample sequence comprises a complex trigonometric function of the complex input sample sequence position and the number of samples in the sequence.
26. A method as claimed in claim 24 or 25, wherein the post-processing for each of the first and second channels includes applying first post-multiplication factors to the real and imaginary coefficient components, differencing and combining the post-multiplied real and imaginary components, applying second post-multiplication factors to the difference and combination results, and differencing to obtain a sequence of modified discrete cosine transform coefficients representing said input sequence of digital audio samples. - 25 -
27. A metiiod for coding audio data including the steps of: obtaining first and second input sequences of digital audio samples x[n], y[n] corresponding to respective first and second audio channels; combining the first and second input sequences of digital audio samples into a single complex input sample sequence z[n], where z[ = x[ ] + jy[n]; pre-processing the complex input sequence samples including applying a pre- multiplication factor cos(τm/N) + jsin(τm/N) to obtain modified complex input sequence samples, where N is the number of audio samples in each of the first and second input sequences and n = 0,....,(N-1); transforming the modified complex input sequence samples into a complex transform coefficient sequence Zk utilising a fast Fourier transform, wherein k = 0,....,(N/2-l); and post-processing the sequence of complex transform coefficients to obtain first and second sequences of audio coded frequency domain coefficients corresponding to die first and second audio channels Xk, Yk according to:
Gk = (Zfe+z .fc.j)/2 *=0.JV72-1
G = (Zk-Z^_k. )l2j *=0..N72-1
Xk = cosγ * (g-^cos^^+l^yTV -g-^sin^^+l^yΛ^ - sinγ * ( fcjrsin(π(Ar+l/2yN)+^/cos(π(A:+l/2yΛ
Yk = cosγ * C?, fc)rcos(π(t+l/2yΛ -g'A),.sin(π(Λ:+l/2)/Λ - sinγ * (g' rsm(π(k+V2)/N)+g'kticos(π(k+V2)/N)
where Gk is a transform coefficient sequence for the first channel;
G is a transform coefficient sequence for the second channel; g r and gk i are the real and imaginary transform coefficient components of Gk; g'Kr and g'κι are the real and imaginary transform coefficient components of G'k; Z *./ is the complex conjugate of ZN.k.,; and y(k) = π(2£+ l)/4.
EP98909964A 1998-02-21 1998-02-21 A fast frequency transformation techique for transform audio coders Expired - Lifetime EP1057292B1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/SG1998/000014 WO1999043110A1 (en) 1998-02-21 1998-02-21 A fast frequency transformation techique for transform audio coders

Publications (2)

Publication Number Publication Date
EP1057292A1 true EP1057292A1 (en) 2000-12-06
EP1057292B1 EP1057292B1 (en) 2004-04-28

Family

ID=20429840

Family Applications (1)

Application Number Title Priority Date Filing Date
EP98909964A Expired - Lifetime EP1057292B1 (en) 1998-02-21 1998-02-21 A fast frequency transformation techique for transform audio coders

Country Status (3)

Country Link
EP (1) EP1057292B1 (en)
DE (1) DE69823557T2 (en)
WO (1) WO1999043110A1 (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1125235B1 (en) 1998-10-26 2003-04-23 STMicroelectronics Asia Pacific Pte Ltd. Multi-precision technique for digital audio encoder
DE19959156C2 (en) 1999-12-08 2002-01-31 Fraunhofer Ges Forschung Method and device for processing a stereo audio signal to be encoded
US6934677B2 (en) 2001-12-14 2005-08-23 Microsoft Corporation Quantization matrices based on critical band pattern information for digital audio wherein quantization bands differ from critical bands
US7240001B2 (en) 2001-12-14 2007-07-03 Microsoft Corporation Quality improvement techniques in an audio encoder
US7502743B2 (en) 2002-09-04 2009-03-10 Microsoft Corporation Multi-channel audio encoding and decoding with multi-channel transform selection
US7299190B2 (en) 2002-09-04 2007-11-20 Microsoft Corporation Quantization and inverse quantization for audio
GB2423451A (en) * 2005-02-16 2006-08-23 Ishce Ltd Inserting a watermark code into a digitally compressed audio or audio-visual signal or file
US7539612B2 (en) 2005-07-15 2009-05-26 Microsoft Corporation Coding and decoding scale factor information
US7831434B2 (en) 2006-01-20 2010-11-09 Microsoft Corporation Complex-transform channel coding with extended-band frequency coding
US7953604B2 (en) 2006-01-20 2011-05-31 Microsoft Corporation Shape and scale parameters for extended-band frequency coding
US7885819B2 (en) 2007-06-29 2011-02-08 Microsoft Corporation Bitstream syntax for multi-process audio decoding

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2646778B2 (en) * 1990-01-17 1997-08-27 日本電気株式会社 Digital signal processor
JP2866754B2 (en) * 1991-03-27 1999-03-08 三菱電機株式会社 Arithmetic processing unit
CA2090052C (en) * 1992-03-02 1998-11-24 Anibal Joao De Sousa Ferreira Method and apparatus for the perceptual coding of audio signals
JPH06112909A (en) * 1992-09-28 1994-04-22 Sony Corp Signal converter for improved dct
EP0718746B1 (en) * 1994-12-21 2005-03-23 Koninklijke Philips Electronics N.V. Booth multiplier for trigonometric functions

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO9943110A1 *

Also Published As

Publication number Publication date
EP1057292B1 (en) 2004-04-28
DE69823557T2 (en) 2005-02-03
DE69823557D1 (en) 2004-06-03
WO1999043110A1 (en) 1999-08-26

Similar Documents

Publication Publication Date Title
US11557304B2 (en) Methods and apparatus for performing variable block length watermarking of media
KR100253136B1 (en) Low computational complexity digital filter bank
CN102150207B (en) Compression of audio scale-factors by two-dimensional transformation
JP4242450B2 (en) An efficient implementation of a single sideband filter bank for phasor measurements.
JP5414684B2 (en) Method and apparatus for performing audio watermarking, watermark detection, and watermark extraction
EP0990368A1 (en) Method and apparatus for frequency-domain downmixing with block-switch forcing for audio decoding functions
JP2009271554A (en) Parametric representation of spatial audio
CN101887726A (en) The method of stereo coding and decoding and equipment thereof
EP1057292A1 (en) A fast frequency transformation techique for transform audio coders
US20070036228A1 (en) Method and apparatus for audio encoding and decoding
KR20120095920A (en) Optimized low-throughput parametric coding/decoding
EP0775389B1 (en) Encoding system and encoding method for encoding a digital signal having at least a first and a second digital signal component
Brandenburg Introduction to perceptual coding
CN1862969B (en) Adaptive block length, constant converting audio frequency decoding method
US7203717B1 (en) Fast modified discrete cosine transform method
EP0707761B1 (en) Arrangement for determining a signal spectrum of a wideband digital signal and for deriving bit allocation information in response thereto
EP1076295A1 (en) Method and encoder for bit-rate saving encoding of audio signals
KR100424036B1 (en) Disassembly / synthesis filtering system with efficient nodal stack single-sided band filter bank using time-domain aliasing
Lanciani Compressed-domain processing of MPEG audio signals
JP3292228B2 (en) Signal encoding device and signal decoding device
Fielder et al. Audio Coding Tools for Digital Television Distribution
Wylie apt-X100: Low-Delay, Low-Bit-Rate Subband ADPCM Digital: Audio Coding
Sandler Linear time-invariant systems in the wavelet domain
Orglmeister Data reduction in high-quality audio signals
JPH06169289A (en) Compressed data reproduction device

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20000920

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): DE FR GB IT

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: STMICROELECTRONICS ASIA PACIFIC PTE LTD.

17Q First examination report despatched

Effective date: 20030305

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): DE FR GB IT

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

REF Corresponds to:

Ref document number: 69823557

Country of ref document: DE

Date of ref document: 20040603

Kind code of ref document: P

ET Fr: translation filed
PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed

Effective date: 20050131

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: IT

Payment date: 20080213

Year of fee payment: 11

Ref country code: GB

Payment date: 20080129

Year of fee payment: 11

Ref country code: DE

Payment date: 20080208

Year of fee payment: 11

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20080228

Year of fee payment: 11

GBPC Gb: european patent ceased through non-payment of renewal fee

Effective date: 20090221

REG Reference to a national code

Ref country code: FR

Ref legal event code: ST

Effective date: 20091030

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20090901

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20090221

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20090302

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20090221