WO2001069593A1 - Laguerre fonction for audio coding - Google Patents

Laguerre fonction for audio coding Download PDF

Info

Publication number
WO2001069593A1
WO2001069593A1 PCT/EP2001/002424 EP0102424W WO0169593A1 WO 2001069593 A1 WO2001069593 A1 WO 2001069593A1 EP 0102424 W EP0102424 W EP 0102424W WO 0169593 A1 WO0169593 A1 WO 0169593A1
Authority
WO
WIPO (PCT)
Prior art keywords
shape
function
audio
transient
signal component
Prior art date
Application number
PCT/EP2001/002424
Other languages
French (fr)
Inventor
Arnoldus W. J. Oomen
Albertus C. Den Brinker
Original Assignee
Koninklijke Philips Electronics N.V.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics N.V. filed Critical Koninklijke Philips Electronics N.V.
Priority to DE60129771T priority Critical patent/DE60129771T2/en
Priority to EP01929398A priority patent/EP1190415B1/en
Priority to JP2001567585A priority patent/JP4803938B2/en
Publication of WO2001069593A1 publication Critical patent/WO2001069593A1/en

Links

Classifications

    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders

Definitions

  • the shape parameters include a step indication in case the transient signal component is a step-like change in amplitude.
  • the signal after the step-like change is advantageously coded in sustained coders.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Cereal-Derived Products (AREA)
  • Stereophonic System (AREA)
  • Reverberation, Karaoke And Other Acoustics (AREA)

Abstract

Coding (1) of an audio signal is provided including estimating (110) a position of a transient signal component in the audio signal, matching (111, 112) a shape function on the transient signal component in case the transient signal component is gradually declining after an initial increase, which shape function has a substantially exponential initial behavior and a substantially logarithmic declining behavior; and including (15) the position and shape parameters describing the shape function in an audio stream (AS).

Description

AGUERRE FUNCTION FOR AUDIO CODING
The invention relates to coding of audio signals, in which transient signal components are coded.
The invention further relates to decoding of audio signals. The invention also relates to an audio coder, an audio player, an audio system, an audio stream and a storage medium.
The article from Purnhagen and Edler, "Objektbasierter Analyse/Synthese Audio Coder fur sehr niedrige Datenraten", ITG Fachbericht 1998, No. 146, pp. 35-40 discloses a device for coding of audio signals at low bit-rates. A model-based Analysis- Synthesis arrangement is used, in which an input signal is divided in three parts: single sinusoids, harmonic tones, and noise. The input signal is further divided in fixed frames of 32 ms. For all blocks and signal parts, parameters are derived based on a source-model. To improve the representation of transient signal parts, an envelope function a(t) is derived from the input signal and applied on selected sinusoids.. The envelope function consists of two line segments determined by the parameters ratk, rdec, tmax as shown in Fig. 1.
An object of the invention is to provide audio coding that is advantageous in terms of bit-rate and perception. To this end, the invention provides a method of coding and decoding, an audio coder, an audio player, an audio system, an audio stream and a storage medium as defined in the independent claims. Advantageous embodiments are defined in the dependent claims.
A first embodiment of the invention comprises estimating a position of a transient signal component in the audio signal, matching a shape function on the transient signal component in case the transient signal component is gradually declining after an initial increase, which shape function has a substantially exponential initial behavior and a substantially logarithmic declining behavior; and including the position and parameters describing the shape function in an audio stream. Such a function has an initial behavior substantially according to t" and a declining behavior after the initial increase substantially according to e'a where t is a time, and n and a are parameters which describe a form of the shape function. The invention is based on the insight that such a function gives a better representation of transient signal components while the function may be described by a small number of parameters, which is advantageous in terms of bit-rate and perceptual quality. The invention is especially advantageous in embodiments where transient signal components are separately encoded from a sustained signal component, because especially in these embodiments a good representation of the transient signal components is important.
According to a further aspect of the invention, the shape function is a Laguerre function, which is in continuous time given by c - t"e-" (1) where c is a scaling parameter (which may be taken one). In a practical embodiment, a time- discrete Laguerre function is used.
Transient signal components are conceivable as a sudden change in power (or amplitude) level or as a sudden change in waveform pattern. Detection of transient signal components as such, is known in the art. For example, in J. Kliewer and A. Mertins, 'Audio subband coding with improved representation of transient signal segments', Proc. of
EUSIPCO-98, Signal Processing IX, Theories and applications, Rhodos, Greece, Sept. 1998, pp. 2345-2348, a transient detection mechanism is proposed, that is based on the difference in energy levels before and after an attack start position. In a practical embodiment according to the invention, sudden changes in amplitude level are considered. In a preferred embodiment of the invention, the shape function is a generalized discrete Laguerre function. Meixner and eixner-like functions are practical in use and give a surprisingly good result. Such functions are discussed in A.C. den Brinker, 'Meixner-like functions having a rational z-transform', Int. J. Circuit Theory Appi, 23, 1995, pp. 237-246. Parameters of these shape functions are derived in a simple way. In another embodiment of the invention, the shape parameters include a step indication in case the transient signal component is a step-like change in amplitude. The signal after the step-like change is advantageously coded in sustained coders.
In another preferred embodiment of the invention, the position of the transient signal component is a start position. It is convenient to give the start position of the transient signal component for adaptive framing, wherein a frame starts at the start position of a transient signal component. The start position is used both for the shape function and the adaptive framing, which results in efficient coding. If the start position is given, it is not necessary to determine the start position by combining two parameters as would be necessary in the embodiment described by Edler. The aforementioned and other aspects of the invention will be apparent from and elucidated with reference to the embodiments described hereinafter.
In the drawings: Fig. 1 shows a known envelope function, as already discussed;
Fig. 2 shows an embodiment of an audio coder according to the invention; Fig. 3 shows an example of a shape function according to the invention; Fig. 4 shows a diagram of first and second order running central moments of an input audio signal; Fig. 5 shows an example of a shape function derived for an input audio signal;
Fig. 6 shows an embodiment of an audio player according to the invention; and
Fig. 7 shows a system comprising an audio coder and an audio player; The drawings only show those elements that are necessary to understand the invention.
Fig. 2 shows an audio coder 1 according to the invention, comprising an input unit 10 for obtaining an input audio signal x(t). The audio coder 1 separates the input signal into three components: transient signal components, sustained deterministic components, and sustained stochastic components. The audio coder 1 comprises a transient coder 11, a sinusoidal coder 13 and a noise coder 14. The audio coder optionally comprises a gain compression mechanism (GC) 12.
In this advantageous embodiment of the invention, transient coding is performed before sustained coding. This is advantageous because transient signal components are not efficiently and optimally coded in sustained coders. If sustained coders are used to code transient signal components, a lot of coding effort is necessary, e.g. one can imagine that it is difficult to code a transient signal component with only sustained sinusoids. Therefore, the removal of transient signal components from the audio signal to be coded before sustained coding is advantageous. A transient start position derived in the transient coder is used in the sustained coders for adaptive segmentation (adaptive framing) which results in a further improvement of performance of the sustained coding.
The transient coder 11 comprises a transient detector (TD) 1 10, a transient analyzer (TA) 11 1 and a transient synthesizer (TS) 112. First, the signal x(t) enters the transient detector 110. This detector 1 10 estimates if there is a transient signal component, and at which position. This information is fed to the transient analyzer 11 1. This information may also be used in the sinusoidal coder 13 and the noise coder 14 to obtain advantageous signal-induced segmentation. If the position of the transient signal component is determined, the transient analyzer 1 11 tries to extract (the main part of) the transient signal component. It matches a shape function to a signal segment preferably starting at an estimated start position, and determines content underneath the shape function, e.g. a (small) number of sinusoidal components. This information is contained in the transient code Cj. The transient code C is furnished to the transient synthesizer 112. The synthesized transient signal component is subtracted from the input signal x(t) in subtractor 16, resulting in a signal x\. In case, the GC 12 is omitted, x\ = x2. The signal x is furnished to the sinusoidal coder 13 where it is analyzed in a sinusoidal analyzer (SA) 130, which determines the (deterministic) sinusoidal components. This information is contained in the sinusoidal code Cs. From the sinusoidal code Cs, the sinusoidal signal component is reconstructed by a sinusoidal synthesizer (SS) 131. This signal is subtracted in subtractor 17 from the input 2 to the sinusoidal coder 13, resulting in a remaining signal xj devoid of (large) transient signal components and (main) deterministic sinusoidal components. Therefore, the remaining signal x3 is assumed to mainly consist of noise. It is analyzed for its power content according to an ERB scale in a noise analyzer (NA) 14. The noise analyzer 14 produces a noise code CN- Similar to the situation in the sinusoidal coder 13, the noise analyzer 14 may also use the start position of the transients signal component as a position for starting a new analysis block. The segment sizes of the sinusoidal analyzer 130 and the noise analyzer 14 are not necessarily equal. In a multiplexer 15, an audio stream AS is constituted which includes the codes Or, s and N- The audio stream AS is furnished to e.g. a data bus, an antenna system, a storage medium etc. In the following, a representation of transient signal components according to the invention will be discussed. In this embodiment, the code for transient components Or consists of either a parametric shape plus the additional main frequency components (or other content) underneath the shape or a code for identifying a step-like change. According to a preferred embodiment of the invention, the shape function for a transient that is gradually declining after an initial increase, is preferably a generalized discrete Laguerre function. For other types of transient signal components, other functions may be used.
An example of a generalized discrete Laguerre function, is a Meixner function. A discrete zeroth-order Meixner function g(t) is given by: g(t) = ^(l - ξ2)b,1ξ' (2) where t = 0,1,2,.. and (b)t = b(b+l)...(b+t-l) is a Poclihammer symbol. The parameter b denotes an order of generalization (b > 0) and determines the initial shape of the function: approximately / oc t h~l)/2 for small t. The parameter denotes a pole with 0 < ξ< 1 and determines the decay for larger t. The function g(t) is a positive function for all values of t. For b = 1 , a discrete Laguerre function is obtained. Furthermore, for b = 1 , the z-transform of g is a rational function in z and can thus be realized as an impulse response of a first order infinite impulse response (IIR) filter. For all other values of b there is no rational z-
transform. The function g(t) is energy normalized, i.e. ^ g2 (t) = 1 . The zeroth-order
1 = 0 Meixner-function may be created recursively by: g(0) = (l - ξ2 )bl 2 (3)
g(l) = A^ ±ξg(t - 1) for t >0 (4)
In another embodiment according to the invention, Meixner-like functions are used, because they have a rational z-transform. An example of a Meixner-like function is shown in Fig. 3. A discrete zeroth-order Meixner-like function h(t) is given by its z- transform:
Figure imgf000006_0001
where a = 0,1,2,... and Ca is given by:
Figure imgf000006_0002
where _°a is an αth order Legendre polynomial, given by:
Figure imgf000006_0003
The parameter a denotes the order of generalization (a is a non-negative integer) and ξ is the pole with 0 < ξ < 1. The parameter a determines the initial shape of the function: / κ ta for small t. The parameter determines the decay for large t. The function h is a positive function for all values of t and is energy normalized. For all values of a, the function h has a rational z-transform and can be realized as the impulse response of an IIR filter (of order +1).
The function h(t) can be expressed in a finite discrete Laguerre-series according to:
A(0 = ∑ *„ (0 (8) m=0 where φm are discrete Laguerre functions, see the article of A.C. den Brinker. Bm is given by:
5„ = C ξ <
2 \ α+l / 2 (9)
( - ξy \mJ
First and second order running central moments of a given function f(t) are defined by:
Figure imgf000007_0001
Σ^, ( where k0 is the start position of the transient signal component.
With a good estimation of the running moments T\ and T2 of an input audio signal (take f(t) = x(t) in equations 10 and 1 1), the shape parameters may be deduced. Unfortunately, in real data a transient signal component is usually followed by a sustained excitation phase, disturbing a possible measurement of the running moments. Fig. 4 shows the first and second order running central moments of an input audio signal. It appears that the running moments initially increase linearly from the assumed starting position and later on tend to saturate. Although the shape parameters may be deduced from this curve, because the saturation is not as clear as desired for parameter extraction, i.e. it is not clear enough at which k good estimates of Tj and T2 are obtained. In an advantageous embodiment of the invention, a ratio in initial increase of the running moments T; and r_> is used to deduct the shape parameters. This measurement is advantageous in determining b (and in case of the zeroth-order Meixner function a), since b determines the initial behavior of the shape. From a ratio between slopes of running moments T\ and 72 a good estimation for b is obtained. From simulation results has been obtained that to a very good degree, a linear relation exists between the ratio slope Tj/ slope T2 and the parameter b, which is, in contrast to a Laguerre function, slightly dependent on the decay parameter ξ. As a description may be used (derived by experiments): for Meixner: slope Ti/ slope T2 = b + 1/2 (12) for Meixner-like: slope T slope T2 = 2a + 3/2 (13) wherein a ξ dependence is ignored. Because T and T2 are zero for k = ko, slope Tj/ slope T2 may be approximated by 7// T2 for a suitable k.
The pole ξ of the shape may be estimated in the following way. A second order polynomial is fitted to a running central moment, e.g. Tj. This polynomial is fitted to a signal segment of T\ with observation time T such that leveling off is clearly visible, i.e. a clear second order term in the polynomial fit at T. Next, the second-order polynomial is extrapolated to its maximum and this value is assumed to be the saturation level of T\. From this value for T\ and b, ξis calculated with use of equations 2 and 10, with///) = g(t). For a Meixner-like function, ξ is calculated from the value for T\ and a, with use of equations 8-10, vή iffl = h(t). A procedure for estimation of the decay parameter ξ is as follows: start with some value of T fit a second order polynomial to the data on 0 to T, i.e. Tλ (t) ~ c0 +c] t + c2t2 for t = [0,7] where co,ι,2 are fitting parameters check if the quadratic term of this polynomial is essential at t = T: Tx (T) < (1 - ε)(c0 + cxT) where ε represents a relative contribution of the quadratic term at t =
T. if this is satisfied, then extrapolate 7 (t) to its maximum and equate this with
Figure imgf000008_0001
2
T = c - - 1 ° 4c2 calculate the decay parameter ξ from T\ and b (or a) For Meixner-like functions, the shape parameter a is preferably rounded to integer values.
Fig. 5 shows an example of a shape function derived for an input audio signal. Some pre-processing, like performing a Hubert transform of the data, may be performed in order to get a first approximation of the shape, although pre-processing is not essential to the invention. When the value at which the running moments saturate is large, i.e. in the order of segment/ frame length, the Meixner (-like) shape is discarded. In case the transient is a step-like change in amplitude, the position of the transient is retained for a proper segmentation in the sinusoidal coder and the noise code.
After the start position and the shape of a transient have been determined, the signal content underneath the shape is estimated. A (small) number of sinusoids is estimated underneath the shape. This is done in an analysis-by-synthesis procedure as known in the art. The data that is used to estimate the sinusoids, is a segment which is windowed in order to encompass the transient but not any consequent sustained response. Therefore, a time window is applied to the data before entering the analysis-by-synthesis method. In essence, the signal which is considered extends from the start position to some sample where the shape is reduced to a certain percentage of its maximum. The windowed data may be transformed to a frequency domain, e.g. by a Discrete Fourier Transform (DFT). In order to avoid low-frequency components, which presumably extend beyond the estimated transient, a window in the frequency domain is also applied. Next the maximum response is determined and the frequency associated with this maximum response. The estimated shape is modulated by this frequency, and the best possible fit is made to the data according to some predetermined criterion, e.g. a psycho-acoustic model or in a least-squares sense. This estimated transient segment is subtracted from the original transient and the procedure is repeated until a maximum number of sinusoidal components is exceeded, or hardly any energy is left in the segment. In essence, a transient is represented by a sum of modulated Meixner functions. In a practical embodiment, 6 sinusoids are estimated. If the underlying content mainly contains noise, a noise estimation is used or arbitrary values are given for the frequencies of the sinusoids.
The transient code Or includes a start position of a transient and a type of transient. The code for a transient in the case of a Meixner (-like) shape includes: the start position of the transient an indication that the shape is a Meixner (-like) function shape parameters b (or a) and ξ modulation terms: Np frequency parameters and amplitudes for (co)sine modulated shape In case that the transient is essentially a sudden increase in amplitude level where there is no clear decay in this level (relatively) shortly after the starting position, the transient cannot be encoded with a Meixner (-like) shape. In that case, the start position is retained in order to obtain proper signal segmentation. The code for step-transients includes: the start position of the transient an indicator for the step The performance of the subsequent sustained coding stages (sinusoidal and noise) is improved by using the transient position in the segmentation of the signal. The sinusoidal coder and the noise coder start at a new frame at the position of a detected transient. In this way, one prevents averaging over signal parts, which are known to exhibit non-stationary behavior. This implies that a segment in front of a transient segment has to be shortened, shifted or to be concatenated with a previous frame.
The audio coder 1 according to the invention optionally comprises a gain- control element 12 in front of the sustained coders 13 and 14. It is advantageous for the sustained coders, to prevent changes in amplitude level. For a step-transient, this problem is solved by using a segmentation in accordance with the transients. For transients represented with an shape, the problem is partly solved by extracting the transient from the input signal. The remnant signal still may include a significant dynamic change in amplitude level, presumably shaped similar to the estimated shape. In order to flatten the remnant signal, the gain control element may be used. A compression rate may be defined by: 8 (t) = —±-~ (12)
1 + dh(t) wherein h(t) is the estimated shape and d is a parameter describing a compression rate. The gain-control element assumes that after a transient, a stationary phase occurs with amplitude excursions amounting to about 0.2 times the maximum in the estimated shape. A ratio r is defined by: . - 0.2 „ r = — '- (13)
0.2 . wherein Mτ is the maximum of the remnant signal.
The compression rate parameter d is equal to r if r > 2, otherwise d is taken 0. For the compression, only d needs to be transmitted.
Fig. 6 shows an audio player 3 according to the invention. An audio stream AS', e.g. generated by an encoder according to Fig. 2, is obtained from a data bus, an antenna system, a storage medium etc. The audio stream AS is de-multiplexed in a de-multiplexer 30 to obtain the codes Or', Cs' and CN' . These codes are furnished to a transient synthesizer 31, a sinusoidal synthesizer 32 and a noise synthesizer 33 respectively. From the transient code Or', the transient signal components are calculated in the transient synthesizer 31. In case the transient code indicates an shape function, the shape is calculated based on the received parameters. Further, the shape content is calculated based on the frequencies and amplitudes of the sinusoidal components. If the transient code Or' indicates a step, then no transient is calculated. The total transient signal yγ is a sum of all transients.
In case the decompression parameter d is used, i.e. if derived in the coder 1 and included in the audio stream AS', a decompression mechanism 34 is used. The gain signal g(t) is initialized at unity, and the total amplitude decompression factor is calculated as the product of all the different decompression factors. In case the transient is a step, no amplitude decompression factor is calculated.
From two subsequent transient positions, a segmentation for the sinusoidal synthesis SS 32 and the noise synthesis NS 33 is calculated. The sinusoidal code Cs is used to generate signal ys, described as a sum of sinusoids on a given segment. The noise code CN is used to generate a noise signal y^. Subsequent segments are added by, e.g. an overlap-add method.
The total signal y(t) consists of the sum of the transient signal yγ and the product of the amplitude decompression g and the sum of the sinusoidal signal y and the noise signal y^. The audio player comprises two adders 36 and 37 to sum respective signals. The total signal is furnished to an output unit 35, which is e.g. a speaker.
Fig. 7 shows an audio system according to the invention comprising an audio coder 1 as shown in Fig. 2 and an audio player 3 as shown in Fig. 6. Such a system offers playing and recording features. The audio stream AS is furnished from the audio coder to the audio player over a communication channel 2, which may be a wireless connection, a data bus or a storage medium. In case the communication channel 2 is a storage medium, the storage medium may be fixed in the system or may also be a removable disc, memory stick etc. The communication channel 2 may be part of the audio system, but will however often be outside the audio system. It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design many alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word 'comprising' does not exclude the presence of other elements or steps than those listed in a claim. The invention can be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In a device claim enumerating several means, several of these means can be embodied by one and the same item of hardware. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measures cannot be used to advantage.
In summary, the invention provides coding and decoding of an audio signal including estimating a position of a transient signal component in the audio signal, matching a shape function on the transient signal component in case the transient signal component is gradually declining after an initial increase, which shape function has a substantially exponential initial behavior and a substantially logarithmic declining behavior; and including the position and parameters describing the shape function in an audio stream.

Claims

CLAIMS:
1. A method of encoding (1) an audio signal (x), the method comprising the steps of: estimating (110) a position of a transient signal component in the audio signal; matching (1 11,112) a shape function on the transient signal component in case the transient signal component is gradually declining after an initial increase, which shape function has a substantially exponential initial behavior and a substantially logarithmic declining behavior; and including (15) the position and shape parameters describing the shape function in an audio stream (AS).
2. A method as claimed in claim 1 , wherein the shape function is a Laguerre function or a generalized discrete Laguerre function.
3. A method as claimed in claim 2, wherein shape function is a Meixner function or a Meixner-like function.
4. A method as claimed in claim 2, wherein at least one of the shape parameters is determined by a ratio of slopes of running first and a second order moments of the audio signal (x).
5. A method as claimed in claim 1, wherein the shape parameters include a step indication in case the transient signal component is a step-like change in amplitude.
6. A method as claimed in claim 1, wherein the position of the transient signal component is a start position.
7. A method as claimed in claim 1, the method further comprising: flattening a part of the audio signal that is furnished to at least one sustained coding stage by using the shape function in a gain control mechanism.
8. Method of decoding an audio stream, the method comprising the steps of: generating (31) a transient signal component at a given position; and calculating (31) a shape function based on received shape parameters, which shape function has a substantially exponential initial behavior and a substantially logarithmic declining behavior.
9. Audio coder (1), comprising: means for estimating (110) a position of a transient signal component in the audio signal; means for matching (111 ,112) a shape function on the transient signal component in case the transient signal component is gradually declining after an initial increase, which shape function has a substantially exponential initial behavior and has a substantially logarithmic declining behavior; and means for including (15) the position and shape parameters describing the shape function in an audio stream (AS).
10. Audio player (3), comprising means for generating (31) a transient signal component at a given position; and means for calculating (31 ) a shape function based on received shape parameters, which shape function has a substantially exponential initial behavior and a substantially logarithmic declining behavior.
11. Audio system comprising an audio coder (1) as claimed in claim 9 and an audio player (2) as claimed in claim 10.
12. Audio stream (AS) comprising: a position of a transient signal component; and shape parameters describing an shape function which shape function has a substantially exponential initial behavior and a substantially logarithmic declining behavior.
13. Storage medium (2) on which an audio stream (AS) as claimed in claim 12 has been stored.
PCT/EP2001/002424 2000-03-15 2001-03-05 Laguerre fonction for audio coding WO2001069593A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
DE60129771T DE60129771T2 (en) 2000-03-15 2001-03-05 LAGUERRE FUNCTION FOR AUDIO CODING
EP01929398A EP1190415B1 (en) 2000-03-15 2001-03-05 Laguerre function for audio coding
JP2001567585A JP4803938B2 (en) 2000-03-15 2001-03-05 Laguerre function for audio coding

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP00200939.7 2000-03-15
EP00200939 2000-03-15

Publications (1)

Publication Number Publication Date
WO2001069593A1 true WO2001069593A1 (en) 2001-09-20

Family

ID=8171205

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2001/002424 WO2001069593A1 (en) 2000-03-15 2001-03-05 Laguerre fonction for audio coding

Country Status (9)

Country Link
US (2) US6925434B2 (en)
EP (1) EP1190415B1 (en)
JP (1) JP4803938B2 (en)
KR (1) KR100780561B1 (en)
CN (1) CN1154975C (en)
AT (1) ATE369600T1 (en)
DE (1) DE60129771T2 (en)
ES (1) ES2292581T3 (en)
WO (1) WO2001069593A1 (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1385150A1 (en) * 2002-07-24 2004-01-28 STMicroelectronics Asia Pacific Pte Ltd. Method and system for parametric characterization of transient audio signals
JP2004538502A (en) * 2001-06-08 2004-12-24 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Editing audio signals
WO2005008628A1 (en) 2003-07-18 2005-01-27 Koninklijke Philips Electronics N.V. Low bit-rate audio encoding
WO2005024783A1 (en) 2003-09-05 2005-03-17 Koninklijke Philips Electronics N.V. Low bit-rate audio encoding
WO2005036529A1 (en) 2003-10-13 2005-04-21 Koninklijke Philips Electronics N.V. Audio encoding
WO2005122146A1 (en) * 2004-06-08 2005-12-22 Koninklijke Philips Electronics N.V. Audio encoding
WO2006051451A1 (en) * 2004-11-09 2006-05-18 Koninklijke Philips Electronics N.V. Audio coding and decoding
CN1319043C (en) * 2001-10-26 2007-05-30 皇家飞利浦电子股份有限公司 Tracking of sine parameter in audio coder
US7516066B2 (en) 2002-07-16 2009-04-07 Koninklijke Philips Electronics N.V. Audio coding
US7542896B2 (en) 2002-07-16 2009-06-02 Koninklijke Philips Electronics N.V. Audio coding/decoding with spatial parameters and non-uniform segmentation for transients
US7587313B2 (en) 2004-03-17 2009-09-08 Koninklijke Philips Electronics N.V. Audio coding
US7664633B2 (en) 2002-11-29 2010-02-16 Koninklijke Philips Electronics N.V. Audio coding via creation of sinusoidal tracks and phase determination

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1190415B1 (en) * 2000-03-15 2007-08-08 Koninklijke Philips Electronics N.V. Laguerre function for audio coding
ATE334556T1 (en) * 2001-04-18 2006-08-15 Koninkl Philips Electronics Nv AUDIO CODING WITH PARTIAL ENCRYPTION
WO2005024784A1 (en) * 2003-09-09 2005-03-17 Koninklijke Philips Electronics N.V. Encoding of transient audio signal components
JP2007512572A (en) * 2003-12-01 2007-05-17 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Audio encoding
EP1914722B1 (en) 2004-03-01 2009-04-29 Dolby Laboratories Licensing Corporation Multichannel audio decoding
JP4318119B2 (en) * 2004-06-18 2009-08-19 国立大学法人京都大学 Acoustic signal processing method, acoustic signal processing apparatus, acoustic signal processing system, and computer program
US20090106030A1 (en) * 2004-11-09 2009-04-23 Koninklijke Philips Electronics, N.V. Method of signal encoding
CN101479790B (en) * 2006-06-29 2012-05-23 Nxp股份有限公司 Noise synthesis
KR20080073925A (en) * 2007-02-07 2008-08-12 삼성전자주식회사 Method and apparatus for decoding parametric-encoded audio signal
KR101317269B1 (en) * 2007-06-07 2013-10-14 삼성전자주식회사 Method and apparatus for sinusoidal audio coding, and method and apparatus for sinusoidal audio decoding
KR20090008611A (en) * 2007-07-18 2009-01-22 삼성전자주식회사 Audio signal encoding method and appartus therefor
KR101441897B1 (en) * 2008-01-31 2014-09-23 삼성전자주식회사 Method and apparatus for encoding residual signals and method and apparatus for decoding residual signals
US9111525B1 (en) * 2008-02-14 2015-08-18 Foundation for Research and Technology—Hellas (FORTH) Institute of Computer Science (ICS) Apparatuses, methods and systems for audio processing and transmission
CN101770776B (en) * 2008-12-29 2011-06-08 华为技术有限公司 Coding method and device, decoding method and device for instantaneous signal and processing system
US9055374B2 (en) * 2009-06-24 2015-06-09 Arizona Board Of Regents For And On Behalf Of Arizona State University Method and system for determining an auditory pattern of an audio segment
CN102419977B (en) * 2011-01-14 2013-10-02 展讯通信(上海)有限公司 Method for discriminating transient audio signals
EP3382700A1 (en) * 2017-03-31 2018-10-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for post-processing an audio signal using a transient location detection

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1986003873A1 (en) * 1984-12-20 1986-07-03 Gte Laboratories Incorporated Method and apparatus for encoding speech
JPH01165000A (en) * 1987-12-21 1989-06-29 Sony Corp Vocal sound section information forming apparatus
JPH02226300A (en) * 1989-02-28 1990-09-07 Sony Corp Phoneme section information generating device
DK46493D0 (en) * 1993-04-22 1993-04-22 Frank Uldall Leonhard METHOD OF SIGNAL TREATMENT FOR DETERMINING TRANSIT CONDITIONS IN AUDITIVE SIGNALS
CA2168327C (en) 1995-01-30 2000-04-11 Shinichi Kikuchi A recording medium on which a data containing navigation data is recorded, a method and apparatus for reproducing a data according to navigationdata, a method and apparatus for recording a data containing navigation data on a recording medium.
JP3266819B2 (en) * 1996-07-30 2002-03-18 株式会社エイ・ティ・アール人間情報通信研究所 Periodic signal conversion method, sound conversion method, and signal analysis method
GB2318029B (en) * 1996-10-01 2000-11-08 Nokia Mobile Phones Ltd Audio coding method and apparatus
JPH10282995A (en) * 1997-04-01 1998-10-23 Matsushita Electric Ind Co Ltd Method of encoding missing voice interpolation, missing voice interpolation encoding device, and recording medium
SE512719C2 (en) * 1997-06-10 2000-05-02 Lars Gustaf Liljeryd A method and apparatus for reducing data flow based on harmonic bandwidth expansion
DE19747132C2 (en) * 1997-10-24 2002-11-28 Fraunhofer Ges Forschung Methods and devices for encoding audio signals and methods and devices for decoding a bit stream
DE69932861T2 (en) * 1999-10-30 2007-03-15 Stmicroelectronics Asia Pacific Pte Ltd. METHOD FOR CODING AN AUDIO SIGNAL WITH A QUALITY VALUE FOR BIT ASSIGNMENT
EP1190415B1 (en) * 2000-03-15 2007-08-08 Koninklijke Philips Electronics N.V. Laguerre function for audio coding

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
AL JABRI A K ET AL: "LAGUERRE TRANSFORM FOR SPEECH COMPRESSION", PROCEEDINGS OF THE INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE,US,NEW YORK, IEEE, vol. CONF. 10, 10 May 1994 (1994-05-10), pages 737 - 740, XP000509610, ISBN: 0-7803-1881-1 *
BELT H J W ET AL: "Filter synthesis by discrete-time generalized Laguerre networks", ECCTD '95 PROCEEDINGS OF THE 12TH EUROPEAN CONFERENCE ON CIRCUIT THEORY AND DESIGN, PROCEEDINGS OF EUROPEAN CONFERENCE ON CIRCUIT THEORY AND DESIGN: ECCTD '95 (ISBN 975 561 061 8) SET OF TWO VOL, ISTANBUL, TURKEY, 27-31 AUG. 1995, 1995, Istanbul, Turkey, Istanbul Tech. Univ, Turkey, pages 971 - 974 vol.2, XP002940549, ISBN: 975-561-061-8 *
MANDYAM G ET AL: "Application of the discrete Laguerre transform to speech coding", CONFERENCE RECORD OF THE TWENTY-NINTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, CONFERENCE RECORD OF THE TWENTY-NINTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, PACIFIC GROVE, CA, USA, 30 OCT.-2 NOV. 1995, 1996, Los Alamitos, CA, USA, IEEE Comput. Soc. Press, USA, pages 1225 - 1228 vol.2, XP002171527, ISBN: 0-8186-7370-2 *
SARROUHK B E ET AL: "Optimal parameters in modulated Laguerre series expansions", ISSPA '99. PROCEEDINGS OF THE FIFTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS (IEEE CAT. NO.99EX359), BRISBANE, QLD., AUSTRALIA, 22-25 AUG, 1999, Brisbane, Qld., Australia, Queensland Univ. Technol, Australia, pages 187 - 190 vol.1, XP002171528, ISBN: 1-86435-451-8 *
SOHN J ET AL: "A LOW RESOLUTION PULSE POSITION CODING METHOD FOR IMPROVED EXCITATION MODELING OF SPEECH TRANSITION", PHOENIX, AZ, MARCH 15 - 19, 1999,NEW YORK, NY: IEEE,US, 15 March 1999 (1999-03-15), pages 265 - 268, XP000900109, ISBN: 0-7803-5042-1 *

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004538502A (en) * 2001-06-08 2004-12-24 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Editing audio signals
CN1319043C (en) * 2001-10-26 2007-05-30 皇家飞利浦电子股份有限公司 Tracking of sine parameter in audio coder
US7516066B2 (en) 2002-07-16 2009-04-07 Koninklijke Philips Electronics N.V. Audio coding
US7542896B2 (en) 2002-07-16 2009-06-02 Koninklijke Philips Electronics N.V. Audio coding/decoding with spatial parameters and non-uniform segmentation for transients
EP1385150A1 (en) * 2002-07-24 2004-01-28 STMicroelectronics Asia Pacific Pte Ltd. Method and system for parametric characterization of transient audio signals
US7664633B2 (en) 2002-11-29 2010-02-16 Koninklijke Philips Electronics N.V. Audio coding via creation of sinusoidal tracks and phase determination
WO2005008628A1 (en) 2003-07-18 2005-01-27 Koninklijke Philips Electronics N.V. Low bit-rate audio encoding
JP4782006B2 (en) * 2003-07-18 2011-09-28 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Low bit rate audio encoding
JP2007519027A (en) * 2003-07-18 2007-07-12 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Low bit rate audio encoding
WO2005024783A1 (en) 2003-09-05 2005-03-17 Koninklijke Philips Electronics N.V. Low bit-rate audio encoding
US7725310B2 (en) 2003-10-13 2010-05-25 Koninklijke Philips Electronics N.V. Audio encoding
WO2005036529A1 (en) 2003-10-13 2005-04-21 Koninklijke Philips Electronics N.V. Audio encoding
US7587313B2 (en) 2004-03-17 2009-09-08 Koninklijke Philips Electronics N.V. Audio coding
WO2005122146A1 (en) * 2004-06-08 2005-12-22 Koninklijke Philips Electronics N.V. Audio encoding
WO2006051451A1 (en) * 2004-11-09 2006-05-18 Koninklijke Philips Electronics N.V. Audio coding and decoding

Also Published As

Publication number Publication date
US20050187760A1 (en) 2005-08-25
US20010032087A1 (en) 2001-10-18
KR20010113950A (en) 2001-12-28
DE60129771D1 (en) 2007-09-20
DE60129771T2 (en) 2008-04-30
US7499852B2 (en) 2009-03-03
EP1190415B1 (en) 2007-08-08
JP4803938B2 (en) 2011-10-26
US6925434B2 (en) 2005-08-02
CN1364290A (en) 2002-08-14
EP1190415A1 (en) 2002-03-27
JP2003527632A (en) 2003-09-16
CN1154975C (en) 2004-06-23
ATE369600T1 (en) 2007-08-15
KR100780561B1 (en) 2007-11-29
ES2292581T3 (en) 2008-03-16

Similar Documents

Publication Publication Date Title
US7499852B2 (en) Audio coding using a shape function
JP5185254B2 (en) Audio signal volume measurement and improvement in MDCT region
US5781880A (en) Pitch lag estimation using frequency-domain lowpass filtering of the linear predictive coding (LPC) residual
EP2207170A1 (en) System for audio decoding with filling of spectral holes
CN104395958B (en) Effective pre-echo attenuation in digital audio and video signals
EP1903558B1 (en) Audio signal interpolation method and device
RU2719543C1 (en) Apparatus and method for determining a predetermined characteristic relating to processing of artificial audio signal frequency band limitation
US20020120445A1 (en) Coding signals
JP6181773B2 (en) Noise filling without side information for CELP coder
US7197454B2 (en) Audio coding
US20060015328A1 (en) Sinusoidal audio coding
JP4359499B2 (en) Editing audio signals
JP2020170187A (en) Methods and Devices for Identifying and Attenuating Pre-Echoes in Digital Audio Signals
KR102426050B1 (en) Pitch Delay Selection
KR101008529B1 (en) Sinusoid selection in audio encoding
Helen et al. Perceptually motivated parametric representation for harmonic sounds for data compression purposes
EP0713208B1 (en) Pitch lag estimation system
KR20050085761A (en) Sinusoid selection in audio encoding
KR100427053B1 (en) Effective section detecting method using the freqency domain energy, and the iterative playing method using the same
US20060149539A1 (en) Method for separating a sound frame into sinusoidal components and residual noise
Yim et al. Comparison of arma modelling methods for low bit rate speech coding
Brown Solid-State Liquid Chemical Sensor Testing Issues
Hernandez-Gomez et al. Short-time synthesis procedures in vector adaptive transform coding of speech

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 01800515.2

Country of ref document: CN

AK Designated states

Kind code of ref document: A1

Designated state(s): CN JP KR

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR

WWE Wipo information: entry into national phase

Ref document number: 2001929398

Country of ref document: EP

121 Ep: the epo has been informed by wipo that ep was designated in this application
ENP Entry into the national phase

Ref document number: 2001 567585

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 1020017014581

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 1020017014581

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 2001929398

Country of ref document: EP

WWG Wipo information: grant in national office

Ref document number: 2001929398

Country of ref document: EP