GB2322776A - Backward adaptive prediction of audio signals - Google Patents

Backward adaptive prediction of audio signals Download PDF

Info

Publication number
GB2322776A
GB2322776A GB9802611A GB9802611A GB2322776A GB 2322776 A GB2322776 A GB 2322776A GB 9802611 A GB9802611 A GB 9802611A GB 9802611 A GB9802611 A GB 9802611A GB 2322776 A GB2322776 A GB 2322776A
Authority
GB
United Kingdom
Prior art keywords
spectral
values
value
predicted
prediction coefficients
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
GB9802611A
Other versions
GB2322776B (en
GB9802611D0 (en
Inventor
Lin Yin
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nokia Oyj
Original Assignee
Nokia Mobile Phones Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Mobile Phones Ltd filed Critical Nokia Mobile Phones Ltd
Publication of GB9802611D0 publication Critical patent/GB9802611D0/en
Publication of GB2322776A publication Critical patent/GB2322776A/en
Application granted granted Critical
Publication of GB2322776B publication Critical patent/GB2322776B/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)

Abstract

Consecutive frames of an audio signal are transformed into the frequency domain using a modified discrete cosine transform (MDCT) so as to generate a stream of spectral data values for each spectral component. A set of linear prediction coefficients are calculated for each spectral value and are based on a predetermined number of previously determined reconstructed values. The coefficients are recalculated only after receiving several spectral values and thus the same coefficients are used for several consecutive predictions before being updated. A predicted spectral value is generated using the coefficients and the error between the predicted and actual spectral values is calculated. The rate at which the coefficients are updated may be variable. A decoder for receiving and decoding the error values is also disclosed. Application is to a mobile phone system.

Description

2322776 Audio Coding Method and Apparatus The present invention relates to
a method for coding and decoding electronic signals and to apparatus for carrying out such a method.
It is well known that the transmission of data in digital form provides for increased signal to noise ratios and increased information capacity along the transmission channel. There is however a continuing desire to further increase channel capacity by compressing digital signals to an ever greater extent. In relation to audio signals, two basic compression principles are conventionally applied. The first of these involves removing the statistical or deterministic redundancies in the source signal whilst the second involves suppressing or eliminating from the source signal elements which are redundant in so far as human perception is concerned. Recently, the latter principle has become predominant in high quality audio applications and typically involves the separation of an audio signal into frequency components (sometimes called 'sub-bands'), each of which is analysed and quantized with a quantisation accuracy determined to remove data irrelevancy (to the listener). The ISO (International Standards Organisation) MPEG (Moving Pictures Expert Group) audio coding standard and other audio coding standards employ and further define this principle. However, MPEG (and other standards) also employs a technique known as 'adaptive prediction' to produce a further reduction in data rate.
A particular form of adaptive prediction is known as 'backward adaptive lattice prediction'. Fuchs et al, 'Improving MPEG Audio Coding by Backward Adaptive Linear Stereo Prediction, AES Convention, New York, Preprint 4086 Oct. 1995, describes one such backward adaptive lattice prediction algorithm. For each spectral value (the 'current' value) of each frequency component, backward adaptive lattice prediction generates a set of prediction coefficients in the coder from the previously calculated spectral values of that component (via the intermediate calculation of quantized spectral values). These coefficients are then used to predict the value of the current spectral value. The error between the current spectral value and the predicted spectral value is determined and it is this error value (after 2 quantisation) which is transmitted to the receiver. It will be appreciated that at any given time, the current prediction coefficients have effectively been derived from all previously received sample values. At the receiver, the coefficients are similarly calculated and reconstructed spectral values obtained by combining the predicted spectral values with the received error values.
In certain algorithms employing backward adaptive prediction, it is often the case that a measure of the compression achieved is determined during the compression process and the error values sent only if positive compression gain is achieved. If not, then the actual quantized frequency component signals are transmitted instead.
The new MPEG-2 AAC standard employs psychoacoustic modeling and backward adaptive linear prediction with 1024 frequency components. It is envisaged that the new MPEG-4 VM standard will have similar requirements. However, such a large number of frequency components results in a large computational overhead due to the complexity of the prediction algorithm and also requires the availability of large areas of memory to store the calculated coefficients. Additionally, with backward adaptive lattice prediction, even when the predictors are turned 'off (e.g. when no compression advantage can be obtained by transmitting the error values), the decoder must continue to determine the coefficients so that the predictors can be turned 'on' again when required without any temporary degradation in performance. This provides an additional computation overhead.
According to a first aspect of the present invention there is provided a method of coding an audio electrical signal using backward adaptive prediction, the method comprising the steps of:
(a) receiving a first time frame of an audio electrical signal to be coded; (b) transforming the time frame into the frequency domain to generate a frequency spectrum having 512 or more spectral components; (c) receiving subsequent time frames of said audio electrical signal and repeating step (b) for these frames in sequence to generate a stream of spectral data values for each spectral component; (e) for each said stream, calculating a set of prediction coefficients for each spectral value using the covariances of a predetermined number of previously 3 determined reconstructed spectral values of the stream, using said set of prediction coefficients to generate a predicted spectral value, and calculating the error between the predicted spectral value and the corresponding actual spectral value, wherein the calculated errors provide a coded representation of the spectral value stream and said errors can be recombined with predicted spectral values to obtain reconstructed spectral values.
The method of the present invention does not directly calculate a set of prediction coefficients from all preceding spectral components as is the case with conventional backward adaptive prediction algorithms. That is to say that the prediction coefficients are recalculated for each spectral value and are not merely adapted from the previously calculated set. Thus, during periods when the predictor is turned off, there is no requirement to continue updating the coefficients at the decoder.
The present invention overcomes or at least mitigates one or more of the above disadvantages by utilising a backward adaptive prediction algorithm which acts upon a relatively large number of frequency components of an audio signal to be coded and which calculates prediction coefficients for a component from a predetermined number of previously received sample values of that component.
It has been discovered that, whilst backward adaptive prediction algorithms which calculate prediction coefficients from the covariances of a predetermined number of previous spectral values are generally not suitable for coding audio signals sub-divided into a relatively small number of frequency sub-bands (e.g. 32), such prediction algorithms are appropriate when the audio signal is sub-divided into a relatively large number of frequency sub-bands (e.g.1024 as defined in the draft MPEG-4 standard). This is because, when a large number of sub-bands are defined, the order of the prediction algorithm (that is the number of prediction coefficients) can be low and algorithms embodying the present invention offer high performance and are computationally efficient for low orders. Preferably, the prediction order is one or two. More preferably, the prediction order is two.
Preferably, said predetermined number of previously received consecutive spectral values are used to derive a corresponding number of quantized spectral 4 values. It is then the quantized values which are used to calculate said prediction coefficients.
Preferably, the time windows taken from the audio signal are overlapping. For example, each window may contain 2048 sample points with adjacent window having a 50% overlap. However, the windows may also be contiguous.
In certain embodiments of the invention, a new set of prediction coefficients may be calculated for each and every spectral value. However, in other embodiments it may be more computationally efficient to recalculate the prediction coefficients for only every second or third (or other multiple) spectral value and to use the same coefficients for several consecutive spectral values. It may also be appropriate to provide for switching between a low coefficient update rate (e.g. every second value) and a high update rate (e.g. for every spectral value) immediately upon detection of a transient in the audio signal.
The lower limit on the predetermined number of previously received sample points used to calculate each set of prediction coefficients, is determined by the coding quality required. Preferably however, the number is four or more. The upper limit on this number is determined by memory and computational constraints. Preferably the number is ten or less. More preferably the predetermined number is six.
Any suitable method for evaluating the prediction coefficients may be used, e.g. an autocorrelation method. However, it has been found that the least squares method is particularly advantageous.
Preferably, the prediction coefficients used to calculate predicted spectral values are linear prediction coefficients.
It will be appreciated that the present invention is intended for use with psychoacoustic compensation and that quantisation of the error signals may be controlled accordingly.
According to a second aspect of the present invention there is provided a method of decoding an audio electrical signal encoded using the method of the above first aspect, the decoding method comprising the steps of:
receiving as an input signal a sequence of error values corresponding to the coded audio signal and separating these values into spectral component streams; for each stream, determining a corresponding predicted spectral component value for each error value using a set of prediction coefficients, the prediction coefficients being calculated using covariances of a predetermined number of previously determined consecutive predicted spectral component values for that stream, and combining the error value and the predicted spectral value to provide a reconstructed spectral value; and substantially reconstructing said audio signal by combining and frequency- totime transforming the reconstructed spectral values of all of the streams.
It will be appreciated that the specific implementation details of the coding method will to a large extent determine the implementation details of the decoding method, e.g. prediction order.
According to a third aspect of the present invention there is provided apparatus for coding an audio electrical signal using backward adaptive prediction, the apparatus comprising:
an input for receiving an audio electrical signal to be coded; a time-to-frequency domain transformer for transforming sequentially received time frames of the received signal from the time domain to the frequency domain to provide frequency spectra having 512 or more spectral components; signal processing means associated with each spectral component for receiving as a stream the associated spectral values, for calculating for each spectral value a set of prediction coefficients using covariances of a predetermined number of previously reconstructed spectral values, for using said set of prediction coefficients to generate a predicted spectral value, and for calculating the error between the predicted value and the corresponding actual spectral value, the calculated errors providing a coded representation of the received spectral value stream and wherein said errors can be recombined with predicted spectral values to obtain reconstructed spectral values.
According to a fourth aspect of the present invention there is provided apparatus for decoding an audio electrical signal encoded using the apparatus of the above third aspect of the present invention, the apparatus comprising:
an input for receiving a sequence of error values corresponding to the coded audio signal; and 6 signal processing means for separating said sequence of values into separate spectral component streams and for determining for each error value a corresponding predicted spectral value a set of prediction coefficients, the signal processing means being arranged to calculate the prediction coefficients using covariances of a predetermined number of previously determined consecutive reconstructed spectral values, the signal processing means being further arranged to combine each error value with the corresponding predicted spectral value to provide a reconstructed spectral value and to substantially reconstruct said audio signal by combining and frequency-to-time transforming the reconstructed spectral values of all of the sub-bands.
According to a fifth aspect of the present invention there is provided a communications system comprising in combination the apparatus of the third and fourth aspect of the present invention.
According to a sixth aspect of the present invention there is provided a mobile communication device comprising apparatus according to the third and fourth aspect of the present invention.
For a better understanding of the present invention and in order to show how the same may be carried into effect reference will now be made, by way of example, to the accompanying drawings, in which:
Figure 1 shows schematically apparatus for coding an audio signal using backward adaptive prediction according to an embodiment of the present invention; Figure 2 shows schematically apparatus for decoding an audio signal encoded with the apparatus of Figure 1; and Figure 3 shows a mobile telephone incorporating the apparatus of Figures 1 and 2.
With reference to Figure 1, a pulse code modulated (PCM) audio input signal g(t) to be coded is provided at the input to a first signal processing unit 1 of a coding apparatus. This first unit 1 is arranged to transform the input signal g(t) from the time to the frequency domain on a frame by frame basis, each frame n consisting of 2048 sample values and adjacent frames having a 50% overlap. More particularly, the unit 1 employs a modified discrete cosine transform (MDCT) to transform the signal into the frequency domain such that the output of the unit 1 consists of 1024 7 separate streams of spectral values xj(n), each stream j corresponding to a different spectral component. It is noted that other transform methods may be used, e.g. a Fourier transform.
Each stream of data values xj(n) is provided to the corresponding input of a backward adaptive predictor 2, the operation of which is described in detail below. In general terms, for each spectral value xj(n) of each stream, the predictor 2 calculates a set of prediction coefficients aj(n) using subsequently derived reconstructed quantized spectral values, in turn derived from previously received spectral values of that stream. The prediction coefficients are in turn used to calculate an error value ej(n) for the spectral value. The error values for each stream are provided to the input of a quantiser 3 which is arranged to generate quantized errors Wj(n) for subsequent digital transmission. The quantized errors Wj(n) are provided to a multiplexer 4, which generates a multiplexed error signal 9 for transmission, and are also fed back to the predictor 2.
A further signal processing unit 5 is also provided for controlling the operation of the signal processing unit 1 and the quantiser 3 in dependence upon the psychoacoustic characteristics of the input audio signal g(t). The operation of this unit is conventional and will not be described in detail here.
For each spectral component j, x(n), i(n), and Y(n) are the input signal to the predictor 2, a predictor output signal, and a reconstructed quantized signal, and e(n) and F(n) are a prediction error signal and a quantized prediction error signal. The set of prediction coefficients can be represented by:
a(n) = [a 1 (n), a 2 (103, - -, a p (n)]T which is time dependent and where superscript T represents the Transpose. The output signal of the predictor 2 f(n) is calculated by:
p i(n) = a(n) T i(n) Y a j (n)jt(n - i) where l(n) = [1(n - 1), l(n - 2),. -, 5(n - P)] T 8 and P is the prediction order, i.e. the number of coefficients. The predictor error is e(n) = x(n) - i(n) and the reconstructed quantized signal is 1 (n) = i(n) + J6 (n) The calculation of the predictor coefficients is based on minimizing the mean square prediction error. a(n)can be expressed as a(n) = R -1 (n)r(n) where R(n) = E[Y(n)y T (n)] and r(n) = E[Y(n)i(n)] and the symbol E represents the Expectation.
It will be appreciated that once the autocorrelation functions r(n) are obtained, the linear predictors can be obtained by solving the normal equation. However, here a least squared algorithm is presented to estimate the linear predictor coefficients sample by sample. The least squared method often gives better linear prediction coefficient estimation than the autocorrelation method especially when the number of available data is small. It will be shown in the following that when the order of the predictor is low, in particular only two, the complexity of the least squared algorithm is comparable to or less than that of the adaptive lattice algorithm of the prior art.
Assume again that the reconstructed quantized signal is denoted by 3e(n). For a prediction order of two and a block length of L, the covariances of the reconstructed signal are computed by L-1 ro,o Y rl,l i=2 i=2 L-1 r, Y SC(n - i + 2)l(n - i), r2 A.., i=2 An efficient algorithm would be L-1 (n-i+1), ro,l =rl,o = YRi(n- i+1)1(n-i) i=2 LA Y R(n - i + 2)1(n - i + 1) i=2 9 L-2 2 (n - L - 1) + temp,, rl,l = temp, + 1 2 (n - temp, = 1: i'(n i), ro,o = 1 i=2 L-2 temp 2 1: 1(n - i + 1)'i(n - i), ro,l = rl,o = V(n L + 1) 1(n - L + 2) + temp 2 i=2 LA r2 = teMP2 + R(n - 1)l(n), r, R(n - i + 2)i(n - i) i=2 With these covariances, the two linear predictor coefficients can be calculated as follows:
rl,lrl - ro,,r2 a, = 2 ro,Orl,l - ro,l ro,Or2 - ro,lrl a2 = 2 ro,Orl,l ro,l It will be appreciated that the linear prediction coefficients are derived from a predetermined or fixed, relatively small, number of previous spectral values.
Calculation of the coefficients is not dependent upon every previously received spectral value.
In order to enhance the robustness of the backward adaptive prediction against channel errors and numerical round-off errors, bandwidth expansion can be performed after the linear prediction coefficients are obtained. Let the linear prediction coefficients calculated by the above equations be ai,i=0,1,2. where ao = 1. The bandwidth expansion operation replaces each ai by r'ai, where y is a constant slightly less than unity.
As can be seen from the previous section, the covariance functions are updated sample by sample. Correspondingly, the linear prediction coefficients can also be obtained sample by sample by solving the normal equation. However, in order to save computation, the linear prediction coefficients can be calculated less frequently. For example, the linear prediction coefficients may be calculated once every two samples. The loss of the average prediction gain is negligible. However, the loss of the prediction gain is clearly noticeable upon occurrence of a transient in the audio signal to be coded. A transient detector 10 is therefore included which switches the predictor from a normal low coefficient update rate (e.g. every second spectral value) to a high update rate (e.g. every spectral value) when a transient is detected. The high update rate may be maintained for a short period after detection of the transient.
Assume that G, denotes the prediction gain in scalefactor band 1. If G, > 0, the predictor in this subband can be switched on depending on the overall prediction gain, which is calculated as follows NS G = 1 G, 1=1 & (GIA) where N, is the number of scalefactor bands. If G compensates the additional bit need for the predictor side information, i.e., G > T, (dB) or prediction gain does not drop dramatically, i.e., GPresent - GPrevious < T2 (M), the complete side information is transmitted and the predictors which produce positive gains are switched on: otherwise, the predictors are not used, which also means that the transient comes. After the transient frames are detected, the backward adaptive prediction coefficients are calculated sample by sample. After a certain number of samples, the prediction coefficients are calculated every second sample.
Figure 2 illustrates apparatus for decoding a signal encoded using the method described in detail above. The received multiplexed error signal 9 is provided at the input of a demultiplexer 6 which separates the signal into 1024 spectral value streams ej(n). These streams are then passed to a signal processing unit 7. For each stream, this unit 7 calculates for each error value a predicted or estimated spectral value. A predetermined number of these predicted values are in turn used to calculate linear prediction coefficients to allow the calculation of a predicted value for a current sample. This process is identical to that described for the coding process. A reconstructed spectral value is obtained by combining the received error signal with the corresponding predicted value. The streams of reconstructed spectral values are provided to a further processing unit 8 which carries out an inverse MDCT on the data to substantially regenerate the original audio signal.
11 Figure 3 shows a mobile telephone 11 incorporating in its transmitter, apparatus 12 (corresponding to the apparatus of Figure 1) for coding a radio telephone signal using the coding method described above. The telephone also incorporates in its receiver, apparatus 13 (corresponding to the apparatus of Figure 2) for decoding a received encoded telephone signal.
The present invention includes any novel feature or combination of features disclosed herein either explicitly or any generalisation thereof irrespective of whether or not it relates to the claimed invention or mitigates any or all of the problems addressed.
In view of the foregoing description it will be evident to a person skilled in the art that various modifications may be made within the scope of the invention.
12

Claims (17)

Claims
1. A method of coding an audio electrical signal using backward adaptive prediction, the method comprising the steps of:
(a) receiving a first time frame of an audio electrical signal to be coded; (b) transforming the time frame into the frequency domain to generate a frequency spectrum having 512 or more spectral components; (c) receiving subsequent time frames of said audio electrical signal and repeating step (b) for these frames in sequence to generate a stream of spectral data values for each spectral component; (e) for each said stream, calculating a set of prediction coefficients for each spectral value using the covariances of a predetermined number of previously determined reconstructed spectral values of the stream, using said set of prediction coefficients to generate a predicted spectral value, and calculating the error between the predicted spectral value and the corresponding actual spectral value, wherein the calculated errors provide a coded representation of the spectral value stream and said errors can be recombined with predicted spectral values to obtain reconstructed spectral values.
2. A method according to claim 1, wherein the prediction order is two.
3. A method according to claim 1 or 2 and comprising recalculating the prediction coefficients only after receipt of multiple spectral values and using the same coefficients for several consecutive spectral values.
4.
A method according to claim 3, wherein said multiple is two.
5. A method according to claim 3 or 4 and comprising switching between a low coefficient update rate and a high update rate immediately upon detection of a transient in the audio signal to be coded.
6. A method according to any one of the preceding claims, wherein said predetermined number of spectral values is four or more.
13
7. A method according to any one of the preceding claims, wherein said predetermined number of spectral values is ten or less.
8. A method according to any one of the preceding claims, wherein a least squares method is used for evaluating the prediction coefficients.
9. A method according to claim 8 when appended to claim 2, wherein said covariances are determined as:
L-1 L-1 j2(11_ j+ l)' ro,l = rl,o + 1)l(n ro,o (n - Irl,l = i=2 i=2 i=2 L-1 L-1 1(n - i + 2)l(n - i), r2 1(n - i + 2)l(n - i + 1) i=2 i=2 L-2 temp, = 112 i=2 (n - i), ro,o = 1 2(n - L - 1) + temp,, rl,l = temp, + 1 2 (n - 1) L-2 teMP2 = 1 1(n - i + 1)i(n - i), ro,l = rl,o = 1(n - L + 1)l(n - L + 2) + teMP2 i=2 L-1 r2 = teMP2 + l(n - lfl(n), ir, R(n - i + 2)R(n i) i=2
10. A method according to claim 9, wherein the prediction coefficients are determined according to:
rl,l r, - ro,l r2 a, = 2 ro,Orl,l - rol a2 = ro,Or2 - ro,lrl 2 ro,Orl,l ro,l
11. A method of decoding an audio electrical signal encoded, the decoding method comprising the steps of:
receiving as an input signal a sequence of error values corresponding to the coded audio signal and separating these values into spectral component streams; 14 for each stream, determining a corresponding predicted spectral component value for each error value using a set of prediction coefficients, the prediction coefficients being calculated using covariances of a predetermined number of previously determined consecutive predicted spectral component values for that stream, and combining the error value and the predicted spectral value to provide a reconstructed spectral value; and substantially reconstructing said audio signal by combining and frequency-totime transforming the reconstructed spectral values of all of the streams.
12. Apparatus for coding an audio electrical signal using backward adaptive prediction, the apparatus comprising: an input for receiving an audio electrical signal to be coded; a time-to-frequency domain transformer for transforming sequentially received time frames of the received signal from the time domain to the frequency domain to provide frequency spectra having 512 or more spectral components; signal processing means associated with each spectral component for receiving as a stream the associated spectral values, for calculating for each spectral value a set of prediction coefficients using covariances of a predetermined number of previously reconstructed spectral values, for using said set of prediction coefficients to generate a predicted spectral value, and for calculating the error between the predicted value and the corresponding actual spectral value, the calculated errors providing a coded representation of the received spectral value stream and wherein said errors can be recombined with predicted spectral values to obtain reconstructed spectral values.
13. Apparatus for decoding an audio electrical signal encoded, the apparatus comprising: an input for receiving a sequence of error values corresponding to the coded audio signal; and signal processing means for separating said sequence of values into separate spectral component streams and for determining for each error value a corresponding predicted spectral value a set of prediction coefficients, the signal processing means being arranged to calculate the prediction coefficients using covariances of a predetermined number of previously determined consecutive reconstructed spectral values, the signal processing means being further arranged to combine each error value with the corresponding predicted spectral value to provide a reconstructed spectral value and to substantially reconstruct said audio signal by combining and frequency-totime transforming the reconstructed spectral values of all of the subbands.
14. A communications system comprising in combination the apparatus of claims 12 and 13.
15. A mobile communication device comprising in combination the apparatus of claims 12 and 13.
16. A method substantially as hereinbefore described with reference to Figures 1 to 3 of the accompanying drawings.
17. Apparatus substantially as hereinbefore described with reference to Figures 1 to 3 of the accompanying drawings.
GB9802611A 1997-02-07 1998-02-06 Audio coding method and apparatus Expired - Lifetime GB2322776B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
FI970553A FI970553A (en) 1997-02-07 1997-02-07 Audio coding method and device

Publications (3)

Publication Number Publication Date
GB9802611D0 GB9802611D0 (en) 1998-04-01
GB2322776A true GB2322776A (en) 1998-09-02
GB2322776B GB2322776B (en) 2002-03-13

Family

ID=8548146

Family Applications (1)

Application Number Title Priority Date Filing Date
GB9802611A Expired - Lifetime GB2322776B (en) 1997-02-07 1998-02-06 Audio coding method and apparatus

Country Status (9)

Country Link
JP (1) JPH10260699A (en)
CN (1) CN1202513C (en)
AU (1) AU5664898A (en)
DE (1) DE19804584A1 (en)
FI (1) FI970553A (en)
FR (1) FR2759510A1 (en)
GB (1) GB2322776B (en)
SE (1) SE9800338L (en)
WO (1) WO1998035447A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1808851A1 (en) * 2006-01-12 2007-07-18 STMicroelectronics Asia Pacific Pte Ltd. System and method for low power stereo perceptual audio coding using adaptive masking threshold
US10600430B2 (en) 2012-03-29 2020-03-24 Huawei Technologies Co., Ltd. Signal decoding method, audio signal decoder and non-transitory computer-readable medium

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7610195B2 (en) 2006-06-01 2009-10-27 Nokia Corporation Decoding of predictively coded data using buffer adaptation
WO2010104011A1 (en) * 2009-03-10 2010-09-16 日本電信電話株式会社 Encoding method, decoding method, encoding device, decoding device, program, and recording medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5084904A (en) * 1988-11-10 1992-01-28 Pioneer Electronic Corporation Signal transmission device
GB2318029A (en) * 1996-10-01 1998-04-08 Nokia Mobile Phones Ltd Predictive coding of audio signals

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3343965B2 (en) * 1992-10-31 2002-11-11 ソニー株式会社 Voice encoding method and decoding method
EP0692881B1 (en) * 1993-11-09 2005-06-15 Sony Corporation Quantization apparatus, quantization method, high efficiency encoder, high efficiency encoding method, decoder, high efficiency encoder and recording media
US5684920A (en) * 1994-03-17 1997-11-04 Nippon Telegraph And Telephone Acoustic signal transform coding method and decoding method having a high efficiency envelope flattening method therein
DE19526366A1 (en) * 1995-07-20 1997-01-23 Bosch Gmbh Robert Redundancy reduction method for coding multichannel signals and device for decoding redundancy-reduced multichannel signals

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5084904A (en) * 1988-11-10 1992-01-28 Pioneer Electronic Corporation Signal transmission device
GB2318029A (en) * 1996-10-01 1998-04-08 Nokia Mobile Phones Ltd Predictive coding of audio signals

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1808851A1 (en) * 2006-01-12 2007-07-18 STMicroelectronics Asia Pacific Pte Ltd. System and method for low power stereo perceptual audio coding using adaptive masking threshold
US8332216B2 (en) 2006-01-12 2012-12-11 Stmicroelectronics Asia Pacific Pte., Ltd. System and method for low power stereo perceptual audio coding using adaptive masking threshold
US10600430B2 (en) 2012-03-29 2020-03-24 Huawei Technologies Co., Ltd. Signal decoding method, audio signal decoder and non-transitory computer-readable medium

Also Published As

Publication number Publication date
FI970553A (en) 1998-08-08
WO1998035447A3 (en) 1998-11-19
SE9800338D0 (en) 1998-02-05
DE19804584A1 (en) 1998-08-13
GB2322776B (en) 2002-03-13
CN1202513C (en) 2005-05-18
JPH10260699A (en) 1998-09-29
FI970553A0 (en) 1997-02-07
GB9802611D0 (en) 1998-04-01
SE9800338L (en) 1998-08-08
AU5664898A (en) 1998-08-26
FR2759510A1 (en) 1998-08-14
CN1199959A (en) 1998-11-25
WO1998035447A2 (en) 1998-08-13

Similar Documents

Publication Publication Date Title
US6104996A (en) Audio coding with low-order adaptive prediction of transients
KR100469002B1 (en) Audio coding method and apparatus
US7613603B2 (en) Audio coding device with fast algorithm for determining quantization step sizes based on psycho-acoustic model
JP3263168B2 (en) Method and decoder for encoding audible sound signal
US5488665A (en) Multi-channel perceptual audio compression system with encoding mode switching among matrixed channels
US6064954A (en) Digital audio signal coding
KR970007661B1 (en) Method and apparatus for coding audio signals based on perceptual model
KR100814673B1 (en) audio coding
CA2199070C (en) Switched filterbank for use in audio signal coding
US5699484A (en) Method and apparatus for applying linear prediction to critical band subbands of split-band perceptual coding systems
EP0918407A2 (en) Scalable stereo audio encoding/decoding method and apparatus
CN101115051B (en) Audio signal processing method, system and audio signal transmitting/receiving device
EP2054882A2 (en) Arbitrary shaping of temporal noise envelope without side-information
JP3237089B2 (en) Acoustic signal encoding / decoding method
JPH11509388A (en) Redundancy reduction method at the time of signal encoding and signal decoding apparatus with reduced redundancy
JPH0683395A (en) Low-delay audio signal coder utilizing analysis technology by synthesis
EP1507256A1 (en) Acoustic signal encoding method and encoding device, acoustic signal decoding method and decoding device, program, and recording medium image display device
KR101033256B1 (en) Scale factor based bit shifting in fine granularity scalability audio coding
KR100848370B1 (en) Audio Encoding
US8665914B2 (en) Signal analysis/control system and method, signal control apparatus and method, and program
US6012025A (en) Audio coding method and apparatus using backward adaptive prediction
GB2322776A (en) Backward adaptive prediction of audio signals
KR100238324B1 (en) Audio signal error concealment method and circuit therefor
JP4721355B2 (en) Coding rule conversion method and apparatus for coded data
KR100370412B1 (en) Audio decoding method for controlling complexity and audio decoder using the same

Legal Events

Date Code Title Description
732E Amendments to the register in respect of changes of name or changes affecting rights (sect. 32/1977)
732E Amendments to the register in respect of changes of name or changes affecting rights (sect. 32/1977)

Free format text: REGISTERED BETWEEN 20150910 AND 20150916

PE20 Patent expired after termination of 20 years

Expiry date: 20180205