WO2005059899A1 - Fidelity-optimised variable frame length encoding - Google Patents

Fidelity-optimised variable frame length encoding Download PDF

Info

Publication number
WO2005059899A1
WO2005059899A1 PCT/SE2004/001867 SE2004001867W WO2005059899A1 WO 2005059899 A1 WO2005059899 A1 WO 2005059899A1 SE 2004001867 W SE2004001867 W SE 2004001867W WO 2005059899 A1 WO2005059899 A1 WO 2005059899A1
Authority
WO
WIPO (PCT)
Prior art keywords
encoding
signal
sub
frames
frame
Prior art date
Application number
PCT/SE2004/001867
Other languages
English (en)
French (fr)
Inventor
Stefan Bruhn
Ingemar Johansson
Anisse Taleb
Daniel ENSTRÖM
Original Assignee
Telefonaktiebolaget Lm Ericsson (Publ)
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from SE0303501A external-priority patent/SE0303501D0/xx
Priority to EP04820553A priority Critical patent/EP1623411B1/en
Priority to MXPA05012230A priority patent/MXPA05012230A/es
Priority to BRPI0410856A priority patent/BRPI0410856B8/pt
Priority to BRPI0419281-8A priority patent/BRPI0419281B1/pt
Priority to JP2006518596A priority patent/JP4335917B2/ja
Priority to PL04820553T priority patent/PL1623411T3/pl
Priority to DE602004008613T priority patent/DE602004008613T2/de
Application filed by Telefonaktiebolaget Lm Ericsson (Publ) filed Critical Telefonaktiebolaget Lm Ericsson (Publ)
Priority to AU2004298708A priority patent/AU2004298708B2/en
Priority to CNB2004800186630A priority patent/CN100559465C/zh
Priority to CA2527971A priority patent/CA2527971C/en
Priority to ZA200508980A priority patent/ZA200508980B/xx
Priority to CN200710138487XA priority patent/CN101118747B/zh
Publication of WO2005059899A1 publication Critical patent/WO2005059899A1/en
Priority to HK06112026.7A priority patent/HK1091585A1/xx

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes

Definitions

  • the present invention relates in general to encoding of audio signals, and in particular to encoding of irrulti-channel audio signals.
  • stereophonic or multi-channel coding of audio signals is to encode the signals of the different channels separately as individual and independent signals.
  • Another basic way used in stereo FM radio transmission and which ensures compatibility with legacy mono radio receivers is to transmit a sum and a difference signal of the two involved channels.
  • M/S stereo coding is similar to the described procedure in stereo FM radio, in a sense that it encodes and transmits the sum and difference signals of the channel sub-bands and thereby exploits redundancy between the channel sub-bands.
  • the structure and operation of an encoder based on M/S stereo coding is described, e.g. in US patent 5,285,498 by J.D. Johnston.
  • Intensity stereo on the other hand is able to make use of stereo irrelevancy. It transmits the joint intensity of the channels (of the different sub-bands) along with some location information indicating how the intensity is distributed among the channels. Intensity stereo does only provide spectral magnitude information of the channels. Phase information is not conveyed. For this reason and since the temporal inter-channel information (more specifically the inter-channel time difference) is of major psycho-acoustical relevancy particularly at lower frequencies, intensity stereo can only be used at high frequencies above e.g. 2 kHz.
  • An intensity stereo coding method is described, e.g. in the European patent 0497413 by R. Neldhuis et al.
  • a recently developed stereo coding method is described, e.g. in a conference paper with the title "Binaural cue coding applied to stereo and multi-channel audio compression", 112th AES convention, May 2002, Kunststoff, Germany by C. Faller et al.
  • This method is a parametric multi-channel audio coding method.
  • the basic principle is that at the encoding side, the input signals from ⁇ channels ci, C2, ... C ⁇ are combined to one mono signal m.
  • the mono signal is audio encoded using any conventional monophonic audio codec.
  • parameters are derived from the channel signals, which describe the multi-channel image.
  • the parameters are encoded and transmitted to the decoder, along with the audio bit stream.
  • the decoder first decodes the mono signal n ⁇ and then regenerates the channel signals ci', C2 1 ,..., CN', based on the parametric description of the multi-channel image.
  • the principle of the Binaural Cue Coding (BCC) method is that it transmits the encoded mono signal and so-called BCC parameters.
  • the BCC parameters comprise coded inter-channel level differences and inter-channel time differences for sub-bands of the original multi-channel input signal.
  • the decoder regenerates the different channel signals by applying sub-band- wise level and phase adjustments of the mono signal based on the BCC parameters.
  • M/S or intensity stereo is that stereo information comprising temporal inter-channel information is transmitted at much lower bit rates.
  • this technique requires computational demanding time-frequency transforms on each of the channels, both at the encoder and the decoder.
  • BCC does not handle the fact that a lot of the stereo information, especially at low frequencies, is diffuse, i.e. it does not come from any specific direction. Diffuse sound fields exist in both channels of a stereo recording but they are to a great extent out of phase with respect to each other. If an algorithm such as BCC is subject to recordings with a great amount of diffuse sound fields the reproduced stereo image will become confused, jumping from left to right as the BCC algorithm can only pan the signal in specific frequency bands to the left or right.
  • a possible means to encode the stereo signal and ensure good reproduction of diffuse sound fields is to use an encoding scheme very similar to the technique used in FM stereo radio broadcast, namely to encode the mono (Left+Right) and the difference (Left-Right) signals separately.
  • a technique, described in US patent 5,434,948 by C.E. Holt et al. uses a similar technique as in BCC for encoding the mono signal and side information.
  • side information consists of predictor filters and optionally a residual signal.
  • the predictor filters estimated by a least-mean- square algorithm, when applied to the mono signal allow the prediction of the multi-channel audio signals. With this technique one is able to reach very low bit rate encoding of multi-channel audio sources, however, at the expense of a quality drop, discussed further below.
  • This technique synthesises the right and left channel signals by filtering sound source signals with so-called head-related filters.
  • this technique requires the different sound source signals to be separated and can thus not generally be applied for stereo or multi-channel coding.
  • a further problem with schemes based on encoding of a main and one or several side signals is that they often require relatively large computational resources.
  • handling discontinuities in parameters from one frame to another is a complex task.
  • estimation errors of transient sound may cause very large side signals, in turn increasing the transmission rate demand.
  • An object of the present invention is therefore to provide an encoding method and device improving the perception quality of multi-channel audio signals, in particular to avoid artefacts such as pre-echoing, ghost-like sounds or frame discontinuity artefacts.
  • a further object of the present invention is to provide an encoding method and device requiring less processing power and having more constant transmission bit rate requirements.
  • polyphonic signals are used to create a main signal, typically a mono signal, and a side signal.
  • the main signal is encoded according to prior- art encoding principles.
  • a number of encoding schemes for the side signal are provided.
  • Each encoding scheme is characterised by a set of sub-frames of different lengths.
  • the total length of the sub-frames corresponds to the length of the encoding frame of the encoding scheme.
  • the sets of sub-frames comprise at least one sub-frame.
  • the encoding scheme to be used on the side signal is selected at least partly dependent on the present signal content of the polyphonic signals.
  • the selection takes place, either before the encoding, based on signal characteristics analysis.
  • the side signal is encoded by each of the encoding schemes, and based on measurements of the quality of the encoding, the best encoding scheme is selected.
  • a side residual signal is created as the difference between the side signal and the main signal scaled with a balance factor.
  • the balance factor is selected to minimise the side residual signal.
  • the optimised side residual signal and the balance factor are encoded and provided as parameters representing the side signal. At the decoder side, the balance factor, the side residual signal and the man signal are used to recover the side signal.
  • the encoding of the side signal comprises an energy contour scaling in order to avoid pre-echoing effects.
  • different encoding schemes may comprise different encoding procedures in the separate sub-frames.
  • the main advantage with the present invention is that the preservation of the perception of the audio signals is improved. Furthermore, the present invention still allows multi-channel signal transmission at very low bit rates.
  • FIG. 1 is a block scheme of a system for transmitting polyphonic signals
  • FIG. 2a is a block diagram of an encoder in a transmitter
  • FIG. 2b is a block diagram of a decoder in a receiver
  • FIG. 3a is a diagram illustrating encoding frames of different lengths
  • FIGS. 3b and 3c are block diagrams of embodiments of side signal encoder units according to the present invention
  • FIG. 4 is a block diagram of an embodiment of an encoder using balance factor encoding of side signal
  • FIG. 5 is a block diagram of an embodiment of an encoder for multi-signal systems
  • FIG. 1 is a block scheme of a system for transmitting polyphonic signals
  • FIG. 2a is a block diagram of an encoder in a transmitter
  • FIG. 2b is a block diagram of a decoder in a receiver
  • FIG. 3a is a diagram illustrating encoding frames of different lengths
  • FIGS. 3b and 3c are block diagrams of embodiments
  • FIG. 6 is a block diagram of an embodiment of a decoder suitable for decoding signals from the device of Fig. 5;
  • FIG. 7a and b are diagrams illustrating a pre-echo artefact;
  • FIG. 8 is a block diagram of an embodiment of a side signal encoder unit according to the present invention, employing different encoding principles in different sub-frames;
  • FIG. 9 illustrates the use of different encoding principles in different frequency sub-bands;
  • FIG. 10 is a flow diagram of the basic steps of an embodiment of an encoding method according to the present invention; and
  • FIG. 11 is a flow diagram of the basic steps of an embodiment of a decoding method according to the present invention.
  • FIG. 1 illustrates a typical system 1, in which the present invention advantageously can be utilised.
  • a transmitter 10 comprises an antenna 12 including associated hardware and software to be able to transmit radio signals 5 to a receiver 20.
  • the transmitter 10 comprises among other parts a multi-channel encoder 14, which transforms signals of a number of input channels 16 into output signals suitable for radio transmission. Examples of suitable multi-channel encoders 14 are described in detail further below.
  • the signals of the input channels 16 can be provided from e.g. an audio signal storage 18, such as a data file of digital representation of audio recordings, magnetic tape or vinyl disc recordings of audio etc.
  • the signals of the input channels 16 can also be provided in "live", e.g. from a set of microphones 19.
  • the audio signals are digitised, if not already in digital form, before entering the multi-channel encoder 14.
  • an antenna 22 with associated hardware and software handles the actual reception of radio signals 5 representing polyphonic audio signals.
  • typical functionalities such as e.g. error correction, are performed.
  • a decoder 24 decodes the received radio signals 5 and transforms the audio data carried thereby into signals of a number of output channels 26.
  • the output signals can be provided to e.g. loudspeakers 29 for immediate presentation, or can be stored in an audio signal storage 28 of any kind.
  • the system 1 can for instance be a phone conference system, a system for supplying audio services or other audio applications. In some systems, such as e.g. the phone conference system, the communication has to be of a duplex type, while e.g.
  • distribution of music from a service provider to a subscriber can be essentially of a one-way type.
  • the transmission of signals from the transmitter 10 to the receiver 20 can also be performed by any other means, e.g. by different kinds of electromagnetic waves, cables or fibres as well as combinations thereof.
  • Fig. 2a illustrates an embodiment of an encoder according to the present invention.
  • the polyphonic signal is a stereo signal comprising two channels a and b, received at input 16A and 16B, respectively.
  • the signals of channel a and b are provided to a pre-processing unit 32, where different signal conditioning procedures may be performed.
  • the (perhaps modified) signals from the output of the pre-processing unit 32 are summed in an addition unit 34.
  • This addition unit 34 also divides the sum by a factor of two.
  • the signal xmono produced in this way is a main signal of the stereo signals, since it basically comprises all data from both channels. In this embodiment the main signal thus represents a pure "mono" signal.
  • the main signal Xmono is provided to a main signal encoder unit 38, which encodes the main signal according to any suitable encoding principles. Such principles are available within prior- art and are thus not further discussed here.
  • the main signal encoder unit 38 gives an output signal pmono, being encoding parameters representing a main signal.
  • a difference (divided by a factor of two) of the channel signals is provided as a side signal Xside.
  • the side signal represents the difference between the two channels in the stereo signal.
  • the side signal xside is provided to a side signal encoding unit 30. Preferred embodiments of the side signal encoding unit 30 will be discussed further below.
  • the side signal xside is transferred into encoding parameters pside representing a side signal side. In certain embodiments, this encoding takes place utilising also information of the main signal Xmono.
  • the arrow 42 indicates such a provision, where the original uncoded main signal mono is utilised.
  • the main signal information that is used in the side signal encoding unit 30 can be deduced from the encoding parameters pmono representing the main signal, as indicated by the broken line 44.
  • the encoding parameters p ono representing the main signal mono is a first output signal
  • the encoding parameters pside representing the side signal Xside is a second output signal.
  • these two output signals pmono, pside, together representing the full stereo sound are multiplexed into one transmission signal 52 in a multiplexor unit 40.
  • the transmission of the first and second output signals pmono, pside may take place separately.
  • a decoder 24 In Fig. 2b, an embodiment of a decoder 24 according to the present invention is illustrated as a block scheme.
  • the received signal 54 comprising encoding parameters representing the main and side signal information are provided to a demultiplexor unit 56, which separates a first and second input signal, respectively.
  • the first input signal corresponding to encoding parameters pmono of a main signal, is provided to a main signal decoder unit 64.
  • the encoding parameters pmono representing the main signal are used to generate an decoded main signal x' mono, being as similar to the main signal xmono (Fig. 2a) of the encoder 14 (Fig. 2a) as possible.
  • the second input signal corresponding a side signal
  • the second input signal is provided to a side signal decoder unit 60.
  • the encoding parameters pside representing the side signal are used to recover a decoded side signal x' .
  • the decoding procedure utilises information about the main signal x' mono, as indicated by arrow 65.
  • the decoded main and side signals x' mono, x" s- de are provided to an addition unit 70, which provides an output signal that is a representation of the original signal of channel a.
  • a difference provided by a subtraction unit 68 provides an output signal that is a representation of the original signal of channel b.
  • These channel signals may be post-processed in a postprocessor unit 74 according to prior-art signal processing procedures.
  • the channel signals a and b are provided at the outputs 26A and 26B of the decoder.
  • encoding is typically performed in one frame at a time.
  • a frame comprises audio samples within a pre-defined time period.
  • a frame SF2 of time duration L is illustrated.
  • the audio samples within the unhatched portion are to be encoded together.
  • the preceding samples and the subsequent samples are encoded in other frames.
  • the division of the samples into frames will in any case introduce some discontinuities at the frame borders. Shifting sounds will give shifting encoding parameters, changing basically at each frame border. This will give rise to perceptible errors.
  • One way to compensate somewhat for this is to base the encoding, not only on the samples that are to be encoded, but also on samples in the absolute vicinity of the frame, as indicated by the hatched portions.
  • interpolation techniques are sometimes also utilised for reducing perception artefacts caused by frame borders.
  • all such procedures require large additional computational resources, and for certain specific encoding techniques, it might also be difficult to provide in with any resources.
  • the audio perception will be improved by using a frame length for encoding of the side signal that is dependent on the present signal content. Since the influence of different frame lengths on the audio perception will differ depending on the nature of the sound to be encoded, an improvement can be obtained by letting the nature of the signal itself affect the frame length that is used.
  • the encoding of the main signal is not the object of the present invention and is therefore not described in detail. However, the frame lengths used for the main signal may or may not be equal to the frame lengths used for the side signal.
  • FIG. 3b One embodiment of a side signal encoder unit 30 according to the present invention is illustrated in Fig. 3b, in which a closed loop decision is utilised.
  • a basic encoding frame of length L is used here.
  • a number of encoding schemes 81 characterised by a separate set 80 of sub-frames 90, are created.
  • Each set 80 of sub-frames 90 comprises one or more sub-frames 90 of equal or differing lengths.
  • the total length of the set 80 of sub-frames 90 is, however, always equal to the basic encoding frame length L.
  • the top encoding scheme is characterised by a set of sub-frames comprises only one sub-frame of length L.
  • the next set of sub- frames comprises two frames of length L/2.
  • the third set comprises two frames of length L/4 followed by a L/2 frame.
  • the signal xside provided to the side signal encoder unit 30 is encoded by all encoding schemes 81. In the top encoding scheme, the entire basic encoding frame is encoded in one piece. However, in the other encoding schemes, the signal xside is encoded in each sub-frame separately from each other.
  • the result from each encoding scheme is provided to a selector 85.
  • a fidelity measurement means 83 determines a fidelity measure for each of the encoded signals.
  • the fidelity measure is an objective quality value, preferably a signal-to-noise measure or a weighted signal-to-noise ratio.
  • the fidelity measures associated with each encoding scheme are compared and the result controls a switching means 87 to select the encoding parameters representing the side signal from the encoding scheme giving the best fidelity measure as the output signal pside from the side signal encoder unit 30.
  • the lengths of the sub-frames used are selected according to:
  • l sf are the lengths of the sub-frames
  • l f is the length of the encoding frame
  • n is an integer.
  • n is selected between 0 and 3.
  • any frame lengths will be possible to use as long as the total length of the set is kept constant.
  • Fig. 3c another embodiment of a side signal encoder unit 30 according to the present invention is illustrated.
  • the frame length decision is an open loop decision, based on the statistics of the signal.
  • the spectral characteristics of the side signal will be used as a base for deciding which encoding scheme that is going to be used.
  • different encoding schemes characterised by different sets of sub-frames are available.
  • the selector 85 is placed before the actual encoding.
  • the input side signal xside enters the selector 85 and a signal analysing unit 84.
  • the result of the analysis becomes the input of a switch 86, in which only one of the encoding schemes 81 are utilised.
  • the output from that encoding scheme will also be the output signal pside from the side signal encoder unit 30.
  • the advantage with an open loop decision is that only one actual encoding has to be performed.
  • the disadvantage is, however, that the analysis of the signal characteristics may be very complicated indeed and it may be difficult to predict possible behaviours in advance to be able to give an appropriate choice in the switch 86.
  • a lot of statistical analysis of sound has to be performed and included in the signal analysing unit 84. Any small change in the encoding schemes may turn upside down on the statistical behaviour.
  • variable frame length coding for the side signal is that one can select between a fine temporal resolution and coarse frequency resolution on one side and coarse temporal resolution and fine frequency resolution on the other.
  • the above embodiments will preserve the stereo image in the best possible manner.
  • the method presented in US 5,434,948, uses a filtered version of the mono (main) signal to resemble the side or difference signal.
  • the filter parameters are optimised and allowed to vary in time.
  • the filter parameters are then transmitted representing an encoding of the side signal.
  • a residual side signal is transmitted.
  • Such an approach would be possible to use as side signal encoding method within the scope of the present invention.
  • This approach has, however, some disadvantages.
  • the quantisation of the of the filter coefficients and any residual side signal often require relatively high bit rates for transmission, since the filter order has to be high to provide an accurate side signal estimate.
  • the estimation of the filter itself may be problematic, especially in cases of transient rich music.
  • Estimation errors will give a modified side signal that is sometimes larger in magnitude than the unmodified signal. This will lead to higher bit rate demands. Moreover, if a new set of filter coefficients are computed every N samples, the filter coefficients need to be interpolated to yield a smooth transition from one set of filter coefficients to another, as discussed above. Interpolation of filter coefficients is a complex task and errors in the interpolation will manifest itself in large side error signals leading to higher bit rates needed for the difference error signal encoder.
  • a means to avoid the need for interpolation is to update the filter coefficients on a sample-by-sample basis and rely on backwards-adaptive analysis. For this to work well it is needed that the bit rate of the residual encoder is fairly high. This is therefore not a good alternative for low bit rate stereo coding.
  • the encoding of the side signal is based on the idea to reduce the redundancy between the mono and side signal by using a simple balance factor instead of a complex bit rate consuming predictor filter.
  • the residual of this operation is then encoded.
  • the magnitude of such a residual is relatively small and does not call for very high bit rate need for transfer. This idea is very suitable indeed to combine with the variable frame set approach described earlier, since the computational complexity is low.
  • the use of a balance factor combined with the variable frame length approach removes the need for complex interpolation and the associated problems that interpolation may cause. Moreover, the use of a simple balance factor instead of a complex filter gives fewer problems with estimation as possible estimation errors for the balance factor has less impact. The preferred solution will be able to reproduce both panned signals and diffuse sound fields with good quality and with limited bit rate requirements and computational resources.
  • Fig. 4 illustrates a preferred embodiment of a stereo encoder according to the present invention.
  • This embodiment is very similar to the one shown in Fig. 2a, however, with the details of the side signal encoder unit 30 revealed.
  • the encoder, 14 of this embodiment does not have any pre-processing unit, and the input signals are provided directly to the addition and subtraction units 34, 36.
  • the mono signal mono is multiplied with a certain balance factor gsm in a multiplier 33.
  • a subtraction unit 35 the multiplied mono signal is subtracted from the side signal side, i.e. essentially the difference between the two channels, to produce a side residual signal.
  • the balance factor gsm is determined based on the content of the mono and side signals by the optimiser 37 in order to minimise the side residual signal according to a quality criterion.
  • the quality criterion is preferably a least mean square criterion.
  • the side residual signal is encoded in a side residual encoder 39 according to any encoder procedures.
  • the side residual encoder 39 is a low bit rate transform encoder or a CELP (Codebook Excited Linear Prediction) encoder.
  • the encoding parameters pside representing the side signal then comprises the encoding parameters pside residual representing the side residual signal and the optimised balance factor 49.
  • the mono signal 42 used for synthesising the side signals is the target signal Xmono for the mono encoder 38.
  • the local synthesis signal of the mono encoder 38 can also be utilised. In the latter case, the total encoder delay may be increased and the computational complexity for the side signal may increase. On the other hand, the quality may be better as it is then possible to repair coding errors made in the mono encoder.
  • the basic encoding scheme can be described as follows. Denote the two channel signals as a and b, which may be the left and right channel of a stereo pair. The channel signals are combined into a mono signal by addition and to a side signal by a subtraction. In equation form, the operations are described as:
  • a modified or residual side signal is computed according to:
  • f(xmono,xside) is a balance factor function that based on the block on N samples, i.e. a sub-frame, from the side and mono signals strive to remove as much as possible from the side signal.
  • the balance factor is used to minimise the residual side signal. In the special case where it is minimised in a mean square sense, this is equivalent to minimising the energy of the residual side signal xside residual.
  • xside is the side signal and xmono is the mono signal.
  • the function is based on a block starting at "frame start” and ending at "frame end”. It is possible to add weighting in the frequency domain to the computation of the balance factor. This is done by convoluting the xside and xmono signals with the impulse response of a weighting filter. It is then possible to move the estimation error to a frequency range where they are less easy to hear. This is referred to as perceptual weighting.
  • Q g ⁇ . is a quantization function that is applied to the balance factor given by the function f ⁇ x mom ,x side ) -
  • the balance factor is transmitted on the transmission channel. In normal left-right panned signals the balance factor is limited to the interval [- 1.0 l. ⁇ ] . If on the other hand the channels are out of phase with regards to one another, the balance factor may extend beyond these limits.
  • E s is the encoding function (e.g. a transform encoder) of the residual side signal and E m is the encoding function of the mono signal
  • E m is the encoding function of the mono signal
  • One important benefit from computing the balance factor for each frame is that one avoids the use of interpolation. Instead, normally, as described above, the frame processing is performed with overlapping frames.
  • the encoding principle using balance factors operates particularly well in the case of music signals, where fast changes typically are needed to track the stereo image. Lately, multi-channel coding has become popular.
  • One example is 5.1 channel surround sound in DVD movies.
  • the channels are there arranged as: front left, front centre, front right, rear left, rear right and subwoofer.
  • Fig. 5 an embodiment of an encoder that encodes the three front channels in such an arrangement exploiting interchannel redundancies according to the present invention is shown.
  • Three channel signals L, C, R are provided on three inputs 16A-C, and the mono signal xmono is created by a sum of all three signals.
  • a centre signal encoder unit 130 is added, which receives the centre signal xcentre.
  • the mono signal 42 is in this embodiment the encoded and decoded mono signal x' mono, and is multiplied with a certain balance factor go in a multiplier 133.
  • the multiplied mono signal is subtracted from the centre signal Xcentre, to produce a centre residual signal.
  • the balance factor gQ is determined based on the content of the mono and centre signals by an optimiser 137 in order to minimise the centre residual signal according to the quality criterion.
  • the centre residual signal is encoded in a centre residual encoder 139 according to any encoder procedures.
  • the centre residual encoder 139 is a low bit rate transform encoder or a CELP encoder.
  • the encoding parameters pcentre representing the centre signal then comprises the encoding parameters pcentre residual representing the centre residual signal and the optimised balance factor 149.
  • the centre residual signal and the scaled mono signal are added in an addition unit 235, creating a modified centre signal 142 being compensated for encoding errors.
  • the side signal Xside i.e. the difference between the left L and right R channels is provided to the side signal encoder unit 30 as in earlier embodiments.
  • the optimiser 37 also depends on the modified centre signal 142 provided by the centre signal encoder unit 130.
  • the side residual signal will therefore be created as an optimum linear combination of the mono signal 42, the modified centre signal 142 and the side signal in the subtraction unit 35.
  • the variable frame length concept described above can be applied on either of the side and centre signals, or on both.
  • Fig. 6 illustrates a decoder unit suitable for receiving encoded audio signals from the encoder unit of Fig. 5.
  • the received signal 54 is divided into encoding parameters pmono representing ' the main signal, encoding parameters pcentre representing the centre signal and encoding parameters pside representing the side signal.
  • the encoding parameters pmono representing the main signal are used to generate a main signal x' mono.
  • the encoding parameters pcentre representing the centre signal are used to generate a centre signal x" centre, based on main signal x' mono.
  • the encoding parameters pside representing the side signal are decoded, generating a side signal x"side, based on main signal x' mono and centre signal x" centre.
  • the a , ⁇ and ⁇ values can be either constant or dependent of the signal contents in order to emphasise one or two channels in order to achieve an optimal quality.
  • x centr ⁇ s the centre signal and x mon0 is the mono signal.
  • the mono signal comes from the mono target signal but it is possible to use the local synthesis of the mono encoder as well.
  • the centre residual signal to be encoded is: x centre residual V 1 ) X centre ⁇ n ) SQ X mono ⁇ n )
  • Q g ⁇ . is a quantization function that is applied to the balance factor.
  • the balance factor is transmitted on the transmission channel.
  • E c is the encoding function (e.g. a transform encoder) of the centre residual signal and E m is the encoding function of the mono signal then the decoded x c " entre signal in the decoder end can be described as:
  • the side residual signal to be encoded is:
  • g Qsm and g Qsc are quantized values of the parameters g sm and g sc that minimises the expression: frame end ⁇ (fa* M - *rig*/ (»)) - S OT 0 M ⁇ SscKentre (» )
  • the g sm and g sc parameters can be quantized jointly or separately.
  • FIG. 7a-b diagrams are illustrating such an artefact.
  • a signal component having the time development as shown by curve 100.
  • the signal component is not present in the audio sample.
  • the signal component suddenly appears.
  • the signal component is encoded, using a frame length of t2- tl, the occurrence of the signal component will be "smeared out” over the entire frame, as indicated in curve 101. If a decoding takes place of the curve 101, the signal component appears a time ⁇ t before the intended appearance of the signal component, and a "pre-echo" is perceived .
  • the pre-echoing artefacts become more accentuated if long encoding frames are used. By using shorter frames, the artefact is somewhat suppressed.
  • Another way to deal with the pre-echoing problems described above is to utilise the fact that the mono signal is available at both the encoder and decoder end. This makes it possible to scale the side signal according to the energy contour of the mono signal. In the decoder end, the inverse scaling is performed and thus some of the pre-echo problems may be alleviated.
  • An energy contour of the mono signal is computed over the frame as:
  • w(n) is a windowing function.
  • the simplest windowing function is a rectangular window, but other window types such as a hamming window may be more desirable.
  • the side residual signal is then scaled as:
  • the energy contour is computed on the decoded mono signal and is applied to the decoded side signal as: Z-side ( n ) ⁇ x s "i de ( n )f ⁇ E c ("))» frame start ⁇ n ⁇ frame end .
  • this energy contour scaling in some sense is alternative to the use of shorter frame lengths, this concept is particularly well suited to be combined with the variable frame length concept, described further above.
  • a more flexible set of encoding schemes may be provided.
  • the different encoding schemes 81 comprise hatched sub-frames 91, representing encoding applying the energy contour scaling, and un-hatched sub-frames 92, representing encoding procedures not applying the energy contour scaling.
  • the set of encoding schemes of Fig. 8 comprises schemes that handle e.g. pre-echoing artefacts in different ways. In some schemes, longer sub-frames with pre-echoing minimisation according to the energy contour principle are used. In other schemes, shorter sub-frames without energy contour scaling are utilised. Depending on the signal content, one of the alternatives may be more advantageous. For very severe pre-echoing cases, encoding schemes utilising short sub-frames with energy contour scaling may be necessary.
  • the proposed solution can be used in the full frequency band or in one or more distinct sub bands.
  • the use of sub-band can be applied either on both the main and side signals, or on one of them separately.
  • a preferred embodiment comprises a split of the side signal in several frequency bands. The reason is simply that it is easier to remove the possible redundancy in an isolated frequency band than in the entire frequency band. This is particularly important when encoding music signals with rich spectral content.
  • the pre-determined threshold can preferably be 2 kHz, or even more preferably 1 kHz.
  • the diffuse sound fields generally have little energy content at high frequencies.
  • the natural reason is that sound absorption typically increases with frequency.
  • the diffuse sound field components seem to play a less important role for the human auditory system at higher frequencies. Therefore, it is beneficial to employ this solution at low frequencies (below 1 or 2 kHz) and rely on other, even more bit efficient coding schemes at higher frequencies.
  • the fact that the scheme is only applied at low frequencies gives a large saving in bit rate as the necessary bit rate with the proposed method is proportional to the required bandwidth.
  • the mono encoder can encode the entire frequency band, while the proposed side signal encoding is suggested to be performed only in the lower part of the frequency band, as schematically illustrated by Fig. 9.
  • Reference number 301 refers to an encoding scheme according to the present invention of the side signal
  • reference number 302 refers to any other encoding scheme of the side signal
  • reference number 303 refers to an encoding scheme of the side signal.
  • Fig. 10 the main steps of an embodiment of an encoding method according to the present invention are illustrated as a flow diagram.
  • the procedure starts in step 200.
  • a main signal deduced from the polyphonic signals is encoded.
  • encoding schemes are provided, which comprise sub-frames with differing lengths and/ or order.
  • a side signal deduced in step 214 from the polyphonic signals is encoded by an encoding scheme selected dependent at least partly on the actual signal content of the present polyphonic signals.
  • the procedure ends in step 299.
  • Fig. 11 the main steps of an embodiment of a decoding method according to the present invention are illustrated as a flow diagram.
  • the procedure starts in step 200.
  • a received encoded main signal is decoded.
  • encoding schemes are provided, which comprise sub-frames with differing lengths and/ or order.
  • a received side signal is decoded in step 224 by a selected encoding scheme.
  • the decoded main and side signals are combined to a polyphonic signal.
  • the procedure ends in step 299.
PCT/SE2004/001867 2003-12-19 2004-12-15 Fidelity-optimised variable frame length encoding WO2005059899A1 (en)

Priority Applications (13)

Application Number Priority Date Filing Date Title
CN200710138487XA CN101118747B (zh) 2003-12-19 2004-12-15 保真度优化的预回声抑制编码
DE602004008613T DE602004008613T2 (de) 2003-12-19 2004-12-15 Treueoptimierte kodierung mit variabler rahmenlänge
BRPI0410856A BRPI0410856B8 (pt) 2003-12-19 2004-12-15 métodos de codificar e de decodificar sinais multicanais, aparelho codificador, e, aparelho decodificador
BRPI0419281-8A BRPI0419281B1 (pt) 2003-12-19 2004-12-15 Métodos de codificar e de decodificar sinais multicanais, e, aparelhos codificador e decodificador
JP2006518596A JP4335917B2 (ja) 2003-12-19 2004-12-15 忠実度最適化可変フレーム長符号化
PL04820553T PL1623411T3 (pl) 2003-12-19 2004-12-15 Zoptymalizowane pod względem wierności odtwarzania kodowanie ze zmienną długością ramki
AU2004298708A AU2004298708B2 (en) 2003-12-19 2004-12-15 Fidelity-optimised variable frame length encoding
EP04820553A EP1623411B1 (en) 2003-12-19 2004-12-15 Fidelity-optimised variable frame length encoding
MXPA05012230A MXPA05012230A (es) 2003-12-19 2004-12-15 Codificacion de longitud de cuadro variable optimizada en fidelidad.
CNB2004800186630A CN100559465C (zh) 2003-12-19 2004-12-15 保真度优化的可变帧长编码
CA2527971A CA2527971C (en) 2003-12-19 2004-12-15 Fidelity-optimised variable frame length encoding
ZA200508980A ZA200508980B (en) 2003-12-19 2004-12-15 Fidelity-optimised variable frame length encoding
HK06112026.7A HK1091585A1 (en) 2003-12-19 2006-11-01 Fidelity-optimised variable frame length encoding

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
SE0303501-1 2003-12-19
SE0303501A SE0303501D0 (sv) 2003-12-19 2003-12-19 Filter-based parametric multi-channel coding
SE0400417A SE527670C2 (sv) 2003-12-19 2004-02-20 Naturtrogenhetsoptimerad kodning med variabel ramlängd
SE0400417-2 2004-02-20

Publications (1)

Publication Number Publication Date
WO2005059899A1 true WO2005059899A1 (en) 2005-06-30

Family

ID=31996354

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/SE2004/001867 WO2005059899A1 (en) 2003-12-19 2004-12-15 Fidelity-optimised variable frame length encoding

Country Status (15)

Country Link
EP (2) EP1845519B1 (ja)
JP (2) JP4335917B2 (ja)
CN (2) CN101118747B (ja)
AT (2) ATE443317T1 (ja)
AU (1) AU2004298708B2 (ja)
BR (2) BRPI0419281B1 (ja)
CA (2) CA2690885C (ja)
DE (2) DE602004023240D1 (ja)
HK (2) HK1091585A1 (ja)
MX (1) MXPA05012230A (ja)
PL (1) PL1623411T3 (ja)
RU (2) RU2305870C2 (ja)
SE (1) SE527670C2 (ja)
WO (1) WO2005059899A1 (ja)
ZA (1) ZA200508980B (ja)

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006337508A (ja) * 2005-05-31 2006-12-14 Yamaha Corp データ圧縮方法およびデータ圧縮回路並びにデータ伸張回路
WO2006126857A3 (en) * 2005-05-26 2007-01-11 Lg Electronics Inc Method of encoding and decoding an audio signal
WO2007091927A1 (en) * 2006-02-06 2007-08-16 Telefonaktiebolaget Lm Ericsson (Publ) Variable frame offset coding
JP2009500685A (ja) * 2005-07-11 2009-01-08 エルジー エレクトロニクス インコーポレイティド オーディオ信号のエンコーディング及びデコーディング装置及び方法
JP2009522895A (ja) * 2006-01-09 2009-06-11 ノキア コーポレイション バイノーラルオーディオ信号の復号
US7646319B2 (en) 2005-10-05 2010-01-12 Lg Electronics Inc. Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
US7653533B2 (en) 2005-10-24 2010-01-26 Lg Electronics Inc. Removing time delays in signal paths
US7660358B2 (en) 2005-10-05 2010-02-09 Lg Electronics Inc. Signal processing using pilot based coding
US7663513B2 (en) 2005-10-05 2010-02-16 Lg Electronics Inc. Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
US7672379B2 (en) 2005-10-05 2010-03-02 Lg Electronics Inc. Audio signal processing, encoding, and decoding
US7696907B2 (en) 2005-10-05 2010-04-13 Lg Electronics Inc. Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
US7751485B2 (en) 2005-10-05 2010-07-06 Lg Electronics Inc. Signal processing using pilot based coding
US7752053B2 (en) 2006-01-13 2010-07-06 Lg Electronics Inc. Audio signal processing using pilot based coding
US7761303B2 (en) 2005-08-30 2010-07-20 Lg Electronics Inc. Slot position coding of TTT syntax of spatial audio coding application
US7788107B2 (en) 2005-08-30 2010-08-31 Lg Electronics Inc. Method for decoding an audio signal
US7987097B2 (en) 2005-08-30 2011-07-26 Lg Electronics Method for decoding an audio signal
US8073702B2 (en) 2005-06-30 2011-12-06 Lg Electronics Inc. Apparatus for encoding and decoding audio signal and method thereof
US8082157B2 (en) 2005-06-30 2011-12-20 Lg Electronics Inc. Apparatus for encoding and decoding audio signal and method thereof
US8185403B2 (en) 2005-06-30 2012-05-22 Lg Electronics Inc. Method and apparatus for encoding and decoding an audio signal
US8577483B2 (en) 2005-08-30 2013-11-05 Lg Electronics, Inc. Method for decoding an audio signal
WO2017049400A1 (en) 2015-09-25 2017-03-30 Voiceage Corporation Method and system for encoding left and right channels of a stereo sound signal selecting between two and four sub-frames models depending on the bit budget
US9883307B2 (en) 2011-07-05 2018-01-30 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Method and apparatus for decomposing a stereo recording using frequency-domain processing employing a spectral weights generator
US11462223B2 (en) 2018-06-29 2022-10-04 Huawei Technologies Co., Ltd. Stereo signal encoding method and apparatus, and stereo signal decoding method and apparatus

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100981699B1 (ko) * 2002-07-12 2010-09-13 코닌클리케 필립스 일렉트로닉스 엔.브이. 오디오 코딩
US7461106B2 (en) 2006-09-12 2008-12-02 Motorola, Inc. Apparatus and method for low complexity combinatorial coding of signals
US8576096B2 (en) 2007-10-11 2013-11-05 Motorola Mobility Llc Apparatus and method for low complexity combinatorial coding of signals
US8209190B2 (en) 2007-10-25 2012-06-26 Motorola Mobility, Inc. Method and apparatus for generating an enhancement layer within an audio coding system
US7889103B2 (en) 2008-03-13 2011-02-15 Motorola Mobility, Inc. Method and apparatus for low complexity combinatorial coding of signals
US8639519B2 (en) 2008-04-09 2014-01-28 Motorola Mobility Llc Method and apparatus for selective signal coding based on core encoder performance
EP2124486A1 (de) * 2008-05-13 2009-11-25 Clemens Par Winkelabhängig operierende Vorrichtung oder Methodik zur Gewinnung eines pseudostereophonen Audiosignals
BR122020009732B1 (pt) 2008-05-23 2021-01-19 Koninklijke Philips N.V. Método para a geração de um sinal esquerdo e de um sinal direito a partir de um sinal de downmix mono com base em parâmetros espaciais, meio legível por computador não transitório, aparelho de downmix estéreo paramétrico para a geração de um sinal de downmix mono a partir de um sinal esquerdo e de um sinal direito com base em parâmetros espaciais e método para a geração de um sinal residual de previsão para um sinal de diferença a partir de um sinal esquerdo e de um sinal direito com base em parâmetros espaciais
WO2010016270A1 (ja) * 2008-08-08 2010-02-11 パナソニック株式会社 量子化装置、符号化装置、量子化方法及び符号化方法
RU2481650C2 (ru) * 2008-09-17 2013-05-10 Франс Телеком Ослабление опережающих эхо-сигналов в цифровом звуковом сигнале
JP5309944B2 (ja) 2008-12-11 2013-10-09 富士通株式会社 オーディオ復号装置、方法、及びプログラム
US8200496B2 (en) 2008-12-29 2012-06-12 Motorola Mobility, Inc. Audio signal decoder and method for producing a scaled reconstructed audio signal
US8175888B2 (en) 2008-12-29 2012-05-08 Motorola Mobility, Inc. Enhanced layered gain factor balancing within a multiple-channel audio coding system
US8219408B2 (en) 2008-12-29 2012-07-10 Motorola Mobility, Inc. Audio signal decoder and method for producing a scaled reconstructed audio signal
US8140342B2 (en) 2008-12-29 2012-03-20 Motorola Mobility, Inc. Selective scaling mask computation based on peak detection
WO2011013381A1 (ja) * 2009-07-31 2011-02-03 パナソニック株式会社 符号化装置および復号装置
WO2011048798A1 (ja) * 2009-10-20 2011-04-28 パナソニック株式会社 符号化装置、復号化装置およびこれらの方法
EP2346028A1 (en) * 2009-12-17 2011-07-20 Fraunhofer-Gesellschaft zur Förderung der Angewandten Forschung e.V. An apparatus and a method for converting a first parametric spatial audio signal into a second parametric spatial audio signal
CN102770913B (zh) * 2009-12-23 2015-10-07 诺基亚公司 稀疏音频
US8442837B2 (en) 2009-12-31 2013-05-14 Motorola Mobility Llc Embedded speech and audio coding using a switchable model core
US8428936B2 (en) 2010-03-05 2013-04-23 Motorola Mobility Llc Decoder for audio signal including generic audio and speech frames
US8423355B2 (en) 2010-03-05 2013-04-16 Motorola Mobility Llc Encoder for audio signal including generic audio and speech frames
US9129600B2 (en) 2012-09-26 2015-09-08 Google Technology Holdings LLC Method and apparatus for encoding an audio signal
KR102259112B1 (ko) * 2012-11-15 2021-05-31 가부시키가이샤 엔.티.티.도코모 음성 부호화 장치, 음성 부호화 방법, 음성 부호화 프로그램, 음성 복호 장치, 음성 복호 방법 및 음성 복호 프로그램
CN107742521B (zh) 2016-08-10 2021-08-13 华为技术有限公司 多声道信号的编码方法和编码器
CN109215668B (zh) 2017-06-30 2021-01-05 华为技术有限公司 一种声道间相位差参数的编码方法及装置
CN112233682A (zh) * 2019-06-29 2021-01-15 华为技术有限公司 一种立体声编码方法、立体声解码方法和装置

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0497413A1 (en) * 1991-02-01 1992-08-05 Koninklijke Philips Electronics N.V. Subband coding system and a transmitter comprising the coding system
US5285498A (en) * 1992-03-02 1994-02-08 At&T Bell Laboratories Method and apparatus for coding audio signals based on perceptual model
US5434948A (en) * 1989-06-15 1995-07-18 British Telecommunications Public Limited Company Polyphonic coding
US5694332A (en) * 1994-12-13 1997-12-02 Lsi Logic Corporation MPEG audio decoding system with subframe input buffering
US5956674A (en) * 1995-12-01 1999-09-21 Digital Theater Systems, Inc. Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels
US20030061055A1 (en) * 2001-05-08 2003-03-27 Rakesh Taori Audio coding

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5812971A (en) * 1996-03-22 1998-09-22 Lucent Technologies Inc. Enhanced joint stereo coding method using temporal envelope shaping
US5796842A (en) * 1996-06-07 1998-08-18 That Corporation BTSC encoder
US6463410B1 (en) * 1998-10-13 2002-10-08 Victor Company Of Japan, Ltd. Audio signal processing apparatus
US6226616B1 (en) * 1999-06-21 2001-05-01 Digital Theater Systems, Inc. Sound quality of established low bit-rate audio coding systems without loss of decoder compatibility
JP3335605B2 (ja) * 2000-03-13 2002-10-21 日本電信電話株式会社 ステレオ信号符号化方法
JP2003084790A (ja) * 2001-09-17 2003-03-19 Matsushita Electric Ind Co Ltd 台詞成分強調装置
CN1219415C (zh) * 2002-07-23 2005-09-14 华南理工大学 一种5.1通路环绕声的耳机重发的信号处理方法

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5434948A (en) * 1989-06-15 1995-07-18 British Telecommunications Public Limited Company Polyphonic coding
EP0497413A1 (en) * 1991-02-01 1992-08-05 Koninklijke Philips Electronics N.V. Subband coding system and a transmitter comprising the coding system
US5285498A (en) * 1992-03-02 1994-02-08 At&T Bell Laboratories Method and apparatus for coding audio signals based on perceptual model
US5694332A (en) * 1994-12-13 1997-12-02 Lsi Logic Corporation MPEG audio decoding system with subframe input buffering
US5956674A (en) * 1995-12-01 1999-09-21 Digital Theater Systems, Inc. Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels
US20030061055A1 (en) * 2001-05-08 2003-03-27 Rakesh Taori Audio coding

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
FALLER, C. ET AL.: "Binaural cue coding applied to stereo and multi-channel audio compression.", CONVENTION PAPER, 10 May 2002 (2002-05-10) - 13 May 2002 (2002-05-13), XP009024737 *

Cited By (80)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006126857A3 (en) * 2005-05-26 2007-01-11 Lg Electronics Inc Method of encoding and decoding an audio signal
WO2006126859A3 (en) * 2005-05-26 2007-01-11 Lg Electronics Inc Method of encoding and decoding an audio signal
US8090586B2 (en) 2005-05-26 2012-01-03 Lg Electronics Inc. Method and apparatus for embedding spatial information and reproducing embedded signal for an audio signal
US8150701B2 (en) 2005-05-26 2012-04-03 Lg Electronics Inc. Method and apparatus for embedding spatial information and reproducing embedded signal for an audio signal
US8170883B2 (en) 2005-05-26 2012-05-01 Lg Electronics Inc. Method and apparatus for embedding spatial information and reproducing embedded signal for an audio signal
US8214220B2 (en) 2005-05-26 2012-07-03 Lg Electronics Inc. Method and apparatus for embedding spatial information and reproducing embedded signal for an audio signal
JP2006337508A (ja) * 2005-05-31 2006-12-14 Yamaha Corp データ圧縮方法およびデータ圧縮回路並びにデータ伸張回路
JP4639966B2 (ja) * 2005-05-31 2011-02-23 ヤマハ株式会社 オーディオデータ圧縮方法およびオーディオデータ圧縮回路並びにオーディオデータ伸張回路
US8214221B2 (en) 2005-06-30 2012-07-03 Lg Electronics Inc. Method and apparatus for decoding an audio signal and identifying information included in the audio signal
US8073702B2 (en) 2005-06-30 2011-12-06 Lg Electronics Inc. Apparatus for encoding and decoding audio signal and method thereof
US8082157B2 (en) 2005-06-30 2011-12-20 Lg Electronics Inc. Apparatus for encoding and decoding audio signal and method thereof
US8185403B2 (en) 2005-06-30 2012-05-22 Lg Electronics Inc. Method and apparatus for encoding and decoding an audio signal
US8510119B2 (en) 2005-07-11 2013-08-13 Lg Electronics Inc. Apparatus and method of processing an audio signal, utilizing unique offsets associated with coded-coefficients
US8275476B2 (en) 2005-07-11 2012-09-25 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signals
US8510120B2 (en) 2005-07-11 2013-08-13 Lg Electronics Inc. Apparatus and method of processing an audio signal, utilizing unique offsets associated with coded-coefficients
US8180631B2 (en) 2005-07-11 2012-05-15 Lg Electronics Inc. Apparatus and method of processing an audio signal, utilizing a unique offset associated with each coded-coefficient
US8554568B2 (en) 2005-07-11 2013-10-08 Lg Electronics Inc. Apparatus and method of processing an audio signal, utilizing unique offsets associated with each coded-coefficients
JP2009500681A (ja) * 2005-07-11 2009-01-08 エルジー エレクトロニクス インコーポレイティド オーディオ信号のエンコーディング及びデコーディング装置及び方法
JP2009500689A (ja) * 2005-07-11 2009-01-08 エルジー エレクトロニクス インコーポレイティド オーディオ信号の処理装置及び方法
JP2009500684A (ja) * 2005-07-11 2009-01-08 エルジー エレクトロニクス インコーポレイティド オーディオ信号を処理する方法、オーディオ信号のエンコーディング及びデコーディング装置及び方法
JP2009500685A (ja) * 2005-07-11 2009-01-08 エルジー エレクトロニクス インコーポレイティド オーディオ信号のエンコーディング及びデコーディング装置及び方法
US8577483B2 (en) 2005-08-30 2013-11-05 Lg Electronics, Inc. Method for decoding an audio signal
US7831435B2 (en) 2005-08-30 2010-11-09 Lg Electronics Inc. Slot position coding of OTT syntax of spatial audio coding application
US8103514B2 (en) 2005-08-30 2012-01-24 Lg Electronics Inc. Slot position coding of OTT syntax of spatial audio coding application
US8165889B2 (en) 2005-08-30 2012-04-24 Lg Electronics Inc. Slot position coding of TTT syntax of spatial audio coding application
US8082158B2 (en) 2005-08-30 2011-12-20 Lg Electronics Inc. Time slot position coding of multiple frame types
US8060374B2 (en) 2005-08-30 2011-11-15 Lg Electronics Inc. Slot position coding of residual signals of spatial audio coding application
US7761303B2 (en) 2005-08-30 2010-07-20 Lg Electronics Inc. Slot position coding of TTT syntax of spatial audio coding application
US7765104B2 (en) 2005-08-30 2010-07-27 Lg Electronics Inc. Slot position coding of residual signals of spatial audio coding application
US7987097B2 (en) 2005-08-30 2011-07-26 Lg Electronics Method for decoding an audio signal
US7783494B2 (en) 2005-08-30 2010-08-24 Lg Electronics Inc. Time slot position coding
US7788107B2 (en) 2005-08-30 2010-08-31 Lg Electronics Inc. Method for decoding an audio signal
US7792668B2 (en) 2005-08-30 2010-09-07 Lg Electronics Inc. Slot position coding for non-guided spatial audio coding
US7822616B2 (en) 2005-08-30 2010-10-26 Lg Electronics Inc. Time slot position coding of multiple frame types
US7680194B2 (en) 2005-10-05 2010-03-16 Lg Electronics Inc. Method and apparatus for signal processing, encoding, and decoding
US7751485B2 (en) 2005-10-05 2010-07-06 Lg Electronics Inc. Signal processing using pilot based coding
US7663513B2 (en) 2005-10-05 2010-02-16 Lg Electronics Inc. Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
US7743016B2 (en) 2005-10-05 2010-06-22 Lg Electronics Inc. Method and apparatus for data processing and encoding and decoding method, and apparatus therefor
US7774199B2 (en) 2005-10-05 2010-08-10 Lg Electronics Inc. Signal processing using pilot based coding
US7660358B2 (en) 2005-10-05 2010-02-09 Lg Electronics Inc. Signal processing using pilot based coding
US8068569B2 (en) 2005-10-05 2011-11-29 Lg Electronics, Inc. Method and apparatus for signal processing and encoding and decoding
US7671766B2 (en) 2005-10-05 2010-03-02 Lg Electronics Inc. Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
US7756702B2 (en) 2005-10-05 2010-07-13 Lg Electronics Inc. Signal processing using pilot based coding
US7672379B2 (en) 2005-10-05 2010-03-02 Lg Electronics Inc. Audio signal processing, encoding, and decoding
US7696907B2 (en) 2005-10-05 2010-04-13 Lg Electronics Inc. Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
US7675977B2 (en) 2005-10-05 2010-03-09 Lg Electronics Inc. Method and apparatus for processing audio signal
US7756701B2 (en) 2005-10-05 2010-07-13 Lg Electronics Inc. Audio signal processing using pilot based coding
US7646319B2 (en) 2005-10-05 2010-01-12 Lg Electronics Inc. Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
US7840401B2 (en) 2005-10-24 2010-11-23 Lg Electronics Inc. Removing time delays in signal paths
US7761289B2 (en) 2005-10-24 2010-07-20 Lg Electronics Inc. Removing time delays in signal paths
US8095358B2 (en) 2005-10-24 2012-01-10 Lg Electronics Inc. Removing time delays in signal paths
US7716043B2 (en) 2005-10-24 2010-05-11 Lg Electronics Inc. Removing time delays in signal paths
US7742913B2 (en) 2005-10-24 2010-06-22 Lg Electronics Inc. Removing time delays in signal paths
US8095357B2 (en) 2005-10-24 2012-01-10 Lg Electronics Inc. Removing time delays in signal paths
US7653533B2 (en) 2005-10-24 2010-01-26 Lg Electronics Inc. Removing time delays in signal paths
JP2009522894A (ja) * 2006-01-09 2009-06-11 ノキア コーポレイション バイノーラルオーディオ信号の復号
JP2009522895A (ja) * 2006-01-09 2009-06-11 ノキア コーポレイション バイノーラルオーディオ信号の復号
US7865369B2 (en) 2006-01-13 2011-01-04 Lg Electronics Inc. Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
US7752053B2 (en) 2006-01-13 2010-07-06 Lg Electronics Inc. Audio signal processing using pilot based coding
WO2007091927A1 (en) * 2006-02-06 2007-08-16 Telefonaktiebolaget Lm Ericsson (Publ) Variable frame offset coding
US8204740B2 (en) 2006-02-06 2012-06-19 Telefonaktiebolaget Lm Ericsson (Publ) Variable frame offset coding
US9883307B2 (en) 2011-07-05 2018-01-30 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Method and apparatus for decomposing a stereo recording using frequency-domain processing employing a spectral weights generator
CN108352162A (zh) * 2015-09-25 2018-07-31 沃伊斯亚吉公司 用于使用主声道的编码参数编码立体声声音信号以编码辅声道的方法和系统
RU2730548C2 (ru) * 2015-09-25 2020-08-24 Войсэйдж Корпорейшн Способ и система для кодирования левого и правого каналов стереофонического звукового сигнала с выбором между моделями двух и четырех подкадров в зависимости от битового бюджета
EP3353784A1 (en) * 2015-09-25 2018-08-01 VoiceAge Corporation Method and system for encoding left and right channels of a stereo sound signal selecting between two and four sub-frames models depending on the bit budget
EP3353784A4 (en) * 2015-09-25 2019-05-22 VoiceAge Corporation METHOD AND SYSTEM FOR CODING THE LEFT AND RIGHT CHANNELS OF A STEREOTONE SIGNAL WITH SELECTION BETWEEN TWO OR FOUR MODEL MODELS PER BIT HOLIDAY HOUSEHOLD
US10319385B2 (en) 2015-09-25 2019-06-11 Voiceage Corporation Method and system for encoding left and right channels of a stereo sound signal selecting between two and four sub-frames models depending on the bit budget
US10325606B2 (en) 2015-09-25 2019-06-18 Voiceage Corporation Method and system using a long-term correlation difference between left and right channels for time domain down mixing a stereo sound signal into primary and secondary channels
US10339940B2 (en) 2015-09-25 2019-07-02 Voiceage Corporation Method and system for encoding a stereo sound signal using coding parameters of a primary channel to encode a secondary channel
US10522157B2 (en) 2015-09-25 2019-12-31 Voiceage Corporation Method and system for time domain down mixing a stereo sound signal into primary and secondary channels using detecting an out-of-phase condition of the left and right channels
US10573327B2 (en) 2015-09-25 2020-02-25 Voiceage Corporation Method and system using a long-term correlation difference between left and right channels for time domain down mixing a stereo sound signal into primary and secondary channels
WO2017049400A1 (en) 2015-09-25 2017-03-30 Voiceage Corporation Method and system for encoding left and right channels of a stereo sound signal selecting between two and four sub-frames models depending on the bit budget
EP3699909A1 (en) * 2015-09-25 2020-08-26 VoiceAge Corporation Method and system for encoding a stereo sound signal using coding parameters of a primary channel to encode a secondary channel
US10839813B2 (en) 2015-09-25 2020-11-17 Voiceage Corporation Method and system for decoding left and right channels of a stereo sound signal
US10984806B2 (en) 2015-09-25 2021-04-20 Voiceage Corporation Method and system for encoding a stereo sound signal using coding parameters of a primary channel to encode a secondary channel
US11056121B2 (en) 2015-09-25 2021-07-06 Voiceage Corporation Method and system for encoding left and right channels of a stereo sound signal selecting between two and four sub-frames models depending on the bit budget
RU2764287C1 (ru) * 2015-09-25 2022-01-17 Войсэйдж Корпорейшн Способ и система для кодирования левого и правого каналов стереофонического звукового сигнала с выбором между моделями двух и четырех подкадров в зависимости от битового бюджета
EP4235659A3 (en) * 2015-09-25 2023-09-06 VoiceAge Corporation Method and system using a long-term correlation difference between left and right channels for time domain down mixing a stereo sound signal into primary and secondary channels
US11462223B2 (en) 2018-06-29 2022-10-04 Huawei Technologies Co., Ltd. Stereo signal encoding method and apparatus, and stereo signal decoding method and apparatus
US11790923B2 (en) 2018-06-29 2023-10-17 Huawei Technologies Co., Ltd. Stereo signal encoding method and apparatus, and stereo signal decoding method and apparatus

Also Published As

Publication number Publication date
JP4589366B2 (ja) 2010-12-01
EP1623411B1 (en) 2007-08-29
CN101118747A (zh) 2008-02-06
BRPI0419281B1 (pt) 2018-08-14
SE527670C2 (sv) 2006-05-09
RU2007121143A (ru) 2008-12-10
EP1845519A2 (en) 2007-10-17
CN101118747B (zh) 2011-02-23
DE602004008613T2 (de) 2008-06-12
JP2008026914A (ja) 2008-02-07
EP1623411A1 (en) 2006-02-08
AU2004298708B2 (en) 2008-01-03
JP2007529021A (ja) 2007-10-18
MXPA05012230A (es) 2006-02-10
AU2004298708A1 (en) 2005-06-30
EP1845519A3 (en) 2007-11-07
DE602004023240D1 (de) 2009-10-29
DE602004008613D1 (de) 2007-10-11
CA2690885A1 (en) 2005-06-30
SE0400417D0 (sv) 2004-02-20
BRPI0410856B1 (pt) 2019-10-01
JP4335917B2 (ja) 2009-09-30
HK1115665A1 (en) 2008-12-05
CA2527971C (en) 2011-03-15
ATE371924T1 (de) 2007-09-15
BRPI0410856B8 (pt) 2019-10-15
RU2005134365A (ru) 2006-05-27
SE0400417L (sv) 2005-06-20
ATE443317T1 (de) 2009-10-15
BRPI0410856A (pt) 2006-07-04
RU2425340C2 (ru) 2011-07-27
PL1623411T3 (pl) 2008-01-31
CA2527971A1 (en) 2005-06-30
CN1816847A (zh) 2006-08-09
EP1845519B1 (en) 2009-09-16
CA2690885C (en) 2014-01-21
RU2305870C2 (ru) 2007-09-10
HK1091585A1 (en) 2007-01-19
CN100559465C (zh) 2009-11-11
ZA200508980B (en) 2007-03-28

Similar Documents

Publication Publication Date Title
EP1623411B1 (en) Fidelity-optimised variable frame length encoding
US7809579B2 (en) Fidelity-optimized variable frame length encoding
JP4809370B2 (ja) マルチチャネル音声符号化における適応ビット割り当て
US9626973B2 (en) Adaptive bit allocation for multi-channel audio encoding
JP2020091503A (ja) ステレオオーディオ信号を出力する装置及び方法
JP5455647B2 (ja) オーディオデコーダ
US20080312759A1 (en) Flexible frequency and time partitioning in perceptual transform coding of audio
US20140156287A1 (en) Bitstream syntax for multi-process audio decoding
US20090204397A1 (en) Linear predictive coding of an audio signal
US7725324B2 (en) Constrained filter encoding of polyphonic signals
AU2007237227B2 (en) Fidelity-optimised pre-echo suppressing encoding
EP1639580B1 (en) Coding of multi-channel signals

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2005/08980

Country of ref document: ZA

Ref document number: 2005134365

Country of ref document: RU

Ref document number: 200508980

Country of ref document: ZA

WWE Wipo information: entry into national phase

Ref document number: 2004820553

Country of ref document: EP

Ref document number: 2004298708

Country of ref document: AU

Ref document number: 2006518596

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: PA/a/2005/012230

Country of ref document: MX

WWE Wipo information: entry into national phase

Ref document number: 5273/DELNP/2005

Country of ref document: IN

ENP Entry into the national phase

Ref document number: 2004298708

Country of ref document: AU

Date of ref document: 20041215

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 2527971

Country of ref document: CA

WWP Wipo information: published in national office

Ref document number: 2004298708

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 20048186630

Country of ref document: CN

WWP Wipo information: published in national office

Ref document number: 2004820553

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE

WWW Wipo information: withdrawn in national office

Ref document number: DE

ENP Entry into the national phase

Ref document number: PI0410856

Country of ref document: BR

WWG Wipo information: grant in national office

Ref document number: 2004820553

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2004298708

Country of ref document: AU

Date of ref document: 20041215

Kind code of ref document: B