WO2004097794A2 - Advanced processing based on a complex-exponential-modulated filterbank and adaptive time signalling methods - Google Patents

Advanced processing based on a complex-exponential-modulated filterbank and adaptive time signalling methods Download PDF

Info

Publication number
WO2004097794A2
WO2004097794A2 PCT/EP2004/004607 EP2004004607W WO2004097794A2 WO 2004097794 A2 WO2004097794 A2 WO 2004097794A2 EP 2004004607 W EP2004004607 W EP 2004004607W WO 2004097794 A2 WO2004097794 A2 WO 2004097794A2
Authority
WO
WIPO (PCT)
Prior art keywords
signal
subband
channel
accordance
stereo
Prior art date
Application number
PCT/EP2004/004607
Other languages
French (fr)
Other versions
WO2004097794A3 (en
Inventor
Jonas Engdegard
Lars Villemoes
Original Assignee
Coding Technologies Ab
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=20291180&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=WO2004097794(A2) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Priority to EP17173333.0A priority Critical patent/EP3244637B1/en
Priority to PL17173338T priority patent/PL3244640T3/en
Priority to PL17173334T priority patent/PL3244638T3/en
Priority to JP2006505342A priority patent/JP4527716B2/en
Priority to AT04730525T priority patent/ATE444655T1/en
Priority to CN2004800114628A priority patent/CN1781338B/en
Priority to EP17173336.3A priority patent/EP3244639B1/en
Priority to EP17173334.8A priority patent/EP3244638B1/en
Priority to EP17173338.9A priority patent/EP3244640B1/en
Application filed by Coding Technologies Ab filed Critical Coding Technologies Ab
Priority to PL17173337T priority patent/PL3247135T3/en
Priority to EP17173337.1A priority patent/EP3247135B1/en
Priority to EP04730525A priority patent/EP1616461B1/en
Priority to PL17173336T priority patent/PL3244639T3/en
Priority to DE602004023381T priority patent/DE602004023381D1/en
Priority to EP20193070.8A priority patent/EP3823316B1/en
Priority to PL17173333T priority patent/PL3244637T3/en
Publication of WO2004097794A2 publication Critical patent/WO2004097794A2/en
Publication of WO2004097794A3 publication Critical patent/WO2004097794A3/en
Priority to US11/260,659 priority patent/US7487097B2/en
Priority to HK06101638.0A priority patent/HK1081715A1/en
Priority to US11/698,611 priority patent/US7564978B2/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03HIMPEDANCE NETWORKS, e.g. RESONANT CIRCUITS; RESONATORS
    • H03H17/00Networks using digital techniques
    • H03H17/02Frequency selective networks
    • H03H17/0248Filters characterised by a particular frequency response or filtering method
    • H03H17/0264Filter sets with mutual related characteristics
    • H03H17/0266Filter banks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S5/00Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation 
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/03Application of parametric coding in stereophonic audio systems

Definitions

  • the present invention relates to audio source coding systems but the same methods could also be applied in many other technical fields. Different techniques that are useful for audio coding systems using parametric representations of stereo properties are introduced.
  • the present invention relates to parametric coding of the stereo image of an audio signal.
  • Typical parameters used for describing stereo image properties are inter-channel intensity difference (IID) , inter-channel time difference (ITD) , and in- ter-channel coherence (IC) .
  • IID inter-channel intensity difference
  • ITD inter-channel time difference
  • IC in- ter-channel coherence
  • LTI linear time invariant
  • Frequency domain methods for generating a de-correlated signal by adding a random sequence to the IID values along the frequency axis, where different sequences are used for the different audio channels, are also known from prior art.
  • One problem with frequency domain decorrelation by the random sequence modifications is the introduction of pre-echoes. Subjective tests have shown that for non-stationary signals, pre-echoes are by far more annoying than post-echoes, which is also well supported by established psycho acoustical principles. This problem could be reduced by dynamically adapting transform sizes to the signal characteristics in terms of transient con- tent. However, switching transform sizes is always a hard (i.e., binary) decision that affects the full signal bandwidth and that can be difficult to accomplish in a robust manner.
  • United States patent application publication US 2003/0219130 Al discloses a coherence-based audio coding and synthesis.
  • an auditory scene is synthesized from a mono audio signal by modifying, for each critical band, an auditory scene parameter such as an inter-aural level difference (ILD) and/or an inter-aural time difference (ITD) for each subband within the critical band, where the modification is based on an average estimated coherence for the critical band.
  • the coherence- based modification produces auditory scenes having object widths, which more accurately match the widths of the objects in the original input auditory scene.
  • Stereo parameters are the well-known BCC parameters, wherein BCC stands for binaural cue coding.
  • frequency coefficients as obtained by a discrete Fourier transform are grouped together in a single critical band.
  • weighting factors are multiplied by a pseudo-random sequence which is preferably chosen such that the variance is approximately constant for all critical bands, and the average is "0" within each critical band. The same sequence is applied to the spectral coefficients of each different frame.
  • the present invention is based on the finding that, on the de- coding side, a good decorrelation signal for generating a first and a second channel of a multi-channel signal based on the input mono signal is obtained, when a reverberation filter is used, which introduces an integer or preferably a fractional delay into the input signal.
  • this reverberation filter is not applied to the whole input signal. Instead, several reverberation filters are applied to several subbands of the original input signal, i.e., the mono signal so that the reverberation filtering using the reverberation filters is not applied in a time domain or in the frequency domain, i.e., in the domain which is reached, when a Fourier transform is applied.
  • the reverberation filtering using reverberation filters for the subbands is individually performed in the subband domain.
  • a subband signal includes a sequence of at least two subband samples, the sequence of the subband samples representing a bandwidth of the subband signal, which is smaller than the bandwidth of the input signal.
  • the frequency band- width of a subband signal is higher than a frequency bandwidth attributed to a frequency coefficient obtained by Fourier transform.
  • the subband signals are preferably generated by means of a filterbank having for example 32 or 64 filterbank channels, while an FFT would have, for the same example, 1.024 or 2.048 frequency coefficients, i.e., frequency channels.
  • the subband signals can be subband signals obtained by subband- filtering a block of samples of the input signal.
  • the subband filterbank can also be applied continuously without a block wise processing.
  • block wise processing is preferred.
  • fractional delays in a reverberation filter such as a delay between 0.1 and 0.9 and preferably 0.2 to 0.8 of the sampling period of the subband signal. It is noted that in case of critical sampling, and when 64 subband signals are generated using a filterbank having 64 filterbank channels, the sampling period in a subband signal is 64 times larger than the sampling period of the original input signal.
  • the delays are an integral part of the filtering process used in the reverberation device.
  • the output signal constitutes of a multitude of delayed versions of the input signal. It is preferred to delay signals by fractions of the subband sampling period, in order to achieve a good reverberation device in the subband domain.
  • the delay, and preferably the fractional delay introduced by each reverberation filter in each subband is equal for all subbands. Nevertheless, the filter coefficients are different for each sub- bands. It is preferred to use IIR filters. Depending on the actual situation, fractional delay and the filter coefficients for the different filters can be determined empirically using listening tests.
  • the subbands filtered by the set of reverberation filters constitute a decorrelation signal which is to be mixed with the original input signal, i.e., the mono signal to obtain a decoded left channel and decoded right channel.
  • This mixing of a decorrelation signal with the original signal is performed based on an inter-channel coherence parameter transmitted together with the parametrically encoded signal.
  • mixing of the decorrelation signal with a mono signal to obtain the first output channel is different from mixing the decorrelation signal with the mono signal to obtain the second output channel.
  • an encoder includes, in addition to a means for calculating the mono signal and in addition to a means for generating a stereo parameter set, a means for determining a validity of stereo parameter sets for subse- quent portions of the left an right channels.
  • the means for determining is operative to activate the means for generating, when it is determined that the stereo parameter set is not valid anymore so that a second stereo parameter set is calculated for portions of the left and right channels starting at a second time border. This second time border is also determined by the means for determining a validity.
  • the encoded output signal then includes the mono signal, a first stereo parameter set and a first time border associated with the first parameter set and the second stereo parameter set and the second time border associated with the second stereo parameter set.
  • the decoder will use a valid stereo parameter set until a new time border is reached. When this new time border is reached, the decoding operations are performed using the new stereo parameter set.
  • the inventive adaptive determination of stereo parameter sets for different encoder-side determined time borders provides a high coding efficiency on the one hand end and a high coding quality on the other hand. This is due to the fact that for relatively stationary signals, the same stereo parameter set can be used for many blocks of the samples of the mono signal without introducing audible errors. On the other hand, when non-stationary signals are concerned, the inventive adaptive stereo parameter determination provides an improved time resolution so that each signal portion has its optimum stereo parameter set.
  • the present invention provides a solution to the prior art problems by using a reverberation unit as a de-correlator implemented with fractional delay lines in a filterbank, and us- ing adaptive level adjustment of the de-correlated reverberated signal .
  • One aspect of the invention is a method for delaying a signal by: filtering a real-valued time domain signal through the analysis part of complex filterbank; modifying the complex- valued subband signals obtained from the filtering; and fil- tering the modified complex-valued subband signals through the synthesis part of the filterbank; and taking the real part of the complex-valued time domain output signal, where the output signal is the sum of the signals obtained from the synthesis filtering.
  • ter ⁇ T/L
  • the synthesis filter bank has L subbands and the desired delay is ⁇ measured in output signal sample units .
  • Another aspect of the invention is a method for modifying the complex valued subband signals by filtering where the filter
  • Another aspect of the invention is a method for coding of stereo properties of an input signal, by at an encoder, calculate time grid parameters describing the location in time for each stereo parameter set, where the number of stereo parameter sets are arbitrary, and at a decoder, applying parametric stereo synthesis according to that time grid.
  • Another aspect of the invention is a method for coding of ste- reo properties of an input signal, where the time localisation for the first stereo parameter set is, in the case of where a time cue for the stereo parameter set coincides with the beginning of a frame, signalled explicitly instead of signalling the time pointer.
  • Another aspect of the invention is a method for generation of stereo decorrelation for parametric stereo reconstruction, by at a decoder, applying an artificial reverberation process to synthesise the side signal.
  • Another aspect of the invention is a method for generation of stereo decorrelation for parametric stereo reconstruction by, at the decoder, the reverberation process is made within a com- plex modulated filterbank using phase delay adjustment in each filter bank channel.
  • Another aspect of the invention is a method for generation of stereo decorrelation for parametric stereo reconstruction by, at the decoder, the reverberation process utilises a detector designed for finding signals where the reverberation tail could be unwanted and let the reverberation tail be attenuated or removed.
  • Fig. 1 illustrates a block diagram of the inventive appara- tus
  • Fig. 2 illustrates a block diagram of the means for generating a de-correlated signal
  • Fig. 3 illustrates the analysis of a single channel and the synthesis of the stereo channel pair based on the reconstructed stereo subband-signals according to the present invention
  • Fig. 4 illustrates a block diagram of the division of the parametric stereo parameters sets into time segments, based on the signal characteristic
  • Fig. 5 illustrates an example of the division of the parametric stereo parameters sets into time segments, based on the signal characteristic.
  • Delaying a signal by a fraction of a sample can be achieved by several prior art interpolation methods. However, special cases arises when the original signal is available as oversampled complex valued samples. Performing fractional delay in the qmf bank by only applying phase delay by a factor for, each qmf channel corresponding to a constant time delay, results in se- vere artefacts.
  • H n ( ⁇ ) h n (k)exp(-ik ⁇ ) is the discrete time Fourier transform of the filter applied in frequency band n for n ⁇ Oand
  • H n ( ⁇ ) H_ l _ n (- ⁇ for « ⁇ 0. (8)
  • T is the integer closest to L ⁇ .
  • p(k) is extended by zeros outside its support.
  • v ⁇ (k) are different from zero, and (14) is system of linear equations.
  • the number of unknowns g T (k) is typically chosen to be a small number.
  • 3-4 taps already give very good delay performance.
  • the dependence of the filter taps g ⁇ (k) on the delay parameter ⁇ can often be modelled successfully by low order polynomials.
  • Parametric stereo systems always leads to compromises in terms of limited time or frequency resolution in order to minimise conveyed data. It is however well known from psycho-acoustics that some spatial cues can be more important than others, which leads to the possibility to discard the less important cues. Hence, the time resolution does not have to be constant. Great gain in bitrate could be achieved by letting the time grid syn- chronise with the spatial cues. It can easily be done by sending a variable number of parameter sets for each data frame that corresponds to a time segment of fixed size. In order to synchronise the parameter sets with corresponding spatial cues, 5 additional time grid data describing the location in time for each parameter set has to be sent. The resolution of those time pointers could be chosen to be quite low to keep the total amount of data minimised. A special case where a time cue for a parameter set coincides with the beginning of a frame could be 10 signalled explicitly to avoid sending that time pointer.
  • Fig. 4 illustrates an inventive apparatus for performing parameter analysis for time segments having variable and signal dependant time borders.
  • the inventive apparatus includes means
  • Means 402 uses a detector specially designed for extracting spatial cues that is relevant for deciding where to set the time borders. Means 401 outputs all the
  • Fig. 5 illustrates an example of how the time grid generator can perform for a hypothetical input signal.
  • one parameter set per data frame is used if no other time border information is present.
  • the apparatus for encoding a stereo signal to obtain a mono output signal and the stereo parameter set includes the means for calculating the mono signal by combining a left and a right channel of the stereo signals by weighted addition.
  • a means 403 are generating a first stereo parameter set using a portion of the left channel and a portion of the right channel, the portions starting at a first time border is connected to the means for determining the validity of the first stereo parameter set for subsequent portions of the left chan- nel and the right channel.
  • the means for determining is collectively formed by the means 402 and 401 in Fig. 1.
  • the means for determining is operative to generate a second time border and to activate the means for generating, when it is determined that this first stereo parameter set is not valid anymore so that a second stereo parameter set for portions of the left and right channels starting at the second time border is generated.
  • Fig. 4 Not shown in Fig. 4 are means for outputting the mono signal, the first stereo parameter set and the first time border associated with the first stereo parameter set and the second ste- reo parameter set and the second time border associated with the second stereo parameter set as the parametrically encoded stereo signal.
  • the means for determining a validity of a stereo parameter set can include a transient detector, since the probability is high that, after a transient, a new stereo parameter has to be generated, since a signal has changed its shape significantly.
  • the means for determining a validity can include an analysis-by-synthesis device, which is adapted for decoding the mono signal and the stereo parameter set to obtain a decoded left and a decoded right channel, to compare the decoded left channel and the decoded right channel to the left channel and to the right channel, and to activate the means for generating, when the decoded left channel and the decoded right channel are different from the left channel and the right channel by more than the predetermined threshold.
  • an analysis-by-synthesis device which is adapted for decoding the mono signal and the stereo parameter set to obtain a decoded left and a decoded right channel, to compare the decoded left channel and the decoded right channel to the left channel and to the right channel, and to activate the means for generating, when the decoded left channel and the decoded right channel are different from the left channel and the right channel by more than the predetermined threshold.
  • Data frame 1 The time segment corresponding to parameter set 1 starts at the beginning of data frame 1 since no other time border information is present in this data frame.
  • Data frame 2 Two time borders are present in this data frame.
  • the time segment corresponding to parameter set 2 starts at the first time border in this data frame.
  • the time segment corresponding to parameter set 3 starts at the second time border in this data frame.
  • Data frame 3 One time border is present in this data frame.
  • the time segment corresponding to parameter set 4 starts at the time border in this data frame.
  • Data frame 4 One time border is present in this data frame.
  • This time border coincides with the start border of the data frame 4 and does not have to be signalled since this is handled by the default case. Hence, this time border signal can be re- moved.
  • the time segment corresponding to parameter set 5 starts at the beginning of data frame 4, even without signalling this time border.
  • the filter in question should preferably be of all-pass character.
  • One successful approach is to use similar all-pass filters used for artificial reverberation processes.
  • Artificial reverberation algorithms usually requires high time resolution to give an impulse response that is satisfactory diffuse in time.
  • the filter- bank provides excellent possibilities to let the reverberation properties be frequency selective in terms of for example reverberation equalisation, decay time, density and timbre.
  • the filter bank implementations usually exchanges time resolution for higher frequency resolution which normally makes it hard to implement a reverberation process that is smooth enough in time.
  • Fig. 1 illustrates an inventive apparatus for the de- correlation method of signals as used in a parametric stereo system.
  • the inventive apparatus includes means 101 for providing a plurality of subband signals.
  • the providing means can be a complex QMF filterbank, where every signal is associated with a subband index.
  • the subband signals output by the means 101 from Fig 1. are input into a means 102 for providing a de-correlated signal 102, and into a means 103 and 106 for modifying the subband signal.
  • the output from 102 is input into a means 104 and 105 for modifying the of the signal, and the output of 103, 104, 105 and 106 are input into a means for adding, 107 and 108, the subband signals.
  • the means for modifying 103,104, 105 and 106, the subband signals adjusts the level of the de-correlated signal and the unprocessed signal being the output of 101, by multiplying the subband signal with a gain factor, so that every sum of every pair results in a signal with the amount of de-correlated sig ⁇ nal given by the control parameters.
  • the gain factors used in the means for modifying, 103 - 106 are not limited to a positive value. It can also be a negative value.
  • the output from the means for adding subband signals 107 and 108, is input to the means for providing a time-domain signal
  • the output from 109 corresponds to the left channel of the re-constructed stereo signal, and the output from
  • the same de- correlator is used for both output channels, while the means for adding the de-correlated signal with the un-processed signal are separate for the two output channels.
  • the presently described embodiment thereby ensures that the two output signals can be identical as well as completely de-correlated, dependent on the control data provided to the means for adjusting the levels of the signals, and the control data provided to the means for adding the signals.
  • Fig. 2 a block diagram of the means for providing a de- correlated signal is displayed.
  • the input subband signal is input to the means for filtering a subband signal 201.
  • the filtering step is a reverberation unit incorporating all-pass filtering.
  • the filter coefficients used are given by the means for providing filter coefficients 202.
  • the subband index of the currently processed subband signal is input to 202. In one embodiment of the present invention different filter coefficients are calculated based on the subband index provided to 202.
  • the filtering step in 201 relies on delayed samples of the input subband signal as well as delayed samples of intermediate signals in the filtering procedure.
  • means for providing integer subband sample delay and fractional sub- band sample delay are provided by 203.
  • the output of 201 is input to a means for adjusting the level of the subband signal 204, and also to a means for estimating signal characteristics of the subband signal 205.
  • the characteristics estimated is the transient behaviour of the subband signal.
  • a detected transient is signalled to the means for adjusting the level of a subband signal 204, so that the level of the signal is reduced during transient passages.
  • the output from 204 is the de-correlated signal input to 104 and 105 of Fig. 1.
  • Fig. 3 the single analysis filterbank and the two synthesis filterbanks are shown.
  • the analysis filterbank 301 operates on the mono input signal, while the synthesis filterbanks 302 and 303 operate on the re-constructed stereo signals.
  • Fig. 1 therefore, shows the inventive apparatus for generating a decorrelation signal which is indicated by reference 102.
  • this apparatus includes means for providing a plurality of subband signals, wherein a subband signal includes the sequence of at least two subband samples, the sequence of the subband samples representing a bandwidth of the subband signal which is smaller than a bandwidth of the input signal.
  • Each subband signal is input into the means 201 for filtering.
  • Each means 201 for filtering includes a reverberation filter so that a plurality of reverberated subband signals are obtained, wherein the plurality of reverberated subband signals together represent the decorrelation signal.
  • there can be a subband-wise postprocessing of reverberated subband signals which is performed by block 204, which is controlled by block 205.
  • Each reverberation filter is set to a certain delay, and preferably a fractional delay, and each reverberation filter has several filter coefficients, which, as it is shown in Fig. 2, depend on the subband index. This means that it is preferred to use the same delay for each subband but to use different sets of filter coefficients for the different subbands. This is symbolized by means 203 and 202 in Fig. 2, although it is to be mentioned here that delays and filter coefficients are preferably fixedly determined when shipping a decorrelation device, wherein the delays and filter coefficients may be determined empirically using listening tests etc.
  • a multi-channel decoder is shown by Fig. 1 and includes the inventive apparatus for generating the correlation signal, which is termed 102 in Fig. 1.
  • the multi-channel decoder shown in Fig. 1 is for decoding a mono signal and an associated inter- channel coherence measure, the inter-channel coherence measure representing a coherence between a plurality of original channels, wherein the mono signal is derived from the plurality of original channels.
  • Block 102 in Fig. 1 constitutes a generator for generating a decorrelation signal for the mono signal.
  • Blocks 103, 104, 105, 106 and 107 and 108 constitute a mixer for mixing the mono signal and the decorrelation signal in accordance with the first mixing mode to obtain a first decoded output signal and in accordance with the second mixing mode to obtain a second decoded output signal, wherein the mixer is op- erative to determine the first mixing mode and the second mixing mode based on the inter-channel coherence measure transmitted as a side information to the mono signal.
  • the mixer is preferably operative to mix in a subband domain based on separate inter-channel coherence measures for different subbands.
  • the multi-channel decoder further comprises means 109 and 110 for converting the first and second decoded output signals from the subband domain in a time domain to obtain a first decoded output signal and a second decoded output signal in the time domain. Therefore, the inventive means 102 for generating a decorrelation signal and the inventive multi-channel decoder as shown in Fig. 1 operate in the subband domain and perform, as the very last step, a subband domain to time domain conversion.
  • the inventive device can be implemented in hardware or in software or in a firmware including hardware constituents and software constituents.
  • the invention also is a computer program having a computer-readable code for carrying out the inventive methods when running on a computer.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Mathematical Physics (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computer Hardware Design (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Color Television Image Signal Generators (AREA)
  • Picture Signal Circuits (AREA)
  • Epoxy Resins (AREA)
  • Networks Using Active Elements (AREA)
  • Developing Agents For Electrophotography (AREA)
  • Filters That Use Time-Delay Elements (AREA)
  • Wind Motors (AREA)
  • Silver Salt Photography Or Processing Solution Therefor (AREA)

Abstract

A synthesizer for generating a decorrelation signal using an input signal is operative on a plurality of subband signals, wherein a subband signal includes a sequence of at least two subband samples, the sequence of the subband samples representing a bandwidth of the subband signal, which is smaller than a bandwidth of the input signal. The synthesizer includes a filter stage (201) for filtering each subband signal using a reverberation filter to obtain a plurality of reverberated subband signals, wherein a plurality of reverberated subband signals together represent the decorrelation signal. This decorrelation signal is used for reconstructing a signal based on a parametrically encoded stereo signal consisting of a mono signal and a coherence measure.

Description

ADVANCED PROCESSING BASED ON A COMPLEX-EXPONENTIAL-MODULATED FILTERBANK AND ADAPTIVE TIME SIGNALLING METHODS
TECHNICAL FIELD
The present invention relates to audio source coding systems but the same methods could also be applied in many other technical fields. Different techniques that are useful for audio coding systems using parametric representations of stereo properties are introduced.
BACKGROUND OF THE INVENTION AND PRIOR ART
The present invention relates to parametric coding of the stereo image of an audio signal. Typical parameters used for describing stereo image properties are inter-channel intensity difference (IID) , inter-channel time difference (ITD) , and in- ter-channel coherence (IC) . In order to re-construct the stereo image based on these parameters, a method is required that can re-construct the correct level of correlation between the two channels, according to the IC parameter. This is accomplished by a de-correlation method.
There are a couple of methods available for creation of decor- related signals. Ideally, a linear time invariant (LTI) function with all-pass frequency response is desired. One obvious method for achieving this is by using a constant delay. How- ever, using a delay, or any other LTI all-pass functions, will result in non-all-pass response after adding the non-processed signal. In the case of a delay, the result will be a typical comb-filter. The comb-filter often gives an undesirable "me- tallic1 ' sound that, even if the stereo widening effect can be efficient, reduces much naturalness of the original.
Frequency domain methods for generating a de-correlated signal by adding a random sequence to the IID values along the frequency axis, where different sequences are used for the different audio channels, are also known from prior art. One problem with frequency domain decorrelation by the random sequence modifications is the introduction of pre-echoes. Subjective tests have shown that for non-stationary signals, pre-echoes are by far more annoying than post-echoes, which is also well supported by established psycho acoustical principles. This problem could be reduced by dynamically adapting transform sizes to the signal characteristics in terms of transient con- tent. However, switching transform sizes is always a hard (i.e., binary) decision that affects the full signal bandwidth and that can be difficult to accomplish in a robust manner.
United States patent application publication US 2003/0219130 Al discloses a coherence-based audio coding and synthesis. In particular, an auditory scene is synthesized from a mono audio signal by modifying, for each critical band, an auditory scene parameter such as an inter-aural level difference (ILD) and/or an inter-aural time difference (ITD) for each subband within the critical band, where the modification is based on an average estimated coherence for the critical band. The coherence- based modification produces auditory scenes having object widths, which more accurately match the widths of the objects in the original input auditory scene. Stereo parameters are the well-known BCC parameters, wherein BCC stands for binaural cue coding. When generating two different decorrelated output chan¬ nels, frequency coefficients as obtained by a discrete Fourier transform are grouped together in a single critical band. Based on the inter-channel coherence measure, weighting factors are multiplied by a pseudo-random sequence which is preferably chosen such that the variance is approximately constant for all critical bands, and the average is "0" within each critical band. The same sequence is applied to the spectral coefficients of each different frame.
SUMMARY OF THE INVENTION
It is the object of the present invention to provide a decoding concept for parametrically encoded multi-channel signals or an encoding concept for generating such signals which result in a good audio quality and a good coding efficiency.
This object is achieved by an apparatus for generating a decorrelation signal in accordance with claim 1, a multi-channel decoder in accordance with claim 13, a method of generating a decorrelation signal in accordance with claim 20, a method of multi-channel decoding in accordance with claim 21, an appara- tus for encoding a stereo signal in accordance with claim 22 or a method of encoding a stereo signal in accordance with claim 26 or a computer program in accordance with claim 27.
The present invention is based on the finding that, on the de- coding side, a good decorrelation signal for generating a first and a second channel of a multi-channel signal based on the input mono signal is obtained, when a reverberation filter is used, which introduces an integer or preferably a fractional delay into the input signal. Importantly, this reverberation filter is not applied to the whole input signal. Instead, several reverberation filters are applied to several subbands of the original input signal, i.e., the mono signal so that the reverberation filtering using the reverberation filters is not applied in a time domain or in the frequency domain, i.e., in the domain which is reached, when a Fourier transform is applied. Inventively, the reverberation filtering using reverberation filters for the subbands is individually performed in the subband domain.
A subband signal includes a sequence of at least two subband samples, the sequence of the subband samples representing a bandwidth of the subband signal, which is smaller than the bandwidth of the input signal. Naturally, the frequency band- width of a subband signal is higher than a frequency bandwidth attributed to a frequency coefficient obtained by Fourier transform. The subband signals are preferably generated by means of a filterbank having for example 32 or 64 filterbank channels, while an FFT would have, for the same example, 1.024 or 2.048 frequency coefficients, i.e., frequency channels.
The subband signals can be subband signals obtained by subband- filtering a block of samples of the input signal. Alternatively, the subband filterbank can also be applied continuously without a block wise processing. For the present invention, however, block wise processing is preferred.
Since the reverberation filtering is not applied to the whole signal, but is applied subband-wise, a "metallic" sound caused by comb-filtering is avoided.
In cases, in which a sample period between two subsequent sub- band samples of the subband is too large for a good sound impression at the decoder end, it is preferred to use fractional delays in a reverberation filter such as a delay between 0.1 and 0.9 and preferably 0.2 to 0.8 of the sampling period of the subband signal. It is noted that in case of critical sampling, and when 64 subband signals are generated using a filterbank having 64 filterbank channels, the sampling period in a subband signal is 64 times larger than the sampling period of the original input signal.
It is to be noted here that the delays are an integral part of the filtering process used in the reverberation device. The output signal constitutes of a multitude of delayed versions of the input signal. It is preferred to delay signals by fractions of the subband sampling period, in order to achieve a good reverberation device in the subband domain.
In preferred embodiments of the present invention, the delay, and preferably the fractional delay introduced by each reverberation filter in each subband is equal for all subbands. Nevertheless, the filter coefficients are different for each sub- bands. It is preferred to use IIR filters. Depending on the actual situation, fractional delay and the filter coefficients for the different filters can be determined empirically using listening tests.
The subbands filtered by the set of reverberation filters constitute a decorrelation signal which is to be mixed with the original input signal, i.e., the mono signal to obtain a decoded left channel and decoded right channel. This mixing of a decorrelation signal with the original signal is performed based on an inter-channel coherence parameter transmitted together with the parametrically encoded signal. To obtain different left and right channels, i.e., different first and second channels, mixing of the decorrelation signal with a mono signal to obtain the first output channel is different from mixing the decorrelation signal with the mono signal to obtain the second output channel.
To obtain higher efficiency on the encoding side, multi-channel encoding is performed using an adaptive determination of the stereo parameter set. To this end, an encoder includes, in addition to a means for calculating the mono signal and in addition to a means for generating a stereo parameter set, a means for determining a validity of stereo parameter sets for subse- quent portions of the left an right channels. Preferably, the means for determining is operative to activate the means for generating, when it is determined that the stereo parameter set is not valid anymore so that a second stereo parameter set is calculated for portions of the left and right channels starting at a second time border. This second time border is also determined by the means for determining a validity.
The encoded output signal then includes the mono signal, a first stereo parameter set and a first time border associated with the first parameter set and the second stereo parameter set and the second time border associated with the second stereo parameter set. On the decoding side, the decoder will use a valid stereo parameter set until a new time border is reached. When this new time border is reached, the decoding operations are performed using the new stereo parameter set.
Compared to prior art methods, which did a block wise processing and, therefore, a block wise determination of stereo parameter sets, the inventive adaptive determination of stereo parameter sets for different encoder-side determined time borders provides a high coding efficiency on the one hand end and a high coding quality on the other hand. This is due to the fact that for relatively stationary signals, the same stereo parameter set can be used for many blocks of the samples of the mono signal without introducing audible errors. On the other hand, when non-stationary signals are concerned, the inventive adaptive stereo parameter determination provides an improved time resolution so that each signal portion has its optimum stereo parameter set. The present invention provides a solution to the prior art problems by using a reverberation unit as a de-correlator implemented with fractional delay lines in a filterbank, and us- ing adaptive level adjustment of the de-correlated reverberated signal .
Subsequently, several aspects of the present invention are outlined.
One aspect of the invention is a method for delaying a signal by: filtering a real-valued time domain signal through the analysis part of complex filterbank; modifying the complex- valued subband signals obtained from the filtering; and fil- tering the modified complex-valued subband signals through the synthesis part of the filterbank; and taking the real part of the complex-valued time domain output signal, where the output signal is the sum of the signals obtained from the synthesis filtering.
Another aspect of the invention is a method for modifying the complex valued subband signals by filtering each complex-valued subband signal with a complex valued finite impulse response filter where the finite impulse response filter for subband number n is given by a discrete time Fourier transform of the for n even; form H (ω) = , where the parame-
Figure imgf000009_0001
π), for n odd. ter τ = T/L , and where the synthesis filter bank has L subbands and the desired delay is ^measured in output signal sample units .
Another aspect of the invention is a method for modifying the complex valued subband signals by filtering where the filter Gτ(ω) approximately satisfies Vτ(ω)Gτ(ω) + Vτ(ω + π)Gτ(ω + π) = l , where Vτ(ω) is the discrete time Fourier transform of the sequence vτ(k) = A ik p(l)p(l - T -Lk), and p(/)is the prototype filter of said
complex filterbank and A is an appropriate real normalization factor.
Another aspect of the invention is a method for modifying the complex valued subband signals by filtering where the filter
Gτ(ω) satisfies Gτ(—ω) = Gτ(ω + π)* such that even indexed impulse response samples are real valued and odd indexed impulse re- sponse samples are purely imaginary valued.
Another aspect of the invention is a method for coding of stereo properties of an input signal, by at an encoder, calculate time grid parameters describing the location in time for each stereo parameter set, where the number of stereo parameter sets are arbitrary, and at a decoder, applying parametric stereo synthesis according to that time grid.
Another aspect of the invention is a method for coding of ste- reo properties of an input signal, where the time localisation for the first stereo parameter set is, in the case of where a time cue for the stereo parameter set coincides with the beginning of a frame, signalled explicitly instead of signalling the time pointer.
Another aspect of the invention is a method for generation of stereo decorrelation for parametric stereo reconstruction, by at a decoder, applying an artificial reverberation process to synthesise the side signal.
Another aspect of the invention is a method for generation of stereo decorrelation for parametric stereo reconstruction by, at the decoder, the reverberation process is made within a com- plex modulated filterbank using phase delay adjustment in each filter bank channel.
Another aspect of the invention is a method for generation of stereo decorrelation for parametric stereo reconstruction by, at the decoder, the reverberation process utilises a detector designed for finding signals where the reverberation tail could be unwanted and let the reverberation tail be attenuated or removed.
BRIEF DESCRIPTION OF THE DRAWINGS
The present invention will now be described by way of illus- trative examples, not limiting the scope or spirit of the invention, with reference to the accompanying drawings, in which:
Fig. 1 illustrates a block diagram of the inventive appara- tus;
Fig. 2 illustrates a block diagram of the means for generating a de-correlated signal;
Fig. 3 illustrates the analysis of a single channel and the synthesis of the stereo channel pair based on the reconstructed stereo subband-signals according to the present invention;
Fig. 4 illustrates a block diagram of the division of the parametric stereo parameters sets into time segments, based on the signal characteristic; and Fig. 5 illustrates an example of the division of the parametric stereo parameters sets into time segments, based on the signal characteristic.
DESCRIPTION OF PREFERRED EMBODIMENTS
The below-described embodiments are merely illustrative for the principles of the present invention for parametric stereo coding. It is understood that modifications and variations of the arrangements and the details described herein will be apparent to others skilled in the art. It is the intent, there- fore, to be limited only by the scope of the impending patent claims and not by the specific details presented by way of description and explanation of the embodiments herein.
Delaying a signal by a fraction of a sample can be achieved by several prior art interpolation methods. However, special cases arises when the original signal is available as oversampled complex valued samples. Performing fractional delay in the qmf bank by only applying phase delay by a factor for, each qmf channel corresponding to a constant time delay, results in se- vere artefacts.
This can efficiently be avoided by using a compensation filter according to a novel approach allowing high quality approximations to arbitrary delays in any complex-exponential-modulated filterbank. A detailed description follows below.
A continuous time model For ease of computations a complex exponential modulated L - band filterbank will be modelled here by a continuous time windowed transform using the synthesis waveforms un k (t) = v(t- k)exp [iπ(n+1/ 2)(t- k+ θ)] , (1) where n,k are integers withw≥O and θ is a fixed phase term. Results for discrete-time signals are obtain by suitable sampling of the t-variable with spacing ML . It is assumed that the real valued window v(t) is chosen such that for real valued signals x(t) it holds to very high precision that
Figure imgf000013_0001
(3) where * denotes complex conjugation. It is also assumed that v(t) is essentially band limited to the frequency interval
[-π,π . Consider the modification of each frequency band nby filtering the discrete time analysis samples cn(k) with a filter with impulse response hn(k) , dn(k) = ∑K(f cm(k- l) . (4)
Then the modified synthesis
Figure imgf000013_0002
can be computed in the frequency domain to be y(ω) = H(ω)x(ω) , (6) where f(ω) denotes Fourier transforms of f(t) and
Figure imgf000013_0003
Here, Hn(ω) = hn(k)exp(-ikω) is the discrete time Fourier transform of the filter applied in frequency band n for n≥Oand
Hn(ω) = H_l_n(-ωϊ for «<0. (8)
Observe here that the special caseff„(ω)= l leads to H(ω) = 1 in (7) due to the special design of the window v(t) . Another case of interest is Hn(ω) = exp(-iω) which gives H(ω) = exp(-iω) , so that y(t) = x(t- \) .
The proposed solution
In order to achieve a delay of size τ , such that y(t) = x(t -r) , the problem is to design filters Hn(ω) for n≥Osuch that H(ω) = exp(-tτrø) , (9) where H(ω) is given by (7) and (8). The particular solution proposed here is to apply the filters
(exp(-iπ(n + l/2)τ)Gτ(ω), for n even; " \exp(-iπ(n + l/2)τ)Gτ(ω + π), for n odd.
(10) Here Gτ(-ω) = Gτ(ω + π)* implies consistency with (8) for all n . Insertion of (10) into the right hand side of (7) yields H(ω) = exv(-iωτ)[VT(ω)GT(ω) + Vτ(ω + π)Gτ(ω + π)]
(11) where Vτ(ω) = b(ω -π(2n + l/2)) with b(ω) =
Figure imgf000014_0001
. Elementary computations show that Vτ(ω) is the discrete time Fourier transform of
Figure imgf000014_0002
Very good approximations to the perfect delay can be obtained by solving the linear system Vτ(ω)Gτ(ω) + Vτ(ω + π)Gτ(ω + π) = l (13) in the least squares sense with a FIR filter
GΛω) = ∑k__NgΛk)GXV(~ikωIn terms of filter coefficients, the equation (13) can be written 2∑vτ(2k-l)gτ(l) = δ[k] , (14)
where δ[k] = l for £ = 0and δ[k] = 0 for k ≠ O .
In the case of a discrete time -band filter bank with prototype filter p(k) , the obtained delay in sample units is Lτ and the computation (12) is replaced by vT(k) = ik p(l)p(l - T-Lk), (15)
where T is the integer closest to Lτ . Here p(k) is extended by zeros outside its support. For a finite length prototype filter, only finitely many vτ(k) are different from zero, and (14) is system of linear equations. The number of unknowns gT(k) is typically chosen to be a small number. For good QMF filter bank designs, 3-4 taps already give very good delay performance. Moreover, the dependence of the filter taps gτ(k) on the delay parameter τ can often be modelled successfully by low order polynomials.
Signalling adaptive time grid for stereo parameters
Parametric stereo systems always leads to compromises in terms of limited time or frequency resolution in order to minimise conveyed data. It is however well known from psycho-acoustics that some spatial cues can be more important than others, which leads to the possibility to discard the less important cues. Hence, the time resolution does not have to be constant. Great gain in bitrate could be achieved by letting the time grid syn- chronise with the spatial cues. It can easily be done by sending a variable number of parameter sets for each data frame that corresponds to a time segment of fixed size. In order to synchronise the parameter sets with corresponding spatial cues, 5 additional time grid data describing the location in time for each parameter set has to be sent. The resolution of those time pointers could be chosen to be quite low to keep the total amount of data minimised. A special case where a time cue for a parameter set coincides with the beginning of a frame could be 10 signalled explicitly to avoid sending that time pointer.
Fig. 4 illustrates an inventive apparatus for performing parameter analysis for time segments having variable and signal dependant time borders. The inventive apparatus includes means
15. 401 for dividing the input signal into one or several time segments. The time borders that separate the time segments are provided by means 402. Means 402 uses a detector specially designed for extracting spatial cues that is relevant for deciding where to set the time borders. Means 401 outputs all the
20 input signal divided into one or several time segments. This output is input to means 403 for separate parameter analysis for each time segment. Means 403 outputs one parameter set per time segment being analysed.
25 Fig. 5 illustrates an example of how the time grid generator can perform for a hypothetical input signal. In this example one parameter set per data frame is used if no other time border information is present. Hence, when no other time border information is present, the inherent time borders of the data
30 frame is used. The in Fig. 5 depicted time borders are the output from means 402 in Fig. 4. The in Fig. 5 depicted time segments are provided by means 401 in Fig. 4. The apparatus for encoding a stereo signal to obtain a mono output signal and the stereo parameter set includes the means for calculating the mono signal by combining a left and a right channel of the stereo signals by weighted addition. Addition- ally, a means 403 are generating a first stereo parameter set using a portion of the left channel and a portion of the right channel, the portions starting at a first time border is connected to the means for determining the validity of the first stereo parameter set for subsequent portions of the left chan- nel and the right channel.
The means for determining is collectively formed by the means 402 and 401 in Fig. 1.
Particularly, the means for determining is operative to generate a second time border and to activate the means for generating, when it is determined that this first stereo parameter set is not valid anymore so that a second stereo parameter set for portions of the left and right channels starting at the second time border is generated.
Not shown in Fig. 4 are means for outputting the mono signal, the first stereo parameter set and the first time border associated with the first stereo parameter set and the second ste- reo parameter set and the second time border associated with the second stereo parameter set as the parametrically encoded stereo signal. The means for determining a validity of a stereo parameter set can include a transient detector, since the probability is high that, after a transient, a new stereo parameter has to be generated, since a signal has changed its shape significantly. Alternatively, the means for determining a validity can include an analysis-by-synthesis device, which is adapted for decoding the mono signal and the stereo parameter set to obtain a decoded left and a decoded right channel, to compare the decoded left channel and the decoded right channel to the left channel and to the right channel, and to activate the means for generating, when the decoded left channel and the decoded right channel are different from the left channel and the right channel by more than the predetermined threshold.
Data frame 1: The time segment corresponding to parameter set 1 starts at the beginning of data frame 1 since no other time border information is present in this data frame.
Data frame 2: Two time borders are present in this data frame. The time segment corresponding to parameter set 2 starts at the first time border in this data frame. The time segment corresponding to parameter set 3 starts at the second time border in this data frame.
Data frame 3: One time border is present in this data frame.
The time segment corresponding to parameter set 4 starts at the time border in this data frame.
Data frame 4: One time border is present in this data frame.
This time border coincides with the start border of the data frame 4 and does not have to be signalled since this is handled by the default case. Hence, this time border signal can be re- moved. The time segment corresponding to parameter set 5 starts at the beginning of data frame 4, even without signalling this time border.
Using artificial reverberation as decorrelation method for parametric stereo reconstruction
One vital part of making the stereo synthesis in a parametric stereo system is to decrease the coherence between the left and right channel in order to create wideness of the stereo image. It can be done by adding a filtered version of the original mono signal to the side signal, where the side and mono signal is defined by: mono = (left + right) / 2, and side = (left - right) / 2, respectively.
In order to not change the timbre too much, the filter in question should preferably be of all-pass character. One successful approach is to use similar all-pass filters used for artificial reverberation processes. Artificial reverberation algorithms usually requires high time resolution to give an impulse response that is satisfactory diffuse in time. There are great advantages in basing an artificial reverberation algorithm on a complex filter bank such as the complex qmf bank. The filter- bank provides excellent possibilities to let the reverberation properties be frequency selective in terms of for example reverberation equalisation, decay time, density and timbre. However, the filter bank implementations usually exchanges time resolution for higher frequency resolution which normally makes it hard to implement a reverberation process that is smooth enough in time. To deal with this problem a novel method would be to use a fractional delay approximation by only applying phase delay by a factor for, each qmf channel corresponding to a constant time delay. This primitive fractional delay method introduces severe time smearing that fortunately is very much desired in this case. The time smearing contributes to the time diffusion which is highly desirable for reverberation algorithms and gets bigger as the phase delay approaches pi/2 or - pi/2.
Artificial reverberation processes are for natural reasons processes with an infinite impulse response, and offers natural exponential decays. In [PCT/SE02/01372] it is pointed out that if a reverberation unit is used for generating a stereo signal, the reverberation decay might sometimes be unwanted after the very end of a sound. These unwanted reverb-tails can however easily be attenuated or completely removed by just altering the gain of the reverb signal. A detector designed for finding sound endings can be used for that purpose. If the reverberation unit generates artefacts at some specific signals e.g., transients, a detector for those signals can also be used for attenuating the same.
Fig. 1 illustrates an inventive apparatus for the de- correlation method of signals as used in a parametric stereo system. The inventive apparatus includes means 101 for providing a plurality of subband signals. The providing means can be a complex QMF filterbank, where every signal is associated with a subband index.
The subband signals output by the means 101 from Fig 1. are input into a means 102 for providing a de-correlated signal 102, and into a means 103 and 106 for modifying the subband signal. The output from 102 is input into a means 104 and 105 for modifying the of the signal, and the output of 103, 104, 105 and 106 are input into a means for adding, 107 and 108, the subband signals.
In the presently described embodiment of the invention, the means for modifying 103,104, 105 and 106, the subband signals, adjusts the level of the de-correlated signal and the unprocessed signal being the output of 101, by multiplying the subband signal with a gain factor, so that every sum of every pair results in a signal with the amount of de-correlated sig¬ nal given by the control parameters. It should be noted that the gain factors used in the means for modifying, 103 - 106, are not limited to a positive value. It can also be a negative value.
The output from the means for adding subband signals 107 and 108, is input to the means for providing a time-domain signal
109 and 110. The output from 109 corresponds to the left channel of the re-constructed stereo signal, and the output from
110 corresponds to the right channel of the re-constructed stereo signal. In the here described embodiment the same de- correlator is used for both output channels, while the means for adding the de-correlated signal with the un-processed signal are separate for the two output channels. The presently described embodiment thereby ensures that the two output signals can be identical as well as completely de-correlated, dependent on the control data provided to the means for adjusting the levels of the signals, and the control data provided to the means for adding the signals.
In Fig. 2 a block diagram of the means for providing a de- correlated signal is displayed. The input subband signal is input to the means for filtering a subband signal 201. In the presently described embodiment of the present invention the filtering step is a reverberation unit incorporating all-pass filtering. The filter coefficients used are given by the means for providing filter coefficients 202. The subband index of the currently processed subband signal is input to 202. In one embodiment of the present invention different filter coefficients are calculated based on the subband index provided to 202. The filtering step in 201, relies on delayed samples of the input subband signal as well as delayed samples of intermediate signals in the filtering procedure.
It is an essential feature of the present invention that means for providing integer subband sample delay and fractional sub- band sample delay are provided by 203. The output of 201 is input to a means for adjusting the level of the subband signal 204, and also to a means for estimating signal characteristics of the subband signal 205. In a preferred embodiment of the present invention the characteristics estimated is the transient behaviour of the subband signal. In this embodiment a detected transient is signalled to the means for adjusting the level of a subband signal 204, so that the level of the signal is reduced during transient passages. The output from 204 is the de-correlated signal input to 104 and 105 of Fig. 1.
In Fig. 3 the single analysis filterbank and the two synthesis filterbanks are shown. The analysis filterbank 301, operates on the mono input signal, while the synthesis filterbanks 302 and 303 operate on the re-constructed stereo signals.
Fig. 1, therefore, shows the inventive apparatus for generating a decorrelation signal which is indicated by reference 102. As it is shown in Fig. 1 or 3, this apparatus includes means for providing a plurality of subband signals, wherein a subband signal includes the sequence of at least two subband samples, the sequence of the subband samples representing a bandwidth of the subband signal which is smaller than a bandwidth of the input signal. Each subband signal is input into the means 201 for filtering. Each means 201 for filtering includes a reverberation filter so that a plurality of reverberated subband signals are obtained, wherein the plurality of reverberated subband signals together represent the decorrelation signal. Preferably, as it is shown in Fig. 2, there can be a subband-wise postprocessing of reverberated subband signals which is performed by block 204, which is controlled by block 205.
Each reverberation filter is set to a certain delay, and preferably a fractional delay, and each reverberation filter has several filter coefficients, which, as it is shown in Fig. 2, depend on the subband index. This means that it is preferred to use the same delay for each subband but to use different sets of filter coefficients for the different subbands. This is symbolized by means 203 and 202 in Fig. 2, although it is to be mentioned here that delays and filter coefficients are preferably fixedly determined when shipping a decorrelation device, wherein the delays and filter coefficients may be determined empirically using listening tests etc.
A multi-channel decoder is shown by Fig. 1 and includes the inventive apparatus for generating the correlation signal, which is termed 102 in Fig. 1. The multi-channel decoder shown in Fig. 1 is for decoding a mono signal and an associated inter- channel coherence measure, the inter-channel coherence measure representing a coherence between a plurality of original channels, wherein the mono signal is derived from the plurality of original channels. Block 102 in Fig. 1 constitutes a generator for generating a decorrelation signal for the mono signal. Blocks 103, 104, 105, 106 and 107 and 108 constitute a mixer for mixing the mono signal and the decorrelation signal in accordance with the first mixing mode to obtain a first decoded output signal and in accordance with the second mixing mode to obtain a second decoded output signal, wherein the mixer is op- erative to determine the first mixing mode and the second mixing mode based on the inter-channel coherence measure transmitted as a side information to the mono signal.
The mixer is preferably operative to mix in a subband domain based on separate inter-channel coherence measures for different subbands. In this case, the multi-channel decoder further comprises means 109 and 110 for converting the first and second decoded output signals from the subband domain in a time domain to obtain a first decoded output signal and a second decoded output signal in the time domain. Therefore, the inventive means 102 for generating a decorrelation signal and the inventive multi-channel decoder as shown in Fig. 1 operate in the subband domain and perform, as the very last step, a subband domain to time domain conversion.
Depending on the actual situation, the inventive device can be implemented in hardware or in software or in a firmware including hardware constituents and software constituents. When im- plemented in software partially or fully, the invention also is a computer program having a computer-readable code for carrying out the inventive methods when running on a computer.

Claims

Claims
1. Apparatus (102) for generating a decorrelation signal us- ing an input signal, comprising:
means (101) for providing a plurality of subband signals, wherein a subband signal includes a sequence of at least two subband samples, the sequence of the subband samples representing a bandwidth of the subband signal, which is smaller than a bandwidth of the input signal; and
means (201) for filtering each subband signal using a reverberation filter to obtain a plurality of reverberated subband signals, wherein a plurality of reverberated sub- band signals together represent the decorrelation signal.
2. Apparatus in accordance with claim 1, in which the means
(201) for filtering is operative to apply a delay to the subband signal.
3. Apparatus in accordance with claim 2, in which the means
(201) for filtering is operative to apply a fractional delay to a subband signal, the fractional delay being greater than "0" and smaller than a sampling period of the subband signal.
4. Apparatus in accordance with claim 3, in which the fractional delay is smaller than 0.9 of the sampling period of the subband signal and greater than 0.1 of the sampling period of the subband signal.
5. Apparatus in accordance with one of the preceding claims, in which the means (201) for filtering is adapted to have an all-pass characteristic.
6. Apparatus in accordance with one of the preceding claims, in which the reverberation filter (201) is operative to apply the same delay to each subband signal.
7. Apparatus in accordance with one of the preceding claims, in which the reverberation filter (201) is adapted to have different sets or filter coefficients for each subband signal.
8. Apparatus in accordance with one of the preceding claims, in which the reverberation filter is operative to introduce predetermined phase delays into the subband signals.
9. Apparatus in accordance with one of the preceding claims, in which the number of subbands is smaller than or equal to 128 and larger than 1.
10. Apparatus in accordance with one of the preceding claims, in which the input signal includes a block of a predetermined number of input samples, and in which the number of subband signals is smaller than the number of input samples.
11. Apparatus in accordance with claim 10, in which the number of subband samples multiplied by the number of subbands results in the predetermined number of input samples.
12. Apparatus in accordance with one of the preceding claims, in which the means (101) for providing is a complex quad- rature-mirror-filterbank.
13. Multi-channel decoder for decoding a mono signal and an associated inter-channel coherence measure, the inter- channel coherence measure representing a coherence between a plurality of original channels, the mono signal being derived from the plurality of original channels, comprising:
a generator (102) for generating a decorrelation signal from the mono signal in accordance with one of claims 1 to 12;
a mixer (103, 104, 105, 106, 107, 108) for mixing the mono signal and the decorrelation signal in accordance with a first mixing mode to obtain a first decoded output signal and in accordance with a second mixing mode to obtain a second decoded output signal, wherein the mixer is opera- tive to determine the first mixing mode and the second mixing mode based on the inter-channel coherence measure.
14. Multi-channel decoder in accordance with claim 13, in which the mixer is operative to mix in a subband domain based on separate inter-channel coherence measures for different subbands, and
which further comprises means (109, 110) for converting the first and the second decoded output signals from the subband domain in the time domain to obtain time domain first and second decoded output signals.
15. Multi-channel decoder in accordance with claim 13 or 14, in which the plurality of original channels include a left stereo channel and a right stereo channel, and in which the first decoded output signal is a decoded left stereo channel, and in which the second decoded output signal is a decoded right stereo channel.
16. Multi-channel decoder in accordance with one of claims 13 to 15, in which the mixer includes means (103, 106) for modifying a subband of the mono signal or means 104 and 105 for modifying the subband of the decorrelation signal.
17. Multi-channel decoder in accordance with claim 16, in which the means for modifying are implemented as signal level modifier devices.
18. Multi-channel decoder in accordance with claim 16 and 17, in which the mixer includes an adder (107) for adding an unmodified subband of the mono signal and a modified sub- band of the decorrelation signal or for adding a modified subband of the mono signal and an unmodified subband of the decorrelation signal or for adding a modified subband of the mono signal and a modified subband of the decorrelation signal to obtain a subband of a first decoded output channel or the second decoded output channel.
19. Multi-channel decoder, in which the generator (102) includes a filterbank for providing the plurality of sub- bands of the mono signal, the filterbank having a subband output connected to the mixer and the reverberation filter (201) for the subband.
20. Method of generating a decorrelation signal using an input signal, comprising:
providing (101) a plurality of subband signals, wherein a subband signal includes a sequence of at least two subband samples, the sequence of the subband samples representing a bandwidth of the subband signal, which is smaller than a bandwidth of the input signal; and filtering (201) each subband signal using a reverberation filter to obtain a plurality of reverberated sub- band signals, wherein a plurality of reverberated subband signals together represent the decorrelation signal.
21. Method of multi-channel decoding for decoding a mono signal and an associated inter-channel coherence measure, the inter-channel coherence measure representing a coherence between a plurality of original channels, the mono signal being derived from the plurality of original channels, comprising:
generating (102) a decorrelation signal from the mono signal in accordance with the method of claim 20;
mixing (103, 104, 105, 106, 107, 108) the mono signal and the decorrelation signal in accordance with a first mixing mode to obtain a first decoded output signal and in accordance with a second mixing mode to obtain a second decoded output signal, wherein the mixer is operative to determine the first mixing mode and the second mixing mode based on the inter-channel coherence measure.
22. Apparatus for encoding a stereo signal to obtain a mono output signal and a stereo parameter set, comprising:
means for calculating the mono signal by combining a left and a right channel of the stereo signals;
means (403) for generating a first stereo parameter set using a portion of the left channel and a portion of the right channel, the portion starting at a first time border; means (401, 402) for determining a validity of the first stereo parameter set for subsequent portions of the left channel and the right channel, wherein the means for determining is operative to:
generate second time border, and
activate the means for generating, when it is determined that the stereo parameter set is not valid anymore so that a second stereo parameter set for portions of the left and right signals starting at the second time border is generated; and
means for outputting the mono signal and the first stereo parameter set and the first time border associated with the first parameter set, and the second stereo parameter set and the second time border associated with the second stereo parameter set.
23. Apparatus in accordance with claim 22, wherein the means for generating is operative to calculate, as the stereo parameter set, an inter channel time difference parameter, an inter-channel level difference parameter, and/or an in- ter-channel coherence parameter.
24. Apparatus in accordance with claim 22 or 23, with the means for determining includes the transient detector, which is arranged for activating the means for generating, when a transient is detected, and to generate a time instant of the transient as the second time border.
25. Apparatus in accordance with any of claims 22 to 24, in which the means for determining is an analysis-by- synthesis device, which is adapted for:
decoding the mono signal and the stereo parameter set to obtain a decoded left channel and a decoded right channel;
comparing the decoded left channel and the decoded right channel to the left channel and the right channel; and
activating the means for generating, when the decoded left channel and the decoded right channel are different from the left channel and the right channel by more than a predetermined threshold.
26. Method of encoding a stereo signal to obtain a mono output signal and a stereo parameter set, comprising:
calculating the mono signal by combining a left and a right channel of the stereo signals;
generating (403) a first stereo parameter set using a portion of the left channel and a portion of the right channel, the portion starting at a first time border;
determining (401, 402) a validity of the first stereo parameter set for subsequent portions of the left channel and the right channel, by
generating a second time border, and
conducting the step of generating, when it is determined that the stereo parameter set is not valid anymore so that a second stereo pa- rameter set for portions of the left and right signals starting at the second time border is generated; and
outputting the mono signal and the first stereo parameter set and the first time border associated with the first parameter set, and the second stereo parameter set and the second time border associated with the second stereo parameter set.
27. Computer program having a computer-readable code for carrying out a method in accordance with claims 20, 21, 26, when running on a computer.
PCT/EP2004/004607 2003-04-30 2004-04-30 Advanced processing based on a complex-exponential-modulated filterbank and adaptive time signalling methods WO2004097794A2 (en)

Priority Applications (19)

Application Number Priority Date Filing Date Title
PL17173337T PL3247135T3 (en) 2003-04-30 2004-04-30 Advanced processing based on a complex-exponential-modulated filterbank
EP17173337.1A EP3247135B1 (en) 2003-04-30 2004-04-30 Advanced processing based on a complex-exponential-modulated filterbank
PL17173338T PL3244640T3 (en) 2003-04-30 2004-04-30 Advanced processing based on a complex-exponential-modulated filterbank
JP2006505342A JP4527716B2 (en) 2003-04-30 2004-04-30 A novel processing and adaptive time signaling method based on complex exponential modulation filter bank
AT04730525T ATE444655T1 (en) 2003-04-30 2004-04-30 ADVANCED PROCESSING BASED ON A FILTER BANK MODULATED WITH COMPLEX EXPONENTIAL FUNCTION AND ADAPTIVE TIME SIGNALING METHOD
CN2004800114628A CN1781338B (en) 2003-04-30 2004-04-30 Advanced processing based on a complex-exponential-modulated filterbank and adaptive time signalling methods
EP17173336.3A EP3244639B1 (en) 2003-04-30 2004-04-30 Advanced processing based on a complex-exponential-modulated filterbank and adaptive time signalling methods
EP17173334.8A EP3244638B1 (en) 2003-04-30 2004-04-30 Advanced processing based on a complex-exponential-modulated filterbank
EP04730525A EP1616461B1 (en) 2003-04-30 2004-04-30 Advanced processing based on a complex-exponential-modulated filterbank and adaptive time signalling methods
EP17173333.0A EP3244637B1 (en) 2003-04-30 2004-04-30 Advanced processing based on a complex-exponential-modulated filterbank
PL17173333T PL3244637T3 (en) 2003-04-30 2004-04-30 Advanced processing based on a complex-exponential-modulated filterbank
PL17173334T PL3244638T3 (en) 2003-04-30 2004-04-30 Advanced processing based on a complex-exponential-modulated filterbank
EP17173338.9A EP3244640B1 (en) 2003-04-30 2004-04-30 Advanced processing based on a complex-exponential-modulated filterbank
PL17173336T PL3244639T3 (en) 2003-04-30 2004-04-30 Advanced processing based on a complex-exponential-modulated filterbank and adaptive time signalling methods
DE602004023381T DE602004023381D1 (en) 2003-04-30 2004-04-30 MPLEXER EXPONENTIAL FUNCTION MODULATED FILTER BANK AND ADAPTIVE TIME SIGNALING PROCESS
EP20193070.8A EP3823316B1 (en) 2003-04-30 2004-04-30 Advanced processing based on a complex-exponential-modulated filterbank and adaptive time signalling methods
US11/260,659 US7487097B2 (en) 2003-04-30 2005-10-26 Advanced processing based on a complex-exponential-modulated filterbank and adaptive time signalling methods
HK06101638.0A HK1081715A1 (en) 2003-04-30 2006-02-08 Advanced processing based on a complex-exponential modulated filterbank and adaptive time signalling methods
US11/698,611 US7564978B2 (en) 2003-04-30 2007-01-26 Advanced processing based on a complex-exponential-modulated filterbank and adaptive time signalling methods

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
SE0301273A SE0301273D0 (en) 2003-04-30 2003-04-30 Advanced processing based on a complex exponential-modulated filter bank and adaptive time signaling methods
SE0301273-9 2003-04-30

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US11/260,659 Continuation US7487097B2 (en) 2003-04-30 2005-10-26 Advanced processing based on a complex-exponential-modulated filterbank and adaptive time signalling methods

Publications (2)

Publication Number Publication Date
WO2004097794A2 true WO2004097794A2 (en) 2004-11-11
WO2004097794A3 WO2004097794A3 (en) 2005-09-09

Family

ID=20291180

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2004/004607 WO2004097794A2 (en) 2003-04-30 2004-04-30 Advanced processing based on a complex-exponential-modulated filterbank and adaptive time signalling methods

Country Status (13)

Country Link
US (2) US7487097B2 (en)
EP (12) EP3244637B1 (en)
JP (2) JP4527716B2 (en)
KR (1) KR100717604B1 (en)
CN (3) CN101819777B (en)
AT (1) ATE444655T1 (en)
DE (1) DE602004023381D1 (en)
DK (9) DK2124485T3 (en)
ES (9) ES2790886T3 (en)
HK (8) HK1081715A1 (en)
PL (9) PL3244637T3 (en)
SE (1) SE0301273D0 (en)
WO (1) WO2004097794A2 (en)

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007031906A2 (en) 2005-09-13 2007-03-22 Koninklijke Philips Electronics N.V. A method of and a device for generating 3d sound
WO2007085275A1 (en) * 2006-01-27 2007-08-02 Coding Technologies Ab Efficient filtering with a complex modulated filterbank
WO2007102674A1 (en) 2006-03-06 2007-09-13 Samsung Electronics Co., Ltd. Method, medium, and system synthesizing a stereo signal
WO2007110101A1 (en) * 2006-03-28 2007-10-04 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Enhanced method for signal shaping in multi-channel audio reconstruction
JP2009506376A (en) * 2005-08-30 2009-02-12 エルジー エレクトロニクス インコーポレイティド Apparatus and method for encoding and decoding audio signals
JP2009522894A (en) * 2006-01-09 2009-06-11 ノキア コーポレイション Decoding binaural audio signals
EP2291008A1 (en) * 2006-05-04 2011-03-02 LG Electronics Inc. Enhancing audio with remixing capability
US8184817B2 (en) 2005-09-01 2012-05-22 Panasonic Corporation Multi-channel acoustic signal processing device
US8340963B2 (en) 2007-10-12 2012-12-25 Fujitsu Limited Echo suppressing system, echo suppressing method, recording medium, echo suppressor, sound output device, audio system, navigation system and mobile object
US8438015B2 (en) 2006-10-25 2013-05-07 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating audio subband values and apparatus and method for generating time-domain audio samples
JP2013137546A (en) * 2005-08-30 2013-07-11 Lg Electronics Inc Apparatus for encoding and decoding audio signal and method thereof
US8885854B2 (en) 2006-08-09 2014-11-11 Samsung Electronics Co., Ltd. Method, medium, and system decoding compressed multi-channel signals into 2-channel binaural signals
US9172342B2 (en) 2010-09-16 2015-10-27 Dolby International Ab Cross product enhanced subband block based harmonic transposition
US9418667B2 (en) 2006-10-12 2016-08-16 Lg Electronics Inc. Apparatus for processing a mix signal and method thereof
US9934789B2 (en) 2006-01-11 2018-04-03 Samsung Electronics Co., Ltd. Method, medium, and apparatus with scalable channel decoding
US11394350B2 (en) 2012-07-23 2022-07-19 Dali Systems Co. Ltd. Method and system for aligning signals widely spaced in frequency for wideband digital predistortion in wireless communication systems
USRE50009E1 (en) 2006-10-25 2024-06-11 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating audio subband values and apparatus and method for generating time-domain audio samples
USRE50015E1 (en) 2007-10-23 2024-06-18 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating audio subband values and apparatus and method for generating time-domain audio samples

Families Citing this family (44)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE0301273D0 (en) * 2003-04-30 2003-04-30 Coding Technologies Sweden Ab Advanced processing based on a complex exponential-modulated filter bank and adaptive time signaling methods
EP1672618B1 (en) * 2003-10-07 2010-12-15 Panasonic Corporation Method for deciding time boundary for encoding spectrum envelope and frequency resolution
WO2006033058A1 (en) * 2004-09-23 2006-03-30 Koninklijke Philips Electronics N.V. A system and a method of processing audio data, a program element and a computer-readable medium
WO2007106553A1 (en) * 2006-03-15 2007-09-20 Dolby Laboratories Licensing Corporation Binaural rendering using subband filters
JP5103880B2 (en) * 2006-11-24 2012-12-19 富士通株式会社 Decoding device and decoding method
DE102007018032B4 (en) * 2007-04-17 2010-11-11 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Generation of decorrelated signals
JPWO2008132850A1 (en) * 2007-04-25 2010-07-22 パナソニック株式会社 Stereo speech coding apparatus, stereo speech decoding apparatus, and methods thereof
CN101790756B (en) * 2007-08-27 2012-09-05 爱立信电话股份有限公司 Transient detector and method for supporting encoding of an audio signal
BR122012006269A2 (en) 2008-03-10 2019-07-30 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. EQUIPMENT AND METHOD FOR HANDLING AN AUDIO SIGN HAVING A TRANSIENT EVENT
US8831936B2 (en) * 2008-05-29 2014-09-09 Qualcomm Incorporated Systems, methods, apparatus, and computer program products for speech signal processing using spectral contrast enhancement
US8538749B2 (en) * 2008-07-18 2013-09-17 Qualcomm Incorporated Systems, methods, apparatus, and computer program products for enhanced intelligibility
CN102257562B (en) 2008-12-19 2013-09-11 杜比国际公司 Method and apparatus for applying reverb to a multi-channel audio signal using spatial cue parameters
CA3152894C (en) 2009-03-17 2023-09-26 Dolby International Ab Advanced stereo coding based on a combination of adaptively selectable left/right or mid/side stereo coding and of parametric stereo coding
CN101533641B (en) 2009-04-20 2011-07-20 华为技术有限公司 Method for correcting channel delay parameters of multichannel signals and device
US9202456B2 (en) * 2009-04-23 2015-12-01 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for automatic control of active noise cancellation
US11657788B2 (en) 2009-05-27 2023-05-23 Dolby International Ab Efficient combined harmonic transposition
TWI675367B (en) 2009-05-27 2019-10-21 瑞典商杜比國際公司 Systems and methods for generating a high frequency component of a signal from a low frequency component of the signal, a set-top box, a computer program product and storage medium thereof
US9105300B2 (en) 2009-10-19 2015-08-11 Dolby International Ab Metadata time marking information for indicating a section of an audio object
US8718290B2 (en) 2010-01-26 2014-05-06 Audience, Inc. Adaptive noise reduction using level cues
JP5299327B2 (en) 2010-03-17 2013-09-25 ソニー株式会社 Audio processing apparatus, audio processing method, and program
US9378754B1 (en) 2010-04-28 2016-06-28 Knowles Electronics, Llc Adaptive spatial classifier for multi-microphone systems
US9053697B2 (en) 2010-06-01 2015-06-09 Qualcomm Incorporated Systems, methods, devices, apparatus, and computer program products for audio equalization
WO2012009851A1 (en) * 2010-07-20 2012-01-26 Huawei Technologies Co., Ltd. Audio signal synthesizer
MY156027A (en) 2010-08-12 2015-12-31 Fraunhofer Ges Forschung Resampling output signals of qmf based audio codecs
JP5581449B2 (en) * 2010-08-24 2014-08-27 ドルビー・インターナショナル・アーベー Concealment of intermittent mono reception of FM stereo radio receiver
JP6049762B2 (en) 2012-02-24 2016-12-21 ドルビー・インターナショナル・アーベー Audio processing
TWI618050B (en) 2013-02-14 2018-03-11 杜比實驗室特許公司 Method and apparatus for signal decorrelation in an audio processing system
TWI618051B (en) 2013-02-14 2018-03-11 杜比實驗室特許公司 Audio signal processing method and apparatus for audio signal enhancement using estimated spatial parameters
WO2014126688A1 (en) 2013-02-14 2014-08-21 Dolby Laboratories Licensing Corporation Methods for audio signal transient detection and decorrelation control
BR112015018522B1 (en) 2013-02-14 2021-12-14 Dolby Laboratories Licensing Corporation METHOD, DEVICE AND NON-TRANSITORY MEDIA WHICH HAS A METHOD STORED IN IT TO CONTROL COHERENCE BETWEEN AUDIO SIGNAL CHANNELS WITH UPMIX.
EP2830053A1 (en) 2013-07-22 2015-01-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Multi-channel audio decoder, multi-channel audio encoder, methods and computer program using a residual-signal-based adjustment of a contribution of a decorrelated signal
EP3044783B1 (en) 2013-09-12 2017-07-19 Dolby International AB Audio coding
EP3561809B1 (en) 2013-09-12 2023-11-22 Dolby International AB Method for decoding and decoder.
KR101815082B1 (en) * 2013-09-17 2018-01-04 주식회사 윌러스표준기술연구소 Method and apparatus for processing multimedia signals
WO2015060654A1 (en) 2013-10-22 2015-04-30 한국전자통신연구원 Method for generating filter for audio signal and parameterizing device therefor
US9928728B2 (en) * 2014-05-09 2018-03-27 Sony Interactive Entertainment Inc. Scheme for embedding a control signal in an audio signal using pseudo white noise
US10043527B1 (en) * 2015-07-17 2018-08-07 Digimarc Corporation Human auditory system modeling with masking energy adaptation
JP6763194B2 (en) * 2016-05-10 2020-09-30 株式会社Jvcケンウッド Encoding device, decoding device, communication system
KR102201308B1 (en) 2016-11-23 2021-01-11 텔레호낙티에볼라게트 엘엠 에릭슨(피유비엘) Method and apparatus for adaptive control of decorrelation filters
EP3358798A1 (en) * 2017-02-06 2018-08-08 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Receiver, transmitter, communication system for subband communication and methods for subband communication
GB2572650A (en) 2018-04-06 2019-10-09 Nokia Technologies Oy Spatial audio parameters and associated spatial audio playback
GB2574239A (en) 2018-05-31 2019-12-04 Nokia Technologies Oy Signalling of spatial audio parameters
CN110740416B (en) * 2019-09-27 2021-04-06 广州励丰文化科技股份有限公司 Audio signal processing method and device
CN110740404B (en) * 2019-09-27 2020-12-25 广州励丰文化科技股份有限公司 Audio correlation processing method and audio processing device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1991020167A1 (en) * 1990-06-15 1991-12-26 Northwestern University Method and apparatus for creating de-correlated audio output signals and audio recordings made thereby
EP0843503A2 (en) * 1996-11-13 1998-05-20 SANYO ELECTRIC Co., Ltd. Circuit for obtaining a surround sound effect
US6104996A (en) * 1996-10-01 2000-08-15 Nokia Mobile Phones Limited Audio coding with low-order adaptive prediction of transients
GB2353926A (en) * 1999-09-04 2001-03-07 Central Research Lab Ltd Generating a second audio signal from a first audio signal for the reproduction of 3D sound
WO2003007656A1 (en) * 2001-07-10 2003-01-23 Coding Technologies Ab Efficient and scalable parametric stereo coding for low bitrate applications

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6424700A (en) * 1987-07-21 1989-01-26 Nippon Chemicon Piezoelectric acoustic transducer
SG49883A1 (en) * 1991-01-08 1998-06-15 Dolby Lab Licensing Corp Encoder/decoder for multidimensional sound fields
JP2727883B2 (en) * 1992-08-20 1998-03-18 ヤマハ株式会社 Music synthesizer
JP3250577B2 (en) * 1992-12-15 2002-01-28 ソニー株式会社 Adaptive signal processor
SG48273A1 (en) * 1993-11-26 1998-04-17 Philips Electronics Nv A transmission system and a transmitter and a receiver for use in such a system
FR2715527B1 (en) * 1994-01-21 1996-02-23 Thomson Csf Method and device for analysis and synthesis in adaptive sub-bands.
US5956674A (en) * 1995-12-01 1999-09-21 Digital Theater Systems, Inc. Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels
US5848164A (en) * 1996-04-30 1998-12-08 The Board Of Trustees Of The Leland Stanford Junior University System and method for effects processing on audio subband data
DE19632734A1 (en) * 1996-08-14 1998-02-19 Thomson Brandt Gmbh Method and device for generating a multi-tone signal from a mono signal
FR2761550B1 (en) * 1997-03-28 1999-06-25 France Telecom DIGITAL FILTER FOR FRACTIONAL DELAYS
WO1998046045A1 (en) * 1997-04-10 1998-10-15 Sony Corporation Encoding method and device, decoding method and device, and recording medium
JPH1132399A (en) * 1997-05-13 1999-02-02 Sony Corp Coding method and system and recording medium
US5890125A (en) * 1997-07-16 1999-03-30 Dolby Laboratories Licensing Corporation Method and apparatus for encoding and decoding multiple audio channels at low bit rates using adaptive selection of encoding method
US6501860B1 (en) * 1998-01-19 2002-12-31 Canon Kabushiki Kaisha Digital signal coding and decoding based on subbands
DE19900819A1 (en) * 1999-01-12 2000-07-13 Bosch Gmbh Robert Prodder for decoding multi-channel distorted radio signals by extracting spatial information from the data signal and recombining this with mono signal data
FR2793629B1 (en) * 1999-05-12 2001-08-03 Matra Nortel Communications METHOD AND DEVICE FOR CANCELING STEREOPHONIC ECHO WITH FILTERING IN THE FREQUENTIAL DOMAIN
US6978236B1 (en) * 1999-10-01 2005-12-20 Coding Technologies Ab Efficient spectral envelope coding using variable time/frequency resolution and time/frequency switching
JP3951690B2 (en) * 2000-12-14 2007-08-01 ソニー株式会社 Encoding apparatus and method, and recording medium
JP4137401B2 (en) * 2001-04-16 2008-08-20 ティーオーエー株式会社 Active noise eliminator
US7006636B2 (en) 2002-05-24 2006-02-28 Agere Systems Inc. Coherence-based audio coding and synthesis
CA2354808A1 (en) * 2001-08-07 2003-02-07 King Tam Sub-band adaptive signal processing in an oversampled filterbank
CA2354858A1 (en) * 2001-08-08 2003-02-08 Dspfactory Ltd. Subband directional audio signal processing using an oversampled filterbank
KR100923297B1 (en) * 2002-12-14 2009-10-23 삼성전자주식회사 Method for encoding stereo audio, apparatus thereof, method for decoding audio stream and apparatus thereof
JP4597967B2 (en) * 2003-04-17 2010-12-15 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Audio signal generation
SE0301273D0 (en) * 2003-04-30 2003-04-30 Coding Technologies Sweden Ab Advanced processing based on a complex exponential-modulated filter bank and adaptive time signaling methods
US7391870B2 (en) 2004-07-09 2008-06-24 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E V Apparatus and method for generating a multi-channel output signal

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1991020167A1 (en) * 1990-06-15 1991-12-26 Northwestern University Method and apparatus for creating de-correlated audio output signals and audio recordings made thereby
US6104996A (en) * 1996-10-01 2000-08-15 Nokia Mobile Phones Limited Audio coding with low-order adaptive prediction of transients
EP0843503A2 (en) * 1996-11-13 1998-05-20 SANYO ELECTRIC Co., Ltd. Circuit for obtaining a surround sound effect
GB2353926A (en) * 1999-09-04 2001-03-07 Central Research Lab Ltd Generating a second audio signal from a first audio signal for the reproduction of 3D sound
WO2003007656A1 (en) * 2001-07-10 2003-01-23 Coding Technologies Ab Efficient and scalable parametric stereo coding for low bitrate applications

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
SCHUIJERS E ET AL: "ADVANCES IN PARAMETRIC CODING FOR HIGH-QUALITY AUDIO" PREPRINTS OF PAPERS PRESENTED AT THE AES CONVENTION, XX, XX, 22 March 2003 (2003-03-22), pages 1-11, XP008021606 *

Cited By (70)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009506377A (en) * 2005-08-30 2009-02-12 エルジー エレクトロニクス インコーポレイティド Apparatus and method for encoding and decoding audio signals
JP2013137546A (en) * 2005-08-30 2013-07-11 Lg Electronics Inc Apparatus for encoding and decoding audio signal and method thereof
JP2009506374A (en) * 2005-08-30 2009-02-12 エルジー エレクトロニクス インコーポレイティド Apparatus and method for encoding and decoding audio signals
JP2009506375A (en) * 2005-08-30 2009-02-12 エルジー エレクトロニクス インコーポレイティド Apparatus and method for encoding and decoding audio signals
JP2009506371A (en) * 2005-08-30 2009-02-12 エルジー エレクトロニクス インコーポレイティド Apparatus and method for encoding and decoding audio signals
JP2009506376A (en) * 2005-08-30 2009-02-12 エルジー エレクトロニクス インコーポレイティド Apparatus and method for encoding and decoding audio signals
JP2009506373A (en) * 2005-08-30 2009-02-12 エルジー エレクトロニクス インコーポレイティド Apparatus and method for encoding and decoding audio signals
KR101277041B1 (en) * 2005-09-01 2013-06-24 파나소닉 주식회사 Multi-channel acoustic signal processing device and method
US8184817B2 (en) 2005-09-01 2012-05-22 Panasonic Corporation Multi-channel acoustic signal processing device
JP5053849B2 (en) * 2005-09-01 2012-10-24 パナソニック株式会社 Multi-channel acoustic signal processing apparatus and multi-channel acoustic signal processing method
WO2007031906A3 (en) * 2005-09-13 2007-09-13 Koninkl Philips Electronics Nv A method of and a device for generating 3d sound
WO2007031906A2 (en) 2005-09-13 2007-03-22 Koninklijke Philips Electronics N.V. A method of and a device for generating 3d sound
US8515082B2 (en) 2005-09-13 2013-08-20 Koninklijke Philips N.V. Method of and a device for generating 3D sound
JP2009522894A (en) * 2006-01-09 2009-06-11 ノキア コーポレイション Decoding binaural audio signals
US9934789B2 (en) 2006-01-11 2018-04-03 Samsung Electronics Co., Ltd. Method, medium, and apparatus with scalable channel decoding
NO342163B1 (en) * 2006-01-27 2018-04-09 Dolby Int Ab Effective filtration with a complex, modulated filter bank
JP2011103662A (en) * 2006-01-27 2011-05-26 Dolby Internatl Ab Efficient filtering using complex modulated filter bank
EP3334043A1 (en) * 2006-01-27 2018-06-13 Dolby International AB Efficient filtering with a complex modulated filterbank
KR100959971B1 (en) 2006-01-27 2010-05-27 돌비 스웨덴 에이비 Efficient filtering with a complex modulated filterbank
NO20161716A1 (en) * 2006-01-27 2008-08-26 Dolby Int Ab Effective filtration with a complex, modulated filter bank
AU2006336954B2 (en) * 2006-01-27 2011-02-03 Dolby International Ab Efficient filtering with a complex modulated filterbank
NO20180322A1 (en) * 2006-01-27 2008-08-26 Dolby Int Ab Effective filtration with a complex, modulated filter bank
EP2306644A1 (en) * 2006-01-27 2011-04-06 Dolby International AB Efficient filtering with a complex modulated filterbank
JP2011103663A (en) * 2006-01-27 2011-05-26 Dolby Internatl Ab Efficient filtering using complex modulated filter bank
JP2009524956A (en) * 2006-01-27 2009-07-02 ドルビー スウェーデン アクチボラゲット Efficient filtering using complex modulation filter banks
EP2337223A3 (en) * 2006-01-27 2012-01-25 Dolby International AB Efficient filtering with a complex modulated filterbank
NO343578B1 (en) * 2006-01-27 2019-04-08 Dolby Int Ab Effective filtration with a complex, modulated filter bank
NO344514B1 (en) * 2006-01-27 2020-01-20 Dolby Int Ab Efficient filtration with a complex modulated filter bank
CN101401305B (en) * 2006-01-27 2012-05-23 杜比国际公司 Filter with a complex modulated filterbank,
WO2007085275A1 (en) * 2006-01-27 2007-08-02 Coding Technologies Ab Efficient filtering with a complex modulated filterbank
CN101882441B (en) * 2006-01-27 2013-02-27 杜比国际公司 Efficient filtering with a complex modulated filterbank
US8315859B2 (en) 2006-01-27 2012-11-20 Dolby International Ab Efficient filtering with a complex modulated filterbank
EP1991984A4 (en) * 2006-03-06 2010-03-10 Samsung Electronics Co Ltd Method, medium, and system synthesizing a stereo signal
US8620011B2 (en) 2006-03-06 2013-12-31 Samsung Electronics Co., Ltd. Method, medium, and system synthesizing a stereo signal
WO2007102674A1 (en) 2006-03-06 2007-09-13 Samsung Electronics Co., Ltd. Method, medium, and system synthesizing a stereo signal
EP1991984A1 (en) * 2006-03-06 2008-11-19 Samsung Electronics Co., Ltd. Method, medium, and system synthesizing a stereo signal
EP2495722A1 (en) * 2006-03-06 2012-09-05 Samsung Electronics Co., Ltd. Method, medium, and system synthesizing a stereo signal
EP2495723A1 (en) * 2006-03-06 2012-09-05 Samsung Electronics Co., Ltd. Method, medium, and system synthesizing a stereo signal
US9479871B2 (en) 2006-03-06 2016-10-25 Samsung Electronics Co., Ltd. Method, medium, and system synthesizing a stereo signal
JP2009531724A (en) * 2006-03-28 2009-09-03 フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン An improved method for signal shaping in multi-channel audio reconstruction
NO339914B1 (en) * 2006-03-28 2017-02-13 Fraunhofer Ges Forschung Procedure for Signal Formation in Multichannel Audio Recovery
WO2007110101A1 (en) * 2006-03-28 2007-10-04 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Enhanced method for signal shaping in multi-channel audio reconstruction
US8116459B2 (en) 2006-03-28 2012-02-14 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Enhanced method for signal shaping in multi-channel audio reconstruction
CN101406073B (en) * 2006-03-28 2013-01-09 弗劳恩霍夫应用研究促进协会 Enhanced method for signal shaping in multi-channel audio reconstruction
AU2006340728B2 (en) * 2006-03-28 2010-08-19 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Enhanced method for signal shaping in multi-channel audio reconstruction
EP2291008A1 (en) * 2006-05-04 2011-03-02 LG Electronics Inc. Enhancing audio with remixing capability
US8213641B2 (en) 2006-05-04 2012-07-03 Lg Electronics Inc. Enhancing audio with remix capability
US8885854B2 (en) 2006-08-09 2014-11-11 Samsung Electronics Co., Ltd. Method, medium, and system decoding compressed multi-channel signals into 2-channel binaural signals
US9418667B2 (en) 2006-10-12 2016-08-16 Lg Electronics Inc. Apparatus for processing a mix signal and method thereof
US8452605B2 (en) 2006-10-25 2013-05-28 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating audio subband values and apparatus and method for generating time-domain audio samples
USRE50009E1 (en) 2006-10-25 2024-06-11 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating audio subband values and apparatus and method for generating time-domain audio samples
US8438015B2 (en) 2006-10-25 2013-05-07 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating audio subband values and apparatus and method for generating time-domain audio samples
USRE49999E1 (en) 2006-10-25 2024-06-04 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating audio subband values and apparatus and method for generating time-domain audio samples
US8775193B2 (en) 2006-10-25 2014-07-08 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating audio subband values and apparatus and method for generating time-domain audio samples
US8340963B2 (en) 2007-10-12 2012-12-25 Fujitsu Limited Echo suppressing system, echo suppressing method, recording medium, echo suppressor, sound output device, audio system, navigation system and mobile object
USRE50015E1 (en) 2007-10-23 2024-06-18 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating audio subband values and apparatus and method for generating time-domain audio samples
US10192562B2 (en) 2010-09-16 2019-01-29 Dolby International Ab Cross product enhanced subband block based harmonic transposition
US11817110B2 (en) 2010-09-16 2023-11-14 Dolby International Ab Cross product enhanced subband block based harmonic transposition
EP3503100A1 (en) 2010-09-16 2019-06-26 Dolby International AB Cross product enhanced subband block based harmonic transposition
US10706863B2 (en) 2010-09-16 2020-07-07 Dolby International Ab Cross product enhanced subband block based harmonic transposition
EP3975177A1 (en) 2010-09-16 2022-03-30 Dolby International AB Cross product enhanced subband block based harmonic transposition
EP3975178A1 (en) 2010-09-16 2022-03-30 Dolby International AB Cross product enhanced subband block based harmonic transposition
US11355133B2 (en) 2010-09-16 2022-06-07 Dolby International Ab Cross product enhanced subband block based harmonic transposition
US9172342B2 (en) 2010-09-16 2015-10-27 Dolby International Ab Cross product enhanced subband block based harmonic transposition
EP4145445A1 (en) 2010-09-16 2023-03-08 Dolby International AB Cross product enhanced subband block based harmonic transposition
EP4148732A1 (en) 2010-09-16 2023-03-15 Dolby International AB Cross product enhanced subband block based harmonic transposition
US10446161B2 (en) 2010-09-16 2019-10-15 Dolby International Ab Cross product enhanced subband block based harmonic transposition
US9940941B2 (en) 2010-09-16 2018-04-10 Dolby International Ab Cross product enhanced subband block based harmonic transposition
US9735750B2 (en) 2010-09-16 2017-08-15 Dolby International Ab Cross product enhanced subband block based harmonic transposition
US11394350B2 (en) 2012-07-23 2022-07-19 Dali Systems Co. Ltd. Method and system for aligning signals widely spaced in frequency for wideband digital predistortion in wireless communication systems

Also Published As

Publication number Publication date
EP3244640A1 (en) 2017-11-15
EP2124485A2 (en) 2009-11-25
DK3244639T3 (en) 2020-05-04
HK1245552A1 (en) 2018-08-24
CN101071569A (en) 2007-11-14
EP1768454B1 (en) 2013-06-12
HK1245554A1 (en) 2018-08-24
EP3247135B1 (en) 2020-09-02
ES2686088T3 (en) 2018-10-16
HK1081715A1 (en) 2006-05-19
EP3244640B1 (en) 2020-04-08
EP3244637B1 (en) 2020-04-08
EP3247135A1 (en) 2017-11-22
EP3244638A1 (en) 2017-11-15
DK2265041T3 (en) 2018-03-05
SE0301273D0 (en) 2003-04-30
ATE444655T1 (en) 2009-10-15
EP3244639B1 (en) 2020-04-08
WO2004097794A3 (en) 2005-09-09
ES2790860T3 (en) 2020-10-29
EP1616461B1 (en) 2009-09-30
ES2685508T3 (en) 2018-10-09
ES2749575T3 (en) 2020-03-23
EP2265040A3 (en) 2011-01-26
CN1781338A (en) 2006-05-31
US7487097B2 (en) 2009-02-03
DK3244640T3 (en) 2020-05-04
PL3247135T3 (en) 2020-11-30
ES2420764T3 (en) 2013-08-26
PL2265040T3 (en) 2019-01-31
DK3244637T3 (en) 2020-05-04
EP2265040A2 (en) 2010-12-22
EP3244637A1 (en) 2017-11-15
ES2789575T3 (en) 2020-10-26
EP2124485B1 (en) 2018-06-27
US20070121952A1 (en) 2007-05-31
EP2265041A3 (en) 2011-05-25
EP1768454A3 (en) 2010-09-01
EP2265042A2 (en) 2010-12-22
CN1781338B (en) 2010-04-21
CN101819777A (en) 2010-09-01
PL3244640T3 (en) 2020-07-13
PL3244638T3 (en) 2020-01-31
EP2124485A3 (en) 2009-12-02
EP3823316A1 (en) 2021-05-19
EP2265041B1 (en) 2017-12-13
HK1099882A1 (en) 2007-08-24
EP3244639A1 (en) 2017-11-15
JP2007219542A (en) 2007-08-30
HK1147591A1 (en) 2011-08-12
ES2822163T3 (en) 2021-04-29
KR100717604B1 (en) 2007-05-15
DK1768454T3 (en) 2013-09-08
EP2265040B1 (en) 2018-06-27
EP2265042A3 (en) 2011-03-16
DE602004023381D1 (en) 2009-11-12
PL2265041T3 (en) 2018-05-30
JP2006524832A (en) 2006-11-02
EP3823316B1 (en) 2023-03-29
CN101071569B (en) 2011-06-22
PL2124485T3 (en) 2019-01-31
DK2265040T3 (en) 2018-09-24
DK3244638T3 (en) 2019-10-14
HK1245556A1 (en) 2018-08-24
DK3247135T3 (en) 2020-09-28
US7564978B2 (en) 2009-07-21
EP2265042B1 (en) 2016-10-12
DK2124485T3 (en) 2018-10-01
EP3244638B1 (en) 2019-08-28
PL3244637T3 (en) 2020-08-24
KR20060020613A (en) 2006-03-06
HK1245555A1 (en) 2018-08-24
ES2790886T3 (en) 2020-10-29
EP1616461A2 (en) 2006-01-18
JP4602375B2 (en) 2010-12-22
EP1768454A2 (en) 2007-03-28
PL3244639T3 (en) 2020-06-29
PL1768454T3 (en) 2013-10-31
ES2662671T3 (en) 2018-04-09
US20060053018A1 (en) 2006-03-09
HK1245553A1 (en) 2018-08-24
CN101819777B (en) 2012-02-01
JP4527716B2 (en) 2010-08-18
EP2265041A2 (en) 2010-12-22

Similar Documents

Publication Publication Date Title
EP1616461B1 (en) Advanced processing based on a complex-exponential-modulated filterbank and adaptive time signalling methods
RU2402872C2 (en) Efficient filtering with complex modulated filterbank
DK2337224T3 (en) Filter unit and method for generating subband filter pulse response
KR100717607B1 (en) Method and Device for stereo encoding and decoding

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

DPEN Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed from 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2004730525

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 11260659

Country of ref document: US

Ref document number: 1020057020368

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 20048114628

Country of ref document: CN

Ref document number: 2006505342

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 2162/KOLNP/2005

Country of ref document: IN

WWP Wipo information: published in national office

Ref document number: 2004730525

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1020057020368

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 11260659

Country of ref document: US