WO2003044775A1 - Perceptual noise substitution - Google Patents

Perceptual noise substitution Download PDF

Info

Publication number
WO2003044775A1
WO2003044775A1 PCT/IB2002/004601 IB0204601W WO03044775A1 WO 2003044775 A1 WO2003044775 A1 WO 2003044775A1 IB 0204601 W IB0204601 W IB 0204601W WO 03044775 A1 WO03044775 A1 WO 03044775A1
Authority
WO
WIPO (PCT)
Prior art keywords
noise
noise sources
parameters
sources
composition
Prior art date
Application number
PCT/IB2002/004601
Other languages
French (fr)
Inventor
Leon M. Van De Kerkhof
Arnoldus W. J. Oomen
Original Assignee
Koninklijke Philips Electronics N.V.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics N.V. filed Critical Koninklijke Philips Electronics N.V.
Priority to BR0206611-4A priority Critical patent/BR0206611A/en
Priority to JP2003546331A priority patent/JP2005509926A/en
Priority to US10/495,942 priority patent/US20050004791A1/en
Priority to KR10-2004-7007816A priority patent/KR20040063155A/en
Priority to EP02779819A priority patent/EP1451809A1/en
Priority to AU2002343151A priority patent/AU2002343151A1/en
Publication of WO2003044775A1 publication Critical patent/WO2003044775A1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/012Comfort noise or silence coding
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/24Signal processing not specific to the method of recording or reproducing; Circuits therefor for reducing noise

Definitions

  • the invention relates to a method using synthetic noise sources in a multichannel audio coding system for encoding a set of audio signals wherein correlated noise components are present.
  • Such a straightforward substitution causes an unnatural hearing sensation in the case where multiple audio channels actually exhibit a degree of inter-correlation.
  • This unnatural perception is due to the fact that the human ear is able to identify a correlation between audio signals coming from different directions.
  • the correlation between signals determines the "stereo image", the spatial perception of sound sources. If the left and right signals in a two-channel loudspeaker setup are fully correlated, the human auditory system will perceive this as a single sound source positioned in between the speakers. If the signals are uncorrelated, two separate sound sources positioned at the left and right speakers will be perceived. Partly correlated signals will generally be perceived as a wide sound source in between the speakers. Negative correlation can even lead to perceived sound source positions outside the speakerbase. Therefore, if correlation of the sound in left and right speakers is lost, the intended stereo effect disappears and a listener perceives a less natural hearing sensation.
  • the method of the invention comprises the step of: determining, from the relation between said audio signals, a composition of noise sources, the composition being such that the noise sources in said composition are mutually uncorrelated, so that said composition of noise sources synthesizes said noise components in a relation-preserved way.
  • noise components present in an audio signal are composed from noise sources that synthesize perceptually relevant correlation- preserved noise components present in at least one frequency band of said audio signals. These synthesizing noise sources are mutually uncorrelated.
  • the inventive method further comprises the steps of encoding the noise sources, by determining for each noise source a set of noise parameters for synthesizing said source and a set of transformation parameters for generating said composition of noise sources. Furthermore, a preferred embodiment of the invention comprises the step of transmitting said sets of noise parameters for synthesizing each noise source and transmitting said set of transformation parameters for forming said plurality of noise sources. More specifically, said noise parameters and said transformation parameters are determined by orthogonalizing the correlation matrix of said set of audio channels. This orthogonalisation may be, for a time-varying intercorrelation between audio channels, performed on a frame- by-frame basis. The size of a frame may depend on the time frame through which the inter- channel correlations can be considered to be constant.
  • the invention is preferably applicable in a case wherein the set of audio signals is divided into a selected set of frequency bands, at least one of the frequency bands comprising noise-like signals.
  • Non-noisy components present in said audio signals may be encoded by sinusoidal coding.
  • the invention also relates to a coding method using synthetic noise sources in a multi-channel audio coding system for encoding a set of audio channels, the method comprising the steps of: receiving sets of noise parameters for synthesizing noise sources and receiving a set of transformation parameters determined according to the inventive method; generating, in response to said noise parameters, a set of synthesized noise sources; and generating a set of audio signals by forming each audio signal as a plurality of noise sources according to said transformation parameters.
  • an audio encoder comprising: means for detecting, in at least one frequency band of said audio signals, an auto-correlation and a cross-correlation between each one of a set of audio signals; and processing means for determining, from the relation between said audio signals, a composition of noise sources, the composition being such that the noise sources are mutually uncorrelated, so that said composition of noise sources synthesizes said noise components in a relation-preserved way.
  • the encoder may further comprise means for encoding said noise sources as sets of noise parameters for synthesizing each of said sources, transmitting means for transmitting the sets of noise parameters and for transmitting said set of transformation parameters for forming said plurality of noise sources.
  • the invention relates to an audio decoder comprising: receiving means for receiving sets of noise parameters for synthesizing noise sources and for receiving a set of transformation parameters for forming a plurality of said noise sources, a set of noise generators for generating noise sources, in response to the noise parameters; and synthesizing means for synthesizing audio signals with perceptually relevant correlation-preserved noise components by forming, in response to the set of transformation parameters, for each audio signal a plurality of said set of noise sources.
  • the encoder and decoder may be physically distinct signal processing apparatus or may be present as one or several units in a single signal processing apparatus.
  • the transmission may be a wireless transmission, or a transmission through the Internet, in fact, any kind of transmission.
  • the transmission may also be done via a physical data carrier, such as a magnetic disk or a CD-rom etc.
  • the invention also relates to a data carrier, comprising a set of noise parameters for synthesizing noise sources and comprising a set of transformation parameters for forming a plurality of noise sources according to the above-described method.
  • Fig. 1 is a schematic illustration of an encoding apparatus implementing the coding method according to the invention.
  • Fig. 2 is a schematic illustration of a decoding apparatus implementing the coding method according to the invention.
  • Fig. 1 shows an encoder 1 for encoding a four-channel audio signal.
  • the audio channels are represented by four composite arrows 2, each arrow 2 representing one audio channel of four channels.
  • the audio channel 2 comprises an audio signal which in at least one frequency band comprises noise components.
  • an audio signal with audible frequency components is usually split up into several (usually logarithmically scaled) frequency bands, although the method according to the invention can also be performed directly on full bandwith audio signals. For each, or a specific number, of these frequency bands (especially in relevant frequency bands where the human ear is sensitive to correlated signals), the inventive method can be applied.
  • the multi-channel signal 2 is filtered in a filter stage 3.
  • the filter 3 splits up the audio signals into noisy parts 4 and in non-noisy parts 5.
  • Non-noisy parts 5 of the signal 2 are directed towards a sinusoidal coding circuit 6.
  • This circuit 6 generates compressed encoded data 7, which represents non-noisy audio information of said audio signals 2.
  • the noisy parts 4 are directed towards a circuit 8 encoding the noise in a correlation-preserved way according to the invention.
  • the relation between said audio signals is determined and a composition of noise sources is identified, the composition being such that the noise sources in said composition are mutually uncorrelated, so that said composition of noise sources synthesizes said noise components in a relation- preserved way.
  • the relation between said audio signals is determined by measuring the auto- correlation coefficients and cross-correlation coefficients of the audio channels 2.
  • This correlation information may be represented in a correlation matrix expressing the autocorrelation coefficients and intercorrelation coefficients.
  • the coefficient ⁇ S(i)S(i)> expresses the auto-correlation of a channel S(i);
  • the coefficient ⁇ S(i)S(j)> expresses the intercorrelation between channel S(i) and channel S(j); i and j being some integral numbers denoting a specific one channel of said multi-channel system.
  • a set of transformation parameters 9 is calculated from this correlation matrix.
  • the transformation parameters 9 are fed to a transmitter 10.
  • the transformation parameters 9 relate to relevant parameters for synthesizing the noise sources. These transformation parameters may comprise an auto-correlation of the sources, corresponding to the energy of each uncorrelated noise signal, and an intercorrelation, describing a specific relation between said noise sources. These parameters 9 are to be received by a decoder for performing the inverse transformation on a set of generated noise sources, further explained with reference to Fig. 2.
  • the transformation parameters 9 are then combined with the sinusoidal encoded non-noisy signals 7, and transmitted as an encoded signal 11 by transmitter 10.
  • the transmission may be a wireless transmission, or a transmission via the Internet, in fact, any kind of transmission.
  • the transmission may also be done via a physical data carrier, such as a magnetic disk or a CD-rom etc.
  • a decoder 12 for decoding a signal 11 into a set of audio signals 21.
  • the signal 11 comprises a set of transformation parameters for forming a plurality of noise sources according to the method of the invention.
  • a first splitting stage 13 the transformation parameters 9 and the encoded non-noisy signals 7 are extracted from the signal 11.
  • the non-noisy signals 7 are fed to a sinusoidal decoder 14, outputting non-noisy parts 51 of audio channels 21.
  • the transformation parameters 9 are fed to a noise source generating stage 15 comprising a set of independent (random) noise generators 16.
  • the transformation parameters 9 indicate a noise level of each noise generator 16 (including a possible zero level); additionally, other parameters like, for instance, an enveloping form may be specified for the noise sources.
  • the noise generator 16 generates a set of mutually uncorrelated noise sources that are formed, in response to the set of transformation parameters 9, for each audio signal 1 into a plurality of noise sources, thereby synthesizing perceptually relevant correlation-preserved noise components 41 for audio signals 21.
  • a composition stage 17 the correlation-preserved noise components 41 and the non-noisy parts 51 are combined and audio channels 21 are outputted, which are a perceptually relevant reconstruction of the audio channels 2 of Fig.1
  • non-noisy parts of the signal are encoded using a sinusoidal coding
  • other types of encoding may be applied, like waveform coding or Huffman coding.
  • the audio channels as a whole, including non- noisy parts may be transformed according to the above-mentioned transformation parameters.
  • other types of noise encoding may be applied, using different parameters, etc.
  • the method may be applied for a single relevant frequency band for an audio channel of a multi-channel audio system.
  • the method may also be applied in a selected number of channels of a multi-channel audio system.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Stereophonic System (AREA)

Abstract

A method using synthetic noise sources in a multi-channel audio coding system for encoding a set of audio signals wherein correlated noise components are present. The method comprises the step of determining, from the relation between said audio signals, a composition of noise sources, the composition being such that the noise sources in said composition are mutually uncorrelated, so that said composition of noise sources synthesizes said noise components in a relation-preserved way. The method may further comprise the step of encoding the noise sources, by determining for each noise source a set of noise parameters for synthesizing said source and a set of transformation parameters for generating said composition of noise sources.

Description

Perceptual noise substitution
The invention relates to a method using synthetic noise sources in a multichannel audio coding system for encoding a set of audio signals wherein correlated noise components are present.
By encoding only perceptually relevant quantities of noise sources, such as, for instance, the total acoustic energy of noise in a specific frequency range, perceptually irrelevant audio information may be discarded so that a considerable signal compression may be gained. International application WO99/04505 describes such a method. In this method, noise-like components of an input signal are detected on a frequency-band basis. The noiselike components are parametrised, and only the total power of the substituted spectral coefficients is transmitted. In a decoder, the encoded audio channels are reconstructed by inserting random noise sources with the desired power for the substituted spectral coefficients.
Such a straightforward substitution causes an unnatural hearing sensation in the case where multiple audio channels actually exhibit a degree of inter-correlation. This unnatural perception is due to the fact that the human ear is able to identify a correlation between audio signals coming from different directions. The correlation between signals determines the "stereo image", the spatial perception of sound sources. If the left and right signals in a two-channel loudspeaker setup are fully correlated, the human auditory system will perceive this as a single sound source positioned in between the speakers. If the signals are uncorrelated, two separate sound sources positioned at the left and right speakers will be perceived. Partly correlated signals will generally be perceived as a wide sound source in between the speakers. Negative correlation can even lead to perceived sound source positions outside the speakerbase. Therefore, if correlation of the sound in left and right speakers is lost, the intended stereo effect disappears and a listener perceives a less natural hearing sensation.
In other words, if a sound produced from multiple audio channels reflects a single audio source that was recorded via said channels, a reconstruction of said audio source with uncorrelated noise sources would appear to be unnatural. In the aforementioned application, it is attempted to compensate for the above- described effect by encoding a bit value, which, in an active state, triggers a synthesizer to use the same noise source for both left and right channel. In a normally inactive state, left and right channels are synthesized from independent noise sources. Although such a provision offers an improvement as compared to a synthesis of audio channels using inherently uncorrelated noise sources, synthesized sounds still lack naturalness because, in practice, information in the encoded audio channels, describing a degree of correlation between the channels is not used. Therefore, a reconstruction of the original sound is only partly possible when using the known method and the ear still perceives a less natural hearing sensation.
The invention aims to obviate the above-mentioned problem and to provide an improved audio coding, wherein a perceptually near original reconstruction of noisy components in multiple audio channels is possible, with a preserved degree of correlation between the channels. Accordingly, the method of the invention comprises the step of: determining, from the relation between said audio signals, a composition of noise sources, the composition being such that the noise sources in said composition are mutually uncorrelated, so that said composition of noise sources synthesizes said noise components in a relation-preserved way. According to the inventive method, noise components present in an audio signal are composed from noise sources that synthesize perceptually relevant correlation- preserved noise components present in at least one frequency band of said audio signals. These synthesizing noise sources are mutually uncorrelated. Therefore, these noise sources can be easily reconstructed by independent noise generators. Although the method can be applied to transmit noise sources that are not encoded, in a preferred embodiment, the inventive method further comprises the steps of encoding the noise sources, by determining for each noise source a set of noise parameters for synthesizing said source and a set of transformation parameters for generating said composition of noise sources. Furthermore, a preferred embodiment of the invention comprises the step of transmitting said sets of noise parameters for synthesizing each noise source and transmitting said set of transformation parameters for forming said plurality of noise sources. More specifically, said noise parameters and said transformation parameters are determined by orthogonalizing the correlation matrix of said set of audio channels. This orthogonalisation may be, for a time-varying intercorrelation between audio channels, performed on a frame- by-frame basis. The size of a frame may depend on the time frame through which the inter- channel correlations can be considered to be constant.
The invention is preferably applicable in a case wherein the set of audio signals is divided into a selected set of frequency bands, at least one of the frequency bands comprising noise-like signals. Non-noisy components present in said audio signals may be encoded by sinusoidal coding.
The invention also relates to a coding method using synthetic noise sources in a multi-channel audio coding system for encoding a set of audio channels, the method comprising the steps of: receiving sets of noise parameters for synthesizing noise sources and receiving a set of transformation parameters determined according to the inventive method; generating, in response to said noise parameters, a set of synthesized noise sources; and generating a set of audio signals by forming each audio signal as a plurality of noise sources according to said transformation parameters.
In this way, encoded and transmitted noisy audio signals may be decoded and a corresponding multi-channel correlation preserved audio signal may be synthesized. Furthermore, the invention relates to an audio encoder, comprising: means for detecting, in at least one frequency band of said audio signals, an auto-correlation and a cross-correlation between each one of a set of audio signals; and processing means for determining, from the relation between said audio signals, a composition of noise sources, the composition being such that the noise sources are mutually uncorrelated, so that said composition of noise sources synthesizes said noise components in a relation-preserved way.
The encoder may further comprise means for encoding said noise sources as sets of noise parameters for synthesizing each of said sources, transmitting means for transmitting the sets of noise parameters and for transmitting said set of transformation parameters for forming said plurality of noise sources. Likewise, the invention relates to an audio decoder comprising: receiving means for receiving sets of noise parameters for synthesizing noise sources and for receiving a set of transformation parameters for forming a plurality of said noise sources, a set of noise generators for generating noise sources, in response to the noise parameters; and synthesizing means for synthesizing audio signals with perceptually relevant correlation-preserved noise components by forming, in response to the set of transformation parameters, for each audio signal a plurality of said set of noise sources.
The encoder and decoder may be physically distinct signal processing apparatus or may be present as one or several units in a single signal processing apparatus. The transmission may be a wireless transmission, or a transmission through the Internet, in fact, any kind of transmission. The transmission may also be done via a physical data carrier, such as a magnetic disk or a CD-rom etc.
The invention also relates to a data carrier, comprising a set of noise parameters for synthesizing noise sources and comprising a set of transformation parameters for forming a plurality of noise sources according to the above-described method.
Further objects and features of the invention will become apparent from the drawings, wherein:
Fig. 1 is a schematic illustration of an encoding apparatus implementing the coding method according to the invention.
Fig. 2 is a schematic illustration of a decoding apparatus implementing the coding method according to the invention.
Fig. 1 shows an encoder 1 for encoding a four-channel audio signal. The audio channels are represented by four composite arrows 2, each arrow 2 representing one audio channel of four channels. For the invention, the actual number of channels is irrelevant, because obviously, the inventive method can be applied in any audio system as long as more than 1 channel is present. The audio channel 2 comprises an audio signal which in at least one frequency band comprises noise components. In actual embodiments, an audio signal with audible frequency components is usually split up into several (usually logarithmically scaled) frequency bands, although the method according to the invention can also be performed directly on full bandwith audio signals. For each, or a specific number, of these frequency bands (especially in relevant frequency bands where the human ear is sensitive to correlated signals), the inventive method can be applied.
The multi-channel signal 2 is filtered in a filter stage 3. The filter 3 splits up the audio signals into noisy parts 4 and in non-noisy parts 5. Non-noisy parts 5 of the signal 2 are directed towards a sinusoidal coding circuit 6. This circuit 6 generates compressed encoded data 7, which represents non-noisy audio information of said audio signals 2.
The noisy parts 4 are directed towards a circuit 8 encoding the noise in a correlation-preserved way according to the invention. In said circuit 8, the relation between said audio signals is determined and a composition of noise sources is identified, the composition being such that the noise sources in said composition are mutually uncorrelated, so that said composition of noise sources synthesizes said noise components in a relation- preserved way.
The relation between said audio signals is determined by measuring the auto- correlation coefficients and cross-correlation coefficients of the audio channels 2. This correlation information may be represented in a correlation matrix expressing the autocorrelation coefficients and intercorrelation coefficients. In this matrix, the coefficient <S(i)S(i)> expresses the auto-correlation of a channel S(i); the coefficient <S(i)S(j)> expresses the intercorrelation between channel S(i) and channel S(j); i and j being some integral numbers denoting a specific one channel of said multi-channel system.
A set of transformation parameters 9 is calculated from this correlation matrix. The transformation parameters 9 are fed to a transmitter 10. The transformation parameters 9 relate to relevant parameters for synthesizing the noise sources. These transformation parameters may comprise an auto-correlation of the sources, corresponding to the energy of each uncorrelated noise signal, and an intercorrelation, describing a specific relation between said noise sources. These parameters 9 are to be received by a decoder for performing the inverse transformation on a set of generated noise sources, further explained with reference to Fig. 2.
The transformation parameters 9 are then combined with the sinusoidal encoded non-noisy signals 7, and transmitted as an encoded signal 11 by transmitter 10. The transmission may be a wireless transmission, or a transmission via the Internet, in fact, any kind of transmission. The transmission may also be done via a physical data carrier, such as a magnetic disk or a CD-rom etc.
In Fig. 2, essentially, the reverse of the scheme of Fig. 1 is illustrated, in a decoder 12 for decoding a signal 11 into a set of audio signals 21. The signal 11 comprises a set of transformation parameters for forming a plurality of noise sources according to the method of the invention. In a first splitting stage 13, the transformation parameters 9 and the encoded non-noisy signals 7 are extracted from the signal 11. The non-noisy signals 7 are fed to a sinusoidal decoder 14, outputting non-noisy parts 51 of audio channels 21.
The transformation parameters 9 are fed to a noise source generating stage 15 comprising a set of independent (random) noise generators 16. The transformation parameters 9 indicate a noise level of each noise generator 16 (including a possible zero level); additionally, other parameters like, for instance, an enveloping form may be specified for the noise sources. The noise generator 16 generates a set of mutually uncorrelated noise sources that are formed, in response to the set of transformation parameters 9, for each audio signal 1 into a plurality of noise sources, thereby synthesizing perceptually relevant correlation-preserved noise components 41 for audio signals 21. In a composition stage 17, the correlation-preserved noise components 41 and the non-noisy parts 51 are combined and audio channels 21 are outputted, which are a perceptually relevant reconstruction of the audio channels 2 of Fig.1 It will be clear to those skilled in the art that the invention is not limited to the embodiments described with reference to the drawing but may comprise all kinds of variations. For instance, although in the embodiments described, non-noisy parts of the signal are encoded using a sinusoidal coding, other types of encoding may be applied, like waveform coding or Huffman coding. Also, the audio channels as a whole, including non- noisy parts, may be transformed according to the above-mentioned transformation parameters. Furthermore, other types of noise encoding may be applied, using different parameters, etc. The method may be applied for a single relevant frequency band for an audio channel of a multi-channel audio system. The method may also be applied in a selected number of channels of a multi-channel audio system. These and other variations are deemed to fall within the scope of protection of the appended claims.
Reference numbers:
1. encoder
2. composite arrows 3. filter stage
4. noisy parts
5. non-noisy parts
6. sinusoidal coding circuit
7. encoded data 8. noise encoding circuit
9. transformation parameters
10. transmitter
11. encoded signal
12. decoder 13. splitting stage
14. sinusoidal decoder
15. noise source generating stage
16. noise generators
17. composition stage

Claims

CLAIMS:
1. A method using synthetic noise sources in a multi-channel audio coding system for encoding a set of audio signals wherein correlated noise components are present, the method comprising the step of:
- determining, from the relation between said audio signals, a composition of noise sources, the composition being such that the noise sources in said composition are mutually uncorrelated, so that said composition of noise sources synthesizes said noise components in a relation-preserved way.
2. A method according to claim 1, further comprising the step of: - encoding the noise sources, by determining for each noise source a set of noise parameters for synthesizing said source and a set of transformation parameters for generating said composition of noise sources.
3. A method according to claim 1 or 2 further comprising the steps of: - transmitting said sets of noise parameters for synthesizing each noise source and transmitting said set of transformation parameters for forming said plurality of noise sources.
4. A method according to any one of the preceding claims, wherein mutually uncorrelated noise sources are determined on a frame-by-frame basis.
5. A method according to any one of the preceding claims, wherein non-noisy components present in said audio signals are encoded by sinusoidal coding.
6. A method according to any one of the preceding claims, wherein said transformation parameters are determined by orthogonalizing the correlation matrix of said set of audio channels.
7. A method according to any one of the preceding claims, wherein the set of audio signals is divided into a selected set of frequency bands, at least one of the frequency bands comprising noise-like signals.
8. A method using synthetic noise sources in a multi-channel audio coding system for encoding a set of audio channels, the method comprising the steps of:
- receiving sets of noise parameters for synthesizing noise sources and receiving a set of transformation parameters determined according to the method of claim 1;
- generating, in response to said noise parameters, a set of synthesized noise sources; and - generating a set of audio signals by forming each audio signal as a plurality of noise sources according to said transformation parameters.
9. An encoder, for encoding audio channels encoded according to the method of any one of claims 1 to 6, the encoder comprising: - means for detecting, in at least one frequency band of said audio signals, an autocorrelation and a cross-correlation between each one of a set of audio signals; and processing means for determining, from the relation between said audio signals, a composition of noise sources, the composition being such that the noise sources in said composition are mutually uncorrelated, so that said composition of noise sources synthesizes said noise components in a relation-preserved way.
10. An encoder of claim 8, further comprising:
- means for encoding said noise sources as sets of noise parameters for synthesizing each of said sources, - transmitting means for transmitting the sets of noise parameters and for transmitting said set of transformation parameters for forming said plurality of noise sources.
11. A decoder for receiving audio channels encoded and transformed according to any one of claims 1 to 6, the decoder comprising: - receiving means for receiving sets of noise parameters for synthesizing noise sources and for receiving a set of transformation parameters for forming a plurality of said noise sources,
- a set of noise generators for generating noise sources, in response to the noise parameters; and - synthesizing means for synthesizing audio signals with perceptually relevant correlation- preserved noise components by forming, in response to the set of transformation parameters, for each audio signal a plurality of said set of noise sources.
12. A data carrier comprising a set of noise parameters for synthesizing uncorrelated noise sources and comprising a set of transformation parameters for forming a plurality of noise sources according to the method of any one of claims 1 to 7.
PCT/IB2002/004601 2001-11-23 2002-11-04 Perceptual noise substitution WO2003044775A1 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
BR0206611-4A BR0206611A (en) 2001-11-23 2002-11-04 Method using synthetic noise sources in a multi-channel audio coding system to encode a set of audio signals, encoder to encode encoded audio channels, decoder to receive encoded and transformed audio channels, and, data carrier
JP2003546331A JP2005509926A (en) 2001-11-23 2002-11-04 Replace perceptual noise
US10/495,942 US20050004791A1 (en) 2001-11-23 2002-11-04 Perceptual noise substitution
KR10-2004-7007816A KR20040063155A (en) 2001-11-23 2002-11-04 Perceptual noise substitution
EP02779819A EP1451809A1 (en) 2001-11-23 2002-11-04 Perceptual noise substitution
AU2002343151A AU2002343151A1 (en) 2001-11-23 2002-11-04 Perceptual noise substitution

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP01204533.2 2001-11-23
EP01204533 2001-11-23

Publications (1)

Publication Number Publication Date
WO2003044775A1 true WO2003044775A1 (en) 2003-05-30

Family

ID=8181297

Family Applications (2)

Application Number Title Priority Date Filing Date
PCT/IB2002/004601 WO2003044775A1 (en) 2001-11-23 2002-11-04 Perceptual noise substitution
PCT/IB2002/004869 WO2003044776A1 (en) 2001-11-23 2002-11-22 Audio coding

Family Applications After (1)

Application Number Title Priority Date Filing Date
PCT/IB2002/004869 WO2003044776A1 (en) 2001-11-23 2002-11-22 Audio coding

Country Status (10)

Country Link
US (2) US20050004791A1 (en)
EP (2) EP1451809A1 (en)
JP (2) JP2005509926A (en)
KR (2) KR20040063155A (en)
CN (2) CN1288624C (en)
AU (2) AU2002343151A1 (en)
BR (2) BR0206611A (en)
RU (1) RU2004118840A (en)
TW (1) TW200407843A (en)
WO (2) WO2003044775A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPWO2005112002A1 (en) * 2004-05-19 2008-03-27 松下電器産業株式会社 Audio signal encoding apparatus and audio signal decoding apparatus
US9257127B2 (en) 2006-12-27 2016-02-09 Electronics And Telecommunications Research Institute Apparatus and method for coding and decoding multi-object audio signal with various channel including information bitstream conversion
US9743185B2 (en) 2004-04-16 2017-08-22 Dolby International Ab Apparatus and method for generating a level parameter and apparatus and method for generating a multi-channel representation

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7240001B2 (en) * 2001-12-14 2007-07-03 Microsoft Corporation Quality improvement techniques in an audio encoder
US7190449B2 (en) * 2002-10-28 2007-03-13 Nanopoint, Inc. Cell tray
US7460990B2 (en) 2004-01-23 2008-12-02 Microsoft Corporation Efficient coding of digital media spectral data using wide-sense perceptual similarity
ATE527654T1 (en) * 2004-03-01 2011-10-15 Dolby Lab Licensing Corp MULTI-CHANNEL AUDIO CODING
CN101116136B (en) * 2005-02-10 2011-05-18 皇家飞利浦电子股份有限公司 Sound synthesis
WO2006085244A1 (en) * 2005-02-10 2006-08-17 Koninklijke Philips Electronics N.V. Sound synthesis
TWI458365B (en) * 2005-04-12 2014-10-21 Dolby Int Ab Apparatus and method for generating a level parameter, apparatus and method for generating a multi-channel representation and a storage media stored parameter representation
RU2376655C2 (en) * 2005-04-19 2009-12-20 Коудинг Текнолоджиз Аб Energy-dependant quantisation for efficient coding spatial parametres of sound
JP5108767B2 (en) 2005-08-30 2012-12-26 エルジー エレクトロニクス インコーポレイティド Apparatus and method for encoding and decoding audio signals
KR20070025905A (en) * 2005-08-30 2007-03-08 엘지전자 주식회사 Method of effective sampling frequency bitstream composition for multi-channel audio coding
US8046214B2 (en) * 2007-06-22 2011-10-25 Microsoft Corporation Low complexity decoder for complex transform coding of multi-channel sound
US7885819B2 (en) 2007-06-29 2011-02-08 Microsoft Corporation Bitstream syntax for multi-process audio decoding
US8249883B2 (en) * 2007-10-26 2012-08-21 Microsoft Corporation Channel extension coding for multi-channel source
CN101662688B (en) * 2008-08-13 2012-10-03 韩国电子通信研究院 Method and device for encoding and decoding audio signal
US10672408B2 (en) 2015-08-25 2020-06-02 Dolby Laboratories Licensing Corporation Audio decoder and decoding method
CN109215667B (en) 2017-06-29 2020-12-22 华为技术有限公司 Time delay estimation method and device
EP3913626A1 (en) * 2018-04-05 2021-11-24 Telefonaktiebolaget LM Ericsson (publ) Support for generation of comfort noise
CN110267160B (en) * 2019-05-31 2020-09-22 潍坊歌尔电子有限公司 Sound signal processing method, device and equipment

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE19730129A1 (en) * 1997-07-14 1999-01-21 Fraunhofer Ges Forschung Method for signaling noise substitution when encoding an audio signal

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6298322B1 (en) * 1999-05-06 2001-10-02 Eric Lindemann Encoding and synthesis of tonal audio signals using dominant sinusoids and a vector-quantized residual tonal signal

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE19730129A1 (en) * 1997-07-14 1999-01-21 Fraunhofer Ges Forschung Method for signaling noise substitution when encoding an audio signal

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
BASTIAAN KLEIJN W ET AL: "Identification and reconstruction of the unvoiced component in speech", CONFERENCE RECORD OF THE THIRTY-FOURTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (CAT. NO.00CH37154), CONFERENCE RECORD OF THE THIRTY-FOURTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, PACIFIC GROVE, CA, USA, 29 OCT.-1 NOV. 200, 2000, Piscataway, NJ, USA, IEEE, USA, pages 1459 - 1463 vol.2, XP010535241, ISBN: 0-7803-6514-3 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9743185B2 (en) 2004-04-16 2017-08-22 Dolby International Ab Apparatus and method for generating a level parameter and apparatus and method for generating a multi-channel representation
US10015597B2 (en) 2004-04-16 2018-07-03 Dolby International Ab Method for representing multi-channel audio signals
JPWO2005112002A1 (en) * 2004-05-19 2008-03-27 松下電器産業株式会社 Audio signal encoding apparatus and audio signal decoding apparatus
US9257127B2 (en) 2006-12-27 2016-02-09 Electronics And Telecommunications Research Institute Apparatus and method for coding and decoding multi-object audio signal with various channel including information bitstream conversion

Also Published As

Publication number Publication date
AU2002343151A1 (en) 2003-06-10
TW200407843A (en) 2004-05-16
CN1589467A (en) 2005-03-02
EP1451809A1 (en) 2004-09-01
US20050021328A1 (en) 2005-01-27
CN1288623C (en) 2006-12-06
BR0206615A (en) 2004-02-17
BR0206611A (en) 2004-02-17
CN1589466A (en) 2005-03-02
AU2002347474A1 (en) 2003-06-10
KR20040066839A (en) 2004-07-27
JP2005509927A (en) 2005-04-14
WO2003044776A1 (en) 2003-05-30
KR20040063155A (en) 2004-07-12
CN1288624C (en) 2006-12-06
EP1451810A1 (en) 2004-09-01
RU2004118840A (en) 2005-10-10
US20050004791A1 (en) 2005-01-06
JP2005509926A (en) 2005-04-14

Similar Documents

Publication Publication Date Title
US10555104B2 (en) Binaural decoder to output spatial stereo sound and a decoding method thereof
US20050004791A1 (en) Perceptual noise substitution
KR100717598B1 (en) Frequency-based coding of audio channels in parametric multi-channel coding systems
JP4603037B2 (en) Apparatus and method for displaying a multi-channel audio signal
US7583805B2 (en) Late reverberation-based synthesis of auditory scenes
US7006636B2 (en) Coherence-based audio coding and synthesis
JP4939933B2 (en) Audio signal encoding apparatus and audio signal decoding apparatus
KR100924576B1 (en) Individual channel temporal envelope shaping for binaural cue coding schemes and the like
US8296158B2 (en) Methods and apparatuses for encoding and decoding object-based audio signals
KR100928311B1 (en) Apparatus and method for generating an encoded stereo signal of an audio piece or audio data stream
US11200906B2 (en) Audio encoding method, to which BRIR/RIR parameterization is applied, and method and device for reproducing audio by using parameterized BRIR/RIR information
KR100891666B1 (en) Apparatus for processing audio signal and method thereof
Kelly et al. The continuity illusion revisited: coding of multiple concurrent sound sources
JP2007104601A (en) Apparatus for supporting header transport function in multi-channel encoding
KR20070076363A (en) Method of encoding and decoding an audio signal

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LU MC NL PT SE SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2002779819

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 10495942

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 2003546331

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 1120/CHENP/2004

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 20028232267

Country of ref document: CN

Ref document number: 1020047007816

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 2002779819

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 2002779819

Country of ref document: EP