EP1054575A2 - Directional decoding - Google Patents

Directional decoding Download PDF

Info

Publication number
EP1054575A2
EP1054575A2 EP00304128A EP00304128A EP1054575A2 EP 1054575 A2 EP1054575 A2 EP 1054575A2 EP 00304128 A EP00304128 A EP 00304128A EP 00304128 A EP00304128 A EP 00304128A EP 1054575 A2 EP1054575 A2 EP 1054575A2
Authority
EP
European Patent Office
Prior art keywords
channel
signal
channels
signals
input
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP00304128A
Other languages
German (de)
French (fr)
Other versions
EP1054575A3 (en
Inventor
Richard J. Aylward
Hilmar Lehnert
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Bose Corp
Original Assignee
Bose Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/313,058 external-priority patent/US7016501B1/en
Application filed by Bose Corp filed Critical Bose Corp
Publication of EP1054575A2 publication Critical patent/EP1054575A2/en
Publication of EP1054575A3 publication Critical patent/EP1054575A3/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S5/00Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation 
    • H04S5/005Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation  of the pseudo five- or more-channel type, e.g. virtual surround
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/05Generation or adaptation of centre channel in multi-channel audio systems

Definitions

  • the invention relates to the decoding of audio signals into directional channels, and more particularly to novel apparatus and methods for decoding input channels into cardinal output channels.
  • novel apparatus and methods for decoding input channels into cardinal output channels For background reference is made to that application and its background.
  • a method for processing multichannel audio signals includes determining the degree of correlation of two of the channels, and normalizing the channels according to first and second normalization modes; in response to determining that the two channels are correlated and uncorrelated, respectively.
  • a method for processing multichannel audio signals includes determining the degree of correlation of two of the channels and responsive to a determining that the two channels are partially correlated and partially uncorrelated, processing the channels according to a combination of a first normalization mode and a second normalization mode.
  • a method for decoding an encoded multichannel audio signal includes determining the correlation of a first channel and a second channel and processing the first channel and the second channel to produce a third channel and a fourth channel.
  • an apparatus for processing multichannel audio signals includes an input characteristics determiner for determining a degree of correlation of two of the channels; a first normalizing multiplier, coupled to the input characteristics determiner, for applying a first normalizing coefficient to a first of the two channels, the normalizing coefficient being responsive to the degree of correlation; and
  • Input channel characteristics determiner 10 is adapted to receive an audio signal from input channels 12, 13 (identified as left input channel Lt 12 and right input channel Rt 13) from a signal source such as a receiver, VCR, or DVD player. Input channel characteristics determiner 10 is adapted to transmit inputs on channels 12, 13 (by signal lines 17, 19), and to transmit other signals as will be described in the discussion of Fig. 3, to output channel synthesizer 14. Output channel synthesizer 14 is adapted to synthesize output signals on output channels 50, 56, 62, 66, 68, 70, 72, 74.
  • a “channel” as used herein, refers to audio information that is encoded in such a manner that it can be decoded or processed or both and reproduced at a location relative to a listener, so that the listener perceives the sound as originating from a direction in space.
  • Input channels may be encoded in such a way that they can be decoded into more than one output channel, or so that the total number of output channels is greater than the total number of input channels.
  • Output channels are typically designated by a directional designator, such as "left,” “right,” “center,” “surround,” “left surround” and “right surround,” depending on the direction from which it is intended the sound is perceived to come.
  • input channels 12, 13, and output channels 50, 56, 62, 66, 68, 70, 72, 74 are shown as separate elements.
  • the number of input channels is not necessarily the same as the number of physical signal lines that transmit the information in the channels.
  • Digital signal transmission systems typically have one signal line for transmitting several input channels.
  • Input channels are typically encoded as analog electrical signals or as digital bitstreams.
  • Presentation channels refer to channels that are available for decoding or reproduction
  • production channels refer to the channels which have been decoded and which are intended for reproduction by a device such as a loudspeaker.
  • the information in the output channels may be in a "cardinal" state if information in the output channel is exclusively and uniquely associated with that output channel associated direction. Stated differently, if the information in the output channel contains only information for that output channel associated direction and no other output channel contains information for that output channel associated direction, that output channel is in a cardinal state and the associated direction is a cardinal direction. So, for example, if the left surround channel contains only left surround signal content and if no other channels contains left surround signal content, the left surround channel is said to be in a cardinal state, the left surround direction is said to be a cardinal direction, and a location in the cardinal direction relative to a listener is said to be a cardinal location.
  • input channel information is encoded as a signal level, typically measured in volts v with respect to time t .
  • signal level in a channel for example input channel Lt
  • Lt the signal level in a channel
  • the time-avenged values of the signal level in a channel (for example input channel Rt) will be referred to as
  • the difference of the signal level in channels Lt and Rt will be referred to as Lt - Rt
  • the time avenged values of the sum of the signal levels will be referred to as Lt + Rt
  • the absolute value of the time-averaged difference of the signal levels in channels Lt and Rt will be represented as
  • a typical time avenging interval is about 5 ms to about 1000 ms. The length of the time averaging interval is discussed below in connection with Fig. 3.
  • Input channel information may also be encoded digitally as a bitstream of signal levels measured at time intervals.
  • Input channels Lt 12 and Rt 13 are inputted into RMS responding level detector and correlation and phase analyzer 40 which generates the following time averaged signal quantities:
  • the quantities are fed to logic 42, which derives a quantity X that is the larger of either (1) or (2), and derives a quantity Y that is the larger of (3) or (4).
  • Signal quantities (1) and (2) are combined with signal quantities (3) and (4) , along with quantities Y and X to construct normalization coefficients A1, A2, A3, and A4.
  • A1
  • a 2
  • a 3
  • a 4
  • normalization coefficients (A2) and (A3) evaluate to zero.
  • normalization coefficients (A1) and (A4) evaluate to zero.
  • normalization coefficients applied to the signals in channels Lt and Rt are different.
  • the normalization coefficient is responsive to the sum or difference of the two signals and the magnitude of Lt
  • the normalization coefficient is responsive to the sum or difference of the two signals and the magnitude of Rt.
  • a normalization mode of this type which applies different normalization coefficients to the input signals, will be referred to as a "differential mode.”
  • the time averaging interval may be adaptive to the contents of the input signals as determined by correlation and phase analyzer 40. If the input signals are uncorrelated, the avenging interval may be relatively long (for example about 1000 ms). If the input signals are correlated; that is, have similar waveforms, the time intervals may be short (for example about 5 ins). If the magnitude of the signals is relatively small, the time avenging interval may be short. The time averaging interval may be short if both of the input signals are close to zero. If the difference of the magnitude of the signals is large (for example if
  • a common method of implementing time averaging intervals is to measure the signal periodically and weight each measurement exponentially less than the preceding measurement Using this measurement the averaging interval is typically expressed as the period of time it takes for the weighting of the measurement to decline to some fraction, such as 1 / 3 of the weighting of the most recent measurement.
  • Fig. 4 there is shown a first portion of the circuitry of output channel synthesizer 14.
  • Lt input channel 12 is fed to multipliers 22 and 24, where it is multiplied by normalization coefficients A1 and A2, respectively, to form post-normalization channels Lc' 30 and Ls' 32 respectively.
  • Rt input channel 13 is fed to multipliers 34 and 36, where it is multiplied by normalization coefficients A3 and A4, respectively, to form post-normalization channels Rs' 38 and Rc' 40, respectively.
  • Lt to Lc' is equal (in magnitude) to the contribution from Rt to Rc' (or Rs'), independent of the relative amplitude difference (if any) imposed at the input terminals Lt and Rt. Furthermore the contribution from Lt to Lc' (or Ls') and Rt to Rc' (or Rs') is equal to the lesser of the two input signal amplitudes at Lt and Rt.
  • the resulting normalized output signals at (A1) through (A4) are equal amplitude monaural contributions from Lt and Rt which are directionally identified as center channel or center surround channel components.
  • the normalization function is singularly responsive to the dominant condition. Accordingly, sum dominant normalized signals appearing at the outputs of (A1) and (A4) can contain a nondominant surround channel signal. Likewise, difference dominant normalized signals appearing at the outputs of (A2) and (A3) can contain a nondominant center channel signal.
  • the surround channel signal which is present at the outputs of (A1) and (A4) during a sum signal dominant condition, is retrieved by subtracting the output of (A4) from (A1).
  • the surround channel signal is identified as containing a 180 degree relative phase difference at input terminals Lt and Rt.
  • the center channel signal appearing at the outputs of (A2) and (A3) during a difference dominant condition is retrieved by summing the output of (A3) with (A2).
  • the center channel signal is identified as being the in-phase signal appearing at input terminals Lt and Rt.
  • the normalization function illustrated in Fig. 4 has an important characteristic. If the input signals at Lt and Rt contain a dominant center channel signal and simultaneously contain uncorrelated unequal amplitude signals, (such that Lt or Rt is in a condition of dominance) the normalized Lt and Rt signal contributions at the outputs of (A1) and (A4) will not contain equal amplitude contributions of the Lt and Rt input signals, but rather, equal magnitude contributions of the normalized Lt and Rt input signals. Subtracting the output of (A4) from (A1) to retrieve a surround channel signal in the presence of a sum signal dominant condition and an Lt or Rt dominant condition will introduce a portion of the center channel signal into the surround channel.
  • the invention includes a method for providing an improved normalization mode for instances in which contents of Lt and lit are other than panned mono.
  • logic 42 outputs the following values for A1, A2, A3, and A4:
  • a 1
  • a 2
  • a 3
  • a 4
  • These normalization coefficients are formed by taking the signal quantities (1) and (2) in combination with the Y variable and do not include the signal quantities
  • normalization coefficients are formed by taking the signal quantities (1) and (2) in combination with the Y variable, which is common to the normalization coefficients applied to both Lt and Rt, and which do not include the signal quantities
  • a normalization mode of this type, which applies a common normalization coefficient to the input signals will be referred to as a "common mode.”
  • the time averaging intervals may vary, as in the discussion above.
  • a 2 A 8 ⁇
  • a 3 A 8 ⁇ [ Lt-Rt ]- A 7 Rt -(1- A 7) Y Y + ⁇ - u (
  • equations A1, A2, A3, and A4 are applicable to all degrees of correlation and phase. In the case of highly correlated signals, these generalized equations reduce to the differential mode normalization coefficients. In the case of the highly uncorrelated signals, these generalized equations reduce to the common mode normalization coefficients. In the case signals that are partially correlated, the generalized equations yield a result that has some differential content and some common content A normalization of this type will be referred to a "complex mode.”
  • Fig. 5 there is shown a second portion of the circuitry of output channel synthesizer 14.
  • Rc Rc' + .5(1s' + Rs')
  • Ls'' Ls' + .5(Lc' - Rc')
  • Rs''' Rs' + .5(Rc' - Lc')
  • the normalization coefficients are singularly (and therefore exclusively) responsive to the dominant input signal condition at Lt and Rt. If the input signals at Lt and Rt are sum signal dominant, the input signals at Lt and Rt are correlated in-phase, and only normalization multipliers (A1) and (A4) are active. If the input signals at Lt and Rt are difference signal dominant the input signals at Lt and Rt are correlated wit a relative 180 degree phase shift, and only normalization multipliers (A2) and (A3) are active. If the input signals at Lt and Rt are uncorrelated (or in phase quadrature), the sum signal magnitude and the difference signal magnitude are equal, and all normalization multipliers (A1) through (A4) are active with the same numerical value.
  • the Lt and Rt input signals are respectively reduced by subtracting the normalized amplitude of Lt from Lt, and the normalized amplitude of Rt from Rt. This produces a corresponding reduction in the amplitudes of Lo' and Ro'.
  • Lc Lc' + .5 (Ls' + Rs')
  • Rc Rc' + .5 (Rs' + Ls')
  • Ls'' Ls' + .5 (Lc' - Rc)
  • Rs'' Rs' + .5 (Rc'- Lc')
  • Lc' and Ls' are components of the normalized Lt input
  • the Lc' signal cumulatively combines with the Ls' signal
  • the Rc' signal cumulatively combines with the Rs' signal.
  • the normalization coefficient variables at (A1) through (A4) are numerically identical when the input signal conditions at Lt and Rt are uncorrelated in nature. For this condition, the Lt contribution to Lc and Ls'' is dominant over the Rt contribution to Lc and Ls'' by a factor of three, or approximately 10 dB. The Rt contribution to Rc' and Rs'' is dominant over the Lt contribution to Rc' and Rs'' by the same factor of three, or approximately 10 dB. As such, the Lc' and Ls'' signals are substantially components of the normalized Lt input, and the Rc' and Rs'' signals are substantially components of the normalized Rt input.
  • a signal processing system reproduces the contributions from Lt to center and Rt to center as separate Lc and Rc signals, whenever separate center channel loudspeakers can be practically utilized in a reproduction system. This is advantageous over audio signal processing systems that derive a center channel signal from matrix encoded Lt and Rt stereophonic signals by summing a portion (or all) of the component signals at Lt and Rt. Recall that the normalization coefficient values of input normalization multipliers (A1) and (A4) are approximately zero whenever the input signals at Lt and Rt are difference signal dominant.
  • Lc and Rc are identical.
  • Summing the component signals of Lt and Rt at Ls' and Rs' to produce Lc and Rc ensures that the Lc and Rc signals do not contain the dominant surround channel signal .
  • Lc and Rc are largely stereophonic when the input conditions at Lt and Rt are uncorrelated or stereophonic, in nature, and that the content of Lc and Rc are monaural whenever the nature of the input signals at Lt and Rt are difference signal (or surround channel) dominant.
  • Channels Lc and Rc are largely monaural in nature whenever the input signal conditions at Lt and Rt are substantially correlated.
  • the normalization coefficient values at input normalization multipliers (A2) and (A3) are approximately zero whenever the input signal conditions at Lt and Rt are sum signal (or center channel) dominant.
  • the surround channel signal which may be present at Lt and Rt during a sum signal dominant condition is derived by subtracting the signal components of Rc' from Lc' to produce Ls'' and similarly subtracting the signal components at Lc' from Rc' to produce Rs''.
  • Subtracting Rc' from Lc' to produce Ls'' and Lc' from Re' to produce Rs'' ensures that Ls'' and Rs'' do not contain any center channel signal components whenever Lt and Rt are substantially sum signal (or center channel) dominant.
  • the content of the interim signals Ls'' and Rs'' are largely stereophonic in nature whenever the input signal conditions at Lt and Rt are uncorrelated or substantially stereophonic in nature.
  • the interim signals at Ls'' and Rs'' are substantially monaural in nature whenever the input signal conditions at Lt and Rt are substantially sum signal (or center channel) dominant.
  • the stereophonic nature of the interim signals, Ls'' and Rs'' for uncorrelated input signals at Lt and Rt is advantageous over audio signal processing sytems that derive a monaural surround channel signal from matrix encoded stereophonic Lt and Rt signals by subtracting a portion (or all) of the Rt input signal from the Lt input signal.
  • the interim signals at Ls'' and Rs'' although largely stereophonic in nature when the input signal conditions at Lt and Rt are uncorrelated, do not exhibit exclusive cardinal states.
  • the encoded Lt and Rt signals are such that an exclusive left surround channel signal or an exclusive right surround channel signal will respectively appear at Lt and Rt as:
  • the Lt or Rt encoded signals are such that a difference signal dominant condition is encoded with an attending Lt or Rt dominant condition.
  • the encoded Lt and Rt signals are panned mono.
  • An audio signal processing system according to the invention is advantageous because it can decode the given encoded Lt and Rt signal conditions as exclusive left only or right only surround channel signals.
  • FIG. 7 there is shown another portion of channels synthesizer 14.
  • Interim channels La'' 52 and Rs'' 54 signals are combined to form left front channel Lo 64, right front channel Ro 66, left center surround channel Lcs 68, right center surround channel Rcs 70, left surround channel Ls 72, and right surround channel Rs 74 according to:
  • Lo Lo' - .5(A5(.75 Ls'' - .25Rs''))
  • Ro Ro' - .5(A6(.75Rs'' - .25 Rs'))
  • the effect of the circuit of Fig. 7 is to re-matrix interim channels Ls'' and Rs'' with the normalization coefficients A5 and A6.
  • the out-of-phase (or surround channel) signals cumulatively combine, whereas the in-phase (or center channel) signals differentially combine.
  • Re-matrixing the Ls'' and Rs'' signals causes a corresponding reduction in amplitude of any center channel signal component which may be present in Ls'' or Rs'' during a difference dominant, uncorrelated input signal condition at Lt and Rt
  • the process of re-matrixing the Ls'' and Rs'' signals further reduces the stereophonic content of Ls'' and Rs''
  • the contribution of Lt to Ls'' is still dominant over the contribution of Rt to Ls''.
  • the contribution of Rt to Rs'' is still dominant over the contribution of Lt to Rs'.
  • the rematrixed Ls'' and Rs'' signals still retain a stereophonic characteristic when the signal conditions at Lt and Rt are substantially uncorrelated.
  • the Lo' interim signal contains the differential surround channel signal that was dominant in Lt.
  • the outputs of multipliers (A5) or (A6) are equal in amplitude to the contribution of Lt or Rt at Ls'' or Rs'' which is a component of the originating encoded Ls or Rs input signal conditions.
  • all Lt dominant and difference signal dominant input signal conditions are defined as Ls dominant output signal conditions.
  • all Rt dominant and difference signal dominant input signal conditions are defined to be Rs dominant output signal conditions.
  • the directionally cardinal Ls or Rs encoded signal conditions are decoded as cardinal Ls or Rs output signal conditions.
  • the decoder is the complement of the encoded signal conditions. It is also instructive to consider that the output signals at Ls and Rs are approximately zero whenever the encoded signals at Lt and Rt are equal amplitude signals. For this condition, the encoded signals are decoded to the Lcs and Rcs output terminals. In this regard, the decoded output signal conditions are the directional complement of the encoded signal conditions.
  • a signal can be cardinally decoded to the following output terminals: Lo 62, Lc 50 Rc 70, Ro 66, Ls 72, Lcs 68, Rcs 56, Rs 74 placed relative to a listener 78 as indicated.
  • Lt and Rt signals it is possible to decode matrix encoded Lt and Rt signals to six directionally cardinal locations in a 360-degree space. Interim directional locations are "phantom" sources based upon the presence of the decoded signal in multiple channels. For example, a signal can be encoded and subsequently decoded in a complementary manner; to appear at any point between the left channel output and left surround channel output. Likewise, a signal can be encoded and subsequently decoded in a complementary manner, to appear anywhere between the right output channel and right surround output channel. Thus a signal can be encoded and subsequently decoded to appear at any point within a 360-degree spatial angle.
  • Down-mixing the decoded output channels does not reduce the number of cardinal directional states, but rather, the way in which the cardinal directions are reproduced.
  • the cardinal Ls and Rs directional states are still retained.
  • the stereophonic nature of the signals at Ls and Rs is likewise preserved.
  • the exclusive Lcs/Rcs output condition is now reproduced as equal amplitude signals at Ls and Rs.
  • the Lc and Rc output signals appear at the single center channel output, thus retaining the cardinal center only direction.
  • Fig. 9 there is shown the combined circuits of Figs. 3, 4, 5, 6, and 8.
  • the composite block diagram of Fig. 9 is constructed from the individual block diagrams of Figs. 4,5,6, and 7.
  • Boolean switches 80 and 82 have been incorporated into Fig. 9 to enable or disable center channel decoding or surround channel decoding or both.
  • the input signals at Lt and Rt are presented at the Lo and Ro output terminals.
  • Setting the surround channel mode switches to the off state presents the surround channel signals at Lt and Rt to the Lo and Ro output terminals.
  • setting the center channel mode switches to the off state presents the center channel signals at Lt and Rt to the Lo and Ro output terminals.
  • the number of provided reproduction channels are fewer than the number of available presentation channels. In these instances, it is advantageous to process the lesser number of reproduction channels such that the derived number of reproduction channels are equal to the number of available presentation channels.
  • contemporary signal transport formats convey as few as one channel or as many as five channels with an attending (spectrally limited) low frequency effects channel.
  • information identifying the intended reproduction channel format are included as supplementary data within the transport format. It is possible to utilize the supplementary data as a means of re-formatting the number of intended reproduction channels for further processing into the number of available presentation channels.
  • the provided reproduction channel information is defined in terms of the number of front and rear (surround) reproduction channels. The most widespread formats are:
  • the channels are processed discretely.
  • format (2) only the provided left and right reproduction channels are processed as Rt and Lt to obtain a new left, right, and (the derived) center presentation channel signal(s).
  • the originating surround channel signals of format (2) are not processed, and the surround channel mode switches 80 in the block diagram of Fig. 9 are set to the off state.
  • the given channel format is first converted to a matrix format for processing. This is accomplished by first down-mixing the given monaural surround channel into the given left channel to form Lnew and further down-mixing (out-of-phase) the given monaural surround channel into the given right channel to form Rnew. Lnew and Rnew are subsequently input into the decoder to obtain new left, right, left surround and right surround presentation channels.
  • the center channel mode switches 80 are set to the off state, since the originating center channel signal is not processed and is reproduced as given.

Abstract

The degree of correlation between two audio signals is determined and the channels are normalised according to first and second normalisation modes responsive to correlation and uncorrelation respectively.

Description

  • The invention relates to the decoding of audio signals into directional channels, and more particularly to novel apparatus and methods for decoding input channels into cardinal output channels. For background reference is made to that application and its background.
  • It is an important object of the invention to provide an improved method and apparatus for decoding audio signals into multiple output channels.
  • According to the invention, a method for processing multichannel audio signals includes determining the degree of correlation of two of the channels, and normalizing the channels according to first and second normalization modes; in response to determining that the two channels are correlated and uncorrelated, respectively.
  • In another aspect of the invention, a method for processing multichannel audio signals includes determining the degree of correlation of two of the channels and responsive to a determining that the two channels are partially correlated and partially uncorrelated, processing the channels according to a combination of a first normalization mode and a second normalization mode.
  • In another aspect of the invention, a method for decoding an encoded multichannel audio signal includes determining the correlation of a first channel and a second channel and processing the first channel and the second channel to produce a third channel and a fourth channel.
  • In still another aspect of the invention, an apparatus for processing multichannel audio signals, includes an input characteristics determiner for determining a degree of correlation of two of the channels; a first normalizing multiplier, coupled to the input characteristics determiner, for applying a first normalizing coefficient to a first of the two channels, the normalizing coefficient being responsive to the degree of correlation; and
  • a second normalizing multiplier, coupled to the input characteristics determiner, for applying a second normalizing coefficient to the second signal, the normalizing coefficient being responsive to the degree of correlation.
  • Other features, objects, and advantages will become apparent from the following detailed description, which refers to the following drawings in which:
  • FIG. 1 is a block diagram of an audio signal processing system;
  • FIG.2 is a representation of an audio signal, helpful in explaining characteristics of the audio signal;
  • FIG. 3 is a block diagram of an input characteristics determiner according to the invention;
  • FIG. 4 is a first portion of the circuitry of an output channel synthesizer according to the invention;
  • FIG. 5 is a second portion of the circuitry of an output channel synthesizer according to the invention;
  • FIG. 6 is a third portion of the circuitry of an output channel synthesizer according to the invention;
  • FIG. 7 is a fourth portion of the circuitry of an output channel synthesizer according to the invention;
  • FIG. 8 is a diagram illustrating the placement of audio reproduction speakers coupled to outputs of an output channels synthesizer according to the invention;
  • FIG. 9 is the combined circuitry of figs. 4-7; and
  • FIG. 10 is circuit illustrating the pre-processing of signals to the audio signal processing system.
  • Referring now to Fig. 1, there is shown a two-input channel, eight-output channel wideband directional decoding audio signal processing system 1 according to the invention. Input channel characteristics determiner 10 is adapted to receive an audio signal from input channels 12, 13 (identified as left input channel Lt 12 and right input channel Rt 13) from a signal source such as a receiver, VCR, or DVD player. Input channel characteristics determiner 10 is adapted to transmit inputs on channels 12, 13 (by signal lines 17, 19), and to transmit other signals as will be described in the discussion of Fig. 3, to output channel synthesizer 14. Output channel synthesizer 14 is adapted to synthesize output signals on output channels 50, 56, 62, 66, 68, 70, 72, 74.
  • A "channel" as used herein, refers to audio information that is encoded in such a manner that it can be decoded or processed or both and reproduced at a location relative to a listener, so that the listener perceives the sound as originating from a direction in space. Input channels may be encoded in such a way that they can be decoded into more than one output channel, or so that the total number of output channels is greater than the total number of input channels. Output channels are typically designated by a directional designator, such as "left," "right," "center," "surround," "left surround" and "right surround," depending on the direction from which it is intended the sound is perceived to come. For purposes of explanation, input channels 12, 13, and output channels 50, 56, 62, 66, 68, 70, 72, 74 are shown as separate elements. The number of input channels is not necessarily the same as the number of physical signal lines that transmit the information in the channels. Digital signal transmission systems, typically have one signal line for transmitting several input channels. Input channels are typically encoded as analog electrical signals or as digital bitstreams.
  • "Presentation channels" refer to channels that are available for decoding or reproduction, and "reproduction channels" refer to the channels which have been decoded and which are intended for reproduction by a device such as a loudspeaker.
  • The information in the output channels may be in a "cardinal" state if information in the output channel is exclusively and uniquely associated with that output channel associated direction. Stated differently, if the information in the output channel contains only information for that output channel associated direction and no other output channel contains information for that output channel associated direction, that output channel is in a cardinal state and the associated direction is a cardinal direction. So, for example, if the left surround channel contains only left surround signal content and if no other channels contains left surround signal content, the left surround channel is said to be in a cardinal state, the left surround direction is said to be a cardinal direction, and a location in the cardinal direction relative to a listener is said to be a cardinal location.
  • Referring now to Fig. 2, there is shown an example of input channel information. In Fig. 2, input channel information is encoded as a signal level, typically measured in volts v with respect to time t. For ease of explanation, the signal level in a channel (for example input channel Lt) will be referred in the equations as Lt . Similarly, for example, the time-avenged values of the signal level in a channel (for example input channel Rt) will be referred to as |Rt| , the difference of the signal level in channels Lt and Rt will be referred to as Lt - Rt , the time avenged values of the sum of the signal levels will be referred to as Lt + Rt , and the absolute value of the time-averaged difference of the signal levels in channels Lt and Rt will be represented as |Lt - Rt| ,and similar references to other signals. A typical time avenging interval is about 5 ms to about 1000 ms. The length of the time averaging interval is discussed below in connection with Fig. 3. Input channel information may also be encoded digitally as a bitstream of signal levels measured at time intervals.
  • Referring now to Fig. 3, there is shown input channel characteristics determiner 10 in more detail. Input channels Lt 12 and Rt 13 are inputted into RMS responding level detector and correlation and phase analyzer 40 which generates the following time averaged signal quantities: |Lt + Rt| |Lt - Rt| |Lt| |Rt| The quantities are fed to logic 42, which derives a quantity X that is the larger of either (1) or (2), and derives a quantity Y that is the larger of (3) or (4). Signal quantities (1) and (2) are combined with signal quantities (3) and (4) , along with quantities Y and X to construct normalization coefficients A1, A2, A3, and A4. The specific combinations of quantities (1), (2), (3), and (4), and quantities X and Y used to construct A1, A2, A3, and A4 are dependent on correlation and phase relationship information as determined by RMS responding level detector and correlation and phase analyzer 40. If the input channels Lt and Rt are correlated (a condition hereinafter referred to as "panned mono") the values of A1, A2, A3, and A4 are: A1 = |Lt+Rt|-|Lt| Y    {0,1} A2 = |Lt-Rt|-|Lt| Y    {0,1} A3 = |Lt-Rt|-|Rt| Y    {0,1} A4 = | Lt+Rt|-|Rt| Y    {0,1}
  • The domains of all normalization coefficients are from 0 to 1 inclusive. Thus, for the condition of sum signal dominance, normalization coefficients (A2) and (A3) evaluate to zero. Similarly, for the condition of difference signal dominance, normalization coefficients (A1) and (A4) evaluate to zero.
  • The normalization coefficients applied to the signals in channels Lt and Rt are different. In the case of normalization coefficients A1 and A2, the normalization coefficient is responsive to the sum or difference of the two signals and the magnitude of Lt, while in the case of normalization coefficients A3 and A4, the normalization coefficient is responsive to the sum or difference of the two signals and the magnitude of Rt. A normalization mode of this type, which applies different normalization coefficients to the input signals, will be referred to as a "differential mode."
  • In one embodiment the time averaging interval may be adaptive to the contents of the input signals as determined by correlation and phase analyzer 40. If the input signals are uncorrelated, the avenging interval may be relatively long (for example about 1000 ms). If the input signals are correlated; that is, have similar waveforms, the time intervals may be short (for example about 5 ins). If the magnitude of the signals is relatively small, the time avenging interval may be short. The time averaging interval may be short if both of the input signals are close to zero. If the difference of the magnitude of the signals is large (for example if |Lt - Rt| ≥ 20dB ) the time averaging interval may be short. A common method of implementing time averaging intervals is to measure the signal periodically and weight each measurement exponentially less than the preceding measurement Using this measurement the averaging interval is typically expressed as the period of time it takes for the weighting of the measurement to decline to some fraction, such as 1 / 3 of the weighting of the most recent measurement.
  • Referring now to Fig. 4, there is shown a first portion of the circuitry of output channel synthesizer 14. Lt input channel 12 is fed to multipliers 22 and 24, where it is multiplied by normalization coefficients A1 and A2, respectively, to form post-normalization channels Lc' 30 and Ls' 32 respectively. Similarly, Rt input channel 13 is fed to multipliers 34 and 36, where it is multiplied by normalization coefficients A3 and A4, respectively, to form post-normalization channels Rs' 38 and Rc' 40, respectively.
  • If input signals at Lt and Rt are correlated and are further constrained to be either in phase, or phase shifted by a 180 degree relative phase difference, the contribution from Lt to Lc' (or Ls'), is equal (in magnitude) to the contribution from Rt to Rc' (or Rs'), independent of the relative amplitude difference (if any) imposed at the input terminals Lt and Rt. Furthermore the contribution from Lt to Lc' (or Ls') and Rt to Rc' (or Rs') is equal to the lesser of the two input signal amplitudes at Lt and Rt. The resulting normalized output signals at (A1) through (A4) are equal amplitude monaural contributions from Lt and Rt which are directionally identified as center channel or center surround channel components. If the input conditions at Lt and Rt are considered to include both a center channel signal and a surround channel signal, but produce either a sum signal dominant or difference signal dominant condition, the normalization function is singularly responsive to the dominant condition. Accordingly, sum dominant normalized signals appearing at the outputs of (A1) and (A4) can contain a nondominant surround channel signal. Likewise, difference dominant normalized signals appearing at the outputs of (A2) and (A3) can contain a nondominant center channel signal. The surround channel signal, which is present at the outputs of (A1) and (A4) during a sum signal dominant condition, is retrieved by subtracting the output of (A4) from (A1). The surround channel signal is identified as containing a 180 degree relative phase difference at input terminals Lt and Rt. Similarly, the center channel signal appearing at the outputs of (A2) and (A3) during a difference dominant condition, is retrieved by summing the output of (A3) with (A2). The center channel signal is identified as being the in-phase signal appearing at input terminals Lt and Rt.
  • The normalization function illustrated in Fig. 4 has an important characteristic. If the input signals at Lt and Rt contain a dominant center channel signal and simultaneously contain uncorrelated unequal amplitude signals, (such that Lt or Rt is in a condition of dominance) the normalized Lt and Rt signal contributions at the outputs of (A1) and (A4) will not contain equal amplitude contributions of the Lt and Rt input signals, but rather, equal magnitude contributions of the normalized Lt and Rt input signals. Subtracting the output of (A4) from (A1) to retrieve a surround channel signal in the presence of a sum signal dominant condition and an Lt or Rt dominant condition will introduce a portion of the center channel signal into the surround channel. Adding the outputs of (A2) and (A3) to retrieve a center channel signal in the presence of a difference dominant input condition at Lt and Rt during an Lt or Rt dominant input condition, will introduce a portion of the surround channel signal into the center channel. Thus, a differentially based normalization function is especially desirable when the input conditions at Lt and Rt are panned mono. However, it is desirable to adapt the normalization function to the input signal conditions at Lt and Rt whenever the inputs are other than panned mono.
  • Another feature of the invention is that the invention includes a method for providing an improved normalization mode for instances in which contents of Lt and lit are other than panned mono. Referring again to Fig. 3, if RMS responding level detector and correlation and phase analyzer 40 detects that the signals at Rt and Lt are uncorrelated, logic 42 outputs the following values for A1, A2, A3, and A4: A1 = |Lt+Rt|-Y Y    {0,1} A2 = |Lt-Rt|-Y Y    {0,1} A3 = |Lt-Rt|-Y Y    {0,1} A4 = |Lt+Rt|-Y Y    {0,1} These normalization coefficients are formed by taking the signal quantities (1) and (2) in combination with the Y variable and do not include the signal quantities |Lt| and |Rt| .
  • These normalization coefficients are formed by taking the signal quantities (1) and (2) in combination with the Y variable, which is common to the normalization coefficients applied to both Lt and Rt, and which do not include the signal quantities |Lt| and |Rt| . A normalization mode of this type, which applies a common normalization coefficient to the input signals will be referred to as a "common mode."
  • The time averaging intervals may vary, as in the discussion above.
  • The substitution of the Y variable for signal quantities (3) and (4) into normalization coefficients (A1) through (A4) transform normalization coefficients (A1) through (A4) from differential mode to common mode. When the signals in input channels Lt and Rt are uncorrelated, the value of A1 for any assumed Lt and Rt input conditions will be equal to the value of A4. Likewise, the value of A2 will also be equal to the value of A3.
  • Referring now to Fig. 4, and using the new values of A1 - A4, the previous input signal conditions at Lt and Rt , wherein Lt or Rt is dominant and simultaneously contain a dominant center channel signal, now produce equal center channel signal contributions from Lt and Rt at the outputs of A1 and A4. Subtracting the output of A4 from A1 no longer introduces a center channel signal into the surround channel. Further, adding the output of A2 to A3 will not introduce a surround channel signal into the center channel if the input signals at Lt and Rt contain a dominant surround channel signal with an attending Lt or Rt dominant signal. Thus a common-mode based normalization function is desirable whenever the input signals at Lt and Rt are uncorrelated. Normalization coefficients (A1) through (A4) can now be linked when the signals in input channels Lt and Rt are correlated with the values of (A1) through (A4) when the signals in input channels Lt and Rt are uncorrelated to form transform coefficient (A7), and further define transform coefficient A7 as: A7 = X- Lt 2+Rt 2 Lt-Rt- Lt 2+Rt 2 and operator A8 as: A8 = Lt+Rt X where e is an arbitrary number, much smaller than any of the other quantities, inserted so that if the remaining terms of the denominator evaluate to zero, the circuit will not attempt to divide by zero.
  • Normalization coefficients (A1) through (A4) can now be generalized as: A1 = A8 ·|Lt-Rt|- A 7Lt-(1-A7)Y Y - u(|Lt - Rt| - |Lt + Rt|) · A7 · 1 - 2L Y [0,1] [0,1] A2 = A8 · |Lt-Rt|- A 7Lt-(1-A7)Y Y - u(|Lt + Rt| - |Lt - Rt|) · A7 · 1 - 2Lt Y [0,1] [0,1] A3 = A8 · [Lt-Rt]- A 7Rt-(1-A7)Y Y - u(|Lt + Rt| - Lt - Rt) · A7 · 1 - 2R Y [0,1] [0,1] A4 = A8 · |Lt-Rt|-A8Rt-(1-At)Y Y - u(|Lt - Rt| - Lt + Rt) · A7 · 1 - 2R Y [0,1] [0,1]
  • The generalized form of equations A1, A2, A3, and A4 is applicable to all degrees of correlation and phase. In the case of highly correlated signals, these generalized equations reduce to the differential mode normalization coefficients. In the case of the highly uncorrelated signals, these generalized equations reduce to the common mode normalization coefficients. In the case signals that are partially correlated, the generalized equations yield a result that has some differential content and some common content A normalization of this type will be referred to a "complex mode."
  • Referring now to Fig. 5, there is shown a second portion of the circuitry of output channel synthesizer 14. The post-normalization channels of Fig. 4 are combined to produce interim channels Lc 50, Ls'' 52, Rs''54, and Rc 56 as Lc = Lc' + .5(Ls' + Rs') Rc = Rc' + .5(1s' + Rs') Ls'' = Ls' + .5(Lc' - Rc') Rs'' = Rs' + .5(Rc' - Lc') Putting the interim channels in terms of the normalization coefficients A1 ― A4 yields: Lc = Lt(A1) + .5 { Lt (A2) + Rt (A3) } Rc = Rt(A4) + .5 { Lt (A2) + Rt (A3) } Ls'' = Lt(A2) + .5 { Lt(A1) - Rt(A4) } Rs'' = Rt(A3) + .5 { Rt(A4) - Lt(A1) }
  • Referring now to Fig. 6, there is shown the circuitry of fig. 5, with the added interim channels Lo' 60 and Ro' 62 that at the outputs of combiners produce: Lo' = Lt - Rc + Rs'' Ro' = Rt - Lc + Ls''
  • The normalization coefficients are singularly (and therefore exclusively) responsive to the dominant input signal condition at Lt and Rt. If the input signals at Lt and Rt are sum signal dominant, the input signals at Lt and Rt are correlated in-phase, and only normalization multipliers (A1) and (A4) are active. If the input signals at Lt and Rt are difference signal dominant the input signals at Lt and Rt are correlated wit a relative 180 degree phase shift, and only normalization multipliers (A2) and (A3) are active. If the input signals at Lt and Rt are uncorrelated (or in phase quadrature), the sum signal magnitude and the difference signal magnitude are equal, and all normalization multipliers (A1) through (A4) are active with the same numerical value.
  • The consequence of subtracting a correlated Rc signal from the Lt input is simply a reduction in the amplitude of the correlated in-phase (or center channel) signal at Lt. This does not reduce the amplitude of the uniquely left channel signal components, since Rc does not contain any uniquely left channel signal components. The amount of Rc signal removed from the Lt input is linearly dependant upon the relative degree of correlation between the Lt and Rt input signals. The same consequence exists when subtracting the Lc signal components from the Rt input. The amplitude of the correlated in-phase signal components at Rt are reduced in proportion to the degree of correlation between the Lt and Rt input signals.
  • The consequence of adding a correlated (but out-of-phase) Rs'' signal to the Lt input is a reduction in the amplitude of the correlated but out-of-phase (or surround) channel signal at Lt. This does not reduce the amplitude of the uniquely left channel signal components, since the Rs'' signal does not contain any uniquely left channel signal components. The amount of Rs'' signal removed from the Lt input is linearly dependant upon the degree to which the Lt and Rt inputs are correlated, out-of-phase. The same consequence exists when adding out-of-phase correlated signal components in Ls'' to Rt. The amplitude of the correlated out-of-phase signal components at Rt are reduced in proportion to the degree of correlation between the Lt and Rt input signals.
  • When the input signal conditions at Lt and Rt are uncorrelated, the matrix of terms Rs'' - Rc and Ls'' - Lc reduce (respectively) to : -.5 { (A1) + (A2) } Lt and -.5 { (A4) + (A3) } Rt
  • Thus, the Lt and Rt input signals are respectively reduced by subtracting the normalized amplitude of Lt from Lt, and the normalized amplitude of Rt from Rt. This produces a corresponding reduction in the amplitudes of Lo' and Ro'.
  • Considering the nature of the signals Lc, Rc, Ls'', and Rs'', recall that Lc = Lc' + .5 (Ls' + Rs') Rc = Rc' + .5 (Rs' + Ls') Ls'' = Ls' + .5 (Lc' - Rc) Rs'' = Rs' + .5 (Rc'- Lc') Since Lc' and Ls' are components of the normalized Lt input, and Rc' and Rs' are components of the normalized Rt input, the Lc' signal cumulatively combines with the Ls' signal and the Rc' signal cumulatively combines with the Rs' signal. The normalization coefficient variables at (A1) through (A4) are numerically identical when the input signal conditions at Lt and Rt are uncorrelated in nature. For this condition, the Lt contribution to Lc and Ls'' is dominant over the Rt contribution to Lc and Ls'' by a factor of three, or approximately 10 dB. The Rt contribution to Rc' and Rs'' is dominant over the Lt contribution to Rc' and Rs'' by the same factor of three, or approximately 10 dB. As such, the Lc' and Ls'' signals are substantially components of the normalized Lt input, and the Rc' and Rs'' signals are substantially components of the normalized Rt input. If the Lc' and Rc' signals are respectively reproduced by separate loudspeakers placed to the left and right of center, the stereophonic content of the uncorrelated signals at Lt and Rt are substantially preserved. A signal processing system according to the invention reproduces the contributions from Lt to center and Rt to center as separate Lc and Rc signals, whenever separate center channel loudspeakers can be practically utilized in a reproduction system. This is advantageous over audio signal processing systems that derive a center channel signal from matrix encoded Lt and Rt stereophonic signals by summing a portion (or all) of the component signals at Lt and Rt. Recall that the normalization coefficient values of input normalization multipliers (A1) and (A4) are approximately zero whenever the input signals at Lt and Rt are difference signal dominant. The center channel signal which can be present at Lt and Rt during a condition of difference signal dominance is defined at Lc and Re by: Lc = .5 (Ls' + Rs') Rc = .5 (Rs' + Ls') For this condition of Lt and Rt input signal assumptions, Lc and Rc are identical. The summation of the signals Ls' and Rs' at Lc and Rc, respectively, force Lc and Rc to be monaural in nature. Summing the component signals of Lt and Rt at Ls' and Rs' to produce Lc and Rc ensures that the Lc and Rc signals do not contain the dominant surround channel signal . The content of Lc and Rc are largely stereophonic when the input conditions at Lt and Rt are uncorrelated or stereophonic, in nature, and that the content of Lc and Rc are monaural whenever the nature of the input signals at Lt and Rt are difference signal (or surround channel) dominant. Channels Lc and Rc are largely monaural in nature whenever the input signal conditions at Lt and Rt are substantially correlated.
  • The interim signals at Ls'' and Rs'' are similarly reduced whenever the input signal conditions at Lt and Rt are substantially sum signal (or center channel) dominant to: Ls'' = .5 (Lc' - Rc') Rs'' = .5 (Rc' - Lc') The normalization coefficient values at input normalization multipliers (A2) and (A3) are approximately zero whenever the input signal conditions at Lt and Rt are sum signal (or center channel) dominant. The surround channel signal which may be present at Lt and Rt during a sum signal dominant condition is derived by subtracting the signal components of Rc' from Lc' to produce Ls'' and similarly subtracting the signal components at Lc' from Rc' to produce Rs''. Subtracting Rc' from Lc' to produce Ls'' and Lc' from Re' to produce Rs'' ensures that Ls'' and Rs'' do not contain any center channel signal components whenever Lt and Rt are substantially sum signal (or center channel) dominant. The content of the interim signals Ls'' and Rs'' are largely stereophonic in nature whenever the input signal conditions at Lt and Rt are uncorrelated or substantially stereophonic in nature. The interim signals at Ls'' and Rs'' are substantially monaural in nature whenever the input signal conditions at Lt and Rt are substantially sum signal (or center channel) dominant. The stereophonic nature of the interim signals, Ls'' and Rs'' for uncorrelated input signals at Lt and Rt is advantageous over audio signal processing sytems that derive a monaural surround channel signal from matrix encoded stereophonic Lt and Rt signals by subtracting a portion (or all) of the Rt input signal from the Lt input signal.
  • The interim signals at Ls'' and Rs'', although largely stereophonic in nature when the input signal conditions at Lt and Rt are uncorrelated, do not exhibit exclusive cardinal states. The encoded Lt and Rt signals are such that an exclusive left surround channel signal or an exclusive right surround channel signal will respectively appear at Lt and Rt as:
  • Lt = Ls, Rt = -.5(Ls) for exclusive left surround channel signal input
  • Rt = -Rs Lt = .5(Rs) for exclusive right surround channel signal input. For exclusive left only or right only surround channel signals, the Lt or Rt encoded signals are such that a difference signal dominant condition is encoded with an attending Lt or Rt dominant condition. Furthermore, the encoded Lt and Rt signals are panned mono. An audio signal processing system according to the invention is advantageous because it can decode the given encoded Lt and Rt signal conditions as exclusive left only or right only surround channel signals.
  • Referring now to Fig. 7, there is shown another portion of channels synthesizer 14. Interim channels La'' 52 and Rs'' 54 signals are combined to form left front channel Lo 64, right front channel Ro 66, left center surround channel Lcs 68, right center surround channel Rcs 70, left surround channel Ls 72, and right surround channel Rs 74 according to: Lo = Lo' - .5(A5(.75 Ls'' - .25Rs'')) Ro = Ro' - .5(A6(.75Rs'' - .25 Rs'')) Lcs = .5(A5(.75Ls'' - .25Rs'')) + .5(A6(.75Ls'' - .25Rs'')) + .75Ls'' - .25Ls'' Rcs = .5(A6(.75Ls'' - .25Rs'')) + .5(A5(.75Rs'' - .25Ls'')) + .75Rs'' - .25Ls'' Ls = A5(.75Ls'' - .25Rs'') Rs = A6(.75Rs'' - .25Ls'') where
    Figure 00160001
  • The effect of the circuit of Fig. 7 is to re-matrix interim channels Ls'' and Rs'' with the normalization coefficients A5 and A6. The out-of-phase (or surround channel) signals cumulatively combine, whereas the in-phase (or center channel) signals differentially combine. Re-matrixing the Ls'' and Rs'' signals causes a corresponding reduction in amplitude of any center channel signal component which may be present in Ls'' or Rs'' during a difference dominant, uncorrelated input signal condition at Lt and Rt Although the process of re-matrixing the Ls'' and Rs'' signals further reduces the stereophonic content of Ls'' and Rs'', the contribution of Lt to Ls'' is still dominant over the contribution of Rt to Ls''. Likewise the contribution of Rt to Rs'' is still dominant over the contribution of Lt to Rs''. Thus the rematrixed Ls'' and Rs'' signals still retain a stereophonic characteristic when the signal conditions at Lt and Rt are substantially uncorrelated. With consideration to panned monaural, correlated out-of-phase input conditions at Lt and Rt, it is helpful to re-examine the nature of the signals Ls'', Rs'', Lo' and Ro'. The normalized contributions of Lt and Rt at Ls'' and Rs'' are substantially monaural in nature when the input signal conditions at Lt and Rt are correlated but out-of-phase, independent of the relative amplitudes of signals Lt and Rt. The normalized contributions of Lt and Rt at Ls'' and Rs'' are equal to the lesser of the two input signals Lt and Rt, whenever their relative amplitudes differ. Thus a correlated , difference dominant, Lt dominant input signal condition at Lt and Rt will result in contributions from Lt and Rt to Ls'' and Rs'' which are equal to the Rt input signal amplitude.
  • Since these signals are removed from Lt and Rt to produce interim signals Lo' and Ro' (as shown in Fig. 6), the Lo' interim signal contains the differential surround channel signal that was dominant in Lt. The same observation can be made of the interim signal Ro' for input signals at Lt and Rt which are correlated out-of-phase and Rt dominant The outputs of multipliers (A5) or (A6) are equal in amplitude to the contribution of Lt or Rt at Ls'' or Rs'' which is a component of the originating encoded Ls or Rs input signal conditions. As such, all Lt dominant and difference signal dominant input signal conditions are defined as Ls dominant output signal conditions. Likewise, all Rt dominant and difference signal dominant input signal conditions are defined to be Rs dominant output signal conditions. The directionally cardinal Ls or Rs encoded signal conditions are decoded as cardinal Ls or Rs output signal conditions. In this regard, the decoder is the complement of the encoded signal conditions. It is also instructive to consider that the output signals at Ls and Rs are approximately zero whenever the encoded signals at Lt and Rt are equal amplitude signals. For this condition, the encoded signals are decoded to the Lcs and Rcs output terminals. In this regard, the decoded output signal conditions are the directional complement of the encoded signal conditions.
  • Referring now to Fig. 8, the nature of the decoding method disclosed is such that a signal can be cardinally decoded to the following output terminals: Lo 62, Lc 50 Rc 70, Ro 66, Ls 72, Lcs 68, Rcs 56, Rs 74 placed relative to a listener 78 as indicated.
  • It is possible to decode matrix encoded Lt and Rt signals to six directionally cardinal locations in a 360-degree space. Interim directional locations are "phantom" sources based upon the presence of the decoded signal in multiple channels. For example, a signal can be encoded and subsequently decoded in a complementary manner; to appear at any point between the left channel output and left surround channel output. Likewise, a signal can be encoded and subsequently decoded in a complementary manner, to appear anywhere between the right output channel and right surround output channel. Thus a signal can be encoded and subsequently decoded to appear at any point within a 360-degree spatial angle.
  • The rendering of sources adjacent to the left or right side of a listener are more readily perceived when a physical reproduction channel exists at the prescribed spatial angle. The availability of a greater number of presentation channels, particularly in larger commercial venues such as motion picture theatres, which use a larger number of reproduction channels, takes special advantage of this aspect of the invention.
  • It is possible to utilize the greater number of reproduction loudspeakers in a commercial system to better advantage, by combining the pair-wise decoding technique disclosed in Fig. 17 and its description on page 16 of co-pending U.S. Patent Application 08/796285 with the decoding technique now disclosed herein, such that the opposite channel information contained in either the matrix decoded Lt /Rt signals or the originating discrete media are processed to produce additional cardinal presentation channels adjacent to the left side and right side of an attending audience.
  • In many applications, it is not practical to employ as many as eight physical reproduction loudspeakers. Contemporary home reproduction systems are more typically configured with five physical reproduction loudspeakers. Furthermore, the introduction of 5.1 channel, discrete media presentation systems has defined the number of physical reproduction loudspeakers typically utilized. For reasons of convenience (i.e., a limited number of physical presentation loudspeakers and compatibility with discrete media presentation formats), it may be desirable to down-mix the number of decoded output channels of the disclosed algorithm for reproduction via five physical reproduction channels. This can be done by combining the channels as indicated: C = .707 (Lc + Rc) Ls = .707 (Lcs + Ls) Rs = .707 (Rcs + Rs)
  • Down-mixing the decoded output channels does not reduce the number of cardinal directional states, but rather, the way in which the cardinal directions are reproduced. The cardinal Ls and Rs directional states are still retained. The stereophonic nature of the signals at Ls and Rs is likewise preserved. The exclusive Lcs/Rcs output condition is now reproduced as equal amplitude signals at Ls and Rs. Similarly, the Lc and Rc output signals appear at the single center channel output, thus retaining the cardinal center only direction.
  • Referring to Fig. 9, there is shown the combined circuits of Figs. 3, 4, 5, 6, and 8. The composite block diagram of Fig. 9 is constructed from the individual block diagrams of Figs. 4,5,6, and 7. Boolean switches 80 and 82 have been incorporated into Fig. 9 to enable or disable center channel decoding or surround channel decoding or both. When both sets of switches are in the off state, the input signals at Lt and Rt are presented at the Lo and Ro output terminals. Setting the surround channel mode switches to the off state, presents the surround channel signals at Lt and Rt to the Lo and Ro output terminals. Similarly, setting the center channel mode switches to the off state presents the center channel signals at Lt and Rt to the Lo and Ro output terminals.
  • In many instances, the number of provided reproduction channels are fewer than the number of available presentation channels. In these instances, it is advantageous to process the lesser number of reproduction channels such that the derived number of reproduction channels are equal to the number of available presentation channels. Moreover, contemporary signal transport formats convey as few as one channel or as many as five channels with an attending (spectrally limited) low frequency effects channel. In some signal transport formats, such as Dolby AC-3, information identifying the intended reproduction channel format are included as supplementary data within the transport format. It is possible to utilize the supplementary data as a means of re-formatting the number of intended reproduction channels for further processing into the number of available presentation channels. The provided reproduction channel information is defined in terms of the number of front and rear (surround) reproduction channels. The most widespread formats are:
  • (1) three front channels two rear channels (stereo surround)
  • (2) two front channel two rear channels (no center channel)
  • (3) three front channels two rear channels (mono surround)
  • (4) two channels (stereo) (no center or surround channels)
  • (5) two channels Lt/Rt matrix encoded
  • It should be understood that other intended reproduction formats are possible, and it is likewise possible to process other intended reproduction formats using the techniques disclosed herein.
  • In all cases, it is desirable to process only the necessary channels to obtain the desired number of presentation channels. For all illustrations to follow, assume the number of presentation channels available to be five. As such, the Lcs, Rcs, Lc and Rc outputs of the decoding system shown in Fig. 9 are assumed to have been down-mixed as previously described. The number of available presentation channel signals, however, are not limited to five.
  • For format (1), the channels are processed discretely.
  • For format (2), only the provided left and right reproduction channels are processed as Rt and Lt to obtain a new left, right, and (the derived) center presentation channel signal(s). The originating surround channel signals of format (2) are not processed, and the surround channel mode switches 80 in the block diagram of Fig. 9 are set to the off state.
  • For format (3) the given channel format is first converted to a matrix format for processing. This is accomplished by first down-mixing the given monaural surround channel into the given left channel to form Lnew and further down-mixing (out-of-phase) the given monaural surround channel into the given right channel to form Rnew. Lnew and Rnew are subsequently input into the decoder to obtain new left, right, left surround and right surround presentation channels. The center channel mode switches 80 are set to the off state, since the originating center channel signal is not processed and is reproduced as given.
  • For formats (4) and (5), the given signals are input into the circuitry of fig. 9, as Lt and Rt.
  • The pre-processing for the various formats is summarized in Fig. 10.
  • Other embodiments are within the claims.

Claims (16)

  1. A method for processing multi-channel audio signals, comprising:
    determining the degree of correlation of two of the channels;
    responsive to a determining that said two channels are correlated, normalizing said channels according to a first normalization mode; and
    responsive to a determining that said two channels are uncorrelated, normalizing said channels according to a second normalization mode.
  2. A method for processing multi-channel audio signals, comprising:
    determining the degree of correlation of two of the channels;
    responsive to a determining that said two channels are partially correlated and partially uncorrelated, processing said channels according to a combination of a first normalization mode and a second normalization mode.
  3. A method in accordance with claim 2, wherein said combination is a linearly weighted combination of said first mode and said second mode.
  4. A method in accordance with any of claims 1 to 3, wherein said first normalization mode is a differential mode.
  5. A method in accordance with any of claims 1 to 4, wherein said second normalization mode is a common mode.
  6. A method in accordance with claim 4, further comprising determining the phase relationship of said two channels.
  7. A method in accordance with claim 6, wherein responsive to a determining that said two channels are substantially out of phase, said differential mode is difference signal dominant.
  8. A method in accordance with claim 3, wherein responsive to a determining that said two channels are substantially in phase, said differential mode is sum signal dominant.
  9. A method in accordance with claim 5, further comprising the step of determining an absolute value of a sum signal of said two channels and an absolute value of a difference of said two channels.
  10. A method in accordance with claim 9, responsive to a determining that said absolute value of said sum signal is greater than said absolute value of said difference signal, said common mode is sum signal dominant.
  11. A method in accordance with claim 9, responsive to a determining that said absolute value of said difference signal is greater than said absolute value of said sum signal, said common mode is difference signal dominant.
  12. A method for decoding an encoded multi-channel audio signal, comprising:
    determining the correlation of a first channel and a second channel;
    processing said first channel and said second channel to produce a third channel and a fourth channel.
  13. A method in accordance with claim 12, further comprising determining an absolute value of a sum of said first channel and said second channel.
  14. A method in accordance with claim 12 or claim 13, wherein, responsive to a determining that said first channel and said second channel are substantially correlated, or respectively responsive to said absolute value of said sum signal being greater that said absolute value of said difference signal, said third channel and said fourth channel are substantially correlated.
  15. A method in accordance with claim 12 or claim 13, wherein, responsive to a determining that said first channel and said second channel are uncorrelated, or respectively responsive to said absolute value of said difference signal being greater than said absolute value of said difference signal, said third channel and said fourth channel are substantially uncorrelated.
  16. An apparatus for processing multi-channel audio signals, comprising:
    an input characteristics determiner for determining a degree of correlation of two of the channels;
    a first normalizing multiplier, coupled to said input characteristics determiner, for applying a first normalizing coefficient to a first of said two channels, said normalizing coefficient being responsive to said degree of correlation; and
    a second normalizing multiplier, coupled to said input characteristics determiner, for applying a second normalizing coefficient to said second signal, said normalizing coefficient being responsive to said degree of correlation.
EP00304128A 1999-05-17 2000-05-16 Directional decoding Withdrawn EP1054575A3 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US313058 1981-10-19
US09/313,058 US7016501B1 (en) 1997-02-07 1999-05-17 Directional decoding

Publications (2)

Publication Number Publication Date
EP1054575A2 true EP1054575A2 (en) 2000-11-22
EP1054575A3 EP1054575A3 (en) 2002-09-18

Family

ID=23214203

Family Applications (1)

Application Number Title Priority Date Filing Date
EP00304128A Withdrawn EP1054575A3 (en) 1999-05-17 2000-05-16 Directional decoding

Country Status (4)

Country Link
EP (1) EP1054575A3 (en)
JP (1) JP2000350300A (en)
CN (1) CN100349497C (en)
HK (1) HK1032705A1 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002063925A2 (en) * 2001-02-07 2002-08-15 Dolby Laboratories Licensing Corporation Audio channel translation
WO2004019656A2 (en) * 2001-02-07 2004-03-04 Dolby Laboratories Licensing Corporation Audio channel spatial translation
WO2007008004A3 (en) * 2005-07-11 2007-03-15 Lg Electronics Inc Apparatus and method of encoding and decoding audio signal
AU2003278704B2 (en) * 2002-08-07 2009-04-23 Dolby Laboratories Licensing Corporation Audio channel spatial translation
US7660424B2 (en) 2001-02-07 2010-02-09 Dolby Laboratories Licensing Corporation Audio channel spatial translation
US8793125B2 (en) 2004-07-14 2014-07-29 Koninklijke Philips Electronics N.V. Method and device for decorrelation and upmixing of audio channels
WO2017049397A1 (en) * 2015-09-25 2017-03-30 Voiceage Corporation Method and system using a long-term correlation difference between left and right channels for time domain down mixing a stereo sound signal into primary and secondary channels

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005086139A1 (en) * 2004-03-01 2005-09-15 Dolby Laboratories Licensing Corporation Multichannel audio coding
US7283634B2 (en) * 2004-08-31 2007-10-16 Dts, Inc. Method of mixing audio channels using correlated outputs
US7787631B2 (en) * 2004-11-30 2010-08-31 Agere Systems Inc. Parametric coding of spatial audio with cues based on transmitted channels
AU2006291689B2 (en) 2005-09-14 2010-11-25 Lg Electronics Inc. Method and apparatus for decoding an audio signal
CN101341533B (en) * 2005-09-14 2012-04-18 Lg电子株式会社 Method and apparatus for decoding an audio signal
CN101411214B (en) * 2006-03-28 2011-08-10 艾利森电话股份有限公司 Method and arrangement for a decoder for multi-channel surround sound

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0615399A1 (en) 1993-03-09 1994-09-14 Matsushita Electric Industrial Co., Ltd. Sound field controller
US5727068A (en) 1996-03-01 1998-03-10 Cinema Group, Ltd. Matrix decoding method and apparatus

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5046098A (en) * 1985-03-07 1991-09-03 Dolby Laboratories Licensing Corporation Variable matrix decoder with three output channels
JP2512038B2 (en) * 1987-12-01 1996-07-03 松下電器産業株式会社 Sound field playback device
JPH03110999A (en) * 1989-09-25 1991-05-10 Sharp Corp 4-channel stereo emphasis circuit
JPH05236599A (en) * 1992-02-21 1993-09-10 Clarion Co Ltd Acoustic reproducing device with three speakers
EP0593128B1 (en) * 1992-10-15 1999-01-07 Koninklijke Philips Electronics N.V. Deriving system for deriving a centre channel signal from a stereophonic audio signal
US6711266B1 (en) * 1997-02-07 2004-03-23 Bose Corporation Surround sound channel encoding and decoding

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0615399A1 (en) 1993-03-09 1994-09-14 Matsushita Electric Industrial Co., Ltd. Sound field controller
US5727068A (en) 1996-03-01 1998-03-10 Cinema Group, Ltd. Matrix decoding method and apparatus

Cited By (58)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002063925A2 (en) * 2001-02-07 2002-08-15 Dolby Laboratories Licensing Corporation Audio channel translation
WO2002063925A3 (en) * 2001-02-07 2004-02-19 Dolby Lab Licensing Corp Audio channel translation
WO2004019656A2 (en) * 2001-02-07 2004-03-04 Dolby Laboratories Licensing Corporation Audio channel spatial translation
WO2004019656A3 (en) * 2001-02-07 2004-10-14 Dolby Lab Licensing Corp Audio channel spatial translation
KR100904985B1 (en) * 2001-02-07 2009-06-26 돌비 레버러토리즈 라이쎈싱 코오포레이션 Audio channel translation
US7660424B2 (en) 2001-02-07 2010-02-09 Dolby Laboratories Licensing Corporation Audio channel spatial translation
AU2003278704B2 (en) * 2002-08-07 2009-04-23 Dolby Laboratories Licensing Corporation Audio channel spatial translation
KR100988293B1 (en) * 2002-08-07 2010-10-18 돌비 레버러토리즈 라이쎈싱 코오포레이션 Audio channel spatial translation
US8793125B2 (en) 2004-07-14 2014-07-29 Koninklijke Philips Electronics N.V. Method and device for decorrelation and upmixing of audio channels
WO2007008004A3 (en) * 2005-07-11 2007-03-15 Lg Electronics Inc Apparatus and method of encoding and decoding audio signal
US7411528B2 (en) 2005-07-11 2008-08-12 Lg Electronics Co., Ltd. Apparatus and method of processing an audio signal
US7830921B2 (en) 2005-07-11 2010-11-09 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signal
US7835917B2 (en) 2005-07-11 2010-11-16 Lg Electronics Inc. Apparatus and method of processing an audio signal
US7930177B2 (en) 2005-07-11 2011-04-19 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signals using hierarchical block switching and linear prediction coding
US7949014B2 (en) 2005-07-11 2011-05-24 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signal
US7962332B2 (en) 2005-07-11 2011-06-14 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signal
US7966190B2 (en) 2005-07-11 2011-06-21 Lg Electronics Inc. Apparatus and method for processing an audio signal using linear prediction
US7987008B2 (en) 2005-07-11 2011-07-26 Lg Electronics Inc. Apparatus and method of processing an audio signal
US7987009B2 (en) 2005-07-11 2011-07-26 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signals
US7991272B2 (en) 2005-07-11 2011-08-02 Lg Electronics Inc. Apparatus and method of processing an audio signal
US7991012B2 (en) 2005-07-11 2011-08-02 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signal
US7996216B2 (en) 2005-07-11 2011-08-09 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signal
US8010372B2 (en) 2005-07-11 2011-08-30 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signal
US8032240B2 (en) 2005-07-11 2011-10-04 Lg Electronics Inc. Apparatus and method of processing an audio signal
US8032386B2 (en) 2005-07-11 2011-10-04 Lg Electronics Inc. Apparatus and method of processing an audio signal
US8032368B2 (en) 2005-07-11 2011-10-04 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signals using hierarchical block swithcing and linear prediction coding
US8046092B2 (en) 2005-07-11 2011-10-25 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signal
US8050915B2 (en) 2005-07-11 2011-11-01 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signals using hierarchical block switching and linear prediction coding
US8055507B2 (en) 2005-07-11 2011-11-08 Lg Electronics Inc. Apparatus and method for processing an audio signal using linear prediction
CN101218628B (en) * 2005-07-11 2011-11-09 Lg电子株式会社 Apparatus and method of encoding and decoding an audio signal
US8065158B2 (en) 2005-07-11 2011-11-22 Lg Electronics Inc. Apparatus and method of processing an audio signal
US8108219B2 (en) 2005-07-11 2012-01-31 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signal
US8121836B2 (en) 2005-07-11 2012-02-21 Lg Electronics Inc. Apparatus and method of processing an audio signal
US8149877B2 (en) 2005-07-11 2012-04-03 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signal
US8149876B2 (en) 2005-07-11 2012-04-03 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signal
US8149878B2 (en) 2005-07-11 2012-04-03 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signal
US8155152B2 (en) 2005-07-11 2012-04-10 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signal
US8155153B2 (en) 2005-07-11 2012-04-10 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signal
US8155144B2 (en) 2005-07-11 2012-04-10 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signal
US8180631B2 (en) 2005-07-11 2012-05-15 Lg Electronics Inc. Apparatus and method of processing an audio signal, utilizing a unique offset associated with each coded-coefficient
US8255227B2 (en) 2005-07-11 2012-08-28 Lg Electronics, Inc. Scalable encoding and decoding of multichannel audio with up to five levels in subdivision hierarchy
US8275476B2 (en) 2005-07-11 2012-09-25 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signals
US8326132B2 (en) 2005-07-11 2012-12-04 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signal
US8417100B2 (en) 2005-07-11 2013-04-09 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signal
US8510119B2 (en) 2005-07-11 2013-08-13 Lg Electronics Inc. Apparatus and method of processing an audio signal, utilizing unique offsets associated with coded-coefficients
US8510120B2 (en) 2005-07-11 2013-08-13 Lg Electronics Inc. Apparatus and method of processing an audio signal, utilizing unique offsets associated with coded-coefficients
US8554568B2 (en) 2005-07-11 2013-10-08 Lg Electronics Inc. Apparatus and method of processing an audio signal, utilizing unique offsets associated with each coded-coefficients
WO2017049397A1 (en) * 2015-09-25 2017-03-30 Voiceage Corporation Method and system using a long-term correlation difference between left and right channels for time domain down mixing a stereo sound signal into primary and secondary channels
US10319385B2 (en) 2015-09-25 2019-06-11 Voiceage Corporation Method and system for encoding left and right channels of a stereo sound signal selecting between two and four sub-frames models depending on the bit budget
US10325606B2 (en) 2015-09-25 2019-06-18 Voiceage Corporation Method and system using a long-term correlation difference between left and right channels for time domain down mixing a stereo sound signal into primary and secondary channels
US10339940B2 (en) 2015-09-25 2019-07-02 Voiceage Corporation Method and system for encoding a stereo sound signal using coding parameters of a primary channel to encode a secondary channel
US10522157B2 (en) 2015-09-25 2019-12-31 Voiceage Corporation Method and system for time domain down mixing a stereo sound signal into primary and secondary channels using detecting an out-of-phase condition of the left and right channels
US10573327B2 (en) 2015-09-25 2020-02-25 Voiceage Corporation Method and system using a long-term correlation difference between left and right channels for time domain down mixing a stereo sound signal into primary and secondary channels
RU2728535C2 (en) * 2015-09-25 2020-07-30 Войсэйдж Корпорейшн Method and system using difference of long-term correlations between left and right channels for downmixing in time area of stereophonic audio signal to primary and secondary channels
US10839813B2 (en) 2015-09-25 2020-11-17 Voiceage Corporation Method and system for decoding left and right channels of a stereo sound signal
US10984806B2 (en) 2015-09-25 2021-04-20 Voiceage Corporation Method and system for encoding a stereo sound signal using coding parameters of a primary channel to encode a secondary channel
US11056121B2 (en) 2015-09-25 2021-07-06 Voiceage Corporation Method and system for encoding left and right channels of a stereo sound signal selecting between two and four sub-frames models depending on the bit budget
RU2763374C2 (en) * 2015-09-25 2021-12-28 Войсэйдж Корпорейшн Method and system using the difference of long-term correlations between the left and right channels for downmixing in the time domain of a stereophonic audio signal into a primary channel and a secondary channel

Also Published As

Publication number Publication date
HK1032705A1 (en) 2001-07-27
CN100349497C (en) 2007-11-14
JP2000350300A (en) 2000-12-15
CN1281329A (en) 2001-01-24
EP1054575A3 (en) 2002-09-18

Similar Documents

Publication Publication Date Title
EP0858243B1 (en) Surround sound channel encoding and decoding
US7003467B1 (en) Method of decoding two-channel matrix encoded audio to reconstruct multichannel audio
US5757927A (en) Surround sound apparatus
EP0966865B1 (en) Multidirectional audio decoding
EP0553832B1 (en) Sound field controller
KR101215872B1 (en) Parametric coding of spatial audio with cues based on transmitted channels
AU2005262025B2 (en) Apparatus and method for generating a multi-channel output signal
US7945054B2 (en) Method and apparatus to reproduce wide mono sound
US8976972B2 (en) Processing of sound data encoded in a sub-band domain
EP1054575A2 (en) Directional decoding
WO2006060278A1 (en) Synchronizing parametric coding of spatial audio with externally provided downmix
US7016501B1 (en) Directional decoding
EP0571455B1 (en) Sound reproduction system
EP0629335B1 (en) Surround sound apparatus
US5844993A (en) Surround signal processing apparatus
US7929054B2 (en) Up-sampling television audio signals for encoding
US6882733B2 (en) Surround headphone output signal generator
CN114143699B (en) Audio signal processing method and device and computer readable storage medium
EP1061775A2 (en) Audio signal processing
WO2024081957A1 (en) Binaural externalization processing

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AX Request for extension of the european patent

Free format text: AL;LT;LV;MK;RO;SI

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AX Request for extension of the european patent

Free format text: AL;LT;LV;MK;RO;SI

17P Request for examination filed

Effective date: 20030305

AKX Designation fees paid

Designated state(s): DE FR

17Q First examination report despatched

Effective date: 20101013

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN

18W Application withdrawn

Effective date: 20111107