Description
APPARATUS AND METHOD OF PROCESSING MULTI¬ CHANNEL AUDIO INPUT SIGNALS TO PRODUCE AT LEAST TWO CHANNEL OUTPUT SIGNALS THEREFROM, AND
COMPUTER READABLE MEDIUM CONTAINING EXECUTABLE CODE TO PERFORM THE METHOD
Technical Field
[1] The present general inventive concept relates to a method of reproducing an input
5.1 channel sound (or another multi-channel group) through a 2 channel speaker system. More particularly, the present general inventive concept relates to audio post¬ processing technology in which input left and right surround channel sounds form a sound image at a left rear and a right of a listener position while input left, center, low frequency effect (LFE), and right channel sounds form a natural surround sound by correcting an output gain and a delay thereof.
Background Art
[2] A conventional virtual speaker device that uses down-mixing technology is disclosed in WO99/49574. FIG. 1 is a block diagram illustrating the conventional virtual speaker device.
[3] The conventional virtual speaker device of FIG. 1 can make a 6-channel surround effect through a 2-channel speaker. The conventional virtual speaker device is comprised of an impulse response for each channel obtained from a head related transfer function (HRTF), a convolution unit for convolving an input signal with an impulse response for each channel, and an adding unit for adding the convolved signals to produce two channels (i.e., a left channel and a right channel).
[4] More specifically, the conventional virtual speaker device of FIG. 1 convolves audio input signals of a left front channel, a right front channel, a center front channel, a left surround channel, a right surround channel, and a low frequency effect (LFE) channel with corresponding impulse responses to produce two signals, namely, left and right signals for each channel (i.e., the left and right channels). Right signals for the 6 channels are added together, and left signals for the 6 channels are added together, thereby producing two channel output signals, namely, a left channel signal and a right channel signal.
[5] When the two channel output signals are reproduced, a surround effect is produced such that a left virtual speaker, a right virtual speaker, a center virtual speaker, a left- surround virtual speaker, and a right surround virtual speaker seem to be placed around
a listener
[6] However, when a correlation between the left surround channel and the right surround channel is high, the conventional virtual speaker device of FIG 1 has difficulty in forming a sound image at a rear of the listener.
[7] A high correlation between two channels indicates that the two channels have almost the same sound characteristics A description of why it is difficult to form the sound image at the rear of the listener when the correlation is high is as follows
[8] A virtual sound source is formed using an HRTF, which is a characteristic of an acoustic signal at the ears of a human depending on shapes of the head and ears of the human. The HRTF can perceive a 3-dimensional audio sound, because of a phenomenon where paths of the acoustic signals differ. The paths include a simple path difference resulting from an inter-aural level difference (ILD) or an mter-aural time difference (ITD) In addition, a complicated path difference resulting from a changing direction of the acoustic signals due to a diffraction at a surface of the head of the human (i e , a listener), a reflection by an auricle of the human, or the like Since the HRTF in each of the horizontal and vertical directions has unique characteristics, the 3-dimensonal audio sound can be produced using the HRTF
[9] The HRTF can easily distinguish between a left side and a right side on a horizontal plane However, an error of the HRTF makes it difficult to distinguish between a front position and a rear position on the horizontal plane In order to distinguish between the front position and the rear position, accurate frequency characteristics of an actual user should be measured However, if a standard dummy head is used to measure frequency characteristics, front/back confusion occurs due to a difference between frequency characteristics of the standard dummy head and the actual human listener
[10] In order to create a surround channel effect, surround channels must form sound images at a left rear and a right rear of the listener. Disclosure of Invention
Technical Problem
[11] However, when a correlation between left and right surround channel audio input signals is high, the sound image is formed at a center rear of the listener instead of at the left and right rears of the listener In addition, due to the use of the standard dummy head, front/back confusion occurs Thus it is difficult to obtain the surround channel effect.
Technical Solution
[12] The present general inventive concept provides an apparatus and method of re¬ producing m-channel audio input signals using an n-channel speaker system, n being less than m, in which a surround effect obtained by an m-channel speaker system can
be obtained even when the n-channel speaker system includes two (or more) speakers.
Additionally, surround channel audio input signals are transformed into virtual speakers at a left rear and a right rear of a listener position so that the listener can perceive a surround effect.
Advantageous Effects [13] As described above, in an apparatus and method of reproducing multi-channel audio input signals through two channels according to various embodiments of the present general inventive concept, even when the multi-channel audio input signals are reproduced through 2 channels, a surround effect provided by a multi-channel speaker system can be obtained. [14] In addition, since left and right surround channel audio input signals are transformed into virtual speakers at a left rear and a right rear of a listener position, the listener can perceive a surround effect [15] Furthermore, even when a correlation between the left and right surround channel audio input signals is high, a localization of a sound can be improved, and a presence is formed. Thus, an enhanced surround sound is provided to the listener.
Description of Drawings [16] FIG. 1 is a schematic configuration illustrating a conventional virtual speaker device based that uses a down-mixing technique;
[17] FIG. 2 is a schematic configuration illustrating an apparatus to process multi¬ channel audio input signals to produce two channel output signals according to an embodiment of the present general inventive concept; [18] FIG. 3 is a block diagram illustrating a virtual surround filter according to an embodiment of the present general inventive concept; [19] FIG. 4 is a block diagram illustrating a first filter unit according to an embodiment of the present general inventive concept; [20] FIG. 5 is a block diagram illustrating a first filter unit according to another embodiment of the present general inventive concept; [21] FIG. 6 is a configuration illustrating a virtual sound source generation unit according to an embodiment of the present general inventive concept; [22] FIG. 7 is a block diagram illustrating a calculation of the virtual sound source generation unit of FIG. 6; [23] FIG. 8 is a block diagram illustrating an output controller according to an embodiment of the present general inventive concept; [24] FIG 9 is a flowchart illustrating a method of processing multi-channel audio input signals to produce two channel output signals; [25] FIG. 10 is a flowchart illustrating a method of reducing a correlation between a left
surround channel audio input signal and a right surround channel audio input signal according to an embodiment of the present general inventive concept;
[26] FIGS. 1 IA and 1 IB are flowcharts illustrating methods of group-delaying specific frequency components of left and right surround channel audio input signals, re¬ spectively, according to another embodiment of the present general inventive concept;
[27] FIG. 12 is a flowchart illustrating a method of transforming left and right surround channel audio input signals into virtual sound sources; and
[28] FIG. 13 is a flowchart illustrating a method of controlling channel audio input signals other than left and right surround channel audio input signals.
Best Mode
[29] The foregoing and/or other aspects of the present general inventive concept are achieved by providing an apparatus to process m-channel audio input signals to produce n channel output signals, n being less than m, the apparatus including a first filter unit to reduce a correlation between at least two channel audio input signals among the m-channel audio input signals, a virtual sound source generation unit to transform the at least two channel audio input signals output by the first filter unit into virtual sound sources at predetermined positions around a listener position, and an output controller to control gams and delays of channel audio input signals other than the at least two channel audio input signals among the multi-channel audio input signals based on gains and delays of the at least two channel audio input signals output from the virtual sound source generation unit
[30] The foregoing and/or other aspects of the present general inventive concept are also achieved by providing an apparatus to process m-channel audio input signals to produce n-channel output signals, n being less than m, the apparatus including a first filter unit to group delay a specific frequency component of at least two channel audio input signals among the m-channel audio input signals, a virtual sound source generation unit to transform the at least two channel audio input signals output from the first filter unit into virtual sound sources at predetermined positions around a listener position, and an output controller to control gains and delays of channel audio input signals other than the at least two channel audio input signals among the multi¬ channel audio input signals based on gains and delays of the at least two channel audio input signal output by the virtual sound source generation unit
[31] The foregoing and/or other aspects of the present general inventive concept are also achieved by providing a method of processing multi-channel audio input signals to produce n-channel output signals, n being less than m, the method including reducing a correlation between at least two channel audio input signals among the m-channel audio input signals, transforming the at least two channel audio input signals into virtual sound sources at predetermined positions around a listener position, and
controlling gains and delays of channel audio input signals other than the at least two channel audio input signals among the multi-channel audio input signals based on gams and delays of the at least two channel audio input signal transformed into the virtual sound sources
[32] The foregoing and/or other aspects of the present general inventive concept are also achieved by providing a method of processing m-channel audio input signals to produce n channel output signals, n being less than m, the method including group delaying a specific frequency component of at least two channel audio input signals among the multi-channel audio input signals, transforming the at least two channel audio input signals into virtual sound sources at predetermined positions around a listener position, and controlling gains and delays of channel audio input signals other than the at least two channel audio input signals among the multi-channel audio input signals based on gams and delays of the at least two channel audio input signals transformed into the virtual sound sources
[33] The foregoing and/or other aspects of the present general inventive concept are also achieved by providing a computer-readable medium containing executable code to perform the methods described above.
[34] The foregoing and/or other aspects of the present general inventive concept are also achieved by providing an apparatus to produce surround sound with a plurality of channel signals in a system having a predetermined number of speakers, the pre¬ determined number of speakers being less than a number of the plurality of channel signals, the apparatus comprising a filter process unit to receive at least first and second channel signals of the plurality of channel signals having similar signal charac¬ teristics and to process the first and second channel signals differently such that the signal characteristics of the first and second channel signals are made different from each other, and a virtual sound unit to produce at least first and second virtual sound sources at predetermined positions within a sound field from the first and second channel signals having the different signal characteristics
[35] The foregoing and/or other aspects of the present general inventive concept are also achieved by providing an apparatus to process n channel signals to produce a surround sound effect in a speaker system having m speakers, m being less than n, the apparatus comprising a filter unit to induce different delays in at least two of the n channel signals, and a virtual sound unit to receive the delayed at least two of the n channel signals and to localize the received at least two of the n channel signals at pre¬ determined positions around a listener position, and an output controller to control gai ns and delays of the n channel signals other than the at least two of the n channel signals according to gams and delays of the at least two of the n channel signals.
[36] The foregoing and/or other aspects of the present general inventive concept are also
achieved by providing a method of producing surround sound with a plurality of channel signals in a system having a predetermined number of speakers, the pre¬ determined number of speakers being less than a number of the plurality of channel signals, the method comprising receiving at least first and second channel signals of the plurality of channel signals having similar signal characteristics, processing the first and second channel signals differently such that the signal characteristics of the first and second channel signals are made different from each other, and producing at least first and second virtual sound sources at predetermined positions within a sound field from the first and second channel signals having the different signal charac¬ teristics.
[37] The foregoing and/or other aspects of the present general inventive concept are also achieved by providing a method of processing n channel signals to produce a surround sound effect in a speaker system having m speakers, m being less than n, the method comprising inducing different delays in at least two of the n channel signals, localizing the delayed at least two of the n channel signals at predetermined positions around a listener position, and controlling gains and delays of the n channel signals other than the at least two of the n channel signals according to gains and delays of the at least two of the n channel signals.
Mode for Invention
[38] Reference will now be made in detail to the embodiments of the present general inventive concept, examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to the like elements throughout. The em¬ bodiments are described below in order to explain the present general inventive concept while referring to the figures.
[39] FIG. 2 is a schematic configuration illustrating an apparatus to process multi¬ channel audio input signals 100 to produce two channel output signals. The apparatus of FIG. 2 includes a virtual surround filter 200, an output controller 300, adders 400, a left channel speaker 500, and a right channel speaker 600.
[40] The multi-channel audio input signals 100 comprise a left channel L, a center channel C, a low frequency effect channel LFE, a right channel R, a left surround channel Ls, and a right surround channel Rs. Although a group of 5.1 channels is described and illustrated in this embodiment, it should be understood that other multi¬ channel groups, such as a group of 6.1 channels or a group of 7.1 channels, may al¬ ternatively be used.
[41] The virtual surround filter 200 receives the left surround channel Ls and the right surround channel Rs from among the multi-channel audio input signals 100.
[42] The virtual surround filter 200 reduces a correlation between the left and right surround channels Ls and Rs and produces virtual sound sources at a left rear and a
right rear of a listener position. The operations of the virtual surround filter 200 are described below m greater detail with reference to FIGS. 3 through 7.
[43] The output controller 300 receives the left channel L, the center channel C, the low frequency effect channel LFE, and the right channel R from among the multi-channel audio input signals 100.
[44] The virtual sound filter 200 changes frequency characteristics of the left and right surround channels Ls and Rs to produce the virtual sound sources. The output controller 300 controls output gams and time delays of the left channel L, the center channel C, the low frequency effect channel LFE, and the right channel R.
[45] One of the adders 400 adds up left signals of the channel audio input signals 100 output by the virtual surround filter 200 and the output controller 300 and outputs the added left signals to the left channel speaker 500. The other adder 400 adds up right signals of the channel audio input signals 100 output by the virtual surround filter 200 and the output controller 300 and outputs the added right signals to the right channel speaker 600
[46] If a 6 1 -channel audio input signal used instead of the 5 1 channel audio input signal, a rear surround channel is used with the 5.1 channels In this case, a virtual surround filter which is the same as the virtual surround filter 200 is further included and receives two signals into which the corresponding rear surround channel audio input signal is divided
[47] If a 7.1 -channel audio input signal is received instead of the 5.1 channel audio input signal, two rear surround channels are used with the 5.1 channels. In this case, a virtual surround filter which is the same as the virtual surround filter 200 is further included and receives the corresponding two rear surround channel audio input signals
[48] FIG. 3 is a block diagram illustrating the virtual surround filter 200 (see FIG. 2) according to an embodiment of the present general inventive concept. The virtual surround filter of FIG. 3 includes a first filter unit 220 and a virtual sound source generation unit 280.
[49] The first filter unit 220 reduces a correlation between the left and right surround channel audio input signals Ls and Rs to improve a localization of a surround channel sound and simultaneously form a presence. When the first filter unit 220 is not used and the correlation between the left and right surround channel audio input signals Ls and Rs is high, a sound image is formed as a phantom image at a center rear of the listener position instead of at the left and right rears of the listener position. The sound image at the center rear of the listener position may be heard at a front of the listener position due to front/back confusion, which makes it difficult for the listener to perceive a surround effect.
[50] Thus, the first filter unit 220 reduces a sound correlation between the left and right
surround channel sound signals Ls and Rs and forms a presence, to thereby produce a natural surround channel effect A configuration of the first filter unit 220 is described below with reference to FIGS 4 and 5
[51] The virtual sound source generation unit 280 receives signals output by the first filter unit 220 and forms the virtual sound sources at the left and right rears of the listener position to produce the surround effect A configuration of the virtual sound source generation unit 280 is described below with reference to FIGS 6 and 7
[52] FIG 4 is a block diagram illustrating a configuration of the first filter unit 220 (see
FIG 3) according to an embodiment of the present general inventive concept The first filter unit 220 is implemented with a plurality of delay units which can be asymmetrical, a plurality of gain units, and a plurality of adders.
[53] The first filter unit 220 includes a first delay unit 221, a second delay unit 222, a third delay unit 223, a fourth delay unit 224, a first gain unit 225, a second gam unit 226, a first adder 228, a second adder 227, a first filter 229, a second filter 230, a third filter 231, a fourth filter 232, a fifth delay unit 233, a sixth delay unit 234, a third gain unit 235, a fourth gam unit 236, a third adder 237, and a fourth adder 238
[54] The first delay unit 221 delays the left surround channel sound signal Ls for a first predetermined period of time In the present embodiment, the first delay unit 221 is a delay filter having a transfer function of Z m
[55] The second delay unit 222 delays the right surround channel sound signal Rs for a second predetermined period of time. In the present embodiment, the second delay unit 222 is a delay filter having a transfer function of Z m J β RR
[56] The first and second delay units 221 and 222 are asymmetrical. In other words, the first and second delay units 221 and 222 delay received signals (i e , the left surround channel signal Ls and the right surround channel signal Rs) for different periods of time (i e., the first and second predetermined periods of time)
[57] The third delay unit 223 delays the left surround channel sound signal Ls for a third predetermined period of time In the present embodiment, the third delay unit 223 is a delay filter having a transfer function of Z m
[58] The fourth delay unit 224 delays the right surround channel sound signal Rs for a fourth predetermined period of time In the present embodiment, the fourth delay unit 224 is a delay filter having a transfer function of Z m
[59] The third and fourth delay units 223 and 224 are also asymmetrical In other words, the third and fourth delay units 223 and 224 delay received signals (i e , the left surround channel signal Ls and the right surround channel signal Rs) for different periods of time (i e , the third and fourth predetermined periods of time)
[60] The first gam unit 225 changes an output gain of the third delay unit 223, and the second gam unit 226 changes a gain of the fourth delay unit 224
[61] The first adder 228 adds an output of the first delay unit 221 and an output of the second gain unit 226. The second adder 227 adds an output of the second delay unit 222 and an output of the first gain unit 225.
[62] The first and second gain units 225 and 226 reduce output gains of the left and right surround channel audio input signals Ls and Rs delayed for the predetermined periods of time by the third and fourth delay units 223 and 224, respectively. The first and second gain units 225 and 226 prevent the two surround channel audio input signals from being mixed.
[63] The first filter 229 filters an output signal of the first adder 227, and the second filter 230 filters an output signal of the second adder 228. Output signals of the first and second filters 229 and 230 are applied to the virtual sound source generation unit 280.
[64] The fifth delay unit 233 delays a signal that passes through the first and third filters
229 and 231 for a fifth predetermined period of time. In the present embodiment, the fifth delay unit 233 is a delay filter having a transfer function of Z"m
[65] The sixth delay unit 234 delays a signal that passes through the second and fourth filters 230 and 232 for a sixth predetermined period of time. In the present embodiment, the sixth delay unit 234 is a delay filter having a transfer function of Z"m . The fifth and sixth delay units 233 and 234 are asymmetrical. In other words, the fifth and sixth delay units 233 and 234 delay received signals for different periods of time (i.e., the fifth and sixth predetermined periods of time). The transfer functions of the delay units 233, 221, 223, 224, 222, and 234 may be selected according to design specifications.
[66] The first, second, third, and fourth filters 229, 230, 231 , and 232 may be low pass filters (LPFs) to filter out a high-band component of a received signal.
[67] The third gam unit 235 changes an output gain of the fifth delay unit 233, and the fourth gain unit 236 changes an output gain of the sixth delay unit 234.
[68] The third adder 237 adds an output signal of the third gain unit 235 and the left surround channel audio input signal Ls. The fourth adder 238 adds an output signal of the fourth gain unit 236 and the right surround channel audio input signal Rs.
[69] FIG. 5 is a block diagram illustrating the first filter unit 220 (see FIG. 3) according to another embodiment of the present general inventive concept. The first filter unit of FIG. 5 has similar characteristics as those of the first filter unit of FIG. 4. However, the first filter unit of FIG. 5 can obtain a more natural presence than the first filter unit of FIG. 4 by using a full band filter applied to an artificial reverberator to artificially create reverberation characteristics of a space. A full band filter has a characteristic of group-delaying a specific frequency component. Due to the application of this char¬ acteristic of the full band filter, a mono signal can become a similar stereo signal.
[70] The first filter unit of FIG 5 is arranged so that two full band filters are de- pendently connected to each of the left and right surround channels Ls and Rs.
[71] The left channel surround audio input signal Ls is input to a first full band filter (Z
) 251. A gain of the left channel surround audio input signal Ls is changed while passing through a gain unit (GL) 252 Signals output by the gain unit (GL) 252 and the first full band filter (Z L ) 251 are added together by an adder 253 A gam of a signal output by the adder 253 is changed while passing through a gain unit ( GL) 254 A signal output by the gam unit ( - GL) 254 and the left channel surround audio input signal Ls are then added together by an adder 255
[72] The signal output by the adder 253 is input to a second full band filter (Z L ) 256. A gam of the signal output by the adder 253 is changed while passing through a gain unit (GL) 257 Signals output by the gam unit (GL) 257 and the second full band filter (Z L ) 256 are added together by an adder 258 A gam of a signal output by the adder 258 is changed while passing through a gam unit ( GL) 259. A signal output by the gain unit ( - GL) 259 and the signal output by the adder 253 are added together by an adder 260
[73] Two full band filters dependently connected to the right surround channel, namely, a third full band filter (Z ) 261 and a fourth full band filter (Z ) 266, are arranged in a similar manner to the first and second full band filters 251 and 256 connected to the left surround channel Operations of adders 263, 265, 268, and 270 may be similar to the adders 253, 255, 258, and 260, and operations of gain units (GR) 262 and 267 and ( GR) 264 and 269 may be similar to the gam units (GL) 252 and 257 and ( GL) 254 and 259, respectively
[74] When a mono signal is received, delay values of the four full band filters are set to be different as given by LO ≠ Ll ≠ RO ≠ Rl, to convert the mono signal into a stereo signal In order to maximize the reduction of the correlation due to the asymmetrical configuration of the first filter unit of FIG. 5, delay values of the two full band filters independently connected to each channel have a relationship of L0>Ll and R0>Rl or a relationship of L0<Ll and R0<Rl
[75] Gam units of each all pass filter typically have identical values, but in some cases, may have different values. In order to prevent the signals from being out-of-phase, the gam units GL and GR may have identical or different signs, but the gam units 254 and 259 should have identical gain, the gam units 252 and 257 should have identical gam, the gain units 264 and 269 should have identical gain, and the gam units 262 and 267 should have identical gam
[76] FIG. 6 is a configuration of the virtual sound source generation unit 280 (see FIG
3) according to an embodiment of the present general inventive concept The virtual sound source generation unit of FIG 6 transforms the left and right surround channel audio input signals Ls and Rs output by the first filter unit of FIG. 4 and/or 5 into
virtual sound sources at the left rear and the right rear of the listener position.
[77] The virtual sound source generation unit is arranged to receive the left and right surround channel audio input signals Ls and Rs output by the first filter unit, to convolve the left and right surround channel audio input signals Ls and Rs with four finite impulse response (FIR) filters KIl, K 12, K21, and K22, and to add the results of the convolutions together.
[78] The left and right surround channel audio input signals Ls and Rs are convolved with the FIR filters Kl 1 and K12, respectively, and two signals produced by these con¬ volutions are added together to produce a left channel output signal. The left and right surround channel audio input signals Ls and Rs are convolved with the FIR filters K21 and K22, respectively, and two signals produced by these convolutions are added together to produce a right channel output signal.
[79] The left and right channel output signals are added to the output signals of the output controller 300 (see FIG. 2) to produce final output signals of two channels.
[80] FIG. 7 is a block diagram illustrating a calculation of the virtual sound source generation unit 280 (see FIG. 3).
[81] The virtual sound source generation unit of FIG. 7 uses a binaural synthesis filter
(B , B , B , and B ), which is implemented as a first matrix of head related transfer functions (HRTFs) between a virtual sound source (i.e., a location thereof) and a virtual listener at the listener r position, ' and a crosstalk cancellation filter ( VC 11 , C 12 , C 21 , and C ), which is implemented as an inverse matrix of a second matrix of HRTFs between the virtual listener and each location of 2 channel outputs. The locations of the 2 channel outputs correspond to locations of 2 speakers.
[82] The binaural synthesis filter (B , B , B , and B ) is implemented using the
HRTFs, each of which is an acoustic transfer function between a sound source and an eardrum.
[83] The HRTF contains various information representing characteristics of a space where a sound is transferred, including a timing difference between right and left ears of the listener, a level difference between the right and left ears of the listener, and shapes of right and left pinnas of the listener. Particularly, the HRTF includes in¬ formation about the pinnas that critically affects localizations of upper and lower sound images. The information about the pinnas is usually obtained through measurements based on a dummy head, because modeling the pinnas having complicated shapes is difficult. A surround speaker is generally located between 90 ° and 110 ° with respect to a front center of the listener position. Hence, to localize a virtual speaker between 90 0 and 110 ° where the surround speaker is located , an HRTF is measured between 90 ° and 110 ° on each of the left and right sides from the front center of the listener position (e.g., by using the dummy head). The present general inventive concept is not
limited thereto. The HRTF may be measured between 80 and 130 degrees [84] HRTFs between a speaker located between 90 ° and 110 ° on the left side of the listener position and the left and right ears of a dummy head, respectively, are referred to as B 11 and B 21. HRTFs between a speaker located between 90 ° and 110 ° on the right side of the listener position and the left and right ears of the dummy head, re¬ spectively, are referred to as B and B . r J 12 22
[85] When the listener hears an output signal of the binaural synthesized filter through a headphone, the listener perceives that a sound image is positioned between 90 ° and 110 ° on each of the left and right sides of the listener position. Binaural synthesis provides best performance when a sound is reproduced through a headphone. On the other hand, when the sound is reproduced through two speakers, crosstalk occurs between the two speakers and the two ears of the listener, thereby degrading a sense of localization of the sound. In other words, although a sound of the left channel should only be heard in the left ear and a sound of the right channel should only be heard in the right ear, the left channel sound is heard by the right ear and the right channel sound is heard by the left ear due to the crosstalk between the two channels. Thus, the sense of localization is degraded, such that the sound image is not positioned at a desired location with respect to the listener position.
[86] Hence, the crosstalk cancellation filter (C , C , C , and C ) is designed so as to remove this crosstalk phenomenon. First, HRTFs between the listener position and two s ^peakers are measured to desig °n the crosstalk cancellation filter (C 11 , C 12 , C 21 , and C 22 ).
HRTFs between a speaker located at a specific position on the left side of the listener position (e.g., between 90° and 100° with respect to the front center of the listener position) and the left and right ears of a dummy head are, respectively, referred to as H and H , and HRTFs between a speaker located at the specific position on the right side of the listener position (e.g., between 90° and 100° with respect to the front center of the listener position) and the left and right ears of the dummy head are, respectively, referred to as H and H . In this case, a crosstalk cancellation filter matrix C(z) is
12 22 designed by inversing the second matrix of the HRTFs H , H , H and H as in
° J ° 11 12 21 22
Equation 1: [87]
[88] The binaural synthesis filter matrix (B , B , B , and B ) localizes virtual speakers at positions of the left and right surround speakers (e.g., between 90° and 100° with
respect to the front center of the listener position). The crosstalk cancellation filter matrix C(z) removes the crosstalk between two speakers and the left and right ears. Hence, a matrix K(z) used by the virtual sound source generation unit is calculated by multiplying the binaural synthesis filter matrix (B , B , B , and B ) and the crosstalk cancellation filter matrix C(z) as in Equation 2: [89]
'Kn (Z) K13 (Z) Cn(Z) Cn (Z) Bn (Z) B12 (Z) K21(Z) K22 (z)_ C21(Z) C22 (Z) _ B21(Z) B22 (Z) _ (?)
[90] FIG. 8 is a block diagram illustrating the output controller 300 (see FIG. 2) according to an embodiment of the present general inventive concept. The output controller includes gain units 310, 320, 330, and 340 and delay units 315, 325, 335, and 345.
[91] A gain of the left channel audio input signal L is changed while passing through the gain unit (Ga) 310. The left channel audio input signal L is then delayed by the delay unit Z 315.
[92] A gain of the center channel audio input signal C is changed while passing through the gain unit (Gb) 320. The center channel audio input signal C is then delayed by the delay unit Z Δ 325.
[93] A gain of the LFE channel audio input signal LFE is changed while passing through the gain unit (Gc) 330. The LFE channel audio input signal LFE is then delayed by the delay unit Z 335.
[94] A gain of the right channel audio input signal R is changed while passing through the gain unit (Gd) 340. The right channel audio input signal R is then delayed by the delay unit Z 345.
[95] While passing through the virtual surround filter 200 (see FIG. 2), the gains and delays of the left and right surround channel audio input signals Ls and Rs are also changed. In other words, the change of gains and delays of the left and right surround channel audio input signals Ls and Rs depends upon the components of the virtual surround filter 200 (see FIG. 2). Accordingly, the gains and delays applied by the gam units 310, 320, 330, 340, and delay units 315, 325, 335, and 345 to the left, center, LFE, and right channel audio input signals L, C, LFE, and R are adjusted based on the characteristics of the virtual surround filter 200 (see FIG. 2) .
[96] Values G a, Gb, Gc, and Gd of the gain units, which relate to an output gain, are determined through a comparison between an RMS power of input and output signals of the virtual surround filter 200 (see FIG. 2). A time delay value Z is obtained through an impulse response of or a group delay of the virtual surround filter 200 (see
FIG 2).
[97] FIG. 9 is a flowchart illustrating a method of processing multi-channel audio input signals to produce two channel output signals First, in operation SlOO, a correlation between a left surround channel audio input signal Ls and a right surround channel audio input signal Rs among the multi-channel audio input signals is reduced The reduction of the correlation is described below in greater detail.
[98] In operation S200, the left and right surround channel audio input signals Ls and
Rs having the reduced correlation are transformed into virtual sound sources at pre¬ determined positions around a listener position, namely, a left surround and a right surround.
[99] Then, in operation S300, gains and delays of channel audio input signals other than the left and right surround channel audio input signals Ls and Rs, namely, left, center, LFE, and right channel audio input signals L, C, LFE, and R, are controlled to correspond to the gains and delays of the virtual sound sources into which the left and right surround channel audio input signals Ls and Rs have been transformed in the operation S200.
[100] In operation S400, signals to be output via a left channel of the multi-channel audio input signals, on which the operations S200 and S300 have been performed, are added together (i.e., summed), and signals to be output via a right channel of the multi¬ channel audio input signals, on which the operations S200 and S300 have been performed, are added together. The signals of the left and right channels can then be output by left and right speakers, respectively.
[101] FIG. 10 is a flowchart illustrating a method of reducing a correlation between a left surround channel audio input signal Ls and a right surround channel audio input signal Rs, according to an embodiment of the present general inventive concept. First, in operation SlOl, the left surround channel audio input signal Ls is delayed for a first period of time. In operation S 102, the right surround channel audio input signal Rs is delayed for a second period of time.
[102] In operation S 103, the left surround channel audio input signal Ls is delayed for a third period of time. In operation S 104, the right surround channel audio input signal Rs is delayed for a fourth period of time
[103] In operation S 105, a gain of the left surround channel audio input signal Ls delayed in the operation S 103 is changed. In operation S 106, a gain of the right surround channel audio input signal Rs delayed in the operation S 104 is changed.
[104] In operation S 107, the left surround channel audio input signal Ls delayed in the operation SlOl is added to the right surround channel audio input signal Rs having the gam changed in the operation S 106. In operation S 108, the right surround channel audio input signal Rs delayed in the operation S 102 is added to a signal obtained by
changing the gain of the left surround channel audio input signal Ls in the operation
S105 [105] In operation S109, a high-band component of a signal obtained by the addition in the operation S 107 is filtered out In operation Sl 10, a high-band component of a signal added in the operation S 108 is filtered out [106] In operation Si ll, a gain of a signal obtained by the filtering in the operation S 109 is changed In operation S 112, a gain of a signal obtained by the filtering in the operation Sl 10 is changed [107] A signal obtained by the changing of the gam in the operation Si ll is then delayed for a fifth period of time in operation S 113, and the delayed signal is fed back to the operations SlOl and S 103. A signal obtained by the changing of the gam in the operation Sl 12 is then delayed for a sixth period of time in operation S114, and the delayed signal is fed back to the operations S 102 and S 104 [108] In the present embodiment, the first, second, third, fourth, fifth, and sixth periods of time are different from each other [109] FIG 1 IA is a flowchart illustrating a method of group-delaymg a specific frequency component of the left surround channel audio input signal Ls according to another embodiment of the present general inventive concept FIG 1 IB is a flowchart illustrating a method of group-delaying a specific frequency component of the right surround channel audio input signal Rs according to another embodiment of the present general inventive concept. [110] Referring to FIG 1 IA, in operation S151, the left surround channel audio input signal Ls is delayed for a first predetermined period of time. [Il l] In operation S 152, a gain of the left surround channel audio input signal Ls is changed. [112] In operation S153, signals obtained in the operations S151 and S152 are added t ogether A gain of the signal obtained by the addition in the operation S 153 is then changed in operation S 154, and the signal obtained in the operation S 154 is fed back to the operation S 151 [113] In operation S155, the signal obtained by the addition in the operation S153 is delayed for a second predetermined period of time. [114] In operation S 156, a gain of the signal obtained by the addition in the operation
S 153 is changed. [115] In operation S157, signals obtained in the operations S155 and S156 are added together. A sum of the signals is output as a virtual sound source. A gam of the signal obtained by the addition in the operation S157 is then changed in operation S158, and the signal obtained in the operation S158 is fed back to the operation S155. [116] The method illustrated in FIG. 1 IB is similar to the method illustrated in FIG 1 IA
with respect to the right surround channel audio input signal Rs
[117] Referring to FIG. 1 IB, in operation S 161, the right surround channel audio input signal Rs is delayed for a first predetermined period of time
[118] In operation S 162, a gain of the right surround channel audio input signal Rs is changed.
[119] In operation S163, signals obtained in the operations S161 and S162 are added together A gain of the signal obtained by the addition in the operation S 163 is then changed in operation S 164, and a signal obtained in the operation S 164 is fed back to the operation S 161.
[120] In operation S 165, the signal obtained by the addition in the operation S 163 is delayed for a second predetermined period of time.
[121] In operation S 166, a gain of the signal obtained by the addition in the operation
S 163 is then changed
[122] In operation S167, signals obtained in the operations S165 and S166 are added together A sum of the signals is output as a virtual sound source A gam of a signal obtained by the addition in the operation S167 is then changed in operation S168, and the signal obtained in the operation S168 is fed back to the operation S165.
[123] The first and second predetermined periods of time in the operations S151. S155,
S161, and S 165 of FIGS HA and HB are set to be different from each other. A delay time m the operation S 151 is longer than a delay time in the operation S 155, and a delay time in the operation S 161 is longer than a delay time in the operation S 165. Al ternatively, the delay time in the operation S 155 may be set to be longer than the delay time in the operation S 151, and the delay time in the operation S 165 may be set to be longer than the delay time in the operation S 161
[124] Gain changes in the above operations are typically identical, but in some cases, may be set to be different from each other.
[125] FIG. 12 is a flowchart illustrating in detail the operation S200 (see FIG. 9) of transforming the left and right surround channel audio input signals Ls and Rs into the virtual sound sources First, in operation S210, a first HRTF matrix B between the virtual sound source and the virtual listener at the listener position is calculated
[126] In operation S220, a second HRTF matrix between the virtual listener and 2 channel output locations (i e , a left speaker and a right speaker) is obtained, and an inverse matrix C of the second HRTF matrix is calculated.
[127] In operation S230, the first HRTF matrix B calculated m the operation S210 is multiplied by the inverse matrix C calculated in the operation S220 to obtain a product matrix
[128] In operation S240, a signal obtained in the operation SlOO (see FIG. 9) is convolved with the product matrix obtained in the operation S230.
[129] FIG. 13 is a flowchart illustrating in detail the operation S300 (see FIG. 9) of controlling the gains and delays of channel audio input signals other than the left and right surround channel audio input signals Ls and Rs. First, in operation S310, an RMS power of the left and right surround channel audio input signals Ls and Rs is compared with an RMS power of a signal obtained by the transformation in the operation S200 (see FIG. 9).
[130] In operation S320, the gains of the channel audio input signals other than the left and right surround channel audio input signals Ls and Rs, namely, the left, center, LFE, and right channel audio input signals L, C, LFE, and R, are controlled based on a difference between the RMS powers obtained in the operation S310.
[131] In operation S330, the left, center, LFE, and right channel audio input signals L, C,
LFE, and R are delayed for a period of time that corresponds to the period of time during which the left and right surround channel audio input signals Ls and Rs are transformed into the virtual sound sources.
Industrial Applicability
[132] The present general inventive concept may be embodied in hardware, software, or a combination thereof. For example, the present general inventive concept may be embodied by a computer running a program from a computer-readable medium, including but not limited to storage media such as magnetic storage media (ROMs, RAMs, floppy disks, magnetic tapes, etc.), optically readable media (CD-ROMs, DVDs, etc.), and carrier waves (transmission over the internet). The present general inventive concept may be embodied as a computer-readable medium having a computer-readable program code to cause a number of computer systems connected via a network to effect distributed processing.