WO1998058522A2 - Sound reproduction system - Google Patents

Sound reproduction system Download PDF

Info

Publication number
WO1998058522A2
WO1998058522A2 PCT/GB1998/001527 GB9801527W WO9858522A2 WO 1998058522 A2 WO1998058522 A2 WO 1998058522A2 GB 9801527 W GB9801527 W GB 9801527W WO 9858522 A2 WO9858522 A2 WO 9858522A2
Authority
WO
WIPO (PCT)
Prior art keywords
loudspeakers
centre line
virtual
sound
source
Prior art date
Application number
PCT/GB1998/001527
Other languages
French (fr)
Other versions
WO1998058522A3 (en
Inventor
Michael Peter Hollier
Kelvin Chee Kin Foo
Malcolm Omar Hawksford
Original Assignee
British Telecommunications Public Limited Company
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by British Telecommunications Public Limited Company filed Critical British Telecommunications Public Limited Company
Priority to AU76644/98A priority Critical patent/AU735233B2/en
Priority to DE69816298T priority patent/DE69816298T2/en
Priority to EP98924440A priority patent/EP0990369B1/en
Priority to JP50392399A priority patent/JP2002505057A/en
Publication of WO1998058522A2 publication Critical patent/WO1998058522A2/en
Publication of WO1998058522A3 publication Critical patent/WO1998058522A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/12Circuits for transducers, loudspeakers or microphones for distributing signals to two or more loudspeakers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/01Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/01Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]

Definitions

  • This invention relates to sound reproduction systems, and in particular to an improved system for binaural synthesis, that is the generation of sound signals such that the pressures at a user's ears correspond to those which would have existed in the presence of the sound source to be simulated.
  • Such sounds will have a true source, which is generally a loudspeaker or array of loudspeakers, but seem to the listener to originate from another source, located at the position of the source being simulated This perceived source of the sound is known as a "virtual source”.
  • each loudspeaker L1 , L2 to each ear of a listener Z is represented in Figure 1 , and can be characterised by the following matrix equations: ⁇ L H 1 L H 2L Yl where-
  • X is the signal received at the left ear
  • X R is the signal received at the right ear
  • Y ⁇ is the signal transmitted by the left source (loudspeaker L1 );
  • a sound reproduction system for reproducing sound, the system comprising a plurality of loudspeakers, a processor capable of determining where, within a defined space, a virtual sound source is located and, for each virtual sound source, means for selecting a sub-set of the loudspeakers, said sub-set being selected from the plurality of loudspeakers on the basis of the location of the virtual sound source in the defined space, and means for applying a cross talk cancellation process to the selected sub-set of the loudspeakers.
  • a method of sound reproduction for reproducing sound by way of a plurality of speakers comprising the steps of determining where, within a defined space, a virtual sound source is located and, for each virtual sound source, applying a cross talk cancellation process to a sub-set of the loudspeakers, said sub-set being selected from the plurality of loudspeakers on the basis of the location of the virtual sound source in the defined space.
  • the plurality of loudspeakers from which the subset is selected allows accurate simulation over a greater range of virtual source locations than a single pair of loudspeakers could achieve.
  • the selection of a subset (preferably a pair) from this larger plurality of loudspeakers allows the crosstalk processing to be greatly simplified.
  • the pairwise concept introduced here embraces a finite number of independent crosstalk cancellation processes, each identifying with a pair of loudspeakers in a multiple speaker array.
  • the derivation of the crosstalk cancellation matrix process for each pair is identical to that for a conventional pair.
  • the number of independent crosstalk cancellation matrix modules which can be implemented in such an array is governed by the locations of loudspeakers in the multi-loudspeaker array, and the spatial coverage and accuracy achievable by an optimised pair of loudspeakers in that array.
  • Figure 1 shows a conventional stereo pair configuration with the respective transfer functions from sources to ears as already discussed
  • Figure 2 illustrates four physical point sources with maximum possible number of crosstalk cancellation processes
  • Figure 3 illustrates a lateral set of four loudspeakers, showing the loudspeakers' area of coverage on the lateral plane (the horizontal plane containing the ears);
  • Figure 4 illustrates the application of binaurally synthesised signals to appropriate crosstalk cancellation processes for the configuration of Figure 3;
  • Figure 5 illustrates a three loudspeaker configuration
  • Figure 6 illustrates a five loudspeaker configuration
  • Figure 7 illustrates an application of virtual static point sources to overcome limitations in available space
  • Figure 8 illustrates another four-loudspeaker configuration
  • Figure 9 shows schematically a pairwise crosstalk cancellation implementation circuit for localising five monophonic virtual sources using the four- loudspeaker layout of Figure 8.
  • Figure 2 shows a loudspeaker layout having four loudspeakers L1 , L2, L3, L4. It is not in general necessary to implement all the possible pairwise processes, as in most configurations only adjacent pairs of loudspeakers are used, but for some virtual sources non-adjacent pairs may be selected (as will be seen when discussing Figure 6) so the maximum number of crosstalk cancellation processes between pairs of loudspeakers in an array of four loudspeakers is not four, but six, or more generally, for an array of n loudspeakers, n(n- 1)/2.
  • the selection of an appropriate crosstalk cancellation process is governed by the direction of the synthesised sound source or sources, i.e. if synthesised sound images are to emanate from directions which are covered by one pair of loudspeakers, the processed directional signals are only applied to that pair of loudspeakers and its respective crosstalk cancellation process. If two or more sound sources of different directions are to be synthesised and played back via an array of multiple loudspeakers, respective crosstalk cancellation process modules relating to respective pairs of loudspeakers can be implemented to deliver each pair of directional signals to the ears, taking note that the process is always performed pairwise.
  • each pair of signals is applied to crosstalk cancellation process modules of appropriate pairs of loudspeakers which cover the location of the sound images.
  • loudspeaker L1 ,L2 encompassing the frontal sector 31 ( ⁇ 60°), L1 ,L3 and L2,L4 for left and right sectors 32, 33 respectively and L3,L4 for rear coverage (sector 34).
  • the block diagram in Figure 4 illustrates the strategic switching of a number of processed signals having left and right components (X L , X R ) as heard at the ears to appropriate modules 41 , 42, 43, 44, each corresponding to the pair of the loudspeakers appropriate to the lateral bearings of these signals.
  • Translating virtual moving sound sources using the pairwise concept can be achieved by correctly switching or directing the synthesised signals to the appropriate pairwise crosstalk cancellation process.
  • a sound source can be made to translate from the left sector (32) to the frontal sector (31 ), by first applying the synthesised signal to the crosstalk cancellation processor 42 for the left sector 32, to give its initial position as well as the points of movement within the left sector, depending on the angular step size between synthesised sources. Once the image shifts to the next sector, the synthesised signals are switched to the crosstalk cancellation processor 41 for the front sector 31 to continue projecting the moving source.
  • the example shown above may appear to suggest that the pairwise concept restricts the crosstalk cancellation to within the angle between the pair of loudspeakers.
  • the angle of coverage be it lateral or spherical, strictly depends on how well a pair of loudspeakers can spatialise within its capability (in the sense of localisation accuracy) .
  • the following worked examples were taken from experiments which demonstrate that different paired configurations gave significantly different localisation abilities and reveal advantages of some unconventional loudspeaker placement over current layout practice.
  • FIG. 5 An unusual layout, which may seem to be impractical on initial inspection, is shown in Figure 5.
  • This has just three loudspeakers L1 , L2, L3 (Left, Centre Front, and Right), arranged at 0° (Centre Front) and ⁇ 90° (Right and Left). It displays good imaging ability within the respective loudspeakers' optimised fields of coverage as shown in Figure 5.
  • the left and right frontal quadrants 51 , 52 covered by the Left/Centre pair L1 /L2 and Right/Centre pair L2/L3 give good static frontal sources even with a distinct degree of head rotation to face the virtually positioned source.
  • the unconventional Left/Right pair L1 /L3 along the axis of the ears gave remarkable rear incidence synthesised images covering the range from + 90° to -90°, even on the onsets of the synthesised sound sample.
  • the Left- Right ear axis loudspeaker pair L1 /L3 not only gives coverage along the rear half of the lateral plane (sector 53), but it also encompasses the rear hemisphere, i.e. including point sources above or below the lateral plane.
  • FIG. 6 Another example is illustrated in Figure 6. This illustrates that the coverage provided by some paired loudspeakers is limited but, by combining with several other pairs of loudspeakers in the array, the voids are filled and a desired spatia sation is fulfilled
  • Five loudspeakers are used, arranged at 0° (Centre-Front: L2) ⁇ 60° (Right-Front: L3, and Left-Front: L1 ), and ⁇ 1 20° (Right-Rear: L4 and Left-Rear: L5) .
  • the frontal ⁇ 60° stereo pair L1 /L3 provide poor frontal images in the range covering +1 0° (sectors 62/63) .
  • the pairwise concept employs a strategy of applying the best pair available to achieve good localisation and in this case, subjective tests have shown that sound images projected at the angles between -1 0° and -60° (sector 61 ) and between + 1 0° and + 60° (sector 64) are better localised using the left- front/ ⁇ ght-front non-adjacent pair L1 /L3 than that processed by either the left- front/centre-front or centre-front/ ⁇ ght-front pairs (L1 /L2, L2/L3) .
  • the pairwise concept is not restricted to just these few loudspeaker configurations and locations.
  • the invention delivers a new but yet direct general approach to solving three-dimensional sound field spatiahsation for multiple loudspeaker applications.
  • the loudspeaker array itself may be designed to comply with other constraints such as cost (in particular the number of loudspeakers to be used) and the availability of locations to site the loudspeakers.
  • cost in particular the number of loudspeakers to be used
  • locations to site the loudspeakers.
  • the best localisation effect of a sound source is achieved by engaging a crosstalk cancellation process that relates to the most appropriate pair of loudspeakers available in the array. This does not restrict to just the direct path of sound sources
  • Each individual reflection of a sound source could be treated as a further virtual source, with a suitable delay with respect to the primary source, to simulate a reflected sound
  • Applying the appropriate crosstalk cancellation process to each reflection could accurately render their positions in space, an essence of an immersive spatial environment.
  • Directional loudspeakers can be used to reduce the volume of sound audible at locations away from the listener Z, and in particular at the locations of the virtual rear surround units 74, 75 outside the room R.
  • Figure 9 shows the array set up with pairwise crosstalk cancellation applied to a forward pair L1 , L2 set at ⁇ 60° and a side pair L3, L4 set at ⁇ 90° , i.e. it is based on the assumption that the forward pair L1 , L2 provides the best reconstruction of spatiahsed images in the front sector (Sector 81 ) and the side pair L3, L4 provides the best reconstruction of spatiahsed images in the rear sector (Sector 82) .
  • the example depicts five virtual sources X 0 , X 1 ( X 2 , X 3 , X 4 to be spatiahsed, however the implementation of the pairwise concept does not limit the number of input sources.
  • the input sources X 0 - X 4 are each first subjected to analogue/digital conversion in a bank 91 of converters A/D. The input sources are then treated in a bank of processors 92 with the appropriate hearing response transfer functions
  • H XO IS the HRTF of source X 0 to Left Ear
  • H X0R is the HRTF of source X 0 to Right Ear, etc.
  • the left outputs of the front three sources X 0L X 1 , X 2 are then combined in a combiner 93, and similarly for the right outputs X 0 XI R, X 2 R, (combiner 93a) and the two outputs filtered in a processor 94 by the forward pair crosstalk cancellation matrix for the reconstruction of virtual images in the front sector 81 .
  • the remaining two input sources X 3 , X 4 are similarly filtered by the side pair crosstalk cancellation matrix (processor 94a) for the reconstruction of virtual images in the rear sector 82.
  • the outputs from the cancellation stages 94, 94a are then subject to digital/analogue conversion (D/A) (convertors 96) for output to the appropriate loudspeakers L1 , L2; L3, L4
  • H 1 L HRTF of Loudspeaker L1 to Left Ear

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Stereophonic System (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

A method of sound reproduction for reproducing sound by way of a plurality of loudspeakers comprises the steps of determining wherein a defined space a virtual sound source is located and, for a particular virtual sound source, applying a cross talk cancellation process to a sub-set of the loudspeakers, said sub-set being selected on the basis of the location of the virtual sound source in the defined space.

Description

SOUND REPRODUCTIOM SYSTEM
This invention relates to sound reproduction systems, and in particular to an improved system for binaural synthesis, that is the generation of sound signals such that the pressures at a user's ears correspond to those which would have existed in the presence of the sound source to be simulated. Such sounds will have a true source, which is generally a loudspeaker or array of loudspeakers, but seem to the listener to originate from another source, located at the position of the source being simulated This perceived source of the sound is known as a "virtual source". The principle of using a conventional stereo loudspeaker setup for binaural synthesis was first conceived by Atal B S, & Shroeder M R, "Apparent sound source transiater", US Patent 3236949, 1966, and was later further optimised by Cooper D H & Bauk J L, "Prospects for transaural recording " Journal of the Audio Engineeing Society, Vol.37, (3- 19), 1989, who introduced the term "Transaural Stereo". See also Shroeder M R, "Models of Hearing ", Proc. IEEE, Vol.63 (1332- 1350), 1975 and Cooper D H, Bauk J L, "Generalised transaural stereo " 93rd A ES Convention, Preprint 3401, 1992.
If signals at the ears relating to direction of sound sources can be reconstructed accurately, combined with accurate reconstruction of secondary images such as reflections, compelling spatial immersion could be accomplished.
In a loudspeaker listening situation, in order to synthesise the correct signals to the ears to simulate a sound source at some physical point other than the loudspeakers, the signals to the loudspeakers have to be tailored in such a way as to reconstruct, at the listener's ears, sound pressure indistinguishable from those that the ears would have received in a free field setup. The propagation from each loudspeaker L1 , L2 to each ear of a listener Z is represented in Figure 1 , and can be characterised by the following matrix equations: χL H1 L H2L Yl where-
XR H1 R H2R Y2
X is the signal received at the left ear; XR is the signal received at the right ear;
Y Ϊ is the signal transmitted by the left source (loudspeaker L1 );
Y2 is the signal transmitted by the right source (loudspeaker L2); H1 L = transfer function of left source (loudspeaker L1 ) to left ear H1 R = transfer function of left source (loudspeaker L1 ) to right ear H2ι_ = transfer function of right source (loudspeaker L2) to left ear H2R = transfer function of right source (loudspeaker L2) to right ear Solving for Y with known signals X that describe the sound source at an arbitrary point in space should obtain the appropriate signals to be fed to the loudspeakers This equation clearly shows that the signal X is required to be filtered through a crosstalk cancellation stage formed by the inverted matrix (hereinafter referred to as the crosstalk cancellation matrix) as depicted in the following equation:
H 2R - H 2L
H l R H , γ2 (H 1L H2R J - ( H 1R H 2L ) X R In theory, such a derivation method for a crosstalk cancellation solution could be applied to any set-up of a pair of loudspeakers, whether symmetrical or non-symmetrical. In a conventional crosstalk cancellation configuration for a stereo pair of loudspeakers, and for all transaural systems depicted so far, all synthesised sound images are filtered through a crosstalk cancellation process. However, a stereo pair cannot give accurate reconstruction of signals from all directions. A typical stereo pair arranged in front of a listener gives accurate simulation only within approximately ± 1 00° of the direction the listener is facing. Moreover, if more than two loudspeakers are introduced, the crosstalk cancellation technique breaks down as the system would result in an indeterminate system (more unknowns than equations) Cooper and Bauck extended their generalised transaural theory to more than two discrete channels of information which generalised the crosstalk cancellation to any number of loudspeakers for any number of listeners. However, approximate solutions were only given, in an attempt to solve an indeterminate system for one listener
According to the invention, there is provided a sound reproduction system for reproducing sound, the system comprising a plurality of loudspeakers, a processor capable of determining where, within a defined space, a virtual sound source is located and, for each virtual sound source, means for selecting a sub-set of the loudspeakers, said sub-set being selected from the plurality of loudspeakers on the basis of the location of the virtual sound source in the defined space, and means for applying a cross talk cancellation process to the selected sub-set of the loudspeakers. In another aspect of the invention, there is provided a method of sound reproduction for reproducing sound by way of a plurality of speakers, the method comprising the steps of determining where, within a defined space, a virtual sound source is located and, for each virtual sound source, applying a cross talk cancellation process to a sub-set of the loudspeakers, said sub-set being selected from the plurality of loudspeakers on the basis of the location of the virtual sound source in the defined space.
The plurality of loudspeakers from which the subset is selected allows accurate simulation over a greater range of virtual source locations than a single pair of loudspeakers could achieve. However, the selection of a subset (preferably a pair) from this larger plurality of loudspeakers allows the crosstalk processing to be greatly simplified. The pairwise concept introduced here embraces a finite number of independent crosstalk cancellation processes, each identifying with a pair of loudspeakers in a multiple speaker array. The derivation of the crosstalk cancellation matrix process for each pair is identical to that for a conventional pair. The number of independent crosstalk cancellation matrix modules which can be implemented in such an array is governed by the locations of loudspeakers in the multi-loudspeaker array, and the spatial coverage and accuracy achievable by an optimised pair of loudspeakers in that array.
Embodiments of the invention will now be described with reference to the drawings, in which:
Figure 1 shows a conventional stereo pair configuration with the respective transfer functions from sources to ears as already discussed;
Figure 2 illustrates four physical point sources with maximum possible number of crosstalk cancellation processes; Figure 3 illustrates a lateral set of four loudspeakers, showing the loudspeakers' area of coverage on the lateral plane (the horizontal plane containing the ears); Figure 4 illustrates the application of binaurally synthesised signals to appropriate crosstalk cancellation processes for the configuration of Figure 3;
Figure 5 illustrates a three loudspeaker configuration;
Figure 6 illustrates a five loudspeaker configuration; Figure 7 illustrates an application of virtual static point sources to overcome limitations in available space;
Figure 8 illustrates another four-loudspeaker configuration;
Figure 9 shows schematically a pairwise crosstalk cancellation implementation circuit for localising five monophonic virtual sources using the four- loudspeaker layout of Figure 8.
Figure 2 shows a loudspeaker layout having four loudspeakers L1 , L2, L3, L4. It is not in general necessary to implement all the possible pairwise processes, as in most configurations only adjacent pairs of loudspeakers are used, but for some virtual sources non-adjacent pairs may be selected (as will be seen when discussing Figure 6) so the maximum number of crosstalk cancellation processes between pairs of loudspeakers in an array of four loudspeakers is not four, but six, or more generally, for an array of n loudspeakers, n(n- 1)/2.
The selection of an appropriate crosstalk cancellation process is governed by the direction of the synthesised sound source or sources, i.e. if synthesised sound images are to emanate from directions which are covered by one pair of loudspeakers, the processed directional signals are only applied to that pair of loudspeakers and its respective crosstalk cancellation process. If two or more sound sources of different directions are to be synthesised and played back via an array of multiple loudspeakers, respective crosstalk cancellation process modules relating to respective pairs of loudspeakers can be implemented to deliver each pair of directional signals to the ears, taking note that the process is always performed pairwise.
To illustrate the pairwise concept and the explanation given above, consider the lateral setup as shown in Figure 3 The layout consists of a ±30° frontal pair of loudspeakers L1 , L2, and a ±1 20° rear pair of loudspeakers L3, L4 (angles of incidence are measured with respect to the direction due front of the listener Z). Seven virtual images V 1 to V7 are shown emanating from different bearings. To deliver correctly each binaurally-synthesised sound signal (carrying directional information) to the listener's ears, each pair of signals is applied to crosstalk cancellation process modules of appropriate pairs of loudspeakers which cover the location of the sound images. Four areas of coverage are shown, with loudspeaker L1 ,L2 encompassing the frontal sector 31 ( ± 60°), L1 ,L3 and L2,L4 for left and right sectors 32, 33 respectively and L3,L4 for rear coverage (sector 34). The block diagram in Figure 4 illustrates the strategic switching of a number of processed signals having left and right components (XL, XR) as heard at the ears to appropriate modules 41 , 42, 43, 44, each corresponding to the pair of the loudspeakers appropriate to the lateral bearings of these signals. Translating virtual moving sound sources using the pairwise concept can be achieved by correctly switching or directing the synthesised signals to the appropriate pairwise crosstalk cancellation process. Using the example shown in Figure 3, a sound source can be made to translate from the left sector (32) to the frontal sector (31 ), by first applying the synthesised signal to the crosstalk cancellation processor 42 for the left sector 32, to give its initial position as well as the points of movement within the left sector, depending on the angular step size between synthesised sources. Once the image shifts to the next sector, the synthesised signals are switched to the crosstalk cancellation processor 41 for the front sector 31 to continue projecting the moving source. The example shown above may appear to suggest that the pairwise concept restricts the crosstalk cancellation to within the angle between the pair of loudspeakers. However, the angle of coverage, be it lateral or spherical, strictly depends on how well a pair of loudspeakers can spatialise within its capability (in the sense of localisation accuracy) . The following worked examples were taken from experiments which demonstrate that different paired configurations gave significantly different localisation abilities and reveal advantages of some unconventional loudspeaker placement over current layout practice.
An unusual layout, which may seem to be impractical on initial inspection, is shown in Figure 5. This has just three loudspeakers L1 , L2, L3 (Left, Centre Front, and Right), arranged at 0° (Centre Front) and ± 90° (Right and Left). It displays good imaging ability within the respective loudspeakers' optimised fields of coverage as shown in Figure 5. The left and right frontal quadrants 51 , 52 covered by the Left/Centre pair L1 /L2 and Right/Centre pair L2/L3 give good static frontal sources even with a distinct degree of head rotation to face the virtually positioned source. The unconventional Left/Right pair L1 /L3 along the axis of the ears gave remarkable rear incidence synthesised images covering the range from + 90° to -90°, even on the onsets of the synthesised sound sample. The Left- Right ear axis loudspeaker pair L1 /L3 not only gives coverage along the rear half of the lateral plane (sector 53), but it also encompasses the rear hemisphere, i.e. including point sources above or below the lateral plane.
Another example is illustrated in Figure 6. This illustrates that the coverage provided by some paired loudspeakers is limited but, by combining with several other pairs of loudspeakers in the array, the voids are filled and a desired spatia sation is fulfilled Five loudspeakers are used, arranged at 0° (Centre-Front: L2) ± 60° (Right-Front: L3, and Left-Front: L1 ), and ± 1 20° (Right-Rear: L4 and Left-Rear: L5) . The frontal ±60° stereo pair L1 /L3 provide poor frontal images in the range covering +1 0° (sectors 62/63) . Addition of the centre-front unit L2 and implementation of crosstalk cancellation on left-front/centre-front (L1 /L2) and πght-front/centre-front (L2/L3) pairs provides sufficient coverage for sound images in these sectors. It can be seen that there is a possibility of extending the coverage between the centre loudspeaker L2 and each of the respective front loudspeakers L1 , L3. The pairwise concept employs a strategy of applying the best pair available to achieve good localisation and in this case, subjective tests have shown that sound images projected at the angles between -1 0° and -60° (sector 61 ) and between + 1 0° and + 60° (sector 64) are better localised using the left- front/πght-front non-adjacent pair L1 /L3 than that processed by either the left- front/centre-front or centre-front/πght-front pairs (L1 /L2, L2/L3) . The pairwise concept is not restricted to just these few loudspeaker configurations and locations. The invention delivers a new but yet direct general approach to solving three-dimensional sound field spatiahsation for multiple loudspeaker applications. The loudspeaker array itself may be designed to comply with other constraints such as cost (in particular the number of loudspeakers to be used) and the availability of locations to site the loudspeakers. With such a strategy, in general terms, the best localisation effect of a sound source is achieved by engaging a crosstalk cancellation process that relates to the most appropriate pair of loudspeakers available in the array. This does not restrict to just the direct path of sound sources Each individual reflection of a sound source could be treated as a further virtual source, with a suitable delay with respect to the primary source, to simulate a reflected sound Applying the appropriate crosstalk cancellation process to each reflection could accurately render their positions in space, an essence of an immersive spatial environment.
The introduction of unconventional loudspeaker locations also reveals exceptional rear localisation of sound images and, with another paired configuration that has good frontal attributes, gave strong distinction between front and rear virtual images therefore eliminating front-back and back-front ambiguities.
The ability to project static virtual point sources accurately has great contributions to teleconferencing and fully immersive personal workstation applications. Further applications also extend to home cinema setup in which the loudspeaker positions intended for a cinema need to be simulated. The home environment is restricted in both the number of available loudspeakers and in the availability of positions to place them Virtual loudspeakers in such a setup could be rendered in their respective places as shown in Figure 7. In the example illustrated five virtual units 71 , 72, 73, 74, 75 are simulated by only three real units L1 , L2, L3, configured as already described with reference to Figure 5. Two of the virtual units 74, 75 are located outside the confines of the room R in which the loudspeakers L1 , L2, L3 and the listener Z are located. This could overcome the limitation of physical point sources and available listening space in a room. Directional loudspeakers can be used to reduce the volume of sound audible at locations away from the listener Z, and in particular at the locations of the virtual rear surround units 74, 75 outside the room R.
A simplified example of an implementation of the system, based on a four- loudspeaker array with only two sectors, as shown in Figure 8, will now be described. Figure 9 shows the array set up with pairwise crosstalk cancellation applied to a forward pair L1 , L2 set at ± 60° and a side pair L3, L4 set at ± 90° , i.e. it is based on the assumption that the forward pair L1 , L2 provides the best reconstruction of spatiahsed images in the front sector (Sector 81 ) and the side pair L3, L4 provides the best reconstruction of spatiahsed images in the rear sector (Sector 82) . The example depicts five virtual sources X0, X1 ( X2, X3, X4 to be spatiahsed, however the implementation of the pairwise concept does not limit the number of input sources.
The input sources X0 - X4 are each first subjected to analogue/digital conversion in a bank 91 of converters A/D. The input sources are then treated in a bank of processors 92 with the appropriate hearing response transfer functions
(HRTFs), HX0 , HX0R, HχiL, HX1 R, HX2L, HX2R, HX3 , HX3R, HX4 , HX4R ; where HXO IS the HRTF of source X0 to Left Ear, HX0R is the HRTF of source X0 to Right Ear, etc. The left outputs of the front three sources X0L X1 , X2 are then combined in a combiner 93, and similarly for the right outputs X0 XI R, X2R, (combiner 93a) and the two outputs filtered in a processor 94 by the forward pair crosstalk cancellation matrix for the reconstruction of virtual images in the front sector 81 .
The remaining two input sources X3, X4 are similarly filtered by the side pair crosstalk cancellation matrix (processor 94a) for the reconstruction of virtual images in the rear sector 82. The outputs from the cancellation stages 94, 94a are then subject to digital/analogue conversion (D/A) (convertors 96) for output to the appropriate loudspeakers L1 , L2; L3, L4
In the pairwise crosstalk cancellation processes, the following calculations are performed:
for loudspeaker LI : Y 1 = ( H'2R) XL + (H'2L)XR , where:
- H2L H2R
H*2L = τ r-^ — T H'2R
(H1L « H2R) - (H1R « H2L) (H1L « H2R) - (H1R « H2L)
for loudspeaker L2: Y2 = ( H"l R) XL + (H' 1 L)XR, where:
H1L - H1R H'1L = —, r H'lR =
(H1L • H2R) - (H1R • H2L) (H1L • H2R) - (H1R • H2L)
where: H 1 L = HRTF of Loudspeaker L1 to Left Ear
H 1 R = HRTF of Loudspeaker L1 to Right Ear H2L = HRTF of Loudspeaker L2 to Left Ear H2R = HRTF of Loudspeaker L2 to Right Ear for loudspeaker L3: Y3 = ( H'4R) XL + (H'4L)XR, where:
- H4L H4R
H'4L = . — ; ΓTTTT H'4R =
(H3L • H4R) - (H3R • H4L) (H3L • H4R) - (H3R • H4L)
for loudspeaker L4: Y4 = ( H'3L) XR + (H'3R)XL, where:
H3L - H3R H'3L = . — ; ΓT^" H'3R =
(H3L • H4R) - (H3R • H4L) (H3L • H4R) - (H3R • H4L)
where H3L = HRTF of Loudspeaker L3 to Left Ear H3R = HRTF of Loudspeaker L3 to Right Ear H4L = HRTF of Loudspeaker L4 to Left Ear H4R = HRTF of Loudspeaker L4 to Right Ear

Claims

1 . A sound reproduction system for reproducing sound, the system comprising a plurality of loudspeakers, a processor capable of determining where, within a defined space, one or more virtual sound sources are located and, for each virtual sound source, means for selecting a sub-set of the loudspeakers, said subset being selected from the plurality of loudspeakers on the basis of the location of the virtual sound source in the defined space, and means for applying a cross talk cancellation process to the selected sub-set of the loudspeakers
2. A sound reproduction system as claimed in Claim 1 , having means for reproducing at least a primary virtual sound source and a secondary virtual sound source, and for delaying the secondary virtual sound source signal with respect to the primary source, to simulate a reflection of the primary source.
3. A sound reproduction system as defined in Claim 1 or Claim 2 wherein the subsets of loudspeakers are pairs of loudspeakers
4. A sound reproduction system as claimed in claim 3 wherein there are four loudspeakers arranged substantially at 30┬░ and 1 20┬░ to left and right of a predetermined centre line, and wherein four sectors are defined bounded by divisions at substantially 60┬░ and 1 20┬░ to left and right of the centre line, and wherein for virtual sources in the sector bounded by the divisions to 60┬░ left and right of the centre line the loudspeakers at 30┬░ from the centre line are selected, for positions greater than 1 20┬░ to left and right of the centre line the loudspeakers at 1 20┬░ to left and right of the centre line are selected, and for intermediate angles the two loudspeakers for intermediate angles to the left of the centre line the two left-hand loudspeakers are selected and for intermediate angles to the right of the centre line the two right-hand loudspeakers are selected.
5. A sound reproduction system as claimed in claim 3 wherein there are five loudspeakers arranged substantially at 0┬░, 60┬░ and 1 20┬░ to left and right of a predetermined centre line, and wherein five sectors are defined bounded by divisions at substantially 0┬░, 1 0┬░, and 1 20┬░ to left and right of the centre line, and wherein for virtual sources in the sector bounded by the divisions at 0┬░ and 60┬░ left of the centre line the loudspeakers at 0┬░ and 60┬░ left from the centre line are selected, for virtual sources in the sector bounded by the divisions at 0┬░ and 60┬░ right of the centre line the loudspeakers at 0┬░ and 60┬░ right from the centre line are selected, for positions greater than 1 20┬░ to left and right of the centre line the loudspeakers at 1 20┬░ to left and right of the centre line are selected, and for intermediate angles between 10┬░ and 1 20┬░ left or right of the centre line the two loudspeakers at 60┬░ left and right of the centre line are selected.
6. A system according to claim 3 wherein there are three loudspeakers, arranged in front of a listening point and at 90┬░ of the centre line to left and right, wherein virtual sources to the rear of the user are reproduced using the left and right loudspeakers and virtual sources to the front of the user are represented by the central loudspeaker and the left or right speaker according to which side of the centre line the virtual source is.
7. A method of sound reproduction for reproducing sound by way of a plurality of speakers, the method comprising the steps of determining where, within a defined space, one or more virtual sound sources are located and, for each virtual sound source, applying a cross talk cancellation process to a sub-set of the loudspeakers, said sub-set being selected from the plurality of loudspeakers on the basis of the location of the virtual sound source in the defined space.
8. A sound reproduction system as claimed in Claim 7, for reproducing at least a primary virtual sound source and a secondary virtual sound source, wherein the secondary virtual sound source signal is delayed with respect to the primary source, to simulate a reflection of the primary source.
9. A sound reproduction system as defined in claim 7 or 8 wherein the loudspeakers are selected pairwise.
10. A method of sound reproduction according to claim 7, 8, or 9 wherein a plurality of virtual sound sources are operated on simultaneously, with cross-talk cancellation processes applied to appropriate sub-sets of the loudspeakers for each virtual sound source.
1 1 . A sound reproduction system substantially as decribed with reference to the drawings.
1 2. A method of sound reproduction for reproducing sound substantially as decribed with reference to the drawings.
PCT/GB1998/001527 1997-06-19 1998-05-27 Sound reproduction system WO1998058522A2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
AU76644/98A AU735233B2 (en) 1997-06-19 1998-05-27 Sound reproduction system
DE69816298T DE69816298T2 (en) 1997-06-19 1998-05-27 A sound reproduction
EP98924440A EP0990369B1 (en) 1997-06-19 1998-05-27 Sound reproduction system
JP50392399A JP2002505057A (en) 1997-06-19 1998-05-27 Sound reproduction system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP97304310 1997-06-19
EP97304310.2 1997-06-19

Publications (2)

Publication Number Publication Date
WO1998058522A2 true WO1998058522A2 (en) 1998-12-23
WO1998058522A3 WO1998058522A3 (en) 1999-03-11

Family

ID=8229384

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB1998/001527 WO1998058522A2 (en) 1997-06-19 1998-05-27 Sound reproduction system

Country Status (5)

Country Link
EP (1) EP0990369B1 (en)
JP (1) JP2002505057A (en)
AU (1) AU735233B2 (en)
DE (1) DE69816298T2 (en)
WO (1) WO1998058522A2 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006016156A1 (en) * 2004-08-10 2006-02-16 1...Limited Non-planar transducer arrays
US7078423B2 (en) 2002-07-18 2006-07-18 Inotek Pharmaceuticals Corporation 5-Aryltetrazole compounds, compositions thereof, and uses therefor
US7087631B2 (en) 2002-07-18 2006-08-08 Inotek Pharmaceuticals Corporation Aryltetrazole compounds, and compositions thereof
EP2229012A1 (en) * 2009-03-11 2010-09-15 Yamaha Corporation Device, method, program, and system for canceling crosstalk when reproducing sound through plurality of speakers arranged around listener

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015002517A1 (en) * 2013-07-05 2015-01-08 한국전자통신연구원 Virtual sound image localization method for two dimensional and three dimensional spaces
WO2019079602A1 (en) * 2017-10-18 2019-04-25 Dts, Inc. Preconditioning audio signal for 3d audio virtualization

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5517570A (en) * 1993-12-14 1996-05-14 Taylor Group Of Companies, Inc. Sound reproducing array processor system
US5598478A (en) * 1992-12-18 1997-01-28 Victor Company Of Japan, Ltd. Sound image localization control apparatus

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5598478A (en) * 1992-12-18 1997-01-28 Victor Company Of Japan, Ltd. Sound image localization control apparatus
US5517570A (en) * 1993-12-14 1996-05-14 Taylor Group Of Companies, Inc. Sound reproducing array processor system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
BAUCK J ET AL: "GENERALIZED TRANSAURAL STEREO AND APPLICATIONS" JOURNAL OF THE AUDIO ENGINEERING SOCIETY, vol. 44, no. 9, September 1996, pages 683-705, XP000699723 cited in the application *
PULKII V: "VIRTUAL SOUND SOURCE POSITIONING USING VECTOR BASE AMPLITUDE PANNING" JOURNAL OF THE AUDIO ENGINEERING SOCIETY, vol. 45, no. 6, June 1997, pages 456-466, XP000695381 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7078423B2 (en) 2002-07-18 2006-07-18 Inotek Pharmaceuticals Corporation 5-Aryltetrazole compounds, compositions thereof, and uses therefor
US7087631B2 (en) 2002-07-18 2006-08-08 Inotek Pharmaceuticals Corporation Aryltetrazole compounds, and compositions thereof
US7135491B2 (en) 2002-07-18 2006-11-14 Inotek Pharmaceuticals Corp. 5-Aryltetrazole compounds and uses thereof
WO2006016156A1 (en) * 2004-08-10 2006-02-16 1...Limited Non-planar transducer arrays
GB2431314A (en) * 2004-08-10 2007-04-18 1 Ltd Non-planar transducer arrays
GB2431314B (en) * 2004-08-10 2008-12-24 1 Ltd Non-planar transducer arrays
EP2229012A1 (en) * 2009-03-11 2010-09-15 Yamaha Corporation Device, method, program, and system for canceling crosstalk when reproducing sound through plurality of speakers arranged around listener
US8320590B2 (en) 2009-03-11 2012-11-27 Yamaha Corporation Device, method, program, and system for canceling crosstalk when reproducing sound through plurality of speakers arranged around listener

Also Published As

Publication number Publication date
WO1998058522A3 (en) 1999-03-11
EP0990369B1 (en) 2003-07-09
DE69816298D1 (en) 2003-08-14
AU735233B2 (en) 2001-07-05
JP2002505057A (en) 2002-02-12
AU7664498A (en) 1999-01-04
EP0990369A2 (en) 2000-04-05
DE69816298T2 (en) 2004-05-27

Similar Documents

Publication Publication Date Title
Gardner 3-D audio using loudspeakers
US7787638B2 (en) Method for reproducing natural or modified spatial impression in multichannel listening
Pulkki Spatial sound generation and perception by amplitude panning techniques
US6904152B1 (en) Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics in three dimensions
Kyriakakis Fundamental and technological limitations of immersive audio systems
Algazi et al. Headphone-based spatial sound
US8437485B2 (en) Method and device for improved sound field rendering accuracy within a preferred listening area
CA2162567C (en) Stereophonic reproduction method and apparatus
US5802180A (en) Method and apparatus for efficient presentation of high-quality three-dimensional audio including ambient effects
US8712061B2 (en) Phase-amplitude 3-D stereo encoder and decoder
US6259795B1 (en) Methods and apparatus for processing spatialized audio
EP0036337B1 (en) Sound reproducing system having sonic image localization networks
US8488796B2 (en) 3D audio renderer
EP1275272B1 (en) Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics in three dimensions
MXPA05004091A (en) Dynamic binaural sound capture and reproduction.
JP2005513892A (en) Improving spatial cognition in virtual surround
Jot Interactive 3D audio rendering in flexible playback configurations
Gardner 3D audio and acoustic environment modeling
Jot et al. Binaural simulation of complex acoustic scenes for interactive audio
JPH09121400A (en) Depthwise acoustic reproducing device and stereoscopic acoustic reproducing device
Malham Approaches to spatialisation
EP0990369B1 (en) Sound reproduction system
WO2001019138A2 (en) Method and apparatus for generating a second audio signal from a first audio signal
Tarzan et al. Assessment of sound spatialisation algorithms for sonic rendering with headphones
Kobayashi Ritch Design and Application of a Native-D Recording Format for Optimal Dolby Atmos Reproduction

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GE GH GM GW HU ID IL IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW SD SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN ML MR NE SN TD TG

AK Designated states

Kind code of ref document: A3

Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GE GH GM GW HU ID IL IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW SD SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN ML MR NE SN TD TG

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 09117626

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 76644/98

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 1998924440

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1998924440

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

NENP Non-entry into the national phase in:

Ref country code: CA

WWG Wipo information: grant in national office

Ref document number: 76644/98

Country of ref document: AU

WWG Wipo information: grant in national office

Ref document number: 1998924440

Country of ref document: EP