EP0975201B1 - A method of processing a plural channel audio signal - Google Patents
A method of processing a plural channel audio signal Download PDFInfo
- Publication number
- EP0975201B1 EP0975201B1 EP99305562.3A EP99305562A EP0975201B1 EP 0975201 B1 EP0975201 B1 EP 0975201B1 EP 99305562 A EP99305562 A EP 99305562A EP 0975201 B1 EP0975201 B1 EP 0975201B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- distance
- head
- loudspeaker
- ear
- chosen
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S1/00—Two-channel systems
- H04S1/002—Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
Definitions
- This invention relates to a method of processing a plural channel audio signal including left and right channels, the information in the channels representing a three dimensional sound-field for generation by respective left and right loudspeakers arranged at a given distance from the preferred position of a listener in use.
- the fundamental Head Response Transfer Function (HRTF) characteristics which are required to implement a transaural crosstalk cancellation scheme are the left- and right-ear transfer functions associated with the azimuth angle at which the loudspeakers are situated ( Figure 1 ). For most applications, this is commonly accepted to be ⁇ 30°.
- the near-ear function is sometimes referred to as the "same” side function (or “ S " function), and the far-ear function as the “alternate” (or " A ”) function.
- S the near-ear function
- A the far-ear function
- Transaural crosstalk cancellation is described in more detail in WO 95/15069 .
- HRTFs measured by the prior art methods do not contain LF information, although, of course, the LF response is present in reality.
- the results of a typical HRTF measurement are shown in Figure 3 , depicting the A and S functions at 30° azimuth, measured from a commercial artificial head.
- the uncertainty in the non-valid data, below several hundred Hz, is apparent. Accordingly, the missing LF properties must be replaced in order to create valid HRTFs, and this is conveniently done by extrapolating the amplitude data at the lowest valid frequency (200 Hz) back to 0 Hz (or in practise, back to the lowest practical frequency, say 10 Hz).
- Prior art transaural crosstalk cancellation methods have always used A and S functions which tend to the same value at low frequencies (see for example, Atal and Schroeder, US 3,236,949 ). Using such functions, the anticipated crosstalk signal at the far ear is equal to the primary signal at the near ear at low frequencies, hence the ratio of crosstalk signal to primary signal is always 1:1 at low frequencies.
- transaural crosstalk is defined to be the intensity ratio of the far ear signal with respect to the near ear signal. As these two functions have a different frequency dependence, this ratio will in general be a function of frequency. However, in the prior art the ratio approaches unity at low frequencies because A and S are forced to the same value below about 200 Hz. That is, the transaural crosstalk signal (far ear signal) is equal in magnitude to the primary signal (near ear signal) for such low frequencies.
- the transaural crosstalk signal is substantially equal to (100% of) the primary signal-at low frequencies, regardless of loudspeaker distance and/or angle. Consequently, all the prior art methods of transaural crosstalk cancellation have not been optimal for the arrangements/distances of loudspeakers used in practice.
- the invention provides a means for creating optimal transaural crosstalk cancellation particularly, though not exclusively, for users of Personal Computer (PC) - based multimedia systems, in which the loudspeakers are relatively close to the listener and might be at a variety of differing angles and distances, depending on the individual user's set-up configuration and preferences.
- the amount of transaural crosstalk which occurs is also influenced by the angle of the loudspeakers. (Note that this is not to be confused with the use of the appropriate azimuth angle A and S functions, which is well known: i.e. use 30° A and S functions for speakers at 30°;15° A and S functions for speakers at 15°, and so on).
- the present invention is a transaural crosstalk cancellation means based on "standard", 1 metre A and S functions.
- the method employs an algorithm which controls the intensity of the transaural crosstalk cancellation signal relative to the near-ear intensity, using a crosstalk cancellation factor which is a function of loudspeaker proximity and spatial position.
- the invention is based on the observation that when a sound source moves relatively closely towards the head (say, from a distance of several metres), then the individual far- and near-ear properties of the HRTF do not change a great deal in terms of their spectral properties, but their amplitudes, and the amplitude difference between them, do change substantially, caused by a distance ratio effect.
- loudspeaker position angles lie in the range ⁇ 10° (for notebook PCs) to ⁇ 30° (for desktop PCs), and the distances (loudspeaker to ear) range from about 0.2 metres to 1 metre respectively. These ranges will be used here for illustrative purposes, but of course the invention is not restricted to these parameters.
- the distance ratio (far-ear to sound source vs. near-ear to sound source) becomes greater.
- the intensity of a sound source diminishes with distance as the energy of the propagating wave is spread over an increasing area.
- the wavefront is similar to an expanding bubble, and hence the energy density is related to the surface area of the propagating wavefront, which is related by a square law to the distance travelled (the radius of the bubble). This is described in the Appendix.
- the intensity ratios of left and right channels are related to the ratio of the squares of the distances.
- the intensity ratios for the above examples at distances of 1 m, 0.5 m and 0.2 m are approximately 0.80, 0.62 and 0.35 respectively. In dB units, these ratios are -0.97 dB, -2.08 dB and -4.56 dB respectively.
- Figure 5 shows a diagram of the near space around the listener, together with the reference planes and axes which will be referred to during the following descriptions, in which P-P' represents the front-back axis in the horizontal plane, intercepting the centre of the listener's head, and with Q-Q' representing the corresponding lateral axis from left to right.
- the near-ear distance can be determined, for example, by the following calculation.
- Figure 6 shows a plan view of the listener's head, together with the near area surrounding it.
- Figure 7 we are interested in the front-right quadrant in order to derive an expression for the source to near-ear distance.
- the situation is trivial to resolve, as shown in Figure 7 , if the "true" source-to-ear paths for the close frontal positions (such as path "A") are assumed to be similar to the direct distance (indicated by "B"). This simplifies the situation, as is shown on the left diagram of Figure 7 , indicating a sound source S in the front-right quadrant, at an azimuth angle of ⁇ degrees with respect to the listener.
- the angle subtended by S-head_centre-Q' is (90° - ⁇ ).
- the far-ear distance can be determined, for example, by the following calculation.
- Figure 8 shows a plan view of the listener's head, together with the near-field area surrounding it. Once again, we are particularly interested in the front-right quadrant. However, the path between the sound source and the far-ear comprises two serial elements, as is shown dearly in the right hand detail of Figure 8 .
- First there is a direct path from the source, S, tangentially to the head, labelled q
- second there is a circumferential path around the head, C, from the tangent point to the far-ear.
- the distance from the sound source to the centre of the head is d, and the head radius is r.
- the angle subtended by the tangent point and the head centre at the source is angle R.
- the crosstalk factor which is the ratio of (far-ear/near-ear) intensities, as a fraction or percentage of this limiting, 100% value.
- This would define how much attenuation should be applied to the crossfeed path in a transaural crosstalk cancellation system ("C" in Figure 2 ) based on conventional "infinitely distant" A and S functions.
- the crosstalk cancellation factor, X could be converted into dB units of sound intensity, X(dB) and used to define the LF asymptote difference of an A and S function pair, as shown in Figure 9 , which could then be used in a conventional crosstalk cancellation scheme (for example Figure 2 , corresponding to Atal and Schroeder, US 3,236,949 ) to the same effect.
- the A function LF asymptote would be set so as to lie X(dB) below the S asymptote (because the far ( A ) ear is always more distant).
- the crosstalk factor X is the far-ear LF intensity (I F ) expressed as a fraction of the near-ear LF intensity (I N ).
- the intensities are related to the distances from the source to far-ear (D F ) and near-ear (D N ) by the square law relationship (see Appendix), as follows.
- the transaural crosstalk cancellation factor X is incorporated into the filter design procedure, thus allowing a range of different transaural crosstalk cancellation filters to be created from standard low frequency convergent A and S functions, but with differing values of X, for a range of speaker configurations, such that the end user can select the most appropriate one for their particular speaker configuration.
- a range of filters for X values in the range say, 0.5 to 1.0 in 0.05 increments.
- a further disadvantage of this alternative approach is that it would require many measurements at different distances and angles, and would result in quantised-distance effects: an optimum value could not be calculated and easily be provided for all loudspeaker configurations.
- the present invention allows both distance and angle parameters to be used to calculate a single crosstalk cancellation factor, from which an associated filter is selected, based on accurate, 1 metre measurement.
- the pressure fluctuations propagate away from the source in a spherical manner - the wavefront is just like an expanding "bubble".
- the wavefront sphere increases in size, and hence its energy is spread over a larger surface area. Consequently, the energy density - and intensity - of the expanding wavefront diminishes.
- the expanding sphere is relatively small, having radius r, such that I, represents the energy received per second from sound source s.
- the wavefront has expanded to a larger sphere having radius r 2 , and intensity I 2 at the surface.
- I 1 I 2 r 2 2 r 1 2
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Stereophonic System (AREA)
- Circuit For Audible Band Transducer (AREA)
- Stereophonic Arrangements (AREA)
Description
- This invention relates to a method of processing a plural channel audio signal including left and right channels, the information in the channels representing a three dimensional sound-field for generation by respective left and right loudspeakers arranged at a given distance from the preferred position of a listener in use.
- The processing of audio signals to reproduce a three dimensional sound-field on replay to a listener having two ears has been a goal for inventors for many years. One approach has been to use many sound reproduction channels to surround the listener with a multiplicity of sound sources such as loudspeakers. Another approach has been to use a dummy head having microphones positioned in the auditory canals of artificial ears to make sound recordings for headphone listening. An especially promising approach to the binaural synthesis of such a sound-field has been described in
EP-B-0689756 , which describes the synthesis of a sound-field using a pair of loudspeakers and only two signal channels, the sound-field nevertheless having directional information allowing a listener to perceive sound sources appearing to lie anywhere on a sphere surrounding the head of a listener placed at the centre of the sphere. - The goal of researchers developing and studying the synthesis of 3D sound-fields from conventional two speaker systems has been to provide for complete and effective transaural crosstalk cancellation.
- The fundamental Head Response Transfer Function (HRTF) characteristics which are required to implement a transaural crosstalk cancellation scheme are the left- and right-ear transfer functions associated with the azimuth angle at which the loudspeakers are situated (
Figure 1 ). For most applications, this is commonly accepted to be ±30°. The near-ear function is sometimes referred to as the "same" side function (or "S" function), and the far-ear function as the "alternate" (or "A") function. These A and S characteristics form the basis of all transaural crosstalk cancellation schemes (Figure 2 ). Transaural crosstalk cancellation is described in more detail inWO 95/15069
(where C = (-A/S) ), and:
These terms are often compounded together and simplify to form: - 1. Poor LF response of measurement actuator (loudspeaker).
In practise, it is known to make measurements from an artificial head in order to derive a library of HRTF data. It is common practise to make these measurements at distances of 1 metre or thereabouts, for several reasons. Firstly, the sound source used for such measurements is, ideally, a point source, and usually a loudspeaker is used. However, there is a physical limit on the minimum size of loudspeaker diaphragms. Typically, a diameter of several inches is as small as is practical whilst retaining the power capability and low-distortion properties which are needed. Hence, in order to have the effects of these loudspeaker signals representative of a point source, the loudspeaker must be spaced at a distance of around 1 metre from the artificial head. (As it is often required to create sound effects for PC games and the like which possess apparent distances of several metres or greater, and so, because there is little difference between HRTFs measured at 1 mere and those measured at much greater distances, the 1 metre measurement is used.) However, loudspeakers of this size and configuration possess very poor LF performance, and their LF response begins to fail at frequencies of around 200 Hz and below. - 2. Poor LF response of measurement sensor (microphone in artificial head).
- 3. DC offsets in instrumentation.
It is not uncommon to find spurious DC level offsets of 5 - 10 mV in digital tape recorders and other instruments used in HRTF measurements. (A DC offset corresponds directly to a gain error at 0 Hz.) - 4. Wind pressure artefacts.
In an anechoic measurement chamber, external wind pressure can cause significant pressure fluctuations within the chamber, giving rise to substantially large data offsets. Consequently, it is convenient to filter off the LF components of the HRTF signals prior to recording them, thus making the mid and high frequency information reliable and reproducible, but at the expense of loss of LF data. - 5. Standing waves.
Even in an anechoic chamber, residual reflected energy can combine to cause standing waves. and these are most apparent at long wavelengths, hence procedures used for (4), above are doubly useful. - 6. Impulse measurement method
HRTFs are measured by means of impulse responses, and this measurement does not provide LF data, because there is insufficient energy in the transient impulse below around 200 Hz. Even when a "stretch" pulse method is used, this is still the case. - 7. Time domain windowing
When measuring HRTFs, it is essential to "window" the measured impulses in the time domain to a period of several milliseconds in order to eliminate incorporating reflected waves into the measurement (even in an anechoic chamber), and this cuts off the spectrum of the resultant data, again, below around 200 Hz. - As a consequence, HRTFs measured by the prior art methods do not contain LF information, although, of course, the LF response is present in reality. The results of a typical HRTF measurement are shown in
Figure 3 , depicting the A and S functions at 30° azimuth, measured from a commercial artificial head. The uncertainty in the non-valid data, below several hundred Hz, is apparent. Accordingly, the missing LF properties must be replaced in order to create valid HRTFs, and this is conveniently done by extrapolating the amplitude data at the lowest valid frequency (200 Hz) back to 0 Hz (or in practise, back to the lowest practical frequency, say 10 Hz). However, although the LF amplitude data do not contain a great deal of "detail" (unlike the HF characteristics), and therefore it might be supposed that back-extrapolation might be simple, it is not entirely straightforward. This is because the HRTF curves are not flat at the lowest valid frequency, but still curving, and the near- and far-ear characteristics exhibit slightly differently shaped curves. Consequently, one must make an intelligent estimate of the y-axis intercept, and extrapolate both curves accordingly, as is shown inFigure 4 . Any LF errors can create significant quality problems, as low-frequency artefacts are very noticeable in high quality audio applications, often termed "phase errors". For this reason any LF errors in the processing must be avoided), and so in practice both near- and far-ear characteristics of the HRTF are extrapolated to the same value at low frequencies. - Prior art transaural crosstalk cancellation methods have always used A and S functions which tend to the same value at low frequencies (see for example, Atal and Schroeder,
US 3,236,949 ). Using such functions, the anticipated crosstalk signal at the far ear is equal to the primary signal at the near ear at low frequencies, hence the ratio of crosstalk signal to primary signal is always 1:1 at low frequencies. - In
WO 95/15069 - Another document useful for understanding the present invention is the published patent
US 5 384 851 A . - According to a first aspect of the invention there is provided a method as specified in claims 1-7 According to a second aspect of the invention there is provided transaural crosstalk filter means as specified in claim 8.
- Embodiments of the invention will now be described, by way of example only, with reference to the accompanying diagrammatic drawings, in which:-
-
Figure 1 shows a plan view of a listener, loudspeakers, and transfer functions, -
Figure 2 shows a prior art transaural crosstalk cancellation scheme, -
Figure 3 shows typical experimentally measured A and S functions, -
Figure 4 shows prior art modified A and S functions with forced convergence below 200 Hz, -
Figure 5 shows a listener with reference sphere and co-ordinate system, -
Figure 6 shows a plan view of the space around the listener in the horizontal plane, -
Figure 7 shows how near ear distances are calculated in the horizontal plane, -
Figure 8 shows how far ear distances are calculated in the horizontal plane, -
Figure 9 shows A and S functions according to the present invention, and -
Figure 10 shows the transaural crosstalk cancellation factor (X) as a function of speaker angle and distance in the horizontal plane. - The inventor of the present invention has discovered that the amount of transaural crosstalk which actually occurs, relative to the primary signal, is dependent upon the distance of the loudspeakers from the listener (and this distance dependency is also a function of azimuthal position). In the present description and claims the term "transaural crosstalk" is defined to be the intensity ratio of the far ear signal with respect to the near ear signal. As these two functions have a different frequency dependence, this ratio will in general be a function of frequency. However, in the prior art the ratio approaches unity at low frequencies because A and S are forced to the same value below about 200 Hz. That is, the transaural crosstalk signal (far ear signal) is equal in magnitude to the primary signal (near ear signal) for such low frequencies. Hence it can be said that in all the prior art schemes the transaural crosstalk signal is substantially equal to (100% of) the primary signal-at low frequencies, regardless of loudspeaker distance and/or angle. Consequently, all the prior art methods of transaural crosstalk cancellation have not been optimal for the arrangements/distances of loudspeakers used in practice.
- The invention provides a means for creating optimal transaural crosstalk cancellation particularly, though not exclusively, for users of Personal Computer (PC) - based multimedia systems, in which the loudspeakers are relatively close to the listener and might be at a variety of differing angles and distances, depending on the individual user's set-up configuration and preferences. The amount of transaural crosstalk which occurs is also influenced by the angle of the loudspeakers. (Note that this is not to be confused with the use of the appropriate azimuth angle A and S functions, which is well known: i.e. use 30° A and S functions for speakers at 30°;15° A and S functions for speakers at 15°, and so on).
- This realisation enables the precise calculation of the relative transaural crosstalk intensity which occurs for any given loudspeaker distance and angle, which result can in turn be used to control the amount of transaural crosstalk cancellation which is implemented.
WO 95/15069 - It is standard procedure (as described in
WO/15069 - The present invention is a transaural crosstalk cancellation means based on "standard", 1 metre A and S functions. The method employs an algorithm which controls the intensity of the transaural crosstalk cancellation signal relative to the near-ear intensity, using a crosstalk cancellation factor which is a function of loudspeaker proximity and spatial position. The invention is based on the observation that when a sound source moves relatively closely towards the head (say, from a distance of several metres), then the individual far- and near-ear properties of the HRTF do not change a great deal in terms of their spectral properties, but their amplitudes, and the amplitude difference between them, do change substantially, caused by a distance ratio effect.
- For practical reasons, it is useful to consider the typical range of loudspeaker position angles and distances representative of present multimedia loudspeaker configurations. Such loudspeaker azimuthal angles lie in the range ±10° (for notebook PCs) to ±30° (for desktop PCs), and the distances (loudspeaker to ear) range from about 0.2 metres to 1 metre respectively. These ranges will be used here for illustrative purposes, but of course the invention is not restricted to these parameters.
- As a general illustration of the effects of using a relatively close loudspeaker, first consider the approximate relative intensities at the far- and near-ear. When a lateral sound source moves towards the head from, say, 1 metre distance, the distance ratio (far-ear to sound source vs. near-ear to sound source) becomes greater. For example, at 45° azimuth in the horizontal plane, at a distance of 1 metre from the centre of the head, the near ear is about 0.95 metre distance and the far-ear around 1.06 metre. So the distance ratio is (0.95 / 1.06) = 0.90. When the sound source moves to a distance of 0.5 metre, then the ratio becomes (0.45/0.57) =0.79, and when the distance is only 20 cm, then the ratio is approximately (0.16 / 0.27) = 0.59. The intensity of a sound source diminishes with distance as the energy of the propagating wave is spread over an increasing area. The wavefront is similar to an expanding bubble, and hence the energy density is related to the surface area of the propagating wavefront, which is related by a square law to the distance travelled (the radius of the bubble). This is described in the Appendix. Hence the intensity ratios of left and right channels are related to the ratio of the squares of the distances. Hence, the intensity ratios for the above examples at distances of 1 m, 0.5 m and 0.2 m are approximately 0.80, 0.62 and 0.35 respectively. In dB units, these ratios are -0.97 dB, -2.08 dB and -4.56 dB respectively.
- It is important to note, however, that the far ear to near ear intensity ratio differences are position dependent. For example, if the aforementioned situation were repeated for a frontal sound source (azimuth 0°) approaching the head, then there would be no difference between the left and right channel intensities, by symmetry. In this instance, the intensity level of both channels simply would increase according to the 1/R2 law.
- Accordingly, it is desirable to derive an expression which defines the relative intensity ratio at the far- and near-ears, caused by a local sound source, as a function of both the distance and angular position of the source relative to the listener. As a frame of reference,
Figure 5 shows a diagram of the near space around the listener, together with the reference planes and axes which will be referred to during the following descriptions, in which P-P' represents the front-back axis in the horizontal plane, intercepting the centre of the listener's head, and with Q-Q' representing the corresponding lateral axis from left to right. - The near-ear distance can be determined, for example, by the following calculation.
Figure 6 shows a plan view of the listener's head, together with the near area surrounding it. For the present purpose, we are interested in the front-right quadrant in order to derive an expression for the source to near-ear distance. The situation is trivial to resolve, as shown inFigure 7 , if the "true" source-to-ear paths for the close frontal positions (such as path "A") are assumed to be similar to the direct distance (indicated by "B"). This simplifies the situation, as is shown on the left diagram ofFigure 7 , indicating a sound source S in the front-right quadrant, at an azimuth angle of θ degrees with respect to the listener. Also shown is the distance, d, of the sound source from the head centre, and the distance, p, of the sound source from the near-ear. The angle subtended by S-head_centre-Q' is (90° - θ). The near-ear distance can be derived using the cosine rule, from triangle S-head_centre-near_ear:
If we assume the head radius, r, is 7.5 cm, then p is given by: - The far-ear distance can be determined, for example, by the following calculation.
Figure 8 shows a plan view of the listener's head, together with the near-field area surrounding it. Once again, we are particularly interested in the front-right quadrant. However, the path between the sound source and the far-ear comprises two serial elements, as is shown dearly in the right hand detail ofFigure 8 . First, there is a direct path from the source, S, tangentially to the head, labelled q, and second, there is a circumferential path around the head, C, from the tangent point to the far-ear. As before, the distance from the sound source to the centre of the head is d, and the head radius is r. The angle subtended by the tangent point and the head centre at the source is angle R. - The tangential path, q, can be calculated simply from the triangle:
...and also the angle R:
Considering the triangle S-T-head_centre, the angle P-head_centre-T is (90 - θ - R), and so the angle T-head_centre-Q (the angle subtended by the arc itself) must be (θ + R). The circumferential path can be calculated from this angle, and is:
Hence, by substituting (7) into (8), and combining with (6), an expression for the total distance (in cm) from sound source to far-ear for a 7.5 cm radius head can be calculated: - Now that working expressions for the distances to each ear from the sound source have been established, it is possible to derive an expression which defines the distance-dependent (and azimuth position-dependent) amount of crosstalk, relative to 100% (corresponding to equal transaural crosstalk signal and primary signal at low frequencies, as suitable for a distant source). As the source moves closer, the relative intensity between the ears decreases, and so there is relatively less crosstalk. This "crosstalk factor" (call it X) characterises the amount of transaural crosstalk relative to an infinitely distant source, where the near-ear and far-ear signals are virtually equal in amplitude at very low frequency (they tend to the same value at 0 Hz). Thus it is convenient to describe the crosstalk factor, which is the ratio of (far-ear/near-ear) intensities, as a fraction or percentage of this limiting, 100% value. This, in turn, would define how much attenuation should be applied to the crossfeed path in a transaural crosstalk cancellation system ("C" in
Figure 2 ) based on conventional "infinitely distant" A and S functions. - Alternatively, the crosstalk cancellation factor, X, could be converted into dB units of sound intensity, X(dB) and used to define the LF asymptote difference of an A and S function pair, as shown in
Figure 9 , which could then be used in a conventional crosstalk cancellation scheme (for exampleFigure 2 , corresponding to Atal and Schroeder,US 3,236,949 ) to the same effect. Thus the A function LF asymptote would be set so as to lie X(dB) below the S asymptote (because the far (A) ear is always more distant). - The crosstalk factor X is the far-ear LF intensity (IF) expressed as a fraction of the near-ear LF intensity (IN). The intensities are related to the distances from the source to far-ear (DF) and near-ear (DN) by the square law relationship (see Appendix), as follows.
From equation (5), the near-ear distance is:
And from equation (9), the far-ear distance is:
Hence the crosstalk factor X (i.e. the LF intensity ratio), as a function of the distance from the source to the head centre, d, and source azimuth angle, θ, is as shown below in equation 13.
This can be expressed in dB in the usual manner, thus: - It is worthwhile computing the X factor as a function of distance from the listener's head at various azimuth angles. This has been done in the range 10 degrees to 30 degrees, and is both tabulated in Table 1 below and depicted graphically in
Figure 10 , where the X factor has been expressed as a fraction, according to equation 13.TABLE 1 pq12813.xls AS 1 Jul 98 Distance-Dependent Transaural Crosstalk Cancellation (Head radius assumed to be 7.5 cm) X Factor as a Ratio of Intensities (Far/Near) d (cm) <<- Speaker angle (deg) - >> 10 12 14 16 18 20 22 24 26 28 30 20 0.782 0.745 0.709 0.675 0.643 0.612 0.582 0.554 0.527 0.501 0.477 25 0.818 0.786 0.755 0.725 0.697 0.669 0.642 0.617 0.592 0.569 0.546 30 0.844 0.816 0.789 0.762 0.737 0.712 0.689 0.666 0.643 0.622 0.601 35 0.864 0.839 0.815 0.791 0.768 0.746 0.725 0.704 0.684 0.664 0.645 40 0.879 0.857 0.835 0.814 0.793 0.773 0.754 0.735 0.716 0.698 0.681 45 0.892 0.871 0.852 0.832 0.813 0.795 0.777 0.760 0.743 0.726 0.710 50 0.902 0.883 0.865 0.847 0.830 0.813 0.797 0.781 0.765 0.750 0.735 55 0.910 0.893 0.876 0.860 0.844 0.828 0.813 0.798 0.784 0.769 0.755 60 0.917 0.901 0.886 0.871 0.856 0.841 0.827 0.813 0.799 0.786 0.773 65 0.923 0.908 0.894 0.880 0.866 0.852 0.839 0.826 0.813 0.801 0.789 70 0.928 0.915 0.901 0.888 0.875 0.862 0.850 0.837 0.825 0.814 0.802 75 0.933 0.920 0.907 0.895 0.883 0.871 0.859 0.847 0.836 0.825 0814 80 0.937 0.925 0.913 0.901 0.890 0.878 0.867 0.856 0.845 0.835 0.824 85 0.940 0.929 0.918 0.907 0.896 0.885 0.874 0.864 0.854 0.844 0.834 90 0.944 0.933 0.922 0.912 0.901 0.891 0.881 0.871 0.861 0.852 0.842 95 0.947 0.936 0.926 0.916 0.906 0.896 0.887 0.877 0.868 0.859 0.850 100 0.949 0.939 0.930 0.920 0.911 0.901 0.892 0.883 0.874 0.866 0.857 250 0.979 0.975 0.971 0.967 0.963 0.959 0.955 0.952 0.948 0.944 0.940 X Factor in dB d (cm) « - Speaker angle (deg) - » 10 12 14 16 18 20 22 24 26 28 30 20 -1.067 -1.279 -1.491 -1.704 -1.919 -2.134 -2.349 -2.566 -2.783 -3.001 -3.219 25 -0.872 -1.046 -1.220 -1.395 -1.570 -1.746 -1.922 -2.098 -2.274 -2.449 -2.625 30 -0.736 -0.883 -1.030 -1.178 -1.325 -1.473 -1.621 -1.768 -1.915 -2.062 -2.208 35 -0.635 -0.763 -0.890 -1.017 -1.144 -1.272 -1.399 -1.525 -1.651 -1.777 -1.902 40 -0.559 -0.671 -0.783 -0.894 -1.006 -1.118 -1.229 -1.340 -1.450 -1.560 -1.670 45 -0.499 -0.598 -0.698 -0.798 -0.897 -0.996 -1.095 -1.194 -1.292 -1.390 -1.487 50 -0.450 -0.540 -0.630 -0.719 -0.809 -0.898 -0.987 -1.076 -1.164 -1.252 -1.339 55 -0.410 -0.492 -0.573 -0.655 -0.737 -0.818 -0.899 -0.979 -1.060 -1.139 -1.218 60 -0.376 -0.451 -0.526 -0.601 -0.676 -0.750 -0.825 -0.898 -0.972 -1.045 -1.117 65 -0.348 -0.417 -0.486 -0.555 -0.624 -0.693 -0.762 -0.830 -0.897 -0.965 -1.032 70 -0.323 -0.387 -0.452 -0.516 -0.580 -0.644 -0.708 -0.771 -0.834 -0.896 -0.958 75 -0.302 -0.362 -0.422 -0.482 -0.542 -0.601 -0.661 -0.720 -0.778 -0.836 -0.894 80 -0.283 -0.339 -0.396 -0.452 -0.508 -0.564 -0.619 -0.675 -0.730 -0.784 -0.838 85 -0.266 -0.320 -0.373 -0.426 -0.478 -0.531 -0.583 -0.635 -0.687 -0.738 -0.789 90 -0.252 -0.302 -0.352 -0.402 -0.452 -0.501 -0.551 -0.600 -0.649 -0.697 -0.745 95 -0.239 -0.286 -0.334 -0.381 -0.428 -0.475 -0.522 -0.568 -0.614 -0.660 -0.706 100 -0.227 -0.272 -0.317 -0.362 -0.407 -0.451 -0.496 -0.540 -0.584 -0.627 -0.670 250 -0.091 -0.109 -0.127 -0.145 -0.163 -0.180 -0.198 -0.216 -0.233 -0.250 -0267 - From Table 1, the optimal X values for transaural crosstalk cancellation schemes applicable to, say, (a) a hi-fi system, (b) a desktop PC, and (c) a laptop PC can be ascertained, as tabulated below in Table 2.
TABLE 2 Speaker Distance (m) Speaker Angle X Factor X (dB) hi-fi System 2.5 30° 0.940 -0.267 Desktop PC 0.6 30° 0.773 -1.117 Laptop PC 0.3 15° 0.789 -1.030 - The implementation of the invention is straightforward: the transaural crosstalk cancellation factor X is incorporated into the filter design procedure, thus allowing a range of different transaural crosstalk cancellation filters to be created from standard low frequency convergent A and S functions, but with differing values of X, for a range of speaker configurations, such that the end user can select the most appropriate one for their particular speaker configuration. For example, after inspection of the data shown in Table 1 it would be reasonable to create a range of filters for X values in the range, say, 0.5 to 1.0 in 0.05 increments. These 11 filters would cover most situations.
- This is very convenient, because Microsoft's new Windows98 (trademark) operating system includes the provision to select about a dozen different loudspeaker set-ups. The present invention would fit into this system easily, allowing the user to specify (a) the separation between speakers, and (b) the distance from head to speaker centre-line, for example, and then the software could select the optimal transaural crosstalk filtering arrangements.
- In principle, as an alternative to the above method, it is possible to make A and S measurements at differing distances, say 1 metre, 0.9 metre, 0.8 metre and so on, and create different crosstalk filters for these differing distances and for different loudspeaker configurations. This would "build-in" the correct amount of transaural crosstalk cancellation. However, the same problems would exist in attempting to work out what exactly the low-frequency characteristics of A and S were. Also, as already noted above, such close measurements are compromised by the loudspeaker diaphragm dimensions which depart from point-source properties at these distances, and so it is not possible to make accurate measurements doser than about 0.8 metre.
- A further disadvantage of this alternative approach is that it would require many measurements at different distances and angles, and would result in quantised-distance effects: an optimum value could not be calculated and easily be provided for all loudspeaker configurations. The present invention allows both distance and angle parameters to be used to calculate a single crosstalk cancellation factor, from which an associated filter is selected, based on accurate, 1 metre measurement.
- The above description has been related to loudspeakers which lie in the horizontal plane of the listener: this has been for illustrative purposes only, and the invention is not limited horizontal-plane loudspeaker configurations. The principles described above are equally applicable to loudspeakers which do not lie on the horizontal plane, and the equations may be re-formatted accordingly.
- The sound intensity, I, in a specified direction in a medium is defined as the sound energy transmitted per unit area per unit time. This represents the energy in an imaginary column, c, in length and with a unit cross-section. It can be shown that:
where pRMS is the maximum pressure variation divided by the square root of two, and Z is the characteristic acoustic impedance of air, which is equal to the density of air times the velocity of sound in air.
(Note that intensity, I, is proportional to the square of RMS pressure amplitude.) - When sound is generated by a mechanical disturbance, the pressure fluctuations propagate away from the source in a spherical manner - the wavefront is just like an expanding "bubble". As the wave travels further and further from the source, the wavefront sphere increases in size, and hence its energy is spread over a larger surface area. Consequently, the energy density - and intensity - of the expanding wavefront diminishes.
- Imagine that, at a particular time, the expanding sphere is relatively small, having radius r,, such that I, represents the energy received per second from sound source s. Later in time, the wavefront has expanded to a larger sphere having radius r2, and intensity I2 at the surface. The total energy emanating from s is equal to the product of the area of the sphere and intensity at the surface of the sphere, and so, if no energy is lost:
This rearranges to the "inverse square" relationship, as follows.
A consequence of this is, that the intensity of a sound source is inversely proportional to the square of the distance from the source. Also, it is worth noting the following. - (1) In practise, there is no such thing as a point source of sound, and the relationship is generally used for extended sources at a distance.
- (2) Some energy is always lost because of friction in the medium, and so the sound intensity, I, falls off more rapidly than 1/r2.
Claims (8)
- A method of processing a binaural channel audio signal including left and right channels, the information in the channels representing a three dimensional sound-field for generation by respective left and right loudspeakers arranged at a distance from the preferred position of a listener in use, the method comprising:choosing a distance between said loudspeakers and said preferred position;determining, for each of the left and right channels, a near ear transfer function, S, and a far ear transfer function, A, that are equal at frequencies below 200 Hz; determining from the magnitude of this chosen distance a transaural crosstalk compensation factor, X, being a function of the chosen distance; andadjusting the near and/or far ear transfer functions so that they approach different respective values at frequencies below 200 Hz dictated by the transaural crosstalk compensation factor.
- A method as claimed in claim 1 further including choosing an angle between the left channel loudspeaker and the right channel loudspeaker as viewed from said preferred position, and determining from both said chosen angle and said chosen distance the transaural crosstalk compensation factor, said transaural crosstalk compensation factor being a function of both the chosen angle and the chosen distance.
- A method according to claim 2, further comprising determining a plurality of transaural crosstalk factors at a plurality of chosen distances and chosen angles, and selecting the most appropriate one for a particular loudspeaker configuration..
- A method according to any preceding claim, further comprising converting the crosstalk factor into dBs and wherein applying the crosstalk factor comprises using the dB value in a conventional crosstalk cancellation scheme.
- A method according to any preceding claim, further comprising determining a distance from a far ear to the left and right loudspeakers comprising two serial elements, a first element comprising a direct path from the loudspeaker tangentially to the head of the listener, and a second element comprising the circumferential path around the head.
- Transaurual crosstalk filter means being constructed and arranged for performing the method of any preceding claim.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB9816059 | 1998-07-24 | ||
GB9816059A GB2340005B (en) | 1998-07-24 | 1998-07-24 | A method of processing a plural channel audio signal |
Publications (3)
Publication Number | Publication Date |
---|---|
EP0975201A2 EP0975201A2 (en) | 2000-01-26 |
EP0975201A3 EP0975201A3 (en) | 2005-06-08 |
EP0975201B1 true EP0975201B1 (en) | 2013-04-10 |
Family
ID=10836042
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP99305562.3A Expired - Lifetime EP0975201B1 (en) | 1998-07-24 | 1999-07-12 | A method of processing a plural channel audio signal |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP0975201B1 (en) |
JP (1) | JP2000059892A (en) |
GB (1) | GB2340005B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111587582A (en) * | 2017-10-18 | 2020-08-25 | Dts公司 | Audio signal preconditioning for 3D audio virtualization |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4835185B2 (en) * | 2006-02-08 | 2011-12-14 | ソニー株式会社 | Audio signal output apparatus and sound leakage reduction method |
US9094771B2 (en) | 2011-04-18 | 2015-07-28 | Dolby Laboratories Licensing Corporation | Method and system for upmixing audio to generate 3D audio |
KR101687493B1 (en) * | 2015-08-12 | 2016-12-16 | 연세대학교 산학협력단 | Method and Device for transmitting signal in FTN system |
CN112840671A (en) * | 2018-11-27 | 2021-05-25 | 深圳市欢太科技有限公司 | Stereo playing method, device, storage medium and electronic equipment |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0730812A1 (en) * | 1993-11-25 | 1996-09-11 | Central Research Laboratories Limited | Apparatus for processing binaural signals |
WO1998020707A1 (en) * | 1996-11-01 | 1998-05-14 | Central Research Laboratories Limited | Stereo sound expander |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5832840B2 (en) * | 1977-09-10 | 1983-07-15 | 日本ビクター株式会社 | 3D sound field expansion device |
JPH07105999B2 (en) * | 1990-10-11 | 1995-11-13 | ヤマハ株式会社 | Sound image localization device |
JPH10108300A (en) * | 1996-09-27 | 1998-04-24 | Yamaha Corp | Sound field reproduction device |
GB2334867A (en) * | 1998-02-25 | 1999-09-01 | Steels Elizabeth Anne | Spatial localisation of sound |
-
1998
- 1998-07-24 GB GB9816059A patent/GB2340005B/en not_active Expired - Lifetime
-
1999
- 1999-07-12 EP EP99305562.3A patent/EP0975201B1/en not_active Expired - Lifetime
- 1999-07-26 JP JP11210550A patent/JP2000059892A/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0730812A1 (en) * | 1993-11-25 | 1996-09-11 | Central Research Laboratories Limited | Apparatus for processing binaural signals |
WO1998020707A1 (en) * | 1996-11-01 | 1998-05-14 | Central Research Laboratories Limited | Stereo sound expander |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111587582A (en) * | 2017-10-18 | 2020-08-25 | Dts公司 | Audio signal preconditioning for 3D audio virtualization |
CN111587582B (en) * | 2017-10-18 | 2022-09-02 | Dts公司 | System, method, and storage medium for audio signal preconditioning for 3D audio virtualization |
Also Published As
Publication number | Publication date |
---|---|
GB9816059D0 (en) | 1998-09-23 |
EP0975201A3 (en) | 2005-06-08 |
GB2340005A (en) | 2000-02-09 |
EP0975201A2 (en) | 2000-01-26 |
GB2340005B (en) | 2003-03-19 |
JP2000059892A (en) | 2000-02-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6643375B1 (en) | Method of processing a plural channel audio signal | |
Schärer et al. | Evaluation of equalization methods for binaural signals | |
Best et al. | The role of high frequencies in speech localization | |
US4739513A (en) | Method and apparatus for measuring and correcting acoustic characteristic in sound field | |
CN103222187B (en) | For being eliminated by the non-staining optimization crosstalk of the frequency spectrum of the audio frequency of speaker | |
Hartmann et al. | Transaural experiments and a revised duplex theory for the localization of low-frequency tones | |
US3920904A (en) | Method and apparatus for imparting to headphones the sound-reproducing characteristics of loudspeakers | |
US6424719B1 (en) | Acoustic crosstalk cancellation system | |
Yokoyama et al. | 6-channel recording/reproduction system for 3-dimensional auralization of sound fields | |
Akeroyd et al. | The binaural performance of a cross-talk cancellation system with matched or mismatched setup and playback acoustics | |
WO2002001916A2 (en) | Sound reproduction systems | |
Yost | Sound source localization identification accuracy: Envelope dependencies | |
EP0975201B1 (en) | A method of processing a plural channel audio signal | |
CN111556425A (en) | Tone equalization method for virtual sound reproduction of loudspeaker | |
Vorländer | Acoustic load on the ear caused by headphones | |
Lee et al. | HRTF measurement for accurate sound localization cues | |
Johnson et al. | Perceptual threshold of apparent source width in relation to the azimuth of a single reflection | |
US20240236613A1 (en) | A method, device, storage medium, and headphones of headphone virtual spatial sound playback | |
CN111970612B (en) | Method and system for eliminating crosstalk of bone conduction earphone | |
US11044552B2 (en) | Acoustic radiation control method and system | |
US3670842A (en) | Loudspeakers | |
Takeuchi et al. | Extension of the optimal source distribution for binaural sound reproduction | |
Vidal et al. | HRTF measurements of five dummy heads at two distances | |
WO2021041140A1 (en) | Headphone device for reproducing three-dimensional sound therein, and associated method | |
Liu et al. | A High-Frequency--Band Timbre Equalization Method for Transaural Reproduction With Two Frontal Loudspeakers |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
AX | Request for extension of the european patent |
Free format text: AL;LT;LV;MK;RO;SI |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: CREATIVE TECHNOLOGY LTD. |
|
PUAL | Search report despatched |
Free format text: ORIGINAL CODE: 0009013 |
|
AK | Designated contracting states |
Kind code of ref document: A3 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
AX | Request for extension of the european patent |
Extension state: AL LT LV MK RO SI |
|
AKX | Designation fees paid |
Designated state(s): DE FR GB NL |
|
17P | Request for examination filed |
Effective date: 20060207 |
|
17Q | First examination report despatched |
Effective date: 20060717 |
|
APBK | Appeal reference recorded |
Free format text: ORIGINAL CODE: EPIDOSNREFNE |
|
APBN | Date of receipt of notice of appeal recorded |
Free format text: ORIGINAL CODE: EPIDOSNNOA2E |
|
APBR | Date of receipt of statement of grounds of appeal recorded |
Free format text: ORIGINAL CODE: EPIDOSNNOA3E |
|
APBV | Interlocutory revision of appeal recorded |
Free format text: ORIGINAL CODE: EPIDOSNIRAPE |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): DE FR GB NL |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R096 Ref document number: 69944696 Country of ref document: DE Effective date: 20130606 |
|
REG | Reference to a national code |
Ref country code: NL Ref legal event code: T3 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20140113 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R097 Ref document number: 69944696 Country of ref document: DE Effective date: 20140113 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 18 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 19 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: NL Payment date: 20170726 Year of fee payment: 19 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 20 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FR Payment date: 20180726 Year of fee payment: 20 Ref country code: DE Payment date: 20180727 Year of fee payment: 20 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20180727 Year of fee payment: 20 |
|
REG | Reference to a national code |
Ref country code: NL Ref legal event code: MM Effective date: 20180801 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: NL Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20180801 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R071 Ref document number: 69944696 Country of ref document: DE |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: PE20 Expiry date: 20190711 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION Effective date: 20190711 |