AU621655B2 - Sound imaging method and apparatus - Google Patents

Sound imaging method and apparatus Download PDF

Info

Publication number
AU621655B2
AU621655B2 AU41000/89A AU4100089A AU621655B2 AU 621655 B2 AU621655 B2 AU 621655B2 AU 41000/89 A AU41000/89 A AU 41000/89A AU 4100089 A AU4100089 A AU 4100089A AU 621655 B2 AU621655 B2 AU 621655B2
Authority
AU
Australia
Prior art keywords
sound
signals
phase
channel
amplitude
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
AU41000/89A
Other versions
AU4100089A (en
Inventor
John W Lees
Danny D Lowe
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Q Sound Ltd
Original Assignee
Q Sound Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to US07/239,981 priority Critical patent/US5046097A/en
Priority to US239981 priority
Priority to US39898889A priority
Priority to US398988 priority
Application filed by Q Sound Ltd filed Critical Q Sound Ltd
Publication of AU4100089A publication Critical patent/AU4100089A/en
Application granted granted Critical
Publication of AU621655B2 publication Critical patent/AU621655B2/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R5/00Stereophonic arrangements
    • H04R5/02Spatial or constructional arrangements of loudspeakers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S5/00Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • H04S1/002Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/01Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/03Application of parametric coding in stereophonic audio systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/40Visual indication of stereophonic sound image

Description

L
IU: InM LVUlllMibUINLRE ur r-/11L1J
AUSTRALIA
1586W/GMM I ;i- _:r ct: ~ir: i: I I II I i' S F Ref: 106020 FORM COMMONWEALTH OF AUSTRALIA PATENTS ACT 1952 COMPLETE SPECIFICATION 621655
(ORIGINAL)
FOR OFFICE USE: Class Int Class 9* q 9 Complete Specification Lodged: Accepted: Published: Priority: Related Art: a.
S
U
Name and Address of Applicant: Address for Service: Q Sound Ltd 2748 37th Avenue Calgary Alberta T2P 3M7
CANADA
Spruson Ferguson, Patent Attorneys Level 33 St Martins Tower, 31 Market Street Sydney, New South Wales, 2000, Australia 6 09 Complete Specification for the invention entitled: Sound Imaging Method and Apparatus The following statement is a full description of this invention, including the best method of performing it known to me/us 5845/5 ii p: I- I 3 52 07 ABSTRACT OF THE DISCLOSURE The illusion of distinct sound sources distributed throughout the three-dimensional space containing the listener is possible using only conventional stereo playback equipment by processing monaural sound signals prior to playback on two spaced-apart transducers. A plurality of such processed signals corresponding to different sound source positions may be mixed using conventional techniques without disturbing the positions of the individual images.
Although two loudspeakers are required the sound produced is not conventional stereo, however, each channel of a left/right stereo signal can be separately processed *fee according to the invention and then combined for playback.
The sound processing involves dividing each monaural or single channel signal into two signals and then adjusting the differential phase and amplitude of the two channel signals on a frequency dependent basis in accordance with an empirically derived transfer function that has a specific phase and amplitude adjustment for each predetermined frequency interval over the audio spectrum. Each transfer function is empirically derived to relate to a different sound source location and by providing a number of different transfer functions and selecting them accordingly the soun~ source can be made to appear to move.
!V
1I i 35207 SUMMARY OF THE INVENTION Two ordinary, spaced-apart loudspeakers can produce a sound image that appears to the listener to be emanating from a location other than the actual location of the loudspeakers. The sound signals are processed according to this invention before they are reproduced so that no special playback equipment is required. Although two loudspeakers are required the sound produced is not the same as conventional stereophonic, left and right, sound however, stereo signals can be processed and improved according to this invention. The inventive round processing involves g dividing each monaural or single channel signal into two o signals and then adjusting the differential phase and
C..
oe •amplitude of the two channel signals on a frequency dependent basis in accordance with an empirically derived g• transfer function. The results of this is processing is that the apparent sound source location can be placed as desired, provided that the transfer function is properly derived. Each transfer function has an empirically derived *.phase and amplitude adjustment that is built-up for each predetermined frequency interval over the entire audio spectrum and provides for a separate sound source location.
"By providing a suitable number of different transfer i functions and selecting them accordingly the sound source can appear to the listener to move. The transfer function can be implemented by analog circuit components or the monaural signal can be digitalized and digital filters and the like employed.
-IR i i' 35207 BACKGROUND OF THE INVENTION Field of the Invention This invention relates generally to a method and -apparatus for processing an audio signal and, more particularly, to procLcssing an audio signal so that the resultant sounds appear to the listener to emanate from a location other than the actual location of the loudspeakers.
Human listeners are readily able to estimate the direction and range of a sound source. When multiple sound sources are distributed in space around the listener, the position cf each may be perceived independently and simultaneously. Despite substantial and continuing research over many years, no satisfactory theory has yet been ****developed to account for all of the perceptual abilities of the average listener.
0 process that measures the pressure or velocity 0:004:of a sound wave at a single point, and reproduces that sound 0000 effectively at a single point, will preserve the intelligibility of speech and much of the identity of music.
Nevertheless, such a system removes all of the information :0.0 needed to locate the sound in space. Thus, an orchestra, sees. reproduced by such a system, is perceived as if all *0:0 instruments were playing at the single point of 00.
reproduction.
Efforts were therefore directed to preserving the directional cues contained inherently in the sounds during transmission or recording and reproduction. In U.S. Patent 2,093,540 issued to Alan D. Blumlein in September 1937 1 1 35207 substantial detail for such a two-channel system is given.
The artificial emphasis of the difference between the stereo channels as a means of broadening the stereo image, which is the basis of many present stereo sound enhancement techniques, is described in detail.
Some known stereo enhancement systems rely on cross-coupling the stereo channels in one way or another, to emphasis the existing cues to spatial location contained in a stereo recording. Cross-coupling and its counterpart crosstalk cancellation both rely on the geometry of the loudspeakers and listening area and so must be individually adjusted for each c;.se.
It is clear that attempted. refinements of the •stereo system have not produced great improvement in the se* .Go* systems now in widespread use for entertainment. Real 0*00 listeners like to sit at ease, move or turn their heads, and oooo• S* place their loudspeakers to suit the convenience of room o •o fee* layout and to fit in with other furniture.
OBJECT AND SUMMARY OF THE INVENTION 0 Thus, it is an object of the present invention to provide a method and apparatus for processing an audio signal so that when it is reproduced over two audio transducers the apparent location of the sound source can be e: suitably controlled, so that it seems to the listener that the location of the sound source is separated from the location of the transducers or speakers.
-2- 1-I A J r, -2a- According to one aspect of the present invention there is disclosed a method for producing and locating an apparent origin of a selected sound from an electrical signal corresponding to the selected sound in a predetermined and localized position anywhere within the three-dimensional space containing a listener, said method comprising the steps of: separating said electrical signal into respective first and second channel signals; altering the amplitude and shifting the phase of both said first and second channel signals for successive discrete frequency bands across the audio spectrum and each successive phase shift being different than the preceding phase shift relative to zero degrees and both on a predetermined frequency dependent basis in accordance with an empirically derived transfer function, thereby producing a first channel and a second S 15 channel modified signal and creating a differential phase and amplitude between the two modified channel signals; maintaining the first channel modified signal separate and apart from the second channel modified signal following the step of altering the amplitude and shifting the phase; and 20 respectively applying said first and second channel modified signals to first and second sound transducer means located within the three-dimensional space and spaced apart from the listener to produce a i• o sound apparently originating at a predetermined location In the three-dimensional space that may be different from the location of said 25 sound transducer means.
According to another aspect of the present invention there is disclosed a system for conditioning a signal for producing and locating, using two transducers located in free space, an auditory sensory illusion of an apparent origin for a least one selected sound at a predetermined localized position within the three-dimensional space containing a listener from an electrical signal corresponding to the selected sound, said system comprising: first and second channel means both receiving the electrical signal, said first and second channel means including respective first and second sound processor means each for altering the amplitude and shifting the phase angle of the respective electrical signals on a frequency dependent basis in accordance with empirically derived transfer functions for successive discrete frequency intervals Sang/0614y r -2bacross the audio spectrum to produce respective mclified signals therefrom, wherein the amplitude alteration differential and phase angle shift differential occurring between the two channels are respective predetermined values for each frequency interval of the audio spectrum, said sound processor means shifting the phase angle such that each successive phase angle shift is different and independent of a preceding phase angle shift relative to zero degrees, and said first and second channels being maintained separate and apart before fed to the two transducers.
115 amg/0614y ILmg I 35207 The present invention is based on the discovery that audio reproduction of a monaural using two independent channels and two loudspeakers can produce highly localized images of great clarity in different positions. Observation of this phenomenon by the inventors, under specialized conditions in a recording studio, led to systematic investigations of the conditions required to produce this audio illusion. Some years of work have produced a substantial understanding of the effect, and the ability to reproduce it consistently and at will.
According to theApresent invention, an auditory illusion is produced that is characterized by placing a sound source anywhere in the three-dimensional space *toth surrounding the listener, without constraints imposed by t loudspeaker positions. Multiple images, of independent OeS* r b*O "sources and in independent positions, without known limit to their number, may be reproduced simultaneously using the same two channels. Reproduction requires no more than two independent channels and two loudspeakers and separation distance or rotation of the loudspeakers may be varied o*°Sb within broad limits without destroying the illusion.
Rotation of the listener's head in any plane, for example to to "look at" the image, does not disturb the image.
The processing of audio signals in accordance with theApresent invention is characterized by processing a single channel audio signal to produce a two-channel signal wherein the differential phase and amplitude between the two signals is adjusted on a frequency dependent basis over the -3-
'II
35207 entire audio spectrur. This processing is carried out by dividing the monaural input signal into two signals and then passing one or both of such signals through a transfer function whose amplitude and phase are, in general, non-uniform functions of frequency. The transfer function may involve signal inversion and frequency-dependent delay.
Furthermore, to the bet knowledge of the inventors the transfer functions used in the inventive processing are not derivable from any presently known theory. They must be characterized by empirical means. Each processing transfer tunction places an image in a single position which is determined by the characteristics of the transfer function.
Thus, sound source position is uniquely determined by the transmission function.
*see ~For a given position there may exist a number of ee..
eo different transfer functions, each of which will suffice to place the image generally at the specified position.
If i moving image is required, it may be produced 968o by smoothly changing from one transfer function to another in succession. Thus, a suitably flexible implementation of the process need not be confined to the production of static ee 6 images.
so p~e~e e~\ooc~wnev j7 :.Audio signals processed according to the present invention may be reproduced directly after processing, or be recorded by conventional stereo recording techniques on 000000 S various media such as optical disc, magnetic tape, phono record or optical sound track, or transmitted by any iJL 4 conventional stereo transmission technique such as radio or ~L -1 S I 35207 cable, without any adverse effects on the auditory image provided by theXinvention.
The imaging process of theApresent invention may be also applied recursively. For example, if each channel of a conventional stereo signal is treated as a monophonic signal, and the channels are imaged to two different positions in the listener's space, a complete conventional stereo image along the line joining the positions of the images of the channels will be perceived. In addition, at the time the stereo record or disc is being recorded on multitrack tape, having for example twenty-four channels, each channel can be fed through a transfer function processor so that the recording engineer can locate the various instruments and voices at will to create a see* specialized sound stage. The result of this is still two-channel audio signals that can be played back on conventional reproducing equipment, but that will contain the inventive auditory imaging capability.
BRIEF DESCRIPTION OF THE DRAWINGS Fig. 1 is a plan view. representation of a listening geometry for defining parameters of image
S
u* location; o o. S "Fig. 2 is a side view corresponding to Fig, 1; Fig. 3 is a plan view representation of, listening geometry for defining parameters of listener location;
A
I i 35207 Fig. 4 is an elevational view corresponding to Fig. 4; Figs. 5a-5k are plan views of respective listening Isituations with corresponding variations in loudspeaker placement and Fig. 5m is a table of critical dimensions for three listening rooms; Fig. 6 is a plan view of an image transfer experiment carried out in two isolated rooms; Fig. 7 is a process block diagram relating the present invention to prior art practice; Fig. 8 is a schematic in block diagram form of a sound imaging system according to an embodiment of the present invention; Fig. 9 is a pictorial representation of an *operator workstation according to an embodiment of the present invention; .:Fig. 10 depicts a computer-graphic perspective display used in controlling the present invention; Fig. 11 depicts a computer-graphic display of three orthogonal views used in controlling the present ::ilvention; 12 is a schema tic representation of the formation of virtual sound sources by the present invention,A showing a plan view of three isolated rooms; Fig. 13 is a schematic in block diagram form of equipment for demonstrating the present invention; Fig. 14 is a waveform diagram of a test signal plotted as voltage against time; -6-
:A
J
*fee 0
OSSS
eese e g.
o S 0e**
S
e g..
Se
S
C
S.
S 0
S.
*SSO
0 35207 Fig. 15 tabulates data representing a transfer function according to an embodiment of the present invention; Fig. 16 is a schematic in block diagram form of a sound image location system according to an embodiment of the present invention; Figs. 17A and 17B are graphical representations of typical transfer functions employed in the sound processors of Fig. 16; Fig. 18A-18C are schematic block diagrams of a circuit embodying the present invention; and Fig. 19 is a schematic block diagram of additional circuitry which further embodies the present invention, DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS In order to define terms that will allow an unambiguous dersription of the auditory imaging process according to the present invention, Figs. 1-4 show some dimensions and angles involved.
Fig. 1 is a plan view of a stereo listening situation, showing left and right loudspeakers 101 and 102, respectively, a listener 103, and a sound image position 104 that is apparent to listener 103. For purposes of definition only, the listener is shown situated on a line 105 perpendicular to a line 106 joining loudspeakers 101 and 102, and erected at the midpoint of line 106. This listener position will be referred to as the reference listener position, but with this invention the listener is not
II
S
-7-
,I
'1
A
altering the amplitude and shifting the phase; and respectively applying said first and second channel modified signals to first and second sound transducer means located within the three-dimensional space and spaced apart from the listener to produce a .12 _r 1 I
I
35207 confined to this position. From the reference listener position an image azimuth angle is measured counterclockwise from line 105 to a line 107 between !)stener 103 and image position 104. Similarly, the image slant range is defined as the distance from listener 103 to image position 104. This range is the true range measured in three-dimensional space, not the projected range as measured on the plan or other orthogonal view.
In the present invention the possibility arises of images substantially out of the plane of the speakers.
Accordingly, in Fig. 2 an altitude angle for the image is defined. A listener position 201 corresponds with position 103 and an image position 202 corresponds with image position 104 in Fig. 1. Image a.titude angle is measured upwardly from a horizontal line 203 through the head of listener 103 to a line 204 joining the listener's head to image position 202. It should be noted that loudspeakers 101, 102 do not necessarily lie on line 203.
Having defined th image positional parameters with respect to a reference listening configuration, we proceed "to define parameters for possible variations in the listening configuration. Referring to Fig. 3, loudspeakers 301 and 302, and lines 304 and 305 correspond respectively to items 101, 102, 106, and 105 in Fig. 1. A loudspeaker spacing distance is measured along line 304, and a listener distance is measured along line 305. In the case that a listener is arranged parallel to line 304 along line 306 to position 307, we define a lateral displacement I1 -8- IA T" ii 35207 measured along line 306. For each loudspeaker 301 and 302 we define respective azimuth angles and as measured counterclockwise from a line through loudspeakers 301, 302 and perpendicular to a line joining them, in a direction toward the listener. Similarly for the listener we define an azimuth angle countercloqkwise from line 305 in the direction the listener is facing.
In Fig. 4, a loudspeaker height is measured upward from the horizontal line 401 through the head of the listener 303 to the vertical centerline of loudspeaker 302.
The parameters as defined allow more than one description of a given geometry. For example, an image position-may be described as (180,0,x) or (0,180,x) with .cop eIn conventional stereophonic reproduction the *commensurate with or An image may be formed very l close to the listener, at a small fraction of or remote at a distance several times and may simultaneously be I 1 1 at any azimuth angle without reference to the azimuth i angle subtended by the loudspeakers. In addition, the present invention is capable of image placement at any altitude ianrngle n e is tner distance may vary from 0.5m to 30m tor beyond with the image apparently static in space during the variatical c trlonin ofr The parameters as defined allow more than one position ay be described as (180,0,x) or (0,180,x with complete equivalence t prsn In conventional stereophonic reproduction the n I image is confined to lie along line 106 in Fg, whereas commensurate with or An image may be formed very angle subtended by the loudspeakers In addition, the 5845/5 35207 Good image formation has ben achieved with loudspeaker spacings from 0.2m to Bin, using the same signals Ato drive the loudspeakers from all spacings. Azimuth angles at the loudspeakers and may be varied independently over a broad range with no effect on the image.
It is characteristic of this invention that moderate changes in loudspeaker height do not affect the image altitude angle perceived by the listener. This is true for both positive and negative values of that is to say loudspeaker placement above or below the listener's head height.
Since the image formed is extremely realistic, it is natural for the listener to turn to "look at" that is to sees face directly toward, the image. The image remains stable Soso 0 as this is done; listener azimuth angle (in) has no 0Sperceptible effect on thol spatial poito ofteiae or *see.
seeS at least a range of angles (in) from +120 to -120 degrees.
So strong is the impression of a localjized sound source that listeners have no difficulty in "looking at" or pointing to so 0 th image; a group of listeners will report the same image position.
Figs. 5a-5k shows a set of ten listening geometries in which image stability has been tested. In Fig. 5a, a plin view of a listening geometry is shown. Left and right loudspeakers 501 and 502 respectively roproduced sound for listener 503f~ producing a sound image 504.
Sub-figures 5a through 5k show variations in loudspeaker orientation, and are generally similar to sub-figure
-I-
II
t i l t 3 I 35207 All ten geometries were tested in three different listening rooms with different values of loudspeaker spacing and listener distance as tabulated in figure Room 1 was a small studio control area containing considerable amounts of equipment, room 2 as a large recording studio almost competely empty, and room 3 was a small experimental room with sound absorbing material on three walls.
For each test the listener was asked to give the perceived image position for two conditions; listener head angle zero, and head turned to face the apparent image position. Each test was repeated with three different fes* 0" listeners. Thus, the image stability was tested in a total of 180 configurations. Each of these 180 configurations ooo• oooo i used the same input signals to the loudspeakers. In every case the image azimuth angle was perceived as .ooo degrees.
In Fig. 6 an image transfer experiment is shown in which a sound image 601 is formed by signals processed SI. according to the present invention, driving loudspeakers 602 and 603 in a first room 604. A dummy head 605, such as shown for instance in German Patent 1 927 401, carries left and right microphones 606 and 607 in its model ears.
S Electrical signals on lines 608 and 609 from microphones 606, 607 are separately amplified by amplifiers 610 and 611, which drive left and right loudspeakers 612 and 613, respectively, in a second room 614. A listener 615 situated in this second room, which is acoustically isolated from the -11- ji ii
F
the like employed. i i: 35207 I first room, will perceive a sharp secondary image 616 corresponding to the image 601 in the first room.
An example of the relationship of the inventive sound processor to known systems is shown in Fig. 7, in which one or more multi-track signal sources 701, which may be magnetic tape replay machines, feed a plurality of monophonic signals 702 derived from a plurality of sources to a studio mixing console 703. The console may be used to modify the signals, for instance by changing levels and balancing frequency content, in any desired ways.
A plurality of modified monophonic signals 704 produced by console 703 are connected to the inputs of an 0*04 V image processing system 705 according to the present *i66 invention. Within this system each input channel is assigned to an image position, and transfer function processing is applied to produce two-channel signals from 0S 6 each single input signal 704. All of the two-channel signals are mixed to produce a final pair of signals 706, 707, which may then be returned to a mixing console 708. It should be understood that the two-channel signals produced by this invention are not really left and right stereo signals, however, such connotation provides an easy way of referring to these signals. Thus, when all of the two-channel signals are mixed, all of the left signals are combined into one signal and all of the right signals are combined into one signal. In practice, console 703 and console 708 may be separate sections of the same console.
Using console facilities, the processed signals may be -12ii 2,093,540 issued to Alan D. Blumlein in September 1937 -1~1
II
~ii 35207 applied to drive loudspeakers 709, 710 for monitoring purposes. After any required modification and level setting, master stereo signals 711 and 712 are led to master stereo recorder 713, which may be a two-channel magnetic tape recorder. Items subsequent to item 705 are well known in the prior art.
Sound image processing system 705 is shown in more detail in Fig. 8, in which input signals 801 correspond to signals 704 and output signals 807, 808 correspond respectively to signals 711, 712 of Fig. 7. Each monaural input signal 801 is fed to an individual signal processor 802.
.These processors 802 operate independently, with "no intercoupling of audio signals. Each signal processor "operates to produce the two-channel signals having ooooo differential phase and amplitude adjusted on a frequency 0000 dependent basis. These transfer functions will be explained in detail below. The transfer functions, which may be described in the time domain as real impulse responses or equivalently in the frequency domain as complex frequency responses or amplitude and phase responses, characterize only the desired image position to which the input signal is to be projected.
One or more processed signal pairs 803 produced by the signal processors are applied to the inp\lts of stereo mixer 804. Some or all of them may also be applied to the inputs of a storage system 805. This system is capable of storing complete processed stereo audio signals, and of -13- 1! location of the transducers or speakers.
-2- 35207 replaying them simultaneously to appear at outputs 806.
Typically this storage system amy have different numbers of input channel pairs and output channel pairs. A plurality of outputs 806 from the storage system are applied to further inputs of stereo mixer 804. Stereo mixer 804 sums all left inputs to produce left output 807, and all right inputs to produce right output 808, possibly modifying the amplitude of each input before summing. No interaction or coupling of left and right channels takes place in the mixer.
A human operator 809 may control operation of the system via human interface means 810 to specify the desired S...i image position to be assigned to each input channel.
It may be particularly advantageous to implement S.o.
signal processors 802 digitally, so that no limitation is
S'
placed on the position, trajectory, or speed of motion of an o.o.
image. These digital sound processors that provide the necessary differential adjustment of phase and amplitude on a frequency dependent basis will be explained in more detail below. In such a digital implementation it may not always be economic to provide for signal processing to occur in real time, though such operation is entirely feasible. If Areal-time signal processing is not provided, outputs 803 S would be connected to storage system 805, which would be capable of slow recording and real-time replay. Conversely, if an adequate number of real-time signal processors 802 are provided, storage system 805 may be omitted.
-14- _L I_ I i signals on a frequency dependent basis in accordance wi l l a l. derived transfer functions for successive discrete frequency intervals ang/0614y 1 l li -l--MMi--I^MIi"- 1 35207 In Fig. 9, operator 901 controls mixing console 902 equipped with left and right stereo monitor loudspeakers 903, 904. Although stability of the final processed image is good to a loudspeaker spacing as low as 0.2m, it is preferable for the mixing operator to be provided with loudspeakers placed at least 0.5m apart. With such spacing, accurate image placement is more readily achieved. A computer graphic display means 905, a multi-axis control 906, and a keyboard 907 are provided, along with suitable computing and storage facilities to support them.
Computer graphic display means 905 may provide a graphic representation of the position or trajectory of the 6 image in space as shown, for example, in Figs. 10 and 11.
S
Fig. 10 shows a display 1001 of a listening situation in which a typical listener 1002 and an image trajectory 1003 are presented, along with a representation of a motion picture screen 1004 and perspective space cues 1005, 1006.
At the bottom of the display is a menu 1007 of items relating to the particular section of sound track being operated upon, including recording, time s.e synchronization, and editing'information. Menu items may be selected by keyboard 907, or by moving cursor 1008 to the item, using multi-axis control 906. The selected item can S be modified using keyboard 907, or toggled using a button on multi-axis control 906, invoking appropriate system action.
In particular, a menu item 1009 allows an operator to link the multi-axis control 906 by software to control the viewpoint from which the perspective view is projected, or ii V amg/0614y PC.Llq 3 k~uj i 1 i I n i 1 -r i r i. sees
C
so soeS Os*O
C
SeeS *eeee e e e 35207 to control the position/trajectory of the current sound image. Another menu item 1010 allows selection of an alternate display illustrated in Fig. 11.
In the display of Fig. 11 the virtually full-screen perspective presentation 1001 shown in Fig. is replaced by a set of three orthogonal views of the same scene; a top view 1101, a front view 1102, and a side view 1103. To aid in interpretation the remaining screen quadrant is occupied by a reduced and less detailed version 1104 of the perspective view 1001. Again a menu 1105, substantially similar to that shown at 1007 and with similar functions, occupies the bottom of the screen. One particular menu item 1106 allows toggling back to th display of Fig. In Fig. 12, sound sources 1201, 1202, and 1203 in a first room 1204 are detected by two microphones 1205 and 1206 that generate right and left stereo signals, respectively, that are recorded using conventional stereo recording equipment 1207. If replayed on conventional stereo replay equipment 1208, driving right and left loudspeakers 1209, 1210, respectively, with the signals originating from microphones 1205, 1206, conventional stereo images 1211, 1212, 1213 corresponding respectively to sources 1201, 1202, 1203 will be perceived by a listener 1214 in a second room 1215. These images will be at positions that are projections onto the line joining loudspeakers 1209, 1210 of the lateral positions of the sources relative to microphones 1205, 1206.
-16signals is adjusted on a frequency dependent basis over the -3- 35207 If the two pa-Lrs of stereo signals are processed and combined as detailed above using sound processor 1216.
and reproduced by conventional stereo playback equipment 1217 on right and left loudspeakers 1218, 1219 in a third room 1220, crisp spatially localized images of the sound sources are apparent to listener 1226 at positions unrelated to the actual positions of loudspeakers 1218, 1219. Let us suppose that the processing was such as to form an image of the original right channel signal at position 1224, and an image of the original left channel signal at 1225. Each of these images behaves as if it were truly a loudspeaker; we may think of the images as "virtual loudspeakers" *A transfer function in which both differential amplitude and phase of a two-channel signal are adjusted on a frequency dependent basis across the entire audio band is .required to project an image of a monaural audio signal to a son* given position. For general applications to specify each such response, the amplitude and phase differential at intervals not exceeding 40 Hz must be specified independently for each of the two channels over the entire audio spectrum, for best image stability and coherence. For applications not requiring high quality and sound image placement the frequency intervals may be expanded. Hence specification of such a response requires about 1000 real numbers (or equivalently, 500 complex ones). Differences for human perception of auditory spatial location are somewhat indefinite, being based on subjective measurement, but in a true three-dimensional space more than 1000 -17- SLJ.cn l u''i q L LI.QI1D1LL±L L U UY dlly conventional stereo transmission technique such as radio or v. ^y -4- 35207 distinct positions are resolvable by an average listener.
Exhaustive characterization of all responses for all possible positions therefore constitutes a vast body of data, comprising in all more than on million real numbers, the collection of which is in progress.
It should be noted that is the transfer function in the sound processor according to this invention, which provides the differential adjustment between the two channels, is build up piece-by-piece by trail and error t-ing over the audio spectrum for each 40 Hz interval.
Mo.-over, as will be explained below, each transfer function in the sound processor locates the sound relative to two c, o spaced-apart transducers at only one location, that is, one Sazimuth, height, and depth.
In practice, however, we need not represent all transfer function responses explicitly, as mirror-image symmetry generally exists between the right and left channels, If the responses modifying the channels are interchanged, the image azimuth angle is inverted, S. S whilst the altitude and range remain unchanged.
*I o. It is possible to demonstrate the inventive process and the auditory illusion using conventional equipment and by using simplified signals. If a burst of a sine wave at a known frequency is gated smoothly on and off at relatively long intervals, a very narrow and band of the frequency domain is occupied by the resulting signal.
Effectively, this signal will sample the required response at a single frequency. Hence the required responses, that -18- V a f i is, the transfer functions, reduce to simple control of differential amplitude and phase (or delay) between the left and right channels on a frequency dependent basis. Thus, it iswill be appreciated that the transfer function for a specifical sound placement can be built up empirically by making differential phase and amplitude adjustments for each selected frequency interval over the audio spectrum. By Fourier's theorem any signal may be represented as the sum of a series of sine waves, so the signal used is completely general.
An example, of a system for demonstrating the present invention is shown in Fig. 13, in which an audio synthesizer 1302, a Hewlett-Packard Multifunction Synthesizer model 8904A, is controlled by a computer 1301, Hewlett-Packard model 330M, to generate a monaural audio signal that is fed to the inputs 1303, 1304 of two channels of an audio delay line 1305, Eventide Precision Delay model PD860. From delay line 1305 the right channel signal passes to a switchable inverter 1306 and left and right signals i dthen pass through respective variable attentuators 1307, 1308 and hence to two power amplifiers 1309, 1310 driving left and right loidspeakers 1311, 1312, respectively.
Synthesizer 1302 produces smoothly gated sine wave bursts of any desired test frequency 1401, using an envelope as shown in Fig. 14. The sine wave is gated on using a first linear ramp 1402 of 20 ms duration, dwells at constant amplitude 1403 for 45 ms, and is then gated off using a *i P860 Fro deay lne 305the igh chanelsignl psse -19- Fig. 14 is a waveform diagram of a test signal plotted as voltage against time; 13! i :ii 35207 second linear ramp 1404 of 20 ms duration. Bursts are repeated at intervals 1405 of about 1-5 second.
In addition, using the system of Fig. 13 and the waveform of Fig. 14, the present invention can build up a transfer function over the audio spectrum by adjusting the time delay in delay line 1305 and the amplitude by attentuators 1307, 1308. A listener would make the adjustment, listen to the sound placement and determine if it was in the right location. If so, the next frequency interval would be examined. If not, then further adjustment are made and the listening process repeated. In this way the transfer function over the audio spectrum can be *see built-up.
Fig. 15 is a tale of practical data to be used to form a transfer function suitable to allow reproduction of auditory images well off the direction of the loudspeakers •lug for several sine wave frequencies. This table might be developed just as explained above, by trial and error 000S l listening. All of these images were found to be stable and repeatable in all three listening rooms detailed in Fig. for a broad rang of listener head attitudes including directly facing the image, and for a variety of listeners. i We may generalize the placement of narrowband signals, detailed above, in such a manner as to permit broadband signals, representing complicated sources such as speech and music, to be imaged. If the differential amplitudes and phase shifts for the two channels that are derived from a single input signal are specified for all i .7 position will be referred to as the reference listener position, but with this invention the listener is not -7- _iiii O N III III 35207 foe 0O0 o go o 0 0 *see *0 00
S.
S 0 s S frequencies though the audio band, the complete transfer function is specified. In practice, we need only explicitly specify the differential amplitudes and delays for a number of frequencies in the band of interest. Amplitudes and delays at any intermediate frequency, between those specified, may then be found by interpolation. If the frequencies at which the response is specified are not too widely spaced, and taking into account the smoothness or rate of change of the true response represented, the method of interpolation is not too critical.
In the table of Fig. 15, the amplitudes and delays are applied to the signal in each channel and this is shown generally in Fig. 16 in which a separate sound processor 1500, 1501 is provided. The single channel audio signal is fed in at 1502 and fed to both sound processors 1500, 1501 where the amplitude and phase are adjusted on a frequency dependent basis so that the differential at the left and right channel outputs 1503, 1504, respectively, is the correct amount that was empirically determined, as explained above. Tha control parameters fed in on line 1505 change the differential phase and amplitude adjustment so that the sound image can be at a different, desired location. For example, in a digital implementation the sound processors could be finite impulse response (FIR) filters whose coefficients are varied by the control parameter signal to provide different effective transfer functions.
The system of Fig. 16 can be simplified, as shown from the following analysis. Firstly, only the difference -21case tnat a listener is arrangea para±le- to ±uiixn u L amu.i line 306 to position 307, we define a lateral displacement -8- 1I 35207 or differential between the delays of the two channels is of interest. Suppose that the left and right channel delays are t(l) and t(r) respectively. New delays and t'(r) are defined by adding any fixed delay such that: t(1l) t(a) (1) t(r) t(a) (2) The result is that the entire effect is heard a time t(a) later, or earlier where t(a) is negative. This general expression holds in the special case where t(a) se Substituting: o* t(l) t(r) (3) t(r) t(r) 0 (4) By this transformation we can always reduce the delay in one channel to zero. In a practical implementation we must be careful to substract out the smaller delay, so that the need for a negative delay never arises. It may be preferred to avoid this problem by leaving a fixed residual delay in one channel, and changing the delay in the other. If the fixed residual delay is of sufficient magnitude, the variable delay need be nec .ive.
Secondly, we need not control channel amplitudes independently. It is a common operation in audio engineering to change the amplitudes of si;g als either by amplification or attenuation. So long as both stereo -22- U.tbm to .um or Deyona, win JT.1U jLrc1 Lt space during the variation.
i. :1 35207 channels are changed by the same ratio, there is no change in the positional information carried. It is the ratio or differential of amplitudes that is important and must be preserved. So long as this differential is preserved, all of the effects and illusions in this description are entirely independent of the overall sound level of reproduction. Accordingly, by an operation similar to that detailed above for timing or phase control, we may place all of the amplitude control in one channel, leaving the other at a fixed amplitude. Again, it may be convenience to apply a fixed residual attentuation to one channel, so that all S" required ratios are attainable by attenuation of the other.
Full control is then available using a variable attenuator *fee in one channel only.
"We may thus specify all the required information by specifying the differential attentuation and delay as functions of frequency for a single channel. A fixed, frequency-independent attentuation and delay may be specified for the second channel; if these are left unspecified, we assiimr unity gain and zero delay.
ca n Thus, for any one sound image position, and therefore any one left/right transfer function, the differential phase and amplitude adjusting (filtering) may be organized all in one channel or the other or any combination in between. One of sound processors 1500, 1501 can be simplified to no more than a variable impedance or to just a straight wire. It can not be an open circuit.
Assuming that the phase and amplitude adjusting is performed -23orientation, and are generally similar to sub-figure 35207 in only one channel to provide the necessary differential between the two channels the transfer functions would then be represented as in Figs. 17A and 17B.
Figs. 17A represents a typical transfer function for the differential phase of the two channels, wherein the left channel is unaltered and the right channel undergoes phase adjustment on a frequency dependent basis over the audio spectrum. Similarly, Fig. 17B represents generally a typical transfer function for the differential amplitude of the two channels, wherein the amplitude of the left channel is unaltered and the right channel undergoes attentuation on .s a a frequency dependent basis over the audio spectrum.
is It is appreciated that the sound positioners: 1500, 1501 of Fig. 16, for example, can be analog or digital and may include some or all of the following circuit *elements: filters, delays, inventors, summers, amplifiers, and phase shifters. These functional circuit elements can be organized in any fashion that results in the transfer l function.
Several equivalent representations of this information are possible, and are commonly used in related arts.
o For example, the delay may be specified as a phase change at any given frequency, using the equivalences: Phase (degrees) 360 x (delay time) x frequency Phase (radians) 2 x x (delay time) x frequency -24respectively, in a second room 614. A listener 615 situated i in this second room, which is acoustically isolated from the -11- 1 1 35207 Caution in applying this equivalence is required, because it is not sufficient to specify the principal value of phase; the full phase is required if the above equivalences are to hold.
A convenient representation commonly used in electronic engineering is the complex s-plane representation. All filter characteristics realizable using real analog components (any many that are not) may be specified as a ratio of two polynomials in the Laplace complex frequency variable s. The general form is: T(s) Ein(s) N(s) Eout(s) D(s) Where T(s) is the transfer function in the s plane, Ein(s) and Eout(s) are the input and output signals **0 respectively as functions of s, and the numerator and denominator functions N(s) and D(s) are of the form: N(s) a als a2s a3s3 a s (6) 1 2 3 n D(s) b bls b2 b b sn (7) o 1 2 3 s n The attraction of this notation is that it may be very compact. To specify the function completely at all frequencies, without need of interpolation, we need only specify the n+l coefficients a and the n+1 coefficients b.
With these coefficients specified, the amplitude and phase of the transfer function at any frequency may readily be I i l console 708 may be separate sections of the same console.
Using console facilities, the processed signals may be -12i r fV 35207 derived using well-known methods. A further attraction of this notation is that it is the form most readily derived from analysis of an analog circuit, and therefore, stands as the most natuxal, compact, and well-accepted method of specifying the transfer function of such a circuit.
Yet another representation convenient for use in describing the present invention is the z-plane representation. In the preferred embodiment of the present invention, the signal processor will be implemented as digital filters in order to obtain the advantage of flexibility. Since each image position may be defined by a I transfer function, we need a form of filter in which the transfer function may be readily and rapidly realized with a 0000 minimum of restrictions as to which functions may be 0•00 0: achieved. A fully programmable digital filter is appropriate to meet this requirement.
Such a digital filter may operate in the frequency domain, in which case, the signal is first Fourier .transformed to move it from a time domain representation to S94 a frequency domain one. The filter amplitude and phase o• '.response, determined by one of the above methods, is then applied to t:e frequency domain representation of the signal by complex multiplication. Finally, an inverse Fourier transform is applied, bringing the signal back to the time domain for digital to analog conversion.
Alternatively, we may specify the response directly in the time domain as a real impulse response.
This response is mathematically equivalent to the frequency -26i ~Ii [I 1. I:1 inputs of a storage system 805. This system is capau. i ,ui storing complete processed stereo audio signals, and of -13i I l* L77 35207 domain amplitude and phase response, and may be obtained from it by application of an inverse Fourier transform. We may apply this impulse response directly in the time domain by convolving it with the time domain representation of the signal. It may be demonstrated that the operation of convolution in the time domain is mathematically identical with the operation of multiplication in the frequency domain, so that the direct convolution is entirely equivalent to the frequency domain operation detailed in the preceding paragraph.
Since all digital computations are discrete rather than continuous, a discrete notation is preferred to a o. continuous one. It is convenient to specify the response directly in terms of the coefficients which will be applied in a recursive direct convolution digital filter, and this is readily done using a z-plane notation that parallels the s-plane notation. Thus, if T(z) is s time domain response equivalent to T(s) in the frequency domain: t-j d T(z) N(z) D(z) (8) Where N(z) and D(z) have the form: -1 -2 -n N(z) c 0 cl z c 2 z c z (9) D(z) d dlz d 2 z 2 d a 0 12 m -27i 1
'J
-14- I 3 35207 0
S
6 0e00 0 0O*@ *00* 006S 0* *0
SO
00 0* In this notation the coefficients c and d suffice to specify the function as the a and b coefficients did in the s-plane, so equal compactness is possible. The z-plane filter may be implemented directly if the operator z is interpreted such that -l z is a delay of n sampling intervals.
Then the specifying coefficients c and d are directly the multiplying coefficients in the implementation. We must restrict the specification to use only negative powers of z, since these corresponds to positive delays. A positive power of z would correspond to a negative delay, that is a response before a stimulus was applied.
With these notations in hand we may described equipment to allow placement of images of broad and sounds such as speech and music. For these purposes the sound processor of the present invention, for example, processor 802 of Fig. 8, may be embodied as a variable two-path analog filter with variable path coupling attenuators as in Fig.
18A.
In Fig. 18A, a monophonic or monaural input signal 1601 is input to two filters 1610, 1630 and also to two potentiometers 1651, 1652. The outputs from filters 1610, 1630 are connected to potentiometers 1653, 1654. The four potentiometers 1651-1654 are arranged as a so-called joystick control such that they act differentially. One joystick axis allows control of potentiometers 1651, 1652; -28-
I
i 1 d d ::4 the multi-axis control 906 by software to control the viewpoint from which the perspective view is projected, or na m j$ 1 35207 as one moves such as to pass a greater proportion of its input to its output, the other is mechanically reversed and passes a smaller proportion of its input to its output.
Potentiometers 1653, 1654 are similarly differentially operated on a second, independent joystick axis. Output signals from potentiometers 1653, 1654 are passed to unity gain buffers 1655, 1656 respectively, which in turn drive potentiometers 1657, 1658, respectiiely, that are coupled to act together; they increase or decrease the proportion of input passed to the output in step. The output signals from s.e potentiometers 1657, 1658 pass to a reversing switch 1659, ooo: which allows the filter signals to be fed directly or sees oee. interchanged, to first inputs of summing elements 1660, ooo o S 1670.
Each responsive summing element 1660, 1670 receives at its second input an output from potentiome 's 1651, 1652. Summing element 1670 drives inverter 1690, and switch 1691 allows selection of the direct or inverted signal to drive input 1684 of attenuator 1689. The output 0e *of attenuator 1689 is the so-called right-channel signal.
Similarly summing element 1660 drives inverter 1681, and switch 1682 allows selection of the direct or inverted signal at point 1683. Switch 1685 allows selection of the signal 1683 or the input signal 1601 as the drive to attenuator 1686 which produces left channel output 1688.
Filter 1610, 1630 are identical, and one is shown in detail in Fig. 18B. A unity gain buffer 1611 receives the input signal 1601 and is capacitively coupled via I -29iouaspeaers izuu, iziu or r-ne iazutLa-Lvriv l sources relative to microphones 1205, 1206.
-16- 2, 35207 capacitor 1612 to drive filter element 1613. Similar filter elements 1614 to 1618 are cascaded, and final filter element 1618 is coupled via capacitor 1619 and unity gain buffer 1620 to drive inverter 1621. Switch 1622 allows selection of either the output of buffer 1620 or of inverter 1621 at filter output 1623.
Filter elements 1613 through 1618 are identical and are shown in detail in Fig. 18C. They differ only in the value of their respective capacitor 1631. Input 1632 is connected to capacitor 1631 and resistor 1633 and resistor 1633 is coupled to the inverting input of operational amplifier 1634, output 1636 is the filter element output.
j O Feedback resistor 1635 is connected to operational amplifier 1634 in the conventional fashion. The non-inverting input of operational amplifier 1634 is driven from the junction of capacitor 1631 and one of resistors 1637 to 1642, as selected by switch 1643. This filter is an all-pass filter with a phase shift that varies with frequency according to the setting of switch 1643.
Table 1 lists the values of capacitor 1631 used in each filter element 1613-1618, and Table 2 lists the resistor values selected by switch 1642; these resistor values are the same for all filter elements 1613-1618.
One embodiment of summing elements 1660, 1670 is shown in Fig. 18D, in which two inputs 1661, 1662 for summing in operational amplifier 1663 result in a single output 1664. The gains from input to output are determined by the resistors 1665, 1667 and feedback resistor 1666. In Li s but in a true three-dimensional space more than 1000 -17- 35207 both cases input 1662 is d an from switch 1659, and input 1661 from joystick potentiometers 1651, 1652 respectively.
As examples of image placement, Table 3 shows settings and corresponding image positions to "fly" a sound image corresponding to a helicopter at positions well above the plane including the loudspeakers and the listener. To obtain the required monophonic signal for the process according to the present invention, the stereo tracks on the sound effects disc were summed. With the equipment shown set up as tabulated, realistic sound images are projected in space in such a manner that the listener perceives a *eee helicopter at the locations tabulated.
1 0 s e *ee C* :000: ee -31-
OO•
O O
OOO
-31- :j 1 r Effectiveiy, this signal will sample the required response at a single frequency. Hence the required responses, that -18- ~j.
Table).
352 07 Filter 1 2 3 4 5 6 Capacitor 163). 100 47 33 15 10 4.7 Value, nF Table 2
S
S
*0SO
S
S
S
OSSS
55 S* S
S.
S S 5.
S S @5
S
S
Switch 1642 1. 2 3 4 Position 4 Resistor Or 1637 1638 1.639 1640 1641 Resistor 4700 1000 470 390 120 -value, ohms -32cuttp±irucUe Lquj ror 4 ms, and is then gated ott using a -19- 35207 Table 3 Filter 1630 element 1 switch pas. Filter 1630 element 2 switch pos. 5 Filter 1630 element 3 switch pos. 5 Filter 1630 element 4 switch pos. 5 Filter 1630 element 5 switch pas. FiUlter 1630 inverting switch 1622 norm. norm.
Potentiometer 1652 ratio 0.046 0.054 Potentiometer 1654 ratio 0.90 0.76 .*Potentiometer 1658 ratio 0.77 0.77 Inverting switch 1691 position inv. inv.
Selector switch 1685 position 1601 1601 Output attenuator 1686 ratio 0.23 0.23 0O iOutput attenuator 1687 ratio 1.0 too Image azimuth a, degrees -45 Ii.r13 nerigsic 62 o o *Image altitude b, degrees +21 +17 Image range r remote remote Note to table 3: setting of reversing switch 1659 in both cases is such that signals from element 1657 drive element 1660, and those from element 1658 drive element 1670.
OoOU( -33derived from a single input signal are specified for all i however, that this is not essential to the creation of images. The extra elements are shown inFig. 19, in which i 0 0 0 35207 I By addition of two extra elements to the aboveindependent of frequency. They may circuits, a b extra facility for lateral shifting of the listening area is providedl It should be understood, however, that this is not essential to the creation ofwe can images. The extra elements are shown in Fig. 19, in whichll left and right signals 1701, 1702 may be supplied from the outputs 1688, 1689 respectively of the signal processor of Fig. 16. In each channel a delay 1703, 1704 respectively is t(inserted, and the output signals from the delays 1703, 1704 become the sound processor outputs 1705, 1706.
If t(d) is zero, thThe delays introduced into the channels by thisessentially unaffected by the additional equipment are independent of frequency. They may thus each be completely characterized by a single real number. Let the left channel delay be and the right channel delay As in the above case, only the posdifferential between the delays is significant, and we canced completely control the equipment by specifying the3. A difference between the delays. In implementation, we will add a fixed delay to each channel to ensure that at least no negative delay is required to achieve the required ifferential. Defining a differential delay t(d) as: t(d) t(r) t(l) (11) If t(d) is zero, the effects produced will be essentially unaffected by the additional equipment. .If t(d) is positive, the center of'the listening area will be displaced laterally to the right along dimension of Fig. 3. A -34- ^i Tne system or rig. ±o uian ue.= e from the following analysis. Firstly, only the difference -21i 35207 positive value of t(d) will correspond to a positive value of signifying ri Thtward displacement. Similarly, a leftward displacement, corresponding to a negative value of may be obtained by a negative value of By this method the entire listening area, in which listeners perceive the illusion, may be projected laterally to any point between or beyond the loudspeakers. It is readily possible for dimension to exceed half of dimension and good results have been obtained out to extreme shifts at which dimension is 83% of dimension This may not be the limit of the technique, but represents the limit of *cen current experimentation.
0*
*OSO
S S S S S

Claims (4)

1. A method for producing and locating an apparent origin of a selected soui;d ,rom an electrical signal corresponding to the selected sound in a predetermined and localized position anywhere wifhin the three-dimensional space containing a listener, said methoo .mprislng the steps of: separating said electrical signal into respective first and second channel signals; altering the amplitude and shifting the phase of both said first and second channel signals for successive discrete frequency bands across the audio spectrum and each successive phase shift being different than the preceding phase shift relative to zero degrees and both on a predetermined frequency dependent basis in accordance with an empirically derived transfer function, thereby producing a first channel and a second channel modified signal and creating a differential phase and amplitude between the two modified channel signals; apart from the second channel modified signal following the step of altering the amplitude and shifting the phase; and respectively applying said first and second channel modifieds signals to first and second sound transducer means located within the thanthree-dimensional space and spaced apart from the listener to produce a presound apparently originating at a predetermined location in the three-dimensional space that may be different from the location of said sound transducer means.
2. The method of claim I further including the step of applying said first and second channel signals to respective all pass filters, each said filter having a predetermined frequency response and topology as characterized by the empirically derived transfer function T(s) for the Laplace complex frequency variable(s).
3. The method of claim 2 wherein the step of applying said signals to respective all pass filters includes the further step of applying each of said signals to a cascaded series of filters
42. The method of claim 1 further including the step of storing said first and second channel signals and mod fie signals derived therefrom in a medium capable of regenerating said stored signals at a subsequent chara selected time. S g/0614y 0 '4^ L Phase (radians) 2 x x (delay time) x frequency -24- i il -37- The method of claim 1 wherein the step of altering the amplitude and shifting the phase includes respectively passing said first and second channel signals through first and second sound processors having respective predetermined phase transfer functions that were empirically derived to effect said differential phase shift, whereby phase is shifted on a frequency dependent basis across the audio spectrum and in which each phase shift is different than the preceding phase shift, and predetermined amplitude transfer functions that were empirically derived to effect said differential amplitude alteration. 5, The method of claim 4, wherein the predetermined phase and amplitude transfer functions are constructed on a frequency dependent basis of 40 Hz intervals across the audio spectrum. 7. A system for conditioning a signal for producing and locating, using two transducers located in free space, an auditory sensory illusion of an apparent origin for a least one selected sound at a predetermined localized position within the three-dimensional space containing a listener from an electrical signal corresponding to the selected sound, said system comprising: first and second channel means both receiving the electrical signal, said first and second channel means including respective first and second sound processor means each for altering the amplitude and shifting the phase angle of the respective electrical signals on a frequency dependent basis in accordance with empirically derived transfer functions for successive discrete frequency Intervals across the audio spectrum to produce respective modified signals 25 therefrom, wherein the amplitude alteration differential and phase angle shift differential occurring between the two channels are respective predetermined values for each frequency interval of the audio spectrum, said sound processor means shifting the phase angle such that each successive phase angle shift is different and independent of a preceding phase angle shift relative to zero degrees, and said first and second channels being maintained separate and apart before fed to the two transducers. 8. A system as in claim 7 further including storage means connected to said sound processor means for storing said modified signals in a medium capable of regenerating said stored signals at a subsequent selected time. 9. A system as in claim 8, wherein the frequency dependent basis M/0614y u c I -38- on which the first and second sound processor means operate is made up of Hz intervals across the audio spectrum. A method for producing and locating an apparent origin of a selected sound from an electrical signal corresponding to the selected sound in a predetermined and localized position anywhere within the three-dmnensional space containing a listener, said method being substantially as described with reference to the accompanying drawings. 11. A system for conditioning a signal for producing and locating, using two transducers located in free space, an auditory sensory illusion of an apparent origin for a least one selected sound at a predetermined localized position within the three-dimensional space containing a listener from an electrical signal corresponding to the selected sound, said system being substantially as described with reference to the accompanying drawings. Ii DATED this SEVENTH day of JANUARY 1992 Q Sound Ltd :i Patent Attorneys for the Applicant 'Z SPRUSON FERGUSON ,P LL1,V amg/0614y >12 t 1
AU41000/89A 1988-09-02 1989-09-01 Sound imaging method and apparatus Ceased AU621655B2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
US07/239,981 US5046097A (en) 1988-09-02 1988-09-02 Sound imaging process
US239981 1988-09-02
US39898889A true 1989-08-28 1989-08-28
US398988 1989-08-28

Publications (2)

Publication Number Publication Date
AU4100089A AU4100089A (en) 1990-03-08
AU621655B2 true AU621655B2 (en) 1992-03-19

Family

ID=26933039

Family Applications (1)

Application Number Title Priority Date Filing Date
AU41000/89A Ceased AU621655B2 (en) 1988-09-02 1989-09-01 Sound imaging method and apparatus

Country Status (18)

Country Link
EP (1) EP0357402B1 (en)
JP (1) JP3205808B2 (en)
KR (1) KR930002147B1 (en)
AR (1) AR245858A1 (en)
AT (1) AT123369T (en)
AU (1) AU621655B2 (en)
BG (1) BG60225B2 (en)
CA (1) CA1329911C (en)
DE (1) DE68922885T2 (en)
DK (1) DK433789A (en)
ES (1) ES2075053T3 (en)
FI (1) FI894143A (en)
HU (1) HUT59523A (en)
IL (1) IL91464A (en)
NO (1) NO175229C (en)
NZ (1) NZ230517A (en)
PL (1) PL163716B1 (en)
RU (1) RU2092979C1 (en)

Families Citing this family (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU625530B2 (en) * 1989-12-07 1992-07-16 Q Sound Ltd Sound imaging system for a video game
KR100228688B1 (en) * 1991-01-08 1999-11-01 쥬더 에드 에이. Decoder for variable-number of channel presentation of multi-dimensional sound fields
JPH05145743A (en) * 1991-11-21 1993-06-11 Ricoh Co Ltd Image producing device
EP0563929B1 (en) * 1992-04-03 1998-12-30 Yamaha Corporation Sound-image position control apparatus
US6490359B1 (en) * 1992-04-27 2002-12-03 David A. Gibson Method and apparatus for using visual images to mix sound
WO1994001981A2 (en) * 1992-07-06 1994-01-20 Adaptive Audio Limited Adaptive audio systems and sound reproduction systems
JP2870562B2 (en) * 1992-11-30 1999-03-17 日本ビクター株式会社 Method of sound image localization control
AU4037693A (en) * 1993-04-20 1994-11-08 Sixgraph Technologies Ltd Interactive sound placement system and process
US5436975A (en) * 1994-02-02 1995-07-25 Qsound Ltd. Apparatus for cross fading out of the head sound locations
US5596644A (en) * 1994-10-27 1997-01-21 Aureal Semiconductor Inc. Method and apparatus for efficient presentation of high-quality three-dimensional audio
US5850453A (en) 1995-07-28 1998-12-15 Srs Labs, Inc. Acoustic correction apparatus
RU2106075C1 (en) * 1996-03-25 1998-02-27 Владимир Анатольевич Ефремов Spatial sound playback system
US5970152A (en) * 1996-04-30 1999-10-19 Srs Labs, Inc. Audio enhancement system for use in a surround sound environment
KR100370413B1 (en) * 1996-06-30 2003-04-10 삼성전자 주식회사 Method and apparatus for converting the number of channels when multi-channel audio data is reproduced
JPH10108300A (en) * 1996-09-27 1998-04-24 Yamaha Corp Sound field reproduction device
US5912976A (en) 1996-11-07 1999-06-15 Srs Labs, Inc. Multi-channel audio enhancement system for use in recording and playback and methods for providing same
US6281749B1 (en) 1997-06-17 2001-08-28 Srs Labs, Inc. Sound enhancement system
US6016473A (en) * 1998-04-07 2000-01-18 Dolby; Ray M. Low bit-rate spatial coding method and system
GB2343347B (en) * 1998-06-20 2002-12-31 Central Research Lab Ltd A method of synthesising an audio signal
JP3781902B2 (en) 1998-07-01 2006-06-07 株式会社リコー Sound image localization control device and sound image localization control method
GB2342024B (en) * 1998-09-23 2004-01-14 Sony Uk Ltd Audio processing
US7031474B1 (en) 1999-10-04 2006-04-18 Srs Labs, Inc. Acoustic correction apparatus
US7277767B2 (en) 1999-12-10 2007-10-02 Srs Labs, Inc. System and method for enhanced streaming audio
GB2370176A (en) * 2000-08-10 2002-06-19 James Gregory Stanier A simple microphone unit for the vertical localisation and enhancement of live sounds
JP4602204B2 (en) 2005-08-31 2010-12-22 ソニー株式会社 Audio signal processing apparatus and audio signal processing method
JP4637725B2 (en) 2005-11-11 2011-02-23 ソニー株式会社 Audio signal processing apparatus, audio signal processing method, and program
JP4894386B2 (en) 2006-07-21 2012-03-14 ソニー株式会社 Audio signal processing apparatus, audio signal processing method, and audio signal processing program
JP4835298B2 (en) 2006-07-21 2011-12-14 ソニー株式会社 Audio signal processing apparatus, audio signal processing method and program
US8050434B1 (en) 2006-12-21 2011-11-01 Srs Labs, Inc. Multi-channel audio enhancement system
US8908873B2 (en) 2007-03-21 2014-12-09 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Method and apparatus for conversion between multi-channel audio formats
US9015051B2 (en) 2007-03-21 2015-04-21 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Reconstruction of audio channels with direction parameters indicating direction of origin
US8290167B2 (en) 2007-03-21 2012-10-16 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Method and apparatus for conversion between multi-channel audio formats
EP2124486A1 (en) * 2008-05-13 2009-11-25 Clemens Par Angle-dependent operating device or method for generating a pseudo-stereophonic audio signal
JP5499513B2 (en) * 2009-04-21 2014-05-21 ソニー株式会社 Sound processing apparatus, sound image localization processing method, and sound image localization processing program
CN103329571B (en) 2011-01-04 2016-08-10 Dts有限责任公司 Immersion audio presentation systems
US9823892B2 (en) 2011-08-26 2017-11-21 Dts Llc Audio adjustment system
WO2013108147A1 (en) * 2012-01-17 2013-07-25 Koninklijke Philips N.V. Audio source position estimation
WO2017211448A1 (en) 2016-06-06 2017-12-14 Valenzuela Holding Gmbh Method for generating a two-channel signal from a single-channel signal of a sound source

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU5830686A (en) * 1985-06-07 1986-12-11 Dynavector Systems Ltd. Frequency-dependant delay stereo reproduction
AU1792988A (en) * 1987-05-11 1988-12-06 Jampolsky, David L. Hearing aid for asymmetric hearing perception
AU597089B2 (en) * 1986-09-22 1990-05-24 Harman International Industries Incorporated Automotive sound system

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4152542A (en) * 1971-10-06 1979-05-01 Cooper Duane P Multichannel matrix logic and encoding systems
US4308424A (en) * 1980-04-14 1981-12-29 Bice Jr Robert G Simulated stereo from a monaural source sound reproduction system
NL8303945A (en) * 1983-11-17 1985-06-17 Philips Nv DEVICE FOR REALIZING A PSEUDO STEREO SIGNAL.

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU5830686A (en) * 1985-06-07 1986-12-11 Dynavector Systems Ltd. Frequency-dependant delay stereo reproduction
AU597089B2 (en) * 1986-09-22 1990-05-24 Harman International Industries Incorporated Automotive sound system
AU1792988A (en) * 1987-05-11 1988-12-06 Jampolsky, David L. Hearing aid for asymmetric hearing perception

Also Published As

Publication number Publication date
NZ230517A (en) 1992-10-28
NO175229C (en) 1994-09-14
CA1329911C (en) 1994-05-31
HUT59523A (en) 1992-05-28
DE68922885T2 (en) 1995-10-12
PL163716B1 (en) 1994-04-29
AT123369T (en) 1995-06-15
RU2092979C1 (en) 1997-10-10
AR245858A1 (en) 1994-02-28
IL91464A (en) 1994-11-28
JPH02298200A (en) 1990-12-10
DE68922885D1 (en) 1995-07-06
DK433789A (en) 1990-03-03
AU4100089A (en) 1990-03-08
EP0357402A2 (en) 1990-03-07
FI894143A (en) 1990-03-03
JP3205808B2 (en) 2001-09-04
ES2075053T3 (en) 1995-10-01
BG60225B2 (en) 1993-12-30
NO175229B (en) 1994-06-06
KR900005841A (en) 1990-04-14
EP0357402A3 (en) 1991-10-02
IL91464D0 (en) 1990-04-29
NO893522D0 (en) 1989-09-01
EP0357402B1 (en) 1995-05-31
NO893522L (en) 1990-03-05
KR930002147B1 (en) 1993-03-26
FI894143A0 (en) 1989-09-01
DK433789D0 (en) 1989-09-01

Similar Documents

Publication Publication Date Title
AU621655B2 (en) Sound imaging method and apparatus
US5105462A (en) Sound imaging method and apparatus
US5208860A (en) Sound imaging method and apparatus
US5046097A (en) Sound imaging process
Snow Basic principles of stereophonic sound
US5438623A (en) Multi-channel spatialization system for audio signals
CA2162567C (en) Stereophonic reproduction method and apparatus
US3665105A (en) Method and apparatus for simulating location and movement of sound
US8688249B2 (en) Processing audio input signals
US8638946B1 (en) Method and apparatus for creating spatialized sound
Farina et al. Ambiophonic principles for the recording and reproduction of surround sound for music
JPH1146400A (en) Sound image localization device
US20030169886A1 (en) Method and apparatus for encoding mixed surround sound into a single stereo pair
Mori et al. Precision sound-image-localization technique utilizing multitrack tape masters
US6445798B1 (en) Method of generating three-dimensional sound
Gerzon Dummy head recording
Eargle On the Processing of Two-and Three-Channel Program Material for Four-Channel Playback
Bartlett et al. An improved Stereo Microphone array using boundary technology: theoretical aspects
KR100284457B1 (en) Sound processing method that can record in three dimensions
JP3409364B2 (en) Sound image localization control device
Pompetzki Binaural recording and reproduction for documentation and evaluation
Reilly et al. Category: Technical Papers
McGrath et al. Creation, manipulation and playback of soundfield
Takahashi et al. Precision Sound Image Localization Technique Utilizing Multi-Track Tape Masters
GB2334867A (en) Spatial localisation of sound