WO2010146346A1 - Audio auditioning device - Google Patents

Audio auditioning device Download PDF

Info

Publication number
WO2010146346A1
WO2010146346A1 PCT/GB2010/001165 GB2010001165W WO2010146346A1 WO 2010146346 A1 WO2010146346 A1 WO 2010146346A1 GB 2010001165 W GB2010001165 W GB 2010001165W WO 2010146346 A1 WO2010146346 A1 WO 2010146346A1
Authority
WO
WIPO (PCT)
Prior art keywords
sound
digital signal
signal processor
effect
audio
Prior art date
Application number
PCT/GB2010/001165
Other languages
French (fr)
Inventor
Ben Supper
Mathew Derbyshire
Robert Jenkins
Original Assignee
Focusrite Audio Engineering Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Focusrite Audio Engineering Ltd filed Critical Focusrite Audio Engineering Ltd
Priority to US13/379,907 priority Critical patent/US20120101609A1/en
Priority to AU2010261538A priority patent/AU2010261538A1/en
Priority to EP10747469A priority patent/EP2443845A1/en
Publication of WO2010146346A1 publication Critical patent/WO2010146346A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H1/00Details of electrophonic musical instruments
    • G10H1/02Means for controlling the tone frequencies, e.g. attack or decay; Means for producing special musical effects, e.g. vibratos or glissandos
    • G10H1/04Means for controlling the tone frequencies, e.g. attack or decay; Means for producing special musical effects, e.g. vibratos or glissandos by additional modulation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K15/00Acoustics not otherwise provided for
    • G10K15/08Arrangements for producing a reverberation or echo sound
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/002Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution

Definitions

  • the present invention relates to an audio processing device.
  • the various environments include (without limitation) home stereo, home multi channel cinema, large cinema, concert hall, car interiors, and radio receivers.
  • the quality control of the listening experience of a particular piece of music is managed by employing a professional mix engineer, under the instructions of a music producer.
  • the engineer balances and equalises the music, and may add effects such as reverberation and echo, in a process known as "Mixing", in which the source music is balanced and equalised within a known environment, such as a professional recording studio, in order to create a sound track with adjusted tonal qualities.
  • the aim is to achieve the desired sound of the music, known as the "Mix”.
  • the finished "Mix” is then auditioned within different environments, to see whether it retains the necessary tonal qualities.
  • This auditioning step allows the music producer to experience the qualitative effect of the various environments upon the sound of the "Mix” and thus make any necessary adjustments to the original "Mix” to compensate for those effects and ensure that the "Mix” has an acceptable sound quality across the range of environments for which it is intended.
  • the overall object of this process is to produce a single "Mix" of the music (or other recording) that can be reproduced within all the anticipated environments to an acceptable level of quality, as determined by the music producer.
  • the present invention therefore provides an audio auditioning device comprising a sound input, a sound output, a digital signal processor, and a library of stored digital signal processor effects, wherein the digital signal processor is adapted to apply a chosen effect from the library to a sound signal provided to the device via the sound input and deliver this to the output.
  • the library includes a plurality of digital signal processor effects representing the effect on a sound signal of reproduction in different environments, and the digital signal processor is adapted to apply the chosen effect in real time.
  • Each effect will (generally) be a combination of a loudspeaker model, a room model and a head model. Each effect can thereby replicate one auditioning environment of the plurality of auditioning environments that can be or need to be tried.
  • the present invention can be used to audition that mix in a range of environments whilst still working from the same computing device and listening via the same headphones.
  • the effects can include a home stereo, a home multi channel cinema, a large cinema, a concert hall, a car interior, and a radio receiver, or the like.
  • Each effect is preferably a combination of a loudspeaker model and a room model, to give a combined effect of listening to a specific type of loudspeaker and a specific room environment. This also permits the loudspeakers and the rooms to be interchanged, giving a wider range of possible audition parameters.
  • Each effect preferably further includes a human head model so that the final audio signal as heard through headphones accurately mimics the sound heard by a human listener in the relevant environment.
  • the models can be derived mathematically, or from measured impulse responses. Mathematical derivation is generally preferred as this furnishes accurate information more easily than a recording, and permits post-hoc customisation of the room. Measurement of impulse responses can also be used, however. This involves sending a known brief signal into the environment concerned and observing the resulting sound pattern. A candidate loudspeaker can be tested this way in an anechoic chamber or in a chamber whose parameters are known (and which can therefore be subtracted), to obtain the characteristics of the loudspeaker. A room can then be tested using a known loudspeaker in order to obtain the characteristics of the room.
  • the digital signal processor preferably applies the effect to the sound signal via both convolution reverberation and Schroeder reverberation. As discussed later, this allows a fast and accurate response with minimal computing overhead.
  • the apparatus may comprise a pair of headphones connectable to the sound output of the audio auditioning device, with each of the digital signal processor effects comprising a combination of an environment-specific effect and an effect corresponding to the headphones.
  • Each of the digital signal processor effects may also comprise an effect corresponding to a human head model.
  • the audio auditioning device can be combined with a computing device which includes a stored sound signal, mixing software adapted to adjust the mix of the stored sound signal, and a sound output connected to the sound input of the audio processing device.
  • the computing device is preferably adapted to retain a sound file for processing by the mixing software.
  • the mixing software is preferably adapted to adjust audio parameters of the sound file and save a new version of the sound file to the computing device.
  • the audio auditioning device can be used to monitor live sound.
  • live sound For example, there are a number of historical spaces (often used for classical music recording) where the recording engineer necessarily shares the room with the artists, and so cannot use loudspeakers to balance the live sound.
  • Figure 1 shows the functional elements of the invention and how they interact
  • Figure 2 shows the physical arrangement of the device and associated items.
  • This audio tool has two unique applications
  • the tool can reproduce the sound of any sound reproduction system within any space.
  • the model works via a combination of four principal components. Three are used to build the simulation: a loudspeaker measurement database, a room model, and a human head model. The fourth is the run-time algorithm, which runs on a DSP and applies the simulation to audio in real time, as shown in figure 1.
  • the loudspeaker measurements are obtained by sampling each loudspeaker in a standard room at two distances and in thirteen directions.
  • a measurement stimulus is chosen so that non-linear distortion from the loudspeaker is reduced during sampling, as this would corrupt the measurement.
  • Acoustic reflections from the (known) measurement room are computed out, so what remains is the anechoic, direction-dependent characteristics of each loudspeaker.
  • frontal responses from both loudspeakers are taken so that any disparities between the two loudspeakers can be included accurately in the model.
  • a short pilot tone is added to the beginning of the stimulus to allow for synchronisation, so that processing and acoustic transmission delays can be eliminated.
  • non-linear distortion effects can also be modelled, based on the size of the loudspeaker.
  • the room model is a mathematical model of a rectangular room or other environment. Included in it are the positions of the loudspeaker and listener, the acoustic characteristics of each surface, and simple objects within the room. What results is a complete set of reflections describing the reverberation of the room, its diffusive properties, the angles of emergence and incidence, and the spectral shaping that affects each reflection.
  • a human head model is employed. This is a database which uses equalisation, distance correction, interpolation, and retiming techniques as set out below. This characterises the manner in which sound incident from any direction around a listener is changed by the outer ears, the acoustic shadowing of the listener's head, and the relative distances between the ears.
  • the spectral shaping of the signal obtained here is therefore somewhat different to the one required when replaying the signal through headphones - the signal would be shaped twice, were the impulses not equalised to account for this.
  • the impulse response database was recorded with the reference loudspeaker at 1.4 metres from the dummy head. This produces angular distortion, because when a loudspeaker is placed at such a close distance, the wavefront reaches each ear at an angle of approximately three degrees owing to the head's physical width. This disparity is audible, so we find the true angle of incidence of each stimulus using trigonometry, and correct for it in further processing.
  • the co-ordinates are transformed from the standard polar system in which they were recorded (azimuth and elevation) into a more psychoacoustically useful system (cone angle and cone elevation: the 'cone angle 1 refers to a conical locus around the aural axis in which interaural timing and level differences are almost identical). Transforming the incident angles into this domain groups cues that are psychoacoustically similar. This aids weighting during the subsequent interpolation process, and the curve fitting of interaural time differences applied in the next step.
  • v. In order to increase the spatial resolution of the data set, we use weighted interpolation based on the conical domain, and a time difference for each position derived using our polynomial curves.
  • the 720 measurements in the database are interpolated to form 8010 measurements, to match the sensitivity of the human auditory system.
  • a combination of the average spectrum of the input data (step iv) and the frontal spectrum of the interpolated data is used to equalise the entire data set. This produces the best compromise between linearity of perceived frequency response (furnished by frontal spectrum equalisation), and perceived realism (furnished by average spectrum equalisation).
  • the loudspeaker can thus be positioned arbitrarily in a virtual environment, and a set of impulse responses generated which closely approximate how a listener would experience the sound in a real environment.
  • a run-time algorithm running on the device then applies these impulse responses to a stream of audio.
  • the algorithm is a hybrid of two existing practices: convolution reverberation and Schroeder reverberation.
  • Convolution reverberation accurately reproduces the direct sound and the precise reflection patterns of the first 60ms of reverberant sound in the simulation. This is responsible for making the room acoustics and distances in the simulation sound convincing.
  • the Schroeder reverberation covers later reflections, and is adjusted to the room model to match its spectral shape, decay time, reflection density, and interaural correlation, so that the transition between the two models is seamless. This overcomes the challenge of producing a very accurate simulation with a short processing delay on an inexpensive processor.
  • FIG. 2 shows the physical arrangement of devices.
  • a computing device 10 such as a laptop, personal computer, or the like holds a sound file that requires mixing.
  • the computing device is also provided with suitable mixing software that allows a user to vary the parameters of the mix and output the mixed sound signal via an audio output 12. This is delivered via a cable 14 to the sound auditioning device 16, and the user can listen to its output via headphones 18 connected to an audio output 20 provided on the device 16.
  • the user can propose various draft mixes and audition them live via the controlled environment that is provided by the headphones 18.
  • Different environments can be auditioned by adjusting the selected effect in the device 16, and the effect of this can be heard in real time.
  • the mix can be adjusted accordingly using the computing device 10 so that a suitable balance is achieved between the needs of different environments, as required by the artist.
  • the sound file can be saved by the computing device 10 for use elsewhere.
  • the saved sound file will not contain effects derived from the device 16.
  • the variations in mix parameters imposed by software on the computing device 10 affect the sound file saved on that computing device, and the DSP effects added to the sound signal are applied to the sound signal after it has been reproduced by the computing device 10 but before it is heard by the user via the headphones 18.
  • the effects therefore form part of the auditioning process but not the mixing process.
  • the DSP device 16 could be integrated into the computing device 10 or into software on that device.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Stereophonic System (AREA)

Abstract

Accurate "Mixing" of a sound signal has hitherto required a recording studio environment. Currently, both professional music producers facing budgetary limitations and amateur music makers without access to such meet a difficulty in producing music which has been correctly "Mixed" and "Auditioned". We therefore propose a "Mixing" and "Mix Audition" tool, which can use standard headphones as the method of reproducing the direct sound, together with a DSP system that can be used with a computer based music production system to simulate specific listening experiences. The present invention therefore provides an audio auditioning device comprising a sound input, a sound output, a digital signal processor, and a library of stored digital signal processor effects, wherein the digital signal processor is adapted to apply a chosen effect from the library to a sound signal provided to the device via the sound input and deliver this to the output, and the library includes a plurality of digital signal processor effects representing the effect on a sound signal of reproduction in different environments. The digital signal processor applies the chosen effect in real time. The effects can include a home stereo, a home multi channel cinema, a large cinema, a concert hall, a car interior, and a radio receiver, or the like. The audio auditioning device can be combined with a computing device which includes a stored sound signal, mixing software adapted to adjust the mix of the stored sound signal, and a sound output connected to the sound input of the audio auditioning device.

Description

Audio Auditioning Device
FIELD OF THE INVENTION
The present invention relates to an audio processing device.
BACKGROUND ART
Music is reproduced to the public in many different environments. In many (or most) of these, the quality of experience is compromised by both the listening space and by the method of reproduction of the direct sound. The various environments include (without limitation) home stereo, home multi channel cinema, large cinema, concert hall, car interiors, and radio receivers.
The quality control of the listening experience of a particular piece of music is managed by employing a professional mix engineer, under the instructions of a music producer. The engineer balances and equalises the music, and may add effects such as reverberation and echo, in a process known as "Mixing", in which the source music is balanced and equalised within a known environment, such as a professional recording studio, in order to create a sound track with adjusted tonal qualities. The aim is to achieve the desired sound of the music, known as the "Mix". The finished "Mix" is then auditioned within different environments, to see whether it retains the necessary tonal qualities. This auditioning step allows the music producer to experience the qualitative effect of the various environments upon the sound of the "Mix" and thus make any necessary adjustments to the original "Mix" to compensate for those effects and ensure that the "Mix" has an acceptable sound quality across the range of environments for which it is intended.
The overall object of this process is to produce a single "Mix" of the music (or other recording) that can be reproduced within all the anticipated environments to an acceptable level of quality, as determined by the music producer.
SUMMARY OF THE INVENTION
The introduction of computer-based music production systems and the free distribution of digital music has eroded the financial value of musical content severely, thus creating both problems for existing traditional music producers and also opportunities for new low cost music producers.
As a result, it is no longer economically viable for many professional music producers to use the traditional method of "Mixing", i.e. within a recording studio environment, to create content and to fully audition the quality of musical content. Conversely, it is now easier for amateur music makers to make musical content using only a computer laptop and suitable music production software. However, such amateur music is often unmixed, or at least un-auditioned, for obvious reasons of cost and practicality.
In this new paradigm, particularly the absence of a professional recording studio environment for mixing, both professional music producers and amateur music makers meet a difficulty in producing music which has been correctly "Mixed" and "Auditioned" in order to provide adequate control of the sound quality.
We therefore propose a "Mixing" and "Mix Audition" tool, which can use standard headphones as the method of reproducing the direct sound, together with a DSP system that can be used with a computer based music production system to simulate specific listening experiences and thereby replicate the auditioning process.
The present invention therefore provides an audio auditioning device comprising a sound input, a sound output, a digital signal processor, and a library of stored digital signal processor effects, wherein the digital signal processor is adapted to apply a chosen effect from the library to a sound signal provided to the device via the sound input and deliver this to the output. The library includes a plurality of digital signal processor effects representing the effect on a sound signal of reproduction in different environments, and the digital signal processor is adapted to apply the chosen effect in real time.
Each effect will (generally) be a combination of a loudspeaker model, a room model and a head model. Each effect can thereby replicate one auditioning environment of the plurality of auditioning environments that can be or need to be tried. Thus, after a proposed mix has been created by the user, the present invention can be used to audition that mix in a range of environments whilst still working from the same computing device and listening via the same headphones.
The effects can include a home stereo, a home multi channel cinema, a large cinema, a concert hall, a car interior, and a radio receiver, or the like.
Each effect is preferably a combination of a loudspeaker model and a room model, to give a combined effect of listening to a specific type of loudspeaker and a specific room environment. This also permits the loudspeakers and the rooms to be interchanged, giving a wider range of possible audition parameters. Each effect preferably further includes a human head model so that the final audio signal as heard through headphones accurately mimics the sound heard by a human listener in the relevant environment.
The models can be derived mathematically, or from measured impulse responses. Mathematical derivation is generally preferred as this furnishes accurate information more easily than a recording, and permits post-hoc customisation of the room. Measurement of impulse responses can also be used, however. This involves sending a known brief signal into the environment concerned and observing the resulting sound pattern. A candidate loudspeaker can be tested this way in an anechoic chamber or in a chamber whose parameters are known (and which can therefore be subtracted), to obtain the characteristics of the loudspeaker. A room can then be tested using a known loudspeaker in order to obtain the characteristics of the room.
The digital signal processor preferably applies the effect to the sound signal via both convolution reverberation and Schroeder reverberation. As discussed later, this allows a fast and accurate response with minimal computing overhead.
The apparatus may comprise a pair of headphones connectable to the sound output of the audio auditioning device, with each of the digital signal processor effects comprising a combination of an environment-specific effect and an effect corresponding to the headphones. Each of the digital signal processor effects may also comprise an effect corresponding to a human head model.
The audio auditioning device can be combined with a computing device which includes a stored sound signal, mixing software adapted to adjust the mix of the stored sound signal, and a sound output connected to the sound input of the audio processing device.
The computing device is preferably adapted to retain a sound file for processing by the mixing software. The mixing software is preferably adapted to adjust audio parameters of the sound file and save a new version of the sound file to the computing device.
Alternatively, the audio auditioning device can be used to monitor live sound. For example, there are a number of historical spaces (often used for classical music recording) where the recording engineer necessarily shares the room with the artists, and so cannot use loudspeakers to balance the live sound. BRIEF DESCRIPTION OF THE DRAWINGS
An embodiment of the present invention will now be described by way of example, with reference to the accompanying figures in which;
Figure 1 shows the functional elements of the invention and how they interact, and
Figure 2 shows the physical arrangement of the device and associated items.
DETAILED DESCRIPTION OF THE EMBODIMENTS
This audio tool has two unique applications
1. A customisable and (potentially) mobile "Mixing" environment.
2. A method of auditioning the "Mix" in different environments.
Our solution creates an accurate environment within which any listening experience can be simulated. The variables of spatial dimensions, the listener's head position within the space and the specific sound reproduction system can be modified to accurately model the different environments.
For those music producers who are either on the move, mixing outside of a studio environment, or do not have a studio of any kind they can reproduce the sound of their own studio or the combination of any other recording studio room and specific studio monitors.
For those music producers who do not have the facilities or budgets to audition musical content in many different environments the tool can reproduce the sound of any sound reproduction system within any space.
The model works via a combination of four principal components. Three are used to build the simulation: a loudspeaker measurement database, a room model, and a human head model. The fourth is the run-time algorithm, which runs on a DSP and applies the simulation to audio in real time, as shown in figure 1.
The loudspeaker measurements are obtained by sampling each loudspeaker in a standard room at two distances and in thirteen directions. A measurement stimulus is chosen so that non-linear distortion from the loudspeaker is reduced during sampling, as this would corrupt the measurement. Acoustic reflections from the (known) measurement room are computed out, so what remains is the anechoic, direction-dependent characteristics of each loudspeaker. When a stereo pair of loudspeakers is available, frontal responses from both loudspeakers are taken so that any disparities between the two loudspeakers can be included accurately in the model.
The impulse for these measurements is generated in the frequency domain, giving rise to a flat, continuous spectrum. By dividing this spectrum into twelve sections and boosting the lower stimuli in inverse proportion to frequencies, a partitioned stimulus can be derived that:
i. Can exploit the dynamic range of the loudspeaker without driving it to its distortion limit at high frequencies;
ii. Spreads the signal in time, reducing the influence of noise from the room and the measuring microphone;
iii. Presents only a small portion of the frequency response at any time, so that the loudspeaker does not warm up causing power compression, while intermodulation distortion caused by the Doppler effect is drastically reduced;
iv. After equalisation to counteract the lower-frequency boosting, will mathematically sum to an impulse response.
A short pilot tone is added to the beginning of the stimulus to allow for synchronisation, so that processing and acoustic transmission delays can be eliminated. If desired, non-linear distortion effects can also be modelled, based on the size of the loudspeaker. The room model is a mathematical model of a rectangular room or other environment. Included in it are the positions of the loudspeaker and listener, the acoustic characteristics of each surface, and simple objects within the room. What results is a complete set of reflections describing the reverberation of the room, its diffusive properties, the angles of emergence and incidence, and the spectral shaping that affects each reflection.
To combine the loudspeaker and room models into something that a listener will be able to hear, a human head model is employed. This is a database which uses equalisation, distance correction, interpolation, and retiming techniques as set out below. This characterises the manner in which sound incident from any direction around a listener is changed by the outer ears, the acoustic shadowing of the listener's head, and the relative distances between the ears.
In relation to the head-related impulse responses, great care is needed as a result of two aspects of the human hearing system. First, sensitivity to inter- aural delays is exquisite. Listeners can hear disparities of 10 microseconds of arrival between the left and right ears, and perceive these as shifts in the image position. Second, to get accurate measurements of the effect of the head, torso, and outer ears on incident sound waves, the measurement microphones must be placed within 'ear canals' of a dummy head.
The spectral shaping of the signal obtained here is therefore somewhat different to the one required when replaying the signal through headphones - the signal would be shaped twice, were the impulses not equalised to account for this.
The method of equalisation and correction is described in stages below.
i. The impulse response database was recorded with the reference loudspeaker at 1.4 metres from the dummy head. This produces angular distortion, because when a loudspeaker is placed at such a close distance, the wavefront reaches each ear at an angle of approximately three degrees owing to the head's physical width. This disparity is audible, so we find the true angle of incidence of each stimulus using trigonometry, and correct for it in further processing.
ii. The co-ordinates are transformed from the standard polar system in which they were recorded (azimuth and elevation) into a more psychoacoustically useful system (cone angle and cone elevation: the 'cone angle1 refers to a conical locus around the aural axis in which interaural timing and level differences are almost identical). Transforming the incident angles into this domain groups cues that are psychoacoustically similar. This aids weighting during the subsequent interpolation process, and the curve fitting of interaural time differences applied in the next step.
iii. We reduce each impulse response to minimum phase, and extract the time difference. The time differences are modelled using a peculiar combination of polynomial curves, so that an appropriate time difference can determined and applied at each point in our output data set.
iv. The average spectrum of the input data set is determined for subsequent equalisation.
v. In order to increase the spatial resolution of the data set, we use weighted interpolation based on the conical domain, and a time difference for each position derived using our polynomial curves. The 720 measurements in the database are interpolated to form 8010 measurements, to match the sensitivity of the human auditory system. i. A combination of the average spectrum of the input data (step iv) and the frontal spectrum of the interpolated data is used to equalise the entire data set. This produces the best compromise between linearity of perceived frequency response (furnished by frontal spectrum equalisation), and perceived realism (furnished by average spectrum equalisation).
The loudspeaker can thus be positioned arbitrarily in a virtual environment, and a set of impulse responses generated which closely approximate how a listener would experience the sound in a real environment.
A run-time algorithm running on the device then applies these impulse responses to a stream of audio. The algorithm is a hybrid of two existing practices: convolution reverberation and Schroeder reverberation. Convolution reverberation accurately reproduces the direct sound and the precise reflection patterns of the first 60ms of reverberant sound in the simulation. This is responsible for making the room acoustics and distances in the simulation sound convincing. The Schroeder reverberation covers later reflections, and is adjusted to the room model to match its spectral shape, decay time, reflection density, and interaural correlation, so that the transition between the two models is seamless. This overcomes the challenge of producing a very accurate simulation with a short processing delay on an inexpensive processor.
Figure 2 shows the physical arrangement of devices. A computing device 10 such as a laptop, personal computer, or the like holds a sound file that requires mixing. The computing device is also provided with suitable mixing software that allows a user to vary the parameters of the mix and output the mixed sound signal via an audio output 12. This is delivered via a cable 14 to the sound auditioning device 16, and the user can listen to its output via headphones 18 connected to an audio output 20 provided on the device 16.
Thus, the user can propose various draft mixes and audition them live via the controlled environment that is provided by the headphones 18. Different environments can be auditioned by adjusting the selected effect in the device 16, and the effect of this can be heard in real time. The mix can be adjusted accordingly using the computing device 10 so that a suitable balance is achieved between the needs of different environments, as required by the artist. Once a set of mix parameters has been chosen, the sound file can be saved by the computing device 10 for use elsewhere.
It should be noted that the saved sound file will not contain effects derived from the device 16. The variations in mix parameters imposed by software on the computing device 10 affect the sound file saved on that computing device, and the DSP effects added to the sound signal are applied to the sound signal after it has been reproduced by the computing device 10 but before it is heard by the user via the headphones 18. The effects therefore form part of the auditioning process but not the mixing process.
In a further development, the DSP device 16 could be integrated into the computing device 10 or into software on that device.
It will of course be understood that many variations may be made to the above-described embodiment without departing from the scope of the present invention.

Claims

1. The combination of a computing device and an audio auditioning device; the audio auditioning device comprising: a sound input, a sound output, a digital signal processor, and a library of stored digital signal processor effects; wherein the digital signal processor is adapted to apply a chosen effect from the library to a sound signal provided to the device via the sound input and deliver this to the output, the library includes a plurality of digital signal processor effects representing the effect on a sound signal of reproduction in different environments; the computing device including a stored sound signal, mixing software adapted to adjust the mix of the stored sound signal, and a sound output connected to the sound input of the audio auditioning device.
2. The combination according to claim 1 in which the computing device is adapted to retain a sound file for processing by the mixing software.
3. The combination according to claim 2 in which the mixing software is adapted to adjust audio parameters of the sound file and save a new version of the sound file to the computing device.
4. An audio auditioning device comprising: a sounά input, a sound output, a digital signal processor, and a library of stored digital signal processor effects; wherein the digital signal processor is adapted to apply a chosen effect from the library to a sound signal provided to the device via the sound input and deliver this to the output, characterised in that the library includes a plurality of digital signal processor effects representing the effect on a sound signal of reproduction in different environments, and the digital signal processor is adapted to apply the chosen effect in real time.
5. Apparatus according to any one of the preceding claims, further comprising a pair of headphones connectable to the sound output of the audio auditioning device, wherein each of the digital signal processor effects comprises a combination of an environment-specific effect and an effect corresponding to the headphones.
6. Apparatus according to claim 5, wherein each of the digital signal processor effects further comprises an effect corresponding to a human head model.
7. Apparatus according to any one of the preceding claims, wherein the effect is selected from the group consisting of a home stereo, a home multi channel cinema, a large cinema, a concert hall, a car interior, and a radio receiver.
8. Apparatus according to any one of the preceding claims, in which each effect is a combination of a loudspeaker model and a room model.
9. Apparatus according to claim 8 in which each effect further includes a human head model.
10. Apparatus according to claim 8 or claim 9 in which the models are derived from impulse responses.
11. Apparatus according to any one of the preceding claims, in which the digital signal processor applies the effect to the sound signal via both convolution reverberation and Schroeder reverberation.
12. An audio auditioning device substantially as herein described with reference to and/or as illustrated in the accompanying figures.
PCT/GB2010/001165 2009-06-16 2010-06-15 Audio auditioning device WO2010146346A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
US13/379,907 US20120101609A1 (en) 2009-06-16 2010-06-15 Audio Auditioning Device
AU2010261538A AU2010261538A1 (en) 2009-06-16 2010-06-15 Audio auditioning device
EP10747469A EP2443845A1 (en) 2009-06-16 2010-06-15 Audio auditioning device

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB0910315.1 2009-06-16
GB0910315A GB2471089A (en) 2009-06-16 2009-06-16 Audio processing device using a library of virtual environment effects

Publications (1)

Publication Number Publication Date
WO2010146346A1 true WO2010146346A1 (en) 2010-12-23

Family

ID=40940860

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB2010/001165 WO2010146346A1 (en) 2009-06-16 2010-06-15 Audio auditioning device

Country Status (5)

Country Link
US (1) US20120101609A1 (en)
EP (1) EP2443845A1 (en)
AU (1) AU2010261538A1 (en)
GB (1) GB2471089A (en)
WO (1) WO2010146346A1 (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8644520B2 (en) * 2010-10-14 2014-02-04 Lockheed Martin Corporation Morphing of aural impulse response signatures to obtain intermediate aural impulse response signals
FR2976759B1 (en) * 2011-06-16 2013-08-09 Jean Luc Haurais METHOD OF PROCESSING AUDIO SIGNAL FOR IMPROVED RESTITUTION
CN104349266A (en) * 2013-08-07 2015-02-11 钟志杰 Indoor digital high-definition cinema and digital music karaoke compatible system
CN104835506B (en) * 2014-02-10 2019-12-03 腾讯科技(深圳)有限公司 The method and apparatus for obtaining the wet sound of reverberation
US10679407B2 (en) 2014-06-27 2020-06-09 The University Of North Carolina At Chapel Hill Methods, systems, and computer readable media for modeling interactive diffuse reflections and higher-order diffraction in virtual environment scenes
US9977644B2 (en) * 2014-07-29 2018-05-22 The University Of North Carolina At Chapel Hill Methods, systems, and computer readable media for conducting interactive sound propagation and rendering for a plurality of sound sources in a virtual environment scene
US10248744B2 (en) 2017-02-16 2019-04-02 The University Of North Carolina At Chapel Hill Methods, systems, and computer readable media for acoustic classification and optimization for multi-modal rendering of real-world scenes
US12089032B1 (en) 2020-07-31 2024-09-10 Apple Inc. Estimating room acoustic material properties

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020133327A1 (en) * 1998-03-31 2002-09-19 Mcgrath David Stanley Acoustic response simulation system
US20030033214A1 (en) * 2001-06-27 2003-02-13 John Mikkelsen Media delivery platform
US20070074621A1 (en) * 2005-10-01 2007-04-05 Samsung Electronics Co., Ltd. Method and apparatus to generate spatial sound
US20070168062A1 (en) * 2006-01-17 2007-07-19 Sigmatel, Inc. Computer audio system and method

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5912976A (en) * 1996-11-07 1999-06-15 Srs Labs, Inc. Multi-channel audio enhancement system for use in recording and playback and methods for providing same
WO1999014983A1 (en) * 1997-09-16 1999-03-25 Lake Dsp Pty. Limited Utilisation of filtering effects in stereo headphone devices to enhance spatialization of source around a listener
AUPQ941600A0 (en) * 2000-08-14 2000-09-07 Lake Technology Limited Audio frequency response processing sytem
JP4059478B2 (en) * 2002-02-28 2008-03-12 パイオニア株式会社 Sound field control method and sound field control system
JP4062959B2 (en) * 2002-04-26 2008-03-19 ヤマハ株式会社 Reverberation imparting device, reverberation imparting method, impulse response generating device, impulse response generating method, reverberation imparting program, impulse response generating program, and recording medium
WO2005036523A1 (en) * 2003-10-09 2005-04-21 Teac America, Inc. Method, apparatus, and system for synthesizing an audio performance using convolution at multiple sample rates
US8363843B2 (en) * 2007-03-01 2013-01-29 Apple Inc. Methods, modules, and computer-readable recording media for providing a multi-channel convolution reverb
US7792674B2 (en) * 2007-03-30 2010-09-07 Smith Micro Software, Inc. System and method for providing virtual spatial sound with an audio visual player
US8150051B2 (en) * 2007-12-12 2012-04-03 Bose Corporation System and method for sound system simulation

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020133327A1 (en) * 1998-03-31 2002-09-19 Mcgrath David Stanley Acoustic response simulation system
US20030033214A1 (en) * 2001-06-27 2003-02-13 John Mikkelsen Media delivery platform
US20070074621A1 (en) * 2005-10-01 2007-04-05 Samsung Electronics Co., Ltd. Method and apparatus to generate spatial sound
US20070168062A1 (en) * 2006-01-17 2007-07-19 Sigmatel, Inc. Computer audio system and method

Also Published As

Publication number Publication date
GB2471089A (en) 2010-12-22
GB0910315D0 (en) 2009-07-29
EP2443845A1 (en) 2012-04-25
AU2010261538A1 (en) 2012-02-02
US20120101609A1 (en) 2012-04-26

Similar Documents

Publication Publication Date Title
US9918179B2 (en) Methods and devices for reproducing surround audio signals
AU2001239516B2 (en) System and method for optimization of three-dimensional audio
US20120101609A1 (en) Audio Auditioning Device
US20150131824A1 (en) Method for high quality efficient 3d sound reproduction
AU2001239516A1 (en) System and method for optimization of three-dimensional audio
JP2007512740A (en) Apparatus and method for generating a low frequency channel
Satongar et al. Measurement and analysis of a spatially sampled binaural room impulse response dataset
CN109923877A (en) The device and method that stereo audio signal is weighted
Song et al. Psychoacoustic evaluation of multichannel reproduced sounds using binaural synthesis and spherical beamforming
Werner et al. Effects of shaping of binaural room impulse responses on localization
US11388540B2 (en) Method for acoustically rendering the size of a sound source
Pelzer et al. 3D reproduction of room auralizations by combining intensity panning, crosstalk cancellation and Ambisonics
Sporer et al. Adjustment of the direct-to-reverberant-energy-ratio to reach externalization within a binaural synthesis system
Grimm et al. Implementation and perceptual evaluation of a simulation method for coupled rooms in higher order ambisonics
Griesinger Accurate reproduction of binaural recordings through individual headphone equalization and time domain crosstalk cancellation
GB2366975A (en) A method of audio signal processing for a loudspeaker located close to an ear
Wendt et al. How binaural room impulse responses influence the externalization of speech
Kuhlen et al. A true spatial sound system for CAVE-like displays using four loudspeakers
Benjamin et al. The effect of head diffraction on stereo localization in the mid-frequency range
Hohnerlein Beamforming-based Acoustic Crosstalk Cancelation for Spatial Audio Presentation
Fontana et al. Evaluation of binaural rendering quality for professional audio.
Satongar Simulation and analysis of spatial audio reproduction and listening area effects
Hiekkanen Paikkariippumaton menetelmä kaiuttimien vertailuun
Williams Comparing the Level of Envelopment and Sound Source Location Accuracy Between Different Versions of a Sound Design Piece Using Individual and Non-Individual Head Related Impulse Responses
Lerke A real time compensation system for

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10747469

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 13379907

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 2010747469

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2010261538

Country of ref document: AU

ENP Entry into the national phase

Ref document number: 2010261538

Country of ref document: AU

Date of ref document: 20100615

Kind code of ref document: A