WO2010146346A1 - Audio auditioning device - Google Patents
Audio auditioning device Download PDFInfo
- Publication number
- WO2010146346A1 WO2010146346A1 PCT/GB2010/001165 GB2010001165W WO2010146346A1 WO 2010146346 A1 WO2010146346 A1 WO 2010146346A1 GB 2010001165 W GB2010001165 W GB 2010001165W WO 2010146346 A1 WO2010146346 A1 WO 2010146346A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sound
- digital signal
- signal processor
- effect
- audio
- Prior art date
Links
- 230000000694 effects Effects 0.000 claims abstract description 51
- 230000005236 sound signal Effects 0.000 claims abstract description 21
- 238000012545 processing Methods 0.000 claims description 7
- 238000000034 method Methods 0.000 abstract description 13
- 238000004519 manufacturing process Methods 0.000 abstract description 3
- 210000003128 head Anatomy 0.000 description 12
- 238000005259 measurement Methods 0.000 description 11
- 238000001228 spectrum Methods 0.000 description 7
- 238000004088 simulation Methods 0.000 description 5
- 230000003595 spectral effect Effects 0.000 description 3
- 238000012937 correction Methods 0.000 description 2
- 230000001934 delay Effects 0.000 description 2
- 210000000883 ear external Anatomy 0.000 description 2
- 210000005069 ears Anatomy 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 238000007493 shaping process Methods 0.000 description 2
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000002301 combined effect Effects 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000001010 compromised effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 210000000613 ear canal Anatomy 0.000 description 1
- 238000013178 mathematical model Methods 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 210000005010 torso Anatomy 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/008—Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H1/00—Details of electrophonic musical instruments
- G10H1/02—Means for controlling the tone frequencies, e.g. attack or decay; Means for producing special musical effects, e.g. vibratos or glissandos
- G10H1/04—Means for controlling the tone frequencies, e.g. attack or decay; Means for producing special musical effects, e.g. vibratos or glissandos by additional modulation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10K—SOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
- G10K15/00—Acoustics not otherwise provided for
- G10K15/08—Arrangements for producing a reverberation or echo sound
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/002—Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
Definitions
- the present invention relates to an audio processing device.
- the various environments include (without limitation) home stereo, home multi channel cinema, large cinema, concert hall, car interiors, and radio receivers.
- the quality control of the listening experience of a particular piece of music is managed by employing a professional mix engineer, under the instructions of a music producer.
- the engineer balances and equalises the music, and may add effects such as reverberation and echo, in a process known as "Mixing", in which the source music is balanced and equalised within a known environment, such as a professional recording studio, in order to create a sound track with adjusted tonal qualities.
- the aim is to achieve the desired sound of the music, known as the "Mix”.
- the finished "Mix” is then auditioned within different environments, to see whether it retains the necessary tonal qualities.
- This auditioning step allows the music producer to experience the qualitative effect of the various environments upon the sound of the "Mix” and thus make any necessary adjustments to the original "Mix” to compensate for those effects and ensure that the "Mix” has an acceptable sound quality across the range of environments for which it is intended.
- the overall object of this process is to produce a single "Mix" of the music (or other recording) that can be reproduced within all the anticipated environments to an acceptable level of quality, as determined by the music producer.
- the present invention therefore provides an audio auditioning device comprising a sound input, a sound output, a digital signal processor, and a library of stored digital signal processor effects, wherein the digital signal processor is adapted to apply a chosen effect from the library to a sound signal provided to the device via the sound input and deliver this to the output.
- the library includes a plurality of digital signal processor effects representing the effect on a sound signal of reproduction in different environments, and the digital signal processor is adapted to apply the chosen effect in real time.
- Each effect will (generally) be a combination of a loudspeaker model, a room model and a head model. Each effect can thereby replicate one auditioning environment of the plurality of auditioning environments that can be or need to be tried.
- the present invention can be used to audition that mix in a range of environments whilst still working from the same computing device and listening via the same headphones.
- the effects can include a home stereo, a home multi channel cinema, a large cinema, a concert hall, a car interior, and a radio receiver, or the like.
- Each effect is preferably a combination of a loudspeaker model and a room model, to give a combined effect of listening to a specific type of loudspeaker and a specific room environment. This also permits the loudspeakers and the rooms to be interchanged, giving a wider range of possible audition parameters.
- Each effect preferably further includes a human head model so that the final audio signal as heard through headphones accurately mimics the sound heard by a human listener in the relevant environment.
- the models can be derived mathematically, or from measured impulse responses. Mathematical derivation is generally preferred as this furnishes accurate information more easily than a recording, and permits post-hoc customisation of the room. Measurement of impulse responses can also be used, however. This involves sending a known brief signal into the environment concerned and observing the resulting sound pattern. A candidate loudspeaker can be tested this way in an anechoic chamber or in a chamber whose parameters are known (and which can therefore be subtracted), to obtain the characteristics of the loudspeaker. A room can then be tested using a known loudspeaker in order to obtain the characteristics of the room.
- the digital signal processor preferably applies the effect to the sound signal via both convolution reverberation and Schroeder reverberation. As discussed later, this allows a fast and accurate response with minimal computing overhead.
- the apparatus may comprise a pair of headphones connectable to the sound output of the audio auditioning device, with each of the digital signal processor effects comprising a combination of an environment-specific effect and an effect corresponding to the headphones.
- Each of the digital signal processor effects may also comprise an effect corresponding to a human head model.
- the audio auditioning device can be combined with a computing device which includes a stored sound signal, mixing software adapted to adjust the mix of the stored sound signal, and a sound output connected to the sound input of the audio processing device.
- the computing device is preferably adapted to retain a sound file for processing by the mixing software.
- the mixing software is preferably adapted to adjust audio parameters of the sound file and save a new version of the sound file to the computing device.
- the audio auditioning device can be used to monitor live sound.
- live sound For example, there are a number of historical spaces (often used for classical music recording) where the recording engineer necessarily shares the room with the artists, and so cannot use loudspeakers to balance the live sound.
- Figure 1 shows the functional elements of the invention and how they interact
- Figure 2 shows the physical arrangement of the device and associated items.
- This audio tool has two unique applications
- the tool can reproduce the sound of any sound reproduction system within any space.
- the model works via a combination of four principal components. Three are used to build the simulation: a loudspeaker measurement database, a room model, and a human head model. The fourth is the run-time algorithm, which runs on a DSP and applies the simulation to audio in real time, as shown in figure 1.
- the loudspeaker measurements are obtained by sampling each loudspeaker in a standard room at two distances and in thirteen directions.
- a measurement stimulus is chosen so that non-linear distortion from the loudspeaker is reduced during sampling, as this would corrupt the measurement.
- Acoustic reflections from the (known) measurement room are computed out, so what remains is the anechoic, direction-dependent characteristics of each loudspeaker.
- frontal responses from both loudspeakers are taken so that any disparities between the two loudspeakers can be included accurately in the model.
- a short pilot tone is added to the beginning of the stimulus to allow for synchronisation, so that processing and acoustic transmission delays can be eliminated.
- non-linear distortion effects can also be modelled, based on the size of the loudspeaker.
- the room model is a mathematical model of a rectangular room or other environment. Included in it are the positions of the loudspeaker and listener, the acoustic characteristics of each surface, and simple objects within the room. What results is a complete set of reflections describing the reverberation of the room, its diffusive properties, the angles of emergence and incidence, and the spectral shaping that affects each reflection.
- a human head model is employed. This is a database which uses equalisation, distance correction, interpolation, and retiming techniques as set out below. This characterises the manner in which sound incident from any direction around a listener is changed by the outer ears, the acoustic shadowing of the listener's head, and the relative distances between the ears.
- the spectral shaping of the signal obtained here is therefore somewhat different to the one required when replaying the signal through headphones - the signal would be shaped twice, were the impulses not equalised to account for this.
- the impulse response database was recorded with the reference loudspeaker at 1.4 metres from the dummy head. This produces angular distortion, because when a loudspeaker is placed at such a close distance, the wavefront reaches each ear at an angle of approximately three degrees owing to the head's physical width. This disparity is audible, so we find the true angle of incidence of each stimulus using trigonometry, and correct for it in further processing.
- the co-ordinates are transformed from the standard polar system in which they were recorded (azimuth and elevation) into a more psychoacoustically useful system (cone angle and cone elevation: the 'cone angle 1 refers to a conical locus around the aural axis in which interaural timing and level differences are almost identical). Transforming the incident angles into this domain groups cues that are psychoacoustically similar. This aids weighting during the subsequent interpolation process, and the curve fitting of interaural time differences applied in the next step.
- v. In order to increase the spatial resolution of the data set, we use weighted interpolation based on the conical domain, and a time difference for each position derived using our polynomial curves.
- the 720 measurements in the database are interpolated to form 8010 measurements, to match the sensitivity of the human auditory system.
- a combination of the average spectrum of the input data (step iv) and the frontal spectrum of the interpolated data is used to equalise the entire data set. This produces the best compromise between linearity of perceived frequency response (furnished by frontal spectrum equalisation), and perceived realism (furnished by average spectrum equalisation).
- the loudspeaker can thus be positioned arbitrarily in a virtual environment, and a set of impulse responses generated which closely approximate how a listener would experience the sound in a real environment.
- a run-time algorithm running on the device then applies these impulse responses to a stream of audio.
- the algorithm is a hybrid of two existing practices: convolution reverberation and Schroeder reverberation.
- Convolution reverberation accurately reproduces the direct sound and the precise reflection patterns of the first 60ms of reverberant sound in the simulation. This is responsible for making the room acoustics and distances in the simulation sound convincing.
- the Schroeder reverberation covers later reflections, and is adjusted to the room model to match its spectral shape, decay time, reflection density, and interaural correlation, so that the transition between the two models is seamless. This overcomes the challenge of producing a very accurate simulation with a short processing delay on an inexpensive processor.
- FIG. 2 shows the physical arrangement of devices.
- a computing device 10 such as a laptop, personal computer, or the like holds a sound file that requires mixing.
- the computing device is also provided with suitable mixing software that allows a user to vary the parameters of the mix and output the mixed sound signal via an audio output 12. This is delivered via a cable 14 to the sound auditioning device 16, and the user can listen to its output via headphones 18 connected to an audio output 20 provided on the device 16.
- the user can propose various draft mixes and audition them live via the controlled environment that is provided by the headphones 18.
- Different environments can be auditioned by adjusting the selected effect in the device 16, and the effect of this can be heard in real time.
- the mix can be adjusted accordingly using the computing device 10 so that a suitable balance is achieved between the needs of different environments, as required by the artist.
- the sound file can be saved by the computing device 10 for use elsewhere.
- the saved sound file will not contain effects derived from the device 16.
- the variations in mix parameters imposed by software on the computing device 10 affect the sound file saved on that computing device, and the DSP effects added to the sound signal are applied to the sound signal after it has been reproduced by the computing device 10 but before it is heard by the user via the headphones 18.
- the effects therefore form part of the auditioning process but not the mixing process.
- the DSP device 16 could be integrated into the computing device 10 or into software on that device.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Stereophonic System (AREA)
Abstract
Description
Claims
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/379,907 US20120101609A1 (en) | 2009-06-16 | 2010-06-15 | Audio Auditioning Device |
AU2010261538A AU2010261538A1 (en) | 2009-06-16 | 2010-06-15 | Audio auditioning device |
EP10747469A EP2443845A1 (en) | 2009-06-16 | 2010-06-15 | Audio auditioning device |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB0910315.1 | 2009-06-16 | ||
GB0910315A GB2471089A (en) | 2009-06-16 | 2009-06-16 | Audio processing device using a library of virtual environment effects |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2010146346A1 true WO2010146346A1 (en) | 2010-12-23 |
Family
ID=40940860
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/GB2010/001165 WO2010146346A1 (en) | 2009-06-16 | 2010-06-15 | Audio auditioning device |
Country Status (5)
Country | Link |
---|---|
US (1) | US20120101609A1 (en) |
EP (1) | EP2443845A1 (en) |
AU (1) | AU2010261538A1 (en) |
GB (1) | GB2471089A (en) |
WO (1) | WO2010146346A1 (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8644520B2 (en) * | 2010-10-14 | 2014-02-04 | Lockheed Martin Corporation | Morphing of aural impulse response signatures to obtain intermediate aural impulse response signals |
FR2976759B1 (en) * | 2011-06-16 | 2013-08-09 | Jean Luc Haurais | METHOD OF PROCESSING AUDIO SIGNAL FOR IMPROVED RESTITUTION |
CN104349266A (en) * | 2013-08-07 | 2015-02-11 | 钟志杰 | Indoor digital high-definition cinema and digital music karaoke compatible system |
CN104835506B (en) * | 2014-02-10 | 2019-12-03 | 腾讯科技(深圳)有限公司 | The method and apparatus for obtaining the wet sound of reverberation |
US10679407B2 (en) | 2014-06-27 | 2020-06-09 | The University Of North Carolina At Chapel Hill | Methods, systems, and computer readable media for modeling interactive diffuse reflections and higher-order diffraction in virtual environment scenes |
US9977644B2 (en) * | 2014-07-29 | 2018-05-22 | The University Of North Carolina At Chapel Hill | Methods, systems, and computer readable media for conducting interactive sound propagation and rendering for a plurality of sound sources in a virtual environment scene |
US10248744B2 (en) | 2017-02-16 | 2019-04-02 | The University Of North Carolina At Chapel Hill | Methods, systems, and computer readable media for acoustic classification and optimization for multi-modal rendering of real-world scenes |
US12089032B1 (en) | 2020-07-31 | 2024-09-10 | Apple Inc. | Estimating room acoustic material properties |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020133327A1 (en) * | 1998-03-31 | 2002-09-19 | Mcgrath David Stanley | Acoustic response simulation system |
US20030033214A1 (en) * | 2001-06-27 | 2003-02-13 | John Mikkelsen | Media delivery platform |
US20070074621A1 (en) * | 2005-10-01 | 2007-04-05 | Samsung Electronics Co., Ltd. | Method and apparatus to generate spatial sound |
US20070168062A1 (en) * | 2006-01-17 | 2007-07-19 | Sigmatel, Inc. | Computer audio system and method |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5912976A (en) * | 1996-11-07 | 1999-06-15 | Srs Labs, Inc. | Multi-channel audio enhancement system for use in recording and playback and methods for providing same |
WO1999014983A1 (en) * | 1997-09-16 | 1999-03-25 | Lake Dsp Pty. Limited | Utilisation of filtering effects in stereo headphone devices to enhance spatialization of source around a listener |
AUPQ941600A0 (en) * | 2000-08-14 | 2000-09-07 | Lake Technology Limited | Audio frequency response processing sytem |
JP4059478B2 (en) * | 2002-02-28 | 2008-03-12 | パイオニア株式会社 | Sound field control method and sound field control system |
JP4062959B2 (en) * | 2002-04-26 | 2008-03-19 | ヤマハ株式会社 | Reverberation imparting device, reverberation imparting method, impulse response generating device, impulse response generating method, reverberation imparting program, impulse response generating program, and recording medium |
WO2005036523A1 (en) * | 2003-10-09 | 2005-04-21 | Teac America, Inc. | Method, apparatus, and system for synthesizing an audio performance using convolution at multiple sample rates |
US8363843B2 (en) * | 2007-03-01 | 2013-01-29 | Apple Inc. | Methods, modules, and computer-readable recording media for providing a multi-channel convolution reverb |
US7792674B2 (en) * | 2007-03-30 | 2010-09-07 | Smith Micro Software, Inc. | System and method for providing virtual spatial sound with an audio visual player |
US8150051B2 (en) * | 2007-12-12 | 2012-04-03 | Bose Corporation | System and method for sound system simulation |
-
2009
- 2009-06-16 GB GB0910315A patent/GB2471089A/en not_active Withdrawn
-
2010
- 2010-06-15 US US13/379,907 patent/US20120101609A1/en not_active Abandoned
- 2010-06-15 EP EP10747469A patent/EP2443845A1/en not_active Withdrawn
- 2010-06-15 AU AU2010261538A patent/AU2010261538A1/en not_active Abandoned
- 2010-06-15 WO PCT/GB2010/001165 patent/WO2010146346A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020133327A1 (en) * | 1998-03-31 | 2002-09-19 | Mcgrath David Stanley | Acoustic response simulation system |
US20030033214A1 (en) * | 2001-06-27 | 2003-02-13 | John Mikkelsen | Media delivery platform |
US20070074621A1 (en) * | 2005-10-01 | 2007-04-05 | Samsung Electronics Co., Ltd. | Method and apparatus to generate spatial sound |
US20070168062A1 (en) * | 2006-01-17 | 2007-07-19 | Sigmatel, Inc. | Computer audio system and method |
Also Published As
Publication number | Publication date |
---|---|
GB2471089A (en) | 2010-12-22 |
GB0910315D0 (en) | 2009-07-29 |
EP2443845A1 (en) | 2012-04-25 |
AU2010261538A1 (en) | 2012-02-02 |
US20120101609A1 (en) | 2012-04-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9918179B2 (en) | Methods and devices for reproducing surround audio signals | |
AU2001239516B2 (en) | System and method for optimization of three-dimensional audio | |
US20120101609A1 (en) | Audio Auditioning Device | |
US20150131824A1 (en) | Method for high quality efficient 3d sound reproduction | |
AU2001239516A1 (en) | System and method for optimization of three-dimensional audio | |
JP2007512740A (en) | Apparatus and method for generating a low frequency channel | |
Satongar et al. | Measurement and analysis of a spatially sampled binaural room impulse response dataset | |
CN109923877A (en) | The device and method that stereo audio signal is weighted | |
Song et al. | Psychoacoustic evaluation of multichannel reproduced sounds using binaural synthesis and spherical beamforming | |
Werner et al. | Effects of shaping of binaural room impulse responses on localization | |
US11388540B2 (en) | Method for acoustically rendering the size of a sound source | |
Pelzer et al. | 3D reproduction of room auralizations by combining intensity panning, crosstalk cancellation and Ambisonics | |
Sporer et al. | Adjustment of the direct-to-reverberant-energy-ratio to reach externalization within a binaural synthesis system | |
Grimm et al. | Implementation and perceptual evaluation of a simulation method for coupled rooms in higher order ambisonics | |
Griesinger | Accurate reproduction of binaural recordings through individual headphone equalization and time domain crosstalk cancellation | |
GB2366975A (en) | A method of audio signal processing for a loudspeaker located close to an ear | |
Wendt et al. | How binaural room impulse responses influence the externalization of speech | |
Kuhlen et al. | A true spatial sound system for CAVE-like displays using four loudspeakers | |
Benjamin et al. | The effect of head diffraction on stereo localization in the mid-frequency range | |
Hohnerlein | Beamforming-based Acoustic Crosstalk Cancelation for Spatial Audio Presentation | |
Fontana et al. | Evaluation of binaural rendering quality for professional audio. | |
Satongar | Simulation and analysis of spatial audio reproduction and listening area effects | |
Hiekkanen | Paikkariippumaton menetelmä kaiuttimien vertailuun | |
Williams | Comparing the Level of Envelopment and Sound Source Location Accuracy Between Different Versions of a Sound Design Piece Using Individual and Non-Individual Head Related Impulse Responses | |
Lerke | A real time compensation system for |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 10747469 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 13379907 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2010747469 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2010261538 Country of ref document: AU |
|
ENP | Entry into the national phase |
Ref document number: 2010261538 Country of ref document: AU Date of ref document: 20100615 Kind code of ref document: A |