US20170069305A1 - Method and apparatus for audio processing - Google Patents

Method and apparatus for audio processing Download PDF

Info

Publication number
US20170069305A1
US20170069305A1 US15/226,078 US201615226078A US2017069305A1 US 20170069305 A1 US20170069305 A1 US 20170069305A1 US 201615226078 A US201615226078 A US 201615226078A US 2017069305 A1 US2017069305 A1 US 2017069305A1
Authority
US
United States
Prior art keywords
phase
audio signal
analog
time
sound
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US15/226,078
Inventor
J. Craig Oxford
D. Michael Shields
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to US15/226,078 priority Critical patent/US20170069305A1/en
Publication of US20170069305A1 publication Critical patent/US20170069305A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K15/00Acoustics not otherwise provided for
    • G10K15/08Arrangements for producing a reverberation or echo sound
    • G10K15/10Arrangements for producing a reverberation or echo sound using time-delay networks comprising electromechanical or electro-acoustic devices
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/305Electronic adaptation of stereophonic audio signals to reverberation of the listening space
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/305Electronic adaptation of stereophonic audio signals to reverberation of the listening space
    • H04S7/306For headphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R25/00Deaf-aid sets, i.e. electro-acoustic or electro-mechanical hearing aids; Electric tinnitus maskers providing an auditory perception
    • H04R25/50Customised settings for obtaining desired overall acoustical characteristics

Abstract

A method and apparatus for introducing a time-varying time delay or phase shift randomly into the individual reproduction channels of a sound recording, two in the case of binaural presentation. This emulates the temporal aspect of microphone and/or listener motion. The present invention may be applied as a unidirectional process. No preparation of the source material is required. It can be applied to any multichannel audio signal set. It can process analog or digital signals. The process may be used with headphones, loudspeakers, hearing aids or similar assistive hearing devices.

Description

  • This application is a continuation of U.S. patent application Ser. No. 14/589,341, filed Jan. 5, 2015, which is a continuation of U.S. patent application Ser. No. 14/109,223, filed Dec. 17, 2013, issued as U.S. Pat. No. 8,929,560, which is a continuation of U.S. patent application Ser. No. 12/193,036, filed Aug. 17, 2008, issued as U.S. Pat. No. 8,611,557, which claims priority to Provisional Patent Application No. 60/956,584, filed Aug. 17, 2007, entitled “Method and Process for Audio Processing,” and is entitled to those filing dates, in whole or in part, for priority. The complete disclosures, specifications, drawings and attachments of Provisional Patent Application No. 60/956,584 and U.S. patent application Ser. Nos. 12/193,036 and 14/109,223 and 14/589,341 re incorporated herein in their entireties for all purposes by specific reference.
  • FIELD OF INVENTION
  • This invention relates to a method and process of processing audio signals for the purpose of improved recognition of timbre. More particularly, this invention relates to a method and process for temporally modifying audio signals by simulation of missing reverberant cues.
  • BACKGROUND OF INVENTION
  • Timbre is generally defined as the tonal identity of a sound. It is the attribute that distinguishes a sound from other sounds of the same pitch and intensity. While the term is most commonly used in a musical connotation, timbre is important in other ways because it is a fundamental aspect of the importance of a sound in the hierarchy of threat or alarm.
  • In the presentation of music, it can be far more important to quickly identify what the sound is than where it is. This distinction is both intellectual and intuitive; intellectually, timbre is critical to being able to unravel the musical texture in order to understand it. Intuitively, timbre is a fundamental input to the limbic nervous system which is the seat of emotional response. If timbre cannot be quickly perceived, then the musical texture cannot be decoded, nor can an emotional response be elicited. Conscious effort to “understand” the sound impedes the possibility of viscerally reacting to it. The ability to viscerally react to music is an important element of therapeutic effectiveness in music therapy. Basically, improvement in timbre perception allows the conscious thought process to be bypassed.
  • When a recording is made with the microphones or the performers (or both) in motion, upon playback musical timbre can be more quickly identified. It is hypothesized that this is due to an interaction with human hearing which allows a spatial average energy spectrum to be developed by a process which is in lieu of, or possibly in addition to, the usual averaging of reflections by the human neurophysiological system.
  • This effect is particularly apparent in headphone (binaural) reproduction. Presumably this is because in normal (non-headphone) listening to either live or reproduced sound, there are small head motions of the listener constantly occurring. And with loudspeakers, even though listener's head may be able to make small movements, the source of the sound is fixed. This may enable the listener to develop the aforementioned spatial average estimate of the energy spectrum. In headphone listening, however, this mechanism is not available because there is no relative motion possible between the listener's ears and the sound source. There also are several other problems associated with binaural presentation, chief among which is the sensation that the sound image is in the middle of one's head. Also there are questions concerning the basic frequency response as it relates to diffuse-field versus direct field equalization.
  • Accordingly, what is needed is a method to process audio signals to restore or simulate this perceptual mechanism with the use of headphones or loudspeakers.
  • SUMMARY OF THE INVENTION
  • In various embodiments, the present invention introduces temporal variation in the effective path from the musician to the listener to aid in perception of timbre. Modification of the electrical or acoustical phase of a signal is the same as a time variation (i.e., phase is time). In addition, a wave propagating in a medium requires a particular amount of time to travel a particular distance; hence, time also is distance. It follows that phase is (or can be correlated to) distance.
  • In one exemplary embodiment, the present invention introduces a time-varying time delay randomly introduced into the individual reproduction channels, two in the case of binaural presentation. This emulates the temporal aspect of microphone and/or listener motion. The present invention may be applied as a unidirectional process. No preparation of the source material is required. It can be applied to any multichannel audio signal set. It can process analog or digital signals.
  • DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a diagram of a fixed phase shifter circuit in accordance with an exemplary embodiment of the present invention.
  • FIG. 2 is a diagram of a variable phase shifter circuit in accordance with an exemplary embodiment of the present invention.
  • FIG. 3 is a diagram of an analog audio processing system in accordance with an exemplary embodiment of the present invention.
  • FIG. 4 is a diagram of an analog and digital audio processing system in accordance with an exemplary embodiment of the present invention.
  • DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS
  • In one exemplary embodiment, the present invention enhances the perception of timbre, or tonal identity, by temporal processing of a recording. The recording may be a fixed-microphone recording. The recording can be analog or digital. While the enhancement of the perception of timbre may be accomplished by introducing a time-varying time delay, it may also be accomplished by suitable phase shifting.
  • A sound traveling in a medium (e.g., air) has a wavelength which is inversely proportional to its frequency. The velocity of propagation (e.g., distance/unit time) in the medium is constant, therefore a given number of degrees (e.g., phase angle) of wave movement requires an amount of time which is also inversely proportional to frequency. Thus phase and time and distance are related.
  • Whether the time delays are implemented as pure delay or as phase shifting, it is necessary to make a quantitative estimate of the amount of delay which is required. A motion of the microphones of, say, 0.2 m would be represented by a time shift of about 600 microseconds, using the formula T=r/c, where c=speed of sound=354 m/s, and r=distance in m.
  • In one embodiment, the method of the present invention introduces a random time-varying phase shift, which is free of discontinuities, independently into the channels of a stereophonic electrical signal path. For example, a time-varying phase shift is introduced independently and randomly into the two channels of a stereophonic signal path. The method is not necessarily limited to two channels. The result emulates at least one aspect of the continuous movement of the recording microphones mentioned above.
  • At middle frequencies, 1 kHz, 600 usec corresponds to 216 degrees of phase delay. An example of a fixed phase shifting circuit is illustrated in FIG. 1, where R1 through R3 are resistors and C is a capacitor 32. The circuit further comprises an operational amplifier 30. The resistance values may vary. In one exemplary embodiment, the values of R1 and R2 are equal or approximately equal. Such a circuit will produce phase shift of 0-180 degrees or 180-360 degrees depending on how it is configured. A relatively uniform delay of 600 usec requires 2160 degrees at 10 kHz, so a cascade of such phase shifters is required. Experimentally, it is not necessary to preserve a constant delay time at all frequencies. This can lead to a reduction in the number of stages required.
  • In one embodiment, the phase shifter circuit should be variable according to some external control parameter. In FIG. 2, an embodiment of a variable phase shifter circuit is shown in which an external current 40 controls the phase shift by means of a light-emitting diode 42 which impacts a light dependent resistor 44 so that the resistance varies with varying light emission from the LED. A common voltage controlling several such circuit elements in a cascade produces the required controllable phase-shifter.
  • Other higher-order (i.e. quadratic) phase shifters could be used. Even analog charge-coupled delay lines could be used with a time-varying clock.
  • In yet another embodiment, the invention comprises a goniometer, a circuit or device that changes phase continuously, i.e., not in steps. Effectively, the circuit is a phase modulator with two inputs: a modulation input and a signal input. There may be one such goniometer in each signal channel. The modulation input to each goniometer is an independent source of random noise in a control bandwidth chosen to simulate a physically possible movement of the microphones on the order of 0.1 Hz to 1 Hz.
  • FIG. 3 shows one embodiment of an analog audio processor in accordance with the present invention for a two-channel system. The analog audio signals 2, 12 are applied to two corresponding phase modulator/goniometer circuits or devices 4, 14. These goniometers may be voltage-controlled phase shifters as described above. Two random noise or number generators 6, 16 with suitable low- pass filters 8, 18 provide the random control function.
  • In a digital embodiment, the audio signal is first digitized and then passed in each channel though a delay which is phase-continuously varied according to a random law at an appropriate rate. This technique is similar to that used in direct-digital-synthesis oscillators. The signal is then reconverted to analog for presentation via headphones or loudspeakers. It should be understood that variation in the phase or time delays, the rate or law controlling such delays and the exact circuit embodiments may vary.
  • FIG. 4 shows an embodiment for a system that can process digital sound recordings and analog sound recordings. The input can be two analog signals 2, 12 which are converted to digital by digital/analog converters, or a digital input 22 which may be multiplexed. The application of pure delay is straightforward, using goniometer circuits as described above with digital random number generators 26. The delay may be smoothly varied. For example, a DDS clock with continuous phase interpolation can be used to operate a delay memory with the process at a sufficiently high rate that discontinuities will be absorbed in the output reconstruction filters. Output may be digital 29, or may be converted to analog by digital/analog converters 27 in each channel.
  • The control function is a random or pseudo-random time-varying quantity which controls the phase shifters or delay lines. The rate of variation in this embodiment should be in the range of probable motions of the listener or the microphones. Also, the rate of variation should be low enough that any phase-modulation sidebands will lie below the audio range so as to avoid the intrusion of low-frequency noise. In one exemplary embodiment, a control bandwidth of about 10 Hz is chosen. Because the bandwidth is so low, the random control function could be equally well generated by a true random noise source 6, 16, or by a random-number generator, with a suitable low- pass filter 8, 18.
  • In another embodiment, the phase/time variation should be smooth. Step discontinuities may produce audible artifacts. The range of the phase variation is adjustable. The variation should be free of patterns; that is, truly random and not cyclic.
  • Accordingly, the present invention restores the lost perceptual mechanism derived from relative motions between the source and the listener. The quickness of timbre recognition also may lead to an improvement in intelligibility of all signal types. This comports with the principles of quantitative intelligibility measures such as the Speech Transmission Index which deal with preservation of the infrasonic amplitude modulation transfer function.
  • Another area of binaural reproduction is the perception of the location of sounds in both azimuth and elevation. This is important in virtual-reality presentations and in information delivery systems, such as fighter plane cockpits. These systems usually concern themselves with stereotactic detection of head position, eye-motion tracking or other measures of directional attention in order to process audio messages in amplitude and phase to force the auditory image to be congruent with head position or visual attention.
  • The methods and processes of the present invention can be combined with these processes. For example, one way the “in the head” problem in binaural listening can be addressed is by filtering and cross-feeding the left and right signals according to generalized head-related transfer functions (HRTF). The HRTF models the propagation of sound around the head from ear-to-ear for external sound sources. This is another example of a process which is applied to replace a naturally-occurring aspect of hearing when binaural presentation is involved. The HRTF may be dynamically modified with a variable delay as described above.
  • The method and processes of the present invention also may be combined with assistive hearing devices, such as hearing aids, to improve intelligibility of what is heard through improved recognition of timbre.
  • Thus, it should be understood that the embodiments and examples described herein have been chosen and described in order to best illustrate the principles of the invention and its practical applications to thereby enable one of ordinary skill in the art to best utilize the invention in various embodiments and with various modifications as are suited for particular uses contemplated. Even though specific embodiments of this invention have been described, they are not to be taken as exhaustive. There are several variations that will be apparent to those skilled in the art.

Claims (6)

We claim:
1. A method for modifying an audio signal, comprising the steps of:
introducing a time delay into an audio signal input to produce a modified audio signal, wherein said modified audio signal emulates the temporal aspect of relative motion by a source.
2. The method of claim 1, wherein the audio signal input is analog or digital.
3. The method of claim 1, wherein there are multiple audio signals, and a separate time delay is introduced into each signal.
4. The method of claim 1, further comprising the step of outputting the modified audio signal through a sound reproduction device.
5. The method of claim 4, wherein the sound reproduction device comprises one or more of headphones, an in-ear receiver, earbud, or a hearing aid, or combinations thereof
6. The method of claim 1, wherein the modified audio signals are output to at least one loudspeaker.
US15/226,078 2007-08-17 2016-08-02 Method and apparatus for audio processing Abandoned US20170069305A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US15/226,078 US20170069305A1 (en) 2007-08-17 2016-08-02 Method and apparatus for audio processing

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US95658407P 2007-08-17 2007-08-17
US14/589,341 US9407988B2 (en) 2007-08-17 2015-01-05 Method and apparatus for audio processing
US15/226,078 US20170069305A1 (en) 2007-08-17 2016-08-02 Method and apparatus for audio processing

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US14/589,341 Continuation US9407988B2 (en) 2007-08-17 2015-01-05 Method and apparatus for audio processing

Publications (1)

Publication Number Publication Date
US20170069305A1 true US20170069305A1 (en) 2017-03-09

Family

ID=40362980

Family Applications (4)

Application Number Title Priority Date Filing Date
US12/193,036 Expired - Fee Related US8611557B2 (en) 2007-08-17 2008-08-17 Method and apparatus for audio processing
US14/109,223 Expired - Fee Related US8929560B2 (en) 2007-08-17 2013-12-17 Method and apparatus for audio processing
US14/589,341 Expired - Fee Related US9407988B2 (en) 2007-08-17 2015-01-05 Method and apparatus for audio processing
US15/226,078 Abandoned US20170069305A1 (en) 2007-08-17 2016-08-02 Method and apparatus for audio processing

Family Applications Before (3)

Application Number Title Priority Date Filing Date
US12/193,036 Expired - Fee Related US8611557B2 (en) 2007-08-17 2008-08-17 Method and apparatus for audio processing
US14/109,223 Expired - Fee Related US8929560B2 (en) 2007-08-17 2013-12-17 Method and apparatus for audio processing
US14/589,341 Expired - Fee Related US9407988B2 (en) 2007-08-17 2015-01-05 Method and apparatus for audio processing

Country Status (1)

Country Link
US (4) US8611557B2 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102011075006B3 (en) * 2011-04-29 2012-10-31 Siemens Medical Instruments Pte. Ltd. A method of operating a hearing aid with reduced comb filter perception and hearing aid with reduced comb filter perception
JP6024403B2 (en) * 2012-11-13 2016-11-16 ヤマハ株式会社 Electronic music apparatus, parameter setting method, and program for realizing the parameter setting method
US9875756B2 (en) * 2014-12-16 2018-01-23 Psyx Research, Inc. System and method for artifact masking
US10631108B2 (en) * 2016-02-08 2020-04-21 K/S Himpp Hearing augmentation systems and methods
WO2020078521A1 (en) * 2018-10-14 2020-04-23 Al Shalash Taha Kais Taha Enhance the contrast between the peaks and valleys in speech spectrum

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3881057A (en) * 1972-09-06 1975-04-29 Nippon Musical Instruments Mfg Reverberation-imparting apparatus using a bucket brigade device
US5109419A (en) * 1990-05-18 1992-04-28 Lexicon, Inc. Electroacoustic system
US6140822A (en) * 1997-05-29 2000-10-31 Williams; Thomas H. System for signal path characterization with a reference signal using stepped-frequency increments
US20070165838A1 (en) * 2006-01-13 2007-07-19 Microsoft Corporation Selective glitch detection, clock drift compensation, and anti-clipping in audio echo cancellation
US20070253559A1 (en) * 2006-04-19 2007-11-01 Christopher David Vernon Processing audio input signals

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6327366B1 (en) * 1996-05-01 2001-12-04 Phonak Ag Method for the adjustment of a hearing device, apparatus to do it and a hearing device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3881057A (en) * 1972-09-06 1975-04-29 Nippon Musical Instruments Mfg Reverberation-imparting apparatus using a bucket brigade device
US5109419A (en) * 1990-05-18 1992-04-28 Lexicon, Inc. Electroacoustic system
US6140822A (en) * 1997-05-29 2000-10-31 Williams; Thomas H. System for signal path characterization with a reference signal using stepped-frequency increments
US20070165838A1 (en) * 2006-01-13 2007-07-19 Microsoft Corporation Selective glitch detection, clock drift compensation, and anti-clipping in audio echo cancellation
US20070253559A1 (en) * 2006-04-19 2007-11-01 Christopher David Vernon Processing audio input signals

Also Published As

Publication number Publication date
US8929560B2 (en) 2015-01-06
US9407988B2 (en) 2016-08-02
US20140105409A1 (en) 2014-04-17
US20090046871A1 (en) 2009-02-19
US8611557B2 (en) 2013-12-17
US20150271597A1 (en) 2015-09-24

Similar Documents

Publication Publication Date Title
US20170069305A1 (en) Method and apparatus for audio processing
US6937737B2 (en) Multi-channel audio surround sound from front located loudspeakers
US5095507A (en) Method and apparatus for generating incoherent multiples of a monaural input signal for sound image placement
JP5611970B2 (en) Converter and method for converting audio signals
JP4926704B2 (en) Audio stereo processing method, apparatus and system
RU2498526C2 (en) Apparatus for generating multichannel audio signal
MX2013012999A (en) Apparatus and method and computer program for generating a stereo output signal for providing additional output channels.
Møller et al. On the audibility of all-pass phase in electroacoustical transfer functions
WO2018185733A1 (en) Sound spatialization method
KR20160031952A (en) Multiband ducker
JP4791613B2 (en) Audio adjustment device
Lipshitz Stereo microphone techniques: Are the purists wrong?
Klepko 5-channel microphone array with binaural-head for multichannel reproduction
San Martín et al. Influence of recording technology on the determination of binaural psychoacoustic indicators in soundscape investigations
Monroe Digital hearing
Sigismondi Personal monitor systems
US11540049B1 (en) System and method for an audio reproduction device
US20230007434A1 (en) Control apparatus, signal processing method, and speaker apparatus
US20240056735A1 (en) Stereo headphone psychoacoustic sound localization system and method for reconstructing stereo psychoacoustic sound signals using same
US10966041B2 (en) Audio triangular system based on the structure of the stereophonic panning
KR20030022332A (en) Method of generating a left modified and a right modified audio signal for a stereo system
WO1991020165A1 (en) Improved audio processing system and recordings made thereby
JP6699280B2 (en) Sound reproduction device
Santala Perception and auditory modeling of spatially complex sound scenarios
TW202236255A (en) Device and method for controlling a sound generator comprising synthetic generation of the differential signal

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION