WO2003093775A2 - Sound detection and localization system - Google Patents
Sound detection and localization system Download PDFInfo
- Publication number
- WO2003093775A2 WO2003093775A2 PCT/US2003/013685 US0313685W WO03093775A2 WO 2003093775 A2 WO2003093775 A2 WO 2003093775A2 US 0313685 W US0313685 W US 0313685W WO 03093775 A2 WO03093775 A2 WO 03093775A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sound
- sound event
- input channel
- events
- signal
- Prior art date
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 194
- 230000004807 localization Effects 0.000 title claims description 161
- 230000004308 accommodation Effects 0.000 claims abstract description 127
- 230000007246 mechanism Effects 0.000 claims abstract description 69
- 238000000034 method Methods 0.000 claims description 240
- 230000006870 function Effects 0.000 claims description 14
- 230000004044 response Effects 0.000 claims description 10
- 238000003860 storage Methods 0.000 claims description 9
- 239000003990 capacitor Substances 0.000 description 51
- 238000010187 selection method Methods 0.000 description 25
- 230000001052 transient effect Effects 0.000 description 25
- 238000010586 diagram Methods 0.000 description 24
- 238000002408 directed self-assembly Methods 0.000 description 21
- 230000000694 effects Effects 0.000 description 16
- 238000012795 verification Methods 0.000 description 16
- 238000010606 normalization Methods 0.000 description 9
- 238000012935 Averaging Methods 0.000 description 8
- 230000008569 process Effects 0.000 description 6
- 241001481828 Glyptocephalus cynoglossus Species 0.000 description 5
- 230000003321 amplification Effects 0.000 description 5
- 230000008901 benefit Effects 0.000 description 5
- 238000003199 nucleic acid amplification method Methods 0.000 description 5
- 230000008859 change Effects 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 230000007704 transition Effects 0.000 description 4
- 241001547070 Eriodes Species 0.000 description 3
- 230000007423 decrease Effects 0.000 description 3
- 230000007774 longterm Effects 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 230000035945 sensitivity Effects 0.000 description 3
- 230000000996 additive effect Effects 0.000 description 2
- 210000000613 ear canal Anatomy 0.000 description 2
- 210000000883 ear external Anatomy 0.000 description 2
- 210000000959 ear middle Anatomy 0.000 description 2
- 210000005069 ears Anatomy 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 230000030808 detection of mechanical stimulus involved in sensory perception of sound Effects 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000002360 explosive Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000002459 sustained effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S3/00—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received
- G01S3/80—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received using ultrasonic, sonic or infrasonic waves
- G01S3/802—Systems for determining direction or deviation from predetermined direction
- G01S3/808—Systems for determining direction or deviation from predetermined direction using transducers spaced apart and measuring phase or time difference between signals therefrom, i.e. path-difference systems
- G01S3/8083—Systems for determining direction or deviation from predetermined direction using transducers spaced apart and measuring phase or time difference between signals therefrom, i.e. path-difference systems determining direction of source
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01V—GEOPHYSICS; GRAVITATIONAL MEASUREMENTS; DETECTING MASSES OR OBJECTS; TAGS
- G01V1/00—Seismology; Seismic or acoustic prospecting or detecting
- G01V1/001—Acoustic presence detection
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
Definitions
- the invention relates generally to detecting and localizing sound. More particularly, this invention relates to d etectmg a nd/or 1 ocalizing sounds that include sound events in a complex sound field
- One way in which the individual sounds in a sound field are classified is according to whether a sound has emanated or originated from a particular location. Sounds that can be detected as emanating or originating from a particular direction are referred to as directional sounds, while sounds detected as emanating or originating from no particular direction at all are referred to as non-directional sounds. Another way of classifying sounds is according to whether a sound is a transient or a steady-state sound. Steady-state sounds are those that have a generally constant level of power over time, such as a sustained musical note. Steady- state sounds can be either directional or non-directional sounds. Transient sounds (or
- Transients are sounds that have an initial energy spike, such as a shout or a drum hit. Transients can also be either directional or non-directional sounds.
- An example of a non- directional transient sound is speech in a reverberant space where the direct sound is blocked by an object. In this case if the reverberation time is less than one second the time characteristics of the signal are preserved, but information about its direction is lost.
- Directional transients are referred to in this application collectively as "sound events.”
- Two types of sound events are syllables and impulsive sounds.
- Syllables include phonemes and notes.
- Phonemes are transient sounds that are characteristic of phones in human speech and can be particularly useful in detecting and localizing syllables in human speech. Notes are the individual notes created by a musical instrument.
- Syllables generally have the following characteristics: a finite duration of about at least 50 ms up to about 200 ms, but typically around 150 ms; rise times of about 33 ms; generally occur no more frequently than about once every 0.2 ms to about 0.5 ms; and may have low or high volume (amplitude).
- impulsive sounds are transients of very short duration such as a drum hit or fricatives and explosives in speech.
- Impulsive sounds generally have the following characteristics: a short duration of about 5ms to about 50ms, rise times of about 1ms to aboutlOms, and high volume.
- the sound field need only be generated in one input or "input channel.”
- the sound field needs to be generated in at least two inputs or input channels.
- the archetype for sound localization is natural hearing, where the azimuth of the sound is detected primarily by the arrival time difference between the two input channels represented by the two ears.
- the azimuth of a sound source is determined primarily by the amplitude and phase relationships between the signals generated by two or more input channels.
- the directions of the source of these sounds are described in terms of an angle between each corresponding pair of channels (each a "channel pair"). If sounds are generated in only two channels, the directions of the sounds are given in terms of an angle for that channel pair, generally a left/right angle "lr.” If the sounds are generated in four channels, the directions of the sounds are given in terms of an angle for each channel pair, generally, a left/right angle lr, and a front/back angle "cs.” For example, when the sound field is generated in two channels, the average direction of the sounds in the sound field is given in terms of lr only.
- the value for lr ranges from about -45 degrees to about 45 degrees, with -45 degrees indicating that the sound field originates from the left input channel, 45 degrees indicating that the sound field o riginates from the right input channel and 0 degrees indicating that the sound field originates from a position in the middle precisely between the right and l eft input c hannels (a position often r eferred t o as "center").
- a second directional component is specified. Additionally, even if the sound field is generated in only one channel pair, a second directional component may also be specified because it is often possible to derive an additional channel pair from the one channel pair.
- One known technique for determining these angles is used in reproducing recorded sound.
- this known technique determines the intended direction of sounds by comparing the amplitudes of the signals in one input channel of a input channel pair with the signals in the corresponding input channel of the input channel pair (generally, the 1 eft w ith t he right, and the center with the surround). More specifically, this ratio o f amplitudes is used to determine what is generally referred to an "ordinary steering angle" or
- OSA for each input channel pair.
- the voltage signals in each input channel of an input channel pair are rectified and the logarithms of the rectified voltages are taken.
- a signal is produced that equals the logarithm of the ratio of the voltages in the input channel pair which, when converted back into the magnitude domain, is the ordinary steering angle.
- this determination is often made by a device called a matrix decoder.
- the filter is made slow enough not to distinguish the fluctuations characteristic of the non-directional signals, the filter is generally too slow to distinguish the fluctuations of certain sound events, particularly impulsive sounds. As a result, many sound events are not properly localized. No matter how these filters are designed, they generally work well on only one type of music but not on all. For example, the fast filter will work well on complex popular music, which is full of rapid changes, but will reflect false directional changes (steer too greatly) when a highly-reverberant classical piece is reproduced.
- sounds recorded in a given number of input channels and then reproduced over a different number of channels are stereo and surround.
- Sounds recorded for reproduction in stereo are intended to be perceived as originating only from the front.
- Sounds recorded for reproduction in surround are intended to be perceived as originating from all around, generally with one or two input channels used to reproduce sounds from the rear.
- the techniques used to record sounds intended for reproduction in stereo are generally different from those used to record sounds intended for reproduction in surround.
- sounds recorded for reproduction in surround generally need to be capable of high-quality reproduction in stereo.
- the Dolby Surround® system which records sounds for reproduction in surround, adds a negative phase to the sounds intended for reproduction from behind the listener (the rear). This negative phase is generally undetected by stereo reproduction systems and is transparent to the listener.
- the negative phase is detected by a surround reproduction system that then reproduces the associated sounds in the rear input channels.
- Properties of the human hearing mechanism have been modeled and used to create systems with these features because many of the problems encountered by known sound detection and systems are not experienced by the human hearing mechanism.
- the human hearing mechanism localizes sounds in a sound field by breaking down the sound field into individual sounds, determining the direction of the individual sounds, reassembling related s ounds into s treams (such as the melody 1 ine of a particular instrument or a line of dialogue from a particular speaker), and determining the direction of the stream based on the average d irection of the individual sounds within the stream.
- the human hearing mechanism In breaking down a sound field into individual sounds, the human hearing mechanism has the ability to distinguish transient sounds from other sounds and to distinguish sound events from non-directional transients.
- the human hearing mechanism can distinguish transient from non-transient sounds and sound events from other transient sounds even in the presence of a background signal by relying on the properties of beginning detection and accommodation.
- Beginning detection results from human neurology which is highly evolved to determine the starting points and end points of sounds and makes the human hearing mechanism particularly sensitive to the rise and fall times of sounds.
- Accommodation is the property that allows the human hearing mechanism to distinguish transients from steady-state sounds by gradually ignoring the presence of the steady state sounds.
- the human hearing mechanism relies more heavily on certain frequencies and the apparent direction of the beginnings of sounds.
- the human hearing mechanism relies most heavily on frequencies between about 500 Hz and about 4000 Hz for direction determination.
- This frequency bias is largely a result of the frequency response of the external ear (the pinnae, concha and ear canal) and by the frequency transfer function of the middle ear.
- the human hearing mechanism also relies on the direction indicated in the rise time of a sound more than the direction as indicated in other portions of the sound. This reliance is advantageous because the direction indicated in the rise time of a sound is less likely to be corrupted by sound reflections or reverberations even in a highly reflective environment.
- the human hearing mechanism After detecting the directions of the individual sounds, the human hearing mechanism reassembles related sounds into streams and separately determines the direction of each stream.
- the direction of each stream is generally the average direction of all sounds within the stream. In any case, the process of reassembling related sounds into streams and determining the direction of each stream is performed unconsciously and automatically by the human hearing mechanism.
- the properties of the human hearing mechanism have been modeled to create sound event detection methods which produce a signal indicating the occurrence of sound events in a sound field.
- These sound event detection methods use the property of beginning detection to detect the occurrence of sound events.
- the beginning detection property is modeled to identify the onsets typical of sound events and uses these onsets to generate the trigger signal.
- the sound event detection methods may use the accommodation property to improve the accuracy of the trigger signal.
- the accommodation property is modeled to reduce over time the effect any steady-states sounds may have on the sound field. These effects are reduced in the sound field to produce a differential signal. This differential signal may then be used by the beginning detection model to produce an improved trigger signal.
- the sound event detection methods may also include emphasizing the frequencies important to the human hearing mechanism.
- the properties of the human hearing mechanism have also been used to create sound event detection and localization methods for determining the location of sound events that occur in the presence of a background signal. These sound event detection and localization methods produce at least one steering angle indicating the direction of the sound event whenever a trigger signal indicates that a sound event has occurred ("differential steering angles"). Sound event detection and localization methods use differential signals to determine the differential steering angles by comparing the relative power of the sound event in each input channel of each input channel pair after any steady-state sounds have been removed. By comparing the power in each input channel of an input channel pair instead of the voltage in each input channel of an input channel pair as done in known methods, individual sound events can be identified and separately localized.
- the properties of the human hearing mechanism may also be used to determine the location of all sounds within a sound field.
- sound localization methods have been developed that localize the individual sounds in the presence of background sounds for any combination of sounds more accurately.
- the sound localization methods divide the sound field into sound events and non- sound events and localize the sound events in terms of a true differential steering angle or a true ordinary steering angle and the non-sound events in terms of a filtered ordinary steering angle. These methods essentially treat the non-sound events as a separate sound for which the direction is separately determined.
- the sound localization methods can be implemented for any number and combination of sound event types in a sound field generated in any number of input channel pairs.
- the properties of the human hearing mechanism have been modeled to create electronic circuitry that detects sound events in the presence of a background signal.
- These sound event detectors may be used for a variety of applications for which the detection of specific types of sound events is helpful.
- a sound event detector that detects syllables may be used as part of a phonic detector or as part of a speech recognition or speech coding system.
- a sound event detector that detects syllables may be used in conjunction with an audio amplification device, such as a microphone.
- Sound event detectors like the sound event detection methods, are based on models of the beginning detection p roperty and the accommodation property and generally produce a trigger signal that includes some flag or marker, such as a pulse, that indicates the occurrence of a sound event.
- the properties of the human hearing mechanism have been modeled to create electronic circuitry that detects and localizes sound events in the presence of a background signal.
- These sound event detectors and localizers like the sound event detection and localization methods, determine the direction of sound events in relation to one or more input channel pairs in terms of a differential steering angle or an ordinary steering angle and in some c ases, v erify the accuracy of the steering angles.
- O ne application of a sound event detector and localizer is as a stereo/surround detector.
- a stereo/surround detector determines whether a sound field is intended for reproduction in two input channels or in more than two input channels.
- Sound event detectors may be used for a variety of applications for which the detection of specific types of sound events is helpful.
- a sound event detector that detects syllables may be used as part of a phonic detector or as part of a speech recognition or speech coding system.
- a sound event detector that detects syllables may be used in conjunction with an audio amplification device, such as a microphone.
- Sound event detectors like the sound event detection methods, generally produce a trigger signal that includes some flag or marker, such as a pulse, that indicates the occurrence of a sound event. Also, like the sound event detection methods, the sound event detectors can be implemented to detect any number and combination of sound event types in a sound field that is generated in any number of input channels.
- the sound event detection methods, sound event detection and localization methods, sound localization methods, as well as any of the sound event detectors, sound event detectors and localizers and sound localizers, may be implemented in a sound event device and/or in computable readable software code.
- FIG. 1 is a flow chart of a sound event detection method for a single input channel and single sound event type.
- FIG. 2 is a flow chart of an accommodation method.
- FIG. 3 is a series of time-domain plots of a sample segment of a sound field before, during and after the onsets of transients have been distinguished and the transients have been separated from background sounds.
- FIG. 4 is a flow chart of a beginning detection method for a single input channel.
- FIG. 5 is a flow chart of a sound event detection method for multiple input channels.
- FIG. 6 is a flow chart of a beginning detection method for multiple input channels.
- FIG. 7 is a flow chart of a sound event detection method for a single input channel and multiple sound event types.
- FIG. 8 is a flow chart of a sound event detection method for multiple input channels and multiple sound event types.
- FIG. 9 is a flow chart of a sound event detection and localization method for a single input channel pair and a single sound event type.
- FIG. 10 is a flow chart of a sound event detection and localization method for multiple input channel pairs and a single sound event type.
- FIG. 11 is a flow chart of a sound event detection and localization method for multiple input channel pairs and a single sound event type.
- FIG. 12 is a flow chart of a sound event detection and localization method for a single input channel pair and multiple sound event types.
- FIG. 13 is a flow chart of a sound localization method.
- FIG. 14 is a flow chart of a direction selection method for a single input channel pair and a single sound event type.
- FIG. 15 is a flow chart of a direction selection method for a single channel pair and a single short-duration sound event.
- FIG. 16 is a flow chart of a direction selection method for multiple input channel pairs and a single sound event type.
- FIG. 17 is a flow chart of a differential steering angle determination method.
- FIG. 18 is a flow chart of a direction selection method for multiple input channel pairs and a single short-duration sound event.
- FIG. 1 is block diagram of a sound event device.
- FIG. 20 is a block diagram of a sound event detector for a single input channel and a single sound event type.
- FIG. 21 is a circuit diagram of an accommodation circuit for a single input channel.
- FIG. 22 is a circuit diagram of a trigger generation circuit for a single input channel.
- FIG. 23 is a circuit diagram of a sound event detector for a single input channel and a short-duration sound event.
- FIG. 24 is a block diagram of a sound event detector for multiple input channels and a single sound event type.
- FIG. 25 is a circuit diagram of an alternate trigger generation circuit for multiple input channel pairs.
- FIG. 26 is a block diagram of a sound event detector for a single input channel and multiple sound event types.
- FIG. 27 is a block diagram of a sound event detector for multiple input channels and multiple sound event types.
- FIG. 28 is a block diagram of a sound event detector and localizer for a single input channel pair and a single sound event type.
- FIG. 29 is a circuit diagram of a sound event localization circuit for a single input channel pair and a single sound event type.
- FIG. 30 is a block diagram of a sound event detector and localizer for multiple input channel pairs and a single sound event type.
- FIG. 31 is a circuit diagram of a sound event localization circuit for multiple input channel pairs and a single sound event type.
- FIG. 32 is a block diagram of a sound event detector and localizer for a single input channel pair and multiple sound event types.
- FIG. 33 is a block diagram of a sound event detector and localizer for multiple input channel pairs and multiple sound event types.
- FIG. 34 is a block diagram of a stereo/surround detector for a single sound event type.
- FIG. 35 is a block d iagram of a s tereo/surround detector for m ultiple sound event types.
- FIG. 36 is a block diagram of a sound localizer for a single input channel pair and a single sound event type.
- FIG. 37 is a circuit diagram of a sound localization circuit for a single input channel pair and a single sound event type.
- FIG. 38 is a circuit diagram of a sound localization circuit for a single input channel pair and a short-duration sound event.
- FIG. 39 is a block diagram of a sound localizer for multiple input channel pairs and a single sound event type.
- FIG. 40 is a circuit diagram of a sound localization circuit for multiple input channel pairs and a single sound event type.
- FIG. 41 is a block diagram of a sound localization circuit for multiple input channel pairs and a short-duration sound event.
- FIG. 42 is a block diagram of a sound localizer for a single input channel pair and multiple sound event types.
- FIG. 43 is a block diagram of a sound localizer for multiple input channel pairs and multiple sound event types.
- the human hearing mechanism was studied and used as a model from which to create sound detection and localization systems.
- extremely effective, high-quality methods have been created for the detection of sound events in the presence of steady-state sounds, for the detection and localization of sound events in the presence of steady-state sounds, and for the detection and localization of all sounds in a sound field.
- These methods can also be implemented in software and hardware to create programs, devices and even more complicated systems applicable to a wide range of applications.
- the human hearing mechanism In breaking down a sound field into individual sounds, the human hearing mechanism has the ability to distinguish transient sounds from other sounds and to distinguish s ound events from non-directional transients. Furthermore, the human hearing mechanism can distinguish transient from non-transient sounds and sound events from other transient sounds even in the presence of a background signal by relying on the properties of beginning detection and accommodation. Beginning detection results from human neurology which is highly evolved to determine the starting points and end points of sounds and makes the human hearing mechanism particularly sensitive to the rise and fall times of sounds. Accommodation is the property that allows the human hearing mechanism to distinguish transients from steady-state sounds by gradually ignoring the presence of the steady state sounds. The properties of beginning detection and accommodation and their effect on localization were discovered and demonstrated experimentally.
- the azimuth (as an indication of direction) of a tone can only be detected when the tone starts or stops. If the tone is continuous, it becomes very difficult to determine its direction. Furthermore, if new sounds are added to a steady state tone, the true direction of the new sounds is easy to determine. This demonstrates that some sounds have both a transient and a steady-state component. It also demonstrates that the human hearing mechanism is particularly sensitive to the transient components, which include the beginnings and ends of sounds and uses the beginnings and ends of sounds for localization (the beginning detection property). It also demonstrates that the human hearing mechanism ignores steady-state sounds over time and generally does not rely on them for localization (the accommodation property).
- the rate at which the human hearing mechanism gradually ignores the steady-state sounds is independent of the sound event type being detected. It was also found that the accommodation rate may not be constant and may adjust to acoustic conditions and to the rapidity of speech. However, a reasonable average value for the accommodation rate was found to be about 300 ms.
- the human hearing mechanism relies more heavily on certain frequencies and the apparent d irection o f the beginnings of sounds.
- the human hearing mechanism relies most heavily on frequencies between about 500 Hz and about 4000 Hz for direction determination. This frequency bias is largely a result of the frequency response of the external ear (the pinnae, concha and ear canal) and by the frequency transfer function of the middle ear.
- the human hearing mechanism also relies on the direction indicated in the rise time of a sound more than the direction as indicated in other portions of the sound. This reliance is advantageous because the direction indicated in the rise time of a sound is less likely to be corrupted by sound reflections or reverberations even in a highly reflective environment.
- the human hearing mechanism After detecting the directions of the individual sounds, the human hearing mechanism reassembles related sounds into streams and separately determines the direction of each stream.
- the direction of each stream is generally the average direction of all sounds within the stream.
- sounds are reassembled into a foreground stream and a background stream.
- the foreground stream may consist of dialog and the background stream may consist of environmental sounds being produced around the source of the dialog.
- the process of reassembling related sounds into streams and determining the direction of each stream is performed unconsciously and automatically by the human hearing mechanism.
- Sound event detection methods produce a signal (referred to in this application as a “trigger signal"), which includes a flag or marker, such as a pulse, that indicates the occurrence of a sound event.
- the sound event detection methods can be implemented to detect any number and combination of sound events in any number of input channels. In the following description, the sound event detection methods are discussed in order of increasing complexity, with each subsequent sound event detection method incorporating the steps of the prior methods, except as indicated.
- FIG. 1 An example of a method for detecting sound events in the presence of a background signal that only detects a single sound event type in a sound field that is generated in a single input channel (“sound event detection methods for a single input channel and a single sound event type"), is shown in FIG. 1 and indicated by reference number 100.
- the sound event detection method for a single input channel and a single sound event type 100 includes: emphasizing the d irectionally i mportant frequencies b y modeling frequency bias 104; separating sound events from background sounds by modeling accommodation 106; and detecting sound events by modeling beginning detection 108.
- the directionally important frequencies are emphasized by modeling the frequency bias of the human hearing mechanism 104 using frequency emphasis method.
- the frequency emphasis method includes emphasizing the frequencies between about 500 Hz and about 4 kHz in each input channel of the input channel pair to produce a filtered signal in each input channel. These frequencies are emphasized because they have the most influence on the human hearing mechanism in terms of determining direction. Separating the sound events from the background sounds by modeling accommodation 106 generally includes examining the change in input power in the sound field and is shown in more detail in FIG. 2.
- separating the sound events from the background sounds by modeling accommodation 106 includes: determining the power envelope of the input channel 201 ; determining the power in any steady-state sounds 202, and subtracting the power in any steady-state sounds from the power envelope of the input channel.
- Determining the power envelope of the input channel 201 generally includes squaring the voltage in the input channel.
- a sound field, whether produced live or from a recording, is generally represented as a voltage s ignal in the time domain.
- voltage signals do not have the additive property, which means that the magnitude of a voltage signal resulting from the combination of two voltage signals cannot be determined by simply adding the amplitudes of the two voltage signals.
- power signals do have the additive property. Therefore, by converting the voltage signal into a power signal,
- a power envelope results from which other power signals may be subtracted.
- fluctuations in the power envelope that are not characteristic of the sound event type being detected may be removed. For example, if syllables are being detected, fluctuations with rise times faster than about 33 ms will be removed from the power envelope.
- Determining the power in any steady-state signals 202 in the sound field includes determining the long-term average power of the sound field.
- the long-term average power (referred to in this application as the "accommodation signal") may be determined by integrating the power envelope over a time period equal to the rise time of the sound event type that is being detected. Once a sound event has reached its maximum value (at the end of the rise time of the sound event), the accommodation signal is gradually subtracted from the power envelope 204 at a rate equal to the accommodation rate of the human hearing mechanism (which is about 300 ms) to model the way in which the human hearing mechanism gradually ignores the effects of steady-state sounds in the presence of sound events.
- This difference signal includes positive pulses and other fluctuations. Each pulse in the difference signal indicates the occurrence of a sound event and the other fluctuations are caused by noise such as: reverberation, Gaussian noise, and other signals that may not currently be in the foreground stream.
- FIG. 3 An example of how the accommodation method affects an input signal is shown in FIG. 3.
- the time-domain plot A in FIG. 3 shows a sample segment of a sound field. This segment includes a collection of sound waves of varying frequencies that is characteristic of a syllable 300. The magnitude of this syllable is the voltage in the signal "V.” The voltage V varies with time "t” and has a duration of "D.”
- the time-domain plot B shows a pulse 302, which is segment 300 after it has been converted into a power signal (step 201 in FIG. 2). Pulse 302 has an amplitude that represents the power in the signal ("V 2 ”) and also has duration D.
- the time-domain plot C shows pulse 304, which is pulse 302 after the fluctuations with rise times that are not characteristic of a syllable have been removed.
- the pulse 304 also has an amplitude V and has a rise time "t r " where t r is equal to about 33ms.
- the time-domain plot D shows a pulse 306, which is pulse 304 after the steady-state sounds have been accommodated to by subtracting the accommodation signal.
- the rise-time of pulse 306 follows that of pulse 304. However, after the syllable has reached its maximum value (after t r ) the effects of any simultaneously-occurring steady-state signals are removed from pulse 306 at a rate that is equal to the accommodation rate of the human hearing mechanism (about 300 ms).
- Modeling beginning detection includes distinguishing the sound events from the noise present in the differential signal.
- the differential signal includes a series of rapidly-rising pulses and noise. While each pulse indicates the occu ⁇ ence of a sound event, the noise includes fluctuations that may falsely indicate the occu ⁇ ence of a sound event. Therefore, in order to detect the sound events, the sound events need to be distinguished from the noise.
- Modeling beginning detection 108 is shown in more detail in FIG. 4 and includes deemphasizing the effects of volume; emphasizing the s ound events 404; deemphasizing the noise 406; and detecting the sound events 408.
- steps 404, 406 and 408 can be simultaneously accomplished is by scaling the differential signal by the short-term average power contained in the noise component of the differential signal (the "short-term average high frequency power").
- the short-term average high frequency power is isolated and used to divide the differential signal.
- the short-term average high frequency power can be isolated by filtering the differential signal to obtain the component of the differential signal with frequencies higher than those characteristic of the sound event type being detected, and integrating the high-frequency portion over a short time period. This short time period may be different for different types of music or speech rates.
- a value of about 160 ms was experimentally determined to work well for a wide variety of inputs.
- a second effect is to de-emphasize noise 406 by de-emphasizing the fluctuations that occur more often than is characteristic of the sound event type being detected (for example, syllables generally occur or repeat once about every 200 ms).
- the fluctuations in portions of the differential signal due to noise generally occur more frequently than is typical for a sound event.
- the rapid repeat rate of these noise fluctuations increases the average high-frequency power with which that portion of the differential signal is divided. This de-emphasizes the portion of the differential signal that contains the noise.
- the third effect of normalizing the differential signal is to emphasize the sound events 404 by emphasizing the fluctuations that occur or repeat no more often than is characteristic of the sound event type being detected.
- the portions of the differential signal that contain fluctuations that occur less often than is typical for the sound event type being detected will have a l ower average high frequency p ower (as c ompared to those containing noise) w ith which that portion of the differential signal is divided. This will provide a relative emphasis to the sound events.
- De-emphasizing noise 406 may be further accomplished by removing many of the fluctuations in the differential signal due to noise. Some of the fluctuations due to noise can be identified and removed according to their rise time and other characteristics. For example, fluctuations that have rise-times n ot c haracteristic o f the s ound event type being detected are removed. For example, if the sound event type being detected is syllables, sounds with rise times faster than about 33 ms will be removed. In another example, if the sound event type being detected is impulsive sounds, sounds with rise times faster than about 3.3 ms will be removed.
- the sound events need to be detected 408 from the noise. Because at this point, most of the fluctuations in the improved differential signal due to noise have a low amplitude as compared with the fluctuations caused by the sound events, the sound events are detected 408 by determining which fluctuations have an amplitude that exceeds a threshold using a threshold detection method. In the threshold detection method, the fluctuations that do not exceed the threshold are removed or ignored to produce the trigger signal.
- a threshold detection method the fluctuations that do not exceed the threshold are removed or ignored to produce the trigger signal.
- the sound event detection methods are not perfect in that in some sound events will not be detected, and some fluctuations due to noise will result in a pulse in the trigger signal falsely indicating the occu ⁇ ence of a sound event. However, these occasional e ⁇ ors do not matter. Because the sound event detection methods are modeled after the human hearing mechanism, it produces the same types of e ⁇ ors as does the human hearing mechanism. Therefore, the result will be perceived as completely natural. [42]
- the threshold is chosen so that low amplitude fluctuations that are characteristic of noise or reverberation are not detected.
- the threshold may be a fixed value which is determined experimentally. However, more accurate results are obtained if the threshold varies as a function of the sound field.
- the threshold when the sound field includes many sound events, the threshold will generally be lower than when the sound field includes fewer sound events. This allows greater sensitivity when a greater number of sound events are present in the sound field.
- the threshold can be manually selected from among two or more values experimentally determined according to the characteristics of the sound field. For example, if the sound field is that of modern or popular music, which typically includes many sound events, a lower threshold value can be selected, and alternatively, if the sound field is that of classical music, which typically includes few sound events and may be highly reverberant, a higher threshold value can be selected.
- the threshold value can be chosen as a function of the number of sound events detected during a given time period.
- the threshold value is raised and during time periods where a greater number of sound events are detected, the threshold v alue i s 1 owered.
- any of the sound event detection methods may also include performing only the beginning detection method, which is of particular use when detecting sound events of short duration ("short-duration sound events"), such as impulsive sounds.
- Short-duration sound events such as impulsive sounds
- the sound event detection methods can be simplified to include performing only the b eginning detection method o n the input s ignal ( in d ecibels) d irectly.
- the sound event detection methods that include performing only the beginning detection method may also include removing any fluctuations with rise-times slower than those characteristic of the short-duration sound event being detected before performing the beginning detection method. For example, if impulsive sounds are being detected, fluctuations in the input signal with rise times slower than about 3ms will be removed.
- Methods for detecting a single sound event type in the presence of a background signal may also b e i mplemented when the sound field is generated i n t wo o r more input channels ("sound event detection methods for multiple input channels and a single sound event type").
- the sound event detection methods for multiple input channels and a single sound event type may include performing the sound event detection method for a single input channel and a single sound event type in parallel for each input channel to produce a trigger signal for each input channel indicating the occu ⁇ ence of whatever sound event type is being detected.
- a sound event detection method for multiple input channels and a single sound event may include detecting sound events in pairs of channels, instead of in each channel separately.
- this includes subtracting the accommodated signal in one channel from the accommodated signal in the other channel to create a "difference signal," which is then used to create the trigger signal.
- the difference signal is created by subtracting the accommodation signal of one input channel in an input channel pair from that of the other input channel in the input channel pair.
- the input channel pairs may include a left-right channel pair and or a center-su ⁇ ound input channel pair.
- the term "input channel pair" includes any combination of two input channels and the channels that can be derived from the input channels. The purpose of using more than one input channel to derive trigger signals is to provide a focus on directional signals.
- directional signals can be differentiated from non-directional signals by the phase and amplitude relationships between the input channels. For example, sound fields are often generated in two input channels (ordinary stereo). From these two input channels it is useful to derive four power envelopes and to organize the four power envelopes into two power envelope pairs. For example, if the original two input channels are designated with the conventional "left” and “right” names, the resulting power envelopes can be designated “left power” and “right power,” respectively and together make up a power envelope pair.
- the remaining power envelope pair is made up of power envelopes derived from the sum and the difference of the two input channels, the "left plus right power” and “left minus right power.” This pair is often also refe ⁇ ed to as “center power” or the “su ⁇ ound power.” Non-directional signals almost always will cause all four of these power envelopes to be equal in level. A non-directional transient signal will cause all four of them to rise at the same time. When the difference signal is used to create the trigger signal, a non-directional transient will cause no rise in the d ifference s ignal, because it i s equal in each input channel.
- a directional signal for example a sound event in the left input channel only
- a directional signal will cause a large change in the "left power" envelope, and no co ⁇ esponding increase in the "right power” envelope. Therefore, there will be a large increase in the difference signal created from the left and right accommodated signals, and this change can be used to generate triggers that preferentially distinguish directional signals from non-directional signals.
- FIG. 47 An example of a sound event detection method for multiple input channels and a single sound event 500 that includes detecting sound events in pairs of channels is shown in FIG.
- 5 includes: emphasizing the directionally important frequencies by modeling frequency bias 504; separating the sound events from the background sounds by modeling accommodation in each input channel 506; and detecting sound events in each input channel pair by modeling beginning detection in each input channel pair.
- the directionally important frequencies are emphasized 504 using a frequency emphasis method.
- the sound events are separated from the background signal in each input channel 506 using an accommodation method to produce a differential signal for each input channel.
- the sound events are detected in each input channel pair 508 using an alternate beginning detection method in parallel for each input channel pair in and the differential signals for each channel pair.
- An alternate beginning detection method is shown in more detail in FIG. 6 and may include, determining the difference signal for each input channel pair 601 ; deemphasizing the effects of volume in each input channel pair 602; emphasizing sound events in each input channel pair 604; deemphasizing noise in each input channel pair 606; and detecting sound events in each input channel pair 608. Therefore, the sound event distinction method for m ultiple input channels includes the same steps a s the sound event distinction method but performed on the difference signal of each input channel pair. [49] A difference signal is determined 601 be determining the difference between the differential signals in each input channel of an input channel pair, and rectifying this difference.
- the difference between the input channels in the input channel pair includes both positive and negative pulses representing sound events that have occu ⁇ ed in either input channel of the input channel pair.
- the pulses will be negative or positive depending upon which input channel in the input channel pair reflects the majority of the power in the particular sound event. Therefore, this difference is rectified to produce a difference signal with only positive pulses. While the difference signal still indicates sound events, as previously discussed, it does not indicate sound events common to both input channels. Therefore, the difference signal has a better signal to noise ratio then that of the differential signals of the individual input channels.
- the difference signal does not contain any signals that are equal in both input channels of the input channel pair, which helps to elimination c ertain n on-directional s ignals, s uch a s noise, which are generally equal i n a ll input channels.
- steps 602, 604 and 608 include a rectification step so that the pulses indicating the remaining transients are all positive.
- the result of step 608 is a trigger signal with positive pulses indicating the occu ⁇ ence of a sound event for every input channel pair.
- This sound event detection method may be implemented for multiple input channel pairs by performing the method in parallel for each input channel pair to create a separate trigger signal for each input channel pair.
- the sound event detection methods for multiple channels and a single sound event type may also include detecting a sound event type in multiple channel pairs (collectively, "sound event detection methods for multiple channel pairs and a single sound event type"). These methods generally include the sound event detection methods for multiple channels and a single sound event type that include an alternate beginning detection method implemented for more than one channel pair, or for a single channel pair from which four power envelopes can be derived.
- Methods for detecting one or more sound events in the presence of a background signal may also be implemented so that multiple sound event types, such as syllables and impulsive sounds, are detected in a single input channel ("sound event detection methods for a single input channel and multiple sound event types").
- sound event detection methods for a single input channel and multiple sound event types One example of a sound event detection method for a single input channel and multiple sound event types that detects both syllables and impulsive sounds is shown in FIG. 7 (although this method may be implemented to detect any number and combination of sound event types).
- FIG. 7 The example of a sound event detection method for a single input channel and multiple sound event 700 shown in FIG.
- the directionally important frequencies are emphasized 704 using a frequency bias method.
- the sound events are separated from the background signal 706 using an accommodation method to produce a differential signal.
- the accommodation method may also include removing fluctuations in the power envelope with rise times that not characteristic of syllables (those above about 33 ms).
- syllables are detected 710 using a beginning detection method for single input channel (see FIG. 4) and impulsive sounds are detected using a sound event detection method for short- duration sound events.
- detecting sound events includes deemphasizing the effects of volume, emphasizing sound events and deemphasizing noise using an automatic gain method and by removing certain fluctuations, and detecting sound events using a threshold detection method.
- the short-term high frequency power used to normalize the differential signal is the power in the components of the differential signal above 30 Hz determined over about 167 ms. Additionally, the fluctuations removed are those with a rise time of less than 33 ms.
- the sound event detection method for a single short-duration sound event includes: removing certain fluctuations; deemphasizing t he effects of volume, e mphasizing s ound e vents and deemphasizing noise using an automatic gain method and removing certain fluctuations, and detecting sound events using a threshold detection method.
- the automatic gain method uses the input signals (in decibels) instead of the differential signal.
- the fluctuations that are removed are those with rise times less than about 3 ms.
- Methods for detecting one or more sound events in the presence of a background signal may also be implemented so that multiple types of sound events may be detected for a sound field generated in multiple input channels ("sound event detection methods for multiple input channels and multiple sound events").
- the sound event detection method for multiple input channels and multiple sound events includes performing the sound event detection method for a single input channel and multiple sound event types implemented in parallel for each input channel of the sound field to produce a trigger signal for each sound event type in each input channel.
- the sound event detection method for multiple input channels and multiple sound events 800 may detect each sound event only in each input channel pair, thus producing a trigger signal for each sound event type in each input channel pair.
- FIG. 8 the example shown in FIG.
- this method 800 is used to detect two sound event types (syllables and impulsive sounds) in two input channels (a right input channel and a left input channel) to produce two trigger signals for the input channel pair, the first indicating the occu ⁇ ence of syllables and the second indicating the occu ⁇ ence of impulsive sounds.
- the method may be implemented to detect any number of sound events in any number of input channels to produce for each input channel pair a trigger signal for each sound event type.
- the sound event detection method for multiple input channels and multiple sound event types 800 generally includes: emphasizing directionally important frequencies by modeling the frequency bias in the left input channel 802; emphasizing directionally important frequencies by modeling the frequency bias in the right input channel 804; separating sound events from the background sounds by modeling accommodation in the left input channel 806; separating sound events from the background sounds by modeling accommodation in the left input channel 808; detecting impulsive sounds in the left-right input channel pair by modeling beginning detection 812; and detecting syllables in the left-right input channel pair by modeling beginning detection 814.
- Directionally important frequencies are emphasized separately in both the left and right input channels 802 and 804, respectively, using a frequency bias method.
- syllables may be detected using a beginning detection method to separately d etect syllables i n each input channel to produce a trigger signal indicating the occu ⁇ ence of syllables in each channel.
- syllables may be detected using an alternate beginning detection method to produce a single trigger signal indicating the occu ⁇ ence of syllables for the channel pair.
- a sound event detection method for a single channel and a short- duration sound event is used to produce a trigger signal indicating the occu ⁇ ence of an impulsive sound for each input channel.
- the trigger signals may be combined to create fewer trigger signals.
- the sound event detection methods for multiple channels and multiple sound event types may also include detecting more than one sound event type in multiple channel pairs (collectively, "sound event detection methods for multiple channel pairs and multiple sound event types") These methods generally include the sound event detection methods for multiple channels and multiple sound event types that include an alternate beginning detection method implemented for more than one channel pair, or for a single channel pair from which four power envelopes can be derived.
- the properties of the human hearing mechanism have also been used to create methods for determining the location of sound events that occur in the presence of a background signal (collectively "sound event detection and localization methods").
- the sound event detection methods for multiple input channels not only produce a trigger signal indicating whenever a sound event occurs, but also produce differential signals from which the direction of a sound e vent c an be determined.
- S ound event detection and localization methods use these differential signals to determine the direction of a sound event by comparing the relative power of the sound event in each input channel of each i nput c hannel pair a fter a ny s teady-state s ounds have b een removed.
- the sound event detection and localization methods can be implemented to detect and localize any number and combination of sound events types in a sound field generated in any number of input channel pairs. In the following description, the sound event detection and localization methods are discussed in order of increasing complexity, with each subsequent sound event detection and localization method incorporating the steps of the prior methods, except as indicated.
- FIG. 9 A sound e vent d etection and 1 ocalization m ethod for a s ingle i nput c hannel pair and a single sound event type is shown in FIG. 9.
- This sound event detection and localization method for a single input channel pair and a single sound event type 900 detects a single sound event type and locates the sound events relative to a single input channel pair.
- the location of a sound event as determined by this method 900 is given in terms of an angle between the input channels in the input channel pair (refe ⁇ ed to in this application as a "true differential steering angle" or "true DSA").
- the sound event detection and localization method 900 generally includes: producing a differential signal and detecting sound events in an input channel pair for each input channel in the input channel pair 904; determining the initial sound event direction 906; and isolating the true DSA 908.
- 904 and 906 define a DSA determination method for a single input channel pair and a single sound event type 901.
- a differential signal is produced for each input channel of the input channel pair and sound events are detected in the input channel pair 904 using a sound event detection method for a single input channel pair and a single sound event type or a sound event detection method for a single input channel pair and a single short-duration sound event as appropriate for the sound event type being detected.
- a trigger signal indicating the occu ⁇ ence of a single sound event type is produced for the input channel pair and a differential signal is produced for each input channel in the input channel pair.
- T he i nitial d irection d etermination m ethod determines the d irection o f t he sound event from the differential signal for each input channel in the input channel pair as these differential signals exist when the sound event occurs. More specifically, when the trigger signal indicates that a sound event is occurring, the differential signals for each input channel are converted into decibels to create a decibel differential signal for each input channel. The decibel differential signal for one input channel is then subtracted from the decibel differential signal from the other input channel to create a decibel ratio. This decibel ratio is converted i nto an equivalent angle u sing known methods to create a "differential steering angle" or "DSA" which indicates the location of the sound event between the input channels in the input channel pair.
- DSA differential steering angle
- the true direction of any syllables detected is obtained by isolating the DSA indicated in about the first 20 ms to about the first 30ms of the syllable, which i s then used (held) for about 200 m s.
- the true direction of any impulsive sounds is obtained by isolating the DSA indicated in about the first 3.3 ms to about 5 ms of the impulsive sound, which is then used for about 50 ms.
- the initial direction determination method may be implemented to obtain an even more accurate initial direction.
- This initial direction determination method detects all triggers that occur in a very short segment of the trigger signal (generally, about 3 ms) determines the DSAs associated with those pulses, determines the short-time average of these DSAs and u ses the short-term average D SA to indicate the d irection of a ll the short-term sound events detected in the very short segment for the typical duration of the short-duration sound event type being detected. For example, if impulsive sounds are being detected, the short-time average is then used for the next about 50 ms.
- the idea of using a time average over the rise-time of a sound event as a measure of its direction also works with sounds that have longer durations, but it is useful to use a longer time period for the averaging, as the rise time of these signals is frequently longer.
- Sound event detection and localization methods may also be implemented to detect and localize a single sound event type in multiple input channel pairs ("sound event localization methods for multiple input channels pairs and a single sound event type").
- a typical su ⁇ ound sound system includes at least a left-right input channel pair and a center-su ⁇ ound input channel pair.
- the true DSA for e ach c hannel pair i s determined whenever a trigger signal in any channel pair indicates that a sound event is occu ⁇ ing, however, the DSA is used to indicate the direction of the sound event only if the
- DSA is accurate. If the DSA is not accurate, the direction indicated by a true OSA is used.
- FIG. 10 An example of a sound event localization method for multiple input channel pairs and a single sound event type, is shown in FIG. 10, indicated by reference number 1000 and includes: performing a DSA determination method for a single input channel pair and a single sound event for each input channel pair (collectively a "DSA determination method for multiple input channel pairs a single sound event type") 1051; determining the ordinary steering angle for each channel pair 1003; determining whether the DSA is accurate 1010, where if the DSA is not accurate, isolating and using the true OSA for each input channel pair for the typical duration of the sound event type being detected 1014; where, if the DSA is accurate, isolating the true DSA and using the true DSA for the typical duration of the sound event type being detected 1012.
- the DSA determination method for multiple channel pairs and a single sound event type 1051 includes: detecting a sound event in each input channel pair and producing a differential signal for each channel in the channel pair 1004; and determining the initial DSA for each input channel in the channel pair 1006.
- Detecting a sound event in each input channel pair and producing a differential signal for each channel in the channel pair 1004 includes performing a sound event detection method for a single channel pair and a single sound event type i n parallel for e ach channel p air.
- D etermining the initial DSA for each input channel in the channel pair 1006 includes performing an initial direction determination method in parallel for each channel pair. Determining the OSA for each channel pair 1003 is generally accomplished using known methods.
- Determining whether the DSA is accurate includes determining, at the moment the trigger signal indicates that a sound event is occurring, whether a sound event begins in one c hannel just as a sound event ends in another channel using a "background signal check" and determining whether the values obtained for the DSA are consistent with each other a "consistency check.”
- the sound event detection and localization methods do not yield accurate results when a sound event occurs precisely at the moment that another sound event in a different direction ends.
- the true DSA is isolated and used for the typical duration of the sound event type being detected 1012. For example, if syllables are being detected the DSA is used for the typical duration of a sound event. In another example, if impulsive sounds are being detected, the DSA is used for the typical duration of an impulsive sound. If however, the DSA is not accurate, the true OSA is isolated and used for the typical duration of the sound event type being detected 1014. The true OSA is the OSA occu ⁇ ing during the rise-time of the sound event type being detected.
- the sound event detection and localization methods for multiple channel pairs and a single sound event type may also include methods that reduce the occu ⁇ ence of e ⁇ ors in the true direction of the sound events using an accommodation adjustment method.
- the accommodation adjustment method uses the consistency check to adjust the degree of accommodation in the accommodation signal.
- the accommodation signal represents the steady-state signals present in the sound field as they are gradually ignored by the human hearing mechanism.
- the rate at which these steady-state signals are ignored is refe ⁇ ed to as the degree of accommodation.
- the degree of accommodation which is defined by the accommodation signal ("AccSig").
- the accommodation adjustment method multiplies the accommodation signal AccSig by an adjustment factor Adj, so that AccSig is defined by the following equation :
- Adj is defined by the following equation:
- the sound event detection and localization methods for multiple input channels pairs and a single sound event type may use the number of e ⁇ ors to adjust the threshold as part of e ⁇ or threshold adjustment methods.
- An e ⁇ or threshold adjustment method determines the number of e ⁇ ors determined by the consistency check in a predetermined time period of about several seconds and uses this number to adjust the threshold. As the number of e ⁇ ors increases, the threshold is increased so that fewer sound events are detected. Therefore, a continuous adjustment to the threshold can be provided, which is in proportion to the number of e ⁇ ors detected by the consistency check.
- Sound event detection and localization methods may also be implemented to detect and localize multiple sound event types in a single input channel pair ("sound event localization methods for a single input channel pair and multiple sound event types").
- FIG. 11 shows a sound event detection and localization method 1100 implemented to detect and localize two types of sound events (syllables, and impulsive sounds) to produce a DSA that indicates their whenever they occur.
- this method may be implemented in a similar manner to detect and localize any number and types of sound events.
- the sound event detection and localization method for a single input channel pair and multiple sound event types 1 100 generally includes: producing a differential signal for the input channel pair 1104; detecting syllables in a input channel pair 1106; detecting impulsive sounds in the input channel pair 1107; determining the initial direction 1108; and isolating the true DSA 11 10 for the sound event detected.
- steps 1104, 1106, 1107, and 1108 define a DSA determination method for a single input channel pair and multiple sound event type 1101.
- Producing a differential signal for the channel pair 1 104 includes performing an accommodation method for each input channel.
- Detecting syllables in the input channel pair 1106 includes performing a beginning detection method for a single channel pair and a single sound event type to produce a trigger signal indicating the occu ⁇ ence of syllables in the channel pair.
- Detecting impulsive sounds in the input channel pair 1107 includes performing a s ound event detection m ethod for a s ingle input c hannel and a s ingle short- duration sound event in parallel for each channel in the channel pair (the resulting two trigger signals may be combined to form a single trigger signal indicating the occu ⁇ ence of impulsive sounds in the channel pair).
- the initial direction is then determined 1108 at the moment any trigger signal indicates that a syllable or an impulsive sound is detected.
- an initial direction determination is used to determine the initial direction of the sound event detected using the differential signals of each input channel in the input channel pair.
- the true DSA is isolated 1110 according to the true direction isolation method which uses the differential signal to produce the true DSA.
- the true DSA is then used for the typical duration of the sound event type that was detected. For example, if a syllable was detected, the DSA will be used for the typical duration of a syllable. Conversely, if an impulsive sound was detected, the DSA will be used for the typical duration of an impulsive sound.
- Sound event detection and localization methods may also be implemented to detect and localize multiple sound event types in multiple input channel pairs ("sound event localization methods for multiple input channel pairs and multiple sound event types").
- the sound event localization methods for multiple input channel pairs and multiple sound event types generally involve performing a sound e vent d etection a nd localization method for a single input channel pair and multiple sound event types in a parallel fashion for each input channel pair.
- sound event localization methods for multiple input channel pairs and multiple sound event types may include detecting a sound event, determining the DSA for the sound event, determining whether the DSA is accurate and using the OSA if the DSA is not accurate.
- FIG. 12 An example of a sound event localization methods for multiple input channel pairs and multiple sound event types that uses the OSA to indicate the direction of a sound event if the DSA is not accurate is shown in FIG. 12.
- syllables and impulsive sounds are detected in two channel pairs.
- this method may be implemented to detect any number of wound event types in any number of channels.
- the sound event localization methods for multiple input channel pairs and multiple sound event types 1200 includes: producing a differential signal for the input channel pair 1204; detecting syllables in each input channel pair 1206; detecting impulsive sounds in each input channel pair 1207; determining the initial direction for each channel pair 1208; determining whether the DSA is accurate 1210, where if the DSA is accurate, the true DSA is isolated and used for the typical duration of the sound event type detected 1212; where if the DSA is not accurate, the true OSA is isolated and used for the typical duration of the sound event type detected 1214.
- steps 1204, 1206, 1207, and 1208 define a DSA determination method for multiple input channel pairs and multiple sound event types 1201.
- Producing a differential signal for each channel pair 1204 includes performing an accommodation method in parallel for e ach input channel.
- Detecting syllables in each input channel pair 1206 includes performing a beginning detection method in parallel for a single channel pair and a single sound event type to produce a trigger signal for each channel pair indicating the occu ⁇ ence of syllables in any channel pair.
- Detecting impulsive sounds in each input channel pair 1207 includes performing a sound event detection method for a single input channel and a single short-duration sound event in parallel for each channel in each channel pair (for each channel pair the resulting two trigger signals may be combined to form a single trigger signal i ndicating the occu ⁇ ence of impulsive s ounds in the c hannel pair). Simultaneously, the OSA is determined in each channel pair 1203 using known methods.
- the initial direction is then determined in each channel 1208 at the moment any trigger signal indicates that a syllable or an impulsive sound is detected. At this moment, an initial direction determination is used to determine the initial direction of the sound event detected using the differential signals of each input channel in the input channel pair. It is then determined whether the DSAs a re a ccurate 1210 u sing a b eginning d etection m ethod and/or a consistency check. If the DSAs are found to be accurate, the true DSA is isolated 1210 according to the true direction isolation method which uses the differential signal to produce the true DSA. The true DSA is then used for the t ypical duration o f a syllable.
- the DSA Conversely, if an impulsive sound was detected, the DSA will be used for the typical duration of an impulsive sound. If however, the DSA is not found to be accurate, the true OSA is isolated 1214 to produce the true OSA. The true OSA is then isolated from the OSA during the rise time of the impulsive sound and is used for the typical duration of an impulsive sound. Additionally, the sound event detection and localization methods for multiple channel pairs and a single sound event type may further include an accommodation adjustment method and/or an e ⁇ or threshold adjustment method.
- the direction of any s ound e vents i s used to indicate the direction of the e ntire s ound field for the typical duration of the sound event type that was detected.
- the direction of the subsequent sound event will be used for all input channels as soon as the subsequent sound event occurs and will continue to be used for the typical duration of the sound event type of the subsequent sound event. In generally means that when multiple sound events overlap, the direction of the most recent sound event will be used.
- Stereo/su ⁇ ound detection methods generally determine the number of sound events intended to be reproduced behind the listener and whether this number exceeds a predetermined value.
- the stereo/su ⁇ ound detection methods include performing a sound event detection and localization method for a single input channel pair for the center-su ⁇ ound input channel pair for each sound event type being detected, and determining the number of sound events with an associated true differential steering angle of about 0 degrees to about -45 degrees (indicating a rear direction).
- the su ⁇ ound detection method also determines whether the number of sound events detected for reproduction in the rear exceeds a predetermined value in a defined time period.
- the properties of the human hearing mechanism may also be used to determine the location of all sounds within a sound field.
- B y combining the sound event detection and localization methods with known methods for determining the intended direction of sounds, methods have been developed that localize the individual sounds in the presence of background sounds more accurately for any combination of sounds ("sound localization methods").
- the sound localization methods divide the sound field into sound events and non-sound events and localize the sound events in terms of a true differential steering angle or a true ordinary steering angle and the non-sound events in terms of a filtered ordinary steering angle. These methods essentially treat the non-sound events as a separate sound for which the direction is separately determined.
- the sound localization methods can be implemented to specifically localize any number and combination of sound event types, in addition to localizing the remaining sounds, in a sound field generated in any number of input channel pairs.
- the sound localization methods are discussed in order of increasing complexity, with each subsequent sound localization method incorporating the steps of the prior methods, except as indicated.
- the filtered OSA is used to indicate the direction of the sound field unless a sound event is detected, in which case the true DSA is used.
- the true DSA is used to indicate the direction of sound events only if the DSA is determined to be accurate. In these cases, if the DSA is not found to be accurate, the true OSA is used to indicate the direction of the sound events.
- a sound localization method for detecting a single sound event type in a single input channel pair (“sound localization method for a single input channel pair and a single sound event type”) is shown in FIG.
- 13 generally includes: determining the DSA and the trigger signal 1302; determining the OSA 1306; and determining which direction to use 1304. 1302, 1304 and 1306 are generally performed simultaneously and concu ⁇ ently, for as long as a sound field is sensed.
- Determining the DSA and the trigger signal 1302 is generally accomplished by performing a DSA determination method for a single input channel pair and a single sound event that includes an alternate beginning detection method to produce a single trigger signal for the input channel pair. Even though sound events only occur whenever a pulse is present in the trigger signal, the DSA may be continuously determined. Alternatively, the trigger signal may be continuously determined and the DSA determined only when a pulse is present in the trigger signal.
- the OSA is generally determined 1306 continuously using known methods. Determining which direction to use 1304 basically includes: determining when a sound event occurs, using the true DSA for the duration typical of the sound event type detected and decaying to the OSA at the end of the sound event. However, if at any time a subsequent sound event occurs (even during a sound event), the DSA for the subsequent sound event will be used for the duration typical of the subsequent sound event type.
- a method for determining which direction to use (the "direction selection method for a single input channel pair and a single sound event type") is shown in FIG. 14 and indicated by reference number 1304. It generally includes, determining whether there is an input signal 1402; where if there is an input signal determining whether there is a sound event 1404; where if there is a sound event, generating and selecting the cu ⁇ ent true DSA for the typical duration of the sound event type being detected 1406; determining whether the typical d uration h as e nded 1 408; w here if the typical d uration h as n ot ended, d etermining whether a subsequent sound event is detected 1410, where if a subsequent sound event is not detected, repeating determining whether the typical duration has ended 1408 and whether a subsequent sound event has been detected 1410 until it is determined that either the typical duration has ended in 1408 or that a subsequent sound event has been detected in 1410; where if a subsequent sound event has been detected,
- Determining whether there is an input signal 1402 includes determining whether the input power of the sound field in all input channels ("I ") is greater than about zero. Additionally, it may also include determining whether the input power has dropped in all input channels by more than about 30 dB from a prior sound event. If it has, it can generally be assumed that the input signal has stopped. If there is an input signal, it is then determined whether there is a sound event 1404 by examining the trigger signal. Whenever the trigger signal contains a pulse or other indication of the occu ⁇ ence of a sound event, a sound event exists. Conversely, whenever the trigger signal does not contain a pulse or other indication of the occu ⁇ ence of a sound event no sound event exists.
- a cu ⁇ ent true DSA is generated from the cu ⁇ ent DSA using a true direction isolation method and selected for the typical duration of the sound event type being detected 1408.
- the typical duration of the sound event will be about 50 ms to 200 ms (preferably a bout 150 ms) and if the sound event type being d etected i s a n i mpulsive sound, than the typical duration will be about 50 ms.
- the DSA is selected and used for the typical duration regardless of when the sound event being detected actually ends.
- the input signal is monitored to determine if any subsequent sound events with an accurate DSA are detected (1408 and 1410). If during the typical duration a subsequent sound event is detected, then the cu ⁇ ent DSA will be redefined by the DSA of the subsequent sound event 1414, the redefined cu ⁇ ent DSA will be used to generate a cu ⁇ ent true DSA which will be selected for the typical duration of the sound event type detected 1406, and 1408, 1410 and 1414 will be repeated as appropriate. However, if no subsequent sound event is detected during the typical duration (1408 and 1410), then the entire method repeats, as appropriate, from 1402. [89] In contrast, if it is determined in 1404 that there is no sound event, it is then determined whether there was an immediately preceding sound event 1416.
- a filtered OSA is selected, or continues to be selected 1418 and the process repeats, as appropriate from 1402.
- the filtered OSA is the OSA with the fluctuations having rise- times faster than a specified rise-time removed. For example, fluctuations with rise-times faster than approximately 300 ms may be removed. This prevents the OSA from reflecting rapid directional changes when no sound event is detected. If however, there was an immediately preceding sound event (a sound event for which the typical duration had just ended), the OSA is selected and decayed to from the true DSA of the immediately preceding sound event 1420. The decay helps to provide a smooth transition from DSA of the immediately preceding sound event to the OSA.
- the length of decay needed to provide this smooth transition depends on the sound event type of the immediately preceding sound event. If the immediately preceding sound event was a syllable, the decay will generally be about 300 ms seconds. The process then repeats, as appropriate, from 1402 until it is determined in
- the direction selection method includes determining whether the short-duration sound event has actually ended at the end of the typical duration, and immediately selecting the OSA without any decay if it is determined that the short-duration sound event has not actually ended.
- Such a method (a "direction selection method for a single input channel pair and a short-duration sound event") is shown in FIG. 15 and indicated by reference number 1500.
- the direction selection method for a single input channel pair and a short-duration sound event generally includes, determining whether there is an input signal 1502; where if there is an input signal, determining whether there is a short-duration sound event 1504; where if there is a short-duration sound event, generating and selecting the true DSA for the typical duration of the short-duration sound event detected 1506; determining whether the typical duration has ended 1508; where if the typical duration has not ended, determining whether a subsequent short-duration sound event is detected 1510, where if a subsequent short-duration sound event is not detected, repeating determining whether the typical duration has ended 1508 and whether a subsequent short- duration sound event has been detected until it is determined that either the typical duration has ended in 1508 or that a subsequent short-duration sound event has been detected in step 1510; where if a subsequent short-duration sound event has been detected, defining the cu ⁇ ent DSA with that of the subsequent short-duration sound event 1512 and repeating steps 1506, 1508,
- This d irection s election method for a s ingle i nput channel p air and a s hort- duration sound event is virtually the same as the previously discussed direction selection methods, except that the direction selection method for a single input channel pair and a short-duration sound event is implemented to detect short-duration sound events. Furthermore, at the end of a typical duration of a short-duration sound event (assuming no new sound event has or is occu ⁇ ing), a determination is made as to whether the sound event has actually ended 1 514; and the filtered O SA i s used either i mmediately 1 520 or a fter a decay 1522 depending upon whether the short-duration sound event has actually ended.
- the direction selection method for a single input channel pair and a short-duration sound event is implemented to detect short-duration sound events by using a sound event detection method for a single channel and a single short-duration sound event for each input channel of the channel pair t o p roduce a trigger s ignal ( or two trigger s ignals) that i ndicates whenever a short-duration sound event is detected.
- Whether the immediately preceding short-duration s ound event has actually ended may be determined 1518 by comparing the power envelope and the accommodation signal in each input channel of the channel pair. If the input power envelope I 2 is greater than the accommodation signal AccSig in any input channel of the input channel pair, it is determined that the short-duration sound event has not actually ended. Therefore, the filtered OSA is decayed to from the DSA of the immediately preceding sound event. If however, I is about equal to or less than AccSig in each input channel of the input channel pair, it is determined that the short-duration sound event has actually ended and the filtered OSA is immediately selected 1522. The length of decay needed to provide this smooth transition depends on the sound event type being detected.
- Sound localization methods may also be used to localize the sounds in a sound field when the sound field includes more than one input channel pair ("sound localization methods for multiple input channel pairs and a single sound event type").
- the sound localization methods for multiple input channel pairs and a single sound event include the same basic steps as the sound localization method for a single input channel pair and a single sound event type, which generally include: determining the OSA; determining the DSA and trigger signal; and determining which direction to use.
- the step of determining the DSA and a trigger signal includes determining a D SA and a trigger s ignal for each input channel pair and is accomplished by performing a DSA determination method for multiple input channel pairs and a single sound event that includes an alternate beginning detection method.
- the step of determining which direction to use (the "direction selection method for multiple c hannel pairs and a single sound event type") i ncludes responding to differential signals, trigger signals and DSAs from any of the input channel pairs; and generating and selecting the t rue differential steering angle for all t he input c hannel pairs when a sound event with an accurate DSA is detected in any input channel pair.
- the direction selection method for multiple channel pairs and a single sound event type is shown in more detail in FIG. 16 and is indicated by reference number 1600.
- this direction selection method includes determining whether, at the time any trigger signal indicates the occu ⁇ ence of a sound event, the DSAs are accurate; and if the DSAs are not accurate using the true OSAs for the typical duration of the sound event type being detected instead of the DSAs.
- this direction selection method 1600 includes: determining whether there is an input signal 1602; where if there is an input signal determining whether there is a sound event 1604; where if there is a sound event, determining whether the cu ⁇ ent true DSA is accurate 1606; where if the cu ⁇ ent true DSA is accurate, generating and selecting the cu ⁇ ent true DSA for the typical duration of the sound event type being detected 1608; where if the cu ⁇ ent true DSA is not accurate, generating and selecting the cu ⁇ ent true OSA for the typical duration of the sound event type being detected 1620; once either the true OSA or DSA is generated and selected, determining whether the typical duration has ended 1610; where if the typical duration has not ended, determining whether a subsequent sound event is detected 1612, where if a subsequent sound event is not detected, repeating determining whether the typical duration has ended 1610 and whether a subsequent sound event has been detected 1612 until it is determined that either the typical duration has ended in 1610 or that a subsequent sound event has been detected in 1612;
- OSA with that of the subsequent sound event 1618 and repeating 1610, 1612, 1614, 1616 and 1618 as appropriate; where if it is determined in 1610 that the typical duration has ended, repeating the entire method as appropriate from 1602; where if there is no sound event detected in 1604, determining whether there was an immediately preceding sound event 1622; where if there was no immediately preceding sound event, selecting or continuing to select the filtered OSA 1624; and repeating the entire method as appropriate from 1602; and if there was an immediately preceding sound event, selecting and decaying to the filtered OSA from the true DSA 1626; and repeating the entire method as appropriate from 1602; where the entire method is repeated as appropriate until there is no input signal detected in 1602, where if there is no input signal, stopping the method.
- Determining whether there is an input signal 1602 includes determining whether there i s an input signal in each input channel o f each channel p air.
- D etermining whether there is a sound event 1604 includes performing a sound event detection method for a single channel pair and a single sound event type for each channel pair to produce a trigger signal that indicates the occu ⁇ ence of any sound events for each pair.
- Determining whether the cu ⁇ ent DSA is accurate 1606 includes determining whether the cu ⁇ ent DSA (the DSA determined at the moment any trigger signal indicates that a sound event is occurring) from every input channel pair is accurate using a method for determining DSA accuracy for multiple input channel pairs.
- a method for determining DSA a ccuracy for multiple input channel pairs is shown in more detail in FIG. 17, is indicated by reference number 1700, and includes: determining whether the background signal has dropped by about 3dB or more in at least two input channels 1702; and where if the background signal has not dropped by about
- the consistency check is performed as previously described. If the DSAs pass the consistency check, they are considered accurate.
- the cu ⁇ ent true DSAs are generated and selected for each channel pair 1608 using a true direction isolation method.
- the cu ⁇ ent true OSAs (the OSA determined at the moment any of the trigger signals indicates that the cu ⁇ ent sound event is occurring) are generated and selected for e ach c hannel pair 1 620.
- E ither the cu ⁇ ent true OSA or the cu ⁇ ent true DSA is used for the typical duration of the sound event type being detected, unless a subsequent sound event is detected during the typical duration. As long as it is determined that the typical duration has not ended in 1610, it is determined whether any subsequent sound events are detected 1612.
- a subsequent sound event it is determined whether the true DSAs for the subsequent sound event are accurate 1614 using a method for determining DSA accuracy for multiple input channel pairs. If the DSAs of the subsequent sound event are determined to be a ccurate, the DSAs of the subsequent sound event become the cu ⁇ ent DSAs 1616 to reflect the direction of the subsequent sound event. However, if the DSAs of the subsequent sound event are determined not to be accurate, the OSAs of the subsequent sound become the cu ⁇ ent OSAs 1618 to reflect the direction of the sound event. The process repeats from 1608 or 1620, as appropriate, until the typical duration of any subsequent sound events ends.
- any subsequent sound event ends, it is determined in 1622 whether there was an immediately preceding sound event. If there was an immediately preceding sound event, the filtered OSAs are selected and the direction of the sound field decays from that indicated by the DSAs to that indicated by the OSAs 1626. However, if there was no immediately preceding sound event, the filtered OSA is used, or continues to be used 1624.
- any of the sound localization methods for multiple channel pairs and a single sound event type may further include using the consistency check to reduce the occu ⁇ ence of inaccurate DSAs through use of an accommodation adjustment method and/or an e ⁇ or threshold adjustment method as previously described.
- the direction selection method for multiple input channel pairs and a single sound event type includes determining whether the short-duration sound event has actually ended at the end of the typical duration, and immediately selecting the filtered OSA without any decay if it is determined that the short- duration sound event has not actually ended.
- Such a method (a "direction selection method for multiple input channel pairs and a short-duration sound event") is shown in FIG. 18 and indicated by reference number 1800.
- the direction selection method for a single input channel pair and a short-duration sound event generally includes: determining whether there is an input signal 1802; where if there is an input signal determining whether there is a short- duration sound event 1804; where if there is a short-duration sound event, determining whether the cu ⁇ ent true DSA is accurate 1806; where if the cu ⁇ ent true DSA is accurate, generating and selecting the cu ⁇ ent true DSA for the typical duration of the short-duration sound event being detected 1808; where if the cu ⁇ ent true DSA is not accurate, generating and selecting the cu ⁇ ent true OSA for the typical duration of the short-duration sound event being detected 1820; once either the true OSA or DSA is generated and selected, determining whether the typical duration has ended 1 810; where i f the typical duration has not ended, determining whether a subsequent short-duration sound event is detected 1812, where if a subsequent sound event is not detected, repeating determining whether the typical duration has ended 1 810 and whether a subsequent
- OSA from the true DSA 1826; and repeating the entire method as appropriate from 1802; where if the immediately preceding sound event has not actually ended, selecting and decaying to the filtered OSA 1828 and repeating the entire method as appropriate from 1802; where the entire method is repeated as appropriate until there is no input signal detected in 1802, where if there is no input signal, stopping the method.
- This direction selection method for multiple input channel pairs and a short- duration sound event is virtually the same as the previously discussed direction selection method for multiple input channels and a single sound event type, except that the direction selection method for multiple input channel pairs and a short-duration sound event is implemented to detect short-duration sound events. Furthermore, at the end of a typical duration of a short-duration sound event (assuming no new sound event has or is occurring), a determination is made as to whether the sound event has actually ended 1825; and the filtered OSA is u sed either immediately 1826 or after a decay 1828 depending upon whether the short-duration sound event has actually ended.
- the direction selection method for multiple input channel pairs and a short-duration sound event is implemented to detect short-duration sound events by using a sound event detection method for multiple input channels and a single short-duration sound event for each input channel of the channel p air to produce a trigger signal (or two trigger signals) that indicates whenever a short-duration sound event is detected.
- Whether the immediately preceding sound event has actually ended is determined 1825 by determining whether the input power envelope is greater than the accommodation signal for the short-duration sound event in any input channel. If the input power envelope is greater than the accommodation signal for the short-duration sound event in any input c hannel, i t is determined that the short-duration s ound e vent h as not actually ended.
- a decay is made from the DSAs of the immediately preceding short- duration sound event to the filtered OSA 1826. If however, in each input channel, the power envelope is about equal to the accommodation signal, it is determined that the short-duration sound event has actually ended and the filtered OSA is immediately selected 1828. The length of decay needed to provide this smooth transition depends on the sound event type being detected. For example, if the immediately preceding sound event is an impulsive sound, the decay will take about 5 ms. [101] Sound localization methods may also be used to localize the sounds in a sound field by distinguishing more than one sound event type ("sound localization methods for a single input channel pair and multiple sound event types").
- the sound localization methods for a single input channel pair and multiple sound event types include the same basic steps as t he sound localization method for a single input c hannel pair a nd a single s ound event type, which generally include: determining the OSA; determining the DSA and trigger signal; and determining which direction to use.
- determining the DSA and the trigger signal includes determining the DSA and trigger signal for each sound event type by performing a DSA determination method for a single input channel pair and multiple sound event types that uses an alternate beginning detection method.
- determining which direction to use includes performing a direction selection method for a single input channel pair (either for a single sound event type or a short-duration sound event) in parallel for each sound event type being detected.
- Sound localization methods may also be used to localize the sounds in a sound field with more than one input channel pair by distinguishing more than one sound event type ("sound localization methods for multiple input channel pairs and multiple sound event types").
- the sound localization methods for multiple channel pairs and multiple sound event types include the same basic steps as the sound localization method for multiple input channel pairs and a single sound event type, which generally includes: determining the OSA; determining the DSA and trigger signal for each channel pair; and determining which direction to use according to a direction selection method for multiple input channel pairs and a single sound event type.
- determining the DSA and trigger signal for each channel pair includes determining a DSA and trigger signal for each sound event type in each input channel pair. Determining a DSA and trigger signal for each sound event type in each input channel pair is accomplished by performing a DSA determination method for multiple input channel pairs and multiple sound event types that includes an alternate beginning detection method.
- performing the direction selection method for multiple input channel pairs and multiple sound event types in parallel for each sound event includes, responding to the trigger signals from any sound event type; generating and selecting the true DSA for all the input channel pairs if a sound event of any type with an accurate DSA is detected in any input channel pair; or selecting the true OSA for all the input channel pairs if a sound event of any type with an inaccurate DSA is detected in any channel.
- the sound localization methods for multiple input channel pairs and multiple sound event types may also include using the accommodation adjustment methods and/or the e ⁇ or threshold adjustment methods as previously described.
- the method will determine if the drum hit has actually ended, and if it has, the method will immediate revert to using the direction indicated by the filtered OSA and move the entire sound field back to the center input channel.
- the human hearing mechanism will perceive the drum hit as originating from the rear and the music as continually originating from the front as if the music had never moved. If however, the DSA is u sed for longer than the typical duration of the drum hit or if at the end o f the typical duration a decay is used to revert to the filtered OSA when the drum hit actually ends prior to the end of the typical duration, the entire sound field, including the music will be perceived as having moved to the rear.
- the beginning of the shout will be detected as an impulse, and assuming the cu ⁇ ent impulsive DSA is co ⁇ ect, the impulsive DSA will be selected for the typical d uration of an impulsive sound, however, e ither d uring the typical duration or immediately after, the syllable portion of the shout will be detected and assuming it is co ⁇ ect, the syllable DSA will be selected and used for the typical duration of a syllable. Because at the moment the syllable is detected, the DSA of the previously detected impulsive sound will equal that of the syllable, no change in direction will occur. Therefore, the direction indicated in the sharp onset of the shout will be quickly captured according to its impulsive nature and the direction will be used for time characteristic of its syllabic nature.
- the sound event detection methods, sound event detection and localization m ethods and s ound localization methods and any methods included in any o f these methods may be implemented in a sound event device as shown in FIG. 19 and indicated as reference number 1900.
- the optimization device 1900 generally includes a detection unit 1902 and may also include an interface unit 1904.
- the detection unit 1902 includes a processor 1908 coupled to a memory device 1906.
- the memory device 1908 may be any type of fixed or removable digital storage device and (if needed) a device for reading the digital storage device including, floppy disks and floppy drives, CD-ROM disks and drives, optical disks and drives, hard-drives, RAM, ROM and other such devices for storing digital information.
- the processor 1908 may be any type of apparatus used to process digital information.
- the memory device 1 906 m ay store the s ound field and at least one of the following methods: the sound event detection methods, sound event detection and localization m ethods and sound localization methods and any methods included in any o f these methods (collectively, the "detection and/or localization methods").
- the memory communicates one of the detection and/or localization methods, and if necessary the sound field, via a memory signal 1912 to the processor 1908.
- the processor 1908 then performs the detection and/or localization method.
- the interface unit 1904 generally includes an input device 1914 and an output device 1916.
- the output device 1916 is any type of visual, manual, audio, electronic or electromagnetic device capable of communicating information from a processor or memory to a person or other processor or memory. Examples of output devices include, but are not limited to, monitors, speakers, liquid crystal displays, networks, buses, and interfaces.
- the input device 1914 is any type of visual, manual, mechanical, audio, electronic, or electromagnetic device capable of communicating information from a person or processor or memory to a processor or memory. Examples of input devices include keyboards, microphones, voice recognition systems, trackballs, mice, networks, buses, and interfaces.
- the input and output devices 1914 and 1916 may be included in a single device such as a touch screen, computer, processor or memory coupled to the processor via a network.
- the sound field may be communicated to the memory device 1918 from the input device 1914 through the processor 1920.
- the optimized model parameters may be communicated from the processor 1920 to the output device 1916.
- Sound event detectors may be used for a variety of applications for which the detection of specific types of sound events is helpful.
- a sound event detector that detects syllables may be used as part of a phonic detector or as part of a speech recognition or speech coding system.
- a sound event detector that detects syllables may be used in conjunction with an audio amplification device, such as a microphone. This allows the microphone to remain off until a syllable is detected from a speaker, thus preventing the microphone from amplifying undesired sounds and feedback through the microphone itself when the speaker is silent.
- Sound event detectors like the sound event detection methods, generally produce a trigger signal that includes some flag or marker, such as a pulse, that indicates the occu ⁇ ence of a sound event. Also, like the sound event detection methods, the sound event detectors can be implemented to detect any number and combination of sound event types in a sound field that is generated in any number of input channels. In the following description, the sound event detectors are discussed in order of increasing complexity, with each subsequent sound detector incorporating the elements of the prior sound event detectors, except as indicated.
- FIG. 20 One example o f a sound event detector that detects a s ingle type of sound event in a sound field generated in only one input channel (a "sound event detector for a single input channel and a single sound event type") is shown in FIG. 20 and indicated by reference number 2000.
- the sound event detector and localizer may be implemented to detect any number of sound event types in any number of input channel pairs.
- the entire sound field is generated through a left input channel.
- the term "left" as used in this example does not have any directional meaning because the entire sound field i s contained in a single input c hannel and is u sed s imply for the p urposes o f explanation.
- a sound event detector for a single input channel and a single sound event type 1900 generally includes a frequency bias filter 2001; an accommodation circuit for a single input channel 2002; and a trigger generation circuit for a single input channel 2004.
- the accommodation circuit for a single input channel 2002 uses the sound field to produce a differential signal in the left input channel "Lo" for the sound event type being detected and the trigger generation circuit for a single input channel 2004 uses the accommodation signal Lo to produce a trigger signal "TI" indicating whenever a sound event of the type being detected is detected.
- the frequency bias filter 2001 models the frequency bias of the human hearing mechanism by emphasizing frequencies in the sound field from about 500 Hz to about 4000 Hz.
- the accommodation circuit for a single input channel 1702 separates sound events from any background signals in the sound field by modeling accommodation.
- This circuit 2102 is shown in more detail in FIG. 21 and generally includes: a multiplier 2102; a low-pass filter 2104; and an accommodation signal circuit 2006.
- the multiplier 2002 converts the sound field as generated in the left input channel (the "input signal") into a power signal " Lin 2 .”
- the i nput signal is generally a v oltage signal and c an generally be converted into a power signal by squaring the input signal.
- the resulting power signal Lin 2 includes many fluctuations, some of which indicate sound events, and some of which indicate noise.
- the low-pass filter then removes the fluctuations with rise times faster than about 30 ms from the power signal Lin to produce a filtered power signal L .
- This low-pass filter 2104 may be of any type, such as a filter with a roll-off of 12 dB/octave.
- the accommodation circuit 2106 creates and subtracts the accommodation signal (which represents the long-term average power in the sound field as it is ) to create the differential signal Lo.
- the accommodation circuit 2106 generally includes an operational amplifier 2108; a resistor 2114; a diode 2 112 and a capacitor 2 110.
- the filtered power signal L 2 is coupled to the positive terminal of the operational amplifier 2108 or, alternatively any device c apable o f determining a difference between two signals.
- capacitor 2110 will act as an open circuit resulting in an accommodation signal L that is about equal the filtered power signal L to produce a differential signal Lo about equal to zero.
- the filtered power signal L when the filtered power signal L does contain a sound event, the filtered power signal L will rapidly increase according to the rise time of the sound event. This rapid increase in L 2 will cause a co ⁇ esponding spike in the differential signal Lo.
- capacitor 2110 After the rise-time of the sound event, capacitor 2110 will charge causing the accommodation signal L? to gradually rise according to time constant defined by resistor 2114 and capacitor 21 10. This time constant is generally made equal to the accommodation rate of the human hearing mechanism, determined experimentally to be about 300 ms. L? will continue to rise until the voltage across capacitor 2110 (and thus P ) equals L 2 or until the sound event ends or starts to decay. This increasing L?
- the differential signal Lo therefore, includes a s eries of fluctuations with on-times equal to or less than those characteristic of the sound event type being detected and with fall-times defined by the accommodation signal, and/or the end of the sound event.
- the trigger generation circuit for a single input channel 1904 (shown in FIG. 19) then detects the sound events to produce a trigger signal TI' that includes a pulse whenever a sound event is detected.
- a trigger generation circuit 1904 for a single input channel is shown in more detail in FIG. 22 and includes: a high-pass filter 2202; a normalization circuit 2206 and a low-pass filter 2208.
- the goal of the trigger generation circuit 1904 is to remove as many fluctuations caused by noise as possible and to deemphasize those that are not removed. As explained previously in connection with the sound event detection methods, this is accomplished by removing the fluctuations that have frequencies higher than those characteristic of the sound event being detected and by normalizing the differential signal Lo with the s hort-term high frequency p ower i n t he differential signal.
- the normalization i s accomplished using an automatic gain control circuit which includes the high-pass filter 2202 and the normalization circuit 2206.
- the high-pass filter includes a capacitor/resistor pair that defines the cutoff frequency as that which is characteristic of the sound event being detected.
- a rectifier may be included between the high-pass filter 2202 and the normalization circuit 2206 to rectify any negative pulses or fluctuations.
- the normalization circuit 2206 which includes an integrator 2210 and a divide by circuit 2212, then averages the high-frequency component of Lo over a short time period defined by the integrator 2210. The short time period defined by the integrator may equal about 160 ms, however, this time p eriod may be adjusted as a function of the type of sound field.
- the divide-by circuit 2212 then divides Lo by the averaged HFl to yield the normalized differential signal Nl.
- a rectifier (not shown) may be included between the normalization circuit 2206 and the low-pass filter 2208 to rectify any negative pulses or fluctuations.
- the normalized differential signal Nl is then filtered by a low-pass filter 2208 to remove fluctuations with frequencies higher than are characteristic of the sound event being detected to yield a filtered normalized differential s ignal NT.
- additional n oise can be removed from N 1' by including circuitry that detects and removes fluctuations that occur more often than is characteristic of the sound event being detected and that remove any fluctuations that occur when a decrease in the sound field of at least 10 dB is detected.
- Nl' therefore, includes a series of pulses of varying amplitudes representing the occu ⁇ ence of sound events and fluctuations due to noise.
- a threshold detector 2218 In order to detect the sound events from the noise in the filtered normalized differential signal Nl', a threshold detector 2218 detects only those pulses with an amplitude greater than a threshold. This helps to distinguish pulses indicating sound events from fluctuations due to noise.
- the output of the threshold detector is a trigger signal "TI" that indicates, generally by pulses, the occu ⁇ ence of a sound event in the sole (left) input channel of the sound field.
- the sound event detector for a single input channel and a 5 single sound event type may also include a threshold adjustment circuit. The threshold adjustment circuit adjusts the threshold of the threshold detector in order to adjust the sensitivity of the sound event detector.
- the threshold detector may allow manual adjustment of the threshold and may include a voltage source and a variable resistor coupled to the threshold detector in the trigger generation circuit.
- the resistance of the resistor may be l o manually controlled by a knob or switch or other such device to control the voltage supplied by t he voltage s ource to the t hreshold d etector which i s u sed b y t he t hreshold d etector t o define the threshold.
- the threshold detector provides automatic adjustment of the threshold and includes a counter coupled to the output of the trigger generation circuit and a comparator coupled to the counter and the threshold detector in the trigger generation
- the counter counts the number of sound events that occur in a specified time period and communicates this number to the comparator. This specified time period is generally on the order of about a few seconds.
- the comparator then produces a voltage which is inversely proportional to the number of sound events and communicates this voltage to the threshold detector which uses the voltage to define the threshold. Generally, the threshold is increased
- any of the sound event detectors may not include an accommodation signal circuit, which is of particular use when detecting short-duration sound events, such as impulsive sounds.
- An example of a sound event detector that does not include an accommodation signal circuit which is of particular use when detecting short-duration sound events, such as impulsive sounds.
- An example of a sound event detector that does not include an accommodation signal circuit which is of particular use when detecting short-duration sound events, such as impulsive sounds.
- An example of a sound event detector that does not include an accommodation signal circuit which is of particular use when detecting short-duration sound events, such as impulsive sounds.
- This sound event detector for short-duration sound events 2300 includes: a frequency bias filter 2301; a linear to dB c onverter 2302; a high-pass filter 2303; and a trigger generation circuit for a single channel pair 2304.
- the frequency bias filter 2301 emphasizes the frequencies in the input signal from about 500 Hz to about 4000 Hz to
- the sound event detectors for short-duration sound events may also include a threshold adjustment circuit.
- Sound event detectors may also be implemented when the sound field is generated in two or more input channels.
- a sound event detector that detects a single sound event type in a sound field generated in two input channel may include a sound event detector for a single channel and a single channel pair for each input channel that produces a trigger signal for each input channel.
- the trigger signals may be combined to form a single trigger signal that indicates the occu ⁇ ence of a sound event in any input channel.
- a sound event detector for multiple input channels and a single sound event type may produce only a single trigger signal for each channel pair from a difference signal.
- the entire sound field is either generated through a single input channel pair including a left input channel and a right input channel.
- this method is applicable for any number of input channels or input channel pairs.
- the sound event detector for multiple input channels and a single sound event type 2400 includes: a first accommodation circuit for a single input channel 2402; a second accommodation circuit for a single input channel 2404; and an alternate trigger generation circuit for a single input channel pair 2406.
- the first and second accommodation circuits 2402 and 2404, respectively, are generally identical.
- the first accommodation circuit 2402 uses the left input channel signal ("Lin") to produce a differential signal for the left input channel (the "left differential signal" or "Lo").
- the alternate trigger generation circuit for a single input channel pair 2406 uses both the right and left differential signals to produce a trigger signal that indicates the occu ⁇ ence of sound events in either input channel (the "left-right trigger signal” or "Tlr").
- the trigger generation circuit for a single input channel pair 2406 is shown in more detail in FIG. 25 and includes: an operational amplifier 2501 ; a rectified high-pass filter 2502; a rectified normalization c ircuit 2506; a l ow-pass filter 2508 and a threshold detector 2510. Although similar to the trigger generation circuit for a single input channel (as shown in FIG. 25),
- the trigger generation circuit for a single input channel pair 2406 also includes an operational amplifier 2501 (or other device capable of determining a difference) that creates a signal equal to the difference between the left and right differential signal (the "left-right difference signal” or “Lo-Ro") and uses the left-right difference signal to create a trigger signal for the left-right input channel pair (the "left-right trigger signal” or Tlr").
- the left-right difference signal is obtained by subtracting Ro from Lo, it may alternatively be determined by subtracting Lo from Ro.
- the left-right difference signal Lo- Ro includes a series of pulses and other fluctuations that indicated the occu ⁇ ence of sound events and noise in either input channel of the input channel pair.
- the pulses and fluctuations in the difference signal may have a positive or a negative amplitude depending on whether the power in the transient is greater in the left input channel or the right input channel, respectively.
- a first rectifier 2504 is included in the rectified high-pass filter 2502 and a second rectifier is included in the normalization circuit 2506.
- the rectified high-pass filter 2502 produces a rectified high-pass left-right difference signal ("HFlr"). This rectified high-pass left-right difference signal is used by the normalization circuit 2506 to normalize the difference signal
- Lo-Ro the result of which is rectified by the second rectifier 2507 to produce a normalized left-right signal ( "Nlr”).
- the low-pass filter 2508 removes fluctuations due to noise with rise-times faster than those characteristic of the sound event being detected to produce a filtered normalized left-right signal ("Nlr' ").
- additional noise can be removed from Nlr' by including circuitry that detects and removes fluctuations that occur more often than is characteristic of the sound event being detected and that remove any fluctuations that occur when a decrease in the sound field of at least 10 dB is detected.
- Nlr' therefore, includes a series of pulses of varying amplitudes representing the occu ⁇ ence of sound events and fluctuations due to the remaining noise.
- the threshold detector 2510 detects the sound events as those pulses that have an amplitude greater than a threshold to create the left-right trigger signal Tlr.
- This sound event detector may be repeated in parallel for multiple input channel pairs to produce a trigger signal for each input channel pair.
- this sound event detector for multiple input channels and a single sound event may also include a threshold adjustment circuit for each trigger generation circuit.
- the sound event detectors for multiple input channels and a single sound event that include an alternate trigger generation circuit may also be refe ⁇ ed to as "sound event detectors for a single channel pair and a single sound event type.” Additionally, any of the sound event detectors for multiple input channels (or a single input channel pair) and a single sound event may include a threshold adjustment circuit for each trigger generation circuit.
- Sound event detectors may also be implemented so that more than one type of sound event is detected.
- These "sound event detector for a single input channel and multiple sound event types" generally include a sound event detector for a single input channel and a single s ound event type implemented in parallel for each sound event type being detected to produce a trigger signal for each sound event type being detected.
- An example of such a sound event detector for a single input channel and multiple sound event types is shown in FIG. 26 and indicated by reference number 2600.
- the entire sound field is generated through a left input channel.
- the term "left” as used in this example does not have any directional meaning because the entire sound field is contained in a single input channel and, in fact, the i nput channel c an be given any designation.
- the sound event detector for a single input channel and multiple sound event types 2600 is implemented to detect syllables and impulsive sounds. However, any number or combination of sound events may be detected.
- the sound event detector for a single input channel and multiple sound event types 2600 generally includes: a accommodation circuit for a s ingle input channel 2602; a trigger generation circuit for a single input channel implemented for syllables 2604; and a sound event detector for a single input channel and a single short- duration sound event implemented for impulsive sounds 2606.
- the accommodation circuit for a single input channel 2602 uses the input signal Lin to produce a differential signal.
- the trigger generation signal for a single input channel implemented to detect syllables 2604 uses the differential signal to produce a trigger signal that indicates the occu ⁇ ence of syllables in the sole input channel (left) of the sound field ("Tl(s)").
- the trigger generation signal for a single input channel implemented to detect syllables 2604 includes filters (see FIG. 22) for which the cut-off frequency rise time is about 33 ms.
- the trigger generation circuit for a single input channel implemented to detect impulsive sounds 2606 uses the input signal Lin to produce a trigger signal that indicates the occu ⁇ ence of impulsive sounds in the sole input channel (left) of the sound field ("Tl(i)").
- This trigger generation circuit for a single input channel implemented to detect impulsive sounds 2606 includes a high-pass filter (see 2303 in FIG. 22) for which the cut-off rise-time is about 3 ms.
- the sound event detectors for multiple input channels and a single sound event may include a threshold adjustment circuit for each trigger generation circuit.
- Sound event detectors may also be implemented so that more than one type of sound event is detected in more than one input channel.
- These "sound event detectors for multiple i nput c hannels and m ultiple sound e vent t ypes" m ay produce a trigger signal for each sound event type in each input channel pair.
- trigger signals in each channel pair may be combined in almost any manner to reduce the number of trigger signals.
- An example of such a sound event detector for multiple input channels and multiple sound event types is shown in FIG. 27 and indicated by reference number 2700. In this example, the entire sound field is either detected in or reproduced through a left and a right input channel.
- the method may b e implemented for any number and combination o f input channels.
- the sound event detector for multiple input channels and multiple sound event types is implemented to detect syllables and impulsive sounds. However, any number or combination of sound events may be detected.
- the sound event detector for multiple input channels and multiple sound event types 2700 generally includes, a first accommodation circuit for a signal input channel 2702; a second accommodation circuit for a signal input channel 2706; a first sound event detector for a single channel and a short-duration sound event 2708; an alternate trigger generation circuit for a single input channel pair 2710; and a second sound event detector for a single channel and a short-duration sound event 2712.
- the first and second accommodation circuits for a single channel 2702 and 2703, respectively, are identical to each other.
- the first accommodation circuit for a single channel 2702 produces a differential signal for the left input channel Lo.
- the second accommodation circuit for a single channel 2703 produces a differential signal for right input channel Ro.
- the alternative trigger generation circuit for a single channel pair 2718 uses Lo and Ro to produce a trigger signal that indicates the occu ⁇ ence of syllables in the left-right channel pair Tlr(s).
- the alternative trigger generation circuit for a single channel pair 2718 (shown in more detail in FIG. 25, indicated by reference number 2406) includes filters with a cut-off rise-time defined at about 33 ms.
- Both the first and second sound event detector for a single channel and a single short-duration sound event 2708 and 2712, respectively, include a high- pass filter (see 2303 in FIG. 23) with a cut-off rise-time of about 3 ms.
- the sound event detector for multiple input channels and multiple sound event types includes a sound event detector for a single input channel and a single sound event implemented in parallel for each sound event type in each input channel. This sound event detector produces a trigger signal for each sound event in each input channel.
- the s ound event detector for multiple input c hannels and multiple sound event types may include a sound event detector for a single input channel a multiple sound event types implemented in parallel for each input channel. This sound event detector for multiple input channels and multiple sound event types also produces a trigger signal for each sound event in each input channel.
- any of the sound event detectors for multiple input channels and multiple sound event types may also include a threshold adjustment circuit for each trigger generation circuit.
- Sound event detectors and localizers like the sound event detection and localization methods, determine the direction of sound events i n relation to one or more input channel p airs in terms o f a differential steering angle or an ordinary steering angle and in some cases, verify the accuracy of the steering angles. Also, like the sound event detection and localization methods, the sound event detectors and localizers can be implemented to detect any number and combination o f s ound event types in a sound field generated in any number of i nput channels. In the following description, the sound event detectors and localizers are discussed in order of increasing complexity, with each subsequent sound detector and localizer incorporating the elements of the prior sound event detectors and localizers, except as indicated.
- FIG. 28 One example of a sound event detector and localizer implemented to detect a single sound event type in a single input channel pair is shown in FIG. 28 (a "sound event detector and localizer for a single input channel pair and a single sound event type").
- sound events are detected and localized with respect to a right input channel and a left input channel.
- the sound event detector and l ocalizer for a single input channel pair and a single s ound e vent type 2800 shown in FIG. 28 generally includes: a sound event detector for a single input channel and a single sound event type 2804; and a sound event localization circuit for a single input channel pair and a single sound event type 2806.
- the sound event detector for a single input channel pair and a single sound event type 2804 includes any of the sound event detectors for multiple input channels and a single sound event, which includes a trigger generation circuit for a single input channel pair and is implemented for whatever sound event is being detected.
- the sound event detector for a single input channel pair and a single sound event type 2804 uses the left input signal Lin and a right input signal Rin to produce an differential signal for the left input channel Lo, a differential signal for the right input channel Ro, and a trigger signal indicating the occu ⁇ ence in either input channel of whatever sound event is being detected Tlr.
- the sound event localization circuit for a single input channel pair and a single sound event type 2806 then uses Lo, Ro and Tlr to produce a true differential steering angle indicating the direction of the detected sound events relative to the right and left input channel dlr'.
- the sound event localization circuit for a single i nput channel p air and a single sound event type 2806 is shown in more detail in FIG. 29 and generally includes a DSA circuit 2904, a switch 2 518; a resistor 2906; a c apacitor 2908; and a c ontrol c ircuit 2910.
- the DSA circuit 2904 uses the left accommodation signal Lo and the right accommodation signal Ro to determine the differential steering angle dlr.
- the DSA circuit 2904 includes a first linear to decibel circuit 2912; a second linear to decibel circuit 2914; an operational amplifier 2916 and a decibel to equivalent angle circuit 2918.
- the first and second linear to decibel circuits 2912 and 2914 respectively, convert the left and right accommodation signals, respectively, from a power signal into a decibel signal.
- the operational amplifier 2916 (or alternatively, any circuit that can determine a difference) determines the ratio between the left and right decibel signals by determining the difference between the two signals. This ratio is then converted into an equivalent angle by the decibel to equivalent angle circuit 2918 to produce the differential steering angle dlr.
- the control circuit 2910, the switch 2906 and the capacitor 2908 generally form a sample-and-hold circuit and can therefore be replaced with any device or circuit that performs a similar function.
- the control circuit 2910 causes the switch 2906 to close and the capacitor 2908 to capture dlr during the rise time of the sound event to produce the true DSA ("dlr' ") for the typical duration of the sound event type b eing detected.
- the control c ircuit 2910 receives the trigger signal Tlr and produces a control signal "con” that controls switch 2906.
- Switch 2906 is a two position switch and is generally in position C when no sound events are detected.
- the switch 2906 When in position A, the switch 2906 is closed, when in position B, the switch is open, and when in position C, the switch 2906 is grounded.
- the control circuit 2910 receives an indication from trigger signal Tlr that a sound event is occurring, it communicates to switch 2906 via "con" a command to close (go to position A). In response, the switch 2906 closes.
- the control circuit 2910 communicates to switch 2906 via con a command to open (go to position B). After the typical duration of the sound event type being detected, the control circuit 2910, communicates to switch 2906 a command via con to go to ground (go to position C).
- the left-right differential steering angle dlr is captured by the capacitor 2908 to create the true DSA dlr'.
- the true DSA is held until the end of the typical duration of the sound event, even if the sound event has not actually ended. For example, if the sound event being detected is a syllable, the true DSA will be held for about 50 ms to about 200 ms, preferably after about 150 ms. In another example, if the sound event being detected is an impulsive sound, the true DSA will be held for about 50 ms.
- the capacitor is ground through switch C causing the voltage held by capacitor 2908 and thus, dlr' to go to zero.
- the capacitor 2908 is chosen so that it can sufficiently capture dlr during the rise time of the sound event being detected. For example, if the sound event being detected is a syllable, the capacitor must be able to capture dlr in 20 ms to about 30 ms. In another example, if the sound event being detected is an impulsive sound, the capacitor must be able to capture dlr in about 5ms.
- the sound event detector and localizer for a single input channel pair and a single s ound event t ype is o ptimized for s ound e vents with v ery short durations such as impulsive sounds. In some cases, it is very difficult to obtain an accurate
- this optimized sound event detector and localizer for a single input channel pair and a single sound event type further includes a circuit for determining the average DSA of all sound events detected in an about 3 ms time frame (the "DSA averaging circuit").
- the DSA averaging circuit is generally implemented in the sound event localization circuit 2806 between the DSA circuit 2804 and the switch 2906.
- the sound event detector and localizer may also be implemented to detect a single sound event type in a sound field generated in multiple input channel pairs (a "sound event detector and localizer for multiple input channel pairs and a single sound event type").
- a sound event detector and localizer for multiple input channel pairs and a single sound event type implemented to detect and localize a single sound event in both a right-left input channel pair ("LR input channel pair") and a center-su ⁇ ound input channel pair ("CS input channel pair) is shown in FIG. 30 and designated by reference number 3000.
- This detector and localizer may be implemented for any combination of input channel pairs with the LR input channel pair and the CS input channel pair used for only for the purposes of example.
- the sound event detector and localizer for multiple input channel pairs and a single sound event type 3000 produces a true differential steering angle for the LR input channel pair (dlr') and the CS input channel pair (dcs') and generally includes: a first and second sound event detector for a single input channel pair and a single sound event 3010 and 3012, respectively; and a sound event localization circuit for multiple input channel pairs and a single sound event type 3014.
- the first and second sound event detectors for a single input channel pair and a single sound event type 3010 and 3012 are both implemented to detect the same sound event.
- the first sound event detector for multiple input channels 3010 uses the input signals in the left and right input channels, Lin and Rin, respectively, to produce a left differential signal Lo, a right differential signal Ro, a left power envelope L 2 , a right power envelope R , and a left-right trigger signal Tlr.
- the second sound event detector for multiple input channels 3012 uses the input signals in the center and su ⁇ ound input channels, Cin and Rin, respectively, to produce a center differential signal Co, a su ⁇ ound differential signal So, a c enter power envelope C , a su ⁇ ound power envelope S , and a c enter-su ⁇ ound trigger signal Tcs.
- the sound event localization circuit for multiple input channel pairs and a single sound event type 3014 uses the left differential signal Lo, the right differential signal Ro, and the left-right trigger signal to produce an angle indicating the direction of a detected sound event that equals either a true OSA or a true DSA for the left-right input channel pair
- circuit 3014 uses all the power envelopes, dlr and dcs to verify the accuracy of the DSA.
- the sound event localization circuit for multiple input channel pairs and a single sound event type 2614 is shown in more detail in FIG.
- 31 and generally includes: a first OSA circuit 3102; a first DSA circuit 3104; a second DSA circuit 3106; a second OSA circuit 3108; a verification circuit 3116; a control circuit 3118; a first two-position switch 3110; a first three-position switch 3112; a first capacitor 3114; a second two-position switch
- d/lr' is produced by the first OSA circuit 3102, first DSA circuit 3104, first two-position switch 3110, first three-position switch 3112 and the first capacitor 3114.
- d/cs' is produced by the second OSA circuit 3108, second DSA circuit 3106, second two-position switch 3120, second three-position switch 3122 and the second capacitor
- the first and second OSA circuits 3102 and 3108 convert Lin and Rin and Cin and Sin, respectively, into ordinary steering angles, "lr” and " cs" using known methods.
- the control circuit 3118, the first three-way switch 3112, and the first capacitor 3114 form a first sample and hold circuit, while the control circuit 3118, the second three-way switch 3122, the second capacitor 3124 form a second sample and hold circuit.
- Both the first and second three way switches 3112 and 3122, respectively, are normally in position C when no sound events are detected and controlled by the control circuit so that each switch is closed when a sound event i s d etected ( moved to p osition A ), o pened a 11 he e nd o f t he typical r ise time o f the sound event type being detected (moved to position B) and grounded at the end of the typical duration of the sound event type being detected (move to position C).
- the control circuit 3118 produces a control signal "con” that is communicated to the three-way switches 3112 and 3122.
- the control signal causes the three-way switches 3112 and 3122 to move to (or stay in) position A whenever either trigger signal (Tlr or Tcs) indicates that a sound event is being detected in either input channel pair. Subsequently, con causes the three-way switches 3112 and 3122 to move to position B at the end of the typical duration of the rise-time of the sound event. Then con causes the three-way switches 3112 and 3122 to move to position C at the end of the typical duration of the sound event type being detected.
- Both two-way switches 3110 and 3120 include positions D and E. When a sound event is detected and both two-way switches 31 10 and 3120 are in position D, the DSA for each channel pair are used to indicate the direction of the sound event. However, when a sound event is detected and both two-way switches 31 10 and 3120 are in position E, the OSA for each channel pair is used to indicate the position of the sound event.
- the verification circuit 3 1 16 c ontrols both t wo-way s witches 3110 and 3 120 v ia a verification s ignal " vs" according to w hether t he D SAs a re c o ⁇ ect w hen a s ound e vent i s detected.
- W hen e ither trigger signal indicates that a sound event is being detected, the verification circuit determines whether at least two of the power envelopes (L 2 , R 2 , C 2 , S 2 ) have dropped by at least 3dB or more.
- the verification circuit will communicate to both two-way switches 3110 and 3120 via vs causing them to move or stay in position E. However, if at least two of the power envelopes have not dropped by at least 3dB or more, the verification circuit will communicate to both two-way switches 3110 and 3120 via vs causing the to move or stay in position D.
- the verification circuit of the sound event localization circuit 3116 also includes a circuit for performing a consistency check (a "consistency check circuit").
- the consistency check circuit is coupled to both DSA circuits and uses the differential steering angles produced by each make a further determination of the accuracy of the differential steering angles.
- the consistency check circuit uses a known circuit for determining the sum of the absolute values of dlr and dcs at the moment any trigger signal indicates that a sound event has been detected and then determines whether the sum is less than or equal to 45 degrees.
- the v erification s ignal will communicate to both of the two-way switches 31 10 and 3120 causing then to move to position D.
- additional circuitry may be added to adjust the degree of accommodation (an “accommodation adjustment circuit") and the threshold as a function of the c onsistency check (an “e ⁇ or threshold c ircuit"). Both the accommodation adjustment circuit and the e ⁇ or threshold circuit (not shown) are coupled to the consistency check circuit and include a counter that counts the number of e ⁇ ors detected by the consistency check in a time period of about several seconds.
- the accommodation adjustment circuit is also coupled to the accommodation signal in the transient detection circuits included in the DSA circuits 3104 and 3102 and further includes an accommodation voltage source that is adjusted according to the number of e ⁇ ors counted by the counter. As the number of e ⁇ ors increases, the voltage produced by the accommodation voltage source will increase to reduce the degree of accommodation.
- the e ⁇ or threshold circuit is also coupled to the threshold detector in the trigger generation circuits and further includes an e ⁇ or voltage source that is adjusted according to the number of e ⁇ ors counted by the counter. As the number of e ⁇ ors increases, the voltage produced by the e ⁇ or voltage source will increase to cause the threshold voltage to increase so that fewer sound events are detected.
- the sound event detector and localizer may also be implemented to detect multiple sound event types in a sound field generated in a single input channel pair (a "sound event detector and localizer for a single input channel pair and multiple sound event types").
- a sound event detector and localizer for a single input channel pair and a multiple sound event types generally includes a sound event detector and localizer for a single input channel and a single sound even type implemented in parallel for each sound event type being detected to produce a differential steering angle for each sound event type in the input channel pair.
- the sound event detector and localizer for a single input channel pair and multiple sound event types may include a sound event detector for a single channel pair and a single sound event type implemented for each sound event type being detected and a sound event localization circuit for a single channel pair and a single sound event type that produces a d ifferential steering angle i ndicating the direction of all types of sound e vents being detected.
- An example of such a sound event detector and localizer for a single input channel pair and multiple sound event types that detects syllables and impulsive sounds is shown in FIG. 32.
- the sound event detector and localizer for a single input channel pair and multiple sound event types includes: a sound event detector for a channel pair and a single sound event type 3202; a sound event detector for a single channel pair and a short- duration sound event type 3204; and a sound event localization circuit for a single channel pair and a single sound event type 3206.
- the sound event detector for a single channel pair and a single sound event type 3202 may include the sound event detector for multiple channels and a single sound event type shown in FIG.
- the sound event detector for a single channel pair and a short-duration sound event type 3204 may include the sound event detector for a single channel pair and a single short-duration sound event shown in FIG.
- the sound event localization circuit for a single channel pair and a single sound event type 3206 may include the sound event localization circuit for a single channels pair and multiple sound event types as shown in FIG. 29 implemented to produce a left right differential steering angle indicating the direction of a detected syllable or impulsive sound whenever either trigger signal (Tlr(s) or Tlr(i)) indicates that a sound event is occurring.
- the sound event detector and localizer may also be implemented to detect multiple sound event types in a sound field generated in multiple input channel pairs (a "sound event detector and localizer for multiple input channel pairs and multiple sound event types").
- a sound event detector and localizer for multiple input channel pairs and multiple sound event types that detects syllables and impulsive sounds in both the left- right and center-su ⁇ ound channel pairs is shown in FIG. 33.
- This sound event detector and localizer for multiple input channel pairs and multiple sound event types 3300 includes: a first and a second sound event detector for a single channel pair and a single short-duration sound event 3 302 and 3308, respectively; a first and a s econd sound e vent d etector for a single channel pair and a single sound event type 3304 and 3306, respectively; and a sound event localization circuit for multiple channel pairs and a single sound event type 3310.
- the first sound event detector for a single channel pair and a single sound event type 3 304 m ay include the sound event d etector for multiple channels and a single sound event type shown in FIG. 24 implemented to detect syllables and to produces a left differential signal Lo, a right differential signal Ro and a left-right trigger signal indicating the occu ⁇ ence of syllables "Tlr(s)."
- the second sound event detector for a single channel pair and a single sound event type 3306 may include the sound event detector for multiple c hannels and a s ingle sound event type shown in FIG.
- the first sound event detector for a single channel pair and a short-duration sound event type 3302 may include the sound event detector for a single channel pair and a single short-duration sound event shown in FIG.
- the second sound event detector for a single channel pair and a short-duration sound event type 3308 may include the sound event detector for a single channel pair and a single short-duration sound event shown in FIG.
- the sound event localization circuit for multiple channel pairs and a single sound event type 3310 may include the sound event localization circuit for multiple channel pairs and a single sound event type as shown in FIG.
- any of the sound event detector and localizer for multiple input channel pairs a nd m ultiple s ound e vent t ypes may a dditionally i nclude an a ccommodation adjustment circuit and/or an e ⁇ or threshold circuit.
- Both the accommodation adjustment circuit and the e ⁇ or threshold circuit are coupled to the consistency check circuit and include a counter that counts the number of e ⁇ ors detected by the consistency check in a time period of about several seconds.
- the accommodation adjustment circuit is also coupled to the accommodation signal in the transient detection circuits included in the DSA circuits.
- the e ⁇ or threshold circuit is also coupled to the threshold detector in the trigger generation circuits and further includes an e ⁇ or voltage source that is adjusted according to the number of e ⁇ ors counted by the counter. As the number of e ⁇ ors increases, the voltage produced by the e ⁇ or voltage source will increase to cause the threshold voltage to increase so that fewer sound events are detected.
- a stereo/su ⁇ ound detector determines whether a sound field is intended f or r eproduction in two input c hannels or i n more than two input c hannels.
- FIG. 34 (a "stereo/su ⁇ ound detector for a single sound event type") is shown in FIG. 34 and includes: a sound event detector and localizer for a single input channel pair and a single sound event type 3402 and a detector and c ounter 3404.
- the sound event detector and localizer for a single input channel pair and a single sound event type 3402 uses the signals in the center input channel Cin and the su ⁇ ound input channel Sin to produce a true differential steering angle dcs' that reflects the direction of whatever sound event is being detected.
- the threshold detector and counter 3404 determines the number of times dcs' falls within the range of about 0 degrees to about -45 degrees.
- the threshold detector and counter 3404 produces a signal sursig indicating that the sound field should be reproduced in su ⁇ ound. Conversely, if the number does not exceed a predetermined value, then the detector and counter 3404 produces a signal sursig indicating that the sound field should be reproduced in stereo. In general, if the number of sound events detected during a relatively long time period on the order of about 10 s to about 15 s is on the order of about 2 or 3, the detector and counter 3404 will produce a signal sursig indicating that the sound field should be reproduced in su ⁇ ound. Additionally, the detector and counter may further determine the duration of the sound events and only count those with durations that exceed a predetermined value as sound events that are intended for reproduction in the rear.
- sound events with durations less than about 50 ms will not be counted as sound events that are intended to be reproduced in the rear.
- sound events with durations of about 200 ms to about 300 ms will be counted as sound events that are intended to be reproduced in the rear.
- a stereo/su ⁇ ound detector implemented to detect a single sound event type (a "stereo/su ⁇ ound detector for multiple sound event types") is shown in FIG. 35 and is indicated by reference number 3500.
- the stereo/su ⁇ ound detector 3500 is implemented to count the number of syllables and impulsive functions intended for reproduction in the rear.
- the detector 3500 includes: first and s econd sound event d etector and 1 ocalizers for a s ingle i nput channel p air and a single sound event type 3502 and 3504 and a detector and counter 3506.
- the first sound event detector and localizer for a single input channel pair 3502 uses the signals in the center input channel Cin and the su ⁇ ound input channel Sin to produce a true differential steering angle that reflects the direction of the syllables dcs'(s).
- the second sound event detector and localizer for a single input channel pair 3504 uses the signals in the center input channel Cin and the su ⁇ ound input channel Sin to produce a true differential steering angle that reflects the direction of the impulsive sounds dcs'(i).
- the detector and counter 3506 determines the number of then determines the number of times dcs'(s,i) falls within the range of about 0 degrees to about -45 degrees to produce a signal sursig(s,i) that indicates whether the sound is to be reproduced in stereo or in su ⁇ ound.
- Sound localizers separately detect and localize sound events and non-sound events in a sound field to produce a continuous indication of the direction of the sound field.
- sound localizers may be used in a variety of applications, such as the reproduction of recorded sounds, particularly if the sounds are part of a complex sound field that includes sound events occu ⁇ ing simultaneously with steady-state sounds.
- the sound localizers can be used as part of a matrix decoder to derive the true directions of the sounds from a two input channel mix. Also, the sound localizers can be implemented to detect any number and combination of sound event types in a sound field generated in any number of input channels. In the following description, the sound localizers are discussed in order of increasing complexity, with each subsequent sound localizer incorporating the elements of the prior sound localizers, except as indicated.
- the sound localizer for a single input channel pair and a single sound event type 3600 includes: a sound event detector for a single channel pair and a single sound event type 3602; and a sound localization circuit for a single input channel pair and a single s ound event 3604.
- the sound e vent d etector for a s ingle channel pair and a single sound event type 3602 may include the sound event detector for multiple channels and a single sound event type shown in FIG.
- the sound localization circuit 3604 uses Tlr, Lo, Ro, and the signals in the left and right input channels Lin and Rin, respectively, to produce a steering angle that indicates the direction of the sound field with respect to the left-right input channel pair in terms of an ordinary steering angle and a differential steering angle (generally refe ⁇ ed to in this application as a "comprehensive steering angle” and the comprehensive steering angle with respect to the left-right input channel pair is refe ⁇ ed to as "clr' ").
- the sound localization circuit for a single input channel pair and a single sound event type 3604 is shown in more detail in FIG.
- the sound event localization circuit for a single input channel pair and a single sound event type 3604 uses the left and right differential signals Lo and Ro, respectively, and the left and right input signals Lin and Rin, respectively, to produce a left-right comprehensive steering angle clr'.
- clr' equals follows the OSA when no sound events are detected and follows the DSA whenever a sound event is detected.
- the OSA circuit 3702 uses Lin and Rin to determine the ordinary steering angle lr.
- the DSA circuit 3704 uses the differential signals Lo and Ro to produce the differential steering angle.
- the control circuit 3706, the first switch 3708, the resistor 3710; the second switch 3712, and the capacitor 3714 generally form a sample-and-hold circuit and can therefore be replaced with any device or circuit that performs a similar function.
- the first switch 3708 will be open and the second switch 3712 will be closed. In this state, clr' will follow lr at a rate defined by the resistor 3710 and the capacitor 3714.
- the control circuit 3706 causes the first switch 3708 to close and the capacitor 3712 to capture dlr during the typical rise time of the sound event type being detected so that clr' equals the true DSA for the typical duration of the sound event type being detected. More specifically, the control c ircuit 3716 receives the trigger signal Tlr and produces a control signal "con" that controls the first switch 3708 and the second switch 3712. When Tlr indicates that a sound event is occurring, the control circuit 3706 communicates to the first switch 3708 via "con” a command to close causing the first switch 3708 to close.
- the control circuit 3706 communicates to the first and second switches 3708 and 3712, respectively, via con a command to open causing the first and second switches 3708 and 3712 to open.
- the control circuit 3706 communicates to the second switch 3712 a command via con to go close.
- the left-right differential steering angle dlr is captured by the capacitor 3714 to create the true DSA dlr'.
- Clr' is defined by the true DSA, which is held until the end of the typical duration of the sound event, even if the sound event has not actually ended.
- the true DSA will be held for about 50 ms to about 200 ms, preferably after about 150 ms. In another example, if the sound event being detected is an impulsive sound, the true DSA will be held for about 50 ms.
- the capacitor 3714 At the end of the typical duration of the sound event type being detected, the capacitor 3714 will charge or discharge until it reflects lr at a specified rate.
- the capacitor 3714 and resistor 3710 are chosen so that they define an RC time constant that will cause the specified rate of decay. For example, the RC time constant be equal to about 300 ms.
- the sound localization circuit can be specifically implemented for short-duration sound events.
- a short-duration sound event such as an impulsive sound
- An example of a sound event localization circuit that includes this functionality is shown in FIG. 38.
- the sound localization circuit for a single input channel pair and short-duration sound events 3800 includes: an OSA circuit 3802; a DSA circuit 3 804; a control circuit 3 808; a detector 3810; a first switch 3 812; a s econd switch 3816; a first resistor 3814; a third switch 3813; a second resistor 3815; and a capacitor 3818.
- This sound localization circuit 3800 produces a left-right comprehensive steering angle clr'.
- clr' When no sound events are detected, clr' equals the filtered OSA (which is lr after it is filtered by resistor 3814 and capacitor 3816). However, when a sound event is detected, clr' either equals the true OSA or the true DSA.
- the sound localization circuit for a single channel pair and a single short duration sound event generally behaves in the same way as the sound localization circuit shown in FIG. 37, except that at the end of a sound event, clr' either decays to lr or goes to lr immediately depending on whether the sound event has actually ended.
- the control circuit 3808 After the control circuit 3808 receives a trigger signal indicating that a sound event is occurring and has determined that the typical duration of the sound event being detected has ended, in addition to its other functions as previously described, it communicates with the detector 3810 to establish whether the sound event has actually ended.
- the detector 3810 determines whether a sound event has actually ended by comparing the power envelope with the accommodation signal in each input channel of the channel pair. If the input power envelope is greater than the accommodation signal in any input channel of the input channel pair, the detector 3810 will determine that the s hort-duration sound event has not actually ended. Conversely, if the input power envelope is not greater than the accommodation signal in any input channel of the input channel pair, the detector 3810 will determine that the short- duration sound event has actually ended.
- control circuit 3808 If the control circuit 3808 establishes that the sound event has not actually ended, it will then instruct the second switch 3816 to close via con.
- the second switch 3816 closes, clr' will drift to the filtered OSA at a specified rate. Therefore, capacitor 3818 and resistor 3814 are select so that their RC time constant is about equal to the specified rate (which i s generally about 3 00 m s). F or example, i f i mpulsive sounds are being detected, the RC time constant of capacitor 3818 and resistor 3814 will be about 5ms.
- the control circuit 3808 establishes that the sound event has actually ended, it will then instruct the third switch 3813 to close via con.
- the third switch 3813 closes, clr' immediately goes to lr. Therefore, the second resistor 3815 is chosen so that the RC time constant of the second resistor 3815 and the capacitor 3818 is much lower than that of the first resistor 3814 and the capacitor 3818 (generally about a factor of ten lower). Generally, the third switch will remain closed for a very short time (generally about 3 ms to about 10 ms). After this very short time has ended, the control circuit 3803 will instruct the third s witch 3813 to open and the second s witch 3816 to close s o that c lr' will go to the filtered OSA.
- the sound localizer for a single input channel pair and a short- duration sound event type may further include a DSA averaging circuit.
- the DSA averaging circuit may be implemented in the sound event localization circuit 3800 between the DSA circuit 3804 and the switch 3812.
- the sound localizer may also be implemented to detect a single sound event type in a sound field generated in multiple input channel pairs (a "sound localizer for multiple input channel pairs and a single sound event type").
- a sound localizer for multiple input channel pairs and a single sound event type implemented to detect and localize a single sound event in both an LR input channel pair and a CS input channel pair is shown in FIG. 39 and designated by reference number 3900.
- This sound localizer may be implemented for any combination of input channel pairs with the LR input channel pair and the CS input channel pair u sed i n t his i nstance for explanation p urposes o nly.
- the so und 1 ocalizer for multiple input channel pairs and a single sound event type 3900 produces a comprehensive steering angle for the LR input channel pair ("clr' ") and the CS input channel pair (“ccs' ”) and generally includes: a first and second sound event detector for a single input channel pair and a single sound event 3902 and 3904, respectively; and a sound localization circuit for multiple input channel pairs and a single sound event type 3906.
- the first and second sound event detectors for a single input channel and a single sound event type 3902 and 3904 may include a sound event detector for multiple input channels and a single sound event type, such as that shown in FIG. 24, implemented to detect the same sound event.
- the first sound event detector for a single input channel 3902 uses Lin' and Rin' to produce a left differential signal Lo, a right differential signal Ro, and a left- right trigger signal Tlr.
- the sound localization circuit for multiple input channel pairs and a single sound event type 3906 uses the left differential signal Lo, the right differential signal Ro, and the left-right trigger signal to produce a left-right comprehensive steering angle clr', and uses the center differential signal Co, the su ⁇ ound differential signal So and the center-su ⁇ ound trigger signal Tcs to produce a center-su ⁇ ound comprehensive steering angle ccs'. Additionally, circuit 3906 uses all the power envelopes, dlr and dcs to verify the accuracy of the DSA. This circuit 3906 is shown in more detail in FIG. 40 and generally includes: a first OSA circuit 4002; a first DSA circuit 4004; a second DSA circuit 4006; a second OSA circuit
- clr' is produced by the first OSA circuit 4002, first DSA circuit 4004, first two-position switch 4010, first switch 4012, the first resistor 4014, the second switch 4016, and the first capacitor 3114.
- d/cs' is produced by the second OSA circuit 4008, second DSA circuit 4006, second two-position switch 4030, the third switch 4032, the second resistor 4034 and the second capacitor 4038.
- the first and second OSA circuits 4002 and 4008 convert Lin and Rin and Cin and Sin, respectively, into ordinary steering angles, lr and cs.
- the c ontrol c ircuit 4022, the first s witch 4012, the second switch 4016, and the first capacitor 4018 form a first sample and hold circuit
- the control circuit 4022, the third switch 4032, the fourth switch 4034, the second resistor 4034, and the second capacitor 4038 form a second s ample and hold circuit.
- Both the first and third switches 4012 and 4032 are normally open when no sound events are detected and are controlled by the control circuit so that each switch is closed when a sound event is detected and opened at the end of the typical rise time of the sound event type being detected.
- Both the second and forth switches 4016 and 4036 are normally closed when no sound events are detected and opened after a sound event has been detected and the typical rise-time of the sound event type being detected has ended.
- the control circuit 4022 produces a control signal "con” that is communicated to all the switches 4012, 4016, 4032 and 4036.
- the control signal causes the first a nd second s witches 4012 and 4016 t o c lose ( or stay c losed) w henever e ither t rigger signal (Tlr or Tcs) indicates that a sound event is being detected in either input channel pair. Subsequently, con causes all the switches 4012, 4016, 4032 and 4036 to open at the end of the typical duration of the rise-time of the sound event.
- Both two-way switches 4010 and 4030 include positions D and E.
- the DSA for each channel pair are used to indicate the direction of the sound event.
- the OSA for each channel pair is used to indicate the position of the sound event.
- the verification circuit 4020 c ontrols both two-way switches 4010 and 4020 v ia a verification s ignal " vs" according to w hether t he D S As a re c o ⁇ ect w hen a s ound e vent i s detected.
- W hen e ither trigger signal indicates that a sound event is being detected
- the verification circuit determines whether at least two of the power envelopes (L , R , C , S ) have dropped by at least 3dB or more. If at least two of the power envelopes have dropped by at least 3dB or more from the last sound event, the verification circuit will communicate to both two-way switches 4010 and 4020 via vs causing them to move or stay in position E. However, if at least two of the power envelopes have not dropped by at least 3dB or more, the verification circuit will communicate to both two-way switches 4010 and 4020 via vs causing the to move or stay in position D.
- the verification circuit of the sound event localization circuit 4020 also includes a consistency check circuit.
- the consistency check circuit is coupled to both DSA circuits and uses the differential steering angles produced by each to make a further determination of the accuracy of the differential steering angles as previously described.
- the sound localizer for multiple sound event types and a single sound event type may further include an accommodation adjustment circuit and/or an e ⁇ or threshold circuit.
- the sound localization circuit for multiple channel pairs and a single sound event type can be specifically implemented for short-duration sound events, such as impulsive sounds.
- short-duration sound events such as impulsive sounds.
- FIG. 41 An example of such a sound localization circuit for multiple input channel pairs and short-duration sound events is shown in FIG. 41 and indicated by reference number 4100.
- This sound localization circuit for multiple input channel pairs and short-duration sound events 4100 generally includes: a first OSA circuit 4102; a first DSA circuit 4104; a second DSA circuit 4106; a second OSA circuit 4108; a verification circuit 4120; a control circuit 4122; a detector circuit 4124; a first two-position switch 4110; a first switch 4112; a first resistor 4114; a first capacitor 4118; a second switch
- This sound localization circuit 4100 produces a left- right comprehensive steering angle.
- clr' will equal the filtered OSA (which is lr after being filtered by first resistor 4114 and first capacitor 41 18).
- clr' When there is a sound event detected, clr' will equal either the true OSA or the true DSA of the left-right channel pair.
- This sound localization circuit 4100 also produces a center- su ⁇ ound comprehensive steering ccs'. When no sound events are detected, clr' equals the filtered OSA. When sound events are detected, clr' will equal the true OSA or the true DSA of the center-su ⁇ ound channel pair.
- the sound localization circuit for multiple channel pairs and a single short duration sound event generally behaves in the same way as the sound localization circuit for multiple channel pairs an a single sound event type shown in FIG.
- control circuit 4122 After the control circuit 4122 receives a trigger signal indicating that a sound event is occu ⁇ ing and has determined that the typical duration of the sound event being detected has ended, in addition to its other functions as previously described, it communicates with the detector 4124 to establish whether the sound event has actually ended.
- the detector 4122 determines whether a sound event has actually ended by comparing the power envelope with the accommodation signal in each input channel of the channel pair. If the input power envelope is greater than the accommodation signal in any input channel of the input channel pair, the detector 4122 w ill determine that the short-duration sound event has not actually ended.
- the detector 4124 will determine that the short- duration sound event has actually ended. If the control circuit 4122 establishes that the sound event has not actually ended, it will then instruct the second and fourth switches 4116 and 4136, respectively, to close via con. When the second switch 4116 and the fourth switch
- the first and second capacitors 4118 and 4138, respectively, and the first and second resistors 4114 and 4134, respectively, are select so that their RC time constant is about equal to the specified rate. For example, if impulsive sounds are being detected, the RC time constant of capacitor 4118 and resistor 41 14, as well as the RC time constant of capacitor 4138 and resistor 4134 will be about 5ms.
- control circuit 4122 establishes that the sound event has actually ended, it will then instruct the fifth and sixth switches 41 13 and 4133, respectively, to close via con.
- fifth and sixth switches 4113 and 4133, respectively, close clr' and ccs' both immediately goes to lr and cs, respectively. Therefore, the RC time constant of the third resistor 41 15 and the first capacitor 4118, and the RC time constant of the fourth resistor 4135 and the second capacitor 4138 will both be very low.
- RC time constants may be about at least a factor of ten lower than the RC time constant of the first resistor 4114 and the first c apacitor 41 18 and the RC time constant o f the second resistor 4 134 and the second capacitor 4138.
- the fifth and sixth switches 4113 and 4133, respectively, will remain closed for a short time. This short time may be about 3 ms to about 10 ms.
- the control circuit 4122 will instruct the fifth and sixth switches 4113 and 4135, respectively, to open and the second and fourth switches 4166 and 4136, respectively, to close so that clr' and ccs' will reflect the filtered OSAs.
- the sound localizer for a multiple input channel pair and a short-duration sound event type may further include first and second DSA averaging circuits.
- the first DSA averaging circuit may be implemented in the sound event localization circuit 4100 between the first DSA circuit 4104 and the first two-position switch 41 10.
- the second DSA averaging circuit may be implemented in the sound event localization circuit 4100 between the second DSA circuit 4106 and the second two-position switch 4130.
- the verification circuit of the sound localization circuit for multiple channel pairs and a single short-duration sound event type 4100 also includes a consistency check circuit.
- the consistency check circuit is coupled to both DSA circuits and uses the differential steering angles produced by each to make a further determination of the accuracy of the differential steering angles as previously described.
- the sound localizer for multiple sound event types and a single sound event type may further include an accommodation adjustment circuit and/or an e ⁇ or threshold circuit.
- the sound localizer may also be implemented to detect multiple sound event types in a sound field generated in a single input channel pair (a "sound localizer for a single input channel pair and multiple sound event types").
- a sound event detector and localizer for a single input channel pair and a multiple sound event types implemented to detect and localize syllables and impulsive sounds in a left-right input channel pair is shown in FIG. 42 and indicated by reference number 4200 (however, this localizer may be implemented for any input channel pair to detect any combination of sound event types).
- the sound localizer for a single input channel pair and multiple sound event types 4200 generally includes: a s ound event d etector for a single input c hannel and a single sound e vent type 4202; a sound e vent d etector for a s ingle-channel and a s ingle short-duration sound e vent device 4206; and a sound 1 ocalization circuit for a s ingle input channel pair and multiple sound event types 4204.
- the sound event detector for a single channel pair and a single sound event type 4202 may i nclude the s ound e vent d etector for m ultiple c hannels and a s ingle sound event type shown in FIG. 24 implemented to detect syllables and to produces a left differential signal Lo, a right differential signal Ro and a left-right trigger signal indicating the occu ⁇ ence of syllables Tlr(s).
- the sound event detector for a single channel pair and a short-duration sound event type 4206 may include the sound event detector for a single channel pair and a single short-duration sound event shown in FIG.
- the sound localization circuit for a single channel pair and a single sound event type 4204 may include the sound localization circuit for a single channels pair and multiple sound event types as shown in FIG. 37 implemented to produce a left-right differential steering angle indicating the direction of a detected syllable or impulsive sound whenever either trigger signal (Tlr(s) or Tlr(i)) indicates that a sound event is occurring.
- this sound localizer is implemented to detect and localize syllables and impulsive sounds in a left-right input channel pair, it may be implemented to detect any number of sound event types in any channel pair simply by adding additional sound event detectors for a single channel pair implemented to detect the desired sound event type and having the sound localization circuit for a single channel pair and a single sound event type respond to any of the trigger signals produced by the sound event detectors for a single channel pair.
- the sound localizer may also be implemented to detect multiple sound event types in a sound field generated in multiple input channel pairs (a "sound localizer for multiple input channel pairs and multiple sound event types").
- a sound localizer for multiple input channel pairs and multiple sound event types specifically localizes syllables and impulsive sounds in a left-right input channel pair and a center-su ⁇ ound input channel pair is shown in FIG. 43 (however, the sound localizer for multiple input channel pairs and multiple sound event types may be implemented to specifically localize any combination of sound events in any number of input channel pairs).
- This sound localizer for multiple input channel pairs and multiple sound event types 4300 i excludes: a first and a second sound event detector for a single channel pair and a single short-duration sound event 4308 and 4306, respectively; a first and a second sound event detector for a single channel pair and a single sound event type 4302 and 4304, respectively; and a sound event localization circuit for multiple channel pairs and a single sound event type 4310.
- the first sound event detector for a single channel pair and a single sound event type 4302 m ay include the sound event detector for multiple channels and a s ingle sound event type shown in FIG. 24 implemented to detect syllables and to produces a left differential signal Lo, a right differential signal Ro and a left-right trigger signal indicating the occu ⁇ ence of syllables Tlr(s).
- the second sound event detector for a single channel pair and a single sound event type 4304 may include the sound event detector for multiple channels and a s ingle sound event type shown in FIG.
- the first sound event detector for a single channel pair and a short-duration sound event type 4308 may include the sound event detector for a single channel pair and a single short-duration sound 5 event shown in FIG. 23 implemented to detect impulsive sounds and to produce a left-right trigger signal (as a combination of a left trigger signal and a right trigger signal) indicating the occu ⁇ ence of impulsive sounds Tlr(i).
- the second sound event detector for a single channel pair and a short-duration sound event type 4306 may include the sound event detector for a single channel pair and a single short-duration sound event shown in FIG. 23 l o implemented to detect impulsive sounds and to produce a center-su ⁇ ound trigger signal (as a combination of a center trigger signal and a su ⁇ ound trigger signal) indicating the occu ⁇ ence of impulsive sounds Tcs(i).
- the sound localization circuit for multiple channel pairs and a single sound event type 4310 may include the sound localization circuit for multiple channel pairs and a single sound event type as shown in FIG. 40 implemented to produce left-right
- the 20 implemented to detect and localize syllables and impulsive sounds in left-right and center- su ⁇ ound input channel pairs, it may be implemented to detect any number of sound event types in number of channel pair simply by adding additional sound event detectors for a single channel pair implemented to detect the desired sound event types in the desired channel pairs and having the sound localization circuit for multiple channel pairs and a single
- 25 sound event type 4310 respond to any o f the trigger signals produced by the sound event detectors for a single channel pair.
- any of the sound localizers for multiple input channel pairs and multiple sound event types may additionally include an accommodation a djustment circuit and/or an e ⁇ or threshold circuit. Both the accommodation adjustment circuit and the e ⁇ or
- threshold circuit (not shown) are coupled to the consistency check circuit and include a counter that counts the number of e ⁇ ors detected by the consistency check in a time period of about several seconds.
- the accommodation adjustment circuit is also coupled to the accommodation s ignal in the transient detection c ircuits included i n t he DSA circuits.
- t he e ⁇ or t hreshold c ircuit is also coupled to the threshold d etector i n t he trigger generation circuits and further includes an e ⁇ or voltage source that is adjusted according to the number of e ⁇ ors counted by the counter. As the number of e ⁇ ors increases, the voltage produced by the e ⁇ or voltage source will increase to cause the threshold voltage to increase so that fewer sound events are detected.
- Implementations of the sound event detection methods, sound event detection and localization methods and sound localization methods and any methods included in any of these methods include computer readable software code. These algorithms may be implemented together or independently. Such code may be stored on a processor, a memory device or on any other computer readable storage medium. Alternatively, the software code may be encoded in a computer readable electronic or optical signal. The code may be object code or any other code describing or controlling the functionality described in this application.
- the computer readable storage medium may be a magnetic storage disk such as a floppy disk, an optical disk such as a CD-ROM, semiconductor memory or any other physical object storing program code or associated data.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Remote Sensing (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Physics & Mathematics (AREA)
- Radar, Positioning & Navigation (AREA)
- Signal Processing (AREA)
- Geology (AREA)
- Geophysics (AREA)
- Environmental & Geological Engineering (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- General Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Quality & Reliability (AREA)
- Stereophonic System (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Abstract
Description
Claims
Priority Applications (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP03747642.1A EP1585947B1 (en) | 2002-05-03 | 2003-05-02 | Sound detection and localization system |
KR1020047017705A KR101047194B1 (en) | 2002-05-03 | 2003-05-02 | Sound Detection and Positioning System |
CA2483609A CA2483609C (en) | 2002-05-03 | 2003-05-02 | Sound detection and localization system |
CN038145073A CN1830009B (en) | 2002-05-03 | 2003-05-02 | Sound detection and localization system |
AU2003265935A AU2003265935A1 (en) | 2002-05-03 | 2003-05-02 | Sound detection and localization system |
JP2004501891A JP4744874B2 (en) | 2002-05-03 | 2003-05-02 | Sound detection and specific system |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US37755802P | 2002-05-03 | 2002-05-03 | |
US60/377,558 | 2002-05-03 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2003093775A2 true WO2003093775A2 (en) | 2003-11-13 |
WO2003093775A3 WO2003093775A3 (en) | 2006-03-30 |
Family
ID=29401529
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2003/013685 WO2003093775A2 (en) | 2002-05-03 | 2003-05-02 | Sound detection and localization system |
Country Status (8)
Country | Link |
---|---|
US (4) | US20040005065A1 (en) |
EP (1) | EP1585947B1 (en) |
JP (2) | JP4744874B2 (en) |
KR (1) | KR101047194B1 (en) |
CN (1) | CN1830009B (en) |
AU (1) | AU2003265935A1 (en) |
CA (2) | CA2483609C (en) |
WO (1) | WO2003093775A2 (en) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7667647B2 (en) | 1999-03-05 | 2010-02-23 | Era Systems Corporation | Extension of aircraft tracking and positive identification from movement areas into non-movement areas |
US7739167B2 (en) | 1999-03-05 | 2010-06-15 | Era Systems Corporation | Automated management of airport revenues |
US7777675B2 (en) | 1999-03-05 | 2010-08-17 | Era Systems Corporation | Deployable passive broadband aircraft tracking |
US7782256B2 (en) | 1999-03-05 | 2010-08-24 | Era Systems Corporation | Enhanced passive coherent location techniques to track and identify UAVs, UCAVs, MAVs, and other objects |
US7889133B2 (en) | 1999-03-05 | 2011-02-15 | Itt Manufacturing Enterprises, Inc. | Multilateration enhancements for noise and operations management |
US7908077B2 (en) | 2003-06-10 | 2011-03-15 | Itt Manufacturing Enterprises, Inc. | Land use compatibility planning software |
US7965227B2 (en) | 2006-05-08 | 2011-06-21 | Era Systems, Inc. | Aircraft tracking using low cost tagging as a discriminator |
US8072382B2 (en) | 1999-03-05 | 2011-12-06 | Sra International, Inc. | Method and apparatus for ADS-B validation, active and passive multilateration, and elliptical surveillance |
US8203486B1 (en) | 1999-03-05 | 2012-06-19 | Omnipol A.S. | Transmitter independent techniques to extend the performance of passive coherent location |
US8446321B2 (en) | 1999-03-05 | 2013-05-21 | Omnipol A.S. | Deployable intelligence and tracking system for homeland security and search and rescue |
WO2017095559A1 (en) * | 2015-12-01 | 2017-06-08 | Qualcomm Incorporated | Determining audio event based on location information |
Families Citing this family (41)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7451006B2 (en) * | 2001-05-07 | 2008-11-11 | Harman International Industries, Incorporated | Sound processing system using distortion limiting techniques |
US6804565B2 (en) * | 2001-05-07 | 2004-10-12 | Harman International Industries, Incorporated | Data-driven software architecture for digital sound processing and equalization |
KR100548899B1 (en) * | 2001-05-11 | 2006-02-02 | 교세라 가부시키가이샤 | Portable communication terminal and wireless communication system therefor |
US7443987B2 (en) * | 2002-05-03 | 2008-10-28 | Harman International Industries, Incorporated | Discrete surround audio system for home and automotive listening |
US20040005065A1 (en) * | 2002-05-03 | 2004-01-08 | Griesinger David H. | Sound event detection system |
US20050108024A1 (en) * | 2003-11-13 | 2005-05-19 | Fawcett John Jr. | Systems and methods for retrieving data |
CA2592099A1 (en) | 2004-12-22 | 2006-06-29 | Nucleonics, Inc. | Conserved hbv and hcv sequences useful for gene silencing |
US8036402B2 (en) * | 2005-12-15 | 2011-10-11 | Harman International Industries, Incorporated | Distortion compensation |
EP1989853B1 (en) * | 2006-02-23 | 2016-12-14 | Togewa Holding AG | Switching system and corresponding method for unicast or multicast end-to-end data and/or multimedia stream transmissions between network nodes |
JP4786384B2 (en) * | 2006-03-27 | 2011-10-05 | 株式会社東芝 | Audio processing apparatus, audio processing method, and audio processing program |
JP4867516B2 (en) * | 2006-08-01 | 2012-02-01 | ヤマハ株式会社 | Audio conference system |
DE602007009784D1 (en) * | 2007-01-16 | 2010-11-25 | Harman Becker Automotive Sys | Apparatus and method for tracking surround headphones using audio signals below the masked threshold of hearing |
US9015051B2 (en) * | 2007-03-21 | 2015-04-21 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Reconstruction of audio channels with direction parameters indicating direction of origin |
US8908873B2 (en) * | 2007-03-21 | 2014-12-09 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Method and apparatus for conversion between multi-channel audio formats |
CN100505837C (en) * | 2007-05-10 | 2009-06-24 | 华为技术有限公司 | System and method for controlling image collector for target positioning |
US8050414B2 (en) * | 2008-10-16 | 2011-11-01 | Gas Technology Institute | Robust pipe-strike pulse detector |
US8045738B2 (en) * | 2008-10-31 | 2011-10-25 | Zounds Hearing, Inc. | System for managing feedback |
WO2011139772A1 (en) * | 2010-04-27 | 2011-11-10 | James Fairey | Sound wave modification |
TWI403304B (en) | 2010-08-27 | 2013-08-01 | Ind Tech Res Inst | Method and mobile device for awareness of linguistic ability |
US9111526B2 (en) | 2010-10-25 | 2015-08-18 | Qualcomm Incorporated | Systems, method, apparatus, and computer-readable media for decomposition of a multichannel music signal |
CN102592597B (en) * | 2011-01-17 | 2014-08-13 | 鸿富锦精密工业(深圳)有限公司 | Electronic device and audio data copyright protection method |
US9143571B2 (en) * | 2011-03-04 | 2015-09-22 | Qualcomm Incorporated | Method and apparatus for identifying mobile devices in similar sound environment |
EP2495581B1 (en) * | 2011-03-04 | 2017-03-22 | BlackBerry Limited | Human audible localization for sound emitting devices |
JP5994470B2 (en) * | 2012-08-08 | 2016-09-21 | 株式会社Jvcケンウッド | Sound source direction detecting device, sound source direction detecting method, sound source direction detecting program |
DE102013207149A1 (en) * | 2013-04-19 | 2014-11-06 | Siemens Medical Instruments Pte. Ltd. | Controlling the effect size of a binaural directional microphone |
KR102195897B1 (en) * | 2013-06-05 | 2020-12-28 | 삼성전자주식회사 | Apparatus for dectecting aucoustic event, operating method thereof, and computer-readable recording medium having embodied thereon a program which when executed by a computer perorms the method |
US9747899B2 (en) | 2013-06-27 | 2017-08-29 | Amazon Technologies, Inc. | Detecting self-generated wake expressions |
CN103531202B (en) * | 2013-10-14 | 2015-10-28 | 无锡儒安科技有限公司 | Distributed Detection sound event also chooses the method for similar events point |
US9672727B1 (en) * | 2013-11-05 | 2017-06-06 | Alarm.Com Incorporated | Handling duress input |
US9478229B2 (en) | 2013-12-10 | 2016-10-25 | Massachusetts Institute Of Technology | Methods and apparatus for recording impulsive sounds |
US10134416B2 (en) * | 2015-05-11 | 2018-11-20 | Microsoft Technology Licensing, Llc | Privacy-preserving energy-efficient speakers for personal sound |
US9977645B2 (en) * | 2015-10-01 | 2018-05-22 | Moodelizer Ab | Dynamic modification of audio content |
EP3434024B1 (en) | 2016-04-21 | 2023-08-02 | Hewlett-Packard Development Company, L.P. | Electronic device microphone listening modes |
WO2017192200A1 (en) * | 2016-05-05 | 2017-11-09 | The Research Foundation For The State Unversity Of New York | Compositions for treating periodontitis and dental calculus accumulation |
US10264999B2 (en) | 2016-09-07 | 2019-04-23 | Massachusetts Institute Of Technology | High fidelity systems, apparatus, and methods for collecting noise exposure data |
EP3324407A1 (en) | 2016-11-17 | 2018-05-23 | Fraunhofer Gesellschaft zur Förderung der Angewand | Apparatus and method for decomposing an audio signal using a ratio as a separation characteristic |
EP3324406A1 (en) * | 2016-11-17 | 2018-05-23 | Fraunhofer Gesellschaft zur Förderung der Angewand | Apparatus and method for decomposing an audio signal using a variable threshold |
CN106814670A (en) * | 2017-03-22 | 2017-06-09 | 重庆高略联信智能技术有限公司 | A kind of river sand mining intelligent supervision method and system |
CN108806711A (en) * | 2018-08-07 | 2018-11-13 | 吴思 | A kind of extracting method and device |
US10811032B2 (en) * | 2018-12-19 | 2020-10-20 | Cirrus Logic, Inc. | Data aided method for robust direction of arrival (DOA) estimation in the presence of spatially-coherent noise interferers |
TWI728632B (en) * | 2019-12-31 | 2021-05-21 | 財團法人工業技術研究院 | Positioning method for specific sound source |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1995002288A1 (en) | 1993-07-07 | 1995-01-19 | Picturetel Corporation | Reduction of background noise for speech enhancement |
EP0682436A2 (en) | 1994-05-09 | 1995-11-15 | AT&T Corp. | Voice actuated switching system |
EP0690655A2 (en) | 1994-06-30 | 1996-01-03 | AT&T Corp. | Direction finder |
Family Cites Families (118)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3845572A (en) | 1972-08-02 | 1974-11-05 | Singer Co | Modular vehicle trainer sound system having a plurality of separately controllable sound generators and a polyphonic speaker array |
US4251688A (en) | 1979-01-15 | 1981-02-17 | Ana Maria Furner | Audio-digital processing system for demultiplexing stereophonic/quadriphonic input audio signals into 4-to-72 output audio signals |
JPS56132804A (en) | 1980-03-22 | 1981-10-17 | Sharp Corp | Operational tone quality control circuit |
JPS60107998A (en) * | 1983-11-16 | 1985-06-13 | Nissan Motor Co Ltd | Acoustic device for automobile |
US4704728A (en) * | 1984-12-31 | 1987-11-03 | Peter Scheiber | Signal re-distribution, decoding and processing in accordance with amplitude, phase, and other characteristics |
US4941177A (en) | 1985-03-07 | 1990-07-10 | Dolby Laboratories Licensing Corporation | Variable matrix decoder |
US4799260A (en) | 1985-03-07 | 1989-01-17 | Dolby Laboratories Licensing Corporation | Variable matrix decoder |
US5046098A (en) * | 1985-03-07 | 1991-09-03 | Dolby Laboratories Licensing Corporation | Variable matrix decoder with three output channels |
JPS621441U (en) * | 1985-06-20 | 1987-01-07 | ||
US4759066A (en) * | 1987-05-27 | 1988-07-19 | Polk Investment Corporation | Sound system with isolation of dimensional sub-speakers |
US4972482A (en) | 1987-09-18 | 1990-11-20 | Sanyo Electric Co., Ltd. | Fm stereo demodulator |
US4829299A (en) | 1987-09-25 | 1989-05-09 | Dolby Laboratories Licensing Corporation | Adaptive-filter single-bit digital encoder and decoder and adaptation control circuit responsive to bit-stream loading |
US5189703A (en) | 1988-01-06 | 1993-02-23 | Lucasarts Entertainment Company | Timbre correction units for use in sound systems |
US4862502A (en) | 1988-01-06 | 1989-08-29 | Lexicon, Inc. | Sound reproduction |
US4932059A (en) * | 1988-01-11 | 1990-06-05 | Fosgate Inc. | Variable matrix decoder for periphonic reproduction of sound |
JPH0256600A (en) * | 1988-08-23 | 1990-02-26 | Ricoh Co Ltd | Speech dialing system |
JPH0623119Y2 (en) * | 1989-01-24 | 1994-06-15 | パイオニア株式会社 | Surround stereo playback device |
US5146507A (en) | 1989-02-23 | 1992-09-08 | Yamaha Corporation | Audio reproduction characteristics control device |
US5109419A (en) | 1990-05-18 | 1992-04-28 | Lexicon, Inc. | Electroacoustic system |
US5504819A (en) | 1990-06-08 | 1996-04-02 | Harman International Industries, Inc. | Surround sound processor with improved control voltage generator |
US5172415A (en) | 1990-06-08 | 1992-12-15 | Fosgate James W | Surround processor |
US5428687A (en) * | 1990-06-08 | 1995-06-27 | James W. Fosgate | Control voltage generator multiplier and one-shot for integrated surround sound processor |
US5295189A (en) | 1990-06-08 | 1994-03-15 | Fosgate James W | Control voltage generator for surround sound processor |
US5625696A (en) * | 1990-06-08 | 1997-04-29 | Harman International Industries, Inc. | Six-axis surround sound processor with improved matrix and cancellation control |
US5666424A (en) * | 1990-06-08 | 1997-09-09 | Harman International Industries, Inc. | Six-axis surround sound processor with automatic balancing and calibration |
US5339363A (en) * | 1990-06-08 | 1994-08-16 | Fosgate James W | Apparatus for enhancing monophonic audio signals using phase shifters |
KR920004817Y1 (en) | 1990-08-14 | 1992-07-20 | 삼성전자 주식회사 | Common receiver device of audio mutilateral type |
JP3118023B2 (en) * | 1990-08-15 | 2000-12-18 | 株式会社リコー | Voice section detection method and voice recognition device |
US5119422A (en) | 1990-10-01 | 1992-06-02 | Price David A | Optimal sonic separator and multi-channel forward imaging system |
US5274740A (en) | 1991-01-08 | 1993-12-28 | Dolby Laboratories Licensing Corporation | Decoder for variable number of channel presentation of multidimensional sound fields |
CA2077662C (en) | 1991-01-08 | 2001-04-17 | Mark Franklin Davis | Encoder/decoder for multidimensional sound fields |
US5136650A (en) | 1991-01-09 | 1992-08-04 | Lexicon, Inc. | Sound reproduction |
KR970000147B1 (en) | 1991-01-31 | 1997-01-04 | 삼성전자 주식회사 | Multi-channel sound recording and reproducing system |
US5594800A (en) | 1991-02-15 | 1997-01-14 | Trifield Productions Limited | Sound reproduction system having a matrix converter |
JPH06276599A (en) * | 1991-07-26 | 1994-09-30 | Gijutsu Kenkyu Kumiai Iryo Fukushi Kiki Kenkyusho | Impulsive sound suppressing device |
US5161197A (en) | 1991-11-04 | 1992-11-03 | Lexicon, Inc. | Acoustic analysis |
US5199075A (en) | 1991-11-14 | 1993-03-30 | Fosgate James W | Surround sound loudspeakers and processor |
JPH05191899A (en) | 1992-01-16 | 1993-07-30 | Pioneer Electron Corp | Stereo sound device |
JPH08502867A (en) * | 1992-10-29 | 1996-03-26 | ウィスコンシン アラムニ リサーチ ファンデーション | Method and device for producing directional sound |
US5333201A (en) | 1992-11-12 | 1994-07-26 | Rocktron Corporation | Multi dimensional sound circuit |
US5319713A (en) | 1992-11-12 | 1994-06-07 | Rocktron Corporation | Multi dimensional sound circuit |
US5357574A (en) | 1992-12-14 | 1994-10-18 | Ford Motor Company | Coherent signal generation in digital radio receiver |
ES2149235T3 (en) | 1993-01-22 | 2000-11-01 | Koninkl Philips Electronics Nv | DIGITAL TRANSMISSION IN 3 CHANNELS OF STEREOPHONIC SIGNALS LEFT AND RIGHT AND A CENTRAL SIGNAL. |
CA2112171C (en) * | 1993-02-25 | 2003-10-21 | Bradley Anderson Ballard | Dsp-based vehicle equalization design system |
US5748749A (en) * | 1993-03-24 | 1998-05-05 | Noise Cancellation Technologies, Inc. | Active noise cancelling muffler |
ES2165370T3 (en) * | 1993-06-22 | 2002-03-16 | Thomson Brandt Gmbh | METHOD FOR OBTAINING A MULTICHANNEL DECODING MATRIX. |
US5463424A (en) * | 1993-08-03 | 1995-10-31 | Dolby Laboratories Licensing Corporation | Multi-channel transmitter/receiver system providing matrix-decoding compatible signals |
US5386473A (en) * | 1994-01-21 | 1995-01-31 | Harrison; Robert W. | Passive surround sound circuit |
US5497425A (en) | 1994-03-07 | 1996-03-05 | Rapoport; Robert J. | Multi channel surround sound simulation device |
US5602923A (en) | 1994-03-07 | 1997-02-11 | Sony Corporation | Theater sound system with upper surround channels |
US6144747A (en) | 1997-04-02 | 2000-11-07 | Sonics Associates, Inc. | Head mounted surround sound system |
US5626643A (en) * | 1994-09-26 | 1997-05-06 | Owens-Corning Fiberglas Technology Inc. | Contact drying of fibers to form composite strands |
US5638452A (en) | 1995-04-21 | 1997-06-10 | Rocktron Corporation | Expandable multi-dimensional sound circuit |
US5761313A (en) | 1995-06-30 | 1998-06-02 | Philips Electronics North America Corp. | Circuit for improving the stereo image separation of a stereo signal |
KR0128064Y1 (en) * | 1995-08-18 | 1998-11-02 | 김광호 | Surround sound signal regenative apparatus with sub-woofer signal synthesizing function |
JP2956545B2 (en) | 1995-08-28 | 1999-10-04 | ヤマハ株式会社 | Sound field control device |
US5708719A (en) | 1995-09-07 | 1998-01-13 | Rep Investment Limited Liability Company | In-home theater surround sound speaker system |
US6118876A (en) * | 1995-09-07 | 2000-09-12 | Rep Investment Limited Liability Company | Surround sound speaker system for improved spatial effects |
US5930370A (en) | 1995-09-07 | 1999-07-27 | Rep Investment Limited Liability | In-home theater surround sound speaker system |
KR0174084B1 (en) | 1995-09-25 | 1999-04-01 | 이준 | Inverse Converter of MPEG-2 Multichannel Audio Decoder |
US5798818A (en) | 1995-10-17 | 1998-08-25 | Sony Corporation | Configurable cinema sound system |
US5642423A (en) | 1995-11-22 | 1997-06-24 | Sony Corporation | Digital surround sound processor |
US5956674A (en) | 1995-12-01 | 1999-09-21 | Digital Theater Systems, Inc. | Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels |
US5771295A (en) | 1995-12-26 | 1998-06-23 | Rocktron Corporation | 5-2-5 matrix system |
US5841993A (en) | 1996-01-02 | 1998-11-24 | Ho; Lawrence | Surround sound system for personal computer for interfacing surround sound with personal computer |
US5727068A (en) | 1996-03-01 | 1998-03-10 | Cinema Group, Ltd. | Matrix decoding method and apparatus |
EP0808076B1 (en) * | 1996-05-17 | 2007-11-21 | Micronas GmbH | Surround sound system |
US5850455A (en) | 1996-06-18 | 1998-12-15 | Extreme Audio Reality, Inc. | Discrete dynamic positioning of audio signals in a 360° environment |
US6697491B1 (en) * | 1996-07-19 | 2004-02-24 | Harman International Industries, Incorporated | 5-2-5 matrix encoder and decoder system |
US5870480A (en) * | 1996-07-19 | 1999-02-09 | Lexicon | Multichannel active matrix encoder and decoder with maximum lateral separation |
US5796844A (en) * | 1996-07-19 | 1998-08-18 | Lexicon | Multichannel active matrix sound reproduction with maximum lateral separation |
FI105522B (en) | 1996-08-06 | 2000-08-31 | Sample Rate Systems Oy | Arrangement for home theater or other audio equipment |
US6144474A (en) | 1996-10-21 | 2000-11-07 | Fujitsu Limited | Optical transmission system including optical repeaters with selectively enabled gain equalizers contained therein and including an add/drop apparatus with a plurality of individually selectable filters |
DE19651308C2 (en) | 1996-12-10 | 1998-10-22 | Becker Gmbh | Audio sound system for a motor vehicle |
US6711266B1 (en) * | 1997-02-07 | 2004-03-23 | Bose Corporation | Surround sound channel encoding and decoding |
US6038324A (en) | 1997-02-21 | 2000-03-14 | Ambourn; Paul R. | Automotive surround sound circuit background of the invention |
US5862228A (en) | 1997-02-21 | 1999-01-19 | Dolby Laboratories Licensing Corporation | Audio matrix encoding |
JP3663461B2 (en) | 1997-03-13 | 2005-06-22 | スリーエス テック カンパニー リミテッド | Frequency selective spatial improvement system |
US6973200B1 (en) * | 1997-04-22 | 2005-12-06 | Canon Kabushiki Kaisha | Image processing apparatus, image processing method, and storage medium |
US6198826B1 (en) | 1997-05-19 | 2001-03-06 | Qsound Labs, Inc. | Qsound surround synthesis from stereo |
JP4478220B2 (en) * | 1997-05-29 | 2010-06-09 | ソニー株式会社 | Sound field correction circuit |
US5983087A (en) | 1997-06-26 | 1999-11-09 | Delco Electronics Corporation | Distributed digital signal processing for vehicle audio systems |
US6108584A (en) * | 1997-07-09 | 2000-08-22 | Sony Corporation | Multichannel digital audio decoding method and apparatus |
US6141597A (en) | 1997-09-08 | 2000-10-31 | Picturetel Corporation | Audio processor |
JP3906533B2 (en) | 1997-11-04 | 2007-04-18 | ヤマハ株式会社 | Pseudo stereo circuit |
US6683962B1 (en) * | 1997-12-23 | 2004-01-27 | Harman International Industries, Incorporated | Method and system for driving speakers with a 90 degree phase shift |
US6624873B1 (en) * | 1998-05-05 | 2003-09-23 | Dolby Laboratories Licensing Corporation | Matrix-encoded surround-sound channels in a discrete digital sound format |
JP4151110B2 (en) | 1998-05-14 | 2008-09-17 | ソニー株式会社 | Audio signal processing apparatus and audio signal reproduction apparatus |
EP0980064A1 (en) * | 1998-06-26 | 2000-02-16 | Ascom AG | Method for carrying an automatic judgement of the transmission quality of audio signals |
JP3781902B2 (en) * | 1998-07-01 | 2006-06-07 | 株式会社リコー | Sound image localization control device and sound image localization control method |
JP2000032434A (en) * | 1998-07-08 | 2000-01-28 | Victor Co Of Japan Ltd | Image-pickup device |
JP3484988B2 (en) | 1998-09-22 | 2004-01-06 | ヤマハ株式会社 | Performance information editing method and recording medium storing performance information editing program |
FI113935B (en) | 1998-09-25 | 2004-06-30 | Nokia Corp | Method for Calibrating the Sound Level in a Multichannel Audio System and a Multichannel Audio System |
US6453047B1 (en) | 1998-09-28 | 2002-09-17 | Creative Technology Ltd | Matrix encoding system with improved behavior frequency |
US6590983B1 (en) | 1998-10-13 | 2003-07-08 | Srs Labs, Inc. | Apparatus and method for synthesizing pseudo-stereophonic outputs from a monophonic input |
GB2342830B (en) * | 1998-10-15 | 2002-10-30 | Central Research Lab Ltd | A method of synthesising a three dimensional sound-field |
US6711536B2 (en) * | 1998-10-20 | 2004-03-23 | Canon Kabushiki Kaisha | Speech processing apparatus and method |
US6556685B1 (en) | 1998-11-06 | 2003-04-29 | Harman Music Group | Companding noise reduction system with simultaneous encode and decode |
US6442277B1 (en) | 1998-12-22 | 2002-08-27 | Texas Instruments Incorporated | Method and apparatus for loudspeaker presentation for positional 3D sound |
US6694027B1 (en) * | 1999-03-09 | 2004-02-17 | Smart Devices, Inc. | Discrete multi-channel/5-2-5 matrix system |
JP4610087B2 (en) | 1999-04-07 | 2011-01-12 | ドルビー・ラボラトリーズ・ライセンシング・コーポレーション | Matrix improvement to lossless encoding / decoding |
US6539357B1 (en) | 1999-04-29 | 2003-03-25 | Agere Systems Inc. | Technique for parametric coding of a signal containing information |
JP2001028799A (en) | 1999-05-10 | 2001-01-30 | Sony Corp | Onboard sound reproduction device |
US6442278B1 (en) | 1999-06-15 | 2002-08-27 | Hearing Enhancement Company, Llc | Voice-to-remaining audio (VRA) interactive center channel downmix |
CN100429960C (en) | 2000-07-19 | 2008-10-29 | 皇家菲利浦电子有限公司 | Multi-channel stereo converter for deriving a stereo surround and/or audio centre signal |
US7236838B2 (en) * | 2000-08-29 | 2007-06-26 | Matsushita Electric Industrial Co., Ltd. | Signal processing apparatus, signal processing method, program and recording medium |
JP4264686B2 (en) | 2000-09-14 | 2009-05-20 | ソニー株式会社 | In-vehicle sound reproduction device |
US7457422B2 (en) * | 2000-11-29 | 2008-11-25 | Ford Global Technologies, Llc | Method and implementation for detecting and characterizing audible transients in noise |
DE10110422A1 (en) * | 2001-03-05 | 2002-09-19 | Harman Becker Automotive Sys | Method for controlling a multi-channel sound reproduction system and multi-channel sound reproduction system |
US6996239B2 (en) * | 2001-05-03 | 2006-02-07 | Harman International Industries, Inc. | System for transitioning from stereo to simulated surround sound |
US7451006B2 (en) * | 2001-05-07 | 2008-11-11 | Harman International Industries, Incorporated | Sound processing system using distortion limiting techniques |
US6804565B2 (en) * | 2001-05-07 | 2004-10-12 | Harman International Industries, Incorporated | Data-driven software architecture for digital sound processing and equalization |
US7177432B2 (en) * | 2001-05-07 | 2007-02-13 | Harman International Industries, Incorporated | Sound processing system with degraded signal optimization |
US7447321B2 (en) * | 2001-05-07 | 2008-11-04 | Harman International Industries, Incorporated | Sound processing system for configuration of audio signals in a vehicle |
US20040086130A1 (en) * | 2002-05-03 | 2004-05-06 | Eid Bradley F. | Multi-channel sound processing systems |
US20040005065A1 (en) * | 2002-05-03 | 2004-01-08 | Griesinger David H. | Sound event detection system |
KR100878004B1 (en) * | 2003-06-02 | 2009-01-12 | 후지쓰 텐 가부시키가이샤 | Acoustic field adjusting apparatus |
US20050063551A1 (en) * | 2003-09-18 | 2005-03-24 | Yiou-Wen Cheng | Multi-channel surround sound expansion method |
-
2003
- 2003-05-02 US US10/428,451 patent/US20040005065A1/en not_active Abandoned
- 2003-05-02 US US10/428,366 patent/US7492908B2/en active Active
- 2003-05-02 AU AU2003265935A patent/AU2003265935A1/en not_active Abandoned
- 2003-05-02 US US10/428,405 patent/US7567676B2/en active Active
- 2003-05-02 CN CN038145073A patent/CN1830009B/en not_active Expired - Lifetime
- 2003-05-02 JP JP2004501891A patent/JP4744874B2/en not_active Expired - Lifetime
- 2003-05-02 CA CA2483609A patent/CA2483609C/en not_active Expired - Lifetime
- 2003-05-02 KR KR1020047017705A patent/KR101047194B1/en active IP Right Grant
- 2003-05-02 WO PCT/US2003/013685 patent/WO2003093775A2/en active Application Filing
- 2003-05-02 EP EP03747642.1A patent/EP1585947B1/en not_active Expired - Lifetime
- 2003-05-02 CA CA2773294A patent/CA2773294C/en not_active Expired - Lifetime
-
2004
- 2004-03-26 US US10/810,989 patent/US7499553B2/en not_active Expired - Lifetime
-
2010
- 2010-09-16 JP JP2010208638A patent/JP2011022602A/en not_active Withdrawn
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1995002288A1 (en) | 1993-07-07 | 1995-01-19 | Picturetel Corporation | Reduction of background noise for speech enhancement |
EP0682436A2 (en) | 1994-05-09 | 1995-11-15 | AT&T Corp. | Voice actuated switching system |
EP0690655A2 (en) | 1994-06-30 | 1996-01-03 | AT&T Corp. | Direction finder |
Non-Patent Citations (1)
Title |
---|
See also references of EP1585947A4 |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7667647B2 (en) | 1999-03-05 | 2010-02-23 | Era Systems Corporation | Extension of aircraft tracking and positive identification from movement areas into non-movement areas |
US7739167B2 (en) | 1999-03-05 | 2010-06-15 | Era Systems Corporation | Automated management of airport revenues |
US7777675B2 (en) | 1999-03-05 | 2010-08-17 | Era Systems Corporation | Deployable passive broadband aircraft tracking |
US7782256B2 (en) | 1999-03-05 | 2010-08-24 | Era Systems Corporation | Enhanced passive coherent location techniques to track and identify UAVs, UCAVs, MAVs, and other objects |
US7889133B2 (en) | 1999-03-05 | 2011-02-15 | Itt Manufacturing Enterprises, Inc. | Multilateration enhancements for noise and operations management |
US8072382B2 (en) | 1999-03-05 | 2011-12-06 | Sra International, Inc. | Method and apparatus for ADS-B validation, active and passive multilateration, and elliptical surveillance |
US8203486B1 (en) | 1999-03-05 | 2012-06-19 | Omnipol A.S. | Transmitter independent techniques to extend the performance of passive coherent location |
US8446321B2 (en) | 1999-03-05 | 2013-05-21 | Omnipol A.S. | Deployable intelligence and tracking system for homeland security and search and rescue |
US7908077B2 (en) | 2003-06-10 | 2011-03-15 | Itt Manufacturing Enterprises, Inc. | Land use compatibility planning software |
US7965227B2 (en) | 2006-05-08 | 2011-06-21 | Era Systems, Inc. | Aircraft tracking using low cost tagging as a discriminator |
WO2017095559A1 (en) * | 2015-12-01 | 2017-06-08 | Qualcomm Incorporated | Determining audio event based on location information |
US10134422B2 (en) | 2015-12-01 | 2018-11-20 | Qualcomm Incorporated | Determining audio event based on location information |
Also Published As
Publication number | Publication date |
---|---|
AU2003265935A1 (en) | 2003-11-17 |
US7492908B2 (en) | 2009-02-17 |
US7499553B2 (en) | 2009-03-03 |
CA2773294C (en) | 2013-03-12 |
JP4744874B2 (en) | 2011-08-10 |
AU2003265935A8 (en) | 2003-11-17 |
US7567676B2 (en) | 2009-07-28 |
CN1830009A (en) | 2006-09-06 |
US20040005064A1 (en) | 2004-01-08 |
US20040179697A1 (en) | 2004-09-16 |
CA2483609C (en) | 2012-09-18 |
KR20040105252A (en) | 2004-12-14 |
WO2003093775A3 (en) | 2006-03-30 |
CN1830009B (en) | 2010-05-05 |
JP2005539413A (en) | 2005-12-22 |
US20040022392A1 (en) | 2004-02-05 |
CA2773294A1 (en) | 2003-11-13 |
CA2483609A1 (en) | 2003-11-13 |
US20040005065A1 (en) | 2004-01-08 |
JP2011022602A (en) | 2011-02-03 |
KR101047194B1 (en) | 2011-07-06 |
EP1585947A2 (en) | 2005-10-19 |
EP1585947A4 (en) | 2011-04-27 |
EP1585947B1 (en) | 2020-01-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2773294C (en) | Sound detection and localization system | |
JP4952698B2 (en) | Audio processing apparatus, audio processing method and program | |
Ratnam et al. | Blind estimation of reverberation time | |
US5867581A (en) | Hearing aid | |
Dietz et al. | Auditory model based direction estimation of concurrent speakers from binaural signals | |
US20100185308A1 (en) | Sound Signal Processing Device And Playback Device | |
JPS61500329A (en) | Signal processing device for determining the angular position of the acoustic source | |
US5572593A (en) | Method and apparatus for detecting and extending temporal gaps in speech signal and appliances using the same | |
CN113287169A (en) | Apparatus, method and computer program for blind source separation and remixing | |
KR101073632B1 (en) | A zero-crossing-based multiple source localization apparatus in reverberant environments | |
JP3097376B2 (en) | Howling suppression device | |
JPH0736487A (en) | Speech signal processor | |
JPH0424692A (en) | Voice section detection system | |
Kolotzek et al. | Fast processing explains the effect of sound reflection on binaural unmasking | |
Morgan et al. | Automated evaluation of acoustic talker direction finder algorithms in the varechoic chamber | |
Vesa | Estimation of reverberation time from binaural signals without using controlled excitation | |
Sirisawat et al. | Source sound determination in horizontal plane using human ears shape microphones | |
Henderson | Estimating azimuth from speech in a natural auditory environment | |
JPH06175676A (en) | Voice detector | |
JPH0644724B2 (en) | Howling detector | |
JPH04340598A (en) | Voice recognition device | |
JPH01255897A (en) | Voice detection | |
KR20150072959A (en) | Method and apparatus for processing sound signal | |
JPH0632538B2 (en) | Howling detector | |
JPH04152396A (en) | Voice segmenting device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PH PL PT RO RU SC SD SE SG SK SL TJ TM TN TR TT TZ UA UG UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2483609 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2003747642 Country of ref document: EP Ref document number: 2004501891 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020047017705 Country of ref document: KR |
|
WWP | Wipo information: published in national office |
Ref document number: 1020047017705 Country of ref document: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 20038145073 Country of ref document: CN |
|
WWP | Wipo information: published in national office |
Ref document number: 2003747642 Country of ref document: EP |