US3057960A - Normalized sound control system - Google Patents

Normalized sound control system Download PDF

Info

Publication number
US3057960A
US3057960A US95087A US9508761A US3057960A US 3057960 A US3057960 A US 3057960A US 95087 A US95087 A US 95087A US 9508761 A US9508761 A US 9508761A US 3057960 A US3057960 A US 3057960A
Authority
US
United States
Prior art keywords
signal
signals
cross
correlation
normalizing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
US95087A
Inventor
James F Kaiser
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AT&T Corp
Original Assignee
Bell Telephone Laboratories Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Bell Telephone Laboratories Inc filed Critical Bell Telephone Laboratories Inc
Priority to US95087A priority Critical patent/US3057960A/en
Application granted granted Critical
Publication of US3057960A publication Critical patent/US3057960A/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R27/00Public address systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2201/00Details of transducers, loudspeakers or microphones covered by H04R1/00 but not provided for in any of its subgroups
    • H04R2201/40Details of arrangements for obtaining desired directional characteristic by combining a number of identical transducers covered by H04R1/40 but not provided for in any of its subgroups
    • H04R2201/403Linear arrays of transducers

Definitions

  • the cocktail party effect is a psychoaconstic mechanism by which a human listener is able to concentrate on speech from a specific talker, despite a surrounding noise or speech babble whose sound intensity is approximately equal to that of the speech of the preferred talker.
  • two spatially separated microphones are placed in a sound field containing both speech from a preferred talker and background noise and speech of equal or higher intensity from other talkers.
  • the output signals of the two microphones are combined t form a composite signal, and the amplitude of the composite signal is modulated or varied by a control signal.
  • the properties of the control signal are selected to reinforce speech sounds from the preferred talker and to suppress noise and speech sounds from other talkers. This selective reinforcement and suppression in the David system improves the intelligibility of speech from the preferred talker and decreases the intelligibility of speech from other talkers, thereby simulating the ability of the human listener to concentrate on speech from a preferred talker in spite of background noise of equal or higher intensity.
  • the control signal is derived by cross-correlating the two microphone signals; that is, the two microphone signals are multiplied together and the product of the two signals is averaged over an appropriate time interval.
  • the two microphone signals are multiplied together and the product of the two signals is averaged over an appropriate time interval.
  • the magnitude of the cross-correlation signal thus provides an accurate measure of the presence or absence of speech from the preferred talker, and modulation of the composite signal by the cross-correlation signal serves to reinforce the preferred talkers speech sounds, which are coherent with the cross-correlation signal, and to supp-ress the background noise and speech sounds, which are incoherent with the cross-correctional signal. Varying the amplitude of the composite signal in this fashion increases the intelligibility of speech from the preferred talker and decreases the intelligibility of noise and speech from background talkers.
  • the David system achieves a significant improvement in the intelligibility of speech from a particular talker
  • the improvement in intelligibility is accompanied by an audible distortion in intensity. This distortion is particularly noticeable during changes in the intensity of speech sounds from the preferred talker.
  • the magnitude of the cross-correlation signal is a function of more than one independent variable.
  • the most important independent variable is the degree of time coincidence between speech sounds arriving at the two microphones, since it is by adjusting this variable that speech sounds from the preferred talker are brought into time coincidence, thereby creating large magnitudes in the cross-correlation signal which ⁇ serve to reinforce speech from the preferred taiker.
  • a second independent variable is the average intensity of the sound energy arriving at the two microphones, and this variable also increases the magnitude of the cross-correlation signal. It has been determined by applicant that this additional source of increased magnitude produces an excessive reinforcement of speech from the preferred talker which is audible as the distortion in intensity referred to above.
  • the present invention eliminates ydisto-rtion from the David system by normalizing the cross-correlation signal to render it insensitive to the average intensity of the sound energy reaching the two microphones, without disturbing the dependence of the cross-correlation signal upon the degree of time coincidence l'between speech sounds ⁇ arriving at the two microphones.
  • normalizing comprises dividing the cross-correlation signal by a normalizing signal that is also a function of the average intensity of the sound energy arriving at the microphones, thereby effectively canceling the dependence of the cross-correlation signal upon average sound intensity. Modulation of the composite signal by the normalized cross-correlation signal of the present invention enhances the intelligibility of speech sounds from a pre ferred talker without distorting the intensity of the preferred speech sounds.
  • This invention provides two alternative systems for deriving a normalizing signal.
  • the normalizing signal is derived by obtaining the product of the average magnitudes of the two microphone signals.
  • the second system the largest value of each microphone signal over several pitch periods is obtained, and the normalizing signal is derived either by multiplying together the largest values of the two microphone signals or by squaring the larger of the two largest values.
  • FIG. 1A is a schematic diagram of the cross-correlation sound control system described in the aforementioned David application;
  • FIG. 1B is a block diagram of a sound control system embodying the normalizing apparatus of this invention.
  • FIG. 2 is a schematic diagram of a preferred embodiment of the normalizing apparatus of this invention.
  • FIG. 3 is a schematic diagram of normalizing apparatus alternative to that of FIG. 2.
  • FIG. 1A there is shown a cross-correlation version of the sound control system described in the aforementioned David application.
  • Spatially separated microphones 1, 2 are placed in a sound field containing both speech sounds from a preferred source and background noise from other sources, and each microphone generates an electrical signal in response to the sound energy it receives.
  • the microphone signals are combined in adder 3 to form a composite signal whose amplitude is modulated or adjusted in a variable gain amplifier 5.
  • Amplifier 5 is controlled by a cross-correlation signal derived from the microphone signals by signal processing circuits 10, 11 and cross-correlation circuit 12.
  • Delay element 4 synchronizes the composite signal with the control signal, and reproducer 6 converts the modulated composite signal into audible sound waves in which the intelligibility of speech sounds from the preferred source is substantially improved.
  • Signal processing circuits 10, 11 are of similar construction, comprising, respectively, delay elements 100, 110, equalizers 101, A111, rectitiers 102, 112, and low-pass filters 103, 113.
  • the delay elements bring into time coincidence the portions of each microphone signal attributable to speech sounds from the preferred talker; the equalizers pre-emphasize selected frequencies; and the rectiers, which are preferably of the half-wave variety, together with the low-pass lters serve to make the polarity of the microphone signals unidirectional and to pass only the prominent features of the unidirectional signals.
  • the processed microphone signals from circuits 10, 11 are applied to cross-correlation circuit 12, where the two signals are multiplied together in multiplier 120 and the product of the two signals is averaged by lowpass filter I121 to form a cross-correlation control signal.
  • circuit 12 The cross-correlation operation performed by circuit 12 may be expressed as follows:
  • f1(x), f2(x) denote the microphone signals
  • the independent variable t represents the amount of time by which one of the microphone signals is delayed with respect to the other microphone signal in order to create time coincidence
  • the superposed bar denotes the time averaging performed by low-pass filter 121 upon the product of the delayed microphone signals
  • p(l) is the cross-correlation signal derived by circuit 12.
  • Equation l the magnitude of the cross correlation of the two functions, as expressed in Equation l, depends primarily upon the independent variable t,
  • the cross-correlation function is a measure of the I time coherence between related portions of the two functions; for example, see Y. W. Lee, T. P. Cheatham, Ir., I. B. Wiesner, Application of Correlation Analysis to the Detection of Periodic Signals in Noise, volume 38, Proceedings of the I.R.E., page 1165 (1950).
  • the magnitude of the cross-correlation function increases as the time coherence of the two functions increases, starting from a mininmum value when the related portions of the two functions are most incoherent, and reaching a maximum value when the related portions are exactly time coincident.
  • the independent variable t is adjusted via delay elements 100, 110 to bring into time coincidence the portions of both microphone signals which are attributable to speech sounds from the preferred source.
  • This adjustment of the variable t causes the magnitude of the cross-correlation signal to be relatively large when speech sounds from the preferred source are present, and to be relatively small, with rare exceptions caused by chance coincidences, at all other times.
  • Varying the amplitude of the composite signal from adder 3 in amplifier 5 in response to the cross-correlation signal from circuit 12 improves the intelligibility of speech from the prel ferred talker by selectively reinforcing those portions of the composite signal due to preferred speech sounds and suppressing those portions due to sounds from other sources.
  • Equation l Analysis of Equation l reveals, however, that the magnitude of the cross correlation of the two microphone signals is also dependent upon the average intensity of the sounds arriving at the microphones, as well as upon the degree of time coherence between the sounds. This is demonstrated, for example, by increasing the average sound intensity at each of the microphones 1, 2 by constant factors K1, K2, respectively; then from Equation 1, the magnitude of the cross-correlation function is increased by a factor lil-K2, that is,
  • Equation 2 the dependence of the crosscorrelation signal upon the product of the sound intensities, an expression in Equation 2, produces in amplitier 5 a reinforcement of the composite signal in excess of the reinforcement required for improvement of intelligibility, the excess reinforcement causing the audible distortion of intensity previously discussed.
  • the present invention eliminates this distortion at its source: the cross-correlation signal is normalized before applying it to the control terminal of amplifier 5 in order to remove its dependence upon the product of the average sound intensities at the two microphones.
  • FIG. 1B the normalizing apparatus of this invention is shown incorporated in the sound control system of FiG. 1A.
  • the processed microphone signals from circuits 1t), 11 are applied to normalizing circuit 13 as well as to cross-correlation circuit 12.
  • Normalizing circuit 13 which is described in detail below, derives from the microphone signals a nomalizing signal, N, which is a nonlinear function of the average magnitudes of the microphone signals.
  • divider 14 By applying the normalizing signal to the divisor terminal of divider 14, which may be of any suitable construction, and by applying the cross-correlation signal to the dividend terminal of divider 14, there is obtained at the output terminal of divider 14 a normalized cross-correlation signal that is primarily dependent upon the degree of time coherence between related portions of the microphone signals and that is relatively insensitive to the product of the average sound intensities at the microphones.
  • FIG. 2 illustrates a preferred embodiment of the principles of this invention, in which the microphone signals from signal processing circuits 10, 11 are applied to ampliers 20, 22, respectively, which are constructed in a conventional manner to provide low impedance driving sources for peak detectors 21, 23.
  • Peak detectors 21, 23 comprise, respectively, diodes 210, 230, and RC circuits consisting of capacitors 211, 231, and resistors 212, 232.
  • the peak detectors have a charging time constant on the order of microseconds and a discharge or holding time constant of about 100 milliseconds, in order to derive from each microphone signal a unidirectional or single polarity signal whose peaks coincide with the commencement of the periods of voiced sounds, and whose average amplitude is approximately equal to the average magnitude of each of the microphone signals, denoted respectively, as shown in FIG. 2. Since the average magnitude of each of the microphone signals is proportional Ito the average sound intensity at each microphone, by multiplying together the output signals of peak detectors 21, 23 in multiplier 24, there is developed at the output terminal of multiplier 24 a normalizing signal, N, approximately proportional to the product of the average sound intensity at each microphone,
  • Threshold circuit 25 prevents the normalizing signal from falling below a predetermined nonzero value, and comprises a voltage divider network consisting of resistors 250, 251, with a diode 252 biased by an adjustable energy source 253 connected in parallel with resistor 251. -Diode 252 and source 253 are shown in FIG. 2 connected for a negative polarity normalizing signal; however, -for a positive polarity normalizing signal, the polarities of diode l252 and source 253 are reversed.
  • FIG. 3 illustrates alternative apparatus for deriving a normalizing signal, which is somewhat faster-acting than the apparatus shown in FIG. 2.
  • the processed microphone signals from circuits 10, y11 of FIG. lB are applied to amplifiers 30, 31 and then passed to tapped delay lines 32, 33, which are terminated in matching impedances 321, 33-1 to prevent reflection.
  • the taps of each delay line 32, 33 are connected to a common output point P0, P1, through diodes 32a through 32u and 33a through 3311, respectively, and the signal formed at each of the common output points is the largest of all the signals appearing simultaenously at the taps of each delay line.
  • a normalizing signal -either by multiplying together the two signals from .the common output points or by squaring the larger of the two signals.
  • 'Ihe two largest signals formed at each common output point are passed through amplifiers 34, 35, and depending upon the positions in which switches 36, 37 are set, the two largest signals -are applied either to the input terminals of diodes 38, 39 or to the input terminals of multiplier 40.
  • Diodes 38, 39 are connected to a common output point Pz and the signal developed at this point is the larger of the two largest signals from delay lines 32, 33.
  • the combination that comprises a source of a pair of information bearing signals derived from the output signals of a pair of spatially separated sound detecting devices, means for cross-correlating said pair of information bearing signals, means for normalizing the output signal of said crosscorrelating means to eliminate its dependence upon the average amplitudes of said information bearing signals, and means under the control of said normalized crosscorrelation signal for adjusting the amplitude of the sum of the output signals of said sound detecting devices.
  • the combination that comprises a pair of spatially separated microphones, means for additively combining the output signals of said microphones, means for individually processing the output signals of said microphones to emphasize selected features of said signals, means for averaging the product of said processed microphone signals to develop a crosscorrelation signal, means for deriving a normalizing signal from said processed microphone signals, means for dividing said cross-correlation signal by said normalizing signal to obtain a control signal, and means under the influence of said control signal for varying the amplitude of said additively combined microphone signals.
  • said means for deriving a normalizing signal comprises two parallel subpaths each of which is provided with an input terminal, an output terminal, yand an amplifier and a peak detector connected in series between said input and output terminals, means for applying each of said processed microphone signals to the input terminal of one of said subpaths, a multiplier provided with two input terminals and an output terminal, means for connecting the output terminal of each of said subpaths to one of the input terminals of said multiplier, and threshold means connected between the output terminal of said multiplier and said dividing means.
  • said threshold means comprises a voltage divider including a first resistor and a second resistor, a diode connected in series with an adjustable energy source, and means for connecting said series connected diode and energy source in parallel with said second resistor.
  • apparatus for reducing distortion which comprises a source of a pair of information bearing signals representative of the sound energy at two spatially separated points in a sound field, a source of a cross-correlation signal derived from said pair of information bearing signals whose magnitude is a function of the time coherence between said signals and of the average amplitudes of said signals, means for deriving from each of said information 'bearing signals a signal proportional to the largest value over several pitch periods of each of said information bearing signals, means for combining said largest value signals to form a normalizing signal, and means for dividing said cross-correlation signal by said normalizing signal.
  • said means for deriving from each of said information bearing signals a signal proportional to the largest value over several pitch periods of each of said information bearing signals comprises two parallel signal paths, one for each of said information bearing signals, wherein each of said signals paths includes a first amplifying means provided with an input terminal and an output terminal, a delay line provided with an input terminal and a plurality of taps, means for connecting the output terminal of said first amplifying means to the input terminal of said delay line, a plurality of diodes in one-to-one correspondence with the taps of said delay line, wherein each of said diodes is provided with an input terminal and an output terminal, means for connecting each tap of said delay line to the input terminal of one of said diodes, means for connecting the output terminals of said diodes to a common output point, a second amplifying means provided with an input terminal and an output terminal, means for connecting said common output point to the input terminal of said second amplifying means, and means for connecting the output terminal of said signals
  • Apparatus as defined in claim 6 wherein said means for combining said average magnitude signals to form a normalizing signal comprises means for multiplying together said average magnitude signals.
  • said means for combining said average magnitude signals to form a normalizing signal comprises two diodes, one for each of said average magnitude signals, wherein each of said diodes is provided with an input terminal and an output terminal, means for applying each average magnitude signal to the input terminal of one of said diodes, means for connecting the output terminals of said diodes to a common output point, squaring means provided with an input terminal and an output terminal, means for connecting said common output point to the input terminal of said squaring means, and means for connecting the output terminal of said squaring means to said dividing means.

Description

Oct. 9, 1962 J. F. KAISER NORMALIZED SOUND CONTROL SYSTEM 3 Sheets-Sheet l Oct. 9, 1962 J. F. KAISER NORMALIZEO SOUND CONTROL SYSTEM 5 Sheets-Sheet 2 Filed March 15, 1961 QE @Ema ln Oct. 9, 1962 J. F. KAISER NORMALIZED SOUND CONTROL SYSTEM 5 Sheets-Sheet 5 Filed March 13, 1961 INVENTOR J. KA /SER BV ATTORNEY m. 6fm
United States Patent Office 3,057,960 Patented Oct. 9, 1962 3,057,960 NORMAMZED SUND CONTROL SYSTEM James F. Kaiser, Summit, NJ., assigner to Bell Telephone Laboratories, incorporated, New York, NX., a corporation of New York Filed Mar. 13, 1961, Ser. No. 95,087 9 Claims. (Cl. 179-1) This invention relates to sound control systems for improving the intelligibility of speech from a talker located in a noisy environment.
The sound control system described by E. E. David, Jr. in a copending application filed December 15, 1959, Serial No. 859,684, improves the inteiligibility of speech from a talker located in a noisy environment by simulating in part of the so-called cocktail party effect. The cocktail party effect is a psychoaconstic mechanism by which a human listener is able to concentrate on speech from a specific talker, despite a surrounding noise or speech babble whose sound intensity is approximately equal to that of the speech of the preferred talker. In the David system, two spatially separated microphones are placed in a sound field containing both speech from a preferred talker and background noise and speech of equal or higher intensity from other talkers. The output signals of the two microphones are combined t form a composite signal, and the amplitude of the composite signal is modulated or varied by a control signal. The properties of the control signal are selected to reinforce speech sounds from the preferred talker and to suppress noise and speech sounds from other talkers. This selective reinforcement and suppression in the David system improves the intelligibility of speech from the preferred talker and decreases the intelligibility of speech from other talkers, thereby simulating the ability of the human listener to concentrate on speech from a preferred talker in spite of background noise of equal or higher intensity.
In one embodiment of the David invention, the control signal is derived by cross-correlating the two microphone signals; that is, the two microphone signals are multiplied together and the product of the two signals is averaged over an appropriate time interval. By suitably delaying one or the other of the two microphone signals before cross-correlating them, those portions of the two microphone signals which correspond to speech sounds from the preferred talker are brought into time coincidence, thereby causing the magnitude of the crosscorrelation control signal to be relatively large when speech sounds from the preferred talker are present, and to be relatively small when speech sounds from the preferred talker are not present. The magnitude of the cross-correlation signal thus provides an accurate measure of the presence or absence of speech from the preferred talker, and modulation of the composite signal by the cross-correlation signal serves to reinforce the preferred talkers speech sounds, which are coherent with the cross-correlation signal, and to supp-ress the background noise and speech sounds, which are incoherent with the cross-correctional signal. Varying the amplitude of the composite signal in this fashion increases the intelligibility of speech from the preferred talker and decreases the intelligibility of noise and speech from background talkers.
Although the David system achieves a significant improvement in the intelligibility of speech from a particular talker, in the embodiment that utilizes the cross correlation of he microphone signals as a control signal the improvement in intelligibility is accompanied by an audible distortion in intensity. This distortion is particularly noticeable during changes in the intensity of speech sounds from the preferred talker.
Investigation into the source of distortion in the crosscorrelation version of the David system revealed that the magnitude of the cross-correlation signal is a function of more than one independent variable. The most important independent variable is the degree of time coincidence between speech sounds arriving at the two microphones, since it is by adjusting this variable that speech sounds from the preferred talker are brought into time coincidence, thereby creating large magnitudes in the cross-correlation signal which `serve to reinforce speech from the preferred taiker. A second independent variable is the average intensity of the sound energy arriving at the two microphones, and this variable also increases the magnitude of the cross-correlation signal. It has been determined by applicant that this additional source of increased magnitude produces an excessive reinforcement of speech from the preferred talker which is audible as the distortion in intensity referred to above.
It is the principal object of the present invention to improve the intelligibility of speech sounds from a preferred talker located in a noisy environment without distorting the intensity of the preferred speech sounds.
it is a specific object of this invention to eliminate distortion from the cross-correlation version of the David system by removing the dependence of the magnitude of the cross-correlation control signal upon the average intensity of the sound energy reaching the two microphones.
The present invention eliminates ydisto-rtion from the David system by normalizing the cross-correlation signal to render it insensitive to the average intensity of the sound energy reaching the two microphones, without disturbing the dependence of the cross-correlation signal upon the degree of time coincidence l'between speech sounds `arriving at the two microphones. In this invention, normalizing comprises dividing the cross-correlation signal by a normalizing signal that is also a function of the average intensity of the sound energy arriving at the microphones, thereby effectively canceling the dependence of the cross-correlation signal upon average sound intensity. Modulation of the composite signal by the normalized cross-correlation signal of the present invention enhances the intelligibility of speech sounds from a pre ferred talker without distorting the intensity of the preferred speech sounds.
This invention provides two alternative systems for deriving a normalizing signal. In the iirst system, the normalizing signal is derived by obtaining the product of the average magnitudes of the two microphone signals. In the second system, the largest value of each microphone signal over several pitch periods is obtained, and the normalizing signal is derived either by multiplying together the largest values of the two microphone signals or by squaring the larger of the two largest values.
The invention will be fully understood from the following detailed description of illustrative embodiments thereof taken in connection with the appended drawings, in which:
FIG. 1A is a schematic diagram of the cross-correlation sound control system described in the aforementioned David application;
FIG. 1B is a block diagram of a sound control system embodying the normalizing apparatus of this invention;
FIG. 2 is a schematic diagram of a preferred embodiment of the normalizing apparatus of this invention; and
FIG. 3 is a schematic diagram of normalizing apparatus alternative to that of FIG. 2.
Theoretical Considerations Referring'to FIG. 1A, there is shown a cross-correlation version of the sound control system described in the aforementioned David application. Spatially separated microphones 1, 2 are placed in a sound field containing both speech sounds from a preferred source and background noise from other sources, and each microphone generates an electrical signal in response to the sound energy it receives. The microphone signals are combined in adder 3 to form a composite signal whose amplitude is modulated or adjusted in a variable gain amplifier 5. Amplifier 5 is controlled by a cross-correlation signal derived from the microphone signals by signal processing circuits 10, 11 and cross-correlation circuit 12. Delay element 4 synchronizes the composite signal with the control signal, and reproducer 6 converts the modulated composite signal into audible sound waves in which the intelligibility of speech sounds from the preferred source is substantially improved.
Signal processing circuits 10, 11 are of similar construction, comprising, respectively, delay elements 100, 110, equalizers 101, A111, rectitiers 102, 112, and low-pass filters 103, 113. The delay elements bring into time coincidence the portions of each microphone signal attributable to speech sounds from the preferred talker; the equalizers pre-emphasize selected frequencies; and the rectiers, which are preferably of the half-wave variety, together with the low-pass lters serve to make the polarity of the microphone signals unidirectional and to pass only the prominent features of the unidirectional signals. The processed microphone signals from circuits 10, 11 are applied to cross-correlation circuit 12, where the two signals are multiplied together in multiplier 120 and the product of the two signals is averaged by lowpass filter I121 to form a cross-correlation control signal.
The cross-correlation operation performed by circuit 12 may be expressed as follows:
where f1(x), f2(x) denote the microphone signals, the independent variable t represents the amount of time by which one of the microphone signals is delayed with respect to the other microphone signal in order to create time coincidence, the superposed bar denotes the time averaging performed by low-pass filter 121 upon the product of the delayed microphone signals, and p(l) is the cross-correlation signal derived by circuit 12.
It is well known that for two functions that are in some manner related to each other, the magnitude of the cross correlation of the two functions, as expressed in Equation l, depends primarily upon the independent variable t,
that is, the cross-correlation function is a measure of the I time coherence between related portions of the two functions; for example, see Y. W. Lee, T. P. Cheatham, Ir., I. B. Wiesner, Application of Correlation Analysis to the Detection of Periodic Signals in Noise, volume 38, Proceedings of the I.R.E., page 1165 (1950). Thus the magnitude of the cross-correlation function increases as the time coherence of the two functions increases, starting from a mininmum value when the related portions of the two functions are most incoherent, and reaching a maximum value when the related portions are exactly time coincident.
`ln the case of the cross-correlation control signal derived from the two microphone signals by the apparatus of FIG. 1A, the independent variable t is adjusted via delay elements 100, 110 to bring into time coincidence the portions of both microphone signals which are attributable to speech sounds from the preferred source. This adjustment of the variable t causes the magnitude of the cross-correlation signal to be relatively large when speech sounds from the preferred source are present, and to be relatively small, with rare exceptions caused by chance coincidences, at all other times. Varying the amplitude of the composite signal from adder 3 in amplifier 5 in response to the cross-correlation signal from circuit 12 improves the intelligibility of speech from the prel ferred talker by selectively reinforcing those portions of the composite signal due to preferred speech sounds and suppressing those portions due to sounds from other sources.
Analysis of Equation l reveals, however, that the magnitude of the cross correlation of the two microphone signals is also dependent upon the average intensity of the sounds arriving at the microphones, as well as upon the degree of time coherence between the sounds. This is demonstrated, for example, by increasing the average sound intensity at each of the microphones 1, 2 by constant factors K1, K2, respectively; then from Equation 1, the magnitude of the cross-correlation function is increased by a factor lil-K2, that is,
ata :Iolite-Ionen) :K1-1cm 2) It has been determined that the dependence of the crosscorrelation signal upon the product of the sound intensities, an expression in Equation 2, produces in amplitier 5 a reinforcement of the composite signal in excess of the reinforcement required for improvement of intelligibility, the excess reinforcement causing the audible distortion of intensity previously discussed. The present invention eliminates this distortion at its source: the cross-correlation signal is normalized before applying it to the control terminal of amplifier 5 in order to remove its dependence upon the product of the average sound intensities at the two microphones. The dependence of the normalized cross-correlation signal upon the degree of time coherence between the microphone signals is preserved, however, and modulation of the composite signal in response to the normalized cross-correlation signal improves the intelligibility of speech sounds from a preferred source Without distorting the intensity of the preferred speech sounds.
Com plete System Referring now to FIG. 1B, the normalizing apparatus of this invention is shown incorporated in the sound control system of FiG. 1A. The processed microphone signals from circuits 1t), 11 are applied to normalizing circuit 13 as well as to cross-correlation circuit 12. Normalizing circuit 13, which is described in detail below, derives from the microphone signals a nomalizing signal, N, which is a nonlinear function of the average magnitudes of the microphone signals. By applying the normalizing signal to the divisor terminal of divider 14, which may be of any suitable construction, and by applying the cross-correlation signal to the dividend terminal of divider 14, there is obtained at the output terminal of divider 14 a normalized cross-correlation signal that is primarily dependent upon the degree of time coherence between related portions of the microphone signals and that is relatively insensitive to the product of the average sound intensities at the microphones.
Preferred Normalzing Circuit FIG. 2 illustrates a preferred embodiment of the principles of this invention, in which the microphone signals from signal processing circuits 10, 11 are applied to ampliers 20, 22, respectively, which are constructed in a conventional manner to provide low impedance driving sources for peak detectors 21, 23. Peak detectors 21, 23 comprise, respectively, diodes 210, 230, and RC circuits consisting of capacitors 211, 231, and resistors 212, 232. The peak detectors have a charging time constant on the order of microseconds and a discharge or holding time constant of about 100 milliseconds, in order to derive from each microphone signal a unidirectional or single polarity signal whose peaks coincide with the commencement of the periods of voiced sounds, and whose average amplitude is approximately equal to the average magnitude of each of the microphone signals, denoted respectively, as shown in FIG. 2. Since the average magnitude of each of the microphone signals is proportional Ito the average sound intensity at each microphone, by multiplying together the output signals of peak detectors 21, 23 in multiplier 24, there is developed at the output terminal of multiplier 24 a normalizing signal, N, approximately proportional to the product of the average sound intensity at each microphone,
N=|f1|'|f2| (3) By dividing the cross-correlation signal p, by the normalizing signal N, as shown in divider 14 of FIG. 1B, there is obtained a normalized cross-correlation signal I; where It is observed in Equation 4 that an increase in sound intensity at each microphone by factors K1, K2, does not affect the magnitude of the normalized cross-correlation signal, since the factor K1K2 appears in both the numerator and the denominator of AEquation 4.
In order to prevent the denominator of Equation 4 from passing to Zero at some time and thereby producing an erroneous normalized cross-correlation signal at the output terminal of divider 14, the normalizing signal is passed through threshold circuit 25. Threshold circuit 25 prevents the normalizing signal from falling below a predetermined nonzero value, and comprises a voltage divider network consisting of resistors 250, 251, with a diode 252 biased by an adjustable energy source 253 connected in parallel with resistor 251. -Diode 252 and source 253 are shown in FIG. 2 connected for a negative polarity normalizing signal; however, -for a positive polarity normalizing signal, the polarities of diode l252 and source 253 are reversed.
Alternative Normalizng Circuit FIG. 3 illustrates alternative apparatus for deriving a normalizing signal, which is somewhat faster-acting than the apparatus shown in FIG. 2. The processed microphone signals from circuits 10, y11 of FIG. lB are applied to amplifiers 30, 31 and then passed to tapped delay lines 32, 33, which are terminated in matching impedances 321, 33-1 to prevent reflection. The taps of each delay line 32, 33 are connected to a common output point P0, P1, through diodes 32a through 32u and 33a through 3311, respectively, and the signal formed at each of the common output points is the largest of all the signals appearing simultaenously at the taps of each delay line. By making the total delay of the delay lines equal to several pitch periods, that is, about l milliseconds delay with n=l equally spaced taps, the largest value of each microphone signal over several pitch periods is obtained at each of the common output points.
From the largest signal obtained at each of the common output points there is derived a normalizing signal, -either by multiplying together the two signals from .the common output points or by squaring the larger of the two signals. 'Ihe two largest signals formed at each common output point are passed through amplifiers 34, 35, and depending upon the positions in which switches 36, 37 are set, the two largest signals -are applied either to the input terminals of diodes 38, 39 or to the input terminals of multiplier 40. Diodes 38, 39 are connected to a common output point Pz and the signal developed at this point is the larger of the two largest signals from delay lines 32, 33. The magnitude of the larger of the two signals passed by diodes 38, 39 is squared in squaring circuit -41 to form -a normalizing signal. When switches 36, 37 connect the two common output points P0, P1 of delay lines 32, 33 to the input terminals of multiplier 40, however, the product signal developed at the output terminal of multiplier 40 from the largest signal `at each common output point is the normalizing signal.
It is to be understood that the above-described arrangements are merely illustrative of applications of the principles of the invention. Numerous other arrangements may be devised by those skilled in the art without departing from the spirit and scope of the invention.
What is claimed is:
l. In a system for the control of sound, the combination that comprises a source of a pair of information bearing signals derived from the output signals of a pair of spatially separated sound detecting devices, means for cross-correlating said pair of information bearing signals, means for normalizing the output signal of said crosscorrelating means to eliminate its dependence upon the average amplitudes of said information bearing signals, and means under the control of said normalized crosscorrelation signal for adjusting the amplitude of the sum of the output signals of said sound detecting devices.
2. In a sound control system, the combination that comprises a pair of spatially separated microphones, means for additively combining the output signals of said microphones, means for individually processing the output signals of said microphones to emphasize selected features of said signals, means for averaging the product of said processed microphone signals to develop a crosscorrelation signal, means for deriving a normalizing signal from said processed microphone signals, means for dividing said cross-correlation signal by said normalizing signal to obtain a control signal, and means under the influence of said control signal for varying the amplitude of said additively combined microphone signals.
3. Apparatus as deiined in claim 2 wherein said means for deriving a normalizing signal comprises two parallel subpaths each of which is provided with an input terminal, an output terminal, yand an amplifier and a peak detector connected in series between said input and output terminals, means for applying each of said processed microphone signals to the input terminal of one of said subpaths, a multiplier provided with two input terminals and an output terminal, means for connecting the output terminal of each of said subpaths to one of the input terminals of said multiplier, and threshold means connected between the output terminal of said multiplier and said dividing means.
4. Apparatus as deiined in claim 3 wherein said peak detector comprises a diode and an RC circuit.
5. Apparatus las defined in claim 3 wherein said threshold means comprises a voltage divider including a first resistor and a second resistor, a diode connected in series with an adjustable energy source, and means for connecting said series connected diode and energy source in parallel with said second resistor.
6. In combination with a cross-correlation sound control system, apparatus for reducing distortion which comprises a source of a pair of information bearing signals representative of the sound energy at two spatially separated points in a sound field, a source of a cross-correlation signal derived from said pair of information bearing signals whose magnitude is a function of the time coherence between said signals and of the average amplitudes of said signals, means for deriving from each of said information 'bearing signals a signal proportional to the largest value over several pitch periods of each of said information bearing signals, means for combining said largest value signals to form a normalizing signal, and means for dividing said cross-correlation signal by said normalizing signal.
7. Apparatus as defined in claim `6 wherein said means for deriving from each of said information bearing signals a signal proportional to the largest value over several pitch periods of each of said information bearing signals comprises two parallel signal paths, one for each of said information bearing signals, wherein each of said signals paths includes a first amplifying means provided with an input terminal and an output terminal, a delay line provided with an input terminal and a plurality of taps, means for connecting the output terminal of said first amplifying means to the input terminal of said delay line, a plurality of diodes in one-to-one correspondence with the taps of said delay line, wherein each of said diodes is provided with an input terminal and an output terminal, means for connecting each tap of said delay line to the input terminal of one of said diodes, means for connecting the output terminals of said diodes to a common output point, a second amplifying means provided with an input terminal and an output terminal, means for connecting said common output point to the input terminal of said second amplifying means, and means for connecting the output terminal of said second amplifying means to said combining means.
8. Apparatus as defined in claim 6 wherein said means for combining said average magnitude signals to form a normalizing signal comprises means for multiplying together said average magnitude signals.
9. Apparatus as dened in claim 6 wherein said means for combining said average magnitude signals to form a normalizing signal comprises two diodes, one for each of said average magnitude signals, wherein each of said diodes is provided with an input terminal and an output terminal, means for applying each average magnitude signal to the input terminal of one of said diodes, means for connecting the output terminals of said diodes to a common output point, squaring means provided with an input terminal and an output terminal, means for connecting said common output point to the input terminal of said squaring means, and means for connecting the output terminal of said squaring means to said dividing means.
No references cited.
US95087A 1961-03-13 1961-03-13 Normalized sound control system Expired - Lifetime US3057960A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US95087A US3057960A (en) 1961-03-13 1961-03-13 Normalized sound control system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US95087A US3057960A (en) 1961-03-13 1961-03-13 Normalized sound control system

Publications (1)

Publication Number Publication Date
US3057960A true US3057960A (en) 1962-10-09

Family

ID=22249463

Family Applications (1)

Application Number Title Priority Date Filing Date
US95087A Expired - Lifetime US3057960A (en) 1961-03-13 1961-03-13 Normalized sound control system

Country Status (1)

Country Link
US (1) US3057960A (en)

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3440350A (en) * 1966-08-01 1969-04-22 Bell Telephone Labor Inc Reception of signals transmitted in a reverberant environment
US3497622A (en) * 1966-10-21 1970-02-24 Zenith Radio Corp Automatic gain control
US3502989A (en) * 1966-05-03 1970-03-24 Itt Receiver employing correlation techniques
US3794766A (en) * 1973-02-08 1974-02-26 Bell Telephone Labor Inc Delay equalizing circuit for an audio system using multiple microphones
US3947636A (en) * 1974-08-12 1976-03-30 Edgar Albert D Transient noise filter employing crosscorrelation to detect noise and autocorrelation to replace the noisey segment
US4131760A (en) * 1977-12-07 1978-12-26 Bell Telephone Laboratories, Incorporated Multiple microphone dereverberation system
US4181818A (en) * 1976-11-05 1980-01-01 Gentex Corporation Personal amplifier system
US4185167A (en) * 1976-06-28 1980-01-22 Acoustical Design Incorporated Sound masking package
US4236040A (en) * 1978-08-17 1980-11-25 General Motors Corporation Sound intensity meter
NL8303589A (en) * 1982-10-18 1984-05-16 Nippon Telegraph & Telephone MICROPHONE CONFIGURATION AND METHOD FOR DERIVING A DESIRED SIGNAL.
US4956867A (en) * 1989-04-20 1990-09-11 Massachusetts Institute Of Technology Adaptive beamforming for noise reduction
US20020176594A1 (en) * 2001-03-02 2002-11-28 Volker Hohmann Method for the operation of a hearing aid device or hearing device system as well as hearing aid device or hearing device system
WO2003015458A2 (en) * 2001-08-10 2003-02-20 Rasmussen Digital Aps Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in multiple wave sound environment
WO2003015457A2 (en) * 2001-08-10 2003-02-20 Rasmussen Digital Aps Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in single wave sound environment
US6549630B1 (en) * 2000-02-04 2003-04-15 Plantronics, Inc. Signal expander with discrimination between close and distant acoustic source
US20070047743A1 (en) * 2005-08-26 2007-03-01 Step Communications Corporation, A Nevada Corporation Method and apparatus for improving noise discrimination using enhanced phase difference value
US20070047742A1 (en) * 2005-08-26 2007-03-01 Step Communications Corporation, A Nevada Corporation Method and system for enhancing regional sensitivity noise discrimination
US20070050441A1 (en) * 2005-08-26 2007-03-01 Step Communications Corporation,A Nevada Corporati Method and apparatus for improving noise discrimination using attenuation factor
US20090234618A1 (en) * 2005-08-26 2009-09-17 Step Labs, Inc. Method & Apparatus For Accommodating Device And/Or Signal Mismatch In A Sensor Array
US20100109951A1 (en) * 2005-08-26 2010-05-06 Dolby Laboratories, Inc. Beam former using phase difference enhancement
US20100158269A1 (en) * 2008-12-22 2010-06-24 Vimicro Corporation Method and apparatus for reducing wind noise
US20110029288A1 (en) * 2005-08-26 2011-02-03 Dolby Laboratories Licensing Corporation Method And Apparatus For Improving Noise Discrimination In Multiple Sensor Pairs
CN104885152A (en) * 2012-12-28 2015-09-02 共荣工程株式会社 Sound-source separation method, device, and program

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
None *

Cited By (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3502989A (en) * 1966-05-03 1970-03-24 Itt Receiver employing correlation techniques
US3440350A (en) * 1966-08-01 1969-04-22 Bell Telephone Labor Inc Reception of signals transmitted in a reverberant environment
US3497622A (en) * 1966-10-21 1970-02-24 Zenith Radio Corp Automatic gain control
US3794766A (en) * 1973-02-08 1974-02-26 Bell Telephone Labor Inc Delay equalizing circuit for an audio system using multiple microphones
US3947636A (en) * 1974-08-12 1976-03-30 Edgar Albert D Transient noise filter employing crosscorrelation to detect noise and autocorrelation to replace the noisey segment
US4185167A (en) * 1976-06-28 1980-01-22 Acoustical Design Incorporated Sound masking package
US4181818A (en) * 1976-11-05 1980-01-01 Gentex Corporation Personal amplifier system
US4131760A (en) * 1977-12-07 1978-12-26 Bell Telephone Laboratories, Incorporated Multiple microphone dereverberation system
US4236040A (en) * 1978-08-17 1980-11-25 General Motors Corporation Sound intensity meter
NL8303589A (en) * 1982-10-18 1984-05-16 Nippon Telegraph & Telephone MICROPHONE CONFIGURATION AND METHOD FOR DERIVING A DESIRED SIGNAL.
US4956867A (en) * 1989-04-20 1990-09-11 Massachusetts Institute Of Technology Adaptive beamforming for noise reduction
US6549630B1 (en) * 2000-02-04 2003-04-15 Plantronics, Inc. Signal expander with discrimination between close and distant acoustic source
US20020176594A1 (en) * 2001-03-02 2002-11-28 Volker Hohmann Method for the operation of a hearing aid device or hearing device system as well as hearing aid device or hearing device system
US7013015B2 (en) * 2001-03-02 2006-03-14 Siemens Audiologische Technik Gmbh Method for the operation of a hearing aid device or hearing device system as well as hearing aid device or hearing device system
WO2003015458A2 (en) * 2001-08-10 2003-02-20 Rasmussen Digital Aps Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in multiple wave sound environment
WO2003015457A2 (en) * 2001-08-10 2003-02-20 Rasmussen Digital Aps Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in single wave sound environment
WO2003015458A3 (en) * 2001-08-10 2003-10-23 Rasmussen Digital Aps Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in multiple wave sound environment
WO2003015457A3 (en) * 2001-08-10 2004-03-11 Rasmussen Digital Aps Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in single wave sound environment
US7274794B1 (en) 2001-08-10 2007-09-25 Sonic Innovations, Inc. Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in single wave sound environment
AU2002331235B2 (en) * 2001-08-10 2008-08-14 Rasmussen Digital Aps Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in single wave sound environment
US20070047743A1 (en) * 2005-08-26 2007-03-01 Step Communications Corporation, A Nevada Corporation Method and apparatus for improving noise discrimination using enhanced phase difference value
US20110029288A1 (en) * 2005-08-26 2011-02-03 Dolby Laboratories Licensing Corporation Method And Apparatus For Improving Noise Discrimination In Multiple Sensor Pairs
EP1917837A2 (en) * 2005-08-26 2008-05-07 Step Communications Corporation Method and apparatus for improving noise discrimination using attenuation factor
US20070047742A1 (en) * 2005-08-26 2007-03-01 Step Communications Corporation, A Nevada Corporation Method and system for enhancing regional sensitivity noise discrimination
US20090234618A1 (en) * 2005-08-26 2009-09-17 Step Labs, Inc. Method & Apparatus For Accommodating Device And/Or Signal Mismatch In A Sensor Array
US20100109951A1 (en) * 2005-08-26 2010-05-06 Dolby Laboratories, Inc. Beam former using phase difference enhancement
USRE47535E1 (en) 2005-08-26 2019-07-23 Dolby Laboratories Licensing Corporation Method and apparatus for accommodating device and/or signal mismatch in a sensor array
US20070050441A1 (en) * 2005-08-26 2007-03-01 Step Communications Corporation,A Nevada Corporati Method and apparatus for improving noise discrimination using attenuation factor
EP1917837A4 (en) * 2005-08-26 2011-03-02 Dolby Lab Licensing Corp Method and apparatus for improving noise discrimination using attenuation factor
US8111192B2 (en) 2005-08-26 2012-02-07 Dolby Laboratories Licensing Corporation Beam former using phase difference enhancement
US8155927B2 (en) 2005-08-26 2012-04-10 Dolby Laboratories Licensing Corporation Method and apparatus for improving noise discrimination in multiple sensor pairs
US8155926B2 (en) 2005-08-26 2012-04-10 Dolby Laboratories Licensing Corporation Method and apparatus for accommodating device and/or signal mismatch in a sensor array
US20100158269A1 (en) * 2008-12-22 2010-06-24 Vimicro Corporation Method and apparatus for reducing wind noise
CN104885152A (en) * 2012-12-28 2015-09-02 共荣工程株式会社 Sound-source separation method, device, and program
EP2940686A4 (en) * 2012-12-28 2016-08-03 Kyoei Engineering Co Ltd Sound-source separation method, device, and program
CN104885152B (en) * 2012-12-28 2019-04-26 共荣工程株式会社 Sound source separating method, device and storage medium

Similar Documents

Publication Publication Date Title
US3057960A (en) Normalized sound control system
GB1458663A (en) Microphone circuits
US3518578A (en) Signal compression and expansion system
KR850004369A (en) Audio signal processing device and method
US3732371A (en) Wide dynamic range noise masking compandor
GB1341468A (en) Multi frequency voice receiver
US3278866A (en) Selective transmission
US6301364B1 (en) Tagging echoes with low frequency noise
US3701028A (en) Reduction of harmonic distortion
US3213199A (en) System for masking information
GB1309700A (en) Speech redognition apparatus
US2799734A (en) Speech brighteners
US2208160A (en) Communication system
FR2305909A1 (en) Microphones and associated equipment - include one unshielded microphone and one masked microphone, and electronics system to minimise noise background
US3746791A (en) Speech synthesizer utilizing white noise
GB1008565A (en) Improvements in or relating to voiced sound detection circuits
US4415768A (en) Tuning apparatus and method
GB1031145A (en) Improvements in or relating to audio-frequency amplifiers
US2449971A (en) Apparatus for reducing noise in microphone circuits
US4266094A (en) Electronic speech processing system
US3548100A (en) Formant frequency extractor
ES445885A1 (en) Installation of electronic listening. (Machine-translation by Google Translate, not legally binding)
US1762768A (en) Transmission system
GB1054796A (en)
FR2290814A1 (en) Acoustic feedback coupling disturbance suppression arrangement - has successive circuit stages operating on desired and undesired components