US3784747A - Speech suppression by predictive filtering - Google Patents

Speech suppression by predictive filtering Download PDF

Info

Publication number
US3784747A
US3784747A US00204509A US3784747DA US3784747A US 3784747 A US3784747 A US 3784747A US 00204509 A US00204509 A US 00204509A US 3784747D A US3784747D A US 3784747DA US 3784747 A US3784747 A US 3784747A
Authority
US
United States
Prior art keywords
signal
speech
undesired
waveform
parameter
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
US00204509A
Inventor
D Berkley
O Mitchell
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AT&T Corp
Original Assignee
Bell Telephone Laboratories Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Bell Telephone Laboratories Inc filed Critical Bell Telephone Laboratories Inc
Application granted granted Critical
Publication of US3784747A publication Critical patent/US3784747A/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M9/00Arrangements for interconnection not involving centralised switching
    • H04M9/08Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic
    • H04M9/082Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic using echo cancellers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B3/00Line transmission systems
    • H04B3/02Details
    • H04B3/20Reducing echo effects or singing; Opening or closing transmitting path; Conditioning for transmission in one direction or the other
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M9/00Arrangements for interconnection not involving centralised switching
    • H04M9/08Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic
    • H04M9/087Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic using different frequency bands for transmitting and receiving paths ; using phase shifting arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/02Circuits for transducers, loudspeakers or microphones for preventing acoustic reaction, i.e. acoustic oscillatory feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02082Noise filtering the noise being echo, reverberation of the speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02087Noise filtering the noise being separate speech, e.g. cocktail party

Definitions

  • ABSTRACT Speech signal energy from an undesired source is suppressed by extracting from the undesired signal a delay parameter and a gain parameter. These parameters control a delay and gain network through which both the desired and undesired signals are routed.
  • the invention is applied to several handsfree telephony situations and to the suppression of one of two speakers in a room.
  • a general object of the invention is to reduce the energy from an undesired speech source in a composite signal containing desired speech.
  • Another object of the invention is tosuppress an undesired speech signal in an electronic communications channel.
  • a specific object of the invention is to render a desired speech signal relatively more intelligible despite the presence of undesired speech energy.
  • a particular inventive object is to achieve the foregoing objects in the hands-free telephony situation.
  • Another specific object of the invention is to avoid voice switching functions and thus enable full-time duplex operation of a hands-free telephone channel.
  • Yet another inventive object is to distinguish one talker from another nearby talker and to suppress speech signals from one of them.
  • the invention is grounded in the general recognition that an unwanted speech signal can be rejected on the basis of its speech parameters.
  • the basic concept contemplated by the present invention is to extract, from the undesired signal, a gain parameter and a delay parameter. These parameters control a delay and gain network through which both desired speech signals and the undesired signal are routed.
  • the delay is approximately equal to the current duration of the pitch period of the undesired speech.
  • the gain is calculated, in accordance with one of several possible formulas, so as to bring the delayed unwanted signal to the amplitude level of the present value of the unwanted signal.
  • the gain is set equal to l.
  • the network output is then subtractively applied to the unwanted signal or to any composite signal containing the unwanted signal. The process may be carried out in analog or digital fashion.
  • the process is carried out by sampling techniques where the signal is sampled at a rate of, for example, 6 kHz that results in 30 to 60 samples per pitch period.
  • the number of samples in a pitch period will vary in accordance with the pitch frequency.
  • speech from the loudspeaker of a hands-free telephone set impinging either directly or reverberatively on the sets microphone can be largely removed from the microphone output.
  • the reverberant signal as well as the direct signal is suppressed because the unwanted speech parameters do not vary rapidly during voicing.
  • speech from, for example, two talkers in the same room is detected by a multiplicity of microphones, and the speech of one talker is suppressed using speech parameters determined by combining the outputs of the microphones.
  • the rearrangement of the Atal process constitutes in one aspect a filter; and more specifically, a comb filter with minima at the pitch frequency (and harmonics hereof) of the undesired speech.
  • a comb filter with minima at the pitch frequency (and harmonics hereof) of the undesired speech This distinguishes the predictive filter of the present invention from a conventional echo canceler which merely replicates a reverberant signal and subtractively applies the replica to the composite signal.
  • FIG. 1 is a communications network schematic block diagram containing a hands-free telephone and an inventive embodiment
  • FIG. 2 is a schematic block diagram of the inventive predictive filter
  • FIG. 3 is a schematic block diagram further delineating the inventive predictor
  • FIGS. 4-6 are graphs depicting various characteristics of the predictor
  • FIGS. 7 and 8 are two further embodiments of the invention in a communications network containing hands-free telephones.
  • FIGS. 9 and 10 are schematic diagrams of the invention as applied to suppression of speech from talkers in a room.
  • a hands-free telephone loudspeaker 1 and microphone 2 present in a reverberative enclosure 3 are shown in FIG. 1 connected to the speech processor of the present invention.
  • the desired speech signal input to microphone 2 is from source 4, the near-end talker, whose signal denoted a travels mainly the direct path 5 and also reverberative paths not shown.
  • Loudspeaker 1 which broadcasts the far-end talker signal, is a source of undesired input to microphone 2 either via the direct path denoted 6 or reverberative paths illustrated by path 7.
  • the far-end talker direct path speech signal is denoted
  • the speech processing network 8 in FIG. 1 consists of what will be called a predictive filter 9 connected in the microphone 2 output circuit.
  • filter 9 consists of two parallel legs.
  • the first leg is a predictor 11 which may be a network consisting of adelay network 12 and an amplifier 13.
  • the second leg is a direct shunt path. Both legs are connected to a subtractor 10.
  • the predictor 11 is controlled in a manner to be described, by a parameter extractor 14 connected in the loudspeaker l circuit.
  • a low pass filter l5 advantageously 3 kHz and a 6 kHz sampler 16 are serially connected in the output circuit of microphone 2.
  • a low pass filter 17 and a sampler 18 are in shunt relation to the loudspeaker ll input circuit and serially connected to parameter extractor l4.
  • a waveform representing the far-end talker signal 0 is illustrated in FIG. 4. Because a speech signal is redundant-Le, the signal changes little in shape and length of pitch period from one pitch period to the next -the present form or value of signal c can be estimated by a linear prediction based on a past value of signal c.
  • the signal 0 of FIG. 4 is shown made up of speech in consecutive pitch periods I, I 1 etc. lnherently, the speech signals in adjacent pitch periods of signal c are of unequal amplitude.
  • a gain denoted b can be calculated (in a manner to be described) that when applied to the sampled signal of the pitch period 1 will cause the latter to approximate the sampled signal in the next pitch period 1
  • the amplified signal of period I is subtractively combined with the signal value of period the result is the substantial filtering out of the signal 0.
  • a composite signal a c containingsignal c is amplified during period 1 and subtractively combined with the composite signal a 0 during period 1 the same result obtains.
  • W (the amplitude of sample n reaching subtractor via the direct path in predictive filter 9) is subtractively combined with W,, (the amplitude of the delayed sample) where k is the number of samples in a pitch period.
  • the time window over which the parameters are evaluated is of the order of the pitch period to ensure that sufficient energy is present.
  • a time window of 30 samples at a sampling rate of 6 kHz will include between one-half and all the samples in a given pitch period.
  • input speech samples from sampler 18 are stored as frames of signals.
  • the store content is then fed to an arithmetic unit which is part of parameter extractor 14, wherein for 30 samples, computational values of correlation X, are computed as follows:
  • N can advantageously be in the range 30-60 samples.
  • the computed values of X are then inspected in a peak locating network also part of extractor 14, to de termine the largest value of X, The value ofj is found such that X, is the maximum of all values of X.
  • This particular value ofj is the delay parameter, k, which is supplied to predictive filter 9 as one parameter. It is seen that k is a variable delay and that the maximum value of X, is X
  • the delay parameter k for a typical voiced segment is shown in FIG. 6.
  • the gain parameter b is calculated by computing circuitry also in parameter extractor 14, that solves:
  • the gain parameter b likewise is supplied to predictive filter 9.
  • delay parameter k and gain parameter b is but one of several systems by which, from an analysis of the speech energy content in adjacent or substantially adjacent signal segments, parameters may be calculated that when applied to a past signal segment will render the latter closely similar to the shape of the present signal segment.
  • incoming speech to loudspeaker 1 is continuously analyzed to extract therefrom an optimum delay parameter, and a gain factor. These parameters are periodically updated as for example, every 5 ms. When no incoming signal to loudspeaker l is present, the delay and gain are zero. With incoming signal, the calculated present signal value output of predictor 11 is subtracted from the undelayed, unamplified signal sample representing signals a c.
  • the filter depicted in FIG. 3 and described above has a transfer function in Z transform notation.
  • H(Z) l bZ' The magnitude of the frequency response of a typical embodiment of filter 9 is shown in FIG. 5 using predictor 11 where T is the sampling period.
  • the frequency response for gain parameter b l are shown by the solid curves and for gain parameter bzl by the broken line. Since speech is dynamic during voicing, the parameters b and k have to be optimized as stated above, and readjusted periodically as, for example, every 5 ms.
  • the parameters b and k calculated do not vary smoothly with time.
  • the optimum delay occasionally doubles during voiced segments. Also, during unvoiced segments, the optimum delayvaries rapidly over a wide range while the correlation remains relatively low. However, the gains calculated are not negligible during these unvoiced portions. Desired speech a, uncorrelated with the undesired signal 0 which is to be rejected, is degraded when passed through a filter with these rapidly varying filter parameters, while under such conditions no additional suppression of the unwanted source is accomplished.
  • the predictive filter 9 will be effective in removing part of the reverberant signal as well as the direct sound. Specifically, that part of the reverbcrant signal that has parameters not greatly different from the filter parameters will be reduced in amplitude.
  • the far-end echo picked up by the microphone 2 from loudspeaker 1 is first reduced in amplitude during voiced segments by a speech processor 8 in the manner described previously.
  • Gain and delay parameters b and k of the far-end speech are measured on the received loudspeaker signal, and the far-end echo component of the microphone signal is reduced by filtering.
  • the remaining far-end signal at the output of the speech processor 8 is then removed V by the center-clipping echo suppressor.
  • parameter delay circuit 19A which is serially connected between the output of parameter control 19 and predictive filter 9.
  • Parameter delay circuit 19A advantageously is provided with a delay duration adjustment circuit 198 with which the delay duration may be set to correspond to the transit time which characterizes each given hands-free telephone.
  • FIGS. 7 and 8 A combination of a predictive filter with a centerclipping echo suppressor of the type taught in D. A Berkley-O. M. M. Mitchell-J. R. Pierce U.S. Pat. No. 3,699,271 which is hereby incorporated by reference, is shown in FIGS. 7 and 8. This combination is a possible replacement for voice switching presently used for echo and feedback suppression.
  • FIG. 7 shows a network denoted for eliminating the echo ofthe far-end talker in a 4-wire hands-free tel-
  • the received signal is used to set the clipping levels by means of clipping control 22 so as just to remove the echo.
  • the output of D/A converter 10A is fed to filter bank 40 which comprises plural contiguous band filters in the voice frequency range.
  • filter bank 40 which comprises plural contiguous band filters in the voice frequency range.
  • center clipper 41 the signal in each subband from filter 41 is center clipped at a level determined by clipping control 22 which measures in effect the'energy level in the received signal within each of the subbands.
  • the output of clipper 41 is filtered in bank 42 which is similar to bank 40.
  • the clipping control 22 is advantageously controlled also by the parameter extractor 14. Since the echo is reduced by the predictive filter 9 during voicing, the clipping levels can be reduced by substantially the same amount during voicing. Consequently in FIG. 7, a control signal is shown (dashed line) between the parameter extractor l4 and the clipping control 22, which causes an attenuation of the input to clipping level control 22 that is equal to the suppression achieved by speech processing network 8. It will be recognized that optimum performance of clipping level control 22 will be realized by inserting a delay in its input path to compensate for the already mentioned signal transit time between loudspeaker 1 and microphone 2. With the clipping levels thus reduced during voiced segments, there will be less mutilation of the near-end speech by the center-clipping process.
  • FIG. 8 shows a circuit for eliminating both the farend echo (echo of far-end talker caused by acoustic coupling through room acoustics) and near-end echo (echo of near-end talker caused by imperfect hybrid junction) in a 2-wire hands-free telephone.
  • the far-end echo is eliminated by network 50 as described above for FIG. 7.
  • the near-end echo is eliminated by a similar circuit denoted 51 introduced on the receive side of the local 4-wire network as shown.
  • circuit 51 An alternative method of adjusting the clipping level control by the parameter extractor 14 via the parameter delay 19A is shown in circuit 51.
  • a second predictive filter designated 9a is used in circuit 51 to attenuate the clipping level control signal during voiced segments.
  • the clipping levels follow the signal at the input to the narrow band center clipper, i.e., at the output of the predictive filter 9a.
  • FIG. 9 shows the desired speech source 23 and an undesired source 24 both of whose speech signals form the input to microphones 25 and 26.
  • the undesired source 24 is positioned so that the time delays for direct sound transmission to microphones 25 and 26 are equal.
  • the output of microphone 25 is predictive filter 9 as microphone 26, enter the parameter extractor 30 wherein an arithmetic unit within the extractor calculates the computational values and A peak picking network within the extractor then selects the peaks from X, and Y, and a comparator finds the largest value peak which occurs in both sequence X, and Y for the same value ofj. This value ofj is the delay parameter k for the undesired speech supplied to the predictive filter 9.
  • FIG. 10 An alternative method of extracting the parameters is shown in FIG. 10.
  • Two additional microphones 27 and 28 are positioned so that time delays from desired speaker 24 for direct sound transmission to microphones 27 and 28 are equal to the time delays to microphones 25 and 26.
  • the outputs of all microphones 25-28 are processed by a non-linear processor 31 as described in O. M. M. Mitchell-C. A. Ross-R. L. Wallace, Jr. U.S. Pat. No. 3,644,671, which is hereby incorporated by reference.
  • the output of processor 31 contains the undesired signal and an attenuated and disturbed component of the desired signal. (The outputs may alternatively be added to merely attenuate the desired signal.)
  • the output of the non-linear processor 31 enters the speech processing network 8.
  • the output of microphone 25 is processed by speech processing network 8 which filters out the undesired talker 24 in the manner already described.
  • speech processing network 8 which filters out the undesired talker 24 in the manner already described.
  • the presence in the output of the nonlinear processor 31 of a small amount of the desired talker does not significantly affect the delay parameter k but will cause a small error in the evaluation of X and b.
  • Speech processing apparatus for suppressing voiced segments of an undesired speech signal while leaving a desired speech signal intelligible, comprising:
  • Apparatus in accordance with claim 1 further comprising means for deriving from the waveform of said undesired speech signal a gain parameter specifying the amount by which the amplitudes of corresponding values of said undesired speech signal in a past said interval must be respectively adjusted so as to produce a substantial duplicate of the undesired said speech signal of a present said interval; and which further comprises means controlled by said gain parameter for amplifying said delayed composite speech waveform prior to its being subtractively applied to said summer.
  • a communications network comprising:
  • a hands-free telephone station including a direct acoustic coupling path between the station loudspeaker and microphone, a second remote telephone station, and transmission means interconnecting said stations;
  • a communications network pursuant to claim 4 wherein said deriving means comprises:
  • X1 2 2 it) in) 71 I! ll further comprising means for rendering said gain parameter equal to zero in the absence of voiced segments of the signal in said loudspeaker path from said remote station.
  • a communications network pursuant to claim 8, whefe is the speechigflal receive? by Said fi further comprising means for adjustably delaying armlcrolphone and W" the Speech slgnal recelved rival of said delay and gain parameters at said second by Sam P path by an amount that compensates for the transit meahs for Selectmg the largest Value P from the time delay over said direct acoustic coupling path of composite P ,Vahles Ofsaid Parameters 1 and 1 speech from said remote station. for the Same Value of the term j;
  • a communications network pursuant to claim 4, means for pp y the desired and the undesired Said f th i i signals from one of said microphones to a summer filter bank means connected to the output of said irectly over a first path and alternately over a secsummer and comprising plural contiguous sub- 0nd path through a network including delay means; band s; and

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
  • Telephone Function (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

Speech signal energy from an undesired source is suppressed by extracting from the undesired signal a delay parameter and a gain parameter. These parameters control a delay and gain network through which both the desired and undesired signals are routed. The delayed, amplified signal, when applied to a subtractor along with the undelayed signal, suppresses the undesired signal. The invention is applied to several hands-free telephony situations and to the suppression of one of two speakers in a room.

Description

Tlnite States atent ,Tan.0,1974
Berkley et a1.
1 1 SPEECH SUPPRESSION BY PREDICTIVE FlLTERllNG [75] Inventors: David Arthur Berkley, New York,
N.Y.; Olga Mary Mracek Mitchell, Summit, NJ.
[73] Assignee: Bell Telephone Laboratories,
Incorporated, Murray Hill, NJ.
[22] Filed: Dec. 3, 1971 {21] Appl. No.: 204,509
[52] 11.5. C1 179/1 HF, 179/1 P, 179/1 FS [51] lnt. Cl. H04m l/20, H04b 15/00 [58] Field 01 Search 179/1 P, 1 VC, 1 HF,
179/1 F, l.FS, 1 SA, 15.55 R, 100.2 K, 81 B, 170.8; 324/473-476 [56] References Cited UNITED STATES PATENTS 3,177,489 4/1965 Saltzberg 325/476 3,631,520 12/1971 Atal 179/1 SA aw SPEECH 111 SIGNAL Q 3,133,990 5/1964 S6618) 179/1 P 3,644,674 2/1972 Mitchell 179/1 P 3,601,549 8/1971 M1IC11811 179/1 FS 3,603,744 Krasin 179/81 B Primary Examinen-William C. Cooper Assistant Examiner-Jon Bradford Leaheey A ttorngy -firavesn [57] ABSTRACT Speech signal energy from an undesired source is suppressed by extracting from the undesired signal a delay parameter and a gain parameter. These parameters control a delay and gain network through which both the desired and undesired signals are routed. The
- delayed, amplified signal, when applied to a subtractor along with the undelayed signal, suppresses the undesired signal. The invention is applied to several handsfree telephony situations and to the suppression of one of two speakers in a room.
W 11 Claims, 10Drawing Figures PATH FROM FAR-END STATlON w F LOW PASS In I l FILTER SAMPLER STATION PATENTED 81974 SHEY 1 0f F/GJ PATH FROM j 1 v FAR-END STATQNW I I 0 m a? I LOW PASS f vfi/ ,6} FILTER SPEECH g SIGNAL 5 w, 6 I SAMPLER -Ia PARAMETER V PEE H PARAMETER A9 -Q v coNTRoL I95 I LOW PASS 9A I FILTER I f DELAY l H5 EYEJ DURATION AMP ADJUST. F/G .Z s LER floA I SPEECH l PRED cTIvE D/A 2 SIGNALS FILTER DCONVERTER b L I EIT ER \LOWPASS 9 N FILTER 1 O PREDICTOR FoR sAMP'LER SIGNAL 2 lgA L D/A coNvERTER FIG. 3
F I DELAY l3 II NETWORK I (h SAMPLES) I 2 19A 1 (5) I? w coNv RTER I PATENTEBJAN 8W4 SHEET 2 [1F 4 SIGNAL;
FIG. 5
SPECTRAL MAGNITUDE vs. FREQUENCY FOR PREDICTIVE FILTER OF FIG. 3
montzoiz PRLTHZ l FREQUENCY FIG. 6
DELAY PARAMETER R FOR A TYPICAL VOICED SEGMENT GEE/5 1 TIME (msEg) SPEECH SUPPRESSION BY PREDTCTIVIE FTILTEIRING FIELD OF THE INVENTION This invention relates to speech signal processing, and in particular to reducing the energy content of that part of a composite speech signal attributable to an undesired source.
BACKGROUND OF THE INVENTION In telephony and elsewhere,,it often happens that speech from a source the listener wishes to hear is seriously impaired in intelligibility by speech from a second, undesired source. Numerous expedients to reduce the effects of the second source have been proposed. These involve relative enhancement of the desired speech signal, rendering the undesired signal relatively unintelligible or reducing the energy of the undesired signal. Regardless of the approach, the result typically has been that the desired signal is more intelligible than it would be in the absence of the processing. The hands-free" telephone well exemplifies this problem of'conflicting speech sources, because its electroacoustic speaker constitutes a potential source of undesired signal at the microphone of the same station.
Accordingly, a general object of the invention is to reduce the energy from an undesired speech source in a composite signal containing desired speech.
Another object of the invention is tosuppress an undesired speech signal in an electronic communications channel.
A specific object of the invention is to render a desired speech signal relatively more intelligible despite the presence of undesired speech energy.
A particular inventive object is to achieve the foregoing objects in the hands-free telephony situation.
Another specific object of the invention is to avoid voice switching functions and thus enable full-time duplex operation of a hands-free telephone channel.
Yet another inventive object is to distinguish one talker from another nearby talker and to suppress speech signals from one of them.
SUMMARY OF THE INVENTION The invention is grounded in the general recognition that an unwanted speech signal can be rejected on the basis of its speech parameters.
A discussion of certain speech parameters is found in the patent application of B. S. Atal, Ser. No. 753,408, filed Aug. 19, 1968 now Pat. No. 3,631,520 and assigned to applicants assignee. A predictive coding technique for reducing transmission bandwidth needs is therein disclosed by Atal in which an estimate of the present value of a speech sample is made based on a known corresponding past value. From these data, a difference or error signal is generated, and transmitted to a remote receiving station along with certain predictor parameters. At the remote receiving station the entire signal is reconstituted from the error signal, using the predictor parameters. 7 It has been realized that the generic process as represented by the Atal disclosure can be rearranged'soas to substantially eliminate a given undesired voice signal.
The basic concept contemplated by the present invention is to extract, from the undesired signal, a gain parameter and a delay parameter. These parameters control a delay and gain network through which both desired speech signals and the undesired signal are routed. The delay is approximately equal to the current duration of the pitch period of the undesired speech. The gain is calculated, in accordance with one of several possible formulas, so as to bring the delayed unwanted signal to the amplitude level of the present value of the unwanted signal. Alternatively, in a technically less complex embodiment, the gain is set equal to l. In either case, the network output is then subtractively applied to the unwanted signal or to any composite signal containing the unwanted signal. The process may be carried out in analog or digital fashion.
Advantageously however, the process is carried out by sampling techniques where the signal is sampled at a rate of, for example, 6 kHz that results in 30 to 60 samples per pitch period. The number of samples in a pitch period will vary in accordance with the pitch frequency.
In one embodiment pursuant to the invention, speech from the loudspeaker of a hands-free telephone set impinging either directly or reverberatively on the sets microphone, can be largely removed from the microphone output. The reverberant signal as well as the direct signal is suppressed because the unwanted speech parameters do not vary rapidly during voicing.
In another embodiment pursuant to the invention, speech from, for example, two talkers in the same room is detected by a multiplicity of microphones, and the speech of one talker is suppressed using speech parameters determined by combining the outputs of the microphones.
It will be apparent that the rearrangement of the Atal process constitutes in one aspect a filter; and more specifically, a comb filter with minima at the pitch frequency (and harmonics hereof) of the undesired speech. This distinguishes the predictive filter of the present invention from a conventional echo canceler which merely replicates a reverberant signal and subtractively applies the replica to the composite signal.
The invention and its further objects, features, and advantages will be readily discerned in detail from a reading of the description to follow of illustrative embodiments.
BRIEF DESCRIPTION OF THE DRAWING FIG. 1 is a communications network schematic block diagram containing a hands-free telephone and an inventive embodiment;
FIG. 2 is a schematic block diagram of the inventive predictive filter;
FIG. 3 is a schematic block diagram further delineating the inventive predictor;
FIGS. 4-6 are graphs depicting various characteristics of the predictor;
FIGS. 7 and 8 are two further embodiments of the invention in a communications network containing hands-free telephones; and
FIGS. 9 and 10 are schematic diagrams of the invention as applied to suppression of speech from talkers in a room.
DETAILED DESCRIPTION OF INVENTIVE EMBODIMENTS Hands-free Telephone Situations In the first inventive embodiment, a hands-free telephone loudspeaker 1 and microphone 2 present in a reverberative enclosure 3 are shown in FIG. 1 connected to the speech processor of the present invention. Usually, the desired speech signal input to microphone 2 is from source 4, the near-end talker, whose signal denoted a travels mainly the direct path 5 and also reverberative paths not shown. Loudspeaker 1 which broadcasts the far-end talker signal, is a source of undesired input to microphone 2 either via the direct path denoted 6 or reverberative paths illustrated by path 7. The far-end talker direct path speech signal is denoted The speech processing network 8 in FIG. 1 consists of what will be called a predictive filter 9 connected in the microphone 2 output circuit. As seen in FIGS. 2 and 3, filter 9 consists of two parallel legs. The first leg is a predictor 11 which may be a network consisting of adelay network 12 and an amplifier 13. The second leg is a direct shunt path. Both legs are connected to a subtractor 10. The predictor 11 is controlled in a manner to be described, by a parameter extractor 14 connected in the loudspeaker l circuit.
Pursuant to one embodiment, the invention is carried out digitally. A low pass filter l5 advantageously 3 kHz and a 6 kHz sampler 16 are serially connected in the output circuit of microphone 2. Similarly, a low pass filter 17 and a sampler 18 are in shunt relation to the loudspeaker ll input circuit and serially connected to parameter extractor l4.
A waveform representing the far-end talker signal 0 is illustrated in FIG. 4. Because a speech signal is redundant-Le, the signal changes little in shape and length of pitch period from one pitch period to the next -the present form or value of signal c can be estimated by a linear prediction based on a past value of signal c.
The signal 0 of FIG. 4 is shown made up of speech in consecutive pitch periods I, I 1 etc. lnherently, the speech signals in adjacent pitch periods of signal c are of unequal amplitude. Thus, a gain denoted b can be calculated (in a manner to be described) that when applied to the sampled signal of the pitch period 1 will cause the latter to approximate the sampled signal in the next pitch period 1 If then the amplified signal of period I, is subtractively combined with the signal value of period the result is the substantial filtering out of the signal 0. In like manner, if a composite signal a c containingsignal c is amplified during period 1 and subtractively combined with the composite signal a 0 during period 1 the same result obtains.
Thus, in mathematical terms, W, (the amplitude of sample n reaching subtractor via the direct path in predictive filter 9) is subtractively combined with W,, (the amplitude of the delayed sample) where k is the number of samples in a pitch period. Advantageously, the time window over which the parameters are evaluated is of the order of the pitch period to ensure that sufficient energy is present. A time window of 30 samples at a sampling rate of 6 kHz will include between one-half and all the samples in a given pitch period.
Since speech is only quasi-stationary during voicing, the gain parameter b and delay parameter k have to be periodically calculated. This is accomplished in the digital parameter extractor 14 pursuant to the teaching of the aforementioned Atal patent application Ser. No. 753,408.
As taught therein, input speech samples from sampler 18 are stored as frames of signals. The store content is then fed to an arithmetic unit which is part of parameter extractor 14, wherein for 30 samples, computational values of correlation X, are computed as follows:
where N can advantageously be in the range 30-60 samples.
The computed values of X, are then inspected in a peak locating network also part of extractor 14, to de termine the largest value of X, The value ofj is found such that X, is the maximum of all values of X. This particular value ofj is the delay parameter, k, which is supplied to predictive filter 9 as one parameter. It is seen that k is a variable delay and that the maximum value of X, is X The delay parameter k for a typical voiced segment is shown in FIG. 6.
The gain parameter b is calculated by computing circuitry also in parameter extractor 14, that solves:
The gain parameter b likewise is supplied to predictive filter 9.
The described calculation of delay parameter k and gain parameter b is but one of several systems by which, from an analysis of the speech energy content in adjacent or substantially adjacent signal segments, parameters may be calculated that when applied to a past signal segment will render the latter closely similar to the shape of the present signal segment.
Thus, incoming speech to loudspeaker 1 is continuously analyzed to extract therefrom an optimum delay parameter, and a gain factor. These parameters are periodically updated as for example, every 5 ms. When no incoming signal to loudspeaker l is present, the delay and gain are zero. With incoming signal, the calculated present signal value output of predictor 11 is subtracted from the undelayed, unamplified signal sample representing signals a c.
Reconversion to analog form of the signal in the microphone 2 output circuit is achieved in D/A converter 10A.
The filter depicted in FIG. 3 and described above has a transfer function in Z transform notation.
H(Z) l bZ' The magnitude of the frequency response of a typical embodiment of filter 9 is shown in FIG. 5 using predictor 11 where T is the sampling period.
The frequency response for gain parameter b l are shown by the solid curves and for gain parameter bzl by the broken line. Since speech is dynamic during voicing, the parameters b and k have to be optimized as stated above, and readjusted periodically as, for example, every 5 ms.
Filtering of the input speech by the calculated parameters results in suppression during voicing of up to 30 dB during voiced segments of the undesired signal c,
and an average suppression of about 14 dB of the undesired signal 0.
The parameters b and k calculated do not vary smoothly with time. The optimum delay occasionally doubles during voiced segments. Also, during unvoiced segments, the optimum delayvaries rapidly over a wide range while the correlation remains relatively low. However, the gains calculated are not negligible during these unvoiced portions. Desired speech a, uncorrelated with the undesired signal 0 which is to be rejected, is degraded when passed through a filter with these rapidly varying filter parameters, while under such conditions no additional suppression of the unwanted source is accomplished.
To avoid this difficulty, logic is introduced pursuant to one facet of the invention, to prevent undesirable variation of the filter parameters b and k. It was determined that not much suppression was obtained when the correlation X was less than 0.85. Consequently gain b is set equal to zero for X 0.85. This is achieved by parameter control circuit 19 (FIG. l) which sets b equal to zero for X 0.85. This choice of X is a compromise between one as great as possible and one low enough so that all of the voiced segments of speech are suppressed. The resulting suppression during voicing is unchanged while degradation ofa second speech is reduced. FIG. 6 shows the variation of delay parameter k during a typical voiced segment.
Since the parameters b and k vary relatively slowly during voiced segments, the predictive filter 9 will be effective in removing part of the reverberant signal as well as the direct sound. Specifically, that part of the reverbcrant signal that has parameters not greatly different from the filter parameters will be reduced in amplitude.
The foregoing discussion of the invention as applied to hands-free telephony has assumed no separation between loudspeaker l and microphone 2. In practice, however, a significant transit time for the signal 0 to travel path 6 to microphone 2 is required. It is therefore necesssary to compensate in speech processing network 8 for the loudspeaker-microphone transit time.
ephone. Like numerals denote items which correspond to counterparts in FIGS. 1-3. The far-end echo picked up by the microphone 2 from loudspeaker 1 is first reduced in amplitude during voiced segments by a speech processor 8 in the manner described previously. Gain and delay parameters b and k of the far-end speech are measured on the received loudspeaker signal, and the far-end echo component of the microphone signal is reduced by filtering. The remaining far-end signal at the output of the speech processor 8 is then removed V by the center-clipping echo suppressor.
This is achieved by parameter delay circuit 19A which is serially connected between the output of parameter control 19 and predictive filter 9. Parameter delay circuit 19A advantageously is provided with a delay duration adjustment circuit 198 with which the delay duration may be set to correspond to the transit time which characterizes each given hands-free telephone.
A combination of a predictive filter with a centerclipping echo suppressor of the type taught in D. A Berkley-O. M. M. Mitchell-J. R. Pierce U.S. Pat. No. 3,699,271 which is hereby incorporated by reference, is shown in FIGS. 7 and 8. This combination is a possible replacement for voice switching presently used for echo and feedback suppression.
FIG. 7 shows a network denoted for eliminating the echo ofthe far-end talker in a 4-wire hands-free tel- As taught in D. A. Berkley et al. U.S. Pat. No. 3,699,271, the received signal is used to set the clipping levels by means of clipping control 22 so as just to remove the echo. The output of D/A converter 10A is fed to filter bank 40 which comprises plural contiguous band filters in the voice frequency range. In center clipper 41 the signal in each subband from filter 41 is center clipped at a level determined by clipping control 22 which measures in effect the'energy level in the received signal within each of the subbands. The output of clipper 41 is filtered in bank 42 which is similar to bank 40.
In this embodiment, the clipping control 22 is advantageously controlled also by the parameter extractor 14. Since the echo is reduced by the predictive filter 9 during voicing, the clipping levels can be reduced by substantially the same amount during voicing. Consequently in FIG. 7, a control signal is shown (dashed line) between the parameter extractor l4 and the clipping control 22, which causes an attenuation of the input to clipping level control 22 that is equal to the suppression achieved by speech processing network 8. It will be recognized that optimum performance of clipping level control 22 will be realized by inserting a delay in its input path to compensate for the already mentioned signal transit time between loudspeaker 1 and microphone 2. With the clipping levels thus reduced during voiced segments, there will be less mutilation of the near-end speech by the center-clipping process.
FIG. 8 shows a circuit for eliminating both the farend echo (echo of far-end talker caused by acoustic coupling through room acoustics) and near-end echo (echo of near-end talker caused by imperfect hybrid junction) in a 2-wire hands-free telephone. The far-end echo is eliminated by network 50 as described above for FIG. 7. The near-end echo is eliminated by a similar circuit denoted 51 introduced on the receive side of the local 4-wire network as shown.
An alternative method of adjusting the clipping level control by the parameter extractor 14 via the parameter delay 19A is shown in circuit 51. A second predictive filter designated 9a is used in circuit 51 to attenuate the clipping level control signal during voiced segments. Thus the clipping levels follow the signal at the input to the narrow band center clipper, i.e., at the output of the predictive filter 9a.
Suppression of One of Two Room Speakers A further embodiment allows the suppression of the speech signal from one of two talkers in a room. FIG. 9 shows the desired speech source 23 and an undesired source 24 both of whose speech signals form the input to microphones 25 and 26. The undesired source 24 is positioned so that the time delays for direct sound transmission to microphones 25 and 26 are equal. In the output of microphone 25 is predictive filter 9 as microphone 26, enter the parameter extractor 30 wherein an arithmetic unit within the extractor calculates the computational values and A peak picking network within the extractor then selects the peaks from X, and Y, and a comparator finds the largest value peak which occurs in both sequence X, and Y for the same value ofj. This value ofj is the delay parameter k for the undesired speech supplied to the predictive filter 9.
An alternative method of extracting the parameters is shown in FIG. 10. Two additional microphones 27 and 28 are positioned so that time delays from desired speaker 24 for direct sound transmission to microphones 27 and 28 are equal to the time delays to microphones 25 and 26. The outputs of all microphones 25-28 are processed by a non-linear processor 31 as described in O. M. M. Mitchell-C. A. Ross-R. L. Wallace, Jr. U.S. Pat. No. 3,644,671, which is hereby incorporated by reference. The output of processor 31 contains the undesired signal and an attenuated and disturbed component of the desired signal. (The outputs may alternatively be added to merely attenuate the desired signal.) The output of the non-linear processor 31 enters the speech processing network 8. The output of microphone 25 is processed by speech processing network 8 which filters out the undesired talker 24 in the manner already described. The presence in the output of the nonlinear processor 31 of a small amount of the desired talker does not significantly affect the delay parameter k but will cause a small error in the evaluation of X and b.
[t is to be understood that the embodiments described herein are merely illustrative of the principles of the invention. Various modifications may be made thereto by persons skilled in the art without departing from the spirit and scope of the invention.
What is claimed is:
1. Speech processing apparatus for suppressing voiced segments of an undesired speech signal while leaving a desired speech signal intelligible, comprising:
means forderiving an electronic waveform representing the undesired speech signal;
means for deriving an electronic waveform representing a composite signal containing a reverberant version of the undesired speech signal and the desired signal;
means for deriving from the waveform of said undesired speech signal a delay parameter determined from the signal values during an interval embracing a substantial portion ofa pitch period of said undesired speech signal;
means for applying said composite speech signal waveform to a summer over a first path;
means for delaying in a second path said composite speech signal waveform by an amount of said delay parameter; and means for subtractively applying to said summer the delayed said composite speech waveform.
2. Apparatus pursuant to claim 1, further comprising means responsive to the absence of voiced segments of said undesired speech signal for interrupting said second path.
3. Apparatus in accordance with claim 1 further comprising means for deriving from the waveform of said undesired speech signal a gain parameter specifying the amount by which the amplitudes of corresponding values of said undesired speech signal in a past said interval must be respectively adjusted so as to produce a substantial duplicate of the undesired said speech signal of a present said interval; and which further comprises means controlled by said gain parameter for amplifying said delayed composite speech waveform prior to its being subtractively applied to said summer.
4. A communications network comprising:
a hands-free telephone station including a direct acoustic coupling path between the station loudspeaker and microphone, a second remote telephone station, and transmission means interconnecting said stations;
means for derivingfrom the incoming signal waveform to said loudspeaker from said second stationa delay parameter representing the duration of an interval embracing a substantial portion of the present pitch period of speech from the remote station; and a gain parameter specifying the amount by which the waveform in a past said interval must be changed in amplitude to substantially correspond to the undesired speech waveform of the present interval;
means for applying the composite signal-consisting of the desired near-end talker signal and the acoustically coupled far-end talker signal from said loudspeaker-in said microphone output to a summer over a first path;
means disposed in a second path for delaying said composite signal by the amount of said delay parameter and for amplifying the delayed composite signal an amount determined by said gain parameter; and
means for subtractively applying the delayed, amplified composite signal to said summer.
5. A communications network pursuant to claim 4 wherein said deriving means comprises:
a signal sampler connected to the circuit of said loudspeaker and operating at a set sampling rate; and
means for computing values of a term X, in accordance with the relationship where W is the amplitude of a sample n reaching said sampler, and means for finding that value ofj such that X,- is the maximum of all values of X,, the
9 110 found value of j constituting said delay parameter. means for center-clipping the output of each said fil- 6. A communications network pursuant to claim ter bank subband a varying amount in response to wherein said gain parameter deriving means comprises the concurrent said energy level value; and means for computing the value b in accordance with means connecting said control signal producing the relationship 5 means and said deriving means responsive to voiced portions of signal from said remote station for reducing all said clipping levels.
7 11. Speech processing apparatus for suppressing 2 n n-k speech from one of two talkers in a room comprising: b=% 10 first and second microphones located equidistant 2 2 from the first, desired said talker but at unequal distances from the second, undesired said talker;
means for deriving from the two said microphone outputs a first parameter X, in accordance with the 1 r h where W, is the amplitude of a sample n reaching said re a Ions sampler, and k is a delay parameter for a voiced segment. N l l/2 I 1/2 7. A communications network pursuant to claim 6, X1: 2 2 it) in) 71 I! ll further comprising means for rendering said gain parameter equal to zero in the absence of voiced segments of the signal in said loudspeaker path from said remote station.
8. A communications network-pursuant to claim 7, 1/2
further comprising means for setting said gain parame- N I WT?) 1/2 W2 and a second parameter Y5, respectively, calculated in accordance with the relationship ter equal to zero in response to values of X corresponding to the maximum computed values of X, which are less than a critical predetermined value.
9. A communications network pursuant to claim 8, whefe is the speechigflal receive? by Said fi further comprising means for adjustably delaying armlcrolphone and W" the Speech slgnal recelved rival of said delay and gain parameters at said second by Sam P path by an amount that compensates for the transit meahs for Selectmg the largest Value P from the time delay over said direct acoustic coupling path of composite P , Vahles Ofsaid Parameters 1 and 1 speech from said remote station. for the Same Value of the term j;
10. A communications network pursuant to claim 4, means for pp y the desired and the undesired Said f th i i signals from one of said microphones to a summer filter bank means connected to the output of said irectly over a first path and alternately over a secsummer and comprising plural contiguous sub- 0nd path through a network including delay means; band s; and
means for producing-from said remote station 40 means for adjusting said delay means as a function of speech signalcontrol signals representative of the the value of the term j. incoming speech energy level in each said subband;

Claims (11)

1. Speech processing apparatus for suppressing voiced segments of an undesired speech signal while leaving a desired speech signal intelligible, comprising: means for deriving an electronic waveform representing the undesired speech signal; means for deriving an electronic waveform representing a composite signal containing a reverberant version of the undesired speech signal and the desired signal; means for deriving from the waveform of said undesired speech signal a delay parameter determined from the signal values during an interval embracing a substantial portion of a pitch period of said undesired speech signal; means for applying said composite speech signal waveform to a summer over a first path; means for delaying in a second path said composite speech signal waveform by an amount of said delay parameter; and means for subtractively applying to said summer the delayed said composite speech waveform.
2. Apparatus pursuant to claim 1, further comprising means responsive to the absence of voiced segments of said undesired speech signal for interrupting said second path.
3. Apparatus in accordance with claim 1 further comprising means for deriving from the waveform of said undesired speech signal a gain parameter specifying the amount by which the amplitudes of corresponding values of said undesired speech signal in a past said interval must be respectively adjusted so as to produce a substantial duplicate of the undesired said speech signal of a present said interval; and which further comprises means controlled by said gain parameter for amplifying said delayed composite speech waveform prior to its being subtractively applied to said summer.
4. A communications network comprising: a hands-free telephone station including a direct acoustic coupling path between the station loudspeaker and microphone, a second remote telephone station, and transmission means interconnecting said stations; means for deriving-from the incoming signal waveform to said loudspeaker from said second station-a delay parameter representing the duration of an interval embracing a substantial portion of the present pitch period of speech from the remote station; and a gain parameter specifying the amount by which the waveform in a past said interval must be changed in amplitude to substantially correspond to the undesired speech waveform of the present interval; means for applying the composite signal-consisting of the desired near-end talker signal and the acoustically coupled far-end talker signal from said loudspeaker-in said microphone output to a summer over a first path; means disposed in a second path for delaying said composite signal by the amount of said delay parameter and for amplifying the delayed composite signal an amount determined by said gain parameter; and means for subtractively applying the delayed, amplified composite signal to said summer.
5. A communications network pursuant to claim 4 wherein said deriving means comprises: a signal sampler connected to the circuit of said loudspeaker and operating at a set sampling rate; and means for computing values of a term Xj in accordance with the relationship
6. A communications network pursuant to claim 5 wherein said gain parameter deriving means comprises means for computing the value b in accordance with the relationship
7. A communications network pursuant to claim 6, further comprising means for rendering said gain parameter equal to zero in the absence of voiced segments of the signal in said loudspeaker path from said remote station.
8. A communications network pursuant to claim 7, further comprising means for setting said gain parameter equal to zero in response to values of Xk, corresponding to the maximum computed values of Xj which are less than a critical predetermined value.
9. A communications network pursuant to claim 8, further comprising means for adjustably delaying arrival of said delay and gain parameters at said second path by an amount that compensates for the transit time delay over said direct acoustic coupling path of speech from said remote station.
10. A communications network pursuant to claim 4, further comprising: filter bank means connected to the output of said summer and comprising plural contiguous subbands; means for producing-from said remote station speech signal-control signals representative of the incoming speech energy level in each said subband; means for center-clipping the output of each said filter bank subband a varying amOunt in response to the concurrent said energy level value; and means connecting said control signal producing means and said deriving means responsive to voiced portions of signal from said remote station for reducing all said clipping levels.
11. Speech processing apparatus for suppressing speech from one of two talkers in a room comprising: first and second microphones located equidistant from the first, desired said talker but at unequal distances from the second, undesired said talker; means for deriving from the two said microphone outputs a first parameter Xj in accordance with the relationship
US00204509A 1971-12-03 1971-12-03 Speech suppression by predictive filtering Expired - Lifetime US3784747A (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US20450971A 1971-12-03 1971-12-03

Publications (1)

Publication Number Publication Date
US3784747A true US3784747A (en) 1974-01-08

Family

ID=22758195

Family Applications (1)

Application Number Title Priority Date Filing Date
US00204509A Expired - Lifetime US3784747A (en) 1971-12-03 1971-12-03 Speech suppression by predictive filtering

Country Status (5)

Country Link
US (1) US3784747A (en)
JP (1) JPS55760B2 (en)
CA (1) CA952439A (en)
DE (1) DE2207141C3 (en)
GB (1) GB1369711A (en)

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3922488A (en) * 1972-12-15 1975-11-25 Ard Anstalt Feedback-cancelling electro-acoustic transducer apparatus
US4024358A (en) * 1975-10-31 1977-05-17 Communications Satellite Corporation (Comsat) Adaptive echo canceller using differential pulse code modulation encoding
US4031338A (en) * 1976-02-10 1977-06-21 Communications Satellite Corporation (Comsat) Echo suppressor using frequency-selective center clipping
US4122303A (en) * 1976-12-10 1978-10-24 Sound Attenuators Limited Improvements in and relating to active sound attenuation
US4166924A (en) * 1977-05-12 1979-09-04 Bell Telephone Laboratories, Incorporated Removing reverberative echo components in speech signals
FR2451676A1 (en) * 1979-03-12 1980-10-10 Soumagne Joel ECHO DETECTOR IN PARTICULAR FOR SPEECH INTERPOLATION COMMUNICATION SYSTEM
US4360708A (en) * 1978-03-30 1982-11-23 Nippon Electric Co., Ltd. Speech processor having speech analyzer and synthesizer
EP0106640A1 (en) * 1982-10-15 1984-04-25 British Telecommunications Noise control circuit
US4473906A (en) * 1980-12-05 1984-09-25 Lord Corporation Active acoustic attenuator
US4591670A (en) * 1982-09-30 1986-05-27 Nec Corporation Echo canceller and echo suppressor for frequency divisional attenuation of acoustic echoes
EP0204718A1 (en) * 1984-12-14 1986-12-17 Motorola Inc Full duplex speakerphone for radio and landline telephones.
US4670903A (en) * 1981-06-30 1987-06-02 Nippon Electric Co., Ltd. Echo canceller for attenuating acoustic echo signals on a frequency divisional manner
US4819263A (en) * 1986-06-30 1989-04-04 Cellular Communications Corporation Apparatus and method for hands free telephonic communication
US4825384A (en) * 1981-08-27 1989-04-25 Canon Kabushiki Kaisha Speech recognizer
EP0472356A1 (en) * 1990-08-16 1992-02-26 Fujitsu Ten Limited Speech recognition apparatus for a vehicle, using a microphone arrangement to determine the seat from which a command is generated
US5619566A (en) * 1993-08-27 1997-04-08 Motorola, Inc. Voice activity detector for an echo suppressor and an echo suppressor
EP0881814A1 (en) * 1997-05-28 1998-12-02 Deutsche Telekom AG Method for determining the step size Alpha for adjusting the convergence speed in the NLMS algorithm
WO2001035118A1 (en) * 1999-11-05 2001-05-17 Wavemakers Research, Inc. Method to determine whether an acoustic source is near or far from a pair of microphones
US6249581B1 (en) * 1997-08-01 2001-06-19 Bitwave Pte. Ltd. Spectrum-based adaptive canceller of acoustic echoes arising in hands-free audio
US6442275B1 (en) 1998-09-17 2002-08-27 Lucent Technologies Inc. Echo canceler including subband echo suppressor
US20030219112A1 (en) * 2002-05-22 2003-11-27 Boland Simon Daniel Apparatus and method for echo control
US20030219087A1 (en) * 2002-05-22 2003-11-27 Boland Simon Daniel Apparatus and method for time-alignment of two signals
EP1739654A2 (en) 1999-09-08 2007-01-03 Volkswagen AG Method for operating a multiple microphone system in a motor vehicle and multiple microphone system itself
US7734034B1 (en) 2005-06-21 2010-06-08 Avaya Inc. Remote party speaker phone detection

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1997028917A1 (en) * 1996-02-06 1997-08-14 Shinagawa Refractories Co., Ltd. Immersion nozzle replacement apparatus
US11423921B2 (en) 2018-06-11 2022-08-23 Sony Corporation Signal processing device, signal processing method, and program
CN112203188B (en) * 2020-07-24 2021-10-01 北京工业大学 Automatic volume adjusting method

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3133990A (en) * 1962-04-27 1964-05-19 Altec Lansing Corp Automatic level-adjustment circuit
US3177489A (en) * 1960-01-11 1965-04-06 Thompson Ramo Wooldridge Inc Interference suppression systems
US3601549A (en) * 1969-11-25 1971-08-24 Bell Telephone Labor Inc Switching circuit for cancelling the direct sound transmission from the loudspeaker to the microphone in a loudspeaking telephone set
US3603744A (en) * 1965-09-29 1971-09-07 Superior Continental Corp Line tap unit for telephone system
US3631520A (en) * 1968-08-19 1971-12-28 Bell Telephone Labor Inc Predictive coding of speech signals
US3644674A (en) * 1969-06-30 1972-02-22 Bell Telephone Labor Inc Ambient noise suppressor

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3177489A (en) * 1960-01-11 1965-04-06 Thompson Ramo Wooldridge Inc Interference suppression systems
US3133990A (en) * 1962-04-27 1964-05-19 Altec Lansing Corp Automatic level-adjustment circuit
US3603744A (en) * 1965-09-29 1971-09-07 Superior Continental Corp Line tap unit for telephone system
US3631520A (en) * 1968-08-19 1971-12-28 Bell Telephone Labor Inc Predictive coding of speech signals
US3644674A (en) * 1969-06-30 1972-02-22 Bell Telephone Labor Inc Ambient noise suppressor
US3601549A (en) * 1969-11-25 1971-08-24 Bell Telephone Labor Inc Switching circuit for cancelling the direct sound transmission from the loudspeaker to the microphone in a loudspeaking telephone set

Cited By (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3922488A (en) * 1972-12-15 1975-11-25 Ard Anstalt Feedback-cancelling electro-acoustic transducer apparatus
US4024358A (en) * 1975-10-31 1977-05-17 Communications Satellite Corporation (Comsat) Adaptive echo canceller using differential pulse code modulation encoding
US4031338A (en) * 1976-02-10 1977-06-21 Communications Satellite Corporation (Comsat) Echo suppressor using frequency-selective center clipping
US4122303A (en) * 1976-12-10 1978-10-24 Sound Attenuators Limited Improvements in and relating to active sound attenuation
US4166924A (en) * 1977-05-12 1979-09-04 Bell Telephone Laboratories, Incorporated Removing reverberative echo components in speech signals
US4360708A (en) * 1978-03-30 1982-11-23 Nippon Electric Co., Ltd. Speech processor having speech analyzer and synthesizer
FR2451676A1 (en) * 1979-03-12 1980-10-10 Soumagne Joel ECHO DETECTOR IN PARTICULAR FOR SPEECH INTERPOLATION COMMUNICATION SYSTEM
US4473906A (en) * 1980-12-05 1984-09-25 Lord Corporation Active acoustic attenuator
US4670903A (en) * 1981-06-30 1987-06-02 Nippon Electric Co., Ltd. Echo canceller for attenuating acoustic echo signals on a frequency divisional manner
US4825384A (en) * 1981-08-27 1989-04-25 Canon Kabushiki Kaisha Speech recognizer
US4591670A (en) * 1982-09-30 1986-05-27 Nec Corporation Echo canceller and echo suppressor for frequency divisional attenuation of acoustic echoes
EP0106640A1 (en) * 1982-10-15 1984-04-25 British Telecommunications Noise control circuit
EP0204718A4 (en) * 1984-12-14 1988-03-30 Motorola Inc Full duplex speakerphone for radio and landline telephones.
EP0204718A1 (en) * 1984-12-14 1986-12-17 Motorola Inc Full duplex speakerphone for radio and landline telephones.
US4819263A (en) * 1986-06-30 1989-04-04 Cellular Communications Corporation Apparatus and method for hands free telephonic communication
EP0472356A1 (en) * 1990-08-16 1992-02-26 Fujitsu Ten Limited Speech recognition apparatus for a vehicle, using a microphone arrangement to determine the seat from which a command is generated
US5619566A (en) * 1993-08-27 1997-04-08 Motorola, Inc. Voice activity detector for an echo suppressor and an echo suppressor
EP0881814A1 (en) * 1997-05-28 1998-12-02 Deutsche Telekom AG Method for determining the step size Alpha for adjusting the convergence speed in the NLMS algorithm
US6249581B1 (en) * 1997-08-01 2001-06-19 Bitwave Pte. Ltd. Spectrum-based adaptive canceller of acoustic echoes arising in hands-free audio
US6442275B1 (en) 1998-09-17 2002-08-27 Lucent Technologies Inc. Echo canceler including subband echo suppressor
EP1739654A2 (en) 1999-09-08 2007-01-03 Volkswagen AG Method for operating a multiple microphone system in a motor vehicle and multiple microphone system itself
WO2001035118A1 (en) * 1999-11-05 2001-05-17 Wavemakers Research, Inc. Method to determine whether an acoustic source is near or far from a pair of microphones
US20030219112A1 (en) * 2002-05-22 2003-11-27 Boland Simon Daniel Apparatus and method for echo control
US20030219087A1 (en) * 2002-05-22 2003-11-27 Boland Simon Daniel Apparatus and method for time-alignment of two signals
US7027593B2 (en) * 2002-05-22 2006-04-11 Avaya Technology Corp. Apparatus and method for echo control
US7043014B2 (en) * 2002-05-22 2006-05-09 Avaya Technology Corp. Apparatus and method for time-alignment of two signals
US7734034B1 (en) 2005-06-21 2010-06-08 Avaya Inc. Remote party speaker phone detection

Also Published As

Publication number Publication date
GB1369711A (en) 1974-10-09
DE2207141B2 (en) 1980-10-09
CA952439A (en) 1974-08-06
JPS55760B2 (en) 1980-01-09
JPS4865813A (en) 1973-09-10
DE2207141C3 (en) 1981-07-30
DE2207141A1 (en) 1973-08-02

Similar Documents

Publication Publication Date Title
US3784747A (en) Speech suppression by predictive filtering
Allen et al. Multimicrophone signal‐processing technique to remove room reverberation from speech signals
CN1689072B (en) Method and system for processing subband signals using adaptive filters
EP1602223B1 (en) Echo canceller with reduced requirement for processing power
US6904146B2 (en) Full duplex echo cancelling circuit
JP3228940B2 (en) Method and apparatus for reducing residual far-end echo in voice communication networks
RU2109408C1 (en) Line echo suppressor
US8306215B2 (en) Echo canceller for eliminating echo without being affected by noise
KR100482396B1 (en) Device for suppressing interference component of input signal
US5390244A (en) Method and apparatus for periodic signal detection
US20080170706A1 (en) Method And Device For Removing Echo In A Multi-Channel Audio Signal
EP1715669A1 (en) A method for removing echo in an audio signal
EP0853844B1 (en) Echo cancelling system for digital telephony applications
JPS583430A (en) Suppressing device of detouring signal
CN110956975B (en) Echo cancellation method and device
KR20130040194A (en) Method and device for suppressing residual echoes
JP3507020B2 (en) Echo suppression method, echo suppression device, and echo suppression program storage medium
KR100470523B1 (en) Process and Apparatus for Eliminating Loudspeaker Interference from Microphone Signals
US6834108B1 (en) Method for improving acoustic noise attenuation in hand-free devices
US3585311A (en) Speech processor using contiguous multiband center-clipping
US20080152156A1 (en) Robust Method of Echo Suppressor
US7711107B1 (en) Perceptual masking of residual echo
Surin et al. An adaptive noise decorrelation technique for stereophonic acoustic echo cancellation
Cecchi et al. Multichannel double-talk detector based on fundamental frequency estimation
KR100272131B1 (en) Adaptive reverbation cancelling apparatus