US3784747A - Speech suppression by predictive filtering - Google Patents
Speech suppression by predictive filtering Download PDFInfo
- Publication number
- US3784747A US3784747A US00204509A US3784747DA US3784747A US 3784747 A US3784747 A US 3784747A US 00204509 A US00204509 A US 00204509A US 3784747D A US3784747D A US 3784747DA US 3784747 A US3784747 A US 3784747A
- Authority
- US
- United States
- Prior art keywords
- signal
- speech
- undesired
- waveform
- parameter
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 230000001629 suppression Effects 0.000 title abstract description 15
- 238000001914 filtration Methods 0.000 title description 4
- 230000003111 delayed effect Effects 0.000 claims abstract description 12
- 239000002131 composite material Substances 0.000 claims description 25
- 238000012545 processing Methods 0.000 claims description 11
- 230000004044 response Effects 0.000 claims description 6
- 230000005540 biological transmission Effects 0.000 claims description 5
- 230000008878 coupling Effects 0.000 claims description 5
- 238000010168 coupling process Methods 0.000 claims description 5
- 238000005859 coupling reaction Methods 0.000 claims description 5
- 238000005070 sampling Methods 0.000 claims description 5
- 238000009877 rendering Methods 0.000 claims description 3
- 238000000034 method Methods 0.000 description 9
- 230000008569 process Effects 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 208000019300 CLIPPERS Diseases 0.000 description 3
- 208000021930 chronic lymphocytic inflammation with pontine perivascular enhancement responsive to steroids Diseases 0.000 description 3
- 230000001934 delay Effects 0.000 description 3
- 230000000694 effects Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 230000002238 attenuated effect Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 101150071609 floA gene Proteins 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M9/00—Arrangements for interconnection not involving centralised switching
- H04M9/08—Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic
- H04M9/082—Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic using echo cancellers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04B—TRANSMISSION
- H04B3/00—Line transmission systems
- H04B3/02—Details
- H04B3/20—Reducing echo effects or singing; Opening or closing transmitting path; Conditioning for transmission in one direction or the other
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M9/00—Arrangements for interconnection not involving centralised switching
- H04M9/08—Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic
- H04M9/087—Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic using different frequency bands for transmitting and receiving paths ; using phase shifting arrangements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/02—Circuits for transducers, loudspeakers or microphones for preventing acoustic reaction, i.e. acoustic oscillatory feedback
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02082—Noise filtering the noise being echo, reverberation of the speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02087—Noise filtering the noise being separate speech, e.g. cocktail party
Definitions
- ABSTRACT Speech signal energy from an undesired source is suppressed by extracting from the undesired signal a delay parameter and a gain parameter. These parameters control a delay and gain network through which both the desired and undesired signals are routed.
- the invention is applied to several handsfree telephony situations and to the suppression of one of two speakers in a room.
- a general object of the invention is to reduce the energy from an undesired speech source in a composite signal containing desired speech.
- Another object of the invention is tosuppress an undesired speech signal in an electronic communications channel.
- a specific object of the invention is to render a desired speech signal relatively more intelligible despite the presence of undesired speech energy.
- a particular inventive object is to achieve the foregoing objects in the hands-free telephony situation.
- Another specific object of the invention is to avoid voice switching functions and thus enable full-time duplex operation of a hands-free telephone channel.
- Yet another inventive object is to distinguish one talker from another nearby talker and to suppress speech signals from one of them.
- the invention is grounded in the general recognition that an unwanted speech signal can be rejected on the basis of its speech parameters.
- the basic concept contemplated by the present invention is to extract, from the undesired signal, a gain parameter and a delay parameter. These parameters control a delay and gain network through which both desired speech signals and the undesired signal are routed.
- the delay is approximately equal to the current duration of the pitch period of the undesired speech.
- the gain is calculated, in accordance with one of several possible formulas, so as to bring the delayed unwanted signal to the amplitude level of the present value of the unwanted signal.
- the gain is set equal to l.
- the network output is then subtractively applied to the unwanted signal or to any composite signal containing the unwanted signal. The process may be carried out in analog or digital fashion.
- the process is carried out by sampling techniques where the signal is sampled at a rate of, for example, 6 kHz that results in 30 to 60 samples per pitch period.
- the number of samples in a pitch period will vary in accordance with the pitch frequency.
- speech from the loudspeaker of a hands-free telephone set impinging either directly or reverberatively on the sets microphone can be largely removed from the microphone output.
- the reverberant signal as well as the direct signal is suppressed because the unwanted speech parameters do not vary rapidly during voicing.
- speech from, for example, two talkers in the same room is detected by a multiplicity of microphones, and the speech of one talker is suppressed using speech parameters determined by combining the outputs of the microphones.
- the rearrangement of the Atal process constitutes in one aspect a filter; and more specifically, a comb filter with minima at the pitch frequency (and harmonics hereof) of the undesired speech.
- a comb filter with minima at the pitch frequency (and harmonics hereof) of the undesired speech This distinguishes the predictive filter of the present invention from a conventional echo canceler which merely replicates a reverberant signal and subtractively applies the replica to the composite signal.
- FIG. 1 is a communications network schematic block diagram containing a hands-free telephone and an inventive embodiment
- FIG. 2 is a schematic block diagram of the inventive predictive filter
- FIG. 3 is a schematic block diagram further delineating the inventive predictor
- FIGS. 4-6 are graphs depicting various characteristics of the predictor
- FIGS. 7 and 8 are two further embodiments of the invention in a communications network containing hands-free telephones.
- FIGS. 9 and 10 are schematic diagrams of the invention as applied to suppression of speech from talkers in a room.
- a hands-free telephone loudspeaker 1 and microphone 2 present in a reverberative enclosure 3 are shown in FIG. 1 connected to the speech processor of the present invention.
- the desired speech signal input to microphone 2 is from source 4, the near-end talker, whose signal denoted a travels mainly the direct path 5 and also reverberative paths not shown.
- Loudspeaker 1 which broadcasts the far-end talker signal, is a source of undesired input to microphone 2 either via the direct path denoted 6 or reverberative paths illustrated by path 7.
- the far-end talker direct path speech signal is denoted
- the speech processing network 8 in FIG. 1 consists of what will be called a predictive filter 9 connected in the microphone 2 output circuit.
- filter 9 consists of two parallel legs.
- the first leg is a predictor 11 which may be a network consisting of adelay network 12 and an amplifier 13.
- the second leg is a direct shunt path. Both legs are connected to a subtractor 10.
- the predictor 11 is controlled in a manner to be described, by a parameter extractor 14 connected in the loudspeaker l circuit.
- a low pass filter l5 advantageously 3 kHz and a 6 kHz sampler 16 are serially connected in the output circuit of microphone 2.
- a low pass filter 17 and a sampler 18 are in shunt relation to the loudspeaker ll input circuit and serially connected to parameter extractor l4.
- a waveform representing the far-end talker signal 0 is illustrated in FIG. 4. Because a speech signal is redundant-Le, the signal changes little in shape and length of pitch period from one pitch period to the next -the present form or value of signal c can be estimated by a linear prediction based on a past value of signal c.
- the signal 0 of FIG. 4 is shown made up of speech in consecutive pitch periods I, I 1 etc. lnherently, the speech signals in adjacent pitch periods of signal c are of unequal amplitude.
- a gain denoted b can be calculated (in a manner to be described) that when applied to the sampled signal of the pitch period 1 will cause the latter to approximate the sampled signal in the next pitch period 1
- the amplified signal of period I is subtractively combined with the signal value of period the result is the substantial filtering out of the signal 0.
- a composite signal a c containingsignal c is amplified during period 1 and subtractively combined with the composite signal a 0 during period 1 the same result obtains.
- W (the amplitude of sample n reaching subtractor via the direct path in predictive filter 9) is subtractively combined with W,, (the amplitude of the delayed sample) where k is the number of samples in a pitch period.
- the time window over which the parameters are evaluated is of the order of the pitch period to ensure that sufficient energy is present.
- a time window of 30 samples at a sampling rate of 6 kHz will include between one-half and all the samples in a given pitch period.
- input speech samples from sampler 18 are stored as frames of signals.
- the store content is then fed to an arithmetic unit which is part of parameter extractor 14, wherein for 30 samples, computational values of correlation X, are computed as follows:
- N can advantageously be in the range 30-60 samples.
- the computed values of X are then inspected in a peak locating network also part of extractor 14, to de termine the largest value of X, The value ofj is found such that X, is the maximum of all values of X.
- This particular value ofj is the delay parameter, k, which is supplied to predictive filter 9 as one parameter. It is seen that k is a variable delay and that the maximum value of X, is X
- the delay parameter k for a typical voiced segment is shown in FIG. 6.
- the gain parameter b is calculated by computing circuitry also in parameter extractor 14, that solves:
- the gain parameter b likewise is supplied to predictive filter 9.
- delay parameter k and gain parameter b is but one of several systems by which, from an analysis of the speech energy content in adjacent or substantially adjacent signal segments, parameters may be calculated that when applied to a past signal segment will render the latter closely similar to the shape of the present signal segment.
- incoming speech to loudspeaker 1 is continuously analyzed to extract therefrom an optimum delay parameter, and a gain factor. These parameters are periodically updated as for example, every 5 ms. When no incoming signal to loudspeaker l is present, the delay and gain are zero. With incoming signal, the calculated present signal value output of predictor 11 is subtracted from the undelayed, unamplified signal sample representing signals a c.
- the filter depicted in FIG. 3 and described above has a transfer function in Z transform notation.
- H(Z) l bZ' The magnitude of the frequency response of a typical embodiment of filter 9 is shown in FIG. 5 using predictor 11 where T is the sampling period.
- the frequency response for gain parameter b l are shown by the solid curves and for gain parameter bzl by the broken line. Since speech is dynamic during voicing, the parameters b and k have to be optimized as stated above, and readjusted periodically as, for example, every 5 ms.
- the parameters b and k calculated do not vary smoothly with time.
- the optimum delay occasionally doubles during voiced segments. Also, during unvoiced segments, the optimum delayvaries rapidly over a wide range while the correlation remains relatively low. However, the gains calculated are not negligible during these unvoiced portions. Desired speech a, uncorrelated with the undesired signal 0 which is to be rejected, is degraded when passed through a filter with these rapidly varying filter parameters, while under such conditions no additional suppression of the unwanted source is accomplished.
- the predictive filter 9 will be effective in removing part of the reverberant signal as well as the direct sound. Specifically, that part of the reverbcrant signal that has parameters not greatly different from the filter parameters will be reduced in amplitude.
- the far-end echo picked up by the microphone 2 from loudspeaker 1 is first reduced in amplitude during voiced segments by a speech processor 8 in the manner described previously.
- Gain and delay parameters b and k of the far-end speech are measured on the received loudspeaker signal, and the far-end echo component of the microphone signal is reduced by filtering.
- the remaining far-end signal at the output of the speech processor 8 is then removed V by the center-clipping echo suppressor.
- parameter delay circuit 19A which is serially connected between the output of parameter control 19 and predictive filter 9.
- Parameter delay circuit 19A advantageously is provided with a delay duration adjustment circuit 198 with which the delay duration may be set to correspond to the transit time which characterizes each given hands-free telephone.
- FIGS. 7 and 8 A combination of a predictive filter with a centerclipping echo suppressor of the type taught in D. A Berkley-O. M. M. Mitchell-J. R. Pierce U.S. Pat. No. 3,699,271 which is hereby incorporated by reference, is shown in FIGS. 7 and 8. This combination is a possible replacement for voice switching presently used for echo and feedback suppression.
- FIG. 7 shows a network denoted for eliminating the echo ofthe far-end talker in a 4-wire hands-free tel-
- the received signal is used to set the clipping levels by means of clipping control 22 so as just to remove the echo.
- the output of D/A converter 10A is fed to filter bank 40 which comprises plural contiguous band filters in the voice frequency range.
- filter bank 40 which comprises plural contiguous band filters in the voice frequency range.
- center clipper 41 the signal in each subband from filter 41 is center clipped at a level determined by clipping control 22 which measures in effect the'energy level in the received signal within each of the subbands.
- the output of clipper 41 is filtered in bank 42 which is similar to bank 40.
- the clipping control 22 is advantageously controlled also by the parameter extractor 14. Since the echo is reduced by the predictive filter 9 during voicing, the clipping levels can be reduced by substantially the same amount during voicing. Consequently in FIG. 7, a control signal is shown (dashed line) between the parameter extractor l4 and the clipping control 22, which causes an attenuation of the input to clipping level control 22 that is equal to the suppression achieved by speech processing network 8. It will be recognized that optimum performance of clipping level control 22 will be realized by inserting a delay in its input path to compensate for the already mentioned signal transit time between loudspeaker 1 and microphone 2. With the clipping levels thus reduced during voiced segments, there will be less mutilation of the near-end speech by the center-clipping process.
- FIG. 8 shows a circuit for eliminating both the farend echo (echo of far-end talker caused by acoustic coupling through room acoustics) and near-end echo (echo of near-end talker caused by imperfect hybrid junction) in a 2-wire hands-free telephone.
- the far-end echo is eliminated by network 50 as described above for FIG. 7.
- the near-end echo is eliminated by a similar circuit denoted 51 introduced on the receive side of the local 4-wire network as shown.
- circuit 51 An alternative method of adjusting the clipping level control by the parameter extractor 14 via the parameter delay 19A is shown in circuit 51.
- a second predictive filter designated 9a is used in circuit 51 to attenuate the clipping level control signal during voiced segments.
- the clipping levels follow the signal at the input to the narrow band center clipper, i.e., at the output of the predictive filter 9a.
- FIG. 9 shows the desired speech source 23 and an undesired source 24 both of whose speech signals form the input to microphones 25 and 26.
- the undesired source 24 is positioned so that the time delays for direct sound transmission to microphones 25 and 26 are equal.
- the output of microphone 25 is predictive filter 9 as microphone 26, enter the parameter extractor 30 wherein an arithmetic unit within the extractor calculates the computational values and A peak picking network within the extractor then selects the peaks from X, and Y, and a comparator finds the largest value peak which occurs in both sequence X, and Y for the same value ofj. This value ofj is the delay parameter k for the undesired speech supplied to the predictive filter 9.
- FIG. 10 An alternative method of extracting the parameters is shown in FIG. 10.
- Two additional microphones 27 and 28 are positioned so that time delays from desired speaker 24 for direct sound transmission to microphones 27 and 28 are equal to the time delays to microphones 25 and 26.
- the outputs of all microphones 25-28 are processed by a non-linear processor 31 as described in O. M. M. Mitchell-C. A. Ross-R. L. Wallace, Jr. U.S. Pat. No. 3,644,671, which is hereby incorporated by reference.
- the output of processor 31 contains the undesired signal and an attenuated and disturbed component of the desired signal. (The outputs may alternatively be added to merely attenuate the desired signal.)
- the output of the non-linear processor 31 enters the speech processing network 8.
- the output of microphone 25 is processed by speech processing network 8 which filters out the undesired talker 24 in the manner already described.
- speech processing network 8 which filters out the undesired talker 24 in the manner already described.
- the presence in the output of the nonlinear processor 31 of a small amount of the desired talker does not significantly affect the delay parameter k but will cause a small error in the evaluation of X and b.
- Speech processing apparatus for suppressing voiced segments of an undesired speech signal while leaving a desired speech signal intelligible, comprising:
- Apparatus in accordance with claim 1 further comprising means for deriving from the waveform of said undesired speech signal a gain parameter specifying the amount by which the amplitudes of corresponding values of said undesired speech signal in a past said interval must be respectively adjusted so as to produce a substantial duplicate of the undesired said speech signal of a present said interval; and which further comprises means controlled by said gain parameter for amplifying said delayed composite speech waveform prior to its being subtractively applied to said summer.
- a communications network comprising:
- a hands-free telephone station including a direct acoustic coupling path between the station loudspeaker and microphone, a second remote telephone station, and transmission means interconnecting said stations;
- a communications network pursuant to claim 4 wherein said deriving means comprises:
- X1 2 2 it) in) 71 I! ll further comprising means for rendering said gain parameter equal to zero in the absence of voiced segments of the signal in said loudspeaker path from said remote station.
- a communications network pursuant to claim 8, whefe is the speechigflal receive? by Said fi further comprising means for adjustably delaying armlcrolphone and W" the Speech slgnal recelved rival of said delay and gain parameters at said second by Sam P path by an amount that compensates for the transit meahs for Selectmg the largest Value P from the time delay over said direct acoustic coupling path of composite P ,Vahles Ofsaid Parameters 1 and 1 speech from said remote station. for the Same Value of the term j;
- a communications network pursuant to claim 4, means for pp y the desired and the undesired Said f th i i signals from one of said microphones to a summer filter bank means connected to the output of said irectly over a first path and alternately over a secsummer and comprising plural contiguous sub- 0nd path through a network including delay means; band s; and
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Computer Networks & Wireless Communication (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Otolaryngology (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
- Telephone Function (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
Speech signal energy from an undesired source is suppressed by extracting from the undesired signal a delay parameter and a gain parameter. These parameters control a delay and gain network through which both the desired and undesired signals are routed. The delayed, amplified signal, when applied to a subtractor along with the undelayed signal, suppresses the undesired signal. The invention is applied to several hands-free telephony situations and to the suppression of one of two speakers in a room.
Description
Tlnite States atent ,Tan.0,1974
Berkley et a1.
1 1 SPEECH SUPPRESSION BY PREDICTIVE FlLTERllNG [75] Inventors: David Arthur Berkley, New York,
N.Y.; Olga Mary Mracek Mitchell, Summit, NJ.
[73] Assignee: Bell Telephone Laboratories,
Incorporated, Murray Hill, NJ.
[22] Filed: Dec. 3, 1971 {21] Appl. No.: 204,509
[52] 11.5. C1 179/1 HF, 179/1 P, 179/1 FS [51] lnt. Cl. H04m l/20, H04b 15/00 [58] Field 01 Search 179/1 P, 1 VC, 1 HF,
179/1 F, l.FS, 1 SA, 15.55 R, 100.2 K, 81 B, 170.8; 324/473-476 [56] References Cited UNITED STATES PATENTS 3,177,489 4/1965 Saltzberg 325/476 3,631,520 12/1971 Atal 179/1 SA aw SPEECH 111 SIGNAL Q 3,133,990 5/1964 S6618) 179/1 P 3,644,674 2/1972 Mitchell 179/1 P 3,601,549 8/1971 M1IC11811 179/1 FS 3,603,744 Krasin 179/81 B Primary Examinen-William C. Cooper Assistant Examiner-Jon Bradford Leaheey A ttorngy -firavesn [57] ABSTRACT Speech signal energy from an undesired source is suppressed by extracting from the undesired signal a delay parameter and a gain parameter. These parameters control a delay and gain network through which both the desired and undesired signals are routed. The
- delayed, amplified signal, when applied to a subtractor along with the undelayed signal, suppresses the undesired signal. The invention is applied to several handsfree telephony situations and to the suppression of one of two speakers in a room.
F I DELAY l3 II NETWORK I (h SAMPLES) I 2 19A 1 (5) I? w coNv RTER I PATENTEBJAN 8W4 SHEET 2 [1F 4 SIGNAL;
FIG. 5
SPECTRAL MAGNITUDE vs. FREQUENCY FOR PREDICTIVE FILTER OF FIG. 3
montzoiz PRLTHZ l FREQUENCY FIG. 6
DELAY PARAMETER R FOR A TYPICAL VOICED SEGMENT GEE/5 1 TIME (msEg) SPEECH SUPPRESSION BY PREDTCTIVIE FTILTEIRING FIELD OF THE INVENTION This invention relates to speech signal processing, and in particular to reducing the energy content of that part of a composite speech signal attributable to an undesired source.
BACKGROUND OF THE INVENTION In telephony and elsewhere,,it often happens that speech from a source the listener wishes to hear is seriously impaired in intelligibility by speech from a second, undesired source. Numerous expedients to reduce the effects of the second source have been proposed. These involve relative enhancement of the desired speech signal, rendering the undesired signal relatively unintelligible or reducing the energy of the undesired signal. Regardless of the approach, the result typically has been that the desired signal is more intelligible than it would be in the absence of the processing. The hands-free" telephone well exemplifies this problem of'conflicting speech sources, because its electroacoustic speaker constitutes a potential source of undesired signal at the microphone of the same station.
Accordingly, a general object of the invention is to reduce the energy from an undesired speech source in a composite signal containing desired speech.
Another object of the invention is tosuppress an undesired speech signal in an electronic communications channel.
A specific object of the invention is to render a desired speech signal relatively more intelligible despite the presence of undesired speech energy.
A particular inventive object is to achieve the foregoing objects in the hands-free telephony situation.
Another specific object of the invention is to avoid voice switching functions and thus enable full-time duplex operation of a hands-free telephone channel.
Yet another inventive object is to distinguish one talker from another nearby talker and to suppress speech signals from one of them.
SUMMARY OF THE INVENTION The invention is grounded in the general recognition that an unwanted speech signal can be rejected on the basis of its speech parameters.
A discussion of certain speech parameters is found in the patent application of B. S. Atal, Ser. No. 753,408, filed Aug. 19, 1968 now Pat. No. 3,631,520 and assigned to applicants assignee. A predictive coding technique for reducing transmission bandwidth needs is therein disclosed by Atal in which an estimate of the present value of a speech sample is made based on a known corresponding past value. From these data, a difference or error signal is generated, and transmitted to a remote receiving station along with certain predictor parameters. At the remote receiving station the entire signal is reconstituted from the error signal, using the predictor parameters. 7 It has been realized that the generic process as represented by the Atal disclosure can be rearranged'soas to substantially eliminate a given undesired voice signal.
The basic concept contemplated by the present invention is to extract, from the undesired signal, a gain parameter and a delay parameter. These parameters control a delay and gain network through which both desired speech signals and the undesired signal are routed. The delay is approximately equal to the current duration of the pitch period of the undesired speech. The gain is calculated, in accordance with one of several possible formulas, so as to bring the delayed unwanted signal to the amplitude level of the present value of the unwanted signal. Alternatively, in a technically less complex embodiment, the gain is set equal to l. In either case, the network output is then subtractively applied to the unwanted signal or to any composite signal containing the unwanted signal. The process may be carried out in analog or digital fashion.
Advantageously however, the process is carried out by sampling techniques where the signal is sampled at a rate of, for example, 6 kHz that results in 30 to 60 samples per pitch period. The number of samples in a pitch period will vary in accordance with the pitch frequency.
In one embodiment pursuant to the invention, speech from the loudspeaker of a hands-free telephone set impinging either directly or reverberatively on the sets microphone, can be largely removed from the microphone output. The reverberant signal as well as the direct signal is suppressed because the unwanted speech parameters do not vary rapidly during voicing.
In another embodiment pursuant to the invention, speech from, for example, two talkers in the same room is detected by a multiplicity of microphones, and the speech of one talker is suppressed using speech parameters determined by combining the outputs of the microphones.
It will be apparent that the rearrangement of the Atal process constitutes in one aspect a filter; and more specifically, a comb filter with minima at the pitch frequency (and harmonics hereof) of the undesired speech. This distinguishes the predictive filter of the present invention from a conventional echo canceler which merely replicates a reverberant signal and subtractively applies the replica to the composite signal.
The invention and its further objects, features, and advantages will be readily discerned in detail from a reading of the description to follow of illustrative embodiments.
BRIEF DESCRIPTION OF THE DRAWING FIG. 1 is a communications network schematic block diagram containing a hands-free telephone and an inventive embodiment;
FIG. 2 is a schematic block diagram of the inventive predictive filter;
FIG. 3 is a schematic block diagram further delineating the inventive predictor;
FIGS. 4-6 are graphs depicting various characteristics of the predictor;
FIGS. 7 and 8 are two further embodiments of the invention in a communications network containing hands-free telephones; and
FIGS. 9 and 10 are schematic diagrams of the invention as applied to suppression of speech from talkers in a room.
DETAILED DESCRIPTION OF INVENTIVE EMBODIMENTS Hands-free Telephone Situations In the first inventive embodiment, a hands-free telephone loudspeaker 1 and microphone 2 present in a reverberative enclosure 3 are shown in FIG. 1 connected to the speech processor of the present invention. Usually, the desired speech signal input to microphone 2 is from source 4, the near-end talker, whose signal denoted a travels mainly the direct path 5 and also reverberative paths not shown. Loudspeaker 1 which broadcasts the far-end talker signal, is a source of undesired input to microphone 2 either via the direct path denoted 6 or reverberative paths illustrated by path 7. The far-end talker direct path speech signal is denoted The speech processing network 8 in FIG. 1 consists of what will be called a predictive filter 9 connected in the microphone 2 output circuit. As seen in FIGS. 2 and 3, filter 9 consists of two parallel legs. The first leg is a predictor 11 which may be a network consisting of adelay network 12 and an amplifier 13. The second leg is a direct shunt path. Both legs are connected to a subtractor 10. The predictor 11 is controlled in a manner to be described, by a parameter extractor 14 connected in the loudspeaker l circuit.
Pursuant to one embodiment, the invention is carried out digitally. A low pass filter l5 advantageously 3 kHz and a 6 kHz sampler 16 are serially connected in the output circuit of microphone 2. Similarly, a low pass filter 17 and a sampler 18 are in shunt relation to the loudspeaker ll input circuit and serially connected to parameter extractor l4.
A waveform representing the far-end talker signal 0 is illustrated in FIG. 4. Because a speech signal is redundant-Le, the signal changes little in shape and length of pitch period from one pitch period to the next -the present form or value of signal c can be estimated by a linear prediction based on a past value of signal c.
The signal 0 of FIG. 4 is shown made up of speech in consecutive pitch periods I, I 1 etc. lnherently, the speech signals in adjacent pitch periods of signal c are of unequal amplitude. Thus, a gain denoted b can be calculated (in a manner to be described) that when applied to the sampled signal of the pitch period 1 will cause the latter to approximate the sampled signal in the next pitch period 1 If then the amplified signal of period I, is subtractively combined with the signal value of period the result is the substantial filtering out of the signal 0. In like manner, if a composite signal a c containingsignal c is amplified during period 1 and subtractively combined with the composite signal a 0 during period 1 the same result obtains.
Thus, in mathematical terms, W, (the amplitude of sample n reaching subtractor via the direct path in predictive filter 9) is subtractively combined with W,, (the amplitude of the delayed sample) where k is the number of samples in a pitch period. Advantageously, the time window over which the parameters are evaluated is of the order of the pitch period to ensure that sufficient energy is present. A time window of 30 samples at a sampling rate of 6 kHz will include between one-half and all the samples in a given pitch period.
Since speech is only quasi-stationary during voicing, the gain parameter b and delay parameter k have to be periodically calculated. This is accomplished in the digital parameter extractor 14 pursuant to the teaching of the aforementioned Atal patent application Ser. No. 753,408.
As taught therein, input speech samples from sampler 18 are stored as frames of signals. The store content is then fed to an arithmetic unit which is part of parameter extractor 14, wherein for 30 samples, computational values of correlation X, are computed as follows:
where N can advantageously be in the range 30-60 samples.
The computed values of X, are then inspected in a peak locating network also part of extractor 14, to de termine the largest value of X, The value ofj is found such that X, is the maximum of all values of X. This particular value ofj is the delay parameter, k, which is supplied to predictive filter 9 as one parameter. It is seen that k is a variable delay and that the maximum value of X, is X The delay parameter k for a typical voiced segment is shown in FIG. 6.
The gain parameter b is calculated by computing circuitry also in parameter extractor 14, that solves:
The gain parameter b likewise is supplied to predictive filter 9.
The described calculation of delay parameter k and gain parameter b is but one of several systems by which, from an analysis of the speech energy content in adjacent or substantially adjacent signal segments, parameters may be calculated that when applied to a past signal segment will render the latter closely similar to the shape of the present signal segment.
Thus, incoming speech to loudspeaker 1 is continuously analyzed to extract therefrom an optimum delay parameter, and a gain factor. These parameters are periodically updated as for example, every 5 ms. When no incoming signal to loudspeaker l is present, the delay and gain are zero. With incoming signal, the calculated present signal value output of predictor 11 is subtracted from the undelayed, unamplified signal sample representing signals a c.
Reconversion to analog form of the signal in the microphone 2 output circuit is achieved in D/A converter 10A.
The filter depicted in FIG. 3 and described above has a transfer function in Z transform notation.
H(Z) l bZ' The magnitude of the frequency response of a typical embodiment of filter 9 is shown in FIG. 5 using predictor 11 where T is the sampling period.
The frequency response for gain parameter b l are shown by the solid curves and for gain parameter bzl by the broken line. Since speech is dynamic during voicing, the parameters b and k have to be optimized as stated above, and readjusted periodically as, for example, every 5 ms.
Filtering of the input speech by the calculated parameters results in suppression during voicing of up to 30 dB during voiced segments of the undesired signal c,
and an average suppression of about 14 dB of the undesired signal 0.
The parameters b and k calculated do not vary smoothly with time. The optimum delay occasionally doubles during voiced segments. Also, during unvoiced segments, the optimum delayvaries rapidly over a wide range while the correlation remains relatively low. However, the gains calculated are not negligible during these unvoiced portions. Desired speech a, uncorrelated with the undesired signal 0 which is to be rejected, is degraded when passed through a filter with these rapidly varying filter parameters, while under such conditions no additional suppression of the unwanted source is accomplished.
To avoid this difficulty, logic is introduced pursuant to one facet of the invention, to prevent undesirable variation of the filter parameters b and k. It was determined that not much suppression was obtained when the correlation X was less than 0.85. Consequently gain b is set equal to zero for X 0.85. This is achieved by parameter control circuit 19 (FIG. l) which sets b equal to zero for X 0.85. This choice of X is a compromise between one as great as possible and one low enough so that all of the voiced segments of speech are suppressed. The resulting suppression during voicing is unchanged while degradation ofa second speech is reduced. FIG. 6 shows the variation of delay parameter k during a typical voiced segment.
Since the parameters b and k vary relatively slowly during voiced segments, the predictive filter 9 will be effective in removing part of the reverberant signal as well as the direct sound. Specifically, that part of the reverbcrant signal that has parameters not greatly different from the filter parameters will be reduced in amplitude.
The foregoing discussion of the invention as applied to hands-free telephony has assumed no separation between loudspeaker l and microphone 2. In practice, however, a significant transit time for the signal 0 to travel path 6 to microphone 2 is required. It is therefore necesssary to compensate in speech processing network 8 for the loudspeaker-microphone transit time.
ephone. Like numerals denote items which correspond to counterparts in FIGS. 1-3. The far-end echo picked up by the microphone 2 from loudspeaker 1 is first reduced in amplitude during voiced segments by a speech processor 8 in the manner described previously. Gain and delay parameters b and k of the far-end speech are measured on the received loudspeaker signal, and the far-end echo component of the microphone signal is reduced by filtering. The remaining far-end signal at the output of the speech processor 8 is then removed V by the center-clipping echo suppressor.
This is achieved by parameter delay circuit 19A which is serially connected between the output of parameter control 19 and predictive filter 9. Parameter delay circuit 19A advantageously is provided with a delay duration adjustment circuit 198 with which the delay duration may be set to correspond to the transit time which characterizes each given hands-free telephone.
A combination of a predictive filter with a centerclipping echo suppressor of the type taught in D. A Berkley-O. M. M. Mitchell-J. R. Pierce U.S. Pat. No. 3,699,271 which is hereby incorporated by reference, is shown in FIGS. 7 and 8. This combination is a possible replacement for voice switching presently used for echo and feedback suppression.
FIG. 7 shows a network denoted for eliminating the echo ofthe far-end talker in a 4-wire hands-free tel- As taught in D. A. Berkley et al. U.S. Pat. No. 3,699,271, the received signal is used to set the clipping levels by means of clipping control 22 so as just to remove the echo. The output of D/A converter 10A is fed to filter bank 40 which comprises plural contiguous band filters in the voice frequency range. In center clipper 41 the signal in each subband from filter 41 is center clipped at a level determined by clipping control 22 which measures in effect the'energy level in the received signal within each of the subbands. The output of clipper 41 is filtered in bank 42 which is similar to bank 40.
In this embodiment, the clipping control 22 is advantageously controlled also by the parameter extractor 14. Since the echo is reduced by the predictive filter 9 during voicing, the clipping levels can be reduced by substantially the same amount during voicing. Consequently in FIG. 7, a control signal is shown (dashed line) between the parameter extractor l4 and the clipping control 22, which causes an attenuation of the input to clipping level control 22 that is equal to the suppression achieved by speech processing network 8. It will be recognized that optimum performance of clipping level control 22 will be realized by inserting a delay in its input path to compensate for the already mentioned signal transit time between loudspeaker 1 and microphone 2. With the clipping levels thus reduced during voiced segments, there will be less mutilation of the near-end speech by the center-clipping process.
FIG. 8 shows a circuit for eliminating both the farend echo (echo of far-end talker caused by acoustic coupling through room acoustics) and near-end echo (echo of near-end talker caused by imperfect hybrid junction) in a 2-wire hands-free telephone. The far-end echo is eliminated by network 50 as described above for FIG. 7. The near-end echo is eliminated by a similar circuit denoted 51 introduced on the receive side of the local 4-wire network as shown.
An alternative method of adjusting the clipping level control by the parameter extractor 14 via the parameter delay 19A is shown in circuit 51. A second predictive filter designated 9a is used in circuit 51 to attenuate the clipping level control signal during voiced segments. Thus the clipping levels follow the signal at the input to the narrow band center clipper, i.e., at the output of the predictive filter 9a.
Suppression of One of Two Room Speakers A further embodiment allows the suppression of the speech signal from one of two talkers in a room. FIG. 9 shows the desired speech source 23 and an undesired source 24 both of whose speech signals form the input to microphones 25 and 26. The undesired source 24 is positioned so that the time delays for direct sound transmission to microphones 25 and 26 are equal. In the output of microphone 25 is predictive filter 9 as microphone 26, enter the parameter extractor 30 wherein an arithmetic unit within the extractor calculates the computational values and A peak picking network within the extractor then selects the peaks from X, and Y, and a comparator finds the largest value peak which occurs in both sequence X, and Y for the same value ofj. This value ofj is the delay parameter k for the undesired speech supplied to the predictive filter 9.
An alternative method of extracting the parameters is shown in FIG. 10. Two additional microphones 27 and 28 are positioned so that time delays from desired speaker 24 for direct sound transmission to microphones 27 and 28 are equal to the time delays to microphones 25 and 26. The outputs of all microphones 25-28 are processed by a non-linear processor 31 as described in O. M. M. Mitchell-C. A. Ross-R. L. Wallace, Jr. U.S. Pat. No. 3,644,671, which is hereby incorporated by reference. The output of processor 31 contains the undesired signal and an attenuated and disturbed component of the desired signal. (The outputs may alternatively be added to merely attenuate the desired signal.) The output of the non-linear processor 31 enters the speech processing network 8. The output of microphone 25 is processed by speech processing network 8 which filters out the undesired talker 24 in the manner already described. The presence in the output of the nonlinear processor 31 of a small amount of the desired talker does not significantly affect the delay parameter k but will cause a small error in the evaluation of X and b.
[t is to be understood that the embodiments described herein are merely illustrative of the principles of the invention. Various modifications may be made thereto by persons skilled in the art without departing from the spirit and scope of the invention.
What is claimed is:
1. Speech processing apparatus for suppressing voiced segments of an undesired speech signal while leaving a desired speech signal intelligible, comprising:
means forderiving an electronic waveform representing the undesired speech signal;
means for deriving an electronic waveform representing a composite signal containing a reverberant version of the undesired speech signal and the desired signal;
means for deriving from the waveform of said undesired speech signal a delay parameter determined from the signal values during an interval embracing a substantial portion ofa pitch period of said undesired speech signal;
means for applying said composite speech signal waveform to a summer over a first path;
means for delaying in a second path said composite speech signal waveform by an amount of said delay parameter; and means for subtractively applying to said summer the delayed said composite speech waveform.
2. Apparatus pursuant to claim 1, further comprising means responsive to the absence of voiced segments of said undesired speech signal for interrupting said second path.
3. Apparatus in accordance with claim 1 further comprising means for deriving from the waveform of said undesired speech signal a gain parameter specifying the amount by which the amplitudes of corresponding values of said undesired speech signal in a past said interval must be respectively adjusted so as to produce a substantial duplicate of the undesired said speech signal of a present said interval; and which further comprises means controlled by said gain parameter for amplifying said delayed composite speech waveform prior to its being subtractively applied to said summer.
4. A communications network comprising:
a hands-free telephone station including a direct acoustic coupling path between the station loudspeaker and microphone, a second remote telephone station, and transmission means interconnecting said stations;
means for derivingfrom the incoming signal waveform to said loudspeaker from said second stationa delay parameter representing the duration of an interval embracing a substantial portion of the present pitch period of speech from the remote station; and a gain parameter specifying the amount by which the waveform in a past said interval must be changed in amplitude to substantially correspond to the undesired speech waveform of the present interval;
means for applying the composite signal-consisting of the desired near-end talker signal and the acoustically coupled far-end talker signal from said loudspeaker-in said microphone output to a summer over a first path;
means disposed in a second path for delaying said composite signal by the amount of said delay parameter and for amplifying the delayed composite signal an amount determined by said gain parameter; and
means for subtractively applying the delayed, amplified composite signal to said summer.
5. A communications network pursuant to claim 4 wherein said deriving means comprises:
a signal sampler connected to the circuit of said loudspeaker and operating at a set sampling rate; and
means for computing values of a term X, in accordance with the relationship where W is the amplitude of a sample n reaching said sampler, and means for finding that value ofj such that X,- is the maximum of all values of X,, the
9 110 found value of j constituting said delay parameter. means for center-clipping the output of each said fil- 6. A communications network pursuant to claim ter bank subband a varying amount in response to wherein said gain parameter deriving means comprises the concurrent said energy level value; and means for computing the value b in accordance with means connecting said control signal producing the relationship 5 means and said deriving means responsive to voiced portions of signal from said remote station for reducing all said clipping levels.
7 11. Speech processing apparatus for suppressing 2 n n-k speech from one of two talkers in a room comprising: b=% 10 first and second microphones located equidistant 2 2 from the first, desired said talker but at unequal distances from the second, undesired said talker;
means for deriving from the two said microphone outputs a first parameter X, in accordance with the 1 r h where W, is the amplitude of a sample n reaching said re a Ions sampler, and k is a delay parameter for a voiced segment. N l l/2 I 1/2 7. A communications network pursuant to claim 6, X1: 2 2 it) in) 71 I! ll further comprising means for rendering said gain parameter equal to zero in the absence of voiced segments of the signal in said loudspeaker path from said remote station.
8. A communications network-pursuant to claim 7, 1/2
further comprising means for setting said gain parame- N I WT?) 1/2 W2 and a second parameter Y5, respectively, calculated in accordance with the relationship ter equal to zero in response to values of X corresponding to the maximum computed values of X, which are less than a critical predetermined value.
9. A communications network pursuant to claim 8, whefe is the speechigflal receive? by Said fi further comprising means for adjustably delaying armlcrolphone and W" the Speech slgnal recelved rival of said delay and gain parameters at said second by Sam P path by an amount that compensates for the transit meahs for Selectmg the largest Value P from the time delay over said direct acoustic coupling path of composite P , Vahles Ofsaid Parameters 1 and 1 speech from said remote station. for the Same Value of the term j;
10. A communications network pursuant to claim 4, means for pp y the desired and the undesired Said f th i i signals from one of said microphones to a summer filter bank means connected to the output of said irectly over a first path and alternately over a secsummer and comprising plural contiguous sub- 0nd path through a network including delay means; band s; and
means for producing-from said remote station 40 means for adjusting said delay means as a function of speech signalcontrol signals representative of the the value of the term j. incoming speech energy level in each said subband;
Claims (11)
1. Speech processing apparatus for suppressing voiced segments of an undesired speech signal while leaving a desired speech signal intelligible, comprising: means for deriving an electronic waveform representing the undesired speech signal; means for deriving an electronic waveform representing a composite signal containing a reverberant version of the undesired speech signal and the desired signal; means for deriving from the waveform of said undesired speech signal a delay parameter determined from the signal values during an interval embracing a substantial portion of a pitch period of said undesired speech signal; means for applying said composite speech signal waveform to a summer over a first path; means for delaying in a second path said composite speech signal waveform by an amount of said delay parameter; and means for subtractively applying to said summer the delayed said composite speech waveform.
2. Apparatus pursuant to claim 1, further comprising means responsive to the absence of voiced segments of said undesired speech signal for interrupting said second path.
3. Apparatus in accordance with claim 1 further comprising means for deriving from the waveform of said undesired speech signal a gain parameter specifying the amount by which the amplitudes of corresponding values of said undesired speech signal in a past said interval must be respectively adjusted so as to produce a substantial duplicate of the undesired said speech signal of a present said interval; and which further comprises means controlled by said gain parameter for amplifying said delayed composite speech waveform prior to its being subtractively applied to said summer.
4. A communications network comprising: a hands-free telephone station including a direct acoustic coupling path between the station loudspeaker and microphone, a second remote telephone station, and transmission means interconnecting said stations; means for deriving-from the incoming signal waveform to said loudspeaker from said second station-a delay parameter representing the duration of an interval embracing a substantial portion of the present pitch period of speech from the remote station; and a gain parameter specifying the amount by which the waveform in a past said interval must be changed in amplitude to substantially correspond to the undesired speech waveform of the present interval; means for applying the composite signal-consisting of the desired near-end talker signal and the acoustically coupled far-end talker signal from said loudspeaker-in said microphone output to a summer over a first path; means disposed in a second path for delaying said composite signal by the amount of said delay parameter and for amplifying the delayed composite signal an amount determined by said gain parameter; and means for subtractively applying the delayed, amplified composite signal to said summer.
5. A communications network pursuant to claim 4 wherein said deriving means comprises: a signal sampler connected to the circuit of said loudspeaker and operating at a set sampling rate; and means for computing values of a term Xj in accordance with the relationship
6. A communications network pursuant to claim 5 wherein said gain parameter deriving means comprises means for computing the value b in accordance with the relationship
7. A communications network pursuant to claim 6, further comprising means for rendering said gain parameter equal to zero in the absence of voiced segments of the signal in said loudspeaker path from said remote station.
8. A communications network pursuant to claim 7, further comprising means for setting said gain parameter equal to zero in response to values of Xk, corresponding to the maximum computed values of Xj which are less than a critical predetermined value.
9. A communications network pursuant to claim 8, further comprising means for adjustably delaying arrival of said delay and gain parameters at said second path by an amount that compensates for the transit time delay over said direct acoustic coupling path of speech from said remote station.
10. A communications network pursuant to claim 4, further comprising: filter bank means connected to the output of said summer and comprising plural contiguous subbands; means for producing-from said remote station speech signal-control signals representative of the incoming speech energy level in each said subband; means for center-clipping the output of each said filter bank subband a varying amOunt in response to the concurrent said energy level value; and means connecting said control signal producing means and said deriving means responsive to voiced portions of signal from said remote station for reducing all said clipping levels.
11. Speech processing apparatus for suppressing speech from one of two talkers in a room comprising: first and second microphones located equidistant from the first, desired said talker but at unequal distances from the second, undesired said talker; means for deriving from the two said microphone outputs a first parameter Xj in accordance with the relationship
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US20450971A | 1971-12-03 | 1971-12-03 |
Publications (1)
Publication Number | Publication Date |
---|---|
US3784747A true US3784747A (en) | 1974-01-08 |
Family
ID=22758195
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US00204509A Expired - Lifetime US3784747A (en) | 1971-12-03 | 1971-12-03 | Speech suppression by predictive filtering |
Country Status (5)
Country | Link |
---|---|
US (1) | US3784747A (en) |
JP (1) | JPS55760B2 (en) |
CA (1) | CA952439A (en) |
DE (1) | DE2207141C3 (en) |
GB (1) | GB1369711A (en) |
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3922488A (en) * | 1972-12-15 | 1975-11-25 | Ard Anstalt | Feedback-cancelling electro-acoustic transducer apparatus |
US4024358A (en) * | 1975-10-31 | 1977-05-17 | Communications Satellite Corporation (Comsat) | Adaptive echo canceller using differential pulse code modulation encoding |
US4031338A (en) * | 1976-02-10 | 1977-06-21 | Communications Satellite Corporation (Comsat) | Echo suppressor using frequency-selective center clipping |
US4122303A (en) * | 1976-12-10 | 1978-10-24 | Sound Attenuators Limited | Improvements in and relating to active sound attenuation |
US4166924A (en) * | 1977-05-12 | 1979-09-04 | Bell Telephone Laboratories, Incorporated | Removing reverberative echo components in speech signals |
FR2451676A1 (en) * | 1979-03-12 | 1980-10-10 | Soumagne Joel | ECHO DETECTOR IN PARTICULAR FOR SPEECH INTERPOLATION COMMUNICATION SYSTEM |
US4360708A (en) * | 1978-03-30 | 1982-11-23 | Nippon Electric Co., Ltd. | Speech processor having speech analyzer and synthesizer |
EP0106640A1 (en) * | 1982-10-15 | 1984-04-25 | British Telecommunications | Noise control circuit |
US4473906A (en) * | 1980-12-05 | 1984-09-25 | Lord Corporation | Active acoustic attenuator |
US4591670A (en) * | 1982-09-30 | 1986-05-27 | Nec Corporation | Echo canceller and echo suppressor for frequency divisional attenuation of acoustic echoes |
EP0204718A1 (en) * | 1984-12-14 | 1986-12-17 | Motorola Inc | Full duplex speakerphone for radio and landline telephones. |
US4670903A (en) * | 1981-06-30 | 1987-06-02 | Nippon Electric Co., Ltd. | Echo canceller for attenuating acoustic echo signals on a frequency divisional manner |
US4819263A (en) * | 1986-06-30 | 1989-04-04 | Cellular Communications Corporation | Apparatus and method for hands free telephonic communication |
US4825384A (en) * | 1981-08-27 | 1989-04-25 | Canon Kabushiki Kaisha | Speech recognizer |
EP0472356A1 (en) * | 1990-08-16 | 1992-02-26 | Fujitsu Ten Limited | Speech recognition apparatus for a vehicle, using a microphone arrangement to determine the seat from which a command is generated |
US5619566A (en) * | 1993-08-27 | 1997-04-08 | Motorola, Inc. | Voice activity detector for an echo suppressor and an echo suppressor |
EP0881814A1 (en) * | 1997-05-28 | 1998-12-02 | Deutsche Telekom AG | Method for determining the step size Alpha for adjusting the convergence speed in the NLMS algorithm |
WO2001035118A1 (en) * | 1999-11-05 | 2001-05-17 | Wavemakers Research, Inc. | Method to determine whether an acoustic source is near or far from a pair of microphones |
US6249581B1 (en) * | 1997-08-01 | 2001-06-19 | Bitwave Pte. Ltd. | Spectrum-based adaptive canceller of acoustic echoes arising in hands-free audio |
US6442275B1 (en) | 1998-09-17 | 2002-08-27 | Lucent Technologies Inc. | Echo canceler including subband echo suppressor |
US20030219112A1 (en) * | 2002-05-22 | 2003-11-27 | Boland Simon Daniel | Apparatus and method for echo control |
US20030219087A1 (en) * | 2002-05-22 | 2003-11-27 | Boland Simon Daniel | Apparatus and method for time-alignment of two signals |
EP1739654A2 (en) | 1999-09-08 | 2007-01-03 | Volkswagen AG | Method for operating a multiple microphone system in a motor vehicle and multiple microphone system itself |
US7734034B1 (en) | 2005-06-21 | 2010-06-08 | Avaya Inc. | Remote party speaker phone detection |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1997028917A1 (en) * | 1996-02-06 | 1997-08-14 | Shinagawa Refractories Co., Ltd. | Immersion nozzle replacement apparatus |
US11423921B2 (en) | 2018-06-11 | 2022-08-23 | Sony Corporation | Signal processing device, signal processing method, and program |
CN112203188B (en) * | 2020-07-24 | 2021-10-01 | 北京工业大学 | Automatic volume adjusting method |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3133990A (en) * | 1962-04-27 | 1964-05-19 | Altec Lansing Corp | Automatic level-adjustment circuit |
US3177489A (en) * | 1960-01-11 | 1965-04-06 | Thompson Ramo Wooldridge Inc | Interference suppression systems |
US3601549A (en) * | 1969-11-25 | 1971-08-24 | Bell Telephone Labor Inc | Switching circuit for cancelling the direct sound transmission from the loudspeaker to the microphone in a loudspeaking telephone set |
US3603744A (en) * | 1965-09-29 | 1971-09-07 | Superior Continental Corp | Line tap unit for telephone system |
US3631520A (en) * | 1968-08-19 | 1971-12-28 | Bell Telephone Labor Inc | Predictive coding of speech signals |
US3644674A (en) * | 1969-06-30 | 1972-02-22 | Bell Telephone Labor Inc | Ambient noise suppressor |
-
1971
- 1971-12-03 US US00204509A patent/US3784747A/en not_active Expired - Lifetime
-
1972
- 1972-01-27 GB GB382872A patent/GB1369711A/en not_active Expired
- 1972-02-18 JP JP1655372A patent/JPS55760B2/ja not_active Expired
- 1972-03-29 CA CA138,445A patent/CA952439A/en not_active Expired
- 1972-12-16 DE DE2207141A patent/DE2207141C3/en not_active Expired
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3177489A (en) * | 1960-01-11 | 1965-04-06 | Thompson Ramo Wooldridge Inc | Interference suppression systems |
US3133990A (en) * | 1962-04-27 | 1964-05-19 | Altec Lansing Corp | Automatic level-adjustment circuit |
US3603744A (en) * | 1965-09-29 | 1971-09-07 | Superior Continental Corp | Line tap unit for telephone system |
US3631520A (en) * | 1968-08-19 | 1971-12-28 | Bell Telephone Labor Inc | Predictive coding of speech signals |
US3644674A (en) * | 1969-06-30 | 1972-02-22 | Bell Telephone Labor Inc | Ambient noise suppressor |
US3601549A (en) * | 1969-11-25 | 1971-08-24 | Bell Telephone Labor Inc | Switching circuit for cancelling the direct sound transmission from the loudspeaker to the microphone in a loudspeaking telephone set |
Cited By (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3922488A (en) * | 1972-12-15 | 1975-11-25 | Ard Anstalt | Feedback-cancelling electro-acoustic transducer apparatus |
US4024358A (en) * | 1975-10-31 | 1977-05-17 | Communications Satellite Corporation (Comsat) | Adaptive echo canceller using differential pulse code modulation encoding |
US4031338A (en) * | 1976-02-10 | 1977-06-21 | Communications Satellite Corporation (Comsat) | Echo suppressor using frequency-selective center clipping |
US4122303A (en) * | 1976-12-10 | 1978-10-24 | Sound Attenuators Limited | Improvements in and relating to active sound attenuation |
US4166924A (en) * | 1977-05-12 | 1979-09-04 | Bell Telephone Laboratories, Incorporated | Removing reverberative echo components in speech signals |
US4360708A (en) * | 1978-03-30 | 1982-11-23 | Nippon Electric Co., Ltd. | Speech processor having speech analyzer and synthesizer |
FR2451676A1 (en) * | 1979-03-12 | 1980-10-10 | Soumagne Joel | ECHO DETECTOR IN PARTICULAR FOR SPEECH INTERPOLATION COMMUNICATION SYSTEM |
US4473906A (en) * | 1980-12-05 | 1984-09-25 | Lord Corporation | Active acoustic attenuator |
US4670903A (en) * | 1981-06-30 | 1987-06-02 | Nippon Electric Co., Ltd. | Echo canceller for attenuating acoustic echo signals on a frequency divisional manner |
US4825384A (en) * | 1981-08-27 | 1989-04-25 | Canon Kabushiki Kaisha | Speech recognizer |
US4591670A (en) * | 1982-09-30 | 1986-05-27 | Nec Corporation | Echo canceller and echo suppressor for frequency divisional attenuation of acoustic echoes |
EP0106640A1 (en) * | 1982-10-15 | 1984-04-25 | British Telecommunications | Noise control circuit |
EP0204718A4 (en) * | 1984-12-14 | 1988-03-30 | Motorola Inc | Full duplex speakerphone for radio and landline telephones. |
EP0204718A1 (en) * | 1984-12-14 | 1986-12-17 | Motorola Inc | Full duplex speakerphone for radio and landline telephones. |
US4819263A (en) * | 1986-06-30 | 1989-04-04 | Cellular Communications Corporation | Apparatus and method for hands free telephonic communication |
EP0472356A1 (en) * | 1990-08-16 | 1992-02-26 | Fujitsu Ten Limited | Speech recognition apparatus for a vehicle, using a microphone arrangement to determine the seat from which a command is generated |
US5619566A (en) * | 1993-08-27 | 1997-04-08 | Motorola, Inc. | Voice activity detector for an echo suppressor and an echo suppressor |
EP0881814A1 (en) * | 1997-05-28 | 1998-12-02 | Deutsche Telekom AG | Method for determining the step size Alpha for adjusting the convergence speed in the NLMS algorithm |
US6249581B1 (en) * | 1997-08-01 | 2001-06-19 | Bitwave Pte. Ltd. | Spectrum-based adaptive canceller of acoustic echoes arising in hands-free audio |
US6442275B1 (en) | 1998-09-17 | 2002-08-27 | Lucent Technologies Inc. | Echo canceler including subband echo suppressor |
EP1739654A2 (en) | 1999-09-08 | 2007-01-03 | Volkswagen AG | Method for operating a multiple microphone system in a motor vehicle and multiple microphone system itself |
WO2001035118A1 (en) * | 1999-11-05 | 2001-05-17 | Wavemakers Research, Inc. | Method to determine whether an acoustic source is near or far from a pair of microphones |
US20030219112A1 (en) * | 2002-05-22 | 2003-11-27 | Boland Simon Daniel | Apparatus and method for echo control |
US20030219087A1 (en) * | 2002-05-22 | 2003-11-27 | Boland Simon Daniel | Apparatus and method for time-alignment of two signals |
US7027593B2 (en) * | 2002-05-22 | 2006-04-11 | Avaya Technology Corp. | Apparatus and method for echo control |
US7043014B2 (en) * | 2002-05-22 | 2006-05-09 | Avaya Technology Corp. | Apparatus and method for time-alignment of two signals |
US7734034B1 (en) | 2005-06-21 | 2010-06-08 | Avaya Inc. | Remote party speaker phone detection |
Also Published As
Publication number | Publication date |
---|---|
GB1369711A (en) | 1974-10-09 |
DE2207141B2 (en) | 1980-10-09 |
CA952439A (en) | 1974-08-06 |
JPS55760B2 (en) | 1980-01-09 |
JPS4865813A (en) | 1973-09-10 |
DE2207141C3 (en) | 1981-07-30 |
DE2207141A1 (en) | 1973-08-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US3784747A (en) | Speech suppression by predictive filtering | |
Allen et al. | Multimicrophone signal‐processing technique to remove room reverberation from speech signals | |
CN1689072B (en) | Method and system for processing subband signals using adaptive filters | |
EP1602223B1 (en) | Echo canceller with reduced requirement for processing power | |
US6904146B2 (en) | Full duplex echo cancelling circuit | |
JP3228940B2 (en) | Method and apparatus for reducing residual far-end echo in voice communication networks | |
RU2109408C1 (en) | Line echo suppressor | |
US8306215B2 (en) | Echo canceller for eliminating echo without being affected by noise | |
KR100482396B1 (en) | Device for suppressing interference component of input signal | |
US5390244A (en) | Method and apparatus for periodic signal detection | |
US20080170706A1 (en) | Method And Device For Removing Echo In A Multi-Channel Audio Signal | |
EP1715669A1 (en) | A method for removing echo in an audio signal | |
EP0853844B1 (en) | Echo cancelling system for digital telephony applications | |
JPS583430A (en) | Suppressing device of detouring signal | |
CN110956975B (en) | Echo cancellation method and device | |
KR20130040194A (en) | Method and device for suppressing residual echoes | |
JP3507020B2 (en) | Echo suppression method, echo suppression device, and echo suppression program storage medium | |
KR100470523B1 (en) | Process and Apparatus for Eliminating Loudspeaker Interference from Microphone Signals | |
US6834108B1 (en) | Method for improving acoustic noise attenuation in hand-free devices | |
US3585311A (en) | Speech processor using contiguous multiband center-clipping | |
US20080152156A1 (en) | Robust Method of Echo Suppressor | |
US7711107B1 (en) | Perceptual masking of residual echo | |
Surin et al. | An adaptive noise decorrelation technique for stereophonic acoustic echo cancellation | |
Cecchi et al. | Multichannel double-talk detector based on fundamental frequency estimation | |
KR100272131B1 (en) | Adaptive reverbation cancelling apparatus |