US5276765A - Voice activity detection - Google Patents
Voice activity detection Download PDFInfo
- Publication number
- US5276765A US5276765A US07/952,147 US95214792A US5276765A US 5276765 A US5276765 A US 5276765A US 95214792 A US95214792 A US 95214792A US 5276765 A US5276765 A US 5276765A
- Authority
- US
- United States
- Prior art keywords
- signal
- input signal
- speech
- measure
- electrical
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 230000000694 effects Effects 0.000 title claims abstract description 38
- 238000001514 detection method Methods 0.000 title claims description 22
- 238000001228 spectrum Methods 0.000 claims abstract description 32
- 230000003595 spectral effect Effects 0.000 claims abstract description 31
- 230000004044 response Effects 0.000 claims abstract description 19
- 238000001914 filtration Methods 0.000 claims description 7
- 230000003044 adaptive effect Effects 0.000 claims description 6
- 238000005259 measurement Methods 0.000 claims description 2
- 238000000034 method Methods 0.000 claims 2
- 238000011524 similarity measure Methods 0.000 claims 1
- 206010019133 Hangover Diseases 0.000 description 3
- 101000822695 Clostridium perfringens (strain 13 / Type A) Small, acid-soluble spore protein C1 Proteins 0.000 description 1
- 101000655262 Clostridium perfringens (strain 13 / Type A) Small, acid-soluble spore protein C2 Proteins 0.000 description 1
- 101000655256 Paraclostridium bifermentans Small, acid-soluble spore protein alpha Proteins 0.000 description 1
- 101000655264 Paraclostridium bifermentans Small, acid-soluble spore protein beta Proteins 0.000 description 1
- 206010036086 Polymenorrhoea Diseases 0.000 description 1
- 101710096660 Probable acetoacetate decarboxylase 2 Proteins 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000011045 prefiltration Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
Definitions
- a voice activity detector is a device which is supplied with a signal with the object of detecting periods of speech, or periods containing only noise.
- the present invention is not limited thereto, one application of particular interest for such detectors is in mobile radio telephone systems where the knowledge as to the presence or otherwise of speech can be used and exploited by a speech coder to improve the efficient utilisation of radio spectrum, and where also the noise level (from a vehicle-mounted unit) is likely to be high.
- voice activity detection is to locate a measure which differs appreciably between speech and non-speech periods.
- apparatus which includes a speech coder
- a number of parameters are readily available from one or other stage of the coder, and it is therefore desirable to economise on processing needed by utilising some such parameter.
- the main noise sources occur in known defined areas of the frequency spectrum. For example, in a moving car much of the noise (e.g., engine noise) is concentrated in the low frequency regions of the spectrum. Where such knowledge of the spectral position of noise is available, it is desirable to base the decision as to whether speech is present or absent upon measurements taken from that portion of the spectrum which contains relatively little noise. It would, of course, be possible in practice to pre-filter the signal before analysing to detect speech activity, but where the voice activity detector follows the output of a speech coder, prefiltering would distort the voice signal to be coded.
- a voice activity detection apparatus comprising means for receiving an input signal, means for periodically adaptively generating an estimate of the noise signal component of the input signal, means for periodically forming a measure M of the spectral similarity between a portion of the input signal and the noise signal component, means for comparing a parameter derived from the measure M with a threshold value T, and means for producing an output to indicate the presence or absence of speech in dependence upon whether or not that value is exceeded.
- the measure is the Itakura-Saito Distortion Measure.
- FIG. 1 is a block diagram of a first embodiment of the invention
- FIG. 2 shows a second embodiment of the invention
- FIG. 3 shows a third, preferred embodiment of the invention.
- the zero order autocorrelation coefficient is the sum of each term squared, which may be normalized i.e. divided by the total number of terms (for constant frame lengths it is easier to omit the division); that of the filtered signal is thus ##EQU2## and this is therefore a measure of the power of the notional filtered signal s'--in other words, of that part of the signal s which falls within the passband of the notional filter.
- R' 0 can be obtained from a combination of the autocorrelation coefficients R i , weighted by the bracketed constants which determine the frequency band to which the value of R' 0 is responsive.
- the bracketed terms are the autocorrelation coefficients of the impulse response of the notional filter, so that the expression above may be simplified to ##EQU4## where N is the filter order and H i are the (un-normalised) autocorrelation coefficients of the impulse response of the filter.
- the effect on the signal autocorrelation coefficients of filtering a signal may be simulated by producing a weighted sum of the autocorrelation coefficients of the (unfiltered) signal, using the impulse response that the required filter would have had.
- This filtering operation may alternatively be viewed as a form of spectrum comparison, with the signal spectrum being matched against a reference spectrum (the inverse of the response of the notional filter). Since the notional filter in this application is selected so as to approximate the inverse of the noise spectrum, this operation may be viewed as a spectral comparison between speech and noise spectra, and the zeroth autocorrelation coefficient thus generated (i.e. the energy of the inverse filtered signal) as a measure of dissimilarity between the spectra.
- the Itakura-Saito distortion measure is used in LPC to assess the match between the predictor filter and the input spectrum, and in one form is expressed as ##EQU5## where A 0 etc are the autocorrelation coefficients of the LPC parameter set.
- the LPC coefficients are the taps of an FIR filter having the inverse spectral response of the input signal so that the LPC coefficient set is the impulse response of the inverse LPC filter, it will be apparent that the Itakura-Saito Distortion Measure is an fact merely a form of equation 1, wherein the filter response H is the inverse of the spectral shape of an all-pole model of the input signal.
- a signal from a microphone is received at an input 1 and converted to digital samples s at a suitable sampling rate by an analogue to digital converter 2.
- An LPC analysis unit 3 (in a known type of LPC coder) then derives, for successive frames of n (e.g. 160) samples, a set of N (e.g. 8 or 12) LPC filter coefficients L i which are transmitted to represent the input speech.
- the speech signal s also enters a correlator unit 4 (normally part of the LPC coder 3 since the autocorrelation vector R i of the speech is also usually produced as a step in the LPC analysis although it will be appreciated that a separate correlator could be provided).
- the correlator 4 produces the autocorrelation vector R i , including the zero order correlation coefficient R 0 and at least 2 further autocorrelation coefficients R 1 , R 2 , R 3 . These are then supplied to a multiplier unit 5.
- a second input 11 is connected to a second microphone located distant from the speaker so as to receive only background noise.
- the input from this microphone is converted to a digital input sample train by AD converter 12 and LPC analysed by a second LPC analyser 13.
- the "noise" LPC coefficients produced from analyser 13 are passed to correlator unit 14, and the autocorrelation vector thus produced is multiplied term by term with the autocorrelation coefficients R i of the input signal from the speech microphone in multiplier 5 and the weighted coefficients thus produced are combined in adder 6 according to Equation 1, so as to apply a filter having the inverse shape of the noise spectrum from the noise-only microphone (which in practice is the same as the shape of the noise spectrum in the signal-plus-noise microphone) and thus filter out most of the noise.
- the resulting measure M is thresholded by thresholder 7 to produce a logic output 8 indicating the presence or absence of speech; if M is high, speech is deemed to be present.
- This embodiment does, however, require two microphones and two LPC analysers, which adds to the expense and complexity of the equipment necessary.
- another embodiment uses a corresponding measure formed using the autocorrelations from the noise microphone 11 and the LPC coefficients from the main microphone 1, so that an extra autocorrelator rather than an LPC analyser is necessary.
- a buffer 15 which stores a set of LPC coefficients (or the autocorrelation vector of the set) derived from the microphone input 1 in a period identified as being a "non speech" (i.e. noise only) period. These coefficients are then used to derive a measure using equation 1, which also of course corresponds to the Itakura-Saito Distortion Measure, except that a single stored frame of LPC coefficients corresponding to an approximation of the inverse noise spectrum is used, rather than the present frame of LPC coefficients.
- the LPC coefficient vector L i output by analyser 3 is also routed to a correlator 14, which produces the autocorrelation vector of the LPC coefficient vector.
- the buffer memory 15 is controlled by the speech/non-speech output of thresholder 7, in such a way that during "speech" frames the buffer retains the "noise” autocorrelation coefficients, but during "noise” frames a new set of LPC coefficients may be used to update the buffer, for example by a multiple switch 16, via which outputs of the correlator 14, carrying each autocorrelation coefficient, are connected to the buffer 15. It will be appreciated that correlator 14 could be positioned after buffer 15. Further, the speech/no-speech decision for coefficient update need not be from output 8, but could be (and preferably is) otherwise derived.
- the LPC coefficients stored in the buffer are updated from time to time, so that the apparatus is thus capable of tracking changes in the noise spectrum. It will be appreciated that such updating of the buffer may be necessary only occasionally, or may occur only once at the start of operation of the detector, if (as is often the case) the noise spectrum is relatively stationary over time, but in a mobile radio environment frequent updating is preferred.
- the system initially employs equation 1 with coefficient terms corresponding to a simple fixed high pass filter, and then subsequently starts to adapt by switching over to using "noise period" LPC coefficients. If, for some reason, speech detection fails, the system may return to using the simple high pass filter.
- LPC analysis unit 13 is simply replaced by an adaptive filter (for example a transversal FIR or lattice filter), connected so as to whiten the noise input by modelling the inverse filter, and its coefficients are supplied as before to autocorrelator 14.
- an adaptive filter for example a transversal FIR or lattice filter
- LPC analysis means 3 is replaced by such an adaptive filter, and buffer means 15 is omitted, but switch 16 operates to prevent the adaptive filter from adapting its coefficients during speech periods.
- the LPC coefficient vector is simply the impulse response of an FIR filter which has a response approximating the inverse spectral shape of the input signal.
- the Itakura-Saito Distortion Measure between adjacent frames is formed, this is in fact equal to the power of the signal, as filtered by the LPC filter of the previous frame. So if spectra of adjacent frames differ little, a correspondingly small amount of the spectral power of a frame will escape filtering and the measure will be low.
- a large interframe spectral difference produces a high Itakura-Saito Distortion Measure, so that the measure reflects the spectral similarity of adjacent frames.
- the Itakura-Saito Distortion Measure between adjacent frames of a noisy signal containing intermittent speech is higher during periods of speech than periods of noise; the degree of variation (as illustrated by the standard deviation) is also higher, and less intermittently variable.
- the standard deviation of the standard deviation of M is also a reliable measure; the effect of taking each standard deviation is essentially to smooth the measure.
- the measured parameter used to decide whether speech is present is preferably the standard deviation of the Itakura-Saito Distortion Measure, but other measures of variance and other spectral distortion measures (based for example on FFT analysis) could be employed.
- an adaptive threshold in voice activity detection. Such thresholds must not be adjusted during speech periods or the speech signal will be thresholded out. It is accordingly necessary to control the threshold adapter using a speech/non-speech control signal, and it is preferable that this control signal should be independent of the output of the threshold adapter.
- the threshold T is adaptively adjusted so as to keep the threshold level just above the level of the measure M when noise only is present. Since the measure will in general vary randomly when noise is present, the threshold is varied by determining an average level over a number of blocks, and setting the threshold at a level proportional to this average. In a noisy environment this is not usually sufficient, however, and so an assessment of the degree of variation of the parameter over several blocks is also taken into account.
- the threshold value T is therefore preferably calculated according to
- M' is the average value of the measure over a number of consecutive frames
- d is the standard deviation of the measure over those frames
- K is a constant (which may typically be 2).
- an input 1 receives a signal which is sampled and digitised by analogue to digital converter (ADC) 2, and supplied to the input of an inverse filter analyser 3, which in practice is part of a speech coder with which the voice activity detector is to work, and which generates coefficients L i (typically 8) of a filter corresponding to the inverse of the input signal spectrum.
- ADC analogue to digital converter
- the digitised signal is also supplied to an autocorrelator 4, (which is part of analyser 3) which generates the autocorrelation vector R i of the input signal (or at least as many low order terms as there are LPC coefficients). Operation of these parts of the apparatus is as described in FIGS. 1 and 2.
- the autocorrelation coefficients R i are then averaged over several successive speech frames (typically 5-20 ms long) to improve their reliability. This may be achieved by storing each set of autocorrelations coefficients output by autocorrelator 4 in a buffer 4a, and employing an averager 4b to produce a weighted sum of the current autocorrelation coefficients R i and those from previous frames stored in and supplied from buffer 4a.
- the averaged autocorrelation coefficients Ra i thus derived are supplied to weighting and adding means 5,6 which receives also the autocorrelation vector A i of stored noise-period inverse filter coefficients L i from an autocorrelator 14 via buffer 15, and forms from Ra i and A i a measure M preferably defined as: ##EQU7##
- This measure is then thresholded by thesholder 7 against a threshold level, and the logical result provides an indication of the presence or absence of speech at output 8.
- the inverse filter coefficients L i correspond to a fair estimate of the inverse of the noise spectrum, it is desirable to update these coefficients during periods of noise (and, of course, not to update during periods of speech). It is, however, preferable that the speech/non-speech decision on which the updating is based does not depend upon the result of the updating, or else a single wrongly identified frame of signal may result in the voice activity detector subsequently going "out of lock" and wrongly identifying following frames.
- a control signal generating circuit 20 effectively a separate voice activity detector, which forms an independent control signal indicating the presence or absence of speech to control inverse filter analyser 3 (or buffer 15) so that the inverse filter autocorrelation coefficients A i used to form the measure M are only updated during "noise" periods.
- the control signal generator circuit 20 includes LPC analyser 21 (which again may be part of a speech coder and, specifically, may be performed by analyser 3), which produces a set of LPC coefficients M i corresponding to the input signal and an autocorrelator 21a (which may be performed by autocorrelator 3a) which derives the autocorrelation coefficients B.sub. i of M i .
- a measure of the spectral similarity between the input speech frame and the preceding speech frame is thus calculated; this may be the Itakura-Saito distortion measure between R i of the present frame and B i of the preceding frame, as disclosed above, or it may instead be derived by calculating the Itakura-Saito distortion measure for R i and B i of the present frame, and subtracting (in subtractor 25) the corresponding measure for the previous frame stored in buffer 24, to generate a spectral difference signal (in either case, the measure is preferably energy-normalised by dividing by R o ).
- the buffer 24 is then, of course, updated.
- a voiced speech detection circuit comprising a pitch analyser 27 (which in practice may operate as part of a speech coder, and in particular may measure the long term predictor lag value produced in a multipulse LPC coder).
- the pitch analyser 27 produces a logic signal which is "true” when voiced speech is detected, and this signal, together with the threshold measure derived from thresholder 26 (which will generally be “true” when unvoiced speech is present) are supplied to the inputs of a NOR gate 28 to generate a signal which is “false” when speech is present and “true” when noise is present.
- This signal is supplied to buffer 15 (or to inverse filter analyser 3) so that inverse filter coefficients L i are only updated during noise periods.
- Threshold adapter 29 is also connected to receive the non-speech signal control output of control signal generator circuit 20. The output of the threshold adapter 29 is supplied to thresholder 7. The threshold adapter operates to increment or decrement the threshold in steps which are a proportion of the instant threshold value, until the threshold approximates the noise power level (which may conveniently be derived from, for example, weighting and adding circuits 22, 23). When the input signal is very low, it may be desirable that the threshold is automatically set to a fixed, low, level since at the low signal levels the effect of signal quantisation produced by ADC 2 can produce unreliable results.
- hangover generating means 30 which operates to measure the duration of indications of speech after thresholder 7 and, when the presence of speech has been indicated for a period in excess of a predetermined time constant, the output is held high for a short "hangover" period. In this way, clipping of the middle of low-level speech bursts is avoided, and appropriate selection of the time constant prevents triggering of the hangover generator 30 by short spikes of noise which are falsely indicated as speech.
- DSP Digital Signal Processing
- the voice detection apparatus may be implemented as part of an LPC codec.
- autocorrelation coefficients of the signal or related measures partial correlation, or "parcor", coefficients
- the voice detection may take place distantly from the codec.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
T=M'+K·d
Claims (23)
M=R.sub.O A.sub.O +2ΣR.sub.i A.sub.i.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US07/952,147 US5276765A (en) | 1988-03-11 | 1989-03-10 | Voice activity detection |
Applications Claiming Priority (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB888805795A GB8805795D0 (en) | 1988-03-11 | 1988-03-11 | Voice activity detector |
GB8805795 | 1988-03-11 | ||
GB8813346 | 1988-06-06 | ||
GB888813346A GB8813346D0 (en) | 1988-06-06 | 1988-06-06 | Voice activity detection |
GB8820105 | 1988-08-24 | ||
GB888820105A GB8820105D0 (en) | 1988-08-24 | 1988-08-24 | Voice activity detection |
US07/952,147 US5276765A (en) | 1988-03-11 | 1989-03-10 | Voice activity detection |
US55544590A | 1990-08-15 | 1990-08-15 |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US55544590A Continuation | 1988-03-11 | 1990-08-15 |
Publications (1)
Publication Number | Publication Date |
---|---|
US5276765A true US5276765A (en) | 1994-01-04 |
Family
ID=27516796
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US07/952,147 Expired - Lifetime US5276765A (en) | 1988-03-11 | 1989-03-10 | Voice activity detection |
Country Status (1)
Country | Link |
---|---|
US (1) | US5276765A (en) |
Cited By (105)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5490231A (en) * | 1990-05-28 | 1996-02-06 | Matsushita Electric Industrial Co., Ltd. | Noise signal prediction system |
US5572623A (en) * | 1992-10-21 | 1996-11-05 | Sextant Avionique | Method of speech detection |
US5579432A (en) * | 1993-05-26 | 1996-11-26 | Telefonaktiebolaget Lm Ericsson | Discriminating between stationary and non-stationary signals |
US5619566A (en) * | 1993-08-27 | 1997-04-08 | Motorola, Inc. | Voice activity detector for an echo suppressor and an echo suppressor |
EP0768770A1 (en) * | 1995-10-13 | 1997-04-16 | France Telecom | Method and arrangement for the creation of comfort noise in a digital transmission system |
US5633982A (en) * | 1993-12-20 | 1997-05-27 | Hughes Electronics | Removal of swirl artifacts from celp-based speech coders |
WO1997022117A1 (en) * | 1995-12-12 | 1997-06-19 | Nokia Mobile Phones Limited | Method and device for voice activity detection and a communication device |
US5657422A (en) * | 1994-01-28 | 1997-08-12 | Lucent Technologies Inc. | Voice activity detection driven noise remediator |
US5732141A (en) * | 1994-11-22 | 1998-03-24 | Alcatel Mobile Phones | Detecting voice activity |
US5749067A (en) * | 1993-09-14 | 1998-05-05 | British Telecommunications Public Limited Company | Voice activity detector |
US5754554A (en) * | 1994-10-28 | 1998-05-19 | Nec Corporation | Telephone apparatus for multiplexing digital speech samples and data signals using variable rate speech coding |
US5774849A (en) * | 1996-01-22 | 1998-06-30 | Rockwell International Corporation | Method and apparatus for generating frame voicing decisions of an incoming speech signal |
WO1998048407A2 (en) * | 1997-04-18 | 1998-10-29 | Nokia Networks Oy | Speech detection in a telecommunication system |
US5864793A (en) * | 1996-08-06 | 1999-01-26 | Cirrus Logic, Inc. | Persistence and dynamic threshold based intermittent signal detector |
US5970441A (en) * | 1997-08-25 | 1999-10-19 | Telefonaktiebolaget Lm Ericsson | Detection of periodicity information from an audio signal |
US5974375A (en) * | 1996-12-02 | 1999-10-26 | Oki Electric Industry Co., Ltd. | Coding device and decoding device of speech signal, coding method and decoding method |
US5978760A (en) * | 1996-01-29 | 1999-11-02 | Texas Instruments Incorporated | Method and system for improved discontinuous speech transmission |
EP0969692A1 (en) * | 1997-03-06 | 2000-01-05 | Asahi Kasei Kogyo Kabushiki Kaisha | Device and method for processing speech |
USD419160S (en) * | 1998-05-14 | 2000-01-18 | Northrop Grumman Corporation | Personal communications unit docking station |
US6023674A (en) * | 1998-01-23 | 2000-02-08 | Telefonaktiebolaget L M Ericsson | Non-parametric voice activity detection |
USD421002S (en) * | 1998-05-15 | 2000-02-22 | Northrop Grumman Corporation | Personal communications unit handset |
US6041243A (en) * | 1998-05-15 | 2000-03-21 | Northrop Grumman Corporation | Personal communications unit |
US6134524A (en) * | 1997-10-24 | 2000-10-17 | Nortel Networks Corporation | Method and apparatus to detect and delimit foreground speech |
US6141426A (en) * | 1998-05-15 | 2000-10-31 | Northrop Grumman Corporation | Voice operated switch for use in high noise environments |
US6169730B1 (en) | 1998-05-15 | 2001-01-02 | Northrop Grumman Corporation | Wireless communications protocol |
US6182035B1 (en) | 1998-03-26 | 2001-01-30 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and apparatus for detecting voice activity |
US6205423B1 (en) * | 1998-01-13 | 2001-03-20 | Conexant Systems, Inc. | Method for coding speech containing noise-like speech periods and/or having background noise |
US6223062B1 (en) | 1998-05-15 | 2001-04-24 | Northrop Grumann Corporation | Communications interface adapter |
US6243573B1 (en) | 1998-05-15 | 2001-06-05 | Northrop Grumman Corporation | Personal communications system |
US6285979B1 (en) * | 1998-03-27 | 2001-09-04 | Avr Communications Ltd. | Phoneme analyzer |
US20010027391A1 (en) * | 1996-11-07 | 2001-10-04 | Matsushita Electric Industrial Co., Ltd. | Excitation vector generator, speech coder and speech decoder |
US6304559B1 (en) | 1998-05-15 | 2001-10-16 | Northrop Grumman Corporation | Wireless communications protocol |
US6304216B1 (en) | 1999-03-30 | 2001-10-16 | Conexant Systems, Inc. | Signal detector employing correlation analysis of non-uniform and disjoint sample segments |
US6327471B1 (en) | 1998-02-19 | 2001-12-04 | Conexant Systems, Inc. | Method and an apparatus for positioning system assisted cellular radiotelephone handoff and dropoff |
US6348744B1 (en) | 1998-04-14 | 2002-02-19 | Conexant Systems, Inc. | Integrated power management module |
US20020046022A1 (en) * | 2000-10-13 | 2002-04-18 | At&T Corp. | Systems and methods for dynamic re-configurable speech recognition |
US20020046026A1 (en) * | 2000-09-12 | 2002-04-18 | Pioneer Corporation | Voice recognition system |
US20020049592A1 (en) * | 2000-09-12 | 2002-04-25 | Pioneer Corporation | Voice recognition system |
US6381568B1 (en) | 1999-05-05 | 2002-04-30 | The United States Of America As Represented By The National Security Agency | Method of transmitting speech using discontinuous transmission and comfort noise |
US6393396B1 (en) * | 1998-07-29 | 2002-05-21 | Canon Kabushiki Kaisha | Method and apparatus for distinguishing speech from noise |
US6424938B1 (en) * | 1998-11-23 | 2002-07-23 | Telefonaktiebolaget L M Ericsson | Complex signal activity detection for improved speech/noise classification of an audio signal |
US6427134B1 (en) * | 1996-07-03 | 2002-07-30 | British Telecommunications Public Limited Company | Voice activity detector for calculating spectral irregularity measure on the basis of spectral difference measurements |
US6448925B1 (en) | 1999-02-04 | 2002-09-10 | Conexant Systems, Inc. | Jamming detection and blanking for GPS receivers |
US6453291B1 (en) * | 1999-02-04 | 2002-09-17 | Motorola, Inc. | Apparatus and method for voice activity detection in a communication system |
US6453285B1 (en) * | 1998-08-21 | 2002-09-17 | Polycom, Inc. | Speech activity detector for use in noise reduction system, and methods therefor |
US6496145B2 (en) | 1999-03-30 | 2002-12-17 | Sirf Technology, Inc. | Signal detector employing coherent integration |
US6519277B2 (en) | 1999-05-25 | 2003-02-11 | Sirf Technology, Inc. | Accelerated selection of a base station in a wireless communication system |
US6526378B1 (en) * | 1997-12-08 | 2003-02-25 | Mitsubishi Denki Kabushiki Kaisha | Method and apparatus for processing sound signal |
US6531982B1 (en) | 1997-09-30 | 2003-03-11 | Sirf Technology, Inc. | Field unit for use in a GPS system |
US6556967B1 (en) | 1999-03-12 | 2003-04-29 | The United States Of America As Represented By The National Security Agency | Voice activity detector |
US20030120487A1 (en) * | 2001-12-20 | 2003-06-26 | Hitachi, Ltd. | Dynamic adjustment of noise separation in data handling, particularly voice activation |
US6606349B1 (en) | 1999-02-04 | 2003-08-12 | Sirf Technology, Inc. | Spread spectrum receiver performance improvement |
US6618701B2 (en) * | 1999-04-19 | 2003-09-09 | Motorola, Inc. | Method and system for noise suppression using external voice activity detection |
KR100399057B1 (en) * | 2001-08-07 | 2003-09-26 | 한국전자통신연구원 | Apparatus for Voice Activity Detection in Mobile Communication System and Method Thereof |
WO2003048711A3 (en) * | 2001-12-05 | 2004-02-12 | France Telecom | Speech detection system in an audio signal in noisy surrounding |
US6693953B2 (en) | 1998-09-30 | 2004-02-17 | Skyworks Solutions, Inc. | Adaptive wireless communication receiver |
US20040042626A1 (en) * | 2002-08-30 | 2004-03-04 | Balan Radu Victor | Multichannel voice detection in adverse environments |
US6708146B1 (en) | 1997-01-03 | 2004-03-16 | Telecommunications Research Laboratories | Voiceband signal classifier |
US6714158B1 (en) | 2000-04-18 | 2004-03-30 | Sirf Technology, Inc. | Method and system for data detection in a global positioning system satellite receiver |
US20040064314A1 (en) * | 2002-09-27 | 2004-04-01 | Aubert Nicolas De Saint | Methods and apparatus for speech end-point detection |
US20040078200A1 (en) * | 2002-10-17 | 2004-04-22 | Clarity, Llc | Noise reduction in subbanded speech signals |
US6741873B1 (en) * | 2000-07-05 | 2004-05-25 | Motorola, Inc. | Background noise adaptable speaker phone for use in a mobile communication device |
US6778136B2 (en) | 2001-12-13 | 2004-08-17 | Sirf Technology, Inc. | Fast acquisition of GPS signal |
US6788655B1 (en) | 2000-04-18 | 2004-09-07 | Sirf Technology, Inc. | Personal communications device with ratio counter |
US20050025222A1 (en) * | 1998-09-01 | 2005-02-03 | Underbrink Paul A. | System and method for despreading in a spread spectrum matched filter |
US20050044471A1 (en) * | 2001-11-15 | 2005-02-24 | Chia Pei Yen | Error concealment apparatus and method |
US20050154583A1 (en) * | 2003-12-25 | 2005-07-14 | Nobuhiko Naka | Apparatus and method for voice activity detection |
US20050171769A1 (en) * | 2004-01-28 | 2005-08-04 | Ntt Docomo, Inc. | Apparatus and method for voice activity detection |
US6931055B1 (en) | 2000-04-18 | 2005-08-16 | Sirf Technology, Inc. | Signal detector employing a doppler phase correction system |
US20050209762A1 (en) * | 2004-03-18 | 2005-09-22 | Ford Global Technologies, Llc | Method and apparatus for controlling a vehicle using an object detection system and brake-steer |
US6952440B1 (en) | 2000-04-18 | 2005-10-04 | Sirf Technology, Inc. | Signal detector employing a Doppler phase correction system |
US20050246166A1 (en) * | 2004-04-28 | 2005-11-03 | International Business Machines Corporation | Componentized voice server with selectable internal and external speech detectors |
US20060025992A1 (en) * | 2004-07-27 | 2006-02-02 | Yoon-Hark Oh | Apparatus and method of eliminating noise from a recording device |
US20060053007A1 (en) * | 2004-08-30 | 2006-03-09 | Nokia Corporation | Detection of voice activity in an audio signal |
US20060200350A1 (en) * | 2004-12-22 | 2006-09-07 | David Attwater | Multi dimensional confidence |
US20060217973A1 (en) * | 2005-03-24 | 2006-09-28 | Mindspeed Technologies, Inc. | Adaptive voice mode extension for a voice activity detector |
US20070033042A1 (en) * | 2005-08-03 | 2007-02-08 | International Business Machines Corporation | Speech detection fusing multi-class acoustic-phonetic, and energy features |
US20070043563A1 (en) * | 2005-08-22 | 2007-02-22 | International Business Machines Corporation | Methods and apparatus for buffering data for use in accordance with a speech recognition system |
US20070150264A1 (en) * | 1999-09-20 | 2007-06-28 | Onur Tackin | Voice And Data Exchange Over A Packet Based Network With Voice Detection |
WO2007091956A2 (en) | 2006-02-10 | 2007-08-16 | Telefonaktiebolaget Lm Ericsson (Publ) | A voice detector and a method for suppressing sub-bands in a voice detector |
US20070233479A1 (en) * | 2002-05-30 | 2007-10-04 | Burnett Gregory C | Detecting voiced and unvoiced speech using both acoustic and nonacoustic sensors |
US20080133226A1 (en) * | 2006-09-21 | 2008-06-05 | Spreadtrum Communications Corporation | Methods and apparatus for voice activity detection |
US20090271190A1 (en) * | 2008-04-25 | 2009-10-29 | Nokia Corporation | Method and Apparatus for Voice Activity Determination |
US20090316918A1 (en) * | 2008-04-25 | 2009-12-24 | Nokia Corporation | Electronic Device Speech Enhancement |
US7711038B1 (en) | 1998-09-01 | 2010-05-04 | Sirf Technology, Inc. | System and method for despreading in a spread spectrum matched filter |
US20100322366A1 (en) * | 2006-12-06 | 2010-12-23 | Electronics And Telecommunications Research Institute | Method for detecting frame synchronization and structure in dvb-s2 system |
WO2010151183A1 (en) * | 2009-06-23 | 2010-12-29 | Telefonaktiebolaget L M Ericsson (Publ) | Method and an arrangement for a mobile telecommunications network |
US7885314B1 (en) | 2000-05-02 | 2011-02-08 | Kenneth Scott Walley | Cancellation system and method for a wireless positioning system |
US20110051953A1 (en) * | 2008-04-25 | 2011-03-03 | Nokia Corporation | Calibrating multiple microphones |
US20110066439A1 (en) * | 2008-06-02 | 2011-03-17 | Kengo Nakao | Dimension measurement system |
WO2011044842A1 (en) * | 2009-10-15 | 2011-04-21 | 华为技术有限公司 | Method,device and coder for voice activity detection |
US20110125497A1 (en) * | 2009-11-20 | 2011-05-26 | Takahiro Unno | Method and System for Voice Activity Detection |
DE102006032967B4 (en) * | 2005-07-28 | 2012-04-19 | S. Siedle & Söhne Telefon- und Telegrafenwerke OHG | House plant and method for operating a house plant |
US20120197642A1 (en) * | 2009-10-15 | 2012-08-02 | Huawei Technologies Co., Ltd. | Signal processing method, device, and system |
US20140119461A1 (en) * | 2011-08-25 | 2014-05-01 | Mitsubishi Electric Corporation | Signal transmission device |
US8870791B2 (en) | 2006-03-23 | 2014-10-28 | Michael E. Sabatino | Apparatus for acquiring, processing and transmitting physiological sounds |
US8942383B2 (en) | 2001-05-30 | 2015-01-27 | Aliphcom | Wind suppression/replacement component for use with electronic systems |
US9066186B2 (en) | 2003-01-30 | 2015-06-23 | Aliphcom | Light-based detection for acoustic applications |
US9099094B2 (en) | 2003-03-27 | 2015-08-04 | Aliphcom | Microphone array with rear venting |
US9196261B2 (en) | 2000-07-19 | 2015-11-24 | Aliphcom | Voice activity detector (VAD)—based multiple-microphone acoustic noise suppression |
US20160260443A1 (en) * | 2010-12-24 | 2016-09-08 | Huawei Technologies Co., Ltd. | Method and apparatus for detecting a voice activity in an input audio signal |
US10225649B2 (en) | 2000-07-19 | 2019-03-05 | Gregory C. Burnett | Microphone array with rear venting |
US10389657B1 (en) * | 1999-11-05 | 2019-08-20 | Open Invention Network, Llc. | System and method for voice transmission over network protocols |
US11122357B2 (en) | 2007-06-13 | 2021-09-14 | Jawbone Innovations, Llc | Forming virtual microphone arrays using dual omnidirectional microphone array (DOMA) |
US11361784B2 (en) | 2009-10-19 | 2022-06-14 | Telefonaktiebolaget Lm Ericsson (Publ) | Detector and method for voice activity detection |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4227046A (en) * | 1977-02-25 | 1980-10-07 | Hitachi, Ltd. | Pre-processing system for speech recognition |
US4283601A (en) * | 1978-05-12 | 1981-08-11 | Hitachi, Ltd. | Preprocessing method and device for speech recognition device |
US4338738A (en) * | 1980-01-10 | 1982-07-13 | Lamb Owen L | Slide previewer and tray loader |
US4672669A (en) * | 1983-06-07 | 1987-06-09 | International Business Machines Corp. | Voice activity detection process and means for implementing said process |
US4696039A (en) * | 1983-10-13 | 1987-09-22 | Texas Instruments Incorporated | Speech analysis/synthesis system with silence suppression |
US4731846A (en) * | 1983-04-13 | 1988-03-15 | Texas Instruments Incorporated | Voice messaging system with pitch tracking based on adaptively filtered LPC residual signal |
-
1989
- 1989-03-10 US US07/952,147 patent/US5276765A/en not_active Expired - Lifetime
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4227046A (en) * | 1977-02-25 | 1980-10-07 | Hitachi, Ltd. | Pre-processing system for speech recognition |
US4283601A (en) * | 1978-05-12 | 1981-08-11 | Hitachi, Ltd. | Preprocessing method and device for speech recognition device |
US4338738A (en) * | 1980-01-10 | 1982-07-13 | Lamb Owen L | Slide previewer and tray loader |
US4731846A (en) * | 1983-04-13 | 1988-03-15 | Texas Instruments Incorporated | Voice messaging system with pitch tracking based on adaptively filtered LPC residual signal |
US4672669A (en) * | 1983-06-07 | 1987-06-09 | International Business Machines Corp. | Voice activity detection process and means for implementing said process |
US4696039A (en) * | 1983-10-13 | 1987-09-22 | Texas Instruments Incorporated | Speech analysis/synthesis system with silence suppression |
Non-Patent Citations (6)
Title |
---|
McAulay, "Optimum Speech Classification and Its Application to Adaptive Noise Cancellation", 1977 IEEE ICASSP, Hartford, CN, May 9-11, 1977, pp. 425-428. |
McAulay, Optimum Speech Classification and Its Application to Adaptive Noise Cancellation , 1977 IEEE ICASSP, Hartford, CN, May 9 11, 1977, pp. 425 428. * |
Rabiner et al., "Application of an LPC Distance Measure to the Voiced-Unvoiced-Silence Detection Problem", IEEE Trans. on ASSP, vol. ASSP-25, No. 4, Aug. 1977, pp. 338-343. |
Rabiner et al., Application of an LPC Distance Measure to the Voiced Unvoiced Silence Detection Problem , IEEE Trans. on ASSP, vol. ASSP 25, No. 4, Aug. 1977, pp. 338 343. * |
Un, "Improving LPC Analysis of Noisy Speech by Autocorrelation Subtraction Method", ICASSP '81, Atlanta, GA, Mar. 30, 31, Apr. 1981, pp. 1082-1085. |
Un, Improving LPC Analysis of Noisy Speech by Autocorrelation Subtraction Method , ICASSP 81, Atlanta, GA, Mar. 30, 31, Apr. 1981, pp. 1082 1085. * |
Cited By (181)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5490231A (en) * | 1990-05-28 | 1996-02-06 | Matsushita Electric Industrial Co., Ltd. | Noise signal prediction system |
US5572623A (en) * | 1992-10-21 | 1996-11-05 | Sextant Avionique | Method of speech detection |
US5579432A (en) * | 1993-05-26 | 1996-11-26 | Telefonaktiebolaget Lm Ericsson | Discriminating between stationary and non-stationary signals |
US5619566A (en) * | 1993-08-27 | 1997-04-08 | Motorola, Inc. | Voice activity detector for an echo suppressor and an echo suppressor |
US6061647A (en) * | 1993-09-14 | 2000-05-09 | British Telecommunications Public Limited Company | Voice activity detector |
US5749067A (en) * | 1993-09-14 | 1998-05-05 | British Telecommunications Public Limited Company | Voice activity detector |
US5633982A (en) * | 1993-12-20 | 1997-05-27 | Hughes Electronics | Removal of swirl artifacts from celp-based speech coders |
US5657422A (en) * | 1994-01-28 | 1997-08-12 | Lucent Technologies Inc. | Voice activity detection driven noise remediator |
US5754554A (en) * | 1994-10-28 | 1998-05-19 | Nec Corporation | Telephone apparatus for multiplexing digital speech samples and data signals using variable rate speech coding |
US5732141A (en) * | 1994-11-22 | 1998-03-24 | Alcatel Mobile Phones | Detecting voice activity |
AU698712B2 (en) * | 1994-11-22 | 1998-11-05 | Societe Anonyme Dite : Alcatel Mobile Phones | Detecting voice activity |
US5812965A (en) * | 1995-10-13 | 1998-09-22 | France Telecom | Process and device for creating comfort noise in a digital speech transmission system |
FR2739995A1 (en) * | 1995-10-13 | 1997-04-18 | Massaloux Dominique | METHOD AND DEVICE FOR CREATING A COMFORT NOISE IN A DIGITAL SPEECH TRANSMISSION SYSTEM |
EP0768770A1 (en) * | 1995-10-13 | 1997-04-16 | France Telecom | Method and arrangement for the creation of comfort noise in a digital transmission system |
US5963901A (en) * | 1995-12-12 | 1999-10-05 | Nokia Mobile Phones Ltd. | Method and device for voice activity detection and a communication device |
EP0784311A1 (en) | 1995-12-12 | 1997-07-16 | Nokia Mobile Phones Ltd. | Method and device for voice activity detection and a communication device |
WO1997022117A1 (en) * | 1995-12-12 | 1997-06-19 | Nokia Mobile Phones Limited | Method and device for voice activity detection and a communication device |
US5774849A (en) * | 1996-01-22 | 1998-06-30 | Rockwell International Corporation | Method and apparatus for generating frame voicing decisions of an incoming speech signal |
US5978760A (en) * | 1996-01-29 | 1999-11-02 | Texas Instruments Incorporated | Method and system for improved discontinuous speech transmission |
US6427134B1 (en) * | 1996-07-03 | 2002-07-30 | British Telecommunications Public Limited Company | Voice activity detector for calculating spectral irregularity measure on the basis of spectral difference measurements |
US5864793A (en) * | 1996-08-06 | 1999-01-26 | Cirrus Logic, Inc. | Persistence and dynamic threshold based intermittent signal detector |
US20010027391A1 (en) * | 1996-11-07 | 2001-10-04 | Matsushita Electric Industrial Co., Ltd. | Excitation vector generator, speech coder and speech decoder |
US6799160B2 (en) * | 1996-11-07 | 2004-09-28 | Matsushita Electric Industrial Co., Ltd. | Noise canceller |
US20100256975A1 (en) * | 1996-11-07 | 2010-10-07 | Panasonic Corporation | Speech coder and speech decoder |
US8036887B2 (en) | 1996-11-07 | 2011-10-11 | Panasonic Corporation | CELP speech decoder modifying an input vector with a fixed waveform to transform a waveform of the input vector |
US20050203736A1 (en) * | 1996-11-07 | 2005-09-15 | Matsushita Electric Industrial Co., Ltd. | Excitation vector generator, speech coder and speech decoder |
US7587316B2 (en) | 1996-11-07 | 2009-09-08 | Panasonic Corporation | Noise canceller |
US5974375A (en) * | 1996-12-02 | 1999-10-26 | Oki Electric Industry Co., Ltd. | Coding device and decoding device of speech signal, coding method and decoding method |
US6708146B1 (en) | 1997-01-03 | 2004-03-16 | Telecommunications Research Laboratories | Voiceband signal classifier |
US7440891B1 (en) | 1997-03-06 | 2008-10-21 | Asahi Kasei Kabushiki Kaisha | Speech processing method and apparatus for improving speech quality and speech recognition performance |
EP0969692A4 (en) * | 1997-03-06 | 2005-03-09 | Asahi Chemical Ind | Device and method for processing speech |
CN100512510C (en) * | 1997-03-06 | 2009-07-08 | 旭化成株式会社 | Device and method for processing speech |
EP0969692A1 (en) * | 1997-03-06 | 2000-01-05 | Asahi Kasei Kogyo Kabushiki Kaisha | Device and method for processing speech |
WO1998048407A3 (en) * | 1997-04-18 | 1999-02-11 | Nokia Telecommunications Oy | Speech detection in a telecommunication system |
WO1998048407A2 (en) * | 1997-04-18 | 1998-10-29 | Nokia Networks Oy | Speech detection in a telecommunication system |
US5970441A (en) * | 1997-08-25 | 1999-10-19 | Telefonaktiebolaget Lm Ericsson | Detection of periodicity information from an audio signal |
US6531982B1 (en) | 1997-09-30 | 2003-03-11 | Sirf Technology, Inc. | Field unit for use in a GPS system |
US6134524A (en) * | 1997-10-24 | 2000-10-17 | Nortel Networks Corporation | Method and apparatus to detect and delimit foreground speech |
US6526378B1 (en) * | 1997-12-08 | 2003-02-25 | Mitsubishi Denki Kabushiki Kaisha | Method and apparatus for processing sound signal |
US6205423B1 (en) * | 1998-01-13 | 2001-03-20 | Conexant Systems, Inc. | Method for coding speech containing noise-like speech periods and/or having background noise |
US6023674A (en) * | 1998-01-23 | 2000-02-08 | Telefonaktiebolaget L M Ericsson | Non-parametric voice activity detection |
US6327471B1 (en) | 1998-02-19 | 2001-12-04 | Conexant Systems, Inc. | Method and an apparatus for positioning system assisted cellular radiotelephone handoff and dropoff |
US6182035B1 (en) | 1998-03-26 | 2001-01-30 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and apparatus for detecting voice activity |
US6285979B1 (en) * | 1998-03-27 | 2001-09-04 | Avr Communications Ltd. | Phoneme analyzer |
US6348744B1 (en) | 1998-04-14 | 2002-02-19 | Conexant Systems, Inc. | Integrated power management module |
USD419160S (en) * | 1998-05-14 | 2000-01-18 | Northrop Grumman Corporation | Personal communications unit docking station |
US6480723B1 (en) | 1998-05-15 | 2002-11-12 | Northrop Grumman Corporation | Communications interface adapter |
US6141426A (en) * | 1998-05-15 | 2000-10-31 | Northrop Grumman Corporation | Voice operated switch for use in high noise environments |
US6169730B1 (en) | 1998-05-15 | 2001-01-02 | Northrop Grumman Corporation | Wireless communications protocol |
USD421002S (en) * | 1998-05-15 | 2000-02-22 | Northrop Grumman Corporation | Personal communications unit handset |
US6041243A (en) * | 1998-05-15 | 2000-03-21 | Northrop Grumman Corporation | Personal communications unit |
US6243573B1 (en) | 1998-05-15 | 2001-06-05 | Northrop Grumman Corporation | Personal communications system |
US6223062B1 (en) | 1998-05-15 | 2001-04-24 | Northrop Grumann Corporation | Communications interface adapter |
US6304559B1 (en) | 1998-05-15 | 2001-10-16 | Northrop Grumman Corporation | Wireless communications protocol |
US6393396B1 (en) * | 1998-07-29 | 2002-05-21 | Canon Kabushiki Kaisha | Method and apparatus for distinguishing speech from noise |
US6453285B1 (en) * | 1998-08-21 | 2002-09-17 | Polycom, Inc. | Speech activity detector for use in noise reduction system, and methods therefor |
US7545854B1 (en) | 1998-09-01 | 2009-06-09 | Sirf Technology, Inc. | Doppler corrected spread spectrum matched filter |
US20050025222A1 (en) * | 1998-09-01 | 2005-02-03 | Underbrink Paul A. | System and method for despreading in a spread spectrum matched filter |
US7711038B1 (en) | 1998-09-01 | 2010-05-04 | Sirf Technology, Inc. | System and method for despreading in a spread spectrum matched filter |
US7852905B2 (en) | 1998-09-01 | 2010-12-14 | Sirf Technology, Inc. | System and method for despreading in a spread spectrum matched filter |
US6693953B2 (en) | 1998-09-30 | 2004-02-17 | Skyworks Solutions, Inc. | Adaptive wireless communication receiver |
US6424938B1 (en) * | 1998-11-23 | 2002-07-23 | Telefonaktiebolaget L M Ericsson | Complex signal activity detection for improved speech/noise classification of an audio signal |
US6606349B1 (en) | 1999-02-04 | 2003-08-12 | Sirf Technology, Inc. | Spread spectrum receiver performance improvement |
US6453291B1 (en) * | 1999-02-04 | 2002-09-17 | Motorola, Inc. | Apparatus and method for voice activity detection in a communication system |
US6448925B1 (en) | 1999-02-04 | 2002-09-10 | Conexant Systems, Inc. | Jamming detection and blanking for GPS receivers |
US6556967B1 (en) | 1999-03-12 | 2003-04-29 | The United States Of America As Represented By The National Security Agency | Voice activity detector |
US6577271B1 (en) | 1999-03-30 | 2003-06-10 | Sirf Technology, Inc | Signal detector employing coherent integration |
US6636178B2 (en) | 1999-03-30 | 2003-10-21 | Sirf Technology, Inc. | Signal detector employing correlation analysis of non-uniform and disjoint sample segments |
US7002516B2 (en) | 1999-03-30 | 2006-02-21 | Sirf Technology, Inc. | Signal detector employing correlation analysis of non-uniform and disjoint sample segments |
US6496145B2 (en) | 1999-03-30 | 2002-12-17 | Sirf Technology, Inc. | Signal detector employing coherent integration |
US6304216B1 (en) | 1999-03-30 | 2001-10-16 | Conexant Systems, Inc. | Signal detector employing correlation analysis of non-uniform and disjoint sample segments |
US20050035905A1 (en) * | 1999-03-30 | 2005-02-17 | Gronemeyer Steven A. | Signal detector employing correlation analysis of non-uniform and disjoint sample segments |
US6618701B2 (en) * | 1999-04-19 | 2003-09-09 | Motorola, Inc. | Method and system for noise suppression using external voice activity detection |
US6381568B1 (en) | 1999-05-05 | 2002-04-30 | The United States Of America As Represented By The National Security Agency | Method of transmitting speech using discontinuous transmission and comfort noise |
US6519277B2 (en) | 1999-05-25 | 2003-02-11 | Sirf Technology, Inc. | Accelerated selection of a base station in a wireless communication system |
US20070150264A1 (en) * | 1999-09-20 | 2007-06-28 | Onur Tackin | Voice And Data Exchange Over A Packet Based Network With Voice Detection |
US7653536B2 (en) * | 1999-09-20 | 2010-01-26 | Broadcom Corporation | Voice and data exchange over a packet based network with voice detection |
US10389657B1 (en) * | 1999-11-05 | 2019-08-20 | Open Invention Network, Llc. | System and method for voice transmission over network protocols |
US7269511B2 (en) | 2000-04-18 | 2007-09-11 | Sirf Technology, Inc. | Method and system for data detection in a global positioning system satellite receiver |
US20040172195A1 (en) * | 2000-04-18 | 2004-09-02 | Underbrink Paul A. | Method and system for data detection in a global positioning system satellite receiver |
US20050264446A1 (en) * | 2000-04-18 | 2005-12-01 | Underbrink Paul A | Method and system for data detection in a global positioning system satellite receiver |
US6931055B1 (en) | 2000-04-18 | 2005-08-16 | Sirf Technology, Inc. | Signal detector employing a doppler phase correction system |
US6714158B1 (en) | 2000-04-18 | 2004-03-30 | Sirf Technology, Inc. | Method and system for data detection in a global positioning system satellite receiver |
US6952440B1 (en) | 2000-04-18 | 2005-10-04 | Sirf Technology, Inc. | Signal detector employing a Doppler phase correction system |
US6961660B2 (en) | 2000-04-18 | 2005-11-01 | Sirf Technology, Inc. | Method and system for data detection in a global positioning system satellite receiver |
US6788655B1 (en) | 2000-04-18 | 2004-09-07 | Sirf Technology, Inc. | Personal communications device with ratio counter |
US7885314B1 (en) | 2000-05-02 | 2011-02-08 | Kenneth Scott Walley | Cancellation system and method for a wireless positioning system |
US6741873B1 (en) * | 2000-07-05 | 2004-05-25 | Motorola, Inc. | Background noise adaptable speaker phone for use in a mobile communication device |
US9196261B2 (en) | 2000-07-19 | 2015-11-24 | Aliphcom | Voice activity detector (VAD)—based multiple-microphone acoustic noise suppression |
US10225649B2 (en) | 2000-07-19 | 2019-03-05 | Gregory C. Burnett | Microphone array with rear venting |
US20050091053A1 (en) * | 2000-09-12 | 2005-04-28 | Pioneer Corporation | Voice recognition system |
US7035798B2 (en) * | 2000-09-12 | 2006-04-25 | Pioneer Corporation | Speech recognition system including speech section detecting section |
US20020049592A1 (en) * | 2000-09-12 | 2002-04-25 | Pioneer Corporation | Voice recognition system |
US20020046026A1 (en) * | 2000-09-12 | 2002-04-18 | Pioneer Corporation | Voice recognition system |
US20080221887A1 (en) * | 2000-10-13 | 2008-09-11 | At&T Corp. | Systems and methods for dynamic re-configurable speech recognition |
US7457750B2 (en) * | 2000-10-13 | 2008-11-25 | At&T Corp. | Systems and methods for dynamic re-configurable speech recognition |
US9536524B2 (en) | 2000-10-13 | 2017-01-03 | At&T Intellectual Property Ii, L.P. | Systems and methods for dynamic re-configurable speech recognition |
US20020046022A1 (en) * | 2000-10-13 | 2002-04-18 | At&T Corp. | Systems and methods for dynamic re-configurable speech recognition |
US8719017B2 (en) | 2000-10-13 | 2014-05-06 | At&T Intellectual Property Ii, L.P. | Systems and methods for dynamic re-configurable speech recognition |
US8942383B2 (en) | 2001-05-30 | 2015-01-27 | Aliphcom | Wind suppression/replacement component for use with electronic systems |
KR100399057B1 (en) * | 2001-08-07 | 2003-09-26 | 한국전자통신연구원 | Apparatus for Voice Activity Detection in Mobile Communication System and Method Thereof |
US20050044471A1 (en) * | 2001-11-15 | 2005-02-24 | Chia Pei Yen | Error concealment apparatus and method |
US7359856B2 (en) | 2001-12-05 | 2008-04-15 | France Telecom | Speech detection system in an audio signal in noisy surrounding |
US20050143978A1 (en) * | 2001-12-05 | 2005-06-30 | France Telecom | Speech detection system in an audio signal in noisy surrounding |
WO2003048711A3 (en) * | 2001-12-05 | 2004-02-12 | France Telecom | Speech detection system in an audio signal in noisy surrounding |
US7999733B2 (en) | 2001-12-13 | 2011-08-16 | Sirf Technology Inc. | Fast reacquisition of a GPS signal |
US6778136B2 (en) | 2001-12-13 | 2004-08-17 | Sirf Technology, Inc. | Fast acquisition of GPS signal |
US20080198069A1 (en) * | 2001-12-13 | 2008-08-21 | Gronemeyer Steven A | Fast Reacquisition of a GPS Signal |
US20030120487A1 (en) * | 2001-12-20 | 2003-06-26 | Hitachi, Ltd. | Dynamic adjustment of noise separation in data handling, particularly voice activation |
US7146314B2 (en) | 2001-12-20 | 2006-12-05 | Renesas Technology Corporation | Dynamic adjustment of noise separation in data handling, particularly voice activation |
US20070233479A1 (en) * | 2002-05-30 | 2007-10-04 | Burnett Gregory C | Detecting voiced and unvoiced speech using both acoustic and nonacoustic sensors |
US20040042626A1 (en) * | 2002-08-30 | 2004-03-04 | Balan Radu Victor | Multichannel voice detection in adverse environments |
US7146315B2 (en) * | 2002-08-30 | 2006-12-05 | Siemens Corporate Research, Inc. | Multichannel voice detection in adverse environments |
US20040064314A1 (en) * | 2002-09-27 | 2004-04-01 | Aubert Nicolas De Saint | Methods and apparatus for speech end-point detection |
US20040078200A1 (en) * | 2002-10-17 | 2004-04-22 | Clarity, Llc | Noise reduction in subbanded speech signals |
US7146316B2 (en) | 2002-10-17 | 2006-12-05 | Clarity Technologies, Inc. | Noise reduction in subbanded speech signals |
US9066186B2 (en) | 2003-01-30 | 2015-06-23 | Aliphcom | Light-based detection for acoustic applications |
US9099094B2 (en) | 2003-03-27 | 2015-08-04 | Aliphcom | Microphone array with rear venting |
US20050154583A1 (en) * | 2003-12-25 | 2005-07-14 | Nobuhiko Naka | Apparatus and method for voice activity detection |
US8442817B2 (en) * | 2003-12-25 | 2013-05-14 | Ntt Docomo, Inc. | Apparatus and method for voice activity detection |
US20050171769A1 (en) * | 2004-01-28 | 2005-08-04 | Ntt Docomo, Inc. | Apparatus and method for voice activity detection |
US20050209762A1 (en) * | 2004-03-18 | 2005-09-22 | Ford Global Technologies, Llc | Method and apparatus for controlling a vehicle using an object detection system and brake-steer |
US20050246166A1 (en) * | 2004-04-28 | 2005-11-03 | International Business Machines Corporation | Componentized voice server with selectable internal and external speech detectors |
US7925510B2 (en) * | 2004-04-28 | 2011-04-12 | Nuance Communications, Inc. | Componentized voice server with selectable internal and external speech detectors |
US20060025992A1 (en) * | 2004-07-27 | 2006-02-02 | Yoon-Hark Oh | Apparatus and method of eliminating noise from a recording device |
US20060053007A1 (en) * | 2004-08-30 | 2006-03-09 | Nokia Corporation | Detection of voice activity in an audio signal |
US8131553B2 (en) | 2004-12-22 | 2012-03-06 | David Attwater | Turn-taking model |
US20080004881A1 (en) * | 2004-12-22 | 2008-01-03 | David Attwater | Turn-taking model |
US20100324896A1 (en) * | 2004-12-22 | 2010-12-23 | Enterprise Integration Group, Inc. | Turn-taking confidence |
US20060206330A1 (en) * | 2004-12-22 | 2006-09-14 | David Attwater | Mode confidence |
US20060206329A1 (en) * | 2004-12-22 | 2006-09-14 | David Attwater | Turn-taking confidence |
US7809569B2 (en) | 2004-12-22 | 2010-10-05 | Enterprise Integration Group, Inc. | Turn-taking confidence |
US7970615B2 (en) | 2004-12-22 | 2011-06-28 | Enterprise Integration Group, Inc. | Turn-taking confidence |
US20100017212A1 (en) * | 2004-12-22 | 2010-01-21 | David Attwater | Turn-taking model |
US20060200350A1 (en) * | 2004-12-22 | 2006-09-07 | David Attwater | Multi dimensional confidence |
US20060217973A1 (en) * | 2005-03-24 | 2006-09-28 | Mindspeed Technologies, Inc. | Adaptive voice mode extension for a voice activity detector |
US7983906B2 (en) * | 2005-03-24 | 2011-07-19 | Mindspeed Technologies, Inc. | Adaptive voice mode extension for a voice activity detector |
DE102006032967B4 (en) * | 2005-07-28 | 2012-04-19 | S. Siedle & Söhne Telefon- und Telegrafenwerke OHG | House plant and method for operating a house plant |
US20070033042A1 (en) * | 2005-08-03 | 2007-02-08 | International Business Machines Corporation | Speech detection fusing multi-class acoustic-phonetic, and energy features |
US8781832B2 (en) | 2005-08-22 | 2014-07-15 | Nuance Communications, Inc. | Methods and apparatus for buffering data for use in accordance with a speech recognition system |
US20070043563A1 (en) * | 2005-08-22 | 2007-02-22 | International Business Machines Corporation | Methods and apparatus for buffering data for use in accordance with a speech recognition system |
US20080172228A1 (en) * | 2005-08-22 | 2008-07-17 | International Business Machines Corporation | Methods and Apparatus for Buffering Data for Use in Accordance with a Speech Recognition System |
US7962340B2 (en) | 2005-08-22 | 2011-06-14 | Nuance Communications, Inc. | Methods and apparatus for buffering data for use in accordance with a speech recognition system |
US8204754B2 (en) * | 2006-02-10 | 2012-06-19 | Telefonaktiebolaget L M Ericsson (Publ) | System and method for an improved voice detector |
US20090055173A1 (en) * | 2006-02-10 | 2009-02-26 | Martin Sehlstedt | Sub band vad |
US9646621B2 (en) | 2006-02-10 | 2017-05-09 | Telefonaktiebolaget Lm Ericsson (Publ) | Voice detector and a method for suppressing sub-bands in a voice detector |
US8977556B2 (en) * | 2006-02-10 | 2015-03-10 | Telefonaktiebolaget Lm Ericsson (Publ) | Voice detector and a method for suppressing sub-bands in a voice detector |
WO2007091956A2 (en) | 2006-02-10 | 2007-08-16 | Telefonaktiebolaget Lm Ericsson (Publ) | A voice detector and a method for suppressing sub-bands in a voice detector |
US20120185248A1 (en) * | 2006-02-10 | 2012-07-19 | Telefonaktiebolaget Lm Ericsson (Publ) | Voice detector and a method for suppressing sub-bands in a voice detector |
US8920343B2 (en) | 2006-03-23 | 2014-12-30 | Michael Edward Sabatino | Apparatus for acquiring and processing of physiological auditory signals |
US11357471B2 (en) | 2006-03-23 | 2022-06-14 | Michael E. Sabatino | Acquiring and processing acoustic energy emitted by at least one organ in a biological system |
US8870791B2 (en) | 2006-03-23 | 2014-10-28 | Michael E. Sabatino | Apparatus for acquiring, processing and transmitting physiological sounds |
US20080133226A1 (en) * | 2006-09-21 | 2008-06-05 | Spreadtrum Communications Corporation | Methods and apparatus for voice activity detection |
US7921008B2 (en) * | 2006-09-21 | 2011-04-05 | Spreadtrum Communications, Inc. | Methods and apparatus for voice activity detection |
US20100322366A1 (en) * | 2006-12-06 | 2010-12-23 | Electronics And Telecommunications Research Institute | Method for detecting frame synchronization and structure in dvb-s2 system |
US8422604B2 (en) * | 2006-12-06 | 2013-04-16 | Electronics And Telecommunications Research Institute | Method for detecting frame synchronization and structure in DVB-S2 system |
US11122357B2 (en) | 2007-06-13 | 2021-09-14 | Jawbone Innovations, Llc | Forming virtual microphone arrays using dual omnidirectional microphone array (DOMA) |
US8244528B2 (en) | 2008-04-25 | 2012-08-14 | Nokia Corporation | Method and apparatus for voice activity determination |
US20110051953A1 (en) * | 2008-04-25 | 2011-03-03 | Nokia Corporation | Calibrating multiple microphones |
US8682662B2 (en) | 2008-04-25 | 2014-03-25 | Nokia Corporation | Method and apparatus for voice activity determination |
US8611556B2 (en) | 2008-04-25 | 2013-12-17 | Nokia Corporation | Calibrating multiple microphones |
US8275136B2 (en) | 2008-04-25 | 2012-09-25 | Nokia Corporation | Electronic device speech enhancement |
US20090271190A1 (en) * | 2008-04-25 | 2009-10-29 | Nokia Corporation | Method and Apparatus for Voice Activity Determination |
US20090316918A1 (en) * | 2008-04-25 | 2009-12-24 | Nokia Corporation | Electronic Device Speech Enhancement |
US8121844B2 (en) * | 2008-06-02 | 2012-02-21 | Nippon Steel Corporation | Dimension measurement system |
US20110066439A1 (en) * | 2008-06-02 | 2011-03-17 | Kengo Nakao | Dimension measurement system |
CN102460190A (en) * | 2009-06-23 | 2012-05-16 | 瑞典爱立信有限公司 | Method and an arrangement for a mobile telecommunications network |
WO2010151183A1 (en) * | 2009-06-23 | 2010-12-29 | Telefonaktiebolaget L M Ericsson (Publ) | Method and an arrangement for a mobile telecommunications network |
US7996215B1 (en) | 2009-10-15 | 2011-08-09 | Huawei Technologies Co., Ltd. | Method and apparatus for voice activity detection, and encoder |
US20110184734A1 (en) * | 2009-10-15 | 2011-07-28 | Huawei Technologies Co., Ltd. | Method and apparatus for voice activity detection, and encoder |
US20120197642A1 (en) * | 2009-10-15 | 2012-08-02 | Huawei Technologies Co., Ltd. | Signal processing method, device, and system |
WO2011044842A1 (en) * | 2009-10-15 | 2011-04-21 | 华为技术有限公司 | Method,device and coder for voice activity detection |
US11361784B2 (en) | 2009-10-19 | 2022-06-14 | Telefonaktiebolaget Lm Ericsson (Publ) | Detector and method for voice activity detection |
US20110125497A1 (en) * | 2009-11-20 | 2011-05-26 | Takahiro Unno | Method and System for Voice Activity Detection |
US10134417B2 (en) | 2010-12-24 | 2018-11-20 | Huawei Technologies Co., Ltd. | Method and apparatus for detecting a voice activity in an input audio signal |
US9761246B2 (en) * | 2010-12-24 | 2017-09-12 | Huawei Technologies Co., Ltd. | Method and apparatus for detecting a voice activity in an input audio signal |
US10796712B2 (en) | 2010-12-24 | 2020-10-06 | Huawei Technologies Co., Ltd. | Method and apparatus for detecting a voice activity in an input audio signal |
US20160260443A1 (en) * | 2010-12-24 | 2016-09-08 | Huawei Technologies Co., Ltd. | Method and apparatus for detecting a voice activity in an input audio signal |
US11430461B2 (en) | 2010-12-24 | 2022-08-30 | Huawei Technologies Co., Ltd. | Method and apparatus for detecting a voice activity in an input audio signal |
US20140119461A1 (en) * | 2011-08-25 | 2014-05-01 | Mitsubishi Electric Corporation | Signal transmission device |
US9160483B2 (en) * | 2011-08-25 | 2015-10-13 | Mitsubishi Electric Corporation | Signal transmission device with data length changer |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5276765A (en) | Voice activity detection | |
EP0335521B1 (en) | Voice activity detection | |
US4630304A (en) | Automatic background noise estimator for a noise suppression system | |
US3740476A (en) | Speech signal pitch detector using prediction error data | |
US5091948A (en) | Speaker recognition with glottal pulse-shapes | |
US5579435A (en) | Discriminating between stationary and non-stationary signals | |
US5749067A (en) | Voice activity detector | |
CA1123955A (en) | Speech analysis and synthesis apparatus | |
US5970441A (en) | Detection of periodicity information from an audio signal | |
GB1533337A (en) | Speech analysis and synthesis system | |
US5579432A (en) | Discriminating between stationary and non-stationary signals | |
JP3105465B2 (en) | Voice section detection method | |
US5632004A (en) | Method and apparatus for encoding/decoding of background sounds | |
US4219695A (en) | Noise estimation system for use in speech analysis | |
FI111572B (en) | Procedure for processing speech in the presence of acoustic interference | |
JPH08221097A (en) | Detection method of audio component | |
US4972490A (en) | Distance measurement control of a multiple detector system | |
Vahatalo et al. | Voice activity detection for GSM adaptive multi-rate codec | |
JPH01502858A (en) | Apparatus and method for detecting the presence of fundamental frequencies in audio frames | |
Cole et al. | A real-time floating point variable frame rate LPC vocoder | |
Prasad et al. | A 2.4 Kilobits Per Second Linear Prediction Vocoder | |
Cohen et al. | Spectral Enha | |
NZ286953A (en) | Speech encoder/decoder: discriminating between speech and background sound | |
JPH0827637B2 (en) | Voice / silence judgment circuit |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
FPAY | Fee payment |
Year of fee payment: 4 |
|
FEPP | Fee payment procedure |
Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
FPAY | Fee payment |
Year of fee payment: 8 |
|
AS | Assignment |
Owner name: LG ELECTRONICS INC., KOREA, REPUBLIC OF Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:BRITISH TELECOMMUNICATIONS PUBLIC LIMITED COMPANY;REEL/FRAME:014609/0658 Effective date: 20030310 |
|
FEPP | Fee payment procedure |
Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Free format text: PAYER NUMBER DE-ASSIGNED (ORIGINAL EVENT CODE: RMPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
FPAY | Fee payment |
Year of fee payment: 12 |