US3740476A - Speech signal pitch detector using prediction error data - Google Patents
Speech signal pitch detector using prediction error data Download PDFInfo
- Publication number
- US3740476A US3740476A US00161173A US3740476DA US3740476A US 3740476 A US3740476 A US 3740476A US 00161173 A US00161173 A US 00161173A US 3740476D A US3740476D A US 3740476DA US 3740476 A US3740476 A US 3740476A
- Authority
- US
- United States
- Prior art keywords
- signal
- speech
- speech signal
- samples
- developing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
Definitions
- the prediction error 325 33 A nal is devoid of all formant structure, so that there is no chance of confusing pitch signal peaks with formant 5 7 References Cited peaks.
- a voiced-unvoiced decision is obtained from the 'UNlTED STATES PATENTS ratio of the mean-squared value of the speechsignal 'to the mean-squared value of the prediction errorsignal.
- This invention is concerned with the analysis of complex signals, and particularly with the determination of the fundamental frequency, or period, of a complex periodic signal, such as a voiced speech signal. Its principal objectives are to simplify the measurement of pitch frequency and to improve the reliability of the measure.
- SUMMARY OF THE INVENTION Analysis of a complex speech signal to determine its pitch frequency is, in accordance with the invention, based on an analysis of the error between a predicted value of the speech signal based on its past sample values and its actual value at that moment.
- the time interval represented by the number of samples used to ob tain the predicted value is typically 1 msec. Due to the short memory used in the prediction process, the predicted signal values represent, in large measure, the formant structure of the speech signal.
- the pitch analysis arrangement of the invention is particularly effective because, in developing a difference signal, i.e., the prediction error signal, the formant structure of the signal is removed from the input signal.
- a feature of the invention is the additional use of prediction error samples to develop a voiced-unvoiced signal indication.
- a voicing decision is based on the ratio of the meansquared value of input signal samples to the meansquared value of corresponding prediction error samples.
- FIG. 1 is a block schematic diagram of a speech signal analysis system which illustrates the principles of the invention.
- FIG. 2 is an illustration of the waveform of a segment of a voiced speech signal, the positions of detected pitch pulses in the voiced speech signal, as shown by vertical lines, and a segment of unvoiced speech.
- FIG. 1 A signal analysis arrangement which illustrates the principles of the invention is illustrated in FIG. 1.
- Speech signals supplied from any desired source are delivered to the analyzer and passed through low-pass filter 10.
- Filter 10 typically has a cutoff frequency in the neighborhood of 5 kHz.
- the resultant signal is then sampled at a frequency of approximately 10 kHz in sampler 11 under control of signals from clock 12.
- Speech samples, s,,, thus derived are supplied to storage unit 13 which maintains them in order, typically in blocks of 200 samples, i.e., s s S200.
- Blocks or frames of samples are periodically keyed out of storage unit 13, for example, under control of a signal from clock 12, and delivered to adaptive predictor l4, prediction parameter computer 15, and to subtractor network 16.
- Adaptive predictor 14 operates on supplied signal samples' to predict the present value of each sample on the basis of a weighted summation of a number of prior sample values. The prediction operation is carried out on a sample-by-sample basis and predictor 14 is periodically supplied with a new frame of samples from storage unit 13.
- An adaptive predictor suitable for use in the system of this invention is described in detail in a copending application of B. S. Atal, Ser. No. 753,408, filed Aug. 19, 1968, now U.S. Pat. No. 3,631,520.
- predictor I4 is controlled to adapt it to the current signal condition. It has been found sufficient to readjust the values of the parame' ters used to control the predictor at intervals comparable to those of a pitch period of the signal. Since the exact pitch interval is not available (although the pitch output signal of the system may be used in a feedback arrangement to approximate the interval of a later pitch period), readjustment of the parameter values at intervals corresponding approximately to the time of 200 samples is entirely satisfactory. This corresponds to a time interval of approximately 20 msec.
- Prediction parameter computer thus operates on applied speech samples from unit 13 to develop a sequence of parameter signals a a a a,,, which are used periodically to adjust predictor 14.
- Parameter values a are selected to minimize the mean-squared prediction error of the system.
- Sample values developed by predictor 14 are subtracted in network 16 fromthe actual value of corresponding signal samples delivered from storage unit 13 to the subtractor.
- the resultant difference signal represents the error in predicting the value of the signal. It is accordingly called a prediction error signal.
- appropriate delay is provided, for example, in the readout of samples from storage unit 13 or in their delivery to subtractor 16, to allow time for all predictor operations to be completed. Suffice it to say that all of the described operations are carried on in synchronism in a conventional manner.
- the values of signal samples are predicted largely on the basis of their formant constituency. Predicted signals, therefore, represent essentially the formant structure of the input signal. Since the predicted signal values are subtracted from actual signal values, the prediction error signal at the output of subtractor network 16 is essentially devoid of all formant information. Yet, the prediction error signal has been found to preserve, and indeed to denote, the pitch character of the applied signal.
- Prediction error signals from subtractor 16 are passed through low-pass filter 17.
- Filter 17 is constructed with a relatively low cutoff frequency since the fundamental pitch of the applied signal generally is in the lower portion of the band. Elimination of higher frequency portions aids in isolating the pitch signal.
- the positions of individual pitch pulses in the applied signal is determined by locating the samples for which the prediction error is large.
- Samples delivered from filter 17 thus have amplitudes that are proportional to the difference between the applied signal sample and the predicted signal. It is necessary, therefore, only to seek the fundamental frequency of the prediction (error) signal.
- This may be done using any desired fundamental frequency detector 18 of any desired construction.
- a suitable detector includes a half-wave rectifier 19, employed to retain positive peaks only of the signal in order to simplify later operations.
- the rectified signal is delivered to peak picking network 20, which seeks the largest sample in each frame of signals.
- peak picking arrangements are well known to those skilled in the art and are frequently used in pitch detection arrangements, particularly those of the cepstrum type.
- Peak signals thus developed are passed through threshold detector 21', adjusted to a level selected to prevent minor peaks from reaching the output of the analyzer.
- the threshold is adjusted to accommodate the true fundamental frequency peaks determined, for example, from experience.
- the resulting sequence of pitch pulses is indicative of the fundamental frequency or period of the applied speech signal and may be used in any desired fashion.
- the fundamental frequency detector may include an autocorrelator followed by a peak picker and a threshold detector.
- FIG. 2 illustrates a typical interval of a speech signal.
- a voiced speech segment is shown in line A.
- Line B illustrates the sequence of pulses derived from fundamental frequency detector 18 as the output signal of the analyzer system.
- Line C of the figure illustrates a typical unvoiced segment of speech.
- the voiced-unvoiced decision is based on the ratio of the mean-squared value of speech samples to the meansquared value of prediction error samples. It has been found that this ratio is considerably smaller for unvoiced speech sounds than for voiced speech sounds, typically by a factor of approximately 10.
- speech samples from sampler 1 1 are delivered to mean-squared network 22 and prediction error samples from subtractor 16 are delivered to mean-squared network 23.
- Networks for deriving a signal proportional to the mean value of sequence of samples are well known in the art and are frequently used in acoustic signal processing apparatus.
- a typical network includes an arrangement for developing a signal proportional to the square of each signal sample, an adding network for summing a sequence of squared signal values, and a divider network for developing a signal proportional to the average,or mean value, of the summed squared signals.
- Two signals proportional, respectively, to the meansquared value of speech samples and the mean-squared value of prediction error samples are delivered to divider network 24 which produces as its output the quotient of the two signal values.
- the quotient signal is thereupon delivered to threshold detector 25, which is arranged to develop a first signal for quotient values greater than 10, as an indication of a voiced signal interval, and a second signal for quotients less than 10, as an indication of an unvoiced'signal interval.
- Output signals from detector 25 maybe used in any desired fashion to indicate the voicing character of the input signal.
- the fundamental frequency determination arrangement of the invention greatly enhances the reliability with which two important characteristics of a speech signal are determined. This increased reliability is due primarily to the virtual absence of formant structure in the signal at the time the pitch measurement is made.
- the fundamental frequency detector of the invention is particularly applicable to use in a speech transmission system or a speech analysis system in which a linear prediction arrangement is used. In such cases, it is evident that the prediction error signal delivered to subtractor 16 may be derived from the predictor used in coding the speech signals.
- the voicing decision signal may be used in conjunction with other criteria, such as the spectral balance of low frequencies related to high frequencies to make the voicedunvoiced decision more reliable.
- a signal analyzer for determining the fundamental period of a speech signal whichcomprises,
- adaptive predictor means supplied with samples of said speech signal for predicting the present value of each sample on the basis of a weighted summation of a number of prior sample values of said speech signal
- a signal analyzer as defined in claim 1, wherein said means for determining the fundamental frequency of said difference signal comprises,
- a signal analyzer as defined in claim 1, wherein said means for determining the fundamental frequency of said difference signal comprises, t
- Apparatus for determining the fundamental period of a speech signal which comprises,
- Apparatus for determining the fundamental period of a speech signal which comprises,
- adaptive predictor means supplied with samples of said speech signal for developing an estimate of the momentary value of said speech signal from previously supplied samples, means for developing a prediction error signal from the difference between said predicted signal estimate and the corresponding momentary value of samples of said speech signal, means for identifying prediction error samples whose magnitudes are above a prescribed threshold, and means for utilizing the frequency of occurrence of said identified error samples as a measure of the fundamental period of said speech signal.
- predictor means supplied with samples of a speech signal for developing an estimate of the momentary value of said signal from previously supplied samples, means for developing prediction error signal samples from the difference between samples of said signal estimate and the corresponding momentary value of samples of said speech signal, means for identifying prediction error samples whose magnitudes are above a prescribed threshold, means for developing a first signal proportional to the mean-squared value of said speech samples, means for developing a second signal proportional to the mean-squared value of corresponding ones of said error samples, means for developing a signal proportional to the ratio of said first to said second mean-squared signals, means for utilizing the frequency of occurrence of said identified threshold error samples as a measure of the fundamental period of said speech signal, and means for utilizing said ratio of first and second mean-squared signals as a measure of the voicing characteristic of said speech signal.
- values of said ratio of mean-squared signals equal to or greater than a prescribed threshold are used to classify said speech signal as voiced, and wherein values of said ratio of mean-squared signals less than said threshold are used to classify said speech signal as unvoiced.
- means for developing a signal representative of the formant structure of an applied speech signal means for removing said formant representative signal from said speech signal to produce a signal essentially devoid of all formant information, means for measuring the period of said formant devoid signal, and means for determining the voicing character of said speech signal on the basis of the power in said speech signal and the power in said formant devoid signal.
Abstract
Pitch periods in a complex speech signal are determined by evaluating the error in predicting the value of a sample of the signal on the basis of past sample values, and by locating samples for which the prediction error is large. Advantageously, the prediction error signal is devoid of all formant structure, so that there is no chance of confusing pitch signal peaks with formant peaks. A voiced-unvoiced decision is obtained from the ratio of the mean-squared value of the speech signal to the meansquared value of the prediction error signal.
Description
United States Patent 1 Atal [ June 19, 1973 PREDICTION PARAMETER COMPUTER [54] SPEECH SIGNAL PITCH DETECTOR USING 2,732,424 1/1956 Oliver 179/15.55 R PREDICTION ERROR DATA 3,026,375 3/1962 Graham 179/1 SA 3,420,955 l/l969 Noll 179/l SA [75] Inventor: Bishnu Saroop Atal, Murray Hill,
Primary Examiner-Kathleen H. Claffy [73] Assignee: Bell Telephone Laboratories, Assistant BradfordiL'eaheey lncorporated, Mun-3y Hill, J Attorney-R. J. Guenther and William L. Keefauver [22] Filed: July 9, 1971 ABSTRACT lzll PP 161,173 Pitch periods in a complex speech signal are determined by evaluating the error in predicting the value of ['52] US. Cl 179/1 SA a Sample of the Signal on the basis of P Sample [51] Int. Cl. G101 1/04 and y locating Samples for which the Prediction [58] Field of Search 179/1 SA, 15.55 R; error is large Advantageously, the prediction error 325 33 A nal is devoid of all formant structure, so that there is no chance of confusing pitch signal peaks with formant 5 7 References Cited peaks. A voiced-unvoiced decision is obtained from the 'UNlTED STATES PATENTS ratio of the mean-squared value of the speechsignal 'to the mean-squared value of the prediction errorsignal. 3,437,757 4/1969 Coker 179/1 SA 3,405,237 10/1968 David '179/1 SA 8 Claims, 2 Drawing Figures THRESHOLD 25 DETECTOR I VOICED UNVOICED J,24 23 SIGNAL MEAN- MEAN- SQUARE DIVIDER SQUARE I NETWORK NETWORK '10 n 13 LR PlTCH PULSES OUT FILTER SAMPLER m l8 SPEEC l6 Iv-l. ,19 20' 2| H l2 SUBTRACTOR L.F? PEAK THRE SlGlldALS NETWORK FILTER I RECT' PlCKER l" DETEET O R I CLOCK [M l -J ADAPTIVE PREDICTOR SPEECH SIGNAL PITCH DETECTOR USING PREDICTION ERROR DATA This invention is concerned with the analysis of complex signals, and particularly with the determination of the fundamental frequency, or period, of a complex periodic signal, such as a voiced speech signal. Its principal objectives are to simplify the measurement of pitch frequency and to improve the reliability of the measure.
BACKGROUND OF THE INVENTION A number of arrangements for reducing the channel capacity required for the transmission of complex signals, such as speech signals, have been proposed. One of the best known of these is the vocoder. More recently, techniques for removing inherent signal redundancy through the use of linear prediction techniques have been described. In all of these arrangements, a speech wave is analyzed to determine its significant characteristics, and coded information concerning these characteristics is transmitted instead of the speech signal itself. At a receiver station a synthetic speech signal is developed from the coded information.
In general, a different set of coded signal information is employed in each type of bandwidth compression system. However, virtually all employ one characteristic of the speech signal, namely, its pitch frequency. This characteristic denotes the fundamental frequency at which the vocal cords vibrate during the production of different voiced speech sounds. Most speech bandwidth compression systems also employ coded information to identify a speech signal as voiced or unvoiced. Some combine the two forms of information so that the pitch signal inherently specifies the voicing condition.
FIELD OF THE INVENTION.
A number of different proposals for automatically measuring andencoding the pitch characteristic of a speech signal are known and used in the art. Some rely on simple filtering, some on signal correlation, some on formant detection and tracking, and others on a transformation of the logarithm of the spectrum of a speech signal, the so-called cepstrum of the signal. All of these arrangements, however, operate on the speech signal itself and in one way or another strive to find peak values in the signal, or in a modification of it, which identify the pitch characteristic. Unfortunately, peaks due to formants, particularly the first formant of a speech signal, are often stronger than a peak developed to indicate pitch. If the two peaks are close together, it is difficult to determine which is which. Consequently, even the most sophisticated pitch detectors are subject to error and do not always correctly characterize the pitch frequency of a signal.
It is thus another object of this invention to capitalize on a unique property of a voiced speech signal to develop a measure of the pitch frequency of the signal that is unambiguous and which is entirely independent of the formant character of the speech signal.
SUMMARY OF THE INVENTION Analysis of a complex speech signal to determine its pitch frequency is, in accordance with the invention, based on an analysis of the error between a predicted value of the speech signal based on its past sample values and its actual value at that moment. The time interval represented by the number of samples used to ob tain the predicted value is typically 1 msec. Due to the short memory used in the prediction process, the predicted signal values represent, in large measure, the formant structure of the speech signal. The pitch analysis arrangement of the invention is particularly effective because, in developing a difference signal, i.e., the prediction error signal, the formant structure of the signal is removed from the input signal. Yet, since-the pitch period in speech signals ranges typically from 3 msec to 20 msec, the prediction of the pitch structure, based on 1 msec of past speech, is completely negligible. Thus, pitch information is retained in the prediction error signal. Consequently, there is little or no interference from the formant structure and a peak picking operation is effective in developing a measure of the pitch character of the input signal.
A feature of the invention is the additional use of prediction error samples to develop a voiced-unvoiced signal indication. In accordance with the invention, a voicing decision is based on the ratio of the meansquared value of input signal samples to the meansquared value of corresponding prediction error samples.
This invention will be more fully understood from the following detailed description of an illustrative embodiment of it taken together with the attached drawings.
BRIEF DESCRIPTIONOF THE DRAWINGS FIG. 1 is a block schematic diagram of a speech signal analysis system which illustrates the principles of the invention, and
FIG. 2 is an illustration of the waveform of a segment of a voiced speech signal, the positions of detected pitch pulses in the voiced speech signal, as shown by vertical lines, and a segment of unvoiced speech.
DETAILED DESCRIPTION A signal analysis arrangement which illustrates the principles of the invention is illustrated in FIG. 1. Speech signals supplied from any desired source are delivered to the analyzer and passed through low-pass filter 10. Filter 10 typically has a cutoff frequency in the neighborhood of 5 kHz. The resultant signal is then sampled at a frequency of approximately 10 kHz in sampler 11 under control of signals from clock 12.
Speech samples, s,,, thus derived are supplied to storage unit 13 which maintains them in order, typically in blocks of 200 samples, i.e., s s S200. Blocks or frames of samples are periodically keyed out of storage unit 13, for example, under control of a signal from clock 12, and delivered to adaptive predictor l4, prediction parameter computer 15, and to subtractor network 16.
Adaptive predictor 14 operates on supplied signal samples' to predict the present value of each sample on the basis of a weighted summation of a number of prior sample values. The prediction operation is carried out on a sample-by-sample basis and predictor 14 is periodically supplied with a new frame of samples from storage unit 13. An adaptive predictor suitable for use in the system of this invention is described in detail in a copending application of B. S. Atal, Ser. No. 753,408, filed Aug. 19, 1968, now U.S. Pat. No. 3,631,520.
To accommodate the constantly changing character of the input speech signal, predictor I4 is controlled to adapt it to the current signal condition. It has been found sufficient to readjust the values of the parame' ters used to control the predictor at intervals comparable to those of a pitch period of the signal. Since the exact pitch interval is not available (although the pitch output signal of the system may be used in a feedback arrangement to approximate the interval of a later pitch period), readjustment of the parameter values at intervals corresponding approximately to the time of 200 samples is entirely satisfactory. This corresponds to a time interval of approximately 20 msec.
Prediction parameter computer thus operates on applied speech samples from unit 13 to develop a sequence of parameter signals a a a a,,, which are used periodically to adjust predictor 14. Parameter values a are selected to minimize the mean-squared prediction error of the system. An extensive discussion of the relation of parameter signals a to the input signal, their development, and the manner in which they are used to control the predictor is explained in detail in the above-mentioned copending patent application. Parameter signals from computer 15 are developed well in advance of the time that a block of signals is processed in predictor 14 because of the delay inherent in the prediction operation. Typically, parameter control signals are developed within an interval corresponding to the'time of approximately 60 samples.
Sample values developed by predictor 14 are subtracted in network 16 fromthe actual value of corresponding signal samples delivered from storage unit 13 to the subtractor. The resultant difference signal represents the error in predicting the value of the signal. It is accordingly called a prediction error signal. Evidently, appropriate delay is provided, for example, in the readout of samples from storage unit 13 or in their delivery to subtractor 16, to allow time for all predictor operations to be completed. Suffice it to say that all of the described operations are carried on in synchronism in a conventional manner.
It is of importance to recognize that the values of signal samples are predicted largely on the basis of their formant constituency. Predicted signals, therefore, represent essentially the formant structure of the input signal. Since the predicted signal values are subtracted from actual signal values, the prediction error signal at the output of subtractor network 16 is essentially devoid of all formant information. Yet, the prediction error signal has been found to preserve, and indeed to denote, the pitch character of the applied signal.
Prediction error signals from subtractor 16 are passed through low-pass filter 17. Filter 17 is constructed with a relatively low cutoff frequency since the fundamental pitch of the applied signal generally is in the lower portion of the band. Elimination of higher frequency portions aids in isolating the pitch signal.
In accordance with the invention, the positions of individual pitch pulses in the applied signal is determined by locating the samples for which the prediction error is large. Samples delivered from filter 17 thus have amplitudes that are proportional to the difference between the applied signal sample and the predicted signal. It is necessary, therefore, only to seek the fundamental frequency of the prediction (error) signal. This may be done using any desired fundamental frequency detector 18 of any desired construction. A suitable detector includes a half-wave rectifier 19, employed to retain positive peaks only of the signal in order to simplify later operations. The rectified signal is delivered to peak picking network 20, which seeks the largest sample in each frame of signals. Such peak picking arrangements are well known to those skilled in the art and are frequently used in pitch detection arrangements, particularly those of the cepstrum type. Peak signals thus developed are passed through threshold detector 21', adjusted to a level selected to prevent minor peaks from reaching the output of the analyzer. The threshold is adjusted to accommodate the true fundamental frequency peaks determined, for example, from experience. The resulting sequence of pitch pulses is indicative of the fundamental frequency or period of the applied speech signal and may be used in any desired fashion.
Alternatively, as previously described in the art, the fundamental frequency detector may include an autocorrelator followed by a peak picker and a threshold detector.
FIG. 2 illustrates a typical interval of a speech signal. A voiced speech segment is shown in line A. Line B illustrates the sequence of pulses derived from fundamental frequency detector 18 as the output signal of the analyzer system. Line C of the figure illustrates a typical unvoiced segment of speech.
To assure that a clear distinction between voiced and unvoiced signal segments is available, it is in accordance with the invention to produce a voiced-unvoiced decision signal. In accordance with the invention, the voiced-unvoiced decision is based on the ratio of the mean-squared value of speech samples to the meansquared value of prediction error samples. It has been found that this ratio is considerably smaller for unvoiced speech sounds than for voiced speech sounds, typically by a factor of approximately 10.
Accordingly, speech samples from sampler 1 1 are delivered to mean-squared network 22 and prediction error samples from subtractor 16 are delivered to mean-squared network 23. Networks for deriving a signal proportional to the mean value of sequence of samples are well known in the art and are frequently used in acoustic signal processing apparatus. A typical network includes an arrangement for developing a signal proportional to the square of each signal sample, an adding network for summing a sequence of squared signal values, and a divider network for developing a signal proportional to the average,or mean value, of the summed squared signals.
Two signals proportional, respectively, to the meansquared value of speech samples and the mean-squared value of prediction error samples are delivered to divider network 24 which produces as its output the quotient of the two signal values. The quotient signal is thereupon delivered to threshold detector 25, which is arranged to develop a first signal for quotient values greater than 10, as an indication of a voiced signal interval, and a second signal for quotients less than 10, as an indication of an unvoiced'signal interval. Output signals from detector 25 maybe used in any desired fashion to indicate the voicing character of the input signal.
It will be evident to those skilled in the art that the fundamental frequency determination arrangement of the invention, together with the voicing decision arrangement, greatly enhances the reliability with which two important characteristics of a speech signal are determined. This increased reliability is due primarily to the virtual absence of formant structure in the signal at the time the pitch measurement is made. Furthermore, it will be apparent that the fundamental frequency detector of the invention is particularly applicable to use in a speech transmission system or a speech analysis system in which a linear prediction arrangement is used. In such cases, it is evident that the prediction error signal delivered to subtractor 16 may be derived from the predictor used in coding the speech signals.
Furthermore, it will be apparent that the voicing decision signal may be used in conjunction with other criteria, such as the spectral balance of low frequencies related to high frequencies to make the voicedunvoiced decision more reliable.
What is claimed is:
l. A signal analyzer for determining the fundamental period of a speech signal, whichcomprises,
adaptive predictor means supplied with samples of said speech signal for predicting the present value of each sample on the basis of a weighted summation of a number of prior sample values of said speech signal,
means for subtracting said predicted speech value from the actual speech value to develop a difference signal, and
means for determining the fundamental frequency of said difference signal as an indication of the fundamental period of said speech signal.
2. A signal analyzer as defined in claim 1, wherein said means for determining the fundamental frequency of said difference signal comprises,
means for determining the frequency of occurrence of difference signal maxima above a prescribed threshold.
3. A signal analyzer as defined in claim 1, wherein said means for determining the fundamental frequency of said difference signal comprises, t
means for autocorrelating said difference signal for developing an autocorrelation signal representative of the periodic character of said difference signal, and
means for detecting the location of the peak value of said autocorrelation signal.
4. Apparatus for determining the fundamental period of a speech signal, which comprises,
means for developing an estimate of the present value of a speech signal on the basis of past values of said speech signal,
means for developing a signal representative of the difference between said signal estimate and the true present value of said speech signal, and
means for determining the fundamental frequency of said difference signal to develop a signal representative of the fundamental period of said speech signal.
5. Apparatus for determining the fundamental period of a speech signal, which comprises,
adaptive predictor means supplied with samples of said speech signal for developing an estimate of the momentary value of said speech signal from previously supplied samples, means for developing a prediction error signal from the difference between said predicted signal estimate and the corresponding momentary value of samples of said speech signal, means for identifying prediction error samples whose magnitudes are above a prescribed threshold, and means for utilizing the frequency of occurrence of said identified error samples as a measure of the fundamental period of said speech signal. 6. Apparatus for analyzing the character of a speech signal, which comprises, in combination,
predictor means supplied with samples of a speech signal for developing an estimate of the momentary value of said signal from previously supplied samples, means for developing prediction error signal samples from the difference between samples of said signal estimate and the corresponding momentary value of samples of said speech signal, means for identifying prediction error samples whose magnitudes are above a prescribed threshold, means for developing a first signal proportional to the mean-squared value of said speech samples, means for developing a second signal proportional to the mean-squared value of corresponding ones of said error samples, means for developing a signal proportional to the ratio of said first to said second mean-squared signals, means for utilizing the frequency of occurrence of said identified threshold error samples as a measure of the fundamental period of said speech signal, and means for utilizing said ratio of first and second mean-squared signals as a measure of the voicing characteristic of said speech signal. 7. Apparatus for analyzing the character of a speech signal as defined in claim 6, wherein,
values of said ratio of mean-squared signals equal to or greater than a prescribed threshold are used to classify said speech signal as voiced, and wherein values of said ratio of mean-squared signals less than said threshold are used to classify said speech signal as unvoiced. 8. In a pitch analysis arrangement for speech signals, the combination of,
means for developing a signal representative of the formant structure of an applied speech signal, means for removing said formant representative signal from said speech signal to produce a signal essentially devoid of all formant information, means for measuring the period of said formant devoid signal, and means for determining the voicing character of said speech signal on the basis of the power in said speech signal and the power in said formant devoid signal.
Claims (8)
1. A signal analyzer for determining the fundamental period of a speech signal, which comprises, adaptive predictor means supplied with samples of said speech signal for predicting the present value of each sample on the basis of a weighted summation of a number of prior sample values of said speech signal, means for subtracting said predicted speech value from the actual speech value to develop a difference signal, and means for determining the fundamental frequency of said difference signal as an indication of the fundamental period of said speech signal.
2. A signal analyzer as defined in claim 1, wherein said means for determining the fundamental frequency of said difference signal comprises, means for determining the frequency of occurrence of difference signal maxima above a prescribed threshold.
3. A signal analyzer as defined in claim 1, wherein said means for determining the fundamental frequency of said difference signal comprises, means for autocorrelating said difference signal for developing an autocorrelation signal representative of the periodic character of said difference signal, and means for detecting the location of the peak value of said autocorrelation signal.
4. Apparatus for determining the fundamental period of a speech signal, which comprises, means for developing an estimate of the present value of a speech signal on the basis of past values of said speech signal, means for developing a signal representative of the difference between said signal estimate and the true present value of said speech signal, and means for determining the fundamental frequency of said difference signal to develop a signal representative of the fundamental period of said speech signal.
5. Apparatus for determining the fundamental period of a speech signal, which comprises, adaptive predictor means supplied with samples of said speech signal for developing an estimate of the momentary value of said speech signal from previously supplied samples, means for developing a prediction error signal from the difference between said predicted signal estimate and the corresponding momentary value of samples of said speech signal, means for identifying prediction error samples whose magnitudes are above a prescribed threshold, and means for utilizing the frequency of occurrence of said identified error samples as a measure of the fundamental period of said speech signal.
6. Apparatus for analyzing the character of a speech signal, which comprises, in combination, predictor means supplied with samples of a speech signal for developing an estimate of the momentary value of said signal from previously supplied samples, means for developing prediction error signal samples from the difference between samples of said signal estimate and the corresponding momentary value of samples of said speech signal, means for identifying prediction error samples whose magnitudes are above a prescribed threshold, means for developing a first signal proportional to the mean-squared value of said speech samples, means for developing a second signal proportional to the mean-squared value of corresponding ones of said error samples, means for developing a signal proportional to the ratio of said first to said second mean-squared signals, means for utilizing the frequency of occurrence of said identified threshold error samples as a measure of the fundamental period of said speech signal, and means for utilizing said ratio of first and second mean-squared signals as a measure of the voicing characteristic of said speech signal.
7. Apparatus for analyzing the character of a speech signal as defined in claim 6, wherein, values of said ratio of mean-squared signals equal to or greater than a prescribed threshold are used to classify said speech signal as voiced, and wherein values of said ratio of mean-squared signals less than said threshold are used to classify said speech signal as unvoiced.
8. In a pitch analysis arrangement for speech signals, the combination of, means for developing a signal representative of the formant structure of an applied speech signal, means for removing said formant representative signal from said speech signal to produce a signal essentially devoid of all formant information, means for measuring tHe period of said formant devoid signal, and means for determining the voicing character of said speech signal on the basis of the power in said speech signal and the power in said formant devoid signal.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US16117371A | 1971-07-09 | 1971-07-09 |
Publications (1)
Publication Number | Publication Date |
---|---|
US3740476A true US3740476A (en) | 1973-06-19 |
Family
ID=22580131
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US00161173A Expired - Lifetime US3740476A (en) | 1971-07-09 | 1971-07-09 | Speech signal pitch detector using prediction error data |
Country Status (6)
Country | Link |
---|---|
US (1) | US3740476A (en) |
JP (2) | JPS5524118B1 (en) |
CA (1) | CA967285A (en) |
DE (1) | DE2233872C2 (en) |
FR (1) | FR2145501B1 (en) |
NL (1) | NL7209311A (en) |
Cited By (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3979557A (en) * | 1974-07-03 | 1976-09-07 | International Telephone And Telegraph Corporation | Speech processor system for pitch period extraction using prediction filters |
US4038495A (en) * | 1975-11-14 | 1977-07-26 | Rockwell International Corporation | Speech analyzer/synthesizer using recursive filters |
US4070709A (en) * | 1976-10-13 | 1978-01-24 | The United States Of America As Represented By The Secretary Of The Air Force | Piecewise linear predictive coding system |
US4074069A (en) * | 1975-06-18 | 1978-02-14 | Nippon Telegraph & Telephone Public Corporation | Method and apparatus for judging voiced and unvoiced conditions of speech signal |
US4081605A (en) * | 1975-08-22 | 1978-03-28 | Nippon Telegraph And Telephone Public Corporation | Speech signal fundamental period extractor |
US4133976A (en) * | 1978-04-07 | 1979-01-09 | Bell Telephone Laboratories, Incorporated | Predictive speech signal coding with reduced noise effects |
US4164626A (en) * | 1978-05-05 | 1979-08-14 | Motorola, Inc. | Pitch detector and method thereof |
US4280387A (en) * | 1979-02-26 | 1981-07-28 | Norlin Music, Inc. | Frequency following circuit |
US4282406A (en) * | 1979-02-28 | 1981-08-04 | Kokusai Denshin Denwa Kabushiki Kaisha | Adaptive pitch detection system for voice signal |
US4383135A (en) * | 1980-01-23 | 1983-05-10 | Scott Instruments Corporation | Method and apparatus for speech recognition |
US4472832A (en) * | 1981-12-01 | 1984-09-18 | At&T Bell Laboratories | Digital speech coder |
US4561102A (en) * | 1982-09-20 | 1985-12-24 | At&T Bell Laboratories | Pitch detector for speech analysis |
US4653098A (en) * | 1982-02-15 | 1987-03-24 | Hitachi, Ltd. | Method and apparatus for extracting speech pitch |
USRE32580E (en) * | 1981-12-01 | 1988-01-19 | American Telephone And Telegraph Company, At&T Bell Laboratories | Digital speech coder |
US4827517A (en) * | 1985-12-26 | 1989-05-02 | American Telephone And Telegraph Company, At&T Bell Laboratories | Digital speech processor using arbitrary excitation coding |
US4879748A (en) * | 1985-08-28 | 1989-11-07 | American Telephone And Telegraph Company | Parallel processing pitch detector |
US5010574A (en) * | 1989-06-13 | 1991-04-23 | At&T Bell Laboratories | Vector quantizer search arrangement |
USRE34247E (en) * | 1985-12-26 | 1993-05-11 | At&T Bell Laboratories | Digital speech processor using arbitrary excitation coding |
US5233659A (en) * | 1991-01-14 | 1993-08-03 | Telefonaktiebolaget L M Ericsson | Method of quantizing line spectral frequencies when calculating filter parameters in a speech coder |
US5353372A (en) * | 1992-01-27 | 1994-10-04 | The Board Of Trustees Of The Leland Stanford Junior University | Accurate pitch measurement and tracking system and method |
US5471527A (en) * | 1993-12-02 | 1995-11-28 | Dsc Communications Corporation | Voice enhancement system and method |
US5586126A (en) * | 1993-12-30 | 1996-12-17 | Yoder; John | Sample amplitude error detection and correction apparatus and method for use with a low information content signal |
US5657358A (en) * | 1985-03-20 | 1997-08-12 | Interdigital Technology Corporation | Subscriber RF telephone system for providing multiple speech and/or data signals simultaneously over either a single or plurality of RF channels |
US5717819A (en) * | 1995-04-28 | 1998-02-10 | Motorola, Inc. | Methods and apparatus for encoding/decoding speech signals at low bit rates |
US5852604A (en) * | 1993-09-30 | 1998-12-22 | Interdigital Technology Corporation | Modularly clustered radiotelephone system |
US5937376A (en) * | 1995-04-12 | 1999-08-10 | Telefonaktiebolaget Lm Ericsson | Method of coding an excitation pulse parameter sequence |
US6140568A (en) * | 1997-11-06 | 2000-10-31 | Innovative Music Systems, Inc. | System and method for automatically detecting a set of fundamental frequencies simultaneously present in an audio signal |
US20030088401A1 (en) * | 2001-10-26 | 2003-05-08 | Terez Dmitry Edward | Methods and apparatus for pitch determination |
US20050273323A1 (en) * | 2004-06-03 | 2005-12-08 | Nintendo Co., Ltd. | Command processing apparatus |
US20060178876A1 (en) * | 2003-03-26 | 2006-08-10 | Kabushiki Kaisha Kenwood | Speech signal compression device speech signal compression method and program |
US20110213614A1 (en) * | 2008-09-19 | 2011-09-01 | Newsouth Innovations Pty Limited | Method of analysing an audio signal |
US11443761B2 (en) | 2018-09-01 | 2022-09-13 | Indian Institute Of Technology Bombay | Real-time pitch tracking by detection of glottal excitation epochs in speech signal using Hilbert envelope |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE2649259C2 (en) * | 1976-10-29 | 1983-06-09 | Felten & Guilleaume Fernmeldeanlagen GmbH, 8500 Nürnberg | Method for the automatic detection of disturbed telephone speech |
JPS5922602U (en) * | 1982-08-04 | 1984-02-13 | 長山 勉 | Hair washing stand for infants, etc. in the bathroom |
JPS6050901U (en) * | 1983-09-16 | 1985-04-10 | ▲いざさ▼ 秀之 | Air mattress for infants |
FR2670313A1 (en) * | 1990-12-11 | 1992-06-12 | Thomson Csf | METHOD AND DEVICE FOR EVALUATING THE PERIODICITY AND VOICE SIGNAL VOICE IN VOCODERS AT VERY LOW SPEED. |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US2732424A (en) * | 1956-01-24 | oliver | ||
US3026375A (en) * | 1958-05-09 | 1962-03-20 | Bell Telephone Labor Inc | Transmission of quantized signals |
US3405237A (en) * | 1965-06-01 | 1968-10-08 | Bell Telephone Labor Inc | Apparatus for determining the periodicity and aperiodicity of a complex wave |
US3420955A (en) * | 1965-11-19 | 1969-01-07 | Bell Telephone Labor Inc | Automatic peak selector |
US3437757A (en) * | 1966-06-15 | 1969-04-08 | Bell Telephone Labor Inc | Speech analysis system |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DD49355A (en) * | ||||
CA844193A (en) * | 1970-06-09 | Western Electric Company, Incorporated | Predictive coding of speech signals | |
US2221523A (en) * | 1938-03-17 | 1940-11-12 | Ora L Railsback | Pitch determining apparatus |
US2927969A (en) * | 1954-10-20 | 1960-03-08 | Bell Telephone Labor Inc | Determination of pitch frequency of complex wave |
US2908761A (en) * | 1954-10-20 | 1959-10-13 | Bell Telephone Labor Inc | Voice pitch determination |
DE1572520A1 (en) * | 1967-06-08 | 1970-02-19 | Telefunken Patent | Method for recognizing speech sounds |
GB1180288A (en) * | 1967-06-23 | 1970-02-04 | Standard Telephones Cables Ltd | Analysing Complex Signal Waveforms |
US3631520A (en) * | 1968-08-19 | 1971-12-28 | Bell Telephone Labor Inc | Predictive coding of speech signals |
DE2062589C3 (en) * | 1970-12-18 | 1981-03-12 | Siemens AG, 1000 Berlin und 8000 München | Method for determining the fundamental frequency of an at least temporarily periodic signal |
-
1971
- 1971-07-09 US US00161173A patent/US3740476A/en not_active Expired - Lifetime
-
1972
- 1972-01-27 CA CA133,301A patent/CA967285A/en not_active Expired
- 1972-07-03 NL NL7209311A patent/NL7209311A/xx not_active Application Discontinuation
- 1972-07-06 FR FR7224483A patent/FR2145501B1/fr not_active Expired
- 1972-07-08 JP JP6793072A patent/JPS5524118B1/ja active Pending
- 1972-07-10 DE DE2233872A patent/DE2233872C2/en not_active Expired
-
1981
- 1981-06-17 JP JP56093676A patent/JPS5774800A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US2732424A (en) * | 1956-01-24 | oliver | ||
US3026375A (en) * | 1958-05-09 | 1962-03-20 | Bell Telephone Labor Inc | Transmission of quantized signals |
US3405237A (en) * | 1965-06-01 | 1968-10-08 | Bell Telephone Labor Inc | Apparatus for determining the periodicity and aperiodicity of a complex wave |
US3420955A (en) * | 1965-11-19 | 1969-01-07 | Bell Telephone Labor Inc | Automatic peak selector |
US3437757A (en) * | 1966-06-15 | 1969-04-08 | Bell Telephone Labor Inc | Speech analysis system |
Cited By (53)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3979557A (en) * | 1974-07-03 | 1976-09-07 | International Telephone And Telegraph Corporation | Speech processor system for pitch period extraction using prediction filters |
US4074069A (en) * | 1975-06-18 | 1978-02-14 | Nippon Telegraph & Telephone Public Corporation | Method and apparatus for judging voiced and unvoiced conditions of speech signal |
US4081605A (en) * | 1975-08-22 | 1978-03-28 | Nippon Telegraph And Telephone Public Corporation | Speech signal fundamental period extractor |
US4038495A (en) * | 1975-11-14 | 1977-07-26 | Rockwell International Corporation | Speech analyzer/synthesizer using recursive filters |
US4070709A (en) * | 1976-10-13 | 1978-01-24 | The United States Of America As Represented By The Secretary Of The Air Force | Piecewise linear predictive coding system |
WO1979000901A1 (en) * | 1978-04-07 | 1979-11-15 | Western Electric Co | Predictive speech signal coding with reduced noise effects |
US4133976A (en) * | 1978-04-07 | 1979-01-09 | Bell Telephone Laboratories, Incorporated | Predictive speech signal coding with reduced noise effects |
US4164626A (en) * | 1978-05-05 | 1979-08-14 | Motorola, Inc. | Pitch detector and method thereof |
US4280387A (en) * | 1979-02-26 | 1981-07-28 | Norlin Music, Inc. | Frequency following circuit |
US4282406A (en) * | 1979-02-28 | 1981-08-04 | Kokusai Denshin Denwa Kabushiki Kaisha | Adaptive pitch detection system for voice signal |
US4383135A (en) * | 1980-01-23 | 1983-05-10 | Scott Instruments Corporation | Method and apparatus for speech recognition |
US4472832A (en) * | 1981-12-01 | 1984-09-18 | At&T Bell Laboratories | Digital speech coder |
USRE32580E (en) * | 1981-12-01 | 1988-01-19 | American Telephone And Telegraph Company, At&T Bell Laboratories | Digital speech coder |
US4653098A (en) * | 1982-02-15 | 1987-03-24 | Hitachi, Ltd. | Method and apparatus for extracting speech pitch |
US4561102A (en) * | 1982-09-20 | 1985-12-24 | At&T Bell Laboratories | Pitch detector for speech analysis |
US6842440B2 (en) | 1985-03-20 | 2005-01-11 | Interdigital Technology Corporation | Subscriber RF telephone system for providing multiple speech and/or data signals simultaneously over either a single or a plurality of RF channels |
US6954470B2 (en) | 1985-03-20 | 2005-10-11 | Interdigital Technology Corporation | Subscriber RF telephone system for providing multiple speech and/or data signals simultaneously over either a single or a plurality of RF channels |
US20050025101A1 (en) * | 1985-03-20 | 2005-02-03 | Interdigital Technology Corporation | Subscriber RF telephone system for providing multiple speech and/or data signals simultaneously over either a single or a plurality of RF channels |
US20050025094A1 (en) * | 1985-03-20 | 2005-02-03 | Interdigital Technology Corporation | Subscriber RF telephone system for providing multiple speech and/or data signals simultaneously over either a single or a plurality of RF channels |
US20050018636A1 (en) * | 1985-03-20 | 2005-01-27 | Interdigital Technology Corporation | Subscriber RF telephone system for providing multiple speech and/or data signals simultaneously over either a single or a plurality of RF channels |
US6771667B2 (en) | 1985-03-20 | 2004-08-03 | Interdigital Technology Corporation | Subscriber RF telephone system for providing multiple speech and/or data signals simultaneously over either a single or a plurality of RF channels |
US6393002B1 (en) | 1985-03-20 | 2002-05-21 | Interdigital Technology Corporation | Subscriber RF telephone system for providing multiple speech and/or data signals simultaneously over either a single or a plurality of RF channels |
US6282180B1 (en) | 1985-03-20 | 2001-08-28 | Interdigital Technology Corporation | Subscriber RF telephone system for providing multiple speech and/or data signals simultaneously over either a single or a plurality of RF channels |
US5657358A (en) * | 1985-03-20 | 1997-08-12 | Interdigital Technology Corporation | Subscriber RF telephone system for providing multiple speech and/or data signals simultaneously over either a single or plurality of RF channels |
US5687194A (en) * | 1985-03-20 | 1997-11-11 | Interdigital Technology Corporation | Subscriber RF telephone system for providing multiple speech and/or data signals simultaneously over either a single or a plurality of RF channels |
US6014374A (en) * | 1985-03-20 | 2000-01-11 | Interdigital Technology Corporation | Subscriber RF telephone system for providing multiple speech and/or data signals simultaneously over either a single or a plurality of RF channels |
US5734678A (en) * | 1985-03-20 | 1998-03-31 | Interdigital Technology Corporation | Subscriber RF telephone system for providing multiple speech and/or data signals simultaneously over either a single or a plurality of RF channels |
US4879748A (en) * | 1985-08-28 | 1989-11-07 | American Telephone And Telegraph Company | Parallel processing pitch detector |
US4827517A (en) * | 1985-12-26 | 1989-05-02 | American Telephone And Telegraph Company, At&T Bell Laboratories | Digital speech processor using arbitrary excitation coding |
USRE34247E (en) * | 1985-12-26 | 1993-05-11 | At&T Bell Laboratories | Digital speech processor using arbitrary excitation coding |
US5010574A (en) * | 1989-06-13 | 1991-04-23 | At&T Bell Laboratories | Vector quantizer search arrangement |
US5233659A (en) * | 1991-01-14 | 1993-08-03 | Telefonaktiebolaget L M Ericsson | Method of quantizing line spectral frequencies when calculating filter parameters in a speech coder |
US5353372A (en) * | 1992-01-27 | 1994-10-04 | The Board Of Trustees Of The Leland Stanford Junior University | Accurate pitch measurement and tracking system and method |
US5852604A (en) * | 1993-09-30 | 1998-12-22 | Interdigital Technology Corporation | Modularly clustered radiotelephone system |
US6496488B1 (en) | 1993-09-30 | 2002-12-17 | Interdigital Technology Corporation | Modularly clustered radiotelephone system |
US20030076802A1 (en) * | 1993-09-30 | 2003-04-24 | Interdigital Technology Corporation | Modularly clustered radiotelephone system |
US6208630B1 (en) | 1993-09-30 | 2001-03-27 | Interdigital Technology Corporation | Modulary clustered radiotelephone system |
US7245596B2 (en) | 1993-09-30 | 2007-07-17 | Interdigital Technology Corporation | Modularly clustered radiotelephone system |
US20070274258A1 (en) * | 1993-09-30 | 2007-11-29 | Interdigital Technology Corporation | Radiotelephone apparatus and method |
US5471527A (en) * | 1993-12-02 | 1995-11-28 | Dsc Communications Corporation | Voice enhancement system and method |
US5586126A (en) * | 1993-12-30 | 1996-12-17 | Yoder; John | Sample amplitude error detection and correction apparatus and method for use with a low information content signal |
US6064956A (en) * | 1995-04-12 | 2000-05-16 | Telefonaktiebolaget Lm Ericsson | Method to determine the excitation pulse positions within a speech frame |
US5937376A (en) * | 1995-04-12 | 1999-08-10 | Telefonaktiebolaget Lm Ericsson | Method of coding an excitation pulse parameter sequence |
US5717819A (en) * | 1995-04-28 | 1998-02-10 | Motorola, Inc. | Methods and apparatus for encoding/decoding speech signals at low bit rates |
US6140568A (en) * | 1997-11-06 | 2000-10-31 | Innovative Music Systems, Inc. | System and method for automatically detecting a set of fundamental frequencies simultaneously present in an audio signal |
US20030088401A1 (en) * | 2001-10-26 | 2003-05-08 | Terez Dmitry Edward | Methods and apparatus for pitch determination |
US7124075B2 (en) | 2001-10-26 | 2006-10-17 | Dmitry Edward Terez | Methods and apparatus for pitch determination |
US20060178876A1 (en) * | 2003-03-26 | 2006-08-10 | Kabushiki Kaisha Kenwood | Speech signal compression device speech signal compression method and program |
US20050273323A1 (en) * | 2004-06-03 | 2005-12-08 | Nintendo Co., Ltd. | Command processing apparatus |
US8447605B2 (en) * | 2004-06-03 | 2013-05-21 | Nintendo Co., Ltd. | Input voice command recognition processing apparatus |
US20110213614A1 (en) * | 2008-09-19 | 2011-09-01 | Newsouth Innovations Pty Limited | Method of analysing an audio signal |
US8990081B2 (en) * | 2008-09-19 | 2015-03-24 | Newsouth Innovations Pty Limited | Method of analysing an audio signal |
US11443761B2 (en) | 2018-09-01 | 2022-09-13 | Indian Institute Of Technology Bombay | Real-time pitch tracking by detection of glottal excitation epochs in speech signal using Hilbert envelope |
Also Published As
Publication number | Publication date |
---|---|
FR2145501B1 (en) | 1976-08-13 |
NL7209311A (en) | 1973-01-11 |
CA967285A (en) | 1975-05-06 |
FR2145501A1 (en) | 1973-02-23 |
DE2233872C2 (en) | 1983-11-03 |
DE2233872A1 (en) | 1973-01-18 |
JPS5524118B1 (en) | 1980-06-26 |
JPS5774800A (en) | 1982-05-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US3740476A (en) | Speech signal pitch detector using prediction error data | |
EP0335521B1 (en) | Voice activity detection | |
US5276765A (en) | Voice activity detection | |
US4074069A (en) | Method and apparatus for judging voiced and unvoiced conditions of speech signal | |
US7155386B2 (en) | Adaptive correlation window for open-loop pitch | |
US4672669A (en) | Voice activity detection process and means for implementing said process | |
Tanyer et al. | Voice activity detection in nonstationary noise | |
KR100363309B1 (en) | Voice Activity Detector | |
US5548680A (en) | Method and device for speech signal pitch period estimation and classification in digital speech coders | |
US3649765A (en) | Speech analyzer-synthesizer system employing improved formant extractor | |
Rabiner et al. | Application of an LPC distance measure to the voiced-unvoiced-silence detection problem | |
GB1533337A (en) | Speech analysis and synthesis system | |
EP0074822A1 (en) | Recognition of speech or speech-like sounds | |
US4665548A (en) | Speech analysis syllabic segmenter | |
US4864307A (en) | Method and device for the automatic recognition of targets from "Doppler" ec | |
US5579432A (en) | Discriminating between stationary and non-stationary signals | |
US4219695A (en) | Noise estimation system for use in speech analysis | |
US5046100A (en) | Adaptive multivariate estimating apparatus | |
US4972490A (en) | Distance measurement control of a multiple detector system | |
McKenna | Automatic glottal closed-phase location and analysis by Kalman filtering | |
US3420955A (en) | Automatic peak selector | |
Pencak et al. | The NP speech activity detection algorithm | |
EP0310636B1 (en) | Distance measurement control of a multiple detector system | |
AU1222688A (en) | An adaptive multivariate estimating apparatus | |
US3493684A (en) | Vocoder employing composite spectrum-channel and pitch analyzer |