US3190960A - Speech bandwidth compression systems - Google Patents

Speech bandwidth compression systems Download PDF

Info

Publication number
US3190960A
US3190960A US162045A US16204561A US3190960A US 3190960 A US3190960 A US 3190960A US 162045 A US162045 A US 162045A US 16204561 A US16204561 A US 16204561A US 3190960 A US3190960 A US 3190960A
Authority
US
United States
Prior art keywords
output
formant
signal
input
speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
US162045A
Inventor
Lawrence Walter
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National Research Development Corp UK
Original Assignee
National Research Development Corp UK
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National Research Development Corp UK filed Critical National Research Development Corp UK
Application granted granted Critical
Publication of US3190960A publication Critical patent/US3190960A/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Electrophonic Musical Instruments (AREA)
  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)

Abstract

936,124. Vocoders. MINISTER OF AVIATION. Dec. 18, 1961 [Dec. 30, 1960], No. 44730/60. Class 40 (4). In a formant tracker for an analysis synthesis telephone transmission system an adjustable network operates on the incoming speech and a signal is derived which is rectified and fed back to adjust the network continuously so that the network output is the original speech operated on by substantially the inverse of at least some of the formant forming filters of the vocal tract, whereby the signal derived is constrained to be representative of the corresponding formant parameter of the input speech signal. Fig. 1 shows a block diagram of a preferred embodiment for deriving tracking signals at O1 and 02, corresponding to the two lower formants of the speech input at I1, the tracking signals being amplified and fed back via I2 and I3 to control the characteristics of the network. It is shown mathematically that, assuming the speech signal can be represented as where p is the Heaviside operator, w 1 and w 2 are the two lower formant angular frequencies, and P 1 (t) is a function dependent on the excitation and other characteristics of the vocal tract, the output of adder S1, Fig. 1, will be (1+ap<SP>2</SP>)S(t), where a is the signal applied via input 12 to multiplier M1, (if a = 1/x 1 <SP>2</SP> this x 1 <SP>2</SP> function (1+ap<SP>2</SP>) is the inverse of the function which represents the effect of the formant angular frequency w 1 on the original speech and thus (1+ap<SP>2</SP>)S(t) will represent the speech signal with the component due to the formant w 1 removed. Similarly, the output of adder S2[= (1+ap<SP>2</SP>)] will represent the speech signal with the component due to formant w 2 removed. The output of adder S3 will be (1+abp<SP>4</SP>+ap<SP>2</SP>+ bp<SP>2</SP>)S(t) = (1+ap<SP>2</SP>)(1+bp<SP>2</SP>)S(t) and if a and b are correct (i.e. = 1/w 1 <SP>2</SP> and 1/w 2 <SP>2</SP> respectively) will contain no components due to either formants w<SP>1</SP> 1 or w 2 , but if one of the formants say w 1 , is incorrectly tracked there will be an output from this adder S3 which will be in phase with the output from one of the adders, i.e. S2, and these signals applied to the inputs of phase-sensitive detector P2 will produce an output dependent in magnitude and direction on the difference 1 between a and +- which output, after integraw 1 <SP>2</SP> tion and amplification, is applied to input I2 to adjust the network to eliminate the formant corresponding to the angular frequency w 1 from the outputs of adders S3 and S1. A circuit is described with respect to Fig. 2 (not shown), incorporating compensation for the limitation of frequency response of practical differentiating networks.

Description

June 22, 1965 w. LAWRENCE SPEECH BANDWIDTH COMPRESSION SYSTEMS 5 Sheets-Sheet 1 Filed Dec. 26, 1961 WHLTEK LAu/R'A/Cl Inventor @M By M 7 W .A tomeys' June 22, 1965 w. LAWRENCE 3,190,960
SPEECH BANDWIDTH COMPRESSION SYSTEMS Filed Dec. 26, 1961 3 Sheets-Sheet 2 CATFDDE FIG 2 CHHODE FOLLOI/ER Cl C: -{E/ R2 R Al W LTER Z. A WREA/(.
Inventor IZIW/I A ar 0mm W. LAWRENCE SPEECH BANDWIDTH COMPRESSION SYSTEMS,
June 22, 1965 3 Sheets-Sheet 3 Filed Dec. 26. 1961 52 5823 u Emzum u s: m
MMY 0E ln/fll- TER L A RE W Inventor By gwv/ W attorney;
United States Patent M 3,190,960 SPEECH BANDWIDTH COWRESSION SYSTEMS Walter Lawrence, Bournemouth, England, assignor to National Research Development Corporation, London, England Filed Dec. 26, 1961, Ser. No. 162,045 Claims priority, application Great Britain, Dec. 30, 1960, 44,730/ 60 7 Claims. (Cl. 179-1) The present invention relates to speech analysing systems. Such analysing systems may be used in speech bandwidth compression apparatus for the transmission of speech over radio links and the like.
The present invention is based on a theory of speech formation in which it is postulated that the sounds produced in speech are synthesised basically from the production by the vocal chords of a fundamental pitch frequency. This pitch frequency is produced by the opening and closing of the vocal chords, each such opening resulting in a pitch pulse. The pitch pulses emitted from the vocal chords pass through the vocal tract and are subjected there to the action of formant-forming filters in the form of resonators, the resonant frequencies of which are varied as the sounds are articulated. The action of these formant resonators on the pitch pulses may be simulated by passing pitch pulses through a series of resonators or damped oscillators, the oscillation frequencies of which may be varied to produce the required speech. The variable frequencies at which these resonators, or damped oscillators, are operated are called the formant frequencies. In practice, a reasonable likeness to speech may be synthesised by the action on pitch pulses of inter alia a number of formant resonators, or damped oscillators, in series.
An object of the present invention is to produce a formant tracker capable of extracting from speech, formant parameter signals indicative of formant frequencies present in the speech.
It is a further object of the present invention to provide a formant tracker which includes an adjustable network for operating on an input speech signal and tracking means for extracting from a primary output of the network rectified derivatives of the speech signal and for feeding the rectified derivatives back to adjust the network continuously so that the primary output is the speech signal operated upon by substantially the inverse of at least some of the formant-forming filters of the vocal tract, whereby the rectified derivatives are constrained to be representative of the corresponding formant parameters of the input speech signal.
In this specification, the term formant parameter is used in the general sense of a parameter which is related in some known manner to a corresponding formant frequency.
A still further object of the invention is to provide a formant tracker for tracking the frequencies of formants formed by formant-forming resonators in a vocal tract during the production of an input speech signal, including an adjustable network means for simulating the inverse of a number n of the said formant-forming resonators over discrete ranges of frequency and having a primary output representative of the said input speech signal operated upon by the electrical analogue of substantially the inverse of the said It formant-forming resonators and having n secondary outputs, each secondary output being representative of the said input speech signal operated upon by the electrical analogue of substantially the inverse of a different number (n1) of the said n formant-forming resonators, n phase-sensitive demodulators connected in parallel to the said primary output and each connected to a different one of the said 11 sec- 3,190,966 Patented June 22, 1965 ondary outputs, and means for feeding back output signals from the said phase-sensitive demodulators to adjust the electrical network so that the said output signals are representative of the formant frequencies of the said n formant-forming resonators.
An embodiment of the present invention will now be described, by way of example, with reference to the accompanying drawings in which:
FIGURE 1 is a block schematic diagram of a formant tracker illustrating the basic principles upon which the embodiment of the invention is founded and FIGURE 2 is a more detailed circuit diagram of the formant tracker shown in FIGURE 1.
Before the drawings are referred to, it must first be explained that the theory underlying the embodiment is based on the assumption that speech is formed by the action on each pitch pulse of a number of serial operations. For simplicity in explaining the invention, only the action of formant-forming filters or resonators on the pitch pulses will be hereinafter considered in detail. The pro duction of the speech signal may be expressed approximately by the operational equation:
where p is the Heaviside operator, a derivative operator representing d/dt. Where S(t) is a function representing the final spech signal, F(t) is a function representing the pitch pulse, H( p) is an operational function representing the action of all the formant resonators in series and N( p) is an operational function of all the other factors operating in the vocal tract.
It has been found that a good approximation to the action of the formant resonators may be expressed in the where h (p):1/(1+p /w 11 (1)) being a transfer function representing the action of one formant resonator at a resonant angular frequency to, which varies with time.
It is found that a reasonable approximation to the function H( p) may be obtained by taking the first two lower frequency formants represented in the series h,( p). In that case,
Where w, and 0 represent the variable frequencies of the first two formants.
Clearly, if the first three formants are taken, then and so on if more formants are taken.
From Equations 1 and 3:
where P 0) (P) a i(P) Similarly, if three formants are taken, then from Equations 1 and 4 where P 0)=F(t).N(p).l'I (h (p)), and so on if more formants are taken.
In the following description, the function S(t) is used to denote the electrical equivalent of the acoustic signal.
The embodiment about to be described with reference to FIGURES 1 and 2 is a formant tracker which extracts from an electrical signal S(t) two formant parameter signals which are related to the two formant angular frequencies al and m of Equation 5.
FIGURE 1 is a block schematic diagram illustrating the principles on which a circuit network for tracking two formant parameters m and w; is based. FIGURE 1 S(t) is subjected to a double differentiation process.
- 3 shows an input terminal 11 to which is applied a speech signal S(t). This speech signal is passed from the input terminal to a double diiferentiator D1 in which the signal y this means the signal S(t) is reversed in phase and given an amplitude weighting which is proportional to the square of the frequency. 7 That is to say that in the doubledifferentiator, the speech signal S(t) is subjected to a.
transfer function proportional to p so that the output of the double difierentiator D1 is a signal proportional to p .S(t).
The output of the double diiferentiator D1 is applied to one input of a multiplier M1 in which it is multiplied by a voltage a applied to the other input of the multiplier from an input terminal 12. It follows that the output of the multiplier M1 is a signal proportional to a.p S (t). This is added in an adder S1 to the original signal so that the output of the adder S1 is a signal proportional to (l+a.p ).S(t). The output of the double differentiator D1 is also applied to one input of a multiplier M2 to the other input of which a voltage bis applied via an input terminal 13. The output of the multiplier M2 is av signal proportional to b.p .S(t). This is added in an adder S2 to the original signal so that the output of the adder S2 is a signal proportional to (l+b.p ).S(t).
The output of the double differentiator D1 is also applied to a second double differentiator D2, similar to the double differentiator Di, so that the output of the double diiferentiator D2 is a signal proportional to p 3 (t). This signal is applied to one input of a multi plier M3, to the other input of which is applied the voltage a via the terminal 12. The output of the multiplier M3 is, therefore, a signal proportional to a.p .S(t). This signal is applied to one input of a further multiplier M4 to the other input of which is applied the voltage b via the input I3. The output of the multiplier M4 is, therefore, a signal proportional to a.b.p .S (t). The out puts of the multipliers M1, M2 and M4 and the original signal are all applied to a four-input adder S3 so that the output of this adder is proportional to )P -p l 7 It will be seen from Equation 5, that if the assumptions made therein are correct, and a in Expression (7) is 'made equal to l/w and b is made equal to 1/w then the speech signal S(t) will have been operated upon by the inverse of the formant-forming filters in the vocal tract which have resonant frequencies of o and 40 respectively and the output of the adder S3 (referred to as the primary output of the network) will be asignal represented by the function P of Equation above. The correct values for the voltages a and b are obtained by the following means. The -outputs of theadders S1 (referred to as a secondary output of the network) and S3 are applied to separate inputs of a phase-sensitive demodulator Pi, the output of which is applied to an integrator amplifier N1. The output voltage at an output terminal 01 of the integrator amplifier N1 is applied tothe input terminal I3 and, as will be explained hereinafter, is then a voltage which is a function of o Similarly, the outi which are now being considered. The output of the adder S1 is a signal proportional to (l+ap )S(t). Now the component a.p .S(t) of this signal is in antiphase with the component 8(1) and has an amplitude which is proportional to a and to the inverse of the square of the angular frequency 01 it follows that at some value of a(:l/w the component of the signal S(t) due to one formant frequency, f say, will be eliminated from the output of the adder S1. This value of a will have no inverse linear relationship to the square of the formant frequency. Also, when this value of a is obtained, the only components remaining in the output signals of the adder S1 are those due to the'other formant frequency, f say, and the function P 0). Under the same conditions, the output of the adder S3 .will contain a component at the formant frequency f This component at the formant frequency f will be in phase or antiphase with the corresponding component at the output of the adder S1 according to whether b is less than or greater than '1/w respectively. Furthermore, it will have an amplitude which will increase as b 1/ (0 increases Which will be zero when b=1/w In the phase-sensitive demodulator, the components at the formant frequency f are multiplied together. It follows that, the voltage at the output terminal '01 of the integrator amplifier N1 will vary about some datum value according to whether b is greater than or less than 1/w Similarly, the voltage at the output terminal 02 of the integrator amplifier N2 will vary about some datum value according to whether'a is greater than or less than 1/ L0 If now, the output terminal 01 is connected to the input terminal 13 and the output terminal 02 is connected to the input terminal 12 through amplifiers A1 and A2 respectively (as shown in FIGURE 1), then the system will act to constrain b to be equal to 1/w and a to be equal to 10: The values of the voltages a and b, as presented at outputs O3 and 04 respectively, will then be representative of the required two formant parameters.
The block schematic circuit network described with reference to FIGURE 1 is, unfortunately, difficult to realise in practice because pure differentiators cannot be simulated by circuits. Consequently circuits have to be used which have transfer functions similar to the function puts of the adders S2 (also referred to as a secondary output of the network) and S3 are applied to separate inputs of a phase-sensitive demodulator P2, the output of which is applied to an integrator amplifier N2. The output voltage at an output terminal 02 of the integrator amplifier N2 is applied to the input terminal 12 and is then a voltage which is a function of m The function performed by the phase-sensitive demodulator P1 will now be explained. In the production of the speech signal S(t) each formant resonator was excited into damped oscillation on the occurrence of a pitch pulse F(t). 'Thus, the resultant speech signal S(t) consists of the original pitch pulse overlaid with, inter alia, damped oscillations at the formant frequencies, the main two'of of differentiation and then due allowance made for their imperfections. Thus, for example, a practical differentiating circuit does not provide an output in which the amplitude weighting (that is to say, the amplification factor) is proportional to frequency no matter how high the frequency, but has an upper frequency limit, or cutoff frequency, above which the amplification factor increases less with frequency and eventually decreases with further increases of frequency. Signal paths containing more than one differentiation circuit will, therefore, suf fer from a greater effective attenuation. of the signal at the higher frequencies than signal paths containing fewer differentiating circuits. This effect may be offset to some extent by introducing attenuation into those signal paths having fewer than the maximum number of differentiating circuits in any one signal path, to simulate the attenuafentiator comprising essentially a capacitor C2, a high gain phase-inverting amplifier A1 and a feedback resistor R2. The feedback resistor R2 is shunted by a smallvalued capacitor C3 which accentuates slightly the fall off in amplification discussed above. As hereinbefore stated, the amplifier A1 is a phase-inverting amplifier (that is to say, its output is in anti-phase to its input) so that the output of the double dilferentiator D1 is in antiphase to that which one would expect from an ideal double difierentiator. However, this is compensated for in the remainder of the circuit.
The transfer function of the double differentiator D1 may be written as (p/w (l+p/w where w is the angular frequency, corresponding to say 1 kc./s., at which the amplification is unity and w is the angular frequency, corresponding to say 3 kc./s., at which the output is attenuated by 3 db in power.
The output from the double differentiator D1 is applied through a cathode follower 2 to a second double differentiator D2 having the same transfer function as that of the double difi'erentiator D1. It follows that the total transfer function involved in this signal path is The output of the double diiferentiator D1 is also passed, via the cathode follower 2, to a resistance/ capacity filter circuit F1 having a transfer function of This filter circuit is designated to have a substantially level frequency response at the lower frequencies but to attenuate the signal progressively at higher frequencies, the attenuation being of the order of 3 db in power at a frequency of 3 kc./ s. The transfer function of the double differentiator D1 and the filter F1 together is, therefore,
The output of the cathode follower 1 is applied to a filter F2, a cathode follower 3 and a filter F3 in series, each of these filters being identical to the filters F1, so that the transfer function of these two filters F1 and F2 together is 1/ (1+p/ w This transfer function will be referred to hereinafter by the symbol A. The outputs of the filters F3 and F1 and the double difierentiator D2 are applied to cathode followers 4, and 6 respectively. From the foregoing, it will be seen that the outputs of the cathode followers 4, 5 and 6 will be proportional to A.S(t), (A.p /w )S(t) and (A.p /w )S(t) respectively.
The cathode follower 5 is connected at its output to a multiplier M1 the other input to which is the input terminal 12 to which a voltage a is applied. The multiplier M1 is shown in this drawing as being a phase-inverting amplifier to which the output of the cathode follower 5 is applied through a resistor R3. The amplifier A2 is shown as having a feedback resistor R4 which is variable effectively to vary the gain of the amplifier under control of the voltage a applied to the terminal I2. This is a purely symbolic representation of the working of the multiplier which may be a pulsed attenuator multiplier of a type similar to that described in the book Electronic Analogue Computers, 2nd edition, by Korn and Korn (published by the McGraw Hill Book Company) as pages 269 and 270. However, preferably in this case the pulsing is done in the feedback path of the amplifier A2 so that the resistance R4 is effectively varied by the duty cycle of the pulsing which is controlled by the voltage a applied to the input 12. In order to reduce ripple on the output of the multiplier M1 due to the chopping frequency, a capacitor C4 shunts the resistor R4 and the amplifier A2 is followed by a ripple filter R5, C5. The output of the ripple filter is applied to a cathode follower 7. The output of the cathode follower '7 is a signal substantially proportional to (n.A.p /w ).S(t). The minus sign has now disappeared because the amplifier A2 is a phase-inverting amplifier.
The output of the cathode follower 5 is also connected to one input of a multiplier M2 having a similar construction to the multiplier M1. The other input to the multiplier M2 is the input terminal 13 to which is applied a voltage b. The output of the multiplier M2 is applied 6. to a cathode follower 8 at the output of which appears a signal substantially proportional to (b.A.p /w ).S(t).
A multiplier M3, a cathode follower 9, a multiplier M4 and a cathode follower 10 are connected in series to the output of the cathode follower 6. The multipliers M3 and M4 are similar in construction to the multiplier M1 and have their second inputs connected to the input terminals I2 and I3 respectively. The signal at the output of the cathode follower 10 is, therefore, substantially proportional to (a.b.A.p /w ).S(t).
The outputs (S and T of FIGURE 2) of the cathode followers 4 and 7 are connected to the input of an analogue adder S1 (in the form of a summing amplifier) through equal resistors R6 and R7 respectively. The adder S1 comprises a phase-inverting amplifier A3 having a feedback resistor R8. The resistor R8 is shunted by a capacitor C6 to reduce unwanted higher frequency components which might interfere with the action of a phase-sensitive demodulator P1 which receives the output of the adder S1. The output of the adder S1 is a secondary output signal proportional to (1+a.p /w )A.S(t) which is applied to one input of the phase-sensitive demodulator P1.
An adder S2, similar to the adder S1 has the outputs (S and U of FIGURE 2) of the cathode followers 4 and 8 connected to its input through equal resistors R9 and R10 respectively. The output of this adder is, therefore, a secondary output signal proportional to which is applied to one input of a phase-sensitive demodulator P2.
The outputs (S, T, U, V of FIGURE 2) of the cathode followers 4, 7, 8 and 10 are connected to the input of an adder S3 through equal resistors R11, R12, R13 and R14 respectively. The adder S3 is of similar construction to the adder S1. The output of the adder S3 is, therefore, a primary output signal proportional to which is equal to [(l-[-ap /w (1+b.p /w )]A.S(t). This signal is applied to the other inputs of the phasesensitive demodulators P1 and P2. The signals applied to the phase-sensitive demodulators P1 and P2 are, except for the changes in sign and the upper frequency limits imposed by the imperfections of the circuits, the same as those applied to the phase-sensitive demodulators P1 and P2 of FIGURE 1. The output of the phase-sensitive demodulator P1 is applied to the input of an integrator N1. This integrator consists of a phase-inverting amplifier A4 having a feedback capacitor C7, and a resistor R15 through which the integrator input is app ied to the amplifier A4. The output of the, integrator N1 is applied through its output terminal 01 and an amplifier (not shown) to the input terminal 13 so as to control the formant parameter voltage b. The voltage at the output terminal 01 will then be proportional to the required formant parameter e1 A further integrator N2, similar in construction to the integrator N1, receives the output of the phase-sensitive modulator P2. The output terminal 02 of the integrator N2 is fed back through an amplifier (not shown) to the input terminal 12 to control the formant parameter voltage a. The voltage at the output terminal 02 will then be proportional to. the required formant parameter m When the apparatus is initially set up to carry out formant tracking, preset controls (not shown) preferably in the multipliers M1, M2, M3 and M4 and the phasesensitive detectors P1 and P2 must be set up to ensure that the required formants are tracked by the apparatus. That is to say, the apparatus is so preset that the values of the formant parameter voltages a and I) lie in their expected ranges.
It will be realised by those versed in the art that the invention is not limited to the specific apparatus described with reference to FIGURE 1 or FIGURE 2 for operating upon the input speech signal S(t) with the various transfer functions. Thus, for example, the signal proportional to (l-i-qp (l+bp )S(t) obtained at the primary output of the adder S3 may be produced instead by operating on the output signal proportional to (1+ap )S(t) at the output of the adder S1 by apparatus, such as a double differentiator and a multiplier in series, having a transfer function bp and adding the resultant signal, in an adder, to the output of the adder S1.
Although the embodiment hereinbefore described with reference to FIGURES 1 and 2 is constructed to track two formant frequencies, clearly the invention is not limited,
to the tracking of only two formant frequencies: but may be applied to the tracking of any number of formant parameters. Any number, n, of formant frequencies may be tracked by operating on an input speech signal S(t).
by an adjustable network'having a primary output which is the input signal operated upon by a transfer function which is the inverse of all of the n formant-forming filters of the vocal tract and n secondary outputs each of which is the input signal operated upon by a transfer function which is the inverse of all except one (different in each case) of the n formant forming filters. Thus for example, by similar means to those already described with reference to FIGURES l and 2, functions similar to I claim:
1. A formant tracker including:
' (a) a first double ditferentiator, I
(b) a first multiplier connected to an output of the first double differentiator for multiplying the output signal of the first double differentiator by a first electrical signal, 7
(c) a first adder having inputs connected to the input of the first double diiferentiator and to the output of the first multiplier,
(d) a second multiplier connected to an output of the first double diiferentiator for multiplying the output signal of the first double differentiator by a second electrical signal,
(e) a second adder having inputs connected to the input of the first double ditferentiator and to the output of the second multiplier,
(f) a second double diiferentiator having its input connected to the output of the first double differentiator,
(g) a third multiplier connected to an output of the second double diiferentiator for multiplying the output signal of the second double dilferentiator by the said first electrical signal,
(h) a fourth multiplier connected to an output of the third multiplier for multiplying the output signal of the second multiplier by the said second electrical signal,
(i) a third adder connected to. the input of the first double dilferentiator and to the outputs of the said first, second and fourth multipliers,
outputs of the first and third adders,
(k) a first integrator connected to the output of the first phase-sensitive demodulator,
(l) a first amplifier connected to the output of the first integrator to produce the said second electrical signal,
(In) a second phase-sensitive demodulator connected to the outputs of the second and third adders,
(n) a second integrator connected to the output of the second phase-sensitive demodulator, and
(o) a second amplifier connected to the output of the second integrator to produce the said first electrical signal.
2. A formant tracker for tracking the frequencies of formants formed by formant-forming resonators in a vocal tract during the production of an input speech signal, including an adjustable-electrical network means having a primary output, means for extracting from the said primary output formant-parameter signals derived from the said input speech signal, and means for feeding back the said formant-parameter signals to adjust the said electrical network means continuously so that a signal at the primary output is the speech signal operated upon by the electrical analogue of substantially the inverse of at least some of the said formant-forming resonators.
3. A formant, tracker for tracking the frequencies of formants formed by formant-forming resonators in a vocal tract-during the production of an input speech signal, including an adjustable electrical network means for simulating the inverse of at least some of the said formant-forming resonators over discrete ranges of frequencies, the said electrical network means having a primary output, means for extracting from the said primary output formant-parameter signals derived from the input speech signals, and means for feeding back the said formant-parameter signals to adjust the said electrical network means continuously so that a signal at the primary output is the said input speech signal operated upon by the electrical analogue of substantially the inverse of at least some of the said formant-forming resonators.
4. A formant tracker for tracking the frequencies of formants formed by formant-forming resonators in a vocal tract during the production of an input speech signal, including an adjustable electrical network means for simulating the inverse of at least some of the said formant-forming resonators over discrete ranges of frequency, the 'said electrical network means having a primary output representative of the said input speech signal operated upon by the electrical analogue of substantially the inverse of the said at least some of the formant-forming resonators and having secondary outputs equal in number to the said at least some of the formant-forming resonators, each secondary output being representative of the input speech signal operated upon by the electrical analogue of the inverse of all except a corresponding one of the said at least some formant-forming resonators, a plurality of phase-sensitive demodulators all connected to the said primary output and each connected to a separate one of the said secondary outputs, and means for feeding back output signals from the phase-sensitive demodulators to adjust the electrical network means so that the said output signals are representative of the formant frequencies of the said at least some formant-forming resonators.
5. A formant tracker as claimed in claim 4 and wherein the said adjustable network includes a first operational means connected to an input of the said adiustable network for converting an input signal S(t) to substantially the form (1+ap ).S(t) at a'secondary output thereof, a second operational means connected to the said input for'convertirig the said input signal to substantially the form (l+bp ).S(t) at a secondary output thereof, where a and b are variable electrical'quantities and p is the Heaviside operator, and third operational means for providing a primary output from the said adjustable network which is of substantially the form (1+ap (1-|-bp ).S(t).
6. A' formant tracker as claimed in claim 5 including a first phase-sensitive demodulator, means for applying the said secondary output of the said first operational means and the said primary output to the first phasesensitive demodulator, means for feeding back the output of the first phase-sensitive demodulator to the said adjustable network to control the electrical quantity b, a second phase-sensitive demodulator, means for applying the said secondary output of the said second operational means and the said primary output to the second phasesensitive demodulator and means for feeding back the output of the second phase-sensitive demodulator to the said adjustable network to control the electrical quantity a.
'7. A formant tracker for tracking the frequencies of formants formed by formant-forming resonators in a vocal tract during the production of an input speech Signal, including an adjustable electrical network means for simulating the inverse of a number n of the said formant-forming resonators over discrete ranges of frequency, the said electrical network means having a pri- 20 mary output representative of the said input speech signal operated upon the electrical analogue of substantially the inverse of the said 12 formant-forming resonators and having it secondary outputs, each secondary output being representative of the said input speech signal operated upon by the electrical analogue of substantially the inverse of (IL-1) formant-forming resonators, n phasesensitive demodulators connected in parallel to the said primary output and each connected to a different one of the said It secondary outputs, and means for feeding back output signals from the said phase-sensitive demodulators for adjusting the electrical network means so that the said output signals are representative of the formantfrequencies of the said 11 formant-forming resonators.
References Cited by the Examiner UNITED STATES PATENTS 2,996,579 8/61 Slaymaker 1791 OTHER REFERENCES A Survey of Speech Bandwidth Compression Techniques, by S. I. Campanella: IRE Transactions on Audio, September-October 1958, pages 109-110.
ROBERT H. ROSE, Primary Examiner.

Claims (1)

  1. 2. A FORMANT TRACKER FOR TRACKING THE FREQUENCIES OF FORMANTS FORMED BY FORMANT-FORMING RESONATORS IN A VOCAL TRACK DURING THE PRODUCTION OF AN INPUT SPEECH SIGNAL, INCLUDING AN ADJUSTABLE ELECTRICAL NETWORK MEANS HAVING A PRIMARY OUTPUT, MEANS FOR EXTRACTING FROM THE SAID PRIMARY OUTPUT FORMANT-PARAMETER SIGNALS DERIVED FROM THE SAID INPUT SPEECH SIGNAL, AND MEANS FOR FEEDING BACK THE SAID FORMANT-PARAMETER SIGNALS TO ADJUST THE SAID ELECTRICAL NETWORK MEANS CONTINUOUSLY SO THAT A SIGNAL AT THE PRIMARY OUTPUT IS THE SPEECH SIGNAL OPERATED UPON BY THE ELECTRICAL ANALOGUE OF SUBSTANTIALLY THE INVERSE OF AT LEAST SOME OF THE SAID FORMANT-FORMING RESONATORS.
US162045A 1960-12-30 1961-12-26 Speech bandwidth compression systems Expired - Lifetime US3190960A (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GB44730/60A GB936124A (en) 1960-12-30 1960-12-30 Speech bandwidth compression systems

Publications (1)

Publication Number Publication Date
US3190960A true US3190960A (en) 1965-06-22

Family

ID=10434513

Family Applications (1)

Application Number Title Priority Date Filing Date
US162045A Expired - Lifetime US3190960A (en) 1960-12-30 1961-12-26 Speech bandwidth compression systems

Country Status (4)

Country Link
US (1) US3190960A (en)
FR (1) FR1314613A (en)
GB (1) GB936124A (en)
NL (1) NL272790A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3327057A (en) * 1963-11-08 1967-06-20 Bell Telephone Labor Inc Speech analysis
US3483162A (en) * 1967-04-25 1969-12-09 Sperry Rand Corp Photochromic schiff base copolymers
US3808370A (en) * 1972-08-09 1974-04-30 Rockland Systems Corp System using adaptive filter for determining characteristics of an input

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US2996579A (en) * 1960-01-13 1961-08-15 Gen Dynamics Corp Feedback vocoder

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US2996579A (en) * 1960-01-13 1961-08-15 Gen Dynamics Corp Feedback vocoder

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3327057A (en) * 1963-11-08 1967-06-20 Bell Telephone Labor Inc Speech analysis
US3483162A (en) * 1967-04-25 1969-12-09 Sperry Rand Corp Photochromic schiff base copolymers
US3808370A (en) * 1972-08-09 1974-04-30 Rockland Systems Corp System using adaptive filter for determining characteristics of an input

Also Published As

Publication number Publication date
FR1314613A (en) 1963-01-11
GB936124A (en) 1963-09-04
NL272790A (en) 1964-08-25

Similar Documents

Publication Publication Date Title
US3360610A (en) Bandwidth compression utilizing magnitude and phase coded signals representative of the input signal
US4480333A (en) Method and apparatus for active sound control
EP0037493B1 (en) Voice analyzing apparatus
EP0159546A1 (en) Digital graphic equalizer
GB1334250A (en) Self adaptive filter and control circuit therefor
JP2791036B2 (en) Audio processing device
US3737783A (en) Signal-to-noise ratio improving device for receiving systems having two wave collectors
US3190960A (en) Speech bandwidth compression systems
US3786363A (en) Voltage-controlled low-pass filter
GB2069280A (en) Process of testing for a sound control system
US4064444A (en) Method and self-adapting damper system having energy balance
US3109066A (en) Sound control system
US3063034A (en) Method and apparatus for data processing
US3124745A (en) System
US4539536A (en) Parameter-setting approach to obtain high performance CTD transversal filters from devices with significant CTI and FPN
Lago et al. The design of sampled-data feedback systems
US3573374A (en) Formant vocoder utilizing resonator damping
US3393307A (en) Electronic multiplier/divider
Cron et al. Theoretical and experimental study of underwater sound reverberation
RU2039371C1 (en) System of automatic control over non-stationary object
JP2603065B2 (en) Prediction filter
JPH01286622A (en) Method and apparatus for processing signal
US3423530A (en) Speech synthesizer having q multiplier
DeBrunner et al. IIR Filter Sensitivity Predicts Filter Wordlength
RU2130635C1 (en) Adaptive control device with alternating structure