GB1485803A - Method and apparatus for the analysis and synthesis of speech - Google Patents

Method and apparatus for the analysis and synthesis of speech

Info

Publication number
GB1485803A
GB1485803A GB41575/74A GB4157574A GB1485803A GB 1485803 A GB1485803 A GB 1485803A GB 41575/74 A GB41575/74 A GB 41575/74A GB 4157574 A GB4157574 A GB 4157574A GB 1485803 A GB1485803 A GB 1485803A
Authority
GB
United Kingdom
Prior art keywords
model
vocal tract
speech
fed
output
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired
Application number
GB41575/74A
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Gretag AG
Original Assignee
Gretag AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Gretag AG filed Critical Gretag AG
Publication of GB1485803A publication Critical patent/GB1485803A/en
Expired legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

1485803 Analysis synthesis communication system GRETAG AG 24 Sept 1974 [22 July 1974] 41575/74 Heading H4R In an analysis synthesis telephone system the analyser includes a synthesizer which is identical to the synthesizer used at the receiver of the system, and is adjusted in accordance with the transmitted parameters, the output of the synthesizer is compared with stored samples of the original speech, and the transmitted parameters are adjusted to minimize the comparator output. As described the speech input from 1 after filtering in filter 2, having a cut-off frequency in the region 3-5 kHz, is converted to digital form in the A. to D converter 3 operating at a sampling frequency of 6 to 10 kHz and a short sample of speech, of 10 to 30 msecs. length is stored in unit 5. Simultaneously the pitch detector 4 determines, in conventional manner, whether the speech is voiced or unvoiced and, if voiced, the pitch period. The pitch detector output is fed to coder 10 for transmission to the synthesizer S, and also to a generator 6 which generates pitch pulses at the appropriate frequency if the speech is voiced, or a pseudo-random pulse stream if the speech is unvoiced. The output from generator 6 is fed to a vocal tract model 7, which is a linear digital filter simulating the transfer characteristic of the vocal tract, and the output from the model is compared at 8 with the incoming speech to provide an error signal which is used to adjust the parameters of the vocal tract model to reduce the error signal from the comparator. When the error signal has been reduced to a predetermined level the parameters set in the model 7 are coded in the coder 10 and transmitted. At the receiver a similar pulse/noise generator 6<SP>1</SP> and vocal tract model 7<SP>1</SP> to that in the transmitter are used to synthesize a signal suitable for conversion back to analogue form and reproduction on speaker 13. Since similar filters, generators, and vocal tract models, are used for transmission and reception it is suggested that only one filter, one generator, and one vocal tract model, are used with switching to change their function between transmission and reception. Vocal tract model, Fig. 2a, which includes a number of storage locations U 1 to U 8 , which forms the state vector U n of the model on the nth cycle, 8 linear combinations are formed from these values in the feedback matrix 22, the nth sample of the excitation sequence multiplied by the input vectors components b 1 to b 8 in multipliers 23 is added to each of the linear combinations from the matrix 22 in adders 26 to provide the new state vector U n + 1 for the n + 1th cycle. The output sequence y n is calculated as a linear combination of the values in the store 21 which are multiplied by the output vector components C 1 to C 8 in multipliers 24 and added, in 27, to the input pulse x n multiplied by a scalar quantity d to obtain the output sequence y n . The components of matrix A and vectors b and c and scalar quantity d can be divided into two groups, those that are invariable, and normally have simple values such as 0; 1, or - 1, and those that are changed by the optimization process. Parameter computer, Fig. 3a (not shown), includes a first primary model (29) a unit (30), and 8 primary part models (31 to 38). The primary model (29), is identical to the vocal tract model and can therefore be used as such, it is fed with the excitation function x n and yields the synthetic speech signal y n and the partial derivatives which is equal to the corresponding component Ui of the state vector U, and the derivative which is equal to the corresponding term of the excitation sequence x n in respect of the transition coefficient d. The unit 30 (not shown) is a dual model with respect to the vocal tract model and from it can be derived the partial derivatives In addition the components of the state vector U 1 to U 8 , formed by a matrix which is the transpose of the matrix 22 of the vocal tract model, are fed to primary part models 31 to 38 and the state vectors of the primary part modes each provide The partial derivatives of y n are fed to computer stages, Fig. 4 (not shown), to derive the partial derivatives of the error signal, in accordance with the selected error dimension, which partial derivatives are fed back to the parameter computer, Fig. 3a, to increment the parameters to enable y n to more closely approach the incoming speech signal S n . In an alternative arrangement the parameter computer includes the primary model (39), equivalent to the vocal tract model, and a dual part model (40) (as in 29 and 30 in Fig. 3a) but the state vector components U 1 to U 8 of the primary model (39) are fed to primary dual part models (41 to 48) to obtain the partial derivatives with respect to the matrix coefficients.
GB41575/74A 1974-07-22 1974-09-24 Method and apparatus for the analysis and synthesis of speech Expired GB1485803A (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CH1006674A CH581878A5 (en) 1974-07-22 1974-07-22

Publications (1)

Publication Number Publication Date
GB1485803A true GB1485803A (en) 1977-09-14

Family

ID=4358956

Family Applications (1)

Application Number Title Priority Date Filing Date
GB41575/74A Expired GB1485803A (en) 1974-07-22 1974-09-24 Method and apparatus for the analysis and synthesis of speech

Country Status (4)

Country Link
US (1) US3909533A (en)
CA (1) CA1039407A (en)
CH (1) CH581878A5 (en)
GB (1) GB1485803A (en)

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5154714A (en) * 1974-10-16 1976-05-14 Nippon Telegraph & Telephone Tajuonseidensohoshiki
US4058676A (en) * 1975-07-07 1977-11-15 International Communication Sciences Speech analysis and synthesis system
DE2536585C3 (en) * 1975-08-16 1981-04-02 Philips Patentverwaltung Gmbh, 2000 Hamburg Arrangement for statistical signal analysis
US4051331A (en) * 1976-03-29 1977-09-27 Brigham Young University Speech coding hearing aid system utilizing formant frequency transformation
IT1083533B (en) * 1977-06-20 1985-05-21 Cselt Centro Studi Lab Telecom PROCEDURE AND DEVICE FOR THE GENERATION OF A VOICE TYPE SIGNAL FOR THE PERFORMANCE OF OBJECTIVE MEASUREMENTS OF EQUIPMENT PERFORMANCE PART OF VOICE SIGNAL TRANSMISSION SYSTEMS
US4406626A (en) * 1979-07-31 1983-09-27 Anderson Weston A Electronic teaching aid
JPS56500980A (en) * 1979-07-31 1981-07-16
US4319083A (en) * 1980-02-04 1982-03-09 Texas Instruments Incorporated Integrated speech synthesis circuit with internal and external excitation capabilities
JPS58162470A (en) * 1982-03-24 1983-09-27 三菱電機株式会社 Register for calling of elevator
US4520499A (en) * 1982-06-25 1985-05-28 Milton Bradley Company Combination speech synthesis and recognition apparatus
US5127055A (en) * 1988-12-30 1992-06-30 Kurzweil Applied Intelligence, Inc. Speech recognition apparatus & method having dynamic reference pattern adaptation
US4972474A (en) * 1989-05-01 1990-11-20 Cylink Corporation Integer encryptor
WO1992011627A2 (en) * 1990-12-21 1992-07-09 British Telecommunications Public Limited Company Speech coding
JP2810252B2 (en) * 1991-05-22 1998-10-15 シャープ株式会社 Audio playback device
FI96247C (en) * 1993-02-12 1996-05-27 Nokia Telecommunications Oy Procedure for converting speech
US5471527A (en) 1993-12-02 1995-11-28 Dsc Communications Corporation Voice enhancement system and method
US5797120A (en) * 1996-09-04 1998-08-18 Advanced Micro Devices, Inc. System and method for generating re-configurable band limited noise using modulation
US20040210440A1 (en) * 2002-11-01 2004-10-21 Khosrow Lashkari Efficient implementation for joint optimization of excitation and model parameters with a general excitation function
US20130229530A1 (en) * 2012-03-02 2013-09-05 Apple Inc. Spectral calibration of imaging devices

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3631520A (en) * 1968-08-19 1971-12-28 Bell Telephone Labor Inc Predictive coding of speech signals
US3624302A (en) * 1969-10-29 1971-11-30 Bell Telephone Labor Inc Speech analysis and synthesis by the use of the linear prediction of a speech wave

Also Published As

Publication number Publication date
US3909533A (en) 1975-09-30
CH581878A5 (en) 1976-11-15
CA1039407A (en) 1978-09-26

Similar Documents

Publication Publication Date Title
GB1485803A (en) Method and apparatus for the analysis and synthesis of speech
US4220819A (en) Residual excited predictive speech coding system
CA1046642A (en) Phase vocoder speech synthesis system
US5007095A (en) System for synthesizing speech having fluctuation
CA2160749C (en) Speech coding apparatus, speech decoding apparatus, speech coding and decoding method and a phase amplitude characteristic extracting apparatus for carrying out the method
US6006174A (en) Multiple impulse excitation speech encoder and decoder
US4389540A (en) Adaptive linear prediction filters
GB2060321A (en) Speech synthesizer
US4742550A (en) 4800 BPS interoperable relp system
KR920022683A (en) Sampling frequency converter
US5048088A (en) Linear predictive speech analysis-synthesis apparatus
CA2154881A1 (en) A system and method for compression and decompression of audio signals
US4700393A (en) Speech synthesizer with variable speed of speech
US5369730A (en) Speech synthesizer
US4845753A (en) Pitch detecting device
US4601052A (en) Voice analysis composing method
US4908863A (en) Multi-pulse coding system
Rabiner et al. A hardware realization of a digital formant speech synthesizer
JPS57197600A (en) Phonemic piece editting type voice synthesization system
DE3478065D1 (en) Method and apparatus for coding digital signals
JPS59195283A (en) Electronic musical instrument
CA2070603C (en) Speech coders based on analysis-by-synthesis techniques
JP2615991B2 (en) Linear predictive speech analysis and synthesis device
SU1316030A1 (en) Method and apparatus for analyzing and synthesizing speech
JP3097324B2 (en) Digital sound data output device

Legal Events

Date Code Title Description
PS Patent sealed [section 19, patents act 1949]
746 Register noted 'licences of right' (sect. 46/1977)
PCNP Patent ceased through non-payment of renewal fee