EP0347338A3 - Method and apparatus for speech analysis, synthesis and coding - Google Patents
Method and apparatus for speech analysis, synthesis and coding Download PDFInfo
- Publication number
- EP0347338A3 EP0347338A3 EP19890420197 EP89420197A EP0347338A3 EP 0347338 A3 EP0347338 A3 EP 0347338A3 EP 19890420197 EP19890420197 EP 19890420197 EP 89420197 A EP89420197 A EP 89420197A EP 0347338 A3 EP0347338 A3 EP 0347338A3
- Authority
- EP
- European Patent Office
- Prior art keywords
- tube
- sections
- synthesis
- speech analysis
- formants
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 230000015572 biosynthetic process Effects 0.000 title abstract 2
- 238000003786 synthesis reaction Methods 0.000 title abstract 2
- 238000000034 method Methods 0.000 title 1
- 238000004088 simulation Methods 0.000 abstract 2
- 210000001260 vocal cord Anatomy 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Electrophonic Musical Instruments (AREA)
- Prostheses (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
Il est décrit un moyen d'analyse et de synthèse de la parole utilisant la simulation du comportement acoustique d'un tube divisé en tronçons de section variable. Les variations de section des différents tronçons d'un tube permettent d'engendrer des phonèmes lorsqu'une source de débit et pression d'air est placée analogue aux cordes vocales humaines. Par simulation on peut engendrer ces phonèmes sous forme de signaux électriques fournis à un haut-parleur. L'invention porte sur le choix des longueurs des tronçons de tube et lie ce choix à la finesse de l'approximation qu'on veut faire. Pour une approximation à trois formants (les formants sont les fréquences de résonance du tube), on divise le tube en huit tronçons de longueurs successives L/10, L/15, 2L/15, 3L/15, 3L/15/ 2L/15, L/15 et L/10 ; L est la longueur totale du tube. There is described a means of speech analysis and synthesis using the simulation of the acoustic behavior of a tube divided into sections of variable section. The variations in section of the different sections of a tube make it possible to generate phonemes when a source of air flow and air pressure is placed analogous to human vocal cords. By simulation, these phonemes can be generated in the form of electrical signals supplied to a loudspeaker. The invention relates to the choice of the lengths of the tube sections and links this choice to the fineness of the approximation that we want to make. For an approximation to three formants (the formants are the resonance frequencies of the tube), the tube is divided into eight sections of successive lengths L / 10, L / 15, 2L / 15, 3L / 15, 3L / 15 / 2L / 15, L / 15 and L / 10; L is the total length of the tube.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR8808255 | 1988-06-14 | ||
FR8808255A FR2632725B1 (en) | 1988-06-14 | 1988-06-14 | METHOD AND DEVICE FOR ANALYSIS, SYNTHESIS, SPEECH CODING |
Publications (2)
Publication Number | Publication Date |
---|---|
EP0347338A2 EP0347338A2 (en) | 1989-12-20 |
EP0347338A3 true EP0347338A3 (en) | 1992-01-29 |
Family
ID=9367486
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP19890420197 Withdrawn EP0347338A3 (en) | 1988-06-14 | 1989-06-08 | Method and apparatus for speech analysis, synthesis and coding |
Country Status (3)
Country | Link |
---|---|
US (1) | US5121434A (en) |
EP (1) | EP0347338A3 (en) |
FR (1) | FR2632725B1 (en) |
Families Citing this family (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FI91925C (en) * | 1991-04-30 | 1994-08-25 | Nokia Telecommunications Oy | Procedure for identifying a speaker |
US5522013A (en) * | 1991-04-30 | 1996-05-28 | Nokia Telecommunications Oy | Method for speaker recognition using a lossless tube model of the speaker's |
FI96246C (en) * | 1993-02-04 | 1996-05-27 | Nokia Telecommunications Oy | Procedure for sending and receiving coded speech |
FI96247C (en) * | 1993-02-12 | 1996-05-27 | Nokia Telecommunications Oy | Procedure for converting speech |
US5640490A (en) * | 1994-11-14 | 1997-06-17 | Fonix Corporation | User independent, real-time speech recognition system and method |
US5971613A (en) | 1997-04-11 | 1999-10-26 | Kapak Corp. | Bag constructions having inwardly directed side seal portions |
US6823305B2 (en) * | 2000-12-21 | 2004-11-23 | International Business Machines Corporation | Apparatus and method for speaker normalization based on biometrics |
JP2003255993A (en) * | 2002-03-04 | 2003-09-10 | Ntt Docomo Inc | System, method, and program for speech recognition, and system, method, and program for speech synthesis |
US8210851B2 (en) * | 2004-01-13 | 2012-07-03 | Posit Science Corporation | Method for modulating listener attention toward synthetic formant transition cues in speech stimuli for training |
US20050175972A1 (en) * | 2004-01-13 | 2005-08-11 | Neuroscience Solutions Corporation | Method for enhancing memory and cognition in aging adults |
US20060177805A1 (en) * | 2004-01-13 | 2006-08-10 | Posit Science Corporation | Method for enhancing memory and cognition in aging adults |
US20060051727A1 (en) * | 2004-01-13 | 2006-03-09 | Posit Science Corporation | Method for enhancing memory and cognition in aging adults |
US20070111173A1 (en) * | 2004-01-13 | 2007-05-17 | Posit Science Corporation | Method for modulating listener attention toward synthetic formant transition cues in speech stimuli for training |
US20070065789A1 (en) * | 2004-01-13 | 2007-03-22 | Posit Science Corporation | Method for enhancing memory and cognition in aging adults |
US20060073452A1 (en) * | 2004-01-13 | 2006-04-06 | Posit Science Corporation | Method for enhancing memory and cognition in aging adults |
US20060105307A1 (en) * | 2004-01-13 | 2006-05-18 | Posit Science Corporation | Method for enhancing memory and cognition in aging adults |
US20070134635A1 (en) * | 2005-12-13 | 2007-06-14 | Posit Science Corporation | Cognitive training using formant frequency sweeps |
JP5178607B2 (en) * | 2009-03-31 | 2013-04-10 | 株式会社バンダイナムコゲームス | Program, information storage medium, mouth shape control method, and mouth shape control device |
WO2012003602A1 (en) * | 2010-07-09 | 2012-01-12 | 西安交通大学 | Method for reconstructing electronic larynx speech and system thereof |
US9308446B1 (en) | 2013-03-07 | 2016-04-12 | Posit Science Corporation | Neuroplasticity games for social cognition disorders |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3280266A (en) * | 1963-05-15 | 1966-10-18 | Bell Telephone Labor Inc | Synthesis of artificial speech |
US3472964A (en) * | 1965-12-29 | 1969-10-14 | Texas Instruments Inc | Vocal response synthesizer |
SU681447A1 (en) * | 1975-04-15 | 1979-08-25 | Институт математики СО АН СССР | Speech imitator |
FI66268C (en) * | 1980-12-16 | 1984-09-10 | Euroka Oy | MOENSTER OCH FILTERKOPPLING FOER AOTERGIVNING AV AKUSTISK LJUDVAEG ANVAENDNINGAR AV MOENSTRET OCH MOENSTRET TILLAEMPANDETALSYNTETISATOR |
-
1988
- 1988-06-14 FR FR8808255A patent/FR2632725B1/en not_active Expired - Fee Related
-
1989
- 1989-06-08 EP EP19890420197 patent/EP0347338A3/en not_active Withdrawn
- 1989-06-14 US US07/365,566 patent/US5121434A/en not_active Expired - Fee Related
Non-Patent Citations (5)
Title |
---|
ICASSP'86 (IEEE-IECEJ-ASJ INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING), Tokyo, 7-11 avril 1986, vol. 3, pages 2011-2014, IEEE, New York, US; W. FRANK et al.: "Improved vocal tract models for speech synthesis" * |
IRE TRANSACTIONS ON CIRCUIT THEORY, vol. CT-3, no. 4, décembre 1956, pages 232-244, New York, US; E.E. DAVID, Jr.: "Signal theory in speech transmission" * |
J.L. FLANAGAN: "Speech Analysis Synthesis and Perception", 1965, pages 166-171, Springer-Verlag, Berlin, DE * |
SPEECH COMMUNICATION, vol. 7, no. 3, octobre 1988, pages 257-286, Amsterdam, NL; M. MRAYATI et al.: "Distinctive regions and modes: a new theory of speech production" * |
THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, vol. 22, no. 6, novembre 1950, pages 740-753, New York, US; H.K. DUNN: "The calculation of vowel resonances, and an electrical vocal tract" * |
Also Published As
Publication number | Publication date |
---|---|
EP0347338A2 (en) | 1989-12-20 |
FR2632725A1 (en) | 1989-12-15 |
US5121434A (en) | 1992-06-09 |
FR2632725B1 (en) | 1990-09-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0347338A3 (en) | Method and apparatus for speech analysis, synthesis and coding | |
Lindqvist-Gauffin et al. | Acoustic properties of the nasal tract | |
US20030097254A1 (en) | Ultra-narrow bandwidth voice coding | |
Gold et al. | The channel vocoder | |
EP0732687A3 (en) | Apparatus for expanding speech bandwidth | |
DE60122751D1 (en) | METHOD AND DEVICE FOR OBJECTIVE EVALUATION OF LANGUAGE QUALITY WITHOUT REFERENCE SIGNAL | |
Quatieri et al. | Phase coherence in speech reconstruction for enhancement and coding applications | |
US2150364A (en) | Signaling system | |
Norton et al. | Improved LAboratory Prototype ELectrolarynx (LAPEL): using inverse filtering of the frequency response function of the human throat | |
Rodriguez et al. | A fuzzy information space approach to speech signal non‐linear analysis | |
Cassidy et al. | Auditory Display of Hyperspectral Colon Tissue Images Using Vocal Synthesis Models. | |
Malathi et al. | Speech enhancement via smart larynx of variable frequency for laryngectomee patient for Tamil language syllables using RADWT algorithm | |
KR100484666B1 (en) | Voice Color Converter using Transforming Vocal Tract Characteristic and Method | |
McCutcheon et al. | Effects of palatal morphology on/s, z/articulation | |
Sondhi | Transmission line inversion and synthesis from the point of view of transient response | |
Dunn et al. | Complex zeros of a triangular approximation to the glottal wave | |
Rael et al. | A computationally efficient articulatory synthesizer | |
Tyrrell et al. | Transputer-based human hearing simulation | |
Nataraja | An Objective Method of Locating Optimum Pitch | |
Rosenau | Research Project: What are the Physical and Psychoacoustic Criteria for the Quality of Good Singing Voices? | |
Whalen | Articulatory synthesis: Advances and prospects | |
Maybury et al. | Auditory models for speech analysis | |
Erogul et al. | Multiresolutional modification of speech signals for listeners with hearing impairment | |
Vemula et al. | Estimation of vocal tract shape from input/output measurements | |
BE512313A (en) |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): DE FR GB IT NL |
|
PUAL | Search report despatched |
Free format text: ORIGINAL CODE: 0009013 |
|
AK | Designated contracting states |
Kind code of ref document: A3 Designated state(s): DE FR GB IT NL |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 19920730 |