BR9612624A - Sintetizador de fala tendo base de dados de elemento acústico - Google Patents

Sintetizador de fala tendo base de dados de elemento acústico

Info

Publication number
BR9612624A
BR9612624A BR9612624-8A BR9612624A BR9612624A BR 9612624 A BR9612624 A BR 9612624A BR 9612624 A BR9612624 A BR 9612624A BR 9612624 A BR9612624 A BR 9612624A
Authority
BR
Brazil
Prior art keywords
database
phonetic
sequences
trajectories
acoustic element
Prior art date
Application number
BR9612624-8A
Other languages
English (en)
Inventor
Bernd Moebius
Joseph Philip Olive
Michael Abraham Tanenblatt
Jan Pieter Van Santen
Original Assignee
Lucent Technologies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lucent Technologies Inc filed Critical Lucent Technologies Inc
Publication of BR9612624A publication Critical patent/BR9612624A/pt

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

<B>SINTETIZADOR DE FALA TENDO BASE DE DADOS DE ELEMENTO ACúSTICO<D> Um método de síntese de fala emprega uma base de dados de elemento acústico que é estabelecida a partir de seq³ências fonéticas ocorridas em um intervalo de um sinal de fala. ao estabelecer a base de dados, trajetórias são determinadas (220) para cada uma das seq³ências fonéticas contendo um segmento fonético que corresponde a um fonema particular (210). Uma região de tolerância é então identificada baseada em uma concentração de trajetórias que correspondem às seq³ências de fonemas diferentes (230). Os elementos acústicos para a base de dados (260) são formados por porções das seq³ências fonéticas ao identificar pontos de corte (250) nas seq³ências fonéticas que correspondem aos pontos de tempo ao longo das trajetórias respectivas próximas à região de tolerância (240). Desta maneira, é possível concatenar os elementos acústicos tendo um fonema de junção comum, de modo que descontinuidades perceptíveis nos fonemas de junção sejam minimizadas. Métodos computacionalmente simples e rápidos para determinar a região de tolerância são também expostos.
BR9612624-8A 1995-08-16 1996-08-02 Sintetizador de fala tendo base de dados de elemento acústico BR9612624A (pt)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US08/515,887 US5751907A (en) 1995-08-16 1995-08-16 Speech synthesizer having an acoustic element database
PCT/US1996/012628 WO1997007500A1 (en) 1995-08-16 1996-08-02 Speech synthesizer having an acoustic element database

Publications (1)

Publication Number Publication Date
BR9612624A true BR9612624A (pt) 2000-05-23

Family

ID=24053185

Family Applications (1)

Application Number Title Priority Date Filing Date
BR9612624-8A BR9612624A (pt) 1995-08-16 1996-08-02 Sintetizador de fala tendo base de dados de elemento acústico

Country Status (10)

Country Link
US (1) US5751907A (pt)
EP (1) EP0845139B1 (pt)
JP (1) JP3340748B2 (pt)
AU (1) AU6645096A (pt)
BR (1) BR9612624A (pt)
CA (1) CA2222582C (pt)
DE (1) DE69627865T2 (pt)
MX (1) MX9801086A (pt)
TW (1) TW305990B (pt)
WO (1) WO1997007500A1 (pt)

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7251314B2 (en) * 1994-10-18 2007-07-31 Lucent Technologies Voice message transfer between a sender and a receiver
JP3349905B2 (ja) * 1996-12-10 2002-11-25 松下電器産業株式会社 音声合成方法および装置
JP2000075878A (ja) * 1998-08-31 2000-03-14 Canon Inc 音声合成装置およびその方法ならびに記憶媒体
US6202049B1 (en) 1999-03-09 2001-03-13 Matsushita Electric Industrial Co., Ltd. Identification of unit overlap regions for concatenative speech synthesis system
US6178402B1 (en) * 1999-04-29 2001-01-23 Motorola, Inc. Method, apparatus and system for generating acoustic parameters in a text-to-speech system using a neural network
US7369994B1 (en) 1999-04-30 2008-05-06 At&T Corp. Methods and apparatus for rapid acoustic unit selection from a large speech corpus
US6618699B1 (en) 1999-08-30 2003-09-09 Lucent Technologies Inc. Formant tracking based on phoneme information
US7149690B2 (en) 1999-09-09 2006-12-12 Lucent Technologies Inc. Method and apparatus for interactive language instruction
US6725190B1 (en) * 1999-11-02 2004-04-20 International Business Machines Corporation Method and system for speech reconstruction from speech recognition features, pitch and voicing with resampled basis functions providing reconstruction of the spectral envelope
US7725307B2 (en) 1999-11-12 2010-05-25 Phoenix Solutions, Inc. Query engine for processing voice based queries including semantic decoding
US7392185B2 (en) 1999-11-12 2008-06-24 Phoenix Solutions, Inc. Speech based learning/training system using semantic decoding
US7050977B1 (en) * 1999-11-12 2006-05-23 Phoenix Solutions, Inc. Speech-enabled server for internet website and method
US9076448B2 (en) * 1999-11-12 2015-07-07 Nuance Communications, Inc. Distributed real time speech recognition system
US7400712B2 (en) * 2001-01-18 2008-07-15 Lucent Technologies Inc. Network provided information using text-to-speech and speech recognition and text or speech activated network control sequences for complimentary feature access
US6625576B2 (en) 2001-01-29 2003-09-23 Lucent Technologies Inc. Method and apparatus for performing text-to-speech conversion in a client/server environment
US7010488B2 (en) * 2002-05-09 2006-03-07 Oregon Health & Science University System and method for compressing concatenative acoustic inventories for speech synthesis
US20040030555A1 (en) * 2002-08-12 2004-02-12 Oregon Health & Science University System and method for concatenating acoustic contours for speech synthesis
US7542903B2 (en) * 2004-02-18 2009-06-02 Fuji Xerox Co., Ltd. Systems and methods for determining predictive models of discourse functions
US20050187772A1 (en) * 2004-02-25 2005-08-25 Fuji Xerox Co., Ltd. Systems and methods for synthesizing speech using discourse function level prosodic features
JP4878538B2 (ja) * 2006-10-24 2012-02-15 株式会社日立製作所 音声合成装置
US8103506B1 (en) * 2007-09-20 2012-01-24 United Services Automobile Association Free text matching system and method
JP2011180416A (ja) * 2010-03-02 2011-09-15 Denso Corp 音声合成装置、音声合成方法およびカーナビゲーションシステム

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3704345A (en) * 1971-03-19 1972-11-28 Bell Telephone Labor Inc Conversion of printed text into synthetic speech
BG24190A1 (en) * 1976-09-08 1978-01-10 Antonov Method of synthesis of speech and device for effecting same
US4692941A (en) * 1984-04-10 1987-09-08 First Byte Real-time text-to-speech conversion system
US4831654A (en) * 1985-09-09 1989-05-16 Wang Laboratories, Inc. Apparatus for making and editing dictionary entries in a text to speech conversion system
EP0243479A4 (en) * 1985-10-30 1989-12-13 Central Inst Deaf LANGUAGE PROCESSING ARRANGEMENT AND METHOD.
US4820059A (en) * 1985-10-30 1989-04-11 Central Institute For The Deaf Speech processing apparatus and methods
US4829580A (en) * 1986-03-26 1989-05-09 Telephone And Telegraph Company, At&T Bell Laboratories Text analysis system with letter sequence recognition and speech stress assignment arrangement
GB2207027B (en) * 1987-07-15 1992-01-08 Matsushita Electric Works Ltd Voice encoding and composing system
US4979216A (en) * 1989-02-17 1990-12-18 Malsheen Bathsheba J Text to speech synthesis system and method using context dependent vowel allophones
JPH031200A (ja) * 1989-05-29 1991-01-07 Nec Corp 規則型音声合成装置
US5235669A (en) * 1990-06-29 1993-08-10 At&T Laboratories Low-delay code-excited linear-predictive coding of wideband speech at 32 kbits/sec
US5283833A (en) * 1991-09-19 1994-02-01 At&T Bell Laboratories Method and apparatus for speech processing using morphology and rhyming
JPH05181491A (ja) * 1991-12-30 1993-07-23 Sony Corp 音声合成装置
US5490234A (en) * 1993-01-21 1996-02-06 Apple Computer, Inc. Waveform blending technique for text-to-speech system

Also Published As

Publication number Publication date
WO1997007500A1 (en) 1997-02-27
TW305990B (pt) 1997-05-21
EP0845139A1 (en) 1998-06-03
EP0845139B1 (en) 2003-05-02
CA2222582C (en) 2001-09-11
DE69627865T2 (de) 2004-02-19
US5751907A (en) 1998-05-12
EP0845139A4 (en) 1999-10-20
AU6645096A (en) 1997-03-12
CA2222582A1 (en) 1997-02-27
DE69627865D1 (de) 2003-06-05
MX9801086A (es) 1998-04-30
JP2000509157A (ja) 2000-07-18
JP3340748B2 (ja) 2002-11-05

Similar Documents

Publication Publication Date Title
BR9612624A (pt) Sintetizador de fala tendo base de dados de elemento acústico
Ney et al. Improvements in beam search for 10000-word continuous speech recognition
CA2313526A1 (en) Apparatus and methods for detecting emotions
DE69513314D1 (de) Vorrichtung zur Erzeugung von Applaus für Karaoke-Singstimmen
EP0387602A3 (en) Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system
JPS57158900A (en) Text voice synthesizer
FI955025A (fi) Menetelmä ja laitteisto transienttitilanteiden havaitsemiseksi ja kehittämiseksi kuultavissa signaaleissa
Wightman et al. The aligner: Text-to-speech alignment using Markov models
Gordon Induction of rate-dependent processing by coarse-grained aspects of speech
SE9402284D0 (sv) Metod och anordning att adaptera en taligenkänningsutrustning för dialektala variationer i ett språk
Larreur et al. Linguistic and prosodic processing for a text-to-speech synthesis system.
Post French tonal structures
EP1010170A4 (en) METHOD AND SYSTEM FOR AUTOMATIC EVALUATION OF INDEPENDENT TEXT PRONUNCIATION FOR LANGUAGE LEARNING
JP2940835B2 (ja) ピッチ周波数差分特徴量抽出法
Xu et al. Intonation perception of low-pass filtered speech in Mandarin and Cantonese
Campbell Durational cues to prominence and grouping
Gustafson Transcribing names with foreign origin in the ONOMASTICA project
Tatham et al. Syllable reconstruction in concatenated waveform speech synthesis
Dilley et al. Ambiguity in prominence perception in spoken utterances of American English
Carlson et al. Segmental intelligibility of synthetic and natural speech in real and nonsense words.
IT1179093B (it) Procedimento e dispositivo per il riconoscimento senza addestramento preventivo di parole connesse appartenenti a piccoli vocabolari
Daly-Kelly Linguistic and acoustic characteristics of pause intervals in spontaneous speech.
Dent Voice onset time of spontaneously spoken Spanish voiceless stops
Miller Properties of feature detectors for VOT
Mora et al. Intonation features as a form of dialec tal distinction in venezuelan spanish

Legal Events

Date Code Title Description
FA10 Dismissal: dismissal - article 33 of industrial property law
B15K Others concerning applications: alteration of classification

Ipc: G10L 13/02 (2013.01)