BR9612624A - Sintetizador de fala tendo base de dados de elemento acústico - Google Patents
Sintetizador de fala tendo base de dados de elemento acústicoInfo
- Publication number
- BR9612624A BR9612624A BR9612624-8A BR9612624A BR9612624A BR 9612624 A BR9612624 A BR 9612624A BR 9612624 A BR9612624 A BR 9612624A BR 9612624 A BR9612624 A BR 9612624A
- Authority
- BR
- Brazil
- Prior art keywords
- database
- phonetic
- sequences
- trajectories
- acoustic element
- Prior art date
Links
- 238000000034 method Methods 0.000 abstract 1
- 238000001308 synthesis method Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
<B>SINTETIZADOR DE FALA TENDO BASE DE DADOS DE ELEMENTO ACúSTICO<D> Um método de síntese de fala emprega uma base de dados de elemento acústico que é estabelecida a partir de seq³ências fonéticas ocorridas em um intervalo de um sinal de fala. ao estabelecer a base de dados, trajetórias são determinadas (220) para cada uma das seq³ências fonéticas contendo um segmento fonético que corresponde a um fonema particular (210). Uma região de tolerância é então identificada baseada em uma concentração de trajetórias que correspondem às seq³ências de fonemas diferentes (230). Os elementos acústicos para a base de dados (260) são formados por porções das seq³ências fonéticas ao identificar pontos de corte (250) nas seq³ências fonéticas que correspondem aos pontos de tempo ao longo das trajetórias respectivas próximas à região de tolerância (240). Desta maneira, é possível concatenar os elementos acústicos tendo um fonema de junção comum, de modo que descontinuidades perceptíveis nos fonemas de junção sejam minimizadas. Métodos computacionalmente simples e rápidos para determinar a região de tolerância são também expostos.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/515,887 US5751907A (en) | 1995-08-16 | 1995-08-16 | Speech synthesizer having an acoustic element database |
PCT/US1996/012628 WO1997007500A1 (en) | 1995-08-16 | 1996-08-02 | Speech synthesizer having an acoustic element database |
Publications (1)
Publication Number | Publication Date |
---|---|
BR9612624A true BR9612624A (pt) | 2000-05-23 |
Family
ID=24053185
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BR9612624-8A BR9612624A (pt) | 1995-08-16 | 1996-08-02 | Sintetizador de fala tendo base de dados de elemento acústico |
Country Status (10)
Country | Link |
---|---|
US (1) | US5751907A (pt) |
EP (1) | EP0845139B1 (pt) |
JP (1) | JP3340748B2 (pt) |
AU (1) | AU6645096A (pt) |
BR (1) | BR9612624A (pt) |
CA (1) | CA2222582C (pt) |
DE (1) | DE69627865T2 (pt) |
MX (1) | MX9801086A (pt) |
TW (1) | TW305990B (pt) |
WO (1) | WO1997007500A1 (pt) |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7251314B2 (en) * | 1994-10-18 | 2007-07-31 | Lucent Technologies | Voice message transfer between a sender and a receiver |
JP3349905B2 (ja) * | 1996-12-10 | 2002-11-25 | 松下電器産業株式会社 | 音声合成方法および装置 |
JP2000075878A (ja) * | 1998-08-31 | 2000-03-14 | Canon Inc | 音声合成装置およびその方法ならびに記憶媒体 |
US6202049B1 (en) | 1999-03-09 | 2001-03-13 | Matsushita Electric Industrial Co., Ltd. | Identification of unit overlap regions for concatenative speech synthesis system |
US6178402B1 (en) * | 1999-04-29 | 2001-01-23 | Motorola, Inc. | Method, apparatus and system for generating acoustic parameters in a text-to-speech system using a neural network |
US7369994B1 (en) | 1999-04-30 | 2008-05-06 | At&T Corp. | Methods and apparatus for rapid acoustic unit selection from a large speech corpus |
US6618699B1 (en) | 1999-08-30 | 2003-09-09 | Lucent Technologies Inc. | Formant tracking based on phoneme information |
US7149690B2 (en) | 1999-09-09 | 2006-12-12 | Lucent Technologies Inc. | Method and apparatus for interactive language instruction |
US6725190B1 (en) * | 1999-11-02 | 2004-04-20 | International Business Machines Corporation | Method and system for speech reconstruction from speech recognition features, pitch and voicing with resampled basis functions providing reconstruction of the spectral envelope |
US7725307B2 (en) | 1999-11-12 | 2010-05-25 | Phoenix Solutions, Inc. | Query engine for processing voice based queries including semantic decoding |
US7392185B2 (en) | 1999-11-12 | 2008-06-24 | Phoenix Solutions, Inc. | Speech based learning/training system using semantic decoding |
US7050977B1 (en) * | 1999-11-12 | 2006-05-23 | Phoenix Solutions, Inc. | Speech-enabled server for internet website and method |
US9076448B2 (en) * | 1999-11-12 | 2015-07-07 | Nuance Communications, Inc. | Distributed real time speech recognition system |
US7400712B2 (en) * | 2001-01-18 | 2008-07-15 | Lucent Technologies Inc. | Network provided information using text-to-speech and speech recognition and text or speech activated network control sequences for complimentary feature access |
US6625576B2 (en) | 2001-01-29 | 2003-09-23 | Lucent Technologies Inc. | Method and apparatus for performing text-to-speech conversion in a client/server environment |
US7010488B2 (en) * | 2002-05-09 | 2006-03-07 | Oregon Health & Science University | System and method for compressing concatenative acoustic inventories for speech synthesis |
US20040030555A1 (en) * | 2002-08-12 | 2004-02-12 | Oregon Health & Science University | System and method for concatenating acoustic contours for speech synthesis |
US7542903B2 (en) * | 2004-02-18 | 2009-06-02 | Fuji Xerox Co., Ltd. | Systems and methods for determining predictive models of discourse functions |
US20050187772A1 (en) * | 2004-02-25 | 2005-08-25 | Fuji Xerox Co., Ltd. | Systems and methods for synthesizing speech using discourse function level prosodic features |
JP4878538B2 (ja) * | 2006-10-24 | 2012-02-15 | 株式会社日立製作所 | 音声合成装置 |
US8103506B1 (en) * | 2007-09-20 | 2012-01-24 | United Services Automobile Association | Free text matching system and method |
JP2011180416A (ja) * | 2010-03-02 | 2011-09-15 | Denso Corp | 音声合成装置、音声合成方法およびカーナビゲーションシステム |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3704345A (en) * | 1971-03-19 | 1972-11-28 | Bell Telephone Labor Inc | Conversion of printed text into synthetic speech |
BG24190A1 (en) * | 1976-09-08 | 1978-01-10 | Antonov | Method of synthesis of speech and device for effecting same |
US4692941A (en) * | 1984-04-10 | 1987-09-08 | First Byte | Real-time text-to-speech conversion system |
US4831654A (en) * | 1985-09-09 | 1989-05-16 | Wang Laboratories, Inc. | Apparatus for making and editing dictionary entries in a text to speech conversion system |
EP0243479A4 (en) * | 1985-10-30 | 1989-12-13 | Central Inst Deaf | LANGUAGE PROCESSING ARRANGEMENT AND METHOD. |
US4820059A (en) * | 1985-10-30 | 1989-04-11 | Central Institute For The Deaf | Speech processing apparatus and methods |
US4829580A (en) * | 1986-03-26 | 1989-05-09 | Telephone And Telegraph Company, At&T Bell Laboratories | Text analysis system with letter sequence recognition and speech stress assignment arrangement |
GB2207027B (en) * | 1987-07-15 | 1992-01-08 | Matsushita Electric Works Ltd | Voice encoding and composing system |
US4979216A (en) * | 1989-02-17 | 1990-12-18 | Malsheen Bathsheba J | Text to speech synthesis system and method using context dependent vowel allophones |
JPH031200A (ja) * | 1989-05-29 | 1991-01-07 | Nec Corp | 規則型音声合成装置 |
US5235669A (en) * | 1990-06-29 | 1993-08-10 | At&T Laboratories | Low-delay code-excited linear-predictive coding of wideband speech at 32 kbits/sec |
US5283833A (en) * | 1991-09-19 | 1994-02-01 | At&T Bell Laboratories | Method and apparatus for speech processing using morphology and rhyming |
JPH05181491A (ja) * | 1991-12-30 | 1993-07-23 | Sony Corp | 音声合成装置 |
US5490234A (en) * | 1993-01-21 | 1996-02-06 | Apple Computer, Inc. | Waveform blending technique for text-to-speech system |
-
1995
- 1995-08-16 US US08/515,887 patent/US5751907A/en not_active Expired - Lifetime
-
1996
- 1996-08-02 CA CA002222582A patent/CA2222582C/en not_active Expired - Fee Related
- 1996-08-02 BR BR9612624-8A patent/BR9612624A/pt not_active Application Discontinuation
- 1996-08-02 AU AU66450/96A patent/AU6645096A/en not_active Abandoned
- 1996-08-02 WO PCT/US1996/012628 patent/WO1997007500A1/en active IP Right Grant
- 1996-08-02 MX MX9801086A patent/MX9801086A/es not_active IP Right Cessation
- 1996-08-02 DE DE69627865T patent/DE69627865T2/de not_active Expired - Lifetime
- 1996-08-02 JP JP50931697A patent/JP3340748B2/ja not_active Expired - Fee Related
- 1996-08-02 EP EP96926228A patent/EP0845139B1/en not_active Expired - Lifetime
- 1996-08-13 TW TW085109787A patent/TW305990B/zh not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
WO1997007500A1 (en) | 1997-02-27 |
TW305990B (pt) | 1997-05-21 |
EP0845139A1 (en) | 1998-06-03 |
EP0845139B1 (en) | 2003-05-02 |
CA2222582C (en) | 2001-09-11 |
DE69627865T2 (de) | 2004-02-19 |
US5751907A (en) | 1998-05-12 |
EP0845139A4 (en) | 1999-10-20 |
AU6645096A (en) | 1997-03-12 |
CA2222582A1 (en) | 1997-02-27 |
DE69627865D1 (de) | 2003-06-05 |
MX9801086A (es) | 1998-04-30 |
JP2000509157A (ja) | 2000-07-18 |
JP3340748B2 (ja) | 2002-11-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
BR9612624A (pt) | Sintetizador de fala tendo base de dados de elemento acústico | |
Ney et al. | Improvements in beam search for 10000-word continuous speech recognition | |
CA2313526A1 (en) | Apparatus and methods for detecting emotions | |
DE69513314D1 (de) | Vorrichtung zur Erzeugung von Applaus für Karaoke-Singstimmen | |
EP0387602A3 (en) | Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system | |
JPS57158900A (en) | Text voice synthesizer | |
FI955025A (fi) | Menetelmä ja laitteisto transienttitilanteiden havaitsemiseksi ja kehittämiseksi kuultavissa signaaleissa | |
Wightman et al. | The aligner: Text-to-speech alignment using Markov models | |
Gordon | Induction of rate-dependent processing by coarse-grained aspects of speech | |
SE9402284D0 (sv) | Metod och anordning att adaptera en taligenkänningsutrustning för dialektala variationer i ett språk | |
Larreur et al. | Linguistic and prosodic processing for a text-to-speech synthesis system. | |
Post | French tonal structures | |
EP1010170A4 (en) | METHOD AND SYSTEM FOR AUTOMATIC EVALUATION OF INDEPENDENT TEXT PRONUNCIATION FOR LANGUAGE LEARNING | |
JP2940835B2 (ja) | ピッチ周波数差分特徴量抽出法 | |
Xu et al. | Intonation perception of low-pass filtered speech in Mandarin and Cantonese | |
Campbell | Durational cues to prominence and grouping | |
Gustafson | Transcribing names with foreign origin in the ONOMASTICA project | |
Tatham et al. | Syllable reconstruction in concatenated waveform speech synthesis | |
Dilley et al. | Ambiguity in prominence perception in spoken utterances of American English | |
Carlson et al. | Segmental intelligibility of synthetic and natural speech in real and nonsense words. | |
IT1179093B (it) | Procedimento e dispositivo per il riconoscimento senza addestramento preventivo di parole connesse appartenenti a piccoli vocabolari | |
Daly-Kelly | Linguistic and acoustic characteristics of pause intervals in spontaneous speech. | |
Dent | Voice onset time of spontaneously spoken Spanish voiceless stops | |
Miller | Properties of feature detectors for VOT | |
Mora et al. | Intonation features as a form of dialec tal distinction in venezuelan spanish |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FA10 | Dismissal: dismissal - article 33 of industrial property law | ||
B15K | Others concerning applications: alteration of classification |
Ipc: G10L 13/02 (2013.01) |