MX9801086A - Sintetizador de habla que tiene una base de datos de elementos acusticos. - Google Patents

Sintetizador de habla que tiene una base de datos de elementos acusticos.

Info

Publication number
MX9801086A
MX9801086A MX9801086A MX9801086A MX9801086A MX 9801086 A MX9801086 A MX 9801086A MX 9801086 A MX9801086 A MX 9801086A MX 9801086 A MX9801086 A MX 9801086A MX 9801086 A MX9801086 A MX 9801086A
Authority
MX
Mexico
Prior art keywords
phonetic
database
sequences
trajectories
tolerance region
Prior art date
Application number
MX9801086A
Other languages
English (en)
Inventor
Bernd Moebius
Joseph Philip Olive
Michael Abraham Tanenblatt
Jan Pieter Van Santen
Original Assignee
Lucent Technologies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lucent Technologies Inc filed Critical Lucent Technologies Inc
Publication of MX9801086A publication Critical patent/MX9801086A/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

La presente invencion se refiere a un método para síntesis de habla que emplea una base de datos de elementos acusticos, que se establece a partir de secuencias fonéticas que ocurren en un intervalo de una señal de habla, al establecer la base de datos, se determinan trayectorias por cada una de las secuencias fonéticas que contienen un segmento fonético que corresponde a un fonema particular. Luego se identifica una region de tolerancia con base en una concentracion de trayectorias que corresponden a diferentes secuencias de fonema. Los elementos acusticos para la base de datos se forman a partir de porciones de las secuencias fonéticas, al identificar puntos de corte en las secuencias fonéticas que corresponden a puntos en tiempo sobre las trayectorias respectivas proximas a la region de tolerancia. De esta manera, es posible concatenar los elementos acusticos que tienen fonemas de union comunes tal que las discontinuidades perceptibles en los fonemas de union se minimizan. También se describen métodos computacionalmente simples y rápidos para determinar la region de tolerancia.
MX9801086A 1995-08-16 1996-08-02 Sintetizador de habla que tiene una base de datos de elementos acusticos. MX9801086A (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US08/515,887 US5751907A (en) 1995-08-16 1995-08-16 Speech synthesizer having an acoustic element database
PCT/US1996/012628 WO1997007500A1 (en) 1995-08-16 1996-08-02 Speech synthesizer having an acoustic element database

Publications (1)

Publication Number Publication Date
MX9801086A true MX9801086A (es) 1998-04-30

Family

ID=24053185

Family Applications (1)

Application Number Title Priority Date Filing Date
MX9801086A MX9801086A (es) 1995-08-16 1996-08-02 Sintetizador de habla que tiene una base de datos de elementos acusticos.

Country Status (10)

Country Link
US (1) US5751907A (es)
EP (1) EP0845139B1 (es)
JP (1) JP3340748B2 (es)
AU (1) AU6645096A (es)
BR (1) BR9612624A (es)
CA (1) CA2222582C (es)
DE (1) DE69627865T2 (es)
MX (1) MX9801086A (es)
TW (1) TW305990B (es)
WO (1) WO1997007500A1 (es)

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7251314B2 (en) * 1994-10-18 2007-07-31 Lucent Technologies Voice message transfer between a sender and a receiver
JP3349905B2 (ja) * 1996-12-10 2002-11-25 松下電器産業株式会社 音声合成方法および装置
JP2000075878A (ja) * 1998-08-31 2000-03-14 Canon Inc 音声合成装置およびその方法ならびに記憶媒体
US6202049B1 (en) 1999-03-09 2001-03-13 Matsushita Electric Industrial Co., Ltd. Identification of unit overlap regions for concatenative speech synthesis system
US6178402B1 (en) * 1999-04-29 2001-01-23 Motorola, Inc. Method, apparatus and system for generating acoustic parameters in a text-to-speech system using a neural network
US7369994B1 (en) 1999-04-30 2008-05-06 At&T Corp. Methods and apparatus for rapid acoustic unit selection from a large speech corpus
US6618699B1 (en) 1999-08-30 2003-09-09 Lucent Technologies Inc. Formant tracking based on phoneme information
US7149690B2 (en) 1999-09-09 2006-12-12 Lucent Technologies Inc. Method and apparatus for interactive language instruction
US6725190B1 (en) * 1999-11-02 2004-04-20 International Business Machines Corporation Method and system for speech reconstruction from speech recognition features, pitch and voicing with resampled basis functions providing reconstruction of the spectral envelope
US7725307B2 (en) 1999-11-12 2010-05-25 Phoenix Solutions, Inc. Query engine for processing voice based queries including semantic decoding
US9076448B2 (en) * 1999-11-12 2015-07-07 Nuance Communications, Inc. Distributed real time speech recognition system
US7050977B1 (en) 1999-11-12 2006-05-23 Phoenix Solutions, Inc. Speech-enabled server for internet website and method
US7392185B2 (en) 1999-11-12 2008-06-24 Phoenix Solutions, Inc. Speech based learning/training system using semantic decoding
US7400712B2 (en) * 2001-01-18 2008-07-15 Lucent Technologies Inc. Network provided information using text-to-speech and speech recognition and text or speech activated network control sequences for complimentary feature access
US6625576B2 (en) 2001-01-29 2003-09-23 Lucent Technologies Inc. Method and apparatus for performing text-to-speech conversion in a client/server environment
US7010488B2 (en) * 2002-05-09 2006-03-07 Oregon Health & Science University System and method for compressing concatenative acoustic inventories for speech synthesis
US20040030555A1 (en) * 2002-08-12 2004-02-12 Oregon Health & Science University System and method for concatenating acoustic contours for speech synthesis
US7542903B2 (en) * 2004-02-18 2009-06-02 Fuji Xerox Co., Ltd. Systems and methods for determining predictive models of discourse functions
US20050187772A1 (en) * 2004-02-25 2005-08-25 Fuji Xerox Co., Ltd. Systems and methods for synthesizing speech using discourse function level prosodic features
JP4878538B2 (ja) * 2006-10-24 2012-02-15 株式会社日立製作所 音声合成装置
US8103506B1 (en) * 2007-09-20 2012-01-24 United Services Automobile Association Free text matching system and method
JP2011180416A (ja) * 2010-03-02 2011-09-15 Denso Corp 音声合成装置、音声合成方法およびカーナビゲーションシステム

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3704345A (en) * 1971-03-19 1972-11-28 Bell Telephone Labor Inc Conversion of printed text into synthetic speech
BG24190A1 (en) * 1976-09-08 1978-01-10 Antonov Method of synthesis of speech and device for effecting same
US4692941A (en) * 1984-04-10 1987-09-08 First Byte Real-time text-to-speech conversion system
US4831654A (en) * 1985-09-09 1989-05-16 Wang Laboratories, Inc. Apparatus for making and editing dictionary entries in a text to speech conversion system
US4820059A (en) * 1985-10-30 1989-04-11 Central Institute For The Deaf Speech processing apparatus and methods
JPS63501603A (ja) * 1985-10-30 1988-06-16 セントラル インステイチユ−ト フオ ザ デフ スピ−チ処理装置および方法
US4829580A (en) * 1986-03-26 1989-05-09 Telephone And Telegraph Company, At&T Bell Laboratories Text analysis system with letter sequence recognition and speech stress assignment arrangement
GB2207027B (en) * 1987-07-15 1992-01-08 Matsushita Electric Works Ltd Voice encoding and composing system
US4979216A (en) * 1989-02-17 1990-12-18 Malsheen Bathsheba J Text to speech synthesis system and method using context dependent vowel allophones
JPH031200A (ja) * 1989-05-29 1991-01-07 Nec Corp 規則型音声合成装置
US5235669A (en) * 1990-06-29 1993-08-10 At&T Laboratories Low-delay code-excited linear-predictive coding of wideband speech at 32 kbits/sec
US5283833A (en) * 1991-09-19 1994-02-01 At&T Bell Laboratories Method and apparatus for speech processing using morphology and rhyming
JPH05181491A (ja) * 1991-12-30 1993-07-23 Sony Corp 音声合成装置
US5490234A (en) * 1993-01-21 1996-02-06 Apple Computer, Inc. Waveform blending technique for text-to-speech system

Also Published As

Publication number Publication date
EP0845139A4 (en) 1999-10-20
EP0845139A1 (en) 1998-06-03
CA2222582A1 (en) 1997-02-27
JP2000509157A (ja) 2000-07-18
US5751907A (en) 1998-05-12
BR9612624A (pt) 2000-05-23
DE69627865T2 (de) 2004-02-19
WO1997007500A1 (en) 1997-02-27
DE69627865D1 (de) 2003-06-05
JP3340748B2 (ja) 2002-11-05
EP0845139B1 (en) 2003-05-02
CA2222582C (en) 2001-09-11
TW305990B (es) 1997-05-21
AU6645096A (en) 1997-03-12

Similar Documents

Publication Publication Date Title
MX9801086A (es) Sintetizador de habla que tiene una base de datos de elementos acusticos.
Ney et al. Improvements in beam search for 10000-word continuous speech recognition
US6349277B1 (en) Method and system for analyzing voices
EP0688010B1 (en) Speech synthesis method and speech synthesizer
GB9512284D0 (en) Speech Synthesiser
DE69513314D1 (de) Vorrichtung zur Erzeugung von Applaus für Karaoke-Singstimmen
EP1168306A3 (en) Method and apparatus for improving the intelligibility of digitally compressed speech
DE69629763D1 (de) Verfahren und Vorrichtung zur Ermittlung von Triphone Hidden Markov Modellen (HMM)
DE69417445D1 (de) Verfahren und system zur detektion und erzeugung von übergangsbedingungen in tonsignalen
EP0762386A3 (en) Method and apparatus for CELP coding an audio signal while distinguishing speech periods and non-speech periods
JP2000505914A (ja) 音声認識装置において、隠れマルコフ音声モデルを多言語で適用するための方法
TW326070B (en) The estimation method of the impulse gain for coding vocoder
US4847905A (en) Method of encoding speech signals using a multipulse excitation signal having amplitude-corrected pulses
Keiler et al. Efficient linear prediction for digital audio effects
JP2940835B2 (ja) ピッチ周波数差分特徴量抽出法
Colotte et al. Automatic enhancement of speech intelligibility
JPS5925237B2 (ja) 音声分析合成方式の音声区間判定方法
KR0155805B1 (ko) 부프레임별 유/무성 대역 정보를 이용한 음성합성 방법
Strube et al. Synthesis of unrestricted German speech from interpolated log-area-ratio coded transitions
JP3263136B2 (ja) 信号のピッチ同期位置抽出方式及び信号合成方式
JPS61128300A (ja) ピツチ抽出装置
JPH02232700A (ja) 音声合成装置
Morton Naturalness in synthetic speech
Resch et al. Time synchronization of speech.
Lee et al. Context‐dependent acoustic subword modeling for connected digit recognition

Legal Events

Date Code Title Description
FG Grant or registration
MM Annulment or lapse due to non-payment of fees