ATE261171T1 - APPARATUS AND METHOD FOR GENERATING AND EVALUating MULTIPLE PRONUNCIATION VARIANTS OF A Spelled Word USING DECISION TREES - Google Patents

APPARATUS AND METHOD FOR GENERATING AND EVALUating MULTIPLE PRONUNCIATION VARIANTS OF A Spelled Word USING DECISION TREES

Info

Publication number
ATE261171T1
ATE261171T1 AT99303390T AT99303390T ATE261171T1 AT E261171 T1 ATE261171 T1 AT E261171T1 AT 99303390 T AT99303390 T AT 99303390T AT 99303390 T AT99303390 T AT 99303390T AT E261171 T1 ATE261171 T1 AT E261171T1
Authority
AT
Austria
Prior art keywords
spelled word
pronunciations
generating
decision trees
mixed
Prior art date
Application number
AT99303390T
Other languages
German (de)
Inventor
Roland Kuhn
Jean-Claude Junqua
Matteo Contolini
Original Assignee
Matsushita Electric Ind Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/067,764 external-priority patent/US6016471A/en
Priority claimed from US09/069,308 external-priority patent/US6230131B1/en
Priority claimed from US09/070,300 external-priority patent/US6029132A/en
Application filed by Matsushita Electric Ind Co Ltd filed Critical Matsushita Electric Ind Co Ltd
Application granted granted Critical
Publication of ATE261171T1 publication Critical patent/ATE261171T1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination

Abstract

The mixed decision tree includes a network of yes-no questions about adjacent letters in a spelled word sequence and also about adjacent phonemes in the phoneme sequence corresponding to the spelled word sequence. Leaf nodes of the mixed decision tree provide information about which phonetic transcriptions are most probable. Using the mixed trees, scores are developed for each of a plurality of possible pronunciations, and these scores can be used to select the best pronunciation as well as to rank pronunciations in order of probability. The pronunciations generated by the system can be used in speech synthesis and speech recognition applications as well as lexicography applications. <IMAGE>
AT99303390T 1998-04-29 1999-04-29 APPARATUS AND METHOD FOR GENERATING AND EVALUating MULTIPLE PRONUNCIATION VARIANTS OF A Spelled Word USING DECISION TREES ATE261171T1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US09/067,764 US6016471A (en) 1998-04-29 1998-04-29 Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word
US09/069,308 US6230131B1 (en) 1998-04-29 1998-04-29 Method for generating spelling-to-pronunciation decision tree
US09/070,300 US6029132A (en) 1998-04-30 1998-04-30 Method for letter-to-sound in text-to-speech synthesis

Publications (1)

Publication Number Publication Date
ATE261171T1 true ATE261171T1 (en) 2004-03-15

Family

ID=27371225

Family Applications (1)

Application Number Title Priority Date Filing Date
AT99303390T ATE261171T1 (en) 1998-04-29 1999-04-29 APPARATUS AND METHOD FOR GENERATING AND EVALUating MULTIPLE PRONUNCIATION VARIANTS OF A Spelled Word USING DECISION TREES

Country Status (7)

Country Link
EP (1) EP0953970B1 (en)
JP (1) JP3481497B2 (en)
KR (1) KR100509797B1 (en)
CN (1) CN1118770C (en)
AT (1) ATE261171T1 (en)
DE (1) DE69915162D1 (en)
TW (1) TW422967B (en)

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE50003260D1 (en) * 1999-03-08 2003-09-18 Siemens Ag METHOD AND ARRANGEMENT FOR DETERMINING A REPRESENTATIVE LOUD
AU1767600A (en) * 1999-12-23 2001-07-09 Intel Corporation Speech recognizer with a lexical tree based n-gram language model
US6684187B1 (en) 2000-06-30 2004-01-27 At&T Corp. Method and system for preselection of suitable units for concatenative speech
US6505158B1 (en) 2000-07-05 2003-01-07 At&T Corp. Synthesis-based pre-selection of suitable units for concatenative speech
AU2000276394A1 (en) * 2000-09-30 2002-04-15 Intel Corporation Method and system for generating and searching an optimal maximum likelihood decision tree for hidden markov model (hmm) based speech recognition
CN100423911C (en) * 2000-10-13 2008-10-08 索尼公司 Robot device and behavior control method for robot device
US6845358B2 (en) * 2001-01-05 2005-01-18 Matsushita Electric Industrial Co., Ltd. Prosody template matching for text-to-speech systems
US20040078191A1 (en) * 2002-10-22 2004-04-22 Nokia Corporation Scalable neural network-based language identification from written text
US7146319B2 (en) * 2003-03-31 2006-12-05 Novauris Technologies Ltd. Phonetically based speech recognition system and method
FI118062B (en) * 2003-04-30 2007-06-15 Nokia Corp Decision tree with a sparse memory
EP1638080B1 (en) * 2004-08-11 2007-10-03 International Business Machines Corporation A text-to-speech system and method
US7558389B2 (en) * 2004-10-01 2009-07-07 At&T Intellectual Property Ii, L.P. Method and system of generating a speech signal with overlayed random frequency signal
GB2428853A (en) 2005-07-22 2007-02-07 Novauris Technologies Ltd Speech recognition application specific dictionary
US20090291419A1 (en) * 2005-08-01 2009-11-26 Kazuaki Uekawa System of sound representaion and pronunciation techniques for english and other european languages
JP4769223B2 (en) * 2007-04-26 2011-09-07 旭化成株式会社 Text phonetic symbol conversion dictionary creation device, recognition vocabulary dictionary creation device, and speech recognition device
CN101452701B (en) * 2007-12-05 2011-09-07 株式会社东芝 Confidence degree estimation method and device based on inverse model
KR101250897B1 (en) * 2009-08-14 2013-04-04 한국전자통신연구원 Apparatus for word entry searching in a portable electronic dictionary and method thereof
US20110238412A1 (en) * 2010-03-26 2011-09-29 Antoine Ezzat Method for Constructing Pronunciation Dictionaries
US8494850B2 (en) * 2011-06-30 2013-07-23 Google Inc. Speech recognition using variable-length context
US9336771B2 (en) 2012-11-01 2016-05-10 Google Inc. Speech recognition using non-parametric models
US9384303B2 (en) 2013-06-10 2016-07-05 Google Inc. Evaluation of substitution contexts
US9741339B2 (en) * 2013-06-28 2017-08-22 Google Inc. Data driven word pronunciation learning and scoring with crowd sourcing based on the word's phonemes pronunciation scores
JP6234134B2 (en) * 2013-09-25 2017-11-22 三菱電機株式会社 Speech synthesizer
US9858922B2 (en) 2014-06-23 2018-01-02 Google Inc. Caching speech recognition scores
US9299347B1 (en) 2014-10-22 2016-03-29 Google Inc. Speech recognition using associative mapping
CN107767858B (en) * 2017-09-08 2021-05-04 科大讯飞股份有限公司 Pronunciation dictionary generating method and device, storage medium and electronic equipment
CN109376358B (en) * 2018-10-25 2021-07-16 陈逸天 Word learning method and device based on historical spelling experience and electronic equipment
KR102605159B1 (en) * 2020-02-11 2023-11-23 주식회사 케이티 Server, method and computer program for providing voice recognition service
US20240013790A1 (en) * 2021-05-28 2024-01-11 Microsoft Technology Licensing, Llc Method and system of detecting and improving real-time mispronunciation of words

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4852173A (en) * 1987-10-29 1989-07-25 International Business Machines Corporation Design and construction of a binary-tree system for language modelling
EP0562138A1 (en) * 1992-03-25 1993-09-29 International Business Machines Corporation Method and apparatus for the automatic generation of Markov models of new words to be added to a speech recognition vocabulary
KR100355393B1 (en) * 1995-06-30 2002-12-26 삼성전자 주식회사 Phoneme length deciding method in voice synthesis and method of learning phoneme length decision tree
JP3627299B2 (en) * 1995-07-19 2005-03-09 ソニー株式会社 Speech recognition method and apparatus
US5758024A (en) * 1996-06-25 1998-05-26 Microsoft Corporation Method and system for encoding pronunciation prefix trees

Also Published As

Publication number Publication date
CN1233803A (en) 1999-11-03
DE69915162D1 (en) 2004-04-08
EP0953970B1 (en) 2004-03-03
KR19990083555A (en) 1999-11-25
KR100509797B1 (en) 2005-08-23
EP0953970A3 (en) 2000-01-19
JPH11344990A (en) 1999-12-14
EP0953970A2 (en) 1999-11-03
TW422967B (en) 2001-02-21
CN1118770C (en) 2003-08-20
JP3481497B2 (en) 2003-12-22

Similar Documents

Publication Publication Date Title
ATE261171T1 (en) APPARATUS AND METHOD FOR GENERATING AND EVALUating MULTIPLE PRONUNCIATION VARIANTS OF A Spelled Word USING DECISION TREES
US6016471A (en) Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word
US6233553B1 (en) Method and system for automatically determining phonetic transcriptions associated with spelled words
ES2261355T3 (en) CORRESPONDENCE OF PROSODIC TEMPLATES FOR TEXT CONVERSION SYSTEMS IN SPEECH.
US6029132A (en) Method for letter-to-sound in text-to-speech synthesis
ES2243200T3 (en) GENERATION AND SYNTHESIS OF PROSODY TEMPLATES.
ES2233002T3 (en) SPEECH RECOGNITION SYSTEM WITH UPDATED LEXIC BY INTRODUCTION OF SPELLED WORDS.
DE69010941T2 (en) Method and device for the automatic determination of phonological rules for a system for recognizing continuous speech.
Olaszy et al. Profivox—A Hungarian text-to-speech system for telecommunications applications
EP0867858A3 (en) Pronunciation generation in speech recognition
EP0874353A3 (en) Pronunciation generation in speech recognition
Rao English spelling and pronunciation: a brief study
Matoušek et al. Building of a speech corpus optimised for unit selection TTS synthesis
Van Bezooijen et al. Evaluating text-to-speech systems: Some methodological aspects
Hansakunbuntheung et al. Thai tagged speech corpus for speech synthesis
Filipsson et al. LUKAS-a preliminary report on a new Swedish speech synthesis
Maia et al. An HMM-based Brazilian Portuguese speech synthesizer and its characteristics
Silverman On customizing prosody in speech synthesis: Names and addresses as a case in point
Pitrelli et al. Expressive speech synthesis using American English ToBI: questions and contrastive emphasis
Schaden A Database for the Analysis of Cross-Lingual Pronunciation Variants of European City Names.
Gustafson Transcribing names with foreign origin in the ONOMASTICA project
Aroonmanakun et al. Automatic Thai transcriptions of English words
Aroonmanakun et al. Generating Thai Transcriptions for English Words
Al-Saiyd et al. Unit selection model in Arabic speech synthesis
Nath et al. A Grapheme to Phoneme Based Text to Speech Conversion Technique in Unicode Language

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties