ATE261171T1 - APPARATUS AND METHOD FOR GENERATING AND EVALUating MULTIPLE PRONUNCIATION VARIANTS OF A Spelled Word USING DECISION TREES - Google Patents
APPARATUS AND METHOD FOR GENERATING AND EVALUating MULTIPLE PRONUNCIATION VARIANTS OF A Spelled Word USING DECISION TREESInfo
- Publication number
- ATE261171T1 ATE261171T1 AT99303390T AT99303390T ATE261171T1 AT E261171 T1 ATE261171 T1 AT E261171T1 AT 99303390 T AT99303390 T AT 99303390T AT 99303390 T AT99303390 T AT 99303390T AT E261171 T1 ATE261171 T1 AT E261171T1
- Authority
- AT
- Austria
- Prior art keywords
- spelled word
- pronunciations
- generating
- decision trees
- mixed
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
Abstract
The mixed decision tree includes a network of yes-no questions about adjacent letters in a spelled word sequence and also about adjacent phonemes in the phoneme sequence corresponding to the spelled word sequence. Leaf nodes of the mixed decision tree provide information about which phonetic transcriptions are most probable. Using the mixed trees, scores are developed for each of a plurality of possible pronunciations, and these scores can be used to select the best pronunciation as well as to rank pronunciations in order of probability. The pronunciations generated by the system can be used in speech synthesis and speech recognition applications as well as lexicography applications. <IMAGE>
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/067,764 US6016471A (en) | 1998-04-29 | 1998-04-29 | Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word |
US09/069,308 US6230131B1 (en) | 1998-04-29 | 1998-04-29 | Method for generating spelling-to-pronunciation decision tree |
US09/070,300 US6029132A (en) | 1998-04-30 | 1998-04-30 | Method for letter-to-sound in text-to-speech synthesis |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE261171T1 true ATE261171T1 (en) | 2004-03-15 |
Family
ID=27371225
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT99303390T ATE261171T1 (en) | 1998-04-29 | 1999-04-29 | APPARATUS AND METHOD FOR GENERATING AND EVALUating MULTIPLE PRONUNCIATION VARIANTS OF A Spelled Word USING DECISION TREES |
Country Status (7)
Country | Link |
---|---|
EP (1) | EP0953970B1 (en) |
JP (1) | JP3481497B2 (en) |
KR (1) | KR100509797B1 (en) |
CN (1) | CN1118770C (en) |
AT (1) | ATE261171T1 (en) |
DE (1) | DE69915162D1 (en) |
TW (1) | TW422967B (en) |
Families Citing this family (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE50003260D1 (en) * | 1999-03-08 | 2003-09-18 | Siemens Ag | METHOD AND ARRANGEMENT FOR DETERMINING A REPRESENTATIVE LOUD |
AU1767600A (en) * | 1999-12-23 | 2001-07-09 | Intel Corporation | Speech recognizer with a lexical tree based n-gram language model |
US6684187B1 (en) | 2000-06-30 | 2004-01-27 | At&T Corp. | Method and system for preselection of suitable units for concatenative speech |
US6505158B1 (en) | 2000-07-05 | 2003-01-07 | At&T Corp. | Synthesis-based pre-selection of suitable units for concatenative speech |
AU2000276394A1 (en) * | 2000-09-30 | 2002-04-15 | Intel Corporation | Method and system for generating and searching an optimal maximum likelihood decision tree for hidden markov model (hmm) based speech recognition |
CN100423911C (en) * | 2000-10-13 | 2008-10-08 | 索尼公司 | Robot device and behavior control method for robot device |
US6845358B2 (en) * | 2001-01-05 | 2005-01-18 | Matsushita Electric Industrial Co., Ltd. | Prosody template matching for text-to-speech systems |
US20040078191A1 (en) * | 2002-10-22 | 2004-04-22 | Nokia Corporation | Scalable neural network-based language identification from written text |
US7146319B2 (en) * | 2003-03-31 | 2006-12-05 | Novauris Technologies Ltd. | Phonetically based speech recognition system and method |
FI118062B (en) * | 2003-04-30 | 2007-06-15 | Nokia Corp | Decision tree with a sparse memory |
EP1638080B1 (en) * | 2004-08-11 | 2007-10-03 | International Business Machines Corporation | A text-to-speech system and method |
US7558389B2 (en) * | 2004-10-01 | 2009-07-07 | At&T Intellectual Property Ii, L.P. | Method and system of generating a speech signal with overlayed random frequency signal |
GB2428853A (en) | 2005-07-22 | 2007-02-07 | Novauris Technologies Ltd | Speech recognition application specific dictionary |
US20090291419A1 (en) * | 2005-08-01 | 2009-11-26 | Kazuaki Uekawa | System of sound representaion and pronunciation techniques for english and other european languages |
JP4769223B2 (en) * | 2007-04-26 | 2011-09-07 | 旭化成株式会社 | Text phonetic symbol conversion dictionary creation device, recognition vocabulary dictionary creation device, and speech recognition device |
CN101452701B (en) * | 2007-12-05 | 2011-09-07 | 株式会社东芝 | Confidence degree estimation method and device based on inverse model |
KR101250897B1 (en) * | 2009-08-14 | 2013-04-04 | 한국전자통신연구원 | Apparatus for word entry searching in a portable electronic dictionary and method thereof |
US20110238412A1 (en) * | 2010-03-26 | 2011-09-29 | Antoine Ezzat | Method for Constructing Pronunciation Dictionaries |
US8494850B2 (en) * | 2011-06-30 | 2013-07-23 | Google Inc. | Speech recognition using variable-length context |
US9336771B2 (en) | 2012-11-01 | 2016-05-10 | Google Inc. | Speech recognition using non-parametric models |
US9384303B2 (en) | 2013-06-10 | 2016-07-05 | Google Inc. | Evaluation of substitution contexts |
US9741339B2 (en) * | 2013-06-28 | 2017-08-22 | Google Inc. | Data driven word pronunciation learning and scoring with crowd sourcing based on the word's phonemes pronunciation scores |
JP6234134B2 (en) * | 2013-09-25 | 2017-11-22 | 三菱電機株式会社 | Speech synthesizer |
US9858922B2 (en) | 2014-06-23 | 2018-01-02 | Google Inc. | Caching speech recognition scores |
US9299347B1 (en) | 2014-10-22 | 2016-03-29 | Google Inc. | Speech recognition using associative mapping |
CN107767858B (en) * | 2017-09-08 | 2021-05-04 | 科大讯飞股份有限公司 | Pronunciation dictionary generating method and device, storage medium and electronic equipment |
CN109376358B (en) * | 2018-10-25 | 2021-07-16 | 陈逸天 | Word learning method and device based on historical spelling experience and electronic equipment |
KR102605159B1 (en) * | 2020-02-11 | 2023-11-23 | 주식회사 케이티 | Server, method and computer program for providing voice recognition service |
US20240013790A1 (en) * | 2021-05-28 | 2024-01-11 | Microsoft Technology Licensing, Llc | Method and system of detecting and improving real-time mispronunciation of words |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4852173A (en) * | 1987-10-29 | 1989-07-25 | International Business Machines Corporation | Design and construction of a binary-tree system for language modelling |
EP0562138A1 (en) * | 1992-03-25 | 1993-09-29 | International Business Machines Corporation | Method and apparatus for the automatic generation of Markov models of new words to be added to a speech recognition vocabulary |
KR100355393B1 (en) * | 1995-06-30 | 2002-12-26 | 삼성전자 주식회사 | Phoneme length deciding method in voice synthesis and method of learning phoneme length decision tree |
JP3627299B2 (en) * | 1995-07-19 | 2005-03-09 | ソニー株式会社 | Speech recognition method and apparatus |
US5758024A (en) * | 1996-06-25 | 1998-05-26 | Microsoft Corporation | Method and system for encoding pronunciation prefix trees |
-
1999
- 1999-04-28 JP JP12171099A patent/JP3481497B2/en not_active Expired - Fee Related
- 1999-04-28 KR KR10-1999-0015176A patent/KR100509797B1/en not_active IP Right Cessation
- 1999-04-28 TW TW088106840A patent/TW422967B/en not_active IP Right Cessation
- 1999-04-29 AT AT99303390T patent/ATE261171T1/en not_active IP Right Cessation
- 1999-04-29 EP EP99303390A patent/EP0953970B1/en not_active Expired - Lifetime
- 1999-04-29 CN CN99106310A patent/CN1118770C/en not_active Expired - Lifetime
- 1999-04-29 DE DE69915162T patent/DE69915162D1/en not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
CN1233803A (en) | 1999-11-03 |
DE69915162D1 (en) | 2004-04-08 |
EP0953970B1 (en) | 2004-03-03 |
KR19990083555A (en) | 1999-11-25 |
KR100509797B1 (en) | 2005-08-23 |
EP0953970A3 (en) | 2000-01-19 |
JPH11344990A (en) | 1999-12-14 |
EP0953970A2 (en) | 1999-11-03 |
TW422967B (en) | 2001-02-21 |
CN1118770C (en) | 2003-08-20 |
JP3481497B2 (en) | 2003-12-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE261171T1 (en) | APPARATUS AND METHOD FOR GENERATING AND EVALUating MULTIPLE PRONUNCIATION VARIANTS OF A Spelled Word USING DECISION TREES | |
US6016471A (en) | Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word | |
US6233553B1 (en) | Method and system for automatically determining phonetic transcriptions associated with spelled words | |
ES2261355T3 (en) | CORRESPONDENCE OF PROSODIC TEMPLATES FOR TEXT CONVERSION SYSTEMS IN SPEECH. | |
US6029132A (en) | Method for letter-to-sound in text-to-speech synthesis | |
ES2243200T3 (en) | GENERATION AND SYNTHESIS OF PROSODY TEMPLATES. | |
ES2233002T3 (en) | SPEECH RECOGNITION SYSTEM WITH UPDATED LEXIC BY INTRODUCTION OF SPELLED WORDS. | |
DE69010941T2 (en) | Method and device for the automatic determination of phonological rules for a system for recognizing continuous speech. | |
Olaszy et al. | Profivox—A Hungarian text-to-speech system for telecommunications applications | |
EP0867858A3 (en) | Pronunciation generation in speech recognition | |
EP0874353A3 (en) | Pronunciation generation in speech recognition | |
Rao | English spelling and pronunciation: a brief study | |
Matoušek et al. | Building of a speech corpus optimised for unit selection TTS synthesis | |
Van Bezooijen et al. | Evaluating text-to-speech systems: Some methodological aspects | |
Hansakunbuntheung et al. | Thai tagged speech corpus for speech synthesis | |
Filipsson et al. | LUKAS-a preliminary report on a new Swedish speech synthesis | |
Maia et al. | An HMM-based Brazilian Portuguese speech synthesizer and its characteristics | |
Silverman | On customizing prosody in speech synthesis: Names and addresses as a case in point | |
Pitrelli et al. | Expressive speech synthesis using American English ToBI: questions and contrastive emphasis | |
Schaden | A Database for the Analysis of Cross-Lingual Pronunciation Variants of European City Names. | |
Gustafson | Transcribing names with foreign origin in the ONOMASTICA project | |
Aroonmanakun et al. | Automatic Thai transcriptions of English words | |
Aroonmanakun et al. | Generating Thai Transcriptions for English Words | |
Al-Saiyd et al. | Unit selection model in Arabic speech synthesis | |
Nath et al. | A Grapheme to Phoneme Based Text to Speech Conversion Technique in Unicode Language |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |