DE69915162D1 - Vorrichtung und Verfahren zur Erzeugung und Bewertung von mehrfachen Ausprachevarianten eines buchstabierten Worts unter Verwendung von Entscheidungsbäumen - Google Patents
Vorrichtung und Verfahren zur Erzeugung und Bewertung von mehrfachen Ausprachevarianten eines buchstabierten Worts unter Verwendung von EntscheidungsbäumenInfo
- Publication number
- DE69915162D1 DE69915162D1 DE69915162T DE69915162T DE69915162D1 DE 69915162 D1 DE69915162 D1 DE 69915162D1 DE 69915162 T DE69915162 T DE 69915162T DE 69915162 T DE69915162 T DE 69915162T DE 69915162 D1 DE69915162 D1 DE 69915162D1
- Authority
- DE
- Germany
- Prior art keywords
- spelled word
- pronunciations
- generating
- decision trees
- mixed
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000003066 decision tree Methods 0.000 title abstract 3
- 238000000034 method Methods 0.000 title 1
- 230000015572 biosynthetic process Effects 0.000 abstract 1
- 238000003786 synthesis reaction Methods 0.000 abstract 1
- 230000035897 transcription Effects 0.000 abstract 1
- 238000013518 transcription Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Electrically Operated Instructional Devices (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/067,764 US6016471A (en) | 1998-04-29 | 1998-04-29 | Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word |
US09/069,308 US6230131B1 (en) | 1998-04-29 | 1998-04-29 | Method for generating spelling-to-pronunciation decision tree |
US09/070,300 US6029132A (en) | 1998-04-30 | 1998-04-30 | Method for letter-to-sound in text-to-speech synthesis |
Publications (1)
Publication Number | Publication Date |
---|---|
DE69915162D1 true DE69915162D1 (de) | 2004-04-08 |
Family
ID=27371225
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE69915162T Expired - Lifetime DE69915162D1 (de) | 1998-04-29 | 1999-04-29 | Vorrichtung und Verfahren zur Erzeugung und Bewertung von mehrfachen Ausprachevarianten eines buchstabierten Worts unter Verwendung von Entscheidungsbäumen |
Country Status (7)
Country | Link |
---|---|
EP (1) | EP0953970B1 (ko) |
JP (1) | JP3481497B2 (ko) |
KR (1) | KR100509797B1 (ko) |
CN (1) | CN1118770C (ko) |
AT (1) | ATE261171T1 (ko) |
DE (1) | DE69915162D1 (ko) |
TW (1) | TW422967B (ko) |
Families Citing this family (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2000054254A1 (de) * | 1999-03-08 | 2000-09-14 | Siemens Aktiengesellschaft | Verfahren und anordnung zur bestimmung eines repräsentativen lautes |
AU1767600A (en) * | 1999-12-23 | 2001-07-09 | Intel Corporation | Speech recognizer with a lexical tree based n-gram language model |
US6684187B1 (en) | 2000-06-30 | 2004-01-27 | At&T Corp. | Method and system for preselection of suitable units for concatenative speech |
US6505158B1 (en) | 2000-07-05 | 2003-01-07 | At&T Corp. | Synthesis-based pre-selection of suitable units for concatenative speech |
AU2000276394A1 (en) * | 2000-09-30 | 2002-04-15 | Intel Corporation | Method and system for generating and searching an optimal maximum likelihood decision tree for hidden markov model (hmm) based speech recognition |
US6718232B2 (en) * | 2000-10-13 | 2004-04-06 | Sony Corporation | Robot device and behavior control method for robot device |
US6845358B2 (en) | 2001-01-05 | 2005-01-18 | Matsushita Electric Industrial Co., Ltd. | Prosody template matching for text-to-speech systems |
US20040078191A1 (en) * | 2002-10-22 | 2004-04-22 | Nokia Corporation | Scalable neural network-based language identification from written text |
US7146319B2 (en) * | 2003-03-31 | 2006-12-05 | Novauris Technologies Ltd. | Phonetically based speech recognition system and method |
FI118062B (fi) * | 2003-04-30 | 2007-06-15 | Nokia Corp | Pienimuistinen päätöspuu |
EP1638080B1 (en) * | 2004-08-11 | 2007-10-03 | International Business Machines Corporation | A text-to-speech system and method |
US7558389B2 (en) * | 2004-10-01 | 2009-07-07 | At&T Intellectual Property Ii, L.P. | Method and system of generating a speech signal with overlayed random frequency signal |
GB2428853A (en) | 2005-07-22 | 2007-02-07 | Novauris Technologies Ltd | Speech recognition application specific dictionary |
JP2009525492A (ja) * | 2005-08-01 | 2009-07-09 | 一秋 上川 | 英語音、および他のヨーロッパ言語音の表現方法と発音テクニックのシステム |
JP4769223B2 (ja) * | 2007-04-26 | 2011-09-07 | 旭化成株式会社 | テキスト発音記号変換辞書作成装置、認識語彙辞書作成装置、及び音声認識装置 |
CN101452701B (zh) * | 2007-12-05 | 2011-09-07 | 株式会社东芝 | 基于反模型的置信度估计方法及装置 |
KR101250897B1 (ko) * | 2009-08-14 | 2013-04-04 | 한국전자통신연구원 | 전자사전에서 음성인식을 이용한 단어 탐색 장치 및 그 방법 |
US20110238412A1 (en) * | 2010-03-26 | 2011-09-29 | Antoine Ezzat | Method for Constructing Pronunciation Dictionaries |
EP2851895A3 (en) * | 2011-06-30 | 2015-05-06 | Google, Inc. | Speech recognition using variable-length context |
US9336771B2 (en) | 2012-11-01 | 2016-05-10 | Google Inc. | Speech recognition using non-parametric models |
US9384303B2 (en) | 2013-06-10 | 2016-07-05 | Google Inc. | Evaluation of substitution contexts |
US9741339B2 (en) * | 2013-06-28 | 2017-08-22 | Google Inc. | Data driven word pronunciation learning and scoring with crowd sourcing based on the word's phonemes pronunciation scores |
JP6234134B2 (ja) * | 2013-09-25 | 2017-11-22 | 三菱電機株式会社 | 音声合成装置 |
US9858922B2 (en) | 2014-06-23 | 2018-01-02 | Google Inc. | Caching speech recognition scores |
US9299347B1 (en) | 2014-10-22 | 2016-03-29 | Google Inc. | Speech recognition using associative mapping |
CN107767858B (zh) * | 2017-09-08 | 2021-05-04 | 科大讯飞股份有限公司 | 发音词典生成方法及装置、存储介质、电子设备 |
CN109376358B (zh) * | 2018-10-25 | 2021-07-16 | 陈逸天 | 一种借用历史拼读经验的单词学习方法、装置和电子设备 |
KR102605159B1 (ko) * | 2020-02-11 | 2023-11-23 | 주식회사 케이티 | 음성 인식 서비스를 제공하는 서버, 방법 및 컴퓨터 프로그램 |
WO2022246782A1 (en) * | 2021-05-28 | 2022-12-01 | Microsoft Technology Licensing, Llc | Method and system of detecting and improving real-time mispronunciation of words |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4852173A (en) * | 1987-10-29 | 1989-07-25 | International Business Machines Corporation | Design and construction of a binary-tree system for language modelling |
EP0562138A1 (en) * | 1992-03-25 | 1993-09-29 | International Business Machines Corporation | Method and apparatus for the automatic generation of Markov models of new words to be added to a speech recognition vocabulary |
KR100355393B1 (ko) * | 1995-06-30 | 2002-12-26 | 삼성전자 주식회사 | 음성합성에있어서의음소길이결정방법및음소길이결정트리의학습방법 |
JP3627299B2 (ja) * | 1995-07-19 | 2005-03-09 | ソニー株式会社 | 音声認識方法及び装置 |
US5758024A (en) * | 1996-06-25 | 1998-05-26 | Microsoft Corporation | Method and system for encoding pronunciation prefix trees |
-
1999
- 1999-04-28 JP JP12171099A patent/JP3481497B2/ja not_active Expired - Fee Related
- 1999-04-28 KR KR10-1999-0015176A patent/KR100509797B1/ko not_active IP Right Cessation
- 1999-04-28 TW TW088106840A patent/TW422967B/zh not_active IP Right Cessation
- 1999-04-29 DE DE69915162T patent/DE69915162D1/de not_active Expired - Lifetime
- 1999-04-29 EP EP99303390A patent/EP0953970B1/en not_active Expired - Lifetime
- 1999-04-29 AT AT99303390T patent/ATE261171T1/de not_active IP Right Cessation
- 1999-04-29 CN CN99106310A patent/CN1118770C/zh not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
JP3481497B2 (ja) | 2003-12-22 |
EP0953970B1 (en) | 2004-03-03 |
KR100509797B1 (ko) | 2005-08-23 |
EP0953970A2 (en) | 1999-11-03 |
JPH11344990A (ja) | 1999-12-14 |
CN1233803A (zh) | 1999-11-03 |
EP0953970A3 (en) | 2000-01-19 |
ATE261171T1 (de) | 2004-03-15 |
TW422967B (en) | 2001-02-21 |
KR19990083555A (ko) | 1999-11-25 |
CN1118770C (zh) | 2003-08-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE261171T1 (de) | Vorrichtung und verfahren zur erzeugung und bewertung von mehrfachen ausprachevarianten eines buchstabierten worts unter verwendung von entscheidungsbäumen | |
US6016471A (en) | Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word | |
US6233553B1 (en) | Method and system for automatically determining phonetic transcriptions associated with spelled words | |
ES2261355T3 (es) | Correspondencia de plantillas prosodicas para sistemas de conversion de texto en habla. | |
US6029132A (en) | Method for letter-to-sound in text-to-speech synthesis | |
ES2243200T3 (es) | Generacion y sintesis de plantillas de prosodia. | |
ES2233002T3 (es) | Sistema de reconocimiento de habla con lexico actualizable mediante introduccion de palabras deletreadas. | |
DE69010941D1 (de) | Verfahren und Einrichtung zur automatischen Bestimmung von phonologischen Regeln für ein System zur Erkennung kontinuierlicher Sprache. | |
Olaszy et al. | Profivox—A Hungarian text-to-speech system for telecommunications applications | |
EP0867858A3 (en) | Pronunciation generation in speech recognition | |
EP0874353A3 (en) | Pronunciation generation in speech recognition | |
Rao | English spelling and pronunciation: a brief study | |
Matoušek et al. | Building of a speech corpus optimised for unit selection TTS synthesis | |
Van Bezooijen et al. | Evaluating text-to-speech systems: Some methodological aspects | |
Hansakunbuntheung et al. | Thai tagged speech corpus for speech synthesis | |
Filipsson et al. | LUKAS-a preliminary report on a new Swedish speech synthesis | |
Maia et al. | An HMM-based Brazilian Portuguese speech synthesizer and its characteristics | |
Silverman | On customizing prosody in speech synthesis: Names and addresses as a case in point | |
Pitrelli et al. | Expressive speech synthesis using American English ToBI: questions and contrastive emphasis | |
Schaden | A Database for the Analysis of Cross-Lingual Pronunciation Variants of European City Names. | |
Gustafson | Transcribing names with foreign origin in the ONOMASTICA project | |
Aroonmanakun et al. | Automatic Thai transcriptions of English words | |
Aroonmanakun et al. | Generating Thai Transcriptions for English Words | |
Al-Saiyd et al. | Unit selection model in Arabic speech synthesis | |
Nath et al. | A Grapheme to Phoneme Based Text to Speech Conversion Technique in Unicode Language |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8332 | No legal effect for de |