CN1118770C - 利用判定树生成拼写单词的发音和对其评分的方法和设备 - Google Patents
利用判定树生成拼写单词的发音和对其评分的方法和设备 Download PDFInfo
- Publication number
- CN1118770C CN1118770C CN99106310A CN99106310A CN1118770C CN 1118770 C CN1118770 C CN 1118770C CN 99106310 A CN99106310 A CN 99106310A CN 99106310 A CN99106310 A CN 99106310A CN 1118770 C CN1118770 C CN 1118770C
- Authority
- CN
- China
- Prior art keywords
- pronunciation
- phoneme
- decision tree
- sequence
- letter
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000003066 decision tree Methods 0.000 title claims abstract description 88
- 238000000034 method Methods 0.000 title claims description 42
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 14
- 238000003786 synthesis reaction Methods 0.000 claims abstract description 11
- 238000012549 training Methods 0.000 claims description 21
- 239000011159 matrix material Substances 0.000 claims description 8
- 238000009795 derivation Methods 0.000 claims description 3
- 238000000547 structure data Methods 0.000 claims description 2
- 230000035897 transcription Effects 0.000 abstract 1
- 238000013518 transcription Methods 0.000 abstract 1
- 230000008569 process Effects 0.000 description 15
- 239000000203 mixture Substances 0.000 description 14
- 238000010586 diagram Methods 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 4
- 241001269238 Data Species 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 238000013138 pruning Methods 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- 239000004615 ingredient Substances 0.000 description 2
- 238000007689 inspection Methods 0.000 description 2
- 230000014759 maintenance of location Effects 0.000 description 2
- 230000033764 rhythmic process Effects 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 241000208140 Acer Species 0.000 description 1
- 241000214155 Anacrusis Species 0.000 description 1
- 208000019300 CLIPPERS Diseases 0.000 description 1
- 240000004859 Gamochaeta purpurea Species 0.000 description 1
- 208000009989 Posterior Leukoencephalopathy Syndrome Diseases 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 208000021930 chronic lymphocytic inflammation with pontine perivascular enhancement responsive to steroids Diseases 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 210000001747 pupil Anatomy 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000033772 system development Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Electrically Operated Instructional Devices (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
Applications Claiming Priority (9)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US067,764 | 1993-05-26 | ||
US069,308 | 1998-04-29 | ||
US09/067,764 US6016471A (en) | 1998-04-29 | 1998-04-29 | Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word |
US067764 | 1998-04-29 | ||
US09/069,308 US6230131B1 (en) | 1998-04-29 | 1998-04-29 | Method for generating spelling-to-pronunciation decision tree |
US069308 | 1998-04-29 | ||
US070,300 | 1998-04-30 | ||
US09/070,300 US6029132A (en) | 1998-04-30 | 1998-04-30 | Method for letter-to-sound in text-to-speech synthesis |
US070300 | 1998-04-30 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1233803A CN1233803A (zh) | 1999-11-03 |
CN1118770C true CN1118770C (zh) | 2003-08-20 |
Family
ID=27371225
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN99106310A Expired - Lifetime CN1118770C (zh) | 1998-04-29 | 1999-04-29 | 利用判定树生成拼写单词的发音和对其评分的方法和设备 |
Country Status (7)
Country | Link |
---|---|
EP (1) | EP0953970B1 (ko) |
JP (1) | JP3481497B2 (ko) |
KR (1) | KR100509797B1 (ko) |
CN (1) | CN1118770C (ko) |
AT (1) | ATE261171T1 (ko) |
DE (1) | DE69915162D1 (ko) |
TW (1) | TW422967B (ko) |
Families Citing this family (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2000054254A1 (de) * | 1999-03-08 | 2000-09-14 | Siemens Aktiengesellschaft | Verfahren und anordnung zur bestimmung eines repräsentativen lautes |
AU1767600A (en) * | 1999-12-23 | 2001-07-09 | Intel Corporation | Speech recognizer with a lexical tree based n-gram language model |
US6684187B1 (en) | 2000-06-30 | 2004-01-27 | At&T Corp. | Method and system for preselection of suitable units for concatenative speech |
US6505158B1 (en) | 2000-07-05 | 2003-01-07 | At&T Corp. | Synthesis-based pre-selection of suitable units for concatenative speech |
AU2000276394A1 (en) * | 2000-09-30 | 2002-04-15 | Intel Corporation | Method and system for generating and searching an optimal maximum likelihood decision tree for hidden markov model (hmm) based speech recognition |
US6718232B2 (en) * | 2000-10-13 | 2004-04-06 | Sony Corporation | Robot device and behavior control method for robot device |
US6845358B2 (en) | 2001-01-05 | 2005-01-18 | Matsushita Electric Industrial Co., Ltd. | Prosody template matching for text-to-speech systems |
US20040078191A1 (en) * | 2002-10-22 | 2004-04-22 | Nokia Corporation | Scalable neural network-based language identification from written text |
US7146319B2 (en) * | 2003-03-31 | 2006-12-05 | Novauris Technologies Ltd. | Phonetically based speech recognition system and method |
FI118062B (fi) * | 2003-04-30 | 2007-06-15 | Nokia Corp | Pienimuistinen päätöspuu |
EP1638080B1 (en) * | 2004-08-11 | 2007-10-03 | International Business Machines Corporation | A text-to-speech system and method |
US7558389B2 (en) * | 2004-10-01 | 2009-07-07 | At&T Intellectual Property Ii, L.P. | Method and system of generating a speech signal with overlayed random frequency signal |
GB2428853A (en) | 2005-07-22 | 2007-02-07 | Novauris Technologies Ltd | Speech recognition application specific dictionary |
JP2009525492A (ja) * | 2005-08-01 | 2009-07-09 | 一秋 上川 | 英語音、および他のヨーロッパ言語音の表現方法と発音テクニックのシステム |
JP4769223B2 (ja) * | 2007-04-26 | 2011-09-07 | 旭化成株式会社 | テキスト発音記号変換辞書作成装置、認識語彙辞書作成装置、及び音声認識装置 |
CN101452701B (zh) * | 2007-12-05 | 2011-09-07 | 株式会社东芝 | 基于反模型的置信度估计方法及装置 |
KR101250897B1 (ko) * | 2009-08-14 | 2013-04-04 | 한국전자통신연구원 | 전자사전에서 음성인식을 이용한 단어 탐색 장치 및 그 방법 |
US20110238412A1 (en) * | 2010-03-26 | 2011-09-29 | Antoine Ezzat | Method for Constructing Pronunciation Dictionaries |
EP2851895A3 (en) * | 2011-06-30 | 2015-05-06 | Google, Inc. | Speech recognition using variable-length context |
US9336771B2 (en) | 2012-11-01 | 2016-05-10 | Google Inc. | Speech recognition using non-parametric models |
US9384303B2 (en) | 2013-06-10 | 2016-07-05 | Google Inc. | Evaluation of substitution contexts |
US9741339B2 (en) * | 2013-06-28 | 2017-08-22 | Google Inc. | Data driven word pronunciation learning and scoring with crowd sourcing based on the word's phonemes pronunciation scores |
JP6234134B2 (ja) * | 2013-09-25 | 2017-11-22 | 三菱電機株式会社 | 音声合成装置 |
US9858922B2 (en) | 2014-06-23 | 2018-01-02 | Google Inc. | Caching speech recognition scores |
US9299347B1 (en) | 2014-10-22 | 2016-03-29 | Google Inc. | Speech recognition using associative mapping |
CN107767858B (zh) * | 2017-09-08 | 2021-05-04 | 科大讯飞股份有限公司 | 发音词典生成方法及装置、存储介质、电子设备 |
CN109376358B (zh) * | 2018-10-25 | 2021-07-16 | 陈逸天 | 一种借用历史拼读经验的单词学习方法、装置和电子设备 |
KR102605159B1 (ko) * | 2020-02-11 | 2023-11-23 | 주식회사 케이티 | 음성 인식 서비스를 제공하는 서버, 방법 및 컴퓨터 프로그램 |
WO2022246782A1 (en) * | 2021-05-28 | 2022-12-01 | Microsoft Technology Licensing, Llc | Method and system of detecting and improving real-time mispronunciation of words |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4852173A (en) * | 1987-10-29 | 1989-07-25 | International Business Machines Corporation | Design and construction of a binary-tree system for language modelling |
EP0562138A1 (en) * | 1992-03-25 | 1993-09-29 | International Business Machines Corporation | Method and apparatus for the automatic generation of Markov models of new words to be added to a speech recognition vocabulary |
KR100355393B1 (ko) * | 1995-06-30 | 2002-12-26 | 삼성전자 주식회사 | 음성합성에있어서의음소길이결정방법및음소길이결정트리의학습방법 |
JP3627299B2 (ja) * | 1995-07-19 | 2005-03-09 | ソニー株式会社 | 音声認識方法及び装置 |
US5758024A (en) * | 1996-06-25 | 1998-05-26 | Microsoft Corporation | Method and system for encoding pronunciation prefix trees |
-
1999
- 1999-04-28 JP JP12171099A patent/JP3481497B2/ja not_active Expired - Fee Related
- 1999-04-28 KR KR10-1999-0015176A patent/KR100509797B1/ko not_active IP Right Cessation
- 1999-04-28 TW TW088106840A patent/TW422967B/zh not_active IP Right Cessation
- 1999-04-29 DE DE69915162T patent/DE69915162D1/de not_active Expired - Lifetime
- 1999-04-29 EP EP99303390A patent/EP0953970B1/en not_active Expired - Lifetime
- 1999-04-29 AT AT99303390T patent/ATE261171T1/de not_active IP Right Cessation
- 1999-04-29 CN CN99106310A patent/CN1118770C/zh not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
JP3481497B2 (ja) | 2003-12-22 |
EP0953970B1 (en) | 2004-03-03 |
KR100509797B1 (ko) | 2005-08-23 |
EP0953970A2 (en) | 1999-11-03 |
JPH11344990A (ja) | 1999-12-14 |
CN1233803A (zh) | 1999-11-03 |
EP0953970A3 (en) | 2000-01-19 |
ATE261171T1 (de) | 2004-03-15 |
TW422967B (en) | 2001-02-21 |
KR19990083555A (ko) | 1999-11-25 |
DE69915162D1 (de) | 2004-04-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1118770C (zh) | 利用判定树生成拼写单词的发音和对其评分的方法和设备 | |
Galves et al. | Context tree selection and linguistic rhythm retrieval from written texts | |
US6029132A (en) | Method for letter-to-sound in text-to-speech synthesis | |
US6363342B2 (en) | System for developing word-pronunciation pairs | |
Kondrak | Algorithms for language reconstruction | |
CN100492350C (zh) | 以无模式输入将一种文本形式转换成另一种文本形式的语言输入体系结构 | |
Littell et al. | Indigenous language technologies in Canada: Assessment, challenges, and successes | |
US6233553B1 (en) | Method and system for automatically determining phonetic transcriptions associated with spelled words | |
WO2000038083A1 (en) | Method and apparatus for performing full bi-directional translation between a source language and a linked alternative language | |
CN101551947A (zh) | 辅助口语语言学习的计算机系统 | |
CN101650942A (zh) | 基于韵律短语的韵律结构生成方法 | |
Jaya | Sentence patterns of narrative text English textbook in Indonesia | |
Grabe et al. | The IViE Corpus | |
Breadmore et al. | Literacy development: Evidence review | |
de Silva et al. | Singlish to sinhala transliteration using rule-based approach | |
Zupan et al. | How to tag non-standard language: Normalisation versus domain adaptation for slovene historical and user-generated texts | |
Beaufort et al. | Automation of dictation exercises. A working combination of CALL and NLP. | |
Popescu-Belis et al. | GPoeT: a language model trained for rhyme generation on synthetic data | |
Asahiah | Development of a Standard Yorùbá digital text automatic diacritic restoration system | |
Keenan | Large vocabulary syntactic analysis for text recognition | |
Akinwonmi | Development of a prosodic read speech syllabic corpus of the Yoruba language | |
Abdelkader et al. | How Existing NLP Tools of Arabic Language Can Serve Hadith Processing | |
Van Nam et al. | Building a spelling checker for documents in Khmer language | |
Eeds | Holistic assessment of coding ability | |
Baumann et al. | Free Verse Prosodies: Identifying and Classifying Spoken Poetry Using Literary and Computational Perspectives (Rhythmicalizer) |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: MATSUSHITA ELECTRIC (AMERICA) INTELLECTUAL PROPERT Free format text: FORMER OWNER: MATSUSHITA ELECTRIC INDUSTRIAL CO, LTD. Effective date: 20140714 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20140714 Address after: California, USA Patentee after: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA Address before: Osaka Japan Patentee before: Matsushita Electric Industrial Co.,Ltd. |
|
CX01 | Expiry of patent term |
Granted publication date: 20030820 |
|
CX01 | Expiry of patent term |