EP1668628A4 - Method for synthesizing speech - Google Patents
Method for synthesizing speechInfo
- Publication number
- EP1668628A4 EP1668628A4 EP04784355A EP04784355A EP1668628A4 EP 1668628 A4 EP1668628 A4 EP 1668628A4 EP 04784355 A EP04784355 A EP 04784355A EP 04784355 A EP04784355 A EP 04784355A EP 1668628 A4 EP1668628 A4 EP 1668628A4
- Authority
- EP
- European Patent Office
- Prior art keywords
- synthesizing speech
- speech
- synthesizing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 230000002194 synthesizing effect Effects 0.000 title 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
- G10L13/07—Concatenation rules
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
- G10L13/10—Prosody rules derived from text; Stress or intonation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB031326986A CN1260704C (en) | 2003-09-29 | 2003-09-29 | Method for voice synthesizing |
PCT/US2004/030467 WO2005034082A1 (en) | 2003-09-29 | 2004-09-17 | Method for synthesizing speech |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1668628A1 EP1668628A1 (en) | 2006-06-14 |
EP1668628A4 true EP1668628A4 (en) | 2007-01-10 |
Family
ID=34398359
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP04784355A Withdrawn EP1668628A4 (en) | 2003-09-29 | 2004-09-17 | Method for synthesizing speech |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP1668628A4 (en) |
KR (1) | KR100769033B1 (en) |
CN (1) | CN1260704C (en) |
MX (1) | MXPA06003431A (en) |
WO (1) | WO2005034082A1 (en) |
Families Citing this family (60)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
US8996376B2 (en) | 2008-04-05 | 2015-03-31 | Apple Inc. | Intelligent text-to-speech conversion |
US10241752B2 (en) | 2011-09-30 | 2019-03-26 | Apple Inc. | Interface for a virtual digital assistant |
US10241644B2 (en) | 2011-06-03 | 2019-03-26 | Apple Inc. | Actionable reminder entries |
US9431006B2 (en) | 2009-07-02 | 2016-08-30 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
TWI421857B (en) * | 2009-12-29 | 2014-01-01 | Ind Tech Res Inst | Apparatus and method for generating a threshold for utterance verification and speech recognition system and utterance verification system |
US8682667B2 (en) | 2010-02-25 | 2014-03-25 | Apple Inc. | User profiling for selecting user specific voice input processing information |
US10134385B2 (en) | 2012-03-02 | 2018-11-20 | Apple Inc. | Systems and methods for name pronunciation |
US9721563B2 (en) | 2012-06-08 | 2017-08-01 | Apple Inc. | Name recognition system |
KR20140008870A (en) * | 2012-07-12 | 2014-01-22 | 삼성전자주식회사 | Method for providing contents information and broadcasting receiving apparatus thereof |
US9547647B2 (en) | 2012-09-19 | 2017-01-17 | Apple Inc. | Voice-based media searching |
WO2014197334A2 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
US9430463B2 (en) | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
CN105989833B (en) * | 2015-02-28 | 2019-11-15 | 讯飞智元信息科技有限公司 | Multilingual mixed this making character fonts of Chinese language method and system |
US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
CN106157948B (en) * | 2015-04-22 | 2019-10-18 | 科大讯飞股份有限公司 | A kind of fundamental frequency modeling method and system |
US9578173B2 (en) | 2015-06-05 | 2017-02-21 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
CN105096934B (en) * | 2015-06-30 | 2019-02-12 | 百度在线网络技术(北京)有限公司 | Construct method, phoneme synthesizing method, device and the equipment in phonetic feature library |
US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
DK179588B1 (en) | 2016-06-09 | 2019-02-22 | Apple Inc. | Intelligent automated assistant in a home environment |
US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
DK179049B1 (en) | 2016-06-11 | 2017-09-18 | Apple Inc | Data driven natural language event detection and classification |
DK179415B1 (en) | 2016-06-11 | 2018-06-14 | Apple Inc | Intelligent device arbitration and control |
DK179343B1 (en) | 2016-06-11 | 2018-05-14 | Apple Inc | Intelligent task discovery |
DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
CN106534528A (en) * | 2016-11-04 | 2017-03-22 | 广东欧珀移动通信有限公司 | Processing method and device of text information and mobile terminal |
US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
DK201770439A1 (en) | 2017-05-11 | 2018-12-13 | Apple Inc. | Offline personal assistant |
DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
DK201770431A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
DK201770432A1 (en) | 2017-05-15 | 2018-12-21 | Apple Inc. | Hierarchical belief states for digital assistants |
DK179549B1 (en) | 2017-05-16 | 2019-02-12 | Apple Inc. | Far-field extension for digital assistant services |
CN107481713B (en) * | 2017-07-17 | 2020-06-02 | 清华大学 | Mixed language voice synthesis method and device |
CN109948124B (en) * | 2019-03-15 | 2022-12-23 | 腾讯科技(深圳)有限公司 | Voice file segmentation method and device and computer equipment |
CN110942765B (en) * | 2019-11-11 | 2022-05-27 | 珠海格力电器股份有限公司 | Method, device, server and storage medium for constructing corpus |
CN111128116B (en) * | 2019-12-20 | 2021-07-23 | 珠海格力电器股份有限公司 | Voice processing method and device, computing equipment and storage medium |
KR20210109222A (en) | 2020-02-27 | 2021-09-06 | 주식회사 케이티 | Device, method and computer program for synthesizing voice |
US20210350788A1 (en) * | 2020-05-06 | 2021-11-11 | Samsung Electronics Co., Ltd. | Electronic device for generating speech signal corresponding to at least one text and operating method of the electronic device |
CN112530406A (en) * | 2020-11-30 | 2021-03-19 | 深圳市优必选科技股份有限公司 | Voice synthesis method, voice synthesis device and intelligent equipment |
CN113393829B (en) * | 2021-06-16 | 2023-08-29 | 哈尔滨工业大学(深圳) | Chinese speech synthesis method integrating rhythm and personal information |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5970454A (en) * | 1993-12-16 | 1999-10-19 | British Telecommunications Public Limited Company | Synthesizing speech by converting phonemes to digital waveforms |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS6449622A (en) * | 1987-08-19 | 1989-02-27 | Jsp Corp | Resin foaming particle containing crosslinked polyolefin-based resin and manufacture thereof |
US5704007A (en) * | 1994-03-11 | 1997-12-30 | Apple Computer, Inc. | Utilization of multiple voice sources in a speech synthesizer |
US6134528A (en) * | 1997-06-13 | 2000-10-17 | Motorola, Inc. | Method device and article of manufacture for neural-network based generation of postlexical pronunciations from lexical pronunciations |
KR100259777B1 (en) * | 1997-10-24 | 2000-06-15 | 정선종 | Optimal synthesis unit selection method in text-to-speech system |
US7283964B1 (en) * | 1999-05-21 | 2007-10-16 | Winbond Electronics Corporation | Method and apparatus for voice controlled devices with improved phrase storage, use, conversion, transfer, and recognition |
DE60215296T2 (en) * | 2002-03-15 | 2007-04-05 | Sony France S.A. | Method and apparatus for the speech synthesis program, recording medium, method and apparatus for generating a forced information and robotic device |
JP2003295882A (en) * | 2002-04-02 | 2003-10-15 | Canon Inc | Text structure for speech synthesis, speech synthesizing method, speech synthesizer and computer program therefor |
KR100883649B1 (en) * | 2002-04-04 | 2009-02-18 | 삼성전자주식회사 | Text to speech conversion apparatus and method thereof |
GB2388286A (en) * | 2002-05-01 | 2003-11-05 | Seiko Epson Corp | Enhanced speech data for use in a text to speech system |
CN1320482C (en) * | 2003-09-29 | 2007-06-06 | 摩托罗拉公司 | Natural voice pause in identification text strings |
-
2003
- 2003-09-29 CN CNB031326986A patent/CN1260704C/en not_active Expired - Lifetime
-
2004
- 2004-09-17 WO PCT/US2004/030467 patent/WO2005034082A1/en active Application Filing
- 2004-09-17 KR KR1020067006170A patent/KR100769033B1/en active IP Right Grant
- 2004-09-17 MX MXPA06003431A patent/MXPA06003431A/en not_active Application Discontinuation
- 2004-09-17 EP EP04784355A patent/EP1668628A4/en not_active Withdrawn
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5970454A (en) * | 1993-12-16 | 1999-10-19 | British Telecommunications Public Limited Company | Synthesizing speech by converting phonemes to digital waveforms |
Non-Patent Citations (6)
Title |
---|
HELEN M MENG ET AL: "CU VOCAL: CORPUS-BASED SYLLABLE CONCATENATION FOR CHINESE SPEECH SYNTHESIS ACROSS DOMAINS AND DIALECTS", ICSLP 2002 : 7TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING. DENVER, COLORADO, SEPT. 16 - 20, 2002, vol. 4 OF 4, 16 September 2002 (2002-09-16), pages 2373 - 2376, XP007011576, ISBN: 1-876346-40-X * |
HIROKAWA T ET AL: "HIGH QUALITY SPEECH SYNTHESIS SYSTEM BASED ON WAVEFORM CONCATENATION OF PHONEME SEGMENT", IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS, COMMUNICATIONS AND COMPUTER SCIENCES, ENGINEERING SCIENCES SOCIETY, TOKYO, JP, vol. 76A, no. 11, 1 November 1993 (1993-11-01), pages 1964 - 1970, XP000420615, ISSN: 0916-8508 * |
REN-HUA WANG ET AL.: "A CORPUS-BASED CHINESE SPEECH SYNTHESIS WITH CONTEXTUAL DEPENDENT UNIT SELECTION", IEEE INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING (ICSLP), vol. 2, 16 October 2000 (2000-10-16), pages 391 - 394, XP007010255 * |
See also references of WO2005034082A1 * |
WEIBIN ZHU ET AL: "Corpus building for data-driven tts systems", SPEECH SYNTHESIS, 2002. PROCEEDINGS OF 2002 IEEE WORKSHOP ON 11-13 SEPT. 2002, PISCATAWAY, NJ, USA,IEEE, 11 September 2002 (2002-09-11), pages 199 - 202, XP010653645, ISBN: 0-7803-7395-2 * |
WOEI-LUEN PERNG ET AL: "Image Talk: a real time synthetic talking head using one single image with Chinese text-to-speech capability", COMPUTER GRAPHICS AND APPLICATIONS, 1998. PACIFIC GRAPHICS '98. SIXTH PACIFIC CONFERENCE ON SINGAPORE 26-29 OCT. 1998, LOS ALAMITOS, CA, USA,IEEE COMPUT. SOC, US, 26 October 1998 (1998-10-26), pages 140 - 148, XP010315487, ISBN: 0-8186-8620-0 * |
Also Published As
Publication number | Publication date |
---|---|
CN1260704C (en) | 2006-06-21 |
MXPA06003431A (en) | 2006-06-20 |
KR100769033B1 (en) | 2007-10-22 |
EP1668628A1 (en) | 2006-06-14 |
KR20060066121A (en) | 2006-06-15 |
WO2005034082A1 (en) | 2005-04-14 |
CN1604182A (en) | 2005-04-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1668628A4 (en) | Method for synthesizing speech | |
GB0304799D0 (en) | Novel method | |
EP1678958A4 (en) | Vocoder selection method | |
PL1697390T3 (en) | Method for producing organoacylphosphites | |
EP1675596A4 (en) | Method | |
EP1732394A4 (en) | Method for dehydro-roasting | |
PL1641731T3 (en) | Method for producing 1-octene from crack-c4 | |
GB0301117D0 (en) | Method | |
GB0427872D0 (en) | Apparatus & method | |
AU2003269047A8 (en) | Method for making biochips | |
HK1133870A1 (en) | Method for producing 4-pentafluoride-sulfanyl-benzoylguanidines | |
EP1546186A4 (en) | Method for synthesizing peptides | |
EP1556396A4 (en) | Method for producing 2-deoxy-l-ribose | |
GB0307329D0 (en) | Method | |
AU2003257058A1 (en) | Method for making alkyhalosilanes | |
GB0304632D0 (en) | Method | |
EP1612264A4 (en) | Organ-forming method | |
EP1694837A4 (en) | Method | |
HU0300167D0 (en) | Process for alginat obtaining from alginit | |
AU2002367923A8 (en) | Method for preparing 2-alkoxyphenoxyethanamines from 2-alkoxyphenoxyethylacetamides | |
GB0303536D0 (en) | Method | |
HU0300941D0 (en) | Method for producing dialkyl-3-oxo-glutarates | |
PL359838A1 (en) | Method for producing nano-pathes | |
SI1685097T1 (en) | Method for producing 4-pentafluoride-sulfanyl-benzoylguanidines | |
GB0311456D0 (en) | Novel method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20060323 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): DE FR GB IT |
|
DAX | Request for extension of the european patent (deleted) | ||
RBV | Designated contracting states (corrected) |
Designated state(s): DE FR GB IT |
|
A4 | Supplementary search report drawn up and despatched |
Effective date: 20061208 |
|
17Q | First examination report despatched |
Effective date: 20070907 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20080118 |
|
P01 | Opt-out of the competence of the unified patent court (upc) registered |
Effective date: 20230520 |