DE69932819D1 - Intelligente text-sprache-umsetzung - Google Patents
Intelligente text-sprache-umsetzungInfo
- Publication number
- DE69932819D1 DE69932819D1 DE69932819T DE69932819T DE69932819D1 DE 69932819 D1 DE69932819 D1 DE 69932819D1 DE 69932819 T DE69932819 T DE 69932819T DE 69932819 T DE69932819 T DE 69932819T DE 69932819 D1 DE69932819 D1 DE 69932819D1
- Authority
- DE
- Germany
- Prior art keywords
- text
- input text
- speech
- semantics
- speech output
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 230000000694 effects Effects 0.000 abstract 3
- 238000000034 method Methods 0.000 abstract 2
- 238000009877 rendering Methods 0.000 abstract 2
- 230000009466 transformation Effects 0.000 abstract 2
- 230000001131 transforming effect Effects 0.000 abstract 2
- 230000015572 biosynthetic process Effects 0.000 abstract 1
- 238000003786 synthesis reaction Methods 0.000 abstract 1
- 230000002194 synthesizing effect Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/027—Concept to speech synthesisers; Generation of natural phrases from machine-based concepts
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1815—Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
- Surgical Instruments (AREA)
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
- Telephone Function (AREA)
- Mobile Radio Communication Systems (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US98669 | 1998-06-17 | ||
US09/098,669 US6446040B1 (en) | 1998-06-17 | 1998-06-17 | Intelligent text-to-speech synthesis |
PCT/US1999/013329 WO1999066496A1 (en) | 1998-06-17 | 1999-06-14 | Intelligent text-to-speech synthesis |
Publications (2)
Publication Number | Publication Date |
---|---|
DE69932819D1 true DE69932819D1 (de) | 2006-09-28 |
DE69932819T2 DE69932819T2 (de) | 2007-08-16 |
Family
ID=22270397
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE69932819T Expired - Lifetime DE69932819T2 (de) | 1998-06-17 | 1999-06-14 | Intelligente text-sprache-umsetzung |
Country Status (9)
Country | Link |
---|---|
US (1) | US6446040B1 (de) |
EP (1) | EP1086450B1 (de) |
JP (1) | JP2002518711A (de) |
KR (1) | KR100759581B1 (de) |
AT (1) | ATE336775T1 (de) |
AU (1) | AU4681699A (de) |
BR (1) | BR9911315B1 (de) |
DE (1) | DE69932819T2 (de) |
WO (1) | WO1999066496A1 (de) |
Families Citing this family (73)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE19908137A1 (de) * | 1998-10-16 | 2000-06-15 | Volkswagen Ag | Verfahren und Vorrichtung zur automatischen Steuerung mindestens eines Gerätes per Sprachdialog |
JP2001014306A (ja) * | 1999-06-30 | 2001-01-19 | Sony Corp | 電子文書処理方法及び電子文書処理装置並びに電子文書処理プログラムが記録された記録媒体 |
US6912691B1 (en) * | 1999-09-03 | 2005-06-28 | Cisco Technology, Inc. | Delivering voice portal services using an XML voice-enabled web server |
US6578000B1 (en) * | 1999-09-03 | 2003-06-10 | Cisco Technology, Inc. | Browser-based arrangement for developing voice enabled web applications using extensible markup language documents |
US7801766B2 (en) | 2000-03-31 | 2010-09-21 | You Technology Brand Services, Inc. | Method, system, and computer readable medium for facilitating a transaction between a customer, a merchant and an associate |
US6308154B1 (en) * | 2000-04-13 | 2001-10-23 | Rockwell Electronic Commerce Corp. | Method of natural language communication using a mark-up language |
US6823311B2 (en) * | 2000-06-29 | 2004-11-23 | Fujitsu Limited | Data processing system for vocalizing web content |
US6510413B1 (en) * | 2000-06-29 | 2003-01-21 | Intel Corporation | Distributed synthetic speech generation |
US6963831B1 (en) * | 2000-10-25 | 2005-11-08 | International Business Machines Corporation | Including statistical NLU models within a statistical parser |
JP2002221980A (ja) * | 2001-01-25 | 2002-08-09 | Oki Electric Ind Co Ltd | テキスト音声変換装置 |
KR20030002999A (ko) * | 2001-06-30 | 2003-01-09 | 주식회사 케이티 | 스크립트 생성기법을 이용한 음성인식 시스템 시험장치 및그 방법 |
KR100450319B1 (ko) * | 2001-12-24 | 2004-10-01 | 한국전자통신연구원 | 가상 환경에서 참여자간의 의사전달 장치 및 방법 |
JP4150198B2 (ja) * | 2002-03-15 | 2008-09-17 | ソニー株式会社 | 音声合成方法、音声合成装置、プログラム及び記録媒体、並びにロボット装置 |
US20030187658A1 (en) * | 2002-03-29 | 2003-10-02 | Jari Selin | Method for text-to-speech service utilizing a uniform resource identifier |
US7577568B2 (en) * | 2003-06-10 | 2009-08-18 | At&T Intellctual Property Ii, L.P. | Methods and system for creating voice files using a VoiceXML application |
US20040260551A1 (en) * | 2003-06-19 | 2004-12-23 | International Business Machines Corporation | System and method for configuring voice readers using semantic analysis |
US7530015B2 (en) * | 2003-06-25 | 2009-05-05 | Microsoft Corporation | XSD inference |
US8826137B2 (en) | 2003-08-14 | 2014-09-02 | Freedom Scientific, Inc. | Screen reader having concurrent communication of non-textual information |
US8886538B2 (en) * | 2003-09-26 | 2014-11-11 | Nuance Communications, Inc. | Systems and methods for text-to-speech synthesis using spoken example |
US7805307B2 (en) | 2003-09-30 | 2010-09-28 | Sharp Laboratories Of America, Inc. | Text to speech conversion system |
US8489769B2 (en) * | 2003-10-02 | 2013-07-16 | Accenture Global Services Limited | Intelligent collaborative expression in support of socialization of devices |
GB0327991D0 (en) * | 2003-12-03 | 2004-01-07 | Ibm | Interactive voice response method and apparatus |
US20050177369A1 (en) * | 2004-02-11 | 2005-08-11 | Kirill Stoimenov | Method and system for intuitive text-to-speech synthesis customization |
CA2557079A1 (en) * | 2004-03-05 | 2005-09-22 | Lessac Technologies, Inc. | Prosodic speech text codes and their use in computerized speech systems |
US7472065B2 (en) * | 2004-06-04 | 2008-12-30 | International Business Machines Corporation | Generating paralinguistic phenomena via markup in text-to-speech synthesis |
CN101044549A (zh) * | 2004-10-18 | 2007-09-26 | 皇家飞利浦电子股份有限公司 | 向用户通知媒体内容项目的类别的数据处理设备和方法 |
WO2006128480A1 (en) * | 2005-05-31 | 2006-12-07 | Telecom Italia S.P.A. | Method and system for providing speech synthsis on user terminals over a communications network |
US8977636B2 (en) | 2005-08-19 | 2015-03-10 | International Business Machines Corporation | Synthesizing aggregate data of disparate data types into data of a uniform data type |
US7958131B2 (en) | 2005-08-19 | 2011-06-07 | International Business Machines Corporation | Method for data management and data rendering for disparate data types |
JP4640046B2 (ja) | 2005-08-30 | 2011-03-02 | 株式会社日立製作所 | デジタルコンテンツ再生装置 |
US8266220B2 (en) | 2005-09-14 | 2012-09-11 | International Business Machines Corporation | Email management and rendering |
US8577682B2 (en) * | 2005-10-27 | 2013-11-05 | Nuance Communications, Inc. | System and method to use text-to-speech to prompt whether text-to-speech output should be added during installation of a program on a computer system normally controlled through a user interactive display |
US8694319B2 (en) * | 2005-11-03 | 2014-04-08 | International Business Machines Corporation | Dynamic prosody adjustment for voice-rendering synthesized data |
US8326629B2 (en) * | 2005-11-22 | 2012-12-04 | Nuance Communications, Inc. | Dynamically changing voice attributes during speech synthesis based upon parameter differentiation for dialog contexts |
US8600753B1 (en) * | 2005-12-30 | 2013-12-03 | At&T Intellectual Property Ii, L.P. | Method and apparatus for combining text to speech and recorded prompts |
US8271107B2 (en) | 2006-01-13 | 2012-09-18 | International Business Machines Corporation | Controlling audio operation for data management and data rendering |
US8209180B2 (en) * | 2006-02-08 | 2012-06-26 | Nec Corporation | Speech synthesizing device, speech synthesizing method, and program |
US9135339B2 (en) | 2006-02-13 | 2015-09-15 | International Business Machines Corporation | Invoking an audio hyperlink |
US9087507B2 (en) * | 2006-09-15 | 2015-07-21 | Yahoo! Inc. | Aural skimming and scrolling |
GB2443027B (en) * | 2006-10-19 | 2009-04-01 | Sony Comp Entertainment Europe | Apparatus and method of audio processing |
DE102006056286B4 (de) * | 2006-11-29 | 2014-09-11 | Audi Ag | Verfahren zur Wiedergabe von Textinformationen durch Sprache in einem Fahrzeug |
US8438032B2 (en) * | 2007-01-09 | 2013-05-07 | Nuance Communications, Inc. | System for tuning synthesized speech |
WO2008102413A1 (ja) * | 2007-02-22 | 2008-08-28 | Fujitsu Limited | 音楽再生装置および音楽再生方法 |
US8725513B2 (en) * | 2007-04-12 | 2014-05-13 | Nuance Communications, Inc. | Providing expressive user interaction with a multimodal application |
US20090083035A1 (en) * | 2007-09-25 | 2009-03-26 | Ritchie Winson Huang | Text pre-processing for text-to-speech generation |
US20090157407A1 (en) * | 2007-12-12 | 2009-06-18 | Nokia Corporation | Methods, Apparatuses, and Computer Program Products for Semantic Media Conversion From Source Files to Audio/Video Files |
KR20090085376A (ko) * | 2008-02-04 | 2009-08-07 | 삼성전자주식회사 | 문자 메시지의 음성 합성을 이용한 서비스 방법 및 장치 |
JP2009265279A (ja) | 2008-04-23 | 2009-11-12 | Sony Ericsson Mobilecommunications Japan Inc | 音声合成装置、音声合成方法、音声合成プログラム、携帯情報端末、および音声合成システム |
US8265936B2 (en) * | 2008-06-03 | 2012-09-11 | International Business Machines Corporation | Methods and system for creating and editing an XML-based speech synthesis document |
CN101605307A (zh) * | 2008-06-12 | 2009-12-16 | 深圳富泰宏精密工业有限公司 | 文本短信语音播放系统及方法 |
US8165881B2 (en) * | 2008-08-29 | 2012-04-24 | Honda Motor Co., Ltd. | System and method for variable text-to-speech with minimized distraction to operator of an automotive vehicle |
US20100057465A1 (en) * | 2008-09-03 | 2010-03-04 | David Michael Kirsch | Variable text-to-speech for automotive application |
US8219899B2 (en) * | 2008-09-22 | 2012-07-10 | International Business Machines Corporation | Verbal description method and system |
US8990087B1 (en) * | 2008-09-30 | 2015-03-24 | Amazon Technologies, Inc. | Providing text to speech from digital content on an electronic device |
TWI405184B (zh) * | 2009-11-19 | 2013-08-11 | Univ Nat Cheng Kung | 嵌入式作業系統平台之隨讀隨聽電子書手持裝置 |
US8447610B2 (en) | 2010-02-12 | 2013-05-21 | Nuance Communications, Inc. | Method and apparatus for generating synthetic speech with contrastive stress |
US8949128B2 (en) * | 2010-02-12 | 2015-02-03 | Nuance Communications, Inc. | Method and apparatus for providing speech output for speech-enabled applications |
US8571870B2 (en) * | 2010-02-12 | 2013-10-29 | Nuance Communications, Inc. | Method and apparatus for generating synthetic speech with contrastive stress |
US9032042B2 (en) | 2011-06-27 | 2015-05-12 | Microsoft Technology Licensing, Llc | Audio presentation of condensed spatial contextual information |
US8958569B2 (en) | 2011-12-17 | 2015-02-17 | Microsoft Technology Licensing, Llc | Selective spatial audio communication |
TWI574254B (zh) * | 2012-01-20 | 2017-03-11 | 華碩電腦股份有限公司 | 用於電子系統的語音合成方法及裝置 |
US8862985B2 (en) | 2012-06-08 | 2014-10-14 | Freedom Scientific, Inc. | Screen reader with customizable web page output |
US9575960B1 (en) * | 2012-09-17 | 2017-02-21 | Amazon Technologies, Inc. | Auditory enhancement using word analysis |
US8856007B1 (en) | 2012-10-09 | 2014-10-07 | Google Inc. | Use text to speech techniques to improve understanding when announcing search results |
US10540957B2 (en) * | 2014-12-15 | 2020-01-21 | Baidu Usa Llc | Systems and methods for speech transcription |
US10176798B2 (en) | 2015-08-28 | 2019-01-08 | Intel Corporation | Facilitating dynamic and intelligent conversion of text into real user speech |
RU2632424C2 (ru) | 2015-09-29 | 2017-10-04 | Общество С Ограниченной Ответственностью "Яндекс" | Способ и сервер для синтеза речи по тексту |
CN105632484B (zh) * | 2016-02-19 | 2019-04-09 | 云知声(上海)智能科技有限公司 | 语音合成数据库停顿信息自动标注方法及系统 |
GB201810621D0 (en) * | 2018-06-28 | 2018-08-15 | Univ London Queen Mary | Generation of audio data |
CN112334973B (zh) * | 2018-07-19 | 2024-04-26 | 杜比国际公司 | 用于创建基于对象的音频内容的方法和系统 |
US11195511B2 (en) * | 2018-07-19 | 2021-12-07 | Dolby Laboratories Licensing Corporation | Method and system for creating object-based audio content |
CN111429877B (zh) * | 2020-03-03 | 2023-04-07 | 云知声智能科技股份有限公司 | 歌曲处理方法及装置 |
US12008289B2 (en) | 2021-07-07 | 2024-06-11 | Honeywell International Inc. | Methods and systems for transcription playback with variable emphasis |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5029214A (en) | 1986-08-11 | 1991-07-02 | Hollander James F | Electronic speech control apparatus and methods |
JPH0529214A (ja) * | 1991-07-18 | 1993-02-05 | Sharp Corp | 半導体基板の製造方法 |
EP0542628B1 (de) * | 1991-11-12 | 2001-10-10 | Fujitsu Limited | Vorrichtung zur Sprachsynthese |
CA2119397C (en) * | 1993-03-19 | 2007-10-02 | Kim E.A. Silverman | Improved automated voice synthesis employing enhanced prosodic treatment of text, spelling of text and rate of annunciation |
US5561736A (en) * | 1993-06-04 | 1996-10-01 | International Business Machines Corporation | Three dimensional speech synthesis |
US5572625A (en) * | 1993-10-22 | 1996-11-05 | Cornell Research Foundation, Inc. | Method for generating audio renderings of digitized works having highly technical content |
JP2770747B2 (ja) | 1994-08-18 | 1998-07-02 | 日本電気株式会社 | 音声合成装置 |
US5634084A (en) | 1995-01-20 | 1997-05-27 | Centigram Communications Corporation | Abbreviation and acronym/initialism expansion procedures for a text to speech reader |
US5761640A (en) | 1995-12-18 | 1998-06-02 | Nynex Science & Technology, Inc. | Name and address processor |
US5850629A (en) * | 1996-09-09 | 1998-12-15 | Matsushita Electric Industrial Co., Ltd. | User interface controller for text-to-speech synthesizer |
EP0841625A1 (de) | 1996-11-08 | 1998-05-13 | Softmark Limited | Eingabe- und Ausgabekommunikation in einem Datenverarbeitungssystem |
US5860604A (en) * | 1996-11-19 | 1999-01-19 | Doug Slenk | Motorized fertilizer spreader |
US6226614B1 (en) * | 1997-05-21 | 2001-05-01 | Nippon Telegraph And Telephone Corporation | Method and apparatus for editing/creating synthetic speech message and recording medium with the method recorded thereon |
US6081780A (en) * | 1998-04-28 | 2000-06-27 | International Business Machines Corporation | TTS and prosody based authoring system |
US6246672B1 (en) * | 1998-04-28 | 2001-06-12 | International Business Machines Corp. | Singlecast interactive radio system |
-
1998
- 1998-06-17 US US09/098,669 patent/US6446040B1/en not_active Expired - Lifetime
-
1999
- 1999-06-14 AU AU46816/99A patent/AU4681699A/en not_active Abandoned
- 1999-06-14 AT AT99930238T patent/ATE336775T1/de not_active IP Right Cessation
- 1999-06-14 DE DE69932819T patent/DE69932819T2/de not_active Expired - Lifetime
- 1999-06-14 WO PCT/US1999/013329 patent/WO1999066496A1/en active IP Right Grant
- 1999-06-14 JP JP2000555243A patent/JP2002518711A/ja active Pending
- 1999-06-14 EP EP99930238A patent/EP1086450B1/de not_active Expired - Lifetime
- 1999-06-14 KR KR1020007014411A patent/KR100759581B1/ko not_active IP Right Cessation
- 1999-06-14 BR BRPI9911315-5A patent/BR9911315B1/pt not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
JP2002518711A (ja) | 2002-06-25 |
DE69932819T2 (de) | 2007-08-16 |
AU4681699A (en) | 2000-01-05 |
WO1999066496A8 (en) | 2006-11-02 |
EP1086450B1 (de) | 2006-08-16 |
BR9911315B1 (pt) | 2012-12-25 |
ATE336775T1 (de) | 2006-09-15 |
US6446040B1 (en) | 2002-09-03 |
BR9911315A (pt) | 2002-01-15 |
KR100759581B1 (ko) | 2007-09-18 |
KR20010071517A (ko) | 2001-07-28 |
WO1999066496A1 (en) | 1999-12-23 |
EP1086450A1 (de) | 2001-03-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE69932819D1 (de) | Intelligente text-sprache-umsetzung | |
US7716052B2 (en) | Method, apparatus and computer program providing a multi-speaker database for concatenative text-to-speech synthesis | |
Olive et al. | Acoustics of American English speech: A dynamic approach | |
AU1362199A (en) | System and method for auditorially representing pages of sgml data | |
TW347619B (en) | A communication system and method using a speaker dependent time-scaling technique a method for time-scale modification of speech using a modified version of the Waveform Similarity based Overlap-Add technique (WSOLA). | |
WO2003065349A3 (en) | Text to speech | |
O'Malley | Text-to-speech conversion technology | |
TR200102364T2 (tr) | Otomatikleştirilmiş transkripsiyon sistemi ve iki konuşma dönüştürme seferini ve bilgisayar-yardımlı düzeltme kullanan yöntem. | |
DE69427083D1 (de) | Spracherkennungssystem für mehrere sprachen | |
GB2610709A (en) | Synthetic speech processing | |
ATE363120T1 (de) | Audio-dialogsystem und sprachgesteuertes browsing-verfahren | |
CN112185341A (zh) | 基于语音合成的配音方法、装置、设备和存储介质 | |
JP3518898B2 (ja) | 音声合成装置 | |
KR20050080671A (ko) | 티티에스 시스템의 이모티콘 처리 방법 | |
CN117597728A (zh) | 使用未完全训练的文本到语音模型的个性化和动态的文本到语音声音克隆 | |
CN112530399A (zh) | 一种语音数据的扩充方法、系统、电子设备及存储介质 | |
JP3282151B2 (ja) | 音声制御方式 | |
TW283774B (en) | Intelligently vocal chinese input method and chinese dictation machine | |
Makino et al. | Separation of speech signal-to realize multiple talker speech recognition | |
JPH08190397A (ja) | 音声出力装置 | |
Novitasari et al. | Improving Intelligibility of Synthesized Speech in Noisy Condition with Dynamically Adaptive Machine Speech Chain | |
Komal Singh et al. | Speech synthesis. | |
KR890010668A (ko) | 음성 응답 및 안내장치의 문장 구현방식 | |
JPH03127259A (ja) | パソコン・ワープロの文書を音声化 | |
JPS59180728A (ja) | 音声出力編集方式 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition |