EP1248251A2 - Verfahren und System zur automatischen Umsetzung von Textnachrichten in Sprachnachrichten - Google Patents
Verfahren und System zur automatischen Umsetzung von Textnachrichten in Sprachnachrichten Download PDFInfo
- Publication number
- EP1248251A2 EP1248251A2 EP02003909A EP02003909A EP1248251A2 EP 1248251 A2 EP1248251 A2 EP 1248251A2 EP 02003909 A EP02003909 A EP 02003909A EP 02003909 A EP02003909 A EP 02003909A EP 1248251 A2 EP1248251 A2 EP 1248251A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- speech
- voice
- messages
- text
- profile
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
Definitions
- the present invention relates to a method as well as a system that can write any machine readable Text messages, such as emails or fax messages, via a suitable acoustic reproduction system, for example, via a cell phone, based on a acoustically outputs previously generated voice profile.
- a suitable acoustic reproduction system for example, via a cell phone, based on a acoustically outputs previously generated voice profile.
- the present invention is therefore based on the object a speech reproduction of machine-readable texts with to achieve synthetically generated voices so that a Alienation when listening to the generated voice is avoided.
- the user's voice sample data can be analyzed and created a language profile based on this analysis becomes. Based on the created language profile any text message data approximated, so good recognizable to output with the user's voice become.
- the sender is identified on the basis of voice if the text message data matches the Voices can be assigned accordingly.
- creating the language profile by comparing a written reference text with one generated by acoustic articulation of a speaker Reference text are made.
- a system for implementing Text messages in voice messages claimed.
- This has a speech analyzer based on a Analysis of voice sample data a voice profile for entered Voice sample data generated.
- This system also includes a speech generator based on the speech profile any text message in synthetic Implements voice sample data.
- the figure shows schematically a technique for automatic Conversion of text messages into voice messages.
- a method or a system is shown schematically in the figure for the automatic conversion of text messages into voice messages shown.
- One from any person spoken text 1 is replaced by a step S1 Analyzer 2 analyzed. This usually happens because that the acoustic signals are registered analogously and converted into digital voice files by an A / D converter become.
- Step S3 based on the analysis of the digital Language files creates a voice profile 3 of this person become.
- the spoken text 1 can be any Free text or a reference text 8, the one step S2 as part of the analysis with the written form of the Reference text 8 is compared.
- Based on the language profile 3 can be in the following any text message 5 via a speech generator 4 translate into synthetic voice message data 6 (step S5 and step S6).
- the text message 5 can then in a step S7 according to the created language profile 3 be output acoustically.
- a speech generator 4 for a synthetically generated language can be set so that any texts 5 with the voice of this speaker acoustically can be spent. Because of the possible Narrator with a natural and above all familiar Voice becomes strange when you hear the speech avoided. Of course, it is also conceivable that Speech generator speech samples of different people and thus multiple language profiles are available. So that's one Different speakers can be selected.
- an author sends one Recipient an email message.
- the destination address is the Author the recipient's phone number.
- the used Unified Message System determines that as a recipient no E-mail connection, but a telephone connection selected was and therefore puts the entered text in a Voice message around. A language profile is used for this, which was previously created based on a speech sample by this author has been. With this, the voice of the synthetically produced So far the natural voice of the author approximated that the recipient uses the synthetic voice as a recognizes the familiar voice of the sending person.
- the Unified Message System now initiates the construction of a Connection to the telephone line of the receiver and gives the Voice message with the author's voice.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephonic Communication Services (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Description
Claims (6)
- Verfahren zur automatischen Umsetzung von Text-Nachrichten (5) in Sprach-Nachrichten (6), mit den folgenden Schritten:dadurch gekennzeichnet, dass das Sprachprofil (3) nach Analyse (S1 von Sprachprobedaten (1) eines Benutzers auf Grundlage der vorgenommenen Analyse (S1) erstellt wird, um den Text angenähert mit der Stimme des Benutzers auszugeben.Erstellen (S3) eines Sprachprofils (3) undUmsetzen (4) von eingegebenen Text-Nachrichtendaten (5) in synthetische Sprach-Nachrichtendaten (6) auf Grundlage des Sprachprofils (3),
- Verfahren nach Anspruch 1,
dadurch gekennzeichnet, dass das Erstellen des Sprachprofils (3) auf Grundlage eines Vergleichs (S2) von Referenz-Textdaten (8) mit Referenz-Sprachprobedaten (1) erfolgt, wobei die Referenz-Sprachprobedaten (1) durch akustische Wiedergabe der Referenz-Textdaten (8) durch einen Sprecher erzeugt werden. - System zur Umsetzung von Text-Nachrichten (5) in Sprach-Nachrichten (6),mit einem Sprachanalysator (2), der auf Grundlage einer Analyse (S1) von Sprachprobedaten (1) ein Sprachprofil (3) für eingegebene Sprachprobedaten (1) erzeugt, undmit einem Sprachgenerator (4), der auf Grundlage des Sprachprofils (3) eine beliebige Text-Nachricht (5) in synthetische Sprachprobedaten (6) umsetzt.
- System nach Anspruch 3,
dadurch gekennzeichnet, dass der Sprachgenerator (4) dazu ausgelegt ist, das Sprachprofil (3) auf Grundlage eines Vergleichs eines schriftlichen Referenz-Textes (8) mit der von einem Benutzer gesprochenen Form (1) dieses Referenz-Textes (8) zu erzeugen. - System nach Anspruch 3 oder 4,
dadurch gekennzeichnet, dass in Multimediaumgebungen der Sprachanteil von Sprachnachrichten (1) automatisch analysiert wird (S1) und zur akustischen Wiedergabe (7) von Textnachrichten (5) verwendet wird. - Mobiltelephon, aufweisend ein System nach Anspruch 3, 4 oder 5,
dadurch gekennzeichnet, dass die Text-Nachrichten (5) Dokumente in einer Multimediaumgebung, beispielsweise E-Mail-Texte, sind, die auf dem Mobiltelephon in der Sprache gemäß dem zuvor erzeugten Sprachprofil (3) akustisch ausgegeben werden.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| DE10117367A DE10117367B4 (de) | 2001-04-06 | 2001-04-06 | Verfahren und System zur automatischen Umsetzung von Text-Nachrichten in Sprach-Nachrichten |
| DE10117367 | 2001-04-06 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| EP1248251A2 true EP1248251A2 (de) | 2002-10-09 |
| EP1248251A3 EP1248251A3 (de) | 2009-10-07 |
Family
ID=7680748
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| EP02003909A Withdrawn EP1248251A3 (de) | 2001-04-06 | 2002-02-21 | Verfahren und System zur automatischen Umsetzung von Textnachrichten in Sprachnachrichten |
Country Status (3)
| Country | Link |
|---|---|
| US (1) | US20020169610A1 (de) |
| EP (1) | EP1248251A3 (de) |
| DE (1) | DE10117367B4 (de) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| GB2383502B (en) * | 2001-11-02 | 2005-11-02 | Nec Corp | Voice synthesis system and method,and portable terminal and server therefor |
| WO2011083362A1 (en) * | 2010-01-05 | 2011-07-14 | Sony Ericsson Mobile Communications Ab | Personalized text-to-speech synthesis and personalized speech feature extraction |
Families Citing this family (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2004090746A1 (en) * | 2003-04-14 | 2004-10-21 | Koninklijke Philips Electronics N.V. | System and method for performing automatic dubbing on an audio-visual stream |
| US8005677B2 (en) * | 2003-05-09 | 2011-08-23 | Cisco Technology, Inc. | Source-dependent text-to-speech system |
| US8068588B2 (en) * | 2007-06-26 | 2011-11-29 | Microsoft Corporation | Unified rules for voice and messaging |
| US8285548B2 (en) | 2008-03-10 | 2012-10-09 | Lg Electronics Inc. | Communication device processing text message to transform it into speech |
| KR101566379B1 (ko) * | 2009-05-07 | 2015-11-13 | 삼성전자주식회사 | 입력 신호 종류 별 사용자 기능 활성화 방법 및 이를 지원하는 휴대 단말기 |
| US20130079029A1 (en) * | 2011-09-28 | 2013-03-28 | Royce A. Levien | Multi-modality communication network auto-activation |
| US9762524B2 (en) | 2011-09-28 | 2017-09-12 | Elwha Llc | Multi-modality communication participation |
| US9503550B2 (en) | 2011-09-28 | 2016-11-22 | Elwha Llc | Multi-modality communication modification |
| US9699632B2 (en) | 2011-09-28 | 2017-07-04 | Elwha Llc | Multi-modality communication with interceptive conversion |
| US9906927B2 (en) | 2011-09-28 | 2018-02-27 | Elwha Llc | Multi-modality communication initiation |
| US9002937B2 (en) | 2011-09-28 | 2015-04-07 | Elwha Llc | Multi-party multi-modality communication |
| US9477943B2 (en) | 2011-09-28 | 2016-10-25 | Elwha Llc | Multi-modality communication |
| US9788349B2 (en) * | 2011-09-28 | 2017-10-10 | Elwha Llc | Multi-modality communication auto-activation |
| US10424288B2 (en) | 2017-03-31 | 2019-09-24 | Wipro Limited | System and method for rendering textual messages using customized natural voice |
| CN111369966A (zh) * | 2018-12-06 | 2020-07-03 | 阿里巴巴集团控股有限公司 | 一种用于个性化语音合成的方法和装置 |
Family Cites Families (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4707858A (en) * | 1983-05-02 | 1987-11-17 | Motorola, Inc. | Utilizing word-to-digital conversion |
| JPH05260082A (ja) * | 1992-03-13 | 1993-10-08 | Toshiba Corp | テキスト読み上げ装置 |
| US5774841A (en) * | 1995-09-20 | 1998-06-30 | The United States Of America As Represented By The Adminstrator Of The National Aeronautics And Space Administration | Real-time reconfigurable adaptive speech recognition command and control apparatus and method |
| US6035273A (en) * | 1996-06-26 | 2000-03-07 | Lucent Technologies, Inc. | Speaker-specific speech-to-text/text-to-speech communication system with hypertext-indicated speech parameter changes |
| JP3287281B2 (ja) * | 1997-07-31 | 2002-06-04 | トヨタ自動車株式会社 | メッセージ処理装置 |
| US6216104B1 (en) * | 1998-02-20 | 2001-04-10 | Philips Electronics North America Corporation | Computer-based patient record and message delivery system |
| US6081780A (en) * | 1998-04-28 | 2000-06-27 | International Business Machines Corporation | TTS and prosody based authoring system |
| DE19841683A1 (de) * | 1998-09-11 | 2000-05-11 | Hans Kull | Vorrichtung und Verfahren zur digitalen Sprachbearbeitung |
| US6243676B1 (en) * | 1998-12-23 | 2001-06-05 | Openwave Systems Inc. | Searching and retrieving multimedia information |
| US20020072900A1 (en) * | 1999-11-23 | 2002-06-13 | Keough Steven J. | System and method of templating specific human voices |
| US6801931B1 (en) * | 2000-07-20 | 2004-10-05 | Ericsson Inc. | System and method for personalizing electronic mail messages by rendering the messages in the voice of a predetermined speaker |
| US6978239B2 (en) * | 2000-12-04 | 2005-12-20 | Microsoft Corporation | Method and apparatus for speech synthesis without prosody modification |
-
2001
- 2001-04-06 DE DE10117367A patent/DE10117367B4/de not_active Expired - Fee Related
-
2002
- 2002-02-21 EP EP02003909A patent/EP1248251A3/de not_active Withdrawn
- 2002-04-05 US US10/117,291 patent/US20020169610A1/en not_active Abandoned
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| GB2383502B (en) * | 2001-11-02 | 2005-11-02 | Nec Corp | Voice synthesis system and method,and portable terminal and server therefor |
| US7313522B2 (en) | 2001-11-02 | 2007-12-25 | Nec Corporation | Voice synthesis system and method that performs voice synthesis of text data provided by a portable terminal |
| WO2011083362A1 (en) * | 2010-01-05 | 2011-07-14 | Sony Ericsson Mobile Communications Ab | Personalized text-to-speech synthesis and personalized speech feature extraction |
| CN102117614B (zh) * | 2010-01-05 | 2013-01-02 | 索尼爱立信移动通讯有限公司 | 个性化文本语音合成和个性化语音特征提取 |
| US8655659B2 (en) | 2010-01-05 | 2014-02-18 | Sony Corporation | Personalized text-to-speech synthesis and personalized speech feature extraction |
Also Published As
| Publication number | Publication date |
|---|---|
| EP1248251A3 (de) | 2009-10-07 |
| DE10117367B4 (de) | 2005-08-18 |
| DE10117367A1 (de) | 2002-10-17 |
| US20020169610A1 (en) | 2002-11-14 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP1248251A2 (de) | Verfahren und System zur automatischen Umsetzung von Textnachrichten in Sprachnachrichten | |
| DE60222093T2 (de) | Verfahren, modul, vorrichtung und server zur spracherkennung | |
| EP0644680A2 (de) | Verfahren und Vorrichtung zum Erstellen und Bearbeiten von Textdokumenten | |
| DE3416238A1 (de) | Extremschmalband-uebertragungssystem | |
| DE102004050785A1 (de) | Verfahren und Anordnung zur Bearbeitung von Nachrichten im Rahmen eines Integrated Messaging Systems | |
| DE69413912T2 (de) | Sprachumsetzungsverfahren | |
| EP1051701B1 (de) | Verfahren zum übermitteln von sprachdaten | |
| DE60020504T2 (de) | Anpassung eines spracherkenners an korrigierte texte | |
| EP1134726A1 (de) | Verfahren zur Erkennung von Sprachäusserungen nicht-muttersprachlicher Sprecher in einem Sprachverarbeitungssystem | |
| EP2047668B1 (de) | Verfahren, sprachdialogsystem und telekommunikationsendgerät zur multilingualen sprachausgabe | |
| EP1282897A1 (de) | Verfahren zum erzeugen einer sprachdatenbank für einen zielwortschatz zum trainieren eines spracherkennungssystems | |
| WO2002049003A1 (de) | Verfahren und system zum umsetzen von text in sprache | |
| DE19811879C1 (de) | Einrichtung und Verfahren zum Erkennen von Sprache | |
| EP1169841B1 (de) | Erstellen eines referenzmodell-verzeichnisses für ein sprachgesteuertes kommunikationsgerät | |
| DE69419846T2 (de) | Sende- und empfangsverfahren für kodierte sprache | |
| DE10033104C2 (de) | Verfahren zum Erzeugen einer Statistik von Phondauern und Verfahren zum Ermitteln der Dauer einzelner Phone für die Sprachsynthese | |
| EP0984427B1 (de) | Verfahren zum akustischen Ausgeben von Text | |
| DE69910412T2 (de) | Sprachgesteuerte navigation für einen elektronischen post leser | |
| DE102019135799A1 (de) | Verfahren zum Verbessern von Sprachverständlichkeit einer elektronischen Sprechverbindung und Headset zur Durchführung des Verfahrens | |
| DE102016002496A1 (de) | Verfahren und System zum Wiedergeben einer Textnachricht | |
| DE10163277C2 (de) | Verfahren zum Versenden einer Nachricht an eine Rufnummer, sowie Vorrichtung hierfür | |
| WO2004047466A2 (de) | Verfahren zur wiedergabe von gesendeten textnachrichten | |
| DE60025158T2 (de) | Verfahren zur Geschwindigkeitsmodifikation von Sprachsignalen, Verwendung des Verfahrens, und Anordnung zur Durchführung des Verfahrens | |
| DE10056762B4 (de) | Verfahren zum Erstellen elektronischer Nachrichten | |
| EP4375990A2 (de) | Verfahren zum training einer sprechererkennungseinheit eines hörgeräts sowie kombination aus einem solchen hörgerät und einem kommunikationsgerät |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
| AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR |
|
| AX | Request for extension of the european patent |
Free format text: AL;LT;LV;MK;RO;SI |
|
| PUAL | Search report despatched |
Free format text: ORIGINAL CODE: 0009013 |
|
| AK | Designated contracting states |
Kind code of ref document: A3 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR |
|
| AX | Request for extension of the european patent |
Extension state: AL LT LV MK RO SI |
|
| 17P | Request for examination filed |
Effective date: 20100329 |
|
| AKX | Designation fees paid |
Designated state(s): DE FR GB IT SE |
|
| 17Q | First examination report despatched |
Effective date: 20101201 |
|
| RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: SIEMENS ENTERPRISE COMMUNICATIONS GMBH & CO. KG |
|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
| 18D | Application deemed to be withdrawn |
Effective date: 20110615 |