WO2003071520A1 - Synthese de la parole commandee par des parametres - Google Patents

Synthese de la parole commandee par des parametres Download PDF

Info

Publication number
WO2003071520A1
WO2003071520A1 PCT/DE2003/000049 DE0300049W WO03071520A1 WO 2003071520 A1 WO2003071520 A1 WO 2003071520A1 DE 0300049 W DE0300049 W DE 0300049W WO 03071520 A1 WO03071520 A1 WO 03071520A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
message
text file
implemented
text
Prior art date
Application number
PCT/DE2003/000049
Other languages
German (de)
English (en)
Inventor
Marian Trinkel
Uwe Nettelroth
Original Assignee
Deutsche Telekom Ag
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Deutsche Telekom Ag filed Critical Deutsche Telekom Ag
Publication of WO2003071520A1 publication Critical patent/WO2003071520A1/fr

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/033Voice editing, e.g. manipulating the voice of the synthesiser

Definitions

  • the present invention relates to a method for automatically converting a message created by a sender as a text file into a voice message which can be output by a recipient via a voice output device, in particular a loudspeaker, a conversion program implemented on a computer having a voice generator (“voice”) for generating the
  • voice voice generator
  • the invention also relates to a system for implementing the method.
  • SMS Short Message System
  • a terminal for example a cell phone or a computer
  • This message is then integrated into the network by one Computer converted to a voice message (Text to Speach) using a voice, so the recipient no longer has to read the SMS message as is known, but is addressed directly and personally by the synthetic voice with the corresponding content of the message , who have names like "Dagmar” or "Detlef", who present the message to the addressee.
  • a disadvantage of the methods used hitherto is that the conversion uses only the voice which is usually only available and the voice message has only the characteristic coloration assigned to the voice.
  • the available synthetic voices simulate the human voice quite well in terms of emphasis, but they lack it
  • the object of the invention is now to provide a method which can be implemented with simple and inexpensive means and which is individual Variation of the expression also possible within a message. It is also an object of the invention to provide a system for implementing the method.
  • the essential basic idea of the invention is to give the sender of a text message the possibility of influencing the conversion of the message with regard to desired nuances in the emphasis when presenting the message content by identifying the text file.
  • one or more control commands are assigned to the text file, which are recognized as such by the computer and then associated with the sender's wish to give his voice message a special characteristic.
  • the sender assigns the text file to at least one control command which is recognized by the conversion program, the program modifying the characteristic of the voice speaking the voice message, in particular with regard to its tone color and / or its melody, in accordance with the control command.
  • the assignment can be done by prefixing, appending or inserting the control command into the text file, which usually has a header and subsequent data.
  • the control command can in particular be a specific component of the text file, in particular a sentence, a text sequence, a word or a
  • the obvious advantage is that the meaning of the content can be modified via a changing characteristic and that the message gets a certain undertone. It is thus possible to utter a sad message in a correspondingly quiet and overcast manner, or to give the voice a sarcastic undertone in the case of "good" news.
  • the pronunciation and, in particular, the gender of the voice can be adapted to the circumstances.
  • the advantage for example, that the medium of SMS, which was formerly attractive for young people, is given a further appeal by the flexibility.
  • the sender can use the invention to convey exactly what he actually wants to express. According to the invention, a synthetic reading voice is given another human touch.
  • the variability within a message can be achieved either by using different available voices, the choice between the individual voices being made on the basis of the control commands.
  • a control character “$” can mean that the female voice “Dagmar” is used, while “ ⁇ $” means that the text should be read by "Detlef".
  • a variation can, however, also be achieved by varying the characteristics of the only available "neutral” voice by changing the accessible setting parameters, such as timbre, pitch, emphasis, voice stretching or volume.
  • the character “$” a feminine and the character "c?” a male touch of the "neutral” voice.
  • the control commands are advantageously implemented at those points in the text file where a change in the characteristic is desired. In this way, multiple voices can be used within one message, which can lead to an attractive and unique way of expression.
  • an advantageous area of application of the invention is the short message system (SMS).
  • SMS short message system
  • the voice message is then sent as text via the SMS and, after conversion, is output via the loudspeaker of a telephone or a computer.
  • a similar field of application is offered by e-mails that are sent over the Internet and are output after the conversion via the loudspeaker of a telephone or a computer.
  • the new service brings a new game excitement and increased pleasure for the users.
  • the invention provides a new feature for natural communication between man and machine. So each sender can get his own sound design.
  • a linguistic model can be implemented in an advanced form of configuration using implemented control commands and thus help the voice to a higher degree of naturalness.
  • a control command according to the invention can be assigned to each syllable or each letter.
  • the invention is advantageously implemented with a system that has a computer implemented in a communication network on which a program for speech synthesis is implemented.
  • This so-called “voice” converts a message as a text file into a spoken text and sends the message over a voice line to a terminal also implemented in the network.
  • the spoken text is output via a loudspeaker of the terminal.
  • a module is implemented in the program, that recognizes a control command implemented in the text file, the module recognizing the characteristics of the voice speaking the voice message, modified in particular with regard to their timbre or melody, in accordance with the control command.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)

Abstract

L'invention concerne un procédé permettant de convertir automatiquement un message, créé par un expéditeur sous forme de fichier texte, en un message vocal pouvant être diffusé à l'attention d'un destinataire par l'intermédiaire d'un dispositif de sortie vocale, en particulier un haut-parleur, un programme de conversion installé sur un ordinateur commandant un générateur de parole ("voix") pour produire le message vocal d'après le fichier texte. L'invention se caractérise en ce que l'expéditeur associe au fichier texte au moins une instruction de commande qui est reconnue par le programme de conversion, ledit programme modifiant d'après cette instruction de commande les caractéristiques de la voix formulant le message vocal, en particulier du point de vue de son timbre et/ou de sa mélodie.
PCT/DE2003/000049 2002-02-19 2003-01-10 Synthese de la parole commandee par des parametres WO2003071520A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
DE2002107875 DE10207875A1 (de) 2002-02-19 2002-02-19 Parametergesteuerte Sprachsynthese
DE10207875.0 2002-02-19

Publications (1)

Publication Number Publication Date
WO2003071520A1 true WO2003071520A1 (fr) 2003-08-28

Family

ID=27635279

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/DE2003/000049 WO2003071520A1 (fr) 2002-02-19 2003-01-10 Synthese de la parole commandee par des parametres

Country Status (2)

Country Link
DE (1) DE10207875A1 (fr)
WO (1) WO2003071520A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1528483A1 (fr) * 2003-10-30 2005-05-04 Nec Corporation Appareil et procédé pour afficher un message textuel avec des informations sur le contenu émotionel du message

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102004012208A1 (de) 2004-03-12 2005-09-29 Siemens Ag Individualisierung von Sprachausgabe durch Anpassen einer Synthesestimme an eine Zielstimme
US8249873B2 (en) 2005-08-12 2012-08-21 Avaya Inc. Tonal correction of speech

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08247779A (ja) * 1995-03-09 1996-09-27 Honda Motor Co Ltd 音声出力装置
EP0901000A2 (fr) * 1997-07-31 1999-03-10 Toyota Jidosha Kabushiki Kaisha Système de traitement de messages et méthode pour le traitement de messages
WO2000023982A1 (fr) * 1998-10-16 2000-04-27 Volkswagen Aktiengesellschaft Procede et dispositif permettant de sortir des informations et/ou des messages par langue
EP1168297A1 (fr) * 2000-06-30 2002-01-02 Nokia Mobile Phones Ltd. Synthèse de la parole
WO2002049003A1 (fr) * 2000-12-14 2002-06-20 Siemens Aktiengesellschaft Procede et dispositif permettant de convertir du texte en paroles

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US49594A (en) * 1865-08-22 Improvement in rotary engines
GB2291571A (en) * 1994-07-19 1996-01-24 Ibm Text to speech system; acoustic processor requests linguistic processor output
US5905972A (en) * 1996-09-30 1999-05-18 Microsoft Corporation Prosodic databases holding fundamental frequency templates for use in speech synthesis
US6081780A (en) * 1998-04-28 2000-06-27 International Business Machines Corporation TTS and prosody based authoring system
DE19841683A1 (de) * 1998-09-11 2000-05-11 Hans Kull Vorrichtung und Verfahren zur digitalen Sprachbearbeitung
DE19939947C2 (de) * 1999-08-23 2002-01-24 Data Software Ag G Digitales Sprachsyntheseverfahren mit Intonationsnachbildung
DE10018134A1 (de) * 2000-04-12 2001-10-18 Siemens Ag Verfahren und Vorrichtung zum Bestimmen prosodischer Markierungen

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08247779A (ja) * 1995-03-09 1996-09-27 Honda Motor Co Ltd 音声出力装置
EP0901000A2 (fr) * 1997-07-31 1999-03-10 Toyota Jidosha Kabushiki Kaisha Système de traitement de messages et méthode pour le traitement de messages
WO2000023982A1 (fr) * 1998-10-16 2000-04-27 Volkswagen Aktiengesellschaft Procede et dispositif permettant de sortir des informations et/ou des messages par langue
EP1168297A1 (fr) * 2000-06-30 2002-01-02 Nokia Mobile Phones Ltd. Synthèse de la parole
WO2002049003A1 (fr) * 2000-12-14 2002-06-20 Siemens Aktiengesellschaft Procede et dispositif permettant de convertir du texte en paroles

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
PATENT ABSTRACTS OF JAPAN vol. 1997, no. 01 31 January 1997 (1997-01-31) *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1528483A1 (fr) * 2003-10-30 2005-05-04 Nec Corporation Appareil et procédé pour afficher un message textuel avec des informations sur le contenu émotionel du message
US7570814B2 (en) 2003-10-30 2009-08-04 Nec Corporation Data processing device, data processing method, and electronic device

Also Published As

Publication number Publication date
DE10207875A1 (de) 2003-08-28

Similar Documents

Publication Publication Date Title
DE69821673T2 (de) Verfahren und Vorrichtung zum Editieren synthetischer Sprachnachrichten, sowie Speichermittel mit dem Verfahren
EP1336955B1 (fr) Procédé pour la synthèse de parole naturelle dans un système de dialogue par ordinateur
Oktapiani et al. Women’s language features found in female character’s utterances in the Devil Wears Prada movie
DE69521244T2 (de) System zur Text-Sprache-Umsetzung
Schröder Dimensional emotion representation as a basis for speech synthesis with non-extreme emotions
Campbell et al. No laughing matter.
JPH11202884A (ja) 合成音声メッセージ編集作成方法、その装置及びその方法を記録した記録媒体
Kohler Communicative Functions and Linguistic Forms in Speech Interaction: Volume 156
CN111414733B (zh) 一种数据处理方法、装置及电子设备
Leistra-Jones Hans von Bülow and the Confessionalization of Kunstreligion
WO2003071520A1 (fr) Synthese de la parole commandee par des parametres
EP0058130B1 (fr) Procédé pour la synthèse de la parole avec un vocabulaire illimité et dispositif pour la mise en oeuvre dudit procédé
DE69910412T2 (de) Sprachgesteuerte navigation für einen elektronischen post leser
EP1110203A1 (fr) Procede et dispositif de traitement numerique de la voix
Häusl ‘So I prayed to the God of heaven’(Neh 2: 4): Praying and Prayers in the Books of Ezra and Nehemiah
JP3578961B2 (ja) 音声合成方法及び装置
Landy In Defense of Jakobson
Shenishen RESHAPING THE WORLD: THE BREATHING PICTURES IN THE POETRY OF CUMMINGS AND MAYAKOVSKY
Hoegaerts Fairness and fluency: the political audibility of ‘newcomers’ in Victorian debating clubs and public meetings, 1870–1910
Jokinen et al. DUMAS-Adaptation and Robust Information Processing for Mobile Speech Interfaces
AU2021105875A4 (en) Nethra Jyothi
Wendland Exploring the Continuum of Modern Bible Translating: A Comparative Overview of Motives, Methods, Media, and Models.”
DE10048069A1 (de) Elektronische Textübertragungsvorrichtung
Wersényi Evaluation of auditory representations for selected applications of a graphical user interface
Moe Breathing, Parsing, Praying

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): JP US

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT SE SI SK TR

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP