EP1344211B1 - Vorrichtung und verfahren zur differenzierten sprachausgabe - Google Patents
Vorrichtung und verfahren zur differenzierten sprachausgabe Download PDFInfo
- Publication number
- EP1344211B1 EP1344211B1 EP01991746A EP01991746A EP1344211B1 EP 1344211 B1 EP1344211 B1 EP 1344211B1 EP 01991746 A EP01991746 A EP 01991746A EP 01991746 A EP01991746 A EP 01991746A EP 1344211 B1 EP1344211 B1 EP 1344211B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- speech
- output
- parameters
- speech output
- voice
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000000034 method Methods 0.000 title claims abstract description 16
- 230000015572 biosynthetic process Effects 0.000 claims description 17
- 238000003786 synthesis reaction Methods 0.000 claims description 17
- 230000003068 static effect Effects 0.000 claims description 6
- 230000001755 vocal effect Effects 0.000 description 7
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000002996 emotional effect Effects 0.000 description 1
- 230000007257 malfunction Effects 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
Definitions
- the present invention relates to a differential speech output apparatus and method, systems for use with the speech output apparatus, and combinations of a speech output apparatus having at least two systems, particularly for use in a vehicle.
- the EP-A-0 901 000 describes a device for processing messages with receiving means for receiving sent messages, a memory for storing a plurality of different articulations (tone of voice respectively) and assignment means for assigning an articulation of the plurality of articulations to at least one received message. Another articulation is assigned to another received message and output means issues the first message with a first articulation and the second message with a second articulation.
- the invention is in particular the object of providing a central speech output device with a plurality of systems, in which a single speech generator with a small parameter memory is driven by the systems.
- the invention has the advantage that speech outputs for different systems are possible with a single speech output device or speech synthesis device, wherein each system can be identified by vocal characteristic differences.
- a parameter set is assigned for each system, which is used by the speech synthesis device in a speech output from this system.
- a first parameter set for an on-board computer a second parameter set for a navigation system, a third parameter set for traffic information, a fourth parameter set for a TTS system (text to speech system), such as e-mail and one or more additional parameter sets for additional Systems provided.
- a TTS system text to speech system
- the speech synthesizer generates the speech output, for example, with a soft female voice, e.g. for voice output of a navigation system, or with a hard male bass voice, z. B. for the voice output of traffic reports.
- a method and apparatus is used for full speech synthesis, preferably a formant synthesizer.
- the control parameters for the synthesizer are divided into classes.
- a class of dynamic parameters controls the articulation, as the movement of the language tract in speaking.
- a second class of static parameters controls speaker characteristic features, such as the generator fundamental frequency and fixed formants, in a child, a woman, or a woman a male speaker formed by the different geometric dimension of the language tract.
- the device according to the invention or the method according to the invention can be used in particular in systems of a vehicle.
- Each system has two ways to control the voice output for a voice output.
- the first possibility of the speech output comprises sending an output of control commands for the language articulation, wherein the sequence of control parameters for words, sentences and sentence sequences are stored in the system.
- the second way to control the speech output is via a second output, which switches a parameter set, which determines the speaker characteristic.
- the generator and formant parameters are additionally dynamically changed.
- This allows audible differences in prosody to be achieved, such as the duration and / or stress on syllable segments and / or sentence melody.
- a prosodic modulation depending on z. B. be used by a traffic situation or a traffic situation for the voice output of announcements.
- the explosiveness of information can be expressed by modulating the voice.
- the invention has the advantage that z. B. in a vehicle only a single voice generator with a small parameter memory of several information sources driven can be.
- the sources of information can be equipped with different voice characteristics.
- emotional expression in the voice can also be given according to the invention.
- Predefined parameter templates make it easy to change the tuning characteristics.
- the method is also suitable for the implementation of free texts in speech (text to speech), z. B. reading aloud e-mail.
- Fig. 1 shows a schematic diagram of a preferred embodiment of the invention for differentiated speech output with multiple systems according to the invention.
- Fig. 1 illustrated preferred embodiment of the invention comprises a speech output unit 1 with a speech synthesis device 10, which in the example is a vocal tract synthesis module based on a full synthesis of the language.
- a speech synthesis device 10 for example, a formant synthesizer such as KLATTALK can be used.
- the speech synthesizer 10 is connected to an amplifier 12 whose output 14 provides an audio signal that outputs speech through a speaker (not shown).
- the speech synthesis device 10 is assigned to N parameter sets 21, 22 to 2N, which in the example shown are stored in a memory 20 of the speech output unit 1.
- N systems 31, 32 to 3N are shown, which are each connected to the voice output unit 1 via a data connection, such as individual lines, a bus system or data channels.
- Each system can voice over the voice output unit.
- an on-board computer 31 with an associated parameter set for the on-board computer 21 a navigation system 32 with an associated parameter set for the navigation 22, a traffic information system 33 with an associated parameter set for the traffic information 23, an e-mail system such as TTS system 34 with an associated parameter set for e-mail 24.
- Additional systems 3N may be provided with a respective assigned parameter set 2N.
- a parameter set 23 may be provided, with which a hard male bass voice is used in the speech output.
- the order of the voice outputs may be sequential in time according to the receipt of the voice output request from the systems.
- higher priority information e.g. Traffic information in case of danger such as wrong-way driver first issued by voice output.
- information of the highest priority e.g. Information issued by the on-board computer about malfunction of the vehicle or incipient road smoothness immediately, with a current voice output can be interrupted. The interrupted speech output can then be completed or repeated.
- the invention has the advantage that systems with an acoustic display provide the driver without distracting him from his task, as is the case with visual displays to provide information from various systems.
- the use of a speech synthesis device which can be used by various on-board computers, can save costs. Compared to previously used speech-producing method in, for example, navigation systems, the storage space requirement can be reduced.
- the invention is particularly advantageously used in motor vehicles.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Navigation (AREA)
- Traffic Control Systems (AREA)
Abstract
Claims (10)
- Dispositif d'émission vocale différenciée (1) qui peut être relié avec un premier système (31) et au moins un autre système (32, 33, à 3N), l'émission vocale du premier système (31) étant associée à une première caractéristique vocale et l'autre émission vocale de l'autre système (32, 33 à 3N) est associée à une autre caractéristique vocale qui se différencie de manière audible de la première caractéristique,
caractérisé par
une installation de synthèse vocale (10) qui contient des paramètres de commande qui présentent une première classe de paramètre dynamiques et une seconde classe de paramètres statiques,
les paramètres dynamiques commandent l'articulation en fonction du mouvement de l'organe de la parole et les paramètres statiques commandent les caractéristiques vocales,
les paramètres statiques pour les systèmes étant mémorisés en tant que jeux de paramètres associés dans une mémoire (20) du dispositif d'émission vocale et on utilise en fonction d'un signal de choix d'un système, un jeu de paramètres associés de l'installation de synthèse vocale (10) pour l'émission vocale et les paramètres dynamiques sont mémorisés dans chaque système en fonction de la suite de mots, de phrases ou de suites de phrases. - Dispositif selon la revendication 1 dans lequel les paramètres statiques présentent une fréquence de base de générateur et/ou des fréquences de sons fixes qui correspondent de préférence aux différentes dimensions géométriques l'organe de la parole pour un interlocuteur enfant, féminin ou masculin.
- Dispositif selon la revendication 2 dans lequel les paramètres de générateur et/ou de fréquence de sons fine peuvent être modifiés par différentes installations et de préférence réalisant des différences audibles dans la prosodie comme la durée et/ou l'intonation de segments de syllabes et/ou la mélodie des phrases.
- Dispositif selon l'une des revendications 1 à 3 dans lequel l'installation de synthèse d'émission vocale (10) est un synthétiseur de fréquences de sons à l'aide duquel on peut influer sur les caractéristiques vocales.
- Dispositif selon la revendication 4 dans lequel le synthétiseur de fréquences de sons est approprié pour générer de manière séparée des sons vocaux ou non vocaux et dans lequel, en particulier à l'aide d'autres paramètres, on peut commuter des résonateurs supplémentaires ou des atténuateurs et/ou on peut influer sur les paramètres dynamiques pour l'articulation.
- Dispositif selon l'une des revendications 1 à 5 dans lequel l'installation de synthèse vocale (10) est relié à un amplificateur (12) et la voix est délivrée par une sortie audio (14) de l'amplificateur (12).
- Système selon l'une des revendications 1 à 6 comprenant :une première sortie pour délivrer des paramètres dynamiques, et une seconde sortie pour délivrer un signal de sélection pour commuter une phrase de paramètre dans l'installation de réponse vocale (10).
- Système appliqué à un dispositif selon l'une des revendications 1 à 6 comprenant :une sortie pour délivrer des paramètres dynamiques et des paramètres statiques, de préférence en tant que jeu de paramètre à l'installation d'émission vocale (10).
- Combinaison d'un dispositif selon l'une des revendications 1 à 6 comprenant :au moins un premier et un autre systèmes tels qu'un ordinateur de bord (31), un système de navigation (32), un système d'informations de trafic (33), un système de courriels (34) ou un système d'informations (3N), de préférence pour l'utilisation dans un véhicule.
- Procédé pour une émission vocale différenciée utilisant un dispositif selon l'une des revendications 1 à 6.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE10063503 | 2000-12-20 | ||
DE10063503A DE10063503A1 (de) | 2000-12-20 | 2000-12-20 | Vorrichtung und Verfahren zur differenzierten Sprachausgabe |
PCT/EP2001/013488 WO2002050815A1 (fr) | 2000-12-20 | 2001-11-21 | Vorrichtung und verfahren zur differenzierten sprachausgabe |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1344211A1 EP1344211A1 (fr) | 2003-09-17 |
EP1344211B1 true EP1344211B1 (fr) | 2011-02-16 |
Family
ID=7667936
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP01991746A Expired - Lifetime EP1344211B1 (fr) | 2000-12-20 | 2001-11-21 | Vorrichtung und verfahren zur differenzierten sprachausgabe |
Country Status (6)
Country | Link |
---|---|
US (1) | US7698139B2 (fr) |
EP (1) | EP1344211B1 (fr) |
JP (1) | JP2004516515A (fr) |
DE (2) | DE10063503A1 (fr) |
ES (1) | ES2357700T3 (fr) |
WO (1) | WO2002050815A1 (fr) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2412046A (en) * | 2004-03-11 | 2005-09-14 | Seiko Epson Corp | Semiconductor device having a TTS system to which is applied a voice parameter set |
DE102005063077B4 (de) * | 2005-12-29 | 2011-05-05 | Airbus Operations Gmbh | Aufzeichnung digitaler Cockpit-Boden-Kommunikation auf einem unfallgeschützten Sprachrekorder |
DE602007004604D1 (de) * | 2006-06-02 | 2010-03-18 | Koninkl Philips Electronics Nv | Sprachdifferenzierung |
DE102008019071A1 (de) * | 2008-04-15 | 2009-10-29 | Continental Automotive Gmbh | Verfahren, Fahrerinformationssystem und Fahrerassistenzsystem zur Ausgabe von Informationen |
JP7133149B2 (ja) * | 2018-11-27 | 2022-09-08 | トヨタ自動車株式会社 | 自動運転装置、カーナビゲーション装置及び運転支援システム |
JP7336862B2 (ja) * | 2019-03-28 | 2023-09-01 | 株式会社ホンダアクセス | 車両用ナビゲーション装置 |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5667470A (en) * | 1979-11-07 | 1981-06-06 | Canon Inc | Voice desk-top calculator |
US5559927A (en) * | 1992-08-19 | 1996-09-24 | Clynes; Manfred | Computer system producing emotionally-expressive speech messages |
US5561736A (en) * | 1993-06-04 | 1996-10-01 | International Business Machines Corporation | Three dimensional speech synthesis |
JPH08328573A (ja) * | 1995-05-29 | 1996-12-13 | Sanyo Electric Co Ltd | カラオケ装置及び音声再生装置及びこれに使用する記録媒体 |
US5924068A (en) * | 1997-02-04 | 1999-07-13 | Matsushita Electric Industrial Co. Ltd. | Electronic news reception apparatus that selectively retains sections and searches by keyword or index for text to speech conversion |
JP3287281B2 (ja) * | 1997-07-31 | 2002-06-04 | トヨタ自動車株式会社 | メッセージ処理装置 |
JP3502247B2 (ja) * | 1997-10-28 | 2004-03-02 | ヤマハ株式会社 | 音声変換装置 |
DE19908137A1 (de) * | 1998-10-16 | 2000-06-15 | Volkswagen Ag | Verfahren und Vorrichtung zur automatischen Steuerung mindestens eines Gerätes per Sprachdialog |
US20020087655A1 (en) * | 1999-01-27 | 2002-07-04 | Thomas E. Bridgman | Information system for mobile users |
GB9925297D0 (en) * | 1999-10-27 | 1999-12-29 | Ibm | Voice processing system |
US6181996B1 (en) * | 1999-11-18 | 2001-01-30 | International Business Machines Corporation | System for controlling vehicle information user interfaces |
US6539354B1 (en) * | 2000-03-24 | 2003-03-25 | Fluent Speech Technologies, Inc. | Methods and devices for producing and using synthetic visual speech based on natural coarticulation |
-
2000
- 2000-12-20 DE DE10063503A patent/DE10063503A1/de not_active Ceased
-
2001
- 2001-11-21 DE DE50115798T patent/DE50115798D1/de not_active Expired - Lifetime
- 2001-11-21 ES ES01991746T patent/ES2357700T3/es not_active Expired - Lifetime
- 2001-11-21 JP JP2002551833A patent/JP2004516515A/ja active Pending
- 2001-11-21 WO PCT/EP2001/013488 patent/WO2002050815A1/fr active Application Filing
- 2001-11-21 EP EP01991746A patent/EP1344211B1/fr not_active Expired - Lifetime
-
2003
- 2003-06-20 US US10/465,839 patent/US7698139B2/en not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
DE10063503A1 (de) | 2002-07-04 |
ES2357700T3 (es) | 2011-04-28 |
DE50115798D1 (de) | 2011-03-31 |
EP1344211A1 (fr) | 2003-09-17 |
WO2002050815A1 (fr) | 2002-06-27 |
US20030225575A1 (en) | 2003-12-04 |
US7698139B2 (en) | 2010-04-13 |
JP2004516515A (ja) | 2004-06-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE60020773T2 (de) | Graphische Benutzeroberfläche und Verfahren zur Änderung von Aussprachen in Sprachsynthese und -Erkennungssystemen | |
DE602005002706T2 (de) | Verfahren und System für die Umsetzung von Text-zu-Sprache | |
DE60035001T2 (de) | Sprachsynthese mit Prosodie-Mustern | |
DE69821673T2 (de) | Verfahren und Vorrichtung zum Editieren synthetischer Sprachnachrichten, sowie Speichermittel mit dem Verfahren | |
DE60112512T2 (de) | Kodierung von Ausdruck in Sprachsynthese | |
DE19610019C2 (de) | Digitales Sprachsyntheseverfahren | |
DE69925932T2 (de) | Sprachsynthese durch verkettung von sprachwellenformen | |
DE69909716T2 (de) | Formant Sprachsynthetisierer unter Verwendung von Verkettung von Halbsilben mit unabhängiger Überblendung im Filterkoeffizienten- und Quellenbereich | |
US6405169B1 (en) | Speech synthesis apparatus | |
DE60004420T2 (de) | Erkennung von Bereichen überlappender Elemente für ein konkatenatives Sprachsynthesesystem | |
DE2115258A1 (de) | Sprachsynthese durch Verkettung von in Formant Form codierten Wortern | |
DE10042944A1 (de) | Graphem-Phonem-Konvertierung | |
DE69627865T2 (de) | Sprachsynthesizer mit einer datenbank für akustische elemente | |
EP1105867B1 (fr) | Procede et dispositif permettant de concatener des segments audio en tenant compte de la coarticulation | |
EP1282897B1 (fr) | Procede pour produire une banque de donnees vocales pour un lexique cible pour l'apprentissage d'un systeme de reconnaissance vocale | |
EP1344211B1 (fr) | Vorrichtung und verfahren zur differenzierten sprachausgabe | |
EP0058130B1 (fr) | Procédé pour la synthèse de la parole avec un vocabulaire illimité et dispositif pour la mise en oeuvre dudit procédé | |
EP1110203B1 (fr) | Procede et dispositif de traitement numerique de la voix | |
EP0725382A2 (fr) | Procédé et dispositif destinés à fournir des informations codées numériquement sur l'état du trafic routier par parole génerée synthétiquement | |
EP1554715B1 (fr) | Procede de synthese de la parole assistee par ordinateur sous forme de signal vocal analogique a partir d'un texte electronique, dispositif de synthese de la parole et appareil de telecommunication | |
JPH09179576A (ja) | 音声合成方法 | |
EP2592623B1 (fr) | Technique destinée à émettre un signal acoustique au moyen d'un système de navigation | |
DE19837661C2 (de) | Verfahren und Vorrichtung zur koartikulationsgerechten Konkatenation von Audiosegmenten | |
EP1170723A2 (fr) | Procédé pour le calcul des statistiques de durée des phonèmes et procédé pour la détermination de la durée de phonèmes isolés pour la synthèse de la parole | |
JP2577372B2 (ja) | 音声合成装置および方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20030425 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR |
|
RBV | Designated contracting states (corrected) |
Designated state(s): DE ES FR GB IT SE |
|
17Q | First examination report despatched |
Effective date: 20070808 |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): DE ES FR GB IT SE |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D Free format text: NOT ENGLISH |
|
REF | Corresponds to: |
Ref document number: 50115798 Country of ref document: DE Date of ref document: 20110331 Kind code of ref document: P |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R096 Ref document number: 50115798 Country of ref document: DE Effective date: 20110331 |
|
REG | Reference to a national code |
Ref country code: ES Ref legal event code: FG2A Ref document number: 2357700 Country of ref document: ES Kind code of ref document: T3 Effective date: 20110428 |
|
REG | Reference to a national code |
Ref country code: SE Ref legal event code: TRGR |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20111117 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R097 Ref document number: 50115798 Country of ref document: DE Effective date: 20111117 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 15 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 16 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 17 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20201126 Year of fee payment: 20 Ref country code: GB Payment date: 20201123 Year of fee payment: 20 Ref country code: ES Payment date: 20201214 Year of fee payment: 20 Ref country code: FR Payment date: 20201119 Year of fee payment: 20 Ref country code: IT Payment date: 20201130 Year of fee payment: 20 Ref country code: SE Payment date: 20201123 Year of fee payment: 20 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R071 Ref document number: 50115798 Country of ref document: DE |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: PE20 Expiry date: 20211120 |
|
REG | Reference to a national code |
Ref country code: SE Ref legal event code: EUG |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION Effective date: 20211120 |
|
REG | Reference to a national code |
Ref country code: ES Ref legal event code: FD2A Effective date: 20220228 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: ES Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION Effective date: 20211122 |
|
P01 | Opt-out of the competence of the unified patent court (upc) registered |
Effective date: 20230502 |