EP1110203B1 - Procede et dispositif de traitement numerique de la voix - Google Patents
Procede et dispositif de traitement numerique de la voix Download PDFInfo
- Publication number
- EP1110203B1 EP1110203B1 EP99947314A EP99947314A EP1110203B1 EP 1110203 B1 EP1110203 B1 EP 1110203B1 EP 99947314 A EP99947314 A EP 99947314A EP 99947314 A EP99947314 A EP 99947314A EP 1110203 B1 EP1110203 B1 EP 1110203B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- prosody
- generating
- speaker
- speech
- generated
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000012545 processing Methods 0.000 title claims abstract description 14
- 238000000034 method Methods 0.000 title claims description 17
- 238000013518 transcription Methods 0.000 claims description 16
- 230000035897 transcription Effects 0.000 claims description 16
- 230000000694 effects Effects 0.000 claims description 10
- 238000012986 modification Methods 0.000 claims description 4
- 230000004048 modification Effects 0.000 claims description 4
- 230000033764 rhythmic process Effects 0.000 claims description 2
- 238000013519 translation Methods 0.000 claims 3
- 230000005355 Hall effect Effects 0.000 claims 1
- 238000004590 computer program Methods 0.000 claims 1
- 238000003672 processing method Methods 0.000 claims 1
- 238000003786 synthesis reaction Methods 0.000 description 9
- 230000015572 biosynthetic process Effects 0.000 description 8
- 230000008859 change Effects 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 239000003607 modifier Substances 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 235000015220 hamburgers Nutrition 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 210000000214 mouth Anatomy 0.000 description 1
- 210000003928 nasal cavity Anatomy 0.000 description 1
- 238000001208 nuclear magnetic resonance pulse sequence Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 230000005477 standard model Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 210000001260 vocal cord Anatomy 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
Definitions
- the present invention relates to an apparatus and a method for digital speech processing or language production.
- Current systems for digital Voice output has so far been used in environments in which a synthetic Voice is acceptable or even desired.
- the present invention relates a system that enables natural-looking speech to be generated synthetically.
- the commands built into the text stream can also provide information about the Characteristics of the speaker (i.e. parameters of the speaker model) included.
- the Characteristics of the speaker i.e. parameters of the speaker model
- EP 0762384 describes a system in which on the screen on a graphic User interface these speaker characteristics can be entered.
- the speech synthesis takes place using auxiliary information which is given in a database (e.g. as a "waveform sequence" in EP 0831460).
- a database e.g. as a "waveform sequence” in EP 0831460.
- the Composition of the individual sequences leads to distortions and acoustic Artifacts if no measures are taken to suppress them.
- This The problem (one speaks of “segmental quality”) is considered largely solved today (see e.g. Volker Kraft: Linking natural language building blocks to Speech synthesis: requirements, techniques and evaluation. Fortschr.-Ber.VDI Row 10 No. 468, VDI Verlag 1997). Nevertheless, there is also a modern one Speech synthesis systems have a number of other problems.
- Multi-language capability One problem in digital speech output is, for example Multi-language capability.
- Another problem is the improvement of the prosodic Quality, i.e. the quality of the intonation, compare for example "Volker Kraft: Linking natural language building blocks for speech synthesis: requirements, Techniques and Evaluation, Progr.-Ber.VDI Erasmus 10 Nr 468, VDI-Verlag 1997 ".
- the difficulty is due to the fact that the intonation from the Orthographic input information can only be reconstructed inadequately. It is also dependent on higher levels such as semantics and pragmatics as well Speaker situation and type of speaker.
- DE-A-196 10 019 describes a method and an apparatus for digital speech processing with a sentence melody generation device for Generation of a sentence melody for a text known. It also enables display the speech signals in the time and frequency domain and by a Marking the time signal a change in the fundamental frequencies of certain Reach segments.
- the applications range from creating simpler Texts for multimedia applications up to film settings (dubbing), Radio plays, and audio books.
- a further object of the present invention is therefore that of Provision of such intervention options.
- the object of the invention is achieved by the for Text generated sentence melody can be modified using an editor.
- Special embodiments of the invention allow in addition to Editing the sentence melody editing other characteristics of the synthetic generated language.
- the starting point is the written text. But one to achieve sufficient (especially prosodic) quality, as well as to achieve it Dramatic effects are preferred to the user Embodiment given extensive options for intervention.
- the User is in the function of the director who is the speaker on the system defined and them speaking rhythm and melody, pronunciation and emphasis pretends.
- the present invention also includes generating one Phonetic transcription for a written text, as well as the provision of the possibility Modify the phonetic transcription generated, or the phonetic transcription based on generate modifiable rules.
- This can be a special one, for example Accent of a speaker are generated.
- the invention comprises a dictionary facility in which the words of one or more languages are saved along with their pronunciation. In the latter case, this enables Multilingual capability, i.e. editing texts in different languages.
- the Voice processing included speaker models that are either predefined or can be defined or modified by the user. This allows Characteristics of different speakers can be realized, be it male or female Female voices, or different accents of a speaker, such as a Bavarian, Swabian or North German accent.
- the device exists from a dictionary in which all words also include pronunciation in phonetic transcription are saved (if we speak of phonetic transcription below, then this is one any phonetic spelling, such as the SAMPA notation, cf. e.g. "Multilingual speech input / output assessment, methodology and standardization, standard computer-compatible transscription, pp 29-31, in Esprit Project 2589 (SAM) Fin. Report SAM-UCC-037 ", or the international one known from language teaching aids phonetic writing, cf. e.g. "The Principles of the International Phonetic Association: Adescription of the International Phonetic Alphabet and the Manner of Using it. International Phonetic Association, Dept, Phonetics, Univ.
- the invention can either be hybrid in software and hardware or entirely in Software can be realized.
- the digital voice signals generated can be via a special device for digital audio or via a PC sound card.
- Figure 1 shows a block diagram of a device for digital Speech generation according to an embodiment of the present invention.
- Invention this consists of several individual components, which by means of or several digital computing systems can be realized, and their Operation and interaction is described in more detail below.
- the dictionary 100 consists of simple tables (for each language a) in which the words of a language are stored along with their pronunciation are.
- the tables can be used to include additional words and their pronunciation can be expanded as required.
- For special purposes, e.g. for creating accents can also add additional tables with different in one language phonetic entries are generated. The different speakers will each get one Table assigned to the dictionary.
- the translator 110 generates the phonetic script by using the Words of the entered text by their phonetic counterparts in the Dictionary replaced. If in the speaker model modifiers, the later more precisely are described, he uses them to modify the Pronunciation.
- heuristics are e.g. the model by Fujisaki (1992) or other acoustic methods, then the perceptual ones Models, e.g. that of d'Alessandro and Mertens (1995).
- These, but also older ones linguistic models are e.g. described in "Gold Dutoit: An Introduction to Text-to-Speech Synthesis, Kluwer 1997 ".
- the user has an instrument in his hand with which Enter pronunciation, intonation, emphasis, tempo, volume, pauses, etc. and can change.
- he assigns a speaker model to the text sections to be processed 130 to which later explained in more detail with regard to structure and mode of operation becomes.
- the translator responds to this assignment by using the phonetics and if necessary, adapt the prosody to the speaker model and regenerate it.
- the Phonetics are displayed to the user in phonetic transcription, the prosody e.g. in one of the Music taken symbolism (musical notation).
- the user then has the option of to change these specifications, to listen to individual sections of text and his Improve entries again, etc.
- Speaker models 130 are, for example, parameterizations for the Speech production.
- the function of the vocal cords is determined by a Pulse sequence shown, of which only the frequency (pitch) can be changed.
- the other characteristics (oral cavity, nasal cavity) of the speech tract are included digital filters.
- Your parameters are stored in the speaker model. It standard models are stored (child, young lady, old man, etc.).
- the User can generate additional models from them by changing the parameters suitably chooses or changes and saves the model.
- the ones deposited here Parameters are created during language generation, which will be explained in more detail later. used with the prosody information for intonation.
- a speaker model can, for example, concern the rules according to which the translator creates the phonetic transcription, different speaker models can follow different rules. However, it can also be one certain set of filter parameters correspond to the speech signals to be processed in accordance with the speaker characteristics specified thereby. Of course, any combination of these two aspects is one Speaker model conceivable.
- the task of the speech generation unit 140 is to: Predefined text together with the one created by the translator and by the user edited phonetic and prosodic additional information a numerical Generate data stream that represents digital voice signals.
- This Data stream can then be from an output device 150, such as a digital one Audio device or a sound card in the PC, in analog sound signals text to be output.
- a conventional text-to-speech can be used for speech generation Conversion procedures are used, but the pronunciation and the Sentence melody has already been created. Generally one differentiates between rule-based and chain-based synthesizers.
- Chain-based synthesizers are easier to use. You work with a database that stores all possible pairs of sounds. This can easily be chained, but high quality systems are high Have computing time requirements. Such systems are described in "Gold Dutoit: An Introduction to Text-to-Speech Synthesis, Kluwer 1997 “and in” Volker Kraft: Linking natural language building blocks for speech synthesis: requirements, Techniques and evaluation. Fortschr.-Ber. VDI series 10 No. 468, VDI-Verlag 1997 ".
- Digital filters e.g. Bandpass filter for telephone effect
- Hall generators etc.
- sounds stored in an archive 170 can be used.
- Archives 170 contain sounds such as Road noise, railroad, Child cries, ocean waves, background music, etc. saved.
- the archive can be expanded with your own sounds.
- the archive can simply be one Collection of files with digitized sounds, but it can also be one Database in which the sounds are housed as blobs (binary large objects) are.
- the generated speech signals with the Assembled background noise In the mixing device 180, the generated speech signals with the Assembled background noise.
- the volume of all signals can be used the composition are regulated. It is also possible to send each signal individually or all of them with effects.
- the result of the signal generated in this way can be sent to a suitable device for digital audio 150, such as a sound card from a PC, and thus acoustically checked or issued.
- a (not shown) Storage device is provided to store the signal so that it later in can be appropriately transferred to the target medium.
- a device that is classically implemented in hardware can be used as a mixing device can be used, or it can be implemented in software and in the entire program be involved.
- the output device 150 by a another computer to be replaced by a network connection to the Mixing device 180 is coupled.
- a Network connection to the Mixing device 180 is coupled.
- a Computer network such as the Internet, the voice signal generated on another Computer.
- Speech generator 140 generates speech signal directly to the output device 150 are transmitted without the detour via the mixing device 180. Further comparable modifications result in a casual manner for the person skilled in the art.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
Claims (21)
- Dispositif pour le traitement numérique de la parole qui présente :un dispositif de génération de mélodie de phrase pour générer une mélodie de phrase pour un texte, caractérisé parun dispositif d'édition pour afficher et modifier la mélodie de phrase générée.
- Dispositif selon la revendication 1, qui présente également :un dispositif de traduction pour la traduction du texte dans une écriture phonétique, le dispositif d'édition présentant également :un dispositif pour afficher ou modifier l'écriture phonétique générée.
- Dispositif selon la revendication 1 ou 2, dans lequel le dispositif de génération de mélodie de phrase et/ou le dispositif de traduction génère(nt) la mélodie de phrase et/ou l'écriture phonétique sur la base de ou en fonction d'un certain modèle de locuteur.
- Dispositif selon l'une quelconque des revendications 1 à 3, qui présente égalementun dispositif pour sélectionner et/ou modifier un ou plusieurs modèles de locuteurs.
- Dispositif selon la revendication 4, dans lequel le dispositif pour modifier des modèles de locuteurs présente :un dispositif pour modifier des éléments d'écriture phonétique pour générer des accents.
- Dispositif pour générer une voix numérique qui présenteun dispositif pour le traitement numérique de la voix selon l'une quelconque des revendications 1 à 5 ; etun dispositif pour générer des signaux de voix sur la base de l'écriture phonétique et/ou de la mélodie de phrase éventuellement modifiée(s) au moyen du dispositif d'édition.
- Dispositif selon la revendication 6, dans lequel le dispositif de génération de signaux de voix présente en outre :un dispositif de traitement de modèle de locuteur pour générer des signaux de voix sur la base ou en fonction d'un certain modèle de locuteur.
- Dispositif selon la revendication 7, dans lequel le dispositif de traitement de modèle de locuteur présente une ou plusieurs des caractéristiques suivantes :un système de filtre numérique,un dispositif pour prendre en charge un ensemble de paramètres de filtrage qui représente un certain modèle de locuteur.
- Dispositif selon la revendication 7 ou 8, dans lequel le dispositif de traitement de modèle de locuteur présente également :un dispositif pour sélectionner et/ou modifier un modèle de locuteur.
- Dispositif selon l'une quelconque des revendications 6 à 9, qui présente également :un dispositif à effet pour générer des effets sonores.
- Dispositif selon la revendication 10, dans lequel le dispositif à effet présente l'une ou plusieurs des caractéristiques suivantes :un système de filtre numérique pour modifier les signaux de voix générés et/ouun générateur de réverbération pour générer un effet de réverbération.
- Dispositif selon l'une quelconque des revendications 6 à 11, qui présente également :un dispositif d'archivage pour mémoriser les bruits, etun système de mixage pour mixer les signaux de voix générés avec des bruits mémorisés dans le dispositif d'archivage.
- Dispositif selon l'une quelconque dès revendications précédentes, qui présente également :une interface d'utilisateur graphique pour éditer l'écriture phonétique et/ou la mélodie de phrase générée.
- Dispositif selon l'une quelconque des revendications précédentes, qui présente également :un dispositif pour modifier le rythme de la voix et/ou la prononciation et/ou l'accentuation.
- Dispositif selon l'une quelconque des revendications précédentes, qui présente également :un dispositif d'affichage qui affiche la mélodie de phrase au moyen d'une écriture pictographique.
- Dispositif selon l'une quelconque des revendications précédentes, qui présente également :un dispositif de dictionnaire dans lequel les mots d'une ou de plusieurs langues sont mémorisés en même temps que leur prononciation.
- Dispositif selon la revendication 16, dans lequel des enregistrements phonétiques différents sont mémorisés dans le dispositif de dictionnaire pour au moins un enregistrement de dictionnaire.
- Dispositif selon l'une quelconque des revendications 6 à 17, qui présente également :un dispositif pour la transcription des signaux de voix numériques en signaux acoustiques.
- Procédé pour le traitement numérique de la voix, qui présente les étapes suivantes :génération d'une mélodie de phrase pour un texte,le procédé étant caractérisé par l'affichage de la mélodie de phrase générée ;édition de la mélodie de phrase générée et affichée.
- Procédé selon la revendication 19, qui présente également l'étape suivante :utilisation d'un appareil selon l'une quelconque des revendications 1 à 18 pour générer une voix numérique.
- Programme informatique qui présente:un moyen, en particulier un support de données, pour mémoriser et/ou transmettre des données numériques pouvant être lues par un ordinateur, caractérisé en ce que les données mémorisées et/ou transmises présentent la caractéristique suivante :une succession d'instructions exécutables par un ordinateur, lesquelles lui demandent d'exécuter un procédé selon l'une des revendications 19 ou 20.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE19841683A DE19841683A1 (de) | 1998-09-11 | 1998-09-11 | Vorrichtung und Verfahren zur digitalen Sprachbearbeitung |
DE19841683 | 1998-09-11 | ||
PCT/EP1999/006712 WO2000016310A1 (fr) | 1998-09-11 | 1999-09-10 | Procede et dispositif de traitement numerique de la voix |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1110203A1 EP1110203A1 (fr) | 2001-06-27 |
EP1110203B1 true EP1110203B1 (fr) | 2002-08-14 |
Family
ID=7880683
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP99947314A Expired - Lifetime EP1110203B1 (fr) | 1998-09-11 | 1999-09-10 | Procede et dispositif de traitement numerique de la voix |
Country Status (7)
Country | Link |
---|---|
EP (1) | EP1110203B1 (fr) |
JP (1) | JP2002525663A (fr) |
AT (1) | ATE222393T1 (fr) |
AU (1) | AU769036B2 (fr) |
CA (1) | CA2343071A1 (fr) |
DE (2) | DE19841683A1 (fr) |
WO (1) | WO2000016310A1 (fr) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE10117367B4 (de) * | 2001-04-06 | 2005-08-18 | Siemens Ag | Verfahren und System zur automatischen Umsetzung von Text-Nachrichten in Sprach-Nachrichten |
JP2002318593A (ja) * | 2001-04-20 | 2002-10-31 | Sony Corp | 言語処理装置および言語処理方法、並びにプログラムおよび記録媒体 |
AT6920U1 (de) | 2002-02-14 | 2004-05-25 | Sail Labs Technology Ag | Verfahren zur erzeugung natürlicher sprache in computer-dialogsystemen |
DE10207875A1 (de) * | 2002-02-19 | 2003-08-28 | Deutsche Telekom Ag | Parametergesteuerte Sprachsynthese |
CA2557079A1 (fr) | 2004-03-05 | 2005-09-22 | Lessac Technologies, Inc. | Codes pour la synthese de la parole en texte, utilisation de ces derniers dans des systemes de parole informatises |
DE102004012208A1 (de) * | 2004-03-12 | 2005-09-29 | Siemens Ag | Individualisierung von Sprachausgabe durch Anpassen einer Synthesestimme an eine Zielstimme |
DE102008044635A1 (de) | 2008-07-22 | 2010-02-04 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Bereitstellen einer Fernsehsequenz |
US10424288B2 (en) | 2017-03-31 | 2019-09-24 | Wipro Limited | System and method for rendering textual messages using customized natural voice |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5695295A (en) * | 1979-12-28 | 1981-08-01 | Sharp Kk | Voice sysnthesis and control circuit |
FR2494017B1 (fr) * | 1980-11-07 | 1985-10-25 | Thomson Csf | Procede de detection de la frequence de melodie dans un signal de parole et dispositif destine a la mise en oeuvre de ce procede |
JPS58102298A (ja) * | 1981-12-14 | 1983-06-17 | キヤノン株式会社 | 電子機器 |
US4623761A (en) * | 1984-04-18 | 1986-11-18 | Golden Enterprises, Incorporated | Telephone operator voice storage and retrieval system |
US5559927A (en) * | 1992-08-19 | 1996-09-24 | Clynes; Manfred | Computer system producing emotionally-expressive speech messages |
AU3400395A (en) * | 1994-09-12 | 1996-03-29 | Atr Human Information Processing Research Laboratories Co.,Ltd. | Sound characteristic convertor, sound/label associating apparatus and method to form them |
US5956685A (en) * | 1994-09-12 | 1999-09-21 | Arcadia, Inc. | Sound characteristic converter, sound-label association apparatus and method therefor |
DE19503419A1 (de) * | 1995-02-03 | 1996-08-08 | Bosch Gmbh Robert | Verfahren und Einrichtung zur Ausgabe von digital codierten Verkehrsmeldungen mittels synthetisch erzeugter Sprache |
JPH08263094A (ja) * | 1995-03-10 | 1996-10-11 | Winbond Electron Corp | メロディを混合した音声を発生する合成器 |
EP0762384A2 (fr) * | 1995-09-01 | 1997-03-12 | AT&T IPM Corp. | Procédé et dispositif de modification de caractéristiques de voix pour parole synthétisée |
DE19610019C2 (de) * | 1996-03-14 | 1999-10-28 | Data Software Gmbh G | Digitales Sprachsyntheseverfahren |
US6226614B1 (en) * | 1997-05-21 | 2001-05-01 | Nippon Telegraph And Telephone Corporation | Method and apparatus for editing/creating synthetic speech message and recording medium with the method recorded thereon |
JP3616250B2 (ja) * | 1997-05-21 | 2005-02-02 | 日本電信電話株式会社 | 合成音声メッセージ作成方法、その装置及びその方法を記録した記録媒体 |
-
1998
- 1998-09-11 DE DE19841683A patent/DE19841683A1/de not_active Withdrawn
-
1999
- 1999-09-10 AU AU60813/99A patent/AU769036B2/en not_active Ceased
- 1999-09-10 EP EP99947314A patent/EP1110203B1/fr not_active Expired - Lifetime
- 1999-09-10 CA CA002343071A patent/CA2343071A1/fr not_active Abandoned
- 1999-09-10 AT AT99947314T patent/ATE222393T1/de not_active IP Right Cessation
- 1999-09-10 DE DE59902365T patent/DE59902365D1/de not_active Expired - Fee Related
- 1999-09-10 JP JP2000570766A patent/JP2002525663A/ja not_active Withdrawn
- 1999-09-10 WO PCT/EP1999/006712 patent/WO2000016310A1/fr active IP Right Grant
Also Published As
Publication number | Publication date |
---|---|
DE19841683A1 (de) | 2000-05-11 |
AU6081399A (en) | 2000-04-03 |
CA2343071A1 (fr) | 2000-03-23 |
AU769036B2 (en) | 2004-01-15 |
ATE222393T1 (de) | 2002-08-15 |
DE59902365D1 (de) | 2002-09-19 |
WO2000016310A1 (fr) | 2000-03-23 |
EP1110203A1 (fr) | 2001-06-27 |
JP2002525663A (ja) | 2002-08-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0886853B1 (fr) | Procede de synthese vocale a base de microsegments | |
DE60216069T2 (de) | Sprache-zu-sprache erzeugungssystem und verfahren | |
DE60112512T2 (de) | Kodierung von Ausdruck in Sprachsynthese | |
DE69821673T2 (de) | Verfahren und Vorrichtung zum Editieren synthetischer Sprachnachrichten, sowie Speichermittel mit dem Verfahren | |
Jilka | The contribution of intonation to the perception of foreign accent | |
DE69506037T2 (de) | Audioausgabeeinheit und Methode | |
DE69028072T2 (de) | Verfahren und Einrichtung zur Sprachsynthese | |
DE60035001T2 (de) | Sprachsynthese mit Prosodie-Mustern | |
EP1105867B1 (fr) | Procede et dispositif permettant de concatener des segments audio en tenant compte de la coarticulation | |
EP3010014B1 (fr) | Procede d'interpretation de reconnaissance vocale automatique | |
EP1110203B1 (fr) | Procede et dispositif de traitement numerique de la voix | |
Schröder | Can emotions be synthesized without controlling voice quality | |
EP0058130B1 (fr) | Procédé pour la synthèse de la parole avec un vocabulaire illimité et dispositif pour la mise en oeuvre dudit procédé | |
EP1344211B1 (fr) | Vorrichtung und verfahren zur differenzierten sprachausgabe | |
DE60305944T2 (de) | Verfahren zur synthese eines stationären klangsignals | |
DE60311482T2 (de) | Verfahren zur steuerung der dauer bei der sprachsynthese | |
JP2577372B2 (ja) | 音声合成装置および方法 | |
Pearson et al. | Combining concatenation and formant synthesis for improved intelligibility and naturalness in text-to-speech systems | |
DE69329375T2 (de) | Verfahren zur Realisierung von Tonkurven für Sprachnachrichten und Verfahren zur Sprachsynthese und Einrichtung zu seiner Anwendung | |
DE19837661C2 (de) | Verfahren und Vorrichtung zur koartikulationsgerechten Konkatenation von Audiosegmenten | |
EP3144929A1 (fr) | Génération synthétique d'un signal vocale ayant un son naturel | |
WO2023222287A1 (fr) | Synthétiseur vocal et procédé de synthèse vocale | |
EP1212748A1 (fr) | Procede numerique de synthese de la parole avec simulation des intonations | |
Murray | Emotion in concatenated speech | |
Vanderslice et al. | Synthetic Intonation. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20010322 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
GRAG | Despatch of communication of intention to grant |
Free format text: ORIGINAL CODE: EPIDOS AGRA |
|
17Q | First examination report despatched |
Effective date: 20011015 |
|
GRAG | Despatch of communication of intention to grant |
Free format text: ORIGINAL CODE: EPIDOS AGRA |
|
GRAH | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOS IGRA |
|
GRAH | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOS IGRA |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: NL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20020814 Ref country code: IT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT;WARNING: LAPSES OF ITALIAN PATENTS WITH EFFECTIVE DATE BEFORE 2007 MAY HAVE OCCURRED AT ANY TIME BEFORE 2007. THE CORRECT EFFECTIVE DATE MAY BE DIFFERENT FROM THE ONE RECORDED. Effective date: 20020814 Ref country code: GR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20020814 Ref country code: FI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20020814 |
|
REF | Corresponds to: |
Ref document number: 222393 Country of ref document: AT Date of ref document: 20020815 Kind code of ref document: T |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D Free format text: NOT ENGLISH |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: EP |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LU Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20020910 |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: FG4D Free format text: GERMAN |
|
REF | Corresponds to: |
Ref document number: 59902365 Country of ref document: DE Date of ref document: 20020919 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: CY Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20020930 Ref country code: BE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20020930 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20021114 Ref country code: DK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20021114 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: PT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20021202 |
|
GBT | Gb: translation of ep patent filed (gb section 77(6)(a)/1977) |
Effective date: 20021223 |
|
NLV1 | Nl: lapsed or annulled due to failure to fulfill the requirements of art. 29p and 29m of the patents act | ||
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: ES Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20030228 |
|
ET | Fr: translation filed | ||
BERE | Be: lapsed |
Owner name: *KULL HANS Effective date: 20020930 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MC Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20030401 |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: NV Representative=s name: SCHMAUDER & PARTNER AG PATENTANWALTSBUERO |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: IE Payment date: 20030730 Year of fee payment: 5 |
|
26N | No opposition filed |
Effective date: 20030515 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20030821 Year of fee payment: 5 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FR Payment date: 20030918 Year of fee payment: 5 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: AT Payment date: 20030922 Year of fee payment: 5 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: CH Payment date: 20030923 Year of fee payment: 5 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20030930 Year of fee payment: 5 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20040910 Ref country code: GB Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20040910 Ref country code: AT Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20040910 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LI Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20040930 Ref country code: CH Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20040930 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20050401 |
|
GBPC | Gb: european patent ceased through non-payment of renewal fee |
Effective date: 20040910 |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: PL |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: FR Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20050531 |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: MM4A |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: ST |