ES2173389T3 - Procedimiento y dispositivo para la sintesis de señales vocales. - Google Patents
Procedimiento y dispositivo para la sintesis de señales vocales.Info
- Publication number
- ES2173389T3 ES2173389T3 ES97305349T ES97305349T ES2173389T3 ES 2173389 T3 ES2173389 T3 ES 2173389T3 ES 97305349 T ES97305349 T ES 97305349T ES 97305349 T ES97305349 T ES 97305349T ES 2173389 T3 ES2173389 T3 ES 2173389T3
- Authority
- ES
- Spain
- Prior art keywords
- speech
- trozo
- type
- talk
- transcription
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 230000015572 biosynthetic process Effects 0.000 title abstract 3
- 238000003786 synthesis reaction Methods 0.000 title abstract 3
- 238000000034 method Methods 0.000 title abstract 2
- 230000001755 vocal effect Effects 0.000 title 1
- 238000013518 transcription Methods 0.000 abstract 3
- 230000035897 transcription Effects 0.000 abstract 3
- 230000015556 catabolic process Effects 0.000 abstract 1
- 238000006731 degradation reaction Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
- G10L13/07—Concatenation rules
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
- Telephonic Communication Services (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
UN APARATO DE SINTESIS DEL HABLA QUE DEFORMA Y CONECTA LOS TROZOS DEL HABLA PARA SINTETIZAR EL HABLA, TIENE UNA BASE DE DATOS, DE FORMAS DE ONDA DEL HABLA, PARA LOS DATOS DE ALMACENAMIENTO DE UN TIPO DE ACENTO DE UN TROZO DEL HABLA DE UNA PALABRA O UNA SILABA PRONUNCIADA CON UN ACENTO DEL TIPO 0 Y UN ACENTO DE TIPO 1, LOS DATOS DE LA TRANSCRIPCION FONEMICA DEL TROZO DEL HABLA Y LOS DATOS DE UNA POSICION, EN LA CUAL EL TROZO DEL HABLA SE PUEDE SEGMENTAR, UNA MEMORIA TAMPON DE ENTRADA PARA ALMACENAR UNA CADENA DE CARACTERES DE TRANSCRIPCION FONEMICA Y DE LA PROSODIA DEL HABLA QUE SE VA A SINTETIZAR, UNA UNIDAD SELECTORA DE LA UNIDAD DE SINTESIS, PARA RECUPERAR LOS TROZOS DEL HABLA CANDIDATOS DE LA BASE DE DATOS DE FORMAS DE ONDA DEL HABLA, BASANDOSE EN LA CADENA DE CARACTERES DE LA TRANSCRIPCION FONEMICA DE LA MEMORIA TAMPON DE ENTRADA, Y UNA UNIDAD SELECTORA DE TROZOS DEL HABLA UTILIZADOS, PARA DETERMINAR QUE SE USE PRACTICAMENTE UN TROZO DEL HABLA, ENTRE LOS CANDIDATOS RECUPERADOS, DE ACUERDO CON UN TIPO DE ACENTO DEL HABLA QUE SE VA A SINTETIZAR Y DE ACUERDO CON UNA POSICION EN EL HABLA, EN LA CUAL, SE USA EL TROZO DEL HABLA, POR TANTO, SE PREVIENE LA DEGRADACION DE LA CALIDAD DEL SONIDO CUANDO SE PROCESA EL TROZO DEL HABLA.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP8196635A JPH1039895A (ja) | 1996-07-25 | 1996-07-25 | 音声合成方法および装置 |
Publications (1)
Publication Number | Publication Date |
---|---|
ES2173389T3 true ES2173389T3 (es) | 2002-10-16 |
Family
ID=16361051
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
ES97305349T Expired - Lifetime ES2173389T3 (es) | 1996-07-25 | 1997-07-17 | Procedimiento y dispositivo para la sintesis de señales vocales. |
Country Status (6)
Country | Link |
---|---|
US (1) | US6035272A (es) |
EP (1) | EP0821344B1 (es) |
JP (1) | JPH1039895A (es) |
CN (1) | CN1175052A (es) |
DE (1) | DE69710525T2 (es) |
ES (1) | ES2173389T3 (es) |
Families Citing this family (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3587048B2 (ja) * | 1998-03-02 | 2004-11-10 | 株式会社日立製作所 | 韻律制御方法及び音声合成装置 |
JP3180764B2 (ja) * | 1998-06-05 | 2001-06-25 | 日本電気株式会社 | 音声合成装置 |
JP3644263B2 (ja) * | 1998-07-31 | 2005-04-27 | ヤマハ株式会社 | 波形形成装置及び方法 |
US6601030B2 (en) * | 1998-10-28 | 2003-07-29 | At&T Corp. | Method and system for recorded word concatenation |
JP3361066B2 (ja) * | 1998-11-30 | 2003-01-07 | 松下電器産業株式会社 | 音声合成方法および装置 |
EP1163663A2 (en) | 1999-03-15 | 2001-12-19 | BRITISH TELECOMMUNICATIONS public limited company | Speech synthesis |
US7369994B1 (en) | 1999-04-30 | 2008-05-06 | At&T Corp. | Methods and apparatus for rapid acoustic unit selection from a large speech corpus |
JP3361291B2 (ja) * | 1999-07-23 | 2003-01-07 | コナミ株式会社 | 音声合成方法、音声合成装置及び音声合成プログラムを記録したコンピュータ読み取り可能な媒体 |
DE19942171A1 (de) * | 1999-09-03 | 2001-03-15 | Siemens Ag | Verfahren zur Satzendebestimmung in der automatischen Sprachverarbeitung |
JP2001100776A (ja) * | 1999-09-30 | 2001-04-13 | Arcadia:Kk | 音声合成装置 |
GB0029022D0 (en) * | 2000-11-29 | 2001-01-10 | Hewlett Packard Co | Locality-dependent presentation |
US20040030555A1 (en) * | 2002-08-12 | 2004-02-12 | Oregon Health & Science University | System and method for concatenating acoustic contours for speech synthesis |
DE04735990T1 (de) * | 2003-06-05 | 2006-10-05 | Kabushiki Kaisha Kenwood, Hachiouji | Sprachsynthesevorrichtung, sprachsyntheseverfahren und programm |
US7577568B2 (en) * | 2003-06-10 | 2009-08-18 | At&T Intellctual Property Ii, L.P. | Methods and system for creating voice files using a VoiceXML application |
JP4080989B2 (ja) * | 2003-11-28 | 2008-04-23 | 株式会社東芝 | 音声合成方法、音声合成装置および音声合成プログラム |
US8666746B2 (en) * | 2004-05-13 | 2014-03-04 | At&T Intellectual Property Ii, L.P. | System and method for generating customized text-to-speech voices |
CN1787072B (zh) * | 2004-12-07 | 2010-06-16 | 北京捷通华声语音技术有限公司 | 基于韵律模型和参数选音的语音合成方法 |
JP4551803B2 (ja) * | 2005-03-29 | 2010-09-29 | 株式会社東芝 | 音声合成装置及びそのプログラム |
US20070038455A1 (en) * | 2005-08-09 | 2007-02-15 | Murzina Marina V | Accent detection and correction system |
US7924986B2 (en) * | 2006-01-27 | 2011-04-12 | Accenture Global Services Limited | IVR system manager |
US20080027725A1 (en) * | 2006-07-26 | 2008-01-31 | Microsoft Corporation | Automatic Accent Detection With Limited Manually Labeled Data |
CN101261831B (zh) * | 2007-03-05 | 2011-11-16 | 凌阳科技股份有限公司 | 一种音标分解与合成方法 |
US8321222B2 (en) * | 2007-08-14 | 2012-11-27 | Nuance Communications, Inc. | Synthesis by generation and concatenation of multi-form segments |
FR2993088B1 (fr) * | 2012-07-06 | 2014-07-18 | Continental Automotive France | Procede et systeme de synthese vocale |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2761552B2 (ja) * | 1988-05-11 | 1998-06-04 | 日本電信電話株式会社 | 音声合成方法 |
DE69028072T2 (de) * | 1989-11-06 | 1997-01-09 | Canon Kk | Verfahren und Einrichtung zur Sprachsynthese |
JP3070127B2 (ja) * | 1991-05-07 | 2000-07-24 | 株式会社明電舎 | 音声合成装置のアクセント成分制御方式 |
JP3083640B2 (ja) * | 1992-05-28 | 2000-09-04 | 株式会社東芝 | 音声合成方法および装置 |
JPH06250691A (ja) * | 1993-02-25 | 1994-09-09 | N T T Data Tsushin Kk | 音声合成装置 |
JPH07152392A (ja) * | 1993-11-30 | 1995-06-16 | Fujitsu Ltd | 音声合成装置 |
JP3450411B2 (ja) * | 1994-03-22 | 2003-09-22 | キヤノン株式会社 | 音声情報処理方法及び装置 |
JPH07319497A (ja) * | 1994-05-23 | 1995-12-08 | N T T Data Tsushin Kk | 音声合成装置 |
JPH086591A (ja) * | 1994-06-15 | 1996-01-12 | Sony Corp | 音声出力装置 |
JPH0863190A (ja) * | 1994-08-17 | 1996-03-08 | Meidensha Corp | 音声合成装置の文末制御方法 |
JP3085631B2 (ja) * | 1994-10-19 | 2000-09-11 | 日本アイ・ビー・エム株式会社 | 音声合成方法及びシステム |
SE514684C2 (sv) * | 1995-06-16 | 2001-04-02 | Telia Ab | Metod vid tal-till-textomvandling |
-
1996
- 1996-07-25 JP JP8196635A patent/JPH1039895A/ja active Pending
-
1997
- 1997-07-17 DE DE69710525T patent/DE69710525T2/de not_active Expired - Fee Related
- 1997-07-17 ES ES97305349T patent/ES2173389T3/es not_active Expired - Lifetime
- 1997-07-17 EP EP97305349A patent/EP0821344B1/en not_active Expired - Lifetime
- 1997-07-21 US US08/897,830 patent/US6035272A/en not_active Expired - Fee Related
- 1997-07-25 CN CN97115567.4A patent/CN1175052A/zh active Pending
Also Published As
Publication number | Publication date |
---|---|
JPH1039895A (ja) | 1998-02-13 |
DE69710525T2 (de) | 2002-07-18 |
US6035272A (en) | 2000-03-07 |
EP0821344A2 (en) | 1998-01-28 |
EP0821344A3 (en) | 1998-11-18 |
CN1175052A (zh) | 1998-03-04 |
EP0821344B1 (en) | 2002-02-20 |
DE69710525D1 (de) | 2002-03-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ES2173389T3 (es) | Procedimiento y dispositivo para la sintesis de señales vocales. | |
US5384893A (en) | Method and apparatus for speech synthesis based on prosodic analysis | |
GB1380502A (en) | Systems for the synthesis of speech from alphanumeric data | |
WO2004034377A3 (en) | Apparatus, methods and programming for speech synthesis via bit manipulations of compressed data base | |
ES2142332T3 (es) | Reconocedor automatico de voz. | |
Noonan | A tale of two passives in Irish | |
ATE374421T1 (de) | Segmentierungsverfahren zur erweiterung des aktiven vokabulars von spracherkennern | |
ES2153021T3 (es) | Procedimiento y disposicion para la conversion del habla a texto. | |
ES2047029T3 (es) | Sistema y tecnica de reconocimiento de voz. | |
Bagshaw | Phonemic transcription by analogy in text-to-speech synthesis: Novel word pronunciation and lexicon compression | |
MY141708A (en) | Hmm-based text-to-phoneme parser and method for training same | |
WO2004114253A3 (en) | Method of teaching reading | |
JPS5774799A (en) | Word voice notifying system | |
Isenberg et al. | A top‐down effect on the identification of function words | |
Sherwood | Fast text-to-speech algorithms for Esperanto, Spanish, Italian, Russian and English | |
Filipsson et al. | LUKAS-a preliminary report on a new Swedish speech synthesis | |
Sitaram et al. | Universal grapheme-based speech synthesis. | |
Bettega et al. | A Musandam Arabic Text from Lima (Oman) | |
Suñer | Spanish adverbs: support for the phonological cycle? | |
Bright | Phonological rules in literary and colloquial Kannada | |
JPS4949241B1 (es) | ||
Ziółko et al. | Statistics of diphones and triphones presence on the word boundaries in the Polish language. Applications to ASR | |
Viechnicki | The problem of voiced stops in Modern Greek: A non-linear approach | |
Huang et al. | A Chinese text-to-speech synthesis system based on an initial-final model | |
JPS58195900A (ja) | 音声入力式日本語文書処理装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FG2A | Definitive protection |
Ref document number: 821344 Country of ref document: ES |