ES2106669A1 - Metodo relativo a la sinteis del habla y disposicion correspondiente. - Google Patents

Metodo relativo a la sinteis del habla y disposicion correspondiente.

Info

Publication number
ES2106669A1
ES2106669A1 ES09402427A ES9402427A ES2106669A1 ES 2106669 A1 ES2106669 A1 ES 2106669A1 ES 09402427 A ES09402427 A ES 09402427A ES 9402427 A ES9402427 A ES 9402427A ES 2106669 A1 ES2106669 A1 ES 2106669A1
Authority
ES
Spain
Prior art keywords
phoneme
phonemes
elements
carrying
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
ES09402427A
Other languages
English (en)
Other versions
ES2106669B1 (es
Inventor
Tomas Svensson
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Telia AB
Original Assignee
Telia AB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telia AB filed Critical Telia AB
Publication of ES2106669A1 publication Critical patent/ES2106669A1/es
Application granted granted Critical
Publication of ES2106669B1 publication Critical patent/ES2106669B1/es
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/04Time compression or expansion
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/06Elementary speech units used in speech synthesisers; Concatenation rules
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination

Landscapes

  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Machine Translation (AREA)
  • Processing Or Creating Images (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Electrophonic Musical Instruments (AREA)
  • Electric Clocks (AREA)
  • Document Processing Apparatus (AREA)

Abstract

METODO RELATIVO A LA SINTESIS DEL HABLA Y DISPOSICION CORRESPONDIENTE; SE REFIEREN A LA TRANSFORMACION DE FONEMAS EN UN TIEMPO MAS CORTO O MAS LARGO QUE EL FONEMA EXISTENTE. LA TRANSFORMACION TIENE LUGAR DE MANERA ASIMETRICA DE FORMA QUE UN FONEMA BASICO ES DIVIDIDO EN UN NUMERO DE PUNTOS, SIENDO DICHOS PUNTOS IDENTIFICADOS RESPECTO A LOS ELEMENTOS QUE LLEVAN INFORMACION DEL FONEMA. ESTO PROPORCIONA UNA PONDERACION DEL FONEMA ENTRE ELEMENTOS QUE LLEVAN INFORMACION Y ELEMENTOS QUE LLEVAN MENOS INFORMACION. LAS PARTES CUYOS ELEMENTOS LLEVAN MENOS INFORMACION SON TRANSFORMADAS EN UN INTERVALO DE TIEMPO MAYOR O MENOR. LOS ELEMENTOS DEL FONEMA QUE REPRESENTAN PARTES QUE LLEVAN INFORMACION SON TRANSFORMADOS SIN MODIFICAR EN EL TIEMPO. ESTO PROPORCIONA UNA TRANSFORMACION DEL FONEMA QUE CONSERVA SU CARACTER ORIGINAL EN TODO LO ESENCIAL.IDENTIFICADAS LAS PARTES DEL FONEMA QUE LLEVAN MENOS INFORMACION, LA INVENCION TAMBIEN PROPORCIONA UNA INDICACION DE DONDE PUEDEN ADAPTARSE DIFERENTES FONEMAS DENTRODE OTROS PARA LA CREACION DE HABLA ARTIFICIAL.
ES09402427A 1993-11-25 1994-11-25 Metodo relativo a la sintesis del habla y disposicion correspondiente. Expired - Lifetime ES2106669B1 (es)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
SE9303902A SE516521C2 (sv) 1993-11-25 1993-11-25 Anordning och förfarande vid talsyntes

Publications (2)

Publication Number Publication Date
ES2106669A1 true ES2106669A1 (es) 1997-11-01
ES2106669B1 ES2106669B1 (es) 1998-06-01

Family

ID=20391875

Family Applications (1)

Application Number Title Priority Date Filing Date
ES09402427A Expired - Lifetime ES2106669B1 (es) 1993-11-25 1994-11-25 Metodo relativo a la sintesis del habla y disposicion correspondiente.

Country Status (10)

Country Link
US (1) US5729657A (es)
AU (1) AU676389B2 (es)
CH (1) CH689883A5 (es)
DE (1) DE4441906C2 (es)
ES (1) ES2106669B1 (es)
FR (1) FR2713006B1 (es)
GB (1) GB2284328B (es)
IT (1) IT1276336B1 (es)
NL (1) NL194481C (es)
SE (1) SE516521C2 (es)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ES2118424T3 (es) * 1993-08-04 1998-09-16 British Telecomm Sintesis de voz mediante la conversion de fonemas en formas de onda digitales.
US7089184B2 (en) * 2001-03-22 2006-08-08 Nurv Center Technologies, Inc. Speech recognition for recognizing speaker-independent, continuous speech
EP1543503B1 (en) * 2002-09-17 2007-01-24 Koninklijke Philips Electronics N.V. Method for controlling duration in speech synthesis
JP4455633B2 (ja) * 2007-09-10 2010-04-21 株式会社東芝 基本周波数パターン生成装置、基本周波数パターン生成方法及びプログラム
JP6047922B2 (ja) * 2011-06-01 2016-12-21 ヤマハ株式会社 音声合成装置および音声合成方法
JP6992612B2 (ja) * 2018-03-09 2022-01-13 ヤマハ株式会社 音声処理方法および音声処理装置

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4406001A (en) * 1980-08-18 1983-09-20 The Variable Speech Control Company ("Vsc") Time compression/expansion with synchronized individual pitch correction of separate components
US4435832A (en) * 1979-10-01 1984-03-06 Hitachi, Ltd. Speech synthesizer having speech time stretch and compression functions
US4864620A (en) * 1987-12-21 1989-09-05 The Dsp Group, Inc. Method for performing time-scale modification of speech information or speech signals
EP0392049A1 (de) * 1989-04-12 1990-10-17 Siemens Aktiengesellschaft Verfahren zur Dehnung oder Raffung eines Zeitsignals
US5216744A (en) * 1991-03-21 1993-06-01 Dictaphone Corporation Time scale modification of speech signals

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3158685A (en) * 1961-05-04 1964-11-24 Bell Telephone Labor Inc Synthesis of speech from code signals
FR1602936A (es) * 1968-12-31 1971-02-22
US3704345A (en) * 1971-03-19 1972-11-28 Bell Telephone Labor Inc Conversion of printed text into synthetic speech
US4214125A (en) * 1977-01-21 1980-07-22 Forrest S. Mozer Method and apparatus for speech synthesizing
JPS55147697A (en) * 1979-05-07 1980-11-17 Sharp Kk Sound synthesizer
US4435831A (en) * 1981-12-28 1984-03-06 Mozer Forrest Shrago Method and apparatus for time domain compression and synthesis of unvoiced audible signals
US4700301A (en) * 1983-11-02 1987-10-13 Dyke Howard L Method of automatically steering agricultural type vehicles
US4692941A (en) * 1984-04-10 1987-09-08 First Byte Real-time text-to-speech conversion system
US4701937A (en) * 1985-05-13 1987-10-20 Industrial Technology Research Institute Republic Of China Signal storage and replay system
JPH0632020B2 (ja) * 1986-03-25 1994-04-27 インタ−ナシヨナル ビジネス マシ−ンズ コ−ポレ−シヨン 音声合成方法および装置
US4802221A (en) * 1986-07-21 1989-01-31 Ncr Corporation Digital system and method for compressing speech signals for storage and transmission
US4833718A (en) * 1986-11-18 1989-05-23 First Byte Compression of stored waveforms for artificial speech
US5189702A (en) * 1987-02-16 1993-02-23 Canon Kabushiki Kaisha Voice processing apparatus for varying the speed with which a voice signal is reproduced
JPS63285598A (ja) * 1987-05-18 1988-11-22 ケイディディ株式会社 音素接続形パラメ−タ規則合成方式
FR2636163B1 (fr) * 1988-09-02 1991-07-05 Hamon Christian Procede et dispositif de synthese de la parole par addition-recouvrement de formes d'onde
JP3278863B2 (ja) * 1991-06-05 2002-04-30 株式会社日立製作所 音声合成装置
US5175769A (en) * 1991-07-23 1992-12-29 Rolm Systems Method for time-scale modification of signals
DE69228211T2 (de) * 1991-08-09 1999-07-08 Koninklijke Philips Electronics N.V., Eindhoven Verfahren und Apparat zur Handhabung von Höhe und Dauer eines physikalischen Audiosignals

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4435832A (en) * 1979-10-01 1984-03-06 Hitachi, Ltd. Speech synthesizer having speech time stretch and compression functions
US4406001A (en) * 1980-08-18 1983-09-20 The Variable Speech Control Company ("Vsc") Time compression/expansion with synchronized individual pitch correction of separate components
US4864620A (en) * 1987-12-21 1989-09-05 The Dsp Group, Inc. Method for performing time-scale modification of speech information or speech signals
EP0392049A1 (de) * 1989-04-12 1990-10-17 Siemens Aktiengesellschaft Verfahren zur Dehnung oder Raffung eines Zeitsignals
US5216744A (en) * 1991-03-21 1993-06-01 Dictaphone Corporation Time scale modification of speech signals

Also Published As

Publication number Publication date
FR2713006A1 (fr) 1995-06-02
GB2284328B (en) 1998-01-28
DE4441906C2 (de) 2003-02-13
US5729657A (en) 1998-03-17
ITRM940763A1 (it) 1996-05-23
NL9401964A (nl) 1995-06-16
NL194481B (nl) 2002-01-02
GB9423236D0 (en) 1995-01-04
SE9303902D0 (sv) 1993-11-25
ES2106669B1 (es) 1998-06-01
IT1276336B1 (it) 1997-10-28
DE4441906A1 (de) 1995-06-01
GB2284328A (en) 1995-05-31
CH689883A5 (de) 1999-12-31
AU7885694A (en) 1995-06-01
AU676389B2 (en) 1997-03-06
FR2713006B1 (fr) 1998-03-20
ITRM940763A0 (it) 1994-11-23
SE9303902L (sv) 1995-05-26
NL194481C (nl) 2002-05-03
SE516521C2 (sv) 2002-01-22

Similar Documents

Publication Publication Date Title
AU1191899A (en) System and method for representing complex information auditorially
TW347619B (en) A communication system and method using a speaker dependent time-scaling technique a method for time-scale modification of speech using a modified version of the Waveform Similarity based Overlap-Add technique (WSOLA).
EP1005018A3 (en) Speech synthesis employing prosody templates
EP0168650A3 (en) Method of designing a logic circuitry
WO1998044643A3 (en) Audio interface for document based information resource navigation and method therefor
EP0825586A3 (en) Lexical tree pre-filtering in speech recognition
HK40496A (en) Word recognition in a speech recognition system using data reduced word templates
CA2052769A1 (en) Midi file translation
EP1675101A3 (en) Singing voice-synthesizing method and apparatus and storage medium
PL335150A1 (en) Method of and apparatus for recognising spoken information
TW357313B (en) Methods and apparatus for handwriting recognition
WO2003065349A3 (en) Text to speech
EP0770989A3 (en) Speech encoding method and apparatus
MX9505299A (es) Sistemas, metodos y articulos de fabricacion para realizar la hipotesizacion de n-cadenas optimas de alta resolucion.
CA2210887A1 (en) Method and apparatus for speech recognition adapted to an individual speaker
DE69813180D1 (de) Kontextabhängige phonemnetzwerke zur kodierung von sprachinformation
EP1071074A3 (en) Speech synthesis employing prosody templates
DE3275779D1 (en) Recognition of speech or speech-like sounds
MY141708A (en) Hmm-based text-to-phoneme parser and method for training same
GB9017697D0 (en) Pattern comparison in pattern recognition
ES2106669A1 (es) Metodo relativo a la sinteis del habla y disposicion correspondiente.
BR9204112A (pt) Processo e aparelho para o ensino de linguas
EP0865026A3 (de) Effizientes Verfahren zur Geschwindigkeitsmodifikation von Sprachsignalen
AU584130B2 (en) Apparatus and method for identifying spoken words
Holm et al. Generating prosody by superposing multi-parametric overlapping contours.

Legal Events

Date Code Title Description
EC2A Search report published

Date of ref document: 19971101

Kind code of ref document: A1

Effective date: 19971101