DE69022237T2 - Sprachsyntheseeinrichtung nach dem phonetischen Hidden-Markov-Modell. - Google Patents

Sprachsyntheseeinrichtung nach dem phonetischen Hidden-Markov-Modell.

Info

Publication number
DE69022237T2
DE69022237T2 DE69022237T DE69022237T DE69022237T2 DE 69022237 T2 DE69022237 T2 DE 69022237T2 DE 69022237 T DE69022237 T DE 69022237T DE 69022237 T DE69022237 T DE 69022237T DE 69022237 T2 DE69022237 T2 DE 69022237T2
Authority
DE
Germany
Prior art keywords
hidden markov
device based
speech
speech synthesis
markov model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
DE69022237T
Other languages
English (en)
Other versions
DE69022237D1 (de
Inventor
Massimo Giustiniani
Piero Pierucci
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of DE69022237D1 publication Critical patent/DE69022237D1/de
Application granted granted Critical
Publication of DE69022237T2 publication Critical patent/DE69022237T2/de
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
DE69022237T 1990-10-16 1990-10-16 Sprachsyntheseeinrichtung nach dem phonetischen Hidden-Markov-Modell. Expired - Fee Related DE69022237T2 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP90119789A EP0481107B1 (de) 1990-10-16 1990-10-16 Sprachsyntheseeinrichtung nach dem phonetischen Hidden-Markov-Modell

Publications (2)

Publication Number Publication Date
DE69022237D1 DE69022237D1 (de) 1995-10-12
DE69022237T2 true DE69022237T2 (de) 1996-05-02

Family

ID=8204620

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69022237T Expired - Fee Related DE69022237T2 (de) 1990-10-16 1990-10-16 Sprachsyntheseeinrichtung nach dem phonetischen Hidden-Markov-Modell.

Country Status (4)

Country Link
US (1) US5230037A (de)
EP (1) EP0481107B1 (de)
JP (1) JP2826215B2 (de)
DE (1) DE69022237T2 (de)

Families Citing this family (46)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR940002854B1 (ko) * 1991-11-06 1994-04-04 한국전기통신공사 음성 합성시스팀의 음성단편 코딩 및 그의 피치조절 방법과 그의 유성음 합성장치
US5606645A (en) * 1992-02-28 1997-02-25 Kabushiki Kaisha Toshiba Speech pattern recognition apparatus utilizing multiple independent sequences of phonetic segments
US5383120A (en) * 1992-03-02 1995-01-17 General Electric Company Method for tagging collocations in text
EP0577488B9 (de) * 1992-06-29 2007-10-03 Nippon Telegraph And Telephone Corporation Verfahren und Vorrichtung zur Sprachkodierung
WO1995010832A1 (en) * 1993-10-15 1995-04-20 At & T Corp. A method for training a system, the resulting apparatus, and method of use thereof
JP3450411B2 (ja) * 1994-03-22 2003-09-22 キヤノン株式会社 音声情報処理方法及び装置
GB2290684A (en) * 1994-06-22 1996-01-03 Ibm Speech synthesis using hidden Markov model to determine speech unit durations
US5633983A (en) * 1994-09-13 1997-05-27 Lucent Technologies Inc. Systems and methods for performing phonemic synthesis
US5497337A (en) * 1994-10-21 1996-03-05 International Business Machines Corporation Method for designing high-Q inductors in silicon technology without expensive metalization
GB2296846A (en) * 1995-01-07 1996-07-10 Ibm Synthesising speech from text
US5719996A (en) * 1995-06-30 1998-02-17 Motorola, Inc. Speech recognition in selective call systems
US6038533A (en) * 1995-07-07 2000-03-14 Lucent Technologies Inc. System and method for selecting training text
US5822731A (en) * 1995-09-15 1998-10-13 Infonautics Corporation Adjusting a hidden Markov model tagger for sentence fragments
US5832441A (en) * 1996-09-16 1998-11-03 International Business Machines Corporation Creating speech models
JPH10153998A (ja) * 1996-09-24 1998-06-09 Nippon Telegr & Teleph Corp <Ntt> 補助情報利用型音声合成方法、この方法を実施する手順を記録した記録媒体、およびこの方法を実施する装置
US5950162A (en) * 1996-10-30 1999-09-07 Motorola, Inc. Method, device and system for generating segment durations in a text-to-speech system
US5933805A (en) * 1996-12-13 1999-08-03 Intel Corporation Retaining prosody during speech analysis for later playback
JPH10260692A (ja) * 1997-03-18 1998-09-29 Toshiba Corp 音声の認識合成符号化/復号化方法及び音声符号化/復号化システム
JP3033514B2 (ja) * 1997-03-31 2000-04-17 日本電気株式会社 大語彙音声認識方法及び装置
EP1016077B1 (de) * 1997-09-17 2001-05-16 Siemens Aktiengesellschaft Verfahren zur bestimmung einer wahrscheinlichkeit für das auftreten einer folge von mindestens zwei wörtern bei einer spracherkennung
WO2000030069A2 (en) * 1998-11-13 2000-05-25 Lernout & Hauspie Speech Products N.V. Speech synthesis using concatenation of speech waveforms
US6185533B1 (en) * 1999-03-15 2001-02-06 Matsushita Electric Industrial Co., Ltd. Generation and synthesis of prosody templates
DE19915648A1 (de) * 1999-04-07 2000-10-12 Rohde & Schwarz Verfahren zum Bewerten der Sprachqualität von Telefonverbindungen
US6178402B1 (en) 1999-04-29 2001-01-23 Motorola, Inc. Method, apparatus and system for generating acoustic parameters in a text-to-speech system using a neural network
US7219056B2 (en) * 2000-04-20 2007-05-15 International Business Machines Corporation Determining and using acoustic confusability, acoustic perplexity and synthetic acoustic word error rate
US6999926B2 (en) * 2000-11-16 2006-02-14 International Business Machines Corporation Unsupervised incremental adaptation using maximum likelihood spectral transformation
US7124082B2 (en) * 2002-10-11 2006-10-17 Twisted Innovations Phonetic speech-to-text-to-speech system and method
US7593845B2 (en) * 2003-10-06 2009-09-22 Microsoflt Corporation Method and apparatus for identifying semantic structures from text
US7412377B2 (en) 2003-12-19 2008-08-12 International Business Machines Corporation Voice model for speech processing based on ordered average ranks of spectral features
JP2006047866A (ja) * 2004-08-06 2006-02-16 Canon Inc 電子辞書装置およびその制御方法
CN1755796A (zh) * 2004-09-30 2006-04-05 国际商业机器公司 文本到语音转换中基于统计技术的距离定义方法和系统
US7684988B2 (en) * 2004-10-15 2010-03-23 Microsoft Corporation Testing and tuning of automatic speech recognition systems using synthetic inputs generated from its acoustic models
US20060136215A1 (en) * 2004-12-21 2006-06-22 Jong Jin Kim Method of speaking rate conversion in text-to-speech system
US20070213987A1 (en) * 2006-03-08 2007-09-13 Voxonic, Inc. Codebook-less speech conversion method and system
JP2008263543A (ja) * 2007-04-13 2008-10-30 Funai Electric Co Ltd 記録再生装置
US8321222B2 (en) * 2007-08-14 2012-11-27 Nuance Communications, Inc. Synthesis by generation and concatenation of multi-form segments
US20090248647A1 (en) * 2008-03-25 2009-10-01 Omer Ziv System and method for the quality assessment of queries
CN102047321A (zh) * 2008-05-30 2011-05-04 诺基亚公司 用于提供改进的语音合成的方法、设备和计算机程序产品
DK2242045T3 (da) * 2009-04-16 2012-09-24 Univ Mons Talesyntese og kodningsfremgangsmåder
US9299338B2 (en) * 2010-11-08 2016-03-29 Nec Corporation Feature sequence generating device, feature sequence generating method, and feature sequence generating program
WO2012134877A2 (en) * 2011-03-25 2012-10-04 Educational Testing Service Computer-implemented systems and methods evaluating prosodic features of speech
CN103531196B (zh) * 2013-10-15 2016-04-13 中国科学院自动化研究所 一种波形拼接语音合成的选音方法
WO2016011189A1 (en) * 2014-07-15 2016-01-21 The Regents Of The University Of California Frequency-multiplexed speech-sound stimuli for hierarchical neural characterization of speech processing
US10002543B2 (en) * 2014-11-04 2018-06-19 Knotbird LLC System and methods for transforming language into interactive elements
JP6672114B2 (ja) * 2016-09-13 2020-03-25 本田技研工業株式会社 会話メンバー最適化装置、会話メンバー最適化方法およびプログラム
CN109087630B (zh) * 2018-08-29 2020-09-15 深圳追一科技有限公司 语音识别的方法及相关装置

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4882759A (en) * 1986-04-18 1989-11-21 International Business Machines Corporation Synthesizing word baseforms used in speech recognition
US4852180A (en) * 1987-04-03 1989-07-25 American Telephone And Telegraph Company, At&T Bell Laboratories Speech recognition by acoustic/phonetic system and technique
JPH01159698A (ja) * 1987-12-16 1989-06-22 Matsushita Electric Ind Co Ltd パターン認識用モデル作成装置
JP2545914B2 (ja) * 1988-02-09 1996-10-23 日本電気株式会社 音声認識方法
US5033087A (en) * 1989-03-14 1991-07-16 International Business Machines Corp. Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system

Also Published As

Publication number Publication date
DE69022237D1 (de) 1995-10-12
US5230037A (en) 1993-07-20
EP0481107A1 (de) 1992-04-22
EP0481107B1 (de) 1995-09-06
JPH04313034A (ja) 1992-11-05
JP2826215B2 (ja) 1998-11-18

Similar Documents

Publication Publication Date Title
DE69022237T2 (de) Sprachsyntheseeinrichtung nach dem phonetischen Hidden-Markov-Modell.
MX9703138A (es) Reconocimiento de lenguaje.
CA2069675A1 (en) Flexible vocabulary recognition
JPH08512150A (ja) ニューラル・ネットワークを利用してテキストを可聴信号に変換する方法および装置
WO1996023298A3 (en) System amd method for generating and using context dependent sub-syllable models to recognize a tonal language
EP0059880A3 (de) System zur Synthese der Sprache aus einem Text
WO2003019528A1 (fr) Procede de production d&#39;intonation, dispositif de synthese de signaux vocaux fonctionnant selon ledit procede et serveur vocal
O'Malley Text-to-speech conversion technology
JPS57158900A (en) Text voice synthesizer
Kayte et al. Hidden Markov model based speech synthesis: A review
Lee et al. Voice response systems
JP3437064B2 (ja) 音声合成装置
EP0982684A4 (de) Bewegende blder generierende vorrichtung und bildkontrollnetzwerk-lernvorrichtung
JPH07200554A (ja) 文章読み上げ装置
Ladefoged The phonetic specification of the languages of the world
JPS5854400A (ja) 音声出力編集方式
KR100429978B1 (ko) 음성합성시스템의음질저하방지장치
JPH0667685A (ja) 音声合成装置
JPH02247696A (ja) テキスト音声合成装置
O'Shaughnessy Fundamental frequency by rule for a text-to-speech system
Heggtveit An overview of text-to-speech synthesis
Chowdhury Concatenative Text-to-speech synthesis: A study on standard colloquial bengali
GB1224137A (en) Speech synthesis system
JP2581130B2 (ja) 音韻継続時間長決定装置
Dorffner et al. GRAPHON-the Vienna speech systhesis system for arbitrary German text

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8339 Ceased/non-payment of the annual fee