CN1146863C - 语音合成方法及其装置 - Google Patents

语音合成方法及其装置 Download PDF

Info

Publication number
CN1146863C
CN1146863C CNB951190490A CN95119049A CN1146863C CN 1146863 C CN1146863 C CN 1146863C CN B951190490 A CNB951190490 A CN B951190490A CN 95119049 A CN95119049 A CN 95119049A CN 1146863 C CN1146863 C CN 1146863C
Authority
CN
China
Prior art keywords
waveform
pitch
speech
segments
synthetic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB951190490A
Other languages
English (en)
Chinese (zh)
Other versions
CN1131785A (zh
Inventor
�˱�Т
釜井孝浩
松井谦二
原纪代
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from JP6302471A external-priority patent/JPH08160991A/ja
Priority claimed from JP7220963A external-priority patent/JP2987089B2/ja
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Publication of CN1131785A publication Critical patent/CN1131785A/zh
Application granted granted Critical
Publication of CN1146863C publication Critical patent/CN1146863C/zh
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/15Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Electrophonic Musical Instruments (AREA)
CNB951190490A 1994-12-06 1995-12-06 语音合成方法及其装置 Expired - Fee Related CN1146863C (zh)

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
JP6302471A JPH08160991A (ja) 1994-12-06 1994-12-06 音声素片作成方法および音声合成方法、装置
JP302,471/1994 1994-12-06
JP302,471/94 1994-12-06
JP220,963/95 1995-08-30
JP220,963/1995 1995-08-30
JP7220963A JP2987089B2 (ja) 1995-08-30 1995-08-30 音声素片作成方法および音声合成方法とその装置

Related Child Applications (1)

Application Number Title Priority Date Filing Date
CNB2003101028665A Division CN1294555C (zh) 1994-12-06 1995-12-06 语音段制作方法

Publications (2)

Publication Number Publication Date
CN1131785A CN1131785A (zh) 1996-09-25
CN1146863C true CN1146863C (zh) 2004-04-21

Family

ID=26523998

Family Applications (2)

Application Number Title Priority Date Filing Date
CNB951190490A Expired - Fee Related CN1146863C (zh) 1994-12-06 1995-12-06 语音合成方法及其装置
CNB2003101028665A Expired - Fee Related CN1294555C (zh) 1994-12-06 1995-12-06 语音段制作方法

Family Applications After (1)

Application Number Title Priority Date Filing Date
CNB2003101028665A Expired - Fee Related CN1294555C (zh) 1994-12-06 1995-12-06 语音段制作方法

Country Status (3)

Country Link
US (1) US5864812A (ko)
KR (1) KR100385603B1 (ko)
CN (2) CN1146863C (ko)

Families Citing this family (43)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6240384B1 (en) * 1995-12-04 2001-05-29 Kabushiki Kaisha Toshiba Speech synthesis method
DE19610019C2 (de) * 1996-03-14 1999-10-28 Data Software Gmbh G Digitales Sprachsyntheseverfahren
JP3349905B2 (ja) * 1996-12-10 2002-11-25 松下電器産業株式会社 音声合成方法および装置
US6490562B1 (en) 1997-04-09 2002-12-03 Matsushita Electric Industrial Co., Ltd. Method and system for analyzing voices
JP3902860B2 (ja) * 1998-03-09 2007-04-11 キヤノン株式会社 音声合成制御装置及びその制御方法、コンピュータ可読メモリ
JP3430985B2 (ja) * 1999-08-05 2003-07-28 ヤマハ株式会社 合成音生成装置
JP3450237B2 (ja) * 1999-10-06 2003-09-22 株式会社アルカディア 音声合成装置および方法
GB9925297D0 (en) * 1999-10-27 1999-12-29 Ibm Voice processing system
JP2001265375A (ja) * 2000-03-17 2001-09-28 Oki Electric Ind Co Ltd 規則音声合成装置
JP3728172B2 (ja) * 2000-03-31 2005-12-21 キヤノン株式会社 音声合成方法および装置
US6662162B2 (en) * 2000-08-28 2003-12-09 Maureen Casper Method of rating motor dysfunction by assessing speech prosody
US7251601B2 (en) * 2001-03-26 2007-07-31 Kabushiki Kaisha Toshiba Speech synthesis method and speech synthesizer
ATE336774T1 (de) * 2001-05-28 2006-09-15 Texas Instruments Inc Programmierbarer melodienerzeuger
WO2003019530A1 (fr) * 2001-08-31 2003-03-06 Kenwood Corporation Dispositif et procede de generation d'un signal a forme d'onde affecte d'un pas ; programme
US6681208B2 (en) * 2001-09-25 2004-01-20 Motorola, Inc. Text-to-speech native coding in a communication system
US7483832B2 (en) * 2001-12-10 2009-01-27 At&T Intellectual Property I, L.P. Method and system for customizing voice translation of text to speech
US20060069567A1 (en) * 2001-12-10 2006-03-30 Tischer Steven N Methods, systems, and products for translating text to speech
US7065485B1 (en) * 2002-01-09 2006-06-20 At&T Corp Enhancing speech intelligibility using variable-rate time-scale modification
JP2003255993A (ja) * 2002-03-04 2003-09-10 Ntt Docomo Inc 音声認識システム、音声認識方法、音声認識プログラム、音声合成システム、音声合成方法、音声合成プログラム
JP2003295880A (ja) * 2002-03-28 2003-10-15 Fujitsu Ltd 録音音声と合成音声を接続する音声合成システム
GB2392592B (en) * 2002-08-27 2004-07-07 20 20 Speech Ltd Speech synthesis apparatus and method
US20040073428A1 (en) * 2002-10-10 2004-04-15 Igor Zlokarnik Apparatus, methods, and programming for speech synthesis via bit manipulations of compressed database
JP4407305B2 (ja) * 2003-02-17 2010-02-03 株式会社ケンウッド ピッチ波形信号分割装置、音声信号圧縮装置、音声合成装置、ピッチ波形信号分割方法、音声信号圧縮方法、音声合成方法、記録媒体及びプログラム
EP1471499B1 (en) * 2003-04-25 2014-10-01 Alcatel Lucent Method of distributed speech synthesis
WO2004097792A1 (ja) * 2003-04-28 2004-11-11 Fujitsu Limited 音声合成システム
DE04735990T1 (de) * 2003-06-05 2006-10-05 Kabushiki Kaisha Kenwood, Hachiouji Sprachsynthesevorrichtung, sprachsyntheseverfahren und programm
US7363221B2 (en) * 2003-08-19 2008-04-22 Microsoft Corporation Method of noise reduction using instantaneous signal-to-noise ratio as the principal quantity for optimal estimation
JP4483450B2 (ja) * 2004-07-22 2010-06-16 株式会社デンソー 音声案内装置、音声案内方法およびナビゲーション装置
US20060259303A1 (en) * 2005-05-12 2006-11-16 Raimo Bakis Systems and methods for pitch smoothing for text-to-speech synthesis
CN101542593B (zh) * 2007-03-12 2013-04-17 富士通株式会社 语音波形内插装置及方法
US7953600B2 (en) * 2007-04-24 2011-05-31 Novaspeech Llc System and method for hybrid speech synthesis
CN101589430B (zh) * 2007-08-10 2012-07-18 松下电器产业株式会社 声音分离装置、声音合成装置及音质变换装置
JP5141688B2 (ja) * 2007-09-06 2013-02-13 富士通株式会社 音信号生成方法、音信号生成装置及びコンピュータプログラム
US20090177473A1 (en) * 2008-01-07 2009-07-09 Aaron Andrew S Applying vocal characteristics from a target speaker to a source speaker for synthetic speech
US9031834B2 (en) 2009-09-04 2015-05-12 Nuance Communications, Inc. Speech enhancement techniques on the power spectrum
US9053095B2 (en) * 2010-10-31 2015-06-09 Speech Morphing, Inc. Speech morphing communication system
JP5983604B2 (ja) * 2011-05-25 2016-08-31 日本電気株式会社 素片情報生成装置、音声合成装置、音声合成方法および音声合成プログラム
CN105895076B (zh) * 2015-01-26 2019-11-15 科大讯飞股份有限公司 一种语音合成方法及系统
JP6728755B2 (ja) * 2015-03-25 2020-07-22 ヤマハ株式会社 歌唱音発音装置
JP6996095B2 (ja) 2017-03-17 2022-01-17 株式会社リコー 情報表示装置、生体信号計測システムおよびプログラム
CN107799122B (zh) * 2017-09-08 2020-10-23 中国科学院深圳先进技术研究院 一种高生物拟真性语音处理滤波器与语音识别设备
JP7181173B2 (ja) * 2019-09-13 2022-11-30 株式会社スクウェア・エニックス プログラム、情報処理装置、情報処理システム及び方法
CN112786001B (zh) * 2019-11-11 2024-04-09 北京地平线机器人技术研发有限公司 语音合成模型训练方法、语音合成方法和装置

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4685135A (en) * 1981-03-05 1987-08-04 Texas Instruments Incorporated Text-to-speech synthesis system
US4586193A (en) * 1982-12-08 1986-04-29 Harris Corporation Formant-based speech synthesizer
US5208897A (en) * 1990-08-21 1993-05-04 Emerson & Stern Associates, Inc. Method and apparatus for speech recognition based on subsyllable spellings
US5400434A (en) * 1990-09-04 1995-03-21 Matsushita Electric Industrial Co., Ltd. Voice source for synthetic speech system
KR940002854B1 (ko) * 1991-11-06 1994-04-04 한국전기통신공사 음성 합성시스팀의 음성단편 코딩 및 그의 피치조절 방법과 그의 유성음 합성장치
EP0583559B1 (en) * 1992-07-31 2004-02-25 International Business Machines Corporation Finding token sequences in a database of token strings
CN1092195A (zh) * 1993-03-13 1994-09-14 北京联想计算机集团公司 Pc机合成语音音乐及发声的方法
US5704007A (en) * 1994-03-11 1997-12-30 Apple Computer, Inc. Utilization of multiple voice sources in a speech synthesizer

Also Published As

Publication number Publication date
KR100385603B1 (ko) 2003-08-21
CN1131785A (zh) 1996-09-25
CN1495703A (zh) 2004-05-12
KR960025314A (ko) 1996-07-20
US5864812A (en) 1999-01-26
CN1294555C (zh) 2007-01-10

Similar Documents

Publication Publication Date Title
CN1146863C (zh) 语音合成方法及其装置
US9761219B2 (en) System and method for distributed text-to-speech synthesis and intelligibility
Isewon et al. Design and implementation of text to speech conversion for visually impaired people
WO2020146873A1 (en) System and method for direct speech translation system
Yang et al. Uniaudio: An audio foundation model toward universal audio generation
US8019605B2 (en) Reducing recording time when constructing a concatenative TTS voice using a reduced script and pre-recorded speech assets
JP2018146803A (ja) 音声合成装置及びプログラム
EP1872361A1 (en) Hybrid speech synthesizer, method and use
CN1622195A (zh) 语音合成方法和语音合成系统
US20080120093A1 (en) System for creating dictionary for speech synthesis, semiconductor integrated circuit device, and method for manufacturing semiconductor integrated circuit device
US20100312564A1 (en) Local and remote feedback loop for speech synthesis
US9607610B2 (en) Devices and methods for noise modulation in a universal vocoder synthesizer
US9009050B2 (en) System and method for cloud-based text-to-speech web services
Urbain et al. Arousal-driven synthesis of laughter
CN116457870A (zh) 并行化Tacotron:非自回归且可控的TTS
Wu et al. Deep speech synthesis from articulatory representations
WO2023035261A1 (en) An end-to-end neural system for multi-speaker and multi-lingual speech synthesis
US11600261B2 (en) System and method for cross-speaker style transfer in text-to-speech and training data generation
US11960852B2 (en) Robust direct speech-to-speech translation
Kulkarni et al. Improving transfer of expressivity for end-to-end multispeaker text-to-speech synthesis
Guo et al. MSMC-TTS: Multi-stage multi-codebook VQ-VAE based neural TTS
Szekrényes Prosotool, a method for automatic annotation of fundamental frequency
RU2460154C1 (ru) Способ автоматизированной обработки текста и компьютерное устройство для реализации этого способа
Lorenzo-Trueba et al. Simple4all proposals for the albayzin evaluations in speech synthesis
CN113421571B (zh) 一种语音转换方法、装置、电子设备和存储介质

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C19 Lapse of patent right due to non-payment of the annual fee
CF01 Termination of patent right due to non-payment of annual fee