CN1146863C - 语音合成方法及其装置 - Google Patents
语音合成方法及其装置 Download PDFInfo
- Publication number
- CN1146863C CN1146863C CNB951190490A CN95119049A CN1146863C CN 1146863 C CN1146863 C CN 1146863C CN B951190490 A CNB951190490 A CN B951190490A CN 95119049 A CN95119049 A CN 95119049A CN 1146863 C CN1146863 C CN 1146863C
- Authority
- CN
- China
- Prior art keywords
- waveform
- pitch
- speech
- segments
- synthetic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 88
- 230000002194 synthesizing effect Effects 0.000 title claims abstract description 22
- 230000006870 function Effects 0.000 claims description 58
- 230000014509 gene expression Effects 0.000 claims description 25
- 238000003860 storage Methods 0.000 claims description 24
- 238000005070 sampling Methods 0.000 claims description 10
- 230000002123 temporal effect Effects 0.000 claims description 8
- 238000006243 chemical reaction Methods 0.000 claims description 5
- 230000001052 transient effect Effects 0.000 claims description 4
- 230000015572 biosynthetic process Effects 0.000 abstract description 11
- 238000003786 synthesis reaction Methods 0.000 abstract description 10
- 239000011295 pitch Substances 0.000 description 178
- 238000001228 spectrum Methods 0.000 description 30
- 238000010586 diagram Methods 0.000 description 28
- 230000008569 process Effects 0.000 description 25
- 230000008859 change Effects 0.000 description 16
- 238000010189 synthetic method Methods 0.000 description 14
- 238000007792 addition Methods 0.000 description 13
- 239000002131 composite material Substances 0.000 description 10
- 239000000178 monomer Substances 0.000 description 7
- 239000000203 mixture Substances 0.000 description 6
- 239000011318 synthetic pitch Substances 0.000 description 6
- 230000007704 transition Effects 0.000 description 6
- 239000000284 extract Substances 0.000 description 5
- 210000001260 vocal cord Anatomy 0.000 description 5
- 230000033764 rhythmic process Effects 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 3
- 230000005039 memory span Effects 0.000 description 3
- 230000003595 spectral effect Effects 0.000 description 3
- 230000002159 abnormal effect Effects 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 239000000654 additive Substances 0.000 description 2
- 230000000996 additive effect Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000014759 maintenance of location Effects 0.000 description 2
- 230000000737 periodic effect Effects 0.000 description 2
- 241000345998 Calamus manan Species 0.000 description 1
- 241001413866 Diaphone Species 0.000 description 1
- 235000003140 Panax quinquefolius Nutrition 0.000 description 1
- 240000005373 Panax quinquefolius Species 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000000712 assembly Effects 0.000 description 1
- 238000000429 assembly Methods 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 210000004704 glottis Anatomy 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000005055 memory storage Effects 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 235000012950 rattan cane Nutrition 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/15—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Electrophonic Musical Instruments (AREA)
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP6302471A JPH08160991A (ja) | 1994-12-06 | 1994-12-06 | 音声素片作成方法および音声合成方法、装置 |
JP302,471/1994 | 1994-12-06 | ||
JP302,471/94 | 1994-12-06 | ||
JP220,963/95 | 1995-08-30 | ||
JP220,963/1995 | 1995-08-30 | ||
JP7220963A JP2987089B2 (ja) | 1995-08-30 | 1995-08-30 | 音声素片作成方法および音声合成方法とその装置 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB2003101028665A Division CN1294555C (zh) | 1994-12-06 | 1995-12-06 | 语音段制作方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1131785A CN1131785A (zh) | 1996-09-25 |
CN1146863C true CN1146863C (zh) | 2004-04-21 |
Family
ID=26523998
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB951190490A Expired - Fee Related CN1146863C (zh) | 1994-12-06 | 1995-12-06 | 语音合成方法及其装置 |
CNB2003101028665A Expired - Fee Related CN1294555C (zh) | 1994-12-06 | 1995-12-06 | 语音段制作方法 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB2003101028665A Expired - Fee Related CN1294555C (zh) | 1994-12-06 | 1995-12-06 | 语音段制作方法 |
Country Status (3)
Country | Link |
---|---|
US (1) | US5864812A (ko) |
KR (1) | KR100385603B1 (ko) |
CN (2) | CN1146863C (ko) |
Families Citing this family (43)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6240384B1 (en) * | 1995-12-04 | 2001-05-29 | Kabushiki Kaisha Toshiba | Speech synthesis method |
DE19610019C2 (de) * | 1996-03-14 | 1999-10-28 | Data Software Gmbh G | Digitales Sprachsyntheseverfahren |
JP3349905B2 (ja) * | 1996-12-10 | 2002-11-25 | 松下電器産業株式会社 | 音声合成方法および装置 |
US6490562B1 (en) | 1997-04-09 | 2002-12-03 | Matsushita Electric Industrial Co., Ltd. | Method and system for analyzing voices |
JP3902860B2 (ja) * | 1998-03-09 | 2007-04-11 | キヤノン株式会社 | 音声合成制御装置及びその制御方法、コンピュータ可読メモリ |
JP3430985B2 (ja) * | 1999-08-05 | 2003-07-28 | ヤマハ株式会社 | 合成音生成装置 |
JP3450237B2 (ja) * | 1999-10-06 | 2003-09-22 | 株式会社アルカディア | 音声合成装置および方法 |
GB9925297D0 (en) * | 1999-10-27 | 1999-12-29 | Ibm | Voice processing system |
JP2001265375A (ja) * | 2000-03-17 | 2001-09-28 | Oki Electric Ind Co Ltd | 規則音声合成装置 |
JP3728172B2 (ja) * | 2000-03-31 | 2005-12-21 | キヤノン株式会社 | 音声合成方法および装置 |
US6662162B2 (en) * | 2000-08-28 | 2003-12-09 | Maureen Casper | Method of rating motor dysfunction by assessing speech prosody |
US7251601B2 (en) * | 2001-03-26 | 2007-07-31 | Kabushiki Kaisha Toshiba | Speech synthesis method and speech synthesizer |
ATE336774T1 (de) * | 2001-05-28 | 2006-09-15 | Texas Instruments Inc | Programmierbarer melodienerzeuger |
WO2003019530A1 (fr) * | 2001-08-31 | 2003-03-06 | Kenwood Corporation | Dispositif et procede de generation d'un signal a forme d'onde affecte d'un pas ; programme |
US6681208B2 (en) * | 2001-09-25 | 2004-01-20 | Motorola, Inc. | Text-to-speech native coding in a communication system |
US7483832B2 (en) * | 2001-12-10 | 2009-01-27 | At&T Intellectual Property I, L.P. | Method and system for customizing voice translation of text to speech |
US20060069567A1 (en) * | 2001-12-10 | 2006-03-30 | Tischer Steven N | Methods, systems, and products for translating text to speech |
US7065485B1 (en) * | 2002-01-09 | 2006-06-20 | At&T Corp | Enhancing speech intelligibility using variable-rate time-scale modification |
JP2003255993A (ja) * | 2002-03-04 | 2003-09-10 | Ntt Docomo Inc | 音声認識システム、音声認識方法、音声認識プログラム、音声合成システム、音声合成方法、音声合成プログラム |
JP2003295880A (ja) * | 2002-03-28 | 2003-10-15 | Fujitsu Ltd | 録音音声と合成音声を接続する音声合成システム |
GB2392592B (en) * | 2002-08-27 | 2004-07-07 | 20 20 Speech Ltd | Speech synthesis apparatus and method |
US20040073428A1 (en) * | 2002-10-10 | 2004-04-15 | Igor Zlokarnik | Apparatus, methods, and programming for speech synthesis via bit manipulations of compressed database |
JP4407305B2 (ja) * | 2003-02-17 | 2010-02-03 | 株式会社ケンウッド | ピッチ波形信号分割装置、音声信号圧縮装置、音声合成装置、ピッチ波形信号分割方法、音声信号圧縮方法、音声合成方法、記録媒体及びプログラム |
EP1471499B1 (en) * | 2003-04-25 | 2014-10-01 | Alcatel Lucent | Method of distributed speech synthesis |
WO2004097792A1 (ja) * | 2003-04-28 | 2004-11-11 | Fujitsu Limited | 音声合成システム |
DE04735990T1 (de) * | 2003-06-05 | 2006-10-05 | Kabushiki Kaisha Kenwood, Hachiouji | Sprachsynthesevorrichtung, sprachsyntheseverfahren und programm |
US7363221B2 (en) * | 2003-08-19 | 2008-04-22 | Microsoft Corporation | Method of noise reduction using instantaneous signal-to-noise ratio as the principal quantity for optimal estimation |
JP4483450B2 (ja) * | 2004-07-22 | 2010-06-16 | 株式会社デンソー | 音声案内装置、音声案内方法およびナビゲーション装置 |
US20060259303A1 (en) * | 2005-05-12 | 2006-11-16 | Raimo Bakis | Systems and methods for pitch smoothing for text-to-speech synthesis |
CN101542593B (zh) * | 2007-03-12 | 2013-04-17 | 富士通株式会社 | 语音波形内插装置及方法 |
US7953600B2 (en) * | 2007-04-24 | 2011-05-31 | Novaspeech Llc | System and method for hybrid speech synthesis |
CN101589430B (zh) * | 2007-08-10 | 2012-07-18 | 松下电器产业株式会社 | 声音分离装置、声音合成装置及音质变换装置 |
JP5141688B2 (ja) * | 2007-09-06 | 2013-02-13 | 富士通株式会社 | 音信号生成方法、音信号生成装置及びコンピュータプログラム |
US20090177473A1 (en) * | 2008-01-07 | 2009-07-09 | Aaron Andrew S | Applying vocal characteristics from a target speaker to a source speaker for synthetic speech |
US9031834B2 (en) | 2009-09-04 | 2015-05-12 | Nuance Communications, Inc. | Speech enhancement techniques on the power spectrum |
US9053095B2 (en) * | 2010-10-31 | 2015-06-09 | Speech Morphing, Inc. | Speech morphing communication system |
JP5983604B2 (ja) * | 2011-05-25 | 2016-08-31 | 日本電気株式会社 | 素片情報生成装置、音声合成装置、音声合成方法および音声合成プログラム |
CN105895076B (zh) * | 2015-01-26 | 2019-11-15 | 科大讯飞股份有限公司 | 一种语音合成方法及系统 |
JP6728755B2 (ja) * | 2015-03-25 | 2020-07-22 | ヤマハ株式会社 | 歌唱音発音装置 |
JP6996095B2 (ja) | 2017-03-17 | 2022-01-17 | 株式会社リコー | 情報表示装置、生体信号計測システムおよびプログラム |
CN107799122B (zh) * | 2017-09-08 | 2020-10-23 | 中国科学院深圳先进技术研究院 | 一种高生物拟真性语音处理滤波器与语音识别设备 |
JP7181173B2 (ja) * | 2019-09-13 | 2022-11-30 | 株式会社スクウェア・エニックス | プログラム、情報処理装置、情報処理システム及び方法 |
CN112786001B (zh) * | 2019-11-11 | 2024-04-09 | 北京地平线机器人技术研发有限公司 | 语音合成模型训练方法、语音合成方法和装置 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4685135A (en) * | 1981-03-05 | 1987-08-04 | Texas Instruments Incorporated | Text-to-speech synthesis system |
US4586193A (en) * | 1982-12-08 | 1986-04-29 | Harris Corporation | Formant-based speech synthesizer |
US5208897A (en) * | 1990-08-21 | 1993-05-04 | Emerson & Stern Associates, Inc. | Method and apparatus for speech recognition based on subsyllable spellings |
US5400434A (en) * | 1990-09-04 | 1995-03-21 | Matsushita Electric Industrial Co., Ltd. | Voice source for synthetic speech system |
KR940002854B1 (ko) * | 1991-11-06 | 1994-04-04 | 한국전기통신공사 | 음성 합성시스팀의 음성단편 코딩 및 그의 피치조절 방법과 그의 유성음 합성장치 |
EP0583559B1 (en) * | 1992-07-31 | 2004-02-25 | International Business Machines Corporation | Finding token sequences in a database of token strings |
CN1092195A (zh) * | 1993-03-13 | 1994-09-14 | 北京联想计算机集团公司 | Pc机合成语音音乐及发声的方法 |
US5704007A (en) * | 1994-03-11 | 1997-12-30 | Apple Computer, Inc. | Utilization of multiple voice sources in a speech synthesizer |
-
1995
- 1995-11-30 US US08/565,401 patent/US5864812A/en not_active Expired - Fee Related
- 1995-12-05 KR KR1019950046901A patent/KR100385603B1/ko not_active IP Right Cessation
- 1995-12-06 CN CNB951190490A patent/CN1146863C/zh not_active Expired - Fee Related
- 1995-12-06 CN CNB2003101028665A patent/CN1294555C/zh not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
KR100385603B1 (ko) | 2003-08-21 |
CN1131785A (zh) | 1996-09-25 |
CN1495703A (zh) | 2004-05-12 |
KR960025314A (ko) | 1996-07-20 |
US5864812A (en) | 1999-01-26 |
CN1294555C (zh) | 2007-01-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1146863C (zh) | 语音合成方法及其装置 | |
US9761219B2 (en) | System and method for distributed text-to-speech synthesis and intelligibility | |
Isewon et al. | Design and implementation of text to speech conversion for visually impaired people | |
WO2020146873A1 (en) | System and method for direct speech translation system | |
Yang et al. | Uniaudio: An audio foundation model toward universal audio generation | |
US8019605B2 (en) | Reducing recording time when constructing a concatenative TTS voice using a reduced script and pre-recorded speech assets | |
JP2018146803A (ja) | 音声合成装置及びプログラム | |
EP1872361A1 (en) | Hybrid speech synthesizer, method and use | |
CN1622195A (zh) | 语音合成方法和语音合成系统 | |
US20080120093A1 (en) | System for creating dictionary for speech synthesis, semiconductor integrated circuit device, and method for manufacturing semiconductor integrated circuit device | |
US20100312564A1 (en) | Local and remote feedback loop for speech synthesis | |
US9607610B2 (en) | Devices and methods for noise modulation in a universal vocoder synthesizer | |
US9009050B2 (en) | System and method for cloud-based text-to-speech web services | |
Urbain et al. | Arousal-driven synthesis of laughter | |
CN116457870A (zh) | 并行化Tacotron:非自回归且可控的TTS | |
Wu et al. | Deep speech synthesis from articulatory representations | |
WO2023035261A1 (en) | An end-to-end neural system for multi-speaker and multi-lingual speech synthesis | |
US11600261B2 (en) | System and method for cross-speaker style transfer in text-to-speech and training data generation | |
US11960852B2 (en) | Robust direct speech-to-speech translation | |
Kulkarni et al. | Improving transfer of expressivity for end-to-end multispeaker text-to-speech synthesis | |
Guo et al. | MSMC-TTS: Multi-stage multi-codebook VQ-VAE based neural TTS | |
Szekrényes | Prosotool, a method for automatic annotation of fundamental frequency | |
RU2460154C1 (ru) | Способ автоматизированной обработки текста и компьютерное устройство для реализации этого способа | |
Lorenzo-Trueba et al. | Simple4all proposals for the albayzin evaluations in speech synthesis | |
CN113421571B (zh) | 一种语音转换方法、装置、电子设备和存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C19 | Lapse of patent right due to non-payment of the annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |