DE60205421T2 - Verfahren und Vorrichtung zur Sprachsynthese - Google Patents
Verfahren und Vorrichtung zur Sprachsynthese Download PDFInfo
- Publication number
- DE60205421T2 DE60205421T2 DE60205421T DE60205421T DE60205421T2 DE 60205421 T2 DE60205421 T2 DE 60205421T2 DE 60205421 T DE60205421 T DE 60205421T DE 60205421 T DE60205421 T DE 60205421T DE 60205421 T2 DE60205421 T2 DE 60205421T2
- Authority
- DE
- Germany
- Prior art keywords
- formant
- pitch
- speech
- waveforms
- functions
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000000034 method Methods 0.000 title claims description 18
- 238000003786 synthesis reaction Methods 0.000 title description 25
- 230000015572 biosynthetic process Effects 0.000 title description 24
- 230000006870 function Effects 0.000 claims description 105
- 238000009499 grossing Methods 0.000 claims description 14
- 238000001308 synthesis method Methods 0.000 claims description 13
- 230000001131 transforming effect Effects 0.000 claims 3
- 230000009466 transformation Effects 0.000 claims 1
- 239000011295 pitch Substances 0.000 description 82
- 238000001228 spectrum Methods 0.000 description 23
- 238000010586 diagram Methods 0.000 description 7
- 238000012545 processing Methods 0.000 description 3
- 239000003550 marker Substances 0.000 description 2
- 230000033764 rhythmic process Effects 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 230000029305 taxis Effects 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 210000001260 vocal cord Anatomy 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
- 230000005428 wave function Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/027—Concept to speech synthesisers; Generation of natural phrases from machine-based concepts
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Electrophonic Musical Instruments (AREA)
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2001087041 | 2001-03-26 | ||
JP2001087041 | 2001-03-26 | ||
JP2002077096 | 2002-03-19 | ||
JP2002077096A JP3732793B2 (ja) | 2001-03-26 | 2002-03-19 | 音声合成方法、音声合成装置及び記録媒体 |
Publications (2)
Publication Number | Publication Date |
---|---|
DE60205421D1 DE60205421D1 (de) | 2005-09-15 |
DE60205421T2 true DE60205421T2 (de) | 2006-04-20 |
Family
ID=26612017
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE60205421T Expired - Lifetime DE60205421T2 (de) | 2001-03-26 | 2002-03-26 | Verfahren und Vorrichtung zur Sprachsynthese |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP1246163B1 (ko) |
JP (1) | JP3732793B2 (ko) |
KR (1) | KR100457414B1 (ko) |
CN (1) | CN1185619C (ko) |
DE (1) | DE60205421T2 (ko) |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2004025626A1 (en) * | 2002-09-10 | 2004-03-25 | Leslie Doherty | Phoneme to speech converter |
JP2004294816A (ja) * | 2003-03-27 | 2004-10-21 | Yamaha Corp | 携帯端末装置 |
JP4214842B2 (ja) | 2003-06-13 | 2009-01-28 | ソニー株式会社 | 音声合成装置及び音声合成方法 |
JP2005004105A (ja) * | 2003-06-13 | 2005-01-06 | Sony Corp | 信号生成装置及び信号生成方法 |
JP2005234337A (ja) * | 2004-02-20 | 2005-09-02 | Yamaha Corp | 音声合成装置、音声合成方法、及び音声合成プログラム |
JP4469883B2 (ja) | 2007-08-17 | 2010-06-02 | 株式会社東芝 | 音声合成方法及びその装置 |
JP5275102B2 (ja) | 2009-03-25 | 2013-08-28 | 株式会社東芝 | 音声合成装置及び音声合成方法 |
JP5631915B2 (ja) | 2012-03-29 | 2014-11-26 | 株式会社東芝 | 音声合成装置、音声合成方法、音声合成プログラムならびに学習装置 |
JP6499305B2 (ja) * | 2015-09-16 | 2019-04-10 | 株式会社東芝 | 音声合成装置、音声合成方法、音声合成プログラム、音声合成モデル学習装置、音声合成モデル学習方法及び音声合成モデル学習プログラム |
JP6728843B2 (ja) * | 2016-03-24 | 2020-07-22 | カシオ計算機株式会社 | 電子楽器、楽音発生装置、楽音発生方法及びプログラム |
CN108257613B (zh) * | 2017-12-05 | 2021-12-10 | 北京小唱科技有限公司 | 修正音频内容音高偏差的方法及装置 |
CN108597527B (zh) * | 2018-04-19 | 2020-01-24 | 北京微播视界科技有限公司 | 多声道音频处理方法、装置、计算机可读存储介质和终端 |
CN110189743B (zh) * | 2019-05-06 | 2024-03-08 | 平安科技(深圳)有限公司 | 波形拼接中的拼接点平滑方法、装置及存储介质 |
-
2002
- 2002-03-19 JP JP2002077096A patent/JP3732793B2/ja not_active Expired - Fee Related
- 2002-03-25 KR KR10-2002-0016033A patent/KR100457414B1/ko not_active IP Right Cessation
- 2002-03-26 CN CNB021080496A patent/CN1185619C/zh not_active Expired - Fee Related
- 2002-03-26 DE DE60205421T patent/DE60205421T2/de not_active Expired - Lifetime
- 2002-03-26 EP EP02252159A patent/EP1246163B1/en not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
EP1246163B1 (en) | 2005-08-10 |
KR100457414B1 (ko) | 2004-11-16 |
JP2002358090A (ja) | 2002-12-13 |
KR20020076144A (ko) | 2002-10-09 |
EP1246163A2 (en) | 2002-10-02 |
CN1185619C (zh) | 2005-01-19 |
DE60205421D1 (de) | 2005-09-15 |
EP1246163A3 (en) | 2003-08-13 |
JP3732793B2 (ja) | 2006-01-11 |
CN1378199A (zh) | 2002-11-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE60112512T2 (de) | Kodierung von Ausdruck in Sprachsynthese | |
DE4237563C2 (de) | Verfahren zum Synthetisieren von Sprache | |
Peterson et al. | Segmentation techniques in speech synthesis | |
Moulines et al. | Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones | |
DE60213653T2 (de) | Verfahren und system zur echtzeit-sprachsynthese | |
DE69909716T2 (de) | Formant Sprachsynthetisierer unter Verwendung von Verkettung von Halbsilben mit unabhängiger Überblendung im Filterkoeffizienten- und Quellenbereich | |
DE60205421T2 (de) | Verfahren und Vorrichtung zur Sprachsynthese | |
DE60313539T2 (de) | Vorrichtung und Verfahren zur Synthese einer singenden Stimme und Programm zur Realisierung des Verfahrens | |
US8280738B2 (en) | Voice quality conversion apparatus, pitch conversion apparatus, and voice quality conversion method | |
DE60216651T2 (de) | Vorrichtung zur Sprachsynthese | |
DE19610019C2 (de) | Digitales Sprachsyntheseverfahren | |
DE2115258A1 (de) | Sprachsynthese durch Verkettung von in Formant Form codierten Wortern | |
DE69926462T2 (de) | Bestimmung des von einer phasenänderung herrührenden rauschanteils für die audiokodierung | |
EP1105867B1 (de) | Verfahren und vorrichtungen zur koartikulationsgerechten konkatenation von audiosegmenten | |
US7251601B2 (en) | Speech synthesis method and speech synthesizer | |
DE60202161T2 (de) | Verfahren, Vorrichtung und Programm zur Analyse und Synthese von Sprache | |
DE4033350B4 (de) | Verfahren und Vorrichtung für die Sprachverarbeitung | |
EP0058130B1 (de) | Verfahren zur Synthese von Sprache mit unbegrenztem Wortschatz und Schaltungsanordnung zur Durchführung des Verfahrens | |
DE69815062T2 (de) | Verfahren und gerät zur audiorepräsentation von nach dem lpc prinzip kodierter sprache durch hinzufügen von rauschsignalen | |
DE60305944T2 (de) | Verfahren zur synthese eines stationären klangsignals | |
DE60316678T2 (de) | Verfahren zum synthetisieren von sprache | |
Saitou et al. | Analysis of acoustic features affecting" singing-ness" and its application to singing-voice synthesis from speaking-voice. | |
WO2000016310A1 (de) | Vorrichtung und verfahren zur digitalen sprachbearbeitung | |
JP3727885B2 (ja) | 音声素片生成方法と装置及びプログラム、並びに音声合成方法と装置 | |
DE60131521T2 (de) | Verfahren und Vorrichtung zur Steuerung des Betriebs eines Geräts bzw. eines Systems sowie System mit einer solchen Vorrichtung und Computerprogramm zur Ausführung des Verfahrens |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition |