JP2006119655A5 - - Google Patents

Download PDF

Info

Publication number
JP2006119655A5
JP2006119655A5 JP2005336272A JP2005336272A JP2006119655A5 JP 2006119655 A5 JP2006119655 A5 JP 2006119655A5 JP 2005336272 A JP2005336272 A JP 2005336272A JP 2005336272 A JP2005336272 A JP 2005336272A JP 2006119655 A5 JP2006119655 A5 JP 2006119655A5
Authority
JP
Japan
Prior art keywords
pitch
storage means
voice
speech
index
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2005336272A
Other languages
Japanese (ja)
Other versions
JP4353174B2 (en
JP2006119655A (en
Filing date
Publication date
Application filed filed Critical
Priority to JP2005336272A priority Critical patent/JP4353174B2/en
Priority claimed from JP2005336272A external-priority patent/JP4353174B2/en
Publication of JP2006119655A publication Critical patent/JP2006119655A/en
Publication of JP2006119655A5 publication Critical patent/JP2006119655A5/ja
Application granted granted Critical
Publication of JP4353174B2 publication Critical patent/JP4353174B2/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Claims (4)

特定時刻の音声の特徴量を、音韻とピッチをインデックスとして記憶する記憶手段と、
ピッチと音声の特徴量の時間変化を表すテンプレートを音韻とピッチをインデックスとして記憶するテンプレート記憶手段と、
少なくともピッチ及び音韻を含む音声合成のための音声情報を入力する入力手段と、
前記音声の特徴量とテンプレートを前記入力された音声情報により前記記憶手段及び前記テンプレート記憶手段からそれぞれ読み出す読み出し手段と、
前記読み出された音声の特徴量および前記入力された音声情報に含まれるピッチに前記読み出されたテンプレートを適用し、該適用後の音声の特徴量及びピッチに基づき音声を合成する音声合成手段とを有する音声合成装置において、
前記読み出し手段は、前記入力された音声情報に含まれるピッチが前記記憶手段における最も高いインデックスの値を超える場合に、前記入力された音声情報に含まれるピッチから前記記憶手段に記憶される最も高いインデックスのピッチを引いたピッチ差を求め、前記最も高いピッチインデックスにより前記記憶手段から読み出した特徴量に該ピッチ差を加算した特徴量を前記音声合成手段に出力することを特徴とする音声合成装置。
Storage means for storing a feature amount of speech at a specific time as an index of phoneme and pitch;
Template storage means for storing a template representing a temporal change in the feature amount of the pitch and the voice as an index of the phoneme and the pitch;
Input means for inputting speech information for speech synthesis including at least pitch and phoneme;
Read means for reading out the feature amount and template of the voice from the storage means and the template storage means respectively by the inputted voice information;
Voice synthesis means for applying the read template to the pitch included in the read voice feature quantity and the input voice information, and synthesizing voice based on the voice feature quantity and pitch after the application In a speech synthesizer having
When the pitch included in the input voice information exceeds the value of the highest index in the storage means, the reading means is the highest stored in the storage means from the pitch included in the input voice information A speech synthesizer characterized in that a pitch difference obtained by subtracting a pitch of an index is obtained, and a feature amount obtained by adding the pitch difference to a feature amount read from the storage means by the highest pitch index is output to the speech synthesizer. .
特定時刻の音声の特徴量を、音韻とピッチをインデックスとして記憶する記憶手段と、
ピッチと音声の特徴量の時間変化を表すテンプレートを音韻とピッチをインデックスとして記憶するテンプレート記憶手段と、
少なくともピッチ及び音韻を含む音声合成のための音声情報を入力する入力手段と、
前記音声の特徴量とテンプレートを前記入力された音声情報により前記記憶手段及び前記テンプレート記憶手段からそれぞれ読み出す読み出し手段と、
前記読み出された音声の特徴量および前記入力された音声情報に含まれるピッチに前記読み出されたテンプレートを適用し、該適用後の音声の特徴量及びピッチに基づき音声を合成する音声合成手段とを有する音声合成装置において、
前記読み出し手段は、前記入力された音声情報に含まれるピッチが前記記憶手段における最も低いインデックスの値を下回る場合に、該入力された音声情報に含まれるピッチから前記記憶手段に記憶される最も低いインデックスのピッチを引いたピッチ差を求め、前記最も低いピッチインデックスにより前記記憶手段から読み出した特徴量に該ピッチ差の指定割合を加算した特徴量を前記音声合成手段に出力することを特徴とする音声合成装置。
Storage means for storing a feature amount of speech at a specific time as an index of phoneme and pitch;
Template storage means for storing a template representing a temporal change in the feature amount of the pitch and the voice as an index of the phoneme and the pitch;
Input means for inputting speech information for speech synthesis including at least pitch and phoneme;
Read means for reading out the feature amount and template of the voice from the storage means and the template storage means respectively by the inputted voice information;
Voice synthesis means for applying the read template to the pitch included in the read voice feature quantity and the input voice information, and synthesizing voice based on the voice feature quantity and pitch after the application In a speech synthesizer having
When the pitch included in the input voice information is lower than the lowest index value in the storage means, the reading means is the lowest stored in the storage means from the pitch included in the input voice information. A pitch difference obtained by subtracting the pitch of the index is obtained, and a feature amount obtained by adding a specified ratio of the pitch difference to the feature amount read from the storage unit with the lowest pitch index is output to the speech synthesis unit. Speech synthesizer.
前記記憶手段に記憶される音声の特徴量には、励起レゾナンスを含み、
前記読み出し手段は、前記入力された音声情報に含まれるピッチが低いほど、前記励起レゾナンスのバンド幅が狭くなるように補正して出力することを特徴とする請求項2記載の音声合成装置。
The audio feature quantity stored in the storage means includes excitation resonance,
3. The speech synthesizer according to claim 2, wherein the reading unit corrects and outputs the excitation resonance so that the bandwidth of the excitation resonance becomes narrower as the pitch included in the inputted speech information is lower.
前記記憶手段に記憶される音声の特徴量には、フォルマントを含み、
前記読み出し手段は、前記入力された音声情報に含まれるピッチが低いほど、前記フォルマントのアンプリチュードが大きくなるように補正して出力することを特徴とする請求項2記載の音声合成装置。
The audio feature quantity stored in the storage means includes a formant,
3. The speech synthesizer according to claim 2, wherein the reading means corrects and outputs the formant so that the amplitude of the formant increases as the pitch included in the input speech information decreases.
JP2005336272A 2005-11-21 2005-11-21 Speech synthesizer Expired - Fee Related JP4353174B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2005336272A JP4353174B2 (en) 2005-11-21 2005-11-21 Speech synthesizer

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2005336272A JP4353174B2 (en) 2005-11-21 2005-11-21 Speech synthesizer

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
JP2001067258A Division JP3838039B2 (en) 2001-03-09 2001-03-09 Speech synthesizer

Publications (3)

Publication Number Publication Date
JP2006119655A JP2006119655A (en) 2006-05-11
JP2006119655A5 true JP2006119655A5 (en) 2007-04-12
JP4353174B2 JP4353174B2 (en) 2009-10-28

Family

ID=36537516

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2005336272A Expired - Fee Related JP4353174B2 (en) 2005-11-21 2005-11-21 Speech synthesizer

Country Status (1)

Country Link
JP (1) JP4353174B2 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5471138B2 (en) * 2009-08-06 2014-04-16 大日本印刷株式会社 Phoneme code converter and speech synthesizer
GB2480108B (en) * 2010-05-07 2012-08-29 Toshiba Res Europ Ltd A speech processing method an apparatus

Similar Documents

Publication Publication Date Title
JP5471858B2 (en) Database generating apparatus for singing synthesis and pitch curve generating apparatus
US9666201B2 (en) Bandwidth extension method and apparatus using high frequency excitation signal and high frequency energy
CN101578659B (en) Voice tone converting device and voice tone converting method
JP6317387B2 (en) Weight function determination method
CN101496101B (en) Systems, methods, and apparatus for gain factor limiting
US20140114663A1 (en) Guided speaker adaptive speech synthesis system and method and computer program product
CN105830153A (en) High-band signal modeling
EP1902441A1 (en) Supporting a concatenative text-to-speech synthesis
KR20170092696A (en) Scaling for gain shape circuitry
CN105593933A (en) Gain shape estimation for improved tracking of high-band temporal characteristics
KR20160128871A (en) User-customizable voice revision method of converting voice by parameter modification and voice revision device implementing the same
JP6821970B2 (en) Speech synthesizer and speech synthesizer
JP2004144850A5 (en)
JP2006119655A5 (en)
CN2706830Y (en) Sound source apparatus
JP2002268658A5 (en)
JP2013164609A (en) Singing synthesizing database generation device, and pitch curve generation device
Agiomyrgiannakis et al. ARX-LF-based source-filter methods for voice modification and transformation
JP4648878B2 (en) Style designation type speech synthesis method, style designation type speech synthesis apparatus, program thereof, and storage medium thereof
CN101192408A (en) Method and device for selecting conductivity coefficient vector quantization
WO2007030233A3 (en) Speech dialog method and device
CN102231275A (en) Embedded speech synthesis method based on weighted mixed excitation
JP2009271315A (en) Cellular phone capable of reproducing sound from two-dimensional code, and printed matter with two-dimensional code including sound two-dimensional code being displayed thereon
JP6011039B2 (en) Speech synthesis apparatus and speech synthesis method
JP2010224418A (en) Voice synthesizer, method, and program