JP2006119655A5 - - Google Patents
Download PDFInfo
- Publication number
- JP2006119655A5 JP2006119655A5 JP2005336272A JP2005336272A JP2006119655A5 JP 2006119655 A5 JP2006119655 A5 JP 2006119655A5 JP 2005336272 A JP2005336272 A JP 2005336272A JP 2005336272 A JP2005336272 A JP 2005336272A JP 2006119655 A5 JP2006119655 A5 JP 2006119655A5
- Authority
- JP
- Japan
- Prior art keywords
- pitch
- storage means
- voice
- speech
- index
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Claims (4)
ピッチと音声の特徴量の時間変化を表すテンプレートを音韻とピッチをインデックスとして記憶するテンプレート記憶手段と、
少なくともピッチ及び音韻を含む音声合成のための音声情報を入力する入力手段と、
前記音声の特徴量とテンプレートを前記入力された音声情報により前記記憶手段及び前記テンプレート記憶手段からそれぞれ読み出す読み出し手段と、
前記読み出された音声の特徴量および前記入力された音声情報に含まれるピッチに前記読み出されたテンプレートを適用し、該適用後の音声の特徴量及びピッチに基づき音声を合成する音声合成手段とを有する音声合成装置において、
前記読み出し手段は、前記入力された音声情報に含まれるピッチが前記記憶手段における最も高いインデックスの値を超える場合に、前記入力された音声情報に含まれるピッチから前記記憶手段に記憶される最も高いインデックスのピッチを引いたピッチ差を求め、前記最も高いピッチインデックスにより前記記憶手段から読み出した特徴量に該ピッチ差を加算した特徴量を前記音声合成手段に出力することを特徴とする音声合成装置。 Storage means for storing a feature amount of speech at a specific time as an index of phoneme and pitch;
Template storage means for storing a template representing a temporal change in the feature amount of the pitch and the voice as an index of the phoneme and the pitch;
Input means for inputting speech information for speech synthesis including at least pitch and phoneme;
Read means for reading out the feature amount and template of the voice from the storage means and the template storage means respectively by the inputted voice information;
Voice synthesis means for applying the read template to the pitch included in the read voice feature quantity and the input voice information, and synthesizing voice based on the voice feature quantity and pitch after the application In a speech synthesizer having
When the pitch included in the input voice information exceeds the value of the highest index in the storage means, the reading means is the highest stored in the storage means from the pitch included in the input voice information A speech synthesizer characterized in that a pitch difference obtained by subtracting a pitch of an index is obtained, and a feature amount obtained by adding the pitch difference to a feature amount read from the storage means by the highest pitch index is output to the speech synthesizer. .
ピッチと音声の特徴量の時間変化を表すテンプレートを音韻とピッチをインデックスとして記憶するテンプレート記憶手段と、
少なくともピッチ及び音韻を含む音声合成のための音声情報を入力する入力手段と、
前記音声の特徴量とテンプレートを前記入力された音声情報により前記記憶手段及び前記テンプレート記憶手段からそれぞれ読み出す読み出し手段と、
前記読み出された音声の特徴量および前記入力された音声情報に含まれるピッチに前記読み出されたテンプレートを適用し、該適用後の音声の特徴量及びピッチに基づき音声を合成する音声合成手段とを有する音声合成装置において、
前記読み出し手段は、前記入力された音声情報に含まれるピッチが前記記憶手段における最も低いインデックスの値を下回る場合に、該入力された音声情報に含まれるピッチから前記記憶手段に記憶される最も低いインデックスのピッチを引いたピッチ差を求め、前記最も低いピッチインデックスにより前記記憶手段から読み出した特徴量に該ピッチ差の指定割合を加算した特徴量を前記音声合成手段に出力することを特徴とする音声合成装置。 Storage means for storing a feature amount of speech at a specific time as an index of phoneme and pitch;
Template storage means for storing a template representing a temporal change in the feature amount of the pitch and the voice as an index of the phoneme and the pitch;
Input means for inputting speech information for speech synthesis including at least pitch and phoneme;
Read means for reading out the feature amount and template of the voice from the storage means and the template storage means respectively by the inputted voice information;
Voice synthesis means for applying the read template to the pitch included in the read voice feature quantity and the input voice information, and synthesizing voice based on the voice feature quantity and pitch after the application In a speech synthesizer having
When the pitch included in the input voice information is lower than the lowest index value in the storage means, the reading means is the lowest stored in the storage means from the pitch included in the input voice information. A pitch difference obtained by subtracting the pitch of the index is obtained, and a feature amount obtained by adding a specified ratio of the pitch difference to the feature amount read from the storage unit with the lowest pitch index is output to the speech synthesis unit. Speech synthesizer.
前記読み出し手段は、前記入力された音声情報に含まれるピッチが低いほど、前記励起レゾナンスのバンド幅が狭くなるように補正して出力することを特徴とする請求項2記載の音声合成装置。 The audio feature quantity stored in the storage means includes excitation resonance,
3. The speech synthesizer according to claim 2, wherein the reading unit corrects and outputs the excitation resonance so that the bandwidth of the excitation resonance becomes narrower as the pitch included in the inputted speech information is lower.
前記読み出し手段は、前記入力された音声情報に含まれるピッチが低いほど、前記フォルマントのアンプリチュードが大きくなるように補正して出力することを特徴とする請求項2記載の音声合成装置。 The audio feature quantity stored in the storage means includes a formant,
3. The speech synthesizer according to claim 2, wherein the reading means corrects and outputs the formant so that the amplitude of the formant increases as the pitch included in the input speech information decreases.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2005336272A JP4353174B2 (en) | 2005-11-21 | 2005-11-21 | Speech synthesizer |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2005336272A JP4353174B2 (en) | 2005-11-21 | 2005-11-21 | Speech synthesizer |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2001067258A Division JP3838039B2 (en) | 2001-03-09 | 2001-03-09 | Speech synthesizer |
Publications (3)
Publication Number | Publication Date |
---|---|
JP2006119655A JP2006119655A (en) | 2006-05-11 |
JP2006119655A5 true JP2006119655A5 (en) | 2007-04-12 |
JP4353174B2 JP4353174B2 (en) | 2009-10-28 |
Family
ID=36537516
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2005336272A Expired - Fee Related JP4353174B2 (en) | 2005-11-21 | 2005-11-21 | Speech synthesizer |
Country Status (1)
Country | Link |
---|---|
JP (1) | JP4353174B2 (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5471138B2 (en) * | 2009-08-06 | 2014-04-16 | 大日本印刷株式会社 | Phoneme code converter and speech synthesizer |
GB2480108B (en) * | 2010-05-07 | 2012-08-29 | Toshiba Res Europ Ltd | A speech processing method an apparatus |
-
2005
- 2005-11-21 JP JP2005336272A patent/JP4353174B2/en not_active Expired - Fee Related
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5471858B2 (en) | Database generating apparatus for singing synthesis and pitch curve generating apparatus | |
US9666201B2 (en) | Bandwidth extension method and apparatus using high frequency excitation signal and high frequency energy | |
CN101578659B (en) | Voice tone converting device and voice tone converting method | |
JP6317387B2 (en) | Weight function determination method | |
CN101496101B (en) | Systems, methods, and apparatus for gain factor limiting | |
US20140114663A1 (en) | Guided speaker adaptive speech synthesis system and method and computer program product | |
CN105830153A (en) | High-band signal modeling | |
EP1902441A1 (en) | Supporting a concatenative text-to-speech synthesis | |
KR20170092696A (en) | Scaling for gain shape circuitry | |
CN105593933A (en) | Gain shape estimation for improved tracking of high-band temporal characteristics | |
KR20160128871A (en) | User-customizable voice revision method of converting voice by parameter modification and voice revision device implementing the same | |
JP6821970B2 (en) | Speech synthesizer and speech synthesizer | |
JP2004144850A5 (en) | ||
JP2006119655A5 (en) | ||
CN2706830Y (en) | Sound source apparatus | |
JP2002268658A5 (en) | ||
JP2013164609A (en) | Singing synthesizing database generation device, and pitch curve generation device | |
Agiomyrgiannakis et al. | ARX-LF-based source-filter methods for voice modification and transformation | |
JP4648878B2 (en) | Style designation type speech synthesis method, style designation type speech synthesis apparatus, program thereof, and storage medium thereof | |
CN101192408A (en) | Method and device for selecting conductivity coefficient vector quantization | |
WO2007030233A3 (en) | Speech dialog method and device | |
CN102231275A (en) | Embedded speech synthesis method based on weighted mixed excitation | |
JP2009271315A (en) | Cellular phone capable of reproducing sound from two-dimensional code, and printed matter with two-dimensional code including sound two-dimensional code being displayed thereon | |
JP6011039B2 (en) | Speech synthesis apparatus and speech synthesis method | |
JP2010224418A (en) | Voice synthesizer, method, and program |