JP2014109667A5

JP2014109667A5 -

Info

Publication number: JP2014109667A5
Application number: JP2012263574A
Authority: JP
Filing date: 2012-11-30
Publication date: 2015-10-08
Anticipated expiration: 2032-11-30

Claims

A speech synthesizer that synthesizes a speech waveform in a plurality of divided frequency bands based on input time-series sound source control information and spectrum characteristic information,
To simulate the spectral characteristics of the output target speech in one or more sub-bands, a sine wave synthesizing unit for outputting a synthesized sine wave synthesis component summing a plurality of sine wave whose amplitude is adjusted,
A speech synthesizer comprising: a subband synthesis unit that synthesizes the output sine wave synthesis component in the one or more subbands into a single speech waveform.

In the one or more subbands, a subband division waveform vector generation unit that generates one subband division waveform vector by combining the subband division excitation waveform vector derived from the excitation waveform and the sine wave synthesis component is further provided,
The speech synthesis apparatus according to claim 1, wherein the subband synthesis unit synthesizes the generated subband division waveform vector in the one or more subbands into a single speech waveform .

The sine wave synthesis unit has a sampling rate equal to or higher than the sampling rate when the output sine wave synthesis component is down-sampled so that the sub-wave synthesis unit can completely or approximately restore the original waveform. The speech synthesizer according to claim 1 or 2 , wherein the sine wave synthesis component is generated in a subband of the first and second subbands.

The sinusoidal synthesis unit according to claim 3 part of the band of the subband lower side from claim 1, characterized in that to generate the sinusoidal synthesis components as the one or more sub-band Voice synthesizer.

The sinusoidal synthesis unit, speech synthesis apparatus according to any one of claims 1 to 3, characterized in that to generate the sinusoidal synthesis components in correspondence with an impulse sound source in the one or more subbands.

A speech synthesis method for synthesizing speech waveforms in a plurality of divided frequency bands based on input time-series sound source control information and spectrum characteristic information,
To simulate the spectral characteristics of the output target speech in one or more sub-band, the steps of the sinusoidal synthesis components output was synthesized adding the plurality of sine wave whose amplitude is adjusted,
Synthesizing the output sine wave synthesis component in the one or more subbands into a single speech waveform.

A speech synthesis program for synthesizing speech waveforms in a plurality of divided frequency bands based on input time-series sound source control information and spectrum characteristic information,
Processing to output a sine wave synthesis component obtained by adding and synthesizing a plurality of amplitude-adjusted sine waves so as to simulate the spectral characteristics of the output target speech in one or more subbands;
A speech synthesis program that causes a computer to execute a process of synthesizing the output sine wave synthesis component in the one or more subbands into a single speech waveform.