JP2014109667A5 - - Google Patents

Download PDF

Info

Publication number
JP2014109667A5
JP2014109667A5 JP2012263574A JP2012263574A JP2014109667A5 JP 2014109667 A5 JP2014109667 A5 JP 2014109667A5 JP 2012263574 A JP2012263574 A JP 2012263574A JP 2012263574 A JP2012263574 A JP 2012263574A JP 2014109667 A5 JP2014109667 A5 JP 2014109667A5
Authority
JP
Japan
Prior art keywords
speech
synthesis
sine wave
subbands
waveform
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2012263574A
Other languages
Japanese (ja)
Other versions
JP2014109667A (en
JP6284298B2 (en
Filing date
Publication date
Application filed filed Critical
Priority to JP2012263574A priority Critical patent/JP6284298B2/en
Priority claimed from JP2012263574A external-priority patent/JP6284298B2/en
Publication of JP2014109667A publication Critical patent/JP2014109667A/en
Publication of JP2014109667A5 publication Critical patent/JP2014109667A5/ja
Application granted granted Critical
Publication of JP6284298B2 publication Critical patent/JP6284298B2/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Claims (7)

入力された時系列の音源制御情報およびスペクトル特性情報を基に、分割された複数の周波数帯域で音声波形を合成する音声合成装置であって、
1以上のサブバンドにおいて出力目標音声のスペクトル特性を模擬するように、振幅調整された複数の正弦波を足し合わせて合成した正弦波合成成分を出力する正弦波合成部
前記1以上のサブバンドにおいて前記出力された正弦波合成成分を単一の音声波形に合成するサブバンド合成部と、を備えることを特徴とする音声合成装置。
A speech synthesizer that synthesizes a speech waveform in a plurality of divided frequency bands based on input time-series sound source control information and spectrum characteristic information,
To simulate the spectral characteristics of the output target speech in one or more sub-bands, a sine wave synthesizing unit for outputting a synthesized sine wave synthesis component summing a plurality of sine wave whose amplitude is adjusted,
A speech synthesizer comprising: a subband synthesis unit that synthesizes the output sine wave synthesis component in the one or more subbands into a single speech waveform.
前記1以上のサブバンドにおいて、音源波形に由来するサブバンド分割音源波形ベクトルおよび前記正弦波合成成分を組み合わせて1つのサブバンド分割波形ベクトルを生成するサブバンド分割波形ベクトル生成部を更に備え、
前記サブバンド合成部は、前記1以上のサブバンドにおいて前記生成されたサブバンド分割波形ベクトルを単一の音声波形に合成することを特徴とする請求項1記載の音声合成装置
In the one or more subbands, a subband division waveform vector generation unit that generates one subband division waveform vector by combining the subband division excitation waveform vector derived from the excitation waveform and the sine wave synthesis component is further provided,
The speech synthesis apparatus according to claim 1, wherein the subband synthesis unit synthesizes the generated subband division waveform vector in the one or more subbands into a single speech waveform .
前記正弦波合成部は、前記出力する正弦波合成成分が前記サブバンド合成部において原波形を完全もしくは近似的に復元できるようにダウンサンプリングされた際のサンプリングレートと等しいサンプリングレートで、前記1以上のサブバンドにおいて前記正弦波合成成分を生成することを特徴とする請求項1または請求項2記載の音声合成装置。 The sine wave synthesis unit has a sampling rate equal to or higher than the sampling rate when the output sine wave synthesis component is down-sampled so that the sub-wave synthesis unit can completely or approximately restore the original waveform. The speech synthesizer according to claim 1 or 2 , wherein the sine wave synthesis component is generated in a subband of the first and second subbands. 前記正弦波合成部は、低い側の一部の帯域のサブバンドを前記1以上のサブバンドとして前記正弦波合成成分を生成することを特徴とする請求項1から請求項3のいずれかに記載の音声合成装置。 The sinusoidal synthesis unit according to claim 3 part of the band of the subband lower side from claim 1, characterized in that to generate the sinusoidal synthesis components as the one or more sub-band Voice synthesizer. 前記正弦波合成部は、前記1以上のサブバンドにおいてインパルス音源に対応させて前記正弦波合成成分を生成することを特徴とする請求項1から請求項3のいずれかに記載の音声合成装置。 The sinusoidal synthesis unit, speech synthesis apparatus according to any one of claims 1 to 3, characterized in that to generate the sinusoidal synthesis components in correspondence with an impulse sound source in the one or more subbands. 入力された時系列の音源制御情報およびスペクトル特性情報を基に、分割された複数の周波数帯域で音声波形を合成する音声合成方法であって、
1以上のサブバンドにおいて出力目標音声のスペクトル特性を模擬するように、振幅調整された複数の正弦波を足し合わせて合成した正弦波合成成分出力するステップ
前記1以上のサブバンドにおいて前記出力された正弦波合成成分を単一の音声波形に合成するステップと、を含むことを特徴とする音声合成方法。
A speech synthesis method for synthesizing speech waveforms in a plurality of divided frequency bands based on input time-series sound source control information and spectrum characteristic information,
To simulate the spectral characteristics of the output target speech in one or more sub-band, the steps of the sinusoidal synthesis components output was synthesized adding the plurality of sine wave whose amplitude is adjusted,
Synthesizing the output sine wave synthesis component in the one or more subbands into a single speech waveform.
入力された時系列の音源制御情報およびスペクトル特性情報を基に、分割された複数の周波数帯域で音声波形を合成する音声合成プログラムであって、
1以上のサブバンドにおいて出力目標音声のスペクトル特性を模擬するように、振幅調整された複数の正弦波を足し合わせて合成した正弦波合成成分を出力する処理と、
前記1以上のサブバンドにおいて前記出力された正弦波合成成分を単一の音声波形に合成する処理と、をコンピュータに実行させることを特徴とする音声合成プログラム。
A speech synthesis program for synthesizing speech waveforms in a plurality of divided frequency bands based on input time-series sound source control information and spectrum characteristic information,
Processing to output a sine wave synthesis component obtained by adding and synthesizing a plurality of amplitude-adjusted sine waves so as to simulate the spectral characteristics of the output target speech in one or more subbands;
A speech synthesis program that causes a computer to execute a process of synthesizing the output sine wave synthesis component in the one or more subbands into a single speech waveform.
JP2012263574A 2012-11-30 2012-11-30 Speech synthesis apparatus, speech synthesis method, and speech synthesis program Active JP6284298B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2012263574A JP6284298B2 (en) 2012-11-30 2012-11-30 Speech synthesis apparatus, speech synthesis method, and speech synthesis program

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2012263574A JP6284298B2 (en) 2012-11-30 2012-11-30 Speech synthesis apparatus, speech synthesis method, and speech synthesis program

Related Child Applications (1)

Application Number Title Priority Date Filing Date
JP2017131338A Division JP6410890B2 (en) 2017-07-04 2017-07-04 Speech synthesis apparatus, speech synthesis method, and speech synthesis program

Publications (3)

Publication Number Publication Date
JP2014109667A JP2014109667A (en) 2014-06-12
JP2014109667A5 true JP2014109667A5 (en) 2015-10-08
JP6284298B2 JP6284298B2 (en) 2018-02-28

Family

ID=51030335

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2012263574A Active JP6284298B2 (en) 2012-11-30 2012-11-30 Speech synthesis apparatus, speech synthesis method, and speech synthesis program

Country Status (1)

Country Link
JP (1) JP6284298B2 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2022055464A (en) * 2020-09-29 2022-04-08 Kddi株式会社 Speech analyzing device, method, and program
CN112863477B (en) * 2020-12-31 2023-06-27 出门问问(苏州)信息科技有限公司 Speech synthesis method, device and storage medium

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3557662B2 (en) * 1994-08-30 2004-08-25 ソニー株式会社 Speech encoding method and speech decoding method, and speech encoding device and speech decoding device
JP3659053B2 (en) * 1998-04-23 2005-06-15 ヤマハ株式会社 Waveform data generation method, recording medium recording waveform data generation program, and waveform data generation apparatus
JP4019824B2 (en) * 2002-07-08 2007-12-12 ソニー株式会社 Waveform generating apparatus and method, and decoding apparatus
EP1851760B1 (en) * 2005-02-10 2015-10-07 Koninklijke Philips N.V. Sound synthesis
JP5743137B2 (en) * 2011-01-14 2015-07-01 ソニー株式会社 Signal processing apparatus and method, and program

Similar Documents

Publication Publication Date Title
CA2976864C (en) Apparatus and method for processing an audio signal to obtain a processed audio signal using a target time-domain envelope
CA2778205C (en) Apparatus and method for generating a high frequency audio signal using adaptive oversampling
RU2018130424A (en) HARMONIC TRANSFORMATION IMPROVED BY CROSS-BAND PRODUCTION
RU2012141098A (en) PROCESSING SOUND SIGNALS DURING HIGH FREQUENCY RECONSTRUCTION
JP2012145895A5 (en)
TWI456566B (en) Apparatus and method for modifying an audio signal using envelope shaping
JP2008519491A5 (en)
JP2015526769A (en) Apparatus and method for reproducing audio signal, apparatus and method for generating encoded audio signal, computer program, and encoded audio signal
JP2006243178A5 (en)
TWI457011B (en) Apparatus and method for spatially selective sound acquisition by acoustic triangulation
TW200742477A (en) Method for virtual bass synthesis
CN102985970A (en) Improved magnitude response and temporal alignment in phase vocoder based bandwidth extension for audio signals
WO2009048239A3 (en) Encoding and decoding method using variable subband analysis and apparatus thereof
JP2019101093A5 (en) Speech synthesis method, speech synthesis system and program
TW200734888A (en) Visualization system of acoustic source energy distribution and the method thereof
MY172710A (en) Apparatus and method for generating a frequency enhancement signal using an energy limitation operation
JP2014109667A5 (en)
JP5454330B2 (en) Sound processor
JP6831767B2 (en) Speech recognition methods, devices and programs
JP2014109667A (en) Speech synthesizer, speech synthesis method, and speech synthesis program
Lee et al. Virtual bass system based on a multiband harmonic generation
Alyushin et al. The Technology of Figurative Analysis in the Problems of Speech Information Digital Processing
JP6191238B2 (en) Sound processing apparatus and sound processing method
Hideki et al. Realtime conversion of growl-type voice qualities based on modulation and approximate time-varying filtering driven by a non-linear oscillator: Formulation
van der Vorm Transform coding of audio impulse responses