CN107039033A

CN107039033A - A kind of speech synthetic device

Info

Publication number: CN107039033A
Application number: CN201710248786.2A
Authority: CN
Inventors: 王军; 陈翠琴; 尹利平
Original assignee: Hainan vocational technical college
Current assignee: Hainan vocational technical college
Priority date: 2017-04-17
Filing date: 2017-04-17
Publication date: 2017-08-11

Abstract

The present invention relates to a kind of speech synthetic device, module, receiving module, tone processing module, Modifying model module and synthesis module are built including voice, wherein tone processing module is used in the speech text to be synthesized received, according to the status information for indicating affective state, produce for influenceing the tone information of synthesis voice；Synthesis speech data of the synthesis module synthesis with tone is eventually passed, so that the speech naturalness after synthesis is higher, and then Consumer's Experience is improved.

Description

A kind of speech synthetic device

Technical field

The present invention relates to the present invention relates to phonetic synthesis field, and in particular to a kind of speech synthetic device.

Background technology

Any text information, the smooth voice of standard can be converted into real time bright by phonetic synthesis, also known as literary periodicals technology Read out, and synthesis voice is had higher intelligibility and naturalness as far as possible, artificial face has been loaded onto equivalent to machine.

Synthesis voice should can regenerate transmission information in the way of nature is read again with emotion by a kind of, it may be preferable to body Reveal stronger rhythmical image, voice of the synthesis with specific characteristic style, the heavier novel of such as emotion reads aloud style, storytelling Style, and different manifestations the informal synthesis voice style such as vein of humour vein, so as to increase the diversity of synthesis voice, meet The different demands of people.

In existing speech synthesis system, after input text is by a series of processing such as Text Pretreatment, participle, into rhythm Level prediction module is restrained, then using acoustic model, target acoustical parameters sequence is generated, and finally synthesize voice.Closed in parameter Into in system, speech production is realized by vocoder, due to this speech production mode, it is not necessary to utilize original sound Fragment is spliced, and can accomplish smaller size, so being widely applied on embedded equipment.

At present, synthesis voice Main is that rule-based method pairing is adjusted into voice, and this method can not In view of the details of voice, such as tone information causes the speech naturalness after synthesis relatively low, and then reduce Consumer's Experience.

The content of the invention

In order to solve the above-mentioned technical problem, the present invention provides a kind of speech synthetic device.

The present invention is realized with following technical scheme, a kind of speech synthetic device, including：

Voice builds module, for building phonetic synthesis model previously according to a large amount of speech datas of collection；

Receiving module, the speech text to be synthesized for receiving user；

Tone processing module, in the speech text to be synthesized received, being believed according to the state for indicating affective state Breath, is produced for influenceing the tone information of synthesis voice；

Modifying model module, for being modified according to the tone information of synthesis voice to the phonetic synthesis model；

Synthesis module, for carrying out voice conjunction according to the revised voice with tone information of the Modifying model module Into obtaining the synthesis speech data with tone.

Preferably, the tone processing module includes：Pitch parameters generating unit and tone information converter section, the tone Parameter generating unit is used to produce tone affecting parameters, and tone information conversion according to the status information for indicating affective state The tone affecting parameters that the pitch parameters generating unit is produced are converted into tone influence information by portion.

Preferably, the Modifying model module includes：

Tone acquiring unit, for obtaining the number of tones corresponding to the tone information produced with the tone processing module According to；

Tone recognition unit, for carrying out tone identification to the tone data, obtains tone identification text；

Acoustic feature extraction unit, the acoustic feature for extracting the speech text to be synthesized；

Voice amending unit, for tone identification text to be modified to the phonetic synthesis model, is repaiied Phonetic synthesis model after just.

Preferably, the Modifying model module also includes：Pretreatment unit, use is received for removing the receiving module Noise in the speech text to be synthesized at family.

Preferably, the tone influence information is the characteristic parameter extracted from the Wave data of voice.

Preferably, the tone influence information is the control parameter synthesized for control voice.

Preferably, the control parameter is used for the volume balance and amplitude wave momentum that control voice is synthesized.

The beneficial effects of the invention are as follows：The present invention is after the speech text to be synthesized of user is received, according to instruction emotion The status information of state, produces for influenceing the tone information of synthesis voice, is subsequently used for the tone information according to synthesis voice The phonetic synthesis model is modified；Finally obtain the synthesis speech data with tone so that the voice after synthesis is certainly So degree is higher, and then improves Consumer's Experience.

Brief description of the drawings

Fig. 1 is the structured flowchart of speech synthetic device of the present invention.

Embodiment

To make the object, technical solutions and advantages of the present invention clearer, the present invention is made into one below in conjunction with accompanying drawing It is described in detail on step ground.

As shown in figure 1, the present invention provides a kind of speech synthetic device, including：

Receiving module, the speech text to be synthesized for receiving user；

Preferably, the Modifying model module includes：

Above disclosure is only preferred embodiment of present invention, can not limit the right model of the present invention with this certainly Enclose, therefore the equivalent variations made according to the claims in the present invention, still belong to the scope that the present invention is covered.

Claims

1. a kind of speech synthetic device, it is characterised in that including：

Receiving module, the speech text to be synthesized for receiving user；

Tone processing module, in the speech text to be synthesized received, according to the status information for indicating affective state, production It is raw to be used to influence the tone information of synthesis voice；

Synthesis module, for carrying out phonetic synthesis according to the revised voice with tone information of the Modifying model module, is obtained To the synthesis speech data with tone.

2. a kind of speech synthetic device according to claim 1, it is characterised in that the tone processing module includes：Sound Parameter generating unit and tone information converter section are adjusted, the pitch parameters generating unit is used for according to the status information for indicating affective state Produce tone affecting parameters, and the tone affecting parameters that the tone information converter section produces the pitch parameters generating unit It is converted into tone influence information.

3. a kind of speech synthetic device according to claim 1, it is characterised in that the Modifying model module includes：

Tone acquiring unit, for obtaining the tone data corresponding to the tone information produced with the tone processing module；

Voice amending unit, for tone identification text to be modified to the phonetic synthesis model, is obtained after amendment Phonetic synthesis model.

4. a kind of speech synthetic device according to claim 3, it is characterised in that the Modifying model module also includes： Pretreatment unit, the noise in the speech text to be synthesized of user is received for removing the receiving module.

5. a kind of speech synthetic device according to claim 2, it is characterised in that the tone influence information is from voice Wave data in the characteristic parameter that extracts.

6. a kind of speech synthetic device according to claim 2, it is characterised in that the tone influence information is to be used to control The control parameter of phonetic synthesis processed.

7. a kind of speech synthetic device according to claim 6, it is characterised in that the control parameter is synthesized for control voice Volume balance and amplitude wave momentum.