CN104464716A - Voice broadcasting system and method - Google Patents

Voice broadcasting system and method Download PDF

Info

Publication number
CN104464716A
CN104464716A CN201410670671.9A CN201410670671A CN104464716A CN 104464716 A CN104464716 A CN 104464716A CN 201410670671 A CN201410670671 A CN 201410670671A CN 104464716 A CN104464716 A CN 104464716A
Authority
CN
China
Prior art keywords
voice
characteristic
module
word message
unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410670671.9A
Other languages
Chinese (zh)
Other versions
CN104464716B (en
Inventor
王程程
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Unisound Intelligent Technology Co Ltd
Xiamen Yunzhixin Intelligent Technology Co Ltd
Original Assignee
Beijing Yunzhisheng Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Yunzhisheng Information Technology Co Ltd filed Critical Beijing Yunzhisheng Information Technology Co Ltd
Priority to CN201410670671.9A priority Critical patent/CN104464716B/en
Publication of CN104464716A publication Critical patent/CN104464716A/en
Application granted granted Critical
Publication of CN104464716B publication Critical patent/CN104464716B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Telephonic Communication Services (AREA)

Abstract

The invention relates to a voice broadcasting system and method. Sample voice matched with a written message broadcaster role is recorded and sent to a voice storage module through a first network communication module and a second network communication module; stored voice data are acquired, voice feature parameters are extracted from the acquired voice data, model training is conducted on the voice feature parameters, and then a feature voice model is acquired; written messages needing to be broadcast by a user through voice are collected, and the collected written messages are sent to a feature voice synthesis module through the first network communication module and the second network communication module; the feature voice model and the written messages are acquired, feature voice with broadcaster voice features and written message content is synthesized, and feature voice data are stored in the voice storage module; the feature voice is broadcast. By the adoption of the system and method, voice with written message sender voice features can be broadcast, individualization is high, and the voice can be easily accepted by a hearer.

Description

A kind of voice broadcasting system and method
Technical field
The present invention relates to speech synthesis technique field, particularly a kind of voice broadcasting system and method.
Background technology
In daily life; we often can run into the situation cannot reading SMS because being busy with working in hand; such as: drive, beat keyboard; for this situation; can only waiting in hand works stop time to leaf through mobile phone short message; and for very important SMS, opportunity may be missed because not watching response in time, thus bring loss.
Existing by by note word synthetic speech in prior art, thus carry out voice broadcast missed call, massage voice reading unread short messages.Phonetic synthesis, also known as literary periodicals (Text to Speech) technology, produced the technology of artificial voice by the method for machinery, electronics, it computing machine oneself is produced or the Word message of outside input change into the mankind can listen understand, technology that fluent natural language exports.
But, a kind of sound model that the unified employing of speech sound feature that existing voice broadcasting modes is reported is extracted in advance, the speech sound of synthesis is single, and the voice played out can not realize having identical intonation, the rhythm with the sender of text message, cause that the voice reported out are stiff, emotional expression is insufficient, lacking individuality, not easily accept by hearer.Therefore, a kind of language play back system with Word message sender characteristic voice is badly in need of.
Summary of the invention
Technical matters to be solved by this invention provides one to have Word message sender characteristic voice voice broadcasting system and method for the deficiencies in the prior art.
The technical scheme that the present invention solves the problems of the technologies described above is as follows: a kind of voice broadcasting system, comprises client and server system; Described client comprises characteristic sound and records module, Word message acquisition module, first network communication module and characteristic sound playing module, and described server system comprises voice storage module, characteristic sound training module, second network communication module and characteristic sound synthesis module;
Described characteristic sound records module, and described sample voice for recording the sample voice with Word message report person role match, and is sent to voice storage module through first network communication module and second network communication module by it;
Described first network communication module, it is for receiving and dispatching the transmission data between client and server system;
Described second network communication module, it is for receiving and dispatching the transmission data between client and server system;
Described voice storage module, it records the sample voice data of module acquires and the characteristic speech data of characteristic sound synthesis module synthesis for storing characteristic sound;
Described characteristic sound training module, it extracts sound characteristic parameter in the sample voice that stores from voice storage module, and carries out model training, obtains characteristic speech model, and described characteristic speech model is sent to characteristic voice synthetic module;
Described Word message acquisition module, it needs to carry out with voice the Word message reported for gathering user, and the Word message collected is sent to characteristic sound synthesis module through first network communication module and second network communication module;
Described characteristic sound synthesis module, it, for according to described characteristic speech model and described Word message, synthesizes the characteristic voice with report person's characteristic voice and Word message content, and described characteristic speech data is stored to voice storage module;
Described characteristic sound playing module, it is for playing the characteristic voice of characteristic sound synthesis module synthesis.
The invention has the beneficial effects as follows: can on all kinds of mobile terminal, mobile phone, panel computer such as, realize all kinds of Word message of voice broadcast, Word message comprises: the text message of the instant message software receipt such as newsletter archive information, e-book, SMS and QQ Fetion, micro-letter, footpath between fields, footpath between fields.When user uses reciting news text message of the present invention, e-book, a kind of voice tone color in voice storage module can be selected to play according to oneself hobby; When the information that user uses the present invention to report with other people word communication, the voice that the present invention reports have the voice of Word message sender characteristic voice, comprehensively meet the tone color demand of user to speaker dependent, personalized strong, easily accept by hearer, make the better experience effect of the acquisition of user.
On the basis of technique scheme, the present invention can also do following improvement.
Further, described characteristic sound is recorded module and is comprised voice collecting unit and address list binding unit,
Described voice collecting unit, it is for gathering the raw tone of report person, and the raw tone collected is sent to address list to bind unit;
Described address list binding unit, the sample voice data of having bound for the raw tone of report person and report person's Role Information being bound, and are sent to voice storage module through first network communication module and second network communication module by it.
Further, described first network communication module comprises voice transmitting element, Word message transmitting element and characteristic voice transmitting element; Described second network communication module comprises voice receiving unit, Word message receiving element and characteristic voice receiving unit;
Described voice transmitting element, it records the sample voice data of module output for receiving characteristic sound, and described sample voice data are sent to voice receiving unit;
Described voice receiving unit, its speech data exported for receiving described voice transmitting element, and described speech data is sent to voice storage module;
Described Word message transmitting element, its Word message exported for receiving Word message acquisition module, and described Word message is sent to Word message receiving element;
Described Word message receiving element, its Word message exported for receiving described Word message transmitting element, and described Word message is sent to characteristic sound synthesis module;
Described characteristic voice transmitting element, its characteristic speech data exported for receiving voice storage module, and described characteristic speech data is sent to characteristic voice receiving unit.
Described characteristic voice receiving unit, its characteristic speech data exported for receiving characteristic voice transmitting element, and described characteristic speech data is sent to characteristic sound playing module.
Further, described voice storage module comprises sample voice storage unit and characteristic voice memory unit;
Described sample voice storage unit, it is for receiving and storing the sample voice data that described characteristic sound records module acquires;
Described characteristic voice memory unit, its for receive and store described characteristic sound synthesis module synthesis characteristic speech data.
Further, described characteristic sound training module comprises voice annotation unit, parameter extraction unit, model training unit and model storage unit;
Described voice annotation unit, it for obtaining the sample voice of report person, and carries out voice annotation to it;
Described parameter extraction unit, it, for the sample voice marked, carries out the extraction of acoustical characteristic parameters;
Described model training unit, it obtains the characteristic speech model of report person for carrying out model training to acoustical characteristic parameters, and described characteristic speech model is stored to model storage unit;
Described model storage unit, described model for receiving and storing the characteristic speech model of report person, and is sent to characteristic sound synthesis module by it.
Further, described characteristic sound synthesis module comprises text-processing unit, parameter prediction unit and phonetic synthesis unit;
Described text-processing unit, its Word message exported for being received Word message acquisition module by first network communication module and second network communication module, and the mark described Word message being translated into phonetic synthesis unit can identify;
Described parameter prediction unit, it extracts parameters,acoustic corresponding to current text information for the characteristic speech model exported according to mark and the characteristic sound training module of Word message;
Described phonetic synthesis unit, it, for carrying out phonetic synthesis according to the parameters,acoustic corresponding with text message, exports the characteristic voice with report person pronunciation characteristic consistent with current text information.
In order to solve the technical problem, the present invention also provides a kind of voice broadcast method, comprises the following steps,
S101: record the sample voice with Word message report person role match, and described sample voice is sent to voice storage module through first network communication module and second network communication module;
S102: obtain voice data, extracts sound characteristic parameter, and carries out model training to described sound characteristic parameter, obtain characteristic speech model from the speech data obtained;
S103: gathering user needs to carry out with voice the Word message reported, and the Word message collected is sent to characteristic sound synthesis module through first network communication module and second network communication module;
S104: obtain described characteristic speech model and described Word message, synthesis has the characteristic voice of report person's characteristic voice and Word message content, and described characteristic speech data is stored to voice storage module;
S105: play characteristic voice.
The invention has the beneficial effects as follows: can on all kinds of mobile terminal, mobile phone, panel computer, notebook, vehicle-mounted computer such as, realize all kinds of Word message of voice broadcast, Word message comprises: the text message of the instant message software receipt such as newsletter archive information, e-book, SMS and QQ Fetion, micro-letter, footpath between fields, footpath between fields.When user uses reciting news text message of the present invention, e-book, a kind of voice tone color in voice storage module can be selected to play according to oneself hobby; When the information that user uses the present invention to report with other people word communication, the voice that the present invention reports have the voice of Word message sender characteristic voice, comprehensively meet the tone color demand of user to speaker dependent, personalized strong, easily accept by hearer, make the better experience effect of the acquisition of user.
On the basis of technique scheme, the present invention can also do following improvement.
Further, step S101 is specially:
S101a: the raw tone gathering report person, and the raw tone collected sent to address list to bind unit;
S101b: the raw tone of report person and report person's Role Information are bound, and the speech data bound is sent to voice storage module through first network communication module and second network communication module.
Further, described step S102 is specially,
S102a: the sample voice obtaining report person, and voice annotation is carried out to it;
S102b: obtain the sample voice marked, extraction acoustical characteristic parameters is carried out to it;
S102c: obtain acoustical characteristic parameters, model training is carried out to it, obtains the characteristic speech model of report person;
S102d: obtain and store the characteristic speech model of report person;
Further, described step S104 is specially,
S104a: obtain the Word message gathered, be translated into the mark that phonetic synthesis unit can identify;
S104b: the characteristic speech model of the mark and the output of characteristic sound training module that obtain Word message extracts parameters,acoustic corresponding to current text information;
S104c: obtain the parameters,acoustic corresponding with text message, carry out phonetic synthesis according to described parameters,acoustic, exports the characteristic voice with report person pronunciation characteristic consistent with current text information.
Accompanying drawing explanation
Fig. 1 is voice broadcasting system composition structural drawing;
Fig. 2 is voice broadcasting system inner module composition structural drawing;
Fig. 3 is voice broadcast method process flow diagram.
Embodiment
Be described principle of the present invention and feature below in conjunction with accompanying drawing, example, only for explaining the present invention, is not intended to limit scope of the present invention.
Fig. 1 is voice broadcasting system composition structural drawing, and as shown in Figure 1, a kind of voice broadcasting system, comprises client and server system; Client can be installed on all kinds of mobile terminal, and mobile terminal comprises mobile phone, panel computer, notebook, vehicle-mounted computer etc.Client starts when user needs to carry out voice broadcast.
Client comprises characteristic sound and records module, Word message acquisition module, first network communication module and characteristic sound playing module; Server system comprises voice storage module, characteristic sound training module, second network communication module and characteristic sound synthesis module;
Characteristic sound records module, for recording the sample voice with Word message report person role match, and sample voice is sent to voice storage module through first network communication module and second network communication module;
Characteristic sound refers to the sound pronunciation feature with particular person, can the voice of fuzzy diagnosis speaker identity according to this pronunciation characteristic.According to the speech samples recording different people in the present invention, so that synthesize the voice with speaker characteristic voice according to speech samples.In addition, time for particular person recording characteristic sound, the characteristic sound of the identity Role Information of characteristic people and recording can be bound, be characteristic phonetic symbol and note the role of speaker, Role Information can be the name of speaker, this name can be stored in cell phone address book list, also can be the pet name in the buddy list of the Instant Messenger (IM) software such as the QQ pet name, the Fetion pet name, micro-letter pet name, footpath between fields, the footpath between fields pet name.
First network communication module and second network communication module, for receiving and dispatching the transmission data between client and server system; Network in described first network communication module and second network communication module can be wide area network or LAN (Local Area Network).
Voice storage module, records the sample voice data of module acquires and the characteristic speech data of characteristic sound synthesis module synthesis for storing characteristic sound; The voice storage module i.e. database of a storaged voice, stores the sample voice of particular person and the characteristic voice of later stage synthesis in this database.Wherein, sample voice stores with the form of multiple recording short sentence, the voice that user can record frequent contact are as required stored in this database as characteristic speech samples, each contact person's initial speech sample length can be half an hour by one hour, due to synthesis characteristic sound effect along with database expand better, be pursue the sound effect that more emulates in the later stage, by increasing the mode expanding data storehouse of sample voice duration.
Characteristic sound training module, it extracts sound characteristic parameter in the sample voice that stores from voice storage module, and carries out model training, obtains characteristic speech model, and described characteristic speech model is sent to characteristic voice synthetic module;
Word message acquisition module, it needs to carry out with voice the Word message reported for gathering user, and the Word message collected is sent to characteristic sound synthesis module through first network communication module and second network communication module; The Word message gathered, can be newsletter archive information, e-book, also can be the Word message of the instant message software receipt such as SMS and QQ Fetion, micro-letter, footpath between fields, footpath between fields, client be gathered by card format and is collected Word message, as the input of subsequent voice synthesis system.
Characteristic sound synthesis module, it, for according to described characteristic speech model and described Word message, synthesizes the characteristic voice with report person's characteristic voice and Word message content, and described characteristic speech data is stored to voice storage module;
Characteristic sound playing module, it is for playing the characteristic voice of characteristic sound synthesis module synthesis.
The present invention can on all kinds of mobile terminal, mobile phone, panel computer such as, realize all kinds of Word message of voice broadcast, Word message comprises: the text message of the instant message software receipt such as newsletter archive information, e-book, SMS and QQ Fetion, micro-letter, footpath between fields, footpath between fields.When user uses reciting news text message of the present invention, e-book, a kind of voice tone color in voice storage module can be selected to play according to oneself hobby; When the information that user uses the present invention to report with other people word communication, the voice that the present invention reports have the voice of Word message sender characteristic voice, comprehensively meet the tone color demand of user to speaker dependent, personalized strong, easily accept by hearer, make the better experience effect of the acquisition of user.
Fig. 2 is voice broadcasting system inner module composition structural drawing, as shown in Figure 2, characteristic sound recording module comprises voice collecting unit and address list binding unit, voice collecting unit, it is for gathering the raw tone of report person, and the raw tone collected is sent to address list to bind unit; Address list binding unit, the sample voice data of having bound for the raw tone of report person and report person's Role Information being bound, and are sent to voice storage module through first network communication module and second network communication module by it.
First network communication module comprises voice transmitting element, Word message transmitting element and characteristic voice transmitting element; Second network communication module comprises voice receiving unit, Word message receiving element and characteristic voice receiving unit; Voice transmitting element, it records the sample voice data of module output for receiving characteristic sound, and described sample voice data are sent to voice receiving unit; Voice receiving unit, its speech data exported for receiving described voice transmitting element, and described speech data is sent to voice storage module; Word message transmitting element, its Word message exported for receiving Word message acquisition module, and described Word message is sent to Word message receiving element; Word message receiving element, its Word message exported for receiving described Word message transmitting element, and described Word message is sent to characteristic sound synthesis module; Characteristic voice transmitting element, its characteristic speech data exported for receiving voice storage module, and described characteristic speech data is sent to characteristic voice receiving unit.Characteristic voice receiving unit, its characteristic speech data exported for receiving characteristic voice transmitting element, and described characteristic speech data is sent to characteristic sound playing module.
Voice storage module comprises sample voice storage unit and characteristic voice memory unit; Sample voice storage unit, it is for receiving and storing the sample voice data that described characteristic sound records module acquires; Characteristic voice memory unit, its for receive and store described characteristic sound synthesis module synthesis characteristic speech data.
Characteristic sound training module comprises voice annotation unit, parameter extraction unit, model training unit and model storage unit; Voice annotation unit, it for obtaining the sample voice of report person, and carries out voice annotation to it; The content of mark comprises: the syllable phoneme cutting of speech data and mark, stress and prosodic labeling, character/word border and part-of-speech tagging, identifies the background noise mark of voice.Parameter extraction unit, it, for the sample voice marked, carries out the extraction of acoustical characteristic parameters; Acoustical characteristic parameters comprises fundamental frequency and spectrum signature parameter.Model training unit, it obtains the characteristic speech model of report person for carrying out model training to acoustical characteristic parameters, and described characteristic speech model is stored to model storage unit; Model storage unit, described model for receiving and storing the characteristic speech model of report person, and is sent to characteristic sound synthesis module by it.
Characteristic sound synthesis module comprises text-processing unit, parameter prediction unit and phonetic synthesis unit; Text-processing unit, its Word message exported for being received Word message acquisition module by first network communication module and second network communication module, and the mark described Word message being translated into phonetic synthesis unit can identify; Parameter prediction unit, it extracts parameters,acoustic corresponding to current text information for the characteristic speech model exported according to mark and the characteristic sound training module of Word message; When having synthesis demand after gathering Word message, if stored the sound model (gathering the sound of this word information transmitter namely as characteristic speech samples) of Word message sender in model storage unit, then call the characteristic speech model of this sender, as one of the input of parameter prediction unit.If do not store the sound model of Word message sender in model storage unit, so then one of the input as parameter prediction unit of characteristic speech model can be exported by Real-time Collection speech samples.
Phonetic synthesis unit, it, for carrying out phonetic synthesis according to the parameters,acoustic corresponding with text message, exports the characteristic voice with report person pronunciation characteristic consistent with current text information.
Fig. 3 is voice broadcast method process flow diagram, and as shown in Figure 3, voice broadcast method comprises the following steps,
S101: record the sample voice with Word message report person role match, and described sample voice is sent to voice storage module through first network communication module and second network communication module;
S102: obtain voice data, extracts sound characteristic parameter, and carries out model training to described sound characteristic parameter, obtain characteristic speech model from the speech data obtained;
S103: gathering user needs to carry out with voice the Word message reported, and the Word message collected is sent to characteristic sound synthesis module through first network communication module and second network communication module;
S104: obtain described characteristic speech model and described Word message, synthesis has the characteristic voice of report person's characteristic voice and Word message content, and described characteristic speech data is stored to voice storage module;
S105: play characteristic voice.
Step S101 is specially:
S101a: the raw tone gathering report person, and the raw tone collected sent to address list to bind unit;
S101b: the raw tone of report person and report person's Role Information are bound, and the speech data bound is sent to voice storage module through first network communication module and second network communication module.
Step S102 is specially,
S102a: the sample voice obtaining report person, and voice annotation is carried out to it;
S102b: obtain the sample voice marked, extraction acoustical characteristic parameters is carried out to it;
S102c: obtain acoustical characteristic parameters, model training is carried out to it, obtains the characteristic speech model of report person;
S102d: obtain and store the characteristic speech model of report person;
Step S104 is specially,
S104a: obtain the Word message gathered, be translated into the mark that phonetic synthesis unit can identify;
S104b: the characteristic speech model of the mark and the output of characteristic sound training module that obtain Word message extracts parameters,acoustic corresponding to current text information;
S104c: obtain the parameters,acoustic corresponding with text message, carry out phonetic synthesis according to described parameters,acoustic, exports the characteristic voice with report person pronunciation characteristic consistent with current text information.
The foregoing is only preferred embodiment of the present invention, not in order to limit the present invention, within the spirit and principles in the present invention all, any amendment done, equivalent replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (10)

1. a voice broadcasting system, it is characterized in that, comprise client and server system, described client comprises characteristic sound and records module, Word message acquisition module, first network communication module and characteristic sound playing module, and described server system comprises voice storage module, characteristic sound training module, second network communication module and characteristic sound synthesis module;
Described characteristic sound records module, and described sample voice for recording the sample voice with Word message report person role match, and is sent to voice storage module through first network communication module and second network communication module by it;
Described first network communication module, it is for receiving and dispatching the transmission data between client and server system;
Described second network communication module, it is for receiving and dispatching the transmission data between client and server system;
Described voice storage module, it records the sample voice data of module acquires and the characteristic speech data of characteristic sound synthesis module synthesis for storing characteristic sound;
Described characteristic sound training module, it extracts sound characteristic parameter in the sample voice that stores from voice storage module, and carries out model training, obtains characteristic speech model, and described characteristic speech model is sent to characteristic voice synthetic module;
Described Word message acquisition module, it needs to carry out with voice the Word message reported for gathering user, and the Word message collected is sent to characteristic sound synthesis module through first network communication module and second network communication module;
Described characteristic sound synthesis module, it, for according to described characteristic speech model and described Word message, synthesizes the characteristic voice with report person's characteristic voice and Word message content, and described characteristic speech data is stored to voice storage module;
Described characteristic sound playing module, it is for playing the characteristic voice of characteristic sound synthesis module synthesis.
2. a kind of voice broadcasting system according to claim 1, is characterized in that, described characteristic sound is recorded module and comprised voice collecting unit and address list binding unit,
Described voice collecting unit, it is for gathering the raw tone of report person, and the raw tone collected is sent to address list to bind unit;
Described address list binding unit, the sample voice data of having bound for the raw tone of report person and report person's Role Information being bound, and are sent to voice storage module through first network communication module and second network communication module by it.
3. a kind of voice broadcasting system according to claim 1, it is characterized in that, described first network communication module comprises voice transmitting element, Word message transmitting element and characteristic voice transmitting element; Described second network communication module comprises voice receiving unit, Word message receiving element and characteristic voice receiving unit;
Described voice transmitting element, it records the sample voice data of module output for receiving characteristic sound, and described sample voice data are sent to voice receiving unit;
Described voice receiving unit, its speech data exported for receiving described voice transmitting element, and described speech data is sent to voice storage module;
Described Word message transmitting element, its Word message exported for receiving Word message acquisition module, and described Word message is sent to Word message receiving element;
Described Word message receiving element, its Word message exported for receiving described Word message transmitting element, and described Word message is sent to characteristic sound synthesis module;
Described characteristic voice transmitting element, its characteristic speech data exported for receiving voice storage module, and described characteristic speech data is sent to characteristic voice receiving unit.
Described characteristic voice receiving unit, its characteristic speech data exported for receiving characteristic voice transmitting element, and described characteristic speech data is sent to characteristic sound playing module.
4. a kind of voice broadcasting system according to claim 1, it is characterized in that, described voice storage module comprises sample voice storage unit and characteristic voice memory unit,
Described sample voice storage unit, it is for receiving and storing the sample voice data that described characteristic sound records module acquires;
Described characteristic voice memory unit, its for receive and store described characteristic sound synthesis module synthesis characteristic speech data.
5. a kind of voice broadcasting system according to claim 1, it is characterized in that, described characteristic sound training module comprises voice annotation unit, parameter extraction unit, model training unit and model storage unit;
Described voice annotation unit, it for obtaining the sample voice of report person, and carries out voice annotation to it;
Described parameter extraction unit, it, for the sample voice marked, carries out the extraction of acoustical characteristic parameters;
Described model training unit, it obtains the characteristic speech model of report person for carrying out model training to acoustical characteristic parameters, and described characteristic speech model is stored to model storage unit;
Described model storage unit, described model for receiving and storing the characteristic speech model of report person, and is sent to characteristic sound synthesis module by it.
6. a kind of voice broadcasting system according to claim 1, it is characterized in that, described characteristic sound synthesis module comprises text-processing unit, parameter prediction unit and phonetic synthesis unit;
Described text-processing unit, its Word message exported for being received Word message acquisition module by first network communication module and second network communication module, and the mark described Word message being translated into phonetic synthesis unit can identify;
Described parameter prediction unit, it extracts parameters,acoustic corresponding to current text information for the characteristic speech model exported according to mark and the characteristic sound training module of Word message;
Described phonetic synthesis unit, it, for carrying out phonetic synthesis according to the parameters,acoustic corresponding with text message, exports the characteristic voice with report person pronunciation characteristic consistent with current text information.
7. a voice broadcast method, is characterized in that, described voice broadcast method comprises the following steps,
S101: record the sample voice with Word message report person role match, and described sample voice is sent to voice storage module through first network communication module and second network communication module;
S102: obtain voice data, extracts sound characteristic parameter, and carries out model training to described sound characteristic parameter, obtain characteristic speech model from the speech data obtained;
S103: gathering user needs to carry out with voice the Word message reported, and the Word message collected is sent to characteristic sound synthesis module through first network communication module and second network communication module;
S104: obtain described characteristic speech model and described Word message, synthesis has the characteristic voice of report person's characteristic voice and Word message content, and described characteristic speech data is stored to voice storage module;
S105: play characteristic voice.
8. a kind of voice broadcast method according to claim 7, it is characterized in that, step S101 is specially:
S101a: the raw tone gathering report person, and the raw tone collected sent to address list to bind unit;
S101b: the raw tone of report person and report person's Role Information are bound, and the speech data bound is sent to voice storage module through first network communication module and second network communication module.
9. a kind of voice broadcast method according to claim 7, it is characterized in that, described step S102 is specially,
S102a: the sample voice obtaining report person, and voice annotation is carried out to it;
S102b: obtain the sample voice marked, extraction acoustical characteristic parameters is carried out to it;
S102c: obtain acoustical characteristic parameters, model training is carried out to it, obtains the characteristic speech model of report person;
S102d: obtain and store the characteristic speech model of report person.
10. a kind of voice broadcast method according to claim 7, it is characterized in that, described step S104 is specially,
S104a: obtain the Word message gathered, be translated into the mark that phonetic synthesis unit can identify;
S104b: the characteristic speech model of the mark and the output of characteristic sound training module that obtain Word message extracts parameters,acoustic corresponding to current text information;
S104c: obtain the parameters,acoustic corresponding with text message, carry out phonetic synthesis according to described parameters,acoustic, exports the characteristic voice with report person pronunciation characteristic consistent with current text information.
CN201410670671.9A 2014-11-20 2014-11-20 A kind of voice broadcasting system and method Active CN104464716B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410670671.9A CN104464716B (en) 2014-11-20 2014-11-20 A kind of voice broadcasting system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410670671.9A CN104464716B (en) 2014-11-20 2014-11-20 A kind of voice broadcasting system and method

Publications (2)

Publication Number Publication Date
CN104464716A true CN104464716A (en) 2015-03-25
CN104464716B CN104464716B (en) 2018-01-12

Family

ID=52910670

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410670671.9A Active CN104464716B (en) 2014-11-20 2014-11-20 A kind of voice broadcasting system and method

Country Status (1)

Country Link
CN (1) CN104464716B (en)

Cited By (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105208194A (en) * 2015-08-17 2015-12-30 努比亚技术有限公司 Voice broadcast device and method
CN105304081A (en) * 2015-11-09 2016-02-03 上海语知义信息技术有限公司 Smart household voice broadcasting system and voice broadcasting method
CN105427855A (en) * 2015-11-09 2016-03-23 上海语知义信息技术有限公司 Voice broadcast system and voice broadcast method of intelligent software
CN106205602A (en) * 2015-05-06 2016-12-07 上海汽车集团股份有限公司 Speech playing method and system
CN106571136A (en) * 2016-10-28 2017-04-19 努比亚技术有限公司 Voice output device and method
CN106686135A (en) * 2017-02-22 2017-05-17 北京南师信息技术有限公司 Medicine dispensation information voice prompting system
WO2017114048A1 (en) * 2015-12-28 2017-07-06 努比亚技术有限公司 Mobile terminal and method for identifying contact
CN107154263A (en) * 2017-05-25 2017-09-12 宇龙计算机通信科技(深圳)有限公司 Sound processing method, device and electronic equipment
CN107452400A (en) * 2017-07-24 2017-12-08 珠海市魅族科技有限公司 Voice broadcast method and device, computer installation and computer-readable recording medium
CN107516509A (en) * 2017-08-29 2017-12-26 苏州奇梦者网络科技有限公司 Voice base construction method and system for news report phonetic synthesis
CN109935225A (en) * 2017-12-15 2019-06-25 富泰华工业(深圳)有限公司 Character information processor and method, computer storage medium and mobile terminal
CN109979430A (en) * 2017-12-28 2019-07-05 深圳市优必选科技有限公司 Robot story telling method and device, robot and storage medium
CN110097878A (en) * 2018-01-30 2019-08-06 阿拉的(深圳)人工智能有限公司 Polygonal color phonetic prompt method, cloud device, prompt system and storage medium
CN110459201A (en) * 2019-08-22 2019-11-15 云知声智能科技股份有限公司 A kind of phoneme synthesizing method generating new tone color
CN110751940A (en) * 2019-09-16 2020-02-04 百度在线网络技术(北京)有限公司 Method, device, equipment and computer storage medium for generating voice packet
CN110856023A (en) * 2019-11-15 2020-02-28 四川长虹电器股份有限公司 System and method for realizing customized broadcast of smart television based on TTS
CN110867177A (en) * 2018-08-16 2020-03-06 林其禹 Voice playing system with selectable timbre, playing method thereof and readable recording medium
CN111261139A (en) * 2018-11-30 2020-06-09 上海擎感智能科技有限公司 Character personification broadcasting method and system
CN111276126A (en) * 2020-02-20 2020-06-12 湖南芒果听见科技有限公司 Method and terminal for synthesizing time-administration key voice
CN111276123A (en) * 2018-11-16 2020-06-12 阿拉的(深圳)人工智能有限公司 Method and device for voice broadcasting message, computer equipment and storage medium
CN108847215B (en) * 2018-08-29 2020-07-17 北京云知声信息技术有限公司 Method and device for voice synthesis based on user timbre
CN111627417A (en) * 2019-02-26 2020-09-04 北京地平线机器人技术研发有限公司 Method and device for playing voice and electronic equipment
CN111681638A (en) * 2020-04-20 2020-09-18 深圳奥尼电子股份有限公司 Vehicle-mounted intelligent voice control method and system
CN111739536A (en) * 2020-05-09 2020-10-02 北京捷通华声科技股份有限公司 Audio processing method and device
CN111798829A (en) * 2020-06-30 2020-10-20 中国联合网络通信集团有限公司 Method, system, computer equipment and storage medium for reading text information by voice
WO2021081744A1 (en) * 2019-10-29 2021-05-06 深圳市欢太科技有限公司 Voice information processing method, apparatus, and device, and storage medium
CN113223493A (en) * 2020-01-20 2021-08-06 Tcl集团股份有限公司 Voice nursing method, device, system and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080291325A1 (en) * 2007-05-24 2008-11-27 Microsoft Corporation Personality-Based Device
CN102117614A (en) * 2010-01-05 2011-07-06 索尼爱立信移动通讯有限公司 Personalized text-to-speech synthesis and personalized speech feature extraction
CN102568472A (en) * 2010-12-15 2012-07-11 盛乐信息技术(上海)有限公司 Voice synthesis system with speaker selection and realization method thereof
CN103117057A (en) * 2012-12-27 2013-05-22 安徽科大讯飞信息科技股份有限公司 Application method of special human voice synthesis technique in mobile phone cartoon dubbing
CN103310784A (en) * 2012-03-14 2013-09-18 株式会社东芝 A text to speech method and system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080291325A1 (en) * 2007-05-24 2008-11-27 Microsoft Corporation Personality-Based Device
CN102117614A (en) * 2010-01-05 2011-07-06 索尼爱立信移动通讯有限公司 Personalized text-to-speech synthesis and personalized speech feature extraction
CN102568472A (en) * 2010-12-15 2012-07-11 盛乐信息技术(上海)有限公司 Voice synthesis system with speaker selection and realization method thereof
CN103310784A (en) * 2012-03-14 2013-09-18 株式会社东芝 A text to speech method and system
CN103117057A (en) * 2012-12-27 2013-05-22 安徽科大讯飞信息科技股份有限公司 Application method of special human voice synthesis technique in mobile phone cartoon dubbing

Cited By (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106205602A (en) * 2015-05-06 2016-12-07 上海汽车集团股份有限公司 Speech playing method and system
CN105208194A (en) * 2015-08-17 2015-12-30 努比亚技术有限公司 Voice broadcast device and method
CN105304081A (en) * 2015-11-09 2016-02-03 上海语知义信息技术有限公司 Smart household voice broadcasting system and voice broadcasting method
CN105427855A (en) * 2015-11-09 2016-03-23 上海语知义信息技术有限公司 Voice broadcast system and voice broadcast method of intelligent software
WO2017114048A1 (en) * 2015-12-28 2017-07-06 努比亚技术有限公司 Mobile terminal and method for identifying contact
CN106571136A (en) * 2016-10-28 2017-04-19 努比亚技术有限公司 Voice output device and method
CN106686135A (en) * 2017-02-22 2017-05-17 北京南师信息技术有限公司 Medicine dispensation information voice prompting system
CN107154263A (en) * 2017-05-25 2017-09-12 宇龙计算机通信科技(深圳)有限公司 Sound processing method, device and electronic equipment
CN107452400A (en) * 2017-07-24 2017-12-08 珠海市魅族科技有限公司 Voice broadcast method and device, computer installation and computer-readable recording medium
CN107516509A (en) * 2017-08-29 2017-12-26 苏州奇梦者网络科技有限公司 Voice base construction method and system for news report phonetic synthesis
CN107516509B (en) * 2017-08-29 2021-12-28 苏州奇梦者网络科技有限公司 Voice database construction method and system for news broadcast voice synthesis
CN109935225A (en) * 2017-12-15 2019-06-25 富泰华工业(深圳)有限公司 Character information processor and method, computer storage medium and mobile terminal
CN109979430A (en) * 2017-12-28 2019-07-05 深圳市优必选科技有限公司 Robot story telling method and device, robot and storage medium
CN109979430B (en) * 2017-12-28 2021-04-20 深圳市优必选科技有限公司 Robot story telling method and device, robot and storage medium
CN110097878A (en) * 2018-01-30 2019-08-06 阿拉的(深圳)人工智能有限公司 Polygonal color phonetic prompt method, cloud device, prompt system and storage medium
CN110867177A (en) * 2018-08-16 2020-03-06 林其禹 Voice playing system with selectable timbre, playing method thereof and readable recording medium
CN108847215B (en) * 2018-08-29 2020-07-17 北京云知声信息技术有限公司 Method and device for voice synthesis based on user timbre
CN111276123B (en) * 2018-11-16 2023-01-24 阿拉的(深圳)人工智能有限公司 Method and device for voice broadcasting message, computer equipment and storage medium
CN111276123A (en) * 2018-11-16 2020-06-12 阿拉的(深圳)人工智能有限公司 Method and device for voice broadcasting message, computer equipment and storage medium
CN111261139A (en) * 2018-11-30 2020-06-09 上海擎感智能科技有限公司 Character personification broadcasting method and system
CN111261139B (en) * 2018-11-30 2023-12-26 上海擎感智能科技有限公司 Literal personification broadcasting method and system
CN111627417A (en) * 2019-02-26 2020-09-04 北京地平线机器人技术研发有限公司 Method and device for playing voice and electronic equipment
CN111627417B (en) * 2019-02-26 2023-08-08 北京地平线机器人技术研发有限公司 Voice playing method and device and electronic equipment
CN110459201A (en) * 2019-08-22 2019-11-15 云知声智能科技股份有限公司 A kind of phoneme synthesizing method generating new tone color
CN110459201B (en) * 2019-08-22 2022-01-07 云知声智能科技股份有限公司 Speech synthesis method for generating new tone
CN110751940A (en) * 2019-09-16 2020-02-04 百度在线网络技术(北京)有限公司 Method, device, equipment and computer storage medium for generating voice packet
WO2021081744A1 (en) * 2019-10-29 2021-05-06 深圳市欢太科技有限公司 Voice information processing method, apparatus, and device, and storage medium
CN110856023A (en) * 2019-11-15 2020-02-28 四川长虹电器股份有限公司 System and method for realizing customized broadcast of smart television based on TTS
CN113223493A (en) * 2020-01-20 2021-08-06 Tcl集团股份有限公司 Voice nursing method, device, system and storage medium
CN113223493B (en) * 2020-01-20 2024-09-20 Tcl科技集团股份有限公司 Voice nursing method, device, system and storage medium
CN111276126A (en) * 2020-02-20 2020-06-12 湖南芒果听见科技有限公司 Method and terminal for synthesizing time-administration key voice
CN111681638A (en) * 2020-04-20 2020-09-18 深圳奥尼电子股份有限公司 Vehicle-mounted intelligent voice control method and system
CN111739536A (en) * 2020-05-09 2020-10-02 北京捷通华声科技股份有限公司 Audio processing method and device
CN111798829A (en) * 2020-06-30 2020-10-20 中国联合网络通信集团有限公司 Method, system, computer equipment and storage medium for reading text information by voice

Also Published As

Publication number Publication date
CN104464716B (en) 2018-01-12

Similar Documents

Publication Publication Date Title
CN104464716B (en) A kind of voice broadcasting system and method
US9318100B2 (en) Supplementing audio recorded in a media file
US9196241B2 (en) Asynchronous communications using messages recorded on handheld devices
CN1946065B (en) Method and system for remarking instant messaging by audible signal
CN101042752B (en) Method and sytem used for email administration
CN101030368B (en) Method and system for communicating across channels simultaneously with emotion preservation
US8594995B2 (en) Multilingual asynchronous communications of speech messages recorded in digital media files
US8249857B2 (en) Multilingual administration of enterprise data with user selected target language translation
CN109147800A (en) Answer method and device
US9892095B2 (en) Reconciliation of transcripts
WO2009075428A1 (en) Apparatus for and method of generating a multimedia email
CN103888581A (en) Communication terminal and method for recording communication information thereof
CN108074570A (en) Surface trimming, transmission, the audio recognition method preserved
US20140019137A1 (en) Method, system and server for speech synthesis
CN109710799B (en) Voice interaction method, medium, device and computing equipment
WO2018076664A1 (en) Voice broadcasting method and device
CN109492126B (en) Intelligent interaction method and device
CN109346057A (en) A kind of speech processing system of intelligence toy for children
US20080162559A1 (en) Asynchronous communications regarding the subject matter of a media file stored on a handheld recording device
CN104142936A (en) Audio and video match method and audio and video match device
GB2516942A (en) Text to Speech Conversion
CN110650250A (en) Method, system, device and storage medium for processing voice conversation
CN109637541A (en) The method and electronic equipment of voice conversion text
CN110740212B (en) Call answering method and device based on intelligent voice technology and electronic equipment
CN109213466B (en) Court trial information display method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP01 Change in the name or title of a patent holder
CP01 Change in the name or title of a patent holder

Address after: 100191, Beijing, Huayuan Road, Haidian District No. 2 peony technology building, block A, 5, A503

Patentee after: Yunzhisheng Intelligent Technology Co., Ltd.

Address before: 100191, Beijing, Huayuan Road, Haidian District No. 2 peony technology building, block A, 5, A503

Patentee before: Beijing Yunzhisheng Information Technology Co., Ltd.

TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20200403

Address after: No. 101, 1st Floor, 1st Building, Xisanqi Building Materials City, Haidian District, Beijing, 100000

Co-patentee after: Xiamen yunzhixin Intelligent Technology Co., Ltd

Patentee after: Yunzhisheng Intelligent Technology Co., Ltd.

Address before: 100191, Beijing, Huayuan Road, Haidian District No. 2 peony technology building, block A, 5, A503

Patentee before: Yunzhisheng Intelligent Technology Co., Ltd.