CN202587038U

CN202587038U - Voice data processing platform and system thereof

Info

Publication number: CN202587038U
Application number: CN 201220151755
Authority: CN
Inventors: 沈嘉鑫; 许军; 邵颖; 王钢
Original assignee: SHANGHAI CHEYIN NETWORK TECHNOLOGY Co Ltd
Current assignee: Shanghai car sound intelligent technology Co., Ltd.
Priority date: 2012-04-11
Filing date: 2012-04-11
Publication date: 2012-12-05
Anticipated expiration: 2022-04-11

Abstract

The utility model provides a voice data processing platform and a system thereof. The system comprises the voice data processing platform and a client. The system comprises a voice recognition module arranged in the voice data processing platform or the client. The voice data processing platform comprise a local side communication module for communicating with the client, a data processing module which is connected with the local side terminal communication module to process a received text which is recognized by the voice recognition module, a voice synthesis module which is used for processing a processing result of the data processing module, generating personalized voice and sending the personalized voice to the client. The client comprises a client communication module for connecting the local side communication module and a playing module which is connected with the client communication module to play voice sent by the voice data processing platform. According to the utility model, TTS technology can be employed to generate the personalized voice, and a user experience is improved.

Description

Language data process platform and system

Technical field

The utility model relates to the mobile communication technology field, is meant a kind of language data process platform and system especially.

Background technology

Along with the development and the development of electronic technology of mobile communication technology, more and more client devices (for example portable terminal) have had data processing function and data communication facility.Existing language data process platform generally all comprises: be used for the local side communication module with the client device communication, the sound identification module that is used to carry out speech recognition.After can the voice that receive from client device being discerned like this, handle accordingly.This mode can greatly facilitate user's use.But for existing language data process platform, can only be through single voice to the client device broadcast information, this mode causes the user experience sense very poor.

The utility model content

To above-mentioned defective and the problem that existing client device exists, the purpose of the utility model embodiment is to propose a kind of language data process platform and system that can reduce client device cost and use complexity.

In order to achieve the above object, the utility model embodiment has proposed a kind of voice data processing system, comprises language data process platform and client; Said system comprises the sound identification module that is arranged at language data process platform or client;

Said language data process platform also comprises:

Be used for carrying out the local side communication module of communication with client;

Data processing module connects said local side communication module and handles with the text that the sound identification module that receives is identified;

The phonetic synthesis module is used for the result of said data processing module is generated personalized speech and sends to client;

Said client comprises:

Be used to connect the client communication module of said local side communication module;

Playing module connects said client communication module and plays with the voice that said language data process platform is sent.

The utility model embodiment has also proposed a kind of language data process platform, comprising:

The utility model embodiment has proposed a kind of language data process platform, system, can adopt the TTS technology to generate personalized speech, thus the user's experience sense that improves.

Description of drawings

In order to be illustrated more clearly in the utility model embodiment or technical scheme of the prior art; To do to introduce simply to the accompanying drawing of required use in embodiment or the description of the Prior Art below; Obviously, the accompanying drawing in describing below only is some embodiment of the utility model, for those of ordinary skills; Under the prerequisite of not paying creative work property, can also obtain other accompanying drawing according to these accompanying drawings.

Fig. 1 is the structural representation of a kind of system of the utility model embodiment;

Fig. 2 is the structural representation of the another kind of system of the utility model embodiment;

Fig. 3 is the structural representation of the language data process platform of the utility model embodiment.

Embodiment

The accompanying drawing that will combine the utility model below carries out clear, intactly description to the technical scheme of the utility model, and obviously, described embodiment only is the utility model part embodiment, rather than whole embodiment.Based on the embodiment in the utility model, those of ordinary skills are not making the every other embodiment that is obtained under the creative work prerequisite, all belong to the scope of the utility model protection.

Phonetic synthesis (Text To Speech) is called for short the TTS technology, relates to a plurality of subject technologies such as acoustics, linguistics, Digital Signal Processing, multimedia technology, is a cutting edge technology in Chinese information processing field.Phonetic synthesis is exactly a process that text is converted into voice output; The work of this process mainly is that the text of importing is decomposed into phoneme by word or speech; And want the symbol of special processing to analyze to the numeral in the text, monetary unit, word deforming and punctuate etc., and phoneme is generated DAB come out with loudspeaker plays then or save as to play with multimedia software after the audio files.The application's inventive point is, utilizes later TTS technology to realize the personalized speech broadcast, can translate at the language data process platform simultaneously.

Embodiment 1

The utility model embodiment has proposed a kind of voice data processing system, and its structure is as shown in Figure 1, comprising: language data process platform 1 and client 2;

Said language data process platform 1 comprises: local side communication module 11, sound identification module 12, data processing module 13, TTS module 14 (being the phonetic synthesis module);

Said local side communication module 11 is used to connect client 2 to carry out communication with client 2.Said sound identification module 12 connects local side communication module 11 and data processing module 13, discern with the voice that said client 2 is sent, and the text after will discerning sends to said data processing module 13.Said data processing module 13 is used to receive the text that said sound identification module identifies, and said text is proceeded to handle.TTS module 14, connecting said data processing module 13 is the voice of personalization with the text-converted after will handling, and sends to client 2 through local side communication module 11.Wherein, client can insert this language data process platform 1 in advance, and the sound-type of oneself liking is set.Language data process platform 1 is confirmed the personalized speech and the storage of each user preferences according to the unique identification of this setting and this client.What hear when the user inserts at every turn like this all is the voice of oneself liking, and improves user's experience sense.

The applicant needs explanation at this, and above-mentioned each module is prior art, and the inventive point of the utility model is above-mentioned each module is concentrated in together a language data process platform and the system of being connected to form.

Said client 2 comprises: client communication module 21 and playing module 22.Client communication module 11 is used to connect local side communication module 21, and playing module 22 connects said client communication module 21 and plays with the voice that said language data process platform 1 is sent.

Wherein, said data processing module comprises machine translation unit and/or navigation elements.Said machine translation unit is used for text is carried out sending to the TTS module behind the multilingual translation; Said navigation elements is used for according to sending to the TTS module behind the text generation navigation information.

Client can directly send to the language data process platform with voice like this, carry out speech recognition by the language data process platform then after, handle accordingly.This processing can include but not limited to: carry out multilingual translation, navigate.Certainly, for better service is provided, this language data process platform can be provided with a plurality of various unit to accomplish different services.Multilingual translation, navigation be a concrete mode realizing of the utility model just, but not the qualification that the utility model is made.Simultaneously, convert voice through language data process platform 1 into through TTS module 14 after, can also the text of correspondence also be adopted mail/short message way send to client 2.

Further, said client comprises that the signal conveys module is to carry signal to instruct plant equipment to fix action to plant equipment.Wherein, client can be carried out signal conveys to the plant equipment that connects through wireless or bluetooth, with the fixedly action output (intelligent robotic toy) of instruction plant equipment.Language data process platform 1 is directly controlled plant equipment through client.

Embodiment 2

Another embodiment of the utility model has also proposed a kind of voice data processing system, and the difference of itself and first embodiment is that sound identification module is arranged on client.Its structure is as shown in Figure 2, comprising: language data process platform 1 and client 2;

Said language data process platform 1 comprises: local side communication module 11, data processing module 13, TTS module 14;

Said data processing module 13 connects local side communication module 11, proceeds to handle with the text that the sound identification module 23 with client 2 identifies.TTS module 14, connecting said data processing module 13 is the voice of personalization with the text-converted after will handling, and sends to client 2 through local side communication module 11.Wherein, client can insert this language data process platform 1 in advance, and the sound-type of oneself liking is set.Language data process platform 1 is confirmed the personalized speech and the storage of each user preferences according to the unique identification of this setting and this client.What hear when the user inserts at every turn like this all is the voice of oneself liking, and improves user's experience sense.

Said client 2 comprises: client communication module 21 and playing module 22, sound identification module 23.Client communication module 21 is used to connect local side communication module 11, and playing module 22 connects said client communication module 21 and plays with the voice that said language data process platform 1 is sent.Client 2 after at first discerning through sound identification module 23, sends to language data process platform 1 with the text after the identification through client communication module 21 after receiving user's voice.Language data process platform 1 carries out after the handled sending to client 2 through the mode of voice again.

Embodiment 3

The utility model the 3rd embodiment has proposed a kind of language data process platform, and its structure is as shown in Figure 3, comprising: local side communication module 11, data processing module 13, TTS module 14;

Wherein, said language data process platform 1 also comprises:

Sound identification module 12, said sound identification module 12 connect said local side communication module 11 and data processing module 13 respectively, discern with the voice that said client is sent, and the text after will discerning send to said data processing module 13.

The above; Be merely the embodiment of the utility model; But the protection range of the utility model is not limited thereto; Any technical staff who is familiar with the present technique field can expect changing or replacement in the technical scope that the utility model discloses easily, all should be encompassed within the protection range of the utility model.Therefore, the protection range of the utility model should be as the criterion by said protection range with claim.

Claims

1. a voice data processing system is characterized in that, comprising: language data process platform and client; Said system comprises the sound identification module that is arranged at language data process platform or client;

Said language data process platform also comprises:

Said client comprises:

2. voice data processing system according to claim 1 is characterized in that said data processing module comprises machine translation unit and/or navigation elements;

Said machine translation unit is used for text is carried out sending to the phonetic synthesis module behind the multilingual translation;

Said navigation elements is used for according to sending to the phonetic synthesis module behind the text generation navigation information.

3. voice data processing system according to claim 1 and 2; It is characterized in that; Said sound identification module is arranged at said client, and said sound identification module connects said client communication module and sends to said language data process platform with the voice after will discerning.

4. voice data processing system according to claim 1 and 2; It is characterized in that; Said sound identification module is arranged at said language data process platform; Said sound identification module connects said local side communication module and data processing module respectively, discern with the voice that said client is sent, and the text after will discerning sends to said data processing module.

5. voice data processing system according to claim 1 is characterized in that, said client comprises that the signal conveys module is to carry signal to instruct plant equipment to fix action to plant equipment.

6. a language data process platform is characterized in that, comprising:

The phonetic synthesis module is used for the result of said data processing module is generated personalized speech and sends to client.

7. language data process platform according to claim 6 is characterized in that said data processing module comprises machine translation unit and/or navigation elements;

8. according to claim 6 or 7 described language data process platforms, it is characterized in that said language data process platform also comprises:

Sound identification module, said sound identification module connect said local side communication module and data processing module respectively, discern with the voice that said client is sent, and the text after will discerning send to said data processing module.