CN202587038U - Voice data processing platform and system thereof - Google Patents

Voice data processing platform and system thereof Download PDF

Info

Publication number
CN202587038U
CN202587038U CN 201220151755 CN201220151755U CN202587038U CN 202587038 U CN202587038 U CN 202587038U CN 201220151755 CN201220151755 CN 201220151755 CN 201220151755 U CN201220151755 U CN 201220151755U CN 202587038 U CN202587038 U CN 202587038U
Authority
CN
China
Prior art keywords
module
client
data processing
voice
communication module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN 201220151755
Other languages
Chinese (zh)
Inventor
沈嘉鑫
许军
邵颖
王钢
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai car sound intelligent technology Co., Ltd.
Original Assignee
SHANGHAI CHEYIN NETWORK TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SHANGHAI CHEYIN NETWORK TECHNOLOGY Co Ltd filed Critical SHANGHAI CHEYIN NETWORK TECHNOLOGY Co Ltd
Priority to CN 201220151755 priority Critical patent/CN202587038U/en
Application granted granted Critical
Publication of CN202587038U publication Critical patent/CN202587038U/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Machine Translation (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The utility model provides a voice data processing platform and a system thereof. The system comprises the voice data processing platform and a client. The system comprises a voice recognition module arranged in the voice data processing platform or the client. The voice data processing platform comprise a local side communication module for communicating with the client, a data processing module which is connected with the local side terminal communication module to process a received text which is recognized by the voice recognition module, a voice synthesis module which is used for processing a processing result of the data processing module, generating personalized voice and sending the personalized voice to the client. The client comprises a client communication module for connecting the local side communication module and a playing module which is connected with the client communication module to play voice sent by the voice data processing platform. According to the utility model, TTS technology can be employed to generate the personalized voice, and a user experience is improved.

Description

Language data process platform and system
Technical field
The utility model relates to the mobile communication technology field, is meant a kind of language data process platform and system especially.
Background technology
Along with the development and the development of electronic technology of mobile communication technology, more and more client devices (for example portable terminal) have had data processing function and data communication facility.Existing language data process platform generally all comprises: be used for the local side communication module with the client device communication, the sound identification module that is used to carry out speech recognition.After can the voice that receive from client device being discerned like this, handle accordingly.This mode can greatly facilitate user's use.But for existing language data process platform, can only be through single voice to the client device broadcast information, this mode causes the user experience sense very poor.
The utility model content
To above-mentioned defective and the problem that existing client device exists, the purpose of the utility model embodiment is to propose a kind of language data process platform and system that can reduce client device cost and use complexity.
In order to achieve the above object, the utility model embodiment has proposed a kind of voice data processing system, comprises language data process platform and client; Said system comprises the sound identification module that is arranged at language data process platform or client;
Said language data process platform also comprises:
Be used for carrying out the local side communication module of communication with client;
Data processing module connects said local side communication module and handles with the text that the sound identification module that receives is identified;
The phonetic synthesis module is used for the result of said data processing module is generated personalized speech and sends to client;
Said client comprises:
Be used to connect the client communication module of said local side communication module;
Playing module connects said client communication module and plays with the voice that said language data process platform is sent.
The utility model embodiment has also proposed a kind of language data process platform, comprising:
Be used for carrying out the local side communication module of communication with client;
Data processing module connects said local side communication module and handles with the text that the sound identification module that receives is identified;
The phonetic synthesis module is used for the result of said data processing module is generated personalized speech and sends to client;
The utility model embodiment has proposed a kind of language data process platform, system, can adopt the TTS technology to generate personalized speech, thus the user's experience sense that improves.
Description of drawings
In order to be illustrated more clearly in the utility model embodiment or technical scheme of the prior art; To do to introduce simply to the accompanying drawing of required use in embodiment or the description of the Prior Art below; Obviously, the accompanying drawing in describing below only is some embodiment of the utility model, for those of ordinary skills; Under the prerequisite of not paying creative work property, can also obtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is the structural representation of a kind of system of the utility model embodiment;
Fig. 2 is the structural representation of the another kind of system of the utility model embodiment;
Fig. 3 is the structural representation of the language data process platform of the utility model embodiment.
Embodiment
The accompanying drawing that will combine the utility model below carries out clear, intactly description to the technical scheme of the utility model, and obviously, described embodiment only is the utility model part embodiment, rather than whole embodiment.Based on the embodiment in the utility model, those of ordinary skills are not making the every other embodiment that is obtained under the creative work prerequisite, all belong to the scope of the utility model protection.
Phonetic synthesis (Text To Speech) is called for short the TTS technology, relates to a plurality of subject technologies such as acoustics, linguistics, Digital Signal Processing, multimedia technology, is a cutting edge technology in Chinese information processing field.Phonetic synthesis is exactly a process that text is converted into voice output; The work of this process mainly is that the text of importing is decomposed into phoneme by word or speech; And want the symbol of special processing to analyze to the numeral in the text, monetary unit, word deforming and punctuate etc., and phoneme is generated DAB come out with loudspeaker plays then or save as to play with multimedia software after the audio files.The application's inventive point is, utilizes later TTS technology to realize the personalized speech broadcast, can translate at the language data process platform simultaneously.
Embodiment 1
The utility model embodiment has proposed a kind of voice data processing system, and its structure is as shown in Figure 1, comprising: language data process platform 1 and client 2;
Said language data process platform 1 comprises: local side communication module 11, sound identification module 12, data processing module 13, TTS module 14 (being the phonetic synthesis module);
Said local side communication module 11 is used to connect client 2 to carry out communication with client 2.Said sound identification module 12 connects local side communication module 11 and data processing module 13, discern with the voice that said client 2 is sent, and the text after will discerning sends to said data processing module 13.Said data processing module 13 is used to receive the text that said sound identification module identifies, and said text is proceeded to handle.TTS module 14, connecting said data processing module 13 is the voice of personalization with the text-converted after will handling, and sends to client 2 through local side communication module 11.Wherein, client can insert this language data process platform 1 in advance, and the sound-type of oneself liking is set.Language data process platform 1 is confirmed the personalized speech and the storage of each user preferences according to the unique identification of this setting and this client.What hear when the user inserts at every turn like this all is the voice of oneself liking, and improves user's experience sense.
The applicant needs explanation at this, and above-mentioned each module is prior art, and the inventive point of the utility model is above-mentioned each module is concentrated in together a language data process platform and the system of being connected to form.
Said client 2 comprises: client communication module 21 and playing module 22.Client communication module 11 is used to connect local side communication module 21, and playing module 22 connects said client communication module 21 and plays with the voice that said language data process platform 1 is sent.
Wherein, said data processing module comprises machine translation unit and/or navigation elements.Said machine translation unit is used for text is carried out sending to the TTS module behind the multilingual translation; Said navigation elements is used for according to sending to the TTS module behind the text generation navigation information.
Client can directly send to the language data process platform with voice like this, carry out speech recognition by the language data process platform then after, handle accordingly.This processing can include but not limited to: carry out multilingual translation, navigate.Certainly, for better service is provided, this language data process platform can be provided with a plurality of various unit to accomplish different services.Multilingual translation, navigation be a concrete mode realizing of the utility model just, but not the qualification that the utility model is made.Simultaneously, convert voice through language data process platform 1 into through TTS module 14 after, can also the text of correspondence also be adopted mail/short message way send to client 2.
Further, said client comprises that the signal conveys module is to carry signal to instruct plant equipment to fix action to plant equipment.Wherein, client can be carried out signal conveys to the plant equipment that connects through wireless or bluetooth, with the fixedly action output (intelligent robotic toy) of instruction plant equipment.Language data process platform 1 is directly controlled plant equipment through client.
Embodiment 2
Another embodiment of the utility model has also proposed a kind of voice data processing system, and the difference of itself and first embodiment is that sound identification module is arranged on client.Its structure is as shown in Figure 2, comprising: language data process platform 1 and client 2;
Said language data process platform 1 comprises: local side communication module 11, data processing module 13, TTS module 14;
Said data processing module 13 connects local side communication module 11, proceeds to handle with the text that the sound identification module 23 with client 2 identifies.TTS module 14, connecting said data processing module 13 is the voice of personalization with the text-converted after will handling, and sends to client 2 through local side communication module 11.Wherein, client can insert this language data process platform 1 in advance, and the sound-type of oneself liking is set.Language data process platform 1 is confirmed the personalized speech and the storage of each user preferences according to the unique identification of this setting and this client.What hear when the user inserts at every turn like this all is the voice of oneself liking, and improves user's experience sense.
Said client 2 comprises: client communication module 21 and playing module 22, sound identification module 23.Client communication module 21 is used to connect local side communication module 11, and playing module 22 connects said client communication module 21 and plays with the voice that said language data process platform 1 is sent.Client 2 after at first discerning through sound identification module 23, sends to language data process platform 1 with the text after the identification through client communication module 21 after receiving user's voice.Language data process platform 1 carries out after the handled sending to client 2 through the mode of voice again.
Wherein, said data processing module comprises machine translation unit and/or navigation elements.Said machine translation unit is used for text is carried out sending to the TTS module behind the multilingual translation; Said navigation elements is used for according to sending to the TTS module behind the text generation navigation information.
Further, said client comprises that the signal conveys module is to carry signal to instruct plant equipment to fix action to plant equipment.Wherein, client can be carried out signal conveys to the plant equipment that connects through wireless or bluetooth, with the fixedly action output (intelligent robotic toy) of instruction plant equipment.Language data process platform 1 is directly controlled plant equipment through client.
Embodiment 3
The utility model the 3rd embodiment has proposed a kind of language data process platform, and its structure is as shown in Figure 3, comprising: local side communication module 11, data processing module 13, TTS module 14;
Said data processing module 13 connects local side communication module 11, proceeds to handle with the text that the sound identification module 23 with client 2 identifies.TTS module 14, connecting said data processing module 13 is the voice of personalization with the text-converted after will handling, and sends to client 2 through local side communication module 11.Wherein, client can insert this language data process platform 1 in advance, and the sound-type of oneself liking is set.Language data process platform 1 is confirmed the personalized speech and the storage of each user preferences according to the unique identification of this setting and this client.What hear when the user inserts at every turn like this all is the voice of oneself liking, and improves user's experience sense.
Wherein, said language data process platform 1 also comprises:
Sound identification module 12, said sound identification module 12 connect said local side communication module 11 and data processing module 13 respectively, discern with the voice that said client is sent, and the text after will discerning send to said data processing module 13.
The above; Be merely the embodiment of the utility model; But the protection range of the utility model is not limited thereto; Any technical staff who is familiar with the present technique field can expect changing or replacement in the technical scope that the utility model discloses easily, all should be encompassed within the protection range of the utility model.Therefore, the protection range of the utility model should be as the criterion by said protection range with claim.

Claims (8)

1. a voice data processing system is characterized in that, comprising: language data process platform and client; Said system comprises the sound identification module that is arranged at language data process platform or client;
Said language data process platform also comprises:
Be used for carrying out the local side communication module of communication with client;
Data processing module connects said local side communication module and handles with the text that the sound identification module that receives is identified;
The phonetic synthesis module is used for the result of said data processing module is generated personalized speech and sends to client;
Said client comprises:
Be used to connect the client communication module of said local side communication module;
Playing module connects said client communication module and plays with the voice that said language data process platform is sent.
2. voice data processing system according to claim 1 is characterized in that said data processing module comprises machine translation unit and/or navigation elements;
Said machine translation unit is used for text is carried out sending to the phonetic synthesis module behind the multilingual translation;
Said navigation elements is used for according to sending to the phonetic synthesis module behind the text generation navigation information.
3. voice data processing system according to claim 1 and 2; It is characterized in that; Said sound identification module is arranged at said client, and said sound identification module connects said client communication module and sends to said language data process platform with the voice after will discerning.
4. voice data processing system according to claim 1 and 2; It is characterized in that; Said sound identification module is arranged at said language data process platform; Said sound identification module connects said local side communication module and data processing module respectively, discern with the voice that said client is sent, and the text after will discerning sends to said data processing module.
5. voice data processing system according to claim 1 is characterized in that, said client comprises that the signal conveys module is to carry signal to instruct plant equipment to fix action to plant equipment.
6. a language data process platform is characterized in that, comprising:
Be used for carrying out the local side communication module of communication with client;
Data processing module connects said local side communication module and handles with the text that the sound identification module that receives is identified;
The phonetic synthesis module is used for the result of said data processing module is generated personalized speech and sends to client.
7. language data process platform according to claim 6 is characterized in that said data processing module comprises machine translation unit and/or navigation elements;
Said machine translation unit is used for text is carried out sending to the phonetic synthesis module behind the multilingual translation;
Said navigation elements is used for according to sending to the phonetic synthesis module behind the text generation navigation information.
8. according to claim 6 or 7 described language data process platforms, it is characterized in that said language data process platform also comprises:
Sound identification module, said sound identification module connect said local side communication module and data processing module respectively, discern with the voice that said client is sent, and the text after will discerning send to said data processing module.
CN 201220151755 2012-04-11 2012-04-11 Voice data processing platform and system thereof Expired - Fee Related CN202587038U (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201220151755 CN202587038U (en) 2012-04-11 2012-04-11 Voice data processing platform and system thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201220151755 CN202587038U (en) 2012-04-11 2012-04-11 Voice data processing platform and system thereof

Publications (1)

Publication Number Publication Date
CN202587038U true CN202587038U (en) 2012-12-05

Family

ID=47256394

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201220151755 Expired - Fee Related CN202587038U (en) 2012-04-11 2012-04-11 Voice data processing platform and system thereof

Country Status (1)

Country Link
CN (1) CN202587038U (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106470199A (en) * 2015-08-21 2017-03-01 石家庄市善理通益科技有限公司 The processing method of speech data, device and intercom system
CN113160827A (en) * 2021-04-07 2021-07-23 深圳鱼亮科技有限公司 Voice transcription system and method based on multi-language model

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106470199A (en) * 2015-08-21 2017-03-01 石家庄市善理通益科技有限公司 The processing method of speech data, device and intercom system
CN113160827A (en) * 2021-04-07 2021-07-23 深圳鱼亮科技有限公司 Voice transcription system and method based on multi-language model

Similar Documents

Publication Publication Date Title
CN106409283B (en) Man-machine mixed interaction system and method based on audio
KR101703214B1 (en) Method for changing contents of character data into transmitter's voice and outputting the transmiter's voice
CN103561217A (en) Method and terminal for generating captions
CN102006373A (en) Vehicle-mounted service system and method based on voice command control
US20110270601A1 (en) Universal translator
CN103187079A (en) Vehicle-mounted information system
AU2001247708A1 (en) Web-based speech recognition with scripting and semantic objects
CN105117391A (en) Translating languages
CN104380373A (en) Systems and methods for name pronunciation
CN104078044A (en) Mobile terminal and sound recording search method and device of mobile terminal
CN104320533A (en) Conversion method and system for mobile equipment
US20100211389A1 (en) System of communication employing both voice and text
CN202216698U (en) Navigation voice and music voice switching system
CN101340676A (en) Method, apparatus and mobile terminal implementing simultaneous interpretation
CN106412032A (en) Remote audio character transmission method and system
CN103152480A (en) Method and device for arrival prompt by mobile terminal
CN106537497A (en) Information management system and information management method
CN103491406A (en) Android intelligent television system based on voice recognition
CN101846525B (en) Navigation information processing and acquiring methods and device
CN104078038A (en) Page content aloud-reading method and device
CN202587038U (en) Voice data processing platform and system thereof
CN105280206A (en) Audio playing method and device
CN102056093A (en) Method for converting text message into voice message
CN102571882A (en) Network-based voice reminding method and system
CN105957528A (en) Audio processing method and apparatus

Legal Events

Date Code Title Description
C14 Grant of patent or utility model
GR01 Patent grant
CP03 Change of name, title or address
CP03 Change of name, title or address

Address after: 200335 Shanghai city Changning District Admiralty Road No. 999 Building 1 floor 904-906 room 9

Patentee after: Shanghai car sound intelligent technology Co., Ltd.

Address before: 200233 Room 305, building 4, No. 396, Guilin road, Xuhui District, Shanghai

Patentee before: Shanghai Cheyin Network Technology Co., Ltd.

CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20121205

Termination date: 20210411