A kind ofly video play device is carried out voice-operated system and method through mobile communication terminal
Technical field
The present invention relates to network communication field, more particularly, relate to and a kind ofly video play device is carried out voice-operated system and method through mobile communication terminal.
Background technology
Along with the development of technology, video play device becomes multi-functional multi-media information terminal by the independent function development of watching, can carry out interaction with the user.Existing video play device can be linked in the Internet, and a node as in the Internet carries out receiving and dispatching mail, browsing page, participation recreation etc.
Video play device and user carry out the interactive instruction that just needs to receive the user; In the prior art; Mainly be to use the subsidiary traditional IR remote controller of video play device that video play device is carried out some simple controls; For the control forms of a lot of complicacies, still powerless such as text input control, phonetic entry control etc.
The function of mobile communication terminal is also becoming better and approaching perfection day by day; From initial transmitting-receiving note, receive phone, watching video, participate in receiving in the network mail and browsing page till now, game interactive, GPS navigation etc.; But carry for ease; Its display screen is restricted, and user's the visual experience and the effect of watching video are still not as traditional video play device, and traditional video play device is given a kind of sensation on the spot in person of user.In actual life, mobile communication terminal is one of staff almost, and also very convenient for the input control of mobile communication terminal.
Therefore; Simple relatively in order to solve traditional IR remote controller function; Can not satisfy the complicated input control of video play device, need a kind of mobile communication terminal and video play device be carried out mutual method and system, realize complicated control modes such as for example phonetic entry through mobile communication terminal; Use video play device flexibly, the maximum function of performance video play device.Though the user also can realize the text input through the soft keyboard of mobile communication terminal at present, and then realize the control to video play device, it is efficient, convenient to be not so good as the voice control mode.
Simultaneously, mobile communication terminal can also substitute traditional IR remote controller, has not only realized the function of traditional IR remote controller, also has complicated input control function, not only is user-friendly to, and can economizes on resources.
Summary of the invention
The purpose of this invention is to provide a kind of system and method for video play device being controlled through mobile communication terminal, solve the problem of video play device being carried out phonetic entry control.Make the user to realize control easily, replace traditional IR remote controller through the mobile communication terminal of oneself to video play device.
The present invention provides a kind of system that video play device is controlled through mobile communication terminal; Said system comprises said mobile communication terminal; Said video play device; Speech recognition server is characterized in that: said mobile communication terminal and said speech recognition server interconnect through network; Said video play device and said mobile communication terminal interconnect through network; Said speech recognition server is used for the voice messaging of input is carried out speech recognition;
Wherein, Said mobile communication terminal comprise the voice receiving processing module and with the interconnective communication module of said voice receiving processing module; Said communication module and said speech recognition server interconnect through network, and said communication module and said video play device interconnect through network; Said voice receiving processing module is used to receive the said voice messaging and the speech pattern of user's input.
Preferably, said voice receiving processing module can receive the voice identification result of said speech recognition server to said voice messaging through said communication module, and the result of the said speech recognition that receives is carried out logical process.
Preferably, said speech pattern comprises text entry mode and voice control model.
Preferably, when said speech pattern was said text entry mode, said video play device showed the voice identification result of said logical process.
Preferably; When said speech pattern is said voice control model; Said video play device is converted into command information with the result of the speech recognition of said logical process; Mate with the command information storehouse that is used to start application program, mate successfully after, start said command information corresponding application.
The present invention also provides a kind of method that mobile communication terminal is controlled video play device of passing through of using said system, and it is characterized in that, comprising:
(1) the said voice receiving processing module of said mobile communication terminal receives the said speech input information and the said speech pattern of user's input;
(2) the said voice receiving processing module said speech input information and the said speech pattern that will receive sends to said communication module; Said communication module sends to said speech recognition server through said network, and said speech recognition server carries out said speech recognition to the said speech input information that receives;
(3) said speech recognition server sends to said communication module with recognition result through said network; The said recognition result that said communication module will receive sends to said voice receiving processing module, and said voice receiving processing module carries out logical process to the said recognition result that receives;
(4) recognition result of said voice receiving processing module after with said logical process sends to said communication module, and sends to said video play device through the recognition result and the said speech pattern that receives of said communication module after with said logical process through said network;
(5) said video play device is handled the recognition result after the said logical process according to the said speech pattern that receives accordingly.
Preferably, said recognition result is a literal, and said logical process comprises removes useless punctuation mark with said literal.
Preferably, said speech pattern comprises text entry mode and voice control model.
Preferably, when said speech pattern was said text entry mode, said video play device showed the recognition result of said logical process.
Preferably; When said speech pattern is said voice control model; Said video play device is converted into command information with the recognition result of said logical process; Mate with the command information storehouse that is used to start application program, mate successfully after, start said command information corresponding application.
Compared with prior art, the invention has the advantages that the control mode that realizes complicacies such as phonetic entry through mobile communication terminal, use video play device flexibly, the maximum function of performance video play device.
Description of drawings
In order to make the present invention be convenient to understand, combine accompanying drawing to describe specific embodiment of the present invention now.
Fig. 1 is the building-block of logic according to a preferred embodiment of the system that video play device is controlled through mobile communication terminal of the present invention.
Fig. 2 is the process flow diagram according to a preferred embodiment of the method for video play device being controlled through mobile communication terminal of the present invention.
Embodiment
Below in conjunction with accompanying drawing and embodiment the present invention is described in further detail.
The purpose of this invention is to provide a kind of system and method for video play device being controlled through mobile communication terminal, solve the problem of video play device being carried out phonetic entry control.
Fig. 1 is the building-block of logic according to a preferred embodiment of the system that video play device is controlled through mobile communication terminal of the present invention.Fig. 2 is the process flow diagram according to a preferred embodiment of the method for video play device being controlled through mobile communication terminal of the present invention.As shown in the figure, the voice receiving processing module of mobile communication terminal receives the speech input information and the speech pattern of user's input; The voice receiving processing module sends to communication module with speech input information that receives and speech pattern, and communication module sends to speech recognition server through network; Speech recognition server carries out speech recognition to the speech input information that receives, and the result after speech recognition server will be discerned like literal, sends to communication module through network, and communication module sends to the voice receiving processing module with the recognition result that receives; The voice receiving processing module carries out logical process to the recognition result that receives, and the recognition result of voice receiving processing module after with logical process sends to communication module; The recognition result of communication module in the mobile communication terminal after with logical process sends to video play device with the speech pattern that receives through network; Video play device is handled the recognition result after the logical process according to the speech pattern that receives accordingly; Wherein, speech pattern comprises text entry mode and voice control model.When speech pattern was text entry mode, video play device showed the recognition result of logical process; When speech pattern was the voice control model, video play device was converted into command information with the recognition result of logical process, mated with the command information storehouse that is used to start application program, mate successfully after, startup command information corresponding application.Said logical process comprises removes useless punctuation mark with the literal discerned that receives.
In order to realize the object of the invention, a preferred embodiment provided by the invention is following:
At first, the voice receiving processing module of user's mobile communication terminal receives from user's voice input information and user's voice pattern.Wherein, said speech pattern comprises voice control model and text entry mode.In this preferred embodiment, said user's voice pattern is a text entry mode.
Second; Said voice receiving processing module sends to said communication module with said speech input information; Said communication module sends to speech recognition server through network; Said speech recognition server carries out speech recognition to the said speech input information that receives, and for example, converts said speech input information to literal.For example; The user is through said voice receiving processing module input voice " gantry Fei Jia "; Said voice receiving processing module sends to said communication module with said input voice " gantry Fei Jia "; Said communication module sends to said speech recognition server through network, after said speech recognition server carries out voice recognition processing, converts discernible literal " gantry Fei Jia " to.
The 3rd; The literal discerned after said speech recognition server will be handled sends to the said communication module of said user's mobile communication terminal; Said communication module sends it to said voice receiving processing module; Said voice receiving processing module receives recognition result, and the said recognition result that receives is carried out logical process.For example, the Word message of receiving is removed useless punctuation mark, as ", " or ", " etc.
The 4th; Said voice receiving processing module in the said mobile communication terminal will pass through the said recognition result and the said user's voice pattern of logical process; Be text entry mode, send to said communication module, said communication module is transferred to video play device through network with them.
The 5th, said video play device is handled according to recognition result and said user's voice pattern after the said logical process that receives accordingly.Said user's voice pattern is a text entry mode, and the said literal of discerning that said video play device will receive is shown in the text box.For example, said video play device is shown to the text input information that receives " gantry Fei Jia " in the corresponding text frame.
Another preferred embodiment provided by the invention is following:
At first, the voice receiving processing module of user's mobile communication terminal receives from user's voice input information and user's voice pattern.Wherein, said speech pattern comprises voice control model and text entry mode.In this preferred embodiment, said user's voice pattern is the voice control model.
Second; Said voice receiving processing module sends to said communication module with said speech input information; Said communication module sends to speech recognition server through network; Said speech recognition server carries out voice recognition processing to the said speech input information that receives, and for example, converts said speech input information to literal.For example; The user " opens homepage " through said voice receiving processing module input voice; Said voice receiving processing module " is opened homepage " with said input voice and is sent to said communication module; Said communication module sends to said speech recognition server through network, after said speech recognition server carries out voice recognition processing, converts discernible literal to and " opens homepage ".
The 3rd; Said speech recognition server connects the said communication module that the literal discerned after the processing sends to said user's mobile communication terminal; Said communication module sends it to said voice receiving processing module; Said voice receiving processing module receives recognition result, and the said recognition result that receives is carried out logical process.For example, the Word message of receiving is removed useless punctuation mark, as ", " or ", " etc.
The 4th; Said voice receiving processing module in the said mobile communication terminal will pass through the said recognition result and the said user's voice pattern of logical process; Be the voice control models, send to said communication module, said communication module is transferred to video play device through network with them.
The 5th, said video play device is handled according to recognition result and said user's voice pattern after the said logical process that receives accordingly.If said user's voice pattern is the voice control model; The said literal of discerning that so said video play device will receive transfers command information to; Mate with the command information storehouse that is used to start application program, mate successfully after, start said command information corresponding application.For example, the command information that receives " is opened homepage " to said video play device and mate in the command information storehouse that is used to start application program, mate successfully after, startup " opening homepage " corresponding application promptly starts corresponding homepage browse program.
The matching way of said command information storehouse and voice command can be the matching way that system default is provided with, and also operation interface can be provided, and supplies the matching relationship in User Defined voice command and command information storehouse.The user can be through the corresponding relation in self-defined voice command of text entry mode and command information storehouse.
Above-mentioned detailed description has been illustrated the various embodiment of system and/or process through embodiment and/or synoptic diagram.With regard to these synoptic diagram and/or comprise with regard to one or more functions and/or the operation; It will be understood by those skilled in the art that among these synoptic diagram or the embodiment each function and/or operation all can by various hardware, software, firmware or in fact its combination in any come individually and/or jointly realize.
Should be appreciated that method described herein can combined with hardware or software, or combine both combinations to realize in due course.Therefore; Method of the present invention; Can adopt the form that is included in such as the program code in the tangible mediums such as floppy disk, CD-ROM, hard disk drive or any other machinable medium (that is, instruction), wherein; When program code under situation about carrying out on the programmable calculator, computing equipment generally includes processor, readable storage medium (comprising volatile memory and/or memory element), at least one input equipment and at least one output device of this processor.One or more programs can be for example, and through using API, reusable control waits realizes or utilize the process that combines the present invention to describe.Such program preferably realizes with high level procedural or Object-Oriented Programming Language, to communicate by letter with computer system.Yet if desired, this program can realize with assembly language or machine language.In any situation, language can be compiler language or interpretative code, and realizes combining with hardware.
Need to prove that the category of the technical scheme of the system and method for video play device being controlled through mobile communication terminal of the present invention comprises the combination in any between the each part mentioned above.
Although illustrate and described the present invention with reference to its preferred embodiment particularly, those skilled in the art will appreciate that the various changes on the form of to make and the details and do not break away from the scope of the present invention described in the appended claims.More than combine specific embodiment of the present invention to describe in detail, but be not to be limitation of the present invention.Every foundation technical spirit of the present invention all still belongs to the scope of technical scheme of the present invention to any simple modification that above embodiment did.