One carries out voice-operated system and method by mobile communication terminal to video play device
Technical field
The present invention relates to network communication field, more particularly, relate to one, by mobile communication terminal, voice-operated system and method is carried out to video play device.
Background technology
Along with the development of technology, by independent watching, function development becomes multi-functional multi-media information terminal to video play device, can carry out interaction with user.Existing video play device can be linked in internet, as the node of in internet, carries out receiving and dispatching mail, browsing page, participation game etc.
Video play device and user carry out the interactive instruction just needing to receive user, in prior art, mainly use the subsidiary traditional Infrared remote controller of video play device to carry out some to video play device simply to control, for a lot of complicated control form, such as Text Input control, phonetic entry control etc. are still helpless.
The function of mobile communication terminal is also becoming better and approaching perfection day by day, from initial transmitting-receiving note, receive phone, mail and browsing page, game interactive, GPS navigation etc. is received in watching video till now, participation network, but conveniently carry, its display screen is restricted, and the visual experience of user and the effect of viewing video are still not as traditional video play device, and traditional video play device is to a kind of sensation on the spot in person of user.In actual life, mobile communication terminal is staff one almost, and also very convenient for the input control of mobile communication terminal.
Therefore, relatively simple in order to solve traditional Infrared remote controller function, the input control to video play device complexity can not be met, need a kind of mobile communication terminal and video play device to be carried out mutual method and system, the control mode of the complexity such as such as phonetic entry is realized by mobile communication terminal, use video play device flexibly, play the maximum function of video play device.Although current user also can realize Text Input by the soft keyboard of mobile communication terminal, and then realize the control to video play device, be not as efficient as Voice command mode, convenient.
Meanwhile, mobile communication terminal can also substitute traditional Infrared remote controller, not only achieves the function of traditional Infrared remote controller, also has complicated input control function, is not only user-friendly to, and can economizes on resources.
Summary of the invention
The object of this invention is to provide a kind of system and method video play device controlled by mobile communication terminal, solve the problem of video play device being carried out to phonetic entry control.The control making user can realize video play device easily by the mobile communication terminal of oneself, replaces traditional Infrared remote controller.
The invention provides a kind of system video play device controlled by mobile communication terminal, described system comprises described mobile communication terminal, described video play device, speech recognition server, is characterized in that: described mobile communication terminal and described speech recognition server are interconnected by network; Described video play device and described mobile communication terminal are interconnected by network; Described speech recognition server is used for carrying out speech recognition to the voice messaging of input;
Wherein, described mobile communication terminal comprise phonetic incepting processing module and with the interconnective communication module of described phonetic incepting processing module, described communication module and described speech recognition server are interconnected by network, and described communication module and described video play device are interconnected by network; Described phonetic incepting processing module is for receiving described voice messaging and the speech pattern of user's input.
Preferably, described phonetic incepting processing module can receive the voice identification result of described speech recognition server to described voice messaging by described communication module, and carries out logical process to the result of the described speech recognition received.
Preferably, described speech pattern comprises text entry mode and Voice command pattern.
Preferably, when described speech pattern is described text entry mode, the voice identification result of described logical process shows by described video play device.
Preferably, when described speech pattern is described Voice command pattern, the result of the speech recognition of described logical process is converted into command information by described video play device, mate with the command information storehouse for starting application program, after the match is successful, start the corresponding application program of described command information.
The present invention also provides a kind of method controlled video play device by mobile communication terminal using said system, it is characterized in that, comprising:
(1) the described phonetic incepting processing module of described mobile communication terminal receives the described voice messaging of user's input and described speech pattern;
(2) the described voice messaging received and described speech pattern are sent to described communication module by described phonetic incepting processing module, the described voice messaging received and described speech pattern are sent to described speech recognition server by described network by described communication module, and described speech recognition server carries out described speech recognition to the described voice messaging received;
(3) recognition result is sent to described communication module by described network by described speech recognition server, the described recognition result received is sent to described phonetic incepting processing module by described communication module, and described phonetic incepting processing module carries out logical process to the described recognition result received;
(4) recognition result after described logical process is sent to described communication module by described phonetic incepting processing module, and by described communication module, the recognition result after described logical process and the described speech pattern received is sent to described video play device by described network;
(5) described video play device according to described in the speech pattern that receives, the recognition result after described logical process is processed accordingly.
Preferably, described recognition result is word, and described logical process comprises removes useless punctuation mark by described word.
Preferably, described speech pattern comprises text entry mode and Voice command pattern.
Preferably, when described speech pattern is described text entry mode, the recognition result of described logical process shows by described video play device.
Preferably, when described speech pattern is described Voice command pattern, the recognition result of described logical process is converted into command information by described video play device, mate with the command information storehouse for starting application program, after the match is successful, start the corresponding application program of described command information.
Compared with prior art, the invention has the advantages that the control mode being realized the complexity such as phonetic entry by mobile communication terminal, use video play device flexibly, play the maximum function of video play device.
Accompanying drawing explanation
Being convenient to make the present invention understand, describing specific embodiments of the invention by reference to the accompanying drawings now.
Fig. 1 is by the building-block of logic of mobile communication terminal to a preferred embodiment of the system that video play device controls according to of the present invention.
Fig. 2 is by the flow chart of mobile communication terminal to a preferred embodiment of the method that video play device controls according to of the present invention.
Embodiment
Below in conjunction with the drawings and specific embodiments, the present invention is described in further detail.
The object of this invention is to provide a kind of system and method video play device controlled by mobile communication terminal, solve the problem of video play device being carried out to phonetic entry control.
Fig. 1 is by the building-block of logic of mobile communication terminal to a preferred embodiment of the system that video play device controls according to of the present invention.Fig. 2 is by the flow chart of mobile communication terminal to a preferred embodiment of the method that video play device controls according to of the present invention.As shown in the figure, the phonetic incepting processing module of mobile communication terminal receives voice messaging and the speech pattern of user's input; The voice messaging received and speech pattern are sent to communication module by phonetic incepting processing module, and the described voice messaging received and described speech pattern are sent to speech recognition server by network by communication module; Speech recognition server carries out speech recognition to the voice messaging received, and speech recognition server is by the result after identification, and as word, send to communication module by network, the recognition result received is sent to phonetic incepting processing module by communication module; Phonetic incepting processing module carries out logical process to the recognition result received, and the recognition result after logical process is sent to communication module by phonetic incepting processing module; Recognition result after logical process and the speech pattern received are sent to video play device by network by the communication module in mobile communication terminal; Video play device, according to the speech pattern received, processes accordingly to the recognition result after logical process; Wherein, speech pattern comprises text entry mode and Voice command pattern.When speech pattern is text entry mode, the recognition result of logical process shows by video play device; When speech pattern is Voice command pattern, the recognition result of logical process is converted into command information by video play device, mates with the command information storehouse for starting application program, after the match is successful, and the corresponding application program of starting command information.Described logical process comprises removes useless punctuation mark by the identifiable design word received.
In order to realize object of the present invention, a preferred embodiment provided by the invention is as follows:
First, the phonetic incepting processing module of the mobile communication terminal of user receives from the voice messaging of user and the speech pattern of user.Wherein, described speech pattern comprises Voice command pattern and text entry mode.In this preferred embodiment, the speech pattern of described user is text entry mode.
Second, described voice messaging is sent to described communication module by described phonetic incepting processing module, the described voice messaging received and described speech pattern are sent to speech recognition server by network by described communication module, described speech recognition server carries out speech recognition to the described voice messaging received, such as, described voice messaging is converted to word.Such as, user is by described phonetic incepting processing module input voice " gantry Fei Jia ", described input voice " gantry Fei Jia " are sent to described communication module by described phonetic incepting processing module, the described voice messaging received and described speech pattern are sent to described speech recognition server by network by described communication module, after described speech recognition server carries out voice recognition processing, convert discernible word " gantry Fei Jia " to.
3rd, identifiable design word after process is sent to the described communication module of described user's mobile communication terminal by described speech recognition server, described communication module sends it to described phonetic incepting processing module, described phonetic incepting processing module receives recognition result, carries out logical process to the described recognition result received.Such as, the Word message received is removed useless punctuation mark, as ", " or ", " etc.
4th, described phonetic incepting processing module in described mobile communication terminal is by through the described recognition result of logical process and the speech pattern of described user, i.e. text entry mode, sends to described communication module, and they are transferred to video play device by network by described communication module.
5th, described video play device, according to the speech pattern of the recognition result after the described logical process received and described user, processes accordingly.The speech pattern of described user is text entry mode, described video play device by the described identifiable design text importing that receives in text box.Such as, the text entry information received " gantry Fei Jia " is shown in corresponding text box by described video play device.
Another preferred embodiment provided by the invention is as follows:
First, the phonetic incepting processing module of the mobile communication terminal of user receives from the voice messaging of user and the speech pattern of user.Wherein, described speech pattern comprises Voice command pattern and text entry mode.In this preferred embodiment, the speech pattern of described user is Voice command pattern.
Second, described voice messaging is sent to described communication module by described phonetic incepting processing module, the described voice messaging received and described speech pattern are sent to speech recognition server by network by described communication module, described speech recognition server carries out voice recognition processing to the described voice messaging received, such as, described voice messaging is converted to word.Such as, user " opens homepage " by described phonetic incepting processing module input voice, described input voice " are opened homepage " and are sent to described communication module by described phonetic incepting processing module, the described voice messaging received and described speech pattern are sent to described speech recognition server by network by described communication module, after described speech recognition server carries out voice recognition processing, convert discernible word to and " open homepage ".
3rd, identifiable design word after described speech recognition server connects process sends to the described communication module of described user's mobile communication terminal, described communication module sends it to described phonetic incepting processing module, described phonetic incepting processing module receives recognition result, carries out logical process to the described recognition result received.Such as, the Word message received is removed useless punctuation mark, as ", " or ", " etc.
4th, described phonetic incepting processing module in described mobile communication terminal is by through the described recognition result of logical process and the speech pattern of described user, i.e. Voice command pattern, sends to described communication module, and they are transferred to video play device by network by described communication module.
5th, described video play device, according to the speech pattern of the recognition result after the described logical process received and described user, processes accordingly.If the speech pattern of described user is Voice command pattern, so described video play device transfers the described identifiable design word received to command information, mating with the command information storehouse for starting application program, after the match is successful, starting the corresponding application program of described command information.Such as, the command information received " is opened homepage " with the command information storehouse for starting application program and is mated by described video play device, after the match is successful, starts " opening homepage " corresponding application program, namely starts corresponding homepage browse program.
The matching way of described command information storehouse and voice command can be the matching way that system default is arranged, and also can provide operation interface, for the matching relationship in User Defined voice command and command information storehouse.User is by the corresponding relation in the self-defined voice command of text entry mode and command information storehouse.
Foregoing detailed description illustrates the various embodiments of system and/or process by embodiment and/or schematic diagram.With regard to these schematic diagrames and/or comprise with regard to one or more function and/or operation, it will be understood by those skilled in the art that each function in these schematic diagrames or embodiment and/or operation can by various hardware, software, firmware or in fact its combination in any come to realize individually and/or jointly.
Should be appreciated that, method described herein can combined with hardware or software, or the combination both combining in due course realizes.Therefore, method of the present invention, can adopt program code in tangible mediums such as being included in such as floppy disk, CD-ROM, hard disk drive or any other machinable medium (namely, instruction) form, wherein, when program code performs on programmable computers, computing equipment generally includes processor, this processor readable storage medium (comprising volatile memory and/or memory element), at least one input equipment and at least one output equipment.One or more program can such as, and by using API, reusable control etc. realize or utilize the process described in conjunction with the present invention.Such program preferably realizes with high level procedural or Object-Oriented Programming Language, to communicate with computer system.But if needed, this program can realize by assembler language or machine language.In any case, language can be compiler language or interpretative code, and combines with hardware implementing.
It should be noted that, of the present inventionly comprise combination in any between each part mentioned above by the category of mobile communication terminal to the technical scheme of the system and method that video play device controls.
Although illustrate and describe the present invention with reference to its preferred embodiment particularly, those skilled in the art will appreciate that the various change that can make in form and details and do not depart from the scope of the present invention described in appended claims.More than be described in detail in conjunction with specific embodiments of the invention, but be not limitation of the present invention.Every according to technical spirit of the present invention to any simple modification made for any of the above embodiments, all still belong to the scope of technical solution of the present invention.