CN102664009B - System and method for implementing voice control over video playing device through mobile communication terminal - Google Patents

System and method for implementing voice control over video playing device through mobile communication terminal Download PDF

Info

Publication number
CN102664009B
CN102664009B CN201210136934.9A CN201210136934A CN102664009B CN 102664009 B CN102664009 B CN 102664009B CN 201210136934 A CN201210136934 A CN 201210136934A CN 102664009 B CN102664009 B CN 102664009B
Authority
CN
China
Prior art keywords
play device
video play
mobile communication
communication terminal
pattern
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201210136934.9A
Other languages
Chinese (zh)
Other versions
CN102664009A (en
Inventor
于庭龙
柳润峰
杨福海
曾亮东
路曼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Leshi Zhixin Electronic Technology Tianjin Co Ltd
Original Assignee
Leshi Zhixin Electronic Technology Tianjin Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Leshi Zhixin Electronic Technology Tianjin Co Ltd filed Critical Leshi Zhixin Electronic Technology Tianjin Co Ltd
Priority to CN201210136934.9A priority Critical patent/CN102664009B/en
Publication of CN102664009A publication Critical patent/CN102664009A/en
Application granted granted Critical
Publication of CN102664009B publication Critical patent/CN102664009B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The invention aims at providing a system and a method for controlling a video playing device through a mobile communication terminal, which solves the problem for implementing voice input control over the video playing device. Due to the adoption of the system and the method, the video playing device can be easily controlled by users through their own mobile communication terminals, and a traditional infrared remote controller is substituted. Compared with the prior art, the system and the method have advantages that complicated control methods such as voice input can be realized through the mobile communication terminal, the video playing device can be flexibly utilized, and the function of the video playing device can be maximally played.

Description

One carries out voice-operated system and method by mobile communication terminal to video play device
Technical field
The present invention relates to network communication field, more particularly, relate to one, by mobile communication terminal, voice-operated system and method is carried out to video play device.
Background technology
Along with the development of technology, by independent watching, function development becomes multi-functional multi-media information terminal to video play device, can carry out interaction with user.Existing video play device can be linked in internet, as the node of in internet, carries out receiving and dispatching mail, browsing page, participation game etc.
Video play device and user carry out the interactive instruction just needing to receive user, in prior art, mainly use the subsidiary traditional Infrared remote controller of video play device to carry out some to video play device simply to control, for a lot of complicated control form, such as Text Input control, phonetic entry control etc. are still helpless.
The function of mobile communication terminal is also becoming better and approaching perfection day by day, from initial transmitting-receiving note, receive phone, mail and browsing page, game interactive, GPS navigation etc. is received in watching video till now, participation network, but conveniently carry, its display screen is restricted, and the visual experience of user and the effect of viewing video are still not as traditional video play device, and traditional video play device is to a kind of sensation on the spot in person of user.In actual life, mobile communication terminal is staff one almost, and also very convenient for the input control of mobile communication terminal.
Therefore, relatively simple in order to solve traditional Infrared remote controller function, the input control to video play device complexity can not be met, need a kind of mobile communication terminal and video play device to be carried out mutual method and system, the control mode of the complexity such as such as phonetic entry is realized by mobile communication terminal, use video play device flexibly, play the maximum function of video play device.Although current user also can realize Text Input by the soft keyboard of mobile communication terminal, and then realize the control to video play device, be not as efficient as Voice command mode, convenient.
Meanwhile, mobile communication terminal can also substitute traditional Infrared remote controller, not only achieves the function of traditional Infrared remote controller, also has complicated input control function, is not only user-friendly to, and can economizes on resources.
Summary of the invention
The object of this invention is to provide a kind of system and method video play device controlled by mobile communication terminal, solve the problem of video play device being carried out to phonetic entry control.The control making user can realize video play device easily by the mobile communication terminal of oneself, replaces traditional Infrared remote controller.
The invention provides a kind of system video play device controlled by mobile communication terminal, described system comprises described mobile communication terminal, described video play device, speech recognition server, is characterized in that: described mobile communication terminal and described speech recognition server are interconnected by network; Described video play device and described mobile communication terminal are interconnected by network; Described speech recognition server is used for carrying out speech recognition to the voice messaging of input;
Wherein, described mobile communication terminal comprise phonetic incepting processing module and with the interconnective communication module of described phonetic incepting processing module, described communication module and described speech recognition server are interconnected by network, and described communication module and described video play device are interconnected by network; Described phonetic incepting processing module is for receiving described voice messaging and the speech pattern of user's input.
Preferably, described phonetic incepting processing module can receive the voice identification result of described speech recognition server to described voice messaging by described communication module, and carries out logical process to the result of the described speech recognition received.
Preferably, described speech pattern comprises text entry mode and Voice command pattern.
Preferably, when described speech pattern is described text entry mode, the voice identification result of described logical process shows by described video play device.
Preferably, when described speech pattern is described Voice command pattern, the result of the speech recognition of described logical process is converted into command information by described video play device, mate with the command information storehouse for starting application program, after the match is successful, start the corresponding application program of described command information.
The present invention also provides a kind of method controlled video play device by mobile communication terminal using said system, it is characterized in that, comprising:
(1) the described phonetic incepting processing module of described mobile communication terminal receives the described voice messaging of user's input and described speech pattern;
(2) the described voice messaging received and described speech pattern are sent to described communication module by described phonetic incepting processing module, the described voice messaging received and described speech pattern are sent to described speech recognition server by described network by described communication module, and described speech recognition server carries out described speech recognition to the described voice messaging received;
(3) recognition result is sent to described communication module by described network by described speech recognition server, the described recognition result received is sent to described phonetic incepting processing module by described communication module, and described phonetic incepting processing module carries out logical process to the described recognition result received;
(4) recognition result after described logical process is sent to described communication module by described phonetic incepting processing module, and by described communication module, the recognition result after described logical process and the described speech pattern received is sent to described video play device by described network;
(5) described video play device according to described in the speech pattern that receives, the recognition result after described logical process is processed accordingly.
Preferably, described recognition result is word, and described logical process comprises removes useless punctuation mark by described word.
Preferably, described speech pattern comprises text entry mode and Voice command pattern.
Preferably, when described speech pattern is described text entry mode, the recognition result of described logical process shows by described video play device.
Preferably, when described speech pattern is described Voice command pattern, the recognition result of described logical process is converted into command information by described video play device, mate with the command information storehouse for starting application program, after the match is successful, start the corresponding application program of described command information.
Compared with prior art, the invention has the advantages that the control mode being realized the complexity such as phonetic entry by mobile communication terminal, use video play device flexibly, play the maximum function of video play device.
Accompanying drawing explanation
Being convenient to make the present invention understand, describing specific embodiments of the invention by reference to the accompanying drawings now.
Fig. 1 is by the building-block of logic of mobile communication terminal to a preferred embodiment of the system that video play device controls according to of the present invention.
Fig. 2 is by the flow chart of mobile communication terminal to a preferred embodiment of the method that video play device controls according to of the present invention.
Embodiment
Below in conjunction with the drawings and specific embodiments, the present invention is described in further detail.
The object of this invention is to provide a kind of system and method video play device controlled by mobile communication terminal, solve the problem of video play device being carried out to phonetic entry control.
Fig. 1 is by the building-block of logic of mobile communication terminal to a preferred embodiment of the system that video play device controls according to of the present invention.Fig. 2 is by the flow chart of mobile communication terminal to a preferred embodiment of the method that video play device controls according to of the present invention.As shown in the figure, the phonetic incepting processing module of mobile communication terminal receives voice messaging and the speech pattern of user's input; The voice messaging received and speech pattern are sent to communication module by phonetic incepting processing module, and the described voice messaging received and described speech pattern are sent to speech recognition server by network by communication module; Speech recognition server carries out speech recognition to the voice messaging received, and speech recognition server is by the result after identification, and as word, send to communication module by network, the recognition result received is sent to phonetic incepting processing module by communication module; Phonetic incepting processing module carries out logical process to the recognition result received, and the recognition result after logical process is sent to communication module by phonetic incepting processing module; Recognition result after logical process and the speech pattern received are sent to video play device by network by the communication module in mobile communication terminal; Video play device, according to the speech pattern received, processes accordingly to the recognition result after logical process; Wherein, speech pattern comprises text entry mode and Voice command pattern.When speech pattern is text entry mode, the recognition result of logical process shows by video play device; When speech pattern is Voice command pattern, the recognition result of logical process is converted into command information by video play device, mates with the command information storehouse for starting application program, after the match is successful, and the corresponding application program of starting command information.Described logical process comprises removes useless punctuation mark by the identifiable design word received.
In order to realize object of the present invention, a preferred embodiment provided by the invention is as follows:
First, the phonetic incepting processing module of the mobile communication terminal of user receives from the voice messaging of user and the speech pattern of user.Wherein, described speech pattern comprises Voice command pattern and text entry mode.In this preferred embodiment, the speech pattern of described user is text entry mode.
Second, described voice messaging is sent to described communication module by described phonetic incepting processing module, the described voice messaging received and described speech pattern are sent to speech recognition server by network by described communication module, described speech recognition server carries out speech recognition to the described voice messaging received, such as, described voice messaging is converted to word.Such as, user is by described phonetic incepting processing module input voice " gantry Fei Jia ", described input voice " gantry Fei Jia " are sent to described communication module by described phonetic incepting processing module, the described voice messaging received and described speech pattern are sent to described speech recognition server by network by described communication module, after described speech recognition server carries out voice recognition processing, convert discernible word " gantry Fei Jia " to.
3rd, identifiable design word after process is sent to the described communication module of described user's mobile communication terminal by described speech recognition server, described communication module sends it to described phonetic incepting processing module, described phonetic incepting processing module receives recognition result, carries out logical process to the described recognition result received.Such as, the Word message received is removed useless punctuation mark, as ", " or ", " etc.
4th, described phonetic incepting processing module in described mobile communication terminal is by through the described recognition result of logical process and the speech pattern of described user, i.e. text entry mode, sends to described communication module, and they are transferred to video play device by network by described communication module.
5th, described video play device, according to the speech pattern of the recognition result after the described logical process received and described user, processes accordingly.The speech pattern of described user is text entry mode, described video play device by the described identifiable design text importing that receives in text box.Such as, the text entry information received " gantry Fei Jia " is shown in corresponding text box by described video play device.
Another preferred embodiment provided by the invention is as follows:
First, the phonetic incepting processing module of the mobile communication terminal of user receives from the voice messaging of user and the speech pattern of user.Wherein, described speech pattern comprises Voice command pattern and text entry mode.In this preferred embodiment, the speech pattern of described user is Voice command pattern.
Second, described voice messaging is sent to described communication module by described phonetic incepting processing module, the described voice messaging received and described speech pattern are sent to speech recognition server by network by described communication module, described speech recognition server carries out voice recognition processing to the described voice messaging received, such as, described voice messaging is converted to word.Such as, user " opens homepage " by described phonetic incepting processing module input voice, described input voice " are opened homepage " and are sent to described communication module by described phonetic incepting processing module, the described voice messaging received and described speech pattern are sent to described speech recognition server by network by described communication module, after described speech recognition server carries out voice recognition processing, convert discernible word to and " open homepage ".
3rd, identifiable design word after described speech recognition server connects process sends to the described communication module of described user's mobile communication terminal, described communication module sends it to described phonetic incepting processing module, described phonetic incepting processing module receives recognition result, carries out logical process to the described recognition result received.Such as, the Word message received is removed useless punctuation mark, as ", " or ", " etc.
4th, described phonetic incepting processing module in described mobile communication terminal is by through the described recognition result of logical process and the speech pattern of described user, i.e. Voice command pattern, sends to described communication module, and they are transferred to video play device by network by described communication module.
5th, described video play device, according to the speech pattern of the recognition result after the described logical process received and described user, processes accordingly.If the speech pattern of described user is Voice command pattern, so described video play device transfers the described identifiable design word received to command information, mating with the command information storehouse for starting application program, after the match is successful, starting the corresponding application program of described command information.Such as, the command information received " is opened homepage " with the command information storehouse for starting application program and is mated by described video play device, after the match is successful, starts " opening homepage " corresponding application program, namely starts corresponding homepage browse program.
The matching way of described command information storehouse and voice command can be the matching way that system default is arranged, and also can provide operation interface, for the matching relationship in User Defined voice command and command information storehouse.User is by the corresponding relation in the self-defined voice command of text entry mode and command information storehouse.
Foregoing detailed description illustrates the various embodiments of system and/or process by embodiment and/or schematic diagram.With regard to these schematic diagrames and/or comprise with regard to one or more function and/or operation, it will be understood by those skilled in the art that each function in these schematic diagrames or embodiment and/or operation can by various hardware, software, firmware or in fact its combination in any come to realize individually and/or jointly.
Should be appreciated that, method described herein can combined with hardware or software, or the combination both combining in due course realizes.Therefore, method of the present invention, can adopt program code in tangible mediums such as being included in such as floppy disk, CD-ROM, hard disk drive or any other machinable medium (namely, instruction) form, wherein, when program code performs on programmable computers, computing equipment generally includes processor, this processor readable storage medium (comprising volatile memory and/or memory element), at least one input equipment and at least one output equipment.One or more program can such as, and by using API, reusable control etc. realize or utilize the process described in conjunction with the present invention.Such program preferably realizes with high level procedural or Object-Oriented Programming Language, to communicate with computer system.But if needed, this program can realize by assembler language or machine language.In any case, language can be compiler language or interpretative code, and combines with hardware implementing.
It should be noted that, of the present inventionly comprise combination in any between each part mentioned above by the category of mobile communication terminal to the technical scheme of the system and method that video play device controls.
Although illustrate and describe the present invention with reference to its preferred embodiment particularly, those skilled in the art will appreciate that the various change that can make in form and details and do not depart from the scope of the present invention described in appended claims.More than be described in detail in conjunction with specific embodiments of the invention, but be not limitation of the present invention.Every according to technical spirit of the present invention to any simple modification made for any of the above embodiments, all still belong to the scope of technical solution of the present invention.

Claims (6)

1. by the system that mobile communication terminal controls video play device, described system comprises described mobile communication terminal, described video play device, and speech recognition server, is characterized in that,
Described mobile communication terminal and described speech recognition server are interconnected by network;
Described video play device and described mobile communication terminal are interconnected by network;
Described speech recognition server is used for carrying out speech recognition to the voice messaging of input;
Wherein, described mobile communication terminal comprise phonetic incepting processing module and with the interconnective communication module of described phonetic incepting processing module, described communication module and described speech recognition server are interconnected by network, and described communication module and described video play device are interconnected by network;
Described phonetic incepting processing module is for receiving described voice messaging and the speech pattern of user's input;
Described phonetic incepting processing module can receive the voice identification result of described speech recognition server to described voice messaging by described communication module, and carries out logical process to the result of the described speech recognition received;
Described speech pattern comprises text entry mode and Voice command pattern;
When described speech pattern is described text entry mode, the voice identification result of described logical process shows by described video play device;
When described speech pattern is described Voice command pattern, the result of the speech recognition of described logical process is converted into command information by described video play device, mating with the command information storehouse for starting application program, after the match is successful, starting the corresponding application program of described command information.
2. use method video play device controlled by mobile communication terminal of the system described in claim 1, it is characterized in that, comprising:
(1) the described phonetic incepting processing module of described mobile communication terminal receives the described voice messaging of user's input and described speech pattern;
(2) the described voice messaging received and described speech pattern are sent to described communication module by described phonetic incepting processing module, the described voice messaging received and described speech pattern are sent to described speech recognition server by described network by described communication module, and described speech recognition server carries out described speech recognition to the described voice messaging received;
(3) recognition result is sent to described communication module by described network by described speech recognition server, the described recognition result received is sent to described phonetic incepting processing module by described communication module, and described phonetic incepting processing module carries out logical process to the described recognition result received;
(4) recognition result after described logical process is sent to described communication module by described phonetic incepting processing module, and by described communication module, the recognition result after described logical process and the described speech pattern received is sent to described video play device by described network;
(5) described video play device according to described in the speech pattern that receives, the recognition result after described logical process is processed accordingly.
3. the method controlled video play device by mobile communication terminal as claimed in claim 2, it is characterized in that, described recognition result is word, and described logical process comprises removes useless punctuation mark by described word.
4. the method controlled video play device by mobile communication terminal as claimed in claim 2, it is characterized in that, described speech pattern comprises text entry mode and Voice command pattern.
5. the method by mobile communication terminal, video play device controlled as claimed in claim 4, it is characterized in that, when described speech pattern is described text entry mode, the recognition result of described logical process shows by described video play device.
6. the method by mobile communication terminal, video play device controlled as claimed in claim 4, it is characterized in that, when described speech pattern is described Voice command pattern, the recognition result of described logical process is converted into command information by described video play device, mate with the command information storehouse for starting application program, after the match is successful, start the corresponding application program of described command information.
CN201210136934.9A 2012-05-07 2012-05-07 System and method for implementing voice control over video playing device through mobile communication terminal Expired - Fee Related CN102664009B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210136934.9A CN102664009B (en) 2012-05-07 2012-05-07 System and method for implementing voice control over video playing device through mobile communication terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210136934.9A CN102664009B (en) 2012-05-07 2012-05-07 System and method for implementing voice control over video playing device through mobile communication terminal

Publications (2)

Publication Number Publication Date
CN102664009A CN102664009A (en) 2012-09-12
CN102664009B true CN102664009B (en) 2015-01-14

Family

ID=46773475

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210136934.9A Expired - Fee Related CN102664009B (en) 2012-05-07 2012-05-07 System and method for implementing voice control over video playing device through mobile communication terminal

Country Status (1)

Country Link
CN (1) CN102664009B (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102905185B (en) * 2012-10-26 2016-02-10 四川长虹电器股份有限公司 The method of full voice control HTML5 video playback
CN104122979A (en) * 2013-04-25 2014-10-29 深圳市快播科技有限公司 Method and device for control over large screen through voice
CN104461597A (en) * 2013-09-24 2015-03-25 腾讯科技(深圳)有限公司 Starting control method and device for application program
CN104811777A (en) * 2014-01-23 2015-07-29 阿里巴巴集团控股有限公司 Smart television voice processing method, smart television voice processing system and smart television
CN104935615B (en) * 2014-03-19 2019-12-03 重庆深蜀科技有限公司 Realize the system and method for voice control household appliance
CN104036779B (en) * 2014-06-24 2017-12-26 湖南大学 A kind of wireless speech control method and system for mobile platform
CN105869623A (en) * 2015-12-07 2016-08-17 乐视网信息技术(北京)股份有限公司 Video playing method and device based on speech recognition
US10409550B2 (en) * 2016-03-04 2019-09-10 Ricoh Company, Ltd. Voice control of interactive whiteboard appliances
CN110265033A (en) * 2019-06-21 2019-09-20 四川长虹电器股份有限公司 The system and method for expansion equipment voice interactive function

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101937693A (en) * 2010-08-17 2011-01-05 深圳市子栋科技有限公司 Video and audio playing method and system based on voice command
CN102158664A (en) * 2011-03-31 2011-08-17 四川长虹电器股份有限公司 Method for performing voice control on television by utilizing mobile terminal
CN202168152U (en) * 2011-07-21 2012-03-14 德信互动科技(北京)有限公司 Television control system

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8207936B2 (en) * 2006-06-30 2012-06-26 Sony Ericsson Mobile Communications Ab Voice remote control

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101937693A (en) * 2010-08-17 2011-01-05 深圳市子栋科技有限公司 Video and audio playing method and system based on voice command
CN102158664A (en) * 2011-03-31 2011-08-17 四川长虹电器股份有限公司 Method for performing voice control on television by utilizing mobile terminal
CN202168152U (en) * 2011-07-21 2012-03-14 德信互动科技(北京)有限公司 Television control system

Also Published As

Publication number Publication date
CN102664009A (en) 2012-09-12

Similar Documents

Publication Publication Date Title
CN102664009B (en) System and method for implementing voice control over video playing device through mobile communication terminal
US10194015B2 (en) Systems and methods for facilitating conversations
CN107632706B (en) Application data processing method and system of multi-modal virtual human
US11336594B2 (en) Information processing system and information processing method
CN110400251A (en) Method for processing video frequency, device, terminal device and storage medium
CN109104586B (en) Special effect adding method and device, video call equipment and storage medium
CN103544290A (en) Method and system for displaying individualized recommendation pages through fingerprint identification
CN109448709A (en) A kind of terminal throws the control method and terminal of screen
CN101809651A (en) The mobile wireless display of the incarnation of speech to speech translation and simulating human attribute is provided
CN107808191A (en) The output intent and system of the multi-modal interaction of visual human
US20200081975A1 (en) System and method for dynamic trend clustering
CN104461446B (en) Software running method and system based on interactive voice
CN109885277A (en) Human-computer interaction device, mthods, systems and devices
WO2017091411A1 (en) Synchronizing a server-side keyboard layout with a client-side keyboard layout in a virtual session
CN106919559A (en) Machine translation method and machine translation system
CN109656655A (en) It is a kind of for executing the method, equipment and storage medium of interactive instruction
CN106572131B (en) The method and system that media data is shared in Internet of Things
CN105721904B (en) The method of the content output of display device and control display device
CN106250007B (en) A kind of system and method realizing branching selection and playing
CN105893735B (en) Medical information remote co-screen assistance method and terminal
WO2015023138A1 (en) System and method for providing speech recognition-based messaging interpretation service
CN106293572A (en) Online information multi-screen sharing method, device and system
CN103294193A (en) Multi-terminal interaction method, device and system
CN103873557A (en) Information issuing system with human-computer interaction function and realization method thereof
CN108833256A (en) A kind of instant communication method and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
ASS Succession or assignment of patent right

Owner name: LESHI ZHIXIN ELECTRONIC TECHNOLOGY (TIANJIN) CO.,

Free format text: FORMER OWNER: LETV INFORMATION TECHNOLOGY (BEIJING) CO., LTD.

Effective date: 20130507

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 100026 CHAOYANG, BEIJING TO: 300467 TANGGU, TIANJIN

TA01 Transfer of patent application right

Effective date of registration: 20130507

Address after: 300467, Tianjin District, Tianjin City, Tanggu animation road 126 No. 201-427 animation building B1 district two

Applicant after: LESHI ZHIXIN ELECTRONIC SCIENCE & TECHNOLOGY (TIANJIN) CO., LTD.

Address before: 100026 Beijing City Guanghua Road Chaoyang District Oriental Media Center No. 4 C block 8 layer

Applicant before: LeTV Information Technology (Beijing) Co., Ltd.

C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20150114

Termination date: 20170507

CF01 Termination of patent right due to non-payment of annual fee