Embodiment
For making the purpose, technical solutions and advantages of the present invention clearer, the present invention is described in further detail below in conjunction with accompanying drawing.
In embodiment of the present invention, more single for the structure of webpage, the user submits to content to increase, but the characteristics that entrance is single, some voice control command are provided, when the element-specific in the webpage and this voice control command coupling, just trigger operation corresponding on this element.
Fig. 1 is the sound control method schematic flow sheet according to the web page operation of embodiment of the present invention.
As shown in Figure 1, the method comprises:
Step 101: the speech text territory is set and corresponding to the control command territory in this speech text territory in HTML (Hypertext Markup Language) (HTML) label (tag) of webpage, and in the control command territory, comprises the webpage control command.
, can expand the HTML standard here, for some labels increase speech text territory and control command territory.Label is the fundamental element among the HTML, and web page element is corresponding label in the HTML standard.Web page element is the base unit of webpage, and for example the button in the webpage is exactly a kind of web page element.
In embodiment of the present invention, the speech text territory keeps corresponding with control command, and comprises the webpage control command in the control command territory.
Such as, embodiment of the present invention can arrange speech text territory and corresponding control command territory in the html tag that Input label, Div label, Table label, Tbody label, Tfoot label or Caption label etc. are commonly used.
Such as: can be in the html tag of webpage, send out microblogging speech text territory for the webpage control command setting of sending out microblogging and corresponding to the control command territory in this microblogging speech text territory for particular type; , the webpage control command of relaying microblogging relays microblogging speech text territory for arranging and corresponding to the control command territory in this relay microblogging speech text territory for particular type; For the webpage control command of comment microblogging comment microblogging speech text territory is set and corresponding to the control command territory in this comment microblogging speech text territory for particular type; For comment and the webpage control command of relaying microblogging comment is set and relays microblogging speech text territory and corresponding to this comment and relay the control command territory in microblogging speech text territory for particular type.
Although more than specifically enumerated the more extendible concrete html tags of embodiment of the present invention, it will be appreciated by those of skill in the art that this enumerating only is exemplary, and be not limited to the protection domain of embodiment of the present invention.
And, in embodiment of the present invention, can set in advance by the mode of self-defining function the particular content of the webpage control command that in the control command territory, comprises.
Exemplarily, can be with speech text territory called after voicetext; Control command territory called after voicecmd; And the mode by function definition arranges forwardweibo for relaying the concrete function name of microblogging operational order.
Take the Input label as example, the embodiment of the present invention specific implementation can be as follows:
Onclick=' forwardweibo ' voicecmd=" forwardweibo " voicetext=" please relay "<input type=" button " class=" inputBtn sendBtn " value=" relay " title=" relay " 〉
Wherein, voicecmd and voicetext are the territory that embodiment of the present invention increased newly.Specifically describing in voicetext has text " please relay ", and specifically describe in voicecmd the concrete function name forwardweibo that relays the microblogging operational order is arranged.
Step 102: from voice command, identify key word, in the html tag of described webpage, retrieve the speech text territory that is complementary with this key word, and the webpage control command that comprises in the control command territory of execution corresponding to described speech text territory.
Browser need to be applied to speech recognition technology herein.
Speech recognition is also referred to as automatic speech recognition (ASR, Automatic Speech Recognition), and its target is that the vocabulary content in the human speech is converted to computer-readable input, for example button, binary coding or character string.
Based on concrete applied environment, the webpage control command of browser support can comprise following at least one: send out microblogging; Relay microblogging; The comment microblogging; Comment and relay microblogging; Send mail; Send personal letter; Or upload annex, etc.
When embodiment of the present invention being applied to when utilizing voice to send out microblogging in browser, the method specifically comprises:
At first from voice command, identify " sending out microblogging " key word, then browser retrieves the speech text territory (namely sending out microblogging speech text territory) that is complementary with " send out microblogging " key word in the html tag of webpage, and from corresponding to parsing microblogging function command the control command territory in this speech text territory; Then move this microblogging function command, in webpage, to send microblogging.
When embodiment of the present invention being applied to when utilizing voice to relay microblogging in browser, the method specifically comprises:
At first from voice command, identify " relay microblogging " key word, in the html tag of webpage, retrieve the speech text territory (namely relaying microblogging speech text territory) that is complementary with " relay microblogging " key word, and from the control command territory corresponding to this speech text territory, parse relay microblogging function command; Then move this relay microblogging function command, in webpage, to relay microblogging.
When embodiment of the present invention being applied to when utilizing voice to comment on microblogging in browser, the method specifically comprises:
At first from voice command, identify " comment microblogging " key word, in the html tag of webpage, retrieve the speech text territory (namely commenting on microblogging microblogging speech text territory) that is complementary with " comment microblogging " key word, and from the control command territory corresponding to the speech text territory, parse comment microblogging function command; Then move this comment microblogging function command, in webpage, to comment on microblogging.
Utilize voice in browser, to comment on and when relaying microblogging, the method specifically comprises when embodiment of the present invention is applied to:
At first from voice command, identify " comment and relay microblogging " key word, in the html tag of webpage, retrieve the speech text territory (i.e. comment and relay microblogging speech text territory) that is complementary with " comment and relay microblogging " key word, and from the control command territory corresponding to this speech text territory, parse comment and relay the microblogging order; Then move this comment and relay the microblogging function command, with comment in webpage and relay microblogging.
Although more than specifically enumerated some embodiments of webpage control command, it will be appreciated by those of skill in the art that this enumerating only is exemplary, and be not limited to the protection domain of embodiment of the present invention.
In one embodiment, browser identifies the concrete sound identification of key word from the voice command that the user sends method can have three kinds: based on the method for channel model and voice knowledge, the method for template matches and the method for utilizing artificial neural network, embodiment of the present invention preferably adopts the method for template matches.Template matches development comparative maturity has reached the practical stage at present.In template matching method, be through four steps: feature extraction, template training, template classification, judgement.Technology commonly used has three kinds: dynamic time warping (DTW), theoretical, vector quantization (VQ) technology of hidden Markov (HMM).
Exemplarily: when the user browses a page, and when having inputted some literal (perhaps not input characters), send voice command and " please relay " (namely saying " please relay " these 3 words), browser begins to search in webpage so, find with key word and " please relay " voicetext territory in the input element that is complementary, and definite voicecmd territory corresponding with the voicetext territory, then can carry out ' forwardweibo ' operation according to the value of voicecmd, namely carry out concrete relay microblogging operational order.
Preferably, input-output apparatus control command territory can be set in html tag further, and in input-output apparatus control command territory, comprise the webpage control command.Like this, when receiving the operation of input-output apparatus, can need not to carry out speech recognition, but directly carry out the webpage control command that comprises in this input-output apparatus control command territory.
Such as, take the Input label as example, can increase input-output apparatus control command territory (such as being onclick) newly, and onclick=' forwardweibo ', like this when mouse is clicked button corresponding to label, can directly carry out ' forwardweibo ' operation, namely directly carry out and relay the microblogging operation.
In webpage, can provide a plurality of several operating interactive points for the user, thereby be convenient to user's control.Such as, be applied as example with microblogging, the microblogging of sending out can be arranged, relay microblogging, comment on microblogging or a plurality of operating interactive points such as comment and relay microblogging.
Use after the embodiment of the present invention, just can be by the control of voice realization to these operations.
Exemplarily, Fig. 2 inputs synoptic diagram according to the microblogging of sending out of embodiment of the present invention; Fig. 3 is the relay input synoptic diagram according to embodiment of the present invention; Fig. 4 is the comment input synoptic diagram according to embodiment of the present invention.
Based on above-mentioned analysis, embodiment of the present invention has also proposed a kind of speech control system of web page operation.
Fig. 5 is the speech control system structural representation according to the web page operation of embodiment of the present invention.
As shown in Figure 5, this system comprises webpage setting unit 501 and browser 502.Wherein:
Webpage setting unit 501 is used for html tag at webpage and the speech text territory is set and corresponding to the control command territory in this speech text territory, comprises the webpage control command in the control command territory.
Such as, can be in the html tag of webpage, send out microblogging speech text territory for the webpage control command setting of sending out microblogging and corresponding to the control command territory in this microblogging speech text territory for particular type; , the webpage control command of relaying microblogging relays microblogging speech text territory for arranging and corresponding to the control command territory in this relay microblogging speech text territory for particular type; For the webpage control command of comment microblogging comment microblogging speech text territory is set and corresponding to the control command territory in this comment microblogging speech text territory for particular type; For comment and the webpage control command of relaying microblogging comment is set and relays microblogging speech text territory and corresponding to this comment and relay the control command territory in microblogging speech text territory for particular type.
Browser 502 is used for identifying key word from voice command, retrieves the speech text territory that is complementary with this key word in the html tag of webpage, and the webpage control command that comprises in the control command territory of execution corresponding to the speech text territory.
In one embodiment, input-output apparatus control command territory can be set in html tag further, and in input-output apparatus control command territory, comprise the webpage control command.Like this, when receiving the operation of input-output apparatus, can need not to carry out speech recognition, but directly carry out the webpage control command that comprises in this input-output apparatus control command territory.
Such as, take the Input label as example, can increase input-output apparatus control command territory (such as being onclick) newly, and onclick=' forwardweibo ', like this when mouse is clicked button corresponding to label, can directly carry out ' forwardweibo ' operation, namely carry out and relay the microblogging operation.
Particularly:
Webpage setting unit 501 is further used for arranging input-output apparatus control command territory in this html tag, comprise the webpage control command in described input-output apparatus control command territory;
Browser 502 is further used for when receiving the operation of input-output apparatus, carries out the webpage control command that comprises in this input-output apparatus control command territory.
Preferably, webpage setting unit 501 can also be further used for by the mode of self-defining function the webpage control command being set.
And embodiment of the present invention can arrange speech text territory and corresponding control command territory in the html tag that Input label, Div label, Table label, Tbody label, Tfoot label or Caption label etc. are commonly used.
Based on concrete applied environment, the webpage control command of browser support can comprise following at least one: send out microblogging; Relay microblogging; The comment microblogging; Comment and relay microblogging; Send mail; Send personal letter; Or upload annex, etc.
In one embodiment, the webpage control command is for sending out microblogging.At this moment, browser 502, be used for from voice command identification microblogging key word, in the html tag of webpage, retrieve and the speech text territory (namely sending out microblogging speech text territory) of sending out the microblogging key word and being complementary, and from corresponding to parsing microblogging function command the control command territory in described speech text territory; And move this microblogging function command, in webpage, to send microblogging.
In one embodiment, the webpage control command is for relaying microblogging.At this moment, browser 502, be used for identifying relay microblogging key word from voice command, in the html tag of webpage, retrieve and the speech text territory (namely relaying microblogging speech text territory) of relaying the microblogging key word and being complementary, and from the control command territory corresponding to the speech text territory, parse and relay the microblogging function command; And move this relay microblogging function command, in webpage, to relay microblogging.
In one embodiment, the webpage control command is the comment microblogging.At this moment, browser 502, be used for identifying comment microblogging key word from voice command, in the html tag of webpage, retrieve and the speech text territory (namely commenting on microblogging speech text territory) of commenting on the microblogging key word and being complementary, and from the control command territory corresponding to described speech text territory, parse comment microblogging function command; And move this comment microblogging function command, in webpage, to comment on microblogging.
In one embodiment, the webpage control command is comment and relay microblogging; At this moment, browser 502, be used for identifying comment and relaying microblogging key word (i.e. comment and relay microblogging speech text territory) from voice command, in the html tag of webpage, retrieve and the speech text territory of commenting on and relaying the microblogging key word and being complementary, and from the control command territory corresponding to the speech text territory, parse comment and relay the microblogging order; And move this comment and relay the microblogging function command, with comment in webpage and relay microblogging.
But although more than specifically enumerated some embodiments of webpage control command and extension tag, it will be appreciated by those of skill in the art that this enumerating only is exemplary, and be not limited to the protection domain of embodiment of the present invention.
Can find out from technique scheme, in embodiment of the present invention, the speech text territory at first is set and corresponding to the control command territory in this speech text territory in the html tag of webpage, and in the control command territory, include the webpage control command; Then from voice command, identify key word, in the html tag of webpage, retrieve the speech text territory that is complementary with this key word, and the webpage control command that comprises in the control command territory of execution corresponding to the speech text territory.This shows, use after the embodiment of the present invention, by expansion html tag and voiced keyword identification, realized the web page operation voice control for the web page contents element.And the control mode of embodiment of the present invention is for specific webpage, rather than general order, so embodiment of the present invention has significantly improved the operation versatility.
In addition, the present invention can select label to expand arbitrarily in numerous labels of HTML, and therefore concrete application form of the present invention is very various, also helps various selection of developer.
The above is preferred embodiment of the present invention only, is not for limiting protection scope of the present invention.Within the spirit and principles in the present invention all, any modification of doing, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.