CN112770157B - Voice control method, device, equipment and medium for WEB front-end interface of television - Google Patents
Voice control method, device, equipment and medium for WEB front-end interface of television Download PDFInfo
- Publication number
- CN112770157B CN112770157B CN202011502454.0A CN202011502454A CN112770157B CN 112770157 B CN112770157 B CN 112770157B CN 202011502454 A CN202011502454 A CN 202011502454A CN 112770157 B CN112770157 B CN 112770157B
- Authority
- CN
- China
- Prior art keywords
- voice
- web front
- television
- event
- page
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/422—Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
- H04N21/42203—Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] sound input device, e.g. microphone
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/443—OS processes, e.g. booting an STB, implementing a Java virtual machine in an STB or power management in an STB
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/478—Supplemental services, e.g. displaying phone caller identification, shopping application
- H04N21/4782—Web browsing, e.g. WebTV
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/85—Assembly of content; Generation of multimedia applications
- H04N21/854—Content authoring
- H04N21/8543—Content authoring using a description language, e.g. Multimedia and Hypermedia information coding Expert Group [MHEG], eXtensible Markup Language [XML]
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Software Systems (AREA)
- Computer Security & Cryptography (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
The invention discloses a voice control method, a device, equipment and a medium for a television WEB front end interface, wherein the method comprises the following steps: when a voice command is acquired, judging whether the current application type of the television is Web front-end page application or not; when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis, and a corresponding voice event is determined; judging whether the voice event belongs to a Web front-end page event or not; when the voice event belongs to a Web front-end page event, generating a semantic instruction; and controlling the matched web front-end interface to respond according to the semantic instruction. The invention aims to solve the problems that in the prior art, a designated node in a Web front-end interface in a television terminal cannot be controlled through voice, only the interface can be slid through a remote controller for control, the operation experience is very poor, and the operation is very inconvenient.
Description
Technical Field
The invention relates to the technical field of voice control, in particular to a voice control method and device for a television WEB front-end interface, terminal equipment and a storage medium.
Background
With the wide popularization of the internet and the development of artificial intelligence technology, intelligent voice televisions have become the mainstream of the market. On the other hand, the development of the voice recognition technology enables terminal devices such as intelligent voice televisions and the like to be rapidly popularized. The intelligent voice television is mainly embodied in the mode that corresponding client application can be controlled, for example, a voice command of 'i want to watch a movie' can open a client of a type of 'love art', and the like, namely the corresponding television client application can be controlled through voice.
However, in the prior art, a designated node in a Web front-end interface in a television terminal cannot be controlled through voice, and only sliding of the interface through a remote controller is required for control, so that the operation experience is very poor.
Thus, there is a need for improvements and enhancements in the art.
Disclosure of Invention
The technical problem to be solved by the present invention is to provide a voice control method, apparatus, terminal device and storage medium for a WEB front end interface of a television, aiming at solving the problems that in the prior art, a designated node in the WEB front end interface in a television terminal cannot be controlled by voice, and only the interface can be slid by a remote controller for control, so that the operation experience is very poor and the operation is very inconvenient.
In order to solve the technical problems, the technical scheme adopted by the invention is as follows:
a voice control method for a television WEB front-end interface comprises the following steps:
when a voice command is acquired, judging whether the current application type of the television is Web front-end page application or not;
when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis, and a corresponding voice event is determined;
judging whether the voice event belongs to a Web front-end page event or not;
when the voice event belongs to a Web front-end page event, generating a semantic instruction; and controlling the matched web front-end interface to respond according to the semantic instruction.
The voice control method of the television WEB front end interface comprises the following steps of controlling the matched WEB front end interface to respond according to the semantic instruction:
and converting the Web front-end page instruction according to the matching result of the semantic instruction and the current page form, controlling the current Web page, obtaining a corresponding page anchor point and acquiring a corresponding application response.
The voice control method for the television WEB front-end interface comprises the following steps of judging whether the current application type of the television is the Web front-end page application or not when the voice command is acquired:
when a voice command is acquired, detecting the webpage attribute of the current television application;
judging whether the current television application is a Web front-end page application or not according to the fact whether the current television application has the html page attribute or not;
when the current television application has the html webpage attribute, judging that the current television application is a Web front-end page application;
and when the current television application does not have the html webpage attribute, judging that the current television application is not the Web front-end page application, and performing default conventional voice recognition to control the television.
The voice control method of the television WEB front-end interface is characterized in that when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis, and the step of determining the corresponding voice event comprises the following steps:
when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis; and determining the corresponding voice event.
The voice control method for the television WEB front-end interface comprises the following steps of judging whether the voice event belongs to a Web front-end page event or not:
when the corresponding voice event is determined according to the analysis result;
and analyzing the voice event, and judging whether the voice event belongs to the type of a click or sliding web front end page event.
The voice control method of the television WEB front-end interface comprises the steps that when the voice event belongs to a Web front-end page event, a semantic instruction is generated; controlling the response of the matched web front-end interface according to the semantic instruction comprises:
when the voice event is judged to belong to a Web front-end page event, generating a corresponding semantic instruction;
matching page nodes according to the semantic instruction, and screening optimal nodes;
judging whether the screened optimal nodes are visual or not;
when the screened optimal node is not currently visualized, sliding the page, and then executing page jump to obtain service;
and when the screened optimal node is visualized at present, directly executing page jump to obtain service.
The voice control method of the television WEB front-end interface comprises the steps that when the voice event belongs to a Web front-end page event, a semantic instruction is generated; controlling the response of the matched web front-end interface according to the semantic instruction comprises:
performing rough matching on the corresponding semantic information and text data in the front-end webpage according to the instruction to obtain a matched front-end webpage candidate area, and then performing fine matching from the upper layer to the lower layer according to nodes in the html structure hierarchical tree corresponding to the semantic level instruction information and the current front-end interface to obtain candidate nodes; and taking intersection of the obtained matching nodes and the candidate region to screen out the matching nodes;
screening out an optimal node from the matching nodes, and taking the matching node with the highest matching degree with the current semantic information on two aspects of attributes and contents as the optimal node;
and if the optimal node is in the current visual area, selecting the node, executing the corresponding bound method to obtain the service, if the optimal node is not in the visual area, executing a page sliding method, sliding the page to the corresponding area of the node, and then selecting the node to execute the corresponding bound method to obtain the service.
A voice control apparatus for a WEB front end interface of a television, wherein the apparatus comprises:
the front-end page judging module is used for judging whether the current application type of the television is Web front-end page application or not when the voice command is acquired;
the semantic analysis module is used for identifying and performing semantic analysis on the acquired voice instruction when the current application type of the television is Web front-end page application, and determining a corresponding voice event;
the front-end page event judging module is used for judging whether the voice event belongs to a Web front-end page event or not;
the front-end page event response control module is used for generating a semantic instruction when the voice event belongs to a Web front-end page event; and controlling the matched web front-end interface to respond according to the semantic instruction.
A terminal device comprises a memory, a processor and a voice control program of a television WEB front end interface, wherein the voice control program of the television WEB front end interface is stored in the memory and can run on the processor, and when the processor executes the voice control program of the television WEB front end interface, the voice control method of the television WEB front end interface is realized.
A computer readable storage medium, wherein a voice control program of a television WEB front end interface is stored on the computer readable storage medium, and when the voice control program of the television WEB front end interface is executed by a processor, the steps of any one voice control method of the television WEB front end interface are realized.
Has the advantages that: compared with the prior art, the voice control method of the television WEB front-end interface is provided, the voice instruction is subjected to semantic analysis through the method of controlling the television Web front-end interface through voice, the obtained semantic instruction information is matched with the Web front-end page, the conversion from the voice instruction to the television Web front-end page instruction is realized, a television remote controller is replaced to obtain a corresponding page node and obtain a corresponding application response, the condition that the voice instruction in the application of the television front-end interface cannot be identified is reduced, the use comfort and the operation experience of the intelligent television are greatly improved, and convenience is provided for users.
Drawings
Fig. 1 is a flowchart of a specific implementation of a voice control method for a television WEB front-end interface according to embodiment 1 of the present invention.
Fig. 2 is a schematic flow chart of a voice control method for a television WEB front-end interface according to embodiment 2 of the present invention.
Fig. 3 is a schematic diagram of a WEB front-end page of a television according to the voice control method for the WEB front-end interface of the television according to the embodiment of the present invention.
Fig. 4 is a schematic block diagram of a voice control apparatus of a tv WEB front end interface according to an embodiment of the present invention.
Fig. 5 is a schematic block diagram of a voice control apparatus of a tv WEB front end interface according to another embodiment of the present invention.
Fig. 6 is a schematic block diagram of an internal structure of a terminal device according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and effects of the present invention clearer and clearer, the present invention is further described in detail below with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
With the wide popularization of the internet and the development of artificial intelligence technology, intelligent voice televisions have become the mainstream of the market. On the other hand, the development of voice recognition technology enables terminal devices such as smart voice televisions to be rapidly popularized. The intelligent voice television is mainly embodied in the mode that corresponding client application can be controlled, for example, a voice command of 'i want to watch a movie' can open a client of a type of 'love art' and the like, namely, the corresponding television client application can be controlled through voice.
However, in the prior art, a designated node in a Web front-end interface in a television terminal cannot be controlled through voice, and only sliding of the interface through a remote controller is required for control, so that the operation experience is very poor.
Namely the defects of the prior art: for the application of the front-end page of the medium-long WEB of the television, the voice instruction cannot acquire the anchor point of the front-end page of the WEB of the television, so that corresponding response cannot be obtained, and the use comfort of a user is greatly influenced. For example, a certain television has applications formed by certain longer front-end pages, and a corresponding anchor point cannot be found by a voice command of a user, and the corresponding anchor point can only be operated by a key of a remote controller; on the other hand, the voice command of the user is not recognized or is wrongly recognized in the client application, so that the operation experience of the user is greatly influenced, the operation is very complicated, and inconvenience is brought to the use of the user.
In order to solve the problems in the prior art, the embodiment provides a voice control method for a WEB front-end interface of a television, and the invention provides a method for controlling a television by using voice, which is operated in Linux and Android, can perform voice recognition to semantic understanding through connection between the internet and a remote server, and finally makes an execution decision on a current WEB front-end page of equipment instead of a television remote controller according to a voice control attribute. The method mainly solves the problem that in the existing television, a focus is obtained through a remote controller for selective control of a Web type page of Html, so that the application experience of the Web page in the television is very poor. For example: a longer WEB front-end page exists in the television application, a user can obtain corresponding page information only by performing pull-down operation for multiple times through a remote controller, interested node information is opened through the remote controller, and the television interaction experience is greatly influenced; the application scene of the invention is not limited to the intelligent television, and the invention can also be used for other voice intelligent equipment with a screen.
According to the characteristics of the existing intelligent voice television, the voice command is analyzed to be matched with the current Web front-end page, the page anchor point corresponding to the voice command is decided, the corresponding service is obtained by positioning, the television front-end page capable of sliding up and down through the remote controller is replaced, and meanwhile wrong voice recognition is prevented from entering the client application, so that better user experience is obtained. For example, a user browses a movie film evaluation website at a television end, does not need to search by up and down operations of a remote controller, and controls a current Web page according to a matching result of a current page form and a voice instruction type, namely: the user can speak which movie comment to turn on through voice instructions.
By the method for controlling the television Web front-end interface through the voice, the conversion from the voice instruction to the television Web front-end page instruction can be completed, a television remote controller is replaced to obtain the corresponding page anchor point and obtain the corresponding application response, the processing process of voice recognition and voice control is optimized, the condition that the voice instruction in the front-end interface cannot be recognized is reduced, the accuracy of voice control is improved, and the use comfort and the operation experience of the intelligent television are greatly improved.
Exemplary method
The voice control method for the television WEB front-end interface of this embodiment may be applied to a terminal device, and specifically as shown in fig. 1, the voice control method for the television WEB front-end interface includes the following steps:
step S100, when a voice command is acquired, judging whether the current application type of the television is Web front-end page application;
in the embodiment of the invention, when the smart television acquires the voice command of the user, whether the current application type of the television is Web front-end page application or not is judged firstly.
For example, when a voice command is acquired, detecting the webpage attribute of the current television application; judging whether the current television application is a Web front-end page application or not according to the fact whether the current television application has the html page attribute or not;
when the current television application has the html webpage attribute, determining that the current television application is the Web front-end page application;
and when the current television application does not have the html webpage attribute, judging that the current television application is not the Web front-end page application, and performing default conventional voice recognition to control the television.
For example, if the current smart terminal page is in a browser front-end page similar to fig. 3 (the figure is a part of the current page, and part of the page is not visualized), the html web page attribute is possessed.
S200, when the current application type of the television is Web front-end page application, identifying and performing semantic analysis on the acquired voice instruction to determine a corresponding voice event;
in the embodiment of the invention, when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis, and the corresponding voice event is determined.
Specifically, when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis; and determining the corresponding voice event.
For example: a user says ' small dimension and small dimension ', music is opened ', the user is judged to be a conventional voice control operation through preliminary voice recognition and semantic analysis, and a voice event is a common APP application.
For example, when the user says ' small dimension and small dimension ', opens the home site of the automobile ', the recognition and semantic analysis are carried out; and determining that the corresponding voice event is the voice event of the home station of the automobile needing to be opened.
Step S300, judging whether the voice event belongs to a Web front end page event or not;
in the embodiment of the invention, the smart television can judge whether the voice event belongs to the Web front-end page event.
Specifically, when the corresponding voice event is determined according to the analysis result; the smart television analyzes the voice event and judges whether the voice event belongs to the type of a click or sliding web front-end page event.
For example: a user says ' small dimension and small dimension ' and opens music ', the music is judged to be a regular voice control operation through preliminary voice recognition and semantic analysis, and a corresponding music client is directly opened; if the user says ' small dimension and small dimension ' and opens the home site of the automobile ', the type of the web front-end page event is preliminarily judged, text data corresponding to the standard is normalized and generated as ' open ', ' automobile home ' ] and the next operation is carried out.
Step S400, when the voice event belongs to a Web front end page event, generating a semantic instruction; and controlling the matched web front-end interface to respond according to the semantic instruction.
In the embodiment of the invention, when the voice event belongs to a Web front-end page event, a semantic instruction is generated, the Web front-end page instruction is converted according to the matching result of the semantic instruction and the current page form, the current Web page is controlled, a corresponding page anchor point is obtained, and a corresponding application response is obtained.
Specifically, the method comprises the following steps: when the voice event is judged to belong to a Web front-end page event, generating a corresponding semantic instruction; matching page nodes according to the semantic instruction, and screening optimal nodes;
judging whether the screened optimal nodes are visual or not; when the screened optimal node is not currently visualized, sliding the page, and then executing page jump to obtain service;
and when the screened optimal node is visualized at present, directly executing page jump to obtain service.
Preferably, the intelligent system intelligently performs rough matching on the semantic information corresponding to the instruction and text data in the front-end webpage to obtain a matched front-end webpage candidate area, and then performs fine matching from the upper layer to the lower layer on the basis of nodes in the html structure hierarchical tree corresponding to the semantic level instruction information and the current front-end interface to obtain candidate nodes; and taking intersection of the obtained matching nodes and the candidate region to screen out the matching nodes;
screening out an optimal node from the matching nodes, and taking the matching node with the highest matching degree with the current semantic information on two aspects of attributes and contents as the optimal node;
and if the optimal node is in the current visual area, selecting the node, executing the corresponding bound method to obtain the service, if the optimal node is not in the visual area, executing a page sliding method, sliding the page to the corresponding area of the node, and then selecting the node to execute the corresponding bound method to obtain the service.
The invention is explained in further detail below by means of specific application examples:
as shown in fig. 2, the voice control method for a tv WEB front-end interface according to the embodiment of the present invention includes the following steps:
the intelligent television equipment is input through voice awakening and enters voice recognition;
step S10, after the voice instruction of the user is obtained, judging whether the current television application type is Web front-end page application: judging whether the current television application is a Web front-end application according to whether the current television application has the html webpage attribute, and if so, entering the step S11; otherwise, performing default conventional voice recognition to control the television, and controlling the client application by the voice.
For example, fig. 3 is a schematic diagram of a Web front-end page of a television, and if a current intelligent terminal page is in a browser front-end page similar to fig. 3 (the figure is a part of the current page, and a part of the current page is not visualized), and has an html Web page attribute, the next operation is performed.
And S11, performing semantic analysis according to the acquired user voice instruction, preliminarily judging whether the type of the web front-end page event belongs to a single click or sliding according to an analysis result, filtering invalid data if the type of the web front-end page event belongs to the single click or sliding, standardizing semantic information of the generated instruction, and entering S12 to perform the next operation, otherwise, judging that the type of the web front-end page event belongs to a conventional voice control operation, and controlling the client application by voice.
For example: a user says ' small dimension and small dimension ', opens music ', judges the music as a conventional voice control operation through preliminary voice recognition and semantic analysis, and directly opens a corresponding music client; if the user says ' small dimension and opens the home site of the automobile ', the user is preliminarily judged as the web front-end page event type, and text data ' open ', ' home of the automobile ' corresponding to the standard ' are normalized and the next operation is carried out.
Step S12, matching the acquired instruction information with a Web front end page, namely: firstly, carrying out rough matching on semantic information corresponding to an instruction and text data in a front-end webpage to obtain a matched front-end webpage candidate area, and then carrying out fine matching from the upper layer to the lower layer on nodes in an html structure hierarchical tree corresponding to a current front-end interface according to semantic level instruction information to obtain candidate nodes; and taking intersection of the obtained matching nodes and the candidate region to screen out the matching nodes;
for example: standard text data [ "open", "car owner" ] is matched with normalized text data [ "browse web page", "fine product recommendation", "my collection", "setup and tool", ], [ "HAO website", "online news", "phoenix net", "sina microblog", "UC cloud service", "tv application", "voice happy table", "panning", "HAO website", "Baidu", "car owner", "search fox video", "sofa manager", "super cool", "weather", "tiger flapping sports" }, rough matching and screening to the "car owner" node.
S13, screening out an optimal node from the matched nodes: judging whether the current semantic information exists in the current matching node, namely further judging whether the semantic information is met on three levels of the attribute, the content and the method bound by the matching node, wherein the judging method comprises the following steps:
taking a matching node with the highest matching degree with the current semantic information on two levels of the attribute and the content as an optimal node, and then performing an operation step S14;
for example: the current matching node is only 'car home', so that the current matching node is also the optimal node at the moment,
and S14, if the optimal node is in the current visual area, selecting the node, executing the corresponding bound method to obtain the service, if the optimal node is not in the visual area, executing the page sliding method, sliding the page to the corresponding area of the node, and then selecting the node to execute the corresponding bound method to obtain the service.
For example: if the current 'car home' node is in the current page, the binding method is executed to jump to a new page to obtain service, if the 'car home' node is not in the current page, the binding method is firstly slid to a corresponding area of the node, and then the corresponding binding method is executed to jump to the page to obtain service corresponding to the 'car home'.
Therefore, the method for controlling the television Web front-end interface through the voice carries out semantic analysis on the voice instruction, matches the acquired semantic instruction information with the Web front-end page, realizes the conversion from the voice instruction to the television Web front-end page instruction, replaces a television remote controller to obtain the corresponding page node and acquire the corresponding application response, reduces the condition that the voice instruction in the television front-end interface application cannot be identified, and greatly improves the use comfort and the operation experience of the intelligent television.
Exemplary device
As shown in fig. 4, an embodiment of the present invention further provides a voice control apparatus for a WEB front end interface of a television, including three modules: the system comprises an application type analysis module, an event analysis and matching module and an event response module. The application type analysis module, the event analysis and matching module and the event response module can be connected in sequence.
The application analysis module is used for distinguishing the type of the current page and judging whether the current page is an application formed by a web front-end interface;
the event analysis and matching module mainly comprises two links of semantic analysis and matching analysis and is used for generating semantic information after voice recognition and analysis and then judging whether the acquired voice event acts on the type of the web page;
the event response module comprises an event verification link and an event response link and is used for responding to the voice event matched with the web front-end interface after verification is passed and obtaining service.
As shown in fig. 5, another embodiment of the present invention provides a voice control apparatus for a WEB front end interface of a television, including:
the front-end page judging module 10 is configured to, when the voice command is obtained, judge whether the current application type of the television is a Web front-end page application;
the semantic analysis module 20 is configured to, when the current application type of the television is a Web front-end page application, perform recognition and semantic analysis on the acquired voice instruction to determine a corresponding voice event;
a front-end page event determining module 30, configured to determine whether the voice event belongs to a Web front-end page event;
a front-end page event response control module 40, configured to generate a semantic instruction when the voice event belongs to a Web front-end page event; and controlling the matched web front-end interface to respond according to the semantic instruction.
Based on the above embodiments, the present invention further provides a terminal device, and a schematic block diagram thereof may be as shown in fig. 6. The terminal equipment comprises a processor, a memory, a network interface, a display screen and a voice recognition module which are connected through a system bus. Wherein the processor of the terminal device is configured to provide computing and control capabilities. The memory of the terminal equipment comprises a nonvolatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the operation of an operating system and computer programs in the non-volatile storage medium. The network interface of the terminal device is used for connecting and communicating with an external terminal through a network. The computer program is executed by a processor to realize a voice control method of a television WEB front-end interface. The display screen of the terminal equipment can be a liquid crystal display screen or an electronic ink display screen, and the voice recognition module of the terminal equipment is arranged in the terminal equipment in advance and used for recognizing the voice of a user.
It will be understood by those skilled in the art that the block diagram of fig. 6 is only a block diagram of a part of the structure related to the solution of the present invention, and does not constitute a limitation to the terminal device to which the solution of the present invention is applied, and a specific terminal device may include more or less components than those shown in the figure, or combine some components, or have a different arrangement of components.
In one embodiment, a terminal device is provided, where the terminal device includes a memory, a processor, and a voice control program of a tv WEB front end interface stored in the memory and executable on the processor, and when the processor executes the voice control program of the tv WEB front end interface, the following operation instructions are implemented:
when a voice command is acquired, judging whether the current application type of the television is Web front-end page application or not;
when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis, and a corresponding voice event is determined;
judging whether the voice event belongs to a Web front-end page event or not;
when the voice event belongs to a Web front-end page event, generating a semantic instruction; controlling the matched web front-end interface to respond according to the semantic instruction; as described above.
Wherein the step of controlling the response of the matched web front end interface according to the semantic instruction comprises:
and converting the Web front-end page instruction according to the matching result of the semantic instruction and the current page form, controlling the current Web page, obtaining a corresponding page anchor point and acquiring a corresponding application response.
The step of judging whether the current application type of the television is the Web front-end page application or not when the voice command is acquired comprises the following steps:
when a voice command is acquired, detecting the webpage attribute of the current television application;
judging whether the current television application is a Web front-end page application or not according to the fact whether the current television application has the html page attribute or not;
when the current television application has the html webpage attribute, judging that the current television application is a Web front-end page application;
and when the current television application does not have the html webpage attribute, judging that the current television application is not the Web front-end page application, and performing default conventional voice recognition to control the television.
When the current application type of the television is Web front-end page application, the steps of identifying and performing semantic analysis on the acquired voice instruction and determining the corresponding voice event comprise:
when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis; and determining a corresponding voice event.
The step of judging whether the voice event belongs to a Web front end page event comprises the following steps:
when the corresponding voice event is determined according to the analysis result;
and analyzing the voice event, and judging whether the voice event belongs to the type of a click or sliding web front end page event.
When the voice event belongs to a Web front-end page event, generating a semantic instruction; controlling the response of the matched web front-end interface according to the semantic instruction comprises:
when the voice event is judged to belong to a Web front-end page event, generating a corresponding semantic instruction;
matching page nodes according to the semantic instruction, and screening optimal nodes;
judging whether the screened optimal nodes are visual or not;
when the screened optimal node is not in the current visualization state, the page is slid, and then page skipping is executed to obtain service;
and when the screened optimal node is visualized at present, directly executing page jump to obtain service.
When the voice event belongs to a Web front-end page event, generating a semantic instruction; controlling the response of the matched web front-end interface according to the semantic instruction comprises:
performing rough matching on the corresponding semantic information and text data in the front-end webpage according to the instruction to obtain a matched front-end webpage candidate area, and then performing fine matching from the upper layer to the lower layer according to nodes in the html structure hierarchical tree corresponding to the semantic level instruction information and the current front-end interface to obtain candidate nodes; and taking intersection of the obtained matching nodes and the candidate region to screen out the matching nodes;
screening out an optimal node from the matching nodes, and taking the matching node with the highest matching degree with the current semantic information on two aspects of attributes and contents as the optimal node;
and if the optimal node is in the current visual area, selecting the node, executing the corresponding bound method to obtain the service, if the optimal node is not in the visual area, executing a page sliding method, sliding the page to the corresponding area of the node, and then selecting the node to execute the corresponding bound method to obtain the service.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above may be implemented by hardware instructions of a computer program, which may be stored in a non-volatile computer-readable storage medium, and when executed, may include the processes of the embodiments of the methods described above. Any reference to memory, storage, databases, or other media used in embodiments provided herein may include non-volatile and/or volatile memory. Non-volatile memory can include read-only memory (ROM), programmable ROM (PROM), electrically Programmable ROM (EPROM), electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double Data Rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous Link DRAM (SLDRAM), rambus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
In summary, the present invention discloses a voice control method, apparatus, terminal device and storage medium for a WEB front end interface of a television, and the method includes: when a voice command is acquired, judging whether the current application type of the television is Web front-end page application or not; when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis, and a corresponding voice event is determined; judging whether the voice event belongs to a Web front-end page event or not; when the voice event belongs to a Web front-end page event, generating a semantic instruction; and controlling the matched web front-end interface to respond according to the semantic instruction. The invention aims to solve the problems that in the prior art, a designated node in a Web front-end interface in a television terminal cannot be controlled through voice, only the interface can be slid through a remote controller for control, the operation experience is very poor, and the operation is very inconvenient.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, and not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.
Claims (9)
1. A voice control method for a television WEB front-end interface is characterized by comprising the following steps:
when a voice command is acquired, judging whether the current application type of the television is Web front-end page application or not;
when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis, and a corresponding voice event is determined;
judging whether the voice event belongs to a Web front-end page event or not;
when the voice event belongs to a Web front-end page event, generating a semantic instruction; controlling the matched Web front-end interface to respond according to the semantic instruction;
normalizing the voice event which is preliminarily judged as the Web front-end page event type into text data corresponding to the standard and carrying out the next operation;
when the voice event belongs to a Web front-end page event, generating a semantic instruction; controlling the response of the matched Web front-end interface according to the semantic instruction, wherein the step comprises the following steps:
performing rough matching on the corresponding semantic information and text data in the front-end webpage according to the instruction to obtain a matched front-end webpage candidate area, and then performing fine matching from the upper layer to the lower layer according to nodes in the html structure hierarchical tree corresponding to the semantic level instruction information and the current front-end interface to obtain candidate nodes; and taking intersection of the obtained matching nodes and the candidate region to screen out the matching nodes;
screening out an optimal node from the matching nodes, and taking the matching node with the highest matching degree with the current semantic information on two aspects of attributes and contents as the optimal node;
and if the optimal node is in the current visual area, selecting the node, executing the corresponding bound method to obtain the service, if the optimal node is not in the visual area, executing a page sliding method, sliding the page to the corresponding area of the node, and then selecting the node to execute the corresponding bound method to obtain the service.
2. The voice control method for the WEB front end interface of the television as claimed in claim 1, wherein the step of controlling the response of the matched WEB front end interface according to the semantic instruction comprises:
and converting the Web front-end page instruction according to the matching result of the semantic instruction and the current page form, controlling the current Web page, obtaining a corresponding page anchor point and acquiring a corresponding application response.
3. The voice control method for the WEB front-end interface of the television according to claim 1, wherein the step of determining whether the current application type of the television is the WEB front-end page application when the voice command is obtained comprises:
when a voice command is acquired, detecting the webpage attribute of the current television application;
judging whether the current television application is a Web front-end page application or not according to the html page attribute;
when the current television application has the html webpage attribute, determining that the current television application is the Web front-end page application;
and when the current television application does not have the html webpage attribute, judging that the current television application is not the Web front-end page application, and performing default conventional voice recognition to control the television.
4. The voice control method for the WEB front-end interface of the television as claimed in claim 1, wherein the step of identifying and semantically analyzing the obtained voice command and determining the corresponding voice event when the current application type of the television is the WEB front-end page application comprises:
when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis; and determining a corresponding voice event.
5. The method for controlling the voice of the WEB front-end interface of the television set as claimed in claim 1, wherein the step of determining whether the voice event belongs to the WEB front-end page event comprises:
when the corresponding voice event is determined according to the analysis result;
and analyzing the voice event, and judging whether the voice event belongs to the type of a single click or sliding Web front end page event.
6. The voice control method for the WEB front end interface of the television according to claim 1, wherein when the voice event belongs to a WEB front end page event, a semantic instruction is generated; controlling the response of the matched Web front-end interface according to the semantic instruction, wherein the step comprises the following steps:
when the voice event is judged to belong to a Web front-end page event, generating a corresponding semantic instruction;
matching page nodes according to the semantic instruction, and screening optimal nodes;
judging whether the screened optimal nodes are visual or not;
when the screened optimal node is not in the current visualization state, the page is slid, and then page skipping is executed to obtain service;
and when the screened optimal node is visualized at present, directly executing page jump to obtain service.
7. A voice control device for a television WEB front-end interface, the device comprising:
the front-end page judging module is used for judging whether the current application type of the television is Web front-end page application or not when the voice command is acquired;
the semantic analysis module is used for identifying and performing semantic analysis on the acquired voice instruction when the current application type of the television is Web front-end page application, and determining a corresponding voice event;
the front-end page event judging module is used for judging whether the voice event belongs to a Web front-end page event or not;
the front end page event judging module is also used for standardizing the voice event which is preliminarily judged as the Web front end page event type into text data corresponding to the standard and carrying out the next operation;
The front-end page event response control module is used for generating a semantic instruction when the voice event belongs to a Web front-end page event; controlling the matched Web front-end interface to respond according to the semantic instruction;
the front-end page event response control module is also used for generating a semantic instruction when the voice event belongs to a Web front-end page event; controlling the response of the matched Web front-end interface according to the semantic instruction, wherein the step of controlling the response of the matched Web front-end interface comprises the following steps:
performing rough matching on the corresponding semantic information and text data in the front-end webpage according to the instruction to obtain a matched front-end webpage candidate area, and then performing fine matching from the upper layer to the lower layer according to nodes in the html structure hierarchical tree corresponding to the semantic level instruction information and the current front-end interface to obtain candidate nodes; and taking intersection of the obtained matching nodes and the candidate region to screen out the matching nodes;
screening out an optimal node from the matching nodes, and taking the matching node with the highest matching degree with the current semantic information on two aspects of attributes and contents as the optimal node;
and if the optimal node is in the current visual area, selecting the node, executing the corresponding bound method to obtain the service, if the optimal node is not in the visual area, executing a page sliding method, sliding the page to the corresponding area of the node, and then selecting the node to execute the corresponding bound method to obtain the service.
8. A terminal device, wherein the terminal device comprises a memory, a processor, and a voice control program of a tv WEB front end interface stored in the memory and capable of running on the processor, and when the processor executes the voice control program of the tv WEB front end interface, the steps of the voice control method of the tv WEB front end interface according to any one of claims 1 to 6 are implemented.
9. A computer readable storage medium having stored thereon a voice control program for a television WEB front end interface, the voice control program for a television WEB front end interface when executed by a processor implementing the steps of the method for voice control of a television WEB front end interface as claimed in any one of claims 1 to 6.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011502454.0A CN112770157B (en) | 2020-12-17 | 2020-12-17 | Voice control method, device, equipment and medium for WEB front-end interface of television |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011502454.0A CN112770157B (en) | 2020-12-17 | 2020-12-17 | Voice control method, device, equipment and medium for WEB front-end interface of television |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112770157A CN112770157A (en) | 2021-05-07 |
CN112770157B true CN112770157B (en) | 2023-03-28 |
Family
ID=75694453
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011502454.0A Active CN112770157B (en) | 2020-12-17 | 2020-12-17 | Voice control method, device, equipment and medium for WEB front-end interface of television |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112770157B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113470647A (en) * | 2021-07-22 | 2021-10-01 | 深圳市天威视讯股份有限公司 | Processing method and system through voice control |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017092312A1 (en) * | 2015-12-01 | 2017-06-08 | 乐视控股(北京)有限公司 | Method of browsing webpage on browser and device |
CN106980614A (en) * | 2016-01-15 | 2017-07-25 | 中国科学院声学研究所 | A kind of Web page speech control implementation method extended based on JavaScript |
CN109766073A (en) * | 2019-01-25 | 2019-05-17 | 四川长虹电器股份有限公司 | The method that voice operating web page contents navigate in TV browser |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10157612B2 (en) * | 2012-08-02 | 2018-12-18 | Nuance Communications, Inc. | Methods and apparatus for voice-enabling a web application |
CN105161106A (en) * | 2015-08-20 | 2015-12-16 | 深圳Tcl数字技术有限公司 | Voice control method of intelligent terminal, voice control device and television system |
CN105551488A (en) * | 2015-12-15 | 2016-05-04 | 深圳Tcl数字技术有限公司 | Voice control method and system |
CN110444209B (en) * | 2019-08-13 | 2022-04-12 | 思必驰科技股份有限公司 | Voice interaction method, device and system for embedded web page of intelligent vehicle machine |
-
2020
- 2020-12-17 CN CN202011502454.0A patent/CN112770157B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017092312A1 (en) * | 2015-12-01 | 2017-06-08 | 乐视控股(北京)有限公司 | Method of browsing webpage on browser and device |
CN106980614A (en) * | 2016-01-15 | 2017-07-25 | 中国科学院声学研究所 | A kind of Web page speech control implementation method extended based on JavaScript |
CN109766073A (en) * | 2019-01-25 | 2019-05-17 | 四川长虹电器股份有限公司 | The method that voice operating web page contents navigate in TV browser |
Non-Patent Citations (1)
Title |
---|
郭家清 ; 白宇 ; 蔡东风 ; 刘纪元 ; .基于语音标签的语音浏览器.沈阳航空工业学院学报.2007,(02),全文. * |
Also Published As
Publication number | Publication date |
---|---|
CN112770157A (en) | 2021-05-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10831345B2 (en) | Establishing user specified interaction modes in a question answering dialogue | |
KR101909807B1 (en) | Method and apparatus for inputting information | |
CN107578776B (en) | Voice interaction awakening method and device and computer readable storage medium | |
US8126930B2 (en) | Micro-bucket testing for page optimization | |
US9622016B2 (en) | Invisiblemask: a tangible mechanism to enhance mobile device smartness | |
US20070203869A1 (en) | Adaptive semantic platform architecture | |
WO2018045646A1 (en) | Artificial intelligence-based method and device for human-machine interaction | |
CN105786969A (en) | Information display method and apparatus | |
CN110992937B (en) | Language off-line identification method, terminal and readable storage medium | |
US20090282037A1 (en) | Method and system for providing convenient dictionary services | |
CN103392346A (en) | Personalization of information content by monitoring network traffic | |
CN112770157B (en) | Voice control method, device, equipment and medium for WEB front-end interface of television | |
WO2018183017A1 (en) | Automatically generating documents | |
WO2023280569A1 (en) | Dynamic web page classification in web data collection | |
US20160299972A1 (en) | Providing app store search results | |
CN113343108A (en) | Recommendation information processing method, device, equipment and storage medium | |
JP4962416B2 (en) | Speech recognition system | |
TW201435627A (en) | System and method for optimizing search results | |
CN112447173A (en) | Voice interaction method and device and computer storage medium | |
CN113806667B (en) | Method and system for supporting webpage classification | |
CN114969544A (en) | Hot data-based recommended content generation method, device, equipment and medium | |
CN112052377B (en) | Resource recommendation method, device, server and storage medium | |
TW202219793A (en) | Web page analyzing method and web page analyzing platform using the same | |
CN111666522A (en) | Information processing method, device, equipment and storage medium | |
US20090327233A1 (en) | Method of selecting objects in web pages |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |