CN112770157A - Voice control method, device, equipment and medium for WEB front-end interface of television - Google Patents

Voice control method, device, equipment and medium for WEB front-end interface of television Download PDF

Info

Publication number
CN112770157A
CN112770157A CN202011502454.0A CN202011502454A CN112770157A CN 112770157 A CN112770157 A CN 112770157A CN 202011502454 A CN202011502454 A CN 202011502454A CN 112770157 A CN112770157 A CN 112770157A
Authority
CN
China
Prior art keywords
voice
web front
television
page
event
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202011502454.0A
Other languages
Chinese (zh)
Other versions
CN112770157B (en
Inventor
孙爽
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Skyworth Information Technology Research Institute Co ltd
Shenzhen Skyworth RGB Electronics Co Ltd
Original Assignee
Nanjing Skyworth Information Technology Research Institute Co ltd
Shenzhen Skyworth RGB Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Skyworth Information Technology Research Institute Co ltd, Shenzhen Skyworth RGB Electronics Co Ltd filed Critical Nanjing Skyworth Information Technology Research Institute Co ltd
Priority to CN202011502454.0A priority Critical patent/CN112770157B/en
Publication of CN112770157A publication Critical patent/CN112770157A/en
Application granted granted Critical
Publication of CN112770157B publication Critical patent/CN112770157B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42203Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] sound input device, e.g. microphone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/443OS processes, e.g. booting an STB, implementing a Java virtual machine in an STB or power management in an STB
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4782Web browsing, e.g. WebTV
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • H04N21/8543Content authoring using a description language, e.g. Multimedia and Hypermedia information coding Expert Group [MHEG], eXtensible Markup Language [XML]

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Software Systems (AREA)
  • Computer Security & Cryptography (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The invention discloses a voice control method, a device, equipment and a medium for a television WEB front-end interface, wherein the method comprises the following steps: when a voice command is acquired, judging whether the current application type of the television is Web front-end page application or not; when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis, and a corresponding voice event is determined; judging whether the voice event belongs to a Web front-end page event or not; when the voice event belongs to a Web front-end page event, generating a semantic instruction; and controlling the matched web front-end interface to respond according to the semantic instruction. The invention aims to solve the problems that in the prior art, a designated node in a Web front-end interface in a television terminal cannot be controlled through voice, only the interface can be slid through a remote controller for control, the operation experience is very poor, and the operation is very inconvenient.

Description

Voice control method, device, equipment and medium for WEB front-end interface of television
Technical Field
The invention relates to the technical field of voice control, in particular to a voice control method and device for a television WEB front-end interface, terminal equipment and a storage medium.
Background
With the wide popularization of the internet and the development of artificial intelligence technology, intelligent voice televisions have become the mainstream of the market. On the other hand, the development of voice recognition technology enables terminal devices such as smart voice televisions to be rapidly popularized. The intelligent voice television is mainly embodied in the mode that corresponding client application can be controlled, for example, a voice command of 'i want to watch a movie' can open a client of a type of 'love art' and the like, namely, the corresponding television client application can be controlled through voice.
However, in the prior art, a designated node in a Web front-end interface in a television terminal cannot be controlled through voice, and only sliding of the interface through a remote controller is required for control, so that the operation experience is very poor.
Thus, there is a need for improvements and enhancements in the art.
Disclosure of Invention
The technical problem to be solved by the present invention is to provide a method, an apparatus, a terminal device and a storage medium for voice control of a WEB front-end interface of a television, aiming at solving the problems that in the prior art, a designated node in the WEB front-end interface of a television terminal cannot be controlled by voice, only a remote controller can be used to slide an interface for control, the operation experience is very poor, and the operation is very inconvenient.
In order to solve the technical problems, the technical scheme adopted by the invention is as follows:
a voice control method for a television WEB front-end interface comprises the following steps:
when a voice command is acquired, judging whether the current application type of the television is Web front-end page application or not;
when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis, and a corresponding voice event is determined;
judging whether the voice event belongs to a Web front-end page event or not;
when the voice event belongs to a Web front-end page event, generating a semantic instruction; and controlling the matched web front-end interface to respond according to the semantic instruction.
The voice control method of the television WEB front end interface comprises the following steps of controlling the matched WEB front end interface to respond according to the semantic instruction:
and converting the Web front-end page instruction according to the matching result of the semantic instruction and the current page form, controlling the current Web page, obtaining a corresponding page anchor point and acquiring a corresponding application response.
The voice control method for the television WEB front-end interface comprises the following steps of judging whether the current application type of the television is the Web front-end page application or not when the voice command is acquired:
when a voice command is acquired, detecting the webpage attribute of the current television application;
judging whether the current television application is a Web front-end page application or not according to the fact whether the current television application has the html page attribute or not;
when the current television application has the html webpage attribute, determining that the current television application is the Web front-end page application;
and when the current television application does not have the html webpage attribute, judging that the current television application is not the Web front-end page application, and performing default conventional voice recognition to control the television.
The voice control method of the television WEB front-end interface is characterized in that when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis, and the step of determining the corresponding voice event comprises the following steps:
when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis; and determining a corresponding voice event.
The voice control method of the television WEB front-end interface comprises the following steps of:
when the corresponding voice event is determined according to the analysis result;
and analyzing the voice event, and judging whether the voice event belongs to the type of a page event at the front end of a click or slide web.
The voice control method of the television WEB front-end interface comprises the steps that when the voice event belongs to a Web front-end page event, a semantic instruction is generated; controlling the response of the matched web front-end interface according to the semantic instruction comprises:
when the voice event is judged to belong to a Web front-end page event, generating a corresponding semantic instruction;
matching page nodes according to the semantic instruction, and screening optimal nodes;
judging whether the screened optimal nodes are visual or not;
when the screened optimal node is not in the current visualization state, the page is slid, and then page skipping is executed to obtain service;
and when the screened optimal node is visualized at present, directly executing page jump to obtain service.
The voice control method of the television WEB front-end interface comprises the steps that when the voice event belongs to a Web front-end page event, a semantic instruction is generated; controlling the response of the matched web front-end interface according to the semantic instruction comprises:
performing rough matching on the corresponding semantic information and text data in the front-end webpage according to the instruction to obtain a matched front-end webpage candidate area, and then performing fine matching from the upper layer to the lower layer according to nodes in the html structure hierarchical tree corresponding to the semantic level instruction information and the current front-end interface to obtain candidate nodes; and taking intersection of the obtained matching nodes and the candidate region to screen out the matching nodes;
screening out an optimal node from the matching nodes, and taking the matching node with the highest matching degree with the current semantic information on two aspects of attributes and contents as the optimal node;
and if the optimal node is in the current visual area, selecting the node, executing the corresponding bound method to obtain the service, if the optimal node is not in the visual area, executing a page sliding method, sliding the page to the corresponding area of the node, and then selecting the node to execute the corresponding bound method to obtain the service.
A voice control apparatus for a WEB front end interface of a television, wherein the apparatus comprises:
the front-end page judging module is used for judging whether the current application type of the television is Web front-end page application or not when the voice command is acquired;
the semantic analysis module is used for identifying and performing semantic analysis on the acquired voice instruction when the current application type of the television is Web front-end page application, and determining a corresponding voice event;
the front-end page event judging module is used for judging whether the voice event belongs to a Web front-end page event or not;
the front-end page event response control module is used for generating a semantic instruction when the voice event belongs to a Web front-end page event; and controlling the matched web front-end interface to respond according to the semantic instruction.
A terminal device comprises a memory, a processor and a voice control program of a television WEB front end interface, wherein the voice control program of the television WEB front end interface is stored in the memory and can run on the processor, and when the processor executes the voice control program of the television WEB front end interface, the voice control method of the television WEB front end interface is realized.
A computer readable storage medium, wherein a voice control program of a television WEB front end interface is stored on the computer readable storage medium, and when the voice control program of the television WEB front end interface is executed by a processor, the steps of any one voice control method of the television WEB front end interface are realized.
Has the advantages that: compared with the prior art, the voice control method of the television WEB front-end interface is provided, the voice instruction is subjected to semantic analysis through the method of controlling the television Web front-end interface through voice, the obtained semantic instruction information is matched with the Web front-end page, the conversion from the voice instruction to the television Web front-end page instruction is realized, a television remote controller is replaced to obtain a corresponding page node and obtain a corresponding application response, the condition that the voice instruction in the application of the television front-end interface cannot be identified is reduced, the use comfort and the operation experience of the intelligent television are greatly improved, and convenience is provided for users.
Drawings
Fig. 1 is a flowchart of a specific implementation of a voice control method for a television WEB front-end interface according to embodiment 1 of the present invention.
Fig. 2 is a schematic flow chart of a voice control method for a television WEB front-end interface according to embodiment 2 of the present invention.
Fig. 3 is a schematic diagram of a WEB front-end page of a television according to the voice control method for the WEB front-end interface of the television according to the embodiment of the present invention.
Fig. 4 is a schematic block diagram of a voice control apparatus of a tv WEB front end interface according to an embodiment of the present invention.
Fig. 5 is a schematic block diagram of a voice control apparatus of a tv WEB front end interface according to another embodiment of the present invention.
Fig. 6 is a schematic block diagram of an internal structure of a terminal device according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and effects of the present invention clearer and clearer, the present invention is further described in detail below with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
With the wide popularization of the internet and the development of artificial intelligence technology, intelligent voice televisions have become the mainstream of the market. On the other hand, the development of voice recognition technology enables terminal devices such as smart voice televisions to be rapidly popularized. The intelligent voice television is mainly embodied in the mode that corresponding client application can be controlled, for example, a voice command of 'i want to watch a movie' can open a client of a type of 'love art' and the like, namely, the corresponding television client application can be controlled through voice.
However, in the prior art, a designated node in a Web front-end interface in a television terminal cannot be controlled through voice, and only sliding of the interface through a remote controller is required for control, so that the operation experience is very poor.
Namely the defects of the prior art: for the application of the long WEB front end page in the television, the voice instruction cannot acquire the anchor point of the WEB front end page of the television, so that corresponding response cannot be obtained, and the use comfort of a user is greatly influenced. For example, a certain television has applications formed by certain longer front-end pages, and a corresponding anchor point cannot be found by a voice command of a user, and the corresponding anchor point can only be operated by a key of a remote controller; on the other hand, the voice instruction of the user is not recognized or is recognized to the client application by mistake, so that the operation experience of the user is greatly influenced, the operation is very complicated, and inconvenience is brought to the use of the user.
In order to solve the problems in the prior art, the embodiment provides a voice control method for a WEB front-end interface of a television, and the invention provides a method for controlling a television by using voice, which is operated in Linux and Android, can perform voice recognition to semantic understanding through connection between the internet and a remote server, and finally makes an execution decision on a current WEB front-end page of equipment instead of a television remote controller according to a voice control attribute. The method mainly solves the problem that in the existing television, for the Web type page of the Html, the focus is obtained through a remote controller for selection control, so that the WEB page application experience in the television is very poor. For example: a longer WEB front-end page exists in the television application, a user can obtain corresponding page information only by performing pull-down operation for multiple times through a remote controller, and interested node information is opened through the remote controller, so that the television interaction experience is greatly influenced; the application scene of the invention is not limited to the intelligent television, and the invention can also be used for other voice intelligent equipment with a screen.
According to the characteristics of the existing intelligent voice television, the voice command is analyzed to be matched with the current Web front-end page, the page anchor point corresponding to the voice command is decided, the corresponding service is obtained by positioning, the television front-end page capable of sliding up and down through the remote controller is replaced, and meanwhile wrong voice recognition is prevented from entering the client application, so that better user experience is obtained. For example, a user browses a movie film evaluation website at a television end, does not need to search by up and down operations of a remote controller, and controls a current Web page according to a matching result of a current page form and a voice instruction type, namely: the user can speak which movie comment to turn on through voice instructions.
By the method for controlling the television Web front-end interface through the voice, the conversion from the voice instruction to the television Web front-end page instruction can be completed, a television remote controller is replaced to obtain the corresponding page anchor point and obtain the corresponding application response, the processing process of voice recognition and voice control is optimized, the condition that the voice instruction in the front-end interface cannot be recognized is reduced, the accuracy of voice control is improved, and the use comfort and the operation experience of the intelligent television are greatly improved.
Exemplary method
The voice control method for the television WEB front-end interface of this embodiment may be applied to a terminal device, and specifically as shown in fig. 1, the voice control method for the television WEB front-end interface includes the following steps:
step S100, when a voice command is acquired, judging whether the current application type of the television is Web front-end page application;
in the embodiment of the invention, when the smart television acquires the voice command of the user, whether the current application type of the television is Web front-end page application or not is judged firstly.
For example, when a voice command is acquired, detecting the webpage attribute of the current television application; judging whether the current television application is a Web front-end page application or not according to the fact whether the current television application has the html page attribute or not;
when the current television application has the html webpage attribute, determining that the current television application is the Web front-end page application;
and when the current television application does not have the html webpage attribute, judging that the current television application is not the Web front-end page application, and performing default conventional voice recognition to control the television.
For example, if the current smart terminal page is in a browser front-end page similar to fig. 3 (the figure is a part of the current page, and part of the page is not visualized), the html web page attribute is possessed.
S200, when the current application type of the television is Web front-end page application, identifying and performing semantic analysis on the acquired voice instruction to determine a corresponding voice event;
in the embodiment of the invention, when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis, and the corresponding voice event is determined.
Specifically, when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis; and determining a corresponding voice event.
For example: a user says ' small dimension and small dimension ', music is opened ', the user is judged to be a conventional voice control operation through preliminary voice recognition and semantic analysis, and a voice event is a common APP application.
For example, when the user says ' small dimension and small dimension ', opens the home site of the automobile ', the recognition and semantic analysis are carried out; and determining that the corresponding voice event is the voice event of the home station of the automobile needing to be opened.
Step S300, judging whether the voice event belongs to a Web front end page event or not;
in the embodiment of the invention, the smart television can judge whether the voice event belongs to the Web front-end page event.
Specifically, when the corresponding voice event is determined according to the analysis result; the smart television analyzes the voice event and judges whether the voice event belongs to the type of a click or sliding web front-end page event.
For example: a user says ' small dimension and small dimension ', opens music ', judges the music as a conventional voice control operation through preliminary voice recognition and semantic analysis, and directly opens a corresponding music client; if the user says ' small dimension and small dimension ' and opens the home site of the automobile ', the type of the web front-end page event is preliminarily judged, text data corresponding to the standard is normalized and generated as ' open ', ' automobile home ' ] and the next operation is carried out.
Step S400, when the voice event belongs to a Web front end page event, generating a semantic instruction; and controlling the matched web front-end interface to respond according to the semantic instruction.
In the embodiment of the invention, when the voice event belongs to a Web front-end page event, a semantic instruction is generated, the Web front-end page instruction is converted according to the matching result of the semantic instruction and the current page form, the current Web page is controlled, a corresponding page anchor point is obtained, and a corresponding application response is obtained.
Specifically, the method comprises the following steps: when the voice event is judged to belong to a Web front-end page event, generating a corresponding semantic instruction; matching page nodes according to the semantic instruction, and screening optimal nodes;
judging whether the screened optimal nodes are visual or not; when the screened optimal node is not in the current visualization state, the page is slid, and then page skipping is executed to obtain service;
and when the screened optimal node is visualized at present, directly executing page jump to obtain service.
Preferably, the intelligent system intelligently performs rough matching on the semantic information corresponding to the instruction and text data in the front-end webpage to obtain a matched front-end webpage candidate area, and then performs fine matching from the upper layer to the lower layer on the basis of nodes in the html structure hierarchical tree corresponding to the semantic level instruction information and the current front-end interface to obtain candidate nodes; and taking intersection of the obtained matching nodes and the candidate region to screen out the matching nodes;
screening out an optimal node from the matching nodes, and taking the matching node with the highest matching degree with the current semantic information on two aspects of attributes and contents as the optimal node;
and if the optimal node is in the current visual area, selecting the node, executing the corresponding bound method to obtain the service, if the optimal node is not in the visual area, executing a page sliding method, sliding the page to the corresponding area of the node, and then selecting the node to execute the corresponding bound method to obtain the service.
The invention is further illustrated in detail by the following specific application examples:
as shown in fig. 2, the voice control method for a tv WEB front-end interface according to the embodiment of the present invention includes the following steps:
the intelligent television equipment is input through voice awakening and enters voice recognition;
step S10, after acquiring the user voice instruction, determining whether the current tv application type is a Web front-end page application: judging whether the current television application is the Web front-end application according to the html webpage attribute, and if so, entering step S11; otherwise, performing default conventional voice recognition to control the television, and controlling the client application by the voice.
For example, fig. 3 is a schematic diagram of a front-end page of a Web of a television, and if a current smart terminal page is in a front-end page of a browser similar to that in fig. 3 (the figure is a part of the current page, and a part of the current page is not visualized), the current smart terminal page has an html Web page attribute, and then the next operation is performed.
And S11, performing semantic analysis according to the acquired user voice instruction, preliminarily judging whether the type of the web front-end page event belongs to a single click or sliding type according to the analysis result, if so, filtering invalid data, standardizing semantic information of the generated instruction, and entering S12 to perform the next operation, otherwise, judging that the operation is a conventional voice control operation, and controlling the client application by voice.
For example: a user says ' small dimension and small dimension ', opens music ', judges the music as a conventional voice control operation through preliminary voice recognition and semantic analysis, and directly opens a corresponding music client; if the user says ' small dimension and small dimension ' and opens the home site of the automobile ', the type of the web front-end page event is preliminarily judged, text data corresponding to the standard is normalized and generated as ' open ', ' automobile home ' ] and the next operation is carried out.
Step S12, matching the Web front end page according to the acquired instruction information, that is: firstly, carrying out rough matching on semantic information corresponding to an instruction and text data in a front-end webpage to obtain a matched front-end webpage candidate area, and then carrying out fine matching from the upper layer to the lower layer on nodes in an html structure hierarchical tree corresponding to a current front-end interface according to semantic level instruction information to obtain candidate nodes; and taking intersection of the obtained matching nodes and the candidate region to screen out the matching nodes;
for example: standard text data [ "open", "car owner" ] is matched with normalized text data [ "browse web page", "fine product recommendation", "my collection", "setup and tool", ], [ "HAO website", "online news", "phoenix net", "sina microblog", "UC cloud service", "tv application", "voice happy table", "panning", "HAO website", "Baidu", "car owner", "search fox video", "sofa manager", "super cool", "weather", "tiger flapping sports" }, rough matching and screening to the "car owner" node.
Step S13, screening out the optimal node from the matching nodes: judging whether the current semantic information exists in the current matching node, namely further judging whether the semantic information is met on three levels of the attribute, the content and the method bound by the matching node, wherein the judging method comprises the following steps:
taking the matching node with the highest matching degree with the current semantic information on the two levels of the attribute and the content as the optimal node, and then performing operation S14;
for example: the current matching node is only 'car home', so that the current matching node is also the optimal node at the moment,
and step S14, if the optimal node is in the current visual area, selecting the node, executing the corresponding method for binding to obtain service, if the optimal node is not in the visual area, executing the sliding page method, sliding the page to the corresponding area of the node, and then selecting the node to execute the corresponding method for binding to obtain service.
For example: if the current 'automobile home' node is in the current page, the binding method is executed to jump to a new page to obtain service, if the 'automobile home' node is not in the current page, the node slides to the corresponding area of the node, and then the corresponding binding method is executed to jump to the page corresponding to the 'automobile home' to obtain service.
Therefore, the method for controlling the television Web front-end interface through the voice carries out semantic analysis on the voice instruction, matches the acquired semantic instruction information with the Web front-end page, realizes the conversion from the voice instruction to the television Web front-end page instruction, replaces a television remote controller to obtain the corresponding page node and acquire the corresponding application response, reduces the condition that the voice instruction in the television front-end interface application cannot be identified, and greatly improves the use comfort and the operation experience of the intelligent television.
Exemplary device
As shown in fig. 4, an embodiment of the present invention further provides a voice control apparatus for a WEB front end interface of a television, including three modules: the system comprises an application type analysis module, an event analysis and matching module and an event response module. The application type analysis module, the event analysis and matching module and the event response module can be connected in sequence.
The application analysis module is used for distinguishing the type of the current page and judging whether the current page is an application formed by a web front-end interface;
the event analysis and matching module mainly comprises two links of semantic analysis and matching analysis and is used for generating semantic information after voice recognition and analysis and then judging whether the acquired voice event acts on the type of the web page;
the event response module comprises an event verification link and an event response link and is used for responding to the voice event matched with the web front-end interface after verification is passed and obtaining service.
As shown in fig. 5, another embodiment of the present invention provides a voice control apparatus for a WEB front end interface of a television, including:
the front-end page judging module 10 is configured to, when the voice command is obtained, judge whether the current application type of the television is a Web front-end page application;
the semantic analysis module 20 is configured to, when the current application type of the television is a Web front-end page application, perform recognition and semantic analysis on the acquired voice instruction to determine a corresponding voice event;
a front-end page event determining module 30, configured to determine whether the voice event belongs to a Web front-end page event;
a front-end page event response control module 40, configured to generate a semantic instruction when the voice event belongs to a Web front-end page event; and controlling the matched web front-end interface to respond according to the semantic instruction.
Based on the above embodiments, the present invention further provides a terminal device, and a schematic block diagram thereof may be as shown in fig. 6. The terminal equipment comprises a processor, a memory, a network interface, a display screen and a voice recognition module which are connected through a system bus. Wherein the processor of the terminal device is configured to provide computing and control capabilities. The memory of the terminal equipment comprises a nonvolatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the operation of an operating system and computer programs in the non-volatile storage medium. The network interface of the terminal device is used for connecting and communicating with an external terminal through a network. The computer program is executed by a processor to realize a voice control method of a television WEB front-end interface. The display screen of the terminal equipment can be a liquid crystal display screen or an electronic ink display screen, and the voice recognition module of the terminal equipment is arranged in the terminal equipment in advance and used for recognizing the voice of a user.
It will be understood by those skilled in the art that the block diagram of fig. 6 is only a block diagram of a part of the structure related to the solution of the present invention, and does not constitute a limitation to the terminal device to which the solution of the present invention is applied, and a specific terminal device may include more or less components than those shown in the figure, or combine some components, or have a different arrangement of components.
In one embodiment, a terminal device is provided, where the terminal device includes a memory, a processor, and a voice control program of a tv WEB front end interface stored in the memory and executable on the processor, and when the processor executes the voice control program of the tv WEB front end interface, the following operation instructions are implemented:
when a voice command is acquired, judging whether the current application type of the television is Web front-end page application or not;
when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis, and a corresponding voice event is determined;
judging whether the voice event belongs to a Web front-end page event or not;
when the voice event belongs to a Web front-end page event, generating a semantic instruction; controlling the matched web front-end interface to respond according to the semantic instruction; as described above.
Wherein the step of controlling the response of the matched web front end interface according to the semantic instruction comprises:
and converting the Web front-end page instruction according to the matching result of the semantic instruction and the current page form, controlling the current Web page, obtaining a corresponding page anchor point and acquiring a corresponding application response.
The step of judging whether the current application type of the television is the Web front-end page application or not when the voice command is acquired comprises the following steps:
when a voice command is acquired, detecting the webpage attribute of the current television application;
judging whether the current television application is a Web front-end page application or not according to the fact whether the current television application has the html page attribute or not;
when the current television application has the html webpage attribute, determining that the current television application is the Web front-end page application;
and when the current television application does not have the html webpage attribute, judging that the current television application is not the Web front-end page application, and performing default conventional voice recognition to control the television.
When the current application type of the television is Web front-end page application, the steps of identifying and performing semantic analysis on the acquired voice instruction and determining the corresponding voice event comprise:
when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis; and determining a corresponding voice event.
The step of judging whether the voice event belongs to a Web front end page event comprises the following steps:
when the corresponding voice event is determined according to the analysis result;
and analyzing the voice event, and judging whether the voice event belongs to the type of a page event at the front end of a click or slide web.
When the voice event belongs to a Web front-end page event, generating a semantic instruction; controlling the response of the matched web front-end interface according to the semantic instruction comprises:
when the voice event is judged to belong to a Web front-end page event, generating a corresponding semantic instruction;
matching page nodes according to the semantic instruction, and screening optimal nodes;
judging whether the screened optimal nodes are visual or not;
when the screened optimal node is not in the current visualization state, the page is slid, and then page skipping is executed to obtain service;
and when the screened optimal node is visualized at present, directly executing page jump to obtain service.
When the voice event belongs to a Web front-end page event, generating a semantic instruction; controlling the response of the matched web front-end interface according to the semantic instruction comprises:
performing rough matching on the corresponding semantic information and text data in the front-end webpage according to the instruction to obtain a matched front-end webpage candidate area, and then performing fine matching from the upper layer to the lower layer according to nodes in the html structure hierarchical tree corresponding to the semantic level instruction information and the current front-end interface to obtain candidate nodes; and taking intersection of the obtained matching nodes and the candidate region to screen out the matching nodes;
screening out an optimal node from the matching nodes, and taking the matching node with the highest matching degree with the current semantic information on two aspects of attributes and contents as the optimal node;
and if the optimal node is in the current visual area, selecting the node, executing the corresponding bound method to obtain the service, if the optimal node is not in the visual area, executing a page sliding method, sliding the page to the corresponding area of the node, and then selecting the node to execute the corresponding bound method to obtain the service.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by hardware instructions of a computer program, which can be stored in a non-volatile computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. Any reference to memory, storage, databases, or other media used in embodiments provided herein may include non-volatile and/or volatile memory. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
In summary, the present invention discloses a voice control method, apparatus, terminal device and storage medium for a WEB front end interface of a television, and the method includes: when a voice command is acquired, judging whether the current application type of the television is Web front-end page application or not; when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis, and a corresponding voice event is determined; judging whether the voice event belongs to a Web front-end page event or not; when the voice event belongs to a Web front-end page event, generating a semantic instruction; and controlling the matched web front-end interface to respond according to the semantic instruction. The invention aims to solve the problems that in the prior art, a designated node in a Web front-end interface in a television terminal cannot be controlled through voice, only the interface can be slid through a remote controller for control, the operation experience is very poor, and the operation is very inconvenient.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (10)

1. A voice control method for a television WEB front-end interface is characterized by comprising the following steps:
when a voice command is acquired, judging whether the current application type of the television is Web front-end page application or not;
when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis, and a corresponding voice event is determined;
judging whether the voice event belongs to a Web front-end page event or not;
when the voice event belongs to a Web front-end page event, generating a semantic instruction; and controlling the matched web front-end interface to respond according to the semantic instruction.
2. The voice control method for the television WEB front end interface according to claim 1, wherein the step of controlling the response of the matched WEB front end interface according to the semantic instruction comprises:
and converting the Web front-end page instruction according to the matching result of the semantic instruction and the current page form, controlling the current Web page, obtaining a corresponding page anchor point and acquiring a corresponding application response.
3. The voice control method for the WEB front-end interface of the television according to claim 1, wherein the step of determining whether the current application type of the television is the WEB front-end page application when the voice command is obtained comprises:
when a voice command is acquired, detecting the webpage attribute of the current television application;
judging whether the current television application is a Web front-end page application or not according to the fact whether the current television application has the html page attribute or not;
when the current television application has the html webpage attribute, determining that the current television application is the Web front-end page application;
and when the current television application does not have the html webpage attribute, judging that the current television application is not the Web front-end page application, and performing default conventional voice recognition to control the television.
4. The voice control method for the WEB front-end interface of the television as claimed in claim 1, wherein the step of identifying and semantically analyzing the obtained voice command and determining the corresponding voice event when the current application type of the television is the WEB front-end page application comprises:
when the current application type of the television is Web front-end page application, the obtained voice instruction is identified and subjected to semantic analysis; and determining a corresponding voice event.
5. The method for controlling the voice of the WEB front end interface of the television according to claim 1, wherein the step of determining whether the voice event belongs to a WEB front end page event comprises:
when the corresponding voice event is determined according to the analysis result;
and analyzing the voice event, and judging whether the voice event belongs to the type of a page event at the front end of a click or slide web.
6. The voice control method for the WEB front end interface of the television according to claim 1, wherein when the voice event belongs to a WEB front end page event, a semantic instruction is generated; controlling the response of the matched web front-end interface according to the semantic instruction comprises:
when the voice event is judged to belong to a Web front-end page event, generating a corresponding semantic instruction;
matching page nodes according to the semantic instruction, and screening optimal nodes;
judging whether the screened optimal nodes are visual or not;
when the screened optimal node is not in the current visualization state, the page is slid, and then page skipping is executed to obtain service;
and when the screened optimal node is visualized at present, directly executing page jump to obtain service.
7. The voice control method for the WEB front end interface of the television according to claim 1, wherein when the voice event belongs to a WEB front end page event, a semantic instruction is generated; controlling the response of the matched web front-end interface according to the semantic instruction comprises:
performing rough matching on the corresponding semantic information and text data in the front-end webpage according to the instruction to obtain a matched front-end webpage candidate area, and then performing fine matching from the upper layer to the lower layer according to nodes in the html structure hierarchical tree corresponding to the semantic level instruction information and the current front-end interface to obtain candidate nodes; and taking intersection of the obtained matching nodes and the candidate region to screen out the matching nodes;
screening out an optimal node from the matching nodes, and taking the matching node with the highest matching degree with the current semantic information on two aspects of attributes and contents as the optimal node;
and if the optimal node is in the current visual area, selecting the node, executing the corresponding bound method to obtain the service, if the optimal node is not in the visual area, executing a page sliding method, sliding the page to the corresponding area of the node, and then selecting the node to execute the corresponding bound method to obtain the service.
8. A voice control device for a television WEB front-end interface, the device comprising:
the front-end page judging module is used for judging whether the current application type of the television is Web front-end page application or not when the voice command is acquired;
the semantic analysis module is used for identifying and performing semantic analysis on the acquired voice instruction when the current application type of the television is Web front-end page application, and determining a corresponding voice event;
the front-end page event judging module is used for judging whether the voice event belongs to a Web front-end page event or not;
the front-end page event response control module is used for generating a semantic instruction when the voice event belongs to a Web front-end page event; and controlling the matched web front-end interface to respond according to the semantic instruction.
9. A terminal device, comprising a memory, a processor and a voice control program of a tv WEB front end interface stored in the memory and operable on the processor, wherein the processor implements the voice control program of the tv WEB front end interface according to any one of claims 1 to 7.
10. A computer readable storage medium having stored thereon a voice control program for a television WEB front end interface, the voice control program for a television WEB front end interface when executed by a processor implementing the steps of the method for voice control of a television WEB front end interface as claimed in any one of claims 1 to 7.
CN202011502454.0A 2020-12-17 2020-12-17 Voice control method, device, equipment and medium for WEB front-end interface of television Active CN112770157B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011502454.0A CN112770157B (en) 2020-12-17 2020-12-17 Voice control method, device, equipment and medium for WEB front-end interface of television

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011502454.0A CN112770157B (en) 2020-12-17 2020-12-17 Voice control method, device, equipment and medium for WEB front-end interface of television

Publications (2)

Publication Number Publication Date
CN112770157A true CN112770157A (en) 2021-05-07
CN112770157B CN112770157B (en) 2023-03-28

Family

ID=75694453

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011502454.0A Active CN112770157B (en) 2020-12-17 2020-12-17 Voice control method, device, equipment and medium for WEB front-end interface of television

Country Status (1)

Country Link
CN (1) CN112770157B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113470647A (en) * 2021-07-22 2021-10-01 深圳市天威视讯股份有限公司 Processing method and system through voice control

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140039898A1 (en) * 2012-08-02 2014-02-06 Nuance Communications, Inc. Methods and apparatus for voiced-enabling a web application
CN105161106A (en) * 2015-08-20 2015-12-16 深圳Tcl数字技术有限公司 Voice control method of intelligent terminal, voice control device and television system
WO2017092312A1 (en) * 2015-12-01 2017-06-08 乐视控股(北京)有限公司 Method of browsing webpage on browser and device
WO2017101266A1 (en) * 2015-12-15 2017-06-22 深圳Tcl数字技术有限公司 Voice control method and system
CN106980614A (en) * 2016-01-15 2017-07-25 中国科学院声学研究所 A kind of Web page speech control implementation method extended based on JavaScript
CN109766073A (en) * 2019-01-25 2019-05-17 四川长虹电器股份有限公司 The method that voice operating web page contents navigate in TV browser
CN110444209A (en) * 2019-08-13 2019-11-12 苏州思必驰信息科技有限公司 Voice interactive method, the apparatus and system of web page are embedded towards intelligent vehicle device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140039898A1 (en) * 2012-08-02 2014-02-06 Nuance Communications, Inc. Methods and apparatus for voiced-enabling a web application
CN105161106A (en) * 2015-08-20 2015-12-16 深圳Tcl数字技术有限公司 Voice control method of intelligent terminal, voice control device and television system
WO2017092312A1 (en) * 2015-12-01 2017-06-08 乐视控股(北京)有限公司 Method of browsing webpage on browser and device
WO2017101266A1 (en) * 2015-12-15 2017-06-22 深圳Tcl数字技术有限公司 Voice control method and system
CN106980614A (en) * 2016-01-15 2017-07-25 中国科学院声学研究所 A kind of Web page speech control implementation method extended based on JavaScript
CN109766073A (en) * 2019-01-25 2019-05-17 四川长虹电器股份有限公司 The method that voice operating web page contents navigate in TV browser
CN110444209A (en) * 2019-08-13 2019-11-12 苏州思必驰信息科技有限公司 Voice interactive method, the apparatus and system of web page are embedded towards intelligent vehicle device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
郭家清;白宇;蔡东风;刘纪元;: "基于语音标签的语音浏览器" *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113470647A (en) * 2021-07-22 2021-10-01 深圳市天威视讯股份有限公司 Processing method and system through voice control

Also Published As

Publication number Publication date
CN112770157B (en) 2023-03-28

Similar Documents

Publication Publication Date Title
US10831345B2 (en) Establishing user specified interaction modes in a question answering dialogue
KR101909807B1 (en) Method and apparatus for inputting information
US9622016B2 (en) Invisiblemask: a tangible mechanism to enhance mobile device smartness
US20180322215A1 (en) Web page access method and apparatus
US20040267739A1 (en) Web browser with multilevel functions
US11762923B1 (en) Displaying stylized text snippets with search engine results
US10936645B2 (en) Method and apparatus for generating to-be-played multimedia content
JP2006285982A (en) Data mining technology which improves linkage network for search engine
EP3602330B1 (en) Automatically generating documents
CN105786969A (en) Information display method and apparatus
CN110992937B (en) Language off-line identification method, terminal and readable storage medium
US8370131B2 (en) Method and system for providing convenient dictionary services
CN113806588B (en) Method and device for searching video
US20230018387A1 (en) Dynamic web page classification in web data collection
CN112770157B (en) Voice control method, device, equipment and medium for WEB front-end interface of television
US20160299972A1 (en) Providing app store search results
KR20190033821A (en) Folder Recommending Method and Apparatus Thereof
TW201435627A (en) System and method for optimizing search results
CN113448649B (en) Redis-based home page data loading server and method
KR20030051577A (en) Display method for research result in internet site
CN114969544A (en) Hot data-based recommended content generation method, device, equipment and medium
CN112052377B (en) Resource recommendation method, device, server and storage medium
TW202219793A (en) Web page analyzing method and web page analyzing platform using the same
US20090327233A1 (en) Method of selecting objects in web pages
JPH1021222A (en) Machine translation method and terminology dictionary selecting method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant