CN113849150A

CN113849150A - Method for realizing voice-controlled switching of front-end label pages based on transfer learning

Info

Publication number: CN113849150A
Application number: CN202110980211.6A
Authority: CN
Inventors: 刘昊阳; 胡环宇
Original assignee: Beijing Tongtong Yilian Technology Co ltd
Current assignee: Beijing Tongtong Yilian Technology Co ltd
Priority date: 2021-08-25
Filing date: 2021-08-25
Publication date: 2021-12-28

Abstract

The invention relates to a method for realizing voice-controlled switching of front-end label pages based on transfer learning. The model verification test result of the method for realizing the front-end label page voice control switching based on the transfer learning shows that based on comparison of products of the same type, the method realizes the front-end label page switching operation by using the voice control technology through the transfer learning method, so that a user can directly control page switching through a voice command without being beside equipment such as a computer and the like, the use of the user is facilitated, the use freedom of the front-end label page is improved, the working efficiency of a part of scenes is improved, and the user experience is enriched.

Description

Method for realizing voice-controlled switching of front-end label pages based on transfer learning

Technical Field

The invention belongs to the technical field of computers, and relates to a method for realizing voice-controlled switching of front-end tab pages based on transfer learning in the computer technology.

Background

With the rapid development of internet and computer technologies, various electronic products are applied to people's life, work and study, such as: multimedia teaching, corporate PPT lectures, electronic books, and the like. However, the current switching scheme of the front-end tab mainly adopts the modes of clicking a sub-page tab by a mouse, touching the sub-page tab by a finger or a touch device, triggering page switching by a keyboard key, setting timing automatic switching and the like. The prior art means needs to use a tangible input device, realizes the switching of the label pages in a direct contact mode, and is not intelligent if logic such as timing and the like is used for controlling the switching of the label pages. When a part of scenes such as a large meeting place is used for speaking, people are required to control page switching beside the equipment, and the degree of freedom is relatively low.

Disclosure of Invention

The invention relates to a method for realizing voice-controlled switching of front-end tab pages based on transfer learning, which comprises the following steps in the embodiments of the methods:

A) receiving a voice control switch opening instruction sent by an application client and received by an application server; opening the label name and the URL corresponding to the label name;

B) receiving a voice instruction sent by voice receiving equipment to the application client; recognizing and converting the voice command into a correct label name;

C) matching the correct label name with the migration learning training result data to generate a successfully matched label name;

D) finding and selecting the opened label name corresponding to the successfully matched label name and the URL corresponding to the opened label name;

sending a command for triggering the selected label name and a URL corresponding to the selected label name to the application server;

E) the application server receives and triggers the selected label name and a URL instruction corresponding to the label name and sends the label name and the URL instruction to an application client;

F) the application client receives and triggers the selected label name and a URL instruction corresponding to the selected label name; and the application program executes label switching based on the selected label name and the URL instruction corresponding to the selected label name.

In another aspect, the present invention further relates to an apparatus for implementing voice-controlled front-end tab page switching based on transfer learning, including at least one processor and a memory, which stores instructions that, when executed by the at least one processor, implement the steps of the method according to any one of the above.

The method has the beneficial effect that the method for realizing the voice control switching of the front-end label page based on the transfer learning is provided. The model verification test result of the method for realizing the front-end label page voice control switching based on the transfer learning shows that based on comparison of products of the same type, the method realizes the front-end label page switching operation by using the voice control technology through the transfer learning method, so that a user can directly control page switching through a voice command without being beside equipment such as a computer and the like, the use of the user is facilitated, the use freedom of the front-end label page is improved, the working efficiency of a part of scenes is improved, and the user experience is enriched.

Drawings

FIG. 1 is a logic diagram of a method for implementing voice-controlled switching of front-end tab pages based on transfer learning;

FIG. 2 is a logic diagram of a preparation work of a method for implementing voice-controlled switching of front-end tab pages based on transfer learning;

FIG. 3 is a diagram of a hardware device architecture.

Detailed Description

The technical features of the different embodiments of the invention may be combined in any way in conformity with the gist of the invention, and therefore any specific embodiment should not be understood as limiting the scope of protection of the invention.

In some embodiments of the method for implementing voice-controlled switching of the front-end tab page based on the transfer learning, the method comprises the following steps:

The term "URL" generally refers to the address of a uniform resource locator system, such as a file, picture in a computer program;

the term "migration learning" is a machine learning method that reuses the model developed for task A as an initial point in the process of developing the model for task B.

In some embodiments of the method for implementing voice-controlled switching of the front-end tab page based on the transfer learning, the opened tab name and the URL corresponding to the opened tab name are the received tab name of the application program to be used by the application client and sent from the application server and the URL corresponding to the received tab name.

In some embodiments of the method for implementing voice-controlled switching of the front-end tab page based on the transfer learning, the transfer learning training result data is obtained by training voice training data generated by collecting voice training data in a database through one or more channels of browser downloading and other device importing by a collection device by using a transfer learning device calling a training method.

In some embodiments of the method for realizing voice-controlled switching of the front-end tab page based on the transfer learning, an instance of the transfer learner is created by using voice training data through a collectExample method of the speed command, and the instance of the transfer learner is used for carrying out method calls of all subsequent processes.

In some embodiments of the method for implementing voice-controlled switching of front-end tab pages based on migration learning, the migration learner includes one or more of the following functions:

creating a training button in the transfer learner, clicking the training button to call a train training method for training voice training data for multiple times;

creating a monitoring button opening and calling a listen method in the migration learner to monitor the opened label name corresponding to the voice instruction sent by the voice receiving equipment connected with the application client and the URL corresponding to the label name;

and creating a stop listening button in the migration learner to call a stopListening method to stop listening.

In some embodiments of the method for implementing voice-controlled switching of front-end tab pages based on transfer learning, the voice training data range includes one or more names of a tab and a button; the voice training data is stored in a wma format with excellent compression ratio and tone quality;

and storing the migration learning training result data in a binary file, converting the content in the binary file into a JavaScript Array Buffer type, and loading the JavaScript Array Buffer type into a migration learning device.

The term "wma format" is a new audio format from microsoft corporation, named in common with the MP3 format.

In some embodiments of the method for implementing front-end tab voice control switching based on transfer learning, after the application client receives the URL instruction corresponding to the selected tab name triggered by the trigger, the application program calls a corresponding event processing function based on the selected tab name and the URL instruction corresponding to the selected tab name, executes the selected interaction, and may switch a page or other behaviors with the selection of the tab.

In some embodiments of the method for implementing front-end tab voice-controlled switching based on transfer learning, the application client is connected with the application server through the internet, and the application client is connected with the receiving device through a wire; the application server is connected with the switching server in a wired or wireless mode.

In some embodiments of the method for implementing voice-controlled switching of the front-end tab page based on the transfer learning, the application client includes an instruction receiving part and one or more structures in a voice-controlled switch; the application program is loaded and used at the application client;

the switching server comprises one or more modules of a voice receiving module, a voice analyzing module and a voice processing module.

The invention is further illustrated by the following more specific examples:

referring to fig. 1, a logic diagram of a method for implementing voice-controlled switching of front-end tab pages based on transfer learning according to this embodiment is shown; the method comprises the following steps:

Referring to fig. 2, a logic diagram of preparation work of the method for implementing voice-controlled switching of front-end tab pages based on transfer learning according to the present embodiment is shown; wherein: the method comprises the following steps:

s2.1) receiving the voice training data imported by the collecting device; calling a train method in a migration training algorithm to train the voice training data for multiple times to generate migration learning training result data;

s2.2) receiving a label switching request sent by the application server; sending a label name collection and URL (uniform resource locator) instruction corresponding to the label name to the application server;

s2.3) after receiving a label name collection instruction and a URL instruction corresponding to the label name, the application server collects the label name and the corresponding URL of an application program to be used by an application client;

s2.4) receiving the label name and the corresponding URL of the application program to be used by the application client sent by the application server.

Referring to fig. 3, a functional diagram of a hardware device structure according to the present embodiment; wherein, the function relation among the structures is:

s1.1) the voice processing unit receives a voice control switch opening instruction sent by a voice control switch in an application client;

s1.2) a voice receiving module receives a voice instruction sent by voice receiving equipment connected with the application client; the voice receiving module sends the voice instruction to a voice analysis module;

s1.3) the voice analysis module identifies voice data of the voice command sent from the voice receiving module and converts the voice command into a correct label name;

sending said correct tag name to said voice processing unit;

s1.4) the voice processing unit matches the correct label name sent from the voice analysis module with the transfer learning training result data;

s1.5) finding the opened label name corresponding to the successfully matched label name and the URL corresponding to the opened label name in a transfer learner and selecting the successfully matched label name;

s1.6) the voice processing unit sends a trigger instruction to an instruction receiving part of the application client;

s1.7) the front-end page application program of the application client executes front-end page label switching based on the started selected label name corresponding to the successfully matched label name and the URL corresponding to the label name;

s1.8) the voice processing unit receives a voice control switch closing instruction sent by a voice control switch in the application client.

Implementations and functional operations of the subject matter described in this specification can be implemented in: digital electronic circuitry, tangibly embodied computer software or firmware, computer hardware, including the structures disclosed in this specification and their structural equivalents, or combinations of more than one of the foregoing. Embodiments of the subject matter described in this specification can be implemented as one or more computer programs, i.e., one or more modules of computer program instructions encoded on one or more tangible, non-transitory program carriers, for execution by, or to control the operation of, data processing apparatus.

A computer program (which may also be referred to or described as a program, software application, module, software module, script, or code) can be written in any form of programming language, including compiled or interpreted languages, or declarative or procedural languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program may, but need not, correspond to a file in a file system. A program can be stored in a portion of a file that holds other programs or data, e.g., one or more scripts stored in: in a markup language document; in a single file dedicated to the relevant program; or in multiple coordinated files, such as files that store one or more modules, sub programs, or portions of code. A computer program can be deployed to be executed on one computer or on multiple computers that are located at one site or distributed across multiple sites and interconnected by a communication network.

The processes and logic flows described in this specification can be performed by one or more programmable computers executing one or more computer programs to perform functions by operating on input data and generating output.

Implementations of the subject matter described in this specification can be implemented in a computing system that includes a back-end component, e.g., as a data server, or that includes a middleware component, e.g., an application server, or that includes a front-end component, e.g., a client computer having a graphical user interface or a Web browser through which a user can interact with an implementation of the subject matter described in this specification, or any combination of one or more such back-end, middleware, or front-end components. The components in the system can be interconnected by any form or medium of digital data communication, e.g., a communication network. Examples of communication networks include a local area network ("LAN") and a wide area network ("WAN"), e.g., the Internet. The computing system may include a client and a server. A client and server are generally remote from each other and typically interact through a communication network. The relationship of user ends and servers arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.

While this specification contains many specific implementation details, these should not be construed as limitations on the scope of any inventions or of what may be claimed, but rather as descriptions of features that may embody particular implementations of particular inventions. Certain features that are described in this specification in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable subcombination. Moreover, although features may be described above as acting in combination and even initially claimed as such, one or more features from a claimed combination can in some cases be excised from the combination, and the claimed combination may be directed to a subcombination or variation of a subcombination.

Similarly, while operations are depicted in the drawings in a particular order, this should not be understood as: such operations are required to be performed in the particular order shown, or in sequential order, or all illustrated operations may be performed, in order to achieve desirable results. In certain situations, multitasking and parallel processing may be advantageous. Moreover, the separation of various system modules and components in the embodiments described above should not be understood as requiring such separation in all embodiments, and it should be understood that the program components and systems can generally be integrated together in a single software product or packaged into multiple software products.

Particular embodiments of the subject matter have been described. Other implementations are within the scope of the following claims. For example, the activities recited in the claims can be performed in a different order and still achieve desirable results. As one example, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In certain implementations, multitasking and parallel processing may be advantageous.

Claims

1. A method for realizing voice-controlled switching of front-end tab pages based on transfer learning is characterized by comprising the following steps:

2. The method according to claim 1, wherein the tag name and the URL corresponding to the tag name that are activated in step a) are the tag name and the URL corresponding to the tag name of the application program that is received and is to be used by the application client and sent from the application server.

3. The method according to claim 1, wherein the data of the result of the transfer learning training in step C) is obtained by training the speech training data generated by collecting the speech training data collected from the database by the collection device through one or more channels selected from the group consisting of a browser download and other device import, by using a transfer learning method invoked by the transfer learning device.

4. The method for implementing voice-controlled switching of front-end tab pages based on transfer learning according to claim 3, wherein the transfer learner includes one or more of the following functions:

creating a training button in the transfer learner, clicking the training button to call a training method for training voice training data for multiple times;

creating a monitoring button opening and calling a monitoring method in the migration learner to monitor the opened label name corresponding to the voice instruction sent by the voice receiving equipment connected with the application client and the URL corresponding to the label name;

and creating a listening off button in the migration learner to call a listening off method.

5. The method for realizing the voice-controlled switching of the front-end tab page based on the transfer learning of claim 3 is characterized in that the voice training data range comprises one or more names of a tab and a button; the voice training data is stored in a wma format with a compression ratio and excellent sound quality.

6. The method for realizing voice-controlled switching of the front-end tab page based on the transfer learning of claim 1, wherein the data of the transfer learning training result in the step C) is stored in a binary file, and then the content in the binary file is converted into a JavaScript Array Buffer type and loaded into the transfer learner.

7. The method for realizing voice-controlled switching of the front-end tab page based on the transfer learning of claim 1, wherein the functional relationship among the structures based on the steps A) to E) is as follows:

sending said correct tag name to said voice processing unit;

8. The method for implementing voice-controlled switching of the front-end tab page based on the transfer learning of claim 7, wherein the application client is connected with the application server through the internet, and the application client is connected with the receiving device through a wired connection; the application server is connected with the switching server in a wired or wireless mode.

9. The method for implementing voice-controlled switching of front-end tab pages based on transfer learning of claim 8, wherein the application client comprises an instruction receiving part, one or more structures in a voice-controlled switch; the application program is loaded and used at the application client;

10. A device for realizing voice-controlled switching of front-end label pages based on transfer learning comprises at least one processor; and a memory storing instructions that, when executed by the at least one processor, perform the method according to any one of claims 1-9.