CN113849150A - Method for realizing voice-controlled switching of front-end label pages based on transfer learning - Google Patents

Method for realizing voice-controlled switching of front-end label pages based on transfer learning Download PDF

Info

Publication number
CN113849150A
CN113849150A CN202110980211.6A CN202110980211A CN113849150A CN 113849150 A CN113849150 A CN 113849150A CN 202110980211 A CN202110980211 A CN 202110980211A CN 113849150 A CN113849150 A CN 113849150A
Authority
CN
China
Prior art keywords
voice
label name
transfer learning
name
application client
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110980211.6A
Other languages
Chinese (zh)
Inventor
刘昊阳
胡环宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Tongtong Yilian Technology Co ltd
Original Assignee
Beijing Tongtong Yilian Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Tongtong Yilian Technology Co ltd filed Critical Beijing Tongtong Yilian Technology Co ltd
Priority to CN202110980211.6A priority Critical patent/CN113849150A/en
Publication of CN113849150A publication Critical patent/CN113849150A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning

Abstract

The invention relates to a method for realizing voice-controlled switching of front-end label pages based on transfer learning. The model verification test result of the method for realizing the front-end label page voice control switching based on the transfer learning shows that based on comparison of products of the same type, the method realizes the front-end label page switching operation by using the voice control technology through the transfer learning method, so that a user can directly control page switching through a voice command without being beside equipment such as a computer and the like, the use of the user is facilitated, the use freedom of the front-end label page is improved, the working efficiency of a part of scenes is improved, and the user experience is enriched.

Description

Method for realizing voice-controlled switching of front-end label pages based on transfer learning
Technical Field
The invention belongs to the technical field of computers, and relates to a method for realizing voice-controlled switching of front-end tab pages based on transfer learning in the computer technology.
Background
With the rapid development of internet and computer technologies, various electronic products are applied to people's life, work and study, such as: multimedia teaching, corporate PPT lectures, electronic books, and the like. However, the current switching scheme of the front-end tab mainly adopts the modes of clicking a sub-page tab by a mouse, touching the sub-page tab by a finger or a touch device, triggering page switching by a keyboard key, setting timing automatic switching and the like. The prior art means needs to use a tangible input device, realizes the switching of the label pages in a direct contact mode, and is not intelligent if logic such as timing and the like is used for controlling the switching of the label pages. When a part of scenes such as a large meeting place is used for speaking, people are required to control page switching beside the equipment, and the degree of freedom is relatively low.
Disclosure of Invention
The invention relates to a method for realizing voice-controlled switching of front-end tab pages based on transfer learning, which comprises the following steps in the embodiments of the methods:
A) receiving a voice control switch opening instruction sent by an application client and received by an application server; opening the label name and the URL corresponding to the label name;
B) receiving a voice instruction sent by voice receiving equipment to the application client; recognizing and converting the voice command into a correct label name;
C) matching the correct label name with the migration learning training result data to generate a successfully matched label name;
D) finding and selecting the opened label name corresponding to the successfully matched label name and the URL corresponding to the opened label name;
sending a command for triggering the selected label name and a URL corresponding to the selected label name to the application server;
E) the application server receives and triggers the selected label name and a URL instruction corresponding to the label name and sends the label name and the URL instruction to an application client;
F) the application client receives and triggers the selected label name and a URL instruction corresponding to the selected label name; and the application program executes label switching based on the selected label name and the URL instruction corresponding to the selected label name.
In another aspect, the present invention further relates to an apparatus for implementing voice-controlled front-end tab page switching based on transfer learning, including at least one processor and a memory, which stores instructions that, when executed by the at least one processor, implement the steps of the method according to any one of the above.
The method has the beneficial effect that the method for realizing the voice control switching of the front-end label page based on the transfer learning is provided. The model verification test result of the method for realizing the front-end label page voice control switching based on the transfer learning shows that based on comparison of products of the same type, the method realizes the front-end label page switching operation by using the voice control technology through the transfer learning method, so that a user can directly control page switching through a voice command without being beside equipment such as a computer and the like, the use of the user is facilitated, the use freedom of the front-end label page is improved, the working efficiency of a part of scenes is improved, and the user experience is enriched.
Drawings
FIG. 1 is a logic diagram of a method for implementing voice-controlled switching of front-end tab pages based on transfer learning;
FIG. 2 is a logic diagram of a preparation work of a method for implementing voice-controlled switching of front-end tab pages based on transfer learning;
FIG. 3 is a diagram of a hardware device architecture.
Detailed Description
The technical features of the different embodiments of the invention may be combined in any way in conformity with the gist of the invention, and therefore any specific embodiment should not be understood as limiting the scope of protection of the invention.
In some embodiments of the method for implementing voice-controlled switching of the front-end tab page based on the transfer learning, the method comprises the following steps:
A) receiving a voice control switch opening instruction sent by an application client and received by an application server; opening the label name and the URL corresponding to the label name;
B) receiving a voice instruction sent by voice receiving equipment to the application client; recognizing and converting the voice command into a correct label name;
C) matching the correct label name with the migration learning training result data to generate a successfully matched label name;
D) finding and selecting the opened label name corresponding to the successfully matched label name and the URL corresponding to the opened label name;
sending a command for triggering the selected label name and a URL corresponding to the selected label name to the application server;
E) the application server receives and triggers the selected label name and a URL instruction corresponding to the label name and sends the label name and the URL instruction to an application client;
F) the application client receives and triggers the selected label name and a URL instruction corresponding to the selected label name; and the application program executes label switching based on the selected label name and the URL instruction corresponding to the selected label name.
The term "URL" generally refers to the address of a uniform resource locator system, such as a file, picture in a computer program;
the term "migration learning" is a machine learning method that reuses the model developed for task A as an initial point in the process of developing the model for task B.
In some embodiments of the method for implementing voice-controlled switching of the front-end tab page based on the transfer learning, the opened tab name and the URL corresponding to the opened tab name are the received tab name of the application program to be used by the application client and sent from the application server and the URL corresponding to the received tab name.
In some embodiments of the method for implementing voice-controlled switching of the front-end tab page based on the transfer learning, the transfer learning training result data is obtained by training voice training data generated by collecting voice training data in a database through one or more channels of browser downloading and other device importing by a collection device by using a transfer learning device calling a training method.
In some embodiments of the method for realizing voice-controlled switching of the front-end tab page based on the transfer learning, an instance of the transfer learner is created by using voice training data through a collectExample method of the speed command, and the instance of the transfer learner is used for carrying out method calls of all subsequent processes.
In some embodiments of the method for implementing voice-controlled switching of front-end tab pages based on migration learning, the migration learner includes one or more of the following functions:
creating a training button in the transfer learner, clicking the training button to call a train training method for training voice training data for multiple times;
creating a monitoring button opening and calling a listen method in the migration learner to monitor the opened label name corresponding to the voice instruction sent by the voice receiving equipment connected with the application client and the URL corresponding to the label name;
and creating a stop listening button in the migration learner to call a stopListening method to stop listening.
In some embodiments of the method for implementing voice-controlled switching of front-end tab pages based on transfer learning, the voice training data range includes one or more names of a tab and a button; the voice training data is stored in a wma format with excellent compression ratio and tone quality;
and storing the migration learning training result data in a binary file, converting the content in the binary file into a JavaScript Array Buffer type, and loading the JavaScript Array Buffer type into a migration learning device.
The term "wma format" is a new audio format from microsoft corporation, named in common with the MP3 format.
In some embodiments of the method for implementing front-end tab voice control switching based on transfer learning, after the application client receives the URL instruction corresponding to the selected tab name triggered by the trigger, the application program calls a corresponding event processing function based on the selected tab name and the URL instruction corresponding to the selected tab name, executes the selected interaction, and may switch a page or other behaviors with the selection of the tab.
In some embodiments of the method for implementing front-end tab voice-controlled switching based on transfer learning, the application client is connected with the application server through the internet, and the application client is connected with the receiving device through a wire; the application server is connected with the switching server in a wired or wireless mode.
In some embodiments of the method for implementing voice-controlled switching of the front-end tab page based on the transfer learning, the application client includes an instruction receiving part and one or more structures in a voice-controlled switch; the application program is loaded and used at the application client;
the switching server comprises one or more modules of a voice receiving module, a voice analyzing module and a voice processing module.
The invention is further illustrated by the following more specific examples:
referring to fig. 1, a logic diagram of a method for implementing voice-controlled switching of front-end tab pages based on transfer learning according to this embodiment is shown; the method comprises the following steps:
A) receiving a voice control switch opening instruction sent by an application client and received by an application server; opening the label name and the URL corresponding to the label name;
B) receiving a voice instruction sent by voice receiving equipment to the application client; recognizing and converting the voice command into a correct label name;
C) matching the correct label name with the migration learning training result data to generate a successfully matched label name;
D) finding and selecting the opened label name corresponding to the successfully matched label name and the URL corresponding to the opened label name;
sending a command for triggering the selected label name and a URL corresponding to the selected label name to the application server;
E) the application server receives and triggers the selected label name and a URL instruction corresponding to the label name and sends the label name and the URL instruction to an application client;
F) the application client receives and triggers the selected label name and a URL instruction corresponding to the selected label name; and the application program executes label switching based on the selected label name and the URL instruction corresponding to the selected label name.
Referring to fig. 2, a logic diagram of preparation work of the method for implementing voice-controlled switching of front-end tab pages based on transfer learning according to the present embodiment is shown; wherein: the method comprises the following steps:
s2.1) receiving the voice training data imported by the collecting device; calling a train method in a migration training algorithm to train the voice training data for multiple times to generate migration learning training result data;
s2.2) receiving a label switching request sent by the application server; sending a label name collection and URL (uniform resource locator) instruction corresponding to the label name to the application server;
s2.3) after receiving a label name collection instruction and a URL instruction corresponding to the label name, the application server collects the label name and the corresponding URL of an application program to be used by an application client;
s2.4) receiving the label name and the corresponding URL of the application program to be used by the application client sent by the application server.
Referring to fig. 3, a functional diagram of a hardware device structure according to the present embodiment; wherein, the function relation among the structures is:
s1.1) the voice processing unit receives a voice control switch opening instruction sent by a voice control switch in an application client;
s1.2) a voice receiving module receives a voice instruction sent by voice receiving equipment connected with the application client; the voice receiving module sends the voice instruction to a voice analysis module;
s1.3) the voice analysis module identifies voice data of the voice command sent from the voice receiving module and converts the voice command into a correct label name;
sending said correct tag name to said voice processing unit;
s1.4) the voice processing unit matches the correct label name sent from the voice analysis module with the transfer learning training result data;
s1.5) finding the opened label name corresponding to the successfully matched label name and the URL corresponding to the opened label name in a transfer learner and selecting the successfully matched label name;
s1.6) the voice processing unit sends a trigger instruction to an instruction receiving part of the application client;
s1.7) the front-end page application program of the application client executes front-end page label switching based on the started selected label name corresponding to the successfully matched label name and the URL corresponding to the label name;
s1.8) the voice processing unit receives a voice control switch closing instruction sent by a voice control switch in the application client.
Implementations and functional operations of the subject matter described in this specification can be implemented in: digital electronic circuitry, tangibly embodied computer software or firmware, computer hardware, including the structures disclosed in this specification and their structural equivalents, or combinations of more than one of the foregoing. Embodiments of the subject matter described in this specification can be implemented as one or more computer programs, i.e., one or more modules of computer program instructions encoded on one or more tangible, non-transitory program carriers, for execution by, or to control the operation of, data processing apparatus.
A computer program (which may also be referred to or described as a program, software application, module, software module, script, or code) can be written in any form of programming language, including compiled or interpreted languages, or declarative or procedural languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program may, but need not, correspond to a file in a file system. A program can be stored in a portion of a file that holds other programs or data, e.g., one or more scripts stored in: in a markup language document; in a single file dedicated to the relevant program; or in multiple coordinated files, such as files that store one or more modules, sub programs, or portions of code. A computer program can be deployed to be executed on one computer or on multiple computers that are located at one site or distributed across multiple sites and interconnected by a communication network.
The processes and logic flows described in this specification can be performed by one or more programmable computers executing one or more computer programs to perform functions by operating on input data and generating output.
Implementations of the subject matter described in this specification can be implemented in a computing system that includes a back-end component, e.g., as a data server, or that includes a middleware component, e.g., an application server, or that includes a front-end component, e.g., a client computer having a graphical user interface or a Web browser through which a user can interact with an implementation of the subject matter described in this specification, or any combination of one or more such back-end, middleware, or front-end components. The components in the system can be interconnected by any form or medium of digital data communication, e.g., a communication network. Examples of communication networks include a local area network ("LAN") and a wide area network ("WAN"), e.g., the Internet. The computing system may include a client and a server. A client and server are generally remote from each other and typically interact through a communication network. The relationship of user ends and servers arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.
While this specification contains many specific implementation details, these should not be construed as limitations on the scope of any inventions or of what may be claimed, but rather as descriptions of features that may embody particular implementations of particular inventions. Certain features that are described in this specification in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable subcombination. Moreover, although features may be described above as acting in combination and even initially claimed as such, one or more features from a claimed combination can in some cases be excised from the combination, and the claimed combination may be directed to a subcombination or variation of a subcombination.
Similarly, while operations are depicted in the drawings in a particular order, this should not be understood as: such operations are required to be performed in the particular order shown, or in sequential order, or all illustrated operations may be performed, in order to achieve desirable results. In certain situations, multitasking and parallel processing may be advantageous. Moreover, the separation of various system modules and components in the embodiments described above should not be understood as requiring such separation in all embodiments, and it should be understood that the program components and systems can generally be integrated together in a single software product or packaged into multiple software products.
Particular embodiments of the subject matter have been described. Other implementations are within the scope of the following claims. For example, the activities recited in the claims can be performed in a different order and still achieve desirable results. As one example, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In certain implementations, multitasking and parallel processing may be advantageous.

Claims (10)

1. A method for realizing voice-controlled switching of front-end tab pages based on transfer learning is characterized by comprising the following steps:
A) receiving a voice control switch opening instruction sent by an application client and received by an application server; opening the label name and the URL corresponding to the label name;
B) receiving a voice instruction sent by voice receiving equipment to the application client; recognizing and converting the voice command into a correct label name;
C) matching the correct label name with the migration learning training result data to generate a successfully matched label name;
D) finding and selecting the opened label name corresponding to the successfully matched label name and the URL corresponding to the opened label name;
sending a command for triggering the selected label name and a URL corresponding to the selected label name to the application server;
E) the application server receives and triggers the selected label name and a URL instruction corresponding to the label name and sends the label name and the URL instruction to an application client;
F) the application client receives and triggers the selected label name and a URL instruction corresponding to the selected label name; and the application program executes label switching based on the selected label name and the URL instruction corresponding to the selected label name.
2. The method according to claim 1, wherein the tag name and the URL corresponding to the tag name that are activated in step a) are the tag name and the URL corresponding to the tag name of the application program that is received and is to be used by the application client and sent from the application server.
3. The method according to claim 1, wherein the data of the result of the transfer learning training in step C) is obtained by training the speech training data generated by collecting the speech training data collected from the database by the collection device through one or more channels selected from the group consisting of a browser download and other device import, by using a transfer learning method invoked by the transfer learning device.
4. The method for implementing voice-controlled switching of front-end tab pages based on transfer learning according to claim 3, wherein the transfer learner includes one or more of the following functions:
creating a training button in the transfer learner, clicking the training button to call a training method for training voice training data for multiple times;
creating a monitoring button opening and calling a monitoring method in the migration learner to monitor the opened label name corresponding to the voice instruction sent by the voice receiving equipment connected with the application client and the URL corresponding to the label name;
and creating a listening off button in the migration learner to call a listening off method.
5. The method for realizing the voice-controlled switching of the front-end tab page based on the transfer learning of claim 3 is characterized in that the voice training data range comprises one or more names of a tab and a button; the voice training data is stored in a wma format with a compression ratio and excellent sound quality.
6. The method for realizing voice-controlled switching of the front-end tab page based on the transfer learning of claim 1, wherein the data of the transfer learning training result in the step C) is stored in a binary file, and then the content in the binary file is converted into a JavaScript Array Buffer type and loaded into the transfer learner.
7. The method for realizing voice-controlled switching of the front-end tab page based on the transfer learning of claim 1, wherein the functional relationship among the structures based on the steps A) to E) is as follows:
s1.1) the voice processing unit receives a voice control switch opening instruction sent by a voice control switch in an application client;
s1.2) a voice receiving module receives a voice instruction sent by voice receiving equipment connected with the application client; the voice receiving module sends the voice instruction to a voice analysis module;
s1.3) the voice analysis module identifies voice data of the voice command sent from the voice receiving module and converts the voice command into a correct label name;
sending said correct tag name to said voice processing unit;
s1.4) the voice processing unit matches the correct label name sent from the voice analysis module with the transfer learning training result data;
s1.5) finding the opened label name corresponding to the successfully matched label name and the URL corresponding to the opened label name in a transfer learner and selecting the successfully matched label name;
s1.6) the voice processing unit sends a trigger instruction to an instruction receiving part of the application client;
s1.7) the front-end page application program of the application client executes front-end page label switching based on the started selected label name corresponding to the successfully matched label name and the URL corresponding to the label name;
s1.8) the voice processing unit receives a voice control switch closing instruction sent by a voice control switch in the application client.
8. The method for implementing voice-controlled switching of the front-end tab page based on the transfer learning of claim 7, wherein the application client is connected with the application server through the internet, and the application client is connected with the receiving device through a wired connection; the application server is connected with the switching server in a wired or wireless mode.
9. The method for implementing voice-controlled switching of front-end tab pages based on transfer learning of claim 8, wherein the application client comprises an instruction receiving part, one or more structures in a voice-controlled switch; the application program is loaded and used at the application client;
the switching server comprises one or more modules of a voice receiving module, a voice analyzing module and a voice processing module.
10. A device for realizing voice-controlled switching of front-end label pages based on transfer learning comprises at least one processor; and a memory storing instructions that, when executed by the at least one processor, perform the method according to any one of claims 1-9.
CN202110980211.6A 2021-08-25 2021-08-25 Method for realizing voice-controlled switching of front-end label pages based on transfer learning Pending CN113849150A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110980211.6A CN113849150A (en) 2021-08-25 2021-08-25 Method for realizing voice-controlled switching of front-end label pages based on transfer learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110980211.6A CN113849150A (en) 2021-08-25 2021-08-25 Method for realizing voice-controlled switching of front-end label pages based on transfer learning

Publications (1)

Publication Number Publication Date
CN113849150A true CN113849150A (en) 2021-12-28

Family

ID=78976207

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110980211.6A Pending CN113849150A (en) 2021-08-25 2021-08-25 Method for realizing voice-controlled switching of front-end label pages based on transfer learning

Country Status (1)

Country Link
CN (1) CN113849150A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104021155A (en) * 2014-05-21 2014-09-03 小米科技有限责任公司 Webpage display method and device
CN110147216A (en) * 2019-04-16 2019-08-20 深圳壹账通智能科技有限公司 Page switching method, device, computer equipment and the storage medium of application program
CN110600014A (en) * 2019-09-19 2019-12-20 深圳酷派技术有限公司 Model training method and device, storage medium and electronic equipment
CN112399222A (en) * 2020-11-10 2021-02-23 深圳创维-Rgb电子有限公司 Voice instruction learning method and device for smart television, smart television and medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104021155A (en) * 2014-05-21 2014-09-03 小米科技有限责任公司 Webpage display method and device
CN110147216A (en) * 2019-04-16 2019-08-20 深圳壹账通智能科技有限公司 Page switching method, device, computer equipment and the storage medium of application program
CN110600014A (en) * 2019-09-19 2019-12-20 深圳酷派技术有限公司 Model training method and device, storage medium and electronic equipment
CN112399222A (en) * 2020-11-10 2021-02-23 深圳创维-Rgb电子有限公司 Voice instruction learning method and device for smart television, smart television and medium

Similar Documents

Publication Publication Date Title
CN105391730B (en) A kind of information feedback method, apparatus and system
US6173259B1 (en) Speech to text conversion
CN109514586B (en) Method and system for realizing intelligent customer service robot
US20050138219A1 (en) Managing application interactions using distributed modality components
EP3731161A1 (en) Model application method and system, and model management method and server
US10824664B2 (en) Method and apparatus for providing text push information responsive to a voice query request
CA2480509A1 (en) Closed-loop command and response system for automatic communications between interacting computer systems over an audio communications channel
CN101341532A (en) Sharing voice application processing via markup
US8032825B2 (en) Dynamically creating multimodal markup documents
CN110992955A (en) Voice operation method, device, equipment and storage medium of intelligent equipment
CN111679886A (en) Heterogeneous computing resource scheduling method, system, electronic device and storage medium
JPH08195763A (en) Voice communications channel of network
CN109271503A (en) Intelligent answer method, apparatus, equipment and storage medium
CN116755844A (en) Data processing method, device and equipment of simulation engine and storage medium
CN109782997A (en) A kind of data processing method, device and storage medium
CN111009245A (en) Instruction execution method, system and storage medium
JP2019091416A (en) Method and device for constructing artificial intelligence application
CN108597499B (en) Voice processing method and voice processing device
CN111563182A (en) Voice conference record storage processing method and device
JP2010160788A (en) Method, system, and computer program for dynamically improving performance of interactive voice response system using complex event processor
CN111722893A (en) Method and device for interaction of graphical user interface of electronic equipment and terminal equipment
CN105677730B (en) Method and device for reading webpage resources and electronic equipment
CN109147792A (en) A kind of voice resume system
CN113849150A (en) Method for realizing voice-controlled switching of front-end label pages based on transfer learning
CN113192510A (en) Method, system and medium for implementing voice age and/or gender identification service

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination