CN107171874A - A kind of speech engine switching method, mobile terminal and server - Google Patents

A kind of speech engine switching method, mobile terminal and server Download PDF

Info

Publication number
CN107171874A
CN107171874A CN201710601801.7A CN201710601801A CN107171874A CN 107171874 A CN107171874 A CN 107171874A CN 201710601801 A CN201710601801 A CN 201710601801A CN 107171874 A CN107171874 A CN 107171874A
Authority
CN
China
Prior art keywords
engine
mobile terminal
voice
server
voice engine
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710601801.7A
Other languages
Chinese (zh)
Other versions
CN107171874B (en
Inventor
胡雪情
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Vivo Mobile Communication Co Ltd
Original Assignee
Vivo Mobile Communication Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Vivo Mobile Communication Co Ltd filed Critical Vivo Mobile Communication Co Ltd
Priority to CN201710601801.7A priority Critical patent/CN107171874B/en
Publication of CN107171874A publication Critical patent/CN107171874A/en
Application granted granted Critical
Publication of CN107171874B publication Critical patent/CN107171874B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • H04L41/0668Management of faults, events, alarms or notifications using network fault recovery by dynamic selection of recovery network elements, e.g. replacement by the most appropriate element after failure
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/60Scheduling or organising the servicing of application requests, e.g. requests for application data transmissions using the analysis and optimisation of the required network resources
    • H04L67/61Scheduling or organising the servicing of application requests, e.g. requests for application data transmissions using the analysis and optimisation of the required network resources taking into account QoS or priority requirements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/60Scheduling or organising the servicing of application requests, e.g. requests for application data transmissions using the analysis and optimisation of the required network resources
    • H04L67/63Routing a service request depending on the request content or context
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Telephone Function (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention provides a kind of speech engine switching method, mobile terminal and server, it is related to mobile communication technology field, this method is applied to mobile terminal, and this method includes:Speech engine handover request is sent to server;The configured information of the currently available target voice engine of the reception server feedback;According to configured information, the current speech engine of mobile terminal is switched into target voice engine, and voice service is carried out using target voice engine.The present invention is solved when speech engine occurs abnormal, the problem of voice service interruption or voice assistant can not be used.

Description

Voice engine switching method, mobile terminal and server
Technical Field
The present invention relates to the field of mobile communications technologies, and in particular, to a speech engine switching method, a mobile terminal, and a server.
Background
The voice assistant is an application program which can realize inquiry and operation through voice interaction, and the convenience of operating the mobile phone in different scenes can be greatly improved through the application program. The speech engine is the core part of a speech assistant type application. Typically, when one speech engine fails or there is an abnormality in the speech service, the entire speech assistant will experience an abnormality, and in order to avoid this situation, multiple speech engines need to be prepared simultaneously to cope with the emergency.
However, the current voice assistant has a certain limitation, and usually only supports built-in speech engines, but not switching of multiple speech engines, which may result in interruption of speech services or unavailability of the voice assistant when the speech services corresponding to the built-in speech engines are abnormal.
Disclosure of Invention
The invention provides a voice engine switching method, a mobile terminal and a server, and aims to solve the problems that when a built-in voice engine is abnormal, a voice service is interrupted or a voice assistant cannot be used in the conventional voice assistant application program.
In one aspect, an embodiment of the present invention provides a speech engine switching method, which is applied to a mobile terminal, and includes:
sending a voice engine switching request to a server;
receiving indication information of a currently available target speech engine fed back by a server;
and switching the current voice engine of the mobile terminal to a target voice engine according to the indication information, and performing voice service by using the target voice engine.
On the other hand, the embodiment of the invention also provides a speech engine switching method, which is applied to a server and comprises the following steps:
receiving a voice engine switching request of a mobile terminal;
inquiring a currently available voice engine and a service quality parameter of the voice engine, wherein the service quality parameter is a numerical value determined according to the current congestion degree and the connection power of the voice engine;
selecting a target speech engine from currently available speech engines according to the quality of service parameters;
and sending the indication information of the target voice engine to the mobile terminal.
In another aspect, an embodiment of the present invention further provides a mobile terminal, including:
the request sending module is used for sending a voice engine switching request to the server;
the information receiving module is used for receiving indication information of a currently available target speech engine fed back by the server;
and the engine switching module is used for switching the current voice engine of the mobile terminal to the target voice engine according to the indication information and using the target voice engine to perform voice service.
In another aspect, an embodiment of the present invention further provides a server, including:
the request receiving module is used for receiving a voice engine switching request of the mobile terminal;
the parameter query module is used for querying a currently available voice engine and service quality parameters of the voice engine, wherein the service quality parameters are numerical values determined according to the current congestion degree and the connection power of the voice engine;
the engine selection module is used for selecting a target speech engine from the currently available speech engines according to the service quality parameters;
and the information sending module is used for sending the indication information of the target voice engine to the mobile terminal.
In another aspect, an embodiment of the present invention further provides a mobile terminal, including: the speech engine switching method comprises a memory, a processor and a computer program which is stored on the memory and can run on the processor, wherein the processor realizes the steps of the speech engine switching method when executing the computer program.
In still another aspect, an embodiment of the present invention further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when being executed by a processor, the computer program implements the steps in the speech engine switching method.
In another aspect, an embodiment of the present invention further provides a server, including: the speech engine switching method comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the computer program realizes the steps of the speech engine switching method when being executed by the processor.
Therefore, according to the voice engine switching method, the mobile terminal and the server provided by the invention, when the current voice engine service of the mobile terminal is abnormal, the voice engine switching request is sent to the server, the indication information which is fed back by the server and carries the target voice engine is received, and the voice engine is switched to the target voice engine according to the indication information, so that the current voice service of the mobile terminal can be continuously carried out, and the condition that the voice service is interrupted or a voice assistant application program cannot be used due to the abnormal voice engine service is avoided.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the description of the embodiments of the present invention will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to these drawings without inventive labor.
FIG. 1 is a flow chart illustrating steps of a speech engine switching method according to an embodiment of the present invention;
fig. 2 is a block diagram of a mobile terminal according to an embodiment of the present invention;
FIG. 3 is a flowchart illustrating steps of a speech engine switching method according to another embodiment of the present invention;
FIG. 4 shows a block diagram of a server provided by a further embodiment of the invention;
FIG. 5 shows one of the flow charts of the steps of a specific example of the present invention;
FIG. 6 shows a second flowchart illustrating the steps of an embodiment of the present invention;
fig. 7 is a block diagram of a mobile terminal according to still another embodiment of the present invention;
fig. 8 is a block diagram of a mobile terminal according to still another embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, an embodiment of the present invention provides a speech engine switching method, applied to a mobile terminal, the method including:
step 101, sending a speech engine switching request to a server.
The server can be a cloud server, and the server performs data interaction with the mobile terminal, receives a voice switching request of the mobile terminal, and the like. When the voice engine service of the mobile terminal is abnormal, the mobile terminal sends a voice engine switching request to the server.
And 102, receiving indication information of the currently available target speech engine fed back by the server.
The server inquires a current available voice engine of the mobile terminal according to a voice engine switching request sent by the mobile terminal, and feeds back indication information containing a target voice engine to the mobile terminal, and the mobile terminal receives the information fed back by the server and acquires the indication information containing the target voice engine.
And 103, switching the current voice engine of the mobile terminal to a target voice engine according to the indication information, and performing voice service by using the target voice engine.
The target voice engine is determined according to the indication information fed back by the server, the current voice engine which is serving for the voice assistant application program of the mobile terminal is switched to the target voice engine, and voice service is continued through the target voice engine, so that service interruption is avoided.
Therefore, when the current voice engine service of the mobile terminal is abnormal, the voice engine switching request is sent to the server, the indication information which is fed back by the server and carries the target voice engine is received, and the target voice engine is switched according to the indication information, so that the current voice service of the mobile terminal can be continued, and the voice service interruption or the voice assistant application program can not be used due to the abnormal voice engine service is avoided. The server side can be provided with a plurality of voice engines serving the mobile terminal, and when switching is needed, a plurality of choices are provided for the mobile terminal, so that the use experience of a user is improved.
Preferably, step 101 comprises:
when the current voice engine service failure is detected, sending a voice engine switching request to a server;
or,
and sending a voice engine switching request to the server according to the preset switching frequency.
Specifically, when the mobile terminal detects that the voice engine service fails, a voice engine switching request can be sent to the server, so that the mobile terminal can switch a new voice engine in time, and the voice service interruption or the voice assistant application program can not be used is avoided.
Or the mobile terminal can send a voice engine switching request to the server at regular time according to the preset switching frequency, and the voice engine is switched at regular time, so that the voice engine is ensured to be in a normal working state, and the service quality is improved.
Preferably, before step 103, the method further comprises:
judging whether the mobile terminal has a data packet of a target voice engine:
and if the judgment result is negative, downloading and installing the data packet of the target voice engine.
After the target speech engine is determined, whether the mobile terminal has a data packet of the engine needs to be judged: if so, the target speech engine can be directly used for speech service, such as the functions of speech recognition, speech broadcasting and speech content feedback; and if the judgment result is negative, downloading and installing the data packet of the target voice engine.
Preferably, the step of detecting when the current speech engine service fails comprises:
and receiving a message that the connection between the current voice engine and the server of the current voice engine fails or a message that the connection is overtime.
When the mobile terminal receives a message that the connection between the current speech engine and the server of the engine fails, or the connection is overtime and other abnormal connections, and the service of the speech engine fails before confirmation, a speech engine switching request is sent to the server in time, so that the influence on the normal use of a user is avoided.
In the embodiment of the invention, when the current voice engine service of the mobile terminal is abnormal, the voice engine switching request is sent to the server, the indication information which is fed back by the server and carries the target voice engine is received, and the switching is carried out to the target voice engine according to the indication information, so that the current voice service of the mobile terminal can be continued, and the voice service interruption or the voice assistant application program can not be used due to the abnormal voice engine service. And the mobile terminal can automatically send a voice engine switching request to the server according to the preset switching frequency, and regularly switch the voice engine so as to ensure that the voice engine is in a normal working state and improve the service quality. The server side can also be provided with a plurality of voice engines serving the mobile terminal, and when switching is needed, a plurality of choices are provided for the mobile terminal, so that the use experience of a user is improved. The invention solves the problem that the voice service is interrupted or the voice assistant can not be used when the voice engine is abnormal in the current voice assistant application program.
Referring to fig. 2, an embodiment of the present invention provides a mobile terminal 200, including:
a request sending module 201, configured to send a speech engine switching request to a server.
The server may be a cloud server, and the server performs data interaction with the mobile terminal 200, receives a voice switching request of the mobile terminal 200, and the like. When the voice engine service of the mobile terminal 200 is abnormal, the mobile terminal 200 transmits a voice engine switching request to the server.
And the information receiving module 202 is configured to receive indication information of a currently available target speech engine fed back by the server.
The server queries a currently available speech engine of the mobile terminal 200 according to a speech engine switching request sent by the mobile terminal 200, and feeds back indication information containing a target speech engine to the mobile terminal 200, and the mobile terminal 200 receives the information fed back by the server and acquires the indication information containing the target speech engine.
And an engine switching module 203, configured to switch the current speech engine of the mobile terminal 200 to a target speech engine according to the indication information, and perform a speech service using the target speech engine.
The target speech engine is determined according to the indication information fed back by the server, and the current speech engine which is serving by the speech assistant application program of the mobile terminal 200 is switched to the target speech engine, and the speech service is continued through the target speech engine, so that service interruption is avoided.
Optionally, the request sending module 201 includes:
the first sending submodule is used for sending a voice engine switching request to the server when the current voice engine service is detected to fail;
or,
and the second sending submodule is used for sending a voice engine switching request to the server according to the preset switching frequency.
Optionally, the mobile terminal 200 further includes:
a judging module, configured to judge whether the mobile terminal 200 has a data packet of the target speech engine:
and if the judgment result is negative, downloading and installing the data packet of the target voice engine.
Optionally, the first sending submodule is configured to:
and receiving a message that the connection between the current voice engine and the server of the current voice engine fails or a message that the connection is overtime.
In the above embodiment of the present invention, when the current voice engine service of the mobile terminal 200 is abnormal, the voice engine switching request is sent to the server, the indication information carrying the target voice engine fed back by the server is received, and the mobile terminal is switched to the target voice engine according to the indication information, so that the current voice service of the mobile terminal 200 can be continued, and it is not necessary that the voice service is interrupted or a voice assistant application program cannot be used due to the abnormal voice engine service. And the mobile terminal 200 can automatically send a voice engine switching request to the server according to the preset switching frequency, and switch the voice engine at regular time, so as to ensure that the voice engine is in a normal working state and improve the service quality. The server side can also be provided with a plurality of voice engines for serving the mobile terminal 200, and when switching is needed, a plurality of choices are provided for the mobile terminal 200, so that the use experience of a user is improved.
Referring to fig. 3, another embodiment of the present invention provides a speech engine switching method, applied to a server, including:
step 301, receiving a voice engine switching request of the mobile terminal.
The server can be a cloud server, and the server performs data interaction with the mobile terminal, receives a voice switching request of the mobile terminal, and the like.
Step 302, inquiring a currently available voice engine and a service quality parameter of the voice engine, wherein the service quality parameter is a value determined according to the current congestion degree and the connection power of the voice engine.
The server inquires a currently available voice engine and service quality parameters of the currently available voice engine according to a voice engine switching request sent by the mobile terminal, wherein the service quality parameters are reference parameters for evaluating the service quality currently provided by the voice engine and are determined at least according to the current congestion degree and the connection power of the voice engine. Generally, the congestion level and the connection power have a large impact on the quality of service of the speech engine.
Step 303, selecting a target speech engine from the currently available speech engines according to the quality of service parameter.
And selecting the voice engine with the optimal service quality parameter as a target voice engine and sending the target voice engine to the mobile terminal.
And step 304, sending the indication information of the target voice engine to the mobile terminal.
And the indication information containing the target speech engine is fed back to the mobile terminal, so that the mobile terminal receives the information fed back by the server and switches the speech engine according to the indication information.
Preferably, the connection success rate is the connection power between the voice engine and the server of the voice engine;
the service quality parameter is a comprehensive weight value obtained by weighting and summing the current congestion degree and the connection power according to a preset weight value.
Specifically, the connection success rate is the connection power between the speech engine and the server of the speech engine, and after the speech engine is successfully connected with the server thereof, the speech engine can provide the speech service.
The service quality parameter is a comprehensive weight value obtained by weighting and summing according to the current congestion degree and the connection success rate, and a target speech engine selected by the server is ensured to be a speech engine with better service quality.
In the embodiment of the invention, when a voice engine switching request sent by the mobile terminal is received, the currently available voice engine and the service quality parameters of the voice engine are inquired, the voice engine with better service quality is used as the target voice engine, and the indication information carrying the target voice engine is sent to the mobile terminal, so that the mobile terminal is switched to the target voice engine according to the indication information, the current voice service of the mobile terminal can be continued, and the voice service interruption or the voice assistant application program can not be used due to the abnormal service of the voice engine is avoided. The server side can be provided with a plurality of voice engines serving the mobile terminal, and when switching is needed, a plurality of choices are provided for the mobile terminal, so that the use experience of a user is improved.
Referring to fig. 4, an embodiment of the present invention provides a server 400, including:
a request receiving module 401, configured to receive a voice engine switching request of the mobile terminal.
The server 400 may be a cloud server, and the server 400 performs data interaction with the mobile terminal, receives a voice switching request of the mobile terminal, and so on.
A parameter query module 402, configured to query a currently available speech engine and a quality of service parameter of the speech engine, where the quality of service parameter is a value determined according to a current congestion level and a connection power of the speech engine.
The server 400 queries a currently available speech engine and service quality parameters of the currently available speech engine according to a speech engine switching request sent by the mobile terminal, wherein the service quality parameters are reference parameters for evaluating the service quality currently provided by the speech engine and are determined at least according to the current congestion degree and the connection power of the speech engine. Generally, the congestion level and the connection power have a large impact on the quality of service of the speech engine.
An engine selection module 403, configured to select a target speech engine from currently available speech engines according to the quality of service parameter.
And selecting the voice engine with the optimal service quality parameter as a target voice engine and sending the target voice engine to the mobile terminal.
And an information sending module 404, configured to send the indication information of the target speech engine to the mobile terminal.
Wherein, the indication information containing the target speech engine is fed back to the mobile terminal, so that the mobile terminal receives the information fed back by the server 400 and switches the speech engine according to the indication information.
Preferably, the connection success rate is the connection power between the speech engine and the server 400 of the speech engine;
the service quality parameter is a comprehensive weight value obtained by weighting and summing the current congestion degree and the connection power according to a preset weight value.
In the embodiment of the invention, when a voice engine switching request sent by the mobile terminal is received, the currently available voice engine and the service quality parameters of the voice engine are inquired, the voice engine with better service quality is used as the target voice engine, and the indication information carrying the target voice engine is sent to the mobile terminal, so that the mobile terminal is switched to the target voice engine according to the indication information, the current voice service of the mobile terminal can be continued, and the voice service interruption or the voice assistant application program can not be used due to the abnormal service of the voice engine is avoided. The server 400 can be provided with a plurality of speech engines serving the mobile terminal, and when switching is needed, a plurality of choices are provided for the mobile terminal, so that the use experience of a user is improved.
Optionally, as a specific example, referring to fig. 5, the switching of the speech engine of the mobile terminal mainly includes the following steps:
step 501: the voice assistant-like application determines the nature of the voice service.
During the process of voice service, the voice assistant application program can preset the rule of acquiring data to determine the type of the voice service, such as json data, key value pairs of key and value.
For example, [ "source _ type": "meishi", "more _ url": http:// baidu. com/"];
wherein source _ type is key and value is meishi;
when the value obtained through the key is meishi, obtaining a corresponding value, namely a url address, from the data of which the key value is more _ url, and displaying the gourmet information for the user;
other vertical services can be set, such as chat, the value corresponding to the source _ type may be chat, and the value corresponding to the key value in the jason field is displayed.
Step 502: and when the voice service is detected to be abnormal, or through a preset switching frequency, sending a voice engine switching request to the server.
Step 503: and receiving indication information of the currently available target voice engine fed back by the server, determining the target voice engine according to the indication information, and judging whether a data packet of the engine exists or not.
If not, executing step 504 and then executing step 505; if so, step 305 is performed directly.
Step 504: the data package for the target speech engine is downloaded and installed at the application store or elsewhere.
And 505, switching the current voice engine of the mobile terminal to a target voice engine, and performing voice service by using the target voice engine.
Wherein, carrying out voice service mainly includes: speech recognition, speech reporting and feedback of speech recognition content.
Alternatively, the data may be returned according to the rules specified by the voice assistant for obtaining data, such as a gourmet service, for example, what the user is nice to eat nearby, and the key value of the returned jason data is the same as the voice assistant:
[“source_type”:“meishi”,”more_url”:”http://****”];
and the jason data of chatting is:
[ "source _ type": chat "," more _ url "", "" show _ text "], and"! "].
Referring to fig. 6, the server side mainly executes the following processes:
step 601, receiving a voice engine switching request of the mobile terminal.
Step 602, inquiring a currently available speech engine of the mobile terminal and a service quality parameter of the speech engine.
Step 603, according to the quality of service parameters, one available speech engine type is designated from the currently available speech engines for the voice assistant to query.
And step 604, sending the indication information of the target voice engine to the mobile terminal.
In one aspect, an embodiment of the present invention further provides a mobile terminal, including: the memory, the processor and the computer program stored in the memory and capable of running on the processor, when executed by the processor, implement the processes of the above-mentioned speech engine switching method embodiment, and can achieve the same technical effects, and are not described herein again to avoid repetition.
On the other hand, an embodiment of the present invention further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the computer program implements each process in the speech engine switching method, and can achieve the same technical effect, and in order to avoid repetition, details are not repeated here. The computer-readable storage medium may be a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk.
In another aspect, an embodiment of the present invention further provides a server, including: the speech engine switching method comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the computer program realizes the steps of the speech engine switching method when being executed by the processor.
Referring to fig. 7, still another embodiment of the present invention provides a mobile terminal 700. The mobile terminal 700 shown in fig. 7 includes: at least one processor 701, memory 702, at least one network interface 704, and other user interfaces 703. The various components in the mobile terminal 700 are coupled together by a bus system 705. It is understood that the bus system 705 is used to enable communications among the components. The bus system 705 includes a power bus, a control bus, and a status signal bus in addition to a data bus. But for clarity of illustration the various busses are labeled in figure 7 as the bus system 705.
The user interface 703 may include, among other things, a display, a keyboard, or a pointing device (e.g., a mouse, trackball, touch pad, or touch screen, among others.
It is to be understood that the memory 702 in embodiments of the present invention may be either volatile memory or nonvolatile memory, or may include both volatile and nonvolatile memory. The non-volatile Memory may be a Read-Only Memory (ROM), a Programmable ROM (PROM), an Erasable PROM (EPROM), an Electrically Erasable PROM (EEPROM), or a flash Memory. Volatile Memory can be Random Access Memory (RAM), which acts as external cache Memory. By way of illustration and not limitation, many forms of RAM are available, such as Static random access memory (Static RAM, SRAM), Dynamic Random Access Memory (DRAM), Synchronous Dynamic random access memory (Synchronous DRAM, SDRAM), Double Data rate Synchronous Dynamic random access memory (ddr SDRAM), Enhanced Synchronous SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), and direct memory bus RAM (DRRAM). The memory 702 of the systems and methods described herein is intended to comprise, without being limited to, these and any other suitable types of memory.
In some embodiments, memory 702 stores the following elements, executable modules or data structures, or a subset thereof, or an expanded set thereof: an operating system 7021 and application programs 7022.
The operating system 7021 includes various system programs, such as a framework layer, a core library layer, a driver layer, and the like, for implementing various basic services and processing hardware-based tasks. The application program 7022 includes various application programs, such as a Media Player (Media Player), a Browser (Browser), and the like, for implementing various application services, and a program for implementing the method according to the embodiment of the present invention may be included in the application program 7022.
In the embodiment of the present invention, the processor 701 is configured to, by calling a program or an instruction stored in the memory 702, specifically, a program or an instruction stored in the application 7022: sending a voice engine switching request to a server; receiving indication information of a currently available target speech engine fed back by a server; and switching the current voice engine of the mobile terminal to a target voice engine according to the indication information, and performing voice service by using the target voice engine.
The method disclosed in the above embodiments of the present invention may be applied to the processor 701, or implemented by the processor 701. The processor 701 may be an integrated circuit chip having signal processing capabilities. In implementation, the steps of the above method may be implemented by integrated logic circuits of hardware or instructions in the form of software in the processor 701. The Processor 701 may be a general-purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other programmable logic device, discrete Gate or transistor logic device, or discrete hardware components. The various methods, steps and logic blocks disclosed in the embodiments of the present invention may be implemented or performed. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. The steps of the method disclosed in connection with the embodiments of the present invention may be directly implemented by a hardware decoding processor, or implemented by a combination of hardware and software modules in the decoding processor. The software module may be located in ram, flash memory, rom, prom, or eprom, registers, etc. storage media as is well known in the art. The storage medium is located in the memory 702, and the processor 701 reads the information in the memory 702 and performs the steps of the above method in combination with the hardware thereof.
It is to be understood that the embodiments described herein may be implemented in hardware, software, firmware, middleware, microcode, or any combination thereof. For a hardware implementation, the processing units may be implemented within one or more Application Specific Integrated Circuits (ASICs), Digital Signal Processors (DSPs), Digital Signal Processing Devices (DSPDs), Programmable Logic Devices (PLDs), Field Programmable Gate Arrays (FPGAs), general purpose processors, controllers, micro-controllers, microprocessors, other electronic units configured to perform the functions described herein, or a combination thereof.
For a software implementation, the techniques described herein may be implemented with modules (e.g., procedures, functions, and so on) that perform the functions described herein. The software codes may be stored in a memory and executed by a processor. The memory may be implemented within the processor or external to the processor.
Optionally, as another embodiment, the processor 701 is further configured to: when the current voice engine service failure is detected, sending a voice engine switching request to a server; or sending a voice engine switching request to the server according to a preset switching frequency.
Optionally, as another embodiment, the processor 701 is further configured to: judging whether the mobile terminal has a data packet of a target voice engine: and if the judgment result is negative, downloading and installing the data packet of the target voice engine.
Optionally, as another embodiment, the processor 701 is further configured to: and receiving a message that the connection between the current voice engine and the server of the current voice engine fails or a message that the connection is overtime.
The mobile terminal 700 can implement the processes implemented by the mobile terminal in the foregoing embodiments, and details are not repeated here to avoid repetition.
According to the mobile terminal 700 of the embodiment of the invention, when the current voice engine service of the mobile terminal is abnormal, the processor 701 sends the voice engine switching request to the server, receives the indication information carrying the target voice engine fed back by the server, and switches to the target voice engine according to the indication information, so that the current voice service of the mobile terminal can be continued, and the voice service interruption or the voice assistant application program can not be used due to the abnormal voice engine service is avoided. And the mobile terminal can automatically send a voice engine switching request to the server according to the preset switching frequency, and regularly switch the voice engine so as to ensure that the voice engine is in a normal working state and improve the service quality. The server side can also be provided with a plurality of voice engines serving the mobile terminal, and when switching is needed, a plurality of choices are provided for the mobile terminal, so that the use experience of a user is improved.
Referring to fig. 8, yet another embodiment of the present invention provides a mobile terminal 800. Specifically, the mobile terminal 800 in fig. 8 may be a mobile phone, a tablet computer, a Personal Digital Assistant (PDA), or a vehicle-mounted computer.
The mobile terminal 800 in fig. 8 includes a Radio Frequency (RF) circuit 810, a memory 820, an input unit 830, a display unit 840, a processor 850, a Wi-fi (wireless fidelity) module 860, an audio circuit 870, and a power supply 880.
The input unit 830 may be used, among other things, to receive numeric or character information input by a user and to generate signal inputs related to user settings and function control of the mobile terminal 800.
Specifically, in the embodiment of the present invention, the input unit 830 may include a touch panel 831. The touch panel 831, also referred to as a touch screen, can collect touch operations performed by a user on or near the touch panel 831 (e.g., operations performed by the user on the touch panel 831 using a finger, a stylus, or any other suitable object or accessory), and drive the corresponding connection device according to a preset program. Alternatively, the touch panel 831 may include two portions, i.e., a touch detection device and a touch controller. The touch detection device detects the touch direction of a user, detects a signal brought by touch operation and transmits the signal to the touch controller; the touch controller receives touch information from the touch sensing device, converts it to touch point coordinates, and sends the touch point coordinates to the processor 850, and can receive and execute commands sent from the processor 850. In addition, the touch panel 831 may be implemented by various types such as a resistive type, a capacitive type, an infrared ray, and a surface acoustic wave. In addition to the touch panel 831, the input unit 830 may include other input devices 832, and the other input devices 832 may include, but are not limited to, one or more of a physical keyboard, function keys (e.g., volume control keys, switch keys, etc.), a trackball, a mouse, a joystick, and the like.
Among other things, the display unit 840 may be used to display information input by the user or information provided to the user and various menu interfaces of the mobile terminal 800. The display unit 840 may include a display panel 841, and the display panel 841 may be alternatively configured in the form of an LCD or an Organic Light-Emitting Diode (OLED), or the like.
It should be noted that the touch panel 831 can overlay the display panel 841 to form a touch display screen, which, when detecting a touch operation thereon or nearby, is transmitted to the processor 850 to determine the type of touch event, and then the processor 850 provides a corresponding visual output on the touch display screen according to the type of touch event.
The touch display screen comprises an application program interface display area and a common control display area. The arrangement modes of the application program interface display area and the common control display area are not limited, and can be an arrangement mode which can distinguish two display areas, such as vertical arrangement, left-right arrangement and the like. The application interface display area may be used to display an interface of an application. Each interface may contain at least one interface element such as an icon and/or widget desktop control for an application. The application interface display area may also be an empty interface that does not contain any content. The common control display area is used for displaying controls with high utilization rate, such as application icons like setting buttons, interface numbers, scroll bars, phone book icons and the like.
In an embodiment of the present invention, the processor 850, by calling software programs and/or modules stored in the first memory 821 and/or data in the second memory 822, is configured to: sending a voice engine switching request to a server; receiving indication information of a currently available target speech engine fed back by a server; and switching the current voice engine of the mobile terminal to a target voice engine according to the indication information, and performing voice service by using the target voice engine.
Optionally, as another embodiment, the processor 850 is further configured to: when the current voice engine service failure is detected, sending a voice engine switching request to a server; or sending a voice engine switching request to the server according to a preset switching frequency.
Optionally, as another embodiment, the processor 850 is further configured to: judging whether the mobile terminal has a data packet of a target voice engine: and if the judgment result is negative, downloading and installing the data packet of the target voice engine.
Optionally, as another embodiment, the processor 850 is further configured to: and receiving a message that the connection between the current voice engine and the server of the current voice engine fails or a message that the connection is overtime.
The mobile terminal 800 can implement each process implemented by the mobile terminal in the foregoing embodiments, and details are not repeated here to avoid repetition.
According to the mobile terminal 800 of the embodiment of the present invention, when the current voice engine service of the mobile terminal is abnormal, the processor 850 sends a voice engine switching request to the server, receives the indication information carrying the target voice engine fed back by the server, and switches to the target voice engine according to the indication information, so that the current voice service of the mobile terminal can be continued, and it is not necessary that the voice service is interrupted or a voice assistant application program cannot be used due to the abnormal voice engine service. And the mobile terminal can automatically send a voice engine switching request to the server according to the preset switching frequency, and regularly switch the voice engine so as to ensure that the voice engine is in a normal working state and improve the service quality. The server side can also be provided with a plurality of voice engines serving the mobile terminal, and when switching is needed, a plurality of choices are provided for the mobile terminal, so that the use experience of a user is improved.
Those of ordinary skill in the art will appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the embodiments provided in the present application, it should be understood that the disclosed apparatus and method may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units is only one logical division, and other divisions may be realized in practice, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit.
The functions, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: u disk, removable hard disk, ROM, RAM, magnetic disk, optical disk, and other various media capable of storing program codes.
The above description is only for the specific embodiments of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present invention, and all the changes or substitutions should be covered within the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (16)

1. A speech engine switching method is applied to a mobile terminal and is characterized by comprising the following steps:
sending a voice engine switching request to a server;
receiving indication information of a currently available target speech engine fed back by the server;
and switching the current voice engine of the mobile terminal to a target voice engine according to the indication information, and using the target voice engine to perform voice service.
2. The method of claim 1, wherein the step of sending a speech engine switch request to the server comprises:
when the current voice engine service failure is detected, sending a voice engine switching request to a server;
or,
and sending a voice engine switching request to the server according to the preset switching frequency.
3. The method of claim 1, wherein prior to the step of switching the current speech engine of the mobile terminal to the target speech engine, the method further comprises:
judging whether the mobile terminal has the data packet of the target voice engine:
and if the judgment result is negative, downloading and installing the data packet of the target voice engine.
4. The method of claim 2, wherein the step of detecting a current speech engine service failure comprises:
and receiving a message that the connection between the current voice engine and the server of the current voice engine fails or a message that the connection is overtime.
5. A speech engine switching method is applied to a server and is characterized by comprising the following steps:
receiving a voice engine switching request of a mobile terminal;
inquiring a currently available voice engine and a service quality parameter of the voice engine, wherein the service quality parameter is a numerical value determined according to the current congestion degree and the connection power of the voice engine;
selecting a target speech engine from the currently available speech engines according to the service quality parameters;
and sending the indication information of the target voice engine to the mobile terminal.
6. The method of claim 5, wherein the connection success rate is a connection power between the speech engine and a server of the speech engine;
the service quality parameter is a comprehensive weight value obtained by weighting and summing the current congestion degree and the connection success rate according to a preset weight value.
7. A mobile terminal, comprising:
the request sending module is used for sending a voice engine switching request to the server;
the information receiving module is used for receiving the indication information of the currently available target speech engine fed back by the server;
and the engine switching module is used for switching the current voice engine of the mobile terminal to a target voice engine according to the indication information and using the target voice engine to perform voice service.
8. The mobile terminal of claim 7, wherein the request sending module comprises:
the first sending submodule is used for sending a voice engine switching request to the server when the current voice engine service is detected to fail;
or,
and the second sending submodule is used for sending a voice engine switching request to the server according to the preset switching frequency.
9. The mobile terminal of claim 7, wherein the mobile terminal further comprises:
a judging module, configured to judge whether the mobile terminal has a data packet of the target speech engine:
and if the judgment result is negative, downloading and installing the data packet of the target voice engine.
10. The mobile terminal of claim 8, wherein the first sending submodule is configured to:
and receiving a message that the connection between the current voice engine and the server of the current voice engine fails or a message that the connection is overtime.
11. A server, comprising:
the request receiving module is used for receiving a voice engine switching request of the mobile terminal;
the parameter query module is used for querying a currently available voice engine and a service quality parameter of the voice engine, wherein the service quality parameter is a numerical value determined according to the current congestion degree of the voice engine and the connection power;
the engine selection module is used for selecting a target speech engine from the currently available speech engines according to the service quality parameters;
and the information sending module is used for sending the indication information of the target voice engine to the mobile terminal.
12. The server of claim 11, wherein the connection success rate is a connection power between the speech engine and the server of the speech engine;
the service quality parameter is a comprehensive weight value obtained by weighting and summing the current congestion degree and the connection success rate according to a preset weight value.
13. A mobile terminal, comprising: memory, processor and computer program stored on the memory and executable on the processor, which when executed by the processor implements the steps in the speech engine switching method according to any of claims 1 to 4.
14. A computer-readable storage medium, having stored thereon a computer program which, when executed by a processor, implements the steps in the speech engine switching method of any one of claims 1 to 4.
15. A server, comprising: memory, processor and computer program stored on the memory and executable on the processor, which computer program, when executed by the processor, carries out the steps in the speech engine switching method according to claim 5 or 6.
16. A computer-readable storage medium, having stored thereon a computer program which, when being executed by a processor, carries out the steps of the speech engine switching method according to claim 5 or 6.
CN201710601801.7A 2017-07-21 2017-07-21 Voice engine switching method, mobile terminal and server Active CN107171874B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710601801.7A CN107171874B (en) 2017-07-21 2017-07-21 Voice engine switching method, mobile terminal and server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710601801.7A CN107171874B (en) 2017-07-21 2017-07-21 Voice engine switching method, mobile terminal and server

Publications (2)

Publication Number Publication Date
CN107171874A true CN107171874A (en) 2017-09-15
CN107171874B CN107171874B (en) 2020-02-18

Family

ID=59817349

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710601801.7A Active CN107171874B (en) 2017-07-21 2017-07-21 Voice engine switching method, mobile terminal and server

Country Status (1)

Country Link
CN (1) CN107171874B (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107945797A (en) * 2017-12-07 2018-04-20 携程旅游信息技术(上海)有限公司 Monitoring system based on speech recognition
CN108831448A (en) * 2018-03-22 2018-11-16 北京小米移动软件有限公司 The method, apparatus and storage medium of voice control smart machine
CN108847229A (en) * 2018-05-23 2018-11-20 上海爱优威软件开发有限公司 A kind of information interacting method and terminal based on voice assistant
CN108881377A (en) * 2018-04-28 2018-11-23 上海车音智能科技有限公司 A kind of application service invocation method, terminal device and server
CN109243464A (en) * 2018-07-04 2019-01-18 平安科技(深圳)有限公司 Speech recognition tools dispositions method, device, computer equipment and storage medium
CN109859755A (en) * 2019-03-13 2019-06-07 深圳市同行者科技有限公司 A kind of audio recognition method, storage medium and terminal
CN111105793A (en) * 2019-12-03 2020-05-05 北京蓦然认知科技有限公司 Voice interaction method and device based on interaction engine cluster
CN111128125A (en) * 2019-12-30 2020-05-08 深圳市优必选科技股份有限公司 Voice service configuration system and voice service configuration method and device thereof
CN111179934A (en) * 2018-11-12 2020-05-19 奇酷互联网络科技(深圳)有限公司 Method of selecting a speech engine, mobile terminal and computer-readable storage medium
CN111614701A (en) * 2019-02-22 2020-09-01 杭州海康威视数字技术股份有限公司 Distributed cluster and container state switching method and device
CN111862972A (en) * 2020-07-08 2020-10-30 北京梧桐车联科技有限责任公司 Voice interaction service method, device, equipment and storage medium
CN113449141A (en) * 2021-06-08 2021-09-28 阿波罗智联(北京)科技有限公司 Voice broadcasting method and device, electronic equipment and storage medium
CN113591000A (en) * 2021-07-13 2021-11-02 青岛海信移动通信技术股份有限公司 Browser engine switching method, device and equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090043583A1 (en) * 2007-08-08 2009-02-12 International Business Machines Corporation Dynamic modification of voice selection based on user specific factors
CN103325371A (en) * 2013-06-05 2013-09-25 杭州网豆数字技术有限公司 Voice recognition system and method based on cloud
CN106373565A (en) * 2016-08-24 2017-02-01 安徽咪鼠科技有限公司 Method for automatically switching speech recognition engines
CN106411585A (en) * 2016-09-21 2017-02-15 深圳市智物联网络有限公司 Server switching method, adapter, server and scheduling device
CN106710594A (en) * 2016-11-17 2017-05-24 北京中科汇联科技股份有限公司 Intelligent speech interaction system based on cloud end

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090043583A1 (en) * 2007-08-08 2009-02-12 International Business Machines Corporation Dynamic modification of voice selection based on user specific factors
CN103325371A (en) * 2013-06-05 2013-09-25 杭州网豆数字技术有限公司 Voice recognition system and method based on cloud
CN106373565A (en) * 2016-08-24 2017-02-01 安徽咪鼠科技有限公司 Method for automatically switching speech recognition engines
CN106411585A (en) * 2016-09-21 2017-02-15 深圳市智物联网络有限公司 Server switching method, adapter, server and scheduling device
CN106710594A (en) * 2016-11-17 2017-05-24 北京中科汇联科技股份有限公司 Intelligent speech interaction system based on cloud end

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107945797A (en) * 2017-12-07 2018-04-20 携程旅游信息技术(上海)有限公司 Monitoring system based on speech recognition
CN107945797B (en) * 2017-12-07 2021-12-31 携程旅游信息技术(上海)有限公司 Monitoring system based on speech recognition
CN108831448A (en) * 2018-03-22 2018-11-16 北京小米移动软件有限公司 The method, apparatus and storage medium of voice control smart machine
CN108881377A (en) * 2018-04-28 2018-11-23 上海车音智能科技有限公司 A kind of application service invocation method, terminal device and server
CN108881377B (en) * 2018-04-28 2021-12-03 上海车音智能科技有限公司 Application service calling method, terminal equipment and server
CN108847229A (en) * 2018-05-23 2018-11-20 上海爱优威软件开发有限公司 A kind of information interacting method and terminal based on voice assistant
CN109243464A (en) * 2018-07-04 2019-01-18 平安科技(深圳)有限公司 Speech recognition tools dispositions method, device, computer equipment and storage medium
CN111179934A (en) * 2018-11-12 2020-05-19 奇酷互联网络科技(深圳)有限公司 Method of selecting a speech engine, mobile terminal and computer-readable storage medium
CN111614701A (en) * 2019-02-22 2020-09-01 杭州海康威视数字技术股份有限公司 Distributed cluster and container state switching method and device
CN111614701B (en) * 2019-02-22 2022-09-02 杭州海康威视数字技术股份有限公司 Distributed cluster and container state switching method and device
CN109859755A (en) * 2019-03-13 2019-06-07 深圳市同行者科技有限公司 A kind of audio recognition method, storage medium and terminal
CN111105793A (en) * 2019-12-03 2020-05-05 北京蓦然认知科技有限公司 Voice interaction method and device based on interaction engine cluster
CN111105793B (en) * 2019-12-03 2022-09-06 杭州蓦然认知科技有限公司 Voice interaction method and device based on interaction engine cluster
CN111128125A (en) * 2019-12-30 2020-05-08 深圳市优必选科技股份有限公司 Voice service configuration system and voice service configuration method and device thereof
CN111862972A (en) * 2020-07-08 2020-10-30 北京梧桐车联科技有限责任公司 Voice interaction service method, device, equipment and storage medium
CN111862972B (en) * 2020-07-08 2023-11-14 北京梧桐车联科技有限责任公司 Voice interaction service method, device, equipment and storage medium
CN113449141A (en) * 2021-06-08 2021-09-28 阿波罗智联(北京)科技有限公司 Voice broadcasting method and device, electronic equipment and storage medium
CN113591000A (en) * 2021-07-13 2021-11-02 青岛海信移动通信技术股份有限公司 Browser engine switching method, device and equipment
CN113591000B (en) * 2021-07-13 2024-04-19 青岛海信移动通信技术有限公司 Browser engine switching method, device and equipment

Also Published As

Publication number Publication date
CN107171874B (en) 2020-02-18

Similar Documents

Publication Publication Date Title
CN107171874B (en) Voice engine switching method, mobile terminal and server
CN107466077B (en) Network switching method and mobile terminal
CN115357178B (en) Control method applied to screen-throwing scene and related equipment
CN107526492B (en) Interface display method of application program and mobile terminal
CN106304276B (en) A kind of wireless network processing method and mobile terminal
CN106383645A (en) Music playing control method and mobile terminal
US20190274184A1 (en) Configuration method of discountinuous reception parameter, mobile terminal, and base station
US20140189532A1 (en) Editing text-based communications
CN107426765B (en) Measurement report generation method and mobile terminal
CN106453925B (en) Information transmission method and mobile terminal
CN107172287B (en) Multi-card communication method and mobile terminal
US20130019199A1 (en) Apparatus and method for executing shortcut function in a portable terminal
CN107040965B (en) Flow control method and device and mobile terminal
CN106095250B (en) Application icon layout method and mobile terminal
CN106453953B (en) Wireless access point connection method and mobile terminal
CN107333325B (en) Network searching method and mobile terminal
EP3472699B1 (en) Method and electronic device for managing functionality of applications
CN109683739B (en) Method and device for adjusting operation interface, storage medium and electronic equipment
EP3509353A1 (en) Wifi connection method and device
US20120017241A1 (en) Handheld device and text input method
CN106027373A (en) Information management method and device
CN106250146B (en) Application demand processing method and device
CN112749074B (en) Test case recommending method and device
CN107728898B (en) Information processing method and mobile terminal
CN112218281B (en) Scanning gun connection method, device, terminal and medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant