CN111147530A - System architecture, multi-voice platform switching method, intelligent terminal and storage medium - Google Patents

System architecture, multi-voice platform switching method, intelligent terminal and storage medium Download PDF

Info

Publication number
CN111147530A
CN111147530A CN201811204114.2A CN201811204114A CN111147530A CN 111147530 A CN111147530 A CN 111147530A CN 201811204114 A CN201811204114 A CN 201811204114A CN 111147530 A CN111147530 A CN 111147530A
Authority
CN
China
Prior art keywords
voice
intelligent terminal
voice platform
platform
interface
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811204114.2A
Other languages
Chinese (zh)
Inventor
邱振青
吴海全
张恩勤
曹磊
师瑞文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Grandsun Electronics Co Ltd
Original Assignee
Shenzhen Grandsun Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Grandsun Electronics Co Ltd filed Critical Shenzhen Grandsun Electronics Co Ltd
Priority to CN201811204114.2A priority Critical patent/CN111147530A/en
Publication of CN111147530A publication Critical patent/CN111147530A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/34Adaptation of a single recogniser for parallel processing, e.g. by use of multiple processors or cloud computing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/02Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/51Discovery or management thereof, e.g. service location protocol [SLP] or web services
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Abstract

The application is applicable to the technical field of network architecture, and provides a system architecture supporting a multi-voice platform, a switching method of the multi-voice platform, an intelligent terminal and a computer readable storage medium, wherein the switching method comprises the following steps: under the state of being connected with first pronunciation platform, control first pronunciation platform of current connection gets into the monitoring mode under the monitoring mode of first pronunciation platform, if monitor the instruction of switching to second pronunciation platform, then control the interface that intelligent terminal's framework layer provided turns into the interface with second pronunciation platform adaptation, second pronunciation platform be for being different from first pronunciation platform, through the interface after the conversion with second pronunciation platform establishes to be connected, can make an intelligent terminal and a plurality of pronunciation platform adaptations through this application, thereby makes intelligent terminal switches between a plurality of pronunciation platforms.

Description

System architecture, multi-voice platform switching method, intelligent terminal and storage medium
Technical Field
The present application relates to the field of network architecture technologies, and in particular, to a system architecture supporting multiple voice platforms, a switching method for multiple voice platforms, an intelligent terminal, and a computer-readable storage medium.
Background
Turning to cloud computing (cloud computing) is a significant change that the industry will face. The emergence of various cloud platforms (cloud platforms) is one of the most important links of this transition. As the name implies, such a platform allows developers to either run written programs in the "cloud," use services provided in the "cloud," or both.
Since the cloud platform can run programs and provide services, various intelligent terminals based on the cloud platform are also produced, for example, an intelligent sound box based on a voice platform. However, currently, a smart terminal based on a cloud platform is usually only matched with a corresponding cloud platform, and when a service provided by another cloud platform is needed, the smart terminal matched with the other cloud platform needs to be replaced.
Disclosure of Invention
In view of this, the embodiments of the present application provide a system architecture supporting multiple voice platforms, a switching method for multiple voice platforms, an intelligent terminal, and a computer-readable storage medium, so as to solve the problem that the current intelligent terminal cannot adapt to multiple voice platforms.
A first aspect of an embodiment of the present application provides a system architecture for supporting multi-voice platform handover, including:
the system comprises a voice service layer, an adaptation layer and a framework layer of an intelligent terminal;
the voice service layer comprises a plurality of voice service controls, and the voice service controls respectively correspond to the voice platforms and are communicated with the corresponding voice platforms, so that the voice platforms provide voice services for the intelligent terminal;
the adaptation layer is used for converting an interface of a framework layer of the intelligent terminal into an interface adapted to a corresponding voice platform so as to establish communication connection with a voice service control corresponding to the voice platform;
and the framework layer of the intelligent terminal is used for transmitting the requirements of the intelligent terminal to the corresponding voice platform through an interface and receiving the corresponding voice service which is provided by the voice platform and corresponds to the requirements through the interface.
A second aspect of the embodiments of the present application provides a switching method for a multi-voice platform, where the switching method is applied to an intelligent terminal based on a system architecture supporting multi-voice platform switching provided in the first aspect of the embodiments of the present application, and the switching method includes:
under the state of being connected with a first voice platform, controlling the first voice platform which is currently connected to enter a monitoring mode;
in the monitoring mode of the first voice platform, if an instruction for switching to a second voice platform is monitored, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform, wherein the second voice platform is different from the first voice platform;
and establishing connection with the second voice platform through the converted interface.
A third aspect of an embodiment of the present application provides an intelligent terminal, including:
the monitoring module is used for controlling the currently connected first voice platform to enter a monitoring mode in a state of being connected with the first voice platform;
the conversion module is used for controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to a second voice platform if an instruction for switching to the second voice platform is monitored in the monitoring mode of the first voice platform, wherein the second voice platform is different from the first voice platform;
and the switching module is used for establishing connection with the second voice platform through the converted interface.
A fourth aspect of the embodiments of the present application provides an intelligent terminal, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor implements the steps of the method provided in the second aspect of the embodiments of the present application when executing the computer program.
A fifth aspect of embodiments of the present application provides a computer-readable storage medium storing a computer program which, when executed by one or more processors, performs the steps of the method provided by the second aspect of embodiments of the present application.
A sixth aspect of embodiments of the present application provides a computer program product comprising a computer program which, when executed by one or more processors, performs the steps of the method provided by the second aspect of embodiments of the present application.
An embodiment of the present application provides a system architecture, where the system architecture includes: the voice service layer comprises a plurality of voice service controls, each voice service control corresponds to a voice platform respectively and communicates with the corresponding voice platform, the voice platform is enabled to provide voice service for the intelligent terminal, the adaptation layer is used for converting an interface of the frame layer of the intelligent terminal into an interface matched with the corresponding voice platform, so that communication connection is established with the voice service controls corresponding to the voice platforms, the frame layer of the intelligent terminal is used for transmitting the requirements of the intelligent terminal to the corresponding voice platform and receiving the corresponding voice service provided by the voice platform. Because the interface provided by the framework layer of the intelligent terminal can be converted into the interface adaptive to the corresponding voice platform through the adaptation layer, the intelligent terminal can be switched among a plurality of voice platforms. The embodiment of the application also provides a switching method of the multi-voice platform, and the switching method comprises the following steps: under the state of being connected with a first voice platform, controlling the first voice platform which is currently connected to enter a monitoring mode; in the monitoring mode of the first voice platform, if an instruction for switching to a second voice platform is monitored, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform, wherein the second voice platform is different from the first voice platform; the converted interface is connected with the second voice platform, and the interface provided by the framework layer of the intelligent terminal is controlled to be converted into the interface adaptive to the second voice platform after the instruction for switching to the second voice platform is monitored, so that the intelligent terminal can be switched among a plurality of voice platforms.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present application, the drawings needed to be used in the embodiments or the prior art descriptions will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present application, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive exercise.
FIG. 1 is a diagram illustrating a system architecture for supporting multiple voice platforms according to an embodiment of the present application;
fig. 2 is a schematic flowchart illustrating an implementation process of a switching method for a multi-voice platform according to an embodiment of the present application;
fig. 3 is a schematic block diagram of an intelligent terminal provided in an embodiment of the present application;
fig. 4 is a schematic block diagram of another intelligent terminal provided in the embodiment of the present application.
Detailed Description
In the following description, for purposes of explanation and not limitation, specific details are set forth, such as particular system structures, techniques, etc. in order to provide a thorough understanding of the embodiments of the present application. It will be apparent, however, to one skilled in the art that the present application may be practiced in other embodiments that depart from these specific details. In other instances, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present application with unnecessary detail.
It will be understood that the terms "comprises" and/or "comprising," when used in this specification and the appended claims, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
It is also to be understood that the terminology used in the description of the present application herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the application. As used in the specification of the present application and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise.
It should be further understood that the term "and/or" as used in this specification and the appended claims refers to and includes any and all possible combinations of one or more of the associated listed items.
As used in this specification and the appended claims, the term "if" may be interpreted contextually as "when", "upon" or "in response to a determination" or "in response to a detection". Similarly, the phrase "if it is determined" or "if a [ described condition or event ] is detected" may be interpreted contextually to mean "upon determining" or "in response to determining" or "upon detecting [ described condition or event ]" or "in response to detecting [ described condition or event ]".
In order to explain the technical solution described in the present application, the following description will be given by way of specific examples.
Fig. 1 is a schematic diagram of a system architecture supporting multiple voice platforms according to an embodiment of the present disclosure, in which fig. 1 shows a system architecture applied to a smart speaker, where cloud terminals represent different voice platforms, and a layer where the cloud terminals are located is a voice service layer, and as shown in the drawing, the system architecture includes:
a voice service layer, an adaptation layer, and a Framework layer (Framework) of the intelligent terminal;
the voice service layer comprises a plurality of voice service controls, and each voice service control corresponds to a voice platform and is communicated with the corresponding voice platform, so that the voice platform provides voice service for the intelligent terminal;
the adaptation layer is used for converting an interface of a framework layer of the intelligent terminal into an interface adapted to a corresponding voice platform so as to establish communication connection with a voice service control corresponding to the voice platform;
and the framework layer of the intelligent terminal is used for transmitting the requirements of the intelligent terminal to the corresponding voice platform through an interface and receiving the corresponding voice service which is provided by the voice platform and corresponds to the requirements through the interface.
In the practical application scenario of the embodiment of the application, a voice platform layer is further provided, wherein at least two voice platforms are distributed on the voice platform layer, and the voice platforms are used for providing voice services for the intelligent terminal; the voice platform layer is distributed with a plurality of voice platforms (a cloud A, a cloud B, a cloud C and a cloud D), is the cloud software combination and the service cluster sum of each voice manufacturer, and bears voice services such as calculation, storage, service, data, safety and the like. The voice platform of the voice platform layer can provide various voice services for the intelligent terminal. Taking the smart speaker as an example, the voice service provided by the voice platform matched with the smart speaker includes: voice recognition services, multimedia content, account security, payment, device and user behavior analysis, Internet of Things (IOT) smart home control, and device software upgrade services, etc., which are key to smart speaker software technology.
Each voice platform corresponds to one voice service (cloud service A, cloud service B, cloud service C and cloud service D), the voice services corresponding to the voice platforms in the voice platform layer form a voice service layer, the voice services are software executors of the corresponding voice platforms in the intelligent terminal, can be a process or a service which can be loaded and started by a system, and the voice services are responsible for communicating with the corresponding voice platforms and calling the services provided by the system to realize the service requirements of the intelligent terminal. Taking the smart speaker as an example, the voice service communicates with the voice platform To provide a recording service, a Text To Speech (TTS) service, a wakeup word service, a streaming media playing service, a security management service, and the like for the smart speaker.
The adaptation layer is a key compatible with each voice platform, and converts an interface of a framework layer of the intelligent terminal into an interface adapted to a Software Development Kit (SDK) of each voice platform so as to shield differences of each voice platform. In practical application, when a user wants the intelligent sound box to be connected with the cloud A, an instruction connected with the cloud A can be sent out, at the moment, an adaptation layer in the intelligent sound box can convert an interface provided by a framework layer in the intelligent sound box into an interface adapted with the cloud A, then communication connection can be established with the cloud A through the converted interface and the cloud service A corresponding to the cloud A, after the communication connection is established, the intelligent sound box can enjoy various services provided by the cloud A through the converted interface and the cloud service A corresponding to the cloud A.
The framework layer of the intelligent terminal is the realization of the universal intelligent terminal requirements and the corresponding requirements thereof. Take intelligent audio amplifier as an example, the frame layer contains: a wakeup word component, a TTS component, an Audio component, a system version cloud upgrading component, a WiFi/BT management and use service, an Audio media playing and recording service, and the like. Many common services and control algorithms for intelligent terminals can be implemented at this layer. Meanwhile, the layer provides an interface to transmit the requirements of the intelligent terminal to the corresponding voice platform, and receives the voice service corresponding to the requirements, which is provided by the voice platform, through the interface.
In this embodiment of the present application, the system architecture can enable the intelligent terminal to switch between different voice platforms, and of course, for the intelligent terminal, the system architecture further needs to include:
an operating system layer and a hardware layer. The operating system layer comprises an operating system running in the intelligent terminal and is used for providing a software execution environment;
the hardware layer comprises hardware in the intelligent terminal and is used for providing a carrier for software running in the intelligent terminal.
The operating system layer is an operating system corresponding to each intelligent terminal, is a manager of each hardware in the intelligent terminal, and is responsible for providing a reliable, safe and concurrent software execution environment and providing some basic function libraries.
Each intelligent terminal also needs to be supported by hardware, the hardware corresponding to a plurality of intelligent terminals forms a hardware layer, and the hardware is a carrier of the whole intelligent terminal software and an operation object visible for a user. The reliable stability of the hardware is crucial to the functionality of the intelligent terminal. Taking the smart speaker as an example, the layer includes a CPU, a memory, an external memory, a power module, a microphone array, a speaker array, a wireless communication module, and the like.
As another embodiment of the present application, the system architecture further includes: common software Component Foundation (common software Component Foundation).
The universal software component foundation enables interaction and communication among or in layers in the system architecture to be more flexible and clear, and improves development efficiency. The general software component base comprises: general purpose middleware, general purpose network communication protocols (UDP, TCP, HTTP, etc.), and general purpose software programming languages.
The embodiment of the application provides a system architecture supporting multi-voice platform switching, an interface provided by a framework layer of an intelligent terminal is converted into an interface matched with a corresponding voice platform through an adaptation layer, and then the communication connection can be established with the corresponding voice platform, so that the intelligent terminal can enjoy voice services provided by different voice platforms.
Fig. 2 is a schematic flowchart of a switching method of a multi-voice platform according to an embodiment of the present application, and is applied to an intelligent terminal, where as shown in the figure, the method may include the following steps:
step S201, in a state of being connected to the first voice platform, controlling the currently connected first voice platform to enter a monitoring mode.
In the embodiment of the present application, if the intelligent terminal needs to switch between multiple voice platforms based on the system architecture provided in the embodiment shown in fig. 1, in a development stage of the intelligent sound box, a software flow needs to be designed to be able to configure and switch to a specific voice platform, for example, in a product software development stage, by defining attribute values or switches of compiled configuration items, in a software version compilation stage, software behaviors are automatically cut or modified by compiling scripts. Each software layer processes the configuration items according to the requirements, and the aim of overall cutting and optimizing the software is achieved. Even after the product is released, the updated system version can be loaded in an upgrading mode to realize switching among a plurality of voice platforms.
In the embodiment of the application, the intelligent terminal can be switched to another voice platform in a state that the intelligent terminal is connected with one voice platform. For example, in a state of being connected to the first voice platform, the smart terminal may enjoy a voice service provided by the first voice platform. Meanwhile, in order to ensure that the intelligent terminal can be switched to other voice platforms in the state of being connected with the first voice platform, the first voice platform can be set to be in a monitoring mode. After the connection with the first voice platform is established, the first voice platform provides voice service, and in the state of the monitoring mode, an instruction for switching other voice platforms can be monitored.
Step S202, in the monitoring mode of the first voice platform, if an instruction for switching to a second voice platform is monitored, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform, wherein the second voice platform is different from the first voice platform.
In the embodiment of the application, in the monitoring mode of the first voice platform, an instruction for switching to another voice platform can be monitored, and if the instruction for switching to the second voice platform is monitored, the interface provided by the framework layer of the intelligent terminal is controlled to be converted into the interface adaptive to the second voice platform. The interface provided by the framework layer of the intelligent terminal is converted into an interface adapted to the second voice platform, which is an interface provided by the framework layer of the intelligent terminal controlled by the adaptation layer in the system architecture provided in the embodiment shown in the figure and is converted into an interface adapted to the second voice platform, the second voice platform is another voice platform except the first voice platform in the voice platforms distributed by the voice platform layer, and can be any one voice platform except the first voice platform in the voice platforms distributed by the voice platform layer.
For example, assume that a cloud a, a cloud B, a cloud C and a cloud D are distributed on the voice platform layer, if the current connection of the smart terminal is the cloud B, the cloud B is the first voice platform, and the cloud a, the cloud C and the cloud D can be called as the second voice platform. If an instruction for switching to the cloud A is monitored, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the cloud A; if an instruction for switching to the cloud C is monitored, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the cloud C; and if an instruction for switching to the cloud end D is monitored, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the cloud end D.
As another embodiment of the present application, if an instruction to switch to the second voice platform is monitored, the controlling of converting the interface provided by the framework layer of the intelligent terminal into an interface adapted to the second voice platform includes:
receiving voice information of a user and analyzing the voice information;
if the analyzed voice information is an instruction for switching to the second voice platform, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform;
or the like, or, alternatively,
if an instruction for switching to the second voice platform sent by a key on the intelligent terminal is monitored, controlling an interface provided by a frame layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform;
or the like, or, alternatively,
if an instruction for switching to the second voice platform sent by a mobile terminal interconnected with the intelligent terminal is monitored, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform;
or the like, or, alternatively,
and if the instruction which is the same as the instruction for starting the second voice platform and is configured in advance in the intelligent terminal is detected, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform.
In this application embodiment, under the state that intelligent terminal and first voice platform are connected, first voice platform is in the monitoring mode, if the user sends the speech information of "please switch to high in the clouds C" in intelligent terminal's operational environment, right speech information "please switch to high in the clouds C" and analyze, after the analysis, indicate to switch to high in the clouds C, then control the interface that intelligent terminal's frame layer provided turns into the interface with high in the clouds C adaptation. The voice information may be "please switch to cloud C", "use cloud C", and the like, and the voice information is analyzed to obtain a command for switching to cloud C.
Certainly, in practical application, the intelligent terminal may further be provided with a physical key, the virtual key is displayed through the visual interface, and the instruction for switching to the second voice platform is sent through the physical key, the combination of the virtual key and the virtual key, and the combination of the physical key and the virtual key. For example, a physical button "switch" is set on the smart terminal, after the user clicks the physical button "switch", a plurality of selectable voice platforms are displayed through the visual interface, and the user can send an instruction for switching to the second voice platform (cloud D) by clicking a virtual button corresponding to one of the cloud D in the visual interface.
Certainly, in practical applications, the intelligent terminal may be connected to a wireless local area network, the mobile phone of the user may also be connected to the wireless local area network, application software corresponding to the intelligent terminal is installed in the mobile phone of the user, and the user may send an instruction to switch to the second voice platform through the application software in the mobile phone. The intelligent terminal can be a remote control device which is matched with the intelligent terminal and is not a mobile phone, and an instruction for switching to the second voice platform is sent out through the remote control device.
The default voice platform may be preset in the factory of the intelligent terminal, for example, the default voice platform is set as a cloud a, and the cloud a is a voice platform matched with the intelligent terminal in advance. If the voice platform connected with the intelligent terminal at present is the cloud end B, the cloud end B is the first voice platform, and if the voice platform matched with the intelligent terminal in advance is detected to be the second voice platform (the cloud end A), an instruction for switching to the second voice platform is sent when the voice platform matched with the intelligent terminal in advance is detected to be the second voice platform. For example, a button may be set, and the user may click the button to control the intelligent terminal to search for a pre-matched voice platform.
As another embodiment of the present application, if an instruction to switch to the second voice platform is monitored, the controlling of converting the interface provided by the framework layer of the intelligent terminal into an interface adapted to the second voice platform includes:
acquiring a network environment where the intelligent terminal is located, and determining a voice platform matched with the intelligent terminal based on the network environment;
if the voice platform matched with the intelligent terminal is a second voice platform, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform;
or the like, or, alternatively,
acquiring the voice category received in the working environment where the intelligent terminal is located, and determining a voice platform matched with the intelligent terminal based on the voice category;
and if the voice platform matched with the intelligent terminal is the second voice platform, controlling an interface provided by a frame layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform.
In the embodiment of the present application, various ways of monitoring an instruction for switching to the second voice platform have been exemplified by voice information, a key, an interconnected mobile terminal, and a default second voice platform. However, the above-mentioned switching requires the user's participation. In practical application, the following steps can be set: different network environments match different voice platforms, for example, when it is detected that a network connected to the intelligent terminal belongs to a network in china, a voice platform provided by a chinese service provider may be matched, and if it is detected that a network connected to the intelligent terminal belongs to a network in the united states, a voice platform provided by each service provider may be matched, for example, if the network connected to the intelligent terminal is a network in china, it may be determined that the second voice platform is a voice platform provided by a chinese service provider, or a voice platform provided by a service provider in another country for a chinese user, for example, airy cloud, 360 cloud, or the like. In a specific selection, one of them may be selected, for example, arrests, or 360 clouds.
In practical application, the voice platform matched with the intelligent terminal can be determined according to the voice category received in the working environment where the intelligent terminal is located, for example, in the using process of the intelligent terminal, if the received voice information is 'hello', the voice category can be determined to be Chinese, and then the voice platform provided for a Chinese user can be selected to be switched to, and if the received voice information is 'hello', the voice category can be determined to be English, then the voice platform provided for an English user can be selected to be switched to.
And step S203, establishing connection with the second voice platform through the converted interface.
In the embodiment of the application, once the interface provided by the framework layer of the intelligent terminal is converted into the interface adapted to the second voice platform, the converted interface can be connected with the second voice platform, and the second voice platform provides voice service for the intelligent terminal.
As another embodiment of the present application, after the connection is established with the second voice platform through the converted interface, the second voice platform is controlled to enter a monitoring mode, and the first voice platform is controlled to enter a sleep mode.
In this embodiment of the present application, in order to switch to another voice platform other than the currently connected second voice platform in a state where the connection is established with the second voice platform, it is necessary to control the second voice platform to enter a monitoring mode, and at the same time, control the first voice platform to enter a sleep mode.
The embodiment of the present application describes how to implement a switching method between different voice platforms based on the system architecture provided in the embodiment shown in fig. 1 from the side of an intelligent terminal, and, in an operating state of the intelligent terminal, how to control the intelligent terminal to switch from one of the voice platforms to another voice platform, so that the intelligent terminal can switch between multiple voice platforms.
It should be understood that, the sequence numbers of the steps in the foregoing embodiments do not imply an execution sequence, and the execution sequence of each process should be determined by its function and inherent logic, and should not constitute any limitation to the implementation process of the embodiments of the present application.
Fig. 3 is a schematic block diagram of an intelligent terminal provided in an embodiment of the present application, and for convenience of description, only a part related to the embodiment of the present application is shown.
The intelligent terminal 3 may be a software unit, a hardware unit or a combination of software and hardware unit built in an existing intelligent terminal such as an intelligent sound box and an intelligent watch, may also be integrated into the existing intelligent terminal such as an intelligent sound box and an intelligent watch as an independent pendant, and may also exist as an independent intelligent terminal.
The intelligent terminal 3 includes:
the monitoring module 31 is configured to control the currently connected first voice platform to enter a monitoring mode in a state of being connected to the first voice platform;
a conversion module 32, configured to, in a monitoring mode of the first voice platform, if an instruction to switch to a second voice platform is monitored, control an interface provided by a framework layer of the intelligent terminal to be converted into an interface adapted to the second voice platform, where the second voice platform is different from the first voice platform;
and a switching module 33, configured to establish a connection with the second speech platform through the converted interface.
Optionally, the intelligent terminal 3 further includes:
a control module 34, configured to control the second voice platform to enter a monitoring mode, and control the first voice platform to enter a sleep mode.
Optionally, the conversion module 32 is further configured to:
receiving voice information of a user and analyzing the voice information;
if the analyzed voice information is an instruction for switching to the second voice platform, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform;
or the like, or, alternatively,
if an instruction for switching to the second voice platform sent by a key on the intelligent terminal is monitored, controlling an interface provided by a frame layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform;
or the like, or, alternatively,
if an instruction for switching to the second voice platform sent by a mobile terminal interconnected with the intelligent terminal is monitored, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform;
or the like, or, alternatively,
and if the instruction which is the same as the instruction for starting the second voice platform and is configured in advance in the intelligent terminal is detected, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform.
Optionally, the conversion module 32 is further configured to:
acquiring a network environment where the intelligent terminal is located, and determining a voice platform matched with the intelligent terminal based on the network environment;
if the voice platform matched with the intelligent terminal is a second voice platform, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform;
or the like, or, alternatively,
acquiring the voice category received in the working environment where the intelligent terminal is located, and determining a voice platform matched with the intelligent terminal based on the voice category;
and if the voice platform matched with the intelligent terminal is the second voice platform, controlling an interface provided by a frame layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform.
It should be clearly understood by those skilled in the art that, for convenience and simplicity of description, the foregoing division of the functional units and modules is merely used as an example, and in practical applications, the foregoing function distribution may be performed by different functional units and modules as needed, that is, the internal structure of the intelligent terminal is divided into different functional units or modules to perform all or part of the above-described functions. Each functional unit and module in the embodiments may be integrated in one processing unit, or each unit may exist alone physically, or two or more units are integrated in one unit, and the integrated unit may be implemented in a form of hardware, or in a form of software functional unit. In addition, specific names of the functional units and modules are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present application. The specific working processes of the units and modules in the above-mentioned apparatus may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
Fig. 4 is a schematic block diagram of another intelligent terminal provided in the embodiment of the present application. As shown in fig. 4, the intelligent terminal 4 of this embodiment includes: one or more processors 40, a memory 41, and a computer program 42 stored in the memory 41 and executable on the processors 40. The processor 40 executes the computer program 42 to implement the steps of the above-mentioned embodiments of the switching method for multiple voice platforms, such as the steps S201 to S203 shown in fig. 2. Alternatively, the processor 40, when executing the computer program 42, implements the functions of the modules/units in the intelligent terminal embodiment, such as the functions of the modules 31 to 33 shown in fig. 3.
Illustratively, the computer program 42 may be partitioned into one or more modules/units that are stored in the memory 41 and executed by the processor 40 to accomplish the present application. The one or more modules/units may be a series of computer program instruction segments capable of performing specific functions, which are used to describe the execution process of the computer program 42 in the intelligent terminal 4. For example, the computer program 42 may be partitioned into a listening module, a translation module, and a switching module.
The monitoring module is used for controlling the first cloud platform connected at present to enter a monitoring mode in a state of being connected with the first cloud platform;
the conversion module is used for controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to a second cloud platform if an instruction for switching to the second cloud platform is monitored in the monitoring mode of the first cloud platform, wherein the second cloud platform is an interface which is in the cloud platform layer and is other than the first cloud platform;
and the switching module is used for establishing connection with the second cloud platform through the converted interface.
Other modules or units can refer to the description of the embodiment shown in fig. 3, and are not described again here.
The intelligent terminal includes, but is not limited to, a processor 40 and a memory 41. Those skilled in the art will appreciate that fig. 4 is only one example of a smart terminal 4 and does not constitute a limitation of the smart terminal 4 and may include more or less components than those shown, or some components in combination, or different components, for example, the smart terminal may also include input devices, output devices, network access devices, buses, etc.
The Processor 40 may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), an off-the-shelf Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic device, discrete hardware component, etc. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
The memory 41 may be an internal storage unit of the intelligent terminal 4, such as a hard disk or a memory of the intelligent terminal 4. The memory 41 may also be an external storage device of the intelligent terminal 4, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), and the like, which are equipped on the intelligent terminal 4. Further, the memory 41 may also include both an internal storage unit and an external storage device of the smart terminal 4. The memory 41 is used for storing the computer program and other programs and data required by the intelligent terminal. The memory 41 may also be used to temporarily store data that has been output or is to be output.
In the above embodiments, the descriptions of the respective embodiments have respective emphasis, and reference may be made to the related descriptions of other embodiments for parts that are not described or illustrated in a certain embodiment.
Those of ordinary skill in the art will appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.
In the embodiments provided in the present application, it should be understood that the disclosed intelligent terminal and method may be implemented in other ways. For example, the above-described embodiments of the intelligent terminal are merely illustrative, and for example, the division of the modules or units is only one logical function division, and there may be other divisions when actually implementing, for example, a plurality of units or components may be combined or may be integrated into another system, or some features may be omitted or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated modules/units, if implemented in the form of software functional units and sold or used as separate products, may be stored in a computer readable storage medium. Based on such understanding, all or part of the flow in the method of the embodiments described above can be realized by a computer program, which can be stored in a computer-readable storage medium and can realize the steps of the embodiments of the methods described above when the computer program is executed by a processor. Wherein the computer program comprises computer program code, which may be in the form of source code, object code, an executable file or some intermediate form, etc. The computer-readable medium may include: any entity or device capable of carrying the computer program code, recording medium, usb disk, removable hard disk, magnetic disk, optical disk, computer Memory, Read-Only Memory (ROM), Random Access Memory (RAM), electrical carrier wave signals, telecommunications signals, software distribution medium, and the like. It should be noted that the computer readable medium may contain other components which may be suitably increased or decreased as required by legislation and patent practice in jurisdictions, for example, in some jurisdictions, computer readable media which may not include electrical carrier signals and telecommunications signals in accordance with legislation and patent practice.
The above-mentioned embodiments are only used for illustrating the technical solutions of the present application, and not for limiting the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; such modifications and substitutions do not substantially depart from the spirit and scope of the embodiments of the present application and are intended to be included within the scope of the present application.

Claims (10)

1. A system architecture for supporting multiple voice platforms, comprising:
the system comprises a voice service layer, an adaptation layer and a framework layer of an intelligent terminal;
the voice service layer comprises a plurality of voice service controls, and the voice service controls respectively correspond to the voice platforms and are communicated with the corresponding voice platforms, so that the voice platforms provide voice services for the intelligent terminal;
the adaptation layer is used for converting an interface of a framework layer of the intelligent terminal into an interface adapted to a corresponding voice platform so as to establish communication connection with a voice service control corresponding to the voice platform;
and the framework layer of the intelligent terminal is used for transmitting the requirements of the intelligent terminal to the corresponding voice platform through an interface and receiving the corresponding voice service which is provided by the voice platform and corresponds to the requirements through the interface.
2. The system architecture of claim 1, further comprising:
and the general software component is used for providing a uniform software foundation for communication among all layers of the system architecture.
3. The system architecture that supports multi-voice platform switching according to claim 2, wherein the generic software component comprises: general middleware, general network communication protocols, and general software programming languages.
The system architecture further comprises: an operating system layer and a hardware layer;
the operating system layer comprises an operating system running in the intelligent terminal and is used for providing a software execution environment;
the hardware layer comprises hardware in the intelligent terminal and is used for providing a carrier for software running in the intelligent terminal.
4. A switching method of a multi-voice platform is applied to an intelligent terminal and comprises the following steps:
under the state of being connected with a first voice platform, controlling the first voice platform which is currently connected to enter a monitoring mode;
in the monitoring mode of the first voice platform, if an instruction for switching to a second voice platform is monitored, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform, wherein the second voice platform is different from the first voice platform;
and establishing connection with the second voice platform through the converted interface.
5. The multi-voice platform handoff method of claim 4, wherein after establishing a connection with the second voice platform via the translated interface, the handoff method further comprises:
and controlling the second voice platform to enter a monitoring mode, and controlling the first voice platform to enter a sleep mode.
6. The method for switching between multiple voice platforms according to claim 4, wherein the controlling the interface provided by the framework layer of the smart terminal to be converted into the interface adapted to the second voice platform if the instruction for switching to the second voice platform is monitored comprises:
receiving voice information of a user and analyzing the voice information;
if the analyzed voice information is an instruction for switching to the second voice platform, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform;
or the like, or, alternatively,
if an instruction for switching to the second voice platform sent by a key on the intelligent terminal is monitored, controlling an interface provided by a frame layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform;
or the like, or, alternatively,
if an instruction for switching to the second voice platform sent by a mobile terminal interconnected with the intelligent terminal is monitored, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform;
or the like, or, alternatively,
and if an instruction for switching to the second voice platform is monitored, wherein the instruction is sent by a server interconnected with the intelligent terminal, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform.
7. The method for switching between multiple voice platforms according to claim 4, wherein the controlling the interface provided by the framework layer of the smart terminal to be converted into the interface adapted to the second voice platform if the instruction for switching to the second voice platform is monitored comprises:
acquiring a network environment where the intelligent terminal is located, and determining a voice platform matched with the intelligent terminal based on the network environment;
if the voice platform matched with the intelligent terminal is the second voice platform, controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform;
or the like, or, alternatively,
acquiring the voice category received in the working environment where the intelligent terminal is located, and determining a voice platform matched with the intelligent terminal based on the voice category;
and if the voice platform matched with the intelligent terminal is the second voice platform, controlling an interface provided by a frame layer of the intelligent terminal to be converted into an interface adaptive to the second voice platform.
8. An intelligent terminal, comprising:
the monitoring module is used for controlling the currently connected first voice platform to enter a monitoring mode in a state of being connected with the first voice platform;
the conversion module is used for controlling an interface provided by a framework layer of the intelligent terminal to be converted into an interface adaptive to a second voice platform if an instruction for switching to the second voice platform is monitored in the monitoring mode of the first voice platform, wherein the second voice platform is different from the first voice platform;
and the switching module is used for establishing connection with the second voice platform through the converted interface.
9. An intelligent terminal comprising a memory, a processor and a computer program stored in the memory and executable on the processor, characterized in that the steps of the method according to any of claims 4 to 7 are implemented when the computer program is executed by the processor.
10. A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program which, when executed by one or more processors, implements the steps of the method according to any one of claims 4 to 7.
CN201811204114.2A 2018-10-16 2018-10-16 System architecture, multi-voice platform switching method, intelligent terminal and storage medium Pending CN111147530A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811204114.2A CN111147530A (en) 2018-10-16 2018-10-16 System architecture, multi-voice platform switching method, intelligent terminal and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811204114.2A CN111147530A (en) 2018-10-16 2018-10-16 System architecture, multi-voice platform switching method, intelligent terminal and storage medium

Publications (1)

Publication Number Publication Date
CN111147530A true CN111147530A (en) 2020-05-12

Family

ID=70515091

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811204114.2A Pending CN111147530A (en) 2018-10-16 2018-10-16 System architecture, multi-voice platform switching method, intelligent terminal and storage medium

Country Status (1)

Country Link
CN (1) CN111147530A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111917726A (en) * 2020-07-01 2020-11-10 中国建设银行股份有限公司 Adaptation layer, voice communication system and control method thereof
CN112581957A (en) * 2020-12-04 2021-03-30 浪潮电子信息产业股份有限公司 Computer voice control method, system and related device

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102845128A (en) * 2010-04-28 2012-12-26 惠普发展公司,有限责任合伙企业 Techniques to provide integrated voice service management
US20150295731A1 (en) * 2014-04-15 2015-10-15 Cisco Technology, Inc. Programmable infrastructure gateway for enabling hybrid cloud services in a network environment
CN105677561A (en) * 2015-12-31 2016-06-15 广州华多网络科技有限公司 Mobile product multi-platform testing method and system
CN107632720A (en) * 2017-03-08 2018-01-26 深圳市中易腾达科技股份有限公司 A kind of multifunction speech keyboard and application system
CN107993657A (en) * 2017-12-08 2018-05-04 广东思派康电子科技有限公司 A kind of switching method based on multiple voice assistant platforms
CN108011961A (en) * 2017-12-07 2018-05-08 携程旅游信息技术(上海)有限公司 Voice interactive system based on mixing cloud service

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102845128A (en) * 2010-04-28 2012-12-26 惠普发展公司,有限责任合伙企业 Techniques to provide integrated voice service management
US20150295731A1 (en) * 2014-04-15 2015-10-15 Cisco Technology, Inc. Programmable infrastructure gateway for enabling hybrid cloud services in a network environment
CN105677561A (en) * 2015-12-31 2016-06-15 广州华多网络科技有限公司 Mobile product multi-platform testing method and system
CN107632720A (en) * 2017-03-08 2018-01-26 深圳市中易腾达科技股份有限公司 A kind of multifunction speech keyboard and application system
CN108011961A (en) * 2017-12-07 2018-05-08 携程旅游信息技术(上海)有限公司 Voice interactive system based on mixing cloud service
CN107993657A (en) * 2017-12-08 2018-05-04 广东思派康电子科技有限公司 A kind of switching method based on multiple voice assistant platforms

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111917726A (en) * 2020-07-01 2020-11-10 中国建设银行股份有限公司 Adaptation layer, voice communication system and control method thereof
CN111917726B (en) * 2020-07-01 2022-03-15 中国建设银行股份有限公司 Adaptation layer, voice communication system and control method thereof
CN112581957A (en) * 2020-12-04 2021-03-30 浪潮电子信息产业股份有限公司 Computer voice control method, system and related device

Similar Documents

Publication Publication Date Title
US20170046124A1 (en) Responding to Human Spoken Audio Based on User Input
CN107277153B (en) Method, device and server for providing voice service
US20210398527A1 (en) Terminal screen projection control method and terminal
US9218052B2 (en) Framework for voice controlling applications
CN111147530A (en) System architecture, multi-voice platform switching method, intelligent terminal and storage medium
US20080255852A1 (en) Apparatuses and methods for voice command processing
JP6619488B2 (en) Continuous conversation function in artificial intelligence equipment
CN109473104B (en) Voice recognition network delay optimization method and device
CN109429522A (en) Voice interactive method, apparatus and system
CN108028044A (en) The speech recognition system of delay is reduced using multiple identifiers
CN109240107A (en) A kind of control method of electrical equipment, device, electrical equipment and medium
US11270690B2 (en) Method and apparatus for waking up device
CN112687286A (en) Method and device for adjusting noise reduction model of audio equipment
US20220052866A1 (en) Smart home system control method and apparatus, electronic device
CN106373571A (en) Voice control method and device
KR20150088532A (en) Apparatus for providing service during call and method for using the apparatus
CN111708753A (en) Method, device and equipment for evaluating database migration and computer storage medium
CN111063353A (en) Client processing method allowing user-defined voice interactive content and user terminal
CN108597499B (en) Voice processing method and voice processing device
CN112233677A (en) Intelligent interaction equipment
CN110035308A (en) Data processing method, equipment and storage medium
CN111770236B (en) Conversation processing method, device, system, server and storage medium
CN109800030B (en) Application program running method and device and terminal
CN114095405A (en) Vehicle function unit testing method and device, electronic equipment and storage medium
US20200293333A1 (en) Electronic device, application execution system, and control method therefor

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination