WO2020233074A1 - 移动终端的控制方法、装置、移动终端及可读存储介质 - Google Patents

移动终端的控制方法、装置、移动终端及可读存储介质 Download PDF

Info

Publication number
WO2020233074A1
WO2020233074A1 PCT/CN2019/122033 CN2019122033W WO2020233074A1 WO 2020233074 A1 WO2020233074 A1 WO 2020233074A1 CN 2019122033 W CN2019122033 W CN 2019122033W WO 2020233074 A1 WO2020233074 A1 WO 2020233074A1
Authority
WO
WIPO (PCT)
Prior art keywords
party
target
application
function
target application
Prior art date
Application number
PCT/CN2019/122033
Other languages
English (en)
French (fr)
Inventor
付铮
Original Assignee
深圳壹账通智能科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳壹账通智能科技有限公司 filed Critical 深圳壹账通智能科技有限公司
Publication of WO2020233074A1 publication Critical patent/WO2020233074A1/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/22Interactive procedures; Man-machine interfaces
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • H04M1/72433User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for voice messaging, e.g. dictaphones
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
    • H04M2201/405Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition involving speaker-dependent recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2250/00Details of telephonic subscriber devices
    • H04M2250/74Details of telephonic subscriber devices with voice recognition means

Definitions

  • This application relates to the field of artificial intelligence technology, and in particular to a method and device for controlling a mobile terminal, a mobile terminal and a readable storage medium.
  • the voice control function has become an important function of the mobile terminal; when the user is inconvenient to control the mobile terminal by manual operation, the user can send relevant voice commands to the mobile terminal by voice.
  • the mobile terminal is allowed to perform corresponding task processing, thereby providing convenience for users.
  • the existing voice control function has certain shortcomings.
  • the current voice control function is generally a function of the native system of the terminal. Therefore, when performing voice control, the mobile terminal generally provides corresponding functions through the system's own functional components. Service, for example, when the user asks the terminal to play music through voice, the terminal plays the music through the player function that comes with the system, that is, the voice control function does not integrate well with third-party applications (app); if the user wants To control a third-party application by voice, the user needs to start the third-party application manually, and then start the voice function provided by the third-party application itself, in order to realize the voice control function, which brings users inconvenient.
  • the main purpose of this application is to provide a mobile terminal control method, device, mobile terminal, and readable storage medium, aiming to solve the technical problem of low efficiency of existing voice control third-party applications.
  • the present application provides a control method of a mobile terminal, the control method of the mobile terminal is applied to a mobile terminal, and the control method of the mobile terminal includes:
  • the real-time displacement speed is greater than the preset speed threshold, acquiring a range image within a preset range through a camera of the mobile terminal, and determining whether there is a user image of the preset target user in the range image;
  • voice control mode if voice information is received, perform voiceprint analysis on the voice information to determine whether the voice information comes from a preset target user;
  • the corresponding third-party target invocation rule is determined according to the application type of the third-party target application, and the third-party target application is invoked based on the third-party target invocation rule to start the target function of the third-party target application.
  • control device for a mobile terminal includes:
  • the speed detection module is used to detect the real-time displacement speed of the mobile terminal and determine whether the real-time displacement speed is greater than a preset speed threshold;
  • An image judgment module configured to obtain a range image within a preset range through the camera of the mobile terminal if the real-time displacement speed is greater than the preset speed threshold, and determine whether the preset range image exists in the range image User image of the target user;
  • a mode entry module configured to enter the voice control mode if there is a user image of the preset target user in the range image
  • the voice analysis module is configured to perform voiceprint analysis on the voice information if voice information is received when in the voice control mode, and determine whether the voice information comes from a preset target user;
  • An information determining module configured to determine the corresponding third-party target application and target function according to the voice information if it is determined that the voice information comes from the preset target user;
  • the application invocation module is used to determine the corresponding third-party target invocation rule according to the application type of the third-party target application, and call the third-party target application based on the third-party target invocation rule to start the third-party target application Target function.
  • the present application also provides a mobile terminal, wherein the mobile terminal includes a processor, a memory, and computer-readable instructions stored on the memory and executable by the processor, wherein When the computer-readable instructions are executed by the processor, the steps of the above-mentioned mobile terminal control method are realized.
  • the present application also provides a readable storage medium having computer-readable instructions stored on the storage medium, and when the computer-readable instructions are executed by a processor, the above-mentioned mobile terminal Control method steps.
  • FIG. 1 is a schematic diagram of the hardware structure of a mobile terminal involved in a solution of an embodiment of the application
  • FIG. 2 is a schematic flowchart of a first embodiment of a method for controlling a mobile terminal according to this application.
  • the mobile terminal control method involved in the embodiments of the present application is mainly applied to a mobile terminal, and the mobile terminal may be a mobile phone, a tablet computer, a palmtop computer, a wearable device, and other devices with data processing functions.
  • FIG. 1 is a schematic diagram of the hardware structure of the mobile terminal involved in the solution of the embodiment of the application.
  • the mobile terminal may include a processor 1001 (for example, a central processing unit) Processing Unit, CPU), communication bus 1002, user interface 1003, network interface 1004, memory 1005.
  • processor 1001 for example, a central processing unit
  • CPU Central Processing Unit
  • the communication bus 1002 is used to realize the connection and communication between these components;
  • the user interface 1003 may include a display (Display), an input unit such as a keyboard (Keyboard);
  • the network interface 1004 may optionally include a standard wired interface, a wireless interface (Such as wireless fidelity WIreless-FIdelity, WI-FI interface);
  • the memory 1005 can be a high-speed random access memory (random access memory, RAM), or stable memory (non-volatile memory), such as a disk memory.
  • the memory 1005 may optionally be a storage device independent of the aforementioned processor 1001.
  • the memory 1005 as a computer-readable storage medium may include an operating system, a network communication module, and computer-readable instructions; the network communication module is mainly used to connect to a database and communicate data with the database; and the processor 1001 may call the storage in the memory 1005 And execute the control method of the mobile terminal provided in the embodiment of the present application.
  • the embodiment of the present application provides a method for controlling a mobile terminal.
  • FIG. 2 is a schematic flowchart of a first embodiment of a method for controlling a mobile terminal according to this application.
  • control method of the mobile terminal is applied to the mobile terminal, and the control method of the mobile terminal includes the following steps:
  • Step S10 When in the voice control mode, if voice information is received, perform voiceprint analysis on the voice information to determine whether the voice information comes from a preset target user;
  • the control method of the mobile terminal of this embodiment is applied to a mobile terminal, which may be a mobile phone, a tablet computer, a palmtop computer, a wearable device, etc.; for convenience of description, a mobile phone is used as an example for description in this embodiment.
  • a mobile terminal for the control method of the mobile terminal in this embodiment, it can be realized by means of a voice control application, that is, the voice control application can be pre-installed in the user's mobile phone, and the voice control application is used to realize the self-contained application and non-terminal system.
  • the centralized voice control of the third-party application of this voice control application avoids the user from manually starting a single third-party application before starting the voice function provided by the third-party application itself, thereby simplifying the third-party application
  • the single voice control operation process improves the efficiency of voice control third-party applications.
  • the voice control function can also be integrated in the mobile phone system itself.
  • the user’s mobile phone is also equipped with a microphone (or other sound signal collection device) to collect and receive the voice information sent by the user; of course, the mobile phone can also be wired or wireless with an external microphone (such as a headset and other equipment). ) Connection, the user can perform voice control through the external microphone.
  • the application interface includes a mode setting item for the user to choose to turn on or off the voice control mode; when the user chooses to turn on the voice control mode through the mode setting item of the voice control application
  • the mobile phone enters the voice control mode, and monitors whether the voice information is received through the microphone on the mobile phone.
  • the mobile phone receives voice information, it will perform voiceprint analysis (via voice control application) on the voice information to determine whether the voice information comes from a preset target user, that is, to determine whether the voice is a preset target
  • the user sends out; for the preset target user, it can be the owner of the phone, or another user who has the authority to perform voice control on the mobile phone.
  • the mobile phone can perform operations such as the next voice semantic analysis, that is, step S20; and if the voice information does not originate from the preset target user, the voice information can be considered It is made by a user without voice control authority, or environmental noise, at this time, the phone will not feedback the voice information; through the above method, it avoids unconscious misoperation caused by other people or environmental voices, and is also beneficial to improve the phone’s voice Control accuracy and safety.
  • voiceprint recognition on voice information can be implemented based on a pre-trained voiceprint recognition model, or a voiceprint recognition SDK (Software Development Kit) provided by other third parties. That is, the voiceprint recognition model or voiceprint recognition SDK is integrated in the local voice control application of the mobile phone, thereby improving the efficiency of voiceprint recognition.
  • a voiceprint recognition SDK Software Development Kit
  • the preset target user can record his sample voice into the voice control application of the mobile phone through the microphone of the mobile phone in advance, so that the mobile phone pre-stores the sample voice of the preset target user; for the sample voice, the mobile phone can pass the pre-passed
  • the voiceprint recognition model is obtained by machine learning or other methods to extract the sample features; and when the mobile phone receives the voice information, it will also extract the voice features from the voiceprint recognition model, and then compare the two. When the similarity between the two reaches a certain threshold, it is considered that the voice information and the sample voice originate from the same user, that is, the voice information originates from a preset target user.
  • the voiceprint recognition is realized through the voiceprint recognition SDK provided by a third party, the recognition process is similar to the above-mentioned recognition process through the voiceprint recognition model, and will not be repeated here.
  • the preset target user can be one; it can also be two or more (here "above” includes the number, the same below), that is, there can be multiple different users who have voice control of the mobile phone permission.
  • the above-mentioned process of voiceprint recognition of voice information can also be implemented through a cloud server, thereby reducing the consumption of mobile phone resources for voiceprint recognition, and also conducive to reducing the storage space occupied by voice control applications on mobile phones.
  • the preset target user can record his sample voice into the mobile phone through the mobile phone microphone in advance.
  • the mobile phone will send it to the cloud server corresponding to the voice control application, and the cloud server will store it; and the mobile phone is receiving
  • the voice information will be sent to the cloud server. Since the cloud server compares the two and returns the comparison result to the mobile phone, the mobile phone can judge whether the voice information comes from a pre-determination based on the comparison result.
  • Set target users can record his sample voice into the mobile phone through the mobile phone microphone in advance.
  • the mobile phone will send it to the cloud server corresponding to the voice control application, and the cloud server will store it; and the mobile phone is receiving When the voice information is received, the voice information will be sent to the cloud server. Since the cloud server compares the
  • Step S20 If it is determined that the voice information comes from the preset target user, determine the corresponding third-party target application and target function according to the voice information;
  • the mobile phone when the mobile phone determines that the voice information comes from a preset target user, the mobile phone will perform semantic recognition on the voice information, determine the third-party target application and target function corresponding to the voice information, that is, determine that the preset target user wants Launched third-party target applications and target functions.
  • the function of semantic recognition of speech information can also be realized by a semantic recognition model obtained through relevant machine learning, or by means of a semantic recognition SDK provided by a third party.
  • the preset voice message sent by the target user is "open the D application and navigate to X location"
  • the D application is a navigation application or map application provided by a third party
  • the mobile phone when the mobile phone receives the voice information, it can first recognize The operation keyword "open”, and the operation object "D application” corresponding to the operation keyword is determined as a third-party target application, and for “navigation” it is a function keyword, and the corresponding target can be determined according to the function key Function ("X location" is the specific function content or function object).
  • the third-party target application and target function can be directly determined from the voice information; or the target can be determined from the voice information first. Function, and then determine a third-party target application that can achieve the target function according to the target function. For example, when the voice information received by the mobile terminal is "Navigate to X location" or "Play G song", the target function is determined first, and the third-party target application is determined according to the target function.
  • the step of determining the corresponding third-party target application and target function according to the voice information includes:
  • the mobile phone When the mobile phone receives voice information, it first parses the voice information and extracts the corresponding functional keywords from the voice information.
  • the process of extracting this functional keyword can be achieved through the semantic recognition model as described above, or with the help of a semantic recognition SDK provided by a third party; it can also be the function word voice of several functional keywords pre-stored in the mobile phone.
  • Voice information compare the voice information with the function word voice, and determine whether there is a segment in the voice information that matches the function word voice (the similarity reaches a certain threshold); if it exists, the segment is the voice corresponding to the function keyword Segment, and further determine the corresponding functional keywords.
  • the functional keyword is “Navigation”; in “Play G songs”, the functional keyword is “Play” and so on.
  • the form of functional keywords may also be in the form of "verb + object” or other forms. For example, the entire paragraph of "play G song” is used as a functional keyword. Wait.
  • the corresponding target function is determined according to the function keyword, and the corresponding third-party target application is determined according to the target function.
  • the mobile phone When the mobile phone obtains the function keyword, it can determine the service that the voice message wants to start/execute according to the target keyword, that is, determine the target function; at this time, the mobile phone will use the target function from the installed third-party applications Determine the third-party target application that supports the target function.
  • the target function can be known as the navigation function according to the function keyword; according to the target function, the third-party target application can be further determined as the third-party D navigation application installed in the mobile phone.
  • the process of determining the third-party target application by the mobile phone according to the target function may be to first query and obtain the information of the third-party application installed in the current mobile phone, and then determine whether there is any third-party application that can support the installed third-party application based on the third-party application information.
  • the third-party optional application of the target function may be to first query and obtain the information of the third-party application installed in the current mobile phone, and then determine whether there is any third-party application that can support the installed third-party application based on the third-party application information.
  • the third-party optional application of the target function.
  • the third-party applications installed in the mobile phone include third-party D navigation applications, third-party T map applications, and third-party Y music applications; among these installed third-party applications, third-party D navigation applications and third-party T map applications Both can support (achieve) the target function (navigation), that is, there is a third-party optional application that can support the target function in the installed third-party application; at this time, the mobile phone can determine in the third-party optional application Third-party target application.
  • the mobile phone needs to perform network query through the network (including mobile data network, WIFI network, etc.), and download and install through the network to support the The third-party network application with the target function is then determined as the third-party target application, so as to ensure that the user can provide the required function and service.
  • network including mobile data network, WIFI network, etc.
  • the mobile phone will determine the third-party target application from these third-party optional applications; at this time, the mobile phone will determine the The number of third-party optional applications; if there is only one third-party optional application, the only third-party optional application can be directly determined as the third-party target application; and if the number of third-party optional applications is two If there are more than one, the mobile phone will determine a third-party target application from it according to certain rules.
  • the mobile phone can obtain the respective use frequency of these third-party optional applications (such as the number of uses in the last seven days), and use the highest frequency
  • the third-party optional applications of are determined as third-party target applications, so that the third-party target applications that are launched later can fit the user’s usage habits; of course, it can also be to obtain the latest update time (or install Time), and determine the third-party optional application with the latest update time as the third-party target application, so that the third-party target application that is launched subsequently can provide users with the latest functional services.
  • Step S30 Determine a corresponding third-party target invocation rule according to the application type of the third-party target application, and invoke the third-party target application based on the third-party target invocation rule to start the target function of the third-party target application.
  • the mobile phone when the mobile phone determines the third-party target application and target function, the mobile phone will call the third-party target application through certain third-party application calling rules, and start the target function of the third-party target application, and then according to the target
  • the execution result of the function is output accordingly, such as displaying the navigation route, playing music, etc.
  • the third-party call interface API Application Programming Interface, application programming interface
  • the third-party call interface API can also be realized in the way of automatic simulation of manual operation, of course, it can also be in other ways.
  • the mobile phone determines the third-party target application and target function, it can first determine the corresponding third-party target calling rule according to the third-party target application, and then call the third-party target application based on the third-party target calling rule. For example, the mobile phone can first determine whether the third-party target application provides a third-party invocation interface; if so, the third-party invocation interface is preferentially invoked; otherwise, the invocation can be realized by automatically simulating manual operations. For another example, the mobile phone may pre-set the priority invocation methods of different third-party applications, and then prioritize the invocation according to the set method when making third-party invocations.
  • the third-party application is required to provide the third-party calling interface and the calling interface specification;
  • the calling interface specification includes a related identifier template to construct a
  • the uniform resource identifier URI Uniform Resource Identifier, which is used to identify the name of a certain Internet resource, allows users to interact with any (including local and Internet) resources through a specific protocol)
  • URI Uniform Resource Identifier
  • the mobile phone When calling through a third-party calling interface, the mobile phone will first obtain the calling interface specification of the third-party target application, and obtain the corresponding identifier template according to the calling interface specification; then the mobile phone will according to the specific content of the target function and the calling
  • the interface specification fills in the content of the identifier template to construct the corresponding target identifier, for example, according to "Navigate to X location" and call interface specification to generate the corresponding functional character, and then fill the functional character to the identifier
  • the target identifier is obtained; the mobile phone can then input the target identifier into the third-party calling interface of the third-party target application to call the third-party target application and execute the target function of the third-party target application, and according to this The execution result of the target function is output accordingly.
  • Calling through the third-party calling interface can reduce the related function requirements of the voice control application (or the voice control function of the mobile phone).
  • the voice control application (or the mobile phone) does not need to pay attention to how the target function is implemented, but only needs to call the specification according to the interface
  • the corresponding call result can be obtained and the user can be provided with functional services without redeveloping, which reduces the implementation cost of voice control.
  • the mobile phone can first start the third-party target application, and then display the target application interface of the third-party target application on On the display. After displaying the target application interface, the mobile phone will recognize the target application interface and determine the function trigger area corresponding to the target function in the target application interface.
  • the corresponding recognition script ie recognition specification
  • the corresponding recognition script can be preset according to the typesetting mode of the target application interface, so that when the target application interface is displayed, the relevant page elements are identified according to the recognition script. To determine the function trigger area.
  • OCR Optical Character Recognition
  • a screenshot is taken when the target application interface is displayed, and related keywords are identified through OCR technology, so as to determine the corresponding function trigger area according to the keywords.
  • OCR Optical Character Recognition
  • the function trigger type of the function trigger area will also be determined, for example, by inputting relevant command characters and clicking the corresponding button to trigger the corresponding function instruction, or directly clicking a button to trigger the corresponding function instruction.
  • the phone When determining the function trigger type, the phone will call the corresponding operation control (such as input control, click control, etc.) according to the function trigger type, and perform related simulation operations in the function trigger area through the operation control to start the first
  • the target function of the three-party target application such as inputting a character in the input bar of the function trigger area through the input control, or clicking a function button in the function trigger area by clicking the space simulation; and then outputting accordingly according to the execution result.
  • Invoking third-party applications through the above-mentioned automated simulation of manual operations can achieve compatibility with different third-party applications to a certain extent.
  • Third-party applications can also be implemented in third-party applications without relying on interfaces for data import and export. , Or the seamless connection between the system and third-party applications, which is conducive to improving the stability of third-party calls made by mobile terminals and improving user experience.
  • the voice control applications installed in the mobile phone do not necessarily require all third-party Application related data (such as third-party application call interface specifications, automated simulation operation scripts, simulation use cases, etc.) are stored locally; that is, when the mobile phone provides voice control services through the voice control application, when determining the third-party target application and The target function can be sent to the third-party target application and target function to the voice application server, and the voice application server generates the relevant target identifier or automated simulation operation script and simulation use case based on the third target application and target function.
  • third-party Application related data such as third-party application call interface specifications, automated simulation operation scripts, simulation use cases, etc.
  • the mobile terminal in this embodiment When the mobile terminal in this embodiment is in the voice control mode, if it receives voice information, it performs voiceprint analysis on the voice information to determine whether the voice information comes from a preset target user; If the information comes from the preset target user, the corresponding third-party target application and target function are determined according to the voice information; the corresponding third-party target calling rule is determined according to the application type of the third-party target application, and based on the The third-party target invocation rule calls the third-party target application to start the target function of the third-party target application.
  • this embodiment can provide voice intelligent services when the user is inconvenient to manually operate the mobile terminal, so that the user can control the mobile terminal by voice, which provides convenience for the user; at the same time, in the voice control process, the mobile terminal also It can call third-party applications, provide users with corresponding functional services through third-party applications, expand the function coverage of voice control, and realize centralized voice for non-terminal system's own applications and third-party applications that are not voice control applications Control, avoid the user to start a single third-party application manually, and then start the voice function provided by the third-party application itself, thereby simplifying the operation process of single voice control for third-party applications and improving voice control
  • the efficiency of third-party applications further improves user experience; in addition, when third-party applications are called, they can be implemented through interface calls or simulated manual operations, which improves the compatibility between different applications to a certain extent and reduces the impact on the system. Or the modification of third-party applications will help improve the stability of the mobile terminal.
  • the method further includes:
  • Step S40 detecting the real-time displacement speed of the mobile terminal, and judging whether the real-time displacement speed is greater than a preset speed threshold;
  • the entry (starting) of the voice control mode of the mobile phone can also be a series of sensors (or devices) of the mobile phone to detect the surrounding environment.
  • the current environment is judged to be inconvenient for the user to manually operate the mobile phone according to the detection data , That is, automatically start the voice control application and enter the voice control mode, without the user's manual settings, so as to provide users with convenience.
  • it may be detected whether the user is in a driving state, and if so, the voice control mode is automatically entered.
  • the mobile phone can detect its real-time displacement speed through GPS or other equipment, and determine whether the real-time displacement speed is greater than a preset speed threshold; the preset speed threshold can be set according to the actual situation, for example, set to 10km/ h etc. If the real-time displacement speed of the mobile phone is greater than the preset speed threshold, it can be considered that the mobile phone is currently on the vehicle, and step S50 is entered at this time; and if the real-time displacement speed of the mobile phone is less than or equal to the preset speed threshold, the current mode is maintained constant.
  • Step S50 If the real-time displacement speed is greater than the preset speed threshold, obtain a range image within a preset range through the camera of the mobile terminal, and determine whether the preset target user is present in the range image.
  • the mobile phone will obtain the range image within the preset range through the camera; when the range image is obtained again, the range image can be identified to determine the range Whether there is a user image of the preset target user in the image; if there is a user image of the preset target user in the image in the range, it can be considered that the preset target user is currently using a mobile phone on a running vehicle, and then step S60 is entered; and If there is no user image of the preset target user in the image in the range, the current mode remains unchanged.
  • Step S60 if there is a user image of the preset target user in the range image, enter the voice control mode.
  • the preset target user is currently using a mobile phone on a running vehicle, and the mobile phone will automatically start the voice control application and enter the voice control Mode, the user can operate the mobile phone by voice, which provides convenience for the user.
  • the user may also use the mobile phone on the running subway, bus, or taxi.
  • the mobile phone can send out related voice inquiry messages at this time, such as "It is detected that you are using the mobile phone on a running vehicle, do you enter the voice mode", and then collect the user's reply voice, if the user answers "Yes” within the preset time ", it enters the voice control mode; if the user answers "No" within the preset time or the user's reply voice is not collected within the preset time, the current mode will remain unchanged.
  • the accuracy of environmental judgment can be further improved, thereby improving user experience.
  • an embodiment of the present application also provides a control device for a mobile terminal, and the control device for the mobile terminal includes:
  • the voice analysis module is configured to perform voiceprint analysis on the voice information if voice information is received when in the voice control mode, and determine whether the voice information comes from a preset target user;
  • An information determining module configured to determine the corresponding third-party target application and target function according to the voice information if it is determined that the voice information comes from the preset target user;
  • the application invocation module is used to determine the corresponding third-party target invocation rule according to the application type of the third-party target application, and call the third-party target application based on the third-party target invocation rule to start the third-party target application Target function.
  • each virtual function module of the control device of the above mobile terminal is stored in the memory 1005 of the mobile terminal shown in FIG. 1, and is used to implement all the functions of computer-readable instructions; when each module is executed by the processor 1001, the mobile terminal can be implemented The function of voice control.
  • the application calling module includes:
  • the template obtaining unit is configured to obtain the calling interface specification of the third-party target application, and obtain the corresponding identifier template according to the calling interface specification;
  • a template filling unit configured to fill the identifier template with content according to the target function and the calling interface specification, and construct a corresponding target identifier
  • the identifier input unit is used to input the target identifier into a third-party calling interface of the third-party target application to call the third-party target application and start the target function of the third-party target application.
  • the application calling module includes:
  • An interface display unit configured to start the third-party target application and display the target application interface of the third-party target application
  • An interface recognition unit configured to recognize the target application interface, and determine the function trigger area corresponding to the target function and the function trigger type of the function trigger area in the target application interface;
  • the operation simulation unit is configured to call a corresponding operation control according to the function trigger type, and perform a simulation operation in the function trigger area through the operation component to start the target function of the third-party target application.
  • control device of the mobile terminal further includes:
  • the speed detection module is used to detect the real-time displacement speed of the mobile terminal and determine whether the real-time displacement speed is greater than a preset speed threshold;
  • An image judgment module configured to obtain a range image within a preset range through the camera of the mobile terminal if the real-time displacement speed is greater than the preset speed threshold, and determine whether the preset range image exists in the range image User image of the target user;
  • the mode entry module is configured to enter the voice control mode if there is a user image of the preset target user in the range image.
  • the information determining module 20 includes:
  • the information analysis unit is used to analyze the voice information, and extract corresponding functional keywords from the voice information;
  • the application determining unit is configured to determine the corresponding target function according to the function keyword, and determine the corresponding third-party target application according to the target function.
  • the application determining unit includes:
  • the application query subunit is used to query the installed third-party applications in the mobile terminal, and determine whether there are third-party optional applications that support the target function among the installed third-party applications;
  • the first determining subunit is configured to determine the third-party target application in the third-party optional application if the third-party optional application exists in the installed third-party application;
  • the second determining subunit is configured to, if the third-party optional application does not exist in the installed third-party application, download and install the third-party network application that supports the target function through the network, and configure the third-party The network application is determined as a third-party target application.
  • the first determining subunit is specifically configured to determine the number of applications of the third-party optional application if the third-party optional application exists in the installed third-party application; If the number of selected applications is more than two, the third-party target application is determined in the third-party optional applications according to the respective use frequencies of the third-party optional applications.
  • each module in the above-mentioned mobile terminal control device corresponds to each step in the above-mentioned mobile terminal control method embodiment, and its functions and realization processes are not repeated here.
  • embodiments of the present application also provide a readable storage medium, and the computer-readable storage medium may be a non-volatile readable storage medium.
  • the readable storage medium of the present application stores computer readable instructions, and when the computer readable instructions are executed by a processor, the steps of the control method of the mobile terminal as described above are realized.

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Telephone Function (AREA)
  • Telephonic Communication Services (AREA)

Abstract

本申请涉及人工智能技术领域, 提供一种移动终端的控制方法、装置、移动终端及可读存储介质. 移动终端在处于语音控制模式时, 若接收到语音信息,则对所述语音信息进行声纹分析, 判断所述语音信息是否来源于预设目标用户; 若判断所述语音信息来源于所述预设目标用户, 则根据所述语音信息确定对应的第三方目标应用和目标功能; 根据所述第三方目标应用的应用类型确定对应的第三方目标调用规则, 并基于所述第三方目标调用规则调用所述第三方目标应用, 并启动所述第三方目标应用的目标功能. 本申请可基于人工智能的方式实现移动终端的语音控制功能, 并解决现有语音控制第三方应用效率低的技术问题, 为用户提供了方便.

Description

移动终端的控制方法、装置、移动终端及可读存储介质
本申请要求于2019年5月21日提交中国专利局、申请号为201910433466.3、发明名称为“移动终端的控制方法、装置、移动终端及可读存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及人工智能技术领域,尤其涉及一种移动终端的控制方法、装置、移动终端及可读存储介质。
背景技术
随着终端技术的不断发展,语音控制功能已成为移动终端的一项重要功能;用户在不方便通过手动操作的方式控制移动终端时,可以通过语音的方式向移动终端发出相关的语音指令,以使得移动终端进行相应的任务处理,从而为用户提供了方便。
但是,现有语音控制功能具有一定的缺陷,目前的语音控制功能一般是终端的原生系统所自带的功能,因此在进行语音控制时,移动终端一般是通过系统自带的功能组件提供相应的服务,例如当用户通过语音的方式要求终端播放音乐时,终端是通过系统自带的播放器功能播放音乐,也即该语音控制功能并不能很好地融合第三方应用(app);若用户希望通过语音方式控制第三方应用,则需要用户先通过手动操作的方式启动该第三方应用后,再启动该第三方应用本身所提供的语音功能,才能实现语音控制功能,这就为用户带来了不便。
发明内容
本申请的主要目的在于提供一种移动终端的控制方法、装置、移动终端及可读存储介质,旨在解决现有语音控制第三方应用效率低的技术问题。
为实现上述目的,本申请提供一种移动终端的控制方法,所述移动终端的控制方法应用于移动终端,所述移动终端的控制方法包括:
检测所述移动终端的实时位移速度,并判断所述实时位移速度是否大于预设速度阈值;
若所述实时位移速度大于所述预设速度阈值,则通过所述移动终端的摄像头获取预设范围内的范围图像,并判断所述范围图像中是否存在所述预设目标用户的用户图像;
若所述范围图像中存在所述预设目标用户的用户图像,则进入所述语音控制模式;
在处于语音控制模式时,若接收到语音信息,则对所述语音信息进行声纹分析,判断所述语音信息是否来源于预设目标用户;
若判断所述语音信息来源于所述预设目标用户,则根据所述语音信息确定对应的第三方目标应用和目标功能;
根据所述第三方目标应用的应用类型确定对应的第三方目标调用规则,并基于所述第三方目标调用规则调用所述第三方目标应用,启动所述第三方目标应用的目标功能。
此外,为实现上述目的,本申请还提供一种移动终端的控制装置,所述移动终端的控制装置包括:
速度检测模块,用于检测所述移动终端的实时位移速度,并判断所述实时位移速度是否大于预设速度阈值;
图像判断模块,用于若所述实时位移速度大于所述预设速度阈值,则通过所述移动终端的摄像头获取预设范围内的范围图像,并判断所述范围图像中是否存在所述预设目标用户的用户图像;
模式进入模块,用于若所述范围图像中存在所述预设目标用户的用户图像,则进入所述语音控制模式;
语音分析模块,用于在处于语音控制模式时,若接收到语音信息,则对所述语音信息进行声纹分析,判断所述语音信息是否来源于预设目标用户;
信息确定模块,用于若判断所述语音信息来源于所述预设目标用户,则根据所述语音信息确定对应的第三方目标应用和目标功能;
应用调用模块,用于根据所述第三方目标应用的应用类型确定对应的第三方目标调用规则,并基于所述第三方目标调用规则调用所述第三方目标应用,启动所述第三方目标应用的目标功能。
此外,为实现上述目的,本申请还提供一种移动终端,其中,所述移动终端包括处理器、存储器、以及存储在所述存储器上并可被所述处理器执行的计算机可读指令,其中所述计算机可读指令被所述处理器执行时,实现上述的移动终端的控制方法的步骤。
此外,为实现上述目的,本申请还提供一种可读存储介质,所述存储介质上存储有计算机可读指令,其中所述计算机可读指令被处理器执行时,实现如上述的移动终端的控制方法的步骤。
本申请的一个或多个实施例的细节在下面的附图和描述中提出。本申请的其他特征和优点将从说明书、附图以及权利要求书变得明显。
附图说明
图1为本申请实施例方案中涉及的移动终端的硬件结构示意图;
图2为本申请移动终端的控制方法第一实施例的流程示意图。
本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。
具体实施方式
应当理解,此处所描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。
本申请实施例涉及的移动终端的控制方法主要应用于移动终端,该移动终端可以是手机、平板电脑、掌上电脑、可穿戴设备等具有数据处理功能的设备。
参照图1,图1为本申请实施例方案中涉及的移动终端的硬件结构示意图。本申请实施例中,该移动终端可以包括处理器1001(例如中央处理器Central Processing Unit,CPU),通信总线1002,用户接口1003,网络接口1004,存储器1005。其中,通信总线1002用于实现这些组件之间的连接通信;用户接口1003可以包括显示屏(Display)、输入单元比如键盘(Keyboard);网络接口1004可选的可以包括标准的有线接口、无线接口(如无线保真WIreless-FIdelity,WI-FI接口);存储器1005可以是高速随机存取存储器(random access memory,RAM),也可以是稳定的存储器(non-volatile memory),例如磁盘存储器,存储器1005可选的还可以是独立于前述处理器1001的存储装置。本领域技术人员可以理解,图1中示出的硬件结构并不构成对本申请的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。作为一种计算机可读存储介质的存储器1005可以包括操作系统、网络通信模块以及计算机可读指令;网络通信模块主要用于连接数据库,与数据库进行数据通信;而处理器1001可以调用存储器1005中存储的计算机可读指令,并执行本申请实施例提供的移动终端的控制方法。
本申请实施例提供了一种移动终端的控制方法。
参照图2,图2为本申请移动终端的控制方法第一实施例的流程示意图。
本实施例中,所述移动终端的控制方法应用于移动终端,所述移动终端的控制方法包括以下步骤:
步骤S10,在处于语音控制模式时,若接收到语音信息,则对所述语音信息进行声纹分析,判断所述语音信息是否来源于预设目标用户;
本实施例的移动终端的控制方法应用于移动终端,该移动终端可以是手机、平板电脑、掌上电脑、可穿戴设备等;为描述方便,本实施例中以手机为例进行说明。对于本实施例中的移动终端的控制方法,可以是借助一语音控制应用实现,也即用户的手机中可预先安装该语音控制应用,通过该语音控制应用实现对非终端系统自带应用和非本语音控制应用的第三方应用的集中式语音控制,避免了用户先通过手动操作的方式启动单一第三方应用后,再启动该第三方应用本身所提供的语音功能,从而简化了对于第三方应用的单一语音控制的操作流程,提高了语音控制第三方应用效率。当然在实际中,也可以是在手机系统本身集成该语音控制功能。此外,用户的手机还设置有麦克风(或是其它的声音信号采集装置),用以采集接收用户发出的语音信息;当然手机也可以是以有线或无线的方式与一外接麦克风(如耳麦等设备)连接,用户通过该外接麦克风进行语音控制。进一步的,对于本实施例中的语音控制应用,其应用界面中包括一模式设置项,以供用户选择开启或关闭语音控制模式;当用户通过语音控制应用的该模式设置项选择开启语音控制模式时,手机即进入语音控制模式,并通过手机上的麦克风监听是否接收到语音信息。当手机将接收到语音信息时,将对该语音信息进行声纹分析(通过语音控制应用进行),判断该语音信息是否来源于预设目标用户,也即判断该语音是否为某一预设目标用户发出;对于该预设目标用户,可以是机主,又或者是其它有权限对手机进行语音控制的用户。如果该语音信息确来源于该预设目标用户,则手机可进行下一步语音语义分析等操作,即进入步骤S20;而如果该语音信息并不是来源于预设目标用户,则可认为该语音信息是由无语音控制权限的用户发出、又或者是环境噪音,此时手机不会对该语音信息进行反馈;通过上述方式,避免了旁人或环境语音引起的无意识误操作、还有利于提高手机语音控制的准确性和安全性。
进一步,对于上述对语音信息进行声纹识别的过程,可以是根据预先训练好的声纹识别模型、又或是其它第三方提供的声纹识别SDK(软件开发工具包,Software Development Kit)实现,也即手机本地的语音控制应用中集成有该声纹识别模型或声纹识别SDK,从而提高声纹识别的效率。具体的,预设目标用户可预先通过手机麦克风往手机的语音控制应用中录入自己的样本语音,以使手机预先存储有预设目标用户的样本语音;对于该样本语音,手机可通过该预先通过机器学习或其它方式得到声纹识别模型提取出其中的样本特征;而手机在接收到语音信息时,也将通过该声纹识别模型提取出其中的语音特征,然后将两者进行比对,当两者的相似度达到一定阈值时,即认为该语音信息与样本语音来源于同一用户,也即该语音信息来源于预设目标用户。而如果是通过第三方提供的声纹识别SDK实现声纹识别,其识别过程与上述通过声纹识别模型识别过程类似,此处不再赘述。值得说明的是,对于该预设目标用户可以是一位;也可以是两位以上(此处“以上”包括本数,下同),也即可以是有多位不同的用户对手机具有语音控制的权限。
再进一步的,对于上述对语音信息进行声纹识别的过程,也可以是通过云端的服务器实现,从而降低声纹识别的手机资源消耗,还有利于减小语音控制应用对手机存储空间的占用量。具体的,预设目标用户可预先通过手机麦克风往手机录入自己的样本语音,对于该样本语音,手机会将其发送至语音控制应用对应的云服务器,由该云服务器进行存储;而手机在接收到语音信息时,会将该语音信息发送至云服务器,由于云服务器对两者进行比对,并将比对结果返回到手机,手机即可根据该比对结果判断该语音信息是否来源于预设目标用户。
步骤S20,若判断所述语音信息来源于所述预设目标用户,则根据所述语音信息确定对应的第三方目标应用和目标功能;
本实施例中,当手机判断语音信息来源于预设目标用户时,手机将对该语音信息进行语义识别,确定该语音信息对应的第三方目标应用和目标功能,也即确定预设目标用户希望启动的第三方目标应用和目标功能。其中,对语音信息进行语义识别的功能,也可以是通过相关机器学习得到的语义识别模型、或者是借助第三方提供的语义识别SDK实现的。例如,预设目标用户发出的语音信息为“打开D应用,导航至X地点”(D应用为一第三方提供的导航应用或地图应用);手机在接收到该语音信息时,可先识别出其中的操作关键字“打开”,并将该操作关键字对应的操作对象“D应用”确定为第三方目标应用,而对于“导航”则为功能关键字,根据该功能关键可确定对应的目标功能(“X地点”为具体的功能内容或功能对象)。
值得说明的是,在根据语音信息确定对应第三方目标应用和目标功能的过程中,可以是直接从语音信息中确定出第三方目标应用和目标功能;还可以是先从语音信息中确定出目标功能,再根据该目标功能确定出能实现该目标功能的第三方目标应用。例如,当移动终端接收到的语音信息为“导航至X地点”、“播放G歌曲”时,即是先确定目标功能,在根据该目标功能确定第三方目标应用。具体的,所述根据所述语音信息确定对应的第三方目标应用和目标功能的步骤包括:
对所述语音信息进行解析,并从所述语音信息中提取得到对应的功能关键词;
手机在接收到语音信息时,首先将对该语音信息进行解析,并从该语音信息中提取得到对应的功能关键词。对于该功能关键词的提取过程,可以是如上述通过语义识别模型、或者是借助第三方提供的语义识别SDK实现的;还可以是先在手机里预存若干功能关键词的功能词语音,当得到语音信息,将该语音信息与功能词语音进行比对,判断该语音信息中是否存在与功能词语音匹配(相似度达到一定阈值)的片段;若存在,则该片段为功能关键词对应的语音片段,并进一步确定对应的功能关键词。例如“导航至X地点”中,功能关键词为“导航”;又例如“播放G歌曲”中,功能关键词为“播放”等。当然,在实际中,功能关键词的形式除了上述“纯动词”的形式外,还可能是“动词+对象”的形式或是其它的形式,例如将“播放G歌曲”整段作为功能关键词等。
根据所述功能关键词确定对应的目标功能,并根据所述目标功能确定对应的第三方目标应用。
手机在得到功能关键词时,即可根据该目标关键词确定该语音信息所希望启动/执行的服务,也即确定目标功能;此时手机将会根据该目标功能从已安装的第三方应用中确定支持该目标功能的第三方目标应用。例如,对于功能关键词“导航”,根据该功能关键词可知目标功能为导航功能;根据该目标功能可进一步确定出第三方目标应用为手机中已安装的第三方D导航应用。
进一步的,手机根据目标功能确定第三方目标应用的过程,可以是先查询获取当前手机中已经安装的第三方应用信息,然后根据这些第三方应用信息判断已安装的第三方应用中是否存在能够支持该目标功能的第三方可选应用。例如对于手机中已安装的第三方应用包括第三方D导航应用、第三方T地图应用、第三方Y音乐应用;在这些已安装的第三方应用中,第三方D导航应用和第三方T地图应用均可以支持(可实现)该目标功能(导航),也即已安装的第三方应用中存在能够支持该目标功能第三方可选应用;此时手机即可在该第三方可选应用中确定出第三方目标应用。而如果已安装的第三方应用中不存在能够支持该目标功能的第三方可选应用,则手机需要通过网络(包括移动数据网络、WIFI网络等)进行网络查询,并通过网络下载安装能够支持该目标功能的第三方网络应用,然后将该第三方网络应用确定为第三方目标应用,从而保证能够为用户提供其所需要的功能服务。
再进一步的,在已安装的第三方应用中存在能支持该目标功能第三方可选应用的情况下,手机将从这些第三方可选应用中确定第三方目标应用;此时手机将会确定该第三方可选应用的应用数量;如果第三方可选应用仅为一个,那可直接将该唯一的第三方可选应用确定为第三方目标应用;而如果第三方可选应用的应用数量在两个以上,则手机将会根据一定的规则从中确定出一个第三方目标应用。例如,手机的第三方可选应用包括第三方D导航应用和第三方T地图应用,则手机可获取这些第三方可选应用各自的使用频率(如最近七天的使用次数),并将使用频率最高的第三方可选应用确定为第三方目标应用,从而使得后续启动的第三方目标应用能够贴合用户的使用习惯;当然,还可以是获取这些第三方可选应用各自的最近更新时间(或安装时间),并将最近更新时间最新的第三方可选应用确定为第三方目标应用,从而使得后续启动的第三方目标应用能够为用户提供最新的功能服务。
步骤S30,根据所述第三方目标应用的应用类型确定对应的第三方目标调用规则,并基于所述第三方目标调用规则调用所述第三方目标应用,启动所述第三方目标应用的目标功能。
本实施例中,当手机确定第三方目标应用和目标功能时,手机将会通过一定的第三方应用调用规则调用该第三方目标应用,并启动该第三方目标应用的目标功能,然后根据该目标功能的执行结果进行相应的输出,例如显示导航路线、播放音乐等。而对于该第三方目标应用及该目标功能的启动,可以是通过该第三方目标应用本身提供的第三方调用接口API(Application Programming Interface、应用程序编程接口)进行调用,也可以是以自动化模拟人工操作的方式实现,当然还可以是通过其它的方式。手机在确定第三方目标应用和目标功能时,首先可根据该第三方目标应用确定对应的第三方目标调用规则,然后再基于该第三方目标调用规则调用该第三方目标应用。例如,手机可以先判断该第三方目标应用是否有对外提供第三方调用接口;若有,则优先以第三方调用接口调用的方式进行调用;否则,则可通过自动化模拟人工操作的方式实现调用。又例如,手机可以先预先设置不同第三方应用的优先调用方式,在进行第三方调用时优先根据设置的方式进行调用。
具体的,对于该第三方调用接口调用的方式,要求第三方应用中提供有第三方调用接口和该调用接口规范;该调用接口规范中包括有相关的标识符模板,用以构造出满足该第三方调用接口入参规范的统一资源标识符URI(Uniform Resource Identifier,一个用于标识某一互联网资源名称的字符串,允许用户对任何(包括本地和互联网)的资源通过特定的协议进行交互操作),还包括有该标识符模板的相关填充规则,即如何填充该标识符模板,各字符串的相关含义等。当通过第三方调用接口进行调用时,手机首先会获取该第三方目标应用的调用接口规范,并根据该调用接口规范获取到对应的标识符模板;然后手机会根据目标功能的具体内容以及该调用接口规范对标识符模板进行内容填充,构造得到对应的目标标识符,例如根据“导航至X地点”和调用接口规范中的字符串规定生成对应的功能字符,再将该功能字符填充至标识符模板中,从而得到目标标识符;然后手机可将该目标标识符输入至第三方目标应用的第三方调用接口,以调用所述第三方目标应用和执行第三方目标应用的目标功能,并根据该目标功能的执行结果进行相应的输出。通过该第三方调用接口进行调用,可减少语音控制应用(或是手机的语音控制功能)本身的相关功能要求,该语音控制应用(或手机)无需关注目标功能如何实现,只需要根据接口调用规范进行构建相应的统一资源标识符并将其输入至第三方调用接口,即可得到相应的调用结果并为用户提供功能服务,无需进行重新开发,降低了语音控制的实现成本。
而当通过自动化模拟人工操作(自动化模拟用例)的方式实现第三方目标应用和目标功能的启动时,可以是手机先启动该第三方目标应用,然后将该第三方目标应用的目标应用界面显示在显示屏上。在显示该目标应用界面后,手机将会对所述目标应用界面进行识别,并在所述目标应用界面中确定目标功能对应的功能触发区域。对于该功能触发区域的识别过程,可以是预先根据目标应用界面的排版模式预先设置对应的识别脚本(即识别规范),从而在显示目标应用界面时,根据该识别脚本识别出相关的页面要素,从而确定功能触发区域。当然在识别的过程中,还可以是结合光学字符识别(Optical Character Recognition,OCR)技术(或其它技术)进行,即在显示目标应用界面时进行截图,并通过OCR技术识别出相关的关键字,从而根据关键字确定对应的功能触发区域。在确定功能触发区域的同时,还将要确定该功能触发区域的功能触发类型,例如通过输入相关命令字符并点击相应按键的方式触发相应功能指令,还是通过直接点击某个按键触发相应功能指令等。在确定功能触发类型时,手机将会根据该功能触发类型调用对应的操作控件(如输入控件、点击控件等),并通过该操作控件在该功能触发区域进行相关的模拟操作,以启动该第三方目标应用的目标功能,例如通过输入控件在功能触发区域的输入栏模拟输入某个字符、通过点击空间模拟点击功能触发区域的某个功能按键等;然后可根据执行结果进行相应地输出。通过上述自动化模拟人工操作的方式实现第三方应用调用,可在一定程度上实现对不同第三方应用的兼容性,在不依赖接口进行数据导入导出的情况下也可实现第三方应用于第三方应用、或是系统与第三方应用之间的无缝衔接,有利于提高移动终端进行第三方调用的稳定性,提高了用户体验。
值得说明的是,对于上述的第三方调用接口调用或是自动化模拟人工操作进行调用的方式,由于市面上的第三方应用种类较多,因此手机中安装的语音控制应用不一定要将所有第三方应用的相关资料(如第三方应用的调用接口规范、自动化模拟操作脚本、模拟用例等)均存储在本地;也即手机在通过该语音控制应用提供语音控制服务时,当确定第三方目标应用和目标功能了,可以是将该第三方目标应用和目标功能发送至语音应用服务器,由该语音应用服务器根据该第三目标应用和目标功能构建生成相关的目标标识符或自动化模拟操作脚本、模拟用例,再将该目标标识符或自动化模拟操作脚本、模拟用例返回至手机,以供手机将该目标标识符输入至第三方目标应用的第三方调用接口,或通过自动化模拟操作脚本、模拟用例模拟人工操作,从而实现第三方目标应用和目标功能的启动。
本实施例中的移动终端,在处于语音控制模式时,若接收到语音信息,则对所述语音信息进行声纹分析,判断所述语音信息是否来源于预设目标用户;若判断所述语音信息来源于所述预设目标用户,则根据所述语音信息确定对应的第三方目标应用和目标功能;根据所述第三方目标应用的应用类型确定对应的第三方目标调用规则,并基于所述第三方目标调用规则调用所述第三方目标应用,启动所述第三方目标应用的目标功能。通过以上方式,本实施例可在用户不方便手动操作移动终端时提供语音智能服务,使得用户可通过语音的方式控制移动终端,为用户提供了方便;同时,在语音控制过程中,移动终端还可以对第三方应用进行调用,通过第三方应用为用户提供相应的功能服务,扩展了语音控制的功能覆盖面,实现对非终端系统自带应用和非本语音控制应用的第三方应用的集中式语音控制,避免了用户先通过手动操作的方式启动单一第三方应用后,再启动该第三方应用本身所提供的语音功能,从而简化了对于第三方应用的单一语音控制的操作流程,提高了语音控制第三方应用效率,进一步提高了用户体验;此外,在进行第三方应用调用时,可通过接口调用或模拟人工操作的方式实现,在一定程度上提高不同应用之间的兼容性,减小对系统或第三方应用的改动,有利于提高移动终端运行的稳定性。
基于上述图2所示实施例,提出本申请移动终端的控制方法第二实施例的流程示意图。本实施例中,所述步骤S10之前,还包括:
步骤S40,检测所述移动终端的实时位移速度,并判断所述实时位移速度是否大于预设速度阈值;
本实施例中,对于手机语音控制模式的进入(启动),还可以是通过手机一系列的传感器(或装置)对周边的环境检测,当根据检测数据判断当前环境为用户不方便手动操作手机时,即自动启动该语音控制应用并进入语音控制模式,无需用户手动进行设置,从而为用户提供方便。例如,本实施例中可以是检测用户是否在驾驶状态,若是,则自动进入语音控制模式。具体的,手机可通过GPS或是其它设备对检测自身的实时位移速度,并判断该实时位移速度是否大于一预设速度阈值;该预设速度阈值可以根据实际情况进行设置,例如设置为10km/h等。若手机的实时位移速度大于该预设速度阈值,则可认为手机当前正位于交通工具上,此时进入步骤S50;而若手机的实时位移速度小于或等于该预设速度阈值,则保持当前模式不变。
步骤S50,若所述实时位移速度大于所述预设速度阈值,则通过所述移动终端的摄像头获取预设范围内的范围图像,并判断所述范围图像中是否存在所述预设目标用户的用户图像;
本实施例中,若手机的实时位移速度大于该预设速度阈值,则手机将通过摄像头获取预设范围内的范围图像;再得到该范围图像时,可对该范围图像进行识别,判断该范围图像中是否存在预设目标用户的用户图像;若该范围图像中存在预设目标用户的用户图像,则可认为预设目标用户当前处于运行的交通工具上使用手机,此时进入步骤S60;而若该范围图像中不存在预设目标用户的用户图像,则保持当前模式不变。
步骤S60,若所述范围图像中存在所述预设目标用户的用户图像,则进入所述语音控制模式。
本实施例中,若该范围图像中存在预设目标用户的用户图像,则可认为预设目标用户当前处于运行的交通工具上使用手机,此时手机将自动启动该语音控制应用并进入语音控制模式,用户可通过语音的方式对手机进行操作,为用户了提供方便。
当然,在实际中,用户也可能是正在运行的地铁、公交车、出租车上使用手机,此时虽然用户当前处于运行的交通工具上使用手机,但并不影响用户手动操作,对此,手机还可设置其它的判定规则,以进一步确定是否需要进入语音控制模式。例如手机此时可发出相关的语音询问信息,如“检测到您处于运行的交通工具上使用手机,请问是否进入语音模式”,然后采集用户的回复语音,若用户在预设时间内回答“是”,则进入语音控制模式;若用户回答在预设时间内回答“否”或是未在预设时间内采集到用户的回复语音,则保持当前模式不变。通过以上方式,可进一步提高环境判断的准确性,从而提高用户的体验。
此外,本申请实施例还提供一种移动终端的控制装置,所述移动终端的控制装置包括:
语音分析模块,用于在处于语音控制模式时,若接收到语音信息,则对所述语音信息进行声纹分析,判断所述语音信息是否来源于预设目标用户;
信息确定模块,用于若判断所述语音信息来源于所述预设目标用户,则根据所述语音信息确定对应的第三方目标应用和目标功能;
应用调用模块,用于根据所述第三方目标应用的应用类型确定对应的第三方目标调用规则,并基于所述第三方目标调用规则调用所述第三方目标应用,启动所述第三方目标应用的目标功能。
其中,上述移动终端的控制装置的各虚拟功能模块存储于图1所示移动终端的存储器1005中,用于实现计算机可读指令的所有功能;各模块被处理器1001执行时,可实现移动终端的语音控制的功能。
进一步的,所述应用调用模块包括:
模板获取单元,用于获取所述第三方目标应用的调用接口规范,并根据所述调用接口规范获取对应的标识符模板;
模板填充单元,用于根据所述目标功能和所述调用接口规范对所述标识符模板进行内容填充,构造得到对应的目标标识符;
标识符输入单元,用于将所述目标标识符输入至所述第三方目标应用的第三方调用接口,以调用所述第三方目标应用,并启动所述第三方目标应用的目标功能。
进一步的,所述应用调用模块包括:
界面显示单元,用于启动所述第三方目标应用,并显示所述第三方目标应用的目标应用界面;
界面识别单元,用于对所述目标应用界面进行识别,并在所述目标应用界面中确定所述目标功能对应的功能触发区域和所述功能触发区域的功能触发类型;
操作模拟单元,用于根据所述功能触发类型调用对应的操作控件,并通过所述操作组件在所述功能触发区域进行模拟操作,以启动所述第三方目标应用的目标功能。
进一步的,所述移动终端的控制装置还包括:
速度检测模块,用于检测所述移动终端的实时位移速度,并判断所述实时位移速度是否大于预设速度阈值;
图像判断模块,用于若所述实时位移速度大于所述预设速度阈值,则通过所述移动终端的摄像头获取预设范围内的范围图像,并判断所述范围图像中是否存在所述预设目标用户的用户图像;
模式进入模块,用于若所述范围图像中存在所述预设目标用户的用户图像,则进入所述语音控制模式。
进一步的,所述信息确定模块20包括:
信息解析单元,用于对所述语音信息进行解析,并从所述语音信息中提取得到对应的功能关键词;
应用确定单元,用于根据所述功能关键词确定对应的目标功能,并根据所述目标功能确定对应的第三方目标应用。
进一步的,所述应用确定单元包括:
应用查询子单元,用于查询所述移动终端中已安装的第三方应用,并判断所述已安装的第三方应用中是否存在支持所述目标功能的第三方可选应用;
第一确定子单元,用于若所述已安装的第三方应用中存在所述第三方可选应用,则在所述第三方可选应用中确定第三方目标应用;
第二确定子单元,用于若所述已安装的第三方应用中不存在所述第三方可选应用,则通过网络下载安装支持所述目标功能的第三方网络应用,并将所述第三方网络应用确定为第三方目标应用。
进一步的,所述第一确定子单元,具体用于若所述已安装的第三方应用中存在所述第三方可选应用,则确定所述第三方可选应用的应用数量;若第三方可选应用的应用数量为两个以上,则根据所述第三方可选应用各自的使用频率在所述第三方可选应用中确定第三方目标应用。
其中,上述移动终端的控制装置中各个模块的功能实现与上述移动终端的控制方法实施例中各步骤相对应,其功能和实现过程在此处不再一一赘述。
此外,本申请实施例还提供一种可读存储介质,所述计算机可读存储介质可以为非易失性可读存储介质。
本申请可读存储介质上存储有计算机可读指令,其中所述计算机可读指令被处理器执行时,实现如上述的移动终端的控制方法的步骤。
其中,计算机可读指令被执行时所实现的方法可参照本申请移动终端的控制方法的各个实施例,此处不再赘述。
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者系统不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者系统所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者系统中还存在另外的相同要素。
上述本申请实施例序号仅仅为了描述,不代表实施例的优劣。
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在如上所述的一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本申请各个实施例所述的方法。
以上仅为本申请的优选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。

Claims (20)

  1. 一种移动终端的控制方法,其中,所述移动终端的控制方法应用于移动终端,所述移动终端的控制方法包括:
    检测所述移动终端的实时位移速度,并判断所述实时位移速度是否大于预设速度阈值;
    若所述实时位移速度大于所述预设速度阈值,则通过所述移动终端的摄像头获取预设范围内的范围图像,并判断所述范围图像中是否存在所述预设目标用户的用户图像;
    若所述范围图像中存在所述预设目标用户的用户图像,则进入所述语音控制模式;
    在处于语音控制模式时,若接收到语音信息,则对所述语音信息进行声纹分析,判断所述语音信息是否来源于预设目标用户;
    若判断所述语音信息来源于所述预设目标用户,则根据所述语音信息确定对应的第三方目标应用和目标功能;
    根据所述第三方目标应用的应用类型确定对应的第三方目标调用规则,并基于所述第三方目标调用规则调用所述第三方目标应用,启动所述第三方目标应用的目标功能。
  2. 如权利要求1所述的移动终端的控制方法,其中,所述基于所述第三方目标调用规则调用所述第三方目标应用,启动所述第三方目标应用的目标功能的步骤包括:
    获取所述第三方目标应用的调用接口规范,并根据所述调用接口规范获取对应的标识符模板;
    根据所述目标功能和所述调用接口规范对所述标识符模板进行内容填充,构造得到对应的目标标识符;
    将所述目标标识符输入至所述第三方目标应用的第三方调用接口,以调用所述第三方目标应用,并启动所述第三方目标应用的目标功能。
  3. 如权利要求1所述的移动终端的控制方法,其中,所述基于所述第三方目标调用规则调用所述第三方目标应用,并启动所述第三方目标应用的目标功能的步骤包括:
    启动所述第三方目标应用,并显示所述第三方目标应用的目标应用界面;
    对所述目标应用界面进行识别,并在所述目标应用界面中确定所述目标功能对应的功能触发区域和所述功能触发区域的功能触发类型;
    根据所述功能触发类型调用对应的操作控件,并通过所述操作组件在所述功能触发区域进行模拟操作,以启动所述第三方目标应用的目标功能。
  4. 如权利要求1所述的移动终端的控制方法,其中,所述根据所述语音信息确定对应的第三方目标应用和目标功能的步骤包括:
    对所述语音信息进行解析,并从所述语音信息中提取得到对应的功能关键词;
    根据所述功能关键词确定对应的目标功能,并根据所述目标功能确定对应的第三方目标应用。
  5. 如权利要求4所述的移动终端的控制方法,其中,所述根据所述目标功能确定对应的第三方目标应用的步骤包括:
    查询所述移动终端中已安装的第三方应用,并判断所述已安装的第三方应用中是否存在支持所述目标功能的第三方可选应用;
    若所述已安装的第三方应用中存在所述第三方可选应用,则在所述第三方可选应用中确定第三方目标应用;
    若所述已安装的第三方应用中不存在所述第三方可选应用,则通过网络下载安装支持所述目标功能的第三方网络应用,并将所述第三方网络应用确定为第三方目标应用。
  6. 如权利要求5所述的移动终端的控制方法,其中,所述若所述已安装的第三方应用中存在所述第三方可选应用,则在所述第三方可选应用中确定第三方目标应用的步骤包括:
    若所述已安装的第三方应用中存在所述第三方可选应用,则确定所述第三方可选应用的应用数量;
    若第三方可选应用的应用数量为两个以上,则根据所述第三方可选应用各自的使用频率在所述第三方可选应用中确定第三方目标应用。
  7. 一种移动终端的控制装置,其中,所述移动终端的控制装置包括:
    速度检测模块,用于检测所述移动终端的实时位移速度,并判断所述实时位移速度是否大于预设速度阈值;
    图像判断模块,用于若所述实时位移速度大于所述预设速度阈值,则通过所述移动终端的摄像头获取预设范围内的范围图像,并判断所述范围图像中是否存在所述预设目标用户的用户图像;
    模式进入模块,用于若所述范围图像中存在所述预设目标用户的用户图像,则进入所述语音控制模式;
    语音分析模块,用于在处于语音控制模式时,若接收到语音信息,则对所述语音信息进行声纹分析,判断所述语音信息是否来源于预设目标用户;
    信息确定模块,用于若判断所述语音信息来源于所述预设目标用户,则根据所述语音信息确定对应的第三方目标应用和目标功能;
    应用调用模块,用于根据所述第三方目标应用的应用类型确定对应的第三方目标调用规则,并基于所述第三方目标调用规则调用所述第三方目标应用,启动所述第三方目标应用的目标功能。
  8. 如权利要求7所述的移动终端的控制装置,其中,所述应用调用模块包括:
    模板获取单元,用于获取所述第三方目标应用的调用接口规范,并根据所述调用接口规范获取对应的标识符模板;
    模板填充单元,用于根据所述目标功能和所述调用接口规范对所述标识符模板进行内容填充,构造得到对应的目标标识符;
    标识符输入单元,用于将所述目标标识符输入至所述第三方目标应用的第三方调用接口,以调用所述第三方目标应用,并启动所述第三方目标应用的目标功能。
  9. 如权利要求7所述的移动终端的控制装置,其中,所述应用调用模块包括:
    界面显示单元,用于启动所述第三方目标应用,并显示所述第三方目标应用的目标应用界面;
    界面识别单元,用于对所述目标应用界面进行识别,并在所述目标应用界面中确定所述目标功能对应的功能触发区域和所述功能触发区域的功能触发类型;
    操作模拟单元,用于根据所述功能触发类型调用对应的操作控件,并通过所述操作组件在所述功能触发区域进行模拟操作,以启动所述第三方目标应用的目标功能。
  10. 如权利要求7所述的移动终端的控制装置,其中,所述信息确定模块包括:
    信息解析单元,用于对所述语音信息进行解析,并从所述语音信息中提取得到对应的功能关键词;
    应用确定单元,用于根据所述功能关键词确定对应的目标功能,并根据所述目标功能确定对应的第三方目标应用。
  11. 如权利要求10所述的移动终端的控制装置,其中,所述应用确定单元包括:
    应用查询子单元,用于查询所述移动终端中已安装的第三方应用,并判断所述已安装的第三方应用中是否存在支持所述目标功能的第三方可选应用;
    第一确定子单元,用于若所述已安装的第三方应用中存在所述第三方可选应用,则在所述第三方可选应用中确定第三方目标应用;
    第二确定子单元,用于若所述已安装的第三方应用中不存在所述第三方可选应用,则通过网络下载安装支持所述目标功能的第三方网络应用,并将所述第三方网络应用确定为第三方目标应用。
  12. 一种移动终端,其中,所述移动终端包括处理器、存储器、以及存储在所述存储器上并可被所述处理器执行的计算机可读指令,其中所述计算机可读指令被所述处理器执行时,实现以下步骤:
    检测所述移动终端的实时位移速度,并判断所述实时位移速度是否大于预设速度阈值;
    若所述实时位移速度大于所述预设速度阈值,则通过所述移动终端的摄像头获取预设范围内的范围图像,并判断所述范围图像中是否存在所述预设目标用户的用户图像;
    若所述范围图像中存在所述预设目标用户的用户图像,则进入所述语音控制模式;
    在处于语音控制模式时,若接收到语音信息,则对所述语音信息进行声纹分析,判断所述语音信息是否来源于预设目标用户;
    若判断所述语音信息来源于所述预设目标用户,则根据所述语音信息确定对应的第三方目标应用和目标功能;
    根据所述第三方目标应用的应用类型确定对应的第三方目标调用规则,并基于所述第三方目标调用规则调用所述第三方目标应用,启动所述第三方目标应用的目标功能。
  13. 如权利要求12所述的移动终端,其中,所述基于所述第三方目标调用规则调用所述第三方目标应用,启动所述第三方目标应用的目标功能的步骤包括:
    获取所述第三方目标应用的调用接口规范,并根据所述调用接口规范获取对应的标识符模板;
    根据所述目标功能和所述调用接口规范对所述标识符模板进行内容填充,构造得到对应的目标标识符;
    将所述目标标识符输入至所述第三方目标应用的第三方调用接口,以调用所述第三方目标应用,并启动所述第三方目标应用的目标功能。
  14. 如权利要求12所述的移动终端,其中,所述基于所述第三方目标调用规则调用所述第三方目标应用,并启动所述第三方目标应用的目标功能的步骤包括:
    启动所述第三方目标应用,并显示所述第三方目标应用的目标应用界面;
    对所述目标应用界面进行识别,并在所述目标应用界面中确定所述目标功能对应的功能触发区域和所述功能触发区域的功能触发类型;
    根据所述功能触发类型调用对应的操作控件,并通过所述操作组件在所述功能触发区域进行模拟操作,以启动所述第三方目标应用的目标功能。
  15. 如权利要求12所述的移动终端,其中,所述根据所述语音信息确定对应的第三方目标应用和目标功能的步骤包括:
    对所述语音信息进行解析,并从所述语音信息中提取得到对应的功能关键词;
    根据所述功能关键词确定对应的目标功能,并根据所述目标功能确定对应的第三方目标应用。
  16. 如权利要求15所述的移动终端,其中,所述根据所述目标功能确定对应的第三方目标应用的步骤包括:
    查询所述移动终端中已安装的第三方应用,并判断所述已安装的第三方应用中是否存在支持所述目标功能的第三方可选应用;
    若所述已安装的第三方应用中存在所述第三方可选应用,则在所述第三方可选应用中确定第三方目标应用;
    若所述已安装的第三方应用中不存在所述第三方可选应用,则通过网络下载安装支持所述目标功能的第三方网络应用,并将所述第三方网络应用确定为第三方目标应用。
  17. 一种可读存储介质,其中,所述存储介质上存储有计算机可读指令,其中所述计算机可读指令被处理器执行时,实现以下步骤:
    检测移动终端的实时位移速度,并判断所述实时位移速度是否大于预设速度阈值;
    若所述实时位移速度大于所述预设速度阈值,则通过所述移动终端的摄像头获取预设范围内的范围图像,并判断所述范围图像中是否存在所述预设目标用户的用户图像;
    若所述范围图像中存在所述预设目标用户的用户图像,则进入所述语音控制模式;
    在处于语音控制模式时,若接收到语音信息,则对所述语音信息进行声纹分析,判断所述语音信息是否来源于预设目标用户;
    若判断所述语音信息来源于所述预设目标用户,则根据所述语音信息确定对应的第三方目标应用和目标功能;
    根据所述第三方目标应用的应用类型确定对应的第三方目标调用规则,并基于所述第三方目标调用规则调用所述第三方目标应用,启动所述第三方目标应用的目标功能。
  18. 如权利要17所述的可读存储介质,其中,所述基于所述第三方目标调用规则调用所述第三方目标应用,启动所述第三方目标应用的目标功能的步骤包括:
    获取所述第三方目标应用的调用接口规范,并根据所述调用接口规范获取对应的标识符模板;
    根据所述目标功能和所述调用接口规范对所述标识符模板进行内容填充,构造得到对应的目标标识符;
    将所述目标标识符输入至所述第三方目标应用的第三方调用接口,以调用所述第三方目标应用,并启动所述第三方目标应用的目标功能。
  19. 如权利要17所述的可读存储介质,其中,所述基于所述第三方目标调用规则调用所述第三方目标应用,并启动所述第三方目标应用的目标功能的步骤包括:
    启动所述第三方目标应用,并显示所述第三方目标应用的目标应用界面;
    对所述目标应用界面进行识别,并在所述目标应用界面中确定所述目标功能对应的功能触发区域和所述功能触发区域的功能触发类型;
    根据所述功能触发类型调用对应的操作控件,并通过所述操作组件在所述功能触发区域进行模拟操作,以启动所述第三方目标应用的目标功能。
  20. 如权利要17所述的可读存储介质,其中,所述根据所述语音信息确定对应的第三方目标应用和目标功能的步骤包括:
    对所述语音信息进行解析,并从所述语音信息中提取得到对应的功能关键词;
    根据所述功能关键词确定对应的目标功能,并根据所述目标功能确定对应的第三方目标应用。
PCT/CN2019/122033 2019-05-21 2019-11-29 移动终端的控制方法、装置、移动终端及可读存储介质 WO2020233074A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910433466.3A CN110310648A (zh) 2019-05-21 2019-05-21 移动终端的控制方法、装置、移动终端及可读存储介质
CN201910433466.3 2019-05-21

Publications (1)

Publication Number Publication Date
WO2020233074A1 true WO2020233074A1 (zh) 2020-11-26

Family

ID=68075516

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/122033 WO2020233074A1 (zh) 2019-05-21 2019-11-29 移动终端的控制方法、装置、移动终端及可读存储介质

Country Status (2)

Country Link
CN (1) CN110310648A (zh)
WO (1) WO2020233074A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112863514A (zh) * 2021-03-15 2021-05-28 湖北亿咖通科技有限公司 一种语音应用的控制方法和电子设备

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110310648A (zh) * 2019-05-21 2019-10-08 深圳壹账通智能科技有限公司 移动终端的控制方法、装置、移动终端及可读存储介质
CN110865844B (zh) * 2019-11-28 2021-09-28 安徽江淮汽车集团股份有限公司 基于车联网平台的应用配置系统及方法

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN204116902U (zh) * 2014-02-10 2015-01-21 美的集团股份有限公司 对家用电器语音控制的语音控制端及控制终端
CN104298904A (zh) * 2014-09-30 2015-01-21 北京金山安全软件有限公司 移动终端的语音识别功能控制方法、装置和移动终端
WO2015078155A1 (en) * 2013-11-28 2015-06-04 Tencent Technology (Shenzhen) Company Limited A method and mobile terminal for speech communication
CN107621882A (zh) * 2017-09-30 2018-01-23 咪咕互动娱乐有限公司 一种控制模式的切换方法、装置及存储介质
CN110310648A (zh) * 2019-05-21 2019-10-08 深圳壹账通智能科技有限公司 移动终端的控制方法、装置、移动终端及可读存储介质

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103915095B (zh) * 2013-01-06 2017-05-31 华为技术有限公司 语音识别的方法、交互设备、服务器和系统
CN105430433B (zh) * 2015-10-29 2019-02-19 小米科技有限责任公司 信息处理方法及装置
CN107644509A (zh) * 2017-09-04 2018-01-30 深圳支点电子智能科技有限公司 智能手表和相关产品
CN107911335B (zh) * 2017-09-26 2021-02-09 五八有限公司 校验统一资源标识符uri的方法、装置和系统
CN108597512A (zh) * 2018-04-27 2018-09-28 努比亚技术有限公司 移动终端控制方法、移动终端及计算机可读存储介质
CN109656512A (zh) * 2018-12-20 2019-04-19 Oppo广东移动通信有限公司 基于语音助手的交互方法、装置、存储介质及终端

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015078155A1 (en) * 2013-11-28 2015-06-04 Tencent Technology (Shenzhen) Company Limited A method and mobile terminal for speech communication
CN204116902U (zh) * 2014-02-10 2015-01-21 美的集团股份有限公司 对家用电器语音控制的语音控制端及控制终端
CN104298904A (zh) * 2014-09-30 2015-01-21 北京金山安全软件有限公司 移动终端的语音识别功能控制方法、装置和移动终端
CN107621882A (zh) * 2017-09-30 2018-01-23 咪咕互动娱乐有限公司 一种控制模式的切换方法、装置及存储介质
CN110310648A (zh) * 2019-05-21 2019-10-08 深圳壹账通智能科技有限公司 移动终端的控制方法、装置、移动终端及可读存储介质

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
邓阳 (DENG, YANG): "基于Android平台的语音控制系统的设计与实现 (Design and implementation of voice control system based on Android platform)", 中国优秀硕士学位论文全文数据库信息科技辑 (INFORMATION & TECHNOLOGY, CHINA MASTER’S THESES FULL-TEXT DATABASE), no. 01, 15 January 2018 (2018-01-15), XP55756531, DOI: 20200213153411Y *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112863514A (zh) * 2021-03-15 2021-05-28 湖北亿咖通科技有限公司 一种语音应用的控制方法和电子设备
CN112863514B (zh) * 2021-03-15 2024-03-15 亿咖通(湖北)技术有限公司 一种语音应用的控制方法和电子设备

Also Published As

Publication number Publication date
CN110310648A (zh) 2019-10-08

Similar Documents

Publication Publication Date Title
WO2020233074A1 (zh) 移动终端的控制方法、装置、移动终端及可读存储介质
EP3300074B1 (en) Information processing apparatus
WO2021034038A1 (en) Method and system for context association and personalization using a wake-word in virtual personal assistants
RU2592062C1 (ru) Система и способ управления внешним устройством, соединенным с устройством
WO2015005679A1 (ko) 음성 인식 방법, 장치 및 시스템
EP2761400A1 (en) User interface method and device
WO2015053541A1 (ko) 전자 장치에서 연관 정보 표시 방법 및 장치
WO2011162445A1 (ko) 온톨로지 기반 개인화 서비스 시스템 및 방법
WO2014119975A1 (en) Method and system for sharing part of web page
WO2020107761A1 (zh) 广告文案处理方法、装置、设备及计算机可读存储介质
WO2013077589A1 (ko) 음성인식 부가 서비스 제공 방법 및 이에 적용되는 장치
WO2021251539A1 (ko) 인공신경망을 이용한 대화형 메시지 구현 방법 및 그 장치
WO2021060728A1 (ko) 사용자 발화를 처리하는 전자 장치 및 그 작동 방법
WO2020253115A1 (zh) 基于语音识别的产品推荐方法、装置、设备和存储介质
WO2020062640A1 (zh) 终端应用动态文案的语言切换方法、服务器及存储介质
KR20200011198A (ko) 대화형 메시지 구현 방법, 장치 및 프로그램
WO2021107208A1 (ko) 챗봇 채널연계 통합을 위한 챗봇 통합 에이전트 플랫폼 시스템 및 그 서비스 방법
KR20190115405A (ko) 검색 방법 및 이 방법을 적용하는 전자 장치
US20030182129A1 (en) Dialog system and dialog control system
WO2021017332A1 (zh) 语音控制报错方法、电器及计算机可读存储介质
CN111667824A (zh) 智能体装置、智能体装置的控制方法及存储介质
WO2019031621A1 (ko) 통화 중 감정을 인식하여 인식된 감정을 활용하는 방법 및 시스템
WO2014014229A1 (ko) 검색 기능이 부여된 대표전화 정보제공시스템 및 그 방법
WO2015037871A1 (ko) 텍스트 인식을 이용한 음성재생 서비스 제공 시스템, 서버 및 단말
WO2020149621A1 (ko) 영어 말하기 평가 시스템 및 방법

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19929788

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 03.03.2022)

122 Ep: pct application non-entry in european phase

Ref document number: 19929788

Country of ref document: EP

Kind code of ref document: A1