CN107277225B - Method and device for controlling intelligent equipment through voice and intelligent equipment - Google Patents

Method and device for controlling intelligent equipment through voice and intelligent equipment Download PDF

Info

Publication number
CN107277225B
CN107277225B CN201710309069.6A CN201710309069A CN107277225B CN 107277225 B CN107277225 B CN 107277225B CN 201710309069 A CN201710309069 A CN 201710309069A CN 107277225 B CN107277225 B CN 107277225B
Authority
CN
China
Prior art keywords
voice operation
operation prompt
voice
contextual model
prompt information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710309069.6A
Other languages
Chinese (zh)
Other versions
CN107277225A (en
Inventor
李良
葛均辉
王熙
刘义平
于鸿洋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201710309069.6A priority Critical patent/CN107277225B/en
Publication of CN107277225A publication Critical patent/CN107277225A/en
Priority to PCT/CN2018/085442 priority patent/WO2018202073A1/en
Application granted granted Critical
Publication of CN107277225B publication Critical patent/CN107277225B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • H04M1/72433User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for voice messaging, e.g. dictaphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • H04M1/72436User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for text messaging, e.g. SMS or e-mail
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72448User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72448User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
    • H04M1/72454User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions according to context-related or environment-related conditions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72469User interfaces specially adapted for cordless or mobile telephones for operating the device by selecting functions from two or more displayed items, e.g. menus or icons
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2250/00Details of telephonic subscriber devices
    • H04M2250/74Details of telephonic subscriber devices with voice recognition means

Abstract

The invention discloses a method and a device for controlling intelligent equipment through voice and the intelligent equipment. The method comprises the following steps: when a user interface of the intelligent equipment is loaded or switched, acquiring a current contextual model of the intelligent equipment; judging whether the current contextual model supports voice operation; and if so, selecting at least one piece of voice operation prompt information which can be supported by the current contextual model, and displaying at least part of the selected voice operation prompt information at a specified position on the screen of the intelligent equipment. According to the technical scheme, whether voice operation prompting is needed to be carried out on the user can be reasonably judged under the condition that resources are consumed less, so that the user can clearly know whether the voice operation can be used or not and how to realize the voice operation, and the voice operation can be clear at a glance through display on a screen.

Description

Method and device for controlling intelligent equipment through voice and intelligent equipment
Technical Field
The invention relates to the field of intelligent equipment, in particular to a method and a device for controlling intelligent equipment through voice and the intelligent equipment.
Background
The intelligent equipment is more convenient to operate and control by adopting the voice command during normal use, so that voice operation is supported mostly. Even so, however, situations arise in which voice operation is not supported in many contextual models. For a user, it is a tedious matter to turn over a specification to search for a voice keyword supported by each contextual model, and therefore how to perform proper voice operation prompting on the user is a problem to be solved.
Disclosure of Invention
In view of the above, the present invention has been made to provide a method, an apparatus and a smart device for voice controlling a smart device that overcome or at least partially solve the above problems.
According to an aspect of the present invention, there is provided a method of voice controlling a smart device, comprising:
when a user interface of the intelligent equipment is loaded or switched, acquiring a current contextual model of the intelligent equipment;
judging whether the current contextual model supports voice operation;
and if so, selecting at least one piece of voice operation prompt information which can be supported by the current contextual model, and displaying at least part of the selected voice operation prompt information at a specified position on the screen of the intelligent equipment.
Optionally, the intelligent device is a vehicle-mounted intelligent device, a mobile terminal or a computer device.
Optionally, the current contextual model includes: an interface of an application running in the foreground, and/or a function of an application that can be invoked at the present time.
Optionally, the method further comprises:
voice operation prompt information which can be supported by the contextual model is set for each contextual model of the intelligent equipment and is stored in a specified configuration file;
the judging whether the current contextual model supports voice operation or not comprises the following steps: and searching voice operation prompt information which can be supported by the contextual model in the configuration file, if the voice operation prompt information can be searched, judging that the current contextual model supports voice operation, and if the voice operation prompt information cannot be searched, judging that the current contextual model does not support voice operation.
Optionally, the selecting at least one piece of voice operation prompt information that can be supported by the current contextual model includes:
determining the display quantity of voice operation prompt information to be displayed;
and randomly selecting the voice operation prompt messages with the number equal to the determined display number from the searched voice operation prompt messages which can be supported by the current contextual model, and/or weighting and randomly selecting the voice operation prompt messages with the number equal to the determined display number according to the weight set for each voice operation prompt message in the configuration file.
Optionally, the method further comprises:
recording voice operation prompt information used by a user;
and adjusting the weight set for each voice operation prompt message in the configuration file according to the use record of the voice operation prompt message.
Optionally, the voice operation prompt message is a prompt message containing a voice keyword;
the recording of the voice operation prompt information used by the user comprises the following steps: and when the user speaks a voice keyword, adding one to the number of the corresponding voice operation prompt messages.
Optionally, the adjusting the weight set for each voice operation prompt message in the configuration file according to the usage record of the voice operation prompt message includes:
when the using quantity of a voice operation prompt message reaches a preset value, the weight of the voice operation prompt message is correspondingly increased or reduced to the weight corresponding to the preset value.
Optionally, the method further comprises:
and loading the configuration file when the intelligent equipment is started.
Optionally, the method further comprises:
and displaying the voice operation prompt information in a floating window mode, and displaying a trigger switch for opening/closing the floating window in an operating system of the intelligent equipment.
According to another aspect of the present invention, there is provided an apparatus for controlling a smart device by voice, comprising:
the contextual model acquisition unit is suitable for acquiring the current contextual model of the intelligent equipment when the user interface of the intelligent equipment is loaded or switched;
the judging unit is suitable for judging whether the current contextual model supports voice operation or not;
and the display unit is suitable for selecting at least one piece of voice operation prompt information which can be supported by the current contextual model when the current contextual model supports voice operation, and displaying at least part of the selected voice operation prompt information at a specified position on the screen of the intelligent equipment.
Optionally, the intelligent device is a vehicle-mounted intelligent device, a mobile terminal or a computer device.
Optionally, the current contextual model includes: an interface of an application running in the foreground, and/or a function of an application that can be invoked at the present time.
Optionally, the apparatus further comprises:
the configuration unit is suitable for respectively setting voice operation prompt information which can be supported by the contextual model for each contextual model of the intelligent equipment and storing the voice operation prompt information in a specified configuration file;
the judging unit is suitable for searching the voice operation prompt information which can be supported by the contextual model in the configuration file, if the voice operation prompt information can be searched, the current contextual model is judged to support the voice operation, and if the voice operation prompt information cannot be searched, the current contextual model is judged not to support the voice operation.
Optionally, the display unit is adapted to determine a display number of voice operation prompt messages to be displayed, randomly select, from the found voice operation prompt messages that can be supported by the current contextual model, a number of voice operation prompt messages equal to the determined display number, and/or randomly select, according to a weight set for each voice operation prompt message in the configuration file, a number of voice operation prompt messages equal to the determined display number in a weighted manner.
Optionally, the apparatus further comprises:
the recording unit is suitable for recording voice operation prompt information used by a user;
and the configuration unit is suitable for adjusting the weight set for each voice operation prompt message in the configuration file according to the use record of the voice operation prompt message.
Optionally, the voice operation prompt message is a prompt message containing a voice keyword;
the recording unit is suitable for adding one to the number of the corresponding voice operation prompt messages when the user speaks a voice keyword.
Optionally, the configuration unit is adapted to, when the number of users using one voice operation prompt message reaches a preset value, correspondingly increase or decrease the weight of the voice operation prompt message to a weight corresponding to the preset value.
Optionally, the configuration unit is adapted to load the configuration file when the smart device is started.
Optionally, the display unit is adapted to display the voice operation prompt message in a floating window manner, and is adapted to display a trigger switch for turning on/off the floating window in an operating system of the smart device.
In accordance with still another aspect of the present invention, there is provided a smart device, wherein the device includes: a voice acquisition unit, a voice recognition unit, and a device as in any above;
the voice acquisition unit is suitable for acquiring voice data and sending the acquired voice data to an application to be received and/or the voice recognition unit;
the voice recognition unit is suitable for converting the voice data collected by the voice receiving unit into character data and sending the converted character data to an application to be received.
Optionally, the intelligent device is a vehicle-mounted intelligent device, a mobile terminal or a computer device.
According to the technical scheme, when the user interface of the intelligent device is loaded or switched, the current contextual model of the intelligent device is acquired, then judgment is carried out once, if the contextual model supports voice operation, at least one piece of voice operation prompt information which can be supported by the current contextual model is selected, and at least part of the selected voice operation prompt information is displayed at the appointed position on the screen of the intelligent device. According to the technical scheme, whether voice operation prompting is needed to be carried out on the user can be reasonably judged under the condition that resources are consumed less, so that the user can clearly know whether the voice operation can be used or not and how to realize the voice operation, and the voice operation can be clear at a glance through display on a screen.
The foregoing description is only an overview of the technical solutions of the present invention, and the embodiments of the present invention are described below in order to make the technical means of the present invention more clearly understood and to make the above and other objects, features, and advantages of the present invention more clearly understandable.
Drawings
Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are only for purposes of illustrating the preferred embodiments and are not to be construed as limiting the invention. Also, like reference numerals are used to refer to like parts throughout the drawings. In the drawings:
FIG. 1 illustrates a flow diagram of a method for voice-controlling a smart device in accordance with one embodiment of the present invention;
FIG. 2-a shows a schematic interface diagram showing a plurality of voice operated prompts displayed in a floating window of a smart rearview mirror screen;
2-b illustrate an interface diagram showing a plurality of voice operated prompt messages displayed in a floating window of another smart device screen;
FIG. 3 is a schematic diagram of an apparatus for voice-controlling a smart device according to an embodiment of the present invention;
fig. 4 shows a schematic structural diagram of a smart device according to an embodiment of the present invention.
Detailed Description
Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.
Fig. 1 is a flow chart illustrating a method for controlling an intelligent device by voice according to an embodiment of the present invention, and as shown in fig. 1, the method includes:
step S110, when the user interface of the intelligent device is loaded or switched, the current contextual model of the intelligent device is obtained.
The scene mode may include: an interface of an application running in the foreground, and/or a function of an application that can be invoked at present.
For example, when a user opens a new application, the application starts to run in the foreground, the interface is activated, the interface of the application is obtained, and whether the interface supports voice operation is determined. For another example, when a music playing application program is running, a user opens a new application program, and then obtains two contextual modes, namely an interface of the new application program running in the foreground and a music playing function which can be called up in the background. Although the interface of the new application program running in the foreground does not support the voice operation, the music playing function supports the voice operation (such as pausing or song cutting), and the current contextual model is still judged to support the voice operation. That is, step S120, it is determined whether the current contextual model supports voice operation.
And S130, if so, selecting at least one piece of voice operation prompt information which can be supported by the current contextual model, and displaying at least part of the selected voice operation prompt information at a specified position on the screen of the intelligent device.
Specifically, the intelligent device may be a vehicle-mounted intelligent device, a mobile terminal, or a computer device, such as an emerging vehicle-mounted intelligent device like an intelligent rearview mirror, or such as a mobile phone, which is an intelligent device that has been widely used for voice control. For example, FIG. 2-a shows an interface diagram showing a plurality of voice operated prompts displayed in a floating window of a smart rearview mirror screen. As shown in fig. 2-a, the current profile is the main interface of cool me music, as well as the talk function and navigation function running in the background (not shown because not running in the foreground). At the moment, a user sees that a floating window is arranged on the right side of the intelligent rearview mirror screen, four pieces of voice operation prompt information are shown, and the user can send out voice instructions conforming to the format. 2-b illustrate an interface diagram showing a plurality of voice operated prompt messages displayed in a floating window of another smart device screen. As shown in fig. 2-b, since the selected voice operation prompt messages are more and can not be displayed in the screen, the user can manually slide to view the other voice operation prompt messages which can not be displayed.
It can be seen that, in the method shown in fig. 1, when the user interface of the smart device is loaded or switched, the current contextual model of the smart device is obtained, then a determination is made once, if the contextual model supports voice operation, at least one piece of voice operation prompt information that can be supported by the current contextual model is selected, and at least part of the selected voice operation prompt information is displayed at a specified position on the screen of the smart device. According to the technical scheme, whether voice operation prompting is needed to be carried out on the user can be reasonably judged under the condition that resources are consumed less, so that the user can clearly know whether the voice operation can be used or not and how to realize the voice operation, and the voice operation can be clear at a glance through display on a screen.
In one embodiment of the present invention, the method shown in fig. 1 further comprises: respectively setting voice operation prompt information which can be supported by the contextual model for each contextual model of the intelligent equipment, and storing the voice operation prompt information in a specified configuration file; judging whether the current contextual model supports voice operation comprises the following steps: and searching voice operation prompt information which can be supported by the contextual model in the configuration file, if the voice operation prompt information can be searched, judging that the current contextual model supports voice operation, and if the voice operation prompt information cannot be searched, judging that the current contextual model does not support voice operation.
Taking an interface of a map navigation application program running in a foreground as an example, the application program may have a multi-level interface, and voice operations supported by each interface are not identical, so that each interface may also be used as a type of contextual model, and corresponding voice operation prompt information is stored in a configuration file.
In an embodiment of the present invention, the selecting at least one piece of voice operation prompt information that can be supported by the current contextual model includes: determining the display quantity of voice operation prompt information to be displayed; and randomly selecting the voice operation prompt messages with the number equal to the determined display number from the voice operation prompt messages which can be supported by the found current contextual model, and/or weighting and randomly selecting the voice operation prompt messages with the number equal to the determined display number according to the weight set for each voice operation prompt message in the configuration file.
And if the display quantity of the voice operation prompt information displayed in the floating window is larger than the quantity of the searched voice operation prompt information corresponding to the current contextual model, displaying all the searched voice operation prompt information corresponding to the current contextual model. Regarding random selection, all voice operation prompt messages can be treated without distinction, and weighting treatment can be carried out. For example, if a new function is added to the application after the update, and the function also supports voice operation, the user may not be able to perform the operation skillfully, and a prompt is required at this time, and the weight corresponding to the voice operation prompt information corresponding to the function may be set higher.
As the user uses, many common voice operations have become skilled, and the user may no longer need to prompt for such voice operations; or the user uses the voice operation prompt information many times, but the voice operation is difficult to remember, but the voice operation is common, so that the user needs to adjust the voice operation prompt information according to the use habit of the user when the user selects the voice operation prompt information. Thus in one embodiment of the invention, the method further comprises: recording voice operation prompt information used by a user; and adjusting the preset weight for each voice operation prompt message in the configuration file according to the use record of the voice operation prompt message.
Specifically, the voice operation prompt information is prompt information containing a voice keyword; the recording of the voice operation prompt information used by the user comprises the following steps: and when the user speaks a voice keyword, adding one to the number of the corresponding voice operation prompt messages.
For example, "call to", "close", "navigate to", and "open music" shown in fig. 2-a are four voice keywords, and the user can record correspondingly every time he speaks one voice keyword.
In an embodiment of the present invention, the adjusting the preset weight for each voice operation prompt message in the configuration file according to the usage record of the voice operation prompt message includes: when the using quantity of a piece of voice operation prompt information reaches a preset value, the weight of the voice operation prompt information is correspondingly increased or reduced to a weight value corresponding to the preset value.
According to the method, the configuration file can be loaded when the intelligent equipment is started, so that the correct operation of the functions is ensured.
In an embodiment of the present invention, the method further includes: and displaying the voice operation prompt information in a floating window mode, and displaying a trigger switch for opening/closing the floating window in an operating system of the intelligent equipment.
And displaying a trigger switch for opening/closing the floating window in a setting menu item of the operating system, counting the times of voice operation prompt information used by a user when the user closes the trigger switch, reporting the times to a server, and calculating a preset value serving as a trigger condition for adjusting the weight of the voice prompt information according to the use condition of each user serving as a big data sample.
Fig. 3 is a schematic structural diagram illustrating an apparatus for voice-controlling a smart device according to an embodiment of the present invention, and as shown in fig. 3, the apparatus 300 for voice-controlling a smart device includes:
the contextual model obtaining unit 310 is adapted to obtain a current contextual model of the smart device when the user interface of the smart device is loaded or switched.
The scene mode may include: an interface of an application running in the foreground, and/or a function of an application that can be invoked at present.
For example, when a user opens a new application, the application starts to run in the foreground, the interface is activated, the interface of the application is obtained, and whether the interface supports voice operation is determined. For another example, when a music playing application program is running, a user opens a new application program, and then obtains two contextual modes, namely an interface of the new application program running in the foreground and a music playing function which can be called up in the background. Although the interface of the new application program running in the foreground does not support the voice operation, the music playing function supports the voice operation (such as pausing or song cutting), and the current contextual model is still judged to support the voice operation. I.e. the determining unit 320, is adapted to determine whether the current contextual model supports voice operation.
The display unit 330 is adapted to select at least one piece of voice operation prompt information that can be supported by the current contextual model when the current contextual model supports voice operation, and display at least part of the selected voice operation prompt information at a specified position on the screen of the smart device.
The effect of displaying at least part of the voice operated prompt message of the selection in the floating window may also be viewed with reference to fig. 2-a and 2-b.
It can be seen that, in the apparatus shown in fig. 3, through the mutual cooperation of the units, when the user interface of the smart device is loaded or switched, the current contextual model of the smart device is obtained, and then a judgment is performed once, if the contextual model supports voice operation, at least one piece of voice operation prompt information that can be supported by the current contextual model is selected, and at least part of the selected voice operation prompt information is displayed at a specified position on the screen of the smart device. According to the technical scheme, whether voice operation prompting is needed to be carried out on the user can be reasonably judged under the condition that resources are consumed less, so that the user can clearly know whether the voice operation can be used or not and how to realize the voice operation, and the voice operation can be clear at a glance through display on a screen.
In an embodiment of the present invention, the apparatus further includes: the configuration unit 340 is adapted to set voice operation prompt information that can be supported by the contextual model for each contextual model of the intelligent device, and store the voice operation prompt information in a specified configuration file; the determining unit 320 is adapted to search the voice operation prompt information that can be supported by the contextual model in the configuration file, determine that the current contextual model supports the voice operation if the voice operation prompt information can be found, and determine that the current contextual model does not support the voice operation if the voice operation prompt information cannot be found.
Taking an interface of a map navigation application program running in a foreground as an example, the application program may have a multi-level interface, and voice operations supported by each interface are not identical, so that each interface may also be used as a type of contextual model, and corresponding voice operation prompt information is stored in a configuration file.
In an embodiment of the present invention, in the above apparatus, the display unit 330 is adapted to determine a display number of the voice operation prompt messages to be displayed, randomly select, from the searched voice operation prompt messages that can be supported by the current contextual model, a number of voice operation prompt messages equal to the determined display number, and/or weight-randomly select, according to the weight set in the configuration file for each voice operation prompt message, a number of voice operation prompt messages equal to the determined display number.
And if the display quantity of the voice operation prompt information displayed in the floating window is larger than the quantity of the searched voice operation prompt information corresponding to the current contextual model, displaying all the searched voice operation prompt information corresponding to the current contextual model. Regarding random selection, all voice operation prompt messages can be treated without distinction, and weighting treatment can be carried out. For example, if a new function is added to the application after the update, and the function also supports voice operation, the user may not be able to perform the operation skillfully, and a prompt is required at this time, and the weight corresponding to the voice operation prompt information corresponding to the function may be set higher.
As the user uses, many common voice operations have become skilled, and the user may no longer need to prompt for such voice operations; or the user uses the voice operation prompt information many times, but the voice operation is difficult to remember, but the voice operation is common, so that the user needs to adjust the voice operation prompt information according to the use habit of the user when the user selects the voice operation prompt information. Thus in one embodiment of the invention, the above apparatus further comprises: a recording unit 350 adapted to record voice operation prompt information used by a user; the configuration unit 340 is further adapted to adjust the weight preset for each voice operation prompt message in the configuration file according to the usage record of the voice operation prompt message.
Specifically, in an embodiment of the present invention, in the apparatus, the voice operation prompt message is a prompt message containing a voice keyword; the recording unit 350 is adapted to add one to the number of usage of the corresponding voice operation prompt message when the user speaks a voice keyword.
For example, "call to", "close", "navigate to", and "open music" shown in fig. 2-a are four voice keywords, and the user can record correspondingly every time he speaks one voice keyword.
In an embodiment of the present invention, in the above apparatus, the configuration unit 340 is adapted to, when the number of users using a piece of voice operation prompt information reaches a preset value, correspondingly increase or decrease the weight of the voice operation prompt information to a weight value corresponding to the preset value.
In an embodiment of the present invention, in the apparatus, the configuration unit 340 is adapted to load a configuration file when the smart device is started, so as to ensure correct operation of the above functions.
In an embodiment of the present invention, in the above apparatus, the display unit 330 is further adapted to display the voice operation prompt information in a floating window manner, and display a trigger switch for opening/closing the floating window in an operating system of the smart device.
And displaying a trigger switch for opening/closing the floating window in a setting menu item of the operating system, counting the times of voice operation prompt information used by a user when the user closes the trigger switch, reporting the times to a server, and calculating a preset value serving as a trigger condition for adjusting the weight of the voice prompt information according to the use condition of each user serving as a big data sample.
Fig. 4 is a schematic structural diagram of a smart device according to an embodiment of the present invention, and as shown in fig. 4, the smart device includes: a voice acquisition unit 410, a voice recognition unit 420, and an apparatus 300 for voice controlling a smart device as in any of the above embodiments.
The speech acquisition unit 410 is adapted to acquire speech data and send the acquired speech data to the application to be received and/or the speech recognition unit 420. Since many applications, such as instant chat applications, can directly utilize the collected speech data, a speech recognition unit is not required for recognition.
The voice recognition unit 420 is adapted to convert the voice data collected by the voice receiving unit 410 into text data, and send the converted text data to an application to be received. In particular, it may be implemented using a speech recognition library.
In the above embodiments, the smart device may be an in-vehicle smart device, a mobile terminal, or a computer device. For example, a mobile phone, a car recorder, an intelligent rearview mirror, etc. can be used as the intelligent device in the above embodiments.
In summary, according to the technical scheme of the present invention, when the user interface of the smart device is loaded or switched, the current contextual model of the smart device is obtained, and then a determination is made once, if the contextual model supports voice operation, at least one piece of voice operation prompt information that can be supported by the current contextual model is selected, and at least part of the selected voice operation prompt information is displayed at a designated position on the screen of the smart device. According to the technical scheme, whether voice operation prompting is needed to be carried out on the user can be reasonably judged under the condition that resources are consumed less, so that the user can clearly know whether the voice operation can be used or not and how to realize the voice operation, and the voice operation can be clear at a glance through display on a screen.
It should be noted that:
the algorithms and displays presented herein are not inherently related to any particular computer, virtual machine, or other apparatus. Various general purpose devices may be used with the teachings herein. The required structure for constructing such a device will be apparent from the description above. Moreover, the present invention is not directed to any particular programming language. It is appreciated that a variety of programming languages may be used to implement the teachings of the present invention as described herein, and any descriptions of specific languages are provided above to disclose the best mode of the invention.
In the description provided herein, numerous specific details are set forth. It is understood, however, that embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure an understanding of this description.
Similarly, it should be appreciated that in the foregoing description of exemplary embodiments of the invention, various features of the invention are sometimes grouped together in a single embodiment, figure, or description thereof for the purpose of streamlining the disclosure and aiding in the understanding of one or more of the various inventive aspects. However, the disclosed method should not be interpreted as reflecting an intention that: that the invention as claimed requires more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed embodiment. Thus, the claims following the detailed description are hereby expressly incorporated into this detailed description, with each claim standing on its own as a separate embodiment of this invention.
Those skilled in the art will appreciate that the modules in the device in an embodiment may be adaptively changed and disposed in one or more devices different from the embodiment. The modules or units or components of the embodiments may be combined into one module or unit or component, and furthermore they may be divided into a plurality of sub-modules or sub-units or sub-components. All of the features disclosed in this specification (including any accompanying claims, abstract and drawings), and all of the processes or elements of any method or apparatus so disclosed, may be combined in any combination, except combinations where at least some of such features and/or processes or elements are mutually exclusive. Each feature disclosed in this specification (including any accompanying claims, abstract and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise.
Furthermore, those skilled in the art will appreciate that while some embodiments described herein include some features included in other embodiments, rather than other features, combinations of features of different embodiments are meant to be within the scope of the invention and form different embodiments. For example, in the following claims, any of the claimed embodiments may be used in any combination.
The various component embodiments of the invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof. It will be appreciated by those skilled in the art that a microprocessor or Digital Signal Processor (DSP) may be used in practice to implement some or all of the functions of the apparatus for voice-controlling a smart device and some or all of the components of the smart device in accordance with embodiments of the present invention. The present invention may also be embodied as apparatus or device programs (e.g., computer programs and computer program products) for performing a portion or all of the methods described herein. Such programs implementing the present invention may be stored on computer-readable media or may be in the form of one or more signals. Such a signal may be downloaded from an internet website or provided on a carrier signal or in any other form.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The invention may be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means may be embodied by one and the same item of hardware. The usage of the words first, second and third, etcetera do not indicate any ordering. These words may be interpreted as names.
The embodiment of the invention discloses A1 and a method for controlling intelligent equipment by voice, wherein the method comprises the following steps:
when a user interface of the intelligent equipment is loaded or switched, acquiring a current contextual model of the intelligent equipment;
judging whether the current contextual model supports voice operation;
and if so, selecting at least one piece of voice operation prompt information which can be supported by the current contextual model, and displaying at least part of the selected voice operation prompt information at a specified position on the screen of the intelligent equipment.
A2, the method of A1, wherein the smart device is an in-vehicle smart device, a mobile terminal or a computer device.
A3, the method as in a1, wherein the current scene mode includes: an interface of an application running in the foreground, and/or a function of an application that can be invoked at the present time.
A4, the method of a1, wherein the method further comprises:
voice operation prompt information which can be supported by the contextual model is set for each contextual model of the intelligent equipment and is stored in a specified configuration file;
the judging whether the current contextual model supports voice operation or not comprises the following steps: and searching voice operation prompt information which can be supported by the contextual model in the configuration file, if the voice operation prompt information can be searched, judging that the current contextual model supports voice operation, and if the voice operation prompt information cannot be searched, judging that the current contextual model does not support voice operation.
A5, the method as in a4, wherein the selecting at least one piece of voice operation prompting information that can be supported by the current contextual model comprises:
determining the display quantity of voice operation prompt information to be displayed;
randomly selecting the voice operation prompt messages with the number equal to the determined display number from the voice operation prompt messages which can be supported by the found current contextual model; and/or according to the weight set for each voice operation prompt message in the configuration file, randomly selecting the voice operation prompt messages with the number equal to the determined display number in a weighted mode.
A6, the method of a5, wherein the method further comprises:
recording voice operation prompt information used by a user;
and adjusting the weight set for each voice operation prompt message in the configuration file according to the use record of the voice operation prompt message.
A7, the method according to A6, wherein the voice operation prompt message is a prompt message containing a voice keyword;
the recording of the voice operation prompt information used by the user comprises the following steps: and when the user speaks a voice keyword, adding one to the number of the corresponding voice operation prompt messages.
A8, the method as in A6, wherein the adjusting the weight set for each voice operation prompt message in the configuration file according to the record of the use of the voice operation prompt message comprises:
when the using quantity of a voice operation prompt message reaches a preset value, the weight of the voice operation prompt message is correspondingly increased or reduced to the weight corresponding to the preset value.
A9, the method of any one of a4-A8, wherein the method further comprises:
and loading the configuration file when the intelligent equipment is started.
A10, the method of a1, wherein the method further comprises:
and displaying the voice operation prompt information in a floating window mode, and displaying a trigger switch for opening/closing the floating window in an operating system of the intelligent equipment.
The embodiment of the invention also discloses B11 and a device for controlling the intelligent equipment by voice, wherein the device comprises:
the contextual model acquisition unit is suitable for acquiring the current contextual model of the intelligent equipment when the user interface of the intelligent equipment is loaded or switched;
the judging unit is suitable for judging whether the current contextual model supports voice operation or not;
and the display unit is suitable for selecting at least one piece of voice operation prompt information which can be supported by the current contextual model when the current contextual model supports voice operation, and displaying at least part of the selected voice operation prompt information at a specified position on the screen of the intelligent equipment.
B12, the apparatus according to B11, wherein the intelligent device is an in-vehicle intelligent device, a mobile terminal or a computer device.
B13, the apparatus as in B11, wherein the current scene mode comprises: an interface of an application running in the foreground, and/or a function of an application that can be invoked at the present time.
B14, the apparatus of B11, wherein the apparatus further comprises:
the configuration unit is suitable for respectively setting voice operation prompt information which can be supported by the contextual model for each contextual model of the intelligent equipment and storing the voice operation prompt information in a specified configuration file;
the judging unit is suitable for searching the voice operation prompt information which can be supported by the contextual model in the configuration file, if the voice operation prompt information can be searched, the current contextual model is judged to support the voice operation, and if the voice operation prompt information cannot be searched, the current contextual model is judged not to support the voice operation.
B15, the device of B14, wherein,
the display unit is suitable for determining the display quantity of the voice operation prompt messages to be displayed, randomly selecting the voice operation prompt messages with the quantity equal to the determined display quantity from the searched voice operation prompt messages which can be supported by the current contextual model, and/or randomly selecting the voice operation prompt messages with the quantity equal to the determined display quantity in a weighting mode according to the weight set for each voice operation prompt message in the configuration file.
B16, the apparatus of B15, wherein the apparatus further comprises:
the recording unit is suitable for recording voice operation prompt information used by a user;
and the configuration unit is suitable for adjusting the weight set for each voice operation prompt message in the configuration file according to the use record of the voice operation prompt message.
B17, the device as B16, wherein the voice operation prompt message is a prompt message containing a voice keyword;
the recording unit is suitable for adding one to the number of the corresponding voice operation prompt messages when the user speaks a voice keyword.
B18, the device of B16, wherein,
the configuration unit is suitable for correspondingly increasing or reducing the weight of the voice operation prompt information to the weight corresponding to the preset value when the using number of the voice operation prompt information reaches the preset value.
B19, the device of any one of B14-B18, wherein,
the configuration unit is suitable for loading the configuration file when the intelligent equipment is started.
B20, the device of B11, wherein,
the display unit is suitable for displaying the voice operation prompt information in a floating window mode and displaying a trigger switch for opening/closing the floating window in an operating system of the intelligent equipment.
The embodiment of the invention also discloses C21 and intelligent equipment, wherein the equipment comprises: a voice acquisition unit, a voice recognition unit, and an apparatus as described in any of B11-B20;
the voice acquisition unit is suitable for acquiring voice data and sending the acquired voice data to an application to be received and/or the voice recognition unit;
the voice recognition unit is suitable for converting the voice data collected by the voice receiving unit into character data and sending the converted character data to an application to be received.
C22, the smart device according to C21, wherein the smart device is an in-vehicle smart device, a mobile terminal or a computer device.

Claims (20)

1. A method of voice controlling a smart device, wherein the method comprises:
when a user interface of the intelligent equipment is loaded or switched, acquiring a current contextual model of the intelligent equipment;
judging whether the current contextual model supports voice operation;
if so, selecting at least one piece of voice operation prompt information which can be supported by the current contextual model, and displaying at least part of the selected voice operation prompt information at a specified position on the screen of the intelligent equipment;
voice operation prompt information which can be supported by the contextual model is set for each contextual model of the intelligent equipment and is stored in a specified configuration file;
the judging whether the current contextual model supports voice operation or not comprises the following steps: searching voice operation prompt information which can be supported by the contextual model in the configuration file, if the voice operation prompt information can be searched, judging that the current contextual model supports voice operation, and if the voice operation prompt information cannot be searched, judging that the current contextual model does not support voice operation;
wherein the selecting at least one voice operation prompt message which can be supported by the current contextual model comprises:
determining the display quantity of voice operation prompt information to be displayed;
and randomly selecting the voice operation prompt messages with the number equal to the determined display number from the voice operation prompt messages which can be supported by the searched current contextual model.
2. The method of claim 1, wherein the smart device is an in-vehicle smart device, a mobile terminal, or a computer device.
3. The method of claim 1, wherein the current contextual model comprises: an interface of an application running in the foreground, and/or a function of an application that can be invoked at the present time.
4. The method of claim 1, wherein the selecting at least one voice operation prompt message that can be supported by the current contextual model further comprises:
and according to the weight set for each voice operation prompt message in the configuration file, randomly selecting the voice operation prompt messages with the number equal to the determined display number in a weighted mode.
5. The method of claim 4, wherein the method further comprises:
recording voice operation prompt information used by a user;
and adjusting the weight set for each voice operation prompt message in the configuration file according to the use record of the voice operation prompt message.
6. The method of claim 5, wherein the voice operation prompt message is a prompt message containing a voice keyword;
the recording of the voice operation prompt information used by the user comprises the following steps: and when the user speaks a voice keyword, adding one to the number of the corresponding voice operation prompt messages.
7. The method of claim 5, wherein the adjusting the weight set for each voice operation prompt message in the configuration file according to the usage record of the voice operation prompt message comprises:
when the using quantity of a voice operation prompt message reaches a preset value, the weight of the voice operation prompt message is correspondingly increased or reduced to the weight corresponding to the preset value.
8. The method of any one of claims 1-7, wherein the method further comprises:
and loading the configuration file when the intelligent equipment is started.
9. The method of claim 1, wherein the method further comprises:
and displaying the voice operation prompt information in a floating window mode, and displaying a trigger switch for opening/closing the floating window in an operating system of the intelligent equipment.
10. An apparatus for voice-controlling a smart device, wherein the apparatus comprises:
the contextual model acquisition unit is suitable for acquiring the current contextual model of the intelligent equipment when the user interface of the intelligent equipment is loaded or switched;
the judging unit is suitable for judging whether the current contextual model supports voice operation or not;
the display unit is suitable for selecting at least one piece of voice operation prompt information which can be supported by the current contextual model when the current contextual model supports voice operation, and displaying at least part of the selected voice operation prompt information at a specified position on the screen of the intelligent equipment;
the configuration unit is suitable for respectively setting voice operation prompt information which can be supported by the contextual model for each contextual model of the intelligent equipment and storing the voice operation prompt information in a specified configuration file;
the judging unit is suitable for searching voice operation prompt information which can be supported by the contextual model in the configuration file, if the voice operation prompt information can be searched, judging that the current contextual model supports voice operation, and if the voice operation prompt information cannot be searched, judging that the current contextual model does not support voice operation;
the display unit is suitable for determining the display quantity of the voice operation prompt messages to be displayed, and randomly selecting the voice operation prompt messages with the quantity equal to the determined display quantity from the searched voice operation prompt messages which can be supported by the current contextual model.
11. The apparatus of claim 10, wherein the smart device is an in-vehicle smart device, a mobile terminal, or a computer device.
12. The apparatus of claim 10, wherein the current contextual model comprises: an interface of an application running in the foreground, and/or a function of an application that can be invoked at the present time.
13. The apparatus of claim 10, wherein,
and the display unit is also suitable for weighting and randomly selecting the voice operation prompt messages with the number equal to the determined display number according to the weight set for each voice operation prompt message in the configuration file.
14. The apparatus of claim 13, wherein the apparatus further comprises:
the recording unit is suitable for recording voice operation prompt information used by a user;
and the configuration unit is suitable for adjusting the weight set for each voice operation prompt message in the configuration file according to the use record of the voice operation prompt message.
15. The apparatus according to claim 14, wherein the voice operation prompt message is a prompt message containing a voice keyword;
the recording unit is suitable for adding one to the number of the corresponding voice operation prompt messages when the user speaks a voice keyword.
16. The apparatus of claim 14, wherein,
the configuration unit is suitable for correspondingly increasing or reducing the weight of the voice operation prompt information to the weight corresponding to the preset value when the using number of the voice operation prompt information reaches the preset value.
17. The apparatus of any one of claims 10-16,
the configuration unit is suitable for loading the configuration file when the intelligent equipment is started.
18. The apparatus of claim 10, wherein,
the display unit is suitable for displaying the voice operation prompt information in a floating window mode and displaying a trigger switch for opening/closing the floating window in an operating system of the intelligent equipment.
19. A smart device, wherein the device comprises: a speech acquisition unit, a speech recognition unit, and an apparatus according to any of claims 10-18;
the voice acquisition unit is suitable for acquiring voice data and sending the acquired voice data to an application to be received and/or the voice recognition unit;
the voice recognition unit is suitable for converting the voice data collected by the voice receiving unit into character data and sending the converted character data to an application to be received.
20. The smart device of claim 19, wherein the smart device is an in-vehicle smart device, a mobile terminal, or a computer device.
CN201710309069.6A 2017-05-04 2017-05-04 Method and device for controlling intelligent equipment through voice and intelligent equipment Active CN107277225B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201710309069.6A CN107277225B (en) 2017-05-04 2017-05-04 Method and device for controlling intelligent equipment through voice and intelligent equipment
PCT/CN2018/085442 WO2018202073A1 (en) 2017-05-04 2018-05-03 Method and apparatus for voice control over intelligent device, and intelligent device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710309069.6A CN107277225B (en) 2017-05-04 2017-05-04 Method and device for controlling intelligent equipment through voice and intelligent equipment

Publications (2)

Publication Number Publication Date
CN107277225A CN107277225A (en) 2017-10-20
CN107277225B true CN107277225B (en) 2020-04-24

Family

ID=60074305

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710309069.6A Active CN107277225B (en) 2017-05-04 2017-05-04 Method and device for controlling intelligent equipment through voice and intelligent equipment

Country Status (2)

Country Link
CN (1) CN107277225B (en)
WO (1) WO2018202073A1 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107277225B (en) * 2017-05-04 2020-04-24 北京奇虎科技有限公司 Method and device for controlling intelligent equipment through voice and intelligent equipment
CN108965968B (en) * 2018-07-25 2021-04-30 聚好看科技股份有限公司 Method and device for displaying operation prompt of smart television and computer storage medium
CN113794800B (en) 2018-11-23 2022-08-26 华为技术有限公司 Voice control method and electronic equipment
WO2020111348A1 (en) * 2018-11-30 2020-06-04 엘지전자 주식회사 Vehicle control device and vehicle control method
CN109346081A (en) * 2018-12-20 2019-02-15 广州河东科技有限公司 A kind of sound control method, device, equipment and storage medium
CN111414145A (en) * 2019-01-04 2020-07-14 上海擎感智能科技有限公司 Voice function use prompting method and device
CN111552794B (en) * 2020-05-13 2023-09-19 海信电子科技(武汉)有限公司 Prompt generation method, device, equipment and storage medium
CN112887805B (en) * 2021-01-12 2023-01-20 南京创维信息技术研究院有限公司 Voice function prompting method, device, equipment and medium
CN114115790A (en) * 2021-11-12 2022-03-01 上汽通用五菱汽车股份有限公司 Voice conversation prompting method, device, equipment and computer readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1764896A (en) * 2003-04-07 2006-04-26 诺基亚有限公司 Method and apparatus for providing permission voice input in electronic equipment with user interface
CN102883041A (en) * 2012-08-02 2013-01-16 聚熵信息技术(上海)有限公司 Voice control device and method for mobile terminal
CN103200329A (en) * 2013-04-10 2013-07-10 威盛电子股份有限公司 Voice control method, mobile terminal device and voice control system
CN106297791A (en) * 2016-08-25 2017-01-04 Tcl集团股份有限公司 A kind of omnidistance voice realization method and system
CN106601242A (en) * 2015-10-16 2017-04-26 中兴通讯股份有限公司 Executing method and device of operation event and terminal

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090165145A1 (en) * 2007-12-21 2009-06-25 Nokia Corporation Changing modes in a device
DE112014006614B4 (en) * 2014-04-22 2018-04-12 Mitsubishi Electric Corporation User interface system, user interface controller, user interface control method, and user interface control program
CN105975511A (en) * 2016-04-27 2016-09-28 乐视控股(北京)有限公司 Intelligent dialogue method and apparatus
CN107277225B (en) * 2017-05-04 2020-04-24 北京奇虎科技有限公司 Method and device for controlling intelligent equipment through voice and intelligent equipment

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1764896A (en) * 2003-04-07 2006-04-26 诺基亚有限公司 Method and apparatus for providing permission voice input in electronic equipment with user interface
CN102883041A (en) * 2012-08-02 2013-01-16 聚熵信息技术(上海)有限公司 Voice control device and method for mobile terminal
CN103200329A (en) * 2013-04-10 2013-07-10 威盛电子股份有限公司 Voice control method, mobile terminal device and voice control system
CN106601242A (en) * 2015-10-16 2017-04-26 中兴通讯股份有限公司 Executing method and device of operation event and terminal
CN106297791A (en) * 2016-08-25 2017-01-04 Tcl集团股份有限公司 A kind of omnidistance voice realization method and system

Also Published As

Publication number Publication date
CN107277225A (en) 2017-10-20
WO2018202073A1 (en) 2018-11-08

Similar Documents

Publication Publication Date Title
CN107277225B (en) Method and device for controlling intelligent equipment through voice and intelligent equipment
US10970096B2 (en) Suggesting actions based on machine learning
US10055681B2 (en) Mapping actions and objects to tasks
US10739958B2 (en) Method and device for executing application using icon associated with application metadata
KR102069322B1 (en) Method for operating program and an electronic device thereof
EP2631789A1 (en) Apparatus and method for grouping application program folders in electronic device
EP2634682A2 (en) Apparatus and method for creating a shortcut menu
CN110377365A (en) The method and apparatus for showing small routine
CN110462647B (en) Electronic device and method for executing functions of electronic device
US9302676B2 (en) Methods and systems for simulating a smart device user interface on a vehicle head unit
WO2019152115A1 (en) Methods to present the context of virtual assistant conversation
CN112684967A (en) Method for displaying subtitles and electronic equipment
CN108834122A (en) A kind of bluetooth recognition method, apparatus, equipment and storage medium
CN111104547A (en) Method and device for processing data in vehicle
CN116029719A (en) Payment service recommendation method, electronic device and storage medium
EP1954016A1 (en) Portable terminal and display method and medium therefor
EP1777926A1 (en) Method and mobile terminal for performing multiple tasks without conflict
CN108595141A (en) Pronunciation inputting method and device, computer installation and computer readable storage medium
CN109947522B (en) Information display method, device, terminal, server and storage medium
CN104391742B (en) Optimizing application method and apparatus
US20170024442A1 (en) Electronic device and method of acquiring user information in electronic device
CN105898053B (en) A kind of communications records processing equipment, method and mobile terminal
CN108062952B (en) Voice control method, device and system
CN105843504B (en) A kind of window regulation method and electronic equipment
US20170270974A1 (en) Method for Controlling Multimedia and Electronic Device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant