US11217256B2 - Voice interaction method, device and terminal - Google Patents

Voice interaction method, device and terminal Download PDF

Info

Publication number
US11217256B2
US11217256B2 US16/563,488 US201916563488A US11217256B2 US 11217256 B2 US11217256 B2 US 11217256B2 US 201916563488 A US201916563488 A US 201916563488A US 11217256 B2 US11217256 B2 US 11217256B2
Authority
US
United States
Prior art keywords
content
voice
interactive mode
wake
request
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active, expires
Application number
US16/563,488
Other versions
US20200194007A1 (en
Inventor
Bingbing Fan
Hao Liang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Baidu Online Network Technology Beijing Co Ltd
Shanghai Xiaodu Technology Co Ltd
Original Assignee
Baidu Online Network Technology Beijing Co Ltd
Shanghai Xiaodu Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Baidu Online Network Technology Beijing Co Ltd, Shanghai Xiaodu Technology Co Ltd filed Critical Baidu Online Network Technology Beijing Co Ltd
Assigned to BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD. reassignment BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: Fan, Bingbing, LIANG, Hao
Publication of US20200194007A1 publication Critical patent/US20200194007A1/en
Assigned to SHANGHAI XIAODU TECHNOLOGY CO. LTD., BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD. reassignment SHANGHAI XIAODU TECHNOLOGY CO. LTD. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
Application granted granted Critical
Publication of US11217256B2 publication Critical patent/US11217256B2/en
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/22Interactive procedures; Man-machine interfaces
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/26Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/487Arrangements for providing information services, e.g. recorded voice services or time announcements
    • H04M3/493Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
    • H04M3/4936Speech interaction details
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Definitions

  • the present application relates to the field of intelligent interaction technology, and in particular, to a voice interaction method, device and terminal.
  • a near-field voice interactive mode is usually utilized as a way of interacting, for example, a Bluetooth voice interactor is utilized to interact with a smart TV.
  • a Bluetooth voice interactor is utilized to interact with a smart TV.
  • a user has to provide a wake-up prompt first, then speak out a search requirement.
  • a user usually has to provide supplemental information multiple times to find out the content he wants to watch, and the user has to speak out wake-up words repeatedly every time he interacts, which is very inconvenient and leads to low search efficiency.
  • a voice interaction method, device and terminal are provided according to embodiments of the present application, so as to at least solve the above technical problems in the existing technology.
  • a voice interaction method includes receiving a wake-up prompt, activating an interactive mode according to the wake-up prompt, displaying a dialog prompt identification in the interactive mode, obtaining a vocal request, wherein the vocal request is input in response to the dialog prompt identification, and displaying a requested content according to the vocal request.
  • the method further includes displaying the dialog prompt identification again in the interactive mode, obtaining an updated vocal request, and displaying an updated requested content according to the updated vocal request.
  • the dialog prompt identification includes a search prompt indicator
  • the search prompt indicator includes a general search start prompt word and a preset dialogue timer.
  • the dialog prompt identification includes a content guide
  • the content guide is used to prompt a user to provide requested content relevant with the content guide.
  • the method prior to activating the interactive mode according to the wake-up prompt, the method further includes determining whether a content of the wake-up prompt is associated with a preset interaction scenario, and activating the interactive mode according to the wake-up prompt, in a case that the content of the wake-up prompt is associated with the preset interaction scenario.
  • the method further includes determining whether a content of the vocal request is associated with a search request in the preset interaction scenario, and exiting the interactive mode, in a case that the content of the vocal request is not associated with the search request in the preset interaction scenario.
  • a voice interaction device configured to an embodiment of the present application.
  • the device includes a wake-up prompt receiving module configured to receive a wake-up prompt, an interactive mode activating module configured to activate an interactive mode according to the wake-up prompt, a prompt identification displaying module configured to display a dialog prompt identification in the interactive mode, a vocal request obtaining module configured to obtain a vocal request, wherein the vocal request is input in response to the dialog prompt identification, and a requested content displaying module configured to display a requested content according to the vocal request.
  • the device further includes an interaction scenario determination module configured to determine whether a content of the wake-up prompt is associated with a preset interaction scenario, and to activate the interactive mode according to the wake-up prompt, in a case that the content of the wake-up prompt is associated with the preset interaction scenario.
  • the device further includes a requirement determination module configured to determine whether a content of the vocal request is associated with a search request in the preset interaction scenario, and to exit the interactive mode, in a case that the content of the vocal request is not associated with the search request in the preset interaction scenario.
  • a voice interaction terminal is provided according to an embodiment of the present application.
  • the functions may be implemented by using hardware or by corresponding software executed by hardware.
  • the hardware or software includes one or more modules corresponding to the functions described above.
  • the voice interaction terminal structurally includes a processor and a memory, wherein the memory is configured to store programs which support the voice interaction terminal in executing the voice interaction method in the first aspect.
  • the processor is configured to execute the programs stored in the memory.
  • the voice interaction terminal may further include a communication interface through which the voice interaction terminal communicates with other devices or communication networks.
  • a non-transitory computer readable storage medium for storing computer software instructions used for a voice interaction device.
  • the computer readable storage medium can include programs involved in executing the voice interaction method described above in the first aspect.
  • One of the above technical solutions has the following advantages or beneficial effects:
  • a user can continuously provide vocal requests in an interactive mode, without waking up the interactive mode repeatedly, thereby improving user experience.
  • FIG. 1 is a flowchart showing a voice interaction method according to an embodiment.
  • FIG. 2 is a schematic diagram showing another voice interaction method according to an embodiment.
  • FIG. 3 is a block diagram showing a voice interaction device according to an embodiment.
  • FIG. 4 is a block diagram showing another voice interaction device according to an embodiment.
  • FIG. 5 is a schematic diagram showing a voice interaction terminal according to an embodiment.
  • a voice interaction method may include receiving a wake-up prompt at S 10 , activating an interactive mode according to the wake-up prompt at S 20 , displaying a dialog prompt identification in the interactive mode at S 30 , obtaining a vocal request, wherein the vocal request is input in response to the dialog prompt identification at S 40 , and displaying a requested content according to the vocal request at S 50 .
  • This embodiment is applicable to smart home appliances, such as smart TVs and smart air conditioners, and the like.
  • a wake-up prompt for example, “Xiaodu, Xiaodu, turn on the TV” is received.
  • the content of the wake-up prompt may be parsed. If the parsing results are messy codes, which means no clear content of a wake-up prompt is obtained, the interactive mode cannot be activated.
  • a user is prompted to speak out again a wake-up word for waking up the smart home appliance.
  • the way of prompting to re-obtain a wake-up prompt can also be adaptively designed according to hardware of a smart home appliance. Taking a smart TV as an example again, if an indicator light turns blue and flickers, with its brightness being gradually weakened to none, it is a prompt that a wake-up prompt should be re-obtained so that the smart TV may be waken up.
  • the interactive mode may be activated, and an interface of entering the interactive mode may be displayed on the TV screen.
  • This interface can be adaptively designed as needed.
  • a dialog prompt identification is displayed then in the interactive mode, and its function is to remind a user to notice dialogue timer, to provide a requested content, to provide a search start prompt word and the like.
  • the dialog prompt identification can be implemented in various manners and can be adaptively designed according to requirements. Furthermore, the designed position of the dialog prompt identification can also be adaptively adjusted, all of which fall into the protection scope of the present implementation.
  • the dialog prompt identification is designed as a dynamic circle displayed in the interface, which represents dialogue timer for a user.
  • the dialog prompt identification may also be an animation designed as two cartoon figures interacting with a TV, which indicates that a user is prompted to provide a requested content.
  • the dialog prompt identification may further be designed as a trumpet-shaped logo with keywords such as “Xiaodu”, which reminds a user to speak out the key words before starting a search.
  • a user can interact with a smart home appliance by providing vocal requests.
  • a vocal request provided by a user is “Xiaodu, Xiaodu, I want to watch a movie”.
  • a voice recognition is performed, and the keyword “movie” may be obtained.
  • the smart TV can perform a search by using the keyword “movie” or keywords which are related to “movie”, such as “hot movie”.
  • the search results of searching for “hot movie” are displayed in the interface, and a dialog prompt identification may also be displayed at the same time. If a further search is not required, the interactive mode is directly exited. If a further search is required, another vocal request can be continuously provided, until the requested content is found out, and then the interactive mode can be exited automatically.
  • the voice interaction method according to the embodiment includes, but is not limited to, a method for interacting with a smart TV, it can also be applied to other smart home appliances such as a smart air conditioner.
  • the process of interacting therewith is similar to the method mentioned above.
  • no further details are provided herein again, all of which fall into the protection scope of this implementation.
  • the method may further include displaying the dialog prompt identification again in the interactive mode, obtaining an updated vocal request, and displaying an updated requested content according to the updated vocal request.
  • a user can provide a further vocal request after seeing a dialog prompt identification.
  • the vocal request provided by a user can be “Hong Kong movie”.
  • a voice recognition is performed, and the keyword “Hong Kong” may be obtained.
  • the smart TV can perform a search by using the keyword “Hong Kong” or keywords which are related to “Hong Kong”.
  • the search results of searching for “Hong Kong, hot movie” are then displayed in the interface of the smart TV, and a dialog prompt identification is displayed again.
  • a vocal request of “gangster movie” is further provided by the user.
  • a voice recognition is performed, and the keyword “gangster” may be obtained.
  • the smart TV can perform a search by using the keyword “gangster” or keywords which are related to “gangster”, such as “police”, “gang” or “bandit”.
  • the searching results of searching for “Hong Kong, hot, gangster movie” are then displayed in the interface of the smart TV, a dialog prompt identification is displayed again.
  • a vocal request of “performed by Liu XX” is further provided by the user.
  • a voice recognition is performed, and the keyword “Liu XX” may be obtained.
  • the smart TV can perform a search by using the keyword “Liu XX”.
  • the searching results of searching for “Hong Kong, hot, gangster movie, Liu XX” are then displayed in the interface of the smart TV, a dialog prompt identification is displayed again.
  • a vocal request of “next page” is provided by the user.
  • a voice recognition is performed.
  • the next page of the searching results is then displayed in the interface of the smart TV, and a dialog prompt identification is displayed at the same time again.
  • a further vocal request of “play the first one” is provided by the user, the corresponding program is played, and the interactive mode is exited automatically.
  • the dialog prompt identification may include a search prompt indicator
  • the search prompt indicator may include a general search start prompt word and a preset dialogue timer.
  • the general search start prompt word includes a search start prompt word, such as “Xiaodu, Xiaodu”.
  • the search start prompt word can be displayed all the time during the entire searching process or can be displayed only at the beginning of the searching process.
  • the search prompt indicator may also include a preset dialogue timer, such as a time progress bar. A vocal request should be provided by a user within the preset dialogue timer. If no vocal request is provided by a user within the preset dialogue timer, the interactive mode is exited. Alternatively, if the search prompt indicator disappears, the current interactive mode is exited automatically, thereby avoiding mis-operation.
  • the duration of the dialogue timer may be set to 1 minute or several minutes in advance, for example. An adaptive adjustment of the duration of the preset dialogue timer may be made according to different product types, all of which fall into the protection scope of this implementation.
  • the dialog prompt identification may include a content guide, and the content guide is used to prompt a user to provide a requested content relevant with the content guide.
  • the content guide is used to prompt a user to provide a relevant requested content after obtaining a vocal request each time during an interaction process.
  • a smart TV when a number of movies regarding “Hong Kong, hot, gangster, Liu XX” are searched out and displayed in the interface of the smart TV, the top ranked hot movie “Infernal Affairs” may be displayed in the content guide, which is used to prompt the user to directly provide a vocal request of “Xiaodu, Xiaodu, I want to see Infernal Affairs.”
  • the user may also provide a vocal request of “Xiaodu, Xiaodu, I want to watch the third film, Chill” according to the content displayed on the current page.
  • the method may further include determining whether a content of the wake-up prompt is associated with a preset interaction scenario, and activating the interactive mode according to the wake-up prompt, in a case that the content of the wake-up prompt is associated with the preset interaction scenario.
  • the wake-up words for different kinds of smart home appliances need to be associated with preset interaction scenarios.
  • the wake-up word for a smart TV may be “turn on the TV”
  • the wake-up word for a smart air conditioner may be “turn on the air conditioner”.
  • using the wake-up word “turn on the air conditioner” will fail to be associated with the preset interaction scenario of the smart TV. Therefore, to avoid startup errors, it is important and necessary to determine whether a content of the wake-up prompt is associated with a preset interaction scenario.
  • the method may further include determining whether a content of the vocal request is associated with a search request in the preset interaction scenario, and exiting the interactive mode, in a case that the content of the vocal request is not associated with the search request in the preset interaction scenario.
  • relevant search requirements associated with preset interaction scenarios may be stored.
  • the stored relevant search requirements may be “Xiaodu, Xiaodu, I want to watch a movie”, “Hong Kong movie”, “I want to watch news broadcast” and the like.
  • the stored relevant search requirements may be “Xiaodu, Xiaodu, hot air” and the like.
  • a voice interaction device may include a wake-up prompt receiving module 10 configured to receive a wake-up prompt, an interactive mode activating module 20 configured to activate an interactive mode according to the wake-up prompt, a prompt identification displaying module 30 configured to display a dialog prompt identification in the interactive mode, a vocal request obtaining module 40 configured to obtain a vocal request, wherein the vocal request is input in response to the dialog prompt identification, and a requested content displaying module 50 configured to display a requested content according to the vocal request.
  • a wake-up prompt receiving module 10 configured to receive a wake-up prompt
  • an interactive mode activating module 20 configured to activate an interactive mode according to the wake-up prompt
  • a prompt identification displaying module 30 configured to display a dialog prompt identification in the interactive mode
  • a vocal request obtaining module 40 configured to obtain a vocal request, wherein the vocal request is input in response to the dialog prompt identification
  • a requested content displaying module 50 configured to display a requested content according to the vocal request.
  • the device further includes an interaction scenario determination module 21 configured to determine whether a content of the wake-up prompt is associated with a preset interaction scenario, and to activate the interactive mode according to the wake-up prompt, in a case that the content of the wake-up prompt is associated with the preset interaction scenario.
  • the device further includes a requirement determination module 60 configured to determine whether a content of the vocal request is associated with a search request in the preset interaction scenario, and to exit the interactive mode, in a case that the content of the vocal request is not associated with the search request in the preset interaction scenario.
  • a requirement determination module 60 configured to determine whether a content of the vocal request is associated with a search request in the preset interaction scenario, and to exit the interactive mode, in a case that the content of the vocal request is not associated with the search request in the preset interaction scenario.
  • a user in a variety of interaction scenarios, such as in a scenario of interacting with a smart home appliance, a user can continuously provide vocal requests in an interactive mode, without waking up the interactive mode repeatedly, thereby improving user experience.
  • a voice interaction terminal is provided according to an embodiment of the present application.
  • the voice interaction terminal includes a memory 400 , a processor 500 , wherein a computer program that can run on the processor 500 is stored in the memory 400 .
  • the processor 500 executes the computer program to implement the voice interaction method according to the foregoing embodiments.
  • the number of either the memory 400 or the processor 500 may be one or more.
  • the terminal may further include a communication interface 600 configured to enable the memory 400 and the processor 500 to communicate with an external device.
  • the memory 400 may include a high-speed RAM memory and may also include a non-volatile memory, such as at least one magnetic disk memory.
  • the bus may be an Industry Standard Architecture (ISA) bus, a Peripheral Component Interconnected (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like.
  • ISA Industry Standard Architecture
  • PCI Peripheral Component Interconnected
  • EISA Extended Industry Standard Architecture
  • the bus may be categorized into an address bus, a data bus, a control bus, and the like. For ease of illustration, only one bold line is shown in FIG. 5 to represent the bus, but it does not mean that there is only one bus or one type of bus.
  • the memory 400 , the processor 500 , and the communication interface 600 may implement mutual communication through an internal interface.
  • a computer-readable storage medium having computer programs stored thereon. When executed by a processor, the programs implement the voice interaction method described in the Embodiment I.
  • the description of the terms “one embodiment,” “some embodiments,” “an example,” “a specific example,” or “some examples” and the like means the specific features, structures, materials, or characteristics described in connection with the embodiment or example are included in at least one embodiment or example of the present application. Furthermore, the specific features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more of the embodiments or examples. In addition, different embodiments or examples described in this specification and features of different embodiments or examples may be incorporated and combined by those skilled in the art without mutual contradiction.
  • first and second are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of indicated technical features. Thus, features defining “first” and “second” may explicitly or implicitly include at least one of the features. In the description of the present application, “a plurality of” means two or more, unless expressly limited otherwise.
  • Logic and/or steps, which are represented in the flowcharts or otherwise described herein, for example, may be thought of as a sequencing listing of executable instructions for implementing logic functions, which may be embodied in any computer-readable medium, for use by or in connection with an instruction execution system, device, or device (such as a computer-based system, a processor-included system, or other system that fetch instructions from an instruction execution system, device, or device and execute the instructions).
  • a “computer-readable medium” may be any device that may contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, device, or device.
  • the computer-readable media include the following: electrical connections (electronic devices) having one or more wires, a portable computer disk cartridge (magnetic device), random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber devices, and portable read only memory (CDROM).
  • the computer-readable medium may even be paper or other suitable medium upon which the program may be printed, as it may be read, for example, by optical scanning of the paper or other medium, followed by editing, interpretation or, where appropriate, process otherwise to electronically obtain the program, which is then stored in a computer memory.
  • each of the functional units in the embodiments of the present application may be integrated in one processing module, or each of the units may exist alone physically, or two or more units may be integrated in one module.
  • the above-mentioned integrated module may be implemented in the form of hardware or in the form of software functional module.
  • the integrated module When the integrated module is implemented in the form of a software functional module and is sold or used as an independent product, the integrated module may also be stored in a computer-readable storage medium.
  • the storage medium may be a read only memory, a magnetic disk, an optical disk, or the like.

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

A voice interaction method, device, and terminal. The method includes receiving a wake-up prompt, activating an interactive mode according to the wake-up prompt, displaying a dialog prompt identification in the interactive mode, obtaining a vocal request, wherein the vocal request is input in response to the dialog prompt identification, and displaying a requested content according to the vocal request. In a variety of interaction scenarios, such as in a scenario of interacting with a smart home appliance, a user can continuously provide vocal requests in an interactive mode, without waking up the interactive mode repeatedly, thereby improving user experience.

Description

CROSS-REFERENCE TO RELATED APPLICATION
This application claims priority to Chinese Patent Application No. 201811519317.0, filed on Dec. 12, 2018, which is hereby incorporated by reference in its entirety.
TECHNICAL FIELD
The present application relates to the field of intelligent interaction technology, and in particular, to a voice interaction method, device and terminal.
BACKGROUNDS
In the field of smart home appliances such as smart televisions (TV), a near-field voice interactive mode is usually utilized as a way of interacting, for example, a Bluetooth voice interactor is utilized to interact with a smart TV. Although certain convenience is provided to a user in this way, it is still required to manually perform a Bluetooth connection, which means that user's hands cannot be truly liberated. Currently, an optimized interaction method is to control smart home appliances by using a far-field interactive mode, which is also suitable for smart TVs or far-field TV box devices.
However, in the current voice interaction technology, a user has to provide a wake-up prompt first, then speak out a search requirement. Especially when searching for video resources for playing, a user usually has to provide supplemental information multiple times to find out the content he wants to watch, and the user has to speak out wake-up words repeatedly every time he interacts, which is very inconvenient and leads to low search efficiency.
SUMMARY
A voice interaction method, device and terminal are provided according to embodiments of the present application, so as to at least solve the above technical problems in the existing technology.
In a first aspect, a voice interaction method is provided according to an embodiment of the present application. The method includes receiving a wake-up prompt, activating an interactive mode according to the wake-up prompt, displaying a dialog prompt identification in the interactive mode, obtaining a vocal request, wherein the vocal request is input in response to the dialog prompt identification, and displaying a requested content according to the vocal request.
In an implementation, after displaying the requested content, the method further includes displaying the dialog prompt identification again in the interactive mode, obtaining an updated vocal request, and displaying an updated requested content according to the updated vocal request.
In an implementation, the dialog prompt identification includes a search prompt indicator, and the search prompt indicator includes a general search start prompt word and a preset dialogue timer.
In an implementation, the dialog prompt identification includes a content guide, and the content guide is used to prompt a user to provide requested content relevant with the content guide.
In an implementation, prior to activating the interactive mode according to the wake-up prompt, the method further includes determining whether a content of the wake-up prompt is associated with a preset interaction scenario, and activating the interactive mode according to the wake-up prompt, in a case that the content of the wake-up prompt is associated with the preset interaction scenario.
In an implementation, after obtaining the vocal request, the method further includes determining whether a content of the vocal request is associated with a search request in the preset interaction scenario, and exiting the interactive mode, in a case that the content of the vocal request is not associated with the search request in the preset interaction scenario.
In a second aspect, a voice interaction device is provided according to an embodiment of the present application. The device includes a wake-up prompt receiving module configured to receive a wake-up prompt, an interactive mode activating module configured to activate an interactive mode according to the wake-up prompt, a prompt identification displaying module configured to display a dialog prompt identification in the interactive mode, a vocal request obtaining module configured to obtain a vocal request, wherein the vocal request is input in response to the dialog prompt identification, and a requested content displaying module configured to display a requested content according to the vocal request.
In an implementation, the device further includes an interaction scenario determination module configured to determine whether a content of the wake-up prompt is associated with a preset interaction scenario, and to activate the interactive mode according to the wake-up prompt, in a case that the content of the wake-up prompt is associated with the preset interaction scenario.
In an implementation, the device further includes a requirement determination module configured to determine whether a content of the vocal request is associated with a search request in the preset interaction scenario, and to exit the interactive mode, in a case that the content of the vocal request is not associated with the search request in the preset interaction scenario.
In a third aspect, a voice interaction terminal is provided according to an embodiment of the present application. The functions may be implemented by using hardware or by corresponding software executed by hardware. The hardware or software includes one or more modules corresponding to the functions described above.
In a possible design, the voice interaction terminal structurally includes a processor and a memory, wherein the memory is configured to store programs which support the voice interaction terminal in executing the voice interaction method in the first aspect. The processor is configured to execute the programs stored in the memory. The voice interaction terminal may further include a communication interface through which the voice interaction terminal communicates with other devices or communication networks.
In a fourth aspect, a non-transitory computer readable storage medium for storing computer software instructions used for a voice interaction device is provided. The computer readable storage medium can include programs involved in executing the voice interaction method described above in the first aspect.
One of the above technical solutions has the following advantages or beneficial effects: In a variety of interaction scenarios, such as in a scenario of interacting with a smart home appliance, a user can continuously provide vocal requests in an interactive mode, without waking up the interactive mode repeatedly, thereby improving user experience.
The above summary is provided only for illustration and is not intended to be limiting in any way. In addition to the illustrative aspects, embodiments, and features described above, further aspects, embodiments, and features of the present application will be readily understood from the following detailed description with reference to the accompanying drawings.
BRIEF DESCRIPTION OF THE DRAWINGS
In the drawings, unless otherwise specified, identical or similar parts or elements are denoted by identical reference numerals throughout the drawings. The drawings are not necessarily drawn to scale. It should be understood these drawings merely illustrate some embodiments of the present application and should not be construed as limiting the scope of the present application.
FIG. 1 is a flowchart showing a voice interaction method according to an embodiment.
FIG. 2 is a schematic diagram showing another voice interaction method according to an embodiment.
FIG. 3 is a block diagram showing a voice interaction device according to an embodiment.
FIG. 4 is a block diagram showing another voice interaction device according to an embodiment.
FIG. 5 is a schematic diagram showing a voice interaction terminal according to an embodiment.
DETAILED DESCRIPTION OF THE EMBODIMENTS
Hereafter, only certain exemplary embodiments are briefly described. As can be appreciated by those skilled in the art, the described embodiments may be modified in different ways, without departing from the spirit or scope of the present application. Accordingly, the drawings and the description should be considered as illustrative in nature instead of being restrictive.
Embodiment I
As shown in FIG. 1, in an implementation, a voice interaction method may include receiving a wake-up prompt at S10, activating an interactive mode according to the wake-up prompt at S20, displaying a dialog prompt identification in the interactive mode at S30, obtaining a vocal request, wherein the vocal request is input in response to the dialog prompt identification at S40, and displaying a requested content according to the vocal request at S50.
This embodiment is applicable to smart home appliances, such as smart TVs and smart air conditioners, and the like. Taking a smart TV as an example for illustration, firstly, a wake-up prompt, for example, “Xiaodu, Xiaodu, turn on the TV” is received. Then, the content of the wake-up prompt may be parsed. If the parsing results are messy codes, which means no clear content of a wake-up prompt is obtained, the interactive mode cannot be activated. Then, a user is prompted to speak out again a wake-up word for waking up the smart home appliance. The way of prompting to re-obtain a wake-up prompt can also be adaptively designed according to hardware of a smart home appliance. Taking a smart TV as an example again, if an indicator light turns blue and flickers, with its brightness being gradually weakened to none, it is a prompt that a wake-up prompt should be re-obtained so that the smart TV may be waken up.
When it is determined that the parsing result of a wake-up prompt includes clear wake-up words, the interactive mode may be activated, and an interface of entering the interactive mode may be displayed on the TV screen. This interface can be adaptively designed as needed. A dialog prompt identification is displayed then in the interactive mode, and its function is to remind a user to notice dialogue timer, to provide a requested content, to provide a search start prompt word and the like. The dialog prompt identification can be implemented in various manners and can be adaptively designed according to requirements. Furthermore, the designed position of the dialog prompt identification can also be adaptively adjusted, all of which fall into the protection scope of the present implementation. For example, the dialog prompt identification is designed as a dynamic circle displayed in the interface, which represents dialogue timer for a user. The dialog prompt identification may also be an animation designed as two cartoon figures interacting with a TV, which indicates that a user is prompted to provide a requested content. The dialog prompt identification may further be designed as a trumpet-shaped logo with keywords such as “Xiaodu”, which reminds a user to speak out the key words before starting a search.
After seeing the dialog prompt identification, a user can interact with a smart home appliance by providing vocal requests. For example, a vocal request provided by a user is “Xiaodu, Xiaodu, I want to watch a movie”. After receiving the vocal request, a voice recognition is performed, and the keyword “movie” may be obtained. In this case, the smart TV can perform a search by using the keyword “movie” or keywords which are related to “movie”, such as “hot movie”. After the search is completed, the search results of searching for “hot movie” are displayed in the interface, and a dialog prompt identification may also be displayed at the same time. If a further search is not required, the interactive mode is directly exited. If a further search is required, another vocal request can be continuously provided, until the requested content is found out, and then the interactive mode can be exited automatically.
Certainly, the voice interaction method according to the embodiment includes, but is not limited to, a method for interacting with a smart TV, it can also be applied to other smart home appliances such as a smart air conditioner. The process of interacting therewith is similar to the method mentioned above. Thus, no further details are provided herein again, all of which fall into the protection scope of this implementation.
In an implementation, after displaying the requested content, the method may further include displaying the dialog prompt identification again in the interactive mode, obtaining an updated vocal request, and displaying an updated requested content according to the updated vocal request.
In an example, after the requested content with regard to the keyword “movie” is displayed, a user can provide a further vocal request after seeing a dialog prompt identification. For example, the vocal request provided by a user can be “Hong Kong movie”. After receiving the vocal request of “Hong Kong movie”, a voice recognition is performed, and the keyword “Hong Kong” may be obtained. In this case, the smart TV can perform a search by using the keyword “Hong Kong” or keywords which are related to “Hong Kong”. After the search is completed, the search results of searching for “Hong Kong, hot movie” are then displayed in the interface of the smart TV, and a dialog prompt identification is displayed again.
For example, a vocal request of “gangster movie” is further provided by the user. After receiving the vocal request, a voice recognition is performed, and the keyword “gangster” may be obtained. In this case, the smart TV can perform a search by using the keyword “gangster” or keywords which are related to “gangster”, such as “police”, “gang” or “bandit”. After the search is completed, the searching results of searching for “Hong Kong, hot, gangster movie” are then displayed in the interface of the smart TV, a dialog prompt identification is displayed again.
Continuously, for example, a vocal request of “performed by Liu XX” is further provided by the user. After receiving the vocal request, a voice recognition is performed, and the keyword “Liu XX” may be obtained. In this case, the smart TV can perform a search by using the keyword “Liu XX”. After the search is completed, the searching results of searching for “Hong Kong, hot, gangster movie, Liu XX” are then displayed in the interface of the smart TV, a dialog prompt identification is displayed again.
Continuously, for another example, a vocal request of “next page” is provided by the user. After receiving the vocal request, a voice recognition is performed. In this case, according to the recognition results, the next page of the searching results is then displayed in the interface of the smart TV, and a dialog prompt identification is displayed at the same time again. After a further vocal request of “play the first one” is provided by the user, the corresponding program is played, and the interactive mode is exited automatically.
In an implementation, the dialog prompt identification may include a search prompt indicator, and the search prompt indicator may include a general search start prompt word and a preset dialogue timer.
The general search start prompt word includes a search start prompt word, such as “Xiaodu, Xiaodu”. The search start prompt word can be displayed all the time during the entire searching process or can be displayed only at the beginning of the searching process. The search prompt indicator may also include a preset dialogue timer, such as a time progress bar. A vocal request should be provided by a user within the preset dialogue timer. If no vocal request is provided by a user within the preset dialogue timer, the interactive mode is exited. Alternatively, if the search prompt indicator disappears, the current interactive mode is exited automatically, thereby avoiding mis-operation. The duration of the dialogue timer may be set to 1 minute or several minutes in advance, for example. An adaptive adjustment of the duration of the preset dialogue timer may be made according to different product types, all of which fall into the protection scope of this implementation.
In an implementation, the dialog prompt identification may include a content guide, and the content guide is used to prompt a user to provide a requested content relevant with the content guide.
The content guide is used to prompt a user to provide a relevant requested content after obtaining a vocal request each time during an interaction process. Taking a smart TV as an example, when a number of movies regarding “Hong Kong, hot, gangster, Liu XX” are searched out and displayed in the interface of the smart TV, the top ranked hot movie “Infernal Affairs” may be displayed in the content guide, which is used to prompt the user to directly provide a vocal request of “Xiaodu, Xiaodu, I want to see Infernal Affairs.” Alternatively, the user may also provide a vocal request of “Xiaodu, Xiaodu, I want to watch the third film, Chill” according to the content displayed on the current page.
As shown in FIG. 2, in an implementation, prior to S20, the method may further include determining whether a content of the wake-up prompt is associated with a preset interaction scenario, and activating the interactive mode according to the wake-up prompt, in a case that the content of the wake-up prompt is associated with the preset interaction scenario.
The wake-up words for different kinds of smart home appliances need to be associated with preset interaction scenarios. For example, the wake-up word for a smart TV may be “turn on the TV”, and the wake-up word for a smart air conditioner may be “turn on the air conditioner”. For a smart TV, using the wake-up word “turn on the air conditioner” will fail to be associated with the preset interaction scenario of the smart TV. Therefore, to avoid startup errors, it is important and necessary to determine whether a content of the wake-up prompt is associated with a preset interaction scenario.
As shown in FIG. 2, in an implementation, after S40, the method may further include determining whether a content of the vocal request is associated with a search request in the preset interaction scenario, and exiting the interactive mode, in a case that the content of the vocal request is not associated with the search request in the preset interaction scenario.
In an example, in the interactive mode of various scenarios, relevant search requirements associated with preset interaction scenarios may be stored. For example, in a preset interaction scenario of a smart TV, the stored relevant search requirements may be “Xiaodu, Xiaodu, I want to watch a movie”, “Hong Kong movie”, “I want to watch news broadcast” and the like. In a preset interaction scenarios of a smart air conditioner, the stored relevant search requirements may be “Xiaodu, Xiaodu, hot air” and the like. If, in the interactive mode of a smart TV, a user says search requirements which are not associated with the search requests in the preset interaction scenario of the smart TV, such as “Please set the temperature to 10 degrees” or “Please turn to the intermediate wind”, the current interactive mode is exited automatically.
Embodiment II
As shown in FIG. 3, in an implementation, a voice interaction device may include a wake-up prompt receiving module 10 configured to receive a wake-up prompt, an interactive mode activating module 20 configured to activate an interactive mode according to the wake-up prompt, a prompt identification displaying module 30 configured to display a dialog prompt identification in the interactive mode, a vocal request obtaining module 40 configured to obtain a vocal request, wherein the vocal request is input in response to the dialog prompt identification, and a requested content displaying module 50 configured to display a requested content according to the vocal request.
As shown in FIG. 4, in an embodiment, the device further includes an interaction scenario determination module 21 configured to determine whether a content of the wake-up prompt is associated with a preset interaction scenario, and to activate the interactive mode according to the wake-up prompt, in a case that the content of the wake-up prompt is associated with the preset interaction scenario.
As shown in FIG. 4, in an embodiment, the device further includes a requirement determination module 60 configured to determine whether a content of the vocal request is associated with a search request in the preset interaction scenario, and to exit the interactive mode, in a case that the content of the vocal request is not associated with the search request in the preset interaction scenario.
According to the embodiments, in a variety of interaction scenarios, such as in a scenario of interacting with a smart home appliance, a user can continuously provide vocal requests in an interactive mode, without waking up the interactive mode repeatedly, thereby improving user experience.
Embodiment III
As shown in FIG. 5, a voice interaction terminal is provided according to an embodiment of the present application. The voice interaction terminal includes a memory 400, a processor 500, wherein a computer program that can run on the processor 500 is stored in the memory 400. The processor 500 executes the computer program to implement the voice interaction method according to the foregoing embodiments. The number of either the memory 400 or the processor 500 may be one or more. The terminal may further include a communication interface 600 configured to enable the memory 400 and the processor 500 to communicate with an external device.
The memory 400 may include a high-speed RAM memory and may also include a non-volatile memory, such as at least one magnetic disk memory.
If the memory 400, the processor 500, and the communication interface 600 are implemented independently, the memory 400, the processor 500, and the communication interface 600 may be connected to each other via a bus to realize mutual communication. The bus may be an Industry Standard Architecture (ISA) bus, a Peripheral Component Interconnected (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like. The bus may be categorized into an address bus, a data bus, a control bus, and the like. For ease of illustration, only one bold line is shown in FIG. 5 to represent the bus, but it does not mean that there is only one bus or one type of bus.
Optionally, in a specific implementation, if the memory 400, the processor 500, and the communication interface 600 are integrated on one chip, the memory 400, the processor 500, and the communication interface 600 may implement mutual communication through an internal interface.
Embodiment IV
According to an embodiment, it is provided a computer-readable storage medium having computer programs stored thereon. When executed by a processor, the programs implement the voice interaction method described in the Embodiment I.
In the description of the specification, the description of the terms “one embodiment,” “some embodiments,” “an example,” “a specific example,” or “some examples” and the like means the specific features, structures, materials, or characteristics described in connection with the embodiment or example are included in at least one embodiment or example of the present application. Furthermore, the specific features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more of the embodiments or examples. In addition, different embodiments or examples described in this specification and features of different embodiments or examples may be incorporated and combined by those skilled in the art without mutual contradiction.
In addition, the terms “first” and “second” are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of indicated technical features. Thus, features defining “first” and “second” may explicitly or implicitly include at least one of the features. In the description of the present application, “a plurality of” means two or more, unless expressly limited otherwise.
Any process or method descriptions described in flowcharts or otherwise herein may be understood as representing modules, segments or portions of code that include one or more executable instructions for implementing the steps of a particular logic function or process. The scope of the preferred embodiments of the present application includes additional implementations where the functions may not be performed in the order shown or discussed, including according to the functions involved, in substantially simultaneous or in reverse order, which should be understood by those skilled in the art to which the embodiment of the present application belongs.
Logic and/or steps, which are represented in the flowcharts or otherwise described herein, for example, may be thought of as a sequencing listing of executable instructions for implementing logic functions, which may be embodied in any computer-readable medium, for use by or in connection with an instruction execution system, device, or device (such as a computer-based system, a processor-included system, or other system that fetch instructions from an instruction execution system, device, or device and execute the instructions). For the purposes of this specification, a “computer-readable medium” may be any device that may contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, device, or device. More specific examples (not a non-exhaustive list) of the computer-readable media include the following: electrical connections (electronic devices) having one or more wires, a portable computer disk cartridge (magnetic device), random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber devices, and portable read only memory (CDROM). In addition, the computer-readable medium may even be paper or other suitable medium upon which the program may be printed, as it may be read, for example, by optical scanning of the paper or other medium, followed by editing, interpretation or, where appropriate, process otherwise to electronically obtain the program, which is then stored in a computer memory.
It should be understood various portions of the present application may be implemented by hardware, software, firmware, or a combination thereof. In the above embodiments, multiple steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, they may be implemented using any one or a combination of the following techniques well known in the art: discrete logic circuits having a logic gate circuit for implementing logic functions on data signals, application specific integrated circuits with suitable combinational logic gate circuits, programmable gate arrays (PGA), field programmable gate arrays (FPGAs), and the like.
Those skilled in the art may understand that all or some of the steps carried in the methods in the foregoing embodiments may be implemented by a program instructing relevant hardware. The program may be stored in a computer-readable storage medium, and when executed, one of the steps of the method embodiment or a combination thereof is included.
In addition, each of the functional units in the embodiments of the present application may be integrated in one processing module, or each of the units may exist alone physically, or two or more units may be integrated in one module. The above-mentioned integrated module may be implemented in the form of hardware or in the form of software functional module. When the integrated module is implemented in the form of a software functional module and is sold or used as an independent product, the integrated module may also be stored in a computer-readable storage medium. The storage medium may be a read only memory, a magnetic disk, an optical disk, or the like.
The foregoing descriptions are merely specific embodiments of the present application, but not intended to limit the protection scope of the present application. Those skilled in the art may easily conceive of various changes or modifications within the technical scope disclosed herein, all these should be covered within the protection scope of the present application. Therefore, the protection scope of the present application should be subject to the protection scope of the claims.

Claims (13)

What is claimed is:
1. A voice interaction method comprising:
receiving a wake-up voice;
activating an interactive mode of a smart home appliance based on content of the wake-up voice;
displaying a dialog prompt identification in the interactive mode of the smart home appliance;
obtaining a vocal request, wherein the vocal request is input in response to the dialog prompt identification;
displaying requested content according to the vocal request; and
after displaying the requested content and before deactivating the interactive mode of the smart home appliance:
displaying the dialog prompt identification again in the interactive mode;
obtaining an updated vocal request; and
displaying an updated requested content according to the updated vocal request.
2. The voice interaction method according to claim 1, wherein the dialog prompt identification comprises a search prompt indicator, and the search prompt indicator comprises a general search start prompt word and a preset dialogue timer.
3. The voice interaction method according to claim 1, wherein the dialog prompt identification comprises a content guide, and the content guide is used to prompt a user to provide a requested content relevant with the content guide.
4. The voice interaction method according to claim 1, wherein prior to activating the interactive mode based on the content of the wake-up voice, the method further comprises:
determining whether a content of the wake-up voice is associated with a preset interaction scenario; and
activating the interactive mode according to the wake-up voice when the content of the wake-up voice is associated with the preset interaction scenario.
5. The voice interaction method according to claim 4, wherein after obtaining the vocal request, the method further comprises:
determining whether a content of the vocal request is associated with a search request in the preset interaction scenario; and
exiting the interactive mode when the content of the vocal request is not associated with the search request in the preset interaction scenario.
6. A voice interaction device comprising:
one or more processors; and
a memory for storing one or more programs, wherein the one or more programs are executed by the one or more processors to enable the one or more processors to:
receive a wake-up voice;
activate an interactive mode of a smart home appliance according to the wake-up voice;
display a dialog prompt identification in the interactive mode of the smart home appliance;
obtain a vocal request, wherein the vocal request is input in response to the dialog prompt identification;
display a requested content according to the vocal request; and
after displaying the requested content and before deactivating the interactive mode of the smart home appliance:
display the dialog prompt identification again in the interactive mode;
obtain an updated vocal request; and
display an updated requested content according to the updated vocal request.
7. The voice interaction device according to claim 6, wherein the one or more programs are executed by the one or more processors to enable the one or more processors to:
determine whether a content of the wake-up voice is associated with a preset interaction scenario, and activate the interactive mode according to the wake-up voice, in a case that the content of the wake-up voice is associated with the preset interaction scenario.
8. The voice interaction device according to claim 7, wherein the one or more programs are executed by the one or more processors to enable the one or more processors to:
determine whether a content of the vocal request is associated with a search request in the preset interaction scenario, and exit the interactive mode, in a case that the content of the vocal request is not is associated with the search request in the preset interaction scenario.
9. A non-transitory computer readable storage medium, in which a computer program is stored, wherein the computer program, when executed by a processor, causes a smart home appliance to:
receive a wake-up voice;
activate an interactive mode of the smart home appliance based on content of the wake-up voice;
display a dialog prompt identification in the interactive mode of the smart home appliance;
obtain a vocal request, wherein the vocal request is input in response to the dialog prompt identification;
display requested content according to the vocal request; and
after displaying the requested content and before deactivating the interactive mode of the smart home appliance:
display the dialog prompt identification again in the interactive mode;
obtain an updated vocal request; and
display an updated requested content according to the updated vocal request.
10. The computer readable storage medium of claim 9, wherein the dialog prompt identification comprises a search prompt indicator, and the search prompt indicator comprises a general search start prompt word and a preset dialogue timer.
11. The computer readable storage medium of claim 9, wherein the dialog prompt identification comprises a content guide, and the content guide is used to prompt a user to provide a requested content relevant with the content guide.
12. The computer readable storage medium of claim 9, wherein prior to activating the interactive mode based on the content of the wake-up voice, the computer program causes the smart home appliance to:
determine whether the content of the wake-up voice is associated with a preset interaction scenario; and
activate the interactive mode according to the wake-up voice when the content of the wake-up voice is associated with the preset interaction scenario.
13. The computer readable storage medium of claim 12, wherein after obtaining the vocal request, the computer program causes the smart home appliance to:
determine whether a content of the vocal request is associated with a search request in the preset interaction scenario; and
exit the interactive mode when the content of the vocal request is not associated with the search request in the preset interaction scenario.
US16/563,488 2018-12-12 2019-09-06 Voice interaction method, device and terminal Active 2040-02-14 US11217256B2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811519317.0A CN109410944B (en) 2018-12-12 2018-12-12 Voice interaction method, device and terminal
CN201811519317.0 2018-12-12

Publications (2)

Publication Number Publication Date
US20200194007A1 US20200194007A1 (en) 2020-06-18
US11217256B2 true US11217256B2 (en) 2022-01-04

Family

ID=65458732

Family Applications (1)

Application Number Title Priority Date Filing Date
US16/563,488 Active 2040-02-14 US11217256B2 (en) 2018-12-12 2019-09-06 Voice interaction method, device and terminal

Country Status (2)

Country Link
US (1) US11217256B2 (en)
CN (1) CN109410944B (en)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11676582B2 (en) * 2019-02-27 2023-06-13 Google Llc Detecting conversations with computing devices
CN112334979B (en) * 2019-02-27 2024-07-12 谷歌有限责任公司 Detecting ongoing conversations by computing device
CN110619873A (en) * 2019-08-16 2019-12-27 北京小米移动软件有限公司 Audio processing method, device and storage medium
CN110751948A (en) * 2019-10-18 2020-02-04 珠海格力电器股份有限公司 Voice recognition method, device, storage medium and voice equipment
CN110689891A (en) * 2019-11-20 2020-01-14 广东奥园奥买家电子商务有限公司 Voice interaction method and device based on public display device
CN111192581A (en) * 2020-01-07 2020-05-22 百度在线网络技术(北京)有限公司 Voice wake-up method, device and storage medium
CN113641408B (en) * 2020-04-23 2024-12-13 百度在线网络技术(北京)有限公司 Method and device for generating quick entry
US12547372B2 (en) 2021-03-15 2026-02-10 VIDAA USA, Inc. Display apparatus and display method
CN112860331B (en) * 2021-03-19 2023-11-10 Vidaa美国公司 A display device and voice interaction prompt method
CN115775560B (en) * 2021-03-16 2025-05-27 海信视像科技股份有限公司 A wake-up response prompting method and display device
CN113241069B (en) * 2021-04-15 2023-12-12 王维坤 A method to improve the success rate of voice interaction
CN113297359B (en) * 2021-04-23 2023-11-28 阿里巴巴新加坡控股有限公司 Methods and devices for interactive information
CN113301394B (en) * 2021-04-30 2023-07-11 当趣网络科技(杭州)有限公司 Voice control method combined with user grade
CN113990310A (en) * 2021-10-15 2022-01-28 深圳集智数字科技有限公司 Method and device for controlling screen content through voice and electronic equipment
CN115424623B (en) * 2022-03-23 2025-02-21 北京罗克维尔斯科技有限公司 Voice interaction method, device, equipment and computer readable storage medium
CN115240674B (en) * 2022-07-21 2025-06-06 海信视像科技股份有限公司 Wake-up-free voice control method for terminal device, terminal device and server
CN118136016B (en) * 2024-04-10 2025-02-18 广州小鹏汽车科技有限公司 Voice interaction method, server and computer-readable storage medium

Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030078784A1 (en) * 2001-10-03 2003-04-24 Adam Jordan Global speech user interface
US20110022393A1 (en) * 2007-11-12 2011-01-27 Waeller Christoph Multimode user interface of a driver assistance system for inputting and presentation of information
US20130006643A1 (en) * 2010-01-13 2013-01-03 Aram Lindahl Devices and Methods for Identifying a Prompt Corresponding to a Voice Input in a Sequence of Prompts
US20140244269A1 (en) * 2013-02-28 2014-08-28 Sony Mobile Communications Ab Device and method for activating with voice input
US20140278435A1 (en) * 2013-03-12 2014-09-18 Nuance Communications, Inc. Methods and apparatus for detecting a voice command
CN104575504A (en) 2014-12-24 2015-04-29 上海师范大学 Method for personalized television voice wake-up by voiceprint and voice identification
US20150312351A1 (en) * 2014-04-24 2015-10-29 Alcatel Lucent Method, device and system for device trigger in iot
US20160155443A1 (en) * 2014-11-28 2016-06-02 Microsoft Technology Licensing, Llc Device arbitration for listening devices
CN106230689A (en) 2016-07-25 2016-12-14 北京奇虎科技有限公司 Method, device and the server that a kind of voice messaging is mutual
CN107680589A (en) 2017-09-05 2018-02-09 百度在线网络技术(北京)有限公司 Voice messaging exchange method, device and its equipment
CN107885810A (en) 2017-01-24 2018-04-06 问众智能信息科技(北京)有限公司 The method and apparatus that result for vehicle intelligent equipment interactive voice is shown
CN108132805A (en) 2017-12-20 2018-06-08 深圳Tcl新技术有限公司 Voice interactive method, device and computer readable storage medium
CN108170785A (en) 2017-12-26 2018-06-15 深圳Tcl新技术有限公司 Bootstrap technique, device and the computer readable storage medium of terminal searching operation
CN108259981A (en) 2018-04-11 2018-07-06 深圳市茁壮网络股份有限公司 A kind of television channel change control method, mobile terminal and set-top box
CN108366281A (en) 2018-02-05 2018-08-03 山东浪潮商用系统有限公司 A kind of full voice exchange method applied to set-top box

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070267253A1 (en) * 2006-05-04 2007-11-22 Ryan Tauer Method and apparatus for recording books
CN102938864A (en) * 2012-11-27 2013-02-20 四川长虹电器股份有限公司 Method for realizing television channel switching based on customized voice
CN104290097B (en) * 2014-08-19 2016-03-30 白劲实 The social robot system of a kind of learning type intellectual family and method

Patent Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030078784A1 (en) * 2001-10-03 2003-04-24 Adam Jordan Global speech user interface
US20110022393A1 (en) * 2007-11-12 2011-01-27 Waeller Christoph Multimode user interface of a driver assistance system for inputting and presentation of information
US20130006643A1 (en) * 2010-01-13 2013-01-03 Aram Lindahl Devices and Methods for Identifying a Prompt Corresponding to a Voice Input in a Sequence of Prompts
US20140244269A1 (en) * 2013-02-28 2014-08-28 Sony Mobile Communications Ab Device and method for activating with voice input
US20140278435A1 (en) * 2013-03-12 2014-09-18 Nuance Communications, Inc. Methods and apparatus for detecting a voice command
US20150312351A1 (en) * 2014-04-24 2015-10-29 Alcatel Lucent Method, device and system for device trigger in iot
US20160155443A1 (en) * 2014-11-28 2016-06-02 Microsoft Technology Licensing, Llc Device arbitration for listening devices
CN104575504A (en) 2014-12-24 2015-04-29 上海师范大学 Method for personalized television voice wake-up by voiceprint and voice identification
CN106230689A (en) 2016-07-25 2016-12-14 北京奇虎科技有限公司 Method, device and the server that a kind of voice messaging is mutual
CN107885810A (en) 2017-01-24 2018-04-06 问众智能信息科技(北京)有限公司 The method and apparatus that result for vehicle intelligent equipment interactive voice is shown
CN107680589A (en) 2017-09-05 2018-02-09 百度在线网络技术(北京)有限公司 Voice messaging exchange method, device and its equipment
CN108132805A (en) 2017-12-20 2018-06-08 深圳Tcl新技术有限公司 Voice interactive method, device and computer readable storage medium
CN108170785A (en) 2017-12-26 2018-06-15 深圳Tcl新技术有限公司 Bootstrap technique, device and the computer readable storage medium of terminal searching operation
CN108366281A (en) 2018-02-05 2018-08-03 山东浪潮商用系统有限公司 A kind of full voice exchange method applied to set-top box
CN108259981A (en) 2018-04-11 2018-07-06 深圳市茁壮网络股份有限公司 A kind of television channel change control method, mobile terminal and set-top box

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Office Action for Chinese Application No. 201811519317.0 dated Dec. 30, 2019 (13 pages).
Search Report for Chinese Application No. 201811519317.0, dated Dec. 20, 2019 (6 pages).

Also Published As

Publication number Publication date
CN109410944A (en) 2019-03-01
CN109410944B (en) 2020-06-09
US20200194007A1 (en) 2020-06-18

Similar Documents

Publication Publication Date Title
US11217256B2 (en) Voice interaction method, device and terminal
US11086596B2 (en) Electronic device, server and control method thereof
US10783364B2 (en) Method, apparatus and device for waking up voice interaction function based on gesture, and computer readable medium
US20190149872A1 (en) Information exchanging method and device, audio terminal and computer-readable storage medium
CN104170397B (en) A method and computer storage medium for presenting search results on an electronic device
US11205431B2 (en) Method, apparatus and device for presenting state of voice interaction device, and storage medium
US20130073293A1 (en) Electronic device and method for controlling the same
CN109754788B (en) Voice control method, device, equipment and storage medium
CN107277225B (en) Method and device for controlling intelligent equipment through voice and intelligent equipment
US11574632B2 (en) In-cloud wake-up method and system, terminal and computer-readable storage medium
CN103686359A (en) Startup advertisement playing method and device thereof
RU2582070C1 (en) Method of controlling external input and broadcast receiver
US10802851B2 (en) Display apparatus and controlling method thereof
US20260019678A1 (en) Video processing method, apparatus, device, and storage medium
US20260023583A1 (en) Display apparatus and controlling method thereof
TWI587253B (en) Method and apparatus for providing notice of availability of audio description
CN109725869B (en) Continuous interaction control method and device
CN109275005A (en) Combined key remote control method, device, equipment and storage medium
CN112464075A (en) Application recommendation method and device of intelligent sound box and electronic equipment
US20170346941A1 (en) Electronic device and usage control method
US11582514B2 (en) Source apparatus and control method therefor
CN112203125A (en) Voice broadcasting method and device, video playing device and storage medium
WO2020220649A1 (en) Same window switching method for online list and local list, and computing device
CN110874201A (en) Interaction method, device, storage medium and operating system
US20080300883A1 (en) Projection Apparatus with Speech Indication and Control Method Thereof

Legal Events

Date Code Title Description
FEPP Fee payment procedure

Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

AS Assignment

Owner name: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD., CHINA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:FAN, BINGBING;LIANG, HAO;REEL/FRAME:050828/0959

Effective date: 20181224

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

AS Assignment

Owner name: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD., CHINA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.;REEL/FRAME:056811/0772

Effective date: 20210527

Owner name: SHANGHAI XIAODU TECHNOLOGY CO. LTD., CHINA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.;REEL/FRAME:056811/0772

Effective date: 20210527

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE AFTER FINAL ACTION FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS

STPP Information on status: patent application and granting procedure in general

Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT RECEIVED

STPP Information on status: patent application and granting procedure in general

Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED

STCF Information on status: patent grant

Free format text: PATENTED CASE

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 4