US11217256B2 - Voice interaction method, device and terminal - Google Patents
Voice interaction method, device and terminal Download PDFInfo
- Publication number
- US11217256B2 US11217256B2 US16/563,488 US201916563488A US11217256B2 US 11217256 B2 US11217256 B2 US 11217256B2 US 201916563488 A US201916563488 A US 201916563488A US 11217256 B2 US11217256 B2 US 11217256B2
- Authority
- US
- United States
- Prior art keywords
- content
- voice
- interactive mode
- wake
- request
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active, expires
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/22—Interactive procedures; Man-machine interfaces
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/487—Arrangements for providing information services, e.g. recorded voice services or time announcements
- H04M3/493—Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
- H04M3/4936—Speech interaction details
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Definitions
- the present application relates to the field of intelligent interaction technology, and in particular, to a voice interaction method, device and terminal.
- a near-field voice interactive mode is usually utilized as a way of interacting, for example, a Bluetooth voice interactor is utilized to interact with a smart TV.
- a Bluetooth voice interactor is utilized to interact with a smart TV.
- a user has to provide a wake-up prompt first, then speak out a search requirement.
- a user usually has to provide supplemental information multiple times to find out the content he wants to watch, and the user has to speak out wake-up words repeatedly every time he interacts, which is very inconvenient and leads to low search efficiency.
- a voice interaction method, device and terminal are provided according to embodiments of the present application, so as to at least solve the above technical problems in the existing technology.
- a voice interaction method includes receiving a wake-up prompt, activating an interactive mode according to the wake-up prompt, displaying a dialog prompt identification in the interactive mode, obtaining a vocal request, wherein the vocal request is input in response to the dialog prompt identification, and displaying a requested content according to the vocal request.
- the method further includes displaying the dialog prompt identification again in the interactive mode, obtaining an updated vocal request, and displaying an updated requested content according to the updated vocal request.
- the dialog prompt identification includes a search prompt indicator
- the search prompt indicator includes a general search start prompt word and a preset dialogue timer.
- the dialog prompt identification includes a content guide
- the content guide is used to prompt a user to provide requested content relevant with the content guide.
- the method prior to activating the interactive mode according to the wake-up prompt, the method further includes determining whether a content of the wake-up prompt is associated with a preset interaction scenario, and activating the interactive mode according to the wake-up prompt, in a case that the content of the wake-up prompt is associated with the preset interaction scenario.
- the method further includes determining whether a content of the vocal request is associated with a search request in the preset interaction scenario, and exiting the interactive mode, in a case that the content of the vocal request is not associated with the search request in the preset interaction scenario.
- a voice interaction device configured to an embodiment of the present application.
- the device includes a wake-up prompt receiving module configured to receive a wake-up prompt, an interactive mode activating module configured to activate an interactive mode according to the wake-up prompt, a prompt identification displaying module configured to display a dialog prompt identification in the interactive mode, a vocal request obtaining module configured to obtain a vocal request, wherein the vocal request is input in response to the dialog prompt identification, and a requested content displaying module configured to display a requested content according to the vocal request.
- the device further includes an interaction scenario determination module configured to determine whether a content of the wake-up prompt is associated with a preset interaction scenario, and to activate the interactive mode according to the wake-up prompt, in a case that the content of the wake-up prompt is associated with the preset interaction scenario.
- the device further includes a requirement determination module configured to determine whether a content of the vocal request is associated with a search request in the preset interaction scenario, and to exit the interactive mode, in a case that the content of the vocal request is not associated with the search request in the preset interaction scenario.
- a voice interaction terminal is provided according to an embodiment of the present application.
- the functions may be implemented by using hardware or by corresponding software executed by hardware.
- the hardware or software includes one or more modules corresponding to the functions described above.
- the voice interaction terminal structurally includes a processor and a memory, wherein the memory is configured to store programs which support the voice interaction terminal in executing the voice interaction method in the first aspect.
- the processor is configured to execute the programs stored in the memory.
- the voice interaction terminal may further include a communication interface through which the voice interaction terminal communicates with other devices or communication networks.
- a non-transitory computer readable storage medium for storing computer software instructions used for a voice interaction device.
- the computer readable storage medium can include programs involved in executing the voice interaction method described above in the first aspect.
- One of the above technical solutions has the following advantages or beneficial effects:
- a user can continuously provide vocal requests in an interactive mode, without waking up the interactive mode repeatedly, thereby improving user experience.
- FIG. 1 is a flowchart showing a voice interaction method according to an embodiment.
- FIG. 2 is a schematic diagram showing another voice interaction method according to an embodiment.
- FIG. 3 is a block diagram showing a voice interaction device according to an embodiment.
- FIG. 4 is a block diagram showing another voice interaction device according to an embodiment.
- FIG. 5 is a schematic diagram showing a voice interaction terminal according to an embodiment.
- a voice interaction method may include receiving a wake-up prompt at S 10 , activating an interactive mode according to the wake-up prompt at S 20 , displaying a dialog prompt identification in the interactive mode at S 30 , obtaining a vocal request, wherein the vocal request is input in response to the dialog prompt identification at S 40 , and displaying a requested content according to the vocal request at S 50 .
- This embodiment is applicable to smart home appliances, such as smart TVs and smart air conditioners, and the like.
- a wake-up prompt for example, “Xiaodu, Xiaodu, turn on the TV” is received.
- the content of the wake-up prompt may be parsed. If the parsing results are messy codes, which means no clear content of a wake-up prompt is obtained, the interactive mode cannot be activated.
- a user is prompted to speak out again a wake-up word for waking up the smart home appliance.
- the way of prompting to re-obtain a wake-up prompt can also be adaptively designed according to hardware of a smart home appliance. Taking a smart TV as an example again, if an indicator light turns blue and flickers, with its brightness being gradually weakened to none, it is a prompt that a wake-up prompt should be re-obtained so that the smart TV may be waken up.
- the interactive mode may be activated, and an interface of entering the interactive mode may be displayed on the TV screen.
- This interface can be adaptively designed as needed.
- a dialog prompt identification is displayed then in the interactive mode, and its function is to remind a user to notice dialogue timer, to provide a requested content, to provide a search start prompt word and the like.
- the dialog prompt identification can be implemented in various manners and can be adaptively designed according to requirements. Furthermore, the designed position of the dialog prompt identification can also be adaptively adjusted, all of which fall into the protection scope of the present implementation.
- the dialog prompt identification is designed as a dynamic circle displayed in the interface, which represents dialogue timer for a user.
- the dialog prompt identification may also be an animation designed as two cartoon figures interacting with a TV, which indicates that a user is prompted to provide a requested content.
- the dialog prompt identification may further be designed as a trumpet-shaped logo with keywords such as “Xiaodu”, which reminds a user to speak out the key words before starting a search.
- a user can interact with a smart home appliance by providing vocal requests.
- a vocal request provided by a user is “Xiaodu, Xiaodu, I want to watch a movie”.
- a voice recognition is performed, and the keyword “movie” may be obtained.
- the smart TV can perform a search by using the keyword “movie” or keywords which are related to “movie”, such as “hot movie”.
- the search results of searching for “hot movie” are displayed in the interface, and a dialog prompt identification may also be displayed at the same time. If a further search is not required, the interactive mode is directly exited. If a further search is required, another vocal request can be continuously provided, until the requested content is found out, and then the interactive mode can be exited automatically.
- the voice interaction method according to the embodiment includes, but is not limited to, a method for interacting with a smart TV, it can also be applied to other smart home appliances such as a smart air conditioner.
- the process of interacting therewith is similar to the method mentioned above.
- no further details are provided herein again, all of which fall into the protection scope of this implementation.
- the method may further include displaying the dialog prompt identification again in the interactive mode, obtaining an updated vocal request, and displaying an updated requested content according to the updated vocal request.
- a user can provide a further vocal request after seeing a dialog prompt identification.
- the vocal request provided by a user can be “Hong Kong movie”.
- a voice recognition is performed, and the keyword “Hong Kong” may be obtained.
- the smart TV can perform a search by using the keyword “Hong Kong” or keywords which are related to “Hong Kong”.
- the search results of searching for “Hong Kong, hot movie” are then displayed in the interface of the smart TV, and a dialog prompt identification is displayed again.
- a vocal request of “gangster movie” is further provided by the user.
- a voice recognition is performed, and the keyword “gangster” may be obtained.
- the smart TV can perform a search by using the keyword “gangster” or keywords which are related to “gangster”, such as “police”, “gang” or “bandit”.
- the searching results of searching for “Hong Kong, hot, gangster movie” are then displayed in the interface of the smart TV, a dialog prompt identification is displayed again.
- a vocal request of “performed by Liu XX” is further provided by the user.
- a voice recognition is performed, and the keyword “Liu XX” may be obtained.
- the smart TV can perform a search by using the keyword “Liu XX”.
- the searching results of searching for “Hong Kong, hot, gangster movie, Liu XX” are then displayed in the interface of the smart TV, a dialog prompt identification is displayed again.
- a vocal request of “next page” is provided by the user.
- a voice recognition is performed.
- the next page of the searching results is then displayed in the interface of the smart TV, and a dialog prompt identification is displayed at the same time again.
- a further vocal request of “play the first one” is provided by the user, the corresponding program is played, and the interactive mode is exited automatically.
- the dialog prompt identification may include a search prompt indicator
- the search prompt indicator may include a general search start prompt word and a preset dialogue timer.
- the general search start prompt word includes a search start prompt word, such as “Xiaodu, Xiaodu”.
- the search start prompt word can be displayed all the time during the entire searching process or can be displayed only at the beginning of the searching process.
- the search prompt indicator may also include a preset dialogue timer, such as a time progress bar. A vocal request should be provided by a user within the preset dialogue timer. If no vocal request is provided by a user within the preset dialogue timer, the interactive mode is exited. Alternatively, if the search prompt indicator disappears, the current interactive mode is exited automatically, thereby avoiding mis-operation.
- the duration of the dialogue timer may be set to 1 minute or several minutes in advance, for example. An adaptive adjustment of the duration of the preset dialogue timer may be made according to different product types, all of which fall into the protection scope of this implementation.
- the dialog prompt identification may include a content guide, and the content guide is used to prompt a user to provide a requested content relevant with the content guide.
- the content guide is used to prompt a user to provide a relevant requested content after obtaining a vocal request each time during an interaction process.
- a smart TV when a number of movies regarding “Hong Kong, hot, gangster, Liu XX” are searched out and displayed in the interface of the smart TV, the top ranked hot movie “Infernal Affairs” may be displayed in the content guide, which is used to prompt the user to directly provide a vocal request of “Xiaodu, Xiaodu, I want to see Infernal Affairs.”
- the user may also provide a vocal request of “Xiaodu, Xiaodu, I want to watch the third film, Chill” according to the content displayed on the current page.
- the method may further include determining whether a content of the wake-up prompt is associated with a preset interaction scenario, and activating the interactive mode according to the wake-up prompt, in a case that the content of the wake-up prompt is associated with the preset interaction scenario.
- the wake-up words for different kinds of smart home appliances need to be associated with preset interaction scenarios.
- the wake-up word for a smart TV may be “turn on the TV”
- the wake-up word for a smart air conditioner may be “turn on the air conditioner”.
- using the wake-up word “turn on the air conditioner” will fail to be associated with the preset interaction scenario of the smart TV. Therefore, to avoid startup errors, it is important and necessary to determine whether a content of the wake-up prompt is associated with a preset interaction scenario.
- the method may further include determining whether a content of the vocal request is associated with a search request in the preset interaction scenario, and exiting the interactive mode, in a case that the content of the vocal request is not associated with the search request in the preset interaction scenario.
- relevant search requirements associated with preset interaction scenarios may be stored.
- the stored relevant search requirements may be “Xiaodu, Xiaodu, I want to watch a movie”, “Hong Kong movie”, “I want to watch news broadcast” and the like.
- the stored relevant search requirements may be “Xiaodu, Xiaodu, hot air” and the like.
- a voice interaction device may include a wake-up prompt receiving module 10 configured to receive a wake-up prompt, an interactive mode activating module 20 configured to activate an interactive mode according to the wake-up prompt, a prompt identification displaying module 30 configured to display a dialog prompt identification in the interactive mode, a vocal request obtaining module 40 configured to obtain a vocal request, wherein the vocal request is input in response to the dialog prompt identification, and a requested content displaying module 50 configured to display a requested content according to the vocal request.
- a wake-up prompt receiving module 10 configured to receive a wake-up prompt
- an interactive mode activating module 20 configured to activate an interactive mode according to the wake-up prompt
- a prompt identification displaying module 30 configured to display a dialog prompt identification in the interactive mode
- a vocal request obtaining module 40 configured to obtain a vocal request, wherein the vocal request is input in response to the dialog prompt identification
- a requested content displaying module 50 configured to display a requested content according to the vocal request.
- the device further includes an interaction scenario determination module 21 configured to determine whether a content of the wake-up prompt is associated with a preset interaction scenario, and to activate the interactive mode according to the wake-up prompt, in a case that the content of the wake-up prompt is associated with the preset interaction scenario.
- the device further includes a requirement determination module 60 configured to determine whether a content of the vocal request is associated with a search request in the preset interaction scenario, and to exit the interactive mode, in a case that the content of the vocal request is not associated with the search request in the preset interaction scenario.
- a requirement determination module 60 configured to determine whether a content of the vocal request is associated with a search request in the preset interaction scenario, and to exit the interactive mode, in a case that the content of the vocal request is not associated with the search request in the preset interaction scenario.
- a user in a variety of interaction scenarios, such as in a scenario of interacting with a smart home appliance, a user can continuously provide vocal requests in an interactive mode, without waking up the interactive mode repeatedly, thereby improving user experience.
- a voice interaction terminal is provided according to an embodiment of the present application.
- the voice interaction terminal includes a memory 400 , a processor 500 , wherein a computer program that can run on the processor 500 is stored in the memory 400 .
- the processor 500 executes the computer program to implement the voice interaction method according to the foregoing embodiments.
- the number of either the memory 400 or the processor 500 may be one or more.
- the terminal may further include a communication interface 600 configured to enable the memory 400 and the processor 500 to communicate with an external device.
- the memory 400 may include a high-speed RAM memory and may also include a non-volatile memory, such as at least one magnetic disk memory.
- the bus may be an Industry Standard Architecture (ISA) bus, a Peripheral Component Interconnected (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like.
- ISA Industry Standard Architecture
- PCI Peripheral Component Interconnected
- EISA Extended Industry Standard Architecture
- the bus may be categorized into an address bus, a data bus, a control bus, and the like. For ease of illustration, only one bold line is shown in FIG. 5 to represent the bus, but it does not mean that there is only one bus or one type of bus.
- the memory 400 , the processor 500 , and the communication interface 600 may implement mutual communication through an internal interface.
- a computer-readable storage medium having computer programs stored thereon. When executed by a processor, the programs implement the voice interaction method described in the Embodiment I.
- the description of the terms “one embodiment,” “some embodiments,” “an example,” “a specific example,” or “some examples” and the like means the specific features, structures, materials, or characteristics described in connection with the embodiment or example are included in at least one embodiment or example of the present application. Furthermore, the specific features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more of the embodiments or examples. In addition, different embodiments or examples described in this specification and features of different embodiments or examples may be incorporated and combined by those skilled in the art without mutual contradiction.
- first and second are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of indicated technical features. Thus, features defining “first” and “second” may explicitly or implicitly include at least one of the features. In the description of the present application, “a plurality of” means two or more, unless expressly limited otherwise.
- Logic and/or steps, which are represented in the flowcharts or otherwise described herein, for example, may be thought of as a sequencing listing of executable instructions for implementing logic functions, which may be embodied in any computer-readable medium, for use by or in connection with an instruction execution system, device, or device (such as a computer-based system, a processor-included system, or other system that fetch instructions from an instruction execution system, device, or device and execute the instructions).
- a “computer-readable medium” may be any device that may contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, device, or device.
- the computer-readable media include the following: electrical connections (electronic devices) having one or more wires, a portable computer disk cartridge (magnetic device), random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber devices, and portable read only memory (CDROM).
- the computer-readable medium may even be paper or other suitable medium upon which the program may be printed, as it may be read, for example, by optical scanning of the paper or other medium, followed by editing, interpretation or, where appropriate, process otherwise to electronically obtain the program, which is then stored in a computer memory.
- each of the functional units in the embodiments of the present application may be integrated in one processing module, or each of the units may exist alone physically, or two or more units may be integrated in one module.
- the above-mentioned integrated module may be implemented in the form of hardware or in the form of software functional module.
- the integrated module When the integrated module is implemented in the form of a software functional module and is sold or used as an independent product, the integrated module may also be stored in a computer-readable storage medium.
- the storage medium may be a read only memory, a magnetic disk, an optical disk, or the like.
Landscapes
- Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Signal Processing (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
Description
Claims (13)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201811519317.0A CN109410944B (en) | 2018-12-12 | 2018-12-12 | Voice interaction method, device and terminal |
| CN201811519317.0 | 2018-12-12 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| US20200194007A1 US20200194007A1 (en) | 2020-06-18 |
| US11217256B2 true US11217256B2 (en) | 2022-01-04 |
Family
ID=65458732
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US16/563,488 Active 2040-02-14 US11217256B2 (en) | 2018-12-12 | 2019-09-06 | Voice interaction method, device and terminal |
Country Status (2)
| Country | Link |
|---|---|
| US (1) | US11217256B2 (en) |
| CN (1) | CN109410944B (en) |
Families Citing this family (17)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11676582B2 (en) * | 2019-02-27 | 2023-06-13 | Google Llc | Detecting conversations with computing devices |
| CN112334979B (en) * | 2019-02-27 | 2024-07-12 | 谷歌有限责任公司 | Detecting ongoing conversations by computing device |
| CN110619873A (en) * | 2019-08-16 | 2019-12-27 | 北京小米移动软件有限公司 | Audio processing method, device and storage medium |
| CN110751948A (en) * | 2019-10-18 | 2020-02-04 | 珠海格力电器股份有限公司 | Voice recognition method, device, storage medium and voice equipment |
| CN110689891A (en) * | 2019-11-20 | 2020-01-14 | 广东奥园奥买家电子商务有限公司 | Voice interaction method and device based on public display device |
| CN111192581A (en) * | 2020-01-07 | 2020-05-22 | 百度在线网络技术(北京)有限公司 | Voice wake-up method, device and storage medium |
| CN113641408B (en) * | 2020-04-23 | 2024-12-13 | 百度在线网络技术(北京)有限公司 | Method and device for generating quick entry |
| US12547372B2 (en) | 2021-03-15 | 2026-02-10 | VIDAA USA, Inc. | Display apparatus and display method |
| CN112860331B (en) * | 2021-03-19 | 2023-11-10 | Vidaa美国公司 | A display device and voice interaction prompt method |
| CN115775560B (en) * | 2021-03-16 | 2025-05-27 | 海信视像科技股份有限公司 | A wake-up response prompting method and display device |
| CN113241069B (en) * | 2021-04-15 | 2023-12-12 | 王维坤 | A method to improve the success rate of voice interaction |
| CN113297359B (en) * | 2021-04-23 | 2023-11-28 | 阿里巴巴新加坡控股有限公司 | Methods and devices for interactive information |
| CN113301394B (en) * | 2021-04-30 | 2023-07-11 | 当趣网络科技(杭州)有限公司 | Voice control method combined with user grade |
| CN113990310A (en) * | 2021-10-15 | 2022-01-28 | 深圳集智数字科技有限公司 | Method and device for controlling screen content through voice and electronic equipment |
| CN115424623B (en) * | 2022-03-23 | 2025-02-21 | 北京罗克维尔斯科技有限公司 | Voice interaction method, device, equipment and computer readable storage medium |
| CN115240674B (en) * | 2022-07-21 | 2025-06-06 | 海信视像科技股份有限公司 | Wake-up-free voice control method for terminal device, terminal device and server |
| CN118136016B (en) * | 2024-04-10 | 2025-02-18 | 广州小鹏汽车科技有限公司 | Voice interaction method, server and computer-readable storage medium |
Citations (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20030078784A1 (en) * | 2001-10-03 | 2003-04-24 | Adam Jordan | Global speech user interface |
| US20110022393A1 (en) * | 2007-11-12 | 2011-01-27 | Waeller Christoph | Multimode user interface of a driver assistance system for inputting and presentation of information |
| US20130006643A1 (en) * | 2010-01-13 | 2013-01-03 | Aram Lindahl | Devices and Methods for Identifying a Prompt Corresponding to a Voice Input in a Sequence of Prompts |
| US20140244269A1 (en) * | 2013-02-28 | 2014-08-28 | Sony Mobile Communications Ab | Device and method for activating with voice input |
| US20140278435A1 (en) * | 2013-03-12 | 2014-09-18 | Nuance Communications, Inc. | Methods and apparatus for detecting a voice command |
| CN104575504A (en) | 2014-12-24 | 2015-04-29 | 上海师范大学 | Method for personalized television voice wake-up by voiceprint and voice identification |
| US20150312351A1 (en) * | 2014-04-24 | 2015-10-29 | Alcatel Lucent | Method, device and system for device trigger in iot |
| US20160155443A1 (en) * | 2014-11-28 | 2016-06-02 | Microsoft Technology Licensing, Llc | Device arbitration for listening devices |
| CN106230689A (en) | 2016-07-25 | 2016-12-14 | 北京奇虎科技有限公司 | Method, device and the server that a kind of voice messaging is mutual |
| CN107680589A (en) | 2017-09-05 | 2018-02-09 | 百度在线网络技术(北京)有限公司 | Voice messaging exchange method, device and its equipment |
| CN107885810A (en) | 2017-01-24 | 2018-04-06 | 问众智能信息科技(北京)有限公司 | The method and apparatus that result for vehicle intelligent equipment interactive voice is shown |
| CN108132805A (en) | 2017-12-20 | 2018-06-08 | 深圳Tcl新技术有限公司 | Voice interactive method, device and computer readable storage medium |
| CN108170785A (en) | 2017-12-26 | 2018-06-15 | 深圳Tcl新技术有限公司 | Bootstrap technique, device and the computer readable storage medium of terminal searching operation |
| CN108259981A (en) | 2018-04-11 | 2018-07-06 | 深圳市茁壮网络股份有限公司 | A kind of television channel change control method, mobile terminal and set-top box |
| CN108366281A (en) | 2018-02-05 | 2018-08-03 | 山东浪潮商用系统有限公司 | A kind of full voice exchange method applied to set-top box |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20070267253A1 (en) * | 2006-05-04 | 2007-11-22 | Ryan Tauer | Method and apparatus for recording books |
| CN102938864A (en) * | 2012-11-27 | 2013-02-20 | 四川长虹电器股份有限公司 | Method for realizing television channel switching based on customized voice |
| CN104290097B (en) * | 2014-08-19 | 2016-03-30 | 白劲实 | The social robot system of a kind of learning type intellectual family and method |
-
2018
- 2018-12-12 CN CN201811519317.0A patent/CN109410944B/en active Active
-
2019
- 2019-09-06 US US16/563,488 patent/US11217256B2/en active Active
Patent Citations (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20030078784A1 (en) * | 2001-10-03 | 2003-04-24 | Adam Jordan | Global speech user interface |
| US20110022393A1 (en) * | 2007-11-12 | 2011-01-27 | Waeller Christoph | Multimode user interface of a driver assistance system for inputting and presentation of information |
| US20130006643A1 (en) * | 2010-01-13 | 2013-01-03 | Aram Lindahl | Devices and Methods for Identifying a Prompt Corresponding to a Voice Input in a Sequence of Prompts |
| US20140244269A1 (en) * | 2013-02-28 | 2014-08-28 | Sony Mobile Communications Ab | Device and method for activating with voice input |
| US20140278435A1 (en) * | 2013-03-12 | 2014-09-18 | Nuance Communications, Inc. | Methods and apparatus for detecting a voice command |
| US20150312351A1 (en) * | 2014-04-24 | 2015-10-29 | Alcatel Lucent | Method, device and system for device trigger in iot |
| US20160155443A1 (en) * | 2014-11-28 | 2016-06-02 | Microsoft Technology Licensing, Llc | Device arbitration for listening devices |
| CN104575504A (en) | 2014-12-24 | 2015-04-29 | 上海师范大学 | Method for personalized television voice wake-up by voiceprint and voice identification |
| CN106230689A (en) | 2016-07-25 | 2016-12-14 | 北京奇虎科技有限公司 | Method, device and the server that a kind of voice messaging is mutual |
| CN107885810A (en) | 2017-01-24 | 2018-04-06 | 问众智能信息科技(北京)有限公司 | The method and apparatus that result for vehicle intelligent equipment interactive voice is shown |
| CN107680589A (en) | 2017-09-05 | 2018-02-09 | 百度在线网络技术(北京)有限公司 | Voice messaging exchange method, device and its equipment |
| CN108132805A (en) | 2017-12-20 | 2018-06-08 | 深圳Tcl新技术有限公司 | Voice interactive method, device and computer readable storage medium |
| CN108170785A (en) | 2017-12-26 | 2018-06-15 | 深圳Tcl新技术有限公司 | Bootstrap technique, device and the computer readable storage medium of terminal searching operation |
| CN108366281A (en) | 2018-02-05 | 2018-08-03 | 山东浪潮商用系统有限公司 | A kind of full voice exchange method applied to set-top box |
| CN108259981A (en) | 2018-04-11 | 2018-07-06 | 深圳市茁壮网络股份有限公司 | A kind of television channel change control method, mobile terminal and set-top box |
Non-Patent Citations (2)
| Title |
|---|
| Office Action for Chinese Application No. 201811519317.0 dated Dec. 30, 2019 (13 pages). |
| Search Report for Chinese Application No. 201811519317.0, dated Dec. 20, 2019 (6 pages). |
Also Published As
| Publication number | Publication date |
|---|---|
| CN109410944A (en) | 2019-03-01 |
| CN109410944B (en) | 2020-06-09 |
| US20200194007A1 (en) | 2020-06-18 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11217256B2 (en) | Voice interaction method, device and terminal | |
| US11086596B2 (en) | Electronic device, server and control method thereof | |
| US10783364B2 (en) | Method, apparatus and device for waking up voice interaction function based on gesture, and computer readable medium | |
| US20190149872A1 (en) | Information exchanging method and device, audio terminal and computer-readable storage medium | |
| CN104170397B (en) | A method and computer storage medium for presenting search results on an electronic device | |
| US11205431B2 (en) | Method, apparatus and device for presenting state of voice interaction device, and storage medium | |
| US20130073293A1 (en) | Electronic device and method for controlling the same | |
| CN109754788B (en) | Voice control method, device, equipment and storage medium | |
| CN107277225B (en) | Method and device for controlling intelligent equipment through voice and intelligent equipment | |
| US11574632B2 (en) | In-cloud wake-up method and system, terminal and computer-readable storage medium | |
| CN103686359A (en) | Startup advertisement playing method and device thereof | |
| RU2582070C1 (en) | Method of controlling external input and broadcast receiver | |
| US10802851B2 (en) | Display apparatus and controlling method thereof | |
| US20260019678A1 (en) | Video processing method, apparatus, device, and storage medium | |
| US20260023583A1 (en) | Display apparatus and controlling method thereof | |
| TWI587253B (en) | Method and apparatus for providing notice of availability of audio description | |
| CN109725869B (en) | Continuous interaction control method and device | |
| CN109275005A (en) | Combined key remote control method, device, equipment and storage medium | |
| CN112464075A (en) | Application recommendation method and device of intelligent sound box and electronic equipment | |
| US20170346941A1 (en) | Electronic device and usage control method | |
| US11582514B2 (en) | Source apparatus and control method therefor | |
| CN112203125A (en) | Voice broadcasting method and device, video playing device and storage medium | |
| WO2020220649A1 (en) | Same window switching method for online list and local list, and computing device | |
| CN110874201A (en) | Interaction method, device, storage medium and operating system | |
| US20080300883A1 (en) | Projection Apparatus with Speech Indication and Control Method Thereof |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| AS | Assignment |
Owner name: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD., CHINA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:FAN, BINGBING;LIANG, HAO;REEL/FRAME:050828/0959 Effective date: 20181224 |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
| AS | Assignment |
Owner name: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD., CHINA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.;REEL/FRAME:056811/0772 Effective date: 20210527 Owner name: SHANGHAI XIAODU TECHNOLOGY CO. LTD., CHINA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.;REEL/FRAME:056811/0772 Effective date: 20210527 |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE AFTER FINAL ACTION FORWARDED TO EXAMINER |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT RECEIVED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED |
|
| STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
| MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 4 |