JP6873188B2 - 開発者音声アクションシステム - Google Patents
開発者音声アクションシステム Download PDFInfo
- Publication number
- JP6873188B2 JP6873188B2 JP2019101151A JP2019101151A JP6873188B2 JP 6873188 B2 JP6873188 B2 JP 6873188B2 JP 2019101151 A JP2019101151 A JP 2019101151A JP 2019101151 A JP2019101151 A JP 2019101151A JP 6873188 B2 JP6873188 B2 JP 6873188B2
- Authority
- JP
- Japan
- Prior art keywords
- application
- intent
- user
- grammar
- utterance
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000009471 action Effects 0.000 title claims description 97
- 238000013518 transcription Methods 0.000 claims description 35
- 230000035897 transcription Effects 0.000 claims description 35
- 238000000034 method Methods 0.000 claims description 24
- 238000012545 processing Methods 0.000 claims description 10
- 230000004044 response Effects 0.000 claims description 10
- 238000012790 confirmation Methods 0.000 claims 1
- 230000000694 effects Effects 0.000 description 62
- 238000005352 clarification Methods 0.000 description 41
- 238000010200 validation analysis Methods 0.000 description 31
- 230000001960 triggered effect Effects 0.000 description 15
- 238000004590 computer program Methods 0.000 description 11
- 230000008569 process Effects 0.000 description 11
- 238000004891 communication Methods 0.000 description 5
- 238000011161 development Methods 0.000 description 5
- 230000018109 developmental process Effects 0.000 description 5
- 230000003287 optical effect Effects 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 230000001965 increasing effect Effects 0.000 description 3
- 238000012795 verification Methods 0.000 description 3
- 230000003993 interaction Effects 0.000 description 2
- 230000001902 propagating effect Effects 0.000 description 2
- 238000013515 script Methods 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000000844 transformation Methods 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 241001672694 Citrus reticulata Species 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1815—Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/22—Interactive procedures; Man-machine interfaces
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G10L2015/0638—Interactive procedures
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Theoretical Computer Science (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Machine Translation (AREA)
- Telephonic Communication Services (AREA)
- User Interface Of Digital Computer (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Description
101 アプリケーション開発者
102 コンピューティングデバイス
105 ネットワーク
110 取得エンジン
120 検証エンジン
125 検証基準
130 文法誘導エンジン
135 誘導規則
140 文脈インテントデータベース
200 音声アクションサービスシステム
201 ユーザ
202 クライアントデバイス
204 オーディオデータ
205 ネットワーク
210 音声認識エンジン
220 マッチャ
230 明確化エンジン
235 ユーザパーソナル化データベース
240 文脈インテントデータベース
245 OSインテントデータベース
250 実行エンジン
Claims (14)
- 音声アクションサービスシステムによって、ユーザのコンピューティングデバイスにおいて提供される発話を受信するステップであって、前記発話は音声コマンドトリガフレーズを含む、ステップと、
前記音声アクションサービスシステムによって、前記音声コマンドトリガフレーズに関連するインテントを判定するために前記発話を処理するステップと、
前記音声アクションサービスシステムによって、各々が前記インテントを満足することができる少なくとも第1のアプリケーションおよび第2のアプリケーションを識別するステップであって、少なくとも前記第1のアプリケーションおよび前記第2のアプリケーションを識別するステップは、少なくとも前記第1のアプリケーションおよび前記第2のアプリケーションが1つまたは複数のデータベース中の前記インテントに関連していると判定するステップに基づく、ステップと、
前記音声アクションサービスシステムによって、前記第2のアプリケーションよりも前記第1のアプリケーションを選択するステップであって、前記第2のアプリケーションよりも前記第1のアプリケーションを選択するステップは、前記ユーザおよび他のユーザによる前記第1のアプリケーションの最近の使用状況に少なくとも部分的に基づく、ステップと、
前記音声アクションサービスシステムによって、前記ユーザの前記コンピューティングデバイスに、前記発話に応じて、前記選択した第1のアプリケーションの指示を提供するステップと
を含む、コンピュータ実施方法。 - 前記ユーザによる前記第1のアプリケーションの最近の使用状況に少なくとも部分的に基づいて、前記第2のアプリケーションよりも前記第1のアプリケーションを選択するステップは、前記第1のアプリケーションが、前記ユーザによって、前記音声コマンドトリガフレーズに応じて、最近選択されていたと判定するステップを含む、請求項1に記載のコンピュータ実施方法。
- 前記第2のアプリケーションよりも前記第1のアプリケーションを選択するステップはさらに、前記第1のアプリケーションと前記音声コマンドトリガフレーズまたは前記インテントのうちの少なくとも1つとの間の関係スコアの強度に少なくとも部分的に基づく、請求項1に記載のコンピュータ実施方法。
- 前記第2のアプリケーションよりも前記第1のアプリケーションを選択するステップはさらに、前記発話を受信した際に前記ユーザの前記コンピューティングデバイス上で前記第1のアプリケーションが実行されていることに少なくとも部分的に基づく、請求項1に記載のコンピュータ実施方法。
- 前記選択した第1のアプリケーションの前記指示を提供するステップは、前記選択した第1のアプリケーションの聴覚的指示を提供するステップを含む、請求項1に記載のコンピュータ実施方法。
- 前記音声アクションサービスシステムによって、前記ユーザの前記コンピューティングデバイスにおいて追加の発話を受信するステップであって、前記追加の発話は前記選択した第1のアプリケーションの確認を含む、ステップと、
前記追加の発話を受信するステップに応じて、前記インテントを満足するために前記第1のアプリケーションを実行するステップと
をさらに含む、請求項1に記載のコンピュータ実施方法。 - 前記インテントを判定するために前記発話を処理するステップは、
前記音声アクションサービスシステムによって、前記発話のトランスクリプションを取得するために前記発話に対して音声認識を行うステップと、
前記音声アクションサービスシステムによって、前記トランスクリプションの少なくとも一部が前記音声コマンドトリガフレーズを含むとともに前記音声コマンドトリガフレーズが前記インテントと一致していると判定するステップと
を含む、請求項1に記載のコンピュータ実施方法。 - 少なくとも1つのプロセッサと、
命令を含む少なくとも1つのメモリとを含み、前記命令は、実行されると、前記少なくとも1つのプロセッサに、
ユーザのコンピューティングデバイスにおいて提供される発話を受信することであって、前記発話は音声コマンドトリガフレーズを含む、ことと、
前記発話が前記音声コマンドトリガフレーズを含んでいると判定するために前記発話を処理することと、
少なくとも第1のアプリケーションおよび第2のアプリケーションを識別することであって、少なくとも前記第1のアプリケーションおよび前記第2のアプリケーションを識別することは、少なくとも前記第1のアプリケーションおよび前記第2のアプリケーションが1つまたは複数のデータベース中の前記音声コマンドトリガフレーズにマッピングされていると判定することに基づく、ことと、
前記第2のアプリケーションよりも前記第1のアプリケーションを選択することであって、前記第2のアプリケーションよりも前記第1のアプリケーションを選択することは、前記ユーザおよび他のユーザによる前記第1のアプリケーションの最近の使用状況に少なくとも部分的に基づく、ことと、
前記ユーザの前記コンピューティングデバイスに、前記発話に応じて、前記選択した第1のアプリケーションの指示を提供することと
をさせる、システム。 - 前記ユーザによる前記第1のアプリケーションの最近の使用状況に少なくとも部分的に基づいて、前記第2のアプリケーションよりも前記第1のアプリケーションを選択するための前記命令は、前記第1のアプリケーションが、前記ユーザによって、前記音声コマンドトリガフレーズに応じて、最近選択されていたと判定するための命令を含む、請求項8に記載のシステム。
- 前記第2のアプリケーションよりも前記第1のアプリケーションを選択するための前記命令は、前記第1のアプリケーションと前記音声コマンドトリガフレーズとの間の関係スコアの強度に少なくとも部分的に基づいて、前記第2のアプリケーションよりも前記第1のアプリケーションを選択するための命令をさらに含む、請求項8に記載のシステム。
- 前記第2のアプリケーションよりも前記第1のアプリケーションを選択するための前記命令は、前記発話を受信した際に前記ユーザの前記コンピューティングデバイス上で前記第1のアプリケーションが実行されていることに少なくとも部分的に基づいて、前記第2のアプリケーションよりも前記第1のアプリケーションを選択するための命令をさらに含む、請求項8に記載のシステム。
- 前記選択した第1のアプリケーションの前記指示を提供するための前記命令は、前記選択した第1のアプリケーションの聴覚的指示を提供するための命令をさらに含む、請求項8に記載のシステム。
- 前記ユーザの前記コンピューティングデバイスにおいて追加の発話を受信することであって、前記追加の発話は前記選択した第1のアプリケーションの確認を含む、ことと、
前記追加の発話を受信することに応じて、前記第1のアプリケーションを実行することと
をするための命令をさらに含む、請求項8に記載のシステム。 - 命令を含むコンピュータ可読記憶媒体であって、前記命令は、実行されると、少なくとも1つのプロセッサに、
ユーザのコンピューティングデバイスにおいて提供される発話を受信することであって、前記発話は音声コマンドトリガフレーズを含む、ことと、
前記音声コマンドトリガフレーズに関連するインテントを判定するために前記発話を処理することと、
各々が前記インテントを満足することができる少なくとも第1のアプリケーションおよび第2のアプリケーションを識別することであって、少なくとも前記第1のアプリケーションおよび前記第2のアプリケーションを識別することは、少なくとも前記第1のアプリケーションおよび前記第2のアプリケーションが1つまたは複数のデータベース中の前記インテントに関連していると判定することに基づく、ことと、
前記第2のアプリケーションよりも前記第1のアプリケーションを選択することであって、前記第2のアプリケーションよりも前記第1のアプリケーションを選択することは、前記ユーザおよび他のユーザによる前記第1のアプリケーションの最近の使用状況に少なくとも部分的に基づく、ことと、
前記ユーザの前記コンピューティングデバイスに、前記発話に応じて、前記選択した第1のアプリケーションの指示を提供することと
をさせる、コンピュータ可読記憶媒体。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/693,330 US9472196B1 (en) | 2015-04-22 | 2015-04-22 | Developer voice actions system |
US14/693,330 | 2015-04-22 |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2017550871A Division JP6538188B2 (ja) | 2015-04-22 | 2016-04-12 | 開発者音声アクションシステム |
Publications (2)
Publication Number | Publication Date |
---|---|
JP2019144598A JP2019144598A (ja) | 2019-08-29 |
JP6873188B2 true JP6873188B2 (ja) | 2021-05-19 |
Family
ID=55953380
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2017550871A Active JP6538188B2 (ja) | 2015-04-22 | 2016-04-12 | 開発者音声アクションシステム |
JP2019101151A Active JP6873188B2 (ja) | 2015-04-22 | 2019-05-30 | 開発者音声アクションシステム |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2017550871A Active JP6538188B2 (ja) | 2015-04-22 | 2016-04-12 | 開発者音声アクションシステム |
Country Status (8)
Country | Link |
---|---|
US (4) | US9472196B1 (ja) |
EP (1) | EP3286633B1 (ja) |
JP (2) | JP6538188B2 (ja) |
KR (2) | KR102173100B1 (ja) |
CN (2) | CN107408385B (ja) |
DE (1) | DE112016001852T5 (ja) |
GB (1) | GB2553234B (ja) |
WO (1) | WO2016171956A1 (ja) |
Families Citing this family (158)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
US8977255B2 (en) | 2007-04-03 | 2015-03-10 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
US8676904B2 (en) | 2008-10-02 | 2014-03-18 | Apple Inc. | Electronic devices with voice command and contextual data processing capabilities |
US20120309363A1 (en) | 2011-06-03 | 2012-12-06 | Apple Inc. | Triggering notifications associated with tasks items that represent tasks to perform |
US10417037B2 (en) | 2012-05-15 | 2019-09-17 | Apple Inc. | Systems and methods for integrating third party services with a digital assistant |
EP3809407A1 (en) | 2013-02-07 | 2021-04-21 | Apple Inc. | Voice trigger for a digital assistant |
US10652394B2 (en) | 2013-03-14 | 2020-05-12 | Apple Inc. | System and method for processing voicemail |
US10748529B1 (en) | 2013-03-15 | 2020-08-18 | Apple Inc. | Voice activated device for use with a voice-based digital assistant |
US10176167B2 (en) | 2013-06-09 | 2019-01-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
US9966065B2 (en) | 2014-05-30 | 2018-05-08 | Apple Inc. | Multi-command single utterance input method |
US10170123B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Intelligent assistant for home automation |
US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
JP6334815B2 (ja) * | 2015-03-20 | 2018-05-30 | 株式会社東芝 | 学習装置、方法、プログラムおよび音声対話システム |
US9472196B1 (en) | 2015-04-22 | 2016-10-18 | Google Inc. | Developer voice actions system |
US10200824B2 (en) | 2015-05-27 | 2019-02-05 | Apple Inc. | Systems and methods for proactively identifying and surfacing relevant content on a touch-sensitive device |
US20160378747A1 (en) | 2015-06-29 | 2016-12-29 | Apple Inc. | Virtual assistant for media playback |
US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
US10740384B2 (en) | 2015-09-08 | 2020-08-11 | Apple Inc. | Intelligent automated assistant for media search and playback |
US10331312B2 (en) | 2015-09-08 | 2019-06-25 | Apple Inc. | Intelligent automated assistant in a media environment |
US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US10956666B2 (en) | 2015-11-09 | 2021-03-23 | Apple Inc. | Unconventional virtual assistant interactions |
GB2544543B (en) * | 2015-11-20 | 2020-10-07 | Zuma Array Ltd | Lighting and sound system |
US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
JP6620934B2 (ja) * | 2016-01-29 | 2019-12-18 | パナソニックIpマネジメント株式会社 | 翻訳支援方法、翻訳支援装置、翻訳装置及び翻訳支援プログラム |
US9947316B2 (en) | 2016-02-22 | 2018-04-17 | Sonos, Inc. | Voice control of a media playback system |
US9772817B2 (en) | 2016-02-22 | 2017-09-26 | Sonos, Inc. | Room-corrected voice detection |
US9965247B2 (en) | 2016-02-22 | 2018-05-08 | Sonos, Inc. | Voice controlled media playback system based on user profile |
US10264030B2 (en) | 2016-02-22 | 2019-04-16 | Sonos, Inc. | Networked microphone device control |
US10095470B2 (en) | 2016-02-22 | 2018-10-09 | Sonos, Inc. | Audio response playback |
US10509626B2 (en) | 2016-02-22 | 2019-12-17 | Sonos, Inc | Handling of loss of pairing between networked devices |
US9922648B2 (en) * | 2016-03-01 | 2018-03-20 | Google Llc | Developer voice actions system |
US10049670B2 (en) * | 2016-06-06 | 2018-08-14 | Google Llc | Providing voice action discoverability example for trigger term |
US9978390B2 (en) | 2016-06-09 | 2018-05-22 | Sonos, Inc. | Dynamic player selection for audio signal processing |
US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
DK179415B1 (en) | 2016-06-11 | 2018-06-14 | Apple Inc | Intelligent device arbitration and control |
AU2017100670C4 (en) | 2016-06-12 | 2019-11-21 | Apple Inc. | User interfaces for retrieving contextually relevant media content |
US10152969B2 (en) | 2016-07-15 | 2018-12-11 | Sonos, Inc. | Voice detection by multiple devices |
US10134399B2 (en) | 2016-07-15 | 2018-11-20 | Sonos, Inc. | Contextualization of voice inputs |
US10403275B1 (en) * | 2016-07-28 | 2019-09-03 | Josh.ai LLC | Speech control for complex commands |
US10115400B2 (en) | 2016-08-05 | 2018-10-30 | Sonos, Inc. | Multiple voice services |
US9942678B1 (en) | 2016-09-27 | 2018-04-10 | Sonos, Inc. | Audio playback settings for voice interaction |
US9743204B1 (en) | 2016-09-30 | 2017-08-22 | Sonos, Inc. | Multi-orientation playback device microphones |
US10181323B2 (en) | 2016-10-19 | 2019-01-15 | Sonos, Inc. | Arbitration-based voice recognition |
WO2018085760A1 (en) | 2016-11-04 | 2018-05-11 | Semantic Machines, Inc. | Data collection for a new conversational dialogue system |
WO2018148441A1 (en) | 2017-02-08 | 2018-08-16 | Semantic Machines, Inc. | Natural language content generator |
US11069340B2 (en) | 2017-02-23 | 2021-07-20 | Microsoft Technology Licensing, Llc | Flexible and expandable dialogue system |
WO2018156978A1 (en) | 2017-02-23 | 2018-08-30 | Semantic Machines, Inc. | Expandable dialogue system |
CN116991971A (zh) * | 2017-02-23 | 2023-11-03 | 微软技术许可有限责任公司 | 可扩展对话系统 |
US10762892B2 (en) | 2017-02-23 | 2020-09-01 | Semantic Machines, Inc. | Rapid deployment of dialogue system |
US11183181B2 (en) | 2017-03-27 | 2021-11-23 | Sonos, Inc. | Systems and methods of multiple voice services |
US10726832B2 (en) | 2017-05-11 | 2020-07-28 | Apple Inc. | Maintaining privacy of personal information |
DK180048B1 (en) | 2017-05-11 | 2020-02-04 | Apple Inc. | MAINTAINING THE DATA PROTECTION OF PERSONAL INFORMATION |
DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
DK201770429A1 (en) | 2017-05-12 | 2018-12-14 | Apple Inc. | LOW-LATENCY INTELLIGENT AUTOMATED ASSISTANT |
US20180336892A1 (en) | 2017-05-16 | 2018-11-22 | Apple Inc. | Detecting a trigger of a digital assistant |
US10303715B2 (en) | 2017-05-16 | 2019-05-28 | Apple Inc. | Intelligent automated assistant for media exploration |
CN109102802B (zh) * | 2017-06-21 | 2023-10-17 | 三星电子株式会社 | 用于处理用户话语的系统 |
KR102007478B1 (ko) * | 2017-06-28 | 2019-08-05 | 크리스토퍼 재현 윤 | 특정 조건에서 음성인식을 이용한 어플리케이션 제어 장치 및 방법 |
CN107316643B (zh) * | 2017-07-04 | 2021-08-17 | 科大讯飞股份有限公司 | 语音交互方法及装置 |
US20190027149A1 (en) * | 2017-07-20 | 2019-01-24 | Nuance Communications, Inc. | Documentation tag processing system |
US10475449B2 (en) | 2017-08-07 | 2019-11-12 | Sonos, Inc. | Wake-word detection suppression |
KR102411766B1 (ko) * | 2017-08-25 | 2022-06-22 | 삼성전자주식회사 | 음성 인식 서비스를 활성화하는 방법 및 이를 구현한 전자 장치 |
US11132499B2 (en) | 2017-08-28 | 2021-09-28 | Microsoft Technology Licensing, Llc | Robust expandable dialogue system |
US10311874B2 (en) * | 2017-09-01 | 2019-06-04 | 4Q Catalyst, LLC | Methods and systems for voice-based programming of a voice-controlled device |
US10048930B1 (en) | 2017-09-08 | 2018-08-14 | Sonos, Inc. | Dynamic computation of system response volume |
US10446165B2 (en) | 2017-09-27 | 2019-10-15 | Sonos, Inc. | Robust short-time fourier transform acoustic echo cancellation during audio playback |
US10621981B2 (en) | 2017-09-28 | 2020-04-14 | Sonos, Inc. | Tone interference cancellation |
US10482868B2 (en) | 2017-09-28 | 2019-11-19 | Sonos, Inc. | Multi-channel acoustic echo cancellation |
US10466962B2 (en) | 2017-09-29 | 2019-11-05 | Sonos, Inc. | Media playback system with voice assistance |
CN107886948A (zh) | 2017-11-16 | 2018-04-06 | 百度在线网络技术(北京)有限公司 | 语音交互方法及装置,终端,服务器及可读存储介质 |
US10880650B2 (en) | 2017-12-10 | 2020-12-29 | Sonos, Inc. | Network microphone devices with automatic do not disturb actuation capabilities |
US10818290B2 (en) | 2017-12-11 | 2020-10-27 | Sonos, Inc. | Home graph |
US10878808B1 (en) * | 2018-01-09 | 2020-12-29 | Amazon Technologies, Inc. | Speech processing dialog management |
US11343614B2 (en) | 2018-01-31 | 2022-05-24 | Sonos, Inc. | Device designation of playback and network microphone device arrangements |
US10818288B2 (en) | 2018-03-26 | 2020-10-27 | Apple Inc. | Natural assistant interaction |
US10936822B2 (en) * | 2018-05-04 | 2021-03-02 | Dell Products L.P. | Linguistic semantic analysis alert correlation system |
US10928918B2 (en) | 2018-05-07 | 2021-02-23 | Apple Inc. | Raise to speak |
US11145294B2 (en) * | 2018-05-07 | 2021-10-12 | Apple Inc. | Intelligent automated assistant for delivering content from user experiences |
US11175880B2 (en) | 2018-05-10 | 2021-11-16 | Sonos, Inc. | Systems and methods for voice-assisted media content selection |
US10847178B2 (en) | 2018-05-18 | 2020-11-24 | Sonos, Inc. | Linear filtering for noise-suppressed speech detection |
KR20190133100A (ko) * | 2018-05-22 | 2019-12-02 | 삼성전자주식회사 | 어플리케이션을 이용하여 음성 입력에 대한 응답을 출력하는 전자 장치 및 그 동작 방법 |
US10959029B2 (en) | 2018-05-25 | 2021-03-23 | Sonos, Inc. | Determining and adapting to changes in microphone performance of playback devices |
DK179822B1 (da) | 2018-06-01 | 2019-07-12 | Apple Inc. | Voice interaction at a primary device to access call functionality of a companion device |
DK180639B1 (en) | 2018-06-01 | 2021-11-04 | Apple Inc | DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT |
US10892996B2 (en) | 2018-06-01 | 2021-01-12 | Apple Inc. | Variable latency device coordination |
US10811009B2 (en) * | 2018-06-27 | 2020-10-20 | International Business Machines Corporation | Automatic skill routing in conversational computing frameworks |
US10681460B2 (en) | 2018-06-28 | 2020-06-09 | Sonos, Inc. | Systems and methods for associating playback devices with voice assistant services |
US10461710B1 (en) | 2018-08-28 | 2019-10-29 | Sonos, Inc. | Media playback system with maximum volume setting |
US11076035B2 (en) | 2018-08-28 | 2021-07-27 | Sonos, Inc. | Do not disturb feature for audio notifications |
US10587430B1 (en) | 2018-09-14 | 2020-03-10 | Sonos, Inc. | Networked devices, systems, and methods for associating playback devices based on sound codes |
US10878811B2 (en) | 2018-09-14 | 2020-12-29 | Sonos, Inc. | Networked devices, systems, and methods for intelligently deactivating wake-word engines |
US11024331B2 (en) | 2018-09-21 | 2021-06-01 | Sonos, Inc. | Voice detection optimization using sound metadata |
US10811015B2 (en) | 2018-09-25 | 2020-10-20 | Sonos, Inc. | Voice detection optimization based on selected voice assistant service |
US11462215B2 (en) | 2018-09-28 | 2022-10-04 | Apple Inc. | Multi-modal inputs for voice commands |
US11100923B2 (en) | 2018-09-28 | 2021-08-24 | Sonos, Inc. | Systems and methods for selective wake word detection using neural network models |
US10692518B2 (en) | 2018-09-29 | 2020-06-23 | Sonos, Inc. | Linear filtering for noise-suppressed speech detection via multiple network microphone devices |
KR102620705B1 (ko) | 2018-10-11 | 2024-01-04 | 삼성전자주식회사 | 전자 장치 및 그의 동작 방법 |
US11899519B2 (en) | 2018-10-23 | 2024-02-13 | Sonos, Inc. | Multiple stage network microphone device with reduced power consumption and processing load |
US11527265B2 (en) | 2018-11-02 | 2022-12-13 | BriefCam Ltd. | Method and system for automatic object-aware video or audio redaction |
KR20200055202A (ko) * | 2018-11-12 | 2020-05-21 | 삼성전자주식회사 | 제스처에 의해 트리거 되는 음성 인식 서비스를 제공하는 전자 장치 및 그 동작 방법 |
EP3654249A1 (en) | 2018-11-15 | 2020-05-20 | Snips | Dilated convolutions and gating for efficient keyword spotting |
US11183183B2 (en) | 2018-12-07 | 2021-11-23 | Sonos, Inc. | Systems and methods of operating media playback systems having multiple voice assistant services |
US11132989B2 (en) | 2018-12-13 | 2021-09-28 | Sonos, Inc. | Networked microphone devices, systems, and methods of localized arbitration |
US10602268B1 (en) | 2018-12-20 | 2020-03-24 | Sonos, Inc. | Optimization of network microphone devices using noise classification |
US11315556B2 (en) | 2019-02-08 | 2022-04-26 | Sonos, Inc. | Devices, systems, and methods for distributed voice processing by transmitting sound data associated with a wake word to an appropriate device for identification |
US10867604B2 (en) | 2019-02-08 | 2020-12-15 | Sonos, Inc. | Devices, systems, and methods for distributed voice processing |
US20220013119A1 (en) * | 2019-02-13 | 2022-01-13 | Sony Group Corporation | Information processing device and information processing method |
US11348573B2 (en) | 2019-03-18 | 2022-05-31 | Apple Inc. | Multimodality in digital assistant systems |
US11120794B2 (en) | 2019-05-03 | 2021-09-14 | Sonos, Inc. | Voice assistant persistence across multiple network microphone devices |
US11307752B2 (en) | 2019-05-06 | 2022-04-19 | Apple Inc. | User configurable task triggers |
DK201970509A1 (en) | 2019-05-06 | 2021-01-15 | Apple Inc | Spoken notifications |
US11140099B2 (en) | 2019-05-21 | 2021-10-05 | Apple Inc. | Providing message response suggestions |
DK180129B1 (en) | 2019-05-31 | 2020-06-02 | Apple Inc. | USER ACTIVITY SHORTCUT SUGGESTIONS |
DK201970511A1 (en) | 2019-05-31 | 2021-02-15 | Apple Inc | Voice identification in digital assistant systems |
US11468890B2 (en) | 2019-06-01 | 2022-10-11 | Apple Inc. | Methods and user interfaces for voice-based control of electronic devices |
US11200894B2 (en) | 2019-06-12 | 2021-12-14 | Sonos, Inc. | Network microphone device with command keyword eventing |
US10586540B1 (en) | 2019-06-12 | 2020-03-10 | Sonos, Inc. | Network microphone device with command keyword conditioning |
US11361756B2 (en) | 2019-06-12 | 2022-06-14 | Sonos, Inc. | Conditional wake word eventing based on environment |
US10871943B1 (en) | 2019-07-31 | 2020-12-22 | Sonos, Inc. | Noise classification for event detection |
US11138975B2 (en) | 2019-07-31 | 2021-10-05 | Sonos, Inc. | Locally distributed keyword detection |
US11138969B2 (en) | 2019-07-31 | 2021-10-05 | Sonos, Inc. | Locally distributed keyword detection |
US11184298B2 (en) * | 2019-08-28 | 2021-11-23 | International Business Machines Corporation | Methods and systems for improving chatbot intent training by correlating user feedback provided subsequent to a failed response to an initial user intent |
CN110718221A (zh) * | 2019-10-08 | 2020-01-21 | 百度在线网络技术(北京)有限公司 | 语音技能控制方法、语音设备、客户端以及服务器 |
US11189286B2 (en) | 2019-10-22 | 2021-11-30 | Sonos, Inc. | VAS toggle based on device orientation |
CN110808051A (zh) * | 2019-10-30 | 2020-02-18 | 腾讯科技(深圳)有限公司 | 一种技能选取的方法以及相关装置 |
US20210158803A1 (en) * | 2019-11-21 | 2021-05-27 | Lenovo (Singapore) Pte. Ltd. | Determining wake word strength |
US11450325B1 (en) | 2019-12-12 | 2022-09-20 | Amazon Technologies, Inc. | Natural language processing |
US11482214B1 (en) * | 2019-12-12 | 2022-10-25 | Amazon Technologies, Inc. | Hypothesis generation and selection for inverse text normalization for search |
US11380308B1 (en) | 2019-12-13 | 2022-07-05 | Amazon Technologies, Inc. | Natural language processing |
US11551681B1 (en) * | 2019-12-13 | 2023-01-10 | Amazon Technologies, Inc. | Natural language processing routing |
US11200900B2 (en) | 2019-12-20 | 2021-12-14 | Sonos, Inc. | Offline voice control |
US11562740B2 (en) | 2020-01-07 | 2023-01-24 | Sonos, Inc. | Voice verification for media playback |
CN111240478B (zh) * | 2020-01-07 | 2023-10-13 | 百度在线网络技术(北京)有限公司 | 设备响应的评测方法、装置、设备及存储介质 |
US11556307B2 (en) | 2020-01-31 | 2023-01-17 | Sonos, Inc. | Local voice data processing |
US11308958B2 (en) | 2020-02-07 | 2022-04-19 | Sonos, Inc. | Localized wakeword verification |
US11089440B1 (en) | 2020-03-02 | 2021-08-10 | International Business Machines Corporation | Management of geographically and temporarily distributed services |
WO2021216164A1 (en) * | 2020-04-21 | 2021-10-28 | Google Llc | Hierarchical context specific actions from ambient speech |
US11038934B1 (en) | 2020-05-11 | 2021-06-15 | Apple Inc. | Digital assistant hardware abstraction |
US11061543B1 (en) | 2020-05-11 | 2021-07-13 | Apple Inc. | Providing relevant data items based on context |
US11755276B2 (en) | 2020-05-12 | 2023-09-12 | Apple Inc. | Reducing description length based on confidence |
US11482224B2 (en) | 2020-05-20 | 2022-10-25 | Sonos, Inc. | Command keywords with input detection windowing |
US11727919B2 (en) | 2020-05-20 | 2023-08-15 | Sonos, Inc. | Memory allocation for keyword spotting engines |
US11308962B2 (en) | 2020-05-20 | 2022-04-19 | Sonos, Inc. | Input detection windowing |
US11490204B2 (en) | 2020-07-20 | 2022-11-01 | Apple Inc. | Multi-device audio adjustment coordination |
US11438683B2 (en) | 2020-07-21 | 2022-09-06 | Apple Inc. | User identification using headphones |
US11501762B2 (en) * | 2020-07-29 | 2022-11-15 | Microsoft Technology Licensing, Llc | Compounding corrective actions and learning in mixed mode dictation |
US11698771B2 (en) | 2020-08-25 | 2023-07-11 | Sonos, Inc. | Vocal guidance engines for playback devices |
CN112786040A (zh) * | 2020-10-22 | 2021-05-11 | 青岛经济技术开发区海尔热水器有限公司 | 应用于智能家电设备的语音控制方法、装置及设备 |
US11984123B2 (en) | 2020-11-12 | 2024-05-14 | Sonos, Inc. | Network device interaction by range |
US11551700B2 (en) | 2021-01-25 | 2023-01-10 | Sonos, Inc. | Systems and methods for power-efficient keyword detection |
US11862175B2 (en) * | 2021-01-28 | 2024-01-02 | Verizon Patent And Licensing Inc. | User identification and authentication |
US11908452B1 (en) * | 2021-05-20 | 2024-02-20 | Amazon Technologies, Inc. | Alternative input representations for speech inputs |
US20220406301A1 (en) * | 2021-06-16 | 2022-12-22 | Google Llc | Passive disambiguation of assistant commands |
Family Cites Families (113)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2119397C (en) | 1993-03-19 | 2007-10-02 | Kim E.A. Silverman | Improved automated voice synthesis employing enhanced prosodic treatment of text, spelling of text and rate of annunciation |
US6493743B2 (en) * | 1997-01-28 | 2002-12-10 | Casio Computer Co., Ltd. | PDA workspace interface using application icons for downloading remote user file |
EP0980574B1 (en) | 1997-10-20 | 2004-03-10 | Koninklijke Philips Electronics N.V. | Pattern recognition enrolment in a distributed system |
US6604075B1 (en) * | 1999-05-20 | 2003-08-05 | Lucent Technologies Inc. | Web-based voice dialog interface |
US7069220B2 (en) * | 1999-08-13 | 2006-06-27 | International Business Machines Corporation | Method for determining and maintaining dialog focus in a conversational speech system |
US6748361B1 (en) | 1999-12-14 | 2004-06-08 | International Business Machines Corporation | Personal speech assistant supporting a dialog manager |
US20020072914A1 (en) * | 2000-12-08 | 2002-06-13 | Hiyan Alshawi | Method and apparatus for creation and user-customization of speech-enabled services |
JP4155383B2 (ja) | 2001-03-05 | 2008-09-24 | アルパイン株式会社 | 音声認識機器操作装置 |
US7398209B2 (en) | 2002-06-03 | 2008-07-08 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
US7502737B2 (en) | 2002-06-24 | 2009-03-10 | Intel Corporation | Multi-pass recognition of spoken dialogue |
JP4107093B2 (ja) | 2003-01-30 | 2008-06-25 | 株式会社日立製作所 | 対話型端末装置及び対話アプリケーション提供方法 |
US7013282B2 (en) | 2003-04-18 | 2006-03-14 | At&T Corp. | System and method for text-to-speech processing in a portable device |
JP2005017974A (ja) * | 2003-06-30 | 2005-01-20 | Noritz Corp | 温水システム |
US7363228B2 (en) | 2003-09-18 | 2008-04-22 | Interactive Intelligence, Inc. | Speech recognition system and method |
JP4377718B2 (ja) * | 2004-02-27 | 2009-12-02 | 富士通株式会社 | 対話制御システム及び方法 |
US7624018B2 (en) * | 2004-03-12 | 2009-11-24 | Microsoft Corporation | Speech recognition using categories and speech prefixing |
CN100424630C (zh) * | 2004-03-26 | 2008-10-08 | 宏碁股份有限公司 | 网页语音接口的操作方法 |
US20060116880A1 (en) * | 2004-09-03 | 2006-06-01 | Thomas Gober | Voice-driven user interface |
JP4405370B2 (ja) | 2004-11-15 | 2010-01-27 | 本田技研工業株式会社 | 車両用機器制御装置 |
US7653546B2 (en) * | 2004-11-18 | 2010-01-26 | Nuance Communications, Inc. | Method and system for efficient voice-based programming |
JP3984988B2 (ja) | 2004-11-26 | 2007-10-03 | キヤノン株式会社 | ユーザインタフェース設計装置およびその制御方法 |
WO2006077942A1 (ja) * | 2005-01-19 | 2006-07-27 | Brother Kogyo Kabushiki Kaisha | 無線タグ情報管理システム及び読取装置、タグラベル作成装置、無線タグ回路素子カートリッジ、無線タグ |
JP4628803B2 (ja) | 2005-01-25 | 2011-02-09 | 本田技研工業株式会社 | 音声認識型機器制御装置 |
US7640160B2 (en) | 2005-08-05 | 2009-12-29 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
US7949529B2 (en) | 2005-08-29 | 2011-05-24 | Voicebox Technologies, Inc. | Mobile systems and methods of supporting natural language human-machine interactions |
US9703892B2 (en) * | 2005-09-14 | 2017-07-11 | Millennial Media Llc | Predictive text completion for a mobile communication facility |
JP4260788B2 (ja) | 2005-10-20 | 2009-04-30 | 本田技研工業株式会社 | 音声認識機器制御装置 |
JP4878471B2 (ja) | 2005-11-02 | 2012-02-15 | キヤノン株式会社 | 情報処理装置およびその制御方法 |
JP2008076811A (ja) | 2006-09-22 | 2008-04-03 | Honda Motor Co Ltd | 音声認識装置、音声認識方法及び音声認識プログラム |
US7840409B2 (en) | 2007-02-27 | 2010-11-23 | Nuance Communications, Inc. | Ordering recognition results produced by an automatic speech recognition engine for a multimodal application |
US7877258B1 (en) | 2007-03-29 | 2011-01-25 | Google Inc. | Representing n-gram language models for compact storage and fast retrieval |
US8285329B1 (en) * | 2007-04-02 | 2012-10-09 | Sprint Communications Company L.P. | Mobile device-based control of smart card operation |
US8396713B2 (en) | 2007-04-30 | 2013-03-12 | Nuance Communications, Inc. | Method and system for using a statistical language model and an action classifier in parallel with grammar for better handling of out-of-grammar utterances |
US8028042B2 (en) * | 2007-06-15 | 2011-09-27 | Amazon Technologies, Inc. | System and method of managing media content |
US8239239B1 (en) * | 2007-07-23 | 2012-08-07 | Adobe Systems Incorporated | Methods and systems for dynamic workflow access based on user action |
US8165886B1 (en) * | 2007-10-04 | 2012-04-24 | Great Northern Research LLC | Speech interface system and method for control and interaction with applications on a computing system |
WO2009055819A1 (en) * | 2007-10-26 | 2009-04-30 | Honda Motor Co., Ltd. | Improving free-speech command classification for car navigation system |
US9241063B2 (en) * | 2007-11-01 | 2016-01-19 | Google Inc. | Methods for responding to an email message by call from a mobile device |
US8219407B1 (en) * | 2007-12-27 | 2012-07-10 | Great Northern Research, LLC | Method for processing the output of a speech recognizer |
US8370160B2 (en) | 2007-12-31 | 2013-02-05 | Motorola Mobility Llc | Methods and apparatus for implementing distributed multi-modal applications |
US20090171663A1 (en) | 2008-01-02 | 2009-07-02 | International Business Machines Corporation | Reducing a size of a compiled speech recognition grammar |
US7917368B2 (en) | 2008-02-25 | 2011-03-29 | Mitsubishi Electric Research Laboratories, Inc. | Method for interacting with users of speech recognition systems |
US8418076B2 (en) * | 2008-05-15 | 2013-04-09 | Microsoft Corporation | Managing inputs from a plurality of user input device actuators |
KR101545582B1 (ko) * | 2008-10-29 | 2015-08-19 | 엘지전자 주식회사 | 단말기 및 그 제어 방법 |
US8479051B2 (en) * | 2009-01-23 | 2013-07-02 | Microsoft Corporation | System and method for customized error reporting |
US9755842B2 (en) * | 2009-01-28 | 2017-09-05 | Headwater Research Llc | Managing service user discovery and service launch object placement on a device |
TWI420433B (zh) * | 2009-02-27 | 2013-12-21 | Ind Tech Res Inst | 語音互動系統與方法 |
US9684741B2 (en) | 2009-06-05 | 2017-06-20 | Microsoft Technology Licensing, Llc | Presenting search results according to query domains |
US10540976B2 (en) * | 2009-06-05 | 2020-01-21 | Apple Inc. | Contextual voice commands |
US20130219333A1 (en) * | 2009-06-12 | 2013-08-22 | Adobe Systems Incorporated | Extensible Framework for Facilitating Interaction with Devices |
US9111538B2 (en) | 2009-09-30 | 2015-08-18 | T-Mobile Usa, Inc. | Genius button secondary commands |
US20110099507A1 (en) | 2009-10-28 | 2011-04-28 | Google Inc. | Displaying a collection of interactive elements that trigger actions directed to an item |
US8868427B2 (en) * | 2009-12-11 | 2014-10-21 | General Motors Llc | System and method for updating information in electronic calendars |
EP2531999A4 (en) * | 2010-02-05 | 2017-03-29 | Nuance Communications, Inc. | Language context sensitive command system and method |
US8515734B2 (en) | 2010-02-08 | 2013-08-20 | Adacel Systems, Inc. | Integrated language model, related systems and methods |
US8694313B2 (en) * | 2010-05-19 | 2014-04-08 | Google Inc. | Disambiguation of contact information using historical data |
KR101699720B1 (ko) * | 2010-08-03 | 2017-01-26 | 삼성전자주식회사 | 음성명령 인식 장치 및 음성명령 인식 방법 |
US8731939B1 (en) | 2010-08-06 | 2014-05-20 | Google Inc. | Routing queries based on carrier phrase registration |
US8682661B1 (en) | 2010-08-31 | 2014-03-25 | Google Inc. | Robust speech recognition |
US8719727B2 (en) * | 2010-12-15 | 2014-05-06 | Microsoft Corporation | Managing an immersive environment |
KR101828273B1 (ko) * | 2011-01-04 | 2018-02-14 | 삼성전자주식회사 | 결합기반의 음성명령 인식 장치 및 그 방법 |
US8929591B2 (en) * | 2011-03-08 | 2015-01-06 | Bank Of America Corporation | Providing information associated with an identified representation of an object |
US9104440B2 (en) * | 2011-05-27 | 2015-08-11 | Microsoft Technology Licensing, Llc | Multi-application environment |
US8818994B2 (en) * | 2011-06-27 | 2014-08-26 | Bmc Software, Inc. | Mobile service context |
US8707289B2 (en) * | 2011-07-20 | 2014-04-22 | Google Inc. | Multiple application versions |
US8997171B2 (en) * | 2011-08-19 | 2015-03-31 | Microsoft Technology Licensing, Llc | Policy based application suspension and termination |
US8806369B2 (en) * | 2011-08-26 | 2014-08-12 | Apple Inc. | Device, method, and graphical user interface for managing and interacting with concurrently open software applications |
US8762156B2 (en) * | 2011-09-28 | 2014-06-24 | Apple Inc. | Speech recognition repair using contextual information |
CN102520788B (zh) * | 2011-11-16 | 2015-01-21 | 歌尔声学股份有限公司 | 一种语音识别控制方法 |
CN103999152A (zh) | 2011-12-29 | 2014-08-20 | 英特尔公司 | 利用动态语法元素集的语音识别 |
US9418658B1 (en) * | 2012-02-08 | 2016-08-16 | Amazon Technologies, Inc. | Configuration of voice controlled assistant |
US8902182B2 (en) * | 2012-02-24 | 2014-12-02 | Blackberry Limited | Electronic device and method of controlling a display |
US20130238326A1 (en) * | 2012-03-08 | 2013-09-12 | Lg Electronics Inc. | Apparatus and method for multiple device voice control |
US9503683B2 (en) * | 2012-03-27 | 2016-11-22 | Google Inc. | Providing users access to applications during video communications |
US20140365884A1 (en) * | 2012-03-30 | 2014-12-11 | Google Inc. | Voice command recording and playback |
US8881269B2 (en) * | 2012-03-31 | 2014-11-04 | Apple Inc. | Device, method, and graphical user interface for integrating recognition of handwriting gestures with a screen reader |
JP6012237B2 (ja) * | 2012-04-18 | 2016-10-25 | キヤノン株式会社 | 情報処理装置、制御方法、及びプログラム |
KR101944414B1 (ko) * | 2012-06-04 | 2019-01-31 | 삼성전자주식회사 | 음성 인식 서비스를 제공하기 위한 방법 및 그 전자 장치 |
US9317709B2 (en) * | 2012-06-26 | 2016-04-19 | Google Inc. | System and method for detecting and integrating with native applications enabled for web-based storage |
US8532675B1 (en) * | 2012-06-27 | 2013-09-10 | Blackberry Limited | Mobile communication device user interface for manipulation of data items in a physical space |
US8965759B2 (en) * | 2012-09-01 | 2015-02-24 | Sarah Hershenhorn | Digital voice memo transfer and processing |
US20150088523A1 (en) * | 2012-09-10 | 2015-03-26 | Google Inc. | Systems and Methods for Designing Voice Applications |
CN103674012B (zh) * | 2012-09-21 | 2017-09-29 | 高德软件有限公司 | 语音定制方法及其装置、语音识别方法及其装置 |
KR101407192B1 (ko) * | 2012-09-28 | 2014-06-16 | 주식회사 팬택 | 사운드 출력을 제어하는 휴대 단말 및 사운드 출력 제어 방법 |
KR20140089861A (ko) * | 2013-01-07 | 2014-07-16 | 삼성전자주식회사 | 디스플레이 장치 및 그의 제어 방법 |
US10102845B1 (en) * | 2013-02-25 | 2018-10-16 | Amazon Technologies, Inc. | Interpreting nonstandard terms in language processing using text-based communications |
US9172747B2 (en) * | 2013-02-25 | 2015-10-27 | Artificial Solutions Iberia SL | System and methods for virtual assistant networks |
US9454957B1 (en) * | 2013-03-05 | 2016-09-27 | Amazon Technologies, Inc. | Named entity resolution in spoken language processing |
JP6236805B2 (ja) * | 2013-03-05 | 2017-11-29 | 日本電気株式会社 | 発話コマンド認識システム |
US9530160B2 (en) * | 2013-03-14 | 2016-12-27 | Nanigans, Inc. | System and method for an affinity capture, user feedback and affinity analysis |
WO2014157903A1 (en) * | 2013-03-27 | 2014-10-02 | Samsung Electronics Co., Ltd. | Method and device for displaying service page for executing application |
US9875494B2 (en) * | 2013-04-16 | 2018-01-23 | Sri International | Using intents to analyze and personalize a user's dialog experience with a virtual personal assistant |
US20140324856A1 (en) * | 2013-04-27 | 2014-10-30 | Microsoft Corporation | Application discoverability |
US9292254B2 (en) * | 2013-05-15 | 2016-03-22 | Maluuba Inc. | Interactive user interface for an intelligent assistant |
JP2015011170A (ja) | 2013-06-28 | 2015-01-19 | 株式会社ATR−Trek | ローカルな音声認識を行なう音声認識クライアント装置 |
US9443507B2 (en) * | 2013-07-15 | 2016-09-13 | GM Global Technology Operations LLC | System and method for controlling a speech recognition system |
US20150024721A1 (en) * | 2013-07-22 | 2015-01-22 | Nvidia Corporation | Automatically connecting/disconnecting an incoming phone call to a data processing device based on determining intent of a user thereof to respond to the incoming phone call |
US9343068B2 (en) | 2013-09-16 | 2016-05-17 | Qualcomm Incorporated | Method and apparatus for controlling access to applications having different security levels |
CN103794214A (zh) * | 2014-03-07 | 2014-05-14 | 联想(北京)有限公司 | 一种信息处理方法、装置和电子设备 |
US10552852B1 (en) * | 2014-03-11 | 2020-02-04 | Vmware, Inc. | Service monitor for monitoring and tracking the performance of applications running on different mobile devices |
US10249296B1 (en) * | 2014-05-27 | 2019-04-02 | Amazon Technologies, Inc. | Application discovery and selection in language-based systems |
US10592080B2 (en) * | 2014-07-31 | 2020-03-17 | Microsoft Technology Licensing, Llc | Assisted presentation of application windows |
US9548066B2 (en) * | 2014-08-11 | 2017-01-17 | Amazon Technologies, Inc. | Voice application architecture |
US20160103793A1 (en) * | 2014-10-14 | 2016-04-14 | Microsoft Technology Licensing, Llc | Heterogeneous Application Tabs |
US9116768B1 (en) * | 2014-11-20 | 2015-08-25 | Symantec Corporation | Systems and methods for deploying applications included in application containers |
US9812126B2 (en) * | 2014-11-28 | 2017-11-07 | Microsoft Technology Licensing, Llc | Device arbitration for listening devices |
US10248192B2 (en) * | 2014-12-03 | 2019-04-02 | Microsoft Technology Licensing, Llc | Gaze target application launcher |
US10460720B2 (en) * | 2015-01-03 | 2019-10-29 | Microsoft Technology Licensing, Llc. | Generation of language understanding systems and methods |
US9472196B1 (en) | 2015-04-22 | 2016-10-18 | Google Inc. | Developer voice actions system |
US20170075985A1 (en) * | 2015-09-16 | 2017-03-16 | Microsoft Technology Licensing, Llc | Query transformation for natural language queries |
WO2018080162A1 (ko) * | 2016-10-27 | 2018-05-03 | 삼성전자 주식회사 | 음성 명령에 기초하여 애플리케이션을 실행하는 방법 및 장치 |
US11468881B2 (en) * | 2019-03-29 | 2022-10-11 | Samsung Electronics Co., Ltd. | Method and system for semantic intelligent task learning and adaptive execution |
KR20210045241A (ko) * | 2019-10-16 | 2021-04-26 | 삼성전자주식회사 | 전자 장치 및 전자 장치의 음성 명령어 공유 방법 |
-
2015
- 2015-04-22 US US14/693,330 patent/US9472196B1/en active Active
-
2016
- 2016-04-12 EP EP16721548.2A patent/EP3286633B1/en active Active
- 2016-04-12 CN CN201680019717.8A patent/CN107408385B/zh active Active
- 2016-04-12 CN CN202111019888.XA patent/CN113851120A/zh active Pending
- 2016-04-12 KR KR1020197031169A patent/KR102173100B1/ko active IP Right Grant
- 2016-04-12 KR KR1020177028031A patent/KR102038074B1/ko active IP Right Grant
- 2016-04-12 DE DE112016001852.5T patent/DE112016001852T5/de active Pending
- 2016-04-12 JP JP2017550871A patent/JP6538188B2/ja active Active
- 2016-04-12 WO PCT/US2016/027113 patent/WO2016171956A1/en active Application Filing
- 2016-04-12 GB GB1715580.5A patent/GB2553234B/en active Active
- 2016-09-07 US US15/258,084 patent/US10008203B2/en active Active
-
2018
- 2018-05-23 US US15/987,509 patent/US10839799B2/en active Active
-
2019
- 2019-05-30 JP JP2019101151A patent/JP6873188B2/ja active Active
-
2020
- 2020-11-16 US US17/099,130 patent/US11657816B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
US20170186427A1 (en) | 2017-06-29 |
US11657816B2 (en) | 2023-05-23 |
EP3286633A1 (en) | 2018-02-28 |
WO2016171956A1 (en) | 2016-10-27 |
JP2018511831A (ja) | 2018-04-26 |
CN107408385B (zh) | 2021-09-21 |
KR20190122888A (ko) | 2019-10-30 |
GB201715580D0 (en) | 2017-11-08 |
GB2553234A (en) | 2018-02-28 |
KR20170124583A (ko) | 2017-11-10 |
US20210082430A1 (en) | 2021-03-18 |
US20180374480A1 (en) | 2018-12-27 |
KR102038074B1 (ko) | 2019-10-29 |
US10008203B2 (en) | 2018-06-26 |
DE112016001852T5 (de) | 2018-06-14 |
CN107408385A (zh) | 2017-11-28 |
US10839799B2 (en) | 2020-11-17 |
US20160314791A1 (en) | 2016-10-27 |
US9472196B1 (en) | 2016-10-18 |
CN113851120A (zh) | 2021-12-28 |
JP6538188B2 (ja) | 2019-07-03 |
JP2019144598A (ja) | 2019-08-29 |
GB2553234B (en) | 2022-08-10 |
KR102173100B1 (ko) | 2020-11-02 |
EP3286633B1 (en) | 2022-07-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6873188B2 (ja) | 開発者音声アクションシステム | |
JP6942841B2 (ja) | ダイアログ・システムにおけるパラメータ収集および自動ダイアログ生成 | |
JP6704450B2 (ja) | 開発者ボイスアクションシステム | |
US9026431B1 (en) | Semantic parsing with multiple parsers | |
US11626115B2 (en) | Voice to text conversion based on third-party agent content | |
CN110770736A (zh) | 将对话驱动式应用程序导出到数字通信平台 | |
JP2019503526A5 (ja) | ||
KR102438671B1 (ko) | 텍스트 독립 화자 인식 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20190605 |
|
A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20200629 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20200727 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20201014 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20210322 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20210420 |
|
R150 | Certificate of patent or registration of utility model |
Ref document number: 6873188 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |