JP6251343B2 - 複数のデバイスにおけるホットワードの検出 - Google Patents
複数のデバイスにおけるホットワードの検出 Download PDFInfo
- Publication number
- JP6251343B2 JP6251343B2 JP2016174371A JP2016174371A JP6251343B2 JP 6251343 B2 JP6251343 B2 JP 6251343B2 JP 2016174371 A JP2016174371 A JP 2016174371A JP 2016174371 A JP2016174371 A JP 2016174371A JP 6251343 B2 JP6251343 B2 JP 6251343B2
- Authority
- JP
- Japan
- Prior art keywords
- computing device
- signal
- speech
- mobile computing
- voice
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000001514 detection method Methods 0.000 title 1
- 238000000034 method Methods 0.000 claims description 62
- 230000008569 process Effects 0.000 claims description 29
- 230000009471 action Effects 0.000 claims description 22
- 230000004044 response Effects 0.000 claims description 8
- 238000002604 ultrasonography Methods 0.000 claims description 6
- 230000015654 memory Effects 0.000 description 31
- 238000004891 communication Methods 0.000 description 18
- 238000004364 calculation method Methods 0.000 description 12
- 238000004590 computer program Methods 0.000 description 5
- 230000005540 biological transmission Effects 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 230000005236 sound signal Effects 0.000 description 3
- 239000000872 buffer Substances 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 239000004973 liquid crystal related substance Substances 0.000 description 2
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000000712 assembly Effects 0.000 description 1
- 238000000429 assembly Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000003139 buffering effect Effects 0.000 description 1
- 238000010411 cooking Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000001953 sensory effect Effects 0.000 description 1
- 238000012706 support-vector machine Methods 0.000 description 1
- 239000010409 thin film Substances 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L2025/783—Detection of presence or absence of voice signals based on threshold decision
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/87—Detection of discrete points within a voice signal
Landscapes
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Telephone Function (AREA)
- Telephonic Communication Services (AREA)
- User Interface Of Digital Computer (AREA)
- Circuit For Audible Band Transducer (AREA)
Description
102 ユーザ
104 発言
106 コンピューティングデバイス
108 コンピューティングデバイス
110 コンピューティングデバイス
114 マイク
116 マイク
118 マイク
120 ホットワーダ
122 ホットワーダ
124 ホットワーダ
126 ラウドネススコアラ
128 ラウドネススコアラ
130 ラウドネススコアラ
132 遅延計算モジュール
134 遅延計算モジュール
136 遅延計算モジュール
138 スピーカ
138 デバイス状況
140 スピーカ
140 デバイス状況
142 スピーカ
142 デバイス状況
200 プロセス
Claims (17)
- コンピュータにより実施される方法であって、
(i)予め定義されたホットワードが先行する音声コマンドを処理するように構成され、(ii)同一の予め定義されたホットワードが先行する音声コマンドを処理するように構成された別のモバイルコンピューティングデバイスの近くにあり、(iii)前記別のモバイルコンピューティングデバイスより話者から遠いモバイルコンピューティングデバイスが、前記予め定義されたホットワードが先行する音声コマンドの前記話者による発言を表す音声入力を受信するステップと、
前記予め定義されたホットワードが先行する音声コマンドの前記話者による発言を表す音声入力の受信に応答して、遅延時間の量を判定し、前記遅延時間が経過した後で、前記別のモバイルコンピューティングデバイスによる前記音声コマンドの処理を回避するために前記別のモバイルコンピューティングデバイスに信号を送信するステップと、を備える方法。 - 前記信号が、超音波信号または短距離無線信号を備える、請求項1に記載の方法。
- 前記遅延時間の量が、前記発言を表す音声入力のラウドネスに基づく、請求項1に記載の方法。
- 前記遅延時間の量が、閾値ラウドネスを満たす発言を表す音声入力のラウドネスに基づいてゼロである、請求項1に記載の方法。
- 前記予め定義されたホットワードが先行する音声コマンドの前記話者による発言を表す音声入力の受信に応答して前記信号を送信すると、前記モバイルコンピューティングデバイスのデバイス状態は活動中に設定される、請求項1に記載の方法。
- 前記別のモバイルコンピューティングデバイスから別の信号を受信するステップを備え、
前記音声コマンドの処理が、前記別の信号に基づいて回避され、前記モバイルコンピューティングデバイスのデバイス状態は活動停止に設定される、請求項1に記載の方法。 - 1つまたは複数のコンピュータと、前記1つまたは複数のコンピュータによって実行されるとき、前記1つまたは複数のコンピュータに動作を行わせるように動作可能な命令を記憶する1つまたは複数の記憶デバイスと、を備えるシステムであって、前記動作が、
(i)予め定義されたホットワードが先行する音声コマンドを処理するように構成され、(ii)同一の予め定義されたホットワードが先行する音声コマンドを処理するように構成された別のモバイルコンピューティングデバイスの近くにあり、(iii)前記別のモバイルコンピューティングデバイスより話者から遠いモバイルコンピューティングデバイスが、前記予め定義されたホットワードが先行する音声コマンドの前記話者による発言を表す音声入力を受信することと、
前記予め定義されたホットワードが先行する音声コマンドの前記話者による発言を表す音声入力の受信に応答して、遅延時間の量を判定し、前記遅延時間が経過した後で、前記別のモバイルコンピューティングデバイスによる前記音声コマンドの処理を回避するために前記別のモバイルコンピューティングデバイスに信号を送信することと、を備えるシステム。 - 前記信号が、超音波信号または短距離無線信号を備える、請求項7に記載のシステム。
- 前記遅延時間の量が、前記発言を表す音声入力のラウドネスに基づく、請求項7に記載のシステム。
- 前記遅延時間の量が、閾値ラウドネスを満たす発言を表す音声入力のラウドネスに基づいてゼロである、請求項7に記載のシステム。
- 前記予め定義されたホットワードが先行する音声コマンドの前記話者による発言を表す音声入力の受信に応答して前記信号を送信すると、前記モバイルコンピューティングデバイスのデバイス状態は活動中に設定される、請求項7に記載のシステム。
- 前記動作が、前記別のモバイルコンピューティングデバイスから別の信号を受信することをさらに備え、
前記音声コマンドの処理が、前記別の信号に基づいて回避され、前記モバイルコンピューティングデバイスのデバイス状態は活動停止に設定される、請求項7に記載のシステム。 - 1つまたは複数のコンピュータによって実行可能であり、実行されると前記1つまたは複数のコンピュータに動作を行わせる命令を備えるソフトウェアを記憶する非一時的なコンピュータ可読媒体であって、前記動作が、
(i)予め定義されたホットワードが先行する音声コマンドを処理するように構成され、(ii)同一の予め定義されたホットワードが先行する音声コマンドを処理するように構成された別のモバイルコンピューティングデバイスの近くにあり、(iii)前記別のモバイルコンピューティングデバイスより話者から遠いモバイルコンピューティングデバイスが、前記予め定義されたホットワードが先行する音声コマンドの前記話者による発言を表す音声入力を受信することと、
前記予め定義されたホットワードが先行する音声コマンドの前記話者による発言を表す音声入力の受信に応答して、遅延時間の量を判定し、前記遅延時間が経過した後で、前記別のモバイルコンピューティングデバイスによる前記音声コマンドの処理を回避するために前記別のモバイルコンピューティングデバイスに信号を送信することと、を備える非一時的なコンピュータ可読媒体。 - 前記信号が、超音波信号または短距離無線信号を備える、請求項13に記載の非一時的なコンピュータ可読媒体。
- 前記遅延時間の量が、前記発言を表す音声入力のラウドネスに基づく、請求項13に記載の非一時的なコンピュータ可読媒体。
- 前記予め定義されたホットワードが先行する音声コマンドの前記話者による発言を表す音声入力の受信に応答して前記信号を送信すると、前記モバイルコンピューティングデバイスのデバイス状態は活動中に設定される、請求項13に記載の非一時的なコンピュータ可読媒体。
- 前記動作が、前記別のモバイルコンピューティングデバイスから別の信号を受信することをさらに備え、
前記音声コマンドの処理が、前記別の信号に基づいて回避され、前記モバイルコンピューティングデバイスのデバイス状態は活動停止に設定される、請求項13に記載の非一時的なコンピュータ可読媒体。
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201462061903P | 2014-10-09 | 2014-10-09 | |
US62/061,903 | 2014-10-09 | ||
US14/659,861 | 2015-03-17 | ||
US14/659,861 US9424841B2 (en) | 2014-10-09 | 2015-03-17 | Hotword detection on multiple devices |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2016549271A Division JP6261751B2 (ja) | 2014-10-09 | 2015-09-29 | 複数のデバイスにおけるホットワードの検出 |
Publications (2)
Publication Number | Publication Date |
---|---|
JP2017126317A JP2017126317A (ja) | 2017-07-20 |
JP6251343B2 true JP6251343B2 (ja) | 2017-12-20 |
Family
ID=54347818
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2016549271A Active JP6261751B2 (ja) | 2014-10-09 | 2015-09-29 | 複数のデバイスにおけるホットワードの検出 |
JP2016174371A Active JP6251343B2 (ja) | 2014-10-09 | 2016-09-07 | 複数のデバイスにおけるホットワードの検出 |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2016549271A Active JP6261751B2 (ja) | 2014-10-09 | 2015-09-29 | 複数のデバイスにおけるホットワードの検出 |
Country Status (6)
Country | Link |
---|---|
US (6) | US9424841B2 (ja) |
EP (3) | EP3100260B1 (ja) |
JP (2) | JP6261751B2 (ja) |
KR (2) | KR101819681B1 (ja) |
CN (2) | CN105960673B (ja) |
WO (1) | WO2016057269A1 (ja) |
Families Citing this family (127)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10032452B1 (en) | 2016-12-30 | 2018-07-24 | Google Llc | Multimodal transmission of packetized data |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
US8977255B2 (en) | 2007-04-03 | 2015-03-10 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
US10013986B1 (en) | 2016-12-30 | 2018-07-03 | Google Llc | Data structure pooling of voice activated data packets |
US11017428B2 (en) | 2008-02-21 | 2021-05-25 | Google Llc | System and method of data transmission rate adjustment |
US8676904B2 (en) | 2008-10-02 | 2014-03-18 | Apple Inc. | Electronic devices with voice command and contextual data processing capabilities |
US10706373B2 (en) | 2011-06-03 | 2020-07-07 | Apple Inc. | Performing actions associated with task items that represent tasks to perform |
US10417037B2 (en) | 2012-05-15 | 2019-09-17 | Apple Inc. | Systems and methods for integrating third party services with a digital assistant |
US10152723B2 (en) | 2012-05-23 | 2018-12-11 | Google Llc | Methods and systems for identifying new computers and providing matching services |
US10776830B2 (en) | 2012-05-23 | 2020-09-15 | Google Llc | Methods and systems for identifying new computers and providing matching services |
US10735552B2 (en) | 2013-01-31 | 2020-08-04 | Google Llc | Secondary transmissions of packetized data |
US10650066B2 (en) | 2013-01-31 | 2020-05-12 | Google Llc | Enhancing sitelinks with creative content |
KR20150104615A (ko) | 2013-02-07 | 2015-09-15 | 애플 인크. | 디지털 어시스턴트를 위한 음성 트리거 |
US10652394B2 (en) | 2013-03-14 | 2020-05-12 | Apple Inc. | System and method for processing voicemail |
US10748529B1 (en) | 2013-03-15 | 2020-08-18 | Apple Inc. | Voice activated device for use with a voice-based digital assistant |
US10176167B2 (en) | 2013-06-09 | 2019-01-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
CN105453026A (zh) | 2013-08-06 | 2016-03-30 | 苹果公司 | 基于来自远程设备的活动自动激活智能响应 |
TWI566107B (zh) | 2014-05-30 | 2017-01-11 | 蘋果公司 | 用於處理多部分語音命令之方法、非暫時性電腦可讀儲存媒體及電子裝置 |
US10170123B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Intelligent assistant for home automation |
US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
EP2958010A1 (en) | 2014-06-20 | 2015-12-23 | Thomson Licensing | Apparatus and method for controlling the apparatus by a user |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US11942095B2 (en) | 2014-07-18 | 2024-03-26 | Google Llc | Speaker verification using co-location information |
US9257120B1 (en) | 2014-07-18 | 2016-02-09 | Google Inc. | Speaker verification using co-location information |
US11676608B2 (en) | 2021-04-02 | 2023-06-13 | Google Llc | Speaker verification using co-location information |
US9318107B1 (en) | 2014-10-09 | 2016-04-19 | Google Inc. | Hotword detection on multiple devices |
US9424841B2 (en) | 2014-10-09 | 2016-08-23 | Google Inc. | Hotword detection on multiple devices |
US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
US10460227B2 (en) | 2015-05-15 | 2019-10-29 | Apple Inc. | Virtual assistant in a communication session |
US10200824B2 (en) | 2015-05-27 | 2019-02-05 | Apple Inc. | Systems and methods for proactively identifying and surfacing relevant content on a touch-sensitive device |
US20160378747A1 (en) | 2015-06-29 | 2016-12-29 | Apple Inc. | Virtual assistant for media playback |
US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
US10331312B2 (en) | 2015-09-08 | 2019-06-25 | Apple Inc. | Intelligent automated assistant in a media environment |
US10740384B2 (en) | 2015-09-08 | 2020-08-11 | Apple Inc. | Intelligent automated assistant for media search and playback |
KR20170034154A (ko) * | 2015-09-18 | 2017-03-28 | 삼성전자주식회사 | 콘텐츠 제공 방법 및 이를 수행하는 전자 장치 |
US9542941B1 (en) * | 2015-10-01 | 2017-01-10 | Lenovo (Singapore) Pte. Ltd. | Situationally suspending wakeup word to enable voice command input |
US9747926B2 (en) * | 2015-10-16 | 2017-08-29 | Google Inc. | Hotword recognition |
CN107016999B (zh) | 2015-10-16 | 2022-06-14 | 谷歌有限责任公司 | 热词识别 |
US9928840B2 (en) * | 2015-10-16 | 2018-03-27 | Google Llc | Hotword recognition |
US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US10956666B2 (en) | 2015-11-09 | 2021-03-23 | Apple Inc. | Unconventional virtual assistant interactions |
US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
US10074364B1 (en) * | 2016-02-02 | 2018-09-11 | Amazon Technologies, Inc. | Sound profile generation based on speech recognition results exceeding a threshold |
US9779735B2 (en) | 2016-02-24 | 2017-10-03 | Google Inc. | Methods and systems for detecting and processing speech signals |
US20170294138A1 (en) * | 2016-04-08 | 2017-10-12 | Patricia Kavanagh | Speech Improvement System and Method of Its Use |
US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
DK179415B1 (en) | 2016-06-11 | 2018-06-14 | Apple Inc | Intelligent device arbitration and control |
DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
US10091545B1 (en) * | 2016-06-27 | 2018-10-02 | Amazon Technologies, Inc. | Methods and systems for detecting audio output of associated device |
US10438583B2 (en) * | 2016-07-20 | 2019-10-08 | Lenovo (Singapore) Pte. Ltd. | Natural language voice assistant |
US10621992B2 (en) * | 2016-07-22 | 2020-04-14 | Lenovo (Singapore) Pte. Ltd. | Activating voice assistant based on at least one of user proximity and context |
US9972320B2 (en) * | 2016-08-24 | 2018-05-15 | Google Llc | Hotword detection on multiple devices |
KR102241970B1 (ko) | 2016-11-07 | 2021-04-20 | 구글 엘엘씨 | 기록된 미디어 핫워드 트리거 억제 |
US10276149B1 (en) * | 2016-12-21 | 2019-04-30 | Amazon Technologies, Inc. | Dynamic text-to-speech output |
US10559309B2 (en) | 2016-12-22 | 2020-02-11 | Google Llc | Collaborative voice controlled devices |
US10276161B2 (en) * | 2016-12-27 | 2019-04-30 | Google Llc | Contextual hotwords |
US10708313B2 (en) | 2016-12-30 | 2020-07-07 | Google Llc | Multimodal transmission of packetized data |
US10593329B2 (en) * | 2016-12-30 | 2020-03-17 | Google Llc | Multimodal transmission of packetized data |
KR20180083587A (ko) * | 2017-01-13 | 2018-07-23 | 삼성전자주식회사 | 전자 장치 및 그의 동작 방법 |
KR20180085931A (ko) * | 2017-01-20 | 2018-07-30 | 삼성전자주식회사 | 음성 입력 처리 방법 및 이를 지원하는 전자 장치 |
US9990926B1 (en) * | 2017-03-13 | 2018-06-05 | Intel Corporation | Passive enrollment method for speaker identification systems |
US10403276B2 (en) | 2017-03-17 | 2019-09-03 | Microsoft Technology Licensing, Llc | Voice enabled features based on proximity |
US10621980B2 (en) | 2017-03-21 | 2020-04-14 | Harman International Industries, Inc. | Execution of voice commands in a multi-device system |
CN117577099A (zh) | 2017-04-20 | 2024-02-20 | 谷歌有限责任公司 | 设备上的多用户认证的方法、系统和介质 |
DK180048B1 (en) | 2017-05-11 | 2020-02-04 | Apple Inc. | MAINTAINING THE DATA PROTECTION OF PERSONAL INFORMATION |
US10726832B2 (en) | 2017-05-11 | 2020-07-28 | Apple Inc. | Maintaining privacy of personal information |
DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
DK201770429A1 (en) | 2017-05-12 | 2018-12-14 | Apple Inc. | LOW-LATENCY INTELLIGENT AUTOMATED ASSISTANT |
DK201770411A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | MULTI-MODAL INTERFACES |
US20180336275A1 (en) | 2017-05-16 | 2018-11-22 | Apple Inc. | Intelligent automated assistant for media exploration |
EP3577646B1 (en) * | 2017-05-16 | 2021-07-21 | Google LLC | Handling calls on a shared speech-enabled device |
US20180336892A1 (en) | 2017-05-16 | 2018-11-22 | Apple Inc. | Detecting a trigger of a digital assistant |
US10664533B2 (en) | 2017-05-24 | 2020-05-26 | Lenovo (Singapore) Pte. Ltd. | Systems and methods to determine response cue for digital assistant based on context |
US10395650B2 (en) * | 2017-06-05 | 2019-08-27 | Google Llc | Recorded media hotword trigger suppression |
US10069976B1 (en) * | 2017-06-13 | 2018-09-04 | Harman International Industries, Incorporated | Voice agent forwarding |
US10636428B2 (en) | 2017-06-29 | 2020-04-28 | Microsoft Technology Licensing, Llc | Determining a target device for voice command interaction |
US20190065608A1 (en) * | 2017-08-29 | 2019-02-28 | Lenovo (Singapore) Pte. Ltd. | Query input received at more than one device |
KR102489914B1 (ko) * | 2017-09-15 | 2023-01-20 | 삼성전자주식회사 | 전자 장치 및 이의 제어 방법 |
CN107919119A (zh) * | 2017-11-16 | 2018-04-17 | 百度在线网络技术(北京)有限公司 | 多设备交互协同的方法、装置、设备及计算机可读介质 |
US10276175B1 (en) * | 2017-11-28 | 2019-04-30 | Google Llc | Key phrase detection with audio watermarking |
CN110741338B (zh) | 2017-12-08 | 2023-06-16 | 谷歌有限责任公司 | 使设备与环境中的多个设备隔离以响应口头助理调用 |
US10885910B1 (en) | 2018-03-14 | 2021-01-05 | Amazon Technologies, Inc. | Voice-forward graphical user interface mode management |
US10877637B1 (en) * | 2018-03-14 | 2020-12-29 | Amazon Technologies, Inc. | Voice-based device operation mode management |
US11127405B1 (en) | 2018-03-14 | 2021-09-21 | Amazon Technologies, Inc. | Selective requests for authentication for voice-based launching of applications |
US10818288B2 (en) | 2018-03-26 | 2020-10-27 | Apple Inc. | Natural assistant interaction |
US11145294B2 (en) | 2018-05-07 | 2021-10-12 | Apple Inc. | Intelligent automated assistant for delivering content from user experiences |
US10928918B2 (en) | 2018-05-07 | 2021-02-23 | Apple Inc. | Raise to speak |
US10692496B2 (en) | 2018-05-22 | 2020-06-23 | Google Llc | Hotword suppression |
DK180639B1 (en) | 2018-06-01 | 2021-11-04 | Apple Inc | DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT |
US10892996B2 (en) * | 2018-06-01 | 2021-01-12 | Apple Inc. | Variable latency device coordination |
DK179822B1 (da) | 2018-06-01 | 2019-07-12 | Apple Inc. | Voice interaction at a primary device to access call functionality of a companion device |
GB2574803B (en) * | 2018-06-11 | 2022-12-07 | Xmos Ltd | Communication between audio devices |
EP3807874A1 (en) | 2018-07-13 | 2021-04-21 | Google LLC | End-to-end streaming keyword spotting |
KR20230107386A (ko) * | 2018-08-09 | 2023-07-14 | 구글 엘엘씨 | 핫워드 인식 및 수동 어시스턴스 |
US11462215B2 (en) | 2018-09-28 | 2022-10-04 | Apple Inc. | Multi-modal inputs for voice commands |
CN109545207A (zh) * | 2018-11-16 | 2019-03-29 | 广东小天才科技有限公司 | 一种语音唤醒方法及装置 |
CN109243462A (zh) * | 2018-11-20 | 2019-01-18 | 广东小天才科技有限公司 | 一种语音唤醒方法及装置 |
CN109584876B (zh) * | 2018-12-26 | 2020-07-14 | 珠海格力电器股份有限公司 | 语音数据的处理方法、装置和语音空调 |
CN109584878A (zh) * | 2019-01-14 | 2019-04-05 | 广东小天才科技有限公司 | 一种语音唤醒方法及系统 |
US11348573B2 (en) | 2019-03-18 | 2022-05-31 | Apple Inc. | Multimodality in digital assistant systems |
DK201970509A1 (en) | 2019-05-06 | 2021-01-15 | Apple Inc | Spoken notifications |
US11307752B2 (en) | 2019-05-06 | 2022-04-19 | Apple Inc. | User configurable task triggers |
US11140099B2 (en) | 2019-05-21 | 2021-10-05 | Apple Inc. | Providing message response suggestions |
DK201970510A1 (en) | 2019-05-31 | 2021-02-11 | Apple Inc | Voice identification in digital assistant systems |
DK180129B1 (en) | 2019-05-31 | 2020-06-02 | Apple Inc. | USER ACTIVITY SHORTCUT SUGGESTIONS |
US11227599B2 (en) | 2019-06-01 | 2022-01-18 | Apple Inc. | Methods and user interfaces for voice-based control of electronic devices |
CN112712803B (zh) * | 2019-07-15 | 2022-02-25 | 华为技术有限公司 | 一种语音唤醒的方法和电子设备 |
CN110660390B (zh) * | 2019-09-17 | 2022-05-03 | 百度在线网络技术(北京)有限公司 | 智能设备唤醒方法、智能设备及计算机可读存储介质 |
KR102629796B1 (ko) | 2019-10-15 | 2024-01-26 | 삼성전자 주식회사 | 음성 인식의 향상을 지원하는 전자 장치 |
CN110890092B (zh) * | 2019-11-07 | 2022-08-05 | 北京小米移动软件有限公司 | 唤醒控制方法及装置、计算机存储介质 |
KR20210069977A (ko) * | 2019-12-04 | 2021-06-14 | 엘지전자 주식회사 | 기기 제어 방법 및 이를 이용한 제어 가능한 장치 |
CN111312239B (zh) * | 2020-01-20 | 2023-09-26 | 北京小米松果电子有限公司 | 响应方法、装置、电子设备及存储介质 |
US11282527B2 (en) * | 2020-02-28 | 2022-03-22 | Synaptics Incorporated | Subaudible tones to validate audio signals |
US11061543B1 (en) | 2020-05-11 | 2021-07-13 | Apple Inc. | Providing relevant data items based on context |
US11043220B1 (en) | 2020-05-11 | 2021-06-22 | Apple Inc. | Digital assistant hardware abstraction |
US11755276B2 (en) | 2020-05-12 | 2023-09-12 | Apple Inc. | Reducing description length based on confidence |
US11490204B2 (en) | 2020-07-20 | 2022-11-01 | Apple Inc. | Multi-device audio adjustment coordination |
US11438683B2 (en) | 2020-07-21 | 2022-09-06 | Apple Inc. | User identification using headphones |
CN112133302B (zh) * | 2020-08-26 | 2024-05-07 | 北京小米松果电子有限公司 | 预唤醒终端的方法、装置及存储介质 |
KR20220041413A (ko) * | 2020-09-25 | 2022-04-01 | 삼성전자주식회사 | 전자장치 및 그 제어방법 |
US11727925B2 (en) * | 2020-10-13 | 2023-08-15 | Google Llc | Cross-device data synchronization based on simultaneous hotword triggers |
US11557300B2 (en) | 2020-10-16 | 2023-01-17 | Google Llc | Detecting and handling failures in other assistants |
US20210225374A1 (en) * | 2020-12-23 | 2021-07-22 | Intel Corporation | Method and system of environment-sensitive wake-on-voice initiation using ultrasound |
CN114115788A (zh) * | 2021-10-09 | 2022-03-01 | 维沃移动通信有限公司 | 音频播放方法及装置 |
US20230178075A1 (en) * | 2021-12-02 | 2023-06-08 | Lenovo (Singapore) Pte. Ltd | Methods and devices for preventing a sound activated response |
Family Cites Families (56)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4363102A (en) | 1981-03-27 | 1982-12-07 | Bell Telephone Laboratories, Incorporated | Speaker identification system using word recognition templates |
JP3674990B2 (ja) * | 1995-08-21 | 2005-07-27 | セイコーエプソン株式会社 | 音声認識対話装置および音声認識対話処理方法 |
SE511418C2 (sv) | 1997-03-13 | 1999-09-27 | Telia Ab | Metod för talarverifiering/identifiering via modellering av typiska icke-typiska egenskaper. |
US6076055A (en) | 1997-05-27 | 2000-06-13 | Ameritech | Speaker verification method |
US5897616A (en) | 1997-06-11 | 1999-04-27 | International Business Machines Corporation | Apparatus and methods for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases |
US6141644A (en) | 1998-09-04 | 2000-10-31 | Matsushita Electric Industrial Co., Ltd. | Speaker verification and speaker identification based on eigenvoices |
JP3357629B2 (ja) * | 1999-04-26 | 2002-12-16 | 旭化成株式会社 | 設備制御システム |
DE19939102C1 (de) * | 1999-08-18 | 2000-10-26 | Siemens Ag | Verfahren und Anordnung zum Erkennen von Sprache |
US6567775B1 (en) | 2000-04-26 | 2003-05-20 | International Business Machines Corporation | Fusion of audio and video based speaker identification for multimedia information access |
US6826159B1 (en) | 2000-05-24 | 2004-11-30 | Cisco Technology, Inc. | System and method for providing speaker identification in a conference call |
EP1215658A3 (en) * | 2000-12-05 | 2002-08-14 | Hewlett-Packard Company | Visual activation of voice controlled apparatus |
US20030231746A1 (en) | 2002-06-14 | 2003-12-18 | Hunter Karla Rae | Teleconference speaker identification |
TW200409525A (en) | 2002-11-26 | 2004-06-01 | Lite On Technology Corp | Voice identification method for cellular phone and cellular phone with voiceprint password |
EP1429314A1 (en) * | 2002-12-13 | 2004-06-16 | Sony International (Europe) GmbH | Correction of energy as input feature for speech processing |
US7222072B2 (en) | 2003-02-13 | 2007-05-22 | Sbc Properties, L.P. | Bio-phonetic multi-phrase speaker identity verification |
US8290603B1 (en) | 2004-06-05 | 2012-10-16 | Sonos, Inc. | User interfaces for controlling and manipulating groupings in a multi-zone media system |
US7571014B1 (en) | 2004-04-01 | 2009-08-04 | Sonos, Inc. | Method and apparatus for controlling multimedia players in a multi-zone system |
US20070198262A1 (en) | 2003-08-20 | 2007-08-23 | Mindlin Bernardo G | Topological voiceprints for speaker identification |
US8517921B2 (en) | 2004-04-16 | 2013-08-27 | Gyrus Acmi, Inc. | Endoscopic instrument having reduced diameter flexible shaft |
US8214447B2 (en) | 2004-06-08 | 2012-07-03 | Bose Corporation | Managing an audio network |
US7720012B1 (en) | 2004-07-09 | 2010-05-18 | Arrowhead Center, Inc. | Speaker identification in the presence of packet losses |
US8412521B2 (en) | 2004-08-20 | 2013-04-02 | Multimodal Technologies, Llc | Discriminative training of document transcription system |
US8521529B2 (en) | 2004-10-18 | 2013-08-27 | Creative Technology Ltd | Method for segmenting audio signals |
US8709018B2 (en) | 2005-09-16 | 2014-04-29 | Applied Medical Technology, Inc. | Non-balloon low profile feed device with insertion/removal tool |
KR100711094B1 (ko) * | 2005-11-29 | 2007-04-27 | 삼성전자주식회사 | 분산 통신 환경에서의 이동체들 간의 자원 할당 방법 |
US7741962B2 (en) * | 2006-10-09 | 2010-06-22 | Toyota Motor Engineering & Manufacturing North America, Inc. | Auditory display of vehicular environment |
CN1996847B (zh) | 2006-12-27 | 2010-05-19 | 中国科学院上海技术物理研究所 | 基于协作网格的图像及多媒体数据通信与存储系统 |
US8099288B2 (en) | 2007-02-12 | 2012-01-17 | Microsoft Corp. | Text-dependent speaker verification |
US8838457B2 (en) | 2007-03-07 | 2014-09-16 | Vlingo Corporation | Using results of unstructured language model based speech recognition to control a system-level function of a mobile communications facility |
US20080252595A1 (en) * | 2007-04-11 | 2008-10-16 | Marc Boillot | Method and Device for Virtual Navigation and Voice Processing |
US8385233B2 (en) | 2007-06-12 | 2013-02-26 | Microsoft Corporation | Active speaker identification |
US8504365B2 (en) | 2008-04-11 | 2013-08-06 | At&T Intellectual Property I, L.P. | System and method for detecting synthetic speaker verification |
US8676904B2 (en) | 2008-10-02 | 2014-03-18 | Apple Inc. | Electronic devices with voice command and contextual data processing capabilities |
US8326637B2 (en) | 2009-02-20 | 2012-12-04 | Voicebox Technologies, Inc. | System and method for processing multi-modal device interactions in a natural language voice services environment |
US8209174B2 (en) | 2009-04-17 | 2012-06-26 | Saudi Arabian Oil Company | Speaker verification system |
CN101923853B (zh) | 2009-06-12 | 2013-01-23 | 华为技术有限公司 | 说话人识别方法、设备和系统 |
US8311838B2 (en) | 2010-01-13 | 2012-11-13 | Apple Inc. | Devices and methods for identifying a prompt corresponding to a voice input in a sequence of prompts |
US8626511B2 (en) | 2010-01-22 | 2014-01-07 | Google Inc. | Multi-dimensional disambiguation of voice commands |
JP5411789B2 (ja) * | 2010-04-19 | 2014-02-12 | 本田技研工業株式会社 | コミュニケーションロボット |
KR101672212B1 (ko) * | 2010-06-15 | 2016-11-04 | 엘지전자 주식회사 | 휴대 단말기 및 그 동작 방법 |
US8719018B2 (en) | 2010-10-25 | 2014-05-06 | Lockheed Martin Corporation | Biometric speaker identification |
US8340975B1 (en) * | 2011-10-04 | 2012-12-25 | Theodore Alfred Rosenberger | Interactive speech recognition device and system for hands-free building control |
US9031847B2 (en) | 2011-11-15 | 2015-05-12 | Microsoft Technology Licensing, Llc | Voice-controlled camera operations |
US9711160B2 (en) * | 2012-05-29 | 2017-07-18 | Apple Inc. | Smart dock for activating a voice recognition mode of a portable electronic device |
JP6131537B2 (ja) * | 2012-07-04 | 2017-05-24 | セイコーエプソン株式会社 | 音声認識システム、音声認識プログラム、記録媒体及び音声認識方法 |
US8983836B2 (en) | 2012-09-26 | 2015-03-17 | International Business Machines Corporation | Captioning using socially derived acoustic profiles |
US8996372B1 (en) | 2012-10-30 | 2015-03-31 | Amazon Technologies, Inc. | Using adaptation data with cloud-based speech recognition |
EP2941769B1 (en) | 2013-01-04 | 2019-05-08 | Kopin Corporation | Bifurcated speech recognition |
US8775191B1 (en) | 2013-11-13 | 2014-07-08 | Google Inc. | Efficient utterance-specific endpointer triggering for always-on hotwording |
CN103645876B (zh) * | 2013-12-06 | 2017-01-18 | 百度在线网络技术(北京)有限公司 | 语音输入方法和装置 |
CN103730116B (zh) * | 2014-01-07 | 2016-08-17 | 苏州思必驰信息科技有限公司 | 在智能手表上实现智能家居设备控制的系统及其方法 |
US8938394B1 (en) | 2014-01-09 | 2015-01-20 | Google Inc. | Audio triggers based on context |
US9424841B2 (en) | 2014-10-09 | 2016-08-23 | Google Inc. | Hotword detection on multiple devices |
US9812126B2 (en) | 2014-11-28 | 2017-11-07 | Microsoft Technology Licensing, Llc | Device arbitration for listening devices |
US9721566B2 (en) | 2015-03-08 | 2017-08-01 | Apple Inc. | Competing devices responding to voice triggers |
US10679629B2 (en) | 2018-04-09 | 2020-06-09 | Amazon Technologies, Inc. | Device arbitration by multiple speech processing systems |
-
2015
- 2015-03-17 US US14/659,861 patent/US9424841B2/en active Active
- 2015-09-29 KR KR1020167020950A patent/KR101819681B1/ko active IP Right Grant
- 2015-09-29 KR KR1020167026606A patent/KR101819682B1/ko active IP Right Grant
- 2015-09-29 WO PCT/US2015/052870 patent/WO2016057269A1/en active Application Filing
- 2015-09-29 CN CN201580006769.7A patent/CN105960673B/zh active Active
- 2015-09-29 EP EP15784808.6A patent/EP3100260B1/en active Active
- 2015-09-29 CN CN201911273215.XA patent/CN111105784A/zh active Pending
- 2015-09-29 EP EP18213657.2A patent/EP3483877B1/en active Active
- 2015-09-29 EP EP16193577.0A patent/EP3136381B1/en active Active
- 2015-09-29 JP JP2016549271A patent/JP6261751B2/ja active Active
-
2016
- 2016-06-23 US US15/190,739 patent/US9990922B2/en active Active
- 2016-09-07 JP JP2016174371A patent/JP6251343B2/ja active Active
-
2018
- 2018-04-23 US US15/959,508 patent/US10347253B2/en active Active
-
2019
- 2019-06-27 US US16/454,451 patent/US10665239B2/en active Active
-
2020
- 2020-04-28 US US16/860,419 patent/US11024313B2/en active Active
-
2021
- 2021-04-28 US US17/242,738 patent/US11955121B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
US20200258522A1 (en) | 2020-08-13 |
KR101819681B1 (ko) | 2018-01-17 |
KR20160105847A (ko) | 2016-09-07 |
EP3483877A1 (en) | 2019-05-15 |
US20160300571A1 (en) | 2016-10-13 |
EP3100260A1 (en) | 2016-12-07 |
US10665239B2 (en) | 2020-05-26 |
EP3483877B1 (en) | 2021-12-22 |
EP3136381B1 (en) | 2019-11-06 |
WO2016057269A1 (en) | 2016-04-14 |
US20180315424A1 (en) | 2018-11-01 |
US9990922B2 (en) | 2018-06-05 |
CN105960673A (zh) | 2016-09-21 |
JP6261751B2 (ja) | 2018-01-17 |
JP2017513037A (ja) | 2017-05-25 |
US20160104483A1 (en) | 2016-04-14 |
US11955121B2 (en) | 2024-04-09 |
CN111105784A (zh) | 2020-05-05 |
JP2017126317A (ja) | 2017-07-20 |
US9424841B2 (en) | 2016-08-23 |
CN105960673B (zh) | 2019-12-31 |
KR20160121585A (ko) | 2016-10-19 |
US20190385604A1 (en) | 2019-12-19 |
US20210249016A1 (en) | 2021-08-12 |
KR101819682B1 (ko) | 2018-01-17 |
US11024313B2 (en) | 2021-06-01 |
EP3100260B1 (en) | 2018-12-26 |
US10347253B2 (en) | 2019-07-09 |
EP3136381A1 (en) | 2017-03-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6251343B2 (ja) | 複数のデバイスにおけるホットワードの検出 | |
JP6893951B2 (ja) | 複数のデバイス上でのホットワード検出 | |
US20240233727A1 (en) | Hotword detection on multiple devices |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20170807 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20171011 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20171030 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20171124 |
|
R150 | Certificate of patent or registration of utility model |
Ref document number: 6251343 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |