KR20170080672A - 키 문구 사용자 인식의 증강 - Google Patents
키 문구 사용자 인식의 증강 Download PDFInfo
- Publication number
- KR20170080672A KR20170080672A KR1020177015250A KR20177015250A KR20170080672A KR 20170080672 A KR20170080672 A KR 20170080672A KR 1020177015250 A KR1020177015250 A KR 1020177015250A KR 20177015250 A KR20177015250 A KR 20177015250A KR 20170080672 A KR20170080672 A KR 20170080672A
- Authority
- KR
- South Korea
- Prior art keywords
- data
- user
- probability
- key phrase
- sensor
- Prior art date
Links
- 230000003416 augmentation Effects 0.000 title 1
- 238000000034 method Methods 0.000 claims abstract description 46
- 230000007613 environmental effect Effects 0.000 claims abstract description 38
- 238000012544 monitoring process Methods 0.000 claims abstract description 7
- 238000001514 detection method Methods 0.000 claims description 14
- 230000001965 increasing effect Effects 0.000 claims description 4
- 230000002708 enhancing effect Effects 0.000 abstract description 4
- 230000008447 perception Effects 0.000 abstract description 3
- 230000006399 behavior Effects 0.000 description 13
- 238000004891 communication Methods 0.000 description 9
- 230000009471 action Effects 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 230000000007 visual effect Effects 0.000 description 5
- 238000012545 processing Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000002093 peripheral effect Effects 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 230000001755 vocal effect Effects 0.000 description 2
- 101000822695 Clostridium perfringens (strain 13 / Type A) Small, acid-soluble spore protein C1 Proteins 0.000 description 1
- 101000655262 Clostridium perfringens (strain 13 / Type A) Small, acid-soluble spore protein C2 Proteins 0.000 description 1
- 125000002066 L-histidyl group Chemical group [H]N1C([H])=NC(C([H])([H])[C@](C(=O)[*])([H])N([H])[H])=C1[H] 0.000 description 1
- 101000655256 Paraclostridium bifermentans Small, acid-soluble spore protein alpha Proteins 0.000 description 1
- 101000655264 Paraclostridium bifermentans Small, acid-soluble spore protein beta Proteins 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 230000007177 brain activity Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000004883 computer application Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000001143 conditioned effect Effects 0.000 description 1
- 230000005684 electric field Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/06—Decision making techniques; Pattern matching strategies
- G10L17/10—Multimodal systems, i.e. based on the integration of multiple recognition engines or fusion of expert systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/22—Interactive procedures; Man-machine interfaces
- G10L17/24—Interactive procedures; Man-machine interfaces the user being prompted to utter a password or a predefined phrase
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/227—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of the speaker; Human-factor methodology
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Theoretical Computer Science (AREA)
- Game Theory and Decision Science (AREA)
- Business, Economics & Management (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
Description
도 2는 키 문구 검출을 통한 사용자 인식을 증강시키는 예시적인 방법을 설명하는 흐름도를 나타낸다.
도 3은 예시적인 컴퓨팅 시스템의 블록도를 나타낸다.
Claims (15)
- 방법으로서, 컴퓨팅 디바이스 상에서,
음향 센서를 포함한 하나 이상의 센서로부터 데이터를 수신함으로써 사용 환경을 모니터링하는 단계;
상기 음향 센서로부터 선택된 데이터를 통해 키 문구(key phrase)의 발언(utterance)을 검출하는 단계;
상기 음향 센서로부터의 상기 선택된 데이터뿐만 아니라 상기 음향 센서로부터의 상기 선택된 데이터 외의 상이한 시간에서 수집된 다른 환경 센서 데이터에 기초하여, 상기 키 문구가 식별된 사용자에 의해 발성되었을 확률을 결정하는 단계; 및
상기 확률이 문턱 확률을 충족하거나 초과하면, 상기 컴퓨팅 디바이스 상에 동작을 수행하는 단계
를 포함하는 방법. - 제 1 항에 있어서,
상기 다른 환경 센서 데이터는 음향 센서 데이터를 포함하는 것인 방법. - 제 1 항에 있어서,
상기 다른 환경 센서 데이터는 이미지 데이터를 포함하는 것인 방법. - 제 3 항에 있어서,
상기 이미지 데이터에 기초하여 상기 사용 환경에서 한 명 이상의 사람들을 식별하는 단계를 더 포함하고, 상기 확률을 결정하는 단계는 상기 사용 환경에서 상기 한 명 이상의 사람들의 결정된 신원(identity)에 적어도 부분적으로 기초하여 상기 확률을 결정하는 단계를 포함하는 것인 방법. - 제 1 항에 있어서,
상기 다른 환경 센서는 위치 데이터를 포함하는 것인 방법. - 제 5 항에 있어서,
상기 위치 데이터는 근접 센서로부터의 근접 데이터를 포함하는 것인 방법. - 제 5 항에 있어서,
상기 위치 데이터는 상기 식별된 사용자에 대한 캘린더 정보(calendar information)를 포함하는 것인 방법. - 제 1 항에 있어서,
사용자 행동 패턴을 검출하는 단계를 더 포함하고, 상기 확률을 결정하는 단계는 상기 사용자 행동 패턴에 적어도 부분적으로 기초하여 상기 확률을 결정하는 단계를 포함하는 것인 방법. - 제 8 항에 있어서,
상기 사용자 행동 패턴은 상기 식별된 사용자가 발성하는 빈도에 관한 정보를 포함하는 것인 방법. - 컴퓨팅 시스템에 있어서,
적어도 음향 센서를 포함한 하나 이상의 센서;
로직 머신; 및
상기 로직 머신에 의해 실행가능한 명령어들을 보유하는 저장 머신
을 포함하고,
상기 명령어들은,
상기 음향 센서를 포함한 상기 하나 이상의 센서를 통해 사용 환경을 모니터링하고,
상기 음향 센서로부터 선택된 데이터를 통해 키 문구의 발언을 검출하고,
상기 음향 센서로부터의 상기 선택된 데이터뿐만 아니라 상기 음향 센서로부터의 상기 선택된 데이터 외의 상이한 시간에서 수집된 다른 환경 센서 데이터에 기초하여, 상기 키 문구가 식별된 유저에 의해 발성되었을 확률을 결정하고,
상기 확률이 문턱 확률을 충족하거나 초과하면, 상기 컴퓨팅 디바이스 상에 동작을 수행하도록 실행가능한 것인, 컴퓨팅 시스템. - 제 10 항에 있어서,
상기 다른 환경 센서 데이터는 위치 데이터를 포함하고, 상기 위치 데이터는 근접 센서로부터의 하나 이상의 근접 데이터 및 상기 식별된 사용자에 대한 캘린더 정보를 포함하는 것인 컴퓨팅 시스템. - 제 11 항에 있어서,
상기 명령어들은 또한, 상기 캘린더 정보에 기초하여 상기 키 문구의 발언이 검출된 시간 동안 상기 식별된 사용자가 상기 사용 환경에 있는 것으로 스케줄링되었는지를 결정하고, 상기 식별된 사용자가 상기 사용 환경에 있는 것으로 스케줄링되었다면 상기 식별된 사용자에 의해 상기 키 문구가 발성되었을 확률을 증가시키도록 실행가능한 것인 컴퓨팅 시스템. - 제 10 항에 있어서,
상기 명령어들은 또한, 환경 감지를 통해 검출된 이전의 사용자 행동에 기초하여 사용자 행동 패턴 - 상기 사용자 행동 패턴은 상기 식별된 사용자가 발성한 빈도에 관한 정보를 포함함 - 을 검출하고, 상기 식별된 사용자가 발성하는 평균 빈도에 기초하여 상기 확률을 결정하도록 실행가능한 것인 컴퓨팅 시스템. - 제 10 항에 있어서,
상기 음향 센서로부터의 상기 선택된 데이터 외의 상이한 시간에서 수집된 상기 다른 환경 센서 데이터는 상기 키 문구의 발언 이전 및/또는 이후에 수집된 추가의 음향 데이터를 포함하는 것인 컴퓨팅 시스템. - 제 14 항에 있어서,
상기 키 문구가 상기 식별된 사용자에 의해 발성되었을 확률을 결정하기 위해, 상기 명령어들은 또한,
상기 키 문구의 발언 이전 또는 이후에 상기 식별된 사용자가 또한 발성하였는지를 결정하도록 추가의 음향 데이터를 분석하고,
상기 키 문구의 발언 이전 또는 이후에 상기 식별된 사용자가 또한 발성하였다면 상기 키 문구가 상기 식별된 사용자에 의해 발성되었을 확률을 증가시키고,
상기 분석이 상기 키 문구의 발언 이전 또는 이후에 상기 식별된 사용자가 발성하지 않았다고 나타내면 상기 식별된 사용자에 의해 상기 키 문구가 발성되었을 확률을 감소시키도록 실행가능한 것인 컴퓨팅 시스템.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020227029524A KR102611751B1 (ko) | 2014-11-03 | 2015-11-02 | 키 문구 사용자 인식의 증강 |
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201462074562P | 2014-11-03 | 2014-11-03 | |
US62/074,562 | 2014-11-03 | ||
US14/827,154 US10262655B2 (en) | 2014-11-03 | 2015-08-14 | Augmentation of key phrase user recognition |
US14/827,154 | 2015-08-14 | ||
PCT/US2015/058538 WO2016073321A1 (en) | 2014-11-03 | 2015-11-02 | Augmentation of key phrase user recognition |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020227029524A Division KR102611751B1 (ko) | 2014-11-03 | 2015-11-02 | 키 문구 사용자 인식의 증강 |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20170080672A true KR20170080672A (ko) | 2017-07-10 |
KR102541718B1 KR102541718B1 (ko) | 2023-06-08 |
Family
ID=55853362
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020227029524A KR102611751B1 (ko) | 2014-11-03 | 2015-11-02 | 키 문구 사용자 인식의 증강 |
KR1020177015250A KR102541718B1 (ko) | 2014-11-03 | 2015-11-02 | 키 문구 사용자 인식의 증강 |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020227029524A KR102611751B1 (ko) | 2014-11-03 | 2015-11-02 | 키 문구 사용자 인식의 증강 |
Country Status (6)
Country | Link |
---|---|
US (2) | US10262655B2 (ko) |
EP (1) | EP3216024A1 (ko) |
JP (1) | JP2017536568A (ko) |
KR (2) | KR102611751B1 (ko) |
CN (1) | CN107077847B (ko) |
WO (1) | WO2016073321A1 (ko) |
Families Citing this family (47)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20200074428A1 (en) * | 2012-03-30 | 2020-03-05 | Michael Boukadakis | Digital Concierge and Method |
US10255914B2 (en) * | 2012-03-30 | 2019-04-09 | Michael Boukadakis | Digital concierge and method |
CN105049807B (zh) * | 2015-07-31 | 2018-05-18 | 小米科技有限责任公司 | 监控画面声音采集方法及装置 |
JP6806069B2 (ja) * | 2015-09-16 | 2021-01-06 | 日本電気株式会社 | 操作制御装置、操作制御方法及びプログラム |
US11533584B2 (en) * | 2015-09-16 | 2022-12-20 | Ivani, LLC | Blockchain systems and methods for confirming presence |
GB2583988B (en) * | 2016-06-06 | 2021-03-31 | Cirrus Logic Int Semiconductor Ltd | Voice user interface |
US10522134B1 (en) * | 2016-12-22 | 2019-12-31 | Amazon Technologies, Inc. | Speech based user recognition |
KR20180086032A (ko) | 2017-01-20 | 2018-07-30 | 삼성전자주식회사 | 전자장치, 전자장치의 제어방법 및 기록매체 |
JP6838435B2 (ja) | 2017-03-13 | 2021-03-03 | オムロン株式会社 | 環境センサ |
US10438584B2 (en) * | 2017-04-07 | 2019-10-08 | Google Llc | Multi-user virtual assistant for verbal device control |
KR101949497B1 (ko) * | 2017-05-02 | 2019-02-18 | 네이버 주식회사 | 사용자 발화의 표현법을 파악하여 기기의 동작이나 컨텐츠 제공 범위를 조정하여 제공하는 사용자 명령 처리 방법 및 시스템 |
US10628570B2 (en) * | 2017-05-15 | 2020-04-21 | Fmr Llc | Protection of data in a zero user interface environment |
WO2019002831A1 (en) | 2017-06-27 | 2019-01-03 | Cirrus Logic International Semiconductor Limited | REPRODUCTIVE ATTACK DETECTION |
GB2563953A (en) | 2017-06-28 | 2019-01-02 | Cirrus Logic Int Semiconductor Ltd | Detection of replay attack |
GB201713697D0 (en) | 2017-06-28 | 2017-10-11 | Cirrus Logic Int Semiconductor Ltd | Magnetic detection of replay attack |
US10449440B2 (en) * | 2017-06-30 | 2019-10-22 | Electronic Arts Inc. | Interactive voice-controlled companion application for a video game |
GB201801528D0 (en) | 2017-07-07 | 2018-03-14 | Cirrus Logic Int Semiconductor Ltd | Method, apparatus and systems for biometric processes |
GB201801526D0 (en) | 2017-07-07 | 2018-03-14 | Cirrus Logic Int Semiconductor Ltd | Methods, apparatus and systems for authentication |
GB201801527D0 (en) | 2017-07-07 | 2018-03-14 | Cirrus Logic Int Semiconductor Ltd | Method, apparatus and systems for biometric processes |
GB201801530D0 (en) | 2017-07-07 | 2018-03-14 | Cirrus Logic Int Semiconductor Ltd | Methods, apparatus and systems for authentication |
GB201801532D0 (en) | 2017-07-07 | 2018-03-14 | Cirrus Logic Int Semiconductor Ltd | Methods, apparatus and systems for audio playback |
WO2019021953A1 (ja) * | 2017-07-26 | 2019-01-31 | 日本電気株式会社 | 音声操作装置及びその制御方法 |
CN107507615A (zh) * | 2017-08-29 | 2017-12-22 | 百度在线网络技术(北京)有限公司 | 界面智能交互控制方法、装置、系统及存储介质 |
GB201801661D0 (en) | 2017-10-13 | 2018-03-21 | Cirrus Logic International Uk Ltd | Detection of liveness |
GB201804843D0 (en) | 2017-11-14 | 2018-05-09 | Cirrus Logic Int Semiconductor Ltd | Detection of replay attack |
GB201801663D0 (en) | 2017-10-13 | 2018-03-21 | Cirrus Logic Int Semiconductor Ltd | Detection of liveness |
GB2567503A (en) | 2017-10-13 | 2019-04-17 | Cirrus Logic Int Semiconductor Ltd | Analysing speech signals |
GB201801664D0 (en) | 2017-10-13 | 2018-03-21 | Cirrus Logic Int Semiconductor Ltd | Detection of liveness |
US10403288B2 (en) | 2017-10-17 | 2019-09-03 | Google Llc | Speaker diarization |
CN108305615B (zh) | 2017-10-23 | 2020-06-16 | 腾讯科技(深圳)有限公司 | 一种对象识别方法及其设备、存储介质、终端 |
GB201801659D0 (en) | 2017-11-14 | 2018-03-21 | Cirrus Logic Int Semiconductor Ltd | Detection of loudspeaker playback |
US10482878B2 (en) * | 2017-11-29 | 2019-11-19 | Nuance Communications, Inc. | System and method for speech enhancement in multisource environments |
US11735189B2 (en) | 2018-01-23 | 2023-08-22 | Cirrus Logic, Inc. | Speaker identification |
US11475899B2 (en) * | 2018-01-23 | 2022-10-18 | Cirrus Logic, Inc. | Speaker identification |
US11264037B2 (en) | 2018-01-23 | 2022-03-01 | Cirrus Logic, Inc. | Speaker identification |
US10861462B2 (en) | 2018-03-12 | 2020-12-08 | Cypress Semiconductor Corporation | Dual pipeline architecture for wakeup phrase detection with speech onset detection |
EP3550939A1 (en) * | 2018-04-02 | 2019-10-09 | Signify Holding B.V. | System and methods for augmenting voice commands using connected lighting systems |
US10861453B1 (en) * | 2018-05-01 | 2020-12-08 | Amazon Technologies, Inc. | Resource scheduling with voice controlled devices |
US10692490B2 (en) | 2018-07-31 | 2020-06-23 | Cirrus Logic, Inc. | Detection of replay attack |
US10915614B2 (en) | 2018-08-31 | 2021-02-09 | Cirrus Logic, Inc. | Biometric authentication |
US11037574B2 (en) | 2018-09-05 | 2021-06-15 | Cirrus Logic, Inc. | Speaker recognition and speaker change detection |
US10971160B2 (en) * | 2018-11-13 | 2021-04-06 | Comcast Cable Communications, Llc | Methods and systems for determining a wake word |
RU2744063C1 (ru) * | 2018-12-18 | 2021-03-02 | Общество С Ограниченной Ответственностью "Яндекс" | Способ и система определения говорящего пользователя управляемого голосом устройства |
US11417236B2 (en) * | 2018-12-28 | 2022-08-16 | Intel Corporation | Real-time language learning within a smart space |
US11437043B1 (en) * | 2019-12-12 | 2022-09-06 | Amazon Technologies, Inc. | Presence data determination and utilization |
US11651376B2 (en) * | 2021-07-22 | 2023-05-16 | Bank Of America Corporation | Smart glasses based detection of ATM fraud |
US20240038227A1 (en) * | 2022-07-29 | 2024-02-01 | The Travelers Indemnity Company | Collaborative voice-based design and development system |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110184735A1 (en) * | 2010-01-22 | 2011-07-28 | Microsoft Corporation | Speech recognition analysis via identification information |
US20140249817A1 (en) * | 2013-03-04 | 2014-09-04 | Rawles Llc | Identification using Audio Signatures and Additional Characteristics |
Family Cites Families (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4394538A (en) * | 1981-03-04 | 1983-07-19 | Threshold Technology, Inc. | Speech recognition system and method |
US6952155B2 (en) | 1999-07-23 | 2005-10-04 | Himmelstein Richard B | Voice-controlled security system with proximity detector |
US6347261B1 (en) * | 1999-08-04 | 2002-02-12 | Yamaha Hatsudoki Kabushiki Kaisha | User-machine interface system for enhanced interaction |
US8645137B2 (en) | 2000-03-16 | 2014-02-04 | Apple Inc. | Fast, language-independent method for user authentication by voice |
US7085716B1 (en) | 2000-10-26 | 2006-08-01 | Nuance Communications, Inc. | Speech recognition using word-in-phrase command |
US20060041926A1 (en) * | 2004-04-30 | 2006-02-23 | Vulcan Inc. | Voice control of multimedia content |
US8510215B2 (en) | 2005-04-21 | 2013-08-13 | Victrio, Inc. | Method and system for enrolling a voiceprint in a fraudster database |
US8699944B2 (en) * | 2005-06-10 | 2014-04-15 | The Invention Science Fund I, Llc | Device pairing using device generated sound |
US7822605B2 (en) | 2006-10-19 | 2010-10-26 | Nice Systems Ltd. | Method and apparatus for large population speaker identification in telephone interactions |
US8099288B2 (en) | 2007-02-12 | 2012-01-17 | Microsoft Corp. | Text-dependent speaker verification |
CN101556669A (zh) * | 2008-04-11 | 2009-10-14 | 上海赢思软件技术有限公司 | 利用人机交互技术与用户进行个性化营销的方法和设备 |
JP5349860B2 (ja) * | 2008-08-07 | 2013-11-20 | 株式会社バンダイナムコゲームス | プログラム、情報記憶媒体及びゲーム装置 |
US8229743B2 (en) | 2009-06-23 | 2012-07-24 | Autonomy Corporation Ltd. | Speech recognition system |
US8864581B2 (en) * | 2010-01-29 | 2014-10-21 | Microsoft Corporation | Visual based identitiy tracking |
US8682667B2 (en) | 2010-02-25 | 2014-03-25 | Apple Inc. | User profiling for selecting user specific voice input processing information |
US9800716B2 (en) * | 2010-09-21 | 2017-10-24 | Cellepathy Inc. | Restricting mobile device usage |
CN102332265B (zh) * | 2011-06-20 | 2014-04-16 | 浙江吉利汽车研究院有限公司 | 一种提高汽车声控系统语音识别率的方法 |
US9922256B2 (en) * | 2011-06-30 | 2018-03-20 | Yale University | Subject sensing in an environment |
US9159324B2 (en) | 2011-07-01 | 2015-10-13 | Qualcomm Incorporated | Identifying people that are proximate to a mobile device user via social graphs, speech models, and user context |
CN103186227A (zh) * | 2011-12-28 | 2013-07-03 | 北京德信互动网络技术有限公司 | 人机互动系统和方法 |
US9042867B2 (en) | 2012-02-24 | 2015-05-26 | Agnitio S.L. | System and method for speaker recognition on mobile devices |
US9256457B1 (en) * | 2012-03-28 | 2016-02-09 | Google Inc. | Interactive response system for hosted services |
US8863307B2 (en) * | 2012-06-05 | 2014-10-14 | Broadcom Corporation | Authenticating users based upon an identity footprint |
US9275637B1 (en) * | 2012-11-06 | 2016-03-01 | Amazon Technologies, Inc. | Wake word evaluation |
US9319221B1 (en) * | 2013-05-20 | 2016-04-19 | Amazon Technologies, Inc. | Controlling access based on recognition of a user |
US9558749B1 (en) | 2013-08-01 | 2017-01-31 | Amazon Technologies, Inc. | Automatic speaker identification using speech recognition features |
US8719039B1 (en) * | 2013-12-05 | 2014-05-06 | Google Inc. | Promoting voice actions to hotwords |
EP2911149B1 (en) * | 2014-02-19 | 2019-04-17 | Nokia Technologies OY | Determination of an operational directive based at least in part on a spatial audio property |
US9286892B2 (en) * | 2014-04-01 | 2016-03-15 | Google Inc. | Language modeling in speech recognition |
-
2015
- 2015-08-14 US US14/827,154 patent/US10262655B2/en active Active
- 2015-11-02 KR KR1020227029524A patent/KR102611751B1/ko active IP Right Grant
- 2015-11-02 WO PCT/US2015/058538 patent/WO2016073321A1/en active Application Filing
- 2015-11-02 EP EP15797507.9A patent/EP3216024A1/en not_active Withdrawn
- 2015-11-02 JP JP2017519693A patent/JP2017536568A/ja active Pending
- 2015-11-02 KR KR1020177015250A patent/KR102541718B1/ko active IP Right Grant
- 2015-11-02 CN CN201580059714.2A patent/CN107077847B/zh active Active
-
2019
- 2019-04-09 US US16/378,944 patent/US11270695B2/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110184735A1 (en) * | 2010-01-22 | 2011-07-28 | Microsoft Corporation | Speech recognition analysis via identification information |
US20140249817A1 (en) * | 2013-03-04 | 2014-09-04 | Rawles Llc | Identification using Audio Signatures and Additional Characteristics |
Also Published As
Publication number | Publication date |
---|---|
CN107077847A (zh) | 2017-08-18 |
KR20220123153A (ko) | 2022-09-05 |
CN107077847B (zh) | 2020-11-10 |
US20160125879A1 (en) | 2016-05-05 |
US10262655B2 (en) | 2019-04-16 |
EP3216024A1 (en) | 2017-09-13 |
WO2016073321A1 (en) | 2016-05-12 |
US20190237076A1 (en) | 2019-08-01 |
KR102611751B1 (ko) | 2023-12-07 |
US11270695B2 (en) | 2022-03-08 |
KR102541718B1 (ko) | 2023-06-08 |
JP2017536568A (ja) | 2017-12-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102541718B1 (ko) | 키 문구 사용자 인식의 증강 | |
US11238871B2 (en) | Electronic device and control method thereof | |
US10438595B2 (en) | Speaker identification and unsupervised speaker adaptation techniques | |
CN106415719B (zh) | 使用说话者识别的语音信号的稳健端点指示 | |
JP6803351B2 (ja) | マン・マシン・ダイアログにおけるエージェント係属の管理 | |
KR102513297B1 (ko) | 전자 장치 및 전자 장치의 기능 실행 방법 | |
US11699442B2 (en) | Methods and systems for speech detection | |
KR101726945B1 (ko) | 수동 시작/종료 포인팅 및 트리거 구문들에 대한 필요성의 저감 | |
JP6819672B2 (ja) | 情報処理装置、情報処理方法、及びプログラム | |
JP2021533397A (ja) | 話者埋め込みと訓練された生成モデルとを使用する話者ダイアライゼーション | |
TW201905675A (zh) | 資料更新方法、客戶端及電子設備 | |
EP4139816B1 (en) | Voice shortcut detection with speaker verification | |
KR20210008089A (ko) | 자동화된 어시스턴트를 호출하기 위한 다이내믹 및/또는 컨텍스트 특정 핫워드 | |
US11721338B2 (en) | Context-based dynamic tolerance of virtual assistant | |
CN109032345B (zh) | 设备控制方法、装置、设备、服务端和存储介质 | |
US20200349947A1 (en) | Method for responding to user utterance and electronic device for supporting same | |
US11817097B2 (en) | Electronic apparatus and assistant service providing method thereof | |
KR102563817B1 (ko) | 사용자 음성 입력 처리 방법 및 이를 지원하는 전자 장치 | |
WO2019026617A1 (ja) | 情報処理装置、及び情報処理方法 | |
KR20210006419A (ko) | 건강 관련 정보 생성 및 저장 | |
KR20230147157A (ko) | 어시스턴트 명령(들)의 컨텍스트적 억제 | |
JP2018171683A (ja) | ロボットの制御プログラム、ロボット装置、及びロボットの制御方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PA0105 | International application |
Patent event date: 20170602 Patent event code: PA01051R01D Comment text: International Patent Application |
|
PG1501 | Laying open of application | ||
A201 | Request for examination | ||
PA0201 | Request for examination |
Patent event code: PA02012R01D Patent event date: 20201005 Comment text: Request for Examination of Application |
|
E902 | Notification of reason for refusal | ||
PE0902 | Notice of grounds for rejection |
Comment text: Notification of reason for refusal Patent event date: 20211026 Patent event code: PE09021S01D |
|
E601 | Decision to refuse application | ||
PE0601 | Decision on rejection of patent |
Patent event date: 20220428 Comment text: Decision to Refuse Application Patent event code: PE06012S01D Patent event date: 20211026 Comment text: Notification of reason for refusal Patent event code: PE06011S01I |
|
J201 | Request for trial against refusal decision | ||
PA0104 | Divisional application for international application |
Comment text: Divisional Application for International Patent Patent event code: PA01041R01D Patent event date: 20220825 |
|
PJ0201 | Trial against decision of rejection |
Patent event date: 20220825 Comment text: Request for Trial against Decision on Refusal Patent event code: PJ02012R01D Patent event date: 20220428 Comment text: Decision to Refuse Application Patent event code: PJ02011S01I Appeal kind category: Appeal against decision to decline refusal Appeal identifier: 2022101001536 Request date: 20220825 |
|
J301 | Trial decision |
Free format text: TRIAL NUMBER: 2022101001536; TRIAL DECISION FOR APPEAL AGAINST DECISION TO DECLINE REFUSAL REQUESTED 20220825 Effective date: 20230223 |
|
PJ1301 | Trial decision |
Patent event code: PJ13011S01D Patent event date: 20230223 Comment text: Trial Decision on Objection to Decision on Refusal Appeal kind category: Appeal against decision to decline refusal Request date: 20220825 Decision date: 20230223 Appeal identifier: 2022101001536 |
|
PS0901 | Examination by remand of revocation | ||
GRNO | Decision to grant (after opposition) | ||
PS0701 | Decision of registration after remand of revocation |
Patent event date: 20230309 Patent event code: PS07012S01D Comment text: Decision to Grant Registration Patent event date: 20230223 Patent event code: PS07011S01I Comment text: Notice of Trial Decision (Remand of Revocation) |
|
GRNT | Written decision to grant | ||
PR0701 | Registration of establishment |
Comment text: Registration of Establishment Patent event date: 20230605 Patent event code: PR07011E01D |
|
PR1002 | Payment of registration fee |
Payment date: 20230605 End annual number: 3 Start annual number: 1 |
|
PG1601 | Publication of registration |