ES2754448T3 - Control de un dispositivo electrónico en base a la dirección del habla - Google Patents
Control de un dispositivo electrónico en base a la dirección del habla Download PDFInfo
- Publication number
- ES2754448T3 ES2754448T3 ES16705671T ES16705671T ES2754448T3 ES 2754448 T3 ES2754448 T3 ES 2754448T3 ES 16705671 T ES16705671 T ES 16705671T ES 16705671 T ES16705671 T ES 16705671T ES 2754448 T3 ES2754448 T3 ES 2754448T3
- Authority
- ES
- Spain
- Prior art keywords
- speech
- electronic device
- voice command
- user
- determining
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/21—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2250/00—Details of telephonic subscriber devices
- H04M2250/74—Details of telephonic subscriber devices with voice recognition means
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Navigation (AREA)
- Telephone Function (AREA)
- User Interface Of Digital Computer (AREA)
- Telephonic Communication Services (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US14/671,858 US9911416B2 (en) | 2015-03-27 | 2015-03-27 | Controlling electronic device based on direction of speech |
| PCT/US2016/016649 WO2016160123A1 (en) | 2015-03-27 | 2016-02-04 | Controlling electronic device based on direction of speech |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ES2754448T3 true ES2754448T3 (es) | 2020-04-17 |
Family
ID=55404841
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| ES16705671T Active ES2754448T3 (es) | 2015-03-27 | 2016-02-04 | Control de un dispositivo electrónico en base a la dirección del habla |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US9911416B2 (enExample) |
| EP (1) | EP3274988B1 (enExample) |
| JP (1) | JP2018512619A (enExample) |
| KR (1) | KR101981878B1 (enExample) |
| CN (1) | CN107408386B (enExample) |
| ES (1) | ES2754448T3 (enExample) |
| HU (1) | HUE047117T2 (enExample) |
| WO (1) | WO2016160123A1 (enExample) |
Families Citing this family (51)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN106125048B (zh) | 2016-07-11 | 2019-05-24 | 浙江大华技术股份有限公司 | 一种声源定位方法及装置 |
| US8977255B2 (en) | 2007-04-03 | 2015-03-10 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
| US8676904B2 (en) | 2008-10-02 | 2014-03-18 | Apple Inc. | Electronic devices with voice command and contextual data processing capabilities |
| EP2911149B1 (en) * | 2014-02-19 | 2019-04-17 | Nokia Technologies OY | Determination of an operational directive based at least in part on a spatial audio property |
| US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
| US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
| US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
| US10460227B2 (en) | 2015-05-15 | 2019-10-29 | Apple Inc. | Virtual assistant in a communication session |
| WO2016208789A1 (ko) * | 2015-06-26 | 2016-12-29 | 삼성전자 주식회사 | 소리를 판별하는 방법 및 이를 위한 장치 |
| US10331312B2 (en) | 2015-09-08 | 2019-06-25 | Apple Inc. | Intelligent automated assistant in a media environment |
| US11587559B2 (en) * | 2015-09-30 | 2023-02-21 | Apple Inc. | Intelligent device identification |
| KR102427833B1 (ko) * | 2015-11-30 | 2022-08-02 | 삼성전자주식회사 | 사용자 단말장치 및 디스플레이 방법 |
| EP3414759B1 (en) * | 2016-02-10 | 2020-07-01 | Cerence Operating Company | Techniques for spatially selective wake-up word recognition and related systems and methods |
| US9911417B2 (en) * | 2016-04-01 | 2018-03-06 | Tai-An Lu | Internet of things system with voice-controlled functions and method for processing information of the same |
| US12197817B2 (en) | 2016-06-11 | 2025-01-14 | Apple Inc. | Intelligent device arbitration and control |
| US10147423B2 (en) * | 2016-09-29 | 2018-12-04 | Intel IP Corporation | Context-aware query recognition for electronic devices |
| US9642225B1 (en) * | 2016-10-20 | 2017-05-02 | Kai-kong Ng | Voice-controlled lighting control system |
| KR101893768B1 (ko) * | 2017-02-27 | 2018-09-04 | 주식회사 브이터치 | 음성 인식 트리거를 제공하기 위한 방법, 시스템 및 비일시성의 컴퓨터 판독 가능한 기록 매체 |
| US12444433B2 (en) * | 2017-02-27 | 2025-10-14 | VTouch Co., Ltd. | Method and system for providing voice recognition trigger and non-transitory computer-readable recording medium |
| US10403276B2 (en) | 2017-03-17 | 2019-09-03 | Microsoft Technology Licensing, Llc | Voice enabled features based on proximity |
| KR102471493B1 (ko) * | 2017-10-17 | 2022-11-29 | 삼성전자주식회사 | 전자 장치 및 음성 인식 방법 |
| TWM562433U (zh) * | 2018-01-05 | 2018-06-21 | Thermaltake Technology Co Ltd | 聲控輸入系統 |
| US11150869B2 (en) | 2018-02-14 | 2021-10-19 | International Business Machines Corporation | Voice command filtering |
| US11238856B2 (en) | 2018-05-01 | 2022-02-01 | International Business Machines Corporation | Ignoring trigger words in streamed media content |
| US11200890B2 (en) | 2018-05-01 | 2021-12-14 | International Business Machines Corporation | Distinguishing voice commands |
| AU2019279597B2 (en) * | 2018-06-01 | 2021-11-18 | Apple Inc. | Providing audio information with a digital assistant |
| DK180639B1 (en) | 2018-06-01 | 2021-11-04 | Apple Inc | DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT |
| CN112513983B (zh) | 2018-06-21 | 2024-12-17 | 奇跃公司 | 可穿戴系统语音处理 |
| CN108922528B (zh) * | 2018-06-29 | 2020-10-23 | 百度在线网络技术(北京)有限公司 | 用于处理语音的方法和装置 |
| US11062703B2 (en) | 2018-08-21 | 2021-07-13 | Intel Corporation | Automatic speech recognition with filler model processing |
| NO20181210A1 (en) | 2018-08-31 | 2020-03-02 | Elliptic Laboratories As | Voice assistant |
| CN109391528A (zh) | 2018-08-31 | 2019-02-26 | 百度在线网络技术(北京)有限公司 | 语音智能设备的唤醒方法、装置、设备及存储介质 |
| US11462215B2 (en) | 2018-09-28 | 2022-10-04 | Apple Inc. | Multi-modal inputs for voice commands |
| CN109831709B (zh) * | 2019-02-15 | 2020-10-09 | 杭州嘉楠耘智信息科技有限公司 | 音源定向方法及装置和计算机可读存储介质 |
| WO2020180719A1 (en) | 2019-03-01 | 2020-09-10 | Magic Leap, Inc. | Determining input for speech processing engine |
| US11348573B2 (en) | 2019-03-18 | 2022-05-31 | Apple Inc. | Multimodality in digital assistant systems |
| JP7560480B2 (ja) | 2019-04-19 | 2024-10-02 | マジック リープ, インコーポレイテッド | 発話認識エンジンのための入力の識別 |
| KR102245953B1 (ko) | 2019-06-05 | 2021-04-28 | 엘지전자 주식회사 | 복수의 전자기기의 제어방법 |
| CN110459213A (zh) * | 2019-06-28 | 2019-11-15 | 浙江想能睡眠科技股份有限公司 | 基于语音控制的智能床垫及其控制方法 |
| US11328740B2 (en) | 2019-08-07 | 2022-05-10 | Magic Leap, Inc. | Voice onset detection |
| US11355108B2 (en) | 2019-08-20 | 2022-06-07 | International Business Machines Corporation | Distinguishing voice commands |
| US11205433B2 (en) * | 2019-08-21 | 2021-12-21 | Qualcomm Incorporated | Method and apparatus for activating speech recognition |
| KR102329353B1 (ko) * | 2020-03-17 | 2021-11-22 | 성균관대학교산학협력단 | 심층 신경망을 이용한 음성 발생 방향 추론 방법 및 그 장치 |
| US11917384B2 (en) | 2020-03-27 | 2024-02-27 | Magic Leap, Inc. | Method of waking a device using spoken voice commands |
| US12301635B2 (en) | 2020-05-11 | 2025-05-13 | Apple Inc. | Digital assistant hardware abstraction |
| US12417766B2 (en) | 2020-09-30 | 2025-09-16 | Magic Leap, Inc. | Voice user interface using non-linguistic input |
| US11778370B2 (en) * | 2020-12-07 | 2023-10-03 | Gulfstream Aerospace Corporation | Microphone array onboard aircraft to determine crew/passenger location and to steer a transducer beam pattern to that location |
| US11955137B2 (en) | 2021-03-11 | 2024-04-09 | Apple Inc. | Continuous dialog with a digital assistant |
| CN115083402B (zh) * | 2021-03-15 | 2025-08-22 | Oppo广东移动通信有限公司 | 响应控制语音的方法、装置、终端及存储介质 |
| CN115086096A (zh) * | 2021-03-15 | 2022-09-20 | Oppo广东移动通信有限公司 | 响应控制语音的方法、装置、设备及存储介质 |
| US12266354B2 (en) * | 2021-07-15 | 2025-04-01 | Apple Inc. | Speech interpretation based on environmental context |
Family Cites Families (21)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| DE19956747C1 (de) | 1999-11-25 | 2001-01-11 | Siemens Ag | Verfahren und Vorrichtung zur Spracherkennung sowie ein Telekommunikationssystem |
| US6219645B1 (en) * | 1999-12-02 | 2001-04-17 | Lucent Technologies, Inc. | Enhanced automatic speech recognition using multiple directional microphones |
| DE10133126A1 (de) | 2001-07-07 | 2003-01-16 | Philips Corp Intellectual Pty | Richtungssensitives Audioaufnahmesystem mit Anzeige von Aufnahmegebiet und/oder Störquelle |
| WO2006059806A1 (ja) * | 2004-12-03 | 2006-06-08 | Honda Motor Co., Ltd. | 音声認識装置 |
| JP4873913B2 (ja) | 2004-12-17 | 2012-02-08 | 学校法人早稲田大学 | 音源分離システムおよび音源分離方法、並びに音響信号取得装置 |
| EP1699261B1 (en) * | 2005-03-01 | 2011-05-25 | Oticon A/S | System and method for determining directionality of sound detected by a hearing aid |
| EP2237271B1 (en) | 2009-03-31 | 2021-01-20 | Cerence Operating Company | Method for determining a signal component for reducing noise in an input signal |
| US8588441B2 (en) * | 2010-01-29 | 2013-11-19 | Phonak Ag | Method for adaptively matching microphones of a hearing system as well as a hearing system |
| US9053697B2 (en) * | 2010-06-01 | 2015-06-09 | Qualcomm Incorporated | Systems, methods, devices, apparatus, and computer program products for audio equalization |
| JP5079934B2 (ja) * | 2011-01-18 | 2012-11-21 | パナソニック株式会社 | 車両方向特定装置、車両方向特定方法、及びそのプログラム |
| US20120259638A1 (en) * | 2011-04-08 | 2012-10-11 | Sony Computer Entertainment Inc. | Apparatus and method for determining relevance of input speech |
| US20130204629A1 (en) | 2012-02-08 | 2013-08-08 | Panasonic Corporation | Voice input device and display device |
| US20130238326A1 (en) | 2012-03-08 | 2013-09-12 | Lg Electronics Inc. | Apparatus and method for multiple device voice control |
| KR101946364B1 (ko) * | 2012-05-01 | 2019-02-11 | 엘지전자 주식회사 | 적어도 하나의 마이크 센서를 갖는 모바일 디바이스 및 그 제어방법 |
| US9251787B1 (en) * | 2012-09-26 | 2016-02-02 | Amazon Technologies, Inc. | Altering audio to improve automatic speech recognition |
| WO2014087495A1 (ja) | 2012-12-05 | 2014-06-12 | 株式会社日立製作所 | 音声対話ロボット、音声対話ロボットシステム |
| US9525938B2 (en) * | 2013-02-06 | 2016-12-20 | Apple Inc. | User voice location estimation for adjusting portable device beamforming settings |
| US20140244267A1 (en) * | 2013-02-26 | 2014-08-28 | Avaya Inc. | Integration of user orientation into a voice command system |
| US9384751B2 (en) | 2013-05-06 | 2016-07-05 | Honeywell International Inc. | User authentication of voice controlled devices |
| EP2911149B1 (en) | 2014-02-19 | 2019-04-17 | Nokia Technologies OY | Determination of an operational directive based at least in part on a spatial audio property |
| EP2928210A1 (en) * | 2014-04-03 | 2015-10-07 | Oticon A/s | A binaural hearing assistance system comprising binaural noise reduction |
-
2015
- 2015-03-27 US US14/671,858 patent/US9911416B2/en active Active
-
2016
- 2016-02-04 JP JP2017549296A patent/JP2018512619A/ja not_active Ceased
- 2016-02-04 HU HUE16705671A patent/HUE047117T2/hu unknown
- 2016-02-04 WO PCT/US2016/016649 patent/WO2016160123A1/en not_active Ceased
- 2016-02-04 ES ES16705671T patent/ES2754448T3/es active Active
- 2016-02-04 CN CN201680014289.XA patent/CN107408386B/zh active Active
- 2016-02-04 EP EP16705671.2A patent/EP3274988B1/en active Active
- 2016-02-04 KR KR1020177027318A patent/KR101981878B1/ko active Active
Also Published As
| Publication number | Publication date |
|---|---|
| EP3274988A1 (en) | 2018-01-31 |
| HUE047117T2 (hu) | 2020-04-28 |
| CN107408386B (zh) | 2018-11-23 |
| JP2018512619A (ja) | 2018-05-17 |
| US20160284350A1 (en) | 2016-09-29 |
| EP3274988B1 (en) | 2019-08-07 |
| WO2016160123A1 (en) | 2016-10-06 |
| KR20170131465A (ko) | 2017-11-29 |
| US9911416B2 (en) | 2018-03-06 |
| KR101981878B1 (ko) | 2019-05-23 |
| CN107408386A (zh) | 2017-11-28 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ES2754448T3 (es) | Control de un dispositivo electrónico en base a la dirección del habla | |
| US10657967B2 (en) | Method and apparatus for executing voice command in electronic device | |
| US20220093108A1 (en) | Speaker identification | |
| JP6630765B2 (ja) | 個別化されたホットワード検出モデル | |
| CN106233376B (zh) | 用于通过话音输入激活应用程序的方法和设备 | |
| ES2842181T3 (es) | Generación de notificaciones basadas en datos de contexto en respuesta a una frase hablada por un usuario | |
| ES2817841T3 (es) | Procedimiento y aparato para ajustar umbral de detección para activar función de asistente de voz | |
| US9268399B2 (en) | Adaptive sensor sampling for power efficient context aware inferences | |
| KR102018152B1 (ko) | 항상-온-항상-청취 음성 인식 시스템을 위한 위치 인식 전력 관리 스킴 | |
| US9620116B2 (en) | Performing automated voice operations based on sensor data reflecting sound vibration conditions and motion conditions | |
| US10733989B2 (en) | Proximity based voice activation | |
| EP3483876A1 (en) | Initiating actions based on partial hotwords | |
| US9867012B2 (en) | Whispered speech detection | |
| CN105960628A (zh) | 用于说话者验证的动态阈值 | |
| CN104464737B (zh) | 声音验证系统和声音验证方法 | |
| EP4632733A1 (en) | Voice interaction method and related device | |
| US9633655B1 (en) | Voice sensing and keyword analysis | |
| JP2020512592A (ja) | 装置及び方法 | |
| KR102051011B1 (ko) | 학습 기반 음성 인식 단말을 제어하는 서버 및 제어 방법 | |
| CN111344781A (zh) | 音频处理 | |
| TW202240573A (zh) | 使用語音認證的設備尋檢器 | |
| WO2017219925A1 (zh) | 一种信息发送方法、装置及计算机存储介质 |