JP2018512619A - 発話の方向に基づく電子デバイスの制御 - Google Patents

発話の方向に基づく電子デバイスの制御 Download PDF

Info

Publication number
JP2018512619A
JP2018512619A JP2017549296A JP2017549296A JP2018512619A JP 2018512619 A JP2018512619 A JP 2018512619A JP 2017549296 A JP2017549296 A JP 2017549296A JP 2017549296 A JP2017549296 A JP 2017549296A JP 2018512619 A JP2018512619 A JP 2018512619A
Authority
JP
Japan
Prior art keywords
utterance
electronic device
frequency range
determining
characteristic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
JP2017549296A
Other languages
English (en)
Japanese (ja)
Other versions
JP2018512619A5 (enExample
Inventor
サンラック・ユン
テス・キム
ダク・フン・キム
キュウン・ファン
Original Assignee
クアルコム,インコーポレイテッド
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by クアルコム,インコーポレイテッド filed Critical クアルコム,インコーポレイテッド
Publication of JP2018512619A publication Critical patent/JP2018512619A/ja
Publication of JP2018512619A5 publication Critical patent/JP2018512619A5/ja
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2250/00Details of telephonic subscriber devices
    • H04M2250/74Details of telephonic subscriber devices with voice recognition means

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Navigation (AREA)
  • Telephone Function (AREA)
  • User Interface Of Digital Computer (AREA)
  • Telephonic Communication Services (AREA)
JP2017549296A 2015-03-27 2016-02-04 発話の方向に基づく電子デバイスの制御 Ceased JP2018512619A (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US14/671,858 US9911416B2 (en) 2015-03-27 2015-03-27 Controlling electronic device based on direction of speech
US14/671,858 2015-03-27
PCT/US2016/016649 WO2016160123A1 (en) 2015-03-27 2016-02-04 Controlling electronic device based on direction of speech

Publications (2)

Publication Number Publication Date
JP2018512619A true JP2018512619A (ja) 2018-05-17
JP2018512619A5 JP2018512619A5 (enExample) 2018-06-28

Family

ID=55404841

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2017549296A Ceased JP2018512619A (ja) 2015-03-27 2016-02-04 発話の方向に基づく電子デバイスの制御

Country Status (8)

Country Link
US (1) US9911416B2 (enExample)
EP (1) EP3274988B1 (enExample)
JP (1) JP2018512619A (enExample)
KR (1) KR101981878B1 (enExample)
CN (1) CN107408386B (enExample)
ES (1) ES2754448T3 (enExample)
HU (1) HUE047117T2 (enExample)
WO (1) WO2016160123A1 (enExample)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2019204103A (ja) * 2018-08-31 2019-11-28 百度在線網絡技術(北京)有限公司 音声スマート機器のウェイクアップ方法、音声スマート機器のウェイクアップ装置、スマート機器及びコンピュータ読み取り可能な記憶媒体
JP2020003774A (ja) * 2018-06-29 2020-01-09 バイドゥ オンライン ネットワーク テクノロジー (ベイジン) カンパニー リミテッド 音声を処理する方法及び装置
KR20210116066A (ko) * 2020-03-17 2021-09-27 성균관대학교산학협력단 심층 신경망을 이용한 음성 발생 방향 추론 방법 및 그 장치
JP2022522748A (ja) * 2019-03-01 2022-04-20 マジック リープ, インコーポレイテッド 発話処理エンジンのための入力の決定
US20220182756A1 (en) * 2020-12-07 2022-06-09 Gulfstream Aerospace Corporation Microphone array onboard aircraft to determine crew/passenger location and to steer a transducer beam pattern to that location

Families Citing this family (46)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106125048B (zh) 2016-07-11 2019-05-24 浙江大华技术股份有限公司 一种声源定位方法及装置
US8977255B2 (en) 2007-04-03 2015-03-10 Apple Inc. Method and system for operating a multi-function portable electronic device using voice-activation
US8676904B2 (en) 2008-10-02 2014-03-18 Apple Inc. Electronic devices with voice command and contextual data processing capabilities
EP2911149B1 (en) * 2014-02-19 2019-04-17 Nokia Technologies OY Determination of an operational directive based at least in part on a spatial audio property
US9715875B2 (en) 2014-05-30 2017-07-25 Apple Inc. Reducing the need for manual start/end-pointing and trigger phrases
US9338493B2 (en) 2014-06-30 2016-05-10 Apple Inc. Intelligent automated assistant for TV user interactions
US9886953B2 (en) 2015-03-08 2018-02-06 Apple Inc. Virtual assistant activation
US10460227B2 (en) 2015-05-15 2019-10-29 Apple Inc. Virtual assistant in a communication session
WO2016208789A1 (ko) * 2015-06-26 2016-12-29 삼성전자 주식회사 소리를 판별하는 방법 및 이를 위한 장치
US10331312B2 (en) 2015-09-08 2019-06-25 Apple Inc. Intelligent automated assistant in a media environment
US11587559B2 (en) * 2015-09-30 2023-02-21 Apple Inc. Intelligent device identification
KR102427833B1 (ko) * 2015-11-30 2022-08-02 삼성전자주식회사 사용자 단말장치 및 디스플레이 방법
EP3414759B1 (en) * 2016-02-10 2020-07-01 Cerence Operating Company Techniques for spatially selective wake-up word recognition and related systems and methods
US9911417B2 (en) * 2016-04-01 2018-03-06 Tai-An Lu Internet of things system with voice-controlled functions and method for processing information of the same
US12197817B2 (en) 2016-06-11 2025-01-14 Apple Inc. Intelligent device arbitration and control
US10147423B2 (en) * 2016-09-29 2018-12-04 Intel IP Corporation Context-aware query recognition for electronic devices
US9642225B1 (en) * 2016-10-20 2017-05-02 Kai-kong Ng Voice-controlled lighting control system
KR101893768B1 (ko) * 2017-02-27 2018-09-04 주식회사 브이터치 음성 인식 트리거를 제공하기 위한 방법, 시스템 및 비일시성의 컴퓨터 판독 가능한 기록 매체
US12444433B2 (en) * 2017-02-27 2025-10-14 VTouch Co., Ltd. Method and system for providing voice recognition trigger and non-transitory computer-readable recording medium
US10403276B2 (en) 2017-03-17 2019-09-03 Microsoft Technology Licensing, Llc Voice enabled features based on proximity
KR102471493B1 (ko) * 2017-10-17 2022-11-29 삼성전자주식회사 전자 장치 및 음성 인식 방법
TWM562433U (zh) * 2018-01-05 2018-06-21 Thermaltake Technology Co Ltd 聲控輸入系統
US11150869B2 (en) 2018-02-14 2021-10-19 International Business Machines Corporation Voice command filtering
US11238856B2 (en) 2018-05-01 2022-02-01 International Business Machines Corporation Ignoring trigger words in streamed media content
US11200890B2 (en) 2018-05-01 2021-12-14 International Business Machines Corporation Distinguishing voice commands
AU2019279597B2 (en) * 2018-06-01 2021-11-18 Apple Inc. Providing audio information with a digital assistant
DK180639B1 (en) 2018-06-01 2021-11-04 Apple Inc DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT
CN112513983B (zh) 2018-06-21 2024-12-17 奇跃公司 可穿戴系统语音处理
US11062703B2 (en) 2018-08-21 2021-07-13 Intel Corporation Automatic speech recognition with filler model processing
NO20181210A1 (en) 2018-08-31 2020-03-02 Elliptic Laboratories As Voice assistant
US11462215B2 (en) 2018-09-28 2022-10-04 Apple Inc. Multi-modal inputs for voice commands
CN109831709B (zh) * 2019-02-15 2020-10-09 杭州嘉楠耘智信息科技有限公司 音源定向方法及装置和计算机可读存储介质
US11348573B2 (en) 2019-03-18 2022-05-31 Apple Inc. Multimodality in digital assistant systems
JP7560480B2 (ja) 2019-04-19 2024-10-02 マジック リープ, インコーポレイテッド 発話認識エンジンのための入力の識別
KR102245953B1 (ko) 2019-06-05 2021-04-28 엘지전자 주식회사 복수의 전자기기의 제어방법
CN110459213A (zh) * 2019-06-28 2019-11-15 浙江想能睡眠科技股份有限公司 基于语音控制的智能床垫及其控制方法
US11328740B2 (en) 2019-08-07 2022-05-10 Magic Leap, Inc. Voice onset detection
US11355108B2 (en) 2019-08-20 2022-06-07 International Business Machines Corporation Distinguishing voice commands
US11205433B2 (en) * 2019-08-21 2021-12-21 Qualcomm Incorporated Method and apparatus for activating speech recognition
US11917384B2 (en) 2020-03-27 2024-02-27 Magic Leap, Inc. Method of waking a device using spoken voice commands
US12301635B2 (en) 2020-05-11 2025-05-13 Apple Inc. Digital assistant hardware abstraction
US12417766B2 (en) 2020-09-30 2025-09-16 Magic Leap, Inc. Voice user interface using non-linguistic input
US11955137B2 (en) 2021-03-11 2024-04-09 Apple Inc. Continuous dialog with a digital assistant
CN115083402B (zh) * 2021-03-15 2025-08-22 Oppo广东移动通信有限公司 响应控制语音的方法、装置、终端及存储介质
CN115086096A (zh) * 2021-03-15 2022-09-20 Oppo广东移动通信有限公司 响应控制语音的方法、装置、设备及存储介质
US12266354B2 (en) * 2021-07-15 2025-04-01 Apple Inc. Speech interpretation based on environmental context

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6219645B1 (en) * 1999-12-02 2001-04-17 Lucent Technologies, Inc. Enhanced automatic speech recognition using multiple directional microphones
US7167544B1 (en) * 1999-11-25 2007-01-23 Siemens Aktiengesellschaft Telecommunication system with error messages corresponding to speech recognition errors
JP2012220959A (ja) * 2011-04-08 2012-11-12 Sony Computer Entertainment Inc 入力された発話の関連性を判定するための装置および方法
US20140244267A1 (en) * 2013-02-26 2014-08-28 Avaya Inc. Integration of user orientation into a voice command system

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE10133126A1 (de) 2001-07-07 2003-01-16 Philips Corp Intellectual Pty Richtungssensitives Audioaufnahmesystem mit Anzeige von Aufnahmegebiet und/oder Störquelle
WO2006059806A1 (ja) * 2004-12-03 2006-06-08 Honda Motor Co., Ltd. 音声認識装置
JP4873913B2 (ja) 2004-12-17 2012-02-08 学校法人早稲田大学 音源分離システムおよび音源分離方法、並びに音響信号取得装置
EP1699261B1 (en) * 2005-03-01 2011-05-25 Oticon A/S System and method for determining directionality of sound detected by a hearing aid
EP2237271B1 (en) 2009-03-31 2021-01-20 Cerence Operating Company Method for determining a signal component for reducing noise in an input signal
US8588441B2 (en) * 2010-01-29 2013-11-19 Phonak Ag Method for adaptively matching microphones of a hearing system as well as a hearing system
US9053697B2 (en) * 2010-06-01 2015-06-09 Qualcomm Incorporated Systems, methods, devices, apparatus, and computer program products for audio equalization
JP5079934B2 (ja) * 2011-01-18 2012-11-21 パナソニック株式会社 車両方向特定装置、車両方向特定方法、及びそのプログラム
US20130204629A1 (en) 2012-02-08 2013-08-08 Panasonic Corporation Voice input device and display device
US20130238326A1 (en) 2012-03-08 2013-09-12 Lg Electronics Inc. Apparatus and method for multiple device voice control
KR101946364B1 (ko) * 2012-05-01 2019-02-11 엘지전자 주식회사 적어도 하나의 마이크 센서를 갖는 모바일 디바이스 및 그 제어방법
US9251787B1 (en) * 2012-09-26 2016-02-02 Amazon Technologies, Inc. Altering audio to improve automatic speech recognition
WO2014087495A1 (ja) 2012-12-05 2014-06-12 株式会社日立製作所 音声対話ロボット、音声対話ロボットシステム
US9525938B2 (en) * 2013-02-06 2016-12-20 Apple Inc. User voice location estimation for adjusting portable device beamforming settings
US9384751B2 (en) 2013-05-06 2016-07-05 Honeywell International Inc. User authentication of voice controlled devices
EP2911149B1 (en) 2014-02-19 2019-04-17 Nokia Technologies OY Determination of an operational directive based at least in part on a spatial audio property
EP2928210A1 (en) * 2014-04-03 2015-10-07 Oticon A/s A binaural hearing assistance system comprising binaural noise reduction

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7167544B1 (en) * 1999-11-25 2007-01-23 Siemens Aktiengesellschaft Telecommunication system with error messages corresponding to speech recognition errors
US6219645B1 (en) * 1999-12-02 2001-04-17 Lucent Technologies, Inc. Enhanced automatic speech recognition using multiple directional microphones
JP2012220959A (ja) * 2011-04-08 2012-11-12 Sony Computer Entertainment Inc 入力された発話の関連性を判定するための装置および方法
US20140244267A1 (en) * 2013-02-26 2014-08-28 Avaya Inc. Integration of user orientation into a voice command system

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2020003774A (ja) * 2018-06-29 2020-01-09 バイドゥ オンライン ネットワーク テクノロジー (ベイジン) カンパニー リミテッド 音声を処理する方法及び装置
US11244686B2 (en) 2018-06-29 2022-02-08 Baidu Online Network Technology (Beijing) Co., Ltd. Method and apparatus for processing speech
JP2019204103A (ja) * 2018-08-31 2019-11-28 百度在線網絡技術(北京)有限公司 音声スマート機器のウェイクアップ方法、音声スマート機器のウェイクアップ装置、スマート機器及びコンピュータ読み取り可能な記憶媒体
US11355107B2 (en) 2018-08-31 2022-06-07 Baidu Online Network Technology (Beijing) Co., Ltd. Voice smart device wake-up method, apparatus, device and storage medium
JP2022522748A (ja) * 2019-03-01 2022-04-20 マジック リープ, インコーポレイテッド 発話処理エンジンのための入力の決定
JP7580383B2 (ja) 2019-03-01 2024-11-11 マジック リープ, インコーポレイテッド 発話処理エンジンのための入力の決定
KR20210116066A (ko) * 2020-03-17 2021-09-27 성균관대학교산학협력단 심층 신경망을 이용한 음성 발생 방향 추론 방법 및 그 장치
KR102329353B1 (ko) 2020-03-17 2021-11-22 성균관대학교산학협력단 심층 신경망을 이용한 음성 발생 방향 추론 방법 및 그 장치
US20220182756A1 (en) * 2020-12-07 2022-06-09 Gulfstream Aerospace Corporation Microphone array onboard aircraft to determine crew/passenger location and to steer a transducer beam pattern to that location
US11778370B2 (en) * 2020-12-07 2023-10-03 Gulfstream Aerospace Corporation Microphone array onboard aircraft to determine crew/passenger location and to steer a transducer beam pattern to that location

Also Published As

Publication number Publication date
EP3274988A1 (en) 2018-01-31
HUE047117T2 (hu) 2020-04-28
CN107408386B (zh) 2018-11-23
US20160284350A1 (en) 2016-09-29
EP3274988B1 (en) 2019-08-07
WO2016160123A1 (en) 2016-10-06
KR20170131465A (ko) 2017-11-29
US9911416B2 (en) 2018-03-06
KR101981878B1 (ko) 2019-05-23
ES2754448T3 (es) 2020-04-17
CN107408386A (zh) 2017-11-28

Similar Documents

Publication Publication Date Title
KR101981878B1 (ko) 스피치의 방향에 기초한 전자 디바이스의 제어
EP3134896B1 (en) Method and apparatus for activating application by speech input
US11756563B1 (en) Multi-path calculations for device energy levels
US20150302856A1 (en) Method and apparatus for performing function by speech input
KR101752119B1 (ko) 다수의 디바이스에서의 핫워드 검출
US20220093108A1 (en) Speaker identification
CN109791763B (zh) 多设备上的热词检测
US9343068B2 (en) Method and apparatus for controlling access to applications having different security levels
US9892729B2 (en) Method and apparatus for controlling voice activation
US9837068B2 (en) Sound sample verification for generating sound detection model
US20140337030A1 (en) Adaptive audio frame processing for keyword detection
US9867012B2 (en) Whispered speech detection

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20170928

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20180418

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20180418

A871 Explanation of circumstances concerning accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A871

Effective date: 20180418

A975 Report on accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A971005

Effective date: 20180703

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20180713

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20181005

A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20181221

A045 Written measure of dismissal of application [lapsed due to lack of payment]

Free format text: JAPANESE INTERMEDIATE CODE: A045

Effective date: 20190422