JP6309615B2 - ターゲットキーワードを検出するための方法および装置 - Google Patents

ターゲットキーワードを検出するための方法および装置 Download PDF

Info

Publication number
JP6309615B2
JP6309615B2 JP2016512922A JP2016512922A JP6309615B2 JP 6309615 B2 JP6309615 B2 JP 6309615B2 JP 2016512922 A JP2016512922 A JP 2016512922A JP 2016512922 A JP2016512922 A JP 2016512922A JP 6309615 B2 JP6309615 B2 JP 6309615B2
Authority
JP
Japan
Prior art keywords
state
keyword
score
speech
target keyword
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2016512922A
Other languages
English (en)
Japanese (ja)
Other versions
JP2016526178A5 (enExample
JP2016526178A (ja
Inventor
キム、ソンウン
リ、ミンスブ
キム、テス
ジン、ミンホ
ホワン、キュ・ウォン
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualcomm Inc
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of JP2016526178A publication Critical patent/JP2016526178A/ja
Publication of JP2016526178A5 publication Critical patent/JP2016526178A5/ja
Application granted granted Critical
Publication of JP6309615B2 publication Critical patent/JP6309615B2/ja
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Probability & Statistics with Applications (AREA)
  • Telephone Function (AREA)
  • Input From Keyboards Or The Like (AREA)
  • Telephonic Communication Services (AREA)
JP2016512922A 2013-05-07 2014-04-24 ターゲットキーワードを検出するための方法および装置 Expired - Fee Related JP6309615B2 (ja)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
US201361820498P 2013-05-07 2013-05-07
US61/820,498 2013-05-07
US201361859058P 2013-07-26 2013-07-26
US61/859,058 2013-07-26
US14/087,939 2013-11-22
US14/087,939 US20140337031A1 (en) 2013-05-07 2013-11-22 Method and apparatus for detecting a target keyword
PCT/US2014/035247 WO2014182460A2 (en) 2013-05-07 2014-04-24 Method and apparatus for detecting a target keyword

Publications (3)

Publication Number Publication Date
JP2016526178A JP2016526178A (ja) 2016-09-01
JP2016526178A5 JP2016526178A5 (enExample) 2017-07-20
JP6309615B2 true JP6309615B2 (ja) 2018-04-11

Family

ID=51865436

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2016512922A Expired - Fee Related JP6309615B2 (ja) 2013-05-07 2014-04-24 ターゲットキーワードを検出するための方法および装置

Country Status (6)

Country Link
US (1) US20140337031A1 (enExample)
EP (1) EP2994910B1 (enExample)
JP (1) JP6309615B2 (enExample)
KR (1) KR20160007527A (enExample)
CN (1) CN105190746B (enExample)
WO (1) WO2014182460A2 (enExample)

Families Citing this family (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9106192B2 (en) 2012-06-28 2015-08-11 Sonos, Inc. System and method for device playback calibration
US9704486B2 (en) * 2012-12-11 2017-07-11 Amazon Technologies, Inc. Speech recognition power management
US9892729B2 (en) * 2013-05-07 2018-02-13 Qualcomm Incorporated Method and apparatus for controlling voice activation
US10770075B2 (en) * 2014-04-21 2020-09-08 Qualcomm Incorporated Method and apparatus for activating application by speech input
CN106601238A (zh) * 2015-10-14 2017-04-26 阿里巴巴集团控股有限公司 一种应用操作的处理方法和装置
US9792907B2 (en) 2015-11-24 2017-10-17 Intel IP Corporation Low resource key phrase detection for wake on voice
US9972313B2 (en) * 2016-03-01 2018-05-15 Intel Corporation Intermediate scoring and rejection loopback for improved key phrase detection
US9763018B1 (en) 2016-04-12 2017-09-12 Sonos, Inc. Calibration of audio playback devices
US10043521B2 (en) 2016-07-01 2018-08-07 Intel IP Corporation User defined key phrase detection by user dependent sequence modeling
US10372406B2 (en) 2016-07-22 2019-08-06 Sonos, Inc. Calibration interface
CN106157950A (zh) * 2016-09-29 2016-11-23 合肥华凌股份有限公司 语音控制系统及其唤醒方法、唤醒装置和家电、协处理器
JP6585022B2 (ja) 2016-11-11 2019-10-02 株式会社東芝 音声認識装置、音声認識方法およびプログラム
WO2018097969A1 (en) * 2016-11-22 2018-05-31 Knowles Electronics, Llc Methods and systems for locating the end of the keyword in voice sensing
US10186265B1 (en) * 2016-12-06 2019-01-22 Amazon Technologies, Inc. Multi-layer keyword detection to avoid detection of keywords in output audio
US10083689B2 (en) * 2016-12-23 2018-09-25 Intel Corporation Linear scoring for low power wake on voice
US10276161B2 (en) * 2016-12-27 2019-04-30 Google Llc Contextual hotwords
JP6585112B2 (ja) * 2017-03-17 2019-10-02 株式会社東芝 音声キーワード検出装置および音声キーワード検出方法
US10593352B2 (en) * 2017-06-06 2020-03-17 Google Llc End of query detection
US10311874B2 (en) 2017-09-01 2019-06-04 4Q Catalyst, LLC Methods and systems for voice-based programming of a voice-controlled device
TWI682385B (zh) * 2018-03-16 2020-01-11 緯創資通股份有限公司 語音服務控制裝置及其方法
US10714122B2 (en) 2018-06-06 2020-07-14 Intel Corporation Speech classification of audio for wake on voice
US10461710B1 (en) 2018-08-28 2019-10-29 Sonos, Inc. Media playback system with maximum volume setting
US10650807B2 (en) 2018-09-18 2020-05-12 Intel Corporation Method and system of neural network keyphrase detection
US11100923B2 (en) * 2018-09-28 2021-08-24 Sonos, Inc. Systems and methods for selective wake word detection using neural network models
US11127394B2 (en) 2019-03-29 2021-09-21 Intel Corporation Method and system of high accuracy keyphrase detection for low resource devices
CN111091849B (zh) * 2020-03-03 2020-12-22 龙马智芯(珠海横琴)科技有限公司 鼾声识别的方法及装置、存储介质止鼾设备和处理器
CN111768783B (zh) * 2020-06-30 2024-04-02 北京百度网讯科技有限公司 语音交互控制方法、装置、电子设备、存储介质和系统
US11721338B2 (en) * 2020-08-26 2023-08-08 International Business Machines Corporation Context-based dynamic tolerance of virtual assistant
WO2024030707A1 (en) * 2022-08-01 2024-02-08 Qualcomm Incorporated Using retired pages history for instruction translation lookaside buffer (tlb) prefetching in processor-based devices
US20240037042A1 (en) * 2022-08-01 2024-02-01 Qualcomm Incorporated Using retired pages history for instruction translation lookaside buffer (tlb) prefetching in processor-based devices
US20250095643A1 (en) * 2023-09-18 2025-03-20 Qualcomm Incorporated Low Power Always-on listening Artificial Intelligence (AI) System

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0511798A (ja) * 1990-09-07 1993-01-22 Toshiba Corp 音声認識装置
US5199077A (en) * 1991-09-19 1993-03-30 Xerox Corporation Wordspotting for voice editing and indexing
JP3422541B2 (ja) * 1992-12-17 2003-06-30 ゼロックス・コーポレーション キーワードのモデル化方法及び非キーワードhmmの提供方法
US5621859A (en) * 1994-01-19 1997-04-15 Bbn Corporation Single tree method for grammar directed, very large vocabulary speech recognizer
US5878164A (en) * 1994-01-21 1999-03-02 Lucent Technologies Inc. Interleaved segmental method for handwriting recognition
JP3726448B2 (ja) * 1997-03-12 2005-12-14 セイコーエプソン株式会社 認識対象音声検出方法およびその装置
JP3911865B2 (ja) * 1998-09-09 2007-05-09 セイコーエプソン株式会社 音声認識装置
DE10030369A1 (de) * 2000-06-21 2002-01-03 Volkswagen Ag Spracherkennungssystem
JP3975400B2 (ja) * 2003-08-20 2007-09-12 ソニー株式会社 監視システム、情報処理装置および方法、記録媒体、並びにプログラム
US8214362B1 (en) * 2007-09-07 2012-07-03 Google Inc. Intelligent identification of form field elements
KR20090107364A (ko) * 2008-04-08 2009-10-13 엘지전자 주식회사 이동 단말기 및 그 메뉴 제어방법
CN101894549A (zh) * 2010-06-24 2010-11-24 中国科学院声学研究所 一种语音识别应用领域中的置信度快速计算方法
US8990259B2 (en) * 2011-06-24 2015-03-24 Cavium, Inc. Anchored patterns
CN102426836B (zh) * 2011-08-25 2013-03-20 哈尔滨工业大学 基于分位数自适应裁剪的快速关键词检出方法
US9992745B2 (en) * 2011-11-01 2018-06-05 Qualcomm Incorporated Extraction and analysis of buffered audio data using multiple codec rates each greater than a low-power processor rate
US9015048B2 (en) * 2012-11-30 2015-04-21 At&T Intellectual Property I, L.P. Incremental speech recognition for dialog systems
US9240182B2 (en) * 2013-09-17 2016-01-19 Qualcomm Incorporated Method and apparatus for adjusting detection threshold for activating voice assistant function

Also Published As

Publication number Publication date
CN105190746A (zh) 2015-12-23
JP2016526178A (ja) 2016-09-01
KR20160007527A (ko) 2016-01-20
CN105190746B (zh) 2019-03-15
EP2994910A2 (en) 2016-03-16
EP2994910B1 (en) 2017-06-14
US20140337031A1 (en) 2014-11-13
WO2014182460A3 (en) 2014-12-31
WO2014182460A2 (en) 2014-11-13

Similar Documents

Publication Publication Date Title
JP6309615B2 (ja) ターゲットキーワードを検出するための方法および装置
EP2994911B1 (en) Adaptive audio frame processing for keyword detection
JP6301451B2 (ja) 音声アクティブ化を制御するための方法および装置
US10770075B2 (en) Method and apparatus for activating application by speech input
KR101981878B1 (ko) 스피치의 방향에 기초한 전자 디바이스의 제어
US9837068B2 (en) Sound sample verification for generating sound detection model
CN116504238A (zh) 服务器侧热词
JP2014510309A (ja) 環境音を認識するためのシステムおよび方法
US20150193199A1 (en) Tracking music in audio stream
JP7753363B2 (ja) ユーザ発話プロファイル管理
JPWO2017154282A1 (ja) 音声処理装置および音声処理方法
CN108831477A (zh) 一种语音识别方法、装置、设备及存储介质
US11195545B2 (en) Method and apparatus for detecting an end of an utterance
CN116153291A (zh) 一种语音识别方法及设备

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20160126

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20160127

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20170329

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20170329

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20170608

A871 Explanation of circumstances concerning accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A871

Effective date: 20170608

A975 Report on accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A971005

Effective date: 20170905

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20170912

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20171212

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20180213

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20180314

R150 Certificate of patent or registration of utility model

Ref document number: 6309615

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

LAPS Cancellation because of no payment of annual fees