JP6373985B2 - 音声動作式機能にキーワードモデルを割り当てるための方法および装置 - Google Patents

音声動作式機能にキーワードモデルを割り当てるための方法および装置 Download PDF

Info

Publication number
JP6373985B2
JP6373985B2 JP2016525380A JP2016525380A JP6373985B2 JP 6373985 B2 JP6373985 B2 JP 6373985B2 JP 2016525380 A JP2016525380 A JP 2016525380A JP 2016525380 A JP2016525380 A JP 2016525380A JP 6373985 B2 JP6373985 B2 JP 6373985B2
Authority
JP
Japan
Prior art keywords
keyword
model
electronic device
specific target
sound
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2016525380A
Other languages
English (en)
Japanese (ja)
Other versions
JP2016532146A5 (enExample
JP2016532146A (ja
Inventor
キム、テス
リ、ミンスブ
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualcomm Inc
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of JP2016532146A publication Critical patent/JP2016532146A/ja
Publication of JP2016532146A5 publication Critical patent/JP2016532146A5/ja
Application granted granted Critical
Publication of JP6373985B2 publication Critical patent/JP6373985B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/225Feedback of the input speech

Landscapes

  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Telephonic Communication Services (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Telephone Function (AREA)
  • Information Transfer Between Computers (AREA)
JP2016525380A 2013-07-08 2014-07-02 音声動作式機能にキーワードモデルを割り当てるための方法および装置 Active JP6373985B2 (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201361843650P 2013-07-08 2013-07-08
US61/843,650 2013-07-08
US14/101,869 2013-12-10
US14/101,869 US9786296B2 (en) 2013-07-08 2013-12-10 Method and apparatus for assigning keyword model to voice operated function
PCT/US2014/045193 WO2015006116A1 (en) 2013-07-08 2014-07-02 Method and apparatus for assigning keyword model to voice operated function

Publications (3)

Publication Number Publication Date
JP2016532146A JP2016532146A (ja) 2016-10-13
JP2016532146A5 JP2016532146A5 (enExample) 2017-12-21
JP6373985B2 true JP6373985B2 (ja) 2018-08-15

Family

ID=52133403

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2016525380A Active JP6373985B2 (ja) 2013-07-08 2014-07-02 音声動作式機能にキーワードモデルを割り当てるための方法および装置

Country Status (6)

Country Link
US (1) US9786296B2 (enExample)
EP (1) EP3020040B1 (enExample)
JP (1) JP6373985B2 (enExample)
KR (1) KR101922782B1 (enExample)
CN (1) CN105340006B (enExample)
WO (1) WO2015006116A1 (enExample)

Families Citing this family (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10770075B2 (en) * 2014-04-21 2020-09-08 Qualcomm Incorporated Method and apparatus for activating application by speech input
CN105404625A (zh) * 2014-09-03 2016-03-16 富泰华工业(深圳)有限公司 应用程序的查找方法与系统
US9805714B2 (en) * 2016-03-22 2017-10-31 Asustek Computer Inc. Directional keyword verification method applicable to electronic device and electronic device using the same
CN105845125B (zh) * 2016-05-18 2019-05-03 百度在线网络技术(北京)有限公司 语音合成方法和语音合成装置
US10276161B2 (en) * 2016-12-27 2019-04-30 Google Llc Contextual hotwords
CN106898352B (zh) * 2017-02-27 2020-09-25 联想(北京)有限公司 语音控制方法及电子设备
CN107230475B (zh) * 2017-05-27 2022-04-05 腾讯科技(深圳)有限公司 一种语音关键词识别方法、装置、终端及服务器
CN109151155B (zh) * 2017-06-27 2021-03-23 北京搜狗科技发展有限公司 一种通信处理方法、装置及机器可读介质
CN107221332A (zh) * 2017-06-28 2017-09-29 上海与德通讯技术有限公司 机器人的交互方法及系统
CN107564517A (zh) 2017-07-05 2018-01-09 百度在线网络技术(北京)有限公司 语音唤醒方法、设备及系统、云端服务器与可读介质
JP6752870B2 (ja) * 2017-12-18 2020-09-09 ネイバー コーポレーションNAVER Corporation 複数のウェイクワードを利用して人工知能機器を制御する方法およびシステム
KR102079979B1 (ko) * 2017-12-28 2020-02-21 네이버 주식회사 인공지능 기기에서의 복수의 호출 용어를 이용한 서비스 제공 방법 및 그 시스템
KR102361458B1 (ko) * 2018-01-25 2022-02-10 삼성전자주식회사 사용자 발화 응답 방법 및 이를 지원하는 전자 장치
KR102715536B1 (ko) 2018-03-29 2024-10-11 삼성전자주식회사 전자 장치 및 그 제어 방법
CN108665900B (zh) 2018-04-23 2020-03-03 百度在线网络技术(北京)有限公司 云端唤醒方法及系统、终端以及计算机可读存储介质
US11238210B2 (en) 2018-08-22 2022-02-01 Microstrategy Incorporated Generating and presenting customized information cards
US11500655B2 (en) 2018-08-22 2022-11-15 Microstrategy Incorporated Inline and contextual delivery of database content
US11714955B2 (en) 2018-08-22 2023-08-01 Microstrategy Incorporated Dynamic document annotations
US11682390B2 (en) * 2019-02-06 2023-06-20 Microstrategy Incorporated Interactive interface for analytics
KR20200099380A (ko) * 2019-02-14 2020-08-24 삼성전자주식회사 음성 인식 서비스를 제공하는 방법 및 그 전자 장치
EP3785396B1 (en) * 2019-07-17 2022-09-21 Google LLC Systems and methods to verify trigger keywords in acoustic-based digital assistant applications
KR102433964B1 (ko) * 2019-09-30 2022-08-22 주식회사 오투오 관계 설정을 이용한 실감형 인공지능기반 음성 비서시스템
KR102865574B1 (ko) 2019-10-15 2025-09-29 삼성전자주식회사 웨이크업 모델 생성 방법 및 이를 위한 전자 장치
KR20210045241A (ko) 2019-10-16 2021-04-26 삼성전자주식회사 전자 장치 및 전자 장치의 음성 명령어 공유 방법
KR102862238B1 (ko) 2020-01-21 2025-09-19 삼성전자주식회사 디스플레이 장치 및 그 제어방법
US11082487B1 (en) 2020-09-22 2021-08-03 Vignet Incorporated Data sharing across decentralized clinical trials using customized data access policies
CN115334030B (zh) * 2022-08-08 2023-09-19 阿里健康科技(中国)有限公司 语音消息显示方法及装置
WO2024072036A1 (ko) * 2022-09-30 2024-04-04 삼성전자 주식회사 음성인식 장치 및 음성인식 장치의 동작방법
CN115910058B (zh) * 2022-10-31 2025-08-26 青岛海尔科技有限公司 操作意图的识别方法和装置、存储介质及电子装置
US12007870B1 (en) 2022-11-03 2024-06-11 Vignet Incorporated Monitoring and adjusting data collection from remote participants for health research
US11790107B1 (en) 2022-11-03 2023-10-17 Vignet Incorporated Data sharing platform for researchers conducting clinical trials

Family Cites Families (42)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5054082A (en) 1988-06-30 1991-10-01 Motorola, Inc. Method and apparatus for programming devices to recognize voice commands
JPH1078952A (ja) 1996-07-29 1998-03-24 Internatl Business Mach Corp <Ibm> 音声合成方法、音声合成装置、ハイパーテキストの制御方法及び制御装置
US6092192A (en) * 1998-01-16 2000-07-18 International Business Machines Corporation Apparatus and methods for providing repetitive enrollment in a plurality of biometric recognition systems based on an initial enrollment
US6128482A (en) * 1998-12-22 2000-10-03 General Motors Corporation Providing mobile application services with download of speaker independent voice model
US6442519B1 (en) 1999-11-10 2002-08-27 International Business Machines Corp. Speaker model adaptation via network of similar users
US7219058B1 (en) * 2000-10-13 2007-05-15 At&T Corp. System and method for processing speech recognition results
US6885735B2 (en) * 2001-03-29 2005-04-26 Intellisist, Llc System and method for transmitting voice input from a remote location over a wireless data channel
US20030005412A1 (en) * 2001-04-06 2003-01-02 Eanes James Thomas System for ontology-based creation of software agents from reusable components
US20030007609A1 (en) * 2001-07-03 2003-01-09 Yuen Michael S. Method and apparatus for development, deployment, and maintenance of a voice software application for distribution to one or more consumers
US6810378B2 (en) 2001-08-22 2004-10-26 Lucent Technologies Inc. Method and apparatus for controlling a speech synthesis system to provide multiple styles of speech
US7054817B2 (en) * 2002-01-25 2006-05-30 Canon Europa N.V. User interface for speech model generation and testing
EP1490864A4 (en) 2002-02-26 2006-03-15 Sap Ag Intelligent personal assistants
US7099825B1 (en) 2002-03-15 2006-08-29 Sprint Communications Company L.P. User mobility in a voice recognition environment
JP2004164466A (ja) * 2002-11-15 2004-06-10 Sony Corp 情報更新システム、情報処理装置および情報更新方法
US7603276B2 (en) 2002-11-21 2009-10-13 Panasonic Corporation Standard-model generation for speech recognition using a reference model
US7437294B1 (en) * 2003-11-21 2008-10-14 Sprint Spectrum L.P. Methods for selecting acoustic model for use in a voice command platform
CN101164102B (zh) 2005-02-03 2012-06-20 语音信号科技公司 自动扩展移动通信设备的话音词汇的方法和装置
US7706510B2 (en) 2005-03-16 2010-04-27 Research In Motion System and method for personalized text-to-voice synthesis
JP4843987B2 (ja) * 2005-04-05 2011-12-21 ソニー株式会社 情報処理装置、情報処理方法、およびプログラム
US7949529B2 (en) * 2005-08-29 2011-05-24 Voicebox Technologies, Inc. Mobile systems and methods of supporting natural language human-machine interactions
US7941316B2 (en) * 2005-10-28 2011-05-10 Microsoft Corporation Combined speech and alternate input modality to a mobile device
JP5208104B2 (ja) * 2006-05-12 2013-06-12 ニュアンス コミュニケーションズ オーストリア ゲーエムベーハー 第1の適応化データ処理バージョンから第2の適応化データ処理バージョンに切り替えるための方法
US8234120B2 (en) * 2006-07-26 2012-07-31 Nuance Communications, Inc. Performing a safety analysis for user-defined voice commands to ensure that the voice commands do not cause speech recognition ambiguities
US7689417B2 (en) * 2006-09-04 2010-03-30 Fortemedia, Inc. Method, system and apparatus for improved voice recognition
US7831431B2 (en) * 2006-10-31 2010-11-09 Honda Motor Co., Ltd. Voice recognition updates via remote broadcast signal
US8886537B2 (en) 2007-03-20 2014-11-11 Nuance Communications, Inc. Method and system for text-to-speech synthesis with personalized voice
US20090132920A1 (en) * 2007-11-20 2009-05-21 Microsoft Corporation Community-based software application help system
JP5266761B2 (ja) * 2008-01-10 2013-08-21 日産自動車株式会社 情報案内システムおよびその認識辞書データベース更新方法
US8255224B2 (en) * 2008-03-07 2012-08-28 Google Inc. Voice recognition grammar selection based on context
US8626511B2 (en) * 2010-01-22 2014-01-07 Google Inc. Multi-dimensional disambiguation of voice commands
US8468012B2 (en) * 2010-05-26 2013-06-18 Google Inc. Acoustic model adaptation using geographic information
US9484018B2 (en) * 2010-11-23 2016-11-01 At&T Intellectual Property I, L.P. System and method for building and evaluating automatic speech recognition via an application programmer interface
JP5494468B2 (ja) * 2010-12-27 2014-05-14 富士通株式会社 状態検出装置、状態検出方法および状態検出のためのプログラム
US20140100847A1 (en) * 2011-07-05 2014-04-10 Mitsubishi Electric Corporation Voice recognition device and navigation device
US20130085753A1 (en) * 2011-09-30 2013-04-04 Google Inc. Hybrid Client/Server Speech Recognition In A Mobile Device
US9329751B2 (en) * 2011-10-07 2016-05-03 Predictive Analystics Solutions Pvt. Ltd. Method and a system to generate a user interface for analytical models
JP2013254483A (ja) * 2012-05-11 2013-12-19 Ricoh Co Ltd 情報処理装置、情報処理装置の制御プログラム、画像形成装置
US20150088523A1 (en) * 2012-09-10 2015-03-26 Google Inc. Systems and Methods for Designing Voice Applications
US8935167B2 (en) * 2012-09-25 2015-01-13 Apple Inc. Exemplar-based latent perceptual modeling for automatic speech recognition
US8719229B1 (en) * 2012-10-12 2014-05-06 Autodesk, Inc. Cloud platform for managing design data
US10135968B2 (en) * 2013-04-15 2018-11-20 Nuance Communications, Inc. System and method for acoustic echo cancellation
JP5762660B2 (ja) * 2013-05-21 2015-08-12 三菱電機株式会社 音声認識装置、認識結果表示装置および表示方法

Also Published As

Publication number Publication date
US20150012279A1 (en) 2015-01-08
EP3020040B1 (en) 2018-12-19
US9786296B2 (en) 2017-10-10
JP2016532146A (ja) 2016-10-13
WO2015006116A9 (en) 2015-05-21
CN105340006A (zh) 2016-02-17
WO2015006116A1 (en) 2015-01-15
CN105340006B (zh) 2019-05-03
KR20160030199A (ko) 2016-03-16
KR101922782B1 (ko) 2018-11-27
EP3020040A1 (en) 2016-05-18

Similar Documents

Publication Publication Date Title
JP6373985B2 (ja) 音声動作式機能にキーワードモデルを割り当てるための方法および装置
US9959863B2 (en) Keyword detection using speaker-independent keyword models for user-designated keywords
CN107210033B (zh) 基于众包来更新用于数字个人助理的语言理解分类器模型
KR101649771B1 (ko) 발성 처리를 위한 인식기들의 마크업 언어 기반 선택 및 이용
US8682640B2 (en) Self-configuring language translation device
CN103442130A (zh) 语音操控方法、移动终端装置及语音操控系统
US20150193199A1 (en) Tracking music in audio stream
KR20190122457A (ko) 음성 인식을 수행하는 전자 장치 및 전자 장치의 동작 방법
US9224388B2 (en) Sound recognition method and system
US11948564B2 (en) Information processing device and information processing method
JP6944920B2 (ja) スマートインタラクティブの処理方法、装置、設備及びコンピュータ記憶媒体
CN101989285A (zh) 数据的查询和提供方法、查询系统及其可携式装置与服务器
HK1241551B (zh) 基於众包来更新用於数字个人助理的语言理解分类器模型
HK1241551A1 (en) Updating language understanding classifier models for a digital personal assistant based on crowd-sourcing

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20160325

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20170605

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20170605

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20171102

A871 Explanation of circumstances concerning accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A871

Effective date: 20171102

A975 Report on accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A971005

Effective date: 20171211

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20180109

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20180406

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20180619

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20180718

R150 Certificate of patent or registration of utility model

Ref document number: 6373985

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250