JP2012047924A - 情報処理装置、および情報処理方法、並びにプログラム - Google Patents

情報処理装置、および情報処理方法、並びにプログラム Download PDF

Info

Publication number
JP2012047924A
JP2012047924A JP2010189123A JP2010189123A JP2012047924A JP 2012047924 A JP2012047924 A JP 2012047924A JP 2010189123 A JP2010189123 A JP 2010189123A JP 2010189123 A JP2010189123 A JP 2010189123A JP 2012047924 A JP2012047924 A JP 2012047924A
Authority
JP
Japan
Prior art keywords
score
information
intention
unit
context
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
JP2010189123A
Other languages
English (en)
Japanese (ja)
Other versions
JP2012047924A5 (enExample
Inventor
Katsuki Minamino
活樹 南野
Atsuo Hiroe
厚夫 廣江
Yukinori Maeda
幸徳 前田
Satoshi Asakawa
智 朝川
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Priority to JP2010189123A priority Critical patent/JP2012047924A/ja
Priority to US13/206,631 priority patent/US8566094B2/en
Priority to CN2011102428227A priority patent/CN102385860A/zh
Publication of JP2012047924A publication Critical patent/JP2012047924A/ja
Publication of JP2012047924A5 publication Critical patent/JP2012047924A5/ja
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/32Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • User Interface Of Digital Computer (AREA)
  • Machine Translation (AREA)
JP2010189123A 2010-08-26 2010-08-26 情報処理装置、および情報処理方法、並びにプログラム Ceased JP2012047924A (ja)

Priority Applications (3)

Application Number Priority Date Filing Date Title
JP2010189123A JP2012047924A (ja) 2010-08-26 2010-08-26 情報処理装置、および情報処理方法、並びにプログラム
US13/206,631 US8566094B2 (en) 2010-08-26 2011-08-10 Information processing apparatus, information processing method, and program
CN2011102428227A CN102385860A (zh) 2010-08-26 2011-08-19 信息处理设备、信息处理方法及程序

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2010189123A JP2012047924A (ja) 2010-08-26 2010-08-26 情報処理装置、および情報処理方法、並びにプログラム

Publications (2)

Publication Number Publication Date
JP2012047924A true JP2012047924A (ja) 2012-03-08
JP2012047924A5 JP2012047924A5 (enExample) 2013-08-15

Family

ID=45698351

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2010189123A Ceased JP2012047924A (ja) 2010-08-26 2010-08-26 情報処理装置、および情報処理方法、並びにプログラム

Country Status (3)

Country Link
US (1) US8566094B2 (enExample)
JP (1) JP2012047924A (enExample)
CN (1) CN102385860A (enExample)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103474069A (zh) * 2013-09-12 2013-12-25 中国科学院计算技术研究所 用于融合多个语音识别系统的识别结果的方法及系统
JP2014048523A (ja) * 2012-08-31 2014-03-17 Nippon Telegr & Teleph Corp <Ntt> 行動生成モデル作成装置及び行動推定装置
EP3037982A2 (en) 2014-12-25 2016-06-29 Clarion Co., Ltd. Intention estimation equipment and intention estimation system
JP2017032738A (ja) * 2015-07-31 2017-02-09 日本電信電話株式会社 発話意図モデル学習装置、発話意図抽出装置、発話意図モデル学習方法、発話意図抽出方法、プログラム
WO2018134916A1 (ja) * 2017-01-18 2018-07-26 三菱電機株式会社 音声認識装置
CN110162775A (zh) * 2019-03-11 2019-08-23 腾讯科技(深圳)有限公司 确定意图识别准确度的方法、装置及计算机设备
US10460034B2 (en) 2015-01-28 2019-10-29 Mitsubishi Electric Corporation Intention inference system and intention inference method
WO2020039726A1 (ja) * 2018-08-20 2020-02-27 ソニー株式会社 情報処理装置、情報処理システム、および情報処理方法、並びにプログラム
WO2020129695A1 (ja) * 2018-12-21 2020-06-25 ソニー株式会社 情報処理装置、制御方法、情報処理端末、情報処理方法
JP2021015180A (ja) * 2019-07-11 2021-02-12 東芝映像ソリューション株式会社 電子機器、プログラムおよび音声認識方法
WO2022124637A1 (ko) * 2020-12-10 2022-06-16 삼성전자(주) 전자장치 및 그의 제어방법

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014003748A1 (en) * 2012-06-28 2014-01-03 Nuance Communications, Inc. Meta-data inputs to front end processing for automatic speech recognition
US9424233B2 (en) 2012-07-20 2016-08-23 Veveo, Inc. Method of and system for inferring user intent in search input in a conversational interaction system
US9465833B2 (en) 2012-07-31 2016-10-11 Veveo, Inc. Disambiguating user intent in conversational interaction system for large corpus information retrieval
CN104756100B (zh) * 2012-11-30 2017-07-28 三菱电机株式会社 意图估计装置以及意图估计方法
US10354677B2 (en) * 2013-02-28 2019-07-16 Nuance Communications, Inc. System and method for identification of intent segment(s) in caller-agent conversations
DK3640938T3 (da) * 2013-05-07 2024-10-07 Adeia Guides Inc Trinvis taleinputgrænseflade med feedback i realtid
WO2014183035A1 (en) 2013-05-10 2014-11-13 Veveo, Inc. Method and system for capturing and exploiting user intent in a conversational interaction based information retrieval system
US20150206539A1 (en) * 2013-06-04 2015-07-23 Ims Solutions, Inc. Enhanced human machine interface through hybrid word recognition and dynamic speech synthesis tuning
DE112014006542B4 (de) * 2014-03-31 2024-02-08 Mitsubishi Electric Corporation Einrichtung und Verfahren zum Verständnis von einer Benutzerintention
US9852136B2 (en) 2014-12-23 2017-12-26 Rovi Guides, Inc. Systems and methods for determining whether a negation statement applies to a current or past query
US9854049B2 (en) 2015-01-30 2017-12-26 Rovi Guides, Inc. Systems and methods for resolving ambiguous terms in social chatter based on a user profile
EP3282447B1 (en) * 2015-03-31 2020-08-26 Sony Corporation PROGRESSIVE UTTERANCE ANALYSIS FOR SUCCESSIVELY DISPLAYING EARLY SUGGESTIONS BASED ON PARTIAL SEMANTIC PARSES FOR VOICE CONTROL. &#xA;REAL TIME PROGRESSIVE SEMANTIC UTTERANCE ANALYSIS FOR VISUALIZATION AND ACTIONS CONTROL.
US10249297B2 (en) * 2015-07-13 2019-04-02 Microsoft Technology Licensing, Llc Propagating conversational alternatives using delayed hypothesis binding
US11868354B2 (en) * 2015-09-23 2024-01-09 Motorola Solutions, Inc. Apparatus, system, and method for responding to a user-initiated query with a context-based response
US10032451B1 (en) * 2016-12-20 2018-07-24 Amazon Technologies, Inc. User recognition for speech processing systems
CN107404577B (zh) * 2017-07-20 2019-05-17 维沃移动通信有限公司 一种图像处理方法、移动终端及计算机可读存储介质
US10547939B1 (en) * 2018-09-14 2020-01-28 Lenovo (Singapore) Pte. Ltd. Pickup range control
CN115051903B (zh) 2019-02-14 2023-08-04 华为技术有限公司 一种意图处理方法、装置及系统
CN111737670B (zh) * 2019-03-25 2023-08-18 广州汽车集团股份有限公司 多模态数据协同人机交互的方法、系统及车载多媒体装置
US11749281B2 (en) * 2019-12-04 2023-09-05 Soundhound Ai Ip, Llc Neural speech-to-meaning
US20230127907A1 (en) * 2021-10-22 2023-04-27 International Business Machines Corporation Intention identification in dialogue system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002116791A (ja) * 2000-10-11 2002-04-19 Nissan Motor Co Ltd 音声入力装置
JP2006053203A (ja) * 2004-08-10 2006-02-23 Sony Corp 音声処理装置および方法、記録媒体、並びにプログラム
JP2006071791A (ja) * 2004-08-31 2006-03-16 Fuji Heavy Ind Ltd 車両の音声認識装置
JP2006154190A (ja) * 2004-11-29 2006-06-15 Toshiba Corp 音声移動制御装置および音声移動制御方法

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2321299A1 (en) * 1998-03-09 1999-09-16 Lernout & Hauspie Speech Products N.V. Apparatus and method for simultaneous multimode dictation
WO2001067228A1 (en) * 2000-03-09 2001-09-13 Koninklijke Philips Electronics N.V. Method of interacting with a consumer electronics system
US6964023B2 (en) * 2001-02-05 2005-11-08 International Business Machines Corporation System and method for multi-modal focus detection, referential ambiguity resolution and mood classification using multi-modal input
US6611206B2 (en) * 2001-03-15 2003-08-26 Koninklijke Philips Electronics N.V. Automatic system for monitoring independent person requiring occasional assistance
US7283992B2 (en) * 2001-11-30 2007-10-16 Microsoft Corporation Media agent to suggest contextually related media content
US6990639B2 (en) * 2002-02-07 2006-01-24 Microsoft Corporation System and process for controlling electronic components in a ubiquitous computing environment using multimodal integration
US7228275B1 (en) * 2002-10-21 2007-06-05 Toyota Infotechnology Center Co., Ltd. Speech recognition system having multiple speech recognizers
JP4478939B2 (ja) * 2004-09-30 2010-06-09 株式会社国際電気通信基礎技術研究所 音声処理装置およびそのためのコンピュータプログラム
JP4188989B2 (ja) * 2006-09-15 2008-12-03 本田技研工業株式会社 音声認識装置、音声認識方法、及び音声認識プログラム
EP1908640B1 (en) * 2006-10-02 2009-03-04 Harman Becker Automotive Systems GmbH Voice control of vehicular elements from outside a vehicular cabin
US7818166B2 (en) * 2007-01-31 2010-10-19 Motorola, Inc. Method and apparatus for intention based communications for mobile communication devices
US8219406B2 (en) * 2007-03-15 2012-07-10 Microsoft Corporation Speech-centric multimodal user interface design in mobile technology
JP4412504B2 (ja) * 2007-04-17 2010-02-10 本田技研工業株式会社 音声認識装置、音声認識方法、及び音声認識用プログラム
US8423362B2 (en) * 2007-12-21 2013-04-16 General Motors Llc In-vehicle circumstantial speech recognition
US8417526B2 (en) * 2009-03-13 2013-04-09 Adacel, Inc. Speech recognition learning system and method
US8359020B2 (en) * 2010-08-06 2013-01-22 Google Inc. Automatically monitoring for voice input based on context

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002116791A (ja) * 2000-10-11 2002-04-19 Nissan Motor Co Ltd 音声入力装置
JP2006053203A (ja) * 2004-08-10 2006-02-23 Sony Corp 音声処理装置および方法、記録媒体、並びにプログラム
JP2006071791A (ja) * 2004-08-31 2006-03-16 Fuji Heavy Ind Ltd 車両の音声認識装置
JP2006154190A (ja) * 2004-11-29 2006-06-15 Toshiba Corp 音声移動制御装置および音声移動制御方法

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2014048523A (ja) * 2012-08-31 2014-03-17 Nippon Telegr & Teleph Corp <Ntt> 行動生成モデル作成装置及び行動推定装置
CN103474069A (zh) * 2013-09-12 2013-12-25 中国科学院计算技术研究所 用于融合多个语音识别系统的识别结果的方法及系统
CN103474069B (zh) * 2013-09-12 2016-03-30 中国科学院计算技术研究所 用于融合多个语音识别系统的识别结果的方法及系统
EP3037982A2 (en) 2014-12-25 2016-06-29 Clarion Co., Ltd. Intention estimation equipment and intention estimation system
JP2016122336A (ja) * 2014-12-25 2016-07-07 クラリオン株式会社 意図推定装置、および意図推定システム
US9569427B2 (en) 2014-12-25 2017-02-14 Clarion Co., Ltd. Intention estimation equipment and intention estimation system
US10460034B2 (en) 2015-01-28 2019-10-29 Mitsubishi Electric Corporation Intention inference system and intention inference method
JP2017032738A (ja) * 2015-07-31 2017-02-09 日本電信電話株式会社 発話意図モデル学習装置、発話意図抽出装置、発話意図モデル学習方法、発話意図抽出方法、プログラム
JPWO2018134916A1 (ja) * 2017-01-18 2019-04-11 三菱電機株式会社 音声認識装置
WO2018134916A1 (ja) * 2017-01-18 2018-07-26 三菱電機株式会社 音声認識装置
WO2020039726A1 (ja) * 2018-08-20 2020-02-27 ソニー株式会社 情報処理装置、情報処理システム、および情報処理方法、並びにプログラム
WO2020129695A1 (ja) * 2018-12-21 2020-06-25 ソニー株式会社 情報処理装置、制御方法、情報処理端末、情報処理方法
CN110162775A (zh) * 2019-03-11 2019-08-23 腾讯科技(深圳)有限公司 确定意图识别准确度的方法、装置及计算机设备
JP2021015180A (ja) * 2019-07-11 2021-02-12 東芝映像ソリューション株式会社 電子機器、プログラムおよび音声認識方法
JP7216621B2 (ja) 2019-07-11 2023-02-01 Tvs Regza株式会社 電子機器、プログラムおよび音声認識方法
WO2022124637A1 (ko) * 2020-12-10 2022-06-16 삼성전자(주) 전자장치 및 그의 제어방법

Also Published As

Publication number Publication date
US8566094B2 (en) 2013-10-22
US20120053942A1 (en) 2012-03-01
CN102385860A (zh) 2012-03-21

Similar Documents

Publication Publication Date Title
JP2012047924A (ja) 情報処理装置、および情報処理方法、並びにプログラム
US11875820B1 (en) Context driven device arbitration
US11887590B2 (en) Voice enablement and disablement of speech processing functionality
EP3114679B1 (en) Predicting pronunciation in speech recognition
US10643609B1 (en) Selecting speech inputs
US20200120396A1 (en) Speech recognition for localized content
US12455877B1 (en) Identifying user content
US20180182396A1 (en) Multi-speaker speech recognition correction system
CN113748462A (zh) 确定用于语音处理引擎的输入
JP2012037619A (ja) 話者適応化装置、話者適応化方法および話者適応化用プログラム
CN108346427A (zh) 一种语音识别方法、装置、设备及存储介质
WO2020125038A1 (zh) 语音控制方法及装置
JP7511374B2 (ja) 発話区間検知装置、音声認識装置、発話区間検知システム、発話区間検知方法及び発話区間検知プログラム
JP2004347761A (ja) 音声認識装置、音声認識方法、該音声認識方法をコンピュータに対して実行させるためのコンピュータ実行可能なプログラムおよび記憶媒体
JP2004198831A (ja) 音声認識装置および方法、プログラム、並びに記録媒体
US9460714B2 (en) Speech processing apparatus and method
JP2013050605A (ja) 言語モデル切替装置およびそのプログラム
JP5257680B2 (ja) 音声認識装置
JP4864783B2 (ja) パタンマッチング装置、パタンマッチングプログラム、およびパタンマッチング方法
JP2001188782A (ja) 情報処理装置および方法、並びに記録媒体
JP7347511B2 (ja) 音声処理装置、音声処理方法、およびプログラム
CN110875034A (zh) 用于语音识别的模板训练方法、语音识别方法及其系统
JP2002182685A (ja) 認識装置および認識方法、学習装置および学習方法、並びに記録媒体
JP5476760B2 (ja) コマンド認識装置
KR100622019B1 (ko) 음성 인터페이스 시스템 및 방법

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20130628

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20130628

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20140131

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20140218

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20140410

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20141028

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20141222

A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20150120

A045 Written measure of dismissal of application [lapsed due to lack of payment]

Free format text: JAPANESE INTERMEDIATE CODE: A045

Effective date: 20150526