CN102385860A - 信息处理设备、信息处理方法及程序 - Google Patents

信息处理设备、信息处理方法及程序 Download PDF

Info

Publication number
CN102385860A
CN102385860A CN2011102428227A CN201110242822A CN102385860A CN 102385860 A CN102385860 A CN 102385860A CN 2011102428227 A CN2011102428227 A CN 2011102428227A CN 201110242822 A CN201110242822 A CN 201110242822A CN 102385860 A CN102385860 A CN 102385860A
Authority
CN
China
Prior art keywords
score
information
intention
context
input
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2011102428227A
Other languages
English (en)
Chinese (zh)
Inventor
南野活树
广江厚夫
前田幸德
朝川智
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Publication of CN102385860A publication Critical patent/CN102385860A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/32Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • User Interface Of Digital Computer (AREA)
  • Machine Translation (AREA)
CN2011102428227A 2010-08-26 2011-08-19 信息处理设备、信息处理方法及程序 Pending CN102385860A (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2010189123A JP2012047924A (ja) 2010-08-26 2010-08-26 情報処理装置、および情報処理方法、並びにプログラム
JP2010-189123 2010-08-26

Publications (1)

Publication Number Publication Date
CN102385860A true CN102385860A (zh) 2012-03-21

Family

ID=45698351

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2011102428227A Pending CN102385860A (zh) 2010-08-26 2011-08-19 信息处理设备、信息处理方法及程序

Country Status (3)

Country Link
US (1) US8566094B2 (https=)
JP (1) JP2012047924A (https=)
CN (1) CN102385860A (https=)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106463114A (zh) * 2015-03-31 2017-02-22 索尼公司 信息处理设备、控制方法及程序
CN104756100B (zh) * 2012-11-30 2017-07-28 三菱电机株式会社 意图估计装置以及意图估计方法
CN107404577A (zh) * 2017-07-20 2017-11-28 维沃移动通信有限公司 一种图像处理方法、移动终端及计算机可读存储介质
CN107924679A (zh) * 2015-07-13 2018-04-17 微软技术许可有限责任公司 输入理解处理期间在响应选择中的延迟绑定
CN111565114A (zh) * 2019-02-14 2020-08-21 华为技术有限公司 一种意图处理方法、装置及系统
CN111737670A (zh) * 2019-03-25 2020-10-02 广州汽车集团股份有限公司 多模态数据协同人机交互的方法、系统及车载多媒体装置
CN113012686A (zh) * 2019-12-04 2021-06-22 声音猎手公司 神经语音到意思

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014003748A1 (en) * 2012-06-28 2014-01-03 Nuance Communications, Inc. Meta-data inputs to front end processing for automatic speech recognition
US8577671B1 (en) 2012-07-20 2013-11-05 Veveo, Inc. Method of and system for using conversation state information in a conversational interaction system
US9465833B2 (en) 2012-07-31 2016-10-11 Veveo, Inc. Disambiguating user intent in conversational interaction system for large corpus information retrieval
JP5781040B2 (ja) * 2012-08-31 2015-09-16 日本電信電話株式会社 行動推定装置およびそのプログラム
US10354677B2 (en) * 2013-02-28 2019-07-16 Nuance Communications, Inc. System and method for identification of intent segment(s) in caller-agent conversations
US10121493B2 (en) * 2013-05-07 2018-11-06 Veveo, Inc. Method of and system for real time feedback in an incremental speech input interface
WO2014183035A1 (en) 2013-05-10 2014-11-13 Veveo, Inc. Method and system for capturing and exploiting user intent in a conversational interaction based information retrieval system
WO2014197592A2 (en) * 2013-06-04 2014-12-11 Ims Solutions Inc. Enhanced human machine interface through hybrid word recognition and dynamic speech synthesis tuning
CN103474069B (zh) * 2013-09-12 2016-03-30 中国科学院计算技术研究所 用于融合多个语音识别系统的识别结果的方法及系统
JPWO2015151157A1 (ja) * 2014-03-31 2017-04-13 三菱電機株式会社 意図理解装置および方法
US9852136B2 (en) 2014-12-23 2017-12-26 Rovi Guides, Inc. Systems and methods for determining whether a negation statement applies to a current or past query
JP6514503B2 (ja) * 2014-12-25 2019-05-15 クラリオン株式会社 意図推定装置、および意図推定システム
US10460034B2 (en) 2015-01-28 2019-10-29 Mitsubishi Electric Corporation Intention inference system and intention inference method
US9854049B2 (en) 2015-01-30 2017-12-26 Rovi Guides, Inc. Systems and methods for resolving ambiguous terms in social chatter based on a user profile
JP6370749B2 (ja) * 2015-07-31 2018-08-08 日本電信電話株式会社 発話意図モデル学習装置、発話意図抽出装置、発話意図モデル学習方法、発話意図抽出方法、プログラム
US11868354B2 (en) * 2015-09-23 2024-01-09 Motorola Solutions, Inc. Apparatus, system, and method for responding to a user-initiated query with a context-based response
US10032451B1 (en) * 2016-12-20 2018-07-24 Amazon Technologies, Inc. User recognition for speech processing systems
JP6532619B2 (ja) * 2017-01-18 2019-06-19 三菱電機株式会社 音声認識装置
WO2020039726A1 (ja) * 2018-08-20 2020-02-27 ソニー株式会社 情報処理装置、情報処理システム、および情報処理方法、並びにプログラム
US10547939B1 (en) * 2018-09-14 2020-01-28 Lenovo (Singapore) Pte. Ltd. Pickup range control
JP2022028094A (ja) * 2018-12-21 2022-02-15 ソニーグループ株式会社 情報処理装置、制御方法、情報処理端末、情報処理方法
CN110162775B (zh) * 2019-03-11 2024-08-20 腾讯科技(深圳)有限公司 确定意图识别准确度的方法、装置及计算机设备
JP7216621B2 (ja) * 2019-07-11 2023-02-01 Tvs Regza株式会社 電子機器、プログラムおよび音声認識方法
KR20220082577A (ko) * 2020-12-10 2022-06-17 삼성전자주식회사 전자장치 및 그의 제어방법
US20230127907A1 (en) * 2021-10-22 2023-04-27 International Business Machines Corporation Intention identification in dialogue system
CN115527529A (zh) * 2022-09-19 2022-12-27 广东粤港澳大湾区国家纳米科技创新研究院 一种语音意图识别的方法、装置、电子设备及存储介质

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1372660A (zh) * 2000-03-09 2002-10-02 皇家菲利浦电子有限公司 与消费电子系统进行交互的方法
CN1527992A (zh) * 2001-03-15 2004-09-08 �ʼҷ����ֵ������޹�˾ 监视偶尔需要帮助的独居者的自动系统
US7228275B1 (en) * 2002-10-21 2007-06-05 Toyota Infotechnology Center Co., Ltd. Speech recognition system having multiple speech recognizers

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2321299A1 (en) * 1998-03-09 1999-09-16 Lernout & Hauspie Speech Products N.V. Apparatus and method for simultaneous multimode dictation
JP3760755B2 (ja) * 2000-10-11 2006-03-29 日産自動車株式会社 音声入力装置
US6964023B2 (en) * 2001-02-05 2005-11-08 International Business Machines Corporation System and method for multi-modal focus detection, referential ambiguity resolution and mood classification using multi-modal input
US7283992B2 (en) * 2001-11-30 2007-10-16 Microsoft Corporation Media agent to suggest contextually related media content
US6990639B2 (en) * 2002-02-07 2006-01-24 Microsoft Corporation System and process for controlling electronic components in a ubiquitous computing environment using multimodal integration
JP4581549B2 (ja) * 2004-08-10 2010-11-17 ソニー株式会社 音声処理装置および方法、記録媒体、並びにプログラム
JP2006071791A (ja) * 2004-08-31 2006-03-16 Fuji Heavy Ind Ltd 車両の音声認識装置
JP4478939B2 (ja) * 2004-09-30 2010-06-09 株式会社国際電気通信基礎技術研究所 音声処理装置およびそのためのコンピュータプログラム
JP4282590B2 (ja) * 2004-11-29 2009-06-24 株式会社東芝 音声移動制御装置および音声移動制御方法
JP4188989B2 (ja) * 2006-09-15 2008-12-03 本田技研工業株式会社 音声認識装置、音声認識方法、及び音声認識プログラム
DE602006005493D1 (de) * 2006-10-02 2009-04-16 Harman Becker Automotive Sys Sprachsteuerung von Fahrzeugelementen von außerhalb einer Fahrzeugkabine
US7818166B2 (en) * 2007-01-31 2010-10-19 Motorola, Inc. Method and apparatus for intention based communications for mobile communication devices
US8219406B2 (en) * 2007-03-15 2012-07-10 Microsoft Corporation Speech-centric multimodal user interface design in mobile technology
JP4412504B2 (ja) * 2007-04-17 2010-02-10 本田技研工業株式会社 音声認識装置、音声認識方法、及び音声認識用プログラム
US8423362B2 (en) * 2007-12-21 2013-04-16 General Motors Llc In-vehicle circumstantial speech recognition
US8417526B2 (en) * 2009-03-13 2013-04-09 Adacel, Inc. Speech recognition learning system and method
US8359020B2 (en) * 2010-08-06 2013-01-22 Google Inc. Automatically monitoring for voice input based on context

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1372660A (zh) * 2000-03-09 2002-10-02 皇家菲利浦电子有限公司 与消费电子系统进行交互的方法
CN1527992A (zh) * 2001-03-15 2004-09-08 �ʼҷ����ֵ������޹�˾ 监视偶尔需要帮助的独居者的自动系统
US7228275B1 (en) * 2002-10-21 2007-06-05 Toyota Infotechnology Center Co., Ltd. Speech recognition system having multiple speech recognizers

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104756100B (zh) * 2012-11-30 2017-07-28 三菱电机株式会社 意图估计装置以及意图估计方法
CN106463114A (zh) * 2015-03-31 2017-02-22 索尼公司 信息处理设备、控制方法及程序
CN106463114B (zh) * 2015-03-31 2020-10-27 索尼公司 信息处理设备、控制方法及程序存储单元
CN107924679A (zh) * 2015-07-13 2018-04-17 微软技术许可有限责任公司 输入理解处理期间在响应选择中的延迟绑定
CN107924679B (zh) * 2015-07-13 2021-11-05 微软技术许可有限责任公司 计算机实施的方法、输入理解系统和计算机可读存储设备
CN107404577A (zh) * 2017-07-20 2017-11-28 维沃移动通信有限公司 一种图像处理方法、移动终端及计算机可读存储介质
CN111565114A (zh) * 2019-02-14 2020-08-21 华为技术有限公司 一种意图处理方法、装置及系统
CN111565114B (zh) * 2019-02-14 2022-05-13 华为技术有限公司 一种意图处理方法、装置及系统
US12309043B2 (en) 2019-02-14 2025-05-20 Huawei Technologies Co., Ltd. Intent processing method, apparatus, and system
CN111737670A (zh) * 2019-03-25 2020-10-02 广州汽车集团股份有限公司 多模态数据协同人机交互的方法、系统及车载多媒体装置
CN111737670B (zh) * 2019-03-25 2023-08-18 广州汽车集团股份有限公司 多模态数据协同人机交互的方法、系统及车载多媒体装置
CN113012686A (zh) * 2019-12-04 2021-06-22 声音猎手公司 神经语音到意思
CN113012686B (zh) * 2019-12-04 2025-08-26 声音猎手公司 神经语音到意思

Also Published As

Publication number Publication date
JP2012047924A (ja) 2012-03-08
US20120053942A1 (en) 2012-03-01
US8566094B2 (en) 2013-10-22

Similar Documents

Publication Publication Date Title
CN102385860A (zh) 信息处理设备、信息处理方法及程序
US11887590B2 (en) Voice enablement and disablement of speech processing functionality
CN109509470B (zh) 语音交互方法、装置、计算机可读存储介质及终端设备
US10037758B2 (en) Device and method for understanding user intent
US10283111B1 (en) Disambiguation in speech recognition
US10210862B1 (en) Lattice decoding and result confirmation using recurrent neural networks
CN111164676B (zh) 经由环境语境采集进行的语音模型个性化
US9484021B1 (en) Disambiguation in speech recognition
JP6550068B2 (ja) 音声認識における発音予測
US10056078B1 (en) Output of content based on speech-based searching and browsing requests
US8620658B2 (en) Voice chat system, information processing apparatus, speech recognition method, keyword data electrode detection method, and program for speech recognition
CN101989424B (zh) 语音处理设备和方法
JP5141695B2 (ja) 記号挿入装置および記号挿入方法
JP4987682B2 (ja) 音声チャットシステム、情報処理装置、音声認識方法およびプログラム
US10152298B1 (en) Confidence estimation based on frequency
US20210210073A1 (en) Artificial intelligence device for providing speech recognition function and method of operating artificial intelligence device
JP7230806B2 (ja) 情報処理装置、及び情報処理方法
US10504512B1 (en) Natural language speech processing application selection
JP2013050605A (ja) 言語モデル切替装置およびそのプログラム
JP5183120B2 (ja) 平方根ディスカウンティングを使用した統計的言語による音声認識
US20250104707A1 (en) Artificial intelligence device
CN110875034A (zh) 用于语音识别的模板训练方法、语音识别方法及其系统
US12322407B2 (en) Artificial intelligence device configured to generate a mask value
JP2026020119A (ja) サーバ及びこれを含むシステム
CN118525329A (zh) 话音合成装置和话音合成方法

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20120321