JP2012047924A - 情報処理装置、および情報処理方法、並びにプログラム - Google Patents
情報処理装置、および情報処理方法、並びにプログラム Download PDFInfo
- Publication number
- JP2012047924A JP2012047924A JP2010189123A JP2010189123A JP2012047924A JP 2012047924 A JP2012047924 A JP 2012047924A JP 2010189123 A JP2010189123 A JP 2010189123A JP 2010189123 A JP2010189123 A JP 2010189123A JP 2012047924 A JP2012047924 A JP 2012047924A
- Authority
- JP
- Japan
- Prior art keywords
- score
- information
- intention
- unit
- context
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/32—Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- User Interface Of Digital Computer (AREA)
- Machine Translation (AREA)
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2010189123A JP2012047924A (ja) | 2010-08-26 | 2010-08-26 | 情報処理装置、および情報処理方法、並びにプログラム |
| US13/206,631 US8566094B2 (en) | 2010-08-26 | 2011-08-10 | Information processing apparatus, information processing method, and program |
| CN2011102428227A CN102385860A (zh) | 2010-08-26 | 2011-08-19 | 信息处理设备、信息处理方法及程序 |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2010189123A JP2012047924A (ja) | 2010-08-26 | 2010-08-26 | 情報処理装置、および情報処理方法、並びにプログラム |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JP2012047924A true JP2012047924A (ja) | 2012-03-08 |
| JP2012047924A5 JP2012047924A5 (enExample) | 2013-08-15 |
Family
ID=45698351
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2010189123A Ceased JP2012047924A (ja) | 2010-08-26 | 2010-08-26 | 情報処理装置、および情報処理方法、並びにプログラム |
Country Status (3)
| Country | Link |
|---|---|
| US (1) | US8566094B2 (enExample) |
| JP (1) | JP2012047924A (enExample) |
| CN (1) | CN102385860A (enExample) |
Cited By (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103474069A (zh) * | 2013-09-12 | 2013-12-25 | 中国科学院计算技术研究所 | 用于融合多个语音识别系统的识别结果的方法及系统 |
| JP2014048523A (ja) * | 2012-08-31 | 2014-03-17 | Nippon Telegr & Teleph Corp <Ntt> | 行動生成モデル作成装置及び行動推定装置 |
| EP3037982A2 (en) | 2014-12-25 | 2016-06-29 | Clarion Co., Ltd. | Intention estimation equipment and intention estimation system |
| JP2017032738A (ja) * | 2015-07-31 | 2017-02-09 | 日本電信電話株式会社 | 発話意図モデル学習装置、発話意図抽出装置、発話意図モデル学習方法、発話意図抽出方法、プログラム |
| WO2018134916A1 (ja) * | 2017-01-18 | 2018-07-26 | 三菱電機株式会社 | 音声認識装置 |
| CN110162775A (zh) * | 2019-03-11 | 2019-08-23 | 腾讯科技(深圳)有限公司 | 确定意图识别准确度的方法、装置及计算机设备 |
| US10460034B2 (en) | 2015-01-28 | 2019-10-29 | Mitsubishi Electric Corporation | Intention inference system and intention inference method |
| WO2020039726A1 (ja) * | 2018-08-20 | 2020-02-27 | ソニー株式会社 | 情報処理装置、情報処理システム、および情報処理方法、並びにプログラム |
| WO2020129695A1 (ja) * | 2018-12-21 | 2020-06-25 | ソニー株式会社 | 情報処理装置、制御方法、情報処理端末、情報処理方法 |
| JP2021015180A (ja) * | 2019-07-11 | 2021-02-12 | 東芝映像ソリューション株式会社 | 電子機器、プログラムおよび音声認識方法 |
| WO2022124637A1 (ko) * | 2020-12-10 | 2022-06-16 | 삼성전자(주) | 전자장치 및 그의 제어방법 |
Families Citing this family (21)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2014003748A1 (en) * | 2012-06-28 | 2014-01-03 | Nuance Communications, Inc. | Meta-data inputs to front end processing for automatic speech recognition |
| US9424233B2 (en) | 2012-07-20 | 2016-08-23 | Veveo, Inc. | Method of and system for inferring user intent in search input in a conversational interaction system |
| US9465833B2 (en) | 2012-07-31 | 2016-10-11 | Veveo, Inc. | Disambiguating user intent in conversational interaction system for large corpus information retrieval |
| CN104756100B (zh) * | 2012-11-30 | 2017-07-28 | 三菱电机株式会社 | 意图估计装置以及意图估计方法 |
| US10354677B2 (en) * | 2013-02-28 | 2019-07-16 | Nuance Communications, Inc. | System and method for identification of intent segment(s) in caller-agent conversations |
| DK3640938T3 (da) * | 2013-05-07 | 2024-10-07 | Adeia Guides Inc | Trinvis taleinputgrænseflade med feedback i realtid |
| WO2014183035A1 (en) | 2013-05-10 | 2014-11-13 | Veveo, Inc. | Method and system for capturing and exploiting user intent in a conversational interaction based information retrieval system |
| US20150206539A1 (en) * | 2013-06-04 | 2015-07-23 | Ims Solutions, Inc. | Enhanced human machine interface through hybrid word recognition and dynamic speech synthesis tuning |
| DE112014006542B4 (de) * | 2014-03-31 | 2024-02-08 | Mitsubishi Electric Corporation | Einrichtung und Verfahren zum Verständnis von einer Benutzerintention |
| US9852136B2 (en) | 2014-12-23 | 2017-12-26 | Rovi Guides, Inc. | Systems and methods for determining whether a negation statement applies to a current or past query |
| US9854049B2 (en) | 2015-01-30 | 2017-12-26 | Rovi Guides, Inc. | Systems and methods for resolving ambiguous terms in social chatter based on a user profile |
| EP3282447B1 (en) * | 2015-03-31 | 2020-08-26 | Sony Corporation | PROGRESSIVE UTTERANCE ANALYSIS FOR SUCCESSIVELY DISPLAYING EARLY SUGGESTIONS BASED ON PARTIAL SEMANTIC PARSES FOR VOICE CONTROL. 
REAL TIME PROGRESSIVE SEMANTIC UTTERANCE ANALYSIS FOR VISUALIZATION AND ACTIONS CONTROL. |
| US10249297B2 (en) * | 2015-07-13 | 2019-04-02 | Microsoft Technology Licensing, Llc | Propagating conversational alternatives using delayed hypothesis binding |
| US11868354B2 (en) * | 2015-09-23 | 2024-01-09 | Motorola Solutions, Inc. | Apparatus, system, and method for responding to a user-initiated query with a context-based response |
| US10032451B1 (en) * | 2016-12-20 | 2018-07-24 | Amazon Technologies, Inc. | User recognition for speech processing systems |
| CN107404577B (zh) * | 2017-07-20 | 2019-05-17 | 维沃移动通信有限公司 | 一种图像处理方法、移动终端及计算机可读存储介质 |
| US10547939B1 (en) * | 2018-09-14 | 2020-01-28 | Lenovo (Singapore) Pte. Ltd. | Pickup range control |
| CN115051903B (zh) | 2019-02-14 | 2023-08-04 | 华为技术有限公司 | 一种意图处理方法、装置及系统 |
| CN111737670B (zh) * | 2019-03-25 | 2023-08-18 | 广州汽车集团股份有限公司 | 多模态数据协同人机交互的方法、系统及车载多媒体装置 |
| US11749281B2 (en) * | 2019-12-04 | 2023-09-05 | Soundhound Ai Ip, Llc | Neural speech-to-meaning |
| US20230127907A1 (en) * | 2021-10-22 | 2023-04-27 | International Business Machines Corporation | Intention identification in dialogue system |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2002116791A (ja) * | 2000-10-11 | 2002-04-19 | Nissan Motor Co Ltd | 音声入力装置 |
| JP2006053203A (ja) * | 2004-08-10 | 2006-02-23 | Sony Corp | 音声処理装置および方法、記録媒体、並びにプログラム |
| JP2006071791A (ja) * | 2004-08-31 | 2006-03-16 | Fuji Heavy Ind Ltd | 車両の音声認識装置 |
| JP2006154190A (ja) * | 2004-11-29 | 2006-06-15 | Toshiba Corp | 音声移動制御装置および音声移動制御方法 |
Family Cites Families (16)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CA2321299A1 (en) * | 1998-03-09 | 1999-09-16 | Lernout & Hauspie Speech Products N.V. | Apparatus and method for simultaneous multimode dictation |
| WO2001067228A1 (en) * | 2000-03-09 | 2001-09-13 | Koninklijke Philips Electronics N.V. | Method of interacting with a consumer electronics system |
| US6964023B2 (en) * | 2001-02-05 | 2005-11-08 | International Business Machines Corporation | System and method for multi-modal focus detection, referential ambiguity resolution and mood classification using multi-modal input |
| US6611206B2 (en) * | 2001-03-15 | 2003-08-26 | Koninklijke Philips Electronics N.V. | Automatic system for monitoring independent person requiring occasional assistance |
| US7283992B2 (en) * | 2001-11-30 | 2007-10-16 | Microsoft Corporation | Media agent to suggest contextually related media content |
| US6990639B2 (en) * | 2002-02-07 | 2006-01-24 | Microsoft Corporation | System and process for controlling electronic components in a ubiquitous computing environment using multimodal integration |
| US7228275B1 (en) * | 2002-10-21 | 2007-06-05 | Toyota Infotechnology Center Co., Ltd. | Speech recognition system having multiple speech recognizers |
| JP4478939B2 (ja) * | 2004-09-30 | 2010-06-09 | 株式会社国際電気通信基礎技術研究所 | 音声処理装置およびそのためのコンピュータプログラム |
| JP4188989B2 (ja) * | 2006-09-15 | 2008-12-03 | 本田技研工業株式会社 | 音声認識装置、音声認識方法、及び音声認識プログラム |
| EP1908640B1 (en) * | 2006-10-02 | 2009-03-04 | Harman Becker Automotive Systems GmbH | Voice control of vehicular elements from outside a vehicular cabin |
| US7818166B2 (en) * | 2007-01-31 | 2010-10-19 | Motorola, Inc. | Method and apparatus for intention based communications for mobile communication devices |
| US8219406B2 (en) * | 2007-03-15 | 2012-07-10 | Microsoft Corporation | Speech-centric multimodal user interface design in mobile technology |
| JP4412504B2 (ja) * | 2007-04-17 | 2010-02-10 | 本田技研工業株式会社 | 音声認識装置、音声認識方法、及び音声認識用プログラム |
| US8423362B2 (en) * | 2007-12-21 | 2013-04-16 | General Motors Llc | In-vehicle circumstantial speech recognition |
| US8417526B2 (en) * | 2009-03-13 | 2013-04-09 | Adacel, Inc. | Speech recognition learning system and method |
| US8359020B2 (en) * | 2010-08-06 | 2013-01-22 | Google Inc. | Automatically monitoring for voice input based on context |
-
2010
- 2010-08-26 JP JP2010189123A patent/JP2012047924A/ja not_active Ceased
-
2011
- 2011-08-10 US US13/206,631 patent/US8566094B2/en not_active Expired - Fee Related
- 2011-08-19 CN CN2011102428227A patent/CN102385860A/zh active Pending
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2002116791A (ja) * | 2000-10-11 | 2002-04-19 | Nissan Motor Co Ltd | 音声入力装置 |
| JP2006053203A (ja) * | 2004-08-10 | 2006-02-23 | Sony Corp | 音声処理装置および方法、記録媒体、並びにプログラム |
| JP2006071791A (ja) * | 2004-08-31 | 2006-03-16 | Fuji Heavy Ind Ltd | 車両の音声認識装置 |
| JP2006154190A (ja) * | 2004-11-29 | 2006-06-15 | Toshiba Corp | 音声移動制御装置および音声移動制御方法 |
Cited By (16)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2014048523A (ja) * | 2012-08-31 | 2014-03-17 | Nippon Telegr & Teleph Corp <Ntt> | 行動生成モデル作成装置及び行動推定装置 |
| CN103474069A (zh) * | 2013-09-12 | 2013-12-25 | 中国科学院计算技术研究所 | 用于融合多个语音识别系统的识别结果的方法及系统 |
| CN103474069B (zh) * | 2013-09-12 | 2016-03-30 | 中国科学院计算技术研究所 | 用于融合多个语音识别系统的识别结果的方法及系统 |
| EP3037982A2 (en) | 2014-12-25 | 2016-06-29 | Clarion Co., Ltd. | Intention estimation equipment and intention estimation system |
| JP2016122336A (ja) * | 2014-12-25 | 2016-07-07 | クラリオン株式会社 | 意図推定装置、および意図推定システム |
| US9569427B2 (en) | 2014-12-25 | 2017-02-14 | Clarion Co., Ltd. | Intention estimation equipment and intention estimation system |
| US10460034B2 (en) | 2015-01-28 | 2019-10-29 | Mitsubishi Electric Corporation | Intention inference system and intention inference method |
| JP2017032738A (ja) * | 2015-07-31 | 2017-02-09 | 日本電信電話株式会社 | 発話意図モデル学習装置、発話意図抽出装置、発話意図モデル学習方法、発話意図抽出方法、プログラム |
| JPWO2018134916A1 (ja) * | 2017-01-18 | 2019-04-11 | 三菱電機株式会社 | 音声認識装置 |
| WO2018134916A1 (ja) * | 2017-01-18 | 2018-07-26 | 三菱電機株式会社 | 音声認識装置 |
| WO2020039726A1 (ja) * | 2018-08-20 | 2020-02-27 | ソニー株式会社 | 情報処理装置、情報処理システム、および情報処理方法、並びにプログラム |
| WO2020129695A1 (ja) * | 2018-12-21 | 2020-06-25 | ソニー株式会社 | 情報処理装置、制御方法、情報処理端末、情報処理方法 |
| CN110162775A (zh) * | 2019-03-11 | 2019-08-23 | 腾讯科技(深圳)有限公司 | 确定意图识别准确度的方法、装置及计算机设备 |
| JP2021015180A (ja) * | 2019-07-11 | 2021-02-12 | 東芝映像ソリューション株式会社 | 電子機器、プログラムおよび音声認識方法 |
| JP7216621B2 (ja) | 2019-07-11 | 2023-02-01 | Tvs Regza株式会社 | 電子機器、プログラムおよび音声認識方法 |
| WO2022124637A1 (ko) * | 2020-12-10 | 2022-06-16 | 삼성전자(주) | 전자장치 및 그의 제어방법 |
Also Published As
| Publication number | Publication date |
|---|---|
| US8566094B2 (en) | 2013-10-22 |
| US20120053942A1 (en) | 2012-03-01 |
| CN102385860A (zh) | 2012-03-21 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP2012047924A (ja) | 情報処理装置、および情報処理方法、並びにプログラム | |
| US11875820B1 (en) | Context driven device arbitration | |
| US11887590B2 (en) | Voice enablement and disablement of speech processing functionality | |
| EP3114679B1 (en) | Predicting pronunciation in speech recognition | |
| US10643609B1 (en) | Selecting speech inputs | |
| US20200120396A1 (en) | Speech recognition for localized content | |
| US12455877B1 (en) | Identifying user content | |
| US20180182396A1 (en) | Multi-speaker speech recognition correction system | |
| CN113748462A (zh) | 确定用于语音处理引擎的输入 | |
| JP2012037619A (ja) | 話者適応化装置、話者適応化方法および話者適応化用プログラム | |
| CN108346427A (zh) | 一种语音识别方法、装置、设备及存储介质 | |
| WO2020125038A1 (zh) | 语音控制方法及装置 | |
| JP7511374B2 (ja) | 発話区間検知装置、音声認識装置、発話区間検知システム、発話区間検知方法及び発話区間検知プログラム | |
| JP2004347761A (ja) | 音声認識装置、音声認識方法、該音声認識方法をコンピュータに対して実行させるためのコンピュータ実行可能なプログラムおよび記憶媒体 | |
| JP2004198831A (ja) | 音声認識装置および方法、プログラム、並びに記録媒体 | |
| US9460714B2 (en) | Speech processing apparatus and method | |
| JP2013050605A (ja) | 言語モデル切替装置およびそのプログラム | |
| JP5257680B2 (ja) | 音声認識装置 | |
| JP4864783B2 (ja) | パタンマッチング装置、パタンマッチングプログラム、およびパタンマッチング方法 | |
| JP2001188782A (ja) | 情報処理装置および方法、並びに記録媒体 | |
| JP7347511B2 (ja) | 音声処理装置、音声処理方法、およびプログラム | |
| CN110875034A (zh) | 用于语音识别的模板训练方法、语音识别方法及其系统 | |
| JP2002182685A (ja) | 認識装置および認識方法、学習装置および学習方法、並びに記録媒体 | |
| JP5476760B2 (ja) | コマンド認識装置 | |
| KR100622019B1 (ko) | 음성 인터페이스 시스템 및 방법 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20130628 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20130628 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20140131 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20140218 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20140410 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20141028 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20141222 |
|
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20150120 |
|
| A045 | Written measure of dismissal of application [lapsed due to lack of payment] |
Free format text: JAPANESE INTERMEDIATE CODE: A045 Effective date: 20150526 |