JP2020067585A - コミュニケーション装置およびコミュニケーション装置の制御プログラム - Google Patents
コミュニケーション装置およびコミュニケーション装置の制御プログラム Download PDFInfo
- Publication number
- JP2020067585A JP2020067585A JP2018200832A JP2018200832A JP2020067585A JP 2020067585 A JP2020067585 A JP 2020067585A JP 2018200832 A JP2018200832 A JP 2018200832A JP 2018200832 A JP2018200832 A JP 2018200832A JP 2020067585 A JP2020067585 A JP 2020067585A
- Authority
- JP
- Japan
- Prior art keywords
- utterance
- response
- probability
- response generation
- generation module
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000004891 communication Methods 0.000 title claims abstract description 43
- 230000004044 response Effects 0.000 claims abstract description 184
- 238000004364 calculation method Methods 0.000 claims description 32
- 238000004458 analytical method Methods 0.000 description 16
- 230000006870 function Effects 0.000 description 12
- 238000000034 method Methods 0.000 description 11
- 238000010586 diagram Methods 0.000 description 9
- 230000008569 process Effects 0.000 description 8
- 230000002452 interceptive effect Effects 0.000 description 7
- 238000012545 processing Methods 0.000 description 4
- 230000008451 emotion Effects 0.000 description 2
- 238000007477 logistic regression Methods 0.000 description 2
- 239000002699 waste material Substances 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 1
- 241000287531 Psittacidae Species 0.000 description 1
- 206010041308 Soliloquy Diseases 0.000 description 1
- 206010041349 Somnolence Diseases 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000008921 facial expression Effects 0.000 description 1
- 238000012905 input function Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000003595 mist Substances 0.000 description 1
- 238000010187 selection method Methods 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/027—Concept to speech synthesisers; Generation of natural phrases from machine-based concepts
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B25—HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
- B25J—MANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
- B25J11/00—Manipulators not otherwise provided for
- B25J11/0005—Manipulators having means for high-level communication with users, e.g. speech generator, face recognition means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/004—Artificial life, i.e. computing arrangements simulating life
- G06N3/008—Artificial life, i.e. computing arrangements simulating life based on physical entities controlled by simulated intelligence so as to replicate intelligent life forms, e.g. based on robots replicating pets or humans in their appearance or behaviour
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/02—Knowledge representation; Symbolic representation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/04—Inference or reasoning models
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1815—Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/225—Feedback of the input speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/228—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Theoretical Computer Science (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Artificial Intelligence (AREA)
- Multimedia (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Robotics (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Mechanical Engineering (AREA)
- Machine Translation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Manipulator (AREA)
Abstract
Description
P(質問応答|質問)=70%×50%=35%
P(質問応答|情報提供)=15%×25%=3.75%
P(質問応答|要求)=10%×10%=1%
P(質問応答|非対話)=5%×15%=0.75%
となる。同様に連想応答生成モジュール205bの選択確率は、
P(連想応答|質問)=10%×50%=5%
P(連想応答|情報提供)=40%×25%=10%
P(連想応答|要求)=20%×10%=2%
P(連想応答|非対話)=30%×15%=4.5%
となる。用例応答生成モジュール205cの選択確率、共感応答生成モジュール205dの選択確率、模倣応答生成モジュール205eの選択確率も同様に計算する。
Claims (5)
- ユーザの発話である入力発話を入力する入力部と、
前記入力発話が、発話内容の種類として予め定められた複数の分類クラスのそれぞれに属する確率であるクラス確率を演算する演算部と、
応答の類型ごとに設けられた、前記類型に対応する応答発話をそれぞれ生成する複数の応答生成モジュールと、
前記複数の応答生成モジュールのそれぞれに設定された、前記複数の分類クラスごとの関連度合を示す関連確率、および前記演算部が演算した前記クラス確率に基づいて前記複数の応答生成モジュールから1つを選択し、選択された応答生成モジュールが生成する応答発話を前記ユーザへ発する出力発話と決定する決定部と、
前記出力発話を出力する出力部と
を備えるコミュニケーション装置。 - 前記決定部は、前記複数の応答生成モジュールのうち、前記関連確率と前記クラス確率を掛け合わせた選択確率が予め定められた基準値以上の値となる応答生成モジュールからランダムに1つを選択する請求項1に記載のコミュニケーション装置。
- 前記決定部は、以前に選択された応答生成モジュールが選択される確率が低くなるように設定された過去係数を前記関連確率に掛け合わせて、前記複数の応答生成モジュールから1つを選択する請求項1または2に記載のコミュニケーション装置。
- 前記複数の応答生成モジュールは、前記決定部によって選択されてから、選択された応答生成モジュールが前記応答発話を生成する請求項1から3のいずれか1項に記載のコミュニケーション装置。
- ユーザの発話である入力発話を入力する入力ステップと、
前記入力発話が、発話内容の種類として予め定められた複数の分類クラスのそれぞれに属する確率であるクラス確率を演算する演算ステップと、
応答の類型ごとに設けられた、前記類型に対応する応答発話をそれぞれ生成する複数の応答生成モジュールのそれぞれに設定された、前記複数の分類クラスごとの関連度合を示す関連確率、および前記演算ステップで演算した前記クラス確率に基づいて前記複数の応答生成モジュールから1つを選択し、選択された応答生成モジュールが生成する応答発話を前記ユーザへ発する出力発話と決定する決定ステップと、
前記出力発話を出力する出力ステップと
をコンピュータに実行させるコミュニケーション装置の制御プログラム。
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2018200832A JP7063230B2 (ja) | 2018-10-25 | 2018-10-25 | コミュニケーション装置およびコミュニケーション装置の制御プログラム |
US16/661,495 US11222638B2 (en) | 2018-10-25 | 2019-10-23 | Communication device and control program for communication device |
CN201911016606.3A CN111192577B (zh) | 2018-10-25 | 2019-10-24 | 交流装置及交流装置的控制程序 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2018200832A JP7063230B2 (ja) | 2018-10-25 | 2018-10-25 | コミュニケーション装置およびコミュニケーション装置の制御プログラム |
Publications (2)
Publication Number | Publication Date |
---|---|
JP2020067585A true JP2020067585A (ja) | 2020-04-30 |
JP7063230B2 JP7063230B2 (ja) | 2022-05-09 |
Family
ID=70327099
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2018200832A Active JP7063230B2 (ja) | 2018-10-25 | 2018-10-25 | コミュニケーション装置およびコミュニケーション装置の制御プログラム |
Country Status (3)
Country | Link |
---|---|
US (1) | US11222638B2 (ja) |
JP (1) | JP7063230B2 (ja) |
CN (1) | CN111192577B (ja) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2022120500A (ja) * | 2021-02-05 | 2022-08-18 | Necパーソナルコンピュータ株式会社 | 学習支援システム、学習支援方法、及びプログラム |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11443731B2 (en) | 2020-04-09 | 2022-09-13 | Rovi Guides, Inc. | Systems and methods for generating synthesized speech responses to voice inputs by training a neural network model based on the voice input prosodic metrics and training voice inputs |
US11568859B2 (en) * | 2020-08-31 | 2023-01-31 | Uniphore Software Systems, Inc. | Method and apparatus for extracting key information from conversational voice data |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003255990A (ja) * | 2002-03-06 | 2003-09-10 | Sony Corp | 対話処理装置及び方法並びにロボット装置 |
JP2017102247A (ja) * | 2015-12-01 | 2017-06-08 | 国立研究開発法人産業技術総合研究所 | 音声対話システム、音声対話制御法およびプログラム |
JP2017527926A (ja) * | 2014-07-03 | 2017-09-21 | マイクロソフト テクノロジー ライセンシング,エルエルシー | 社交的会話入力に対するコンピュータレスポンスの生成 |
JP2018041124A (ja) * | 2016-09-05 | 2018-03-15 | 株式会社Nextremer | 対話制御装置、対話エンジン、管理端末、対話装置、対話制御方法、対話方法、およびプログラム |
JP2018132704A (ja) * | 2017-02-16 | 2018-08-23 | トヨタ自動車株式会社 | 対話装置 |
Family Cites Families (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4980917A (en) * | 1987-11-18 | 1990-12-25 | Emerson & Stern Associates, Inc. | Method and apparatus for determining articulatory parameters from speech data |
US6224383B1 (en) * | 1999-03-25 | 2001-05-01 | Planetlingo, Inc. | Method and system for computer assisted natural language instruction with distracters |
US6883014B1 (en) * | 2000-10-19 | 2005-04-19 | Amacis Limited | Electronic message distribution |
US7398209B2 (en) * | 2002-06-03 | 2008-07-08 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
JP2004086001A (ja) * | 2002-08-28 | 2004-03-18 | Sony Corp | 会話処理装置、および会話処理方法、並びにコンピュータ・プログラム |
JP4623278B2 (ja) * | 2004-12-22 | 2011-02-02 | 日本電気株式会社 | 音声対話装置、支援装置、生成装置、音声対話方法、支援方法、生成方法およびプログラム |
JP4826275B2 (ja) * | 2006-02-16 | 2011-11-30 | 株式会社豊田中央研究所 | 応答生成装置、方法及びプログラム |
CN101399590A (zh) * | 2007-09-27 | 2009-04-01 | 株式会社Ntt都科摩 | 一种多用户预编码系统中的反馈选择方法及反馈选择装置 |
US20090209345A1 (en) * | 2008-02-14 | 2009-08-20 | Aruze Gaming America, Inc. | Multiplayer participation type gaming system limiting dialogue voices outputted from gaming machine |
JP5286062B2 (ja) | 2008-12-11 | 2013-09-11 | 日本電信電話株式会社 | 対話装置、対話方法、対話プログラムおよび記録媒体 |
US9576573B2 (en) * | 2011-08-29 | 2017-02-21 | Microsoft Technology Licensing, Llc | Using multiple modality input to feedback context for natural language understanding |
JP5611155B2 (ja) * | 2011-09-01 | 2014-10-22 | Kddi株式会社 | コンテンツに対するタグ付けプログラム、サーバ及び端末 |
GB2513105A (en) * | 2013-03-15 | 2014-10-22 | Deepmind Technologies Ltd | Signal processing systems |
US20130326375A1 (en) * | 2013-08-07 | 2013-12-05 | Liveperson, Inc. | Method and System for Engaging Real-Time-Human Interaction into Media Presented Online |
US10262268B2 (en) * | 2013-10-04 | 2019-04-16 | Mattersight Corporation | Predictive analytic systems and methods |
JP6450138B2 (ja) * | 2014-10-07 | 2019-01-09 | 株式会社Nttドコモ | 情報処理装置及び発話内容出力方法 |
JP6604542B2 (ja) * | 2015-04-02 | 2019-11-13 | パナソニックIpマネジメント株式会社 | 対話方法、対話プログラム及び対話システム |
US9953648B2 (en) * | 2015-05-11 | 2018-04-24 | Samsung Electronics Co., Ltd. | Electronic device and method for controlling the same |
CN106205611B (zh) * | 2016-06-29 | 2020-03-27 | 北京儒博科技有限公司 | 一种基于多模态历史响应结果的人机交互方法及系统 |
US20180060786A1 (en) * | 2016-08-30 | 2018-03-01 | Wipro Limited | System and Method for Allocating Tickets |
CN108153800B (zh) * | 2016-12-06 | 2023-05-23 | 松下知识产权经营株式会社 | 信息处理方法、信息处理装置以及记录介质 |
EP4125029A1 (en) * | 2017-03-23 | 2023-02-01 | Samsung Electronics Co., Ltd. | Electronic apparatus, controlling method of thereof and non-transitory computer readable recording medium |
CN109417504A (zh) * | 2017-04-07 | 2019-03-01 | 微软技术许可有限责任公司 | 自动聊天中的语音转发 |
US10878198B2 (en) * | 2018-01-04 | 2020-12-29 | Facebook, Inc. | Intent arbitration for a virtual assistant |
-
2018
- 2018-10-25 JP JP2018200832A patent/JP7063230B2/ja active Active
-
2019
- 2019-10-23 US US16/661,495 patent/US11222638B2/en active Active
- 2019-10-24 CN CN201911016606.3A patent/CN111192577B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003255990A (ja) * | 2002-03-06 | 2003-09-10 | Sony Corp | 対話処理装置及び方法並びにロボット装置 |
JP2017527926A (ja) * | 2014-07-03 | 2017-09-21 | マイクロソフト テクノロジー ライセンシング,エルエルシー | 社交的会話入力に対するコンピュータレスポンスの生成 |
JP2017102247A (ja) * | 2015-12-01 | 2017-06-08 | 国立研究開発法人産業技術総合研究所 | 音声対話システム、音声対話制御法およびプログラム |
JP2018041124A (ja) * | 2016-09-05 | 2018-03-15 | 株式会社Nextremer | 対話制御装置、対話エンジン、管理端末、対話装置、対話制御方法、対話方法、およびプログラム |
JP2018132704A (ja) * | 2017-02-16 | 2018-08-23 | トヨタ自動車株式会社 | 対話装置 |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2022120500A (ja) * | 2021-02-05 | 2022-08-18 | Necパーソナルコンピュータ株式会社 | 学習支援システム、学習支援方法、及びプログラム |
Also Published As
Publication number | Publication date |
---|---|
US11222638B2 (en) | 2022-01-11 |
CN111192577B (zh) | 2023-10-13 |
US20200135197A1 (en) | 2020-04-30 |
CN111192577A (zh) | 2020-05-22 |
JP7063230B2 (ja) | 2022-05-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11475881B2 (en) | Deep multi-channel acoustic modeling | |
US11823678B2 (en) | Proactive command framework | |
US11545174B2 (en) | Emotion detection using speaker baseline | |
US20240153489A1 (en) | Data driven dialog management | |
US11081104B1 (en) | Contextual natural language processing | |
US11393473B1 (en) | Device arbitration using audio characteristics | |
KR20210070213A (ko) | 음성 사용자 인터페이스 | |
KR20120054845A (ko) | 로봇의 음성인식방법 | |
US11574637B1 (en) | Spoken language understanding models | |
US11276403B2 (en) | Natural language speech processing application selection | |
KR20080023030A (ko) | 온라인 방식에 의한 화자 인식 방법 및 이를 위한 장치 | |
WO2020211820A1 (zh) | 语音情感识别方法和装置 | |
CN111192577B (zh) | 交流装置及交流装置的控制程序 | |
US10600419B1 (en) | System command processing | |
US11468892B2 (en) | Electronic apparatus and method for controlling electronic apparatus | |
US20230377574A1 (en) | Word selection for natural language interface | |
KR20190143583A (ko) | 대화 이해 ai 시스템에 의하여, 머신러닝을 대화 관리 기술에 적용한 하이브리드 계층적 대화 흐름 모델을 기초로 답변을 제공하는 방법 및 컴퓨터 판독가능 기록 매체 | |
US20230306964A1 (en) | Device-specific skill processing | |
KR20200017160A (ko) | 음성을 인식하는 장치, 방법 및 컴퓨터 판독 가능 매체 | |
Lin et al. | Nonverbal acoustic communication in human-computer interaction | |
US20240194197A1 (en) | Systems, devices and methods for affective computing | |
Wang | Speech recognition enhancement based on wireless network sensors application in interactive intelligent teaching system | |
Dodić | APPROACH OF DIFFERENT MODELS OF MACHINE LEARNING IN AUTOMATIC SPEECH RECOGNITION OF BALKAN LANGUAGES | |
JP5988501B2 (ja) | 対話行為出力装置、方法、及びプログラム、並びに対話システム及び方法 | |
KR20240073991A (ko) | 음성 합성 서비스 제공 방법 및 그 시스템 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20210222 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20211130 |
|
A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20211130 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20211209 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20220322 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20220404 |
|
R151 | Written notification of patent or utility model registration |
Ref document number: 7063230 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R151 |