JP2021077268A

JP2021077268A - Information presentation device and information presentation method

Info

Publication number: JP2021077268A
Application number: JP2019205450A
Authority: JP
Inventors: 亮平波多野; Ryohei Hatano
Original assignee: Toppan Printing Co Ltd
Current assignee: Toppan Inc
Priority date: 2019-11-13
Filing date: 2019-11-13
Publication date: 2021-05-20
Anticipated expiration: 2039-11-13
Also published as: JP7395976B2

Abstract

To provide an information presentation system and an information presentation method capable of obtaining necessary information by utilizing past dialog history while using no direct keyword.SOLUTION: An information presentation system includes an action storage unit for storing a recall direction phrase included in a request utterance of a user obtained by performing interactive processing, implementation content indicating content in which the recall direction is changed and retrieved when a concept indicating a recall direction corresponding to the recall direction phrase is changed in order to obtain reply content to the request utterance, and feedback indicating whether or not the reply content is satisfactory as information desired by the user; and a recall direction management unit for calculating a degree of reliability indicating a degree of reliability for the recall direction that the recall direction phrase of the user to obtain information including the reply content desired by the user based on the recall direction phrase, implementation content stored in the action storage unit, and the feedback.SELECTED DRAWING: Figure 1

Description

本発明の実施形態は、情報提示装置、情報提示方法に関する。 An embodiment of the present invention relates to an information presenting device and an information presenting method.

近年、インターネット環境の充実により、ソーシャル・ネットワーク・サービス（以下、ＳＮＳと示す）が普及し、テキストや画像を用いて複数のユーザ間において簡易に意思疎通を行うことが可能となっている。例えば、ＳＮＳのアプリケーションとしては、ＬＩＮＥ（登録商標）、Ｆａｃｅｂｏｏｋ（登録商標）メッセンジャー、Ｓｌａｃｋ（登録商標）などが代表的である。これらのＳＮＳは、一対一のユーザ間の情報のやり取りだけでなく、所定のグループにおける多人数のユーザ間で送受信する情報（複数のユーザ間における対話）を、グループ内の全てのユーザで共有する機能も有している。 In recent years, with the enhancement of the Internet environment, social network services (hereinafter referred to as SNS) have become widespread, and it has become possible to easily communicate between a plurality of users using texts and images. For example, typical SNS applications include LINE (registered trademark), Facebook (registered trademark) messenger, and Slack (registered trademark). These SNSs not only exchange information between one-to-one users, but also share information transmitted and received between a large number of users in a predetermined group (dialogue between a plurality of users) among all users in the group. It also has a function.

また、マン・マシン対話型のＳＮＳとしては、ＧｏｏｇｌｅＡｓｓｉｓｔａｎｔ（登録商標）、ＡｍａｚｏｎＡｌｅｘａ（登録商標）、ＬｉｎｅＣｌｏｖａ（登録商標）などがある。
また、上述したアプリケーションの各々が、パーソナルコンピュータ及びスマートデバイスや、ＧｏｏｇｌｅＨｏｍｅ（登録商標）、ＡｍａｚｏｎＥｃｈｏ（登録商標）、ＣｌｏｖａＷａｖｅ（登録商標）などのスマートスピーカに搭載され、それぞれにおいて音声合成されて、音声を用いた情報提示を主体としたものも広く利用されている。 Further, examples of the man-machine interactive SNS include Google Assistant (registered trademark), Amazon Alexa (registered trademark), and Line Clova (registered trademark).
In addition, each of the above-mentioned applications is installed in personal computers and smart devices, and smart speakers such as Google Home (registered trademark), Amazon Echo (registered trademark), and Clova Wave (registered trademark), and voice synthesis is performed in each of them. , Information presentation using voice is also widely used.

マン・マシン対話型ＳＮＳについては、様々なサービスが展開されている。活用方法についてもユーザとシステムが一対一の状況だけでなく、不特定多数のユーザ対システムという状況で使用されることも多い（旅行計画時に代理店に行くことなく、対話サービスで予約、時刻表検索、保険プランの照会などが可能など）。 Various services are being developed for man-machine interactive SNS. Regarding the usage method, it is often used not only in the situation where the user and the system are one-on-one, but also in the situation where there is an unspecified number of users vs. the system (reservation and timetable by interactive service without going to the agency when planning a trip You can search, inquire about insurance plans, etc.).

マン・マシン対話型のＳＮＳにおいては、ユーザとシステムの対話を実現するために、ユーザからの要求発話に対してどのような返答発話をするかについて、予め対話の流れを想定して規則を構築するルールベース手法を用いられることが多い。 In the man-machine interactive SNS, in order to realize the dialogue between the user and the system, rules are constructed in advance assuming the flow of dialogue regarding what kind of response utterance is to be made in response to the request utterance from the user. Rule-based methods are often used.

対話規則の構築には様々な工夫がされており、手作業による規則構築のコスト低減を目的として、オントロジなど詳細な辞書の代替としてラベル付された対話セットを利用する方法が提案されている（特許文献１）。 Various ideas have been devised for constructing dialogue rules, and a method of using a labeled dialogue set as an alternative to a detailed dictionary such as an ontology has been proposed for the purpose of reducing the cost of manually constructing rules (ontology). Patent Document 1).

対話の質を向上させることを目的として対話規則の生成を補助する技術として、ユーザが対話中に発した言葉の属性（ポジティブやネガティブなどの評価情報）やユーザの属性を基に、返答発話の提示内容をカスタマイズするための手法が提案されている（特許文献２）。 As a technology to assist the generation of dialogue rules for the purpose of improving the quality of dialogue, response utterances are based on the attributes of words (evaluation information such as positive and negative) uttered by the user during the dialogue and the attributes of the user. A method for customizing the presented content has been proposed (Patent Document 2).

特許第４７５５４７８号公報Japanese Patent No. 4755478 特許第６３８１７７５号公報Japanese Patent No. 6381775

しかしながら、従来の対話型ＳＮＳにおいては、ユーザが過去の任意の情報を参照したい場合に、過去の全ての情報を閲覧したり、検索が可能な場合でも、キーワードを思い出せなかったり、グループ対話などで検索の契機となる共通認識が不明瞭で参照できないなど、直感的に内容を再生し辛いという問題がある。 However, in the conventional interactive SNS, when the user wants to refer to arbitrary information in the past, he / she can browse all the past information, and even if the search is possible, he / she cannot remember the keyword or in a group dialogue. There is a problem that it is difficult to intuitively reproduce the contents, such as the common recognition that triggers the search is unclear and cannot be referred to.

また、人同士の対話の場合、過去の対話の内容を参照する場合に、具体的な話の内容だけではなく、「いつ」「どこで」「だれと」「何について」などといった周囲の環境などの環境情報を用いて、会話を行うことも多く、これで対話が成立することも少なくない。 Also, in the case of dialogue between people, when referring to the content of past dialogue, not only the specific content of the story, but also the surrounding environment such as "when", "where", "who", "what", etc. Often, conversations are held using the environmental information of the above, and it is not uncommon for dialogues to be established.

一方、一般的な対話サービスの場合は、マン・マシン型では、不特定多数に向けた汎用的サービスを前提としているため、過去に得た情報を再度取り出そうとしても、所望の情報を得るには、所定の手順を踏まなければならず、前述のような環境情報などを用いた参照ができない。 On the other hand, in the case of a general dialogue service, the man-machine type is premised on a general-purpose service for an unspecified number of people, so even if an attempt is made to retrieve information obtained in the past, the desired information can be obtained. Must follow a predetermined procedure and cannot be referred to using the above-mentioned environmental information or the like.

マン・マシン対話の枠組みを利用し、新規対話規則を追加していくことで、効率的な情報把握が期待できるが、予め対話シナリオを想定しておくルールベース手法などがあるものの、手作業でのメンテナンスを行う部分が多く、作業量の面からかなり困難である。 Efficient information grasping can be expected by adding new dialogue rules using the framework of man-machine dialogue, but although there are rule-based methods that assume dialogue scenarios in advance, it is done manually. There are many parts to be maintained, which is quite difficult in terms of the amount of work.

メッセージアプリにしても、マン・マシン型のようなしくみで情報を取り出せることがインタフェースの面から見ても好ましい。 Even in a message application, it is preferable from the interface point of view that information can be retrieved by a mechanism like a man-machine type.

本発明は、このような事情に鑑みてなされたもので、その目的は、直接的なキーワードを利用しなくても、過去の対話履歴を活用し、必要な情報を得ることができる情報提示装置、情報提示方法を提供することにある。 The present invention has been made in view of such circumstances, and an object of the present invention is an information presenting device capable of obtaining necessary information by utilizing past dialogue history without using direct keywords. , To provide a method of presenting information.

上述した課題を解決するために、本発明の一態様は、利用者の対話履歴と環境情報とに基づく記憶想起の志向を表す想起志向を基に、対話規則構築および対話制御が可能な情報提示システムであり、利用者が利用する端末装置に対して対話処理を行う対話管理部と、対話の履歴である対話履歴を蓄積する対話履歴記憶部と、前記対話を行った環境と当該対話に含まれる想起志向フレーズと想起志向とを蓄積する環境記憶部と、前記対話処理を行うことで得られる利用者の要求発話に含まれる想起志向フレーズと、前記要求発話に対する返答内容を得るために前記想起志向フレーズに対応する想起志向が示す概念を変更した場合に当該想起志向を変更して探索した内容を表す実施内容と、前記返答内容が前記利用者の欲しい情報として満足できたか否かを示すフィードバックと、を記憶する行動記憶部と、前記行動記憶部に記憶された想起フレーズと、前記実施内容と、前記フィードバックとを基に、前記利用者の想起志向フレーズが示す想起志向が前記利用者の欲しい返答内容を含む情報を得るために信頼できる度合いを示す信頼度を算出する想起志向管理部と、を備える情報提示システムである。 In order to solve the above-mentioned problems, one aspect of the present invention presents information capable of constructing dialogue rules and controlling dialogue based on the recall orientation that expresses the intention of memory recall based on the dialogue history of the user and the environmental information. Included in the dialogue management unit that is a system and performs dialogue processing on the terminal device used by the user, the dialogue history storage unit that stores the dialogue history that is the dialogue history, the environment in which the dialogue was performed, and the dialogue. The environment storage unit that stores the recall-oriented phrases and the recall-oriented words, the recall-oriented phrases included in the user's request speech obtained by performing the dialogue processing, and the recall content in order to obtain the response contents to the request speech. When the concept indicated by the recall orientation corresponding to the orientation phrase is changed, the implementation content indicating the content searched by changing the recall orientation and the feedback indicating whether or not the response content is satisfied as the information desired by the user. Based on the behavioral memory unit that stores the information, the recall phrase stored in the behavioral memory unit, the implementation content, and the feedback, the recall orientation indicated by the user's recall-oriented phrase is that of the user. It is an information presentation system equipped with a recall-oriented management unit that calculates the reliability indicating the degree of reliability in order to obtain information including the desired response content.

また、本発明の一態様は、前記想起志向管理部は、前記対話履歴記憶部に記憶された対話に含まれる想起志向フレーズに基づいて、前記対話の環境に類する環境情報に対応する想起志向フレーズを取得する。 Further, in one aspect of the present invention, the recall-oriented management unit responds to environmental information similar to the environment of the dialogue based on the recall-oriented phrase included in the dialogue stored in the dialogue history storage unit. To get.

また、本発明の一態様は、文章のテンプレートに前記対話履歴記憶部に記憶された対話内容に含まれる単語と前記環境記憶部に記憶された単語とを当てはめることで、対話規則を生成する対話規則構築部を有する。 Further, one aspect of the present invention is a dialogue that generates a dialogue rule by applying a word included in the dialogue content stored in the dialogue history storage unit and a word stored in the environment storage unit to a sentence template. It has a rule building department.

また、本発明の一態様は、前記対話管理部が、前記想起志向管理部により算出された想起志向に応じた要求発話に対する返答発話の決定を行う。 Further, in one aspect of the present invention, the dialogue management unit determines a response utterance to a request utterance according to the recall orientation calculated by the recall orientation management unit.

また、本発明の一態様は、前記環境記憶部は、メンバ、天候、場所、日時を含む周囲の環境情報、を蓄積する。 Further, in one aspect of the present invention, the environmental storage unit stores surrounding environmental information including members, weather, place, date and time.

また、本発明の一態様は、前記対話履歴記憶部に記憶された対話内容について、形態素解析、キーワード抽出、ベクトル化うちいずれかの手法を用いることで対話内容に応じた数値データを求める解析部とを有する。 Further, one aspect of the present invention is an analysis unit that obtains numerical data according to the dialogue content by using any of morphological analysis, keyword extraction, and vectorization for the dialogue content stored in the dialogue history storage unit. And have.

また、本発明の一態様は、利用者の年齢、性別、居住地を含むデモグラフィックデータと、想起志向とを記憶する属性記憶部を有し、前記想起志向管理部は、前記属性記憶部に記憶された情報を用いて、利用者が求める回答が得られるように要求発話の想起志向フレーズを変更することで、前記想起志向の最適化処理をする。 Further, one aspect of the present invention has an attribute storage unit that stores demographic data including the age, gender, and place of residence of the user, and the recall orientation, and the recall orientation management unit is stored in the attribute storage unit. The recall-oriented optimization process is performed by changing the recall-oriented phrase of the request utterance so that the answer requested by the user can be obtained by using the stored information.

また、本発明の一態様は、前記想起志向管理部は、前記想起志向に対する信頼度を、属性記憶部に対してユーザ毎に書き込み記憶させる。 Further, in one aspect of the present invention, the recall-oriented management unit writes and stores the reliability of the recall-oriented in the attribute storage unit for each user.

また、本発明の一態様は、前記想起志向管理部は、前記行動記憶部に記憶された探索結果とフィードバックとを基に、前記想起志向に対する信頼度を更新する。 Further, in one aspect of the present invention, the recall-oriented management unit updates the reliability of the recall-oriented based on the search result and feedback stored in the behavior memory unit.

また、本発明の一態様は、利用者の特徴を基に組分けを行うグルーピング推定部と、前記グルーピング推定部によってグルーピングされた結果を蓄積するグルーピング記憶部と、を有し、前記モデル構築部は、前記グルーピング記憶部の情報に記憶されたグルーピングの結果を基に、変更対象の利用者の想起志向が共通するユーザの、想起志向を推定するモデルを用いる。 Further, one aspect of the present invention includes a grouping estimation unit that groups based on the characteristics of the user, and a grouping storage unit that stores the results grouped by the grouping estimation unit, and the model construction unit. Uses a model for estimating the recall orientation of users who share the same recall orientation of the users to be changed, based on the grouping result stored in the information of the grouping storage unit.

また、本発明の一態様は、前記グルーピング記憶部から、前記利用者に共通する想起志向により、対話規則構築および対話管理それぞれが変更されたモデルを用いる。 Further, one aspect of the present invention uses a model in which dialogue rule construction and dialogue management are changed from the grouping storage unit according to the recall orientation common to the users.

また、本発明の一態様は、前記想起志向管理部は、前記変更モデルが用意されていない、あるいは不十分なユーザに対し、当該ユーザと類似した傾向を持つモデルを用いて、対話規則構築と対話管理を最適化するためのモデルを生成する。 Further, in one aspect of the present invention, the recall-oriented management unit constructs a dialogue rule for a user whose modification model is not prepared or insufficient by using a model having a tendency similar to that of the user. Generate a model for optimizing dialogue management.

また、本発明の一態様は、応答内容を含む情報を加工することでコンテンツを生成する合成部を有する。 Further, one aspect of the present invention has a synthesis unit that generates content by processing information including response content.

また、本発明の一態様は、前記合成部は、送信先の端末装置に応じて、異なる加工をしたコンテンツを生成する構を有する。 Further, in one aspect of the present invention, the synthesis unit has a structure for generating contents that have been processed differently depending on the terminal device of the transmission destination.

また、本発明の一態様は、利用者の対話履歴と環境情報とに基づく記憶想起の志向を表す想起志向を基に、対話規則構築および対話制御が可能な情報提示システムにおける情報提示方法であり、前記情報提示システムは、対話の履歴である対話履歴を蓄積する対話履歴記憶部と、前記対話を行った環境と当該対話に含まれる想起志向フレーズと想起志向とを蓄積する環境記憶部と、対話処理を行うことで得られる利用者の要求発話に含まれる想起志向フレーズと、前記要求発話に対する返答内容を得るために前記想起志向フレーズに対応する想起志向が示す概念を変更した場合に当該想起志向を変更して探索した内容を表す実施内容と、前記返答内容が前記利用者の欲しい情報として満足できたか否かを示すフィードバックと、を記憶する行動記憶部と、を有し、対話管理部が、利用者が利用する端末装置に対して対話処理を行い、想起志向管理部が、前記行動記憶部に記憶された想起フレーズと、前記実施内容と、前記フィードバックとを基に、前記利用者の想起志向フレーズが示す想起志向が前記利用者の欲しい返答内容を含む情報を得るために信頼できる度合いを示す信頼度を算出する情報提示方法である。 Further, one aspect of the present invention is an information presentation method in an information presentation system capable of constructing dialogue rules and controlling dialogue based on a recall orientation that expresses a memory recall orientation based on a user's dialogue history and environmental information. The information presentation system includes a dialogue history storage unit that stores dialogue history, which is the history of dialogue, an environment storage unit that stores the environment in which the dialogue was conducted, and recall-oriented phrases and recall-oriented phrases included in the dialogue. When the recall-oriented phrase included in the user's request utterance obtained by performing the dialogue processing and the concept indicated by the recall-oriented phrase corresponding to the recall-oriented phrase are changed in order to obtain the response content to the request utterance, the recall is concerned. It has an action storage unit that stores an implementation content that indicates the content searched by changing the intention and a feedback indicating whether or not the response content is satisfied as the information desired by the user, and is a dialogue management unit. However, the user performs an interactive process on the terminal device used by the user, and the recall-oriented management unit uses the recall phrase stored in the action storage unit, the implementation content, and the feedback. This is an information presentation method for calculating the reliability indicating the degree to which the recall-oriented phrase indicated by the recall-oriented phrase can be trusted in order to obtain information including the response content desired by the user.

本発明では対話規則構築に着目し、記憶想起を促すあるいは曖昧検索に資する情報を対話規則構築に織り込むことで、過去の対話履歴からの情報参照の効率化を目指すものである。具体的には、対話・検索履歴やそれに含まれるグループ内での共通認識に基づくワード、記憶を想起させるための環境情報（天気、部屋など）、を用いて対話規則の構築を行うとともに、ユーザごとあるいはユーザ同士の傾向を比較・分析した結果などを基に、および記憶の想起に関する志向を算出し、前記、対話規則の構築および対話の制御を可能とするものである。 The present invention focuses on the construction of dialogue rules, and aims to improve the efficiency of information reference from the past dialogue history by incorporating information that promotes memory recall or contributes to ambiguous search into the construction of dialogue rules. Specifically, the dialogue rules are constructed using the dialogue / search history, words based on the common recognition within the group included in the dialogue / search history, and environmental information (weather, room, etc.) for reminding the memory, and the user. Based on the results of comparing and analyzing the tendencies of each user or between users, and by calculating the orientation regarding memory recall, it is possible to construct the dialogue rules and control the dialogue.

本発明の一態様としては、想起志向管理部において後述する各種記憶手段により記憶部に蓄積された情報のいずれかあるいは組み合わせで、ユーザの想起志向を推定するためのパラメータを決定することを特徴とする。 One aspect of the present invention is characterized in that, in the recall orientation management unit, a parameter for estimating the recall orientation of the user is determined by any or a combination of the information stored in the storage unit by various storage means described later. To do.

本発明の一態様としては、前記想起志向管理部は、新たな情報の追加に伴い自然言語処理や機械学習、ニューラルネットワーク、強化学習などの枠組みを用いて学習を行い、推定アルゴリズムを最適化する機能を備える（機械学習を行い定式化に必要なパラメータの最適化を行える）。 In one aspect of the present invention, the recall-oriented management unit performs learning using a framework such as natural language processing, machine learning, neural network, and reinforcement learning with the addition of new information, and optimizes the estimation algorithm. Equipped with functions (machine learning can be performed to optimize the parameters required for formulation).

本発明の一態様としては、対話規則構築部は想起志向管理部により算出された想起志向および記憶部の情報に応じて対話規則の構築を行うことを特徴とする。 One aspect of the present invention is characterized in that the dialogue rule building unit constructs a dialogue rule according to the information of the recall-oriented and storage unit calculated by the recall-oriented management unit.

本発明の一態様としては、対話管理部は想起志向管理部により算出された想起志向に応じて要求対話に対する返答発話の決定を行うことを特徴とする。 One aspect of the present invention is characterized in that the dialogue management unit determines the response utterance to the request dialogue according to the recall orientation calculated by the recall orientation management unit.

（環境記憶部にて取得する情報および特徴量に関する記述）
本発明の一態様としては、前記環境記憶部は、対話を参加したメンバや、日時、場所、天候、気温などの対話以外の情報（対話の内容ごとに変化するもの）を記憶することを特徴とする。 (Description of information and features acquired by the environmental storage unit)
One aspect of the present invention is characterized in that the environmental storage unit stores information other than the dialogue (which changes depending on the content of the dialogue) such as the members who participated in the dialogue and the date and time, place, weather, and temperature. And.

本発明の一態様としては、前記環境記憶部の内容は特徴量として数値や記号に変換し、単体、あるいは組み合わせなど異なる情報をかけ合わせた形式へ加工し保持することも可能とする。 As one aspect of the present invention, the contents of the environmental storage unit can be converted into numerical values or symbols as feature quantities, processed into a format in which different information such as a single substance or a combination is combined, and held.

（対話履歴記憶部にて取得する情報および特徴量に関する記述）
本発明の一態様としては、前記対話履歴記憶部は、ユーザのシステム上での対話内容などを記憶することを特徴とする。 (Description of information and features acquired in the dialogue history storage unit)
One aspect of the present invention is characterized in that the dialogue history storage unit stores dialogue contents and the like on the user's system.

本発明の一態様としては、前記対話履歴記憶部の対話内容は共起や類似度などを表す数値や記号に変換し用いることも可能とする（tf-idfやword2vecなどの公知の技術を用いる）。 As one aspect of the present invention, the dialogue content of the dialogue history storage unit can be converted into numerical values or symbols representing co-occurrence, similarity, etc. and used (using known techniques such as tf-idf and word2vec). ).

本発明の一態様としては、前記対話履歴記憶部では、当該システムを介して行った対話全てを扱い、想起志向を用いて構築した対話規則を利用した対話履歴については、対話内容に加え、事前に設定した過去の対話を参照する際に用いた（後述する環境情報に関連する）フレーズに合致あるいは類似するものを記録、または検知回数をトータルまたは、要求内容ごとに記憶することを特徴とする。 As one aspect of the present invention, the dialogue history storage unit handles all the dialogues performed through the system, and the dialogue history using the dialogue rules constructed by using the recall orientation is obtained in advance in addition to the dialogue contents. It is characterized in that it records a phrase that matches or is similar to the phrase (related to the environmental information described later) used when referring to the past dialogue set in, or stores the total number of detections or each request content. ..

（属性記憶部にて取得する情報および特徴量に関する記述）
本発明の一態様としては、前記属性記憶部は、ユーザの年齢、性別、所在地を始めとしたデモグラフィックデータといった対話以外の利用者属性情報（特に、対話によって変化しないもの）を保持することを特徴とする。 (Description of information and features acquired in the attribute storage unit)
As one aspect of the present invention, the attribute storage unit holds user attribute information other than dialogue (particularly, one that does not change due to dialogue) such as demographic data such as age, gender, and location of the user. It is a feature.

本発明の一態様としては、前記属性記憶部は、上記の情報に加え想起志向管理部にて算出したパラメータを属性記憶部にユーザごとに書き込み記憶させることを特徴とする。 As one aspect of the present invention, the attribute storage unit is characterized in that, in addition to the above information, parameters calculated by the recall-oriented management unit are written and stored in the attribute storage unit for each user.

（行動記憶部での取得する情報および特徴量に関する記述）
本発明の一態様としては、前記行動記憶部は、システムとユーザの双方の振る舞いを記録することを特徴とする。 (Description of information and features acquired by the behavioral memory unit)
One aspect of the present invention is characterized in that the behavior storage unit records the behavior of both the system and the user.

本発明の一態様としては、前記行動記憶部は後述する想起志向の推定結果を基に、構築した対話規則および、具体的な実施内容（要求発話と返答発話の紐付け、勘違いの程度を推定して提示したなど）を記録することを特徴とする。 As one aspect of the present invention, the behavioral memory unit estimates the dialogue rules constructed and the specific implementation contents (association of request utterance and response utterance, degree of misunderstanding) based on the estimation result of recall orientation described later. It is characterized by recording the presentation.

本発明の一態様としては、前記行動記憶部は想起志向の推定結果に応じた対話処理の結果に対するユーザの評価（システム上で「あっていましたか？、はい／いいえ」などを選択させる明確な提示）を記憶することを特徴とする。 As one aspect of the present invention, the behavioral memory unit clearly allows the user's evaluation (“was it ?, yes / no” or the like selected on the system) for the result of the dialogue processing according to the recall-oriented estimation result. It is characterized by memorizing the presentation).

（グルーピング記憶部にて取得する情報および特徴量に関する記述）
本発明の一態様としては、前記グルーピング記憶部は、記憶手段の情報を基に、類似した傾向を持つユーザに対して、組分けを実現し記録することを特徴とする。 (Description of information and features acquired in the grouping storage unit)
One aspect of the present invention is characterized in that the grouping storage unit realizes and records grouping for users having similar tendencies based on the information of the storage means.

本発明の一態様としては、前記グルーピング記憶部から、ユーザに共通する想起志向により、対話規則構築および対話管理それぞれの変更を行う変更モデルをさらに備えることを特徴とする。 One aspect of the present invention is characterized in that the grouping storage unit is further provided with a change model for changing each of dialogue rule construction and dialogue management according to a recall orientation common to users.

本発明の一態様としては、前記想起志向管理部は、変更モデルが用意されていない、あるいは不十分なユーザに対し、類似した傾向を持つ変更モデルを抽出し、対話規則構築や対話管理を最適化するための変更モデルを生成することを特徴とする。 As one aspect of the present invention, the recall-oriented management unit extracts a change model having a similar tendency for a user whose change model is not prepared or is insufficient, and optimizes dialogue rule construction and dialogue management. It is characterized by generating a modification model for optimization.

本発明の一態様としては、前記提示制御部は、前記想起志向管理部で導出した推定内容を基に対話管理部が決定した応答発話に応じた、提示方法を変更することを特徴とする。 One aspect of the present invention is characterized in that the presentation control unit changes the presentation method according to the response utterance determined by the dialogue management unit based on the estimated contents derived by the recall-oriented management unit.

本発明の一態様としては、前記提示制御部の情報を基にユーザへの提示形態を変更するものであり、提示制御部はシステムに合成部に加え、ユーザに視覚的な情報を提示できる、可視化部が存在する場合は、表示の切り替えを行うことを特徴とする。 One aspect of the present invention is to change the presentation form to the user based on the information of the presentation control unit, and the presentation control unit can present visual information to the user in addition to the synthesis unit to the system. If there is a visualization unit, the display is switched.

また、音声合成処理を介さずにユーザへの提示を行うことも可能とする。 It is also possible to present to the user without going through the voice synthesis process.

以上説明したように、この発明によれば、直接的なキーワードを利用しなくても、過去の対話履歴を活用し、必要な情報を得ることができる。 As described above, according to the present invention, it is possible to obtain necessary information by utilizing the past dialogue history without using a direct keyword.

また、本発明によれば、先の仕組みを用いることで、固有名詞などの直接的な表現を用いずに、代名詞や記憶を想起するワードを基に所望の情報を取り出し、対話を行うことができる。これにより、過去の対話履歴の効率的な参照が可能となる。 Further, according to the present invention, by using the above mechanism, it is possible to extract desired information based on a word that recalls a pronoun or a memory and perform a dialogue without using a direct expression such as a proper noun. it can. This enables efficient reference of past dialogue history.

また、実際の環境情報などを基にした想起による参照の履歴や、システムの提示に対するユーザからの評価を用いて逐次、ユーザが想起に用いる環境情報の傾向に加え、対話で頻出する勘違いや記憶違いの傾向を考慮した想起志向管理部の最適化が行えるとともに、ユーザ同士の想起志向を比較・分析することで、情報が少ないユーザに対しても想起志向を適用した処理を提供できる。 In addition, in addition to the history of references by recall based on actual environmental information and the tendency of environmental information that the user uses for recall, using the evaluation from the user for the presentation of the system, misunderstandings and memories that frequently occur in dialogue. The recall-oriented management unit can be optimized in consideration of the tendency of difference, and by comparing and analyzing the recall-oriented between users, it is possible to provide a process to which the recall-oriented is applied even to a user with little information.

本発明の実施形態に係る情報提示システム１の構成例を示すブロック図A block diagram showing a configuration example of the information presentation system 1 according to the embodiment of the present invention. 本発明の実施形態に係る端末装置１０の構成例を示すブロック図A block diagram showing a configuration example of the terminal device 10 according to the embodiment of the present invention. 情報提示装置２０へ送信されるデジタルデータの構成例を示す図The figure which shows the structural example of the digital data transmitted to the information presenting apparatus 20 端末装置１０における提示形態のイメージを示す図The figure which shows the image of the presentation form in a terminal apparatus 10. 端末装置１０における提示形態のイメージを示す図The figure which shows the image of the presentation form in a terminal apparatus 10. 情報提示装置２０の構成例を示す図The figure which shows the configuration example of the information presenting apparatus 20 対話管理部２１２の構成例を示す図Diagram showing a configuration example of the dialogue management unit 212 対話履歴記憶部のデータ構成例を示す図The figure which shows the data structure example of the dialogue history storage part 環境記憶部のデータ構成例を示す図The figure which shows the data structure example of the environment storage part 属性記憶部のデータ構成例を示す図The figure which shows the data structure example of the attribute storage part 行動記憶部のデータ構成例を示す図The figure which shows the data structure example of the action memory part グルーピング記憶部のデータ構成例を示す図The figure which shows the data structure example of a grouping storage part 言語知識記憶部２２６の構成例を説明する図The figure explaining the configuration example of the language knowledge memory part 226 想起志向管理部２４０の構成例を示すブロック図Block diagram showing a configuration example of the recall-oriented management unit 240 モデル構築部２４２の最適化の例を説明する図The figure explaining the example of the optimization of the model building part 242 対話規則構築部２５０の構成例を示すブロック図Block diagram showing a configuration example of the dialogue rule construction unit 250 対話規則記憶部のデータ構成例を示す図The figure which shows the data structure example of the dialogue rule storage part システム動作を示すフローチャートFlowchart showing system operation

（実施形態）
以下、本発明の実施形態について、図面を参照して説明する。 (Embodiment)
Hereinafter, embodiments of the present invention will be described with reference to the drawings.

（情報提示システムの概観）
図１は、本発明の実施形態に係る情報提示システム１の構成例を示すブロック図である。 (Overview of information presentation system)
FIG. 1 is a block diagram showing a configuration example of the information presentation system 1 according to the embodiment of the present invention.

情報提示システム１は、人間同士もしくはコンピュータと人間で音声や画像、テキストなどを相互に提示することで意思疎通を図る対話システムを想定している。
情報提示システム１は、端末装置１０（端末装置１０ａ、端末装置１０ｂ、端末装置１０ｃ、端末装置１０ｄ）と、情報提示装置２０（情報提示装置２０ａ、情報提示装置２０ｂ、情報提示装置２０ｃ、情報提示装置２０ｄ）と、通信ネットワーク３０を含んで構成される。 The information presentation system 1 envisions a dialogue system in which humans or computers and humans communicate with each other by presenting voices, images, texts, and the like.
The information presentation system 1 includes a terminal device 10 (terminal device 10a, terminal device 10b, terminal device 10c, terminal device 10d) and an information presentation device 20 (information presentation device 20a, information presentation device 20b, information presentation device 20c, information presentation). The device 20d) and the communication network 30 are included.

端末装置１０は、情報の送受信および提示を要求するユーザが使用する端末、例えば、汎用コンピュータ、パーソナルコンピュータ、タブレット型端末、スマートフォン、スマートスピーカ等である。情報提示システム１の端末装置１０は、ユーザからのメッセージを入力装置を介して取得し、当該メッセージを通信ネットワーク３０を介して情報提示装置２０へ送信する。具体的に、端末装置１０は、ユーザの発話による音声を、自らの端末装置１０が備える（マイクロフォンなどの）入力部によって取得する。端末装置１０は、取得した音声を音声データに変換する。端末装置１０は、自らの端末装置１０が備える送信部により、当該音声データを、通信ネットワーク３０を介して情報提示装置２０へ送信する。 The terminal device 10 is a terminal used by a user who requests transmission / reception and presentation of information, for example, a general-purpose computer, a personal computer, a tablet terminal, a smartphone, a smart speaker, or the like. The terminal device 10 of the information presentation system 1 acquires a message from the user via the input device, and transmits the message to the information presentation device 20 via the communication network 30. Specifically, the terminal device 10 acquires the voice uttered by the user by an input unit (such as a microphone) included in the terminal device 10 itself. The terminal device 10 converts the acquired voice into voice data. The terminal device 10 transmits the voice data to the information presenting device 20 via the communication network 30 by the transmission unit included in the terminal device 10.

図２は、本発明の実施形態に係る端末装置１０の構成例を示す図である。
端末装置１０は、入力部１１０、インタフェース部１２０、記憶部１３０、センサ部１４０、出力部１５０、通信部１６０、制御部１７０を含んで構成される。
入力部１１０は、ユーザからの要求を取得する。入力部１１０は、例えば、マイクロフォンを用いることができる。入力部１１０がマイクロフォンである場合、入力部１１０は、入力される音声に対応した音声信号を生成する。また、入力部１１０は、キーボードやマウス等の入力デバイスであってもよい。
インタフェース部１２０は、端末装置１０を操作する。
記憶部１３０は、インタフェース部１２０および外部制御端末などを用いて設定したパラメータを記憶する。
センサ部１４０は、ＧＮＳＳ（Global Navigation Satellite System）位置情報や室温などの情報を取得する。
出力部１５０は、情報提示装置２０から取得した情報を出力する。出力部１５０としては、例えば、液晶表示装置やスピーカ等であり、各種データやコンテンツなどを出力することができる。これにより、ユーザに対して、各種情報が提示される。
通信部１６０は、送信部１６１、受信部１６２を有し、外部の機器と通信を行う。例えば、通信部１６０は、入力部１１０から入力される情報に基づいて、問い合わせを行う要求情報を送信する。この要求情報は、質問や要求に対する応答を要求する情報である。例えば要求情報は、「この前Aさんと話した本の値段は？」のように質問を表すデータや、「この前と同じ冷房の温度にセットして」等、特定の機器に対する操作要求を表すデータである。
送信部１６１は、情報提示装置２０に各種情報を送信する。
受信部１６２は、情報提示装置２０から送信される各種情報を受信する。
制御部１７０は、端末装置１０内の各部を制御する。
制御部１７０は、入力部１１０によって生成された音声信号に対応する音声データを生成し、音声データを用いた要求情報を通信部１６０から送信させる。 FIG. 2 is a diagram showing a configuration example of the terminal device 10 according to the embodiment of the present invention.
The terminal device 10 includes an input unit 110, an interface unit 120, a storage unit 130, a sensor unit 140, an output unit 150, a communication unit 160, and a control unit 170.
The input unit 110 acquires a request from the user. For the input unit 110, for example, a microphone can be used. When the input unit 110 is a microphone, the input unit 110 generates an audio signal corresponding to the input voice. Further, the input unit 110 may be an input device such as a keyboard or a mouse.
The interface unit 120 operates the terminal device 10.
The storage unit 130 stores parameters set by using the interface unit 120, an external control terminal, or the like.
The sensor unit 140 acquires information such as GNSS (Global Navigation Satellite System) position information and room temperature.
The output unit 150 outputs the information acquired from the information presenting device 20. The output unit 150 is, for example, a liquid crystal display device, a speaker, or the like, and can output various data, contents, and the like. As a result, various information is presented to the user.
The communication unit 160 has a transmission unit 161 and a reception unit 162, and communicates with an external device. For example, the communication unit 160 transmits request information for making an inquiry based on the information input from the input unit 110. This request information is information that requests a response to a question or request. For example, the request information includes data that expresses a question, such as "What is the price of the book you talked to Mr. A last time?", Or an operation request for a specific device, such as "Set to the same cooling temperature as before." It is the data to represent.
The transmission unit 161 transmits various information to the information presenting device 20.
The receiving unit 162 receives various information transmitted from the information presenting device 20.
The control unit 170 controls each unit in the terminal device 10.
The control unit 170 generates voice data corresponding to the voice signal generated by the input unit 110, and causes the communication unit 160 to transmit request information using the voice data.

図３は、端末装置１０から情報提示装置２０へ送信されるデジタルデータ（収集データ）の構成例を示す図である。情報提示装置２０に送信されるデータには、入力部１１０を介してユーザから入力される、ユーザの要求を表すデータ、センサ部１４０によって得られたデータ、記憶部１３０に蓄積されたデータのうち、少なくともいずれか１つが含まれる。
また、情報提示装置２０に送信されるデータとしては、端末装置１０のＧＮＳＳ情報を基に、端末装置１０が存在する場所における天気予報のデータを、外部の天気情報サーバから天気予報ＡＰＩなどを用いて取得し、通信部１６０が情報提示装置２０に送信するようにしてもよい。上述の要求情報も、このデジタルデータの一部として用いられる。 FIG. 3 is a diagram showing a configuration example of digital data (collected data) transmitted from the terminal device 10 to the information presenting device 20. The data transmitted to the information presenting device 20 includes data representing a user's request input from the user via the input unit 110, data obtained by the sensor unit 140, and data stored in the storage unit 130. , At least one is included.
Further, as the data transmitted to the information presenting device 20, based on the GNSS information of the terminal device 10, the weather forecast data at the place where the terminal device 10 exists is used, and the weather forecast API or the like is used from an external weather information server. The communication unit 160 may acquire the data and transmit the data to the information presenting device 20. The above-mentioned requirement information is also used as a part of this digital data.

収集データは、時刻、ユーザＩＤ、発話データ、セッションＩＤ、位置情報、温度、設置場所、の情報を含む。
時刻は、発話された日時を示す。
ユーザＩＤは、ユーザを識別する情報であり、特に、端末装置１０を利用しているユーザを特定するユーザＩＤである。このユーザＩＤは、端末装置１０に割り当てられたＩＤを用いることができ、また、端末装置１０がスマートスピーカである場合には、このスマートスピーカを利用する際にユーザ登録された際のＩＤを用いることができる。
発話データは、ユーザから発話された内容に基づいて得られる発話内容を表す情報であり、メッセージとも称する。発話データは、例えば入力部１１０として用いられるマイクロフォンによってユーザの発話内容が入力された場合には、発話された内容を音声認識することで、音声データからテキストデータに変換することで、発話データが得られる。また、発話データは、音声データに基づくものだけでなく、入力部１１０が、キーボードやマウス等である場合には、入力部１１０から入力されるテキストデータや図（顔文字、絵文字、スタンプなど）を対話データとして用いることができる。
セッションＩＤは、１つの対話の開始から対話の終了まで１つの対話群とし、この対話群を識別する識別情報である。また、セッションが同じであれば、対話履歴記憶部２２１に記憶されるセッションＩＤと同じセッションＩＤが用いられる。
位置情報は、対話が行われた時点における、発話したユーザの位置を示す。ユーザの位置は、例えば、端末装置１０のＧＮＳＳ機能によって現在位置（例えば緯度及び経度）を特定し、この端末装置１０の位置を、ユーザの位置を示す情報として用いるようにしてもよい。例えば、端末装置１０がスマートフォンである場合、スマートフォンは一般に、ユーザによって携帯されるため、端末装置１０の位置をユーザの位置として設定するようにしてもよい。
温度は、端末装置１０の周囲の温度を示す。端末装置１０が室内において利用されている場合には室温を示し、屋外において利用されている場合には気温を示す。
設置場所は、端末装置１０が設置された場所あるいは主に利用される場所の名称である。例えば、端末装置１０がスマートスピーカである場合には、スマートスピーカが設置された場所を示す。 The collected data includes information such as time, user ID, utterance data, session ID, location information, temperature, and installation location.
The time indicates the date and time when the utterance was made.
The user ID is information that identifies the user, and in particular, is a user ID that identifies the user who is using the terminal device 10. As this user ID, the ID assigned to the terminal device 10 can be used, and when the terminal device 10 is a smart speaker, the ID at the time of user registration when using this smart speaker is used. be able to.
The utterance data is information representing the utterance content obtained based on the content uttered by the user, and is also referred to as a message. As for the utterance data, for example, when the user's utterance content is input by the microphone used as the input unit 110, the utterance data is converted from the voice data to the text data by recognizing the uttered content by voice. can get. The utterance data is not limited to voice data, and when the input unit 110 is a keyboard, mouse, or the like, text data or figures (emoticons, pictograms, stamps, etc.) input from the input unit 110 are used. Can be used as dialogue data.
The session ID is identification information that identifies this dialogue group as one dialogue group from the start of one dialogue to the end of the dialogue. If the sessions are the same, the same session ID as the session ID stored in the dialogue history storage unit 221 is used.
The position information indicates the position of the user who spoke at the time of the dialogue. For the user's position, for example, the current position (for example, latitude and longitude) may be specified by the GNSS function of the terminal device 10, and the position of the terminal device 10 may be used as information indicating the user's position. For example, when the terminal device 10 is a smartphone, since the smartphone is generally carried by the user, the position of the terminal device 10 may be set as the user's position.
The temperature indicates the ambient temperature of the terminal device 10. When the terminal device 10 is used indoors, it indicates a room temperature, and when it is used outdoors, it indicates an air temperature.
The installation location is the name of the location where the terminal device 10 is installed or the location where it is mainly used. For example, when the terminal device 10 is a smart speaker, it indicates a place where the smart speaker is installed.

図１に戻り、情報提示装置２０は、端末装置１０から取得した要求情報に対し返答情報を決定し、コンテンツを生成する。情報提示装置２０は、生成されたコンテンツを端末装置１０へ送信する。情報提示装置２０はサーバ装置であり、例えば、汎用コンピュータ、またはパーソナルコンピュータ等のいずれかを用いることができる。 Returning to FIG. 1, the information presenting device 20 determines the response information for the request information acquired from the terminal device 10 and generates the content. The information presenting device 20 transmits the generated content to the terminal device 10. The information presentation device 20 is a server device, and for example, either a general-purpose computer or a personal computer can be used.

情報提示装置２０は、受信した要求情報を解析し後述する対話管理部により返答を決定し、提示形態に応じてコンテンツを最適化しユーザへ提示する。情報提示装置２０は、端末装置１０から送信される要求情報が、音声データである場合には、その音声データを文字データに変換した上で対話管理部２１２により返答を決定する。 The information presenting device 20 analyzes the received request information, determines a response by the dialogue management unit described later, optimizes the content according to the presentation form, and presents the content to the user. When the request information transmitted from the terminal device 10 is voice data, the information presenting device 20 converts the voice data into character data and then determines a response by the dialogue management unit 212.

なお、本実施形態においては、情報提示装置２０が、端末装置１０から送信される音声データを文字データへ変換する場合について説明するが、これに限られない。例えば、端末装置１０が、音声データから文字データへ変換し、当該文字データを情報提示装置２０へ要求情報として送信するようにしてもよい。
または、情報提示装置２０が、端末装置１０から取得した音声データを、文字データに変換する変換機能を有する外部サーバへ送信し、当該外部のサーバから送信される文字データを受信することで文字データを取得するようにしてもよい。 In the present embodiment, the case where the information presenting device 20 converts the voice data transmitted from the terminal device 10 into character data will be described, but the present invention is not limited to this. For example, the terminal device 10 may convert the voice data into character data and transmit the character data to the information presenting device 20 as request information.
Alternatively, the information presenting device 20 transmits the voice data acquired from the terminal device 10 to an external server having a conversion function for converting the voice data into character data, and receives the character data transmitted from the external server to obtain the character data. May be obtained.

なお、前記実施形態において、端末装置１０が備えるマイクロフォンによってユーザからの情報提示要求が入力されるものとしたが、これに限られない。
例えば、人間の意図を伝えることができるデバイスまたは、センサであれば、マイクロフォン以外のデバイスまたはセンサを用いるようにしてもよい。例えば、音声の代わりに、文字情報がキーボードやタッチパネルによって入力されるような構成や、人間のジェスチャによって示される情報がカメラによって入力されるような構成を用いて、要求情報を入力するようにしてもよい。
上記以外にも、図（顔文字、絵文字、スタンプなど）や動画コンテンツをカメラによって入力部１１０から入力されるような構成であったり、センサを介さずとも事前に保存されている動画像コンテンツを用いるようにし、要求情報を入力してもよい。例えば、動画コンテンツを要求情報として入力する場合には、「この前Aさんと話した本の値段は？」等の発話シーンを含む動画コンテンツを入力することができる。これにより、動画コンテンツに含まれる発話内容を要求情報として抽出するようにしてもよい。また、例えば、絵文字を要求情報として入力する場合には、その絵文字を表示させるための単語を要求情報として用いるようにしてもよい。例えば、ある絵文字が「よろしく」という文字に対応する絵文字である場合には、その絵文字が要求情報として入力された場合には「よろしく」という要求情報が入力されたものとして情報を取得するようにすることができる。また、ある絵文字が「本」を表す絵文字である場合には、その絵文字が要求情報として入力された場合には、「本」という要求情報が入力されたものとして情報を取得することができる。ここでは、少なくとも１つの絵文字等の画像のみが要求情報として入力されてもよいし、画像及び音声またはテキストデータが組み合わされて１つの要求情報として入力されてもよい。 In the above embodiment, the information presentation request from the user is input by the microphone included in the terminal device 10, but the present invention is not limited to this.
For example, if it is a device or sensor capable of transmitting a human intention, a device or sensor other than a microphone may be used. For example, the request information is input by using a configuration in which character information is input by a keyboard or a touch panel instead of voice, or a configuration in which information indicated by a human gesture is input by a camera. May be good.
In addition to the above, there is a configuration in which figures (emoticons, pictograms, stamps, etc.) and video content are input from the input unit 110 by the camera, and moving image content saved in advance without using a sensor. You may use it and enter the request information. For example, when inputting video content as request information, it is possible to input video content including an utterance scene such as "What is the price of the book you talked to Mr. A last time?". As a result, the utterance content included in the moving image content may be extracted as request information. Further, for example, when a pictogram is input as request information, a word for displaying the pictogram may be used as request information. For example, if a certain pictogram is a pictogram corresponding to the character "regards", and if the pictogram is input as request information, the information is acquired assuming that the request information "regards" is input. can do. Further, when a certain pictogram is a pictogram representing a "book" and the pictogram is input as request information, the information can be acquired as if the request information "book" was input. Here, only an image such as at least one pictogram may be input as request information, or an image and voice or text data may be combined and input as one request information.

また、上述の実施形態において、通信ネットワーク３０を介して端末装置１０と情報提示装置２０の処理をわけているが、これに限られない。
例えば、端末装置１０の中に情報提示装置２０の一部あるいは全て（ネットワークを介さずにシステムを実現できるように）を含むような構成であってもよい。 Further, in the above-described embodiment, the processing of the terminal device 10 and the information presenting device 20 is separated via the communication network 30, but the processing is not limited to this.
For example, the terminal device 10 may be configured to include a part or all of the information presenting device 20 (so that the system can be realized without going through a network).

通信ネットワーク３０は、例えばインターネットやLAN (Local Area Network) などである。 The communication network 30 is, for example, the Internet or a LAN (Local Area Network).

なお、本実施形態においては、情報提示システム１は人間同士の会話や人間とコンピュータの会話などあらゆる対話型のシステム上で動作するものである。例えば、自動応答システム、質問応答システム、対話システムの一種であるコンピュータが応答内容を規則に基づき抽出または生成するチャットボットを含む会話としてもよい。またネットワークを介して人間同士が対話を実現するLINEやFacebook Messengerのような会話を本実施形態の対象としてもよい。 In the present embodiment, the information presentation system 1 operates on any interactive system such as a conversation between humans and a conversation between a human and a computer. For example, it may be a conversation including a chatbot in which a computer, which is a kind of an automatic answering system, a question answering system, and a dialogue system, extracts or generates a response content based on a rule. In addition, conversations such as LINE and Facebook Messenger, in which humans realize dialogues via a network, may be the target of this embodiment.

LINEやチャットボットなどを含むネットワークを介した対話システムの詳細については、既存技術を用いることでできるため、詳細な説明は割愛する。 Details of the dialogue system via the network including LINE and chatbots can be explained by using existing technology, so detailed explanations are omitted.

（端末装置１０による提示例）
次に、想起志向を用いて構築した対話規則を基にした提示の一例について説明する。
図４は、本発明の実施形態に係る端末装置１０における提示の一例を示す図である。尚、図４は音声でのやり取りを説明のために可視化した概念図である。向かって左が端末装置であり、右がユーザであり、そのユーザが、端末装置に対して発話している状況を表している。 (Presentation example by terminal device 10)
Next, an example of presentation based on the dialogue rules constructed using the recall orientation will be described.
FIG. 4 is a diagram showing an example of presentation in the terminal device 10 according to the embodiment of the present invention. Note that FIG. 4 is a conceptual diagram that visualizes the exchange by voice for the purpose of explanation. The left side is the terminal device, the right side is the user, and the user is speaking to the terminal device.

図４Ａは、固有名詞（例えば「書籍Ａ」）をユーザＵｎが発話することによって指定して情報を取り出す場合を示す。この図において、端末装置１０００は、ユーザＵｎから固有名詞を含む音声入力に基づいて、質問に対する回答を外部に接続されるサーバから得て出力する。
図４Ｂは、日時（いつ）、場所（どこで）、人（だれと）の３つの項目を環境情報として用いて、対話を実現する場合を示す。この図に示すように、本実施形態における端末装置１０ａは、ユーザＵａから発話された内容に、固有名詞（例えば「書籍Ａ」）が含まれていなかったとしても、返答内容を導き出し、出力することができる。 FIG. 4A shows a case where a user Un specifies a proper noun (for example, “book A”) by uttering it to extract information. In this figure, the terminal device 1000 obtains and outputs an answer to a question from a server connected to the outside based on a voice input including a proper noun from a user Un.
FIG. 4B shows a case where a dialogue is realized by using three items of date and time (when), place (where), and person (who) as environmental information. As shown in this figure, the terminal device 10a in the present embodiment derives and outputs the response content even if the content uttered by the user Ua does not include a proper noun (for example, "book A"). be able to.

なお、前記実施形態においては音声対話を前提とした実施例を説明したが、これに限らず、端末装置１０が音声以外の情報を提示（出力）できる場合、システム応答の把握および理解を促進するためコンテンツを提示するようにしてもよい。実施方法は多岐に渡り、単純な方法としては読み上げ内容と同一の内容のテキストを表示したり、過去の音声や、動画像など関連するものを提示（音声の出力または画像の表示）するといった方法をとってもよい。 In the above embodiment, an embodiment premised on voice dialogue has been described, but the present invention is not limited to this, and when the terminal device 10 can present (output) information other than voice, it promotes grasping and understanding of the system response. Therefore, the content may be presented. There are various implementation methods, and as a simple method, a text with the same content as the read content is displayed, or a related item such as past voice or moving image is presented (voice output or image display). You may take.

（情報提示装置２０の構成例）
図５は、本発明の実施形態に係る情報提示装置２０の構成例を示す図である。情報提示装置２０は、情報処理部２１０、情報記憶部２２０、グルーピング推定部２３１、想起志向管理部２４０、対話規則構築部２５０を含んで構成される。 (Configuration example of information presentation device 20)
FIG. 5 is a diagram showing a configuration example of the information presentation device 20 according to the embodiment of the present invention. The information presentation device 20 includes an information processing unit 210, an information storage unit 220, a grouping estimation unit 231, a recall-oriented management unit 240, and a dialogue rule construction unit 250.

情報処理部２１０は、取得部２１１、対話管理部２１２、提示制御部２１３、合成部２１４を含んで構成され、これらを通じて端末装置１０の提示形態を制御する。 The information processing unit 210 includes an acquisition unit 211, a dialogue management unit 212, a presentation control unit 213, and a synthesis unit 214, through which the presentation form of the terminal device 10 is controlled.

情報記憶部２２０は、情報処理部２１０から得たデータを記憶する、対話履歴記憶部２２１、主に端末装置１０から得たデータを記憶する環境記憶部２２２、属性記憶部２２３、その他に行動記憶部２２４、グルーピング記憶部２２５、言語知識記憶部２２６を含んで構成される。 The information storage unit 220 stores the dialogue history storage unit 221 that stores the data obtained from the information processing unit 210, the environment storage unit 222 that mainly stores the data obtained from the terminal device 10, the attribute storage unit 223, and other action storage. It is composed of a unit 224, a grouping storage unit 225, and a language knowledge storage unit 226.

想起志向管理部２４０は、対話履歴からの単語やフレーズの抽出や志向の解析を行う想起解析部２４１と、情報記憶部２２０の情報を用いて記憶の想起に関するパラメータの推定やモデルの構築を行うモデル構築部２４２とを含んで構成される。 The recall-oriented management unit 240 uses the information of the recall analysis unit 241 that extracts words and phrases from the dialogue history and analyzes the orientation, and the information storage unit 220 to estimate parameters related to memory recall and build a model. It is configured to include a model building unit 242.

対話規則構築部２５０は、情報記憶部２２０および想起志向管理部２４０の情報を用いて対話規則を構築する発話構築部２５１と関係構築部２５２とを含んで構成される。 The dialogue rule construction unit 250 includes an utterance construction unit 251 and a relationship construction unit 252 that construct dialogue rules using the information of the information storage unit 220 and the recall-oriented management unit 240.

（取得部２１１の構成例）
取得部２１１は、端末装置１０から送信される収集データ等の各種情報を取得する外部入力インタフェースである。取得部２１１は、端末装置１０から得られた要求情報等の各種情報を対話管理部２１２へ出力する。 (Configuration example of acquisition unit 211)
The acquisition unit 211 is an external input interface that acquires various information such as collected data transmitted from the terminal device 10. The acquisition unit 211 outputs various information such as request information obtained from the terminal device 10 to the dialogue management unit 212.

また、取得部２１１は、マイクロフォンやキーボード、各種センサなどから直接、要求情報等の各種情報を取得する構成であっても構わない。 Further, the acquisition unit 211 may be configured to acquire various information such as request information directly from a microphone, a keyboard, various sensors, or the like.

（対話管理部２１２の構成例）
図６は、本発明の実施形態に係る対話管理部２１２の構成例を説明する図である。対話管理部２１２は、ユーザに対するシステムからの応答を決定する機能を有しており、端末に対する対話処理を行う。対話管理部２１２は、取得部２１１から各種情報を受け取る解析部２１２１、状態の管理や応答の指針を決定する行動選択部２１２２、行動選択部２１２２を基に応答内容を生成する生成部２１２３を含んで構成される。例えば、対話管理部２１２は、ユーザから与えられた質問等に対応する回答を、情報記憶部２２０に記憶された情報を参照して生成する。 (Configuration example of dialogue management unit 212)
FIG. 6 is a diagram illustrating a configuration example of the dialogue management unit 212 according to the embodiment of the present invention. The dialogue management unit 212 has a function of determining a response from the system to the user, and performs dialogue processing with respect to the terminal. The dialogue management unit 212 includes an analysis unit 2121 that receives various information from the acquisition unit 211, an action selection unit 2122 that determines state management and response guidelines, and a generation unit 2123 that generates response contents based on the action selection unit 2122. Consists of. For example, the dialogue management unit 212 generates an answer corresponding to a question or the like given by the user by referring to the information stored in the information storage unit 220.

解析部２１２１は、取得部２１１から得られた情報を、以降の対話管理部２１２の行程で処理できるようなデータに変換する。なお、入力される情報の種類は、ユーザが利用するインタフェースによって異なる。例えば、入力インタフェースがマイクロフォンの場合、入力情報は音声情報（音声データを含む情報）となり、入力インタフェースがカメラであれば画像情報、入力インタフェースがキーボードであれば文字情報となる。 The analysis unit 2121 converts the information obtained from the acquisition unit 211 into data that can be processed in the subsequent steps of the dialogue management unit 212. The type of information to be input differs depending on the interface used by the user. For example, when the input interface is a microphone, the input information is voice information (information including voice data), image information if the input interface is a camera, and character information if the input interface is a keyboard.

解析部２１２１は、取得部２１１から入力された情報を以降の行程で処理できるように、形態素解析やキーワード抽出、あるいはベクトル化をはじめとした数値データへ変換する。この変換処理には、自然言語処理技術や機械学習技術を用いることができる。本発明の実施形態に係る一例としては、tf-idfによるキーワード抽出やword2vec、doc2vecを用いたベクトル化などの手法を用いても良い。解析部２１２１は、数値データに変換された情報を行動選択部２１２２へ出力する。 The analysis unit 2121 converts the information input from the acquisition unit 211 into numerical data such as morphological analysis, keyword extraction, and vectorization so that the information can be processed in the subsequent steps. Natural language processing technology and machine learning technology can be used for this conversion processing. As an example according to the embodiment of the present invention, a method such as keyword extraction using tf-idf or vectorization using word2vec or doc2vec may be used. The analysis unit 2121 outputs the information converted into the numerical data to the action selection unit 2122.

また、解析部２１２１は、取得部２１１から入力された情報が、想起志向を用いた対話か否かを判定し、判定結果を上記の情報（数値データ）と併せて行動選択部２１２２へ出力する。例えば、想起志向を用いた対話か否かの判定は、ユーザから端末装置１０等を介して得られるデジタルデータ（収集データ）のうち、発話データに含まれる発話内容に基づいて判定することができる。より具体的には、解析部２１２１は、発話内容を構文解析または形態素解析を行うことで、単語と品詞を抽出し、抽出された単語や品詞が、想起志向に対応しているか否かを判定する。想起志向に対応しているか否かは、例えば、「この前」などの指示代名詞があるか否か、「本」等の一般名称があるか否かに基づいて、これらの要素がある場合に、想起志向であると判定する。この場合、言語知識記憶部２２６が、想起志向フレーズ（単語）や品詞について、想起志向に対応しているか否かの情報とともに予め記憶しておき、解析部２１２１が、言語知識記憶部２２６を参照し、判定対象の単語や品詞が言語知識記憶部２２６に記憶されている場合に、想起志向であると判定するようにしてもよい。
解析部２１２１は、取得部２１１から入力された情報が、想起志向を用いた対話か否かを、判定し、判定結果を上記の情報（数値データ）とともに、対話履歴記憶部２２１に出力し、例えば、「昨日」、「本」等の想起志向フレーズとして記憶する。 Further, the analysis unit 2121 determines whether or not the information input from the acquisition unit 211 is a dialogue using the recall orientation, and outputs the determination result together with the above information (numerical data) to the action selection unit 2122. .. For example, the determination of whether or not the dialogue is based on the recall orientation can be determined based on the utterance content included in the utterance data among the digital data (collected data) obtained from the user via the terminal device 10 or the like. .. More specifically, the analysis unit 2121 extracts words and part of speech by performing parsing or morphological analysis of the utterance content, and determines whether or not the extracted words and part of speech correspond to the recall orientation. To do. Whether or not it corresponds to recall orientation is based on whether or not there is a demonstrative pronoun such as "before" and whether or not there is a general name such as "book" when these elements are present. , Judged as recall-oriented. In this case, the language knowledge storage unit 226 stores in advance the recall-oriented phrases (words) and part of speech together with information on whether or not they correspond to the recall-oriented, and the analysis unit 2121 refers to the language knowledge storage unit 226. However, when the word or part of speech to be determined is stored in the language knowledge storage unit 226, it may be determined to be recall-oriented.
The analysis unit 2121 determines whether or not the information input from the acquisition unit 211 is a dialogue using recall orientation, and outputs the determination result together with the above information (numerical data) to the dialogue history storage unit 221. For example, memorize it as a recall-oriented phrase such as "yesterday" or "book".

行動選択部２１２２は、情報記憶部２２０および対話規則構築部２５０を基にユーザの状態を定義し、応答の指針を決定するものである。状態とは、例えば、現在の話題等に相当するものであり、要求発話が「東京発の新大阪着の新幹線の始発は？」という内容である場合、状態（話題）は『新幹線』であると定義できる。ここでは、何を調べるのか、その対象を状態として定義することができる。調べる対象が何であるかについては、例えば、ユーザからの要求発話を構文解析することで、品詞や係り受け等の関係から、調べる対象を特定（定義）するようにしてもよい。
行動選択部２１２２は、応答の指針を立てるにあたり、要求発話が「東京発の新大阪着の新幹線の席空いてる？」という内容である場合、構文解析を行った結果に基づいて、状態（話題）が「新幹線」であると定義することができ、新幹線の空席を照会することが応答指針であると決定することができる。また、例えば、行動選択部２１２２は、要求発話が「○○映画館の映画Ｂのチケットある？」という内容である場合、構文解析を行った結果に基づいて、状態（話題）が「映画館」であると定義することができ、○○映画館の映画Ｂが上映される際の空席を照会することを応答指針として決定する。
行動選択部２１２２は、対話履歴記憶部２２１に記憶された対話履歴や、解析部２１２１から得られる数値データに基づいて、前後の関係にある複数のメッセージの関係から、ユーザの状態（話題）を定義する。 The action selection unit 2122 defines the user's state based on the information storage unit 220 and the dialogue rule construction unit 250, and determines the response guideline. The state corresponds to, for example, the current topic, and if the requested utterance is "What is the first Shinkansen from Tokyo to Shin-Osaka?", The state (topic) is "Shinkansen". Can be defined as. Here, what to look for can be defined as a state. Regarding what the object to be examined is, for example, by parsing the utterance requested by the user, the object to be examined may be specified (defined) from the relation of part of speech, dependency, and the like.
When the action selection unit 2122 establishes a response guideline, if the request utterance is "Are there seats on the Shinkansen from Tokyo to Shin-Osaka?", The state (topic) is based on the result of syntactic analysis. ) Can be defined as "Shinkansen", and it can be determined that inquiring about the vacant seats of the Shinkansen is the response guideline. Further, for example, when the request utterance is "Is there a ticket for movie B in a movie theater?", The action selection unit 2122 has a state (topic) of "movie theater" based on the result of syntactic analysis. It can be defined as, and it is decided as a response guideline to inquire about the vacant seats when the movie B of the XX movie theater is screened.
The action selection unit 2122 determines the user's state (topic) from the relationship between a plurality of messages in the context, based on the dialogue history stored in the dialogue history storage unit 221 and the numerical data obtained from the analysis unit 2121. Define.

行動選択部２１２２は、リクエスト（要求情報）とレスポンス（返答情報）のリストをデータベース化したものや、機械学習や強化学習などによって学習された学習済みモデルを用いて要求情報に対する返答情報を決定する。行動選択部２１２２が行う処理内容は、公知の技術である対話システムと同等の構成を用いることができるため、詳細な説明は割愛する。行動選択部２１２２は、生成された情報を生成部２１２３へ出力する。 The action selection unit 2122 determines the response information for the request information by using a database of a list of requests (request information) and responses (response information) and a learned model learned by machine learning or reinforcement learning. .. Since the processing content performed by the action selection unit 2122 can use the same configuration as the dialogue system, which is a known technique, detailed description thereof will be omitted. The action selection unit 2122 outputs the generated information to the generation unit 2123.

行動選択部２１２２は、想起志向を用いた対話であると解析部２１２１によって判定された場合、要求発話の内容を構文解析して得られる想起志向フレーズ（単語）に合致する内容を環境記憶部２２２から探索し、合致する内容がない場合は、想起志向管理部２４０にて構築された想起志向のモデルを用いて最適な応答内容を決定する。
ここでは、環境記憶部２２２には、過去の対話内容と、その対話内容に含まれる想起志向フレーズとが対応づけて記憶されている。行動選択部２１２２は、想起志向フレーズに対応する対話内容を特定し、特定された対話内容を基に、過去に回答した内容も加味して返答情報を決定する。例えば、過去の対話内容において、「書籍Ａの価格」として「１２００円」という対話内容があった場合には、環境記憶部２２２に記憶される。 When the analysis unit 2121 determines that the dialogue is a recall-oriented dialogue, the action selection unit 2122 parses the content of the requested utterance to obtain a content that matches the recall-oriented phrase (word). If there is no matching content, the optimum response content is determined using the recall-oriented model constructed by the recall-oriented management unit 240.
Here, the environmental storage unit 222 stores the past dialogue contents and the recall-oriented phrases included in the dialogue contents in association with each other. The action selection unit 2122 specifies the dialogue content corresponding to the recall-oriented phrase, and determines the response information based on the identified dialogue content, taking into account the content answered in the past. For example, if there is a dialogue content of "1200 yen" as the "price of the book A" in the past dialogue content, it is stored in the environmental storage unit 222.

生成部２１２３は、行動選択部２１２２の返答情報を基に応答の内容を決定する。例えば、生成部２１２３は、行動選択部２１２２において「この前Ａさんと話した本の値段は？」という要求情報に対して「１２００円」という返答情報が決定された場合には、この返答情報「１２００円」に対して「です」等の情報を付加することで、回答（文章）としての体裁を整える処理を行うことで、応答内容を生成する。また、生成部２１２３は、言語のみを用いて回答する場合の他に、言語と画像との２種類を用いて応答内容を生成することもできる。言語と画像の２種類を用いる場合、生成部２１２３は、画像によって応答内容を表現できている部分については、言語において重複した表現が含まれないように、言語のみを用いた応答内容の一部を省略することで、言語と画像の２種類を用いた応答内容を生成することもできる。具体的には、機械学習を用いて応答文を生成する手法などがある（公知の技術を用いる）。また、予め作成された雛形に外部API（Application Programming Interface）を介して必要な情報を当てはめて文章を完成させるという方法もある。本実施形態では、いずれかの手法を用いることとする。
また、生成部２１２３は、リクエストとレスポンスのデータベースを保持していた場合、生成部２１２３を介さずに応答内容を決定することもできる。生成部２１２３の情報を含む、対話管理部２１２の結果は、提示制御部２１３へと出力される。 The generation unit 2123 determines the content of the response based on the response information of the action selection unit 2122. For example, when the action selection unit 2122 determines the response information of "1200 yen" in response to the request information "What is the price of the book that you talked to Mr. A last time?", The generation unit 2123 provides this response information. By adding information such as "desu" to "1200 yen", the response content is generated by performing the process of adjusting the appearance as the answer (sentence). In addition to the case where the generation unit 2123 responds using only the language, the generation unit 2123 can also generate the response content using two types of language and image. When two types of language and image are used, the generation unit 2123 is a part of the response content using only the language so that the part where the response content can be expressed by the image is not included in the duplicate expression in the language. By omitting, it is also possible to generate a response content using two types of language and image. Specifically, there is a method of generating a response sentence using machine learning (using a known technique). There is also a method of completing a sentence by applying necessary information to a template created in advance via an external API (Application Programming Interface). In this embodiment, either method is used.
Further, when the generation unit 2123 holds the request and response database, the response content can be determined without going through the generation unit 2123. The result of the dialogue management unit 212 including the information of the generation unit 2123 is output to the presentation control unit 213.

（提示制御部２１３の構成例）
図５に戻り、提示制御部２１３は、対話管理部２１２から入力された情報（応答内容）に基づき提示方法を決定する。提示方法は、例えば、応答内容をスピーカから出力する場合には、音声によって応答内容を出力することができるため、提示制御部２１３は、提示方法として、音声を用いることを決定する。また、応答内容を表示パネルとスピーカとを用いて出力する場合には、音声と画像とを用いて応答内容を出力することができるため、音声及び画像を用いることを決定する。提示制御部２１３は、生成部２１２３によって生成された応答内容と、決定された提示方法とを含む提示情報を合成部２１４へ出力する。 (Configuration example of presentation control unit 213)
Returning to FIG. 5, the presentation control unit 213 determines the presentation method based on the information (response content) input from the dialogue management unit 212. As for the presentation method, for example, when the response content is output from the speaker, the response content can be output by voice, so the presentation control unit 213 decides to use voice as the presentation method. Further, when the response content is output using the display panel and the speaker, the response content can be output using the voice and the image, so it is decided to use the voice and the image. The presentation control unit 213 outputs the presentation information including the response content generated by the generation unit 2123 and the determined presentation method to the synthesis unit 214.

（合成部２１４の構成例）
合成部２１４は、提示制御部２１３から入力された提示情報に基づき、加工処理を施したコンテンツを生成する。例えば、合成部２１４は、生成部２１２３によって生成された応答内容を、提示方法に対応したデータを生成することで、コンテンツを生成する。合成部２１４は、「１２００円です」という応答内容を、提示方法が音声である場合には、「１２００円です」という応答内容（テキストデータ）を対象として音声合成処理を行い、音声ファイルを生成することでコンテンツを得る。合成部２１４は、生成されたコンテンツを、情報提示装置２０に設けられる通信機能によって、端末装置１０へ送信する。 (Structure example of synthesis unit 214)
The synthesis unit 214 generates the processed content based on the presentation information input from the presentation control unit 213. For example, the synthesis unit 214 generates the content by generating the data corresponding to the presentation method of the response content generated by the generation unit 2123. The synthesis unit 214 performs voice synthesis processing on the response content (text data) of "1200 yen" when the presentation method is voice, and generates a voice file. Get content by doing. The compositing unit 214 transmits the generated content to the terminal device 10 by a communication function provided in the information presenting device 20.

また、情報提示装置２０は、合成部２１４から出力され端末装置１０に送信された返答情報に対する各ユーザの評価を取得するために、明示的な項目（「あっていましたか？はい／いいえ」など）をユーザへ送信することも可能である。端末装置１０に入力されたユーザからの評価情報は、情報記憶部２２０の行動記憶部２２４へ格納される。 Further, the information presenting device 20 obtains an explicit item (such as "Did it match? Yes / No", etc., in order to acquire the evaluation of each user for the response information output from the synthesis unit 214 and transmitted to the terminal device 10. ) Can also be sent to the user. The evaluation information from the user input to the terminal device 10 is stored in the action storage unit 224 of the information storage unit 220.

情報提示装置２０による提示形態については、ユーザによる任意の変更も可能とする。これらユーザによる変更も行動情報として情報記憶部２２０内の行動記憶部２２４へ出力される。 The presentation form by the information presenting device 20 can be arbitrarily changed by the user. These changes made by the user are also output as action information to the action storage unit 224 in the information storage unit 220.

また、取得部２１１、対話管理部２１２、合成部２１４は、取得した入力、解析結果、最適化提示で用いた情報を対話履歴記憶部２２１へ出力する。例えば、取得部２１１は、ユーザからの入力に応じて、時刻、ユーザＩＤ、メッセージ、メッセージＩＤを対話履歴記憶部２２１へ出力する。対話管理部２１２は、対話毎に、想起志向フレーズ、セッションＩＤを対話履歴記憶部２２１へ出力する。合成部２１４は、端末装置１０に対して出力するコンテンツに基づいて、その時刻、ユーザＩＤ、メッセージ、メッセージＩＤを出力する。 Further, the acquisition unit 211, the dialogue management unit 212, and the synthesis unit 214 output the acquired input, the analysis result, and the information used in the optimization presentation to the dialogue history storage unit 221. For example, the acquisition unit 211 outputs the time, the user ID, the message, and the message ID to the dialogue history storage unit 221 in response to the input from the user. The dialogue management unit 212 outputs a recall-oriented phrase and a session ID to the dialogue history storage unit 221 for each dialogue. The synthesis unit 214 outputs the time, the user ID, the message, and the message ID based on the content to be output to the terminal device 10.

（対話履歴記憶部２２１の構成例）
図７は、本発明の実施形態に係る対話履歴記憶部２２１に記憶される対話履歴データのデータ構成の一例を示す図である。対話履歴記憶部２２１は、時刻、ユーザID、発話内容を示すメッセージ、想起志向フレーズ、セッションＩＤ、メッセージＩＤ（ｎ番目）、メッセージＩＤ（ｎ−１番目）の情報を含み、直前の発話などの対話履歴を構成要素の一つとして蓄積（記憶）する。
対話履歴記憶部２２１は、端末装置１０に入力される音声であれば、その対話の相手が情報提示装置２０であってもその発話内容がメッセージとして記憶され、また、端末装置１０の近傍で複数のユーザが対話している場合には、その対話についてもメッセージとして記憶される。例えば、端末装置１０の近傍でユーザＡとユーザＢが対話している場合には、ユーザの声紋等を基に、発話されたメッセージをユーザ毎に分類することができる。その上で、発話内容毎に、メッセージとして記憶される。そのため、情報提示装置２０は、「先週、会社でＡさんと話したときにでてきた本っていくら？」という要求発話があった場合であっても、この対話履歴記憶部２２１に記憶されたメッセージを元に、回答を探索することができる。 (Configuration example of dialogue history storage unit 221)
FIG. 7 is a diagram showing an example of a data structure of dialogue history data stored in the dialogue history storage unit 221 according to the embodiment of the present invention. The dialogue history storage unit 221 includes information on the time, the user ID, the message indicating the utterance content, the recall-oriented phrase, the session ID, the message ID (nth), and the message ID (n-1th), and includes information on the immediately preceding utterance, etc. Accumulate (memorize) the dialogue history as one of the components.
If the dialogue history storage unit 221 is a voice input to the terminal device 10, the utterance content is stored as a message even if the other party of the dialogue is the information presenting device 20, and a plurality of dialogue history storage units 221 are stored in the vicinity of the terminal device 10. If the user is interacting with, the dialogue is also stored as a message. For example, when the user A and the user B are interacting with each other in the vicinity of the terminal device 10, the spoken message can be classified for each user based on the user's voiceprint or the like. Then, each utterance content is stored as a message. Therefore, the information presenting device 20 is stored in the dialogue history storage unit 221 even when there is a request utterance "How much is the book that came out when I talked with Mr. A at the company last week?" You can search for answers based on the messages you received.

対話履歴記憶部２２１に記憶される対話履歴データは、端末装置１０に入力される音声について、全て記憶するようにしてもよいし、端末装置１０の入力部１１０のオン・オフを切り替えることで、入力部１１０がオンとなっている期間における対話内容を記憶するようにしてもよい。例えば、入力部１１０がマイクロフォンである場合には、マイクロフォンの機能をオンまたはオフに、切り替えられるようにしてもよい。この切り替えは、ユーザの操作によってオン・オフされるスイッチに基づいてもよいし、オンまたはオフにする操作コマンドを音声入力によって行うようにしてもよい。
また、入力部１１０が、キーボードやマウス等である場合には、入力部１１０から入力されるテキストデータや図（顔文字、絵文字、スタンプなど）を対話内容とし、対話履歴データとして記憶することもできる。 The dialogue history data stored in the dialogue history storage unit 221 may be stored for all the voices input to the terminal device 10, or by switching the input unit 110 of the terminal device 10 on / off. The dialogue content during the period when the input unit 110 is on may be stored. For example, when the input unit 110 is a microphone, the function of the microphone may be switched on or off. This switching may be based on a switch that is turned on and off by a user operation, or an operation command for turning on or off may be performed by voice input.
Further, when the input unit 110 is a keyboard, a mouse, or the like, text data or figures (emoticons, pictograms, stamps, etc.) input from the input unit 110 can be stored as dialogue history data as dialogue contents. it can.

時刻は、入力部１１０からユーザから発話された時刻を表す。
ユーザIDは、発話を行ったユーザを識別する識別情報である。ここでは、「Ｕ＿００１」が示すユーザは、いずれかの端末装置１０を利用するユーザを特定する識別情報であり、「Ｃ＿００１」は、端末装置１０から出力される返答情報を生成した情報提示装置２０を表す。このユーザＩＤは、ユーザによって発話された内容を入力した端末装置１０に割り当てられた識別情報に対応するユーザＩＤが用いられ、対話管理部２１２によって書き込まれる。情報提示装置２０からコンテンツを出力することで行われる対話については、自情報提示装置２０に割り当てられたユーザＩＤが合成部２１４によって書き込まれる。すなわち、あるユーザからの要求情報に対して、情報提示装置２０から返答を行った、という一連の流れを把握できるようになっている。 The time represents the time spoken by the user from the input unit 110.
The user ID is identification information that identifies the user who made the utterance. Here, the user indicated by "U_001" is identification information that identifies a user who uses any of the terminal devices 10, and "C_001" is an information presenting device 20 that generates response information output from the terminal device 10. Represents. As this user ID, the user ID corresponding to the identification information assigned to the terminal device 10 that has input the content uttered by the user is used, and is written by the dialogue management unit 212. For the dialogue performed by outputting the content from the information presenting device 20, the user ID assigned to the self-information presenting device 20 is written by the synthesis unit 214. That is, it is possible to grasp a series of flows in which the information presenting device 20 responds to the request information from a certain user.

メッセージは、ユーザから発話された内容に基づいて得られる発話内容を表す情報であり、取得部２１１によって書き込まれる。例えばメッセージは、テキストデータである。
想起志向フレーズは、解析部２１２１の解析結果を基に、発話内容に想起志向フレーズが含まれているか否かを表す情報が記憶される。例えば、解析部２１２１が、取得部２１１から入力された情報に、想起志向を用いた対話か否かを判定し、想起志向を用いた対話である場合には、その想起志向に対応する表現要素を想起志向フレーズとして対話履歴記憶部２２１に記憶する。ここでは、一例として、メッセージ「昨日話した本の値段は？」に対して、「昨日」と「本」が想起志向フレーズとして記憶される。
セッションＩＤは、１つの対話の開始から対話の終了まで１つの対話群とし、この対話群を識別する識別情報である。セッションＩＤは、対話管理部２１２によって書き込まれる。 The message is information representing the utterance content obtained based on the content uttered by the user, and is written by the acquisition unit 211. For example, the message is text data.
The recall-oriented phrase stores information indicating whether or not the recall-oriented phrase is included in the utterance content based on the analysis result of the analysis unit 2121. For example, the analysis unit 2121 determines whether or not the information input from the acquisition unit 211 is a dialogue using the recall orientation, and if it is a dialogue using the recall orientation, an expression element corresponding to the recall orientation. Is stored in the dialogue history storage unit 221 as a recall-oriented phrase. Here, as an example, "yesterday" and "book" are memorized as recall-oriented phrases for the message "What is the price of the book I talked about yesterday?".
The session ID is identification information that identifies this dialogue group as one dialogue group from the start of one dialogue to the end of the dialogue. The session ID is written by the dialogue management unit 212.

メッセージＩＤ（ｎ番目）は、メッセージを識別する情報であり、メッセージ毎に異なる情報が付与される。また、メッセージＩＤ（ｎ番目）は、メッセージの時系列における順序を示す情報も含んでおり、例えば、時系列に応じて昇順あるいは降順に並ぶように定められる番号である。
メッセージＩＤ（ｎ−１番目）は、メッセージＩＤ（ｎ番目）が示すメッセージに対して時系列の順において１つ前のメッセージに付与されたメッセージＩＤ（ｎ番目）を示す。
メッセージＩＤ（ｎ番目）とメッセージＩＤ（ｎ−１番目）とを用いることで、メッセージの時系列の順を特定することができるようになっている。
メッセージＩＤは、ユーザから発話されたメッセージについては取得部１１２、情報提示装置２０からコンテンツを出力することによるメッセージについては合成部２１４によって書き込まれる。 The message ID (nth) is information for identifying a message, and different information is given to each message. The message ID (nth) also includes information indicating the order of the messages in the time series, and is, for example, a number determined to be arranged in ascending or descending order according to the time series.
The message ID (n-1st) indicates a message ID (nth) assigned to the message immediately preceding the message indicated by the message ID (nth) in chronological order.
By using the message ID (nth) and the message ID (n-1st), the order of the time series of the messages can be specified.
The message ID is written by the acquisition unit 112 for the message uttered by the user and by the synthesis unit 214 for the message by outputting the content from the information presenting device 20.

（環境記憶部２２２の構成例）
図８は、本発明の実施形態に関わる環境記憶部２２２のデータ構成の一例を示す図である。環境記憶部２２２は、ユーザごとに対話を行った際のユーザ側における日時や場所、天気などを構成要素の一つとして記憶する。具体的に、環境記憶部２２２は、セッションＩＤ、ユーザＩＤ、日付、時間（開始）、時間（終了）、場所、天気、想起志向フレーズの情報を含む。
セッションＩＤは、１つの対話の開始から対話の終了まで１つの対話群とし、この対話群を識別する識別情報である。また、セッションが同じであれば、対話履歴記憶部２２１に記憶されるセッションＩＤと同じセッションＩＤが用いられる。
ユーザＩＤは、ユーザを識別する情報であり、特に、端末装置１０を利用しているユーザを特定するユーザＩＤである。このユーザＩＤは、端末装置１０に割り当てられたＩＤを用いることができ、また、端末装置１０がスマートスピーカである場合には、このスマートスピーカを利用する際にユーザ登録された際のＩＤを用いることができる。
日付は、対話が行われた日付を表す。
時間（開始）は、対話が開始された時刻を示す。
時間（終了）は、対話が終了した時刻を示す。
場所は、対話が行われた時点における、発話したユーザの位置を示す。例えば、ユーザが自宅において発話した場合には、場所としては「自宅」が記憶され、会議室Ａにおいて
発話した場合には「会議室Ａ」が記憶される。場所の特定は、例えば、発話する端末装置１０のＧＮＳＳ機能によって現在位置（例えば緯度及び経度）を特定し、その現在位置に対応する場所の名称を特定するようにしてもよい。
天気は、対話が行われた時点における、発話したユーザの位置を含む地域における天気を示す。この天気の情報は、天気情報ＡＰＩや、外部の気象サーバなどから取得してもよい。この天気については、対話が行われた場所における環境が把握できればよいため、天気ではなく、温度や湿度等であってもよい。 (Structure example of environmental storage unit 222)
FIG. 8 is a diagram showing an example of a data structure of the environmental storage unit 222 according to the embodiment of the present invention. The environment storage unit 222 stores the date and time, place, weather, etc. on the user side when the dialogue is performed for each user as one of the components. Specifically, the environmental storage unit 222 includes information on a session ID, a user ID, a date, a time (start), a time (end), a place, a weather, and a recall-oriented phrase.
The session ID is identification information that identifies this dialogue group as one dialogue group from the start of one dialogue to the end of the dialogue. If the sessions are the same, the same session ID as the session ID stored in the dialogue history storage unit 221 is used.
The user ID is information that identifies the user, and in particular, is a user ID that identifies the user who is using the terminal device 10. As this user ID, the ID assigned to the terminal device 10 can be used, and when the terminal device 10 is a smart speaker, the ID at the time of user registration when using this smart speaker is used. be able to.
The date represents the date on which the dialogue took place.
Time (start) indicates the time when the dialogue started.
Time (end) indicates the time when the dialogue ended.
The location indicates the position of the speaking user at the time of the dialogue. For example, when the user speaks at home, "home" is stored as the place, and when the user speaks in the conference room A, "meeting room A" is stored. The location may be specified, for example, by specifying the current position (for example, latitude and longitude) by the GNSS function of the terminal device 10 that speaks, and specifying the name of the place corresponding to the current position.
The weather indicates the weather in the area including the position of the user who spoke at the time of the dialogue. This weather information may be acquired from a weather information API, an external weather server, or the like. As for this weather, it may be temperature, humidity, etc. instead of the weather, as long as the environment at the place where the dialogue was held can be grasped.

（属性記憶部２２３の構成例）
図９は、本発明の実施形態に係る属性記憶部２２３のデータ構成の一例を示す図である。属性記憶部２２３は、年齢・性別を始め、対話以外のユーザに関する情報（例えば、ユーザの所在地）を構成要素の一つとして記憶する。具体的に、属性記憶部２２３は、ユーザＩＤ、年齢、性別、所在地、想起志向、信頼度等の情報を含む。
ユーザＩＤは、ユーザを識別する識別情報である。このユーザＩＤは、特に、要求発話（質問）をするユーザを識別する識別情報である。
年齢は、ユーザの年齢を示す。
性別は、ユーザの性別である。
所在地は、ユーザの所在地である。デモグラフィックデータは、これら年齢、性別、所在地を用いることができるが、職業等の他の情報を用いることもできる。 (Configuration example of attribute storage unit 223)
FIG. 9 is a diagram showing an example of the data structure of the attribute storage unit 223 according to the embodiment of the present invention. The attribute storage unit 223 stores information about the user other than the dialogue (for example, the location of the user) as one of the components, including age and gender. Specifically, the attribute storage unit 223 includes information such as a user ID, age, gender, location, recall orientation, and reliability.
The user ID is identification information that identifies the user. This user ID is, in particular, identification information that identifies a user who makes a request utterance (question).
Age indicates the age of the user.
Gender is the gender of the user.
The location is the location of the user. These ages, genders, and locations can be used for the demographic data, but other information such as occupations can also be used.

想起志向は、対話履歴記憶部２２１に記憶されたメッセージから想起解析部２４１によって抽出された想起志向フレーズを基にした情報である。例えば、想起志向として、想起解析部２４１によって抽出された想起志向フレーズそのものを用いるようにしてもよいが、ここでは、当該想起志向フレーズを基にした情報を用いる場合について説明する。想起志向フレーズを想起志向として記憶する場合には、具体的な単語を登録することになるため、そうすると、属性記憶部２２３に記憶するデータ量が増大してしまう。また、「３日前」、「４日前」という日時の具体的なデータや、「Ａさん」、「Ｂさん」という固有名詞の具体的なデータを、それぞれ区別して保存したとしても、データ量を増加させたことに対する効果としては必ずしも大きな効果が得られるとは限らない。そこで、「３日前」、「４日前」という日時の具体的なデータについては、想起志向として「日時」として記憶し、「Ａさん」、「Ｂさん」という固有名詞の具体的なデータについては、想起志向として「固有名詞（人物名）」として記憶するようにしてもよい。これにより、想起志向フレーズを「日時」や「固有名詞（人物名）」等の属性毎に、信頼度を把握することができるようになる。この図９においては、想起志向として「固有名詞（書籍）」と、「価格」について例示されている。「固有名詞（書籍）」は、書籍の名称が属する属性を示し、「価格」は、物品やサービスの価格を示す属性である。 The recall-oriented information is information based on the recall-oriented phrase extracted by the recall analysis unit 241 from the message stored in the dialogue history storage unit 221. For example, the recall-oriented phrase itself extracted by the recall analysis unit 241 may be used as the recall-oriented phrase, but here, a case where information based on the recall-oriented phrase is used will be described. When a recall-oriented phrase is stored as a recall-oriented phrase, a specific word is registered, so that the amount of data stored in the attribute storage unit 223 increases. In addition, even if the specific data of the date and time "3 days ago" and "4 days ago" and the specific data of the proper nouns "Mr. A" and "Mr. B" are stored separately, the amount of data can be saved. As an effect on the increase, it is not always possible to obtain a large effect. Therefore, the specific data of the date and time "3 days ago" and "4 days ago" are memorized as "date and time" as a recollection intention, and the specific data of the proper nouns "Mr. A" and "Mr. B" are stored. , You may memorize it as a "proper noun (personal name)" as a recollection orientation. As a result, it becomes possible to grasp the reliability of the recall-oriented phrase for each attribute such as "date and time" and "proper noun (personal name)". In FIG. 9, "proper nouns (books)" and "price" are illustrated as recall-oriented. The "proper noun (book)" indicates the attribute to which the name of the book belongs, and the "price" is an attribute indicating the price of goods and services.

信頼度は、想起志向に対するユーザの認識の正しさを表す度合いである。言い換えると、信頼度は、ユーザが利用する想起志向フレーズが示す想起志向が当該ユーザの欲しい情報を得るために信頼できる度合いを示す。この度合いが高いほど、想起志向が信頼できる、すなわち、要求発話に対する回答として用いる情報処理の過程において、利用できる価値が高いともいえる。
信頼度は、ここでは０から１００までのうちいずれかの数を用いて示しており、１００に近いほど、信頼度が高く、０に近いほど信頼度が低いことを示す。ここでは、想起志向「固有名詞（書籍）」についての信頼度は、９０であり、「価格」についての信頼度は、９５であり、この２つの信頼度を比べた場合には、「価格」の信頼度の方が高い。
ユーザＩＤと想起志向と信頼度とを対応付けて記憶することで、ユーザＩＤが示すユーザが発話した要求発話について、その要求発話に含まれる想起志向を信頼できる度合いを情報として活用することが可能となる。例えば、日時が曖昧になりやすいユーザについては、質問に含まれる日時そのものではなく、探索範囲を広げられるように日時の期間を広げたり、別の期間に変更したりすることができる。 The degree of reliability is a degree indicating the correctness of the user's perception of the recall orientation. In other words, the reliability indicates the degree to which the recall-oriented phrase indicated by the recall-oriented phrase used by the user can be trusted to obtain the information desired by the user. It can be said that the higher this degree is, the more reliable the recall orientation is, that is, the higher the value that can be used in the process of information processing used as an answer to a request utterance.
The reliability is shown here using any number from 0 to 100. The closer it is to 100, the higher the reliability, and the closer it is to 0, the lower the reliability. Here, the reliability of the recall-oriented "proper noun (book)" is 90, and the reliability of the "price" is 95. When comparing these two reliabilitys, the "price" The reliability of is higher.
By storing the user ID, the recall orientation, and the reliability in association with each other, it is possible to utilize as information the degree to which the recall orientation included in the request utterance can be trusted for the request utterance indicated by the user ID. It becomes. For example, for a user whose date and time tends to be ambiguous, the date and time period can be extended or changed to another period so that the search range can be expanded instead of the date and time itself included in the question.

属性記憶部２２３については、上述の他にもユーザの許諾さえあれば、ユーザの端末装置１０から取得できる情報を記憶するようにしてもよい。例えば、端末装置１０において利用されたアプリケーションの利用履歴を属性情報として記憶するようにしてもよい。このアプリケーションの利用履歴は、ユーザのグルーピングを行う際に、アプリケーションの利用の傾向が類似するユーザが同じグループになるようにグルーピングすることができる。 In addition to the above, the attribute storage unit 223 may store information that can be acquired from the user's terminal device 10 with the permission of the user. For example, the usage history of the application used in the terminal device 10 may be stored as attribute information. When grouping users, the usage history of this application can be grouped so that users with similar application usage tendencies are in the same group.

（行動記憶部２２４の構成例）
図１０は、本発明の実施形態に係る行動記憶部２２４のデータ構成の一例である。行動記憶部２２４は、情報処理部２１０に基づき提示された応答内容や、当該応答内容がユーザの欲しい情報として満足できた否かを示すユーザからのリアクション（フィードバック）を記憶する。例えば、行動記憶部２２４は、ユーザが使用した想起志向フレーズや、システムの提示した返答に関する評価などを記憶する。より具体的に、行動記憶部２２４は、時刻、ユーザＩＤ、アクションＩＤ、想起志向フレーズ、実施内容、メッセージＩＤ、探索結果、フィードバックの情報を含んで記憶する。
想起志向フレーズは、探索を行うために用いられた想起志向フレーズを示す。
実施内容は、探索を行う際に用いられた想起志向を示すデータである。実施内容は、探索を行う際に探索範囲を拡げた上で探索を行った場合には、どの想起志向フレーズについてどのように探索範囲を拡げたか（変更したか）を表す情報を含む。
アクションＩＤは、探索を行った処理ごとに付与される情報であり、探索処理を識別する情報である。
探索結果は、実施内容に基づく探索を行った結果、環境記憶部２２２から得られたか否かを示す情報であり、「False」は回答が得られなかったことを示し、「True」は回答が得られたことを示す。
フィードバックは、要求発話に対する回答をユーザの端末装置１０に送信し、その回答が要求発話に対する回答として満足したか否かについてユーザから得られた結果を示す情報であり、「True」は満足したことを示し、「False」が満足しなかったことを示す。 (Structure example of behavior memory unit 224)
FIG. 10 is an example of the data structure of the behavior storage unit 224 according to the embodiment of the present invention. The action storage unit 224 stores the response content presented by the information processing unit 210 and the reaction (feedback) from the user indicating whether or not the response content is satisfied as the information desired by the user. For example, the action storage unit 224 stores a recall-oriented phrase used by the user, an evaluation regarding the response presented by the system, and the like. More specifically, the action storage unit 224 stores information including time, user ID, action ID, recall-oriented phrase, execution content, message ID, search result, and feedback information.
The recall-oriented phrase indicates the recall-oriented phrase used to perform the search.
The content of the implementation is data showing the recall orientation used when conducting the search. The content of the implementation includes information indicating which recall-oriented phrase and how the search range was expanded (changed) when the search was performed after the search range was expanded.
The action ID is information given for each process in which the search is performed, and is information for identifying the search process.
The search result is information indicating whether or not the search was obtained from the environmental storage unit 222 as a result of the search based on the implementation content. "False" indicates that no answer was obtained, and "True" indicates that the answer was not obtained. Indicates that it was obtained.
The feedback is information indicating the result obtained from the user as to whether or not the answer to the request utterance is transmitted to the user's terminal device 10 and the answer is satisfied as the answer to the request utterance, and "True" is satisfied. Indicates that "False" was not satisfied.

行動記憶部２２４は、端末装置１０から受信した要求発話に想起志向フレーズが含まれており、かつ、当該想起志向フレーズに対する回答として情報提示装置２０がコンテンツを出力した場合に、そのコンテンツがユーザにとって有益な情報となっていたか（要求に対して満足する回答となっていたか）について、メッセージ毎に履歴として記憶する。
例えば、ここでは、ユーザIDが「U＿001」からの要求発話には想起志向フレーズとして「先週」、「Ａさん」が抽出されており、実施内容として想起志向「日時」と、想起志向「人物（メンバ）」を手がかりにして回答を探索したことを示す「人物（メンバ）から参照」とが記憶されており、この探索を行っても回答が見つけられなかったことを示す「False」が記憶される。また、ユーザID「U_002」からの要求発話については、要求発話に含まれていた想起フレーズの日時に関する想起志向フレーズについて「２週間前」に範囲を拡張するように変更して再度探索を行ったことを実施内容「日時の範囲を拡げ再試行」が記憶され、その探索を行った結果、回答が見つかったことを示す「True」が記憶され、さらに、その回答に基づくコンテンツをユーザに送信したところ、満足する回答が得られたことを示すフィードバックが得られたことを示す「True」が記憶されている。 When the request utterance received from the terminal device 10 includes a recall-oriented phrase and the information presenting device 20 outputs the content as a response to the recall-oriented phrase, the action storage unit 224 receives the content for the user. Whether the information was useful (whether the answer was satisfactory to the request) is stored as a history for each message.
For example, here, "last week" and "Mr. A" are extracted as recall-oriented phrases in the request utterance from the user ID "U_001". "Reference from a person (member)" indicating that the answer was searched using "member)" is memorized, and "False" indicating that the answer was not found even after performing this search is memorized. To. Regarding the request utterance from the user ID "U_002", the range of the recall-oriented phrase related to the date and time of the recall phrase included in the request utterance was changed to "two weeks ago" and the search was performed again. The content of the implementation "Expand the range of date and time and retry" is memorized, and as a result of the search, "True" indicating that the answer is found is memorized, and the content based on the answer is sent to the user. However, "True" indicating that feedback was obtained indicating that a satisfactory answer was obtained is stored.

（グルーピング記憶部２２５の構成例）
図１１は、本発明の実施形態に係るグルーピング記憶部２２５のデータ構成の一例を示す図である。グルーピング記憶部２２５は、ユーザごとのグルーピングを行った結果を記憶する。単純なものとしては、対話履歴を数値化したものや年齢・性別などを基に、個別または複合的な視点から共有項を見つけ、フラグをつけるという方法がある。
この図においては、グループＩＤ、ユーザＩＤ、年齢、性別等が対応付けされたデータである。
グループＩＤは、グループを識別する情報である。このグループＩＤが同じユーザについては、同じグループに所属することを表している。
ユーザＩＤは、ユーザを個別に識別する情報である。
年齢は、ユーザの年齢である。
性別は、ユーザの性別である。 (Structure example of grouping storage unit 225)
FIG. 11 is a diagram showing an example of a data structure of the grouping storage unit 225 according to the embodiment of the present invention. The grouping storage unit 225 stores the result of grouping for each user. As a simple method, there is a method of finding a shared item from an individual or multiple viewpoints and flagging it based on a numerical value of the dialogue history, age, gender, and the like.
In this figure, it is data in which a group ID, a user ID, an age, a gender, and the like are associated with each other.
The group ID is information that identifies the group. Users with the same group ID indicate that they belong to the same group.
The user ID is information that individually identifies the user.
Age is the age of the user.
Gender is the gender of the user.

（想起志向管理部２４０の構成例） (Structure example of recall-oriented management unit 240)

ここで、グループ化をするにあたり、図１１に示すグループ化とは別に、図１３に示すような記憶を想起する単語の傾向を基にグループ化を行うという方法も用いることができる。 Here, in grouping, apart from the grouping shown in FIG. 11, a method of grouping based on the tendency of the words recalling the memory as shown in FIG. 13 can also be used.

グルーピング推定部２３１は、情報記憶部２２０の情報を用いて、グルーピング処理を実行する。例えば、グルーピング推定部２３１は、情報記憶部２２０の属性記憶部２２３に記憶された情報（年齢、性別など）を用いて、年齢が「３０代」であり、かつ「男性」であるユーザを１つのグループとする、年代が「２０代」であり、かつ、「女性」であるユーザを１つのグループとするようにグルーピングを行う。前記の方法に加え機械学習を用いることも可能である。例えば、属性情報や行動情報を基に、教師なし学習の一種であるクラスタリング分類を行うことでグルーピングを行ってもよい。 The grouping estimation unit 231 executes the grouping process by using the information of the information storage unit 220. For example, the grouping estimation unit 231 uses the information (age, gender, etc.) stored in the attribute storage unit 223 of the information storage unit 220 to select one user whose age is "30's" and "male". Grouping is performed so that users who are in their "twenties" and who are "female" are grouped into one group. It is also possible to use machine learning in addition to the above method. For example, grouping may be performed by performing clustering classification, which is a kind of unsupervised learning, based on attribute information and behavior information.

（言語知識記憶部２２６の構成例）
図１２は、本発明の実施形態に係る言語知識記憶部２２６の構成例を説明する図である。
言語知識記憶部２２６には、言語知識データが記憶される。
言語知識データは、単語、品詞、上位概念、下位概念のデータ項目を含む。
例えば、単語「六法全書」の品詞は「固有名詞」であり、上位概念は「本」、下位概念は「六法全書○○年版」が記憶されている。 (Structure example of language knowledge storage unit 226)
FIG. 12 is a diagram illustrating a configuration example of the language knowledge storage unit 226 according to the embodiment of the present invention.
Language knowledge data is stored in the language knowledge storage unit 226.
Linguistic knowledge data includes data items of words, part of speech, superordinate concepts, and subordinate concepts.
For example, the part of speech of the word "Rokuho Zensho" is "proper noun", the superordinate concept is "book", and the subordinate concept is "Rokuho Zensho XX year edition".

図１３は、本発明の実施形態に係る想起志向管理部２４０の構成例を示す図である。想起志向管理部２４０は、ユーザの想起志向を推定し、対話規則構築および対話管理の指針を決定する。想起志向管理部２４０は、想起解析部２４１、モデル構築部２４２を含んで構成される。 FIG. 13 is a diagram showing a configuration example of the recall-oriented management unit 240 according to the embodiment of the present invention. The recall-oriented management unit 240 estimates the user's recall orientation and determines the guidelines for dialogue rule construction and dialogue management. The recall-oriented management unit 240 includes a recall analysis unit 241 and a model construction unit 242.

想起解析部２４１は、対話履歴記憶部２２１から記憶想起に関するフレーズ（単語）の収集や、収集された情報（単語）を基に解析することで、対話履歴から想起に関する発話と判断されたものの中から環境情報に類する単語やフレーズを推定する。また、想起解析部２４１は、それらを特徴量へと変換しユーザあるいはグループごとに保持してもよい。ここで、記憶想起とは、脳内に保存された記憶の中から特定の記憶を思い出すプロセスである。特定の事象を認識しているにもかかわらず、その名称等を思い出すことができない場合には、記憶はしているが想起できない状態であるといえる。
想起解析部２４１は、対話履歴記憶部２２１に記憶された対話履歴を参照し、対話履歴に含まれるメッセージから、想起志向フレーズを抽出する。ここでは、メッセージに含まれる単語を抽出し、想起志向フレーズとして用いるようにしてもよい。
また、想起解析部２４１は、メッセージに含まれる単語を基に解析することで、対話履歴から想起に関する発話であるか否かの判断を行い、想起に関する発話であると判断されたメッセージから、環境記憶部２２２に記憶された環境に類似する環境にある単語やフレーズを抽出する。環境に類似する単語やフレーズは、メッセージがどの場所で発話されたか、メッセージが発話されたときの天候は何であったか、等の項目を参照し、対話履歴記憶部２２１に含まれるメッセージと関連性の高いと判断しうる単語やフレーズを抽出する。 The recall analysis unit 241 collects phrases (words) related to memory recall from the dialogue history storage unit 221 and analyzes the collected information (words) based on the collected information (words). Estimate words and phrases similar to environmental information from. Further, the recall analysis unit 241 may convert them into feature quantities and hold them for each user or group. Here, memory recall is a process of recalling a specific memory from the memories stored in the brain. If you are aware of a specific event but cannot remember its name, etc., you can say that you are in a state where you can remember but cannot recall it.
The recall analysis unit 241 refers to the dialogue history stored in the dialogue history storage unit 221 and extracts a recall-oriented phrase from the message included in the dialogue history. Here, the words included in the message may be extracted and used as a recall-oriented phrase.
In addition, the recall analysis unit 241 analyzes based on the words contained in the message, determines whether or not the utterance is related to recall from the dialogue history, and from the message determined to be the utterance related to recall, the environment. A word or phrase in an environment similar to the environment stored in the storage unit 222 is extracted. Words and phrases similar to the environment refer to items such as where the message was uttered and what the weather was when the message was uttered, and are related to the message contained in the dialogue history storage unit 221. Extract words and phrases that can be judged to be high.

また、想起解析部２４１は、想起志向フレーズを用いて探索を行った結果、ユーザにとって求める回答が得られたか否かに応じて、その探索に用いられた想起志向フレーズに対する想起志向についての信頼度の算出、または更新を行う。例えば、想起解析部２４１は、探索の結果、ユーザが求める回答が見つかった場合には、信頼度を維持または信頼できる度合いが上がるように信頼度を更新し、ユーザが求める回答が見つからなかった場合には、信頼できる度合いが下がるように信頼度を更新する。ユーザが求める回答が見つかったか否かについては、行動記憶部２２４に記憶された、想起志向フレーズに対応付けられたフィードバックが示す値を用いるようにしてもよい。 In addition, the recall analysis unit 241 performs a search using the recall-oriented phrase, and depending on whether or not a desired answer is obtained for the user, the reliability of the recall-oriented phrase for the recall-oriented phrase used in the search is high. Is calculated or updated. For example, when the recall analysis unit 241 finds the answer requested by the user as a result of the search, the reliability is updated so as to maintain the reliability or increase the degree of reliability, and when the answer requested by the user is not found. Update the reliability so that the reliability is reduced. As for whether or not the answer requested by the user is found, the value indicated by the feedback associated with the recall-oriented phrase stored in the action storage unit 224 may be used.

また、想起解析部２４１は、言語知識記憶部２２６に記憶された単語と品詞との関係を参照し、メッセージに含まれる代名詞に結びついている単語を抽出する。例えば、言語知識記憶部２２６には、単語と品詞との関係が記憶されている。想起解析部２４１は、言語知識記憶部２２６を参照することで、メッセージに含まれる単語の品詞を特定する。そして想起解析部２４１は、発話内容に含まれる単語が代名詞である場合、この代名詞である単語について、形態素解析や係り受け解析を行うことで、当該代名詞に結びついている単語や想起志向フレーズを抽出する。例えば、対話の中では、代名詞などを省略する場合も多い。そのため、「前に話した本の値段は？」という発話のあと「どこに売ってたっけ？」という発話があった場合、２つ目の発話を想起に関する発話とし、前後の対話から想起志向フレーズとして「前に（前回の対話）」「本（固有名詞）」というフレーズを紐付けることで、「前に話した本はどこに売ってたっけ？」という発話があったものとして利用することができる。このようにして、対話の中で、代名詞などが省略されて発話されていたとしても、連続あるいは連続していると思われる発言について、その前後の対話から想起志向フレーズとして用いるフレーズを紐付けることで、省略された代名詞等を補充することができる。 Further, the recall analysis unit 241 refers to the relationship between the word stored in the language knowledge storage unit 226 and the part of speech, and extracts the word associated with the pronoun included in the message. For example, the linguistic knowledge storage unit 226 stores the relationship between words and part of speech. The recall analysis unit 241 identifies the part of speech of the word included in the message by referring to the language knowledge storage unit 226. Then, when the word included in the utterance content is a pronoun, the recall analysis unit 241 extracts the word associated with the pronoun and the recall-oriented phrase by performing morphological analysis and dependency analysis on the word that is the pronoun. To do. For example, pronouns are often omitted in dialogues. Therefore, if there is an utterance "Where did you sell it?" After the utterance "What is the price of the book you talked about before?" By associating the phrases "before (previous dialogue)" and "book (proper noun)", it can be used as if there was an utterance "Where did you sell the book you talked about before?" it can. In this way, even if pronouns are omitted in the dialogue, the phrase used as a recall-oriented phrase is linked from the dialogue before and after the remark that seems to be continuous or continuous. Then, the abbreviated pronouns and the like can be supplemented.

モデル構築部２４２は、情報記憶部２２０の情報を基に、ユーザおよびグループごとの想起志向の推定に必要な数式やルールをモデルとして構築し、更新する。また、モデル構築部２４２は、ユーザの記憶想起に関するパラメータを推定するためのモデル（記憶想起推定モデル）を構築する。 Based on the information of the information storage unit 220, the model construction unit 242 constructs and updates mathematical formulas and rules necessary for estimating the recall orientation for each user and group as a model. In addition, the model building unit 242 builds a model (memory recall estimation model) for estimating parameters related to the user's memory recall.

また、モデル構築部２４２は、記憶想起推定モデルを構築するにあたり、機械学習や強化学習などによって学習された学習済みモデルを用いることもできる。その場合、モデル構築部２４２は、情報記憶部２２０に格納された情報を基に、パラメータ推定用の基底関数を準備し、情報記憶部２２０に格納された情報を教師データとして記憶想起推定モデルの最適化を行う。
ここで、最適化とは、想起志向に基づく想起志向フレーズを用いた探索によって、ユーザが求める回答が得られるように、想起志向フレーズが示す概念よりもより広い概念、より狭い概念、類する概念または異なる対象範囲の想起志向フレーズに変更する処理である。想起志向フレーズの変更には、想起志向フレーズが示す概念が広くなるように変更する場合、想起志向フレーズが示す対象範囲を変更する場合、類する想起志向フレーズとなるように想起志向フレーズを選択し直す場合、がある。 In addition, the model construction unit 242 can also use a trained model learned by machine learning, reinforcement learning, or the like when constructing a memory recall estimation model. In that case, the model construction unit 242 prepares a basis function for parameter estimation based on the information stored in the information storage unit 220, and uses the information stored in the information storage unit 220 as teacher data for the memory recall estimation model. Perform optimization.
Here, optimization is a concept broader, narrower, or similar than the concept indicated by the recall-oriented phrase so that the user can obtain the answer desired by searching using the recall-oriented phrase based on the recall-oriented phrase. It is a process to change to a recall-oriented phrase with a different target range. To change the recall-oriented phrase, when changing the concept indicated by the recall-oriented phrase to be broader, when changing the target range indicated by the recall-oriented phrase, reselecting the recall-oriented phrase so that it becomes a similar recall-oriented phrase. If there is.

モデル構築部２４２は、ユーザごとに特化したモデルである固有モデルと個人を特定せず類似したユーザ群に対して用いることが可能なモデルである汎用モデルといった複数のモデルを持つことが可能である。当該システムの利用を開始したばかりのユーザなどは、蓄積されたデータが少ないため、汎用モデルの中から最も類似した傾向を持つ汎用モデルを選択し処理を行うことも可能である。例えば、属性記憶部２２３に記憶された情報を参照し、ユーザＡ、ユーザＢ、ユーザＣ、・・・等のユーザ群のなかから、年齢、性別、所在地等の属性の少なくとも一部の属性を用いてグルーピングを行い、類似した傾向を持つユーザの汎用モデルを選択するようにしてもよい。また、想起志向に対する信頼度（想起志向「日時」を間違えることが多い、想起志向「価格」を間違えることがやや多い等、を元に求められる、ユーザが想起志向に対する認識の正しさの度合いである信頼度）に応じてグルーピングを行い、類似した信頼度のユーザのモデルを選択するようにしてもよい。
ここで、属性記憶部２２３は、想起志向フレーズと、その想起志向フレーズに対する信頼度とを、ユーザＩＤに対応づけて記憶している。この属性記憶部２２３に記憶される想起志向フレーズと信頼度は、想起志向管理部２４０の想起解析部２４１によって求められる。想起解析部２４１は、想起志向フレーズを用いて探索を行った結果、回答が見つかった場合には、信頼度を維持または信頼できる度合いが上がるように信頼度を更新し、一方で回答が見つからなかった場合には、その想起志向フレーズを信頼できる度合いが下がるように信頼度を更新する。このように、信頼度を更新することで、想起志向フレーズが信頼できる度合いを実際のユーザの言動や志向に近づけることができるので、想起志向（想起志向フレーズ）を最適化することが可能となる。 The model construction unit 242 can have a plurality of models such as a unique model which is a model specialized for each user and a general-purpose model which is a model that can be used for a similar user group without specifying an individual. is there. Since the accumulated data is small for users who have just started using the system, it is possible to select a general-purpose model having the most similar tendency from the general-purpose models and perform processing. For example, referring to the information stored in the attribute storage unit 223, at least a part of the attributes such as age, gender, and location can be selected from the user group such as user A, user B, user C, and so on. It may be used to group and select a general-purpose model of users with similar tendencies. In addition, the degree of correctness of the user's recognition of the recall orientation, which is required based on the degree of trust in the recall orientation (often mistaken for the recall orientation "date and time", a little more often for the recall orientation "price", etc. Grouping may be performed according to a certain reliability), and a model of a user with a similar reliability may be selected.
Here, the attribute storage unit 223 stores the recall-oriented phrase and the reliability of the recall-oriented phrase in association with the user ID. The recall-oriented phrase and the reliability stored in the attribute storage unit 223 are obtained by the recall analysis unit 241 of the recall-oriented management unit 240. When the recall analysis unit 241 searches using the recall-oriented phrase and finds an answer, the recall analysis unit updates the reliability so as to maintain the reliability or increase the reliability, while the answer is not found. If so, update the reliability so that the degree of reliability of the recall-oriented phrase decreases. By updating the reliability in this way, the degree of reliability of the recall-oriented phrase can be brought closer to the actual user's behavior and orientation, so that the recall-oriented phrase can be optimized. ..

モデル構築部２４２のモデルを用いた最適化の例としては、図１４に示すように、ユーザから「先週、Aさんが行ったお店はどこだっけ」といった、「先週」「Aさん」２つの想起志向フレーズを用いた想起発話が提示されたとする（符号（ａ））。 As an example of optimization using the model of the model construction unit 242, as shown in FIG. 14, "Last week" and "Mr. A" 2 such as "Where was the store that Mr. A went to last week?" It is assumed that a recall utterance using two recall-oriented phrases is presented (reference numeral (a)).

当該想起発話（想起フレーズを用いた発話）に関して環境記憶部２２２に該当する情報が存在しない場合、例えば、ユーザの過去の対話から、想起志向「日時」と想起志向「人物（メンバ）」のうち、想起志向「日時」の方が曖昧な部分が多い（信頼度が低い）ようであれば、モデル構築部２４２が、日時の部分をユーザが誤認している可能性を考慮し、想起志向「日時」に対応する想起志向フレーズである「先週」を例えば「２週前」に変更することで、「２週間前」が探索する対象の期間がとなるように期間を変更する。この「日時」に対応する想起志向フレーズが変更された後の発話内容を用いて、対話管理部２１２が対話処理を行う（符号ｂ）。
また、モデル構築部２４２は、要求発話に対する回答に該当する想起志向フレーズが対話履歴記憶部２２１に記憶されていなかった場合に最適化を行う他に、要求発話に対する回答に該当する想起志向フレーズが対話履歴記憶部２２１に記憶されていたとしても、ユーザの求める回答ではなかった場合に、最適化を行うようにしてもよい。
例えば、図１４（符号ａ）において、想起志向フレーズ「先週」を用いて探索を行って得られた回答をユーザの端末装置１０に送信し、端末装置１０からユーザの求める回答が得られなかったことを示す評価結果（フィードバック）が得られる場合がある。このような場合には、図１４（符号ｂ）に示すように、探索する対象範囲が「先週」から「２週間前」のように対象範囲を変更することで、最適化を行うようにしてもよい。
この最適化を行うことで再探索して得られた回答について、ユーザに再度評価してもらい、ユーザが求める回答が得られたことを示す評価結果が得られた場合には、「日時」と「人物（メンバ）」のうち、「日時」に関してユーザから提示された情報に誤りがあったと推定することができる。このような場合、当該ユーザの想起志向「日時」についての信頼度は低下する。そのため、想起解析部２４１は、要求発話を行ったユーザの想起志向「日時」に対する信頼度を現在よりも低い値に更新する。これにより対話管理部２１２は、次回探索する場合は、想起志向「日時」と想起志向「人物（メンバ）」のうち、最初の探索を行う際には、「日時」と「人物（メンバ）」のそれぞれの信頼度のうち、「日時」の信頼度が「人物（メンバ）」よりも低い場合には、「日時」を利用せずに「人物（メンバ）」の想起志向を用いて探索したり、あるいは、「日時」に対応する想起志向フレーズが示す期間が広くなるような想起志向フレーズに変更してから探索をする等の処理を行うこともできる。 When there is no information corresponding to the environmental memory unit 222 regarding the recalled utterance (utterance using the recalled phrase), for example, from the past dialogue of the user, among the recalled "date and time" and the recalled "person (member)". If the recall-oriented "date and time" seems to have more ambiguous parts (low reliability), the model construction unit 242 considers the possibility that the user misidentifies the date and time part, and the recall-oriented "date and time" By changing "last week", which is a recall-oriented phrase corresponding to "date and time", to, for example, "two weeks ago", the period is changed so that "two weeks ago" is the target period to be searched. The dialogue management unit 212 performs dialogue processing using the utterance content after the recall-oriented phrase corresponding to this "date and time" is changed (reference numeral b).
In addition, the model construction unit 242 optimizes when the recall-oriented phrase corresponding to the answer to the request utterance is not stored in the dialogue history storage unit 221, and the recall-oriented phrase corresponding to the answer to the request utterance is Even if it is stored in the dialogue history storage unit 221, if it is not the answer requested by the user, optimization may be performed.
For example, in FIG. 14 (reference numeral a), the answer obtained by performing the search using the recall-oriented phrase “last week” was transmitted to the user's terminal device 10, and the answer requested by the user was not obtained from the terminal device 10. Evaluation results (feedback) indicating that may be obtained. In such a case, as shown in FIG. 14 (reference numeral b), the target range to be searched is changed from "last week" to "two weeks ago" to perform optimization. May be good.
The answer obtained by re-searching by performing this optimization is evaluated again by the user, and when the evaluation result indicating that the answer requested by the user is obtained is obtained, it is referred to as "date and time". It can be estimated that there was an error in the information presented by the user regarding the "date and time" among the "persons (members)". In such a case, the reliability of the user's recall-oriented "date and time" is lowered. Therefore, the recall analysis unit 241 updates the reliability of the recall-oriented "date and time" of the user who made the request utterance to a value lower than the current value. As a result, the dialogue management unit 212 will select the "date and time" and the "person (member)" when performing the first search among the recall-oriented "date and time" and the recall-oriented "person (member)" when searching next time. If the reliability of "date and time" is lower than that of "person (member)", the search is performed using the recall orientation of "person (member)" without using "date and time". Alternatively, it is also possible to perform processing such as searching after changing to a recall-oriented phrase that widens the period indicated by the recall-oriented phrase corresponding to the "date and time".

また、想起志向に基づく探索範囲の別の変更方法としては、例えば想起志向「日時」に関する想起志向フレーズは固定して、想起志向「人物（メンバ）」に関する想起志向フレーズを、同日に対話した他のメンバを示す想起志向フレーズに置き換えたり（例えば、「Ａさん」ではなく同じグループに属する「Ｂさん」に置き換える）、あるいは変更対象の想起志向を複数用いることで複合的（日時と人物（メンバ）の両方）に置き換えを実施するということもできる。これにより、探索する対象範囲を広げたり、対象範囲を変更することができる。
また、評価結果を用いて信頼度を更新し、その信頼度を基に、探索する範囲を予め変更してから探索する場合には、ユーザが求める回答を得るために最短の経路で探索することが可能になる。 In addition, as another method of changing the search range based on the recall-oriented, for example, the recall-oriented phrase related to the recall-oriented "date and time" is fixed, and the recall-oriented phrase related to the recall-oriented "person (member)" is spoken on the same day. By replacing it with a recall-oriented phrase that indicates a member of (for example, replacing it with "Mr. B" who belongs to the same group instead of "Mr. A"), or by using multiple recall-oriented phrases to be changed (date and time and person (member) It can also be said that the replacement is carried out with both). As a result, the target range to be searched can be expanded or the target range can be changed.
In addition, when the reliability is updated using the evaluation result and the search range is changed in advance based on the reliability, the search is performed by the shortest route in order to obtain the answer requested by the user. Becomes possible.

なお、情報記憶部２２０には、ユーザが対話を行う度に、新たな対話情報が対話履歴記憶部２２１に蓄積されるとともに、この新たな対話に基づいて、環境記憶部２２２、属性記憶部２２３、行動記憶部２２４、グルーピング記憶部２２５に、新たな情報が追加されあり、記憶された情報が更新される。このため、モデル構築部２４２は、新たな情報を基に想起するデータのモデル（記憶想起推定モデル）の更新を行うことができる。更新頻度などは管理者が任意で設定できる。 In the information storage unit 220, new dialogue information is accumulated in the dialogue history storage unit 221 each time the user engages in a dialogue, and based on this new dialogue, the environment storage unit 222 and the attribute storage unit 223. , New information is added to the action storage unit 224 and the grouping storage unit 225, and the stored information is updated. Therefore, the model construction unit 242 can update the data model (memory recall estimation model) recalled based on the new information. The update frequency etc. can be set arbitrarily by the administrator.

また、モデル構築部２４２で構築する記憶想起推定モデルは単一のモデルや複数のモデルを組合せたり別途構築できるものとする。また対話規則構築部２５０においてもモデル構築部２４２で構築された記憶想起推定モデルを単一もしくは複数を用いた処理が可能とする。 Further, the memory recall estimation model constructed by the model construction unit 242 can be a single model or a combination of a plurality of models or can be constructed separately. Further, the dialogue rule construction unit 250 also enables processing using a single or a plurality of memory recall estimation models constructed by the model construction unit 242.

（対話規則構築部２５０の構成例）
図１５は、本発明の実施形態に係る対話規則構築部２５０の構成例を示す図である。対話規則構築部２５０は、想起志向管理部２４０の情報を活用し、対話規則の追加および修正を行うことで、対話規則を構築する。具体的には、対話規則構築部２５０は、発話構築部２５１と関係構築部２５２とを含んで構成される。
発話構築部２５１は、情報記憶部２２０を用いてユーザからの要求発話を構築する。
発話構築部２５１は、文章のテンプレート（ひな形）を予め記憶している。テンプレートは、例えば、「ＡのＢはＣですか？」という文章の形式を表すデータである。
関係構築部２５２は、要求発話と返答発話の紐付けを行う。 (Structure example of dialogue rule construction unit 250)
FIG. 15 is a diagram showing a configuration example of the dialogue rule construction unit 250 according to the embodiment of the present invention. The dialogue rule construction unit 250 constructs the dialogue rule by utilizing the information of the recall-oriented management unit 240 and adding and modifying the dialogue rule. Specifically, the dialogue rule building unit 250 includes a speech building unit 251 and a relationship building unit 252.
The utterance construction unit 251 constructs the requested utterance from the user by using the information storage unit 220.
The utterance construction unit 251 stores a sentence template (template) in advance. The template is, for example, data representing the format of the sentence "Is B of A C?".
The relationship building unit 252 links the request utterance and the response utterance.

本発明の実施形態に係る発話構築部２５１の一例としては、最もナイーブな方法として情報記憶部２２０の情報を基に、ユーザごとに対話履歴記憶部２２１の情報と紐付けられた環境記憶部２２２の情報を適用するという方法がある。例えば、セッションIDとユーザIDとの２つをキーとして探索することで紐付けられた情報を得ることができる。 As an example of the utterance construction unit 251 according to the embodiment of the present invention, as the most naive method, the environment storage unit 222 is associated with the information of the dialogue history storage unit 221 for each user based on the information of the information storage unit 220. There is a method of applying the information of. For example, the linked information can be obtained by searching using both the session ID and the user ID as keys.

前記のモデルを用いる場合、登録されている対話規則に対して環境情報に登録されている単語やフレーズでの置き換えを行う。例えば、「六法全書の値段は？」という対話規則に対して、環境情報として「日時」、「メンバ」の想起志向が関連付けられている場合、「昨日話した本の値段は？」「この前Aさんと話した本の値段は？」といった具合に、置き換えて登録するという方法が考えられる。例えば、発話構築部２５１は、「［Ｄ］［Ｅ］［Ｆ］の［Ｇ］は？」というテンプレートを用いる。テンプレートは、発話構築部２５１が予め記憶しておいてもよい。ここで、テンプレートは、［Ｄ］は想起志向「日時」、［Ｅ］は品詞「動詞」、［Ｆ］は品詞「名詞」、［Ｇ］は「本の固有名詞」のデータが対応付けられている。そして、このようなテンプレートに基づいて、環境情報の内容を、［Ｄ］、［Ｅ］、［Ｆ］、［Ｇ］のそれぞれに当てはめることで、「昨日話した本の値段は？」や、「先週調べた電車の発車時刻は？」という文章を構成し、「六法全書の値段は？」という文章から、この構成された文章に置き換えることができる。なお、発話構築部２５１は、返答と紐付いている単語については置き換えを行わない。例えば、単語「１，２００円」は「価格」に関する返答に該当し、単語「９：３０」は、「発車時刻」に関する返答に該当しうるため、このような単語は置き換えを行わない。発話構築部２５１は、置き換えにより生成した文章を、対話規則記憶部２２７に追加で記憶する。 When the above model is used, the registered dialogue rules are replaced with the words and phrases registered in the environmental information. For example, if the dialogue rule "What is the price of the Rokuho Zensho?" Is associated with the recall orientation of "date and time" and "member" as environmental information, "what is the price of the book I talked about yesterday?" A possible method is to replace and register, such as "What is the price of the book I talked to Mr. A?" For example, the utterance construction unit 251 uses the template "What is [G] of [D] [E] [F]?". The template may be stored in advance by the utterance construction unit 251. Here, in the template, [D] is associated with the data of the recall-oriented "date and time", [E] is associated with the part of speech "verb", [F] is associated with the part of speech "noun", and [G] is associated with the data of "book proper noun". ing. Then, based on such a template, by applying the contents of the environmental information to each of [D], [E], [F], and [G], "What is the price of the book I talked about yesterday?" You can compose the sentence "What is the departure time of the train you checked last week?" And replace the sentence "What is the price of the Rokuho Zensho?" With this composed sentence. The utterance construction unit 251 does not replace the word associated with the reply. For example, the word "1,200 yen" may correspond to a reply related to "price", and the word "9:30" may correspond to a reply related to "departure time", so such a word is not replaced. The utterance construction unit 251 additionally stores the sentence generated by the replacement in the dialogue rule storage unit 227.

図１６は、対話規則記憶部２２７に記憶された対話規則データの一例を示す図である。
対話規則データは、対話規則と想起志向とを記憶する。例えば、対話規則データの一例として、対話規則「六法全書の値段は？」に対して、想起志向「日時」と「メンバ」が対応付けている。また、対話規則「1，200円です」に対して、想起志向「価格」が対応付けられ記憶されている。 FIG. 16 is a diagram showing an example of dialogue rule data stored in the dialogue rule storage unit 227.
The dialogue rule data stores the dialogue rule and the recall orientation. For example, as an example of dialogue rule data, a recall-oriented "date and time" and a "member" are associated with the dialogue rule "What is the price of the Six Codes?". In addition, the recall-oriented "price" is associated with the dialogue rule "1,200 yen" and is stored.

発話構築部２５１は、前記の方法で対話規則を構築する場合、情報処理部２１０の構成要素である言語知識記憶部２２６に登録されている単語の上位概念や下位概念を表すオントロジあるいはシソーラスなどのデータを用いて環境情報とは別に置き換えを行うことも可能とする。例えば、発話構築部２５１は、対話規則「六法全書の値段は？」に含まれる単語のうち、「六法全書」に対応する上位概念の単語を、言語知識記憶部２２６を参照することで、上位概念の単語「本」を得ることができる。発話構築部２５１は、この得られた単語を用い、対話規則「六法全書の値段は？」のうち、「六法全書」を「本」に置き換えることで、対話規則「本の値段は？」を生成することができる。発話構築部２５１は、生成した対話規則を対話規則記憶部２２７に追加で記憶する。 When the speech construction unit 251 constructs a dialogue rule by the above method, the utterance construction unit 251 may have an ontroge or a thesaurus that represents a superordinate concept or a subordinate concept of a word registered in the language knowledge storage unit 226, which is a component of the information processing unit 210. It is also possible to use the data to replace it separately from the environmental information. For example, the utterance construction unit 251 ranks the words of the higher concept corresponding to the "Rokuho Zensho" among the words included in the dialogue rule "Rokuho Zensho" by referring to the language knowledge storage unit 226. You can get the concept word "book". The utterance construction department 251 uses the obtained words to replace "Rokuho Zensho" with "book" in the dialogue rule "Rokuho Zensho?" To change the dialogue rule "Book price?" Can be generated. The utterance construction unit 251 additionally stores the generated dialogue rule in the dialogue rule storage unit 227.

また、前記のように対話規則を記憶することで、木探索などを用いたテンプレートマッチングを行うことが可能となるが、具体的な文章の形式にせずとも、過去の対話履歴を学習データとして識別器を作成することで、テンプレートマッチングと同等の処理を行うことも可能である。 In addition, by storing the dialogue rules as described above, it is possible to perform template matching using tree search or the like, but the past dialogue history can be identified as learning data without using a specific sentence format. By creating a vessel, it is possible to perform the same processing as template matching.

関係構築部２５２は、対話履歴記憶部２２１に記憶されたメッセージから、発話構築部２５１によって構築された要求発話の返答に該当する返答発話の紐付けを行う。
例えば、関係構築部２５２は、対話履歴記憶部２２１に記憶されたメッセージ「この前、Ａさんと話した本の値段は？」から、発話構築部２５１によって構築された要求発話の返答に該当する「この前」「Aさん」「本」という内容から環境記憶部２２２を探索（検索）すると、本がピックアップされる。順番にどの本かを確認し，利用者からのフィードバックを基に本を一意に決定する。すると要求発話は「〇〇（という本）の値段は？」に変換され登録されている要求と返答の対話ルールとして登録されていれば値段を引き出すことが可能となる。 The relationship building unit 252 associates the message stored in the dialogue history storage unit 221 with the response utterance corresponding to the response of the request utterance constructed by the utterance building unit 251.
For example, the relationship building unit 252 corresponds to the response of the request utterance constructed by the utterance building unit 251 from the message "What is the price of the book that was talked to Mr. A the other day?" Stored in the dialogue history storage unit 221. When the environmental storage unit 222 is searched (searched) from the contents of "before", "Mr. A", and "book", the book is picked up. Check which books are in order, and uniquely determine the books based on the feedback from users. Then, the request utterance is converted into "What is the price of XX (book)?", And if it is registered as a dialogue rule between the registered request and response, the price can be withdrawn.

（システムの動作）
次に、情報提示システムの動作の一例図面を参照しながら説明する。 (System operation)
Next, an example of the operation of the information presentation system will be described with reference to the drawings.

図１７は、本発明の実施形態に係る情報提示システムの情報提示処理の動作の一例を示すフローチャートである。 FIG. 17 is a flowchart showing an example of the operation of the information presentation process of the information presentation system according to the embodiment of the present invention.

対話開始時に、情報提示装置２０は、対話内容を収集する端末装置１０に割り当てられた識別情報を元に、ユーザＩＤを特定し（ステップＳ１０１）、そのユーザＩＤに基づいて対話履歴記憶部２２１を参照し、ユーザＩＤに対応付けられたメッセージがあるか否かを元に、過去に当該システムの利用履歴があるか否かの判定を行う（ステップＳ１０２）。また、情報提示装置２０は、利用履歴があったとしても、信頼度のスコアがまだ算出されていない、まだ想起フレーズを用いたことがない、システムからの評価要求に答えず信頼度を算出できない、のいずれかに該当する場合には、不十分であるとして、利用履歴がないと判断してもよい。情報提示装置２０は、過去に参加した履歴があれば想起志向に関する固有モデルと汎用モデルを読み込み（ステップＳ１０３）、利用履歴がない場合は、固有モデルが形成されていないため、汎用モデルのみを読み込む（ステップＳ１０４）。 At the start of the dialogue, the information presenting device 20 identifies the user ID based on the identification information assigned to the terminal device 10 that collects the dialogue contents (step S101), and the dialogue history storage unit 221 is stored based on the user ID. Based on whether or not there is a message associated with the user ID with reference to the user ID, it is determined whether or not there is a usage history of the system in the past (step S102). Further, even if the information presenting device 20 has a usage history, the reliability score has not been calculated yet, the recall phrase has not been used yet, and the reliability cannot be calculated without responding to the evaluation request from the system. If any of the above is true, it may be determined that there is no usage history because it is considered insufficient. The information presenting device 20 reads the unique model and the general-purpose model related to recall orientation if there is a history of participation in the past (step S103), and reads only the general-purpose model if there is no usage history because the unique model is not formed. (Step S104).

モデルデータ読み込み後、情報提示装置２０は、入力待機状態となりユーザからのアクション（発話）がされたか否かを判定する（ステップＳ１０５）。発話を取得すると（ステップＳ１０５−ＹＥＳ）、対話処理に移り（ステップＳ１０６）、発話内容を対話履歴記憶部２２１に記憶し、一方、発話を取得していない場合には（ステップＳ１０５−ＮＯ）、一定時間経過後に、ステップＳ１０５の判定を再度行う。対話処理が開始されると情報提示装置２０は、想起志向を用いた対話か否かを解析部２１２１によって判定する（ステップＳ１０７）。想起志向を用いた対話ではない場合（ステップＳ１０７−ＮＯ）、解析部２１２１は、発話内容を対話履歴記憶部２２１に記憶し（ステップＳ１４０）、ステップＳ１０９に移行する。
一方、想起志向を用いた対話である場合（ステップＳ１０７−ＹＥＳ）、行動選択部２１２２は、対話内容に含まれる要求情報に対応する返答情報を、環境記憶部２２２から探索し（ステップＳ１０８）、探索の結果、合致する内容があれば（ステップＳ１０９−ＹＥＳ）、合致する内容を返答情報として取得する。返答情報が取得されると、生成部２１２３が、返答情報を元に応答内容を決定する（ステップＳ１１０）。応答内容が決定されると、情報提示装置２０は、応答内容に応じた返答情報に基づくコンテンツを出力する（ステップＳ１１１）。
情報提示装置２０は、応答内容（コンテンツ）に対するユーザからの評価（ユーザの求める回答が得られたか否かを示す情報）を端末装置１０から取得し（ステップＳ１１２）、評価結果を行動情報として行動記憶部２２４に書き込むことで更新する（ステップＳ１１３）。なお、ここでは、ユーザが求める回答が得られたか否かの判定を行い、ユーザがもとめる回答が得られた場合に処理を終了し、ユーザが求める回答が得られなかった場合には、ステップS１５０に移行するようにしてもよい。 After reading the model data, the information presenting device 20 enters the input standby state and determines whether or not an action (utterance) has been performed by the user (step S105). When the utterance is acquired (step S105-YES), the dialogue processing is started (step S106), the utterance content is stored in the dialogue history storage unit 221, and on the other hand, when the utterance is not acquired (step S105-NO), After a certain period of time has elapsed, the determination in step S105 is performed again. When the dialogue processing is started, the information presenting device 20 determines whether or not the dialogue is a recall-oriented dialogue by the analysis unit 2121 (step S107). When the dialogue does not use the recall orientation (step S107-NO), the analysis unit 2121 stores the utterance content in the dialogue history storage unit 221 (step S140), and proceeds to step S109.
On the other hand, in the case of the dialogue using the recall orientation (step S107-YES), the action selection unit 2122 searches for the response information corresponding to the request information included in the dialogue content from the environmental storage unit 222 (step S108). As a result of the search, if there is a matching content (step S109-YES), the matching content is acquired as response information. When the response information is acquired, the generation unit 2123 determines the response content based on the response information (step S110). When the response content is determined, the information presenting device 20 outputs the content based on the response information according to the response content (step S111).
The information presenting device 20 acquires an evaluation from the user (information indicating whether or not the answer requested by the user has been obtained) for the response content (content) from the terminal device 10 (step S112), and acts using the evaluation result as action information. It is updated by writing to the storage unit 224 (step S113). Here, it is determined whether or not the answer requested by the user has been obtained, the process is terminated when the answer requested by the user is obtained, and when the answer requested by the user is not obtained, step S150. You may want to move to.

一方、探索の結果、合致する内容がない場合（ステップＳ１０９−ＮＯ）、行動選択部２１２２は、モデル構築部２４２によって最適化を行うことで、想起志向管理部２４０にて構築された想起志向のモデルを用いて最適な応答内容を決定する（ステップＳ１５０）。 On the other hand, if there is no matching content as a result of the search (step S109-NO), the action selection unit 2122 is optimized by the model construction unit 242 to be recollection-oriented and constructed by the recollection-oriented management unit 240. The optimum response content is determined using the model (step S150).

ここで、最適化処理では、必要であれば提示形態の決定および適用を実施し、応答内容を決定する。応答内容が決定されると、応答内容を元にしたコンテンツをユーザの端末装置１０へ送信する（ステップＳ１５１）。送信した結果を基に対話情報に応答内容を追加することで更新するとともに、想起志向フレーズ及び実施内容を行動記憶部２２４に書き込むことで更新し（ステップＳ１５３）、ステップＳ１０５の入力待機へと戻る。 Here, in the optimization process, if necessary, the presentation form is determined and applied, and the response content is determined. When the response content is determined, the content based on the response content is transmitted to the user's terminal device 10 (step S151). Based on the transmitted result, the response content is updated by adding the response content to the dialogue information, and the recall-oriented phrase and the implementation content are updated by writing in the action memory unit 224 (step S153), and the process returns to the input standby in step S105. ..

なお、ステップＳ１５０において、対話処理において、想起による参照と判定されたにもかかわらず、該当する想起単語およびフレーズと合致する対話規則が存在しない場合、固有モデルおよび汎用モデルを用いる。これらは想起の傾向をモデル化したものであり、場所や人物（メンバ）、日時などの記憶違いを考慮して「ひょっとしたらXXXですか」というようにシステム側から提示を行う。例えば、「先週、Ａさんが言ってたお店どこだっけ？」という要求発話に対する対話規則がなければ、固有モデルや汎用モデルを用いることで、単語の上位概念化や別の概念に置き換えることで「ひょっとしたら２週前ですか」というような応答情報を元にしたコンテンツを端末装置１０に送信する。 In step S150, when it is determined that the reference is by recall in the dialogue process, but there is no dialogue rule that matches the corresponding recall word and phrase, the unique model and the general-purpose model are used. These are models of the tendency of recall, and the system side presents such as "Maybe XXX?" In consideration of memory differences such as location, person (member), date and time. For example, if there is no dialogue rule for the request utterance "Where is the store that Mr. A said last week?", By using a unique model or a general-purpose model, you can replace it with a higher-level conceptualization of words or another concept. Content based on response information such as "maybe two weeks ago?" Is transmitted to the terminal device 10.

前記の処理を行った場合、システム側の行動として行動情報を更新する。なお、この行動を行った際は対話の区切りで「あっていましたか？、はい／いいえ」といった提示を行いユーザからのフィードバックを取得する。 When the above processing is performed, the action information is updated as an action on the system side. When this action is taken, the user's feedback is obtained by presenting "Did you meet ?, Yes / No" at the break of the dialogue.

行動情報および対話情報を更新後、想起志向のモデル更新を逐次処理で行うこともできる。また、マシンスペックによってはバッチ処理で行うことも可能とする。 After updating the behavior information and the dialogue information, the recall-oriented model can be updated sequentially. Also, depending on the machine specifications, it is possible to perform batch processing.

上述した実施形態における端末装置１０または情報提示装置２０をコンピュータで実現するようにしてもよい。その場合、この機能を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行することによって実現してもよい。なお、ここでいう「コンピュータシステム」とは、ＯＳや周辺機器等のハードウェアを含むものとする。また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時間の間、動的にプログラムを保持するもの、その場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含んでもよい。また上記プログラムは、前述した機能の一部を実現するためのものであってもよく、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるものであってもよく、ＦＰＧＡ（Field Programmable Gate Array）等のプログラマブルロジックデバイスを用いて実現されるものであってもよい。 The terminal device 10 or the information presentation device 20 in the above-described embodiment may be realized by a computer. In that case, the program for realizing this function may be recorded on a computer-readable recording medium, and the program recorded on the recording medium may be read by the computer system and executed. The term "computer system" as used herein includes hardware such as an OS and peripheral devices. Further, the "computer-readable recording medium" refers to a portable medium such as a flexible disk, a magneto-optical disk, a ROM, or a CD-ROM, or a storage device such as a hard disk built in a computer system. Further, a "computer-readable recording medium" is a communication line for transmitting a program via a network such as the Internet or a communication line such as a telephone line, and dynamically holds the program for a short period of time. It may also include a program that holds a program for a certain period of time, such as a volatile memory inside a computer system that serves as a server or a client in that case. Further, the above program may be for realizing a part of the above-mentioned functions, and may be further realized for realizing the above-mentioned functions in combination with a program already recorded in the computer system. It may be realized by using a programmable logic device such as FPGA (Field Programmable Gate Array).

以上、この発明の実施形態について図面を参照して詳述してきたが、具体的な構成はこの実施形態に限られるものではなく、この発明の要旨を逸脱しない範囲の設計等も含まれる。 Although the embodiments of the present invention have been described in detail with reference to the drawings, the specific configuration is not limited to this embodiment, and includes designs and the like within a range that does not deviate from the gist of the present invention.

１…情報提示システム、１０，１０ａ，１０ｂ，１０ｃ，１０ｄ…端末装置、２０，２０ａ，２０ｂ，２０ｃ，２０ｄ…情報提示装置、３０…通信ネットワーク、１１０…入力部、１１２…取得部、１２０…インタフェース部、１３０…記憶部、１４０…センサ部、１５０…出力部、１６０…通信部、１６１…送信部、１６２…受信部、１７０…制御部、２１０…情報処理部、２１１…取得部、２１２…対話管理部、２１３…提示制御部、２１４…合成部、２２０…情報記憶部、２２１…対話履歴記憶部、２２２…環境記憶部、２２３…属性記憶部、２２４…行動記憶部、２２５…グルーピング記憶部、２２６…言語知識記憶部、２２７…対話規則記憶部、２３１…グルーピング推定部、２４０…想起志向管理部、２４１…想起解析部、２４２…モデル構築部、２５０…対話規則構築部、２５１…発話構築部、２５２…関係構築部、１０００…端末装置、２１２１…解析部、２１２２…行動選択部、２１２３…生成部 1 ... Information presentation system, 10, 10a, 10b, 10c, 10d ... Terminal device, 20, 20a, 20b, 20c, 20d ... Information presentation device, 30 ... Communication network, 110 ... Input unit, 112 ... Acquisition unit, 120 ... Interface unit, 130 ... storage unit, 140 ... sensor unit, 150 ... output unit, 160 ... communication unit, 161 ... transmission unit, 162 ... receiver unit, 170 ... control unit, 210 ... information processing unit, 211 ... acquisition unit, 212 ... Dialogue management unit, 213 ... Presentation control unit, 214 ... Synthesis unit, 220 ... Information storage unit, 221 ... Dialogue history storage unit, 222 ... Environmental storage unit, 223 ... Attribute storage unit, 224 ... Behavioral memory unit, 225 ... Grouping Memory unit, 226 ... Language knowledge memory unit, 227 ... Dialogue rule storage unit, 231 ... Grouping estimation unit, 240 ... Recollection-oriented management unit, 241 ... Recollection analysis unit, 242 ... Model construction unit, 250 ... Dialogue rule construction unit, 251 ... Speaking construction unit, 252 ... Relationship building unit, 1000 ... Terminal device, 2121 ... Analysis unit, 2122 ... Action selection unit, 2123 ... Generation unit

Claims

It is an information presentation system that enables dialogue rule construction and dialogue control based on the recall orientation that expresses the intention of memory recall based on the user's dialogue history and environmental information.
A dialogue management unit that performs dialogue processing on the terminal device used by the user,
A dialogue history storage unit that stores dialogue history, which is the history of dialogue,
An environmental memory unit that stores the environment in which the dialogue was conducted, and the recall-oriented phrases and recall-oriented phrases contained in the dialogue.
When the concept of the recall-oriented phrase included in the user's request utterance obtained by performing the dialogue process and the concept of the recall-oriented phrase corresponding to the recall-oriented phrase are changed in order to obtain the response content to the requested utterance. Implementation content that expresses the content searched by changing the recall orientation, and
Feedback indicating whether or not the response content was satisfied with the information desired by the user, and
Behavior memory unit that memorizes
In order to obtain information including the response content that the user wants, the recall-oriented phrase indicated by the user's recall-oriented phrase is based on the recall phrase stored in the action memory unit, the implementation content, and the feedback. The recall-oriented management department that calculates the reliability, which indicates the degree of reliability,
Information presentation system equipped with.

The recall-oriented management department
The information presentation system according to claim 1, wherein a recall-oriented phrase corresponding to environmental information similar to the environment of the dialogue is acquired based on a recall-oriented phrase included in the dialogue stored in the dialogue history storage unit.

Claim 1 or claim 1 having a dialogue rule construction unit that generates a dialogue rule by applying a word included in the dialogue content stored in the dialogue history storage unit to a sentence template and a word stored in the environment storage unit. The information presentation system according to claim 2.

The dialogue management department
The information presentation system according to any one of claims 1 to 3, which determines a response utterance to a request utterance according to the recall orientation calculated by the recall orientation management unit.

The information presentation system according to any one of claims 1 to 4, wherein the environmental storage unit stores ambient environmental information including members, weather, place, and date and time.

Claims 1 to 1 include an analysis unit that obtains numerical data according to the dialogue content by using any of morphological analysis, keyword extraction, and vectorization for the dialogue content stored in the dialogue history storage unit. The information presentation system according to any one of 5.

It has an attribute storage unit that stores demographic data including the user's age, gender, and place of residence, and recall orientation.
The recall-oriented management unit performs the recall-oriented optimization process by changing the recall-oriented phrase of the request utterance so that the answer requested by the user can be obtained by using the information stored in the attribute storage unit. The information presentation system according to any one of claims 1 to 6.

The information presentation system according to any one of claims 1 to 7, wherein the recall-oriented management unit writes and stores the reliability of the recall-oriented in the attribute storage unit for each user.

The information according to any one of claims 1 to 8, wherein the recall-oriented management unit updates the reliability of the recall-oriented based on the search result and feedback stored in the behavior memory unit. Presentation system.

Grouping estimation unit that groups based on user characteristics,
A grouping storage unit that stores the results grouped by the grouping estimation unit, and a grouping storage unit.
From claim 1, which includes a model construction unit that uses a model for estimating the recall orientation of users who have a common recall orientation of users to be changed based on the grouping result stored in the information of the grouping storage unit. The information presentation system according to any one of claims 9.

The information presentation system according to claim 10, wherein a model is used in which dialogue rule construction and dialogue management are each modified from the grouping storage unit according to the recall orientation common to the users.

The recall-oriented management department
Claims 1 to generate a model for optimizing dialogue rule construction and dialogue management for a user whose model is not prepared or insufficient, using a model having a tendency similar to that of the user. The information presentation system according to any one of 11.

The information presentation system according to any one of claims 1 to 12, which has a compositing unit that generates content by processing information including response content.

The synthesis part
The information presentation system according to claim 13, further comprising a structure for generating contents that have been processed differently according to a destination terminal device.

It is an information presentation method in an information presentation system that can construct dialogue rules and control dialogue based on the recall orientation that expresses the intention of memory recall based on the user's dialogue history and environmental information.
The information presentation system
A dialogue history storage unit that stores dialogue history, which is the history of dialogue,
An environmental memory unit that stores the environment in which the dialogue was conducted, and the recall-oriented phrases and recall-oriented phrases contained in the dialogue.
When the concept of the recall-oriented phrase included in the user's request utterance obtained by performing the dialogue process and the concept of the recall-oriented phrase corresponding to the recall-oriented phrase are changed in order to obtain the response content to the requested utterance, the recall is concerned. Implementation content that represents the content searched by changing the orientation, and
Feedback indicating whether or not the response content was satisfied with the information desired by the user, and
Behavior memory unit that memorizes
Have,
The dialogue management department performs dialogue processing on the terminal device used by the user,
Based on the recall phrase stored in the action memory unit, the implementation content, and the feedback, the recall-oriented management unit determines the response content that the user wants as the recall-oriented phrase indicated by the user's recall-oriented phrase. An information presentation method that calculates the degree of reliability that indicates the degree of reliability in order to obtain the information that is included.