JP2022025917A

JP2022025917A - Dialog device

Info

Publication number: JP2022025917A
Application number: JP2020129102A
Authority: JP
Inventors: 知香阿座上; Chika Azagami; 洋樹浅井; Hiroki Asai; 央倉沢; Hiroshi Kurasawa
Original assignee: NTT Docomo Inc
Current assignee: NTT Docomo Inc
Priority date: 2020-07-30
Filing date: 2020-07-30
Publication date: 2022-02-10

Abstract

To provide a dialog device capable of making responses corresponding to the degree of growth of dialog with users.SOLUTION: A dialog device 100 includes: an utterance receiving unit 11 that accepts an utterance from a user; and a state determination unit 14 that calculates the degree of growth for dialog with the user based on a user's utterance history, and acquires a response sentence corresponding to the degree of growth for the utterance from the user. This configuration allows response sentences to be changed according to the degree of growth of dialogue with the user. Therefore, the more the dialog device 100 grows, the more the user can receive corresponding response sentences, and feeling that the dialogue is mechanical is reduced.SELECTED DRAWING: Figure 1

Description

本発明は、対話装置に関する。 The present invention relates to a dialogue device.

近年、対話エージェントが普及してきており、一般的に対話エージェントを実現するための要素技術としてＡＩＭＬ（Artificial Intelligence Markup Language）があげられる。ＡＩＭＬとは、ＸＭＬを応用した記述言語であり、ＡＩＭＬを用いる対話エージェントは、入力された発話に対して、事前に記述されたルールで応答を返信する。また、特許文献１に記載されているように、対象者の状態に応じた的確な応答を行う技術が知られている。 In recent years, dialogue agents have become widespread, and AIML (Artificial Intelligence Markup Language) is generally mentioned as an elemental technology for realizing dialogue agents. AIML is a descriptive language to which XML is applied, and a dialogue agent using AIML returns a response to an input utterance according to a pre-written rule. Further, as described in Patent Document 1, a technique for performing an accurate response according to the state of the subject is known.

特開２０１５-１４８３４号公報JP-A-2015-14834

ＡＩＭＬを用いる対話エージェントでは、事前にＡＩＭＬに記述されたルール通りに応答を返すため、誰に対しても同じ応答をしてしまう。また、パターンをいくつか用意していても、ユーザがしばらく対話エージェントと対話すると応答パターンがわかってしまう。また、特許文献１に記載の技術においても、その応答がパターン化され、同じような応答が繰り返されることが考えられる。したがって、ユーザは対話エージェントの応答が機械的と感じてしまい愛着が持てず利用継続しないことが考えられる。 Since the dialogue agent using AIML returns a response according to the rules described in AIML in advance, the same response is given to everyone. Also, even if some patterns are prepared, the response pattern will be known when the user interacts with the dialogue agent for a while. Further, also in the technique described in Patent Document 1, it is conceivable that the response is patterned and the same response is repeated. Therefore, it is conceivable that the user feels that the response of the dialogue agent is mechanical and cannot be attached to it and does not continue to use it.

そこで、上述の課題を解決するために、本発明は、ユーザとの対話に対する成長度合いに応じた応答を行うことができる対話装置を提供することを目的とする。 Therefore, in order to solve the above-mentioned problems, it is an object of the present invention to provide a dialogue device capable of responding to a dialogue with a user according to the degree of growth.

本発明の対話装置は、ユーザからの発話を受け付ける発話受付部と、前記ユーザの発話履歴に基づいて前記ユーザとの対話に対する成長度を算出する成長度算出部と、前記ユーザからの発話に対して、前記成長度に応じた応答文を取得する応答文取得部と、を備える。 The dialogue device of the present invention has an utterance receiving unit that accepts utterances from the user, a growth degree calculation unit that calculates the growth degree for the dialogue with the user based on the utterance history of the user, and a speech from the user. A response sentence acquisition unit for acquiring a response sentence according to the degree of growth is provided.

この構成によれば、成長度に応じた応答文を取得することができ、ユーザは、機械的な対話と感じることなく、愛着を持って対話装置の利用を継続することができる。 According to this configuration, it is possible to acquire a response sentence according to the degree of growth, and the user can continue to use the dialogue device with attachment without feeling that it is a mechanical dialogue.

本発明によると、ユーザは、機械的な対話と感じることなく、愛着を持って対話装置の利用を継続することができる。 According to the present invention, the user can continue to use the dialogue device with attachment without feeling that it is a mechanical dialogue.

本開示の実施形態に係る対話装置１０を含む対話システム１の機能ブロック図である。It is a functional block diagram of the dialogue system 1 including the dialogue apparatus 10 which concerns on embodiment of this disclosure. ユーザ端末２０を介してユーザに提示される受付画面Ｐの一例を示す図である。It is a figure which shows an example of the reception screen P presented to a user through a user terminal 20. 第１メモリ１０ａに記憶されている解析結果のデータベースを示す図である。It is a figure which shows the database of the analysis result stored in the 1st memory 10a. 発話を形態素解析して、極性反転をする処理を示す模式図である。It is a schematic diagram which shows the process which performs the polarity inversion by morphological analysis of an utterance. 第２メモリ１０ｂに記憶されているデータベースの具体例を示す図である。It is a figure which shows the specific example of the database stored in the 2nd memory 10b. ユーザ模倣部１３の動作を示す模式図である。It is a schematic diagram which shows the operation of the user imitation unit 13. 状態決定部１４の動作を示す模式図である。It is a schematic diagram which shows the operation of the state determination part 14. 第３メモリ１０ｃに記憶されているデータベースの具体例を示す図である。It is a figure which shows the specific example of the database stored in the 3rd memory 10c. 成長記録に対応する情報のデータベースを示す図である。It is a figure which shows the database of the information corresponding to the growth record. 対話装置１０の動作（前半）を示すフローチャートである。It is a flowchart which shows the operation (first half) of a dialogue device 10. 対話装置１０の動作（後半）を示すフローチャートである。It is a flowchart which shows the operation (the latter half) of the dialogue apparatus 10. 本開示の一実施の形態に係る対話装置１００のハードウェア構成の一例を示す図である。It is a figure which shows an example of the hardware composition of the dialogue apparatus 100 which concerns on one Embodiment of this disclosure.

添付図面を参照しながら本発明の実施形態を説明する。可能な場合には、同一の部分には同一の符号を付して、重複する説明を省略する。 An embodiment of the present invention will be described with reference to the accompanying drawings. When possible, the same parts are designated by the same reference numerals, and duplicate description is omitted.

図１は、本開示の実施形態に係る対話装置１０を含む対話システム１の機能ブロック図である。この対話装置１０は、例えば、ユーザ端末２０と通信可能に構成されたサーバ装置である。対話装置１０は、１台の装置によって構成されても良いし、複数台の装置によって構成されても良い。ユーザ端末２０は、例えば、ユーザにより操作されるコンピュータである。本開示では、一例として、ユーザとユーザ端末２０とは一対一で対応しており、ユーザ端末２０は、ユーザに所有されるスマートフォンである。ただし、ユーザ端末２０の形態は特定の形態に限定されない。ユーザ端末２０の他の例としては、タブレット端末、ウェアラブル端末、パーソナルコンピュータなどがあげられる。また、ユーザ端末２０は、複数のユーザによって共有されても良い。 FIG. 1 is a functional block diagram of a dialogue system 1 including a dialogue device 10 according to an embodiment of the present disclosure. The dialogue device 10 is, for example, a server device configured to be able to communicate with the user terminal 20. The dialogue device 10 may be configured by one device or may be configured by a plurality of devices. The user terminal 20 is, for example, a computer operated by the user. In the present disclosure, as an example, the user and the user terminal 20 have a one-to-one correspondence, and the user terminal 20 is a smartphone owned by the user. However, the form of the user terminal 20 is not limited to a specific form. Other examples of the user terminal 20 include a tablet terminal, a wearable terminal, a personal computer, and the like. Further, the user terminal 20 may be shared by a plurality of users.

対話装置１００は、ユーザ端末２０からの発話に応じて応答文を提供することで対話サービスを提供する機能を有している。対話サービスは、ユーザにより入力された発話を受け付け、当該ユーザに応じた応答をユーザに提示するサービスである。図２は、上述した対話サービスにおいて、ユーザ端末２０を介してユーザに提示される受付画面Ｐの一例を示す図である．受付画面Ｐは、例えばユーザ端末２０のディスプレイに表示される複数のアイコンのうち対話サービスに対応するアイコンがユーザのタッチ操作等によって選択されることにより起動する。 The dialogue device 100 has a function of providing a dialogue service by providing a response sentence in response to an utterance from the user terminal 20. The dialogue service is a service that accepts an utterance input by a user and presents a response corresponding to the user to the user. FIG. 2 is a diagram showing an example of a reception screen P presented to the user via the user terminal 20 in the above-mentioned dialogue service. The reception screen P is activated by, for example, selecting an icon corresponding to the dialogue service from a plurality of icons displayed on the display of the user terminal 20 by a user's touch operation or the like.

その後、ユーザは、発話をユーザ端末２０に入力する。例えば、ユーザは受付画面の下部に配置されたテキスト入力ウィンドウＴにテキストを入力することにより、テキストデータ形式の発話を入力しても良い。 After that, the user inputs the utterance to the user terminal 20. For example, the user may input an utterance in a text data format by inputting text in the text input window T arranged at the bottom of the reception screen.

或いは、ユーザはテキスト入力ウィンドウの右側に配置された音声入力アイコンＶを選択して音声受付機能を起動し、当該音声受付機能を用いて音声入力を行うことにより、音声データ形式の発話を入力しても良い。 Alternatively, the user selects the voice input icon V located on the right side of the text input window to activate the voice reception function, and inputs the voice using the voice reception function to input the utterance in the voice data format. May be.

図２では、対話サービスは、ユーザからメッセージＵＤ：「今日暑いね」という発話が入力された場合、当該発話と過去の発話の解析結果に基づいてメッセージＭ：「毎日暑くて嫌になっちゃう」などの応答文を返している。 In FIG. 2, when the user inputs a message UD: "It's hot today", the dialogue service has a message M: "It's hot every day and I don't like it" based on the analysis result of the utterance and the past utterances. Is returning a response statement such as.

なお、ユーザ端末２０は、ユーザに対話サービスを提供する機能を有してもよい。ユーザ端末２０は、対話装置１００からの対話サービスを受けるための公知の機能を有しているが、ユーザ端末２０がこれら対話サービスの機能を有してもよい。例えばユーザ端末２０は、ユーザの発話を受け付ける機能と、発話を形態素解析等により解析する機能と、発話の解析結果に応じて適切な応答文を選択する機能と、応答文をユーザに提示する機能を有してもよい。 The user terminal 20 may have a function of providing a dialogue service to the user. Although the user terminal 20 has a known function for receiving the dialogue service from the dialogue device 100, the user terminal 20 may have the function of these dialogue services. For example, the user terminal 20 has a function of accepting a user's utterance, a function of analyzing the utterance by morphological analysis, a function of selecting an appropriate response sentence according to the analysis result of the utterance, and a function of presenting the response sentence to the user. May have.

また、ユーザ端末２０は、上述した全ての機能を備えていなくても良い。例えば、一部の機能（例えばユーザの発話を解析する機能）は、ユーザ端末２０と通信可能に構成された当該ユーザ端末２０とは異なる処理装置によって実行されても良い。この場合、例えばユーザ端末２０は、ユーザの発話を処理装置に送信し、処理装置により解析された結果を処理装置から受信すれば良い。 Further, the user terminal 20 does not have to have all the above-mentioned functions. For example, some functions (for example, a function of analyzing a user's utterance) may be executed by a processing device different from the user terminal 20 configured to be able to communicate with the user terminal 20. In this case, for example, the user terminal 20 may transmit the user's utterance to the processing device and receive the result analyzed by the processing device from the processing device.

また、解析結果に応じて応答文を決定する機能は、必ずしもユーザ端末２０上で実行されなくてもよく、ユーザ端末２０と通信可能に構成されたユーザ端末２０とは異なる処理装置（上記処理装置と異なってもよい）上で実行されても良い。この場合、例えばユーザ端末２０は処理装置からユーザの発話に対応する応答文を取得し、当該応答文をユーザに提示すれば良い。なお、上記機能は、同一の処理装置で行われても良いし、互いに異なる装置であっても良い。また、対話装置１０が、上述した処理装置の一部または全部として機能しても良い。 Further, the function of determining the response statement according to the analysis result does not necessarily have to be executed on the user terminal 20, and is a processing device different from the user terminal 20 configured to be able to communicate with the user terminal 20 (the processing device described above). It may be executed on). In this case, for example, the user terminal 20 may acquire a response sentence corresponding to the user's utterance from the processing device and present the response sentence to the user. The above functions may be performed by the same processing device, or may be different devices from each other. Further, the dialogue device 10 may function as a part or all of the above-mentioned processing device.

本開示における対話システム１は、対話装置１０および一または複数のユーザ端末２０を含む。対話装置１０は、発話受信部１１、発話分解部１２、ユーザ模倣部１３、状態決定部１４、第１メモリ１０ａ、第２メモリ１０ｂ、第３メモリ１０ｃ、および第４メモリ１０ｄから構成される。以下、各構成について説明する。 The dialogue system 1 in the present disclosure includes a dialogue device 10 and one or more user terminals 20. The dialogue device 10 includes an utterance receiving unit 11, an utterance decomposition unit 12, a user imitation unit 13, a state determination unit 14, a first memory 10a, a second memory 10b, a third memory 10c, and a fourth memory 10d. Hereinafter, each configuration will be described.

発話受信部１１は、ユーザ端末２０からテキスト情報からなる発話を受信する部分である。 The utterance receiving unit 11 is a portion that receives an utterance composed of text information from the user terminal 20.

発話分解部１２は、発話受信部１１が受け取った発話を形態素解析する部分である。発話分解部１２は、形態素解析して得られた解析結果（形態素）が第１メモリ１０ａに記憶されているか判定を行う。発話分解部１２は、第１メモリ１０ａに解析結果が記憶されていなければ第１メモリ１０ａに当該解析結果を記憶する。 The utterance decomposition unit 12 is a part that morphologically analyzes the utterance received by the utterance receiving unit 11. The utterance decomposition unit 12 determines whether or not the analysis result (morpheme) obtained by the morphological analysis is stored in the first memory 10a. If the analysis result is not stored in the first memory 10a, the utterance decomposition unit 12 stores the analysis result in the first memory 10a.

また、発話分解部１２は、極性辞書を有しており、この極性辞書を参照して、受け取った発話を極性判定する部分である。発話分解部１２は、ユーザの発話と極性判定結果（当該発話における極性の割合）とを第２メモリ１０ｂに記憶する。発話分解部１２は、極性判定をする際、極性はポジティブ(以下、pos)、ネガティブ（以下、neg）、ニュートラル（以下、neu）の３つに分けて判定し、それぞれの極性として０から１まで間の値を算出する。 Further, the utterance decomposition unit 12 has a polarity dictionary, and is a portion for determining the polarity of the received utterance by referring to the polarity dictionary. The utterance decomposition unit 12 stores the user's utterance and the polarity determination result (ratio of polarity in the utterance) in the second memory 10b. When the utterance decomposition unit 12 determines the polarity, the polarity is divided into three, positive (hereinafter, pos), negative (hereinafter, neg), and neutral (hereinafter, neu), and each polarity is 0 to 1. Calculate the value between.

ここで、図３および図４を用いて発話分解部１２の処理について説明する。図３は、第１メモリ１０ａに記憶されている解析結果のデータベースを示す図である。図に示される通り、表の１レコード（１行）が１つの形態素に対応している。解析結果は、発話履歴として扱われる情報であって、図３の例では、「形態素」、「品詞」、「回数」および「time」を含んでいる。「形態素」欄は発話分解部１２で形態素解析され、抽出された形態素（表現要素）である。なお、抽出する形態素は限定しない。例えば、名詞だけでも良いし、動詞だけでも良いし、形容詞だけでも良いし、複数の品詞でも良い。「品詞」欄は、「語彙」の品詞情報を表す情報である。「回数」欄は、「形態素」が今までのユーザの発話の中で何回発現したかを表す情報である。「time」はその形態素が一番最近発話された時間を表す情報である。回数とTime、成長記録は、ユーザへの情報提示に使用する。たとえば、「あなたの発話で一番多いのは電車です。」「何時頃よく話します。」「成長度は42.5です。」などを表示する。また、成長度は、数値から成長曲線を出しユーザに提示するために記録する。 Here, the processing of the utterance decomposition unit 12 will be described with reference to FIGS. 3 and 4. FIG. 3 is a diagram showing a database of analysis results stored in the first memory 10a. As shown in the figure, one record (one row) in the table corresponds to one morpheme. The analysis result is information treated as an utterance history, and in the example of FIG. 3, includes "morpheme", "part of speech", "number of times", and "time". The "morpheme" column is a morpheme (expression element) extracted by morphological analysis by the utterance decomposition unit 12. The morpheme to be extracted is not limited. For example, it may be only a noun, a verb, an adjective, or a plurality of parts of speech. The "part of speech" column is information representing the part of speech information of the "vocabulary". The "number of times" column is information indicating how many times the "morpheme" has been expressed in the user's utterances so far. "Time" is information that represents the time when the morpheme was most recently spoken. The number of times, Time, and growth record are used to present information to the user. For example, "The most common utterance of you is the train." "When do you talk a lot?" "Growth is 42.5." In addition, the degree of growth is recorded in order to obtain a growth curve from the numerical value and present it to the user.

図４は、発話を形態素解析して、極性反転をする処理を示す模式図である。図に示される通り、ユーザ発話「今日の電車すごい混んでて辛かった」がユーザ端末２０に入力され、対話装置１０に送信される。 FIG. 4 is a schematic diagram showing a process of performing morphological analysis of utterances and performing polarity reversal. As shown in the figure, the user's utterance "Today's train was very crowded and painful" is input to the user terminal 20 and transmitted to the dialogue device 10.

発話分解部１２は、この発話を形態素解析する（Ｓ１１）。本開示においては、形態素解析エンジンとしてMecabを利用している。発話分解部１２は、「今日」「電車」「すごく」「混む」「辛い」を品詞として取得する。 The utterance decomposition unit 12 morphologically analyzes this utterance (S11). In this disclosure, Mecab is used as a morphological analysis engine. The utterance decomposition unit 12 acquires "today," "train," "very," "crowded," and "spicy" as part of speech.

発話分解部１２は、極性辞書を参照して極性判定する（Ｓ１２）。極性辞書は、形態素ごとの極性を示した辞書である。本開示においてはあらかじめ用意された辞書である。発話分解部１２は、形態素解析した品詞ごとに極性辞書を参照して、当該品詞の極性を判定し、一の発話における品詞ごとの極性の割合を求める。本開示においては、ポジティブ、ネガティブ、ニュートラルの３つの極性の割合を求める。 The utterance decomposition unit 12 determines the polarity with reference to the polarity dictionary (S12). The polarity dictionary is a dictionary showing the polarity of each morpheme. In this disclosure, it is a dictionary prepared in advance. The utterance decomposition unit 12 refers to the polarity dictionary for each part of speech analyzed by morphological analysis, determines the polarity of the part of speech, and obtains the ratio of the polarity of each part of speech in one utterance. In this disclosure, the ratios of the three polarities of positive, negative, and neutral are calculated.

ユーザ模倣部１３は、ユーザの発話に対してその感情を模倣した応答を返すための情報を生成する部分である。ユーザ模倣部１３は、第１メモリ１０ａを参照することにより、総形態素数（以下、voc）を算出する。また、ユーザ模倣部１３は、第２メモリ１０ｂから各発話における極性判定の割合を参照することにより、一のユーザの総発話の極性の割合を計算する。そして、ユーザ模倣部１３は、総形態素数、発話時間(time)が一番最近の形態素(以下rword)、および一のユーザの総発話における極性の割合を、状態決定部１４に送信する。 The user imitation unit 13 is a part that generates information for returning a response that imitates the emotion of the user's utterance. The user imitation unit 13 calculates the total morpheme number (hereinafter, voc) by referring to the first memory 10a. Further, the user imitation unit 13 calculates the ratio of the polarity of the total utterance of one user by referring to the ratio of the polarity determination in each utterance from the second memory 10b. Then, the user imitation unit 13 transmits the total morpheme number, the morpheme having the latest utterance time (time) (hereinafter, rword), and the ratio of the polarity in the total utterance of one user to the state determination unit 14.

図５および図６を用いて、ユーザ模倣部１３の動作について説明する。図５は、第２メモリ１０ｂに記憶されているデータベースの具体例を示す。図に示される通り、第２メモリ１０ｂは、ユーザの発話と極性判定結果とを記憶するデータベースを記憶する。図５に示される表の１行が１つの発話履歴に対応している。図５の例では、発話履歴は「発話文」、「pos」、「neg」、「neu」及び「time」を含んでいる。「発話文」はユーザにより入力されたユーザの発話の内容を示す情報である。「pos」は、当該発話を極性判定し、posとして算出された値を示す情報である。「neg」は、当該発話を極性判定し、negとして算出された値を示す情報である。「neu」は、当該発話を極性判定し、neuとして算出された値を示す情報である。「time」は「発話文」が入力された時間を示す情報である。 The operation of the user imitation unit 13 will be described with reference to FIGS. 5 and 6. FIG. 5 shows a specific example of the database stored in the second memory 10b. As shown in the figure, the second memory 10b stores a database that stores the user's utterance and the polarity determination result. One row of the table shown in FIG. 5 corresponds to one utterance history. In the example of FIG. 5, the utterance history includes "utterance sentence", "pos", "neg", "neu" and "time". The "utterance sentence" is information indicating the content of the user's utterance input by the user. “Pos” is information indicating a value calculated as pos by determining the polarity of the utterance. “Neg” is information indicating a value calculated as neg by determining the polarity of the utterance. “Neu” is information indicating a value calculated as neu by determining the polarity of the utterance. "Time" is information indicating the time when the "utterance sentence" is input.

図６は、ユーザ模倣部１３の動作を示す模式図である。ユーザ模倣部１３は、第１メモリ１０ａを参照して、vocを算出する。ここでは、vocは５０としている。また、ユーザ模倣部１３は、第２メモリ１０ｂを参照して、ユーザ発話の極性の割合を算出する。極性の割合は以下の数式により算出する。ここでは、ｘ_ｐｏｌは、極性（ネガティブ・ポジティブ・ニュートラル）の割合を示し、ｎは、第２メモリ１０ｂの発話数を示す。この数式は、ユーザ発話における極性ごとの平均値を示したものとなる。 FIG. 6 is a schematic diagram showing the operation of the user imitation unit 13. The user imitation unit 13 calculates voc with reference to the first memory 10a. Here, voc is set to 50. Further, the user imitation unit 13 calculates the ratio of the polarity of the user's utterance with reference to the second memory 10b. The polarity ratio is calculated by the following formula. Here, x _pol indicates the ratio of polarity (negative / positive / neutral), and n indicates the number of utterances in the second memory 10b. This formula shows the average value for each polarity in the user's utterance.

例えば、図６の例では、ユーザ発話におけるポジティブの平均値は、０．２８、ネガティブの平均値は０．４１、ニュートラルの平均値は０．３１となる。本件明細書において、以降、各極性における平均値であるXpolを、per_pos、per_nega、 per_neuと表記する。それぞれ、Positive、Negative、Neutralの極性の平均値を示す。また、図６における第２メモリ１０ｂの図は発話文欄とtime欄とを省略した図である。

For example, in the example of FIG. 6, the positive average value in the user utterance is 0.28, the negative average value is 0.41, and the neutral average value is 0.31. Hereinafter, in the present specification, Xpol, which is an average value for each polarity, is referred to as per_pos, per_nega, and per_neu. The average values of Positive, Negative, and Neutral polarities are shown, respectively. Further, the figure of the second memory 10b in FIG. 6 is a diagram in which the utterance sentence column and the time column are omitted.

状態決定部１４は、ユーザ模倣部１３から総形態素数、per_pos、per_neg、およびper_neuを受け取り、成長度を計算する部分である。また、状態決定部１４は、極性の割合を用いた重み付きランダムによって、発話に対する応答対象である応答文の極性を決定する部分である。そして、状態決定部１４は、ユーザ模倣部１３から受け取ったrword（応答対象となる発話のうち直近の形態素）を用いて第３メモリ１０ｃに記憶されている応答文郡を抽出する。この時、第３メモリ１０ｃのトリガーワードとrwordとが複数マッチングした時はマッチングしたレコードを全て取り出す。最後に決定した成長度によって応答文群の中から応答文を1つ選択し、対話送信部に送信する。また、総形態素数、per_pos、per_neg、per_neu、成長度を第４メモリ１０ｄに送信する。 The state determination unit 14 receives the total morpheme number, per_pos, per_neg, and per_neu from the user imitation unit 13, and calculates the degree of growth. Further, the state determination unit 14 is a portion that determines the polarity of the response sentence that is the response target to the utterance by weighted random using the ratio of polarity. Then, the state determination unit 14 extracts the response sentence group stored in the third memory 10c using the rword (the most recent morpheme of the utterances to be responded to) received from the user imitation unit 13. At this time, when a plurality of trigger words and rwords of the third memory 10c are matched, all the matched records are taken out. One response sentence is selected from the response sentence group according to the growth degree determined at the end, and is transmitted to the dialogue transmission unit. Further, the total morpheme number, per_pos, per_neg, per_neu, and the degree of growth are transmitted to the fourth memory 10d.

図７を用いて、状態決定部１４の動作について説明する。まず、図７（ａ）に示されるとおり、状態決定部１４は、以下の式に基づいて成長度ｇを計算する。 The operation of the state determination unit 14 will be described with reference to FIG. 7. First, as shown in FIG. 7A, the state determination unit 14 calculates the growth rate g based on the following equation.

ｇ＝（１－｜ａ_１－per_pos｜－｜ａ_２－per_nega｜－｜ａ_３－per_neu|）・ｍ
この式において、ａ_１、ａ_２、ａ_３は、それぞれポジティブ、ネガティブ、ニュートラルにおける理想の極性値を示す。また、ａ_１＋ａ_２＋ａ_３≦１とし、あらかじめ定められた値とする。ｍは、形態素数である。 g = (1- ｜ a ₁ －per_pos ｜－｜ a ₂ －per_nega ｜－｜ a ₃ －per_neu |) ・ m
In this equation, a ₁ , a ₂ , and a ₃ indicate ideal polarity values in positive, negative, and neutral, respectively. Further, a ₁ + a ₂ + a ₃ ≦ 1 is set, and a predetermined value is used. m is a morpheme number.

すなわち、成長度は、（1-（|それぞれの極性の理想の値-それぞれの極性の実際の値|））*形態素数で計算される。この数式は、理想状態に近づき、また対話数（形態素数）が多くなるにつれて、その数値が上昇するように設定された数式である。 That is, the degree of growth is calculated by (1-(| ideal value of each polarity-actual value of each polarity |)) * morpheme. This formula is a formula set so that the numerical value increases as the ideal state is approached and the number of dialogues (morpheme number) increases.

成長度は、ユーザと対話エージェントとの親密度を表す値であることから、上記の通り、話しかければ話しかけるほど成長度が高くなるように設計されている。ただし、同じ言葉ばかり話しかけても成長度は上がらないように形態素数を用いて調整されている。また、ユーザ端末２０からの発話の感情極性のバランスを算出し、極性のバランスが悪いと成長度が低くなるようにする。よって、成長度はユーザの発話から得た形態素の総数と極性とのバランスで決定される。 Since the growth rate is a value indicating the intimacy between the user and the dialogue agent, as described above, the growth rate is designed to increase as the user speaks. However, it is adjusted using morpheme so that the growth rate does not increase even if only the same words are spoken. Further, the balance of the emotional polarities of the utterances from the user terminal 20 is calculated, and if the polar balance is poor, the degree of growth is lowered. Therefore, the degree of growth is determined by the balance between the total number of morphemes obtained from the user's utterance and the polarity.

一般的に、極性が偏っている状態（ネガティブすぎる、ポジティブすぎる）では正常に成長しているとは言えない。そして、理想の状態に近づいているときこそ、成長していると判断することが妥当である。 In general, it cannot be said that the plants are growing normally when the polarity is biased (too negative or too positive). And it is appropriate to judge that it is growing when it is approaching the ideal state.

本開示においては、成長度は、形態素数（語彙数）だけでは決まらず、極性のバランスを用いて決定している。形態素数が増えること自体は、成長しているといえるが、形態素数に対して、理想の極性からどの程度離れているかの値を乗算することにより、どれぐらい成長したかを示すことができる。理想の極性から離れていなければ、形態素数×1の成長度を得ることができる。 In the present disclosure, the degree of growth is not determined only by the morpheme number (the number of vocabularies), but is determined by using the balance of polarity. It can be said that the increase in the morpheme number itself is growing, but it is possible to show how much the morpheme number has grown by multiplying the morpheme number by the value of how far it is from the ideal polarity. If the polarity is not deviated from the ideal polarity, the growth rate of morpheme number × 1 can be obtained.

上記式では、ポジティブに関する数値（理想値との一致程度）から、ネガティブに関する数値およびニュートラルに関する数値を、減算している。ネガティブに関する感情は、ポジティブに関する感情より強力な感情であるため、その成長度に大きく影響するように、ポジティブに関する数値からネガティブに関する数値を減算している。なお、成長度は、この数式に限定するものではなく、上記定義にしたがった数式であればよい。 In the above formula, the numerical value related to negative and the numerical value related to neutral are subtracted from the numerical value related to positive (the degree of agreement with the ideal value). Negative emotions are stronger than positive emotions, so negative emotions are subtracted from positive emotions to significantly affect their growth. The degree of growth is not limited to this formula, and may be any formula according to the above definition.

ユーザ模倣部１３は、形態素数ｍ＝５０、Positive：０．２８、Negative：０．４１、Neutral：０．３１を算出している。これら数値を上記式に入れることで、状態決定部１４は、成長度ｇ＝４２．５を算出する。 The user imitation unit 13 calculates the morpheme number m = 50, Positive: 0.28, Negative: 0.41, and Neutral: 0.31. By putting these numerical values into the above formula, the state determination unit 14 calculates the growth degree g = 42.5.

例えば、理想の値が０．３３だった場合で、per_posの値が０．２８で、per_negの値が０．４１で、per_neuの値が０．３１で、形態素数が５０だったとき、
g=（1-|(0.33-0.28)|-|(0.33-0.41)|-|(0.33-0.31)|)*50=42.5
となり、成長度は４２．５になる。 For example, when the ideal value is 0.33, the per_pos value is 0.28, the per_neg value is 0.41, the per_neu value is 0.31, and the morphological prime number is 50.
g = (1- | (0.33-0.28) |-| (0.33-0.41) |-| (0.33-0.31) |) * 50 = 42.5
And the growth rate is 42.5.

また、状態決定部１４は、極性の割合を用いた重み付きランダム関数によって、どの極性に属する応答文を決定する。例えば、図７（ｂ）に示されるとおり、予め指定された重み付けをすることで、ユーザ模倣部１３が算出したPositive：０．２８、Negative：０．４１、Neutral：０．３１から、Positive：０．３、Negative：０．４１、Neutral：０．２９を算出する。そして、状態決定部１４は、それぞれの確率に基づいて応答文が属する極性を決定する。すなわち、Positiveとなる確率：３０％、Negativeとなる確率：４１％、Neutralとなる確率：２９％であり、状態決定部１４は、これら確率に従って、応答文が属する極性を決定する。図７（ｂ）では、状態決定部１４は、応答文はネガティブに決定している。 Further, the state determination unit 14 determines which polarity the response statement belongs to by a weighted random function using the ratio of polarities. For example, as shown in FIG. 7B, from Positive: 0.28, Negative: 0.41, Neutral: 0.31 calculated by the user imitation unit 13 by weighting specified in advance, Positive: 0.3, Negative: 0.41, Neutral: 0.29 are calculated. Then, the state determination unit 14 determines the polarity to which the response statement belongs based on each probability. That is, the probability of becoming Positive: 30%, the probability of becoming Negative: 41%, the probability of becoming Neutral: 29%, and the state determination unit 14 determines the polarity to which the response sentence belongs according to these probabilities. In FIG. 7B, the state determination unit 14 negatively determines the response sentence.

図７（ｃ）に示される通り、状態決定部１４は、成長度と応答文の極性とrwordとに基づいて、一または複数の応答文を決定する。具体的には、状態決定部１４は、rwordに基づいて、第３メモリ１０ｃから、トリガーワード欄において、rwordに一致する応答文群を抽出し、さらに、そのうちから極性に一致する応答文群を抽出する。 As shown in FIG. 7 (c), the state determination unit 14 determines one or more response statements based on the degree of growth, the polarity of the response statement, and the rword. Specifically, the state determination unit 14 extracts a response sentence group matching the rword from the third memory 10c based on the rword in the trigger word column, and further selects a response sentence group matching the polarity from the response sentence group. Extract.

図８に示される通り、第３メモリ１０ｃは、定型応答文のデータベースを記憶する。表の１レコード（一行）が、1つの応答文に対応するデータベースである。図８の例では、このデータベースは、「トリガーワード」、「応答文」、「成長度閾値」、「ＰＮ（極性）」を対応付けている。「トリガーワード」は、応答文を選択する際の基準となる語である。「トリガーワード」で示されている語が発話の最後に含まれている場合に、その発話に対する応答文が選択される。「成長度閾値」は、応答文を選択するための基準となるパラメータであって、ユーザの成長度と成長度閾値とを比較することにより、応答文を選択する。「ＰＮ」は、応答文の極性を示す。 As shown in FIG. 8, the third memory 10c stores a database of fixed response statements. One record (one row) in the table is the database corresponding to one response statement. In the example of FIG. 8, this database associates a "trigger word", a "response statement", a "growth threshold", and a "PN (polarity)". The "trigger word" is a reference word when selecting a response sentence. If the word indicated by the "trigger word" is included at the end of the utterance, the response to that utterance is selected. The "growth threshold" is a parameter that serves as a reference for selecting a response statement, and the response statement is selected by comparing the growth degree of the user with the growth threshold. "PN" indicates the polarity of the response statement.

図７（ｃ）では、rwordは「辛い」、極性はNegativeに一致する３つの応答文群を抽出している。さらに、状態決定部１４は、成長度（42.5）と、第３メモリ１０ｃに記憶されている成長度閾値とに基づいて、成長度以下に成長度閾値が設定されている一の応答文を決定する。例えば、成長度４２．５を用いて応答文を決める場合、1つ目の文の成長度閾値が８７０で、２つ目の文の成長度閾値が２８０で、３つ目の成長度閾値が０の場合、応答文は３つ目の応答文が選択される。ここでは、「かなしい」が選択される。 In FIG. 7 (c), three response sentence groups whose rword is "spicy" and whose polarity is Negative are extracted. Further, the state determination unit 14 determines one response sentence in which the growth degree threshold is set to be equal to or lower than the growth degree based on the growth degree (42.5) and the growth degree threshold stored in the third memory 10c. do. For example, when the response sentence is determined using the growth degree 42.5, the growth degree threshold value of the first sentence is 870, the growth degree threshold value of the second sentence is 280, and the growth degree threshold value of the third sentence is 280. If 0, the third response statement is selected as the response statement. Here, "Kana" is selected.

応答文送信部１５は、状態決定部１４により決定された応答文をユーザ端末２０に送信する。また、応答文送信部１５は、ユーザ端末２０に、ユーザの成長記録およびユーザの使用した発話（形態素単位）の頻度を送信してもよい。成長記録として、後述する第４メモリ１０ｄに記憶されている情報がある。また発話の頻度として、第１メモリ１０ａに記憶されている情報がある。その送信は、ユーザ端末２０からの要求によってもでもよいし、定期的になされてもよい。 The response sentence transmission unit 15 transmits the response sentence determined by the state determination unit 14 to the user terminal 20. Further, the response sentence transmission unit 15 may transmit the growth record of the user and the frequency of the utterance (morpheme unit) used by the user to the user terminal 20. As the growth record, there is information stored in the fourth memory 10d, which will be described later. Further, as the frequency of utterance, there is information stored in the first memory 10a. The transmission may be made by a request from the user terminal 20 or may be made periodically.

応答文送信部１５は、発話の頻度を送信する場合には、たとえば，「あなたの発話で一番多いのは〇〇です。」というメッセージを生成する。また、発話の履歴を利用して「何時頃よく話します。」というメッセージを生成してもよい。その場合、対話装置１００は、発話ごとの履歴テーブル（図示せず）を備え、応答文送信部１５は、その情報を利用する。また、応答送信部１５は、「成長度は〇〇です。」のメッセージを生成してもよい。成長度は第４メモリ１０ｄに記憶されている情報である。また，成長度は、過去の成長度の数値（成長度履歴、横軸は時間または総形態素数）から成長曲線を算出し、ユーザ端末２０に送信してもよい。また、数値のみを送り、ユーザ端末２０において成長曲線を形成するように処理してもよい。 When transmitting the frequency of utterances, the response sentence transmitting unit 15 generates, for example, the message "The most utterances of yours are XX." You may also use the utterance history to generate the message "When do you often speak?". In that case, the dialogue device 100 includes a history table (not shown) for each utterance, and the response sentence transmission unit 15 uses the information. Further, the response transmission unit 15 may generate a message "Growth degree is XX.". The degree of growth is information stored in the fourth memory 10d. Further, for the growth degree, a growth curve may be calculated from a numerical value of the past growth degree (growth degree history, the horizontal axis is time or total morpheme) and transmitted to the user terminal 20. Further, only numerical values may be sent and processed so as to form a growth curve in the user terminal 20.

第１メモリ１０ａは、対話分解部から送信された形態素を記憶するデータベースを記憶する部分である。その具体例は上述した通り、図３に示される通りであり、形態素ごとに、品詞、抽出回数、直近に分析した時刻を記憶している。 The first memory 10a is a portion that stores a database that stores morphemes transmitted from the dialogue decomposition unit. As described above, the specific example is as shown in FIG. 3, and the part of speech, the number of extractions, and the most recently analyzed time are stored for each morpheme.

第２メモリ１０ｂは、ユーザの発話と極性判定結果とを記憶するデータベースを記憶する部分である。 The second memory 10b is a portion that stores a database that stores the user's utterance and the polarity determination result.

第３メモリ１０ｃは、定型応答文のデータベースを記憶する部分である。 The third memory 10c is a portion for storing a database of standard response statements.

第４メモリ１０ｄは、ユーザの成長記録を記憶するデータベースである。図９に示される表の１レコードがユーザの１発話の成長記録に対応する情報である。図９の例では、「総形態素数」、「pos」、「neg」、「neu」、「成長度」、「time」を持つ。「time」は、発話をした日時を示す。この第４メモリ１０ｄには、状態決定部１４が応答文を取得するごとに、その成長記録を記憶させる。総形態素数は、応答文作成ごとに抽出された形態素数の累積である。一つの発話には、５～１０個の形態素が含まれる場合が多いが、それ以上の形態素数が含まれる場合もある。 The fourth memory 10d is a database that stores a user's growth record. One record in the table shown in FIG. 9 is information corresponding to the growth record of one utterance of the user. In the example of FIG. 9, it has "total morpheme number", "pos", "neg", "neu", "growth degree", and "time". “Time” indicates the date and time when the utterance was made. In the fourth memory 10d, each time the state determination unit 14 acquires a response sentence, the growth record is stored. The total morpheme is the accumulation of the morphemes extracted for each response sentence creation. One utterance often contains 5 to 10 morphemes, but may include more morphemes.

このように構成された対話装置１０の動作について説明する。図１０および図１１は、対話装置１０の動作を示すフローチャートである。以下、フローチャートの各ステップについて説明する。このフローチャートは１ユーザに対しての処理を記載したものである。 The operation of the dialogue device 10 configured in this way will be described. 10 and 11 are flowcharts showing the operation of the dialogue device 10. Hereinafter, each step of the flowchart will be described. This flowchart describes the process for one user.

発話受信部１１は、ユーザが音声もしくはテキストで入力した発話文を入力１として受信する（Ｓ１０１）。発話分解部１２は、発話受信部１１が受け取った入力１に対しMecab等の形態素解析器を用いて形態素解析を行い、発話文を形態素（言語で意味を持つ最小単位）に分解して品詞を判別する（Ｓ１０２）。発話分解部１２は、判別された形態素が第１メモリ１０ａに記憶されているか判定を行う（Ｓ１０３）。発話分解部１２は、第１メモリ１０ａに記憶されていなければ（Ｓ１０３：ＮＯ）、第１メモリ１０ａに形態素、品詞を記憶する（Ｓ１０４）。発話分解部１２は、第１メモリ１０ａに記憶されていた場合（Ｓ１０３：ＹＥＳ）、出現した回数を１増やすよう第１メモリ１０ａの「回数」の値を更新する。また、発話分解部１２は、入力１を受け取った時間(以下time)を第１メモリ１０ａに記録する。このとき、すでに形態素が登録されていた場合は、その形態素が出現した最新の時間に値を更新する。 The utterance receiving unit 11 receives the utterance sentence input by the user by voice or text as input 1 (S101). The utterance decomposition unit 12 performs morphological analysis on the input 1 received by the utterance receiving unit 11 using a morphological analyzer such as Mecab, decomposes the utterance sentence into morphological elements (the smallest unit having meaning in the language), and decomposes the part of speech. Determine (S102). The utterance decomposition unit 12 determines whether or not the determined morpheme is stored in the first memory 10a (S103). If the utterance decomposition unit 12 is not stored in the first memory 10a (S103: NO), the utterance decomposition unit 12 stores the morpheme and the part of speech in the first memory 10a (S104). When the utterance decomposition unit 12 is stored in the first memory 10a (S103: YES), the utterance decomposition unit 12 updates the value of the “number of times” of the first memory 10a so as to increase the number of appearances by one. Further, the utterance decomposition unit 12 records the time (hereinafter, time) when the input 1 is received in the first memory 10a. At this time, if the morpheme has already been registered, the value is updated at the latest time when the morpheme appears.

発話分解部１２は、日本語評価極性辞書などの極性辞書を用いて入力１の極性判定を行う（Ｓ１０５）。ユーザ模倣部１３は、極性判定を行う時、それぞれの極性のスコアを算出する。例えば、発話分解部１２は、「今日の電車すごい混んでて辛かった…」を極性判定して、pos、neg、neuのスコアをそれぞれ算出する。この時使用する極性辞書はすでに持っているものとする。発話分解部１２は、入力１と、算出したpos、neg、neuのスコアと、timeとを第２メモリ１０ｂに記録する。 The utterance decomposition unit 12 determines the polarity of the input 1 using a polarity dictionary such as a Japanese evaluation polarity dictionary (S105). When the polarity determination is performed, the user imitation unit 13 calculates the score of each polarity. For example, the utterance decomposition unit 12 determines the polarity of "Today's train was so crowded and painful ..." and calculates the scores of pos, neg, and neu, respectively. It is assumed that the polarity dictionary used at this time already exists. The utterance decomposition unit 12 records the input 1, the calculated scores of pos, neg, and neu, and the time in the second memory 10b.

ユーザ模倣部１３は、第１メモリ１０ａに記憶されている形態素の総数（以下、総形態素数）をカウントする（Ｓ１０７）。ユーザ模倣部１３は、第２メモリ１０ｂから全発話の極性判定の結果を受け取る。受け取った極性判定の結果からユーザ発話全体の極性の割合（per_pos、per_neg、per_neu）を計算する（Ｓ１０８）。 The user imitation unit 13 counts the total number of morphemes stored in the first memory 10a (hereinafter referred to as the total number of morphemes) (S107). The user imitation unit 13 receives the result of the polarity determination of all utterances from the second memory 10b. From the result of the received polarity determination, the ratio of the polarity (per_pos, per_neg, per_neu) of the entire user utterance is calculated (S108).

状態決定部１４は、ユーザ模倣部１３から極性の割合と語彙数を受け取り、成長度を計算する（Ｓ１０９）。 The state determination unit 14 receives the polarity ratio and the number of vocabularies from the user imitation unit 13, and calculates the degree of growth (S109).

状態決定部１４は、第２メモリ１０ｂから得た極性の割合によってあらかじめ指定された重みに従って確率的に応答文の極性を決定する（Ｓ１１０）。 The state determination unit 14 probabilistically determines the polarity of the response sentence according to a weight designated in advance by the ratio of the polarity obtained from the second memory 10b (S110).

状態決定部１４は、決定した応答文の極性に対応する応答文群を第３メモリ１０ｃから抽出する。この時、応答文群は事前に第３メモリ１０ｃに記憶されているものとする。抽出した応答文群には応答文と成長度閾値と極性とが記録されている。状態決定部１４は、計算された成長度を成長度閾値と比較し、成長度閾値を超えた場合、対応する応答文を応答文１に設定する（Ｓ１１１）。 The state determination unit 14 extracts the response sentence group corresponding to the determined polarity of the response sentence from the third memory 10c. At this time, it is assumed that the response sentence group is stored in the third memory 10c in advance. In the extracted response sentence group, the response sentence, the growth threshold and the polarity are recorded. The state determination unit 14 compares the calculated growth degree with the growth degree threshold value, and when the growth degree threshold value is exceeded, sets the corresponding response sentence in the response sentence 1 (S111).

状態決定部１４は、第４メモリ１０ｄに総形態素数、per_pos、per_neg、per_neu、成長度、timeを記録する（Ｓ１１２）。 The state determination unit 14 records the total morpheme number, per_pos, per_neg, per_neu, growth rate, and time in the fourth memory 10d (S112).

応答文送信部１５は、選択した応答文をユーザ端末２０に送信する（Ｓ１１３）。 The response text transmission unit 15 transmits the selected response text to the user terminal 20 (S113).

つぎに、本開示における対話装置１００の作用効果について説明する。本開示における対話装置１００は、ユーザからの発話を受け付ける発話受信部１１と、ユーザの発話履歴に基づいてユーザとの対話に対する成長度を算出し、ユーザからの発話に対して、成長度に応じた応答文を取得する状態決定部１４と、を備える。この状態決定部１４は、成長度算出部および応答文取得部として機能する。 Next, the operation and effect of the dialogue device 100 in the present disclosure will be described. The dialogue device 100 in the present disclosure calculates the growth degree of the dialogue with the user based on the utterance receiving unit 11 that receives the utterance from the user and the utterance history of the user, and responds to the utterance from the user according to the growth degree. It is provided with a state determination unit 14 for acquiring a response statement. The state determination unit 14 functions as a growth degree calculation unit and a response sentence acquisition unit.

この構成により、ユーザとの対話の成長度に応じて応答文をかえることができる。従って、ユーザは、対話装置１００が成長すればするほど、それに応じた応答文を受けることができ、その対話が機械的であると感じることが低減する。 With this configuration, the response statement can be changed according to the degree of growth of the dialogue with the user. Therefore, as the dialogue device 100 grows, the user can receive the corresponding response sentence, and the user does not feel that the dialogue is mechanical.

さらに、成長度算出部として機能する状態決定部１４は、発話履歴における、記ユーザの発話に用いられた形態素数（表現要素数）およびユーザの発話における極性種別（ポジティブ、ネガティブ、ニュートラル）の割合に基づいて成長度を算出する。ここでの形態素数は、第１メモリ１０ａにおいて記憶されている形態素の個数であり、語彙数を示す。また、極性種別の割合は、第２メモリ１０ｂにおいて記憶されている全極性種別それぞれのポジティブ、ネガティブ、ニュートラルの割合の平均値または、その平均値に所定の重み付け処理をした値である。 Further, the state determination unit 14 functioning as a growth degree calculation unit is a ratio of the morpheme number (number of expression elements) used for the utterance of the user and the polarity type (positive, negative, neutral) in the utterance of the user in the utterance history. The degree of growth is calculated based on. The morpheme number here is the number of morphemes stored in the first memory 10a, and indicates the number of vocabularies. The polarity type ratio is an average value of the positive, negative, and neutral ratios of all the polarity types stored in the second memory 10b, or a value obtained by subjecting the average value to a predetermined weighting process.

すなわち、成長度は、今までユーザが対話に用いた形態素数と、その発話におけるユーザの極性種別の割合に基づく。極性種別の割合は、ユーザの発話の傾向とみることができ、ユーザの発話は、ポジティブ傾向であるか、ネガティブ傾向であるか、そのどちらでもないニュートラル傾向であるか、を示す。 That is, the degree of growth is based on the morpheme number that the user has used for dialogue and the ratio of the user's polarity type in the utterance. The proportion of the polarity type can be seen as the tendency of the user's utterance, and indicates whether the user's utterance has a positive tendency, a negative tendency, or a neutral tendency that is neither of them.

そして、応答文取得部として機能する状態決定部１４は、成長の度合いと、ユーザの発話の傾向に応じた応答文を選択する。 Then, the state determination unit 14 that functions as a response sentence acquisition unit selects a response sentence according to the degree of growth and the tendency of the user's utterance.

本開示の対話装置１００の成長度は、親密度を表す値であり、ユーザが対話装置１００に話しかければ話しかけるほど成長度が高くなるように設計されている。ただし、同じ言葉ばかり話しかけても成長度は上がらないように設計されている。また、ユーザ端末２０からの発話の感情を示した極性種別のバランスを算出し、極性のバランスが悪いと成長度が低くなるようにする。よって、成長度はユーザの発話から得た語彙の総数と極性とのバランスで決定される。一般的なユーザ（人間）にとって、ネガティブな感情とポジティブな感情との両方を受けることができることが理想的と言われている。 The degree of growth of the dialogue device 100 of the present disclosure is a value representing intimacy, and is designed so that the more the user speaks to the dialogue device 100, the higher the degree of growth. However, it is designed so that the growth rate does not increase even if you speak only the same words. Further, the balance of the polarity types indicating the emotions of the utterance from the user terminal 20 is calculated, and if the polarity balance is poor, the degree of growth is lowered. Therefore, the degree of growth is determined by the balance between the total number of vocabularies obtained from the user's utterances and the polarity. It is said that it is ideal for a general user (human being) to be able to receive both negative emotions and positive emotions.

これによって、極性種別のバランスを考慮した成長度を算出することで、理想的な応答を可能にする。これは、結果的にユーザは対話装置１００に対して親密度を持つことができ、その対話を継続することに貢献することになる。 This enables an ideal response by calculating the degree of growth considering the balance of the polarity types. As a result, the user can have an intimacy with the dialogue device 100 and contributes to the continuation of the dialogue.

より具体的には、対話装置１００は、ユーザからの発話を品詞ごとに分解して形態素（表現要素）を取得する発話分解部１２と、形態素を発話履歴として記憶する品詞履歴記憶部として機能する第１メモリ１０ａとを備える。そして、状態決定部１４は、第１メモリ１０ａに記憶されている形態素数に基づいて成長度を算出する。本開示においては、形態素解析を行い、これを発話履歴として記憶することで、語彙数の記憶を行う。 More specifically, the dialogue device 100 functions as an utterance decomposition unit 12 that decomposes utterances from the user for each part of speech and acquires morphemes (expression elements), and a part of speech history storage unit that stores the morphemes as utterance history. A first memory 10a is provided. Then, the state determination unit 14 calculates the degree of growth based on the morpheme number stored in the first memory 10a. In the present disclosure, the number of vocabularies is stored by performing morphological analysis and storing this as an utterance history.

また、対話装置１００は、発話ごとに、極性種別の割合を記憶する極性種別記憶部として機能する第２メモリ１０ｂをさらに備える。発話分解部１２は、発話における形態素ごとの極性種別の割合を取得して第２メモリ１０ｂに記憶する。そして、状態決定部１４は、形態素数に加えて、第２メモリ１０ｂにおける極性種別の割合に基づいて成長度を算出する。 Further, the dialogue device 100 further includes a second memory 10b that functions as a polarity type storage unit that stores the ratio of the polarity types for each utterance. The utterance decomposition unit 12 acquires the ratio of the polarity type for each morpheme in the utterance and stores it in the second memory 10b. Then, the state determination unit 14 calculates the degree of growth based on the ratio of the polarity type in the second memory 10b in addition to the morpheme number.

本開示の対話装置１００において、応答文取得部として機能する状態決定部１４は、成長度に加えて、ユーザの発話における極性種別に応じた応答文を取得する。その際、成長度算出部として機能する状態決定部１４は、ユーザの発話における極性種別の割合に基づいて、応答文の種別を決定する。ユーザの発話における極性種別に応じた応答文は、ユーザの発話の感情傾向を示し、その傾向に応じた応答文を取得することができる。 In the dialogue device 100 of the present disclosure, the state determination unit 14 that functions as the response sentence acquisition unit acquires the response sentence according to the polarity type in the user's utterance in addition to the growth degree. At that time, the state determination unit 14 that functions as the growth degree calculation unit determines the type of the response sentence based on the ratio of the polarity type in the user's utterance. The response sentence according to the polarity type in the user's utterance indicates the emotional tendency of the user's utterance, and the response sentence according to the tendency can be acquired.

本開示の対話装置１００は、応答文および成長閾値を対応付けた応答文記憶部としての第３メモリ１０ｃをさらに備える。そして、応答文取得部として機能する状態決定部１４は、成長度に応じた応答文を第３メモリ１０ｃから取得する。その際、成長度以下となる成長閾値が設定されている応答文を取得する。この処理によって、成長度に応じた応答文を選択することができる。 The dialogue device 100 of the present disclosure further includes a third memory 10c as a response sentence storage unit associated with a response sentence and a growth threshold value. Then, the state determination unit 14 that functions as the response sentence acquisition unit acquires the response sentence according to the degree of growth from the third memory 10c. At that time, a response statement in which a growth threshold value equal to or lower than the growth degree is set is acquired. By this process, the response sentence according to the degree of growth can be selected.

ここで、第３メモリ１０ｃは、極性種別をさらに対応付けて記憶しておく。そして、状態決定部１４は、ユーザの発話の極性種別の傾向に応じた一または複数の応答文を選択することで、応答文の絞り込みを行い、さらに、成長度に応じた応答文を選択する。 Here, the third memory 10c stores the polarity types in association with each other. Then, the state determination unit 14 narrows down the response sentences by selecting one or a plurality of response sentences according to the tendency of the polarity type of the user's utterance, and further selects the response sentences according to the degree of growth. ..

また、本開示の対話装置１００において、第３メモリ１０ｃは、さらに、トリガーワードを対応付けて記憶する。状態決定部１４は、応答対象であるユーザの発話に含まれている形態素が、第３メモリ１０ｃのトリガーワードに一致している一または複数の応答文を取得する。これにより、ユーザの発話にたいした応答文を適切に選択することができる。 Further, in the dialogue device 100 of the present disclosure, the third memory 10c further stores the trigger word in association with it. The state determination unit 14 acquires one or a plurality of response statements in which the morpheme included in the utterance of the user to be responded matches the trigger word of the third memory 10c. This makes it possible to appropriately select a response sentence for the user's utterance.

また、本開示の対話装置１００は、ユーザの発話に含まれた形態素、その極性種別、および成長度を記憶する。 Further, the dialogue device 100 of the present disclosure stores the morpheme included in the user's utterance, its polarity type, and the degree of growth.

上記実施形態の説明に用いたブロック図は、機能単位のブロックを示している。これらの機能ブロック（構成部）は、ハードウェア及びソフトウェアの少なくとも一方の任意の組み合わせによって実現される。また、各機能ブロックの実現方法は特に限定されない。すなわち、各機能ブロックは、物理的又は論理的に結合した１つの装置を用いて実現されてもよいし、物理的又は論理的に分離した２つ以上の装置を直接的又は間接的に（例えば、有線、無線などを用いて）接続し、これら複数の装置を用いて実現されてもよい。機能ブロックは、上記１つの装置又は上記複数の装置にソフトウェアを組み合わせて実現されてもよい。 The block diagram used in the description of the above embodiment shows a block of functional units. These functional blocks (components) are realized by any combination of at least one of hardware and software. Further, the method of realizing each functional block is not particularly limited. That is, each functional block may be realized using one physically or logically coupled device, or two or more physically or logically separated devices can be directly or indirectly (eg, for example). , Wired, wireless, etc.) and may be realized using these plurality of devices. The functional block may be realized by combining the software with the one device or the plurality of devices.

機能には、判断、決定、判定、計算、算出、処理、導出、調査、探索、確認、受信、送信、出力、アクセス、解決、選択、選定、確立、比較、想定、期待、見做し、報知（broadcasting）、通知（notifying）、通信（communicating）、転送（forwarding）、構成（configuring）、再構成（reconfiguring）、割り当て（allocating、mapping）、割り振り（assigning）などがあるが、これらに限られない。たとえば、送信を機能させる機能ブロック（構成部）は、送信部（transmitting unit）や送信機（transmitter）と呼称される。いずれも、上述したとおり、実現方法は特に限定されない。 Functions include judgment, decision, judgment, calculation, calculation, processing, derivation, investigation, search, confirmation, reception, transmission, output, access, solution, selection, selection, establishment, comparison, assumption, expectation, and assumption. There are broadcasting, notifying, communicating, forwarding, configuring, reconfiguring, allocating, mapping, assigning, etc., but only these. I can't. For example, a functional block (component) that makes transmission function is called a transmitting unit or a transmitter. In each case, as described above, the realization method is not particularly limited.

例えば、本開示の一実施の形態における対話装置１００は、本開示の対話方法の処理を行うコンピュータとして機能してもよい。図１２は、本開示の一実施の形態に係る対話装置１００のハードウェア構成の一例を示す図である。上述の対話装置１００は、物理的には、プロセッサ１００１、メモリ１００２、ストレージ１００３、通信装置１００４、入力装置１００５、出力装置１００６、バス１００７などを含むコンピュータ装置として構成されてもよい。 For example, the dialogue device 100 in one embodiment of the present disclosure may function as a computer that processes the dialogue method of the present disclosure. FIG. 12 is a diagram showing an example of the hardware configuration of the dialogue device 100 according to the embodiment of the present disclosure. The above-mentioned dialogue device 100 may be physically configured as a computer device including a processor 1001, a memory 1002, a storage 1003, a communication device 1004, an input device 1005, an output device 1006, a bus 1007, and the like.

なお、以下の説明では、「装置」という文言は、回路、デバイス、ユニットなどに読み替えることができる。対話装置１００のハードウェア構成は、図に示した各装置を１つ又は複数含むように構成されてもよいし、一部の装置を含まずに構成されてもよい。 In the following description, the word "device" can be read as a circuit, a device, a unit, or the like. The hardware configuration of the dialogue device 100 may be configured to include one or more of the devices shown in the figure, or may be configured not to include some of the devices.

対話装置１００における各機能は、プロセッサ１００１、メモリ１００２などのハードウェア上に所定のソフトウェア（プログラム）を読み込ませることによって、プロセッサ１００１が演算を行い、通信装置１００４による通信を制御したり、メモリ１００２及びストレージ１００３におけるデータの読み出し及び書き込みの少なくとも一方を制御したりすることによって実現される。 For each function in the dialogue device 100, by loading predetermined software (program) on hardware such as the processor 1001 and the memory 1002, the processor 1001 performs an calculation, controls communication by the communication device 1004, and the memory 1002. And by controlling at least one of reading and writing of data in the storage 1003.

プロセッサ１００１は、例えば、オペレーティングシステムを動作させてコンピュータ全体を制御する。プロセッサ１００１は、周辺装置とのインターフェース、制御装置、演算装置、レジスタなどを含む中央処理装置（ＣＰＵ：Central Processing Unit）によって構成されてもよい。例えば、上述の発話分解部１２等は、プロセッサ１００１によって実現されてもよい。 Processor 1001 operates, for example, an operating system to control the entire computer. The processor 1001 may be configured by a central processing unit (CPU) including an interface with a peripheral device, a control device, an arithmetic unit, a register, and the like. For example, the above-mentioned utterance decomposition unit 12 and the like may be realized by the processor 1001.

また、プロセッサ１００１は、プログラム（プログラムコード）、ソフトウェアモジュール、データなどを、ストレージ１００３及び通信装置１００４の少なくとも一方からメモリ１００２に読み出し、これらに従って各種の処理を実行する。プログラムとしては、上述の実施の形態において説明した動作の少なくとも一部をコンピュータに実行させるプログラムが用いられる。例えば、発話分解部１２、ユーザ模倣部１３、状態決定部１４は、メモリ１００２に格納され、プロセッサ１００１において動作する制御プログラムによって実現されてもよく、他の機能ブロックについても同様に実現されてもよい。上述の各種処理は、１つのプロセッサ１００１によって実行される旨を説明してきたが、２以上のプロセッサ１００１により同時又は逐次に実行されてもよい。プロセッサ１００１は、１以上のチップによって実装されてもよい。なお、プログラムは、電気通信回線を介してネットワークから送信されても良い。 Further, the processor 1001 reads a program (program code), a software module, data, and the like from at least one of the storage 1003 and the communication device 1004 into the memory 1002, and executes various processes according to these. As the program, a program that causes a computer to execute at least a part of the operations described in the above-described embodiment is used. For example, the utterance decomposition unit 12, the user imitation unit 13, and the state determination unit 14 may be realized by a control program stored in the memory 1002 and operating in the processor 1001, and may be realized in the same manner for other functional blocks. good. Although it has been described that the various processes described above are executed by one processor 1001, they may be executed simultaneously or sequentially by two or more processors 1001. Processor 1001 may be mounted by one or more chips. The program may be transmitted from the network via a telecommunication line.

メモリ１００２は、コンピュータ読み取り可能な記録媒体であり、例えば、ＲＯＭ（Read Only Memory）、ＥＰＲＯＭ（Erasable Programmable ＲＯＭ）、ＥＥＰＲＯＭ（Electrically Erasable Programmable ＲＯＭ）、ＲＡＭ（Random Access Memory）などの少なくとも１つによって構成されてもよい。メモリ１００２は、レジスタ、キャッシュ、メインメモリ（主記憶装置）などと呼ばれてもよい。メモリ１００２は、本開示の一実施の形態に係る対話方法を実施するために実行可能なプログラム（プログラムコード）、ソフトウェアモジュールなどを保存することができる。 The memory 1002 is a computer-readable recording medium, and is composed of at least one such as a ROM (Read Only Memory), an EPROM (Erasable Programmable ROM), an EEPROM (Electrically Erasable Programmable ROM), and a RAM (Random Access Memory). May be done. The memory 1002 may be referred to as a register, a cache, a main memory (main storage device), or the like. The memory 1002 can store a program (program code), a software module, or the like that can be executed to implement the dialogue method according to the embodiment of the present disclosure.

ストレージ１００３は、コンピュータ読み取り可能な記録媒体であり、例えば、ＣＤ－ＲＯＭ（Compact Disc ＲＯＭ）などの光ディスク、ハードディスクドライブ、フレキシブルディスク、光磁気ディスク(例えば、コンパクトディスク、デジタル多用途ディスク、Ｂｌｕ－ｒａｙ（登録商標）ディスク)、スマートカード、フラッシュメモリ(例えば、カード、スティック、キードライブ)、フロッピー（登録商標）ディスク、磁気ストリップなどの少なくとも１つによって構成されてもよい。ストレージ１００３は、補助記憶装置と呼ばれてもよい。上述の記憶媒体は、例えば、メモリ１００２及びストレージ１００３の少なくとも一方を含むデータベース、サーバその他の適切な媒体であってもよい。 The storage 1003 is a computer-readable recording medium, for example, an optical disk such as a CD-ROM (Compact Disc ROM), a hard disk drive, a flexible disk, an optical magnetic disk (for example, a compact disk, a digital versatile disk, a Blu-ray). It may consist of at least one (registered trademark) disk), smart card, flash memory (eg, card, stick, key drive), floppy (registered trademark) disk, magnetic strip, and the like. The storage 1003 may be referred to as an auxiliary storage device. The storage medium described above may be, for example, a database, server or other suitable medium containing at least one of the memory 1002 and the storage 1003.

通信装置１００４は、有線ネットワーク及び無線ネットワークの少なくとも一方を介してコンピュータ間の通信を行うためのハードウェア（送受信デバイス）であり、例えばネットワークデバイス、ネットワークコントローラ、ネットワークカード、通信モジュールなどともいう。通信装置１００４は、例えば周波数分割複信（ＦＤＤ：Frequency Division Duplex）及び時分割複信（ＴＤＤ：Time Division Duplex）の少なくとも一方を実現するために、高周波スイッチ、デュプレクサ、フィルタ、周波数シンセサイザなどを含んで構成されてもよい。例えば、上述の発話受信部１１および応答文送信部１５などは、通信装置１００４によって実現されてもよい。発話受信部１１および応答文送信部１５は、物理的に、または論理的に分離された実装がなされてもよいし、同一の実装がなされてもよい。 The communication device 1004 is hardware (transmission / reception device) for communicating between computers via at least one of a wired network and a wireless network, and is also referred to as, for example, a network device, a network controller, a network card, a communication module, or the like. The communication device 1004 includes, for example, a high frequency switch, a duplexer, a filter, a frequency synthesizer, and the like in order to realize at least one of FDD (Frequency Division Duplex) and Time Division Duplex (TDD). It may be composed of. For example, the above-mentioned utterance receiving unit 11 and response text transmitting unit 15 may be realized by the communication device 1004. The utterance receiving unit 11 and the response sentence transmitting unit 15 may be physically or logically separated from each other, or may have the same implementation.

入力装置１００５は、外部からの入力を受け付ける入力デバイス（例えば、キーボード、マウス、マイクロフォン、スイッチ、ボタン、センサなど）である。出力装置１００６は、外部への出力を実施する出力デバイス（例えば、ディスプレイ、スピーカー、LEDランプなど）である。なお、入力装置１００５及び出力装置１００６は、一体となった構成（例えば、タッチパネル）であってもよい。 The input device 1005 is an input device (for example, a keyboard, a mouse, a microphone, a switch, a button, a sensor, etc.) that receives an input from the outside. The output device 1006 is an output device (for example, a display, a speaker, an LED lamp, etc.) that outputs to the outside. The input device 1005 and the output device 1006 may have an integrated configuration (for example, a touch panel).

また、プロセッサ１００１、メモリ１００２などの各装置は、情報を通信するためのバス１００７によって接続される。バス１００７は、単一のバスを用いて構成されてもよいし、装置間ごとに異なるバスを用いて構成されてもよい。 Further, each device such as the processor 1001 and the memory 1002 is connected by a bus 1007 for communicating information. The bus 1007 may be configured by using a single bus, or may be configured by using a different bus for each device.

また、対話装置１００は、マイクロプロセッサ、デジタル信号プロセッサ（ＤＳＰ：Digital Signal Processor）、ＡＳＩＣ（Application Specific Integrated Circuit）、ＰＬＤ（Programmable Logic Device）、ＦＰＧＡ（Field Programmable Gate Array）などのハードウェアを含んで構成されてもよく、当該ハードウェアにより、各機能ブロックの一部又は全てが実現されてもよい。例えば、プロセッサ１００１は、これらのハードウェアの少なくとも１つを用いて実装されてもよい。 Further, the dialogue device 100 includes hardware such as a microprocessor, a digital signal processor (DSP: Digital Signal Processor), an ASIC (Application Specific Integrated Circuit), a PLD (Programmable Logic Device), and an FPGA (Field Programmable Gate Array). It may be configured, and some or all of each functional block may be realized by the hardware. For example, processor 1001 may be implemented using at least one of these hardware.

本開示において説明した各態様／実施形態は、ＬＴＥ（Long Term Evolution）、ＬＴＥ－Ａ（LTE-Advanced）、ＳＵＰＥＲ３Ｇ、ＩＭＴ－Ａｄｖａｎｃｅｄ、４Ｇ（4th generation mobile communication system）、５Ｇ（5th generation mobile communication system）、ＦＲＡ（Future Radio Access）、ＮＲ（new Radio）、Ｗ－ＣＤＭＡ（登録商標）、ＧＳＭ（登録商標）、ＣＤＭＡ２０００、ＵＭＢ（Ultra Mobile Broadband）、ＩＥＥＥ８０２．１１（Ｗｉ－Ｆｉ（登録商標））、ＩＥＥＥ８０２．１６（ＷｉＭＡＸ（登録商標））、ＩＥＥＥ８０２．２０、ＵＷＢ（Ultra-WideBand）、Ｂｌｕｅｔｏｏｔｈ（登録商標）、その他の適切なシステムを利用するシステム及びこれらに基づいて拡張された次世代システムの少なくとも一つに適用されてもよい。また、複数のシステムが組み合わされて（例えば、ＬＴＥ及びＬＴＥ－Ａの少なくとも一方と５Ｇとの組み合わせ等）適用されてもよい。 Each aspect / embodiment described in the present disclosure includes LTE (Long Term Evolution), LTE-A (LTE-Advanced), SUPER 3G, IMT-Advanced, 4G (4th generation mobile communication system), and 5G (5th generation mobile communication). system), FRA (Future Radio Access), NR (new Radio), W-CDMA (registered trademark), GSM (registered trademark), CDMA2000, UMB (Ultra Mobile Broadband), IEEE 802.11 (Wi-Fi (registered trademark)) )), LTE 802.16 (WiMAX®), IEEE 802.20, UWB (Ultra-WideBand), Bluetooth®, and other systems that utilize and extend based on these. It may be applied to at least one of the next generation systems. Further, a plurality of systems may be applied in combination (for example, a combination of at least one of LTE and LTE-A and 5G).

本開示において説明した各態様／実施形態の処理手順、シーケンス、フローチャートなどは、矛盾の無い限り、順序を入れ替えてもよい。例えば、本開示において説明した方法については、例示的な順序を用いて様々なステップの要素を提示しており、提示した特定の順序に限定されない。 The order of the processing procedures, sequences, flowcharts, etc. of each aspect / embodiment described in the present disclosure may be changed as long as there is no contradiction. For example, the methods described in the present disclosure present elements of various steps using exemplary order, and are not limited to the particular order presented.

情報等は、上位レイヤ（又は下位レイヤ）から下位レイヤ（又は上位レイヤ）へ出力され得る。複数のネットワークノードを介して入出力されてもよい。 Information and the like can be output from the upper layer (or lower layer) to the lower layer (or upper layer). Input / output may be performed via a plurality of network nodes.

入出力された情報等は特定の場所（例えば、メモリ）に保存されてもよいし、管理テーブルを用いて管理してもよい。入出力される情報等は、上書き、更新、又は追記され得る。出力された情報等は削除されてもよい。入力された情報等は他の装置へ送信されてもよい。 The input / output information and the like may be stored in a specific place (for example, a memory), or may be managed using a management table. Information to be input / output may be overwritten, updated, or added. The output information and the like may be deleted. The input information or the like may be transmitted to another device.

判定は、１ビットで表される値（０か１か）によって行われてもよいし、真偽値（Boolean：true又はfalse）によって行われてもよいし、数値の比較（例えば、所定の値との比較）によって行われてもよい。 The determination may be made by a value represented by 1 bit (0 or 1), by a boolean value (Boolean: true or false), or by comparing numerical values (for example, a predetermined value). It may be done by comparison with the value).

本開示において説明した各態様／実施形態は単独で用いてもよいし、組み合わせて用いてもよいし、実行に伴って切り替えて用いてもよい。また、所定の情報の通知（例えば、「Ｘであること」の通知）は、明示的に行うものに限られず、暗黙的（例えば、当該所定の情報の通知を行わない）ことによって行われてもよい。 Each aspect / embodiment described in the present disclosure may be used alone, in combination, or may be switched and used according to the execution. Further, the notification of predetermined information (for example, the notification of "being X") is not limited to the explicit one, but is performed implicitly (for example, the notification of the predetermined information is not performed). May be good.

以上、本開示について詳細に説明したが、当業者にとっては、本開示が本開示中に説明した実施形態に限定されるものではないということは明らかである。本開示は、請求の範囲の記載により定まる本開示の趣旨及び範囲を逸脱することなく修正及び変更態様として実施することができる。したがって、本開示の記載は、例示説明を目的とするものであり、本開示に対して何ら制限的な意味を有するものではない。 Although the present disclosure has been described in detail above, it is clear to those skilled in the art that the present disclosure is not limited to the embodiments described in the present disclosure. The present disclosure may be implemented as amendments and modifications without departing from the spirit and scope of the present disclosure as determined by the description of the scope of claims. Therefore, the description of this disclosure is for purposes of illustration and does not have any limiting meaning to this disclosure.

ソフトウェアは、ソフトウェア、ファームウェア、ミドルウェア、マイクロコード、ハードウェア記述言語と呼ばれるか、他の名称で呼ばれるかを問わず、命令、命令セット、コード、コードセグメント、プログラムコード、プログラム、サブプログラム、ソフトウェアモジュール、アプリケーション、ソフトウェアアプリケーション、ソフトウェアパッケージ、ルーチン、サブルーチン、オブジェクト、実行可能ファイル、実行スレッド、手順、機能などを意味するよう広く解釈されるべきである。 Software, whether called software, firmware, middleware, microcode, hardware description language, or other names, instructions, instruction sets, codes, code segments, program codes, programs, subprograms, software modules. , Applications, software applications, software packages, routines, subroutines, objects, executable files, execution threads, procedures, features, etc. should be broadly interpreted.

また、ソフトウェア、命令、情報などは、伝送媒体を介して送受信されてもよい。例えば、ソフトウェアが、有線技術（同軸ケーブル、光ファイバケーブル、ツイストペア、デジタル加入者回線（ＤＳＬ：Digital Subscriber Line）など）及び無線技術（赤外線、マイクロ波など）の少なくとも一方を使用してウェブサイト、サーバ、又は他のリモートソースから送信される場合、これらの有線技術及び無線技術の少なくとも一方は、伝送媒体の定義内に含まれる。 Further, software, instructions, information and the like may be transmitted and received via a transmission medium. For example, a website that uses at least one of wired technology (coaxial cable, fiber optic cable, twisted pair, Digital Subscriber Line (DSL), etc.) and wireless technology (infrared, microwave, etc.). When transmitted from a server or other remote source, at least one of these wired and wireless technologies is included within the definition of transmission medium.

本開示において説明した情報、信号などは、様々な異なる技術のいずれかを使用して表されてもよい。例えば、上記の説明全体に渡って言及され得るデータ、命令、コマンド、情報、信号、ビット、シンボル、チップなどは、電圧、電流、電磁波、磁界若しくは磁性粒子、光場若しくは光子、又はこれらの任意の組み合わせによって表されてもよい。 The information, signals, etc. described in the present disclosure may be represented using any of a variety of different techniques. For example, data, instructions, commands, information, signals, bits, symbols, chips, etc. that may be referred to throughout the above description are voltages, currents, electromagnetic waves, magnetic fields or magnetic particles, light fields or photons, or any of these. It may be represented by a combination of.

なお、本開示において説明した用語及び本開示の理解に必要な用語については、同一の又は類似する意味を有する用語と置き換えてもよい。例えば、チャネル及びシンボルの少なくとも一方は信号（シグナリング）であってもよい。また、信号はメッセージであってもよい。また、コンポーネントキャリア（ＣＣ：Component Carrier）は、キャリア周波数、セル、周波数キャリアなどと呼ばれてもよい。 The terms described in the present disclosure and the terms necessary for understanding the present disclosure may be replaced with terms having the same or similar meanings. For example, at least one of a channel and a symbol may be a signal (signaling). Also, the signal may be a message. Further, the component carrier (CC: Component Carrier) may be referred to as a carrier frequency, a cell, a frequency carrier, or the like.

本開示において使用する「システム」及び「ネットワーク」という用語は、互換的に使用される。 The terms "system" and "network" used in this disclosure are used interchangeably.

また、本開示において説明した情報、パラメータなどは、絶対値を用いて表されてもよいし、所定の値からの相対値を用いて表されてもよいし、対応する別の情報を用いて表されてもよい。 Further, the information, parameters, etc. described in the present disclosure may be expressed using absolute values, relative values from predetermined values, or using other corresponding information. It may be represented.

本開示で使用する「判断(determining)」、「決定(determining)」という用語は、多種多様な動作を包含する場合がある。「判断」、「決定」は、例えば、判定(judging)、計算(calculating)、算出(computing)、処理(processing)、導出(deriving)、調査(investigating)、探索(looking up、search、inquiry)（例えば、テーブル、データベース又は別のデータ構造での探索）、確認(ascertaining)した事を「判断」「決定」したとみなす事などを含み得る。また、「判断」、「決定」は、受信(receiving)（例えば、情報を受信すること）、送信(transmitting)(例えば、情報を送信すること)、入力(input)、出力(output)、アクセス(accessing)（例えば、メモリ中のデータにアクセスすること）した事を「判断」「決定」したとみなす事などを含み得る。また、「判断」、「決定」は、解決(resolving)、選択(selecting)、選定(choosing)、確立(establishing)、比較(comparing)などした事を「判断」「決定」したとみなす事を含み得る。つまり、「判断」「決定」は、何らかの動作を「判断」「決定」したとみなす事を含み得る。また、「判断（決定）」は、「想定する（assuming）」、「期待する（expecting）」、「みなす（considering）」などで読み替えられてもよい。 The terms "determining" and "determining" used in this disclosure may include a wide variety of actions. "Judgment" and "decision" are, for example, judgment, calculation, computing, processing, deriving, investigating, looking up, search, inquiry. It may include (eg, searching in a table, database or another data structure), ascertaining as "judgment" or "decision". Also, "judgment" and "decision" are receiving (for example, receiving information), transmitting (for example, transmitting information), input (input), output (output), and access. It may include (for example, accessing data in memory) to be regarded as "judgment" or "decision". In addition, "judgment" and "decision" are considered to be "judgment" and "decision" when the things such as solving, selecting, choosing, establishing, and comparing are regarded as "judgment" and "decision". Can include. That is, "judgment" and "decision" may include considering some action as "judgment" and "decision". Further, "judgment (decision)" may be read as "assuming", "expecting", "considering" and the like.

「接続された(connected)」、「結合された(coupled)」という用語、又はこれらのあらゆる変形は、２又はそれ以上の要素間の直接的又は間接的なあらゆる接続又は結合を意味し、互いに「接続」又は「結合」された２つの要素間に１又はそれ以上の中間要素が存在することを含むことができる。要素間の結合又は接続は、物理的なものであっても、論理的なものであっても、或いはこれらの組み合わせであってもよい。例えば、「接続」は「アクセス」で読み替えられてもよい。本開示で使用する場合、２つの要素は、１又はそれ以上の電線、ケーブル及びプリント電気接続の少なくとも一つを用いて、並びにいくつかの非限定的かつ非包括的な例として、無線周波数領域、マイクロ波領域及び光（可視及び不可視の両方）領域の波長を有する電磁エネルギーなどを用いて、互いに「接続」又は「結合」されると考えることができる。 The terms "connected", "coupled", or any variation thereof, mean any direct or indirect connection or connection between two or more elements and each other. It can include the presence of one or more intermediate elements between two "connected" or "combined" elements. The connection or connection between the elements may be physical, logical, or a combination thereof. For example, "connection" may be read as "access". As used in the present disclosure, the two elements use at least one of one or more wires, cables and printed electrical connections, and as some non-limiting and non-comprehensive examples, the radio frequency domain. Can be considered to be "connected" or "coupled" to each other using electromagnetic energy having wavelengths in the microwave and light (both visible and invisible) regions.

本開示において使用する「に基づいて」という記載は、別段に明記されていない限り、「のみに基づいて」を意味しない。言い換えれば、「に基づいて」という記載は、「のみに基づいて」と「に少なくとも基づいて」の両方を意味する。 The phrase "based on" as used in this disclosure does not mean "based on" unless otherwise stated. In other words, the statement "based on" means both "based only" and "at least based on".

本開示において使用する「第１の」、「第２の」などの呼称を使用した要素へのいかなる参照も、それらの要素の量又は順序を全般的に限定しない。これらの呼称は、２つ以上の要素間を区別する便利な方法として本開示において使用され得る。したがって、第１及び第２の要素への参照は、２つの要素のみが採用され得ること、又は何らかの形で第１の要素が第２の要素に先行しなければならないことを意味しない。 Any reference to elements using designations such as "first" and "second" as used in this disclosure does not generally limit the quantity or order of those elements. These designations can be used in the present disclosure as a convenient way to distinguish between two or more elements. Therefore, references to the first and second elements do not mean that only two elements can be adopted, or that the first element must somehow precede the second element.

上記の各装置の構成における「手段」を、「部」、「回路」、「デバイス」等に置き換えてもよい。 The "means" in the configuration of each of the above devices may be replaced with a "part", a "circuit", a "device" and the like.

本開示において、「含む（include）」、「含んでいる（including）」及びそれらの変形が使用されている場合、これらの用語は、用語「備える（comprising）」と同様に、包括的であることが意図される。さらに、本開示において使用されている用語「又は（or）」は、排他的論理和ではないことが意図される。 When "include", "including" and variations thereof are used in the present disclosure, these terms are as inclusive as the term "comprising". Is intended. Moreover, the term "or" used in the present disclosure is intended not to be an exclusive OR.

本開示において、例えば、英語でのa、 an及びtheのように、翻訳により冠詞が追加された場合、本開示は、これらの冠詞の後に続く名詞が複数形であることを含んでもよい。 In the present disclosure, if articles are added by translation, for example a, an and the in English, the disclosure may include the plural nouns following these articles.

本開示において、「ＡとＢが異なる」という用語は、「ＡとＢが互いに異なる」ことを意味してもよい。なお、当該用語は、「ＡとＢがそれぞれＣと異なる」ことを意味してもよい。「離れる」、「結合される」などの用語も、「異なる」と同様に解釈されてもよい。 In the present disclosure, the term "A and B are different" may mean "A and B are different from each other". The term may mean that "A and B are different from C". Terms such as "separate" and "combined" may be interpreted in the same way as "different".

１０ａ…第１メモリ、１０ｂ…第２メモリ、１０ｃ…第３メモリ、１０ｄ…第４メモリ、１１…発話受信部、１２…発話分解部、１３…ユーザ模倣部、１４…状態決定部、１５…応答文送信部。
10a ... 1st memory, 10b ... 2nd memory, 10c ... 3rd memory, 10d ... 4th memory, 11 ... utterance receiving unit, 12 ... utterance decomposition unit, 13 ... user imitation unit, 14 ... state determination unit, 15 ... Response text transmitter.

Claims

The utterance reception section that accepts utterances from users,
A growth degree calculation unit that calculates the growth degree for a dialogue with the user based on the utterance history of the user, and a growth degree calculation unit.
A response sentence acquisition unit that acquires a response sentence according to the degree of growth in response to the utterance from the user.
A dialogue device equipped with.

The growth degree calculation unit is
The degree of growth is calculated based on the number of expression elements used in the utterance of the user and the ratio of the polarity type in the utterance of the user in the utterance history.
The dialogue device according to claim 1.

An utterance decomposition unit that decomposes utterances from the user for each part of speech and acquires expression elements,
A part-speech history storage unit that stores the expression element as an utterance history,
Equipped with
The growth degree calculation unit is
The degree of growth is calculated based on the number of expression elements stored in the part of speech history storage unit.
The dialogue device according to claim 1 or 2.

It also has a polarity type storage unit that stores the ratio of polarity types for each utterance.
The utterance decomposition part is
The ratio of the polarity type for each expression element in the utterance is acquired and stored in the polarity type storage unit.
The growth degree calculation unit calculates the growth degree based on the ratio of the polarity type in the polarity type storage unit in addition to the number of expression elements.
The dialogue device according to claim 3.

In addition to the degree of growth, the response sentence acquisition unit acquires a response sentence according to the polarity type in the utterance of the user.
The dialogue device according to any one of claims 1 to 4.

The growth degree calculation unit is
The type of response sentence is determined based on the ratio of the polarity type in the user's utterance.
The dialogue device according to claim 5.

A response sentence storage unit associated with a response sentence and a growth threshold is further provided.
The response sentence acquisition unit
The response sentence corresponding to the growth degree is acquired from the response sentence storage unit.
The dialogue device according to any one of claims 1 to 6.

The response sentence storage unit further associates and stores the polarity types, and stores the polar types.
The response sentence acquisition unit selects one or more response sentences according to the tendency of the polarity type of the user's utterance.
In addition, select a response statement according to the degree of growth,
The dialogue device according to claim 7.

The response sentence storage unit further stores the trigger words in association with each other.
The response sentence acquisition unit acquires one or more response sentences in which the expression element included in the utterance of the user who is the response target matches the trigger word of the response sentence storage unit.
The dialogue device according to claim 7 or 8.

The dialogue device according to any one of claims 1 to 9, further comprising a growth degree storage unit for storing the number of expression elements included in the user's utterance, the polarity type thereof, and the growth degree.