JP2023123306A

JP2023123306A - Information processor and program

Info

Publication number: JP2023123306A
Application number: JP2022027324A
Authority: JP
Inventors: 敏郎大櫃; Toshiro Obitsu
Original assignee: Fujitsu Client Computing Ltd
Current assignee: Fujitsu Client Computing Ltd
Priority date: 2022-02-24
Filing date: 2022-02-24
Publication date: 2023-09-05
Anticipated expiration: 2042-02-24
Also published as: JP7299538B1

Abstract

To provide an information processor and a program, which make a response to utterance of a user so that the user and a chatbot can smoothly continue conversation.SOLUTION: An information processor includes: a first estimation part for estimating intention of user input on the basis of the user input; a first specification part for specifying an utterance promotion word for promoting continuous utterance by a user on the basis of the intention and a history of the user input; and a generation part for generating a response sentence on the basis of the intention and the utterance promotion word.SELECTED DRAWING: Figure 1

Description

本発明の実施形態は、情報処理装置及びプログラムに関する。 TECHNICAL FIELD Embodiments of the present invention relate to an information processing apparatus and a program.

近年、ユーザによる音声や文字等の入力に対して自動応答を行うシステム（以下、チャットボットともいう）が知られている。従来、ユーザとチャットボットとの会話が連続的で円滑に進むように、事前にシステム側が準備した複数の選択肢の中からユーザが選択した台詞の履歴に基づいて、チャットボットが、「えー」「あー」等の会話の合間に挟みこまれる感情等を表現する言葉（以下、フィラーともいう）を挿入する技術が提案されている。 2. Description of the Related Art In recent years, there has been known a system (hereinafter also referred to as a chatbot) that automatically responds to input of voice, characters, etc. by a user. Conventionally, in order for the conversation between the user and the chatbot to proceed continuously and smoothly, based on the history of the lines selected by the user from among multiple options prepared by the system in advance, the chatbot will generate "uh" Techniques have been proposed for inserting words (hereinafter also referred to as fillers) that express emotions and the like that are interposed between conversations, such as "ah".

特開２０２０－２４５２２号公報JP 2020-24522 A

上記のような、事前にシステム側が準備した台詞の選択履歴に基づいてフィラーを挿入する方法では、通常会話からのフィラーによる繋ぎ単語を入れた会話を想定した場合、想定した台詞がユーザから発せられないと利用できない場合がある。また、選択履歴に基づいて挿入したフィラーが現在の会話に応じた感情を表すものでない場合もある。このため、フィラーを挿入する方法では、自信がない応答や複雑な文章からなる応答となってしまう場合がある。 In the method described above, in which filler is inserted based on the selection history of lines prepared in advance by the system, assuming a conversation in which filler words are inserted from a normal conversation, the assumed line is uttered by the user. may not be available without Also, the filler inserted based on the selection history may not be emotionally relevant to the current conversation. For this reason, the method of inserting fillers may result in an unconfident response or a response consisting of complicated sentences.

従って、本発明が解決する課題の一例は、ユーザとチャットボットとが会話を円滑に継続できるように、ユーザの発話に対する応答を行うことができる情報処理装置及びプログラムを提供することにある。 Accordingly, one example of the problem to be solved by the present invention is to provide an information processing device and program capable of responding to user's utterances so that the user and the chatbot can smoothly continue the conversation.

本発明の第１態様は、ユーザ入力に基づいて、当該ユーザ入力の意図を推定する第１推定部と、前記意図と、前記ユーザ入力の履歴とに基づいて、前記ユーザ入力に対する応答文章の生成に用いる、ユーザによる継続的な発話を促進させる発話促進単語を特定する第１特定部と、前記意図と前記発話促進単語とに基づいて、前記応答文章を生成する生成部とを備える情報処理装置である。 A first aspect of the present invention includes: a first estimating unit that estimates an intention of the user input based on the user input; and a response sentence to the user input based on the intention and the history of the user input. an information processing apparatus comprising: a first identifying unit that identifies a speech-promoting word that encourages continuous utterance by a user; and a generating unit that generates the response sentence based on the intention and the speech-promoting word is.

また、前記発話促進単語には、前記ユーザ入力を行ったユーザの興味を引く興味単語が含まれ、前記生成部は、前記興味単語を含む前記応答文章を生成してもよい。 Further, the speech prompting word may include an interest word that attracts the interest of the user who has performed the user input, and the generation unit may generate the response sentence including the interest word.

また、情報処理装置は、前記ユーザ入力に基づいて、当該ユーザ入力の話題を推定する第２推定部と、前記意図と、前記話題と、前記興味単語とに基づいて、前記発話促進単語に含まれる前記興味単語と連携して応答文章を形成する連携単語を特定する第２特定部とを更に備え、前記生成部は、前記共感単語及び前記連携単語を含む前記応答文章を生成してもよい。 The information processing device further includes: a second estimation unit that estimates a topic of the user input based on the user input; a second specifying unit that specifies linked words that form a response sentence in collaboration with the interesting word that is used, and the generator may create the response sentence that includes the sympathy word and the linked word. .

前記ユーザ入力の内容に含まれる複数の単語から前記話題における重要単語を抽出する抽出部と、前記話題と前記重要単語とに基づいて、当該ユーザ入力に対する応答文章の生成に用いる、前記発話促進単語に含まれる前記ユーザ入力に対する共感を表す共感単語を特定する第３特定部とを更に備え、前記生成部は、前記共感単語を含む前記応答文章を生成してもよい。 an extraction unit for extracting an important word in the topic from a plurality of words included in the content of the user input; and the utterance-promoting word used for generating a response sentence to the user input based on the topic and the important word. and a third specifying unit that specifies a sympathy word that expresses sympathy for the user input included in the above, and the generating unit generates the response sentence that includes the sympathy word.

また、情報処理装置は、前記ユーザ入力に基づいて、当該ユーザ入力の話題を推定する第２推定部と、前記ユーザ入力と、前記話題とに基づいて、当該ユーザ入力の内容に含まれる複数の単語のうち、当該ユーザ入力に対する応答文章の生成に用いる、前記発話促進単語に含まれる繰り返し単語を特定する第４特定部とを更に備えていてもよい。 Further, the information processing apparatus includes: a second estimating unit that estimates a topic of the user input based on the user input; A fourth specifying unit may further comprise a fourth specifying unit for specifying, among words, repeated words included in the speech-prompting words to be used for generating a response sentence to the user input.

また、前記第４特定部が、複数の前記繰り返し単語を特定するようにしてもよい。 Also, the fourth identifying unit may identify a plurality of the repeated words.

また、本発明の第２態様は、ユーザ入力に基づいて、当該ユーザ入力の意図を推定する処理と、前記意図と、前記ユーザ入力の履歴とに基づいて、前記ユーザ入力に対する応答文章の生成に用いる、ユーザによる継続的な発話を促進させる発話促進単語を特定する処理と、前記意図と、前記発話促進単語とに基づいて、前記応答文章を生成する処理と、を情報処理装置のコンピュータに実行させるプログラムである。 A second aspect of the present invention includes processing for estimating an intention of the user input based on the user input, and generating a response sentence to the user input based on the intention and the history of the user input. The computer of the information processing device executes a process of identifying a speech-promoting word that promotes continuous utterance by the user, and a process of generating the response sentence based on the intention and the speech-promoting word. It is a program that allows

本発明によれば、ユーザとチャットボットとが会話を円滑に継続できるように、ユーザの発話に対する応答を行うことが可能となる。 ADVANTAGE OF THE INVENTION According to this invention, it becomes possible to respond to a user's utterance so that a user and a chatbot can continue conversation smoothly.

図１は、実施形態に係る情報処理システムの構成の一例を模式的に示す図である。FIG. 1 is a diagram schematically showing an example of the configuration of an information processing system according to an embodiment. 図２は、実施形態に係る単語カテゴリデータベース（ＤＢ）の構造の一例を模式的に示す図である。FIG. 2 is a diagram schematically showing an example of the structure of a word category database (DB) according to the embodiment; 図３は、実施形態に係る意図推定データベース（ＤＢ）の構造の一例を模式的に示す図である。FIG. 3 is a diagram schematically showing an example of the structure of an intention estimation database (DB) according to the embodiment; 図４は、実施形態に係る重要単語データベース（ＤＢ）の構造の一例を模式的に示す図である。FIG. 4 is a diagram schematically showing an example of the structure of an important word database (DB) according to the embodiment. 図５は、実施形態に係る会話推定データベース（ＤＢ）の構造の一例を模式的に示す図である。FIG. 5 is a diagram schematically showing an example of the structure of a conversation estimation database (DB) according to the embodiment; 図６は、実施形態に係る興味単語データベース（ＤＢ）の構造の一例を模式的に示す図である。FIG. 6 is a diagram schematically showing an example of the structure of an interesting word database (DB) according to the embodiment. 図７は、実施形態に係る連携単語データベース（ＤＢ）の構造の一例を模式的に示す図である。FIG. 7 is a diagram schematically showing an example of the structure of a linked word database (DB) according to the embodiment. 図８は、実施形態に係る定型文データベース（ＤＢ）の構造の一例を模式的に示す図である。FIG. 8 is a diagram schematically showing an example of the structure of a fixed phrase database (DB) according to the embodiment; 図９は、実施形態に係る情報処理システムにおいて実行される、情報処理の一例を示すフローチャートである。FIG. 9 is a flowchart illustrating an example of information processing executed in the information processing system according to the embodiment; 図１０は、実施形態に係る情報処理により実現される会話の一例を模式的に示す図である。FIG. 10 is a diagram schematically illustrating an example of conversation realized by information processing according to the embodiment; 図１１は、実施形態に係る情報処理システムを実現するコンピュータのハードウェア構成の一例を模式的に示す図である。FIG. 11 is a diagram schematically illustrating an example of a hardware configuration of a computer that implements the information processing system according to the embodiment;

以下に添付図面を参照して、実施形態に係る情報処理装置及びプログラムの実施形態を詳細に説明する。 Embodiments of an information processing apparatus and a program according to embodiments will be described in detail below with reference to the accompanying drawings.

図１は、実施形態に係る情報処理システム１の構成の一例を模式的に示す図である。実施形態に係る情報処理システム１は、ユーザによる入力を受け付け、入力内容を解析する。また、情報処理システム１は、入力内容に対する応答を解析結果に応じて自動生成し、ユーザへの応答を出力する。本実施形態では、ユーザによる入力として、ユーザの発話内容を示す音声データを例示するが、これに限らない。ユーザによる入力としては、テキストデータであっても構わない。 FIG. 1 is a diagram schematically showing an example of the configuration of an information processing system 1 according to an embodiment. The information processing system 1 according to the embodiment receives input from a user and analyzes the input content. Further, the information processing system 1 automatically generates a response to the input content according to the analysis result, and outputs the response to the user. In the present embodiment, voice data indicating the contents of the user's utterance is exemplified as the input by the user, but the input is not limited to this. The input by the user may be text data.

図１に示すように、情報処理システム１は、端末装置１００及びサーバ装置２００を有する。端末装置１００及びサーバ装置２００は、インターネット等のネットワーク（不図示）を介して互いに通信可能に接続されている。 As shown in FIG. 1, the information processing system 1 has a terminal device 100 and a server device 200 . The terminal device 100 and the server device 200 are communicably connected to each other via a network (not shown) such as the Internet.

一例として、情報処理システム１は、端末装置１００を使用するユーザと、サーバ装置２００により実現されるチャットボットとの間における会話を実現するチャットボットシステムである。以下、サーバ装置２００により実現される機能をチャットボットと記載する場合もある。 As an example, the information processing system 1 is a chatbot system that implements a conversation between a user using the terminal device 100 and a chatbot implemented by the server device 200 . Hereinafter, the function realized by the server device 200 may also be described as a chatbot.

端末装置１００は、図１に示すように、通信部１１０、入力部１２０及び出力部１３０を有する。 The terminal device 100 has a communication section 110, an input section 120 and an output section 130, as shown in FIG.

通信部１１０は、端末装置１００と他の装置（図１に示される例ではサーバ装置２００）との間で通信を実行する。 The communication unit 110 performs communication between the terminal device 100 and another device (the server device 200 in the example shown in FIG. 1).

入力部１２０は、端末装置１００のユーザの発話に応じた音声データの入力を受け付ける。入力部１２０は、受け付けた音声データを通信部１１０によりサーバ装置２００へ送信する。 The input unit 120 receives an input of voice data according to the user's speech of the terminal device 100 . Input unit 120 transmits the received voice data to server device 200 through communication unit 110 .

出力部１３０は、通信部１１０によりサーバ装置２００からの応答を受信したとき、ユーザに当該応答を音声や画像で通知する。 When the communication unit 110 receives a response from the server device 200, the output unit 130 notifies the user of the response by voice or image.

サーバ装置２００は、通信部２１０、文章解析部２２０、単語カテゴリデータベース（ＤＢ）２２１、単語解析部２３０、意図推定データベース２３１、重要単語データベース２３２、会話推定データベース２３３、興味単語データベース２３４、応答生成部２４０、連携単語データベース２４１、定型文データベース２４２及び管理部２５０を有する。 The server device 200 includes a communication unit 210, a text analysis unit 220, a word category database (DB) 221, a word analysis unit 230, an intention estimation database 231, an important word database 232, a conversation estimation database 233, an interesting word database 234, and a response generation unit. 240 , linked word database 241 , fixed phrase database 242 and management unit 250 .

通信部２１０は、サーバ装置２００と他の装置（図１に示される例では端末装置１００）との間で通信を実行する。 The communication unit 210 performs communication between the server device 200 and another device (the terminal device 100 in the example shown in FIG. 1).

文章解析部２２０は、端末装置１００を用いてユーザにより入力されたユーザ発話を示す音声データに対して音声認識による解析を行い、ユーザ発話を示すテキストデータを生成する。また、文章解析部２２０は、公知の形態素解析により、ユーザ発話を示すテキストデータに含まれる形態素を取得する。 The text analysis unit 220 analyzes the voice data representing the user's utterance input by the user using the terminal device 100 through voice recognition, and generates text data representing the user's utterance. Also, the text analysis unit 220 acquires morphemes contained in text data representing user utterances by a known morphological analysis.

換言すれば、文章解析部２２０は、形態素解析により、テキスト化されたユーザ発話を単語分割する。文章解析部２２０は、単語カテゴリデータベース２２１を用いて、単語分割により得られた単語から抽出単語を選定する。また、文章解析部２２０は、抽出単語に対応する話題を推定する。文章解析部２２０は、単語分割により得られた単語及び推定された話題を、単語解析部２３０に供給する。 In other words, the sentence analysis unit 220 divides the user's utterance converted into text into words by morphological analysis. The text analysis unit 220 selects extracted words from the words obtained by word segmentation using the word category database 221 . Also, the text analysis unit 220 estimates a topic corresponding to the extracted word. The text analysis unit 220 supplies the words obtained by word segmentation and the estimated topics to the word analysis unit 230 .

単語カテゴリデータベース２２１は、抽出単語と、各抽出単語に対応する話題とが登録されたデータベースである。 The word category database 221 is a database in which extracted words and topics corresponding to each extracted word are registered.

なお、単語カテゴリデータベース２２１に登録される情報は、過去のユーザからの応答に基づいてパラメータが学習された機械学習モデル等により生成されたものであってもよいし、人為的に生成されたものであってもよい。 The information registered in the word category database 221 may be generated by a machine learning model or the like in which parameters are learned based on responses from past users, or may be generated artificially. may be

図２は、実施形態に係る単語カテゴリデータベース２２１の構造の一例を模式的に示す図である。 FIG. 2 is a diagram schematically showing an example of the structure of the word category database 221 according to the embodiment.

図２に示す例では、話題「ファッション」に対応する抽出単語として、「[服アイテム]」、「似合う」、「おしゃれ」、「トレンド」、「買う」が登録されている。なお、「[服アイテム]」は、具体的な服の名称（例えば、ワンピース等）を表している。 In the example shown in FIG. 2, "[clothing item]", "suitable", "fashionable", "trend", and "buy" are registered as extracted words corresponding to the topic "fashion". Note that "[clothing item]" represents a specific clothing name (for example, a dress).

また、話題「季節」に対応する抽出単語として、「春」、「初夏」、「夏」、「秋」、「晩秋」、「冬」、「梅雨」及び「季節」が登録されている。また、話題「旅行」に対応する抽出単語として、「旅行」、「ハワイ」、「グアム」、「東京」、「京都」、「北海道」、「行く」及び「今度」が登録されている。また、話題「ファッション提案」のカテゴリに対応する抽出単語として、「似合う」、「コーデ」、「洋服」、「ディナー」、「おしゃれ」、「予算」及び「彼氏」が登録されている。 Also, "spring", "early summer", "summer", "autumn", "late autumn", "winter", "rainy season" and "season" are registered as extracted words corresponding to the topic "season". As extracted words corresponding to the topic "travel", "travel", "Hawaii", "Guam", "Tokyo", "Kyoto", "Hokkaido", "go" and "next time" are registered. Also, as extracted words corresponding to the category of the topic "fashion proposal", "look good", "coordination", "clothes", "dinner", "fashionable", "budget" and "boyfriend" are registered.

なお、抽出単語が複数の話題に存在する場合には、例えば抽出単語が多く含まれる話題が推定されればよい。あるいは、抽出単語が複数の話題に存在する場合に、Ｎｏ順に上位の話題が推定される構成とすることもできる。 If the extracted word exists in a plurality of topics, the topic that contains many extracted words may be estimated. Alternatively, when an extracted word exists in a plurality of topics, it is also possible to adopt a configuration in which the top topics are estimated in No. order.

単語解析部２３０は、意図推定データベース２３１を用いて、文章解析部２２０から入力される単語からユーザ発話の意図、すなわち発話意図を推定する。換言すれば、単語解析部２３０は、意図推定データベース２３１と、単語分割により得られた単語とに基づいて、ユーザ発話の発話意図を表すカテゴリを推定する。 The word analysis unit 230 uses the intention estimation database 231 to estimate the intention of the user's utterance, that is, the utterance intention from the words input from the text analysis unit 220 . In other words, the word analysis unit 230 estimates the category representing the utterance intention of the user's utterance based on the intention estimation database 231 and the words obtained by word division.

また、単語解析部２３０は、重要単語データベース２３２を用いて、文章解析部２２０から入力される単語から重要度が高い単語である重要単語を特定する。さらに、単語解析部２３０は、特定した重要単語に基づいて、繰り返し単語を特定する。例えば、単語解析部２３０は、重要単語に予め定められた優先順位に従い、繰り返し単語を特定する。 The word analysis unit 230 also uses the important word database 232 to identify important words, which are highly important words, from the words input from the text analysis unit 220 . Further, word analysis section 230 identifies repeated words based on the identified important words. For example, the word analysis unit 230 identifies repetitive words according to a predetermined priority order of important words.

なお、単語解析部２３０は、複数の繰り返し単語を特定してもよい。また、単語解析部２３０は、重要単語が特定できない場合、繰り返し単語を特定する処理を省略してもよい。 Note that the word analysis unit 230 may identify multiple repeated words. Further, when the word analysis unit 230 cannot identify the important word, the word analysis unit 230 may omit the process of identifying repeated words.

そして、単語解析部２３０は、特定された繰り返し単語を応答文章の生成に用いる繰り返し単語として追加する。 Then, the word analysis unit 230 adds the identified repetitive word as a repetitive word to be used for generating a response sentence.

また、単語解析部２３０は、会話推定データベース２３３を用いて、推定された話題及び特定された繰り返し単語に基づいて、ユーザの発話に対する共感を表す共感単語を特定する。なお、単語解析部２３０は、繰り返し単語から共感単語を特定できない場合、共感単語を特定する処理を省略してもよい。そして、単語解析部２３０は、特定された共感単語を応答文章の生成に用いる共感単語として追加する。 In addition, the word analysis unit 230 uses the conversation estimation database 233 to identify sympathy words expressing sympathy for the user's utterance based on the estimated topic and the identified repetitive words. Note that the word analysis unit 230 may omit the process of identifying the sympathy word when the sympathy word cannot be identified from the repeated words. Then, the word analysis unit 230 adds the specified sympathy words as sympathy words used for generating the response sentence.

また、単語解析部２３０は、興味単語データベース２３４を用いて、推定されたカテゴリ及びユーザ発話よりも前に行われたユーザの行為（例えば、対象のユーザ発話よりも前に行われたユーザとチャットボットとの会話を表す前会話情報）からユーザの興味を引く単語である興味単語を特定する。そして、単語解析部２３０は、特定された興味単語を応答文章の生成に用いる興味単語として追加する。 In addition, the word analysis unit 230 uses the interesting word database 234 to determine the estimated category and the behavior of the user that occurred prior to the user's utterance (for example, the chat with the user that occurred prior to the target user's utterance). An interesting word, which is a word that attracts the user's interest, is specified from the previous conversation information representing the conversation with the bot. Then, word analysis section 230 adds the specified interesting word as an interesting word to be used for generating a response sentence.

また、単語解析部２３０は、追加された繰り返し単語、共感単語、及び興味単語を応答生成部２４０に供給する。 Also, the word analysis unit 230 supplies the added repetitive words, sympathetic words, and interesting words to the response generation unit 240 .

意図推定データベース２３１は、各発話意図を表すカテゴリに対応する抽出単語が登録されたデータベースである。換言すれば、意図推定データベース２３１は、単語（抽出単語）と当該単語を含むカテゴリを示す発話意図との対応が登録されたデータベースである。 The intention estimation database 231 is a database in which extracted words corresponding to categories representing each utterance intention are registered. In other words, the intention estimation database 231 is a database in which correspondences between words (extracted words) and utterance intentions indicating categories including the words are registered.

なお、意図推定データベース２３１に登録される情報は、過去のユーザからの応答に基づいてパラメータが学習された機械学習モデル等により生成されたものであってもよいし、人為的に生成されたものであってもよい。 The information registered in the intention estimation database 231 may be generated by a machine learning model or the like in which parameters are learned based on responses from past users, or may be generated artificially. may be

図３は、実施形態に係る意図推定データベース２３１の構造の一例を模式的に示す図である。 FIG. 3 is a diagram schematically showing an example of the structure of the intention estimation database 231 according to the embodiment.

図３に示す例では、カテゴリ「検索」に対応する抽出単語として、「調べて」、「検索して」、及び「おしえて」が登録されている。また、カテゴリ「質問」に対応する抽出単語として、「しますか」、「だね」、「どうなの」、「ありますか」及び「どちら」が登録されている。 In the example shown in FIG. 3, "search", "search", and "tell me" are registered as extracted words corresponding to the category "search". Also, as extracted words corresponding to the category "question", "Do you want to do it?", "Yes", "What do you think?"

また、カテゴリ「挨拶」に対応する抽出単語として、「いかがですか」、「こんにちは」、「おはよう」、「ですね」、及び「きましたね」が登録されている。カテゴリ「雑談」に対応する抽出単語として、「今日」、「今度」、「する」、「行く」、「着る」、「見た」、「暑い」、「寒い」、「どう」、「思う」、「知ってる」、「調子」、「旅行」、「買う」、及び「決まる」が登録されている。 Also, as extracted words corresponding to the category "greetings", "How are you?", "Hello", "Good morning", "Nice", and "Welcome" are registered. Extracted words corresponding to the category "chat" are "today", "next time", "do", "go", "wear", "saw", "hot", "cold", "how", "think" , "know", "condition", "trip", "buy", and "determine" are registered.

また、カテゴリ「相槌」に対応する抽出単語として、「ああ」、「もう」、「その通り」、「そうですか」、「本当」、「すごいです」及び「同感」が登録されている。 In addition, as extracted words corresponding to the category "backhand", "aah", "yes", "that's right", "is that true", "true", "great", and "agree" are registered.

なお、抽出単語が複数のカテゴリに存在する場合には、例えば抽出単語が多く含まれるカテゴリが推定されればよい。あるいは、抽出単語が複数のカテゴリに存在する場合に、Ｎｏ順に上位のカテゴリが推定される構成とすることもできる。 Note that if the extracted words exist in a plurality of categories, for example, the category containing many extracted words may be estimated. Alternatively, when an extracted word exists in a plurality of categories, it is also possible to adopt a configuration in which a higher category is estimated in order of No.

重要単語データベース２３２は、話題毎に重要単語が登録されたデータベースである。換言すれば、重要単語データベース２３２は、話題と当該話題における重要単語との対応が登録されたデータベースである。なお、重要単語データベース２３２に登録される情報は、過去のユーザからの応答に基づいてパラメータが学習された機械学習モデル等により生成されたものであってもよいし、人為的に生成されたものであってもよい。 The important word database 232 is a database in which important words are registered for each topic. In other words, the important word database 232 is a database in which correspondence between topics and important words in the topic is registered. The information registered in the important word database 232 may be generated by a machine learning model or the like in which parameters are learned based on responses from past users, or may be generated artificially. may be

図４は、実施形態に係る重要単語データベース２３２の構造の一例を模式的に示す図である。 FIG. 4 is a diagram schematically showing an example of the structure of the important word database 232 according to the embodiment.

図４に示す例では、話題「ファッション」に対応する重要単語として、「身体」、「素材」、「着る」、「心地良い」、「[服アイテム]」及び「似合う」が登録されている。また、話題「季節」に対応する重要単語として、「初詣」、「節分」、「バレンタインデー」、「春一番」及び「ひな祭り」が登録されている。 In the example shown in FIG. 4, "body", "material", "wear", "comfortable", "[clothing item]", and "suit" are registered as important words corresponding to the topic "fashion". . Also, as important words corresponding to the topic "season", "first visit of the year", "setsubun", "Valentine's Day", "first spring", and "doll's festival" are registered.

また、話題「旅行」に対応する重要単語として、「旅行」、「行く」、「写真」、「動画」、「撮影」、「予約」及び「[場所]」が登録されている。なお、「[場所]」は具体的な場所の名前（例えば、東京等）を表している。また、話題「ファッション提案」に対応する重要単語として、「彼氏」、「誕生会」及び「謝恩会」が登録されている。また、話題「天気」に対応する重要単語として、「天気」、「季節」、「暑い」、「寒い」、「雨」、「晴れ」及び「曇り」が登録されている。 Also, "travel", "go", "picture", "video", "shoot", "reservation", and "[place]" are registered as important words corresponding to the topic "travel". Note that "[location]" represents the name of a specific location (for example, Tokyo). Also, "boyfriend", "birthday party" and "thank you party" are registered as important words corresponding to the topic "fashion proposal". Also, "weather", "season", "hot", "cold", "rainy", "sunny" and "cloudy" are registered as important words corresponding to the topic "weather".

また、話題「趣味」に対応する重要単語として、「渋谷」、「カフェ」及び「遊園地」が登録されている。また、話題「娯楽」に対応する重要単語として、「音楽」、「読書」、「釣り」及び「ゴルフ」が登録されている。また、話題「健康」に対応する重要単語として、「食」及び「プレゼント」が登録されている。 Also, "Shibuya", "cafe" and "amusement park" are registered as important words corresponding to the topic "hobby". Also, "music", "reading", "fishing" and "golf" are registered as important words corresponding to the topic "entertainment". Also, "food" and "present" are registered as important words corresponding to the topic "health".

会話推定データベース２３３は、話題及び重要単語の組毎に共感単語が登録されたデータベースである。換言すれば、会話推定データベース２３３は、話題及び重要単語の組と、当該組における共感単語との対応が登録されたデータベースである。 The conversation estimation database 233 is a database in which sympathetic words are registered for each set of topics and important words. In other words, the conversation estimation database 233 is a database in which correspondence between pairs of topics and important words and sympathetic words in the pairs is registered.

なお、会話推定データベース２３３に登録される情報は、過去のユーザからの応答に基づいてパラメータが学習された機械学習モデル等により生成されたものであってもよいし、人為的に生成されたものであってもよい。 The information registered in the conversation estimation database 233 may be generated by a machine learning model or the like in which parameters are learned based on responses from past users, or may be generated artificially. may be

図５は、実施形態に係る会話推定データベース２３３の構造の一例を模式的に示す図である。 FIG. 5 is a diagram schematically showing an example of the structure of the conversation estimation database 233 according to the embodiment.

図５に示す例では、話題「ファッション」及び重要単語「「[服アイテム]」、「買う」」との組に対応する共感単語として、「ピッタリだね」が登録されている。また、話題「ファッション」及び重要単語「「[服アイテム]」、「何か」、「流行」」の組に対応する共感単語として、「探してみようか」が登録されている。また、話題「ファッション」及び重要単語「「[服アイテム]」、「センス」」の組に対応する共感単語として、「素敵です」が登録されている。 In the example shown in FIG. 5, "perfect fit" is registered as a sympathy word corresponding to the set of the topic "fashion" and the important words "[clothing item]", "buy". Also, as a sympathy word corresponding to the set of the topic "fashion" and the important words "[clothing item]", "something", and "trend", "let's look for it" is registered. Also, "nice" is registered as a sympathy word corresponding to the set of the topic "fashion" and the important words "[clothing item]" and "sense".

また、話題「季節」及び重要単語「「[季節]」、「寒い」、「暑い」」の組に対応する共感単語として、「まだ季節は変わってないかな」が登録されている。また、話題「季節」及び重要単語「「[季節]」、「晴れ」、「雨」」の組に対応する共感単語として、「こんな天候が続くかな」が登録されている。 Also, "I wonder if the season has changed yet" is registered as a sympathy word corresponding to the set of the topic "season" and the important words "[season]", "cold", and "hot". Also, as a sympathy word corresponding to the set of the topic "season" and the important words "[season]", "sunny", and "rain", "I wonder if this kind of weather will continue" is registered.

また、話題「旅行」及び重要単語「「[場所]」、「旅行」、「行く」」の組に対応する共感単語として、「いいなぁ」が登録されている。また、話題「旅行」及び重要単語「「[アイテム]」、「旅行」、「買う」」の組に対応する共感単語として、「それはいい買い物だね」が登録されている。また、話題「旅行」及び重要単語「「[日時]」、「旅行」、「計画」」の組に対応する共感単語として、「[日時]はいい季節だね」が登録されている。 Also, "nice" is registered as a sympathetic word corresponding to the set of the topic "travel" and the important words "[place]", "travel", and "go". Also, "That's a good shopping" is registered as a sympathy word corresponding to the set of the topic "travel" and the important words "[item]", "travel", and "buy". Also, "[Date and time] is a nice season" is registered as a sympathy word corresponding to the set of the topic "Travel" and the important words "[Date and time]", "Travel", and "Plan"".

興味単語データベース２３４は、カテゴリ及び抽出単語の組に対応する興味単語が登録されたデータベースである。換言すれば、会話推定データベース２３３は、カテゴリ及び抽出単語の組と、当該組における興味単語との対応が登録されたデータベースである。 The interesting word database 234 is a database in which interesting words corresponding to sets of categories and extracted words are registered. In other words, the conversation estimation database 233 is a database in which correspondence between pairs of categories and extracted words and interesting words in the pairs is registered.

なお、興味単語データベース２３４に登録される情報は、過去のユーザからの応答に基づいてパラメータが学習された機械学習モデル等により生成されたものであってもよいし、人為的に生成されたものであってもよい。 The information registered in the interesting word database 234 may be generated by a machine learning model or the like in which parameters are learned based on responses from past users, or may be generated artificially. may be

図６は、実施形態に係る興味単語データベース２３４の構造の一例を模式的に示す図である。 FIG. 6 is a diagram schematically showing an example of the structure of the interesting word database 234 according to the embodiment.

図６に示す例では、カテゴリ「検索」及び抽出単語「「[予約]」、「チケット」」との組に対応する興味単語として、「「[日時]」、「[場所]」、「席」」が登録されている。なお、［予約］は、具体的な予約の内容（例えば、コンサートのチケットの予約等）を表している。 In the example shown in FIG. 6, "[date and time]", "[place]", "seat ” is registered. Note that [reservation] represents specific reservation contents (for example, concert ticket reservation, etc.).

また、カテゴリ「検索」及び抽出単語「「[予約]」、「[店名種類]」」との組に対応する興味単語として、「「[店舗名]」、「[場所]」」が登録されている。なお、「[店名種類]」は、具体的な店名及び店のジャンル（例えば、○○屋・イタリアン等）を表している。また、カテゴリ「検索」及び抽出単語「「[販売]」、「[物品名]」」との組に対応する興味単語として、「「[場所]」、「[いつ]」」が登録されている。なお、「[いつ]」は、具体的な日時（例えば、２月２日等）を表している。 Also, "[store name]" and "[place]" are registered as words of interest corresponding to the combination of the category "search" and the extracted words "[reservation]" and "[store name type]". ing. It should be noted that "[store name type]" represents a specific store name and store genre (for example, ○○ya, Italian, etc.). In addition, "[place]" and "[when]" are registered as words of interest corresponding to the combination of the category "search" and the extracted words "[sale]" and "[item name]". there is Note that "[when]" represents a specific date and time (for example, February 2).

また、カテゴリ「雑談」及び抽出単語「「[部位]」、「調子」との組に対応する興味単語として、「「[部位]」、「[季節]」、「[時間]」」が登録されている。なお、「[部位]」は、具体的な身体の部位（例えば、お腹等）を表している。 In addition, "[part]", "[season]", and "[time]" are registered as words of interest corresponding to the set of the category "chat" and the extracted words "[part]" and "tone". It is Note that "[part]" represents a specific part of the body (for example, abdomen).

また、カテゴリ「雑談」及び抽出単語「「[服アイテム]」、「似合う」との組に対応する興味単語として、「「[服アイテム]」、「思う」、「センス」、「おしゃれ」」が登録されている。また、カテゴリ「雑談」及び抽出単語「「[天候]」、「おでかけ」との組に対応する興味単語として、「「[天候]」、「[季節]」」が登録されている。 Also, as interesting words corresponding to the set of the category "chat" and the extracted words "[clothing item]" and "suit", "[clothing item]", "think", "sense", and "stylish". is registered. Also, "[weather]" and "[season]" are registered as words of interest corresponding to the set of the category "chat" and the extracted words "[weather]" and "outing".

応答生成部２４０は、連携単語データベース２４１を用いて、単語解析部２３０から入力される興味単語から、繰り返し単語、共感単語、及び興味単語と連携して応答文章を形成する連携単語を特定する。なお、応答生成部２４０は、興味単語から連携単語を特定できない場合、連携単語を特定する処理を省略してもよい。 The response generation unit 240 uses the linked word database 241 to identify, from the interesting words input from the word analysis unit 230, repeated words, sympathetic words, and linked words that form a response sentence in cooperation with the interesting words. Note that the response generation unit 240 may omit the process of identifying the linked word when the linked word cannot be specified from the interesting word.

また、応答生成部２４０は、定型文データベース２４２を用いて、繰り返し単語、共感単語、興味単語、及び連携単語により、応答文章を生成する。応答生成部２４０は、通信部２１０により、応答文章を端末装置１００へ送信する。 In addition, the response generation unit 240 uses the standard sentence database 242 to generate a response sentence from repeated words, sympathetic words, interesting words, and linked words. The response generation unit 240 transmits the response text to the terminal device 100 through the communication unit 210 .

なお、応答生成部２４０は、繰り返し単語、共感単語、及び連携単語のない応答文章を生成してもよい。また、応答生成部２４０は、予め定めた基準に基づいて、生成された応答文章が意味の通る文章であるか否かを判定する処理を行ってもよい。 Note that the response generation unit 240 may generate a response sentence without repeated words, sympathetic words, and linked words. Further, the response generation unit 240 may perform processing for determining whether or not the generated response text is meaningful based on a predetermined criterion.

連携単語データベース２４１は、会話のカテゴリ、会話の話題及び興味単語の組に対応する連携単語が登録されたデータベースである。換言すれば、連携単語データベース２４１は、会話のカテゴリ、会話の話題及び興味単語の組と、当該組における連携単語との対応が登録されたデータベースである。 The linked word database 241 is a database in which linked words corresponding to sets of conversation categories, conversation topics, and interesting words are registered. In other words, the linked word database 241 is a database in which correspondences between pairs of conversation categories, conversation topics and interesting words, and linked words in the pairs are registered.

なお、連携単語データベース２４１に登録される情報は、過去のユーザからの応答に基づいてパラメータが学習された機械学習モデル等により生成されたものであってもよいし、人為的に生成されたものであってもよい。 The information registered in the linked word database 241 may be generated by a machine learning model or the like in which parameters are learned based on responses from past users, or may be generated artificially. may be

図７は、実施形態に係る連携単語データベース２４１の構造の一例を模式的に示す図である。 FIG. 7 is a diagram schematically showing an example of the structure of the linked word database 241 according to the embodiment.

図７に示す例では、カテゴリ「雑談」、話題「ファッション」、興味単語「「[服アイテム]」との組に対応する連携単語として、「着る」が登録されている。また、カテゴリ「雑談」、話題「ファッション」、興味単語「流行」との組に対応する連携単語として、「[色]が最先端」が登録されている。なお、「[色]」は具体的な色（例えば、レッド等）を表している。また、カテゴリ「雑談」、話題「ファッション」、興味単語「[物品名]」との組に対応する連携単語として、「[場所]があなたに合う」が登録されている。 In the example shown in FIG. 7, "wear" is registered as a linked word corresponding to the set of the category "chat", the topic "fashion", and the word of interest "[clothing item]". Also, "[color] is cutting-edge" is registered as a linked word corresponding to the set of the category "chat", the topic "fashion", and the word of interest "trend". Note that "[color]" represents a specific color (for example, red). In addition, "[location] suits you" is registered as a linked word corresponding to the combination of the category "chat", the topic "fashion", and the word of interest "[item name]".

また、カテゴリ「雑談」、話題「旅行」、興味単語「[場所]」との組に対応する連携単語として、「行ったこと」が登録されている。また、カテゴリ「雑談」、話題「旅行」、興味単語「[服アイテム]」との組に対応する連携単語として、「「[服アイテム]」、「センスある」」が登録されている。また、カテゴリ「雑談」、話題「旅行」、興味単語「「[天候]」、「お出かけ」」との組に対応する連携単語として、「「[天候]」でいいところ探そうか」が登録されている。 In addition, as a linked word corresponding to a set of the category "chatter", the topic "travel", and the word of interest "[place]", "what you have done" is registered. Also, "[clothing item]", "sense"" is registered as a linking word corresponding to the combination of the category "chat", the topic "travel", and the word of interest "[clothing item]". In addition, as a linked word corresponding to the combination of the category "chat", the topic "travel", and the word of interest "[weather]", "outing", "Let's look for a good place in "[weather]"" is registered. ing.

定型文データベース２４２は、カテゴリ、話題及び興味単語の組に対応する定型文が登録されているデータベースである。換言すれば、定型文データベース２４２は、カテゴリ、話題及び興味単語の組と当該組に対応する定型文とが登録されたデータベースである。 The standard sentence database 242 is a database in which standard sentences corresponding to sets of categories, topics, and interesting words are registered. In other words, the standard sentence database 242 is a database in which pairs of categories, topics, and interesting words and standard sentences corresponding to the pairs are registered.

なお、定型文データベース２４２に登録される定型文は、過去のユーザからの応答に基づいてパラメータが学習された機械学習モデル等により生成されたものであってもよいし、人為的に生成されたものであってもよい。 The fixed phrases registered in the fixed phrase database 242 may be generated by a machine learning model or the like in which parameters are learned based on responses from past users, or may be artificially generated. can be anything.

図８は、実施形態に係る定型文データベース２４２の構造の一例を模式的に示す図である。 FIG. 8 is a diagram schematically showing an example of the structure of the fixed phrase database 242 according to the embodiment.

図８に示す例では、カテゴリ「雑談」、話題「ファッション」、及び興味単語「［服アイテム］」の組に対応する定型文として、「[繰り返し単語]に[繰り返し単語]んだ、[共感単語]、[興味単語]を[連携単語]の？」が登録されている。また、カテゴリ「雑談」、話題「ファッション」、及び興味単語「流行」の組に対応する定型文として、「[繰り返し単語]の[繰り返し単語]ですか、[共感単語]、[興味単語]は[連携単語]だね」が登録されている。 In the example shown in FIG. 8, as a fixed sentence corresponding to a set of the category "chat", the topic "fashion", and the word of interest "[clothing item]", "[repeated word] is [repeated word], [empathy word], [interesting word] and [linked word]?" are registered. In addition, as a set of fixed sentences corresponding to the set of the category "chat", the topic "fashion", and the interesting word "fashion", "Is the [repeated word] of [repeated word], [sympathy word], [interested word] are [Collaboration word] desu" is registered.

また、カテゴリ「雑談」、話題「ファッション」、及び興味単語「［物品名］」の組に対応する定型文として、「[繰り返し単語]から、[繰り返し単語]なのですね。[共感単語]、[興味単語]は[連携単語]かな」が登録されている。 In addition, as a fixed phrase corresponding to the set of the category "chat", the topic "fashion", and the word of interest "[item name]", "From [repeated word] to [repeated word], isn't it? [Empathy word], [ [interesting word] is registered as [linked word] kana".

また、カテゴリ「雑談」、話題「旅行」、及び興味単語「［場所］」の組に対応する定型文として、「[繰り返し単語]の[繰り返し単語]を計画してるんだね、[共感単語]、[興味単語]には[連携単語]がありますか」が登録されている。 Also, as a set of fixed sentences corresponding to the set of the category "chat", the topic "travel", and the interesting word "[place]", "I'm planning [repeated word] of [repeated word], [sympathy word ], and [interesting word] has [linked word]" are registered.

また、カテゴリ「雑談」、話題「旅行」、及び興味単語「［観光アイテム］」の組に対応する定型文として、「[繰り返し単語]は[繰り返し単語]だよね。[繰り返し単語]に行くなら[興味単語]はいかがでしょうかねぇ」が登録されている。なお、「［観光アイテム］」は、土産品等の具体的な名称（例えば、○○焼等）を表している。 Also, as a set of fixed sentences corresponding to the set of the category "chat", the topic "travel", and the word of interest "[sightseeing item]", "[repeated word] is [repeated word]. If you go to [repeated word] How about [interesting words]" is registered. Note that "[sightseeing item]" represents a specific name of a souvenir or the like (for example, XX ware, etc.).

また、カテゴリ「雑談」、話題「旅行」、及び興味単語「［食事］」の組に対応する定型文として、「[繰り返し単語]で[繰り返し単語]を食べるんだ。今なら[興味単語]を[連携単語]で、おいしそうだね」が登録されている。なお、「［食事］」は、食材等の具体的な名称（例えば、鯖等）を表している。 In addition, as a fixed sentence corresponding to a set of the category "chat", the topic "travel", and the interesting word "[meal]", "[repeated word] eat [repeated word]. Now [interested word] is [linked word], looks delicious" is registered. Note that "[meal]" represents a specific name of a food material (for example, mackerel, etc.).

管理部２５０は、端末装置１００やユーザを識別するユーザ情報を管理する。また、管理部２５０は、通信部２１０により、ユーザの音声入力を要求する要求情報を端末装置１００に送信する。 The management unit 250 manages user information that identifies the terminal device 100 and the user. In addition, the management unit 250 transmits request information requesting the user's voice input to the terminal device 100 through the communication unit 210 .

なお、図２乃至図８に例示する単語カテゴリデータベース２２１、意図推定データベース２３１、重要単語データベース２３２、会話推定データベース２３３、興味単語データベース２３４、連携単語データベース２４１及び定型文データベース２４２の各データベースの構成は一例であり、任意に設定可能である。 The word category database 221, the intention estimation database 231, the important word database 232, the conversation estimation database 233, the interesting word database 234, the linked word database 241, and the phrase database 242 shown in FIGS. This is an example and can be set arbitrarily.

図９は、実施形態に係る情報処理システム１において実行される、情報処理の一例を示すフローチャートである。ここでは、図９の流れがサーバ装置２００で実行される場合を例示する。また、前提として、端末装置１００によりユーザ発話が入力されているものとする。 FIG. 9 is a flowchart showing an example of information processing executed in the information processing system 1 according to the embodiment. Here, a case where the flow of FIG. 9 is executed by the server device 200 is illustrated. Also, as a premise, it is assumed that the terminal device 100 has input a user's speech.

まず、管理部２５０は、通信部２１０と協働し、端末装置１００からユーザ発話を示す音声データを取得する（Ｓ１０１）。 First, the management unit 250 cooperates with the communication unit 210 to acquire voice data representing user utterances from the terminal device 100 (S101).

文章解析部２２０は、取得されたユーザ発話を示す音声データに対する音声認識処理及び文章単語分割処理を行う（Ｓ１０２）。 The text analysis unit 220 performs voice recognition processing and sentence word segmentation processing on the acquired voice data representing the user's utterance (S102).

その後、文章解析部２２０は、単語カテゴリデータベース２２１により話題を推定するための単語を抽出する（Ｓ１０３）。具体的には、文章解析部２２０は、ユーザ発話を文章単語分割処理することにより分割された複数の単語の中から、単語カテゴリデータベース２２１に抽出単語として登録されている単語を抽出する。 After that, the text analysis unit 220 extracts words for estimating the topic from the word category database 221 (S103). Specifically, the sentence analysis unit 220 extracts words registered as extracted words in the word category database 221 from among a plurality of words obtained by dividing the user's utterance into sentence words.

その後、文章解析部２２０は、抽出単語から話題を推定する（Ｓ１０４）。具体的には、文章解析部２２０は、単語カテゴリデータベース２２１を参照し、Ｓ１０３で抽出された抽出単語に対応する話題を、当該会話の話題として推定する。 After that, the text analysis unit 220 estimates a topic from the extracted words (S104). Specifically, the text analysis unit 220 refers to the word category database 221 and estimates the topic corresponding to the extracted word extracted in S103 as the topic of the conversation.

その後、単語解析部２３０は、カテゴリ判別可能であるか否かを確認する（Ｓ１０５）。具体的には、単語解析部２３０は、まず、意図推定データベース２３１によりカテゴリを推定するための単語を抽出する。 After that, the word analysis unit 230 confirms whether category discrimination is possible (S105). Specifically, the word analysis unit 230 first extracts words for estimating the category from the intention estimation database 231 .

次いで、単語解析部２３０は、カテゴリに対応する抽出単語が意図推定データベース２３１にあるか否かを確認する。抽出単語が意図推定データベース２３１にない場合、単語解析部２３０は、カテゴリ判別不可能であると判定する。一方、抽出単語が意図推定データベース２３１にある場合、単語解析部２３０は、カテゴリ判別可能であると判定する。カテゴリ判別不可能な場合（Ｓ１０５：Ｎｏ）、単語解析部２３０は、処理を中止する（Ｓ１１８）。その後、図９の流れは終了する。 Next, the word analysis unit 230 checks whether or not there is an extracted word corresponding to the category in the intention estimation database 231 . If the extracted word is not in the intention estimation database 231, the word analysis unit 230 determines that the category cannot be determined. On the other hand, when the extracted word exists in the intention estimation database 231, the word analysis unit 230 determines that the category can be determined. If the category cannot be determined (S105: No), the word analysis unit 230 stops processing (S118). The flow of FIG. 9 then ends.

一方、カテゴリ判別可能な場合（Ｓ１０５：Ｙｅｓ）、単語解析部２３０は、会話のカテゴリを推定する（Ｓ１０６）。具体的には、単語解析部２３０は、意図推定データベース２３１を参照し、ユーザ発話を文章単語分割処理することにより分割された複数の単語の中に含まれる、複数の抽出単語の組に対応するカテゴリを当該会話のカテゴリとして推定する。 On the other hand, if the category can be determined (S105: Yes), the word analysis unit 230 estimates the category of the conversation (S106). Specifically, the word analysis unit 230 refers to the intention estimation database 231, and corresponds to a set of a plurality of extracted words included in a plurality of words divided by dividing the user's utterance into sentence words. Infer the category as the category of the conversation.

その後、単語解析部２３０は、文章単語分割処理により分割された複数の単語の中に推定された話題における重要単語があるか否かを判定する（Ｓ１０７）。具体的には、単語解析部２３０は、重要単語データベース２３２を参照し、ユーザ発話を文章単語分割処理することにより分割された複数の単語の中に、Ｓ１０４で推定した話題に対応する重要単語があるか否かを判定する。推定された話題における重要単語がない場合（Ｓ１０７：Ｎｏ）、図９の流れはＳ１１８の処理へ進む。 After that, the word analysis unit 230 determines whether or not there is an important word in the estimated topic among the plurality of words divided by the sentence word division processing (S107). Specifically, the word analysis unit 230 refers to the important word database 232, and among the plurality of words obtained by dividing the user's utterance into sentence words, there is an important word corresponding to the topic estimated in S104. Determine whether or not there is If there is no important word in the estimated topic (S107: No), the flow of FIG. 9 advances to the process of S118.

一方、推定された話題における重要単語がある場合（Ｓ１０７：Ｙｅｓ）、単語解析部２３０は、ユーザ発話に含まれる重要単語を特定する（Ｓ１０８）。具体的には、単語解析部２３０は、重要単語データベース２３２を参照し、推定された話題に対応する重要単語であって、かつ、文章単語分割処理により分割された複数の単語の中に含まれる単語をユーザ発話に含まれる重要単語として特定する。 On the other hand, if there is an important word in the estimated topic (S107: Yes), the word analysis unit 230 identifies the important word included in the user's utterance (S108). Specifically, the word analysis unit 230 refers to the important word database 232 and identifies important words corresponding to the estimated topic and included in the plurality of words divided by the sentence word division processing. Identifies words as key words in user utterances.

その後、単語解析部２３０は、特定された重要単語から繰り返し単語を特定する（Ｓ１０９）。具体的には、１つの重要単語が特定された場合、単語解析部２３０は、当該重要単語を繰り返し単語として特定する。一方、複数の重要単語が特定された場合、単語解析部２３０は、話題毎に予め定めた重要単語の優先順位に従い、繰り返し単語を特定する。 After that, the word analysis unit 230 identifies repeated words from the identified important words (S109). Specifically, when one important word is identified, word analysis section 230 identifies the important word as a repeated word. On the other hand, when a plurality of important words are identified, the word analysis section 230 identifies repeated words according to the priority of the important words predetermined for each topic.

なお、重要単語の優先順位は、例えば、これまでに行われた会話において当該単語が登場した回数等に基づいて定められる。また、単語解析部２３０は、複数の繰り返し単語を特定してもよい。 The order of priority of important words is determined, for example, based on the number of times the word has appeared in conversations that have occurred so far. Also, the word analysis unit 230 may identify a plurality of repeated words.

その後、単語解析部２３０は、推定された話題及び繰り返し単語に基づき、会話推定データベース２３３から共感単語を特定する（Ｓ１１０）。具体的には、単語解析部２３０は、会話推定データベース２３３を参照し、推定された話題及び繰り返し単語の組に対応する共感単語を当該会話における共感単語として特定する。 After that, the word analysis unit 230 identifies sympathetic words from the conversation estimation database 233 based on the estimated topic and repeated words (S110). Specifically, the word analysis unit 230 refers to the conversation estimation database 233, and identifies sympathy words corresponding to the estimated topic and set of repeated words as sympathy words in the conversation.

その後、管理部２５０は、ユーザ発話を入力したユーザの前会話情報を取得する（Ｓ１１１）。具体的には、管理部２５０は、後述するストレージ７３０等に記憶されたユーザの前会話情報を取得する。なお、ここでは、前会話情報には、チャットボットと対象ユーザとが以前に行った会話に関する情報だけでなく、対象ユーザのＷｅｂページの閲覧履歴等の情報も含まれるものとする。 After that, the management unit 250 acquires the previous conversation information of the user who has input the user's utterance (S111). Specifically, the management unit 250 acquires the user's previous conversation information stored in the storage 730 or the like, which will be described later. Here, the previous conversation information includes not only information related to the previous conversation between the chatbot and the target user, but also information such as the browsing history of the target user's Web pages.

その後、文章解析部２２０は、取得された前会話情報に対する、文章抽出処理、文章認識処理及び文章単語分割処理を行う（Ｓ１１２）。 After that, the text analysis unit 220 performs text extraction processing, text recognition processing, and text word segmentation processing on the acquired previous conversation information (S112).

その後、文章解析部２２０は、Ｓ１０３と同様の処理で、単語カテゴリデータベース２２１により話題を推定するための単語を抽出する（Ｓ１１３）。 After that, the text analysis unit 220 extracts words for estimating the topic from the word category database 221 in the same process as S103 (S113).

その後、文章解析部２２０は、Ｓ１０４と同様の処理で、抽出単語から話題を推定する（Ｓ１１４）。 After that, the text analysis unit 220 estimates a topic from the extracted words in the same process as S104 (S114).

その後、単語解析部２３０は、Ｓ１０５と同様の処理で、カテゴリ判別可能であるか否かを確認する（Ｓ１１５）。カテゴリ判別不可能な場合（Ｓ１１５：Ｎｏ）、図９の流れはＳ１１１の処理へ戻り、管理部２５０は、別の前会話情報を取得する。なお、所定回数Ｓ１１１乃至Ｓ１１５の処理を繰り返しても、カテゴリ判別不可能な場合、Ｓ１１８の処理へ進み、図９の流れを終了させてもよい。 After that, the word analysis unit 230 confirms whether or not the category can be determined by the same process as in S105 (S115). If the category cannot be determined (S115: No), the flow of FIG. 9 returns to the process of S111, and the management unit 250 acquires other previous conversation information. If the category cannot be determined even after repeating the processes of S111 to S115 a predetermined number of times, the process of S118 may be performed and the flow of FIG. 9 may be ended.

一方、カテゴリ判別可能な場合（Ｓ１１５：Ｙｅｓ）、単語解析部２３０は、Ｓ１０６と同様の処理で、前会話情報のカテゴリを推定する（Ｓ１１６）。 On the other hand, if the category can be determined (S115: Yes), the word analysis unit 230 estimates the category of the previous conversation information by the same process as in S106 (S116).

その後、単語解析部２３０は、今回のユーザ発話のカテゴリと前会話情報のカテゴリとが同一のカテゴリであるか否かを判定する（Ｓ１１７）。カテゴリが同一でない場合（Ｓ１１７：Ｎｏ）、図９の流れはＳ１１１の処理へ戻り、管理部２５０は、別の前会話情報を取得する。 After that, the word analysis unit 230 determines whether or not the category of the current user utterance and the category of the previous conversation information are the same category (S117). If the categories are not the same (S117: No), the flow in FIG. 9 returns to the process of S111, and the management unit 250 acquires another previous conversation information.

一方、カテゴリが同一である場合（Ｓ１１７：Ｙｅｓ）、単語解析部２３０は、抽出単語の組から、推定された、今回のユーザ発話と前会話情報との共通のカテゴリ（以下、共通カテゴリともいう）に対応する興味単語を特定する（Ｓ１１９）。 On the other hand, if the categories are the same (S117: Yes), the word analysis unit 230 extracts the common category between the current user utterance and the previous conversation information (hereinafter also referred to as the common category) estimated from the set of extracted words. ) is specified (S119).

具体的には、まず、単語解析部２３０は、興味単語データベース２３４を参照し、前会話情報を文単語分割処理することにより分割された複数の単語の中から、Ｓ１１７で推定された、共通カテゴリに対応する抽出単語を抽出する。また、単語解析部２３０は、興味単語データベース２３４を参照し、共通カテゴリと、前会話情報の抽出単語との組に対応する興味単語を特定する。 Specifically, first, the word analysis unit 230 refers to the interesting word database 234, and selects the common category estimated in S117 from among the plurality of words divided by the sentence word division processing of the previous conversation information. Extract the extracted word corresponding to . Word analysis unit 230 also refers to interesting word database 234 to identify an interesting word corresponding to a set of a common category and an extracted word from previous conversation information.

その後、単語解析部２３０は、前会話情報に、Ｓ１１９で特定された興味単語が含まれるか否かを確認する（Ｓ１２０）。具体的には、単語解析部２３０は、前会話情報を文単語分割処理することにより分割された複数の単語の中に、Ｓ１１９で特定された興味単語が存在するか否かを確認する。 After that, the word analysis unit 230 checks whether or not the previous conversation information includes the interesting word specified in S119 (S120). Specifically, word analysis unit 230 confirms whether or not the interesting word specified in S119 exists in the plurality of words divided by the sentence word division processing of the previous conversation information.

前会話情報に興味単語が含まれない場合（Ｓ１２０：Ｎｏ）、単語解析部２３０は、興味単語データベース２３４に基づいて、応答文章の生成に用いる興味単語を特定する（Ｓ１２１）。 If no interesting word is included in the previous conversation information (S120: No), the word analysis unit 230 identifies an interesting word to be used for generating a response sentence based on the interesting word database 234 (S121).

具体的には、Ｓ１１９で１つの興味単語が特定された場合、単語解析部２３０は、当該興味単語を応答文章の生成に用いる興味単語として特定する。一方、Ｓ１１９で複数の興味単語が特定された場合、単語解析部２３０は、カテゴリと抽出単語との組毎に、予め定められた興味単語の優先順位に従い、Ｓ１１９で特定された興味単語の中から、応答文章の生成に用いる興味単語を特定する。 Specifically, when one interesting word is specified in S119, the word analysis unit 230 specifies the interesting word as an interesting word to be used for generating a response sentence. On the other hand, if a plurality of interesting words are specified in S119, the word analysis unit 230 selects the interesting words among the interesting words specified in S119 according to the predetermined priority of the interesting words for each set of the category and the extracted word. , the interesting word used to generate the response sentence is specified.

なお、興味単語の優先順位は、例えば、これまでに行われた、複数のユーザの会話の履歴において当該単語が登場した回数等に基づいて定められる。また、単語解析部２３０は、複数の単語を応答文章の生成に用いる興味単語として特定してもよい。 Note that the order of priority of interesting words is determined, for example, based on the number of times the word has appeared in the conversation history of a plurality of users. Moreover, the word analysis unit 230 may specify a plurality of words as interesting words to be used for generating a response sentence.

一方、前会話情報に興味単語が含まれる場合（Ｓ１２０：Ｙｅｓ）、単語解析部２３０は、前会話情報に基づいて、応答文章の生成に用いる１の興味単語を特定する（Ｓ１２２）。 On the other hand, if the previous conversation information includes an interesting word (S120: Yes), the word analysis unit 230 identifies one interesting word to be used for generating a response sentence based on the previous conversation information (S122).

具体的には、Ｓ１１９で１つの興味単語が特定され、かつ、当該興味単語が前会話情報に含まれる場合、単語解析部２３０は、当該興味単語を応答文章の生成に用いる興味単語として特定する。一方、Ｓ１１９で複数の興味単語が特定され、かつ、当該複数の興味単語のうちの１つが前会話情報に含まれる場合、単語解析部２３０は、前会話情報に含まれる１つの興味単語を応答文章の生成に用いる興味単語として特定する。 Specifically, when one interesting word is specified in S119 and the interesting word is included in the previous conversation information, the word analysis unit 230 specifies the interesting word as an interesting word to be used for generating a response sentence. . On the other hand, when a plurality of interesting words are specified in S119 and one of the plurality of interesting words is included in the previous conversation information, word analysis section 230 responds with one interesting word included in the previous conversation information. It is specified as an interesting word used for sentence generation.

また、Ｓ１１９で複数の興味単語が特定され、かつ、当該複数の興味単語のうちの複数が前会話情報に含まれる場合、単語解析部２３０は、例えば、当該複数の興味単語のうち、対象ユーザの会話中に登場する回数が最も多い興味単語を応答文章の生成に用いる興味単語として特定する。 In addition, when a plurality of interesting words are specified in S119 and more than one of the plurality of interesting words is included in the previous conversation information, the word analysis unit 230 selects, for example, the target user The interesting word that appears most frequently in the conversation is specified as the interesting word to be used for generating the response sentence.

その後、応答生成部２４０は、推定された話題、カテゴリ及び特定された興味単語に基づき連携単語データベース２４１から連携単語を特定する（Ｓ１２３）。具体的には、応答生成部２４０は、連携単語データベース２４１を参照し、推定された話題と、推定されたカテゴリと、特定された興味単語との組に対応する連携単語を特定する。 After that, the response generator 240 identifies linked words from the linked word database 241 based on the estimated topic, category, and specified interesting words (S123). Specifically, the response generation unit 240 refers to the linked word database 241 and identifies linked words corresponding to a set of the estimated topic, the estimated category, and the specified interesting word.

その後、応答生成部２４０は、推定された話題、カテゴリ及び特定された興味単語に基づき定型文データベース２４２から定型文を特定する（Ｓ１２４）。具体的には、応答生成部２４０は、定型文データベース２４２を参照し、推定された話題と、推定されたカテゴリと、特定された興味単語との組に対応する定型文を特定する。 After that, the response generation unit 240 identifies fixed phrases from the fixed phrase database 242 based on the estimated topic, category, and specified interesting words (S124). Specifically, response generation unit 240 refers to fixed phrase database 242 to identify fixed phrases corresponding to a set of the estimated topic, the estimated category, and the specified interesting word.

その後、応答生成部２４０は、特定された繰り返し単語、共感単語、興味単語、連携単語、及び定型文に基づき応答文章を生成する（Ｓ１２５）。具体的には、応答生成部２４０は、Ｓ１２４で特定された定型文に、Ｓ１０９で特定された繰り返し単語、Ｓ１１０で特定された共感単語、Ｓ１２１又はＳ１２２で特定された興味単語、Ｓ１２３で特定された連携単語をあてはめ、応答文章を生成する。 After that, the response generation unit 240 generates a response sentence based on the identified repetitive words, sympathetic words, interesting words, linked words, and standard sentences (S125). Specifically, the response generating unit 240 adds the repetitive word identified in S109, the sympathetic word identified in S110, the interesting word identified in S121 or S122, and the Then, a response sentence is generated by applying the linked words.

その後、応答生成部２４０は、通信部２１０により、端末装置１００に応答文章を出力する（Ｓ１２６）。ここで、端末装置１００の出力部１３０は、通信部１１０によりサーバ装置２００からの応答文章を受信したとき、ユーザに当該応答文章を音声や画像で通知する。 After that, the response generation unit 240 outputs the response text to the terminal device 100 through the communication unit 210 (S126). Here, when the communication unit 110 receives the response text from the server device 200, the output unit 130 of the terminal device 100 notifies the user of the response text by voice or image.

その後、管理部２５０は、所定時間内に次のユーザ発話が入力されたか否かを確認する（Ｓ１２７）。所定時間内に次のユーザ発話が入力されなかった場合（Ｓ１２７：Ｎｏ）、応答生成部２４０は、ユーザの発話を促す文章を端末装置１００に出力する再発話処理を行う（Ｓ１３５）。例えば、応答生成部２４０は、「よくわからないから、もう少し詳しくおしえて」等の文章を出力する。その後、図９の流れはＳ１２７の処理へ戻る。 After that, the management unit 250 confirms whether or not the next user's utterance is input within a predetermined time (S127). If the next user's utterance is not input within the predetermined time (S127: No), the response generation unit 240 performs re-speech processing of outputting a sentence prompting the user to utter to the terminal device 100 (S135). For example, the response generator 240 outputs sentences such as "I don't understand, please tell me more about it." After that, the flow of FIG. 9 returns to the processing of S127.

一方、所定時間内に次のユーザ発話が入力された場合（Ｓ１２７：Ｙｅｓ）、管理部２５０は、Ｓ１０１と同様の処理で、ユーザ発話を示す音声データを取得する（Ｓ１２８）。 On the other hand, if the next user's utterance is input within the predetermined time (S127: Yes), the management unit 250 acquires voice data representing the user's utterance in the same process as S101 (S128).

その後、文章解析部２２０は、Ｓ１０２と同様の処理で、音声認識処理及び文章単語分割処理を行う（Ｓ１２９）。 After that, the text analysis unit 220 performs voice recognition processing and text word segmentation processing in the same manner as in S102 (S129).

その後、文章解析部２２０は、Ｓ１０３と同様の処理で、単語カテゴリデータベース２２１により話題を推定するための単語を抽出する（Ｓ１３０）。 After that, the text analysis unit 220 extracts words for estimating the topic from the word category database 221 in the same process as S103 (S130).

その後、文章解析部２２０は、Ｓ１０４と同様の処理で、抽出単語から話題を推定する（Ｓ１３１）。 After that, the text analysis unit 220 estimates a topic from the extracted words in the same process as S104 (S131).

その後、単語解析部２３０は、Ｓ１０５と同様の処理で、カテゴリ判別可能であるか否かを確認する（Ｓ１３２）。カテゴリ判別不可能な場合（Ｓ１３２：Ｎｏ）、図９の流れはＳ１３５の処理へ進む。 After that, the word analysis unit 230 confirms whether or not the category can be determined by the same process as in S105 (S132). If the category cannot be determined (S132: No), the flow of FIG. 9 advances to the process of S135.

一方、カテゴリ判別可能な場合（Ｓ１３２：Ｙｅｓ）、単語解析部２３０は、Ｓ１０６と同様の処理で、Ｓ１２８で取得されたユーザ発話のカテゴリを推定する（Ｓ１３３）。 On the other hand, if the category can be determined (S132: Yes), the word analysis unit 230 estimates the category of the user's utterance acquired in S128 (S133) in the same process as in S106.

その後、単語解析部２３０は、今回のユーザ発話のカテゴリと前回のユーザ発話のカテゴリとが同一のカテゴリであるか否かを判定する（Ｓ１３４）。カテゴリが同一でない場合（Ｓ１３４：Ｎｏ）、図９の流れはＳ１３５の処理へ進む。なお、所定回数Ｓ１２７乃至Ｓ１３４の処理を繰り返しても、カテゴリ判別不可能な場合、Ｓ１１８の処理へ進み、図９の流れを終了させてもよい。 After that, the word analysis unit 230 determines whether or not the category of the current user utterance and the category of the previous user utterance are the same category (S134). If the categories are not the same (S134: No), the flow of FIG. 9 proceeds to the process of S135. If the category cannot be determined even after repeating the processes of S127 to S134 a predetermined number of times, the process of S118 may be performed and the flow of FIG. 9 may be ended.

一方、カテゴリが同一である場合（Ｓ１３４：Ｎｏ）、図９の流れはＳ１０７の処理へ戻り、Ｓ１０７～Ｓ１３５の処理を繰り返す。 On the other hand, if the categories are the same (S134: No), the flow of FIG. 9 returns to the process of S107, and the processes of S107 to S135 are repeated.

図１０は、実施形態に係る情報処理により実現される会話の一例を模式的に示す図である。図１０の会話は、例えば端末装置１００の出力部１３０によりユーザに提示されるものとする。 FIG. 10 is a diagram schematically illustrating an example of conversation realized by information processing according to the embodiment; It is assumed that the conversation in FIG. 10 is presented to the user by the output unit 130 of the terminal device 100, for example.

以下、図１０に示すように、Ｓ１０１の処理において、管理部２５０が「今度、ハワイへ旅行に行くよ。」とのユーザ発話が取得された場合を例として、実施形態に係る情報処理をより詳細に説明する。 Hereinafter, as shown in FIG. 10, the information processing according to the embodiment will be described in more detail by taking as an example a case where the management unit 250 acquires a user utterance "I'm going on a trip to Hawaii next time" in the process of S101. I will explain in detail.

Ｓ１０２の処理において、文章解析部２２０は、ユーザ発話を示す音声データに対する音声認識処理により、「今度、ハワイへ旅行に行くよ。」とのユーザ発話の内容を示すテキストデータを取得する。また、文章解析部２２０は、文章単語分割処理により、テキストデータの示す文章を単語単位、すなわち「今度」、「ハワイ」、「へ」、「旅行」、「に」、「行く」及び「よ」に分割する。 In the process of S102, the text analysis unit 220 acquires text data indicating the content of the user's utterance, "I'm going on a trip to Hawaii next time," by speech recognition processing for voice data indicating the user's utterance. In addition, the sentence analysis unit 220 performs sentence word segmentation processing to divide sentences indicated by the text data into word units, that is, “next time”, “Hawaii”, “to”, “trip”, “to”, “go” and “yo”. ”.

Ｓ１０３の処理において、文章解析部２２０は、分割された複数の単語のうち、例えば、図２に示す単語カテゴリデータベース２２１に登録されている「今度」、「ハワイ」、「旅行」及び「行く」を、話題を推定するための抽出単語として抽出する。 In the process of S103, the text analysis unit 220 extracts, among the plurality of divided words, for example, "next time", "Hawaii", "travel" and "go" registered in the word category database 221 shown in FIG. is extracted as an extraction word for estimating the topic.

Ｓ１０４の処理において、文章解析部２２０は、図２に示す単語カテゴリデータベース２２１を参照し、「今度」、「ハワイ」、「旅行」及び「行く」に対応するＮｏ．０００３の「旅行」をユーザ発話の話題として推定する。 In the processing of S104, the sentence analysis unit 220 refers to the word category database 221 shown in FIG. 0003 "travel" is estimated as the topic of the user's utterance.

なお、文章解析部２２０が話題を推定する際、話題に対応する抽出単語として単語カテゴリデータベース２２１に登録されている抽出単語の全てがユーザ発話に含まれている必要はない。上記の例では、文章解析部２２０は、抽出単語が最も多く含まれる、Ｎｏ．０００３の「旅行」をユーザ発話の話題として推定している。 When the text analysis unit 220 estimates a topic, it is not necessary that all of the extracted words registered in the word category database 221 as extracted words corresponding to the topic are included in the user's utterance. In the above example, the text analysis unit 220 selects No. 1, which contains the most extracted words. 0003 "travel" is presumed to be the topic of the user's utterance.

Ｓ１０５の処理において、単語解析部２３０は、分割された複数の単語のうち、例えば、図３に示す意図推定データベース２３１に登録されている「今度」、「旅行」及び「行く」を、カテゴリを推定するための抽出単語として抽出する。単語解析部２３０は、カテゴリを推定するための抽出単語を抽出できたことから、カテゴリ判別可能であると判定する。 In the process of S105, the word analysis unit 230 classifies, for example, "this time", "travel", and "go" registered in the intention estimation database 231 shown in FIG. Extract as extracted words for inference. The word analysis unit 230 determines that the category can be determined because the extracted word for estimating the category has been successfully extracted.

Ｓ１０６の処理において、単語解析部２３０は、図３に示す意図推定データベース２３１を参照し、「今度」、「旅行」及び「行く」に対応するＮｏ．０００４の「雑談」をユーザ発話のカテゴリとして推定する。 In the process of S106, the word analysis unit 230 refers to the intention estimation database 231 shown in FIG. 0004 "chat" is estimated as the category of the user's utterance.

なお、単語解析部２３０が話題を推定する際、カテゴリに対応する抽出単語として単語カテゴリデータベース２２１に登録されている抽出単語の全てがユーザ発話に含まれている必要はない。上記の例では、単語解析部２３０は、抽出単語が最も多く含まれる、Ｎｏ．０００４の「雑談」をユーザ発話のカテゴリとして推定している。 Note that when the word analysis unit 230 estimates a topic, it is not necessary that all of the extracted words registered in the word category database 221 as extracted words corresponding to the category are included in the user's utterance. In the above example, the word analysis unit 230 selects No. 1, which contains the most extracted words. 0004 "chat" is presumed to be the category of the user's utterance.

Ｓ１０７の処理において、単語解析部２３０は、分割された複数の単語のうち、例えば、図４に示す重要単語データベース２３２に登録されている、Ｎｏ．０００３の「旅行」に対応する「旅行」及び「行く」を重要単語として抽出する。単語解析部２３０は、Ｎｏ．０００３の「旅行」に対応する重要単語を抽出できたことから、推定された話題における重要単語があると判定する。 In the processing of S107, the word analysis unit 230 extracts, for example, No. 1 registered in the important word database 232 shown in FIG. "Tour" and "Go" corresponding to "Tour" of 0003 are extracted as important words. The word analysis unit 230 uses No. Since the important word corresponding to 0003 "travel" has been extracted, it is determined that there is an important word in the estimated topic.

Ｓ１０８の処理において、単語解析部２３０は、「旅行」及び「行く」をユーザ発話における重要単語として特定する。重要単語として特定された「旅行」及び「行く」は、応答文章の生成に用いる繰り返し単語の候補となる。 In the process of S108, the word analysis unit 230 identifies "travel" and "go" as important words in the user's utterance. "Travel" and "go" specified as important words are candidates for repetitive words used to generate response sentences.

Ｓ１０９の処理において、単語解析部２３０は、「旅行」及び「行く」を応答文章の生成に用いる繰り返し単語として特定する。なお、ここでは、単語解析部２３０は、２つの単語を繰り返し単語として特定するものとするが、これに限定されるものではない。例えば、単語解析部２３０は、１つの単語を繰り返し単語として特定してもよいし、３つの単語を繰り返し単語として特定してもよい。 In the process of S109, the word analysis unit 230 identifies "trip" and "go" as repeated words used to generate a response sentence. Here, it is assumed that the word analysis unit 230 identifies two words as repeated words, but is not limited to this. For example, the word analysis unit 230 may identify one word as the repeated word, or may identify three words as the repeated word.

なお、繰り返し単語の候補となる重要単語が、予め定めた繰り返し単語の数を超える場合、単語解析部２３０は、予め定めた優先順位に従い、繰り返し単語を特定する。例えば、「旅行」及び「行く」がユーザ発話における重要単語として特定され、１つの単語を繰り返し単語として特定する場合、単語解析部２３０は、「旅行」及び「行く」のうち、優先順位が高い単語を繰り返し単語として特定する。 Note that when the number of important words that are candidates for repeated words exceeds a predetermined number of repeated words, word analysis section 230 identifies the repeated words according to a predetermined priority order. For example, when "travel" and "go" are specified as important words in the user's utterance and one word is specified as a repeated word, the word analysis unit 230 gives higher priority to "travel" and "go". Identifies a word as a repeated word.

なお、優先順位は、重要単語データベース２３２における重要単語の表示順等で定められていてもよい。例えば、図３に示す重要単語データベース２３２では、Ｎｏ．０００３の「旅行」に対応する重要単語の優先順位は、「旅行」、「行く」、「写真」、「動画」、「撮影」、「予約」及び「[場所]」の順としてもよい。したがって、上記の例で、１つの単語を繰り返し単語として特定する場合、単語解析部２３０は、「旅行」を、応答文章の生成に用いる繰り返し単語として特定する。 The order of priority may be determined by the display order of important words in the important word database 232, or the like. For example, in the important word database 232 shown in FIG. The priority order of the important words corresponding to "trip" in 0003 may be "trip", "go", "picture", "video", "shoot", "reservation" and "[location]". Therefore, in the above example, when identifying one word as a repeated word, the word analysis unit 230 identifies "travel" as a repeated word used to generate the response sentence.

Ｓ１１０の処理において、単語解析部２３０は、図５に示す会話推定データベース２３３を参照し、話題「旅行」と繰り返し単語「「旅行」及び「行く」」との組に対応するＮｏ．０００６の「いいなぁ」を共感単語として特定する。 In the processing of S110, the word analysis unit 230 refers to the conversation estimation database 233 shown in FIG. 0006 "nice" is specified as a sympathy word.

ここで、Ｓ１１１の処理において、管理部２５０は、前会話情報として、ユーザがチャットボットに話しかけた「このワンピースは似合うかな、買うかどうかは決まってないけど」という過去のユーザ発話を含む情報を取得したものとする。 Here, in the processing of S111, the management unit 250 collects, as previous conversation information, information including past user utterances such as "Does this dress look good on you? I haven't decided whether to buy it or not," spoken to the chatbot by the user. assumed to have been obtained.

この場合、Ｓ１１２の処理において、文章解析部２２０は、文章抽出処理により、「このワンピースは似合うかな、買うかどうかは決まってないけど」との過去のユーザ発話の内容を示すテキストデータを取得する。また、文章解析部２２０は、文章単語分割処理により、「この」、「ワンピース」、「は」、「似合う」、「かな」、「買う」、「か」、「どう」、「か」、「は」、「決まっ」、「て」、「ない」及び「けど」に分割する。 In this case, in the process of S112, the text analysis unit 220 obtains text data indicating the content of the past user utterance, "Does this dress look good on you? I haven't decided whether to buy it or not." . In addition, the text analysis unit 220 performs sentence word segmentation processing to divide words such as “kono”, “one piece”, “ha”, “suit”, “kana”, “buy”, “ka”, “do”, “ka”, It is divided into "wa", "determined", "te", "not" and "but".

Ｓ１１３の処理において、文章解析部２２０は、分割された複数の単語のうち、例えば、図２に示す単語カテゴリデータベース２２１に登録されている「［服アイテム］(ワンピース)」、「似合う」及び「買う」を、話題を推定するための抽出単語として抽出する。 In the processing of S113, the sentence analysis unit 220 selects, among the plurality of divided words, for example, "[Clothing item] (dress)", "suitable" and " "Buy" is extracted as an extraction word for estimating the topic.

Ｓ１１４の処理において、文章解析部２２０は、図２に示す単語カテゴリデータベース２２１を参照し、「［服アイテム］(ワンピース)」、「似合う」及び「買う」に対応するＮｏ．０００１の「ファッション」をユーザ発話の話題として推定する。 In the processing of S114, the text analysis unit 220 refers to the word category database 221 shown in FIG. 0001 "fashion" is estimated as the topic of the user's utterance.

Ｓ１１５の処理において、単語解析部２３０は、分割された複数の単語のうち、例えば、図３に示す意図推定データベース２３１に登録されている「買う」及び「決まる(決まっ)」を、カテゴリを推定するための抽出単語として抽出する。単語解析部２３０は、カテゴリを推定するための抽出単語を抽出できたことから、カテゴリ判別可能であると判定する。 In the process of S115, the word analysis unit 230 estimates categories of, for example, "buy" and "determined (determined)" registered in the intention estimation database 231 shown in FIG. Extract as an extraction word for The word analysis unit 230 determines that the category can be determined because the extracted word for estimating the category has been successfully extracted.

Ｓ１１６の処理において、単語解析部２３０は、図３に示す意図推定データベース２３１を参照し、「買う」及び「決まる(決まっ)」に対応するＮｏ．０００４の「雑談」をユーザ発話のカテゴリとして推定する。 In the processing of S116, the word analysis unit 230 refers to the intention estimation database 231 shown in FIG. 0004 "chat" is estimated as the category of the user's utterance.

Ｓ１１７の処理において、単語解析部２３０は、ユーザ発話のカテゴリと、前会話情報のカテゴリとが、両方ともＮｏ．０００４の「雑談」であるため、ユーザ発話のカテゴリと前会話情報のカテゴリとが同一であると判定する。 In the processing of S117, word analysis section 230 determines that both the category of user utterance and the category of previous conversation information are No. 0004 "chat", it is determined that the category of the user's utterance and the category of the previous conversation information are the same.

Ｓ１１９の処理において、単語解析部２３０は、興味単語データベース２３４を参照し、ユーザ発話と前会話情報との共通カテゴリである「雑談」と、前会話情報の抽出単語である「［服アイテム］」及び「似合う」との組に対応する「［服アイテム］」、「思う」、「センス」及び「おしゃれ」を共通カテゴリに対応する興味単語として特定する。 In the process of S119, the word analysis unit 230 refers to the interesting word database 234 to determine whether the common category of the user utterance and the previous conversation information is "chat" and the extracted word of the previous conversation information is "[Clothing item]". and "looks good on you" are specified as interesting words corresponding to the common category.

Ｓ１２０の処理において、単語解析部２３０は、前会話情報の中に、「ワンピース（［服アイテム］）」が存在することから、前会話情報にＳ１１９で特定された興味単語があると判定する。 In the process of S120, the word analysis unit 230 determines that the previous conversation information includes the interesting word specified in S119 because "one piece ([clothing item])" is present in the previous conversation information.

Ｓ１２２において、単語解析部２３０は、前会話情報の中に存在する「ワンピース」を応答文章の生成に用いる興味単語として特定する。なお、前会話情報の中に、複数の興味単語が含まれる場合、単語解析部２３０は、例えば、過去のユーザ発話において登場回数が最も多い興味単語を、応答文章の生成に用いる興味単語として特定する。 In S122, word analysis unit 230 identifies "one piece" in the previous conversation information as an interesting word to be used for generating a response sentence. Note that if the previous conversation information includes a plurality of interesting words, the word analysis unit 230 identifies, for example, the interesting word that has appeared most frequently in past user utterances as the interesting word to be used for generating the response sentence. do.

また、前会話情報の中に、共通カテゴリに対応する興味単語が存在しない場合、単語解析部２３０は、予め定めた優先順位に従い、Ｓ１１９で特定された共通カテゴリに対応する興味単語の中から、応答文章の生成に用いる興味単語を特定する。 Further, when there is no interesting word corresponding to the common category in the previous conversation information, the word analysis unit 230 follows the predetermined priority order and selects the following from the interesting words corresponding to the common category identified in S119: Identify interesting words used to generate response sentences.

例えば、上記の例で、仮に前会話情報に、「［服アイテム］」、「思う」、「センス」及び「おしゃれ」の何れも存在せず、優先順位が「［服アイテム］」、「思う」、「センス」及び「おしゃれ」の順と定められていた場合、単語解析部２３０は、最も優先順位が高い「［服アイテム］」を、応答文章の生成に用いる興味単語として特定する。 For example, in the above example, if none of "[clothing item]", "thinking", "sense" and "fashionable" exist in the previous conversation information, and the priority is "[clothing item]", "thinking , “sense” and “fashionable”, the word analysis unit 230 identifies “[clothing item]”, which has the highest priority, as an interesting word to be used for generating a response sentence.

Ｓ１２３において、応答生成部２４０は、図７に示す連携単語データベース２４１を参照し、カテゴリ「雑談」、話題「ファッション」及び興味単語「［服アイテム］」の組に対応するＮｏ．０００１の「着る」を応答文章の生成に用いる連携単語として特定する。 In S123, the response generation unit 240 refers to the linked word database 241 shown in FIG. 7, and selects the No. corresponding to the set of the category "chat", the topic "fashion", and the interesting word "[Clothing item]". "wear" in 0001 is specified as a linked word to be used for generating a response sentence.

Ｓ１２４の処理において、応答生成部２４０は、図８に示す定型文データベース２４２を参照し、カテゴリ「雑談」、話題「ファッション」及び興味単語「［服アイテム］」の組に対応するＮｏ．０００１の「[繰り返し単語]に[繰り返し単語]んだ、[共感単語]、[興味単語]を[連携単語]のですか」を応答文章の生成に用いる定型文として特定する。 In the process of S124, the response generation unit 240 refers to the standard sentence database 242 shown in FIG. 0001 "Are [repeated words] and [repeated words], [sympathetic words] and [interesting words] [linked words]?"

Ｓ１２５の処理において、応答生成部２４０は、特定された繰り返し単語「旅行」及び「行く」、共感単語「いいなぁ」、興味単語「ワンピース」、連携単語「着る」を、Ｎｏ．０００１の定型文に当て嵌め、「旅行に行くんだ、いいなぁ、ワンピース着るの？」という応答文章を生成する。 In the process of S125, the response generation unit 240 divides the specified repetitive words "trip" and "go", the sympathetic word "nice", the interesting word "one piece", and the linked word "wear" into No. 0001 is applied to generate a response sentence "I'm going on a trip, that's nice, are you going to wear a dress?"

Ｓ１２６の処理において、応答生成部２４０は、通信部２１０により、端末装置１００に応答文章「旅行に行くの、いいなぁ、ワンピース着るの？」を出力する。 In the process of S<b>126 , the response generation unit 240 outputs the response sentence “I am going on a trip.

Ｓ１２７及びＳ１２８の処理において、管理部２５０は、ユーザからの入力を確認し、「そうだよ。ワンピースを着るよ。寒いかな。」という応答文章に対するユーザ発話を取得する。 In the processes of S127 and S128, the management unit 250 confirms the input from the user, and acquires the user's utterance in response to the response text "Yes. I'm wearing a dress. Isn't it cold?"

Ｓ１２９の処理において、文章解析部２２０は、ユーザ発話を示す音声データに対する音声認識処理により、「そうだよ。ワンピースを着るよ。寒いかな。」とのユーザ発話の内容を示すテキストデータを取得する。また、文章解析部２２０は、文章単語分割処理により、テキストデータの示す文章を、「そうだよ」、「ワンピース」、「を」、「着る」、「よ」、「寒い」及び「かな」に分割する。 In the processing of S129, the text analysis unit 220 acquires text data indicating the content of the user's utterance, "Yes, I'm wearing a dress. It's cold." In addition, the text analysis unit 220 divides the text indicated by the text data into “that’s right”, “one piece”, “wo”, “wear”, “yo”, “cold”, and “kana” through text word segmentation processing. To divide.

Ｓ１３０の処理において、文章解析部２２０は、分割された複数の単語のうち、例えば、図２に示す単語カテゴリデータベース２２１に登録されている「ワンピース」を、話題を推定するための抽出単語として抽出する。 In the processing of S130, the text analysis unit 220 extracts, for example, "one piece" registered in the word category database 221 shown in FIG. do.

Ｓ１３１の処理において、文章解析部２２０は、図２に示す単語カテゴリデータベース２２１を参照し、「ワンピース」に対応するＮｏ．０００１の「ファッション」を、応答文章に対するユーザ発話の話題として推定する。 In the processing of S131, the text analysis unit 220 refers to the word category database 221 shown in FIG. 0001 "fashion" is estimated as the topic of the user's utterance for the response sentence.

Ｓ１３２の処理において、単語解析部２３０は、分割された複数の単語のうち、例えば、図３に示す意図推定データベース２３１に登録されている「着る」及び「寒い」を、カテゴリを推定するための抽出単語として抽出する。単語解析部２３０は、カテゴリを推定するための抽出単語を抽出できたことから、カテゴリ判別可能であると判定する。 In the processing of S132, the word analysis unit 230 selects, for example, “wear” and “cold” registered in the intention estimation database 231 shown in FIG. Extract as an extraction word. The word analysis unit 230 determines that the category can be determined because the extracted word for estimating the category has been successfully extracted.

Ｓ１３３の処理において、単語解析部２３０は、図３に示す意図推定データベース２３１を参照し、「着る」及び「寒い」に対応するＮｏ．０００４の「雑談」をユーザ発話のカテゴリとして推定する。 In the processing of S133, the word analysis unit 230 refers to the intention estimation database 231 shown in FIG. 0004 "chat" is estimated as the category of the user's utterance.

Ｓ１３４の処理において、単語解析部２３０は、前回のユーザ発話のカテゴリである「雑談」が今回のユーザ発話のカテゴリにおいても継続しているため、会話を継続すると判断する。 In the process of S134, the word analysis unit 230 determines to continue the conversation because the category of the previous user's utterance, "chat", continues in the category of the current user's utterance.

図１１は、実施形態に係る情報処理システム１を実現するコンピュータ７００のハードウェア構成の一例を模式的に示す図である。実施形態に係るサーバ装置２００は、図１１に示すコンピュータ７００として構成される。 FIG. 11 is a diagram schematically showing an example of the hardware configuration of a computer 700 that implements the information processing system 1 according to the embodiment. The server device 200 according to the embodiment is configured as a computer 700 shown in FIG. 11 .

図１１に示すように、コンピュータ７００は、プロセッサ７１０、メモリ７２０、ストレージ７３０、入出力インターフェース（Ｉ／Ｆ）７４０及び通信インターフェース（Ｉ／Ｆ）７５０を有する。これらのハードウェアは、バス７６０に接続されている。 As shown in FIG. 11, computer 700 has processor 710 , memory 720 , storage 730 , input/output interface (I/F) 740 and communication interface (I/F) 750 . These hardware are connected to bus 760 .

プロセッサ７１０は、例えばＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）として構成され、コンピュータ７００の各部の動作を統括的に制御する。 The processor 710 is configured as, for example, a CPU (Central Processing Unit) and controls the operation of each part of the computer 700 in an integrated manner.

メモリ７２０は、たとえばＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）及びＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）を含み、プロセッサ７１０により実行されるプログラムなどの各種のデータの揮発的又は不揮発的な記憶、及びプロセッサ７１０がプログラムを実行するための作業領域の提供などを実現する。 The memory 720 includes, for example, ROM (Read Only Memory) and RAM (Random Access Memory), volatile or non-volatile storage of various data such as programs executed by the processor 710, and the processor 710 executing programs. We will realize the provision of a work area for

ストレージ７３０は、例えばＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）又はＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）を含み、各種のデータを不揮発的に記憶する。 The storage 730 includes, for example, a HDD (Hard Disk Drive) or an SSD (Solid State Drive), and stores various data in a non-volatile manner.

入出力インターフェース７４０は、例えばキーボード及びマウスなどのような入力装置（不図示）からコンピュータ７００へのデータの入力と、例えばコンピュータ７００からディスプレイおよびスピーカなどのような出力装置（不図示）へのデータの出力とを制御する。 The input/output interface 740 inputs data from input devices (not shown) such as a keyboard and mouse to the computer 700 and outputs data from the computer 700 to output devices (not shown) such as a display and speakers. to control the output of

通信インターフェース７５０は、コンピュータ７００が他の装置と通信を実行することを可能にする。 Communications interface 750 allows computer 700 to communicate with other devices.

実施形態において、サーバ装置２００が有する図１に示される各構成は、プロセッサ７１０がメモリ７２０又はストレージ７３０などに記憶された情報処理プログラムを実行した結果として、ハードウェアとソフトウェアとの協働による機能モジュールとして実現される。ただし、実施形態では、図１に示される機能モジュール群のうち少なくとも一部が、専用のハードウェアのみによって実現されてもよい。 In the embodiment, each configuration shown in FIG. 1 of the server device 200 functions as a result of the processor 710 executing an information processing program stored in the memory 720 or the storage 730 through cooperation of hardware and software. Implemented as a module. However, in the embodiment, at least part of the functional module group shown in FIG. 1 may be realized only by dedicated hardware.

なお、上述した実施形態に係る情報処理プログラムは、必ずしもメモリ７２０又はストレージ７３０に予め記憶されている必要はない。たとえば、上述した実施形態に係る情報処理プログラムは、フレキシブルディスク（ＦＤ）のような各種の磁気ディスク、又はＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｋ）のような各種の光ディスクなどといった、コンピュータで読み取り可能な記録媒体にインストール可能な形式または実行可能な形式で記録されたコンピュータプログラムプロダクトとして提供されてもよい。 Note that the information processing program according to the above-described embodiments does not necessarily have to be pre-stored in the memory 720 or the storage 730 . For example, the information processing program according to the above-described embodiments can be stored in a computer-readable recording medium such as various magnetic disks such as flexible disks (FD), or various optical disks such as DVDs (Digital Versatile Disks). It may be provided as a computer program product recorded in installable or executable form.

また、上述した実施形態に係る情報処理プログラムは、インターネットなどのネットワーク経由で提供又は配布されてもよい。すなわち、上述した実施形態に係る情報処理プログラムは、インターネットなどのネットワークに接続されたコンピュータ上に格納された状態で、ネットワーク経由でのダウンロードを受け付ける、といった形で提供されてもよい。 Also, the information processing program according to the above-described embodiments may be provided or distributed via a network such as the Internet. That is, the information processing program according to the above-described embodiments may be stored in a computer connected to a network such as the Internet, and may be provided in the form of receiving a download via the network.

なお、実施形態では、端末装置１００も、サーバ装置２００と同様に、図８に示すハードウェア構成を有してコンピュータ７００として構成され得る。したがって、実施形態の変形例として、サーバ装置２００の機能モジュールの少なくとも一部が端末装置１００に実装された構成も考えられる。 Note that, in the embodiment, the terminal device 100 can also be configured as a computer 700 having the hardware configuration shown in FIG. Therefore, as a modified example of the embodiment, a configuration in which at least some of the functional modules of the server device 200 are implemented in the terminal device 100 is also conceivable.

したがって、実施形態の変形例として、端末装置１００が本開示の「情報処理装置」に該当する例も考えられる。さらに、端末装置１００とサーバ装置２００との組み合わせとしての情報処理システム１が本開示の「情報処理装置」に該当する例も考えられる。 Therefore, as a modified example of the embodiment, an example in which the terminal device 100 corresponds to the "information processing device" of the present disclosure is also conceivable. Furthermore, an example in which the information processing system 1 as a combination of the terminal device 100 and the server device 200 corresponds to the "information processing device" of the present disclosure is also conceivable.

以上説明したように、実施形態に係る情報処理装置は、ユーザと継続的な会話を行うための自動応答処理を行う。 As described above, the information processing apparatus according to the embodiment performs automatic response processing for continuous conversation with the user.

一例として、情報処理装置において、単語解析部２３０は、ユーザ発話に基づいて、当該ユーザ発話のカテゴリを推定し、推定されたカテゴリと、対象ユーザの前会話情報とに基づいて、応答文章の生成に用いる、ユーザの継続的な発話を促進させる発話促進単語を特定する。また、応答生成部２４０は、推定された話題、推定されたカテゴリ、特定された発話促進単語に基づいて、応答文章を生成する。この構成によれば、ユーザと、情報処理装置により実現されるチャットボットとの会話において、チャットボットは、ユーザ発話のカテゴリを推定し、推定したカテゴリから発話促進単語を特定し、当該発話促進単語を用いた応答文章を生成することができる。当該発話促進単語を用いた応答文章を生成することにより、チャットボットは、ユーザの継続的な発話を促すことができる。したがって、実施形態に係る情報処理装置によれば、ユーザとチャットボットとが会話を円滑に継続できるように、ユーザの発話に対する応答を行うことができる。 As an example, in the information processing device, the word analysis unit 230 estimates the category of the user's utterance based on the user's utterance, and generates a response sentence based on the estimated category and the previous conversation information of the target user. Identify speech facilitators that encourage the user to continue speaking. Also, the response generation unit 240 generates a response sentence based on the estimated topic, the estimated category, and the specified utterance-promoting words. According to this configuration, in a conversation between the user and the chatbot realized by the information processing device, the chatbot estimates the category of the user's utterance, identifies the utterance-promoting word from the estimated category, and specifies the utterance-promoting word. can generate a response sentence using By generating a response sentence using the speech-promoting word, the chatbot can prompt the user to continue speaking. Therefore, according to the information processing apparatus according to the embodiment, it is possible to respond to the user's utterance so that the user and the chatbot can smoothly continue the conversation.

また、発話促進単語には興味単語が含まれ、応答生成部２４０は、興味単語を含む応答文章を生成する。ここで、興味単語は、ユーザの興味を引く単語であり、応答文章に興味単語が含まれていると、ユーザはチャットボットとの会話を継続しようと考える可能性が高くなる。つまり、チャットボットは、興味単語が含まれる応答文章を生成することにより、ユーザに継続的な発話を促すことができる。 In addition, the utterance-promoting words include interesting words, and response generation section 240 generates a response sentence including the interesting words. Here, the interesting word is a word that attracts the user's interest, and if the response text includes the interesting word, the user is more likely to think of continuing the conversation with the chatbot. That is, the chatbot can prompt the user to continuously speak by generating a response sentence containing the interesting word.

また、情報処理装置において、文章解析部２２０は、ユーザ発話に基づいて、当該ユーザ発話の話題を推定する。また、単語解析部２３０は、推定されたカテゴリと、推定された話題と、特定された興味単語とに基づいて、興味単語と連携して応答文章を形成する連携単語を特定する。また、応答生成部２４０は、興味単語及び連携単語を含む応答文章を生成する。この構成によれば、チャットボットは、興味単語と連携して応答文章を形成する連携単語を含む応答文章を生成することにより、自然な文章でユーザの興味を引くことができる。ここで、例えば、チャットボットが興味単語を含む応答文章を生成しても、生成した文章が不自然であれば、ユーザは発話しようという気持ちを失ってしまう可能性がある。したがって、実施形態に係る情報処理装置によれば、ユーザとチャットボットとが会話をより円滑に継続できるように、ユーザの発話に対する応答を行うことができる。 Also, in the information processing apparatus, the text analysis unit 220 estimates the topic of the user's utterance based on the user's utterance. In addition, word analysis section 230 identifies linked words that form a response sentence in cooperation with the interesting words based on the estimated category, the estimated topic, and the specified interesting words. Also, the response generation unit 240 generates a response sentence including the interesting word and the linked word. According to this configuration, the chatbot can attract the user's interest with a natural sentence by generating a response sentence containing a linked word that forms a response sentence in cooperation with the interesting word. Here, for example, even if a chatbot generates a response sentence containing an interesting word, if the generated sentence is unnatural, the user may lose the desire to speak. Therefore, according to the information processing apparatus according to the embodiment, it is possible to respond to the user's utterance so that the user and the chatbot can continue the conversation more smoothly.

また、単語解析部２３０は、ユーザ発話の内容に含まれる複数の単語から、推定された話題における重要単語を抽出する。また、単語解析部２３０は、話題と重要単語とに基づいて、当該ユーザ発話に対する応答文章の生成に用いる、ユーザ入力に対する共感を表す共感単語を特定する。また、応答生成部２４０は、共感単語を含む応答文章を生成する。この構成によれば、チャットボットは、ユーザ発話に対して共感を示すことができる。ここで、チャットボットに共感を示されることにより、ユーザは、チャットボットとの会話を継続しようと考える可能性が高くなる。つまり、チャットボットは、共感単語が含まれる応答文章を生成することにより、ユーザに継続的な発話を促すことができる。 Also, the word analysis unit 230 extracts important words in the estimated topic from a plurality of words included in the content of the user's utterance. In addition, the word analysis unit 230 identifies sympathy words expressing sympathy for the user's input, which are used to generate response sentences to the user's utterance based on the topic and the important words. Also, the response generation unit 240 generates a response sentence including sympathy words. According to this configuration, the chatbot can empathize with the user's speech. Here, when the chatbot shows sympathy, the user is more likely to think about continuing the conversation with the chatbot. In other words, the chatbot can prompt the user to continuously speak by generating response sentences containing sympathetic words.

また、単語解析部２３０は、ユーザ発話と、推定された話題とに基づいて、当該ユーザ発話の内容に含まれる複数の単語のうち、当該ユーザ発話に対する応答文章の生成に用いる繰り返し単語を特定する。また、応答生成部２４０は、繰り返し単語を含む応答文章を生成する。この構成によれば、チャットボットは、ユーザ発話中の重要な単語を繰り返すことにより、ユーザに対して、ユーザの話における重要なポイントを理解していることをアピールできる。これにより、ユーザは、チャットボットがきちんと話を聞いてくれていると感じやすくなり、チャットボットとの会話を継続しようと考える可能性が高くなる。つまり、チャットボットは、繰り返し単語が含まれる応答文章を生成することにより、ユーザに継続的な発話を促すことができる。 Further, based on the user utterance and the estimated topic, the word analysis unit 230 identifies repeated words used for generating a response sentence to the user utterance among a plurality of words included in the content of the user utterance. . Also, the response generation unit 240 generates a response sentence including repeated words. According to this configuration, the chatbot can appeal to the user that it understands the important points in the user's speech by repeating the important words in the user's speech. This makes it easier for users to feel that the chatbot is listening to them, and increases the likelihood that they will continue the conversation with the chatbot. In other words, the chatbot can prompt the user to continuously speak by generating response sentences containing repeated words.

また、単語解析部２３０は、複数個の単語を、応答文章の生成に用いる繰り返し単語として特定してもよい。これにより、チャットボットは、ユーザに対して、ユーザの話における重要なポイントを理解していることをより強くアピールできる。 Moreover, the word analysis unit 230 may identify a plurality of words as repeated words used for generating the response sentence. This allows the chatbot to more strongly appeal to the user that it understands the important points in the user's speech.

本発明の実施形態及び変形例を説明したが、この実施形態及び変形例は、例として提示したものであり、発明の範囲を限定することは意図していない。この新規な実施形態及び変形例は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。この実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。 While embodiments and variations of the present invention have been described, the embodiments and variations are provided by way of example and are not intended to limit the scope of the invention. The novel embodiments and modifications can be implemented in various other forms, and various omissions, replacements, and modifications can be made without departing from the scope of the invention. This embodiment and its modifications are included in the scope and gist of the invention, and are included in the scope of the invention described in the claims and its equivalents.

１…情報処理システム、１００…端末装置、１１０…通信部、１２０…入力部、１３０出力部、２００…サーバ装置（情報処理装置）、２１０…通信部、２２０…文章解析部、２２１…単語カテゴリデータベース、２３０…単語解析部、２３１…意図推定データベース、２３２…重要単語データベース、２３３…会話推定データベース、２３４…興味単語データベース、２４０…応答生成部、２４１…連携単語データベース、２４２…定型文データベース、２５０…管理部 Reference Signs List 1 information processing system 100 terminal device 110 communication unit 120 input unit 130 output unit 200 server device (information processing device) 210 communication unit 220 text analysis unit 221 word category Database, 230... Word analysis unit, 231... Intention estimation database, 232... Important word database, 233... Conversation estimation database, 234... Interesting word database, 240... Response generation unit, 241... Linked word database, 242... Fixed phrase database, 250 Management Department

本発明の第１態様は、ユーザ入力に基づいて、当該ユーザ入力の意図を推定する第１推定部と、前記意図と、前記ユーザ入力の履歴とに基づいて、前記ユーザ入力に対する応答文章の生成に用いる、ユーザによる継続的な発話を促進させる発話促進単語を特定する第１特定部と、前記意図と前記発話促進単語とに基づいて、前記応答文章を生成する生成部とを備え、前記生成部は、推定された前記ユーザ入力の意図が当該ユーザ入力よりも前に入力された同一のユーザによるユーザ入力の意図と異なる場合、当該ユーザに再発話を促す文章を生成する情報処理装置である。 A first aspect of the present invention includes: a first estimating unit that estimates an intention of the user input based on the user input; and a response sentence to the user input based on the intention and the history of the user input. and a generator for generating the response sentence based on the intention and the utterance-promoting word, wherein the generation The unit is an information processing device that, when the estimated intention of the user input is different from the intention of the user input by the same user that was input before the user input, generates a sentence prompting the user to re-speak. be.

また、本発明の第２態様は、ユーザ入力に基づいて、当該ユーザ入力の意図を推定する処理と、前記意図と、前記ユーザ入力の履歴とに基づいて、前記ユーザ入力に対する応答文章の生成に用いる、ユーザによる継続的な発話を促進させる発話促進単語を特定する処理と、前記意図と、前記発話促進単語とに基づいて、前記応答文章を生成する処理と、推定された前記ユーザ入力の意図が当該ユーザ入力よりも前に入力された同一のユーザによるユーザ入力の意図と異なる場合、当該ユーザに再発話を促す文章を生成する処理と、を情報処理装置のコンピュータに実行させるプログラムである。
A second aspect of the present invention includes processing for estimating an intention of the user input based on the user input, and generating a response sentence to the user input based on the intention and the history of the user input. a process of identifying a speech-promoting word that encourages continuous utterance by a user; a process of generating the response sentence based on the intention and the speech-promoting word; and an estimated intention of the user input. is different from the intention of a user input by the same user that was input before the user input, a process of generating a sentence prompting the user to re-speak.

Claims

a first estimation unit that estimates the intention of the user input based on the user input;
a first identification unit that identifies, based on the intention and the history of the user input, a speech-promoting word that is used to generate a response sentence to the user input and that promotes continuous utterance by the user;
a generation unit that generates the response sentence based on the intention and the speech prompt word;
Information processing device.

the utterance-promoting word includes an interest word that arouses the interest of the user who has performed the user input;
The generation unit generates the response sentence including the interesting word.
The information processing device according to claim 1 .

a second estimation unit that estimates a topic of the user input based on the user input;
a second identification unit that identifies a linked word that forms a response sentence in cooperation with the interesting word included in the speech prompt word based on the intention, the topic, and the interesting word,
The generation unit generates the response sentence including the interesting word and the linked word.
The information processing apparatus according to claim 2.

an extraction unit that extracts important words in the topic from a plurality of words included in the content of the user input;
a third specifying unit that specifies, based on the topic and the important words, an empathy word that is included in the speech-prompting word and expresses empathy for the user input, and which is used to generate a response sentence to the user input;
The generation unit generates the response sentence including the sympathy word.
The information processing apparatus according to claim 3.

identifying, based on the user input and the topic, a repeated word included in the utterance-promoting word used for generating a response sentence to the user input, among a plurality of words included in the content of the user input; 4 further comprising a specific part,
The generation unit generates the response sentence including the repeated word.
The information processing apparatus according to claim 3 or 4.

The fourth identification unit identifies a plurality of the repeated words,
The information processing device according to claim 5 .

In the computer of the information processing equipment,
a process of estimating the intention of the user input based on the user input;
a process of identifying, based on the intention and the history of the user input, speech-promoting words that are used to generate a response sentence to the user input and that promote continuous utterance by the user;
a process of generating the response sentence based on the intention and the speech prompt word;
program to run.