JP5718405B2

JP5718405B2 - Utterance selection apparatus, method and program, dialogue apparatus and method

Info

Publication number: JP5718405B2
Application number: JP2013099367A
Authority: JP
Inventors: 東中　竜一郎; 竜一郎東中; 松尾　義博; 義博松尾; 牧野　俊朗; 俊朗牧野; のぞみ小林; 平野　徹; 徹平野; 豊美目黒; 千明宮崎
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2013-05-09
Filing date: 2013-05-09
Publication date: 2015-05-13
Anticipated expiration: 2033-05-09
Also published as: JP2014219872A

Description

本発明は、発話選択装置、方法、及びプログラム、対話装置及び方法に係り、特に、対話システムで用いられる発話を選択するための発話選択装置、方法、及びプログラム、並びに選択された発話を用いてユーザとの対話を行うための対話装置及び方法に関する。 The present invention relates to an utterance selection device, method, and program, and an interaction device and method, and more particularly, to an utterance selection device, method, and program for selecting an utterance used in an interaction system, and the selected utterance. The present invention relates to an interactive apparatus and method for interacting with a user.

雑多な話題についてユーザと対話を行う雑談対話システムは、現在の対話における話題を把握し、その話題について応答することが重要である。しかし、全ての話題について予めシステムが行うべき発話のリストを準備しておくことはコスト面で難しい。そこで、現在の対話における話題を示すユーザの発話に含まれる単語について、インターネットからウェブ検索エンジンを用いて関連する文を抽出し、抽出した文の中から一つ以上の文を発話として用いて応答を行う対話システムが提案されている（例えば、非特許文献１参照）。 It is important for a chat dialogue system that performs dialogues with users on various topics to grasp the topics in the current dialogue and respond to the topics. However, it is difficult in terms of cost to prepare a list of utterances that the system should perform in advance for all topics. Therefore, for words included in the user's utterance indicating the topic in the current dialogue, a related sentence is extracted from the Internet using a web search engine, and one or more sentences are extracted as utterances from the extracted sentence. An interactive system that performs the above has been proposed (see Non-Patent Document 1, for example).

また、一般的なウェブ文書ではなく、マイクロブログサービスであるツイッターのデータを検索対象とし、ユーザの発話から得られた検索キーワードを元に関連するツイートを取得して、システム発話として用いる対話システムも提案されている（例えば、非特許文献２参照）。 There is also an interactive system that uses Twitter data, which is a microblog service, as a search target instead of a general web document, acquires related tweets based on search keywords obtained from user utterances, and uses them as system utterances. It has been proposed (see, for example, Non-Patent Document 2).

Shibata, M., Nishiguchi, T., and Tomiura, Y. (2009). "Dialog systemfor open-ended conversation using web documents." Infomatica, 33 (3), pp. 277-284.Shibata, M., Nishiguchi, T., and Tomiura, Y. (2009). "Dialog system for open-ended conversation using web documents." Infomatica, 33 (3), pp. 277-284. F. Bessho, T. Harada, and Y. Kuniyoshi, "Dialog system using real-time crowdsourcing and Twitter large-scale corpus," in Proc. SIGDIAL, 2012, pp.227-231.F. Bessho, T. Harada, and Y. Kuniyoshi, "Dialog system using real-time crowdsourcing and Twitter large-scale corpus," in Proc. SIGDIAL, 2012, pp.227-231.

非特許文献１及び非特許文献２に記載の技術では、話題となる単語について得られた検索結果を発話として利用するが、ウェブページやツイッターから得られた検索結果には様々な話題が含まれており、必ずしも所与の話題について適切な検索結果を取得できるとは限らない。そのため、質の低い発話を行ってしまう可能性がある。 In the techniques described in Non-Patent Document 1 and Non-Patent Document 2, search results obtained for a topic word are used as utterances, but search results obtained from web pages and Twitter include various topics. Therefore, it is not always possible to obtain appropriate search results for a given topic. Therefore, there is a possibility that a low quality utterance will be performed.

本発明は、対話システムで用いる発話として、質の高い発話を低コストで選択することができる発話選択装置、方法、及びプログラム、並びに選択された発話を用いて、質の高い応答を行うことができる対話装置及び方法を提供することを目的とする。 The present invention can perform a high-quality response using an utterance selection device, method, and program capable of selecting a high-quality utterance at low cost as an utterance used in a dialogue system, and the selected utterance. An object of the present invention is to provide an interactive apparatus and method.

上記目的を達成するために、第１の発明に係る発話選択装置は、単語に対して、該単語と共に予め定めた、単語と品詞の列からなる言語パターンに基づいて該単語に関連する関連語を収集して対応付けた辞書から、ユーザの発話から抽出された少なくとも１つの抽出単語に関連する関連語を取得する関連語取得部と、複数の発話を含む発話データベースから、前記抽出単語及び前記関連語取得部で取得された前記抽出単語に関連する関連語を含む検索クエリに対応した発話を検索する発話検索部と、前記発話検索部により検索された発話から、少なくとも１つの発話を選択する発話選択部と、を含んで構成されている。 In order to achieve the above object, an utterance selection device according to a first aspect of the present invention provides a related word related to a word on the basis of a language pattern composed of a word and part-of-speech sequence for the word. A related word acquisition unit that acquires a related word related to at least one extracted word extracted from a user's utterance from a dictionary that is collected and correlated, and from the utterance database including a plurality of utterances, the extracted word and the An utterance search unit that searches for an utterance corresponding to a search query including a related word related to the extracted word acquired by the related word acquisition unit, and at least one utterance is selected from the utterances searched by the utterance search unit And an utterance selection unit.

本発明に係る発話選択装置によれば、関連語取得部が、単語に対して、該単語と共に予め定めた、単語と品詞の列からなる言語パターンに基づいて該単語に関連する関連語を収集して対応付けた辞書から、ユーザの発話から抽出された少なくとも１つの抽出単語に関連する関連語を取得する。そして、発話検索部が、複数の発話を含む発話データベースから、抽出単語及び関連語取得部で取得された抽出単語に関連する関連語を含む検索クエリに対応した発話を検索する。さらに、発話選択部が、発話検索部により検索された発話から、少なくとも１つの発話を選択する。 According to the utterance selection device according to the present invention, the related word acquisition unit collects related words related to the word based on a language pattern including a word and a part-of-speech sequence that is predetermined together with the word. Then, a related word related to at least one extracted word extracted from the user's utterance is acquired from the associated dictionary. Then, the utterance search unit searches the utterance database corresponding to the search query including the extracted word and the related word related to the extracted word acquired by the related word acquiring unit from the utterance database including a plurality of utterances. Further, the utterance selection unit selects at least one utterance from the utterances searched by the utterance search unit.

このように、抽出単語に関連する関連語を検索クエリに含めて検索された発話から発話を選択するため、対話システムで用いる発話として、質の高い発話を低コストで選択することができる。 As described above, since the utterance is selected from the utterance searched by including the related word related to the extracted word in the search query, a high-quality utterance can be selected at low cost as the utterance used in the dialogue system.

また、前記関連語取得部は、前記辞書として、前記関連語に前記単語との関連度が対応付けられた辞書を用いて、前記抽出単語に関連する関連語を取得し、前記発話検索部は、前記抽出単語に関連する関連語のうち、前記関連度が所定値以上の関連語を前記検索クエリに含めることができる。また、前記関連度を、前記単語と前記関連語との共起の度合いとしたり、前記単語を含む発話に後続する発話に前記関連語が出現する度合いとしたりすることができる。これにより、抽出単語が表す話題に、より関連の高い発話を選択することができる。 In addition, the related word acquisition unit acquires a related word related to the extracted word using a dictionary in which the related word is associated with a degree of association with the word as the dictionary, and the utterance search unit Among the related words related to the extracted word, related words having the relevance level equal to or higher than a predetermined value can be included in the search query. Further, the degree of association may be the degree of co-occurrence of the word and the related word, or the degree of appearance of the related word in the utterance subsequent to the utterance including the word. As a result, it is possible to select an utterance that is more relevant to the topic represented by the extracted word.

また、前記発話選択部は、前記発話検索部により検索された発話から、前記抽出単語、前記関連語、及び発話内容に影響を与えない単語として予め定めたストップワード以外の単語を含む発話を除外することができる。これにより、質の高い発話を選択することができる。 Further, the utterance selection unit excludes utterances including words other than stop words that are predetermined as words that do not affect the extracted words, the related words, and utterance contents, from utterances searched by the utterance search unit. can do. Thereby, high quality speech can be selected.

また、前記発話選択部は、発話に含まれる前記関連語または前記ストップワードに基づいて、前記発話検索部により検索された発話をクラスタリングして、各クラスタの代表的な発話を選択することができる。これにより、選択する発話に多様性を持たせることができる。 Further, the utterance selection unit can cluster the utterances searched by the utterance search unit based on the related word or the stop word included in the utterance, and select a representative utterance of each cluster. . Thereby, diversity can be given to the selected utterance.

また、前記発話選択部は、発話に含まれる前記関連語または前記ストップワードに基づいて、前記発話検索部により検索された発話をクラスタリングして、各クラスタに含まれる発話数が所定数以上のクラスタの代表的な発話を選択することができる。これにより、抽出単語について話題になり易い発話を選択することができる。 Further, the utterance selection unit clusters the utterances searched by the utterance search unit based on the related word or the stop word included in the utterance, and the number of utterances included in each cluster is a predetermined number or more. The representative utterance can be selected. As a result, it is possible to select an utterance that tends to become a topic for the extracted word.

また、前記発話選択部は、少なくともユーザの直前の発話を取得し、前記発話検索部により検索された発話のうち、前記直前の発話に含まれる単語を含む発話を選択することができる。これにより、潤滑な対話を実現するための発話を選択することができる。 Further, the utterance selection unit can acquire at least the utterance immediately before the user, and can select an utterance including a word included in the immediately preceding utterance out of the utterances searched by the utterance search unit. Thereby, it is possible to select an utterance for realizing a lubricious dialogue.

第２の発明に係る対話装置は、ユーザの発話を受け付ける発話受付部と、上記の発話選択装置により選択された発話のいずれかを出力する発話出力部と、を含んで構成されている。これにより、質の高い応答を行うことができる。 The dialogue apparatus according to the second aspect of the invention includes an utterance accepting unit that accepts a user's utterance and an utterance output unit that outputs one of the utterances selected by the utterance selection device. Thereby, a high-quality response can be performed.

また、第３の発明に係る発話選択方法は、関連語取得部が、単語に対して、該単語と共に予め定めた、単語と品詞の列からなる言語パターンに基づいて該単語に関連する関連語を収集して対応付けた辞書から、ユーザの発話から抽出された少なくとも１つの抽出単語に関連する関連語を取得するステップと、発話検索部が、複数の発話を含む発話データベースから、前記抽出単語及び前記関連語取得部で取得された前記抽出単語に関連する関連語を含む検索クエリに対応した発話を検索するステップと、発話選択部が、前記発話検索部により検索された発話から、少なくとも１つの発話を選択するステップとを含む方法である。 The utterance selection method according to the third aspect of the invention relates to a related word related to a word based on a language pattern consisting of a word and a part of speech sequence that is previously determined by the related word acquisition unit together with the word. from speech databases from the collected association dictionary, comprising the steps of: obtaining related terms associated with at least one extracted words are extracted from the utterance of the user, the speech retrieval unit, a plurality of utterances, said extraction word And a step of searching for an utterance corresponding to a search query including a related word related to the extracted word acquired by the related word acquisition unit, and an utterance selection unit at least 1 from the utterance searched by the utterance search unit Selecting one utterance.

また、第４の発明に係る対話方法は、発話受付部が、ユーザの発話を受け付けるステップと、発話出力部が、上記の発話選択方法により選択された発話のいずれかを出力するステップとを含む方法である。 The dialogue method according to the fourth invention includes a step in which the utterance receiving unit receives the user's utterance and a step in which the utterance output unit outputs one of the utterances selected by the utterance selection method. Is the method.

また、第５の発明に係る発話選択プログラムは、コンピュータを、上記の発話選択装置を構成する各部として機能させるためのプログラムである。 An utterance selection program according to a fifth aspect of the invention is a program for causing a computer to function as each unit constituting the utterance selection device.

以上説明したように、本発明の発話選択装置、方法、及びプログラムによれば、抽出単語に関連する関連語を検索クエリに含めて検索された発話から発話を選択するため、対話システムで用いる発話として、質の高い発話を低コストで選択することができる、という効果が得られる。また、本発明の対話装置及び方法によれば、上記のように選択された発話を用いて、質の高い応答を行うことができる、という効果が得られる。 As described above, according to the utterance selection device, method, and program of the present invention, an utterance used in a dialog system to select an utterance from utterances that are searched by including a related word related to an extracted word in a search query. As a result, it is possible to select a high quality speech at a low cost. In addition, according to the interactive apparatus and method of the present invention, an effect that a high-quality response can be performed using the utterance selected as described above can be obtained.

第１の実施の形態に係る発話選択装置の機能的な構成例を示すブロック図である。It is a block diagram which shows the functional structural example of the speech selection apparatus which concerns on 1st Embodiment. 関連度を計算するためのテーブルの一例を示す図である。It is a figure which shows an example of the table for calculating an association degree. 「ラーメン」についての関連語の一例を示す図である。It is a figure which shows an example of the related word about "ramen." 対話データを用いた関連度を計算するためのテーブルの一例を示す図である。It is a figure which shows an example of the table for calculating the relevance using dialog data. 関連度を再計算した「ラーメン」についての関連語の一例を示す図である。It is a figure which shows an example of the related word about "ramen" which recalculated the relevance degree. 検索クエリの一例を示す図である。It is a figure which shows an example of a search query. 検索結果の一例を示す図である。It is a figure which shows an example of a search result. クラスタリング結果の一例を示す図である。It is a figure which shows an example of a clustering result. 第１の実施の形態における発話選択処理を示すフローチャートである。It is a flowchart which shows the speech selection process in 1st Embodiment. 第２の実施の形態に係る対話装置の機能的な構成例を示すブロック図である。It is a block diagram which shows the functional structural example of the dialogue apparatus which concerns on 2nd Embodiment.

以下、図面を参照して、本発明の実施の形態を詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

＜第１の実施の形態＞
第１の実施の形態に係る発話選択装置１０は、ＣＰＵと、ＲＡＭと、後述する発話選択処理を実行するためのプログラムを記憶したＲＯＭとを備えたコンピュータで構成される。このコンピュータは、機能的には、図１に示すように、単語入力部１１、関連語取得部１２、発話検索部１３、フィルタリング部１４、クラスタリング部１５、及び発話候補選択部１６を含んだ構成で表すことができる。なお、フィルタリング部１４、クラスタリング部１５、及び発話候補選択部１６は、本発明の発話選択部の一例である。 <First Embodiment>
The utterance selection device 10 according to the first embodiment is configured by a computer including a CPU, a RAM, and a ROM that stores a program for executing an utterance selection process described later. Functionally, the computer includes a word input unit 11, a related word acquisition unit 12, an utterance search unit 13, a filtering unit 14, a clustering unit 15, and an utterance candidate selection unit 16, as shown in FIG. Can be expressed as The filtering unit 14, the clustering unit 15, and the utterance candidate selection unit 16 are examples of the utterance selection unit of the present invention.

単語入力部１１は、ユーザの発話から抽出された現在の話題に対応する一つ以上の形態素からなる単語を入力として受け取る。このユーザの発話から抽出された単語を「抽出単語」という。例えば、ユーザの発話に含まれる品詞が名詞の単語を抽出単語として抽出することができる。また、単語入力部１１は、抽出単語に加えて、少なくともユーザの直前の発話を含む対話履歴を受け取る。単語入力部１１は、受け取った抽出単語を関連語取得部１２へ受け渡すと共に、対話履歴を対話履歴記憶部２１に記憶する。 The word input unit 11 receives, as an input, a word composed of one or more morphemes corresponding to the current topic extracted from the user's utterance. The word extracted from the user's utterance is referred to as “extracted word”. For example, a word whose part of speech included in the user's utterance is a noun can be extracted as an extracted word. In addition to the extracted word, the word input unit 11 receives a dialogue history including at least the utterance immediately before the user. The word input unit 11 passes the received extracted word to the related word acquisition unit 12 and stores the dialogue history in the dialogue history storage unit 21.

関連語取得部１２は、単語入力部１１から受け渡された抽出単語に関連する関連語を、複数の単語の各々に関連する関連語が収録された関連語辞書２２を参照して取得する。関連語辞書２２には、見出し単語に関連した品詞が名詞、動詞、及び形容詞の各々の単語が見出し単語に関連する関連語として、見出し単語に対応付けられて収録されている。関連語は、例えば、大規模なブログデータから予め定めた言語パターンを用いて収集することができる。より具体的には、名詞は「Ａ（名詞）のＢ（名詞）」というパターンを、動詞は「Ａ（名詞）をＢ（動詞）」というパターンを、形容詞は「Ａ（名詞）はＢ（形容詞）」または「Ａ（名詞）がＢ（形容詞）」というパターンを用いて、形態素解析されたブログデータに正規表現によるパターンマッチを行い、見出し単語Ａについて関連する品詞が名詞、動詞、及び形容詞の単語Ｂを列挙し、関連語として収集することができる。 The related word acquisition unit 12 acquires related words related to the extracted word delivered from the word input unit 11 with reference to the related word dictionary 22 in which related words related to each of the plurality of words are recorded. In the related word dictionary 22, the part of speech related to the heading word is recorded in association with the heading word as the related word related to the heading word, each of the noun, verb and adjective words. For example, the related terms can be collected from large-scale blog data using a predetermined language pattern. More specifically, the noun has the pattern “B (noun) of A (noun)”, the verb has the pattern “A (noun) B (verb)”, and the adjective “A (noun) B ( Pattern matching by regex on the blog data subjected to morphological analysis using the pattern “adjective” or “A (noun) is B (adjective)”, and the related part of speech for the heading word A is the noun, verb, and adjective Words B can be listed and collected as related words.

なお、関連語辞書２２の見出し単語となる単語に対して、特に関連度の高い関連語のみを関連語辞書２２に含めるために、対数尤度比を用いた関連語のフィルタリングを行ってもよい。例えば、ある見出し単語ｗについて関連語ｒがあるとき、この見出し単語ｗに関連語ｒがどの程度関連しているかを示す関連度を、図２に示すテーブルに従って計算する。図２に示すテーブルにおいて、Ｎは見出し単語と関連語とのペアの総数（出現回数の総数）である。ｃ１２は見出し単語がｗで関連語がｒのペアの総数である。ｃ１は見出し単語がｗのペアの数、ｃ２は関連語がｒのペアの数である。ここから、ｃ１２がどれほど偏った値かを対数尤度比を用いて計算する。対数尤度比は、例えば下記（１）式により計算することができる。 In addition, with respect to a word that is a headword in the related word dictionary 22, in order to include only related words having a high degree of relevance in the related word dictionary 22, related words may be filtered using a log likelihood ratio. . For example, when there is a related word r for a certain heading word w, the degree of association indicating how much the related word r is related to the heading word w is calculated according to the table shown in FIG. In the table shown in FIG. 2, N is the total number of headword-related word pairs (total number of appearances). c12 is the total number of pairs in which the headword is w and the related word is r. c1 is the number of pairs whose headword is w, and c2 is the number of pairs whose related word is r. From here, how much the value of c12 is biased is calculated using the log likelihood ratio. The log likelihood ratio can be calculated by, for example, the following equation (1).

ここで、Ｏ_ｉは図２に示すテーブル中の各セルの値、すなわちｃ１２、ｃ１−ｃ１２、ｃ２−ｃ１２、Ｎ−ｃ２−ｃ１＋ｃ１２の各々である。Ｅ_ｉはＯ_ｉの期待値である。例えば、ｃ１２の期待値はｃ１＊（ｃ２／Ｎ）とすることができる。このようにして求めた対数尤度比を関連度として用いる。対数尤度比はカイ二乗分布に従うため、対数尤度比が１０．８３以上であれば、見出し単語と関連語とが、統計的に有意（ｐ＜０．００１）に関連していると考えられる。そこで、各見出し単語について、対数尤度比が１０．８３を超える関連語のみを収録することができる。なお、関連度は、単語と関連語との共起を表す尺度であればよく、対数尤度比以外にも、カイ二乗値、Ｔスコア、相互情報量などを用いてもよい。図３に、見出し単語「ラーメン」についての関連語の一例を示す。図３では、関連度である対数尤度比順で上位２０件を表示している。 Here, O _i is the value of each cell in the table shown in FIG. 2, that is, c12, c1-c12, c2-c12, and N-c2-c1 + c12. E _i is the expected value of O _i . For example, the expected value of c12 can be c1 * (c2 / N). The log likelihood ratio obtained in this way is used as the relevance. Since the log-likelihood ratio follows a chi-square distribution, if the log-likelihood ratio is 10.83 or more, it is considered that the headword and the related word are statistically significant (p <0.001). It is done. Therefore, for each headword, only related words having a log likelihood ratio exceeding 10.83 can be recorded. The relevance may be a scale representing the co-occurrence of a word and a related word, and in addition to the log likelihood ratio, a chi-square value, a T score, a mutual information amount, and the like may be used. FIG. 3 shows an example of a related word for the heading word “ramen”. In FIG. 3, the top 20 items are displayed in the log likelihood ratio order as the degree of association.

また、対話システムに適した関連度として、対話データにおける単語間の共起の度合いを用いてもよい。例えば、ある発話に単語ｗ１が出現し、後続する発話に単語ｗ２が出現し易い傾向にあれば、単語ｗ１と単語ｗ２とは関連が強いと言える。そこで、上記のように、見出し単語及び関連語の出現頻度を用いて計算された対数尤度比を関連度としてフィルタリングされた関連語について、図４に示すテーブルを用いて関連度を再計算する。図４に示すテーブルにおいて、Ｍは対話データにおける発話の隣接ペアの総数である。ｃ１２は単語ｗ１が発話ペアの前発話に現れ、単語ｗ２が発話ペアの後続発話に出現するペアの総数である。ｃ１は単語ｗ２が発話ペアの後続発話に現れるペアの総数、ｃ２は単語ｗ１が発話ペアの前発話に現れるペアの総数である。図４に示すテーブルの各セルの値を用いて、（１）式により計算される対数尤度比を関連度として再計算する。 Further, the degree of co-occurrence between words in the dialog data may be used as the degree of association suitable for the dialog system. For example, if the word w1 appears in a certain utterance and the word w2 tends to appear in a subsequent utterance, it can be said that the word w1 and the word w2 are strongly related. Therefore, as described above, for the related words filtered using the log likelihood ratio calculated using the appearance frequency of the headword and related words, the relevance is recalculated using the table shown in FIG. . In the table shown in FIG. 4, M is the total number of adjacent pairs of utterances in the dialog data. c12 is the total number of pairs in which the word w1 appears in the previous utterance of the utterance pair and the word w2 appears in the subsequent utterance of the utterance pair. c1 is the total number of pairs in which the word w2 appears in the subsequent utterance of the utterance pair, and c2 is the total number of pairs in which the word w1 appears in the previous utterance of the utterance pair. Using the value of each cell in the table shown in FIG. 4, the log likelihood ratio calculated by equation (1) is recalculated as the relevance.

第１の実施の形態における発話選択装置１０は、対話システムで用いる発話を選択するためのものであるため、上記のように対話データから計算した関連度を用いる場合について説明する。対話データのコーパスとして、マイクロブログサービスであるツイッターのデータを用い、返信関係にあるツイートをペアとして収集してデータベース化し、このデータベースから共起の度合いを示す対数尤度比を関連度として算出することができる。図５に、見出し単語「ラーメン」について、関連度を再計算した関連語の一例を示す。図５では、対話データから算出した関連度である対数尤度比順で上位２０件を表示している。 Since the utterance selection device 10 in the first embodiment is for selecting an utterance to be used in the dialogue system, the case where the degree of association calculated from the dialogue data as described above is used will be described. Use Twitter data, which is a microblog service, as a corpus of dialogue data, collect tweets in a reply relationship as a pair, create a database, and calculate the log-likelihood ratio indicating the degree of co-occurrence as the relevance from this database be able to. FIG. 5 shows an example of a related word obtained by recalculating the relevance level for the headword “ramen”. In FIG. 5, the top 20 items are displayed in the log likelihood ratio order, which is the degree of association calculated from the conversation data.

第１の実施の形態では、ブログデータから抽出した約２００万の単語を見出し単語に持ち、見出し単語についての各関連語に、関連度として対数尤度比が付与された関連語辞書２２を用いる。関連語取得部１２は、関連語辞書２２を参照して取得した抽出単語の関連語を、単語入力部１１から受け渡された抽出単語と共に、発話検索部１３へ受け渡す。 In the first embodiment, a related word dictionary 22 having about 2 million words extracted from blog data as headline words and having a log likelihood ratio as a relevance level for each related word for the headline word is used. . The related word acquisition unit 12 transfers the related word of the extracted word acquired with reference to the related word dictionary 22 together with the extracted word transferred from the word input unit 11 to the utterance search unit 13.

発話検索部１３は、関連語取得部１２から受け渡された抽出単語及び関連語を用いて、発話データベース（ＤＢ）２３を検索する。例えば、抽出単語を必ず含む発話の中で、関連語が含まれている場合には検索結果のスコアにその関連語の対数尤度比の対数を加算したとき、スコアが大きくなるものから順にＬ件の発話を取得する。例えば、Ｌは１００とすることができる。対数尤度比の対数を用いるのは、対数尤度比が非常に大きくなる場合があるための措置である。このように検索することで、関連度の高い関連語をなるべく多く含む発話を発話ＤＢ２３から取得することができる。 The utterance search unit 13 searches the utterance database (DB) 23 using the extracted word and the related word delivered from the related word acquisition unit 12. For example, in a utterance that always includes an extracted word, if a related word is included, the logarithm of the log likelihood ratio of the related word is added to the score of the search result. Get utterances. For example, L can be 100. The use of the logarithm of the log likelihood ratio is a measure for the case where the log likelihood ratio may become very large. By searching in this way, an utterance including as many related words as possible can be acquired from the utterance DB 23.

発話ＤＢ２３としては、例えば、ツイッターのデータを収集し、収集したデータを句点及び記号によって文に分割した上で、一文を一つの発話を示す一レコードとして登録したものを用いることができる。発話ＤＢ２３は、全文検索が可能なように、転置インデックスを保持することができる。このような発話ＤＢ２３の構築には、一般的な全文検索エンジンのデータベースを構築する手順を用いることができ、例えば、フリーソフトのＬｕｃｅｎｅやＮａｍａｚｕなどを用いて構築することができる。 As the utterance DB 23, for example, it is possible to use data obtained by collecting Twitter data, dividing the collected data into sentences by punctuation marks and symbols, and registering one sentence as one record indicating one utterance. The utterance DB 23 can hold a transposed index so that a full text search is possible. The utterance DB 23 can be constructed by using a general procedure for constructing a full-text search engine database. For example, it can be constructed by using free software Lucene or Namazu.

図６に、抽出単語「ラーメン」及びその関連語を用いた検索クエリの一例を示す。図６の例は、Ｌｕｃｅｎｅによる検索クエリの書式を用いている。空白区切りのそれぞれが検索条件であり、「ｃｏｎｔｅｎｔ：麺＾９．１７０９５９」は発話の内容（ｃｏｎｔｅｎｔ）に「麺」という単語が入っている場合には、９．１７０９５９のスコア（「麺」の対数尤度比の対数）が付与されることを表している。「＋ｃｏｎｔｅｎｔ：“ラーメン”」のようにプラスから始まる検索条件は必須条件を表し、これは「ラーメン」という抽出単語を必ず含むという条件を表す。 FIG. 6 shows an example of a search query using the extracted word “ramen” and its related words. The example of FIG. 6 uses a search query format by Lucene. Each blank delimiter is a search condition, and “content: noodle ^ 9.170959” contains the word “noodle” in the content of the utterance (content), the score of 9.170959 (“noodle” (Logarithm of log likelihood ratio) is given. A search condition that starts with a plus, such as “+ content:“ ramen ”” represents an indispensable condition, which represents a condition that the extracted word “ramen” is necessarily included.

図６に示す検索クエリを用いて発話ＤＢ２３を検索した場合の検索結果の上位１０件の一例を図７に示す。抽出単語「ラーメン」と特に関連する発話のみが検索されていることが分かる。図７中の「検索結果のスコア」は、取得した発話に含まれる関連語の対数尤度比の対数の総和である。発話検索部１３は、取得した発話集合をフィルタリング部１４に受け渡す。 FIG. 7 shows an example of the top 10 search results when the utterance DB 23 is searched using the search query shown in FIG. It can be seen that only utterances particularly related to the extracted word “ramen” are retrieved. The “search result score” in FIG. 7 is the sum of the logarithm of the log likelihood ratio of the related words included in the acquired utterance. The utterance search unit 13 passes the acquired utterance set to the filtering unit 14.

フィルタリング部１４は、発話検索部１３から受け渡された発話集合に対してフィルタリングを行う。発話検索部１３によって取得された発話集合に含まれる発話の各々は、抽出単語との関連が比較的高い発話であるが、さらに抽出単語に関連した発話だけを残すための処理である。 The filtering unit 14 filters the utterance set delivered from the utterance search unit 13. Each of the utterances included in the utterance set acquired by the utterance search unit 13 is an utterance having a relatively high relationship with the extracted word, but is a process for leaving only the utterance related to the extracted word.

具体的には、フィルタリング部１４は、発話集合に含まれる各発話に対して形態素解析を行う。形態素解析には、形態素解析器として、ＮＴＴのＪＴＡＧ、フリーソフトのＣｈａＳｅｎ、Ｍｅｃａｂなどを用いることができる。フィルタリング部１４は、形態素解析結果から、内容語（名詞、動詞、形容詞、及び未知語）を抽出し、抽出した内容語を、抽出単語、関連語、ストップワード、及びそれ以外の単語に分類する。そして、フィルタリング部１４は、それ以外の単語に分類した単語が含まれる発話を発話集合から除外する。 Specifically, the filtering unit 14 performs morphological analysis on each utterance included in the utterance set. For morphological analysis, NTT's JTAG, free software ChaSen, Mecab, or the like can be used as a morphological analyzer. The filtering unit 14 extracts content words (nouns, verbs, adjectives, and unknown words) from the morphological analysis result, and classifies the extracted content words into extracted words, related words, stop words, and other words. . Then, the filtering unit 14 excludes utterances including words classified as other words from the utterance set.

ここでは、ストップワードに対応するものとして、動詞及び形容詞である用言を用いる。用言は体言（名詞）と異なり、発話内容に大きく影響を与えない単語である。なお、用言以外にも、発話内容に影響を与えないような補助名詞（「こと」や「もの」など）などをストップワードに含めてもよい。フィルタリング部１４は、フィルタリングした発話集合をクラスタリング部１５へ受け渡す。 Here, verbs and adjectives are used as words corresponding to stop words. Unlike prescriptive words (nouns), predicates are words that do not significantly affect the utterance content. In addition to the precautions, auxiliary nouns (such as “thing” and “thing”) that do not affect the utterance content may be included in the stop word. The filtering unit 14 passes the filtered utterance set to the clustering unit 15.

クラスタリング部１５は、フィルタリング部１４から受け渡された発話集合に含まれる各発話を、各発話に含まれる品詞が名詞の関連語、または発話に品詞が名詞の関連語が含まれていない場合はストップワードによってクラスタリングする。図８に、抽出単語「ラーメン」について取得及びフィルタリングされた発話をクラスタリングした結果の一例を示す。なお、図８のクラスタリング結果には、「食べ」が関連語に含まれているが、これは形態素解析誤りによって「食べ」の品詞が「名詞」とされたためである。サ変名詞（動作性名詞）は動詞として扱う。 When the clustering unit 15 includes each utterance included in the utterance set passed from the filtering unit 14, the part of speech included in each utterance is a noun related word, or the utterance does not include a noun related word Cluster by stop word. FIG. 8 shows an example of the result of clustering utterances acquired and filtered for the extracted word “ramen”. In the clustering result of FIG. 8, “eating” is included in the related word because the part of speech of “eating” is set to “noun” due to a morphological analysis error. Sa variant nouns (behavioral nouns) are treated as verbs.

発話に品詞が名詞の関連語が含まれていない場合は、発話に含まれるストップワードによってクラスタリングを行う。例えば、「ラーメンうまい」、「ラーメン美味しい」といった、抽出単語に対して特に意味内容が変更されないような発話からなるクラスタが得られる。これらはそれぞれ「うまい」及び「美味しい」という用言を持つクラスタに分類される。クラスタリング部１５は、クラスタリング結果を発話候補選択部１６へ受け渡す。 If an utterance does not contain a related word whose part of speech is a noun, clustering is performed using stop words included in the utterance. For example, a cluster composed of utterances whose semantic contents are not changed with respect to the extracted words, such as “good ramen” and “good ramen” can be obtained. These are categorized into clusters with the phrases “delicious” and “delicious” respectively. The clustering unit 15 passes the clustering result to the utterance candidate selection unit 16.

発話候補選択部１６は、クラスタリング部１５から受け渡されたクラスタリング結果に基づいて、３つの発話集合を作成する。一つ目は、関連語が含まれるクラスタにおいて、そのクラスタの代表的な発話が、対話履歴記憶部２１に保存されたユーザの直前の発話に含まれる内容語を含んでいる場合に、その発話を抽出することで得られる発話集合である。この発話集合を以下では「集合１」という。なお、ユーザの直前の発話に含まれる内容語は、対話履歴記憶部２１に保存されたユーザの直前の発話を読み出して、ユーザの直前の発話に対して形態素解析を行い、形態素解析結果から抽出することができる。二つ目は、二発話以上の複数の発話を持つクラスタから代表的な発話を一つずつ抽出することで得られる発話集合である。この発話集合を以下では「集合２」という。三つ目は、全てのクラスタの各々から代表的な発話を一つずつ抽出することで得られる発話集合である。この発話集合を以下では「集合３」という。 The utterance candidate selection unit 16 creates three utterance sets based on the clustering result transferred from the clustering unit 15. First, in a cluster including related words, when a typical utterance of the cluster includes a content word included in the utterance immediately before the user stored in the dialogue history storage unit 21, the utterance Is an utterance set obtained by extracting. Hereinafter, this utterance set is referred to as “set 1”. The content words included in the utterance immediately before the user are extracted from the morpheme analysis result by reading the utterance immediately before the user stored in the dialogue history storage unit 21 and performing morphological analysis on the utterance immediately before the user. can do. The second is an utterance set obtained by extracting representative utterances one by one from a cluster having a plurality of utterances of two or more utterances. Hereinafter, this utterance set is referred to as “set 2”. The third is an utterance set obtained by extracting one representative utterance from each of all clusters. Hereinafter, this utterance set is referred to as “set 3”.

各クラスタの代表的な発話の抽出法としてはどのような方法を用いてもよい。例えば、クラスタ内で最も短い発話をそのクラスタの代表的な発話とすることができる。最も短い発話は最も簡潔な文であり、そのクラスタの内容を密に含んでいると考えられるからである。最も短い発話を用いる以外にも、重要文選択で一般的に用いられるＴＦＩＤＦ（ＴＦ：Term Frequency、ＩＤＦ：Inverse Document Frequency）などの基準に基づいて、各クラスタの代表的な発話を抽出してもよい。 Any method may be used as a representative utterance extraction method for each cluster. For example, the shortest utterance in the cluster can be set as the representative utterance of the cluster. This is because the shortest utterance is the simplest sentence and is considered to contain the contents of the cluster densely. In addition to using the shortest utterance, a representative utterance of each cluster may be extracted based on a standard such as TFIDF (TF: Term Frequency, IDF: Inverse Document Frequency) generally used for important sentence selection. Good.

発話候補選択部１６は、３つの発話集合を作成した上で、集合１が空集合でない場合は、集合１を発話候補集合として選択する。集合１が空集合の場合は集合２を発話候補集合として選択する。集合１も集合２も空集合の場合は、集合３を発話候補集合として選択する。この順で発話候補集合を選択することにより、直前のユーザ発話に応じた応答を返すことが可能であり、また、集合１が空の場合でも、集合２のように複数の発話が含まれているクラスタからの発話を優先的に選択することによって、話題（抽出単語）についてよく話されている内容の発話を選択することができる。集合１及び集合２が空の場合は、集合３を用いることで、抽出単語に関連した何らかの発話を選択することができる。 The utterance candidate selection unit 16 creates three utterance sets and, if the set 1 is not an empty set, selects the set 1 as an utterance candidate set. If set 1 is an empty set, set 2 is selected as the utterance candidate set. If neither set 1 nor set 2 is an empty set, set 3 is selected as the utterance candidate set. By selecting the utterance candidate set in this order, it is possible to return a response according to the immediately preceding user utterance, and even when the set 1 is empty, a plurality of utterances are included as in the set 2 By preferentially selecting an utterance from a certain cluster, it is possible to select an utterance having a content often spoken about the topic (extracted word). When set 1 and set 2 are empty, by using set 3, any utterance related to the extracted word can be selected.

発話候補選択部１６は、選択した発話候補集合を出力する。なお、集合１〜３の全てが空集合の場合には、発話候補選択部１６は、選択結果として空集合を出力する。 The utterance candidate selection unit 16 outputs the selected utterance candidate set. When all of the sets 1 to 3 are empty sets, the utterance candidate selection unit 16 outputs the empty set as a selection result.

次に、第１の実施の形態に係る発話選択装置１０の作用について説明する。上述した関連語辞書２２及び発話ＤＢ２３が用意された状態で、発話選択装置１０に抽出単語及び対話履歴が入力されると、発話選択装置１０において、図９に示す発話選択処理が実行される。 Next, the operation of the utterance selection device 10 according to the first embodiment will be described. When the extracted word and the conversation history are input to the utterance selection device 10 in a state where the related word dictionary 22 and the utterance DB 23 are prepared, the utterance selection processing shown in FIG. 9 is executed in the utterance selection device 10.

発話選択処理のステップ１００で、単語入力部１１が、入力された抽出単語及び対話履歴を受け取り、受け取った抽出単語を関連語取得部１２へ受け渡すと共に、対話履歴を対話履歴記憶部２１に記憶する。 In step 100 of the utterance selection process, the word input unit 11 receives the input extracted word and dialogue history, passes the received extracted word to the related word acquisition unit 12, and stores the dialogue history in the dialogue history storage unit 21. To do.

次に、ステップ１０２で、関連語取得部１２が、単語入力部１１から受け渡された抽出単語の関連語を、複数の単語の各々に対する関連語が収録された関連語辞書２２を参照して取得する。関連語取得部１２は、取得した関連語を、単語入力部１１から受け渡された抽出単語と共に、発話検索部１３へ受け渡す。 Next, in step 102, the related word acquisition unit 12 refers to the related word dictionary 22 in which the related words for each of the plurality of words are recorded for the related words of the extracted words passed from the word input unit 11. get. The related word acquisition unit 12 transfers the acquired related word to the utterance search unit 13 together with the extracted word transferred from the word input unit 11.

次に、ステップ１０４で、発話検索部１３が、関連語取得部１２から受け渡された抽出単語及び関連語による検索クエリを用いて、発話ＤＢ２３を検索し、抽出単語を含む発話であって、発話に含まれる関連語の関連度に応じたスコア順にＬ件の発話を取得する。発話検索部１３は、取得した発話集合をフィルタリング部１４に受け渡す。 Next, in step 104, the utterance search unit 13 searches the utterance DB 23 using the extracted word and the search query based on the related word delivered from the related word acquisition unit 12, and includes the extracted word. L utterances are acquired in the order of scores according to the degree of association of related words included in the utterance. The utterance search unit 13 passes the acquired utterance set to the filtering unit 14.

次に、ステップ１０６で、フィルタリング部１４が、発話検索部１３から受け渡された発話集合に含まれる各発話に対して形態素解析を行い、形態素解析結果から、内容語を抽出し、抽出した内容語を、抽出単語、関連語、ストップワード、及びそれ以外の単語に分類する。そして、フィルタリング部１４は、それ以外の単語に分類した単語が含まれる発話を発話集合から除外する。フィルタリング部１４は、フィルタリングした発話集合をクラスタリング部１５へ受け渡す。 Next, in step 106, the filtering unit 14 performs morphological analysis on each utterance included in the utterance set passed from the utterance search unit 13, extracts content words from the morphological analysis results, and extracts the extracted content. The words are classified into extracted words, related words, stop words, and other words. Then, the filtering unit 14 excludes utterances including words classified as other words from the utterance set. The filtering unit 14 passes the filtered utterance set to the clustering unit 15.

次に、ステップ１０８で、クラスタリング部１５が、フィルタリング部１４から受け渡された発話集合に含まれる各発話を、各発話に含まれる品詞が名詞の関連語、または発話に品詞が名詞の関連語が含まれていない場合はストップワードによってクラスタリングする。クラスタリング部１５は、クラスタリング結果を発話候補選択部１６へ受け渡す。 Next, in step 108, the clustering unit 15 selects each utterance included in the utterance set passed from the filtering unit 14, the part of speech included in each utterance is a noun related word, or the part of speech contains a noun related word. If is not included, clustering is performed using stop words. The clustering unit 15 passes the clustering result to the utterance candidate selection unit 16.

次に、ステップ１１０で、発話候補選択部１６が、クラスタリング部１５から受け渡されたクラスタリング結果に基づいて、関連語が含まれるクラスタの代表的な発話のうち、対話履歴記憶部２１に保存されたユーザの直前の発話に含まれる内容語を含む発話を抽出した発話集合（集合１）を作成する。また、発話候補選択部１６が、二発話以上の複数の発話を持つクラスタから代表的な発話を一つずつ抽出した発話集合（集合２）を作成する。また、発話候補選択部１６が、全てのクラスタの各々から代表的な発話を一つずつ抽出した発話集合（集合３）を作成する。 Next, in step 110, the utterance candidate selection unit 16 is stored in the dialogue history storage unit 21 among representative utterances of clusters including related words based on the clustering result passed from the clustering unit 15. An utterance set (set 1) is created by extracting utterances including content words included in the utterance immediately before the user. Further, the utterance candidate selection unit 16 creates an utterance set (set 2) in which representative utterances are extracted one by one from a cluster having a plurality of utterances of two or more utterances. Further, the utterance candidate selection unit 16 creates an utterance set (set 3) in which representative utterances are extracted one by one from all the clusters.

次に、ステップ１１２で、発話候補選択部１６が、集合１が空集合か否かを判定する。集合１が空集合でない場合には、ステップ１１４へ移行し、発話候補選択部１６が、集合１を発話候補集合として選択して出力する。集合１が空集合の場合には、ステップ１１６へ移行し、発話候補選択部１６が、集合２が空集合か否かを判定する。集合２が空集合でない場合には、ステップ１１８へ移行して、発話候補選択部１６が、集合２を発話候補として選択して出力する。集合２が空集合の場合には、ステップ１２０へ移行し、発話候補選択部１６が、集合３が空集合か否かを判定する。集合３が空集合でない場合には、ステップ１２２へ移行して、発話候補選択部１６が、集合３を発話候補として選択して出力する。集合３が空集合の場合には、ステップ１２４へ移行し、発話候補選択部１６が、空集合を出力し、発話選択処理を終了する。 Next, in step 112, the utterance candidate selection unit 16 determines whether or not the set 1 is an empty set. If the set 1 is not an empty set, the process proceeds to step 114, and the utterance candidate selection unit 16 selects and outputs the set 1 as the utterance candidate set. When the set 1 is an empty set, the process proceeds to step 116, and the utterance candidate selection unit 16 determines whether or not the set 2 is an empty set. If the set 2 is not an empty set, the process proceeds to step 118, where the utterance candidate selection unit 16 selects the set 2 as an utterance candidate and outputs it. When the set 2 is an empty set, the process proceeds to step 120, and the utterance candidate selection unit 16 determines whether or not the set 3 is an empty set. If the set 3 is not an empty set, the process proceeds to step 122, where the utterance candidate selection unit 16 selects the set 3 as an utterance candidate and outputs it. When the set 3 is an empty set, the process proceeds to step 124, where the utterance candidate selection unit 16 outputs the empty set and ends the utterance selection process.

以上説明したように、第１の実施の形態に係る発話選択装置によれば、抽出単語と抽出単語の関連語とを検索クエリとして検索された発話集合から発話を選択するため、対話システムで用いる発話として、質の高い発話を低コストで選択することができる。 As described above, according to the utterance selection device according to the first embodiment, an utterance is selected from an utterance set searched by using an extracted word and a related word of the extracted word as a search query. As an utterance, a high quality utterance can be selected at low cost.

＜第２の実施の形態＞
次に、第２の実施の形態について説明する。第２の実施の形態に係る対話装置２０は、ＣＰＵと、ＲＡＭと、対話処理を実行するためのプログラムを記憶したＲＯＭとを備えたコンピュータで構成される。このコンピュータは、機能的には、図１０に示すように、発話受付部２５、発話候補集合選択部２６、及び発話出力部２７を含んだ構成で表すことができる。発話候補集合選択部２６には、第１の実施の形態に係る発話選択装置１０を構成する各部が含まれる。すなわち、第２の実施の形態に係る対話装置２０は、第１の実施の形態に係る発話選択装置１０を組み込んだ対話装置である。 <Second Embodiment>
Next, a second embodiment will be described. The dialogue apparatus 20 according to the second embodiment is configured by a computer including a CPU, a RAM, and a ROM that stores a program for executing dialogue processing. Functionally, this computer can be represented by a configuration including an utterance receiving unit 25, an utterance candidate set selecting unit 26, and an utterance output unit 27, as shown in FIG. The utterance candidate set selection unit 26 includes each unit constituting the utterance selection device 10 according to the first embodiment. That is, the dialogue device 20 according to the second embodiment is a dialogue device in which the utterance selection device 10 according to the first embodiment is incorporated.

発話受付部２５は、ユーザの発話を入力として受け取る。ユーザの発話は音声データとして入力されてもよい。ユーザの発話が音声データとして入力される場合には、発話受付部２５は、受け取った音声データであるユーザの発話を、音声認識によりテキスト化する。発話受付部２５は、ユーザの発話を示すテキストデータに対して形態素解析を行い、発話中に含まれる品詞が名詞の単語を抽出する。品詞が名詞の単語が複数存在する場合には、その中から一つを選択する。一つを選択する方法は、例えば、受け付けたユーザの発話の一番最後に出現する品詞が名詞の単語を選択するなど、予め定めたルールに従って選択することができる。そして、抽出した単語及び直前のユーザの発話を発話候補集合選択部２６へ受け渡す。 The utterance reception unit 25 receives a user's utterance as an input. The user's utterance may be input as voice data. When the user's utterance is input as voice data, the utterance reception unit 25 converts the user's utterance, which is the received voice data, into text by voice recognition. The utterance reception unit 25 performs morphological analysis on the text data indicating the user's utterance, and extracts words whose part of speech included in the utterance is a noun. If there are a plurality of words whose part of speech is a noun, one of them is selected. The method of selecting one can be selected according to a predetermined rule, for example, selecting a word whose part of speech that appears at the end of the received user's utterance is a noun. Then, the extracted word and the utterance of the previous user are transferred to the utterance candidate set selection unit 26.

発話候補集合選択部２６は、第１の実施の形態に係る発話選択装置１０と同様に、発話候補集合を選択して出力する。 The utterance candidate set selection unit 26 selects and outputs an utterance candidate set in the same manner as the utterance selection apparatus 10 according to the first embodiment.

発話出力部２７は、発話候補集合選択部２６から出力された発話候補集合から一つの発話をランダムに選択し、ユーザにシステム発話として出力する。なお、発話候補集合選択部２６から空集合が返された場合には、別途、発話候補が得られない場合の発話のリストを用意しておき、その中から一つ以上の発話を選択して出力する。 The utterance output unit 27 randomly selects one utterance from the utterance candidate set output from the utterance candidate set selection unit 26, and outputs it to the user as a system utterance. When an empty set is returned from the utterance candidate set selection unit 26, a list of utterances when no utterance candidates are obtained is prepared separately, and one or more utterances are selected from the list. Output.

例えば、対話装置２０において、以下の様な対話がユーザとシステムとの間で実現できる。最初の「こんにちは」はシステムのプロンプト（初期発話）である。「ラーメン」が抽出単語として発話候補集合選択部２６に入力され、その結果得られる発話候補集合から発話が選択され、雑談が実現されている。「幸楽苑のラーメンは美味しいよ」はユーザの直前発話に「美味しい」があるため、優先的に発話候補とされたものである。 For example, in the dialogue apparatus 20, the following dialogue can be realized between the user and the system. The first "Hello" is a system prompt (initial utterance). “Ramen” is input to the utterance candidate set selection unit 26 as an extracted word, and an utterance is selected from the utterance candidate set obtained as a result, thereby realizing a chat. “Korakuen's ramen is delicious” is preferentially considered as an utterance candidate because there is “delicious” in the utterance immediately before the user.

対話システム（ｓｙｓｔｅｍ）とユーザ（ｙｏｕ）の対話例：
［ｓｙｓｔｅｍ］：こんにちは
［ｙｏｕ］：ラーメンが好きなんです。
［ｓｙｓｔｅｍ］：ラーメンは味噌ラーメンですよね
［ｙｏｕ］：そうだね。
［ｓｙｓｔｅｍ］：ラーメン食べたいな
［ｙｏｕ］：美味しいよね
［ｓｙｓｔｅｍ］：幸楽苑のラーメンは美味しいよ
［ｙｏｕ］：本当ですか？ Dialogue system (system) and user (you) dialogue example:
[System]: Hello [you]: I ramen is love.
[System]: Ramen is miso ramen [you]: That's right.
[System]: I want to eat ramen [you]: It's delicious [system]: Koraku ramen is delicious [you]: Is it true?

このように、第１の実施の形態に係る発話選択装置を対話装置に組み込むことで、種々の話題について、低コストで質の高い発話を行う雑談対話システムが実現できる。雑談は人間同士では会話の潤滑油として用いられる。質の高い雑談を実現可能にすることによって、人間とシステムとのやりとりがより円滑になり、コンピュータとユーザとの共同作業の効率が高まる。 As described above, by incorporating the utterance selection device according to the first embodiment into a dialogue device, it is possible to realize a chat dialogue system that performs high-quality utterances on various topics at low cost. Chat is used as a lubricant for conversation between humans. By enabling high-quality chat, human interaction with the system becomes smoother and the efficiency of collaboration between computers and users increases.

なお、第２の実施の形態では、対話装置に発話選択装置が組み込まれた構成について説明したが、発話受付部及び発話出力部を含む対話装置と、対話装置の外部装置として構成された発話選択装置との間で通信を行って、ユーザから受け付けた発話に対する応答を出力する対話システムとして構成してもよい。 In the second embodiment, the configuration in which the utterance selection device is incorporated in the dialog device has been described. However, the dialog device including the utterance reception unit and the utterance output unit, and the utterance selection configured as an external device of the dialog device. You may comprise as an interactive system which communicates between apparatuses and outputs the response with respect to the speech received from the user.

また、本発明は、上述した実施形態に限定されるものではなく、この発明の要旨を逸脱しない範囲内で様々な変形や応用が可能である。 The present invention is not limited to the above-described embodiments, and various modifications and applications can be made without departing from the gist of the present invention.

また、本願明細書中において、プログラムが予めインストールされている実施形態として説明したが、当該プログラムを、コンピュータ読み取り可能な記録媒体に格納して提供することも可能である。 In the present specification, the embodiment has been described in which the program is installed in advance. However, the program can be provided by being stored in a computer-readable recording medium.

１０発話選択装置
１１単語入力部
１２関連語取得部
１３発話検索部
１４フィルタリング部
１５クラスタリング部
１６発話候補選択部
２０対話装置
２１対話履歴記憶部
２２関連語辞書
２３発話データベース
２５発話受付部
２６発話候補集合選択部
２７発話出力部 DESCRIPTION OF SYMBOLS 10 utterance selection apparatus 11 word input part 12 related word acquisition part 13 utterance search part 14 filtering part 15 clustering part 16 utterance candidate selection part 20 dialogue apparatus 21 dialogue history storage part 22 related word dictionary 23 utterance database 25 utterance reception part 26 utterance candidate Set selection unit 27 Utterance output unit

Claims

At least one extracted from a user's utterance from a dictionary in which related words related to the word are collected and associated with the word based on a language pattern composed of a word and part-of-speech sequence, together with the word. A related word acquisition unit that acquires related words related to one extracted word;
An utterance search unit for searching for an utterance corresponding to a search query including a related word related to the extracted word and the extracted word acquired from the extracted word and the related word acquiring unit from an utterance database including a plurality of utterances;
An utterance selection unit that selects at least one utterance from the utterances searched by the utterance search unit;
Utterance selection device including

The related word acquisition unit acquires, as the dictionary, a related word related to the extracted word using a dictionary in which the related word is associated with a degree of association with the word,
2. The utterance selection device according to claim 1, wherein the utterance search unit includes, in the search query, a related word having a relevance level equal to or higher than a predetermined value among related words related to the extracted word.

The utterance selection apparatus according to claim 2, wherein the degree of association is a degree of co-occurrence between the word and the related word.

The utterance selection device according to claim 2, wherein the degree of relevance is a degree at which the related word appears in an utterance subsequent to an utterance including the word.

The utterance selection unit excludes utterances including words other than stop words predetermined as words that do not affect the extracted words, the related words, and utterance contents from utterances searched by the utterance search unit. The utterance selection device according to any one of claims 1 to 4.

The utterance selection unit clusters the utterances searched by the utterance search unit based on the related word or the stop word included in an utterance, and selects a representative utterance of each cluster. Utterance selection device.

The utterance selection unit clusters the utterances searched by the utterance search unit based on the related word or the stop word included in the utterance, and represents a cluster whose number of utterances included in each cluster is a predetermined number or more. The utterance selection device according to claim 5 which selects a typical utterance.

The utterance selection unit acquires at least the utterance immediately before the user, and selects an utterance including a word included in the immediately previous utterance from utterances searched by the utterance search unit. The utterance selection device according to claim 1.

An utterance reception unit for receiving user utterances;
An utterance output unit that outputs any of the utterances selected by the utterance selection device according to any one of claims 1 to 8;
Interactive device including

The related word acquisition unit collects related words related to the word based on a language pattern including a word and a part of speech that is predetermined together with the word, and the user's utterance Obtaining a related word related to at least one extracted word extracted from;
An utterance search unit searches an utterance corresponding to a search query including a related word related to the extracted word and the extracted word acquired by the related word acquiring unit from an utterance database including a plurality of utterances;
An utterance selection unit selecting at least one utterance from the utterances searched by the utterance search unit;
Utterance selection method including

An utterance accepting unit accepting a user's utterance;
An utterance output unit outputting any of the utterances selected by the utterance selection method according to claim 10;
Interactive method including

The utterance selection program for functioning a computer as each part which comprises the utterance selection apparatus of any one of Claims 1-8.