JP2023149601A

JP2023149601A - Inference apparatus, inquiry reply apparatus, interactive apparatus, and inference method

Info

Publication number: JP2023149601A
Application number: JP2022058254A
Authority: JP
Inventors: 健太郎鳥澤; Kentaro Torisawa; ジュリアンクロエツェー; Kloetzer Julien; 淳太水野; Junta MIZUNO; 龍飯田; Ryu Iida; 清敬大竹; Kiyotaka Otake; 鍾勲呉; Jong Hoon Oh
Original assignee: National Institute of Information and Communications Technology
Current assignee: National Institute of Information and Communications Technology
Priority date: 2022-03-31
Filing date: 2022-03-31
Publication date: 2023-10-13
Also published as: WO2023188827A1

Abstract

To provide an inference apparatus that can operate at a high speed and with a sufficient accuracy using few computational resources.SOLUTION: An inference apparatus 50 includes a first neural net 80 that outputs a vector representation of a first input, and a second neural net that outputs a vector representation of a second input, and when there is a predetermined relation between the vector representation of the first and second inputs using learning data of the first and second inputs which has a predetermined relation, causes the first and second neural nets to be learned so as to be located close to each other in a vector space, and clusters the vector representation which is an output of the learned second neural net. The inference apparatus 50 further includes a database 84 constructed in advance so as to enable retrieval extraction of clusters on the basis of the vector representation of the first input, and infers an output on the basis of information 88 of the clusters retrieved and extracted from the database 84 on the basis of the vector representation of the input by the first neural net 80 with respect to an input 60.SELECTED DRAWING: Figure 1

Description

この発明は自然言語による推論装置、質問回答装置、対話装置、及び推論方法に関する。 The present invention relates to a natural language reasoning device, question answering device, dialogue device, and reasoning method.

従来知られている質問回答システムに、本件出願人が開発しウェブ上において提供しているシステム（ＷＩＳＤＯＭＸ）がある。このシステムにおいては、主として入力された質問からキーワード群となる複数の内容語を抽出する。このキーワード群に基づき、インターネット等から収集したパッセージ（連続する７つ程度の文からなる文のまとまり）をいくつか選択する。得られたパッセージ群を質問とともにニューラルネットワークに入力することにより、質問に対する回答を含むか否かという観点からパッセージが分類される。回答が含まれると判断されたパッセージに関しては、そこから質問に対する回答となるフレーズ及び単語などを抽出し整形して回答を出力する。 A conventionally known question answering system is a system (WISDOM X) developed by the applicant of the present invention and provided on the web. This system mainly extracts a plurality of content words, which form a keyword group, from an input question. Based on this keyword group, several passages (groups of sentences consisting of about seven consecutive sentences) collected from the Internet and the like are selected. By inputting the obtained passage group together with the questions into the neural network, the passages are classified from the viewpoint of whether or not they contain answers to the questions. For passages that are determined to contain answers, phrases and words that serve as answers to questions are extracted from them, formatted, and output as answers.

質問回答システムの性格上、質問に対する回答を早期に行う必要がある。そのために、上記したシステムにおいては、あらかじめウェブをクロールしてウェブ上のデータをローカルの記憶装置に蓄積しておく。上記したキーワードベースの検索エンジンの検索範囲は、この記憶装置に記憶された情報であり、その結果、大量の回答候補が得られる。そのため、各回答候補が質問に対する回答を含むか否かの判定処理には大量のデータ処理が伴う。さらに、この処理の対象となるパッセージの各々がある程度のデータ量を持つ上、自然言語処理が必要となる関係上、大規模ニューラルネットワークが処理に使用される。その結果、最終的に行うべきデータ処理の総量は非常に大きなものとなる。したがって、従来の質問回答システムを稼働させるためには大きな計算資源が必要だったという問題がある。 Due to the nature of the question-and-answer system, it is necessary to answer questions quickly. To this end, in the above system, the web is crawled in advance and data on the web is stored in a local storage device. The search range of the keyword-based search engine described above is the information stored in this storage device, resulting in a large number of answer candidates. Therefore, the process of determining whether each answer candidate includes an answer to the question involves processing a large amount of data. Furthermore, since each passage to be processed has a certain amount of data and requires natural language processing, a large-scale neural network is used for the processing. As a result, the total amount of data processing that must be ultimately performed becomes extremely large. Therefore, there is a problem in that large computational resources are required to operate the conventional question answering system.

こうした事情は、質問回答システムに限らない。本件出願人は、質問に限らず一般的な入力に対して応答を行う対話システムも開発している。この対話システムにおいては、応答候補を作成するために、入力から複数の質問を生成し、その質問を上記した質問回答システムに投入してそれぞれ複数の回答を得る。その後、それらの回答から入力発話に対する応答を生成し、更には生成された複数の応答から最適な応答を一つ選ぶ処理を行っている。そのために必要なデータ処理の量は、質問回答システムのためのデータ処理の量を上回る。その結果、対話システムを稼働させるためには非常に大きな計算資源が必要だったという問題がある。 These circumstances are not limited to question-and-answer systems. The applicant has also developed a dialogue system that responds not only to questions but also to general input. In this dialog system, in order to create response candidates, a plurality of questions are generated from input, and the questions are input into the above-described question answering system to obtain a plurality of answers. Thereafter, a response to the input utterance is generated from those answers, and further processing is performed to select one optimal response from the plurality of generated responses. The amount of data processing required for this exceeds the amount of data processing for a question answering system. As a result, there is a problem in that very large computational resources are required to operate the dialogue system.

それ故にこの発明は、従来のものより少ない計算資源により高速に、かつ十分な精度をもって動作可能な推論装置、質問回答装置、対話装置、及び推論方法を提供することである。 Therefore, it is an object of the present invention to provide an inference device, a question answering device, a dialogue device, and an inference method that can operate at high speed and with sufficient accuracy using fewer computational resources than conventional ones.

この発明の第１の局面に係る推論装置は、第１の入力が供給され、この第１の入力のベクトル表現を出力する第１のニューラルネットワークと、第２の入力が供給され、この第２の入力のベクトル表現を出力する第２のニューラルネットワークとを含み、少なくとも、所定の関係にある第１及び第２の入力の学習データを用いて、第１の入力のベクトル表現と第２の入力のベクトル表現が、所定の関係にある場合に、ベクトル空間において近接して位置するように第１及び第２のニューラルネットワークを学習させ、学習済みの第２のニューラルネットワークの出力であるベクトル表現を、ベクトル空間上の位置に基づきクラスタ化し、第１の入力のベクトル表現に基づき、クラスタの検索抽出が可能なようにあらかじめ構築されたデータベースをさらに含み、装置への入力に対して第１のニューラルネットワークによる入力のベクトル表現に基づいてデータベースから検索抽出されたクラスタの情報に基づき、所定の関係にある出力を推論する。 An inference device according to a first aspect of the invention includes a first neural network supplied with a first input and outputting a vector representation of the first input, and a first neural network supplied with a second input and outputting a vector representation of the first input. a second neural network that outputs a vector representation of the input of the first input and a second neural network that outputs the vector representation of the first input and the second input using at least learning data of the first and second inputs having a predetermined relationship. The first and second neural networks are trained to be located close to each other in the vector space when the vector representations of , further includes a database constructed in advance to enable clustering based on the position on the vector space and search and extraction of clusters based on the vector representation of the first input; Outputs having a predetermined relationship are inferred based on cluster information searched and extracted from a database based on the vector representation of input by the network.

好ましくは、データベースは、それぞれのクラスタに含まれる出力のベクトル表現のセントロイドを用いて検索抽出される。 Preferably, the database is searched using a centroid of vector representations of the outputs included in each cluster.

より好ましくは、第１のニューラルネットワークは、第１の入力とこの第１の入力と所定の関係にある第２の入力の属するクラスタに関連したベクトル表現とに基づき、追加の学習がなされる。 More preferably, the first neural network is additionally trained based on a first input and a vector representation associated with a cluster to which a second input having a predetermined relationship with the first input belongs.

この発明の第２の局面に係る質問回答装置は、上記したいずれかの推論装置を含み、所定の関係は、質問とその質問に対する回答を含むものである。 A question answering device according to a second aspect of the invention includes any of the above-mentioned inference devices, and the predetermined relationship includes a question and an answer to the question.

この発明の第３の局面に係る対話装置は、上記したいずれかの推論装置を含み、所定の関係は、発話とその発話に対する応答を含むものである。 A dialogue device according to a third aspect of the invention includes any of the above-mentioned inference devices, and the predetermined relationship includes an utterance and a response to the utterance.

この発明の第４の局面に係る推論方法は、第１の入力が供給され、この第１の入力のベ
クトル表現を出力する第１のニューラルネットワークと、第２の入力が供給され、この第２の入力のベクトル表現を出力する第２のニューラルネットワークとを準備するステップと、少なくとも、所定の関係にある第１及び第２の入力の学習データを用いて、第１の入力のベクトル表現と第２の入力のベクトル表現が、所定の関係にある場合に、ベクトル空間において近接して位置するように第１及び第２のニューラルネットワークを学習させるステップと、学習済みの第２のニューラルネットワークの出力であるベクトル表現を、ベクトル空間上の位置に基づきクラスタ化し、第１の入力のベクトル表現に基づき、クラスタの検索抽出が可能なようにあらかじめデータベースを構築するステップと、入力に対して第１のニューラルネットワークによる入力のベクトル表現に基づいてデータベースから検索抽出されたクラスタの情報に基づき、所定の関係にある出力を推論するステップとを含む。 An inference method according to a fourth aspect of the invention includes a first neural network that is supplied with a first input and outputs a vector representation of the first input; a second neural network that outputs a vector representation of an input; and a second neural network that outputs a vector representation of an input of a step of training the first and second neural networks so that the vector representations of the two inputs are located close to each other in the vector space when they have a predetermined relationship; and an output of the trained second neural network. Clustering the vector representations based on the positions on the vector space, and constructing a database in advance to enable search and extraction of clusters based on the vector representation of the first input; and inferring an output having a predetermined relationship based on cluster information searched and extracted from a database based on a vector representation of an input by the neural network.

この発明の第５の局面に係る質問回答装置は、複数レコードを含むデータベースを含み、複数レコードの各々は、質問に対する回答候補のリンク先と、当該回答候補のベクトル表現が属するクラスタの識別子とを含み、質問文が入力されたことに応答して、質問文を、当該質問文のベクトル表現である質問ベクトルに変換するためのニューラルネットワークと、回答候補の意味的表現ベクトルのクラスタの中で、当該クラスタの代表ベクトルが質問ベクトルと最も近い所定個数のクラスタを選択するクラスタ選択手段と、データベースにおいて、クラスタ選択手段により選択された所定個数のクラスタのいずれかの識別子を持つレコードに含まれるリンク先から、それぞれ回答候補を収集するための回答候補収集手段と、回答候補収集部により収集された回答候補の中から、所定の手順で質問文に対する回答を選択するための回答選択手段とを含む。 A question answering device according to a fifth aspect of the invention includes a database including a plurality of records, each of which includes a link destination of an answer candidate for a question and an identifier of a cluster to which a vector representation of the answer candidate belongs. a neural network for converting a question into a question vector that is a vector representation of the question in response to input of a question; and a cluster of semantic expression vectors of answer candidates. Cluster selection means for selecting a predetermined number of clusters whose representative vector of the cluster is closest to the question vector; and a link destination included in a record having an identifier of one of the predetermined number of clusters selected by the cluster selection means in the database. , and an answer selection means for selecting an answer to the question text in a predetermined procedure from among the answer candidates collected by the answer candidate collecting section.

好ましくは、質問回答装置はさらに、質問文を発した質問者の発話履歴を記憶するための発話履歴記憶手段と、発話履歴記憶手段と、質問文が入力されたことに応答して、当該質問を発した質問者の過去の発話履歴の１又は複数のトピック候補を発話履歴記憶手段に記憶された発話履歴に基づいて特定するためのトピック特定手段と、質問文がニューラルネットワークに入力されるに先立って、トピック特定手段により特定された１又は複数のトピックを表す情報を質問文に付加するためのトピック付加手段とをさらに含む。 Preferably, the question answering device further includes an utterance history storage means for storing the utterance history of the questioner who has uttered the question, and an utterance history storage means for storing the utterance history of the questioner who has uttered the question. topic identification means for identifying one or more topic candidates in the past utterance history of the questioner who uttered the question based on the utterance history stored in the utterance history storage means; The method further includes topic adding means for adding information representing one or more topics specified by the topic specifying means to the question text.

より好ましくは、回答候補の各々は、いずれも複数の連続する文を含む。 More preferably, each of the answer candidates includes a plurality of consecutive sentences.

この発明の第６の局面に係る対話装置は、上記した質問回答装置と、入力される発話のトピックを推定するトピック推定手段と、入力される発話にトピック推定手段により推定されたトピックを示す情報を付加し、質問として質問回答装置に入力するトピック付加手段と、発話に対する質問回答装置の出力を対話にふさわしく整形することにより、入力される発話に対する応答を生成するための応答生成手段とを含む。 A dialogue device according to a sixth aspect of the invention includes the above-described question answering device, topic estimation means for estimating the topic of an input utterance, and information indicating the topic estimated by the topic estimation means for the input utterance. and a topic adding means for adding a question to the question answering device and inputting it as a question to the question answering device, and a response generating means for generating a response to the input utterance by formatting the output of the question answering device in response to the utterance to be suitable for dialogue. .

この発明の第７の局面に係る質問回答用モデルの訓練方法は、複数の質問文と、当該複数の質問文の各々に対する回答候補群とから、質問文と当該質問文に対応する回答候補との組み合わせからなる正例と、質問文と当該質問文に対応しない回答候補との組み合わせからなる負例とを生成することにより、学習データを準備するステップと、質問文を、当該質問文のベクトル表現である質問ベクトルに変換するための質問変換用ニューラルネットワークと、回答候補を、当該回答候補のベクトル表現である回答候補ベクトルに変換するための回答変換用ニューラルネットワークとを、Siameseネットワークにより訓練するステップと、訓練するステップにより訓練された回答変換用ニューラルネットワークを用いて、回答候補群に含まれる回答候補を、当該回答候補のベクトル表現である回答候補ベクトルに変換するステップと、変換するステップにより生成された回答候補ベクトルを所定個数のクラスタにクラスタリングし、回答候補の各々に、当該回答候補が属するクラスタの識別子を付与するステップと、複数の質問文の各々について、当該質問に対応する回答候補を含むクラスタの識別子を対応付けるステップと、複数の質問文の各々について質問変換用ニューラルネットワークの出力する質問ベクトルと、当該質問文に対応する回答候補が属するクラスタとの距離を表す所定の指標が小さくなるように質問変換用ニューラルネットワークの追加学習をするステップとを含む。 A method for training a question-answering model according to a seventh aspect of the present invention is based on a plurality of question sentences and a group of answer candidates for each of the plurality of question sentences. A step of preparing learning data by generating a positive example consisting of a combination of a question sentence and a negative example consisting of a combination of a question sentence and an answer candidate that does not correspond to the question sentence; A question conversion neural network for converting into a question vector, which is a representation, and an answer conversion neural network, which converts an answer candidate into an answer candidate vector, which is a vector representation of the answer candidate, are trained using a Siamese network. and converting an answer candidate included in the answer candidate group into an answer candidate vector that is a vector representation of the answer candidate using the answer conversion neural network trained in the step of training. Clustering the generated answer candidate vectors into a predetermined number of clusters, assigning to each answer candidate an identifier of the cluster to which the answer candidate belongs, and determining an answer candidate corresponding to the question for each of the plurality of question sentences. , and a predetermined index representing the distance between the question vector output by the question conversion neural network for each of the plurality of question sentences and the cluster to which the answer candidate corresponding to the question belongs is small. and a step of additionally learning the question conversion neural network so that the question conversion neural network becomes

好ましくは、質問回答用モデルの訓練方法はさらに、学習データを準備するステップに先立って、インターネットから複数の質問文を収集するステップを含む。 Preferably, the method for training a question answering model further includes the step of collecting a plurality of question sentences from the Internet, prior to the step of preparing training data.

より好ましくは、収集するステップは、学習データを準備するステップに先立って、インターネットから複数の質問文を、その前又は後の文とともに収集するステップと、複数の質問文の各々について、当該質問文について収集された前又は後の文に基づいて、質問文の関連するトピックを推定するステップとを含み、学習データを準備するステップは、複数の質問文の各々に、当該質問文の関連するトピックを付与するステップと、トピックが付与された複数の質問文と、回答候補群とから、質問文と当該質問文に対応する回答候補との組み合わせからなる正例と、質問文と当該質問文に対応しない回答候補との組み合わせからなる負例とを生成することにより、学習データを準備するステップとを含む。 More preferably, the collecting step includes, prior to the step of preparing the learning data, collecting a plurality of question sentences from the Internet together with sentences before or after the question sentences, and for each of the plurality of question sentences, the question sentence estimating a topic related to the question based on previous or subsequent sentences collected for the question, and the step of preparing learning data includes estimating a topic related to the question for each of the plurality of question sentences. A positive example consisting of a combination of a question sentence and an answer candidate corresponding to the question sentence, a positive example consisting of a combination of a question sentence and an answer candidate corresponding to the question sentence, from a plurality of question sentences to which topics have been assigned, and an answer candidate group, and preparing learning data by generating negative examples consisting of combinations with non-corresponding answer candidates.

さらに好ましくは、質問回答用モデルの訓練方法はさらに、学習データを準備するステップに先立って、インターネットから複数の質問文の各々に対する回答候補群を収集するステップと、収集された回答候補群に含まれる回答候補の各々に関連付けて、当該回答候補のインターネット上のＵＲＬを記憶するステップとを含み、方法はさらに、回答候補の各々について、当該回答候補が属するクラスタの識別子と、当該回答候補のインターネット上のＵＲＬとを含む新たなレコードをデータベースに追加するステップを含む。 More preferably, the method for training a question answering model further includes, prior to the step of preparing learning data, collecting a group of answer candidates for each of the plurality of question sentences from the Internet; storing, for each candidate answer, an identifier of a cluster to which the candidate answer belongs, and a URL on the Internet of the candidate answer, in association with each of the candidate answers. and adding a new record to the database containing the above URL.

この発明の上記及び他の目的、特徴、局面及び利点は、添付の図面と関連して理解されるこの発明に関する次の詳細な説明から明らかとなるであろう。 The above and other objects, features, aspects and advantages of the present invention will become apparent from the following detailed description of the invention, understood in conjunction with the accompanying drawings.

図１は、この発明の第１実施形態に係る質問回答装置の機能的ブロック図である。FIG. 1 is a functional block diagram of a question answering device according to a first embodiment of the present invention. 図２は、第１実施形態に係る質問回答装置において、質問ＢＥＲＴの学習を行う学習装置の機能的ブロック図である。FIG. 2 is a functional block diagram of a learning device that learns the question BERT in the question answering device according to the first embodiment. 図３は、図２に示すＢＥＲＴ学習部の機能的ブロック図である。FIG. 3 is a functional block diagram of the BERT learning section shown in FIG. 2. 図４は、追加学習前の質問ＢＥＲＴにより生ずることのある問題を示す模式図である。FIG. 4 is a schematic diagram illustrating problems that may occur due to question BERT before additional learning. 図５は、追加学習した後の質問ＢＥＲＴによる問題の解消を示す模式図である。FIG. 5 is a schematic diagram illustrating problem solving by question BERT after additional learning. 図６は、図２に示す追加学習部の機能的ブロック図である。FIG. 6 is a functional block diagram of the additional learning section shown in FIG. 2. 図７は、この発明の第２実施形態に係る対話装置の機能的ブロック図である。FIG. 7 is a functional block diagram of an interaction device according to a second embodiment of the invention. 図８は、図２実施形態に係る対話装置において、対話履歴管理部の学習を行う学習装置の機能的ブロック図である。FIG. 8 is a functional block diagram of a learning device that performs learning of the dialog history management section in the dialog device according to the embodiment in FIG. 図９は、図８に示す追加学習部の機能的ブロック図である。FIG. 9 is a functional block diagram of the additional learning section shown in FIG. 8. 図１０は、この発明の第１実施形態に係る質問回答装置５０及び第２実施形態に係る対話装置３５０並びにそれらにおいて使用されるニューラルネットワークの学習を行う学習装置を実現するコンピュータシステムの外観を示す図である。FIG. 10 shows the external appearance of a computer system that realizes a question answering device 50 according to a first embodiment of the present invention, an interaction device 350 according to a second embodiment, and a learning device for learning a neural network used therein. It is a diagram. 図１１は、図１０に示すコンピュータシステムのハードウェア構成を示すブロック図である。FIG. 11 is a block diagram showing the hardware configuration of the computer system shown in FIG. 10.

以下の説明及び図面においては、同一の部品には同一の参照番号を付してある。したがって、それらについての詳細な説明は繰返さない。 In the following description and drawings, identical parts are provided with the same reference numerals. Therefore, detailed description thereof will not be repeated.

第１第１実施形態
１．構成
Ａ．質問回答装置
図１に、この発明の第１実施形態に係る質問回答装置５０の機能的ブロック図を示す。図１を参照して、質問回答装置５０は、質問６０を受け取り、インターネット６２から回答候補となるパッセージ群を検索するための回答候補検索部６４と、回答候補検索部６４により検索されたパッセージ群を記憶するためのパッセージＤＢ（Ｄａｔａｂａｓｅ）６６とを含む。パッセージＤＢ６６に記憶されるパッセージ群は、従来のものと比較して限定された数である。質問回答装置５０はさらに、パッセージＤＢ６６に記憶されたパッセージ群に基づき、従来の手法と同様の手法を用いて質問６０に対する回答７０を生成し出力するための回答生成部６８を含む。回答生成部６８が回答７０を生成する手法は、この実施形態においては従来技術と全く同様である。ただし、回答生成部６８が処理する対象となるパッセージの数は、従来技術と比較してはるかに少ない。 1st Embodiment 1. Configuration A. Question Answering Device FIG. 1 shows a functional block diagram of a question answering device 50 according to a first embodiment of the present invention. Referring to FIG. 1, a question answering device 50 receives a question 60, and includes an answer candidate search section 64 for searching a passage group that becomes an answer candidate from the Internet 62, and a passage group searched by the answer candidate search section 64. and a passage DB (Database) 66 for storing. The number of passage groups stored in the passage DB 66 is limited compared to the conventional one. The question answering device 50 further includes an answer generation unit 68 for generating and outputting an answer 70 to the question 60 based on the passage group stored in the passage DB 66 using a method similar to a conventional method. The method by which the answer generation unit 68 generates the answer 70 in this embodiment is exactly the same as in the prior art. However, the number of passages to be processed by the answer generation unit 68 is much smaller than in the prior art.

この実施形態の特徴は、質問６０を受けた時点においてインターネット６２から検索するパッセージの数が限定されている点である。したがってパッセージＤＢ６６に記憶されるパッセージの数は少なく、回答生成部６８が必要とする計算資源も従来と比較してはるかに少なく済む。 A feature of this embodiment is that the number of passages to be searched from the Internet 62 at the time the question 60 is received is limited. Therefore, the number of passages stored in the passage DB 66 is small, and the calculation resources required by the answer generation section 68 are also far fewer than in the past.

回答候補検索部６４は、質問６０を変換し、質問６０を表現するベクトルである質問ベクトル８２を出力するための質問ＢＥＲＴ８０を含む。質問ＢＥＲＴ８０は、あらかじめ学習済みだが、その学習については図２以降を参照して後述する。ここで、質問ＢＥＲＴ等は、ニューラルネットワークのＢｉｄｉｒｅｃｔｉｏｎａｌＥｎｃｏｄｅｒＲｅｐｒｅｓｅｎｔａｔｉｏｎｆｒｏｍＴｒａｎｓｆｏｒｍｅｒｓを含むものであり、ＢＥＲＴ、ＲｏＢＥＲＴａ等、ＴｒａｎｓｆｏｒｍｅｒＥｎｃｏｄｅｒを含むニューラルネットワークであり、一般的に、言語資源による事前学習を経て、タスクに応じたファインチューニングの後に利用される。この実施形態におけるファインチューニング等については、後述する。 The answer candidate search unit 64 includes a question BERT 80 for converting the question 60 and outputting a question vector 82 that is a vector expressing the question 60. The question BERT80 has been trained in advance, and the learning will be described later with reference to FIG. 2 and subsequent figures. Here, the question BERT etc. includes a neural network Bidirectional Encoder Representation from Transformers, and is a neural network including Transformer encoders such as BERT, RoBERTa, etc., and generally uses pre-learning using language resources. to the task Used after appropriate fine tuning. Fine tuning and the like in this embodiment will be described later.

回答候補検索部６４はさらに、セントロイドＤＢ８４を含む。セントロイドＤＢ８４は質問ＢＥＲＴ８０などの訓練過程において同時に生成される。セントロイドＤＢ８４を生成する方法については学習の説明の際にあわせて後述する。簡単に言えば、セントロイドＤＢ８４は、様々な質問に対する回答を含む多くのパッセージをベクトル化し、クラスタリングして、そのセントロイドをデータベース化したものである。セントロイドＤＢ８４の各レコードは、そのセントロイドが代表するクラスタのクラスタ識別子と、そのセントロイドのベクトル空間上の位置を表すベクトルとを含む。パッセージをベクトル化したものをパッセージベクトルと呼ぶ。パッセージベクトルと質問ベクトル８２とは同じ次元数のベクトルである。 The answer candidate search unit 64 further includes a centroid DB 84. The centroid DB 84 is generated simultaneously during the training process such as the query BERT 80. The method for generating the centroid DB 84 will be described later along with the explanation of learning. Simply put, the centroid DB 84 vectorizes and clusters many passages containing answers to various questions, and creates a database of centroids. Each record in the centroid DB 84 includes the cluster identifier of the cluster represented by the centroid and a vector representing the position of the centroid in the vector space. A vectorized passage is called a passage vector. The passage vector and the question vector 82 are vectors with the same number of dimensions.

回答候補検索部６４はさらに、質問ベクトル８２を受けて、ベクトル空間内において質問ベクトル８２に最も近い所定個数のセントロイドのレコードをセントロイドＤＢ８４において検索し、検索された所定個数のセントロイドのクラスタ識別子８８をそれぞれ出力するための回答候補クラスタ特定部８６を含む。これらクラスタ識別子８８により表される各クラスタには、質問６０に対する回答を含む可能性が高いパッセージに対応するパッセージベクトルが含まれることが想定されている。なぜそのようになるかについては、質問ＢＥＲＴ８０の訓練に関する説明において明らかにする。 The answer candidate search unit 64 further receives the question vector 82, searches the centroid DB 84 for records of a predetermined number of centroids closest to the question vector 82 in the vector space, and searches for clusters of the predetermined number of searched centroids. It includes an answer candidate cluster identification unit 86 for outputting identifiers 88, respectively. It is assumed that each cluster represented by these cluster identifiers 88 includes a passage vector corresponding to a passage that is likely to include an answer to the question 60. Why this is so will be made clear in the explanation regarding the training of question BERT80.

回答候補検索部６４はさらに、あらかじめ様々な質問について、その回答候補となるパッセージを表す多数のパッセージレコードを記憶するための回答候補ＤＢ９０を含む。回答候補ＤＢ９０の各レコードは、あらかじめ作成された質問に対して、その回答が含まれていると考えられるパッセージをウェブからダウンロードした情報から作成される。より具体的には、各レコードは、パッセージが存在していたロケーションを示すＵＲＬと、そのパッセージベクトルが属するクラスタのクラスタ識別子とを含む。 The answer candidate search unit 64 further includes an answer candidate DB 90 for storing in advance a large number of passage records representing passages that are answer candidates for various questions. Each record in the answer candidate DB 90 is created from information obtained by downloading a passage that is considered to contain an answer to a previously created question from the web. More specifically, each record includes a URL indicating the location where the passage was located and a cluster identifier of the cluster to which the passage vector belongs.

回答候補検索部６４はさらに、回答候補クラスタ特定部８６が出力するクラスタ識別子８８を受け、クラスタ識別子８８と一致するクラスタ識別子を持つレコードを回答候補ＤＢ９０において検索し、検索されたレコードのＵＲＬを回答候補ＵＲＬ群９４として出力するための回答候補検索部９２と、回答候補ＵＲＬ群９４に含まれるＵＲＬの各々にアクセスし、回答候補を含むと思われるパッセージを含むテキストをダウンロードして、パッセージを抽出し、パッセージＤＢ６６に蓄積するためのパッセージ検索部９６とを含む。 The answer candidate search unit 64 further receives the cluster identifier 88 output by the answer candidate cluster specifying unit 86, searches the answer candidate DB 90 for a record having a cluster identifier that matches the cluster identifier 88, and returns the URL of the searched record. Access the answer candidate search unit 92 for output as a candidate URL group 94 and each of the URLs included in the answer candidate URL group 94, download text containing passages that are considered to include answer candidates, and extract the passages. and a passage search unit 96 for storing in the passage DB 66.

前述したとおり、パッセージ検索部９６がアクセスするＵＲＬは、回答候補クラスタ特定部８６により特定されたセントロイドに対応するクラスタに属するものだけである。したがって、従来技術のようにウェブ全体からダウンロードした大量のデータにアクセスする必要はない。選択された個数のクラスタに属するＵＲＬからダウンロードしたものだけがアクセスの対象となる。そのため、回答生成部６８として従来のものと同じものを採用したとしても、必要な記憶容量も、計算資源もはるかに小さくて済む。 As described above, the passage search section 96 accesses only URLs that belong to the cluster corresponding to the centroid specified by the answer candidate cluster specifying section 86. Therefore, there is no need to access large amounts of data downloaded from across the web as in the prior art. Only those downloaded from URLs belonging to the selected number of clusters are to be accessed. Therefore, even if the answer generation section 68 is the same as the conventional one, the required storage capacity and calculation resources will be much smaller.

Ｂ．学習装置
図２に、図１に示す質問ＢＥＲＴ８０の学習を行い、同時にセントロイドＤＢ８４及び回答候補ＤＢ９０の生成を行うための学習装置１５０の機能的構成を示す。図２を参照して、学習装置１５０は、インターネット６２から多数の質問と、各質問に対する回答候補を含む多数のパッセージとを収集するための質問・パッセージ収集部１６０と、質問・パッセージ収集部１６０が収集した質問を記憶するための質問ＤＢ１６２と、質問・パッセージ収集部１６０が収集したパッセージを記憶するためのパッセージＤＢ１６４とを含む。図２ではパッセージＤＢ１６４に蓄積されるパッセージを用いて、ＢＥＲＴの学習とパッセージのクラスタリングを行うようにしているが、これは別のＤＢ、例えばパッセージＤＢ１６４に蓄積されているパッセージの一部からなるＤＢを用いることもできる。 B. Learning Device FIG. 2 shows a functional configuration of a learning device 150 for learning the question BERT 80 shown in FIG. 1 and simultaneously generating the centroid DB 84 and answer candidate DB 90. Referring to FIG. 2, the learning device 150 includes a question/passage collection unit 160 for collecting a large number of questions and a large number of passages including answer candidates for each question from the Internet 62; includes a question DB 162 for storing questions collected by the question/passage collection unit 160, and a passage DB 164 for storing passages collected by the question/passage collection unit 160. In FIG. 2, BERT learning and passage clustering are performed using the passages stored in the passage DB 164, but this is done in another DB, for example, a DB consisting of part of the passages stored in the passage DB 164. You can also use

質問ＤＢ１６２の各レコードは各質問に対応する。各レコードは、例えば、質問識別子と、その質問に対応するパッセージが主として属するクラスタのクラスタ識別子と、質問のテキストと、その質問が存在するインターネット上のＵＲＬとを含む。 Each record in the question DB 162 corresponds to each question. Each record includes, for example, a question identifier, a cluster identifier of the cluster to which the passage corresponding to the question mainly belongs, the text of the question, and the URL on the Internet where the question exists.

質問・パッセージ収集部１６０の各レコードは各パッセージに対応する。各レコードは、パッセージＩＤと、対応する質問ＩＤと、そのパッセージが存在するインターネット上のＵＲＬとを含む。 Each record of the question/passage collection unit 160 corresponds to each passage. Each record includes a passage ID, a corresponding question ID, and a URL on the Internet where the passage exists.

学習装置１５０はさらに、図１に示す質問ＢＥＲＴ８０と同じ構成の、訓練対象の質問ＢＥＲＴ１６８と、質問ＢＥＲＴ１６８と同じ構成の回答候補ＢＥＲＴ１７０と、質問ＤＢ１６２及び質問・パッセージ収集部１６０に記憶されたデータを用いて、質問ＢＥＲＴ１６８及び回答候補ＢＥＲＴ１７０の学習をＳｉａｍｅｓｅＢＥＲＴネットワーク（Siamese BERT Networks）により同時に行うためのＢＥＲＴ学習部１６６とを含む。ＳｉａｍｅｓｅＢＥＲＴネットワークについては、以下の参考文献に記載がある。この実施形態におけるＳｉａｍｅｓｅＢＥＲＴネットワークの概略については図３を参照して後述する。なお、本発明は、ＳｉａｍｅｓｅＢＥＲＴネットワークによる構成に限定されるものではなく、質問ＢＥＲＴ８０と回答候補ＢＥＲＴ１７０の出力であるベクトル表現の近い、遠いが学習データと整合していれば、問題ない。 The learning device 150 further includes a training target question BERT 168 having the same configuration as the question BERT 80 shown in FIG. and a BERT learning unit 166 for simultaneously learning a question BERT 168 and an answer candidate BERT 170 using Siamese BERT Networks. The Siamese BERT network is described in the following references: An outline of the Siamese BERT network in this embodiment will be described later with reference to FIG. 3. Note that the present invention is not limited to the configuration using the Siamese BERT network, and there is no problem as long as the vector representations that are the outputs of the question BERT 80 and the answer candidate BERT 170 are close or distant but consistent with the learning data.

［参考文献］
Nils Reimers and Iryna Gurevych，”Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks，”[online]，令和元年8月27日，Arxiv.org，［令和４年３月１０日検索］，インターネット，＜https://arxiv.org/pdf/1908.10084＞
学習装置１５０はさらに、ＢＥＲＴ学習部１６６により学習が行われた回答候補ＢＥＲＴ１７０を用いて質問・パッセージ収集部１６０に含まれる各パッセージをパッセージベクトル化し、さらにクラスタリングして回答候補ＤＢ９０及びセントロイドＤＢ８４を生成するためのパッセージクラスタリング部１７２を含む。パッセージクラスタリング部１７２による回答候補ＤＢ９０の生成は、以下の手順に従って行われる。まず質問・パッセージ収集部１６０に含まれる各パッセージを回答候補ＢＥＲＴ１７０に入力することにより、回答候補ＢＥＲＴ１７０の出力に得られるベクトルに変換する。これらのベクトルは、各パッセージを表すパッセージベクトルである。パッセージクラスタリング部１７２はさらに、これらパッセージベクトルを所定個数のクラスタにクラスタリングする。このクラスタの個数は、あらかじめ定めておいてもよいし、クラスタ内のベクトルの分散の合計を最小にするなど、所定の基準により支障の出ない一定範囲内において決めるようにしてもよい。 [References]
Nils Reimers and Iryna Gurevych, “Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks,” [online], August 27, 2019, Arxiv.org, [searched March 10, 2020], Internet ,＜https://arxiv.org/pdf/1908.10084＞
The learning device 150 further converts each passage included in the question/passage collection unit 160 into a passage vector using the answer candidate BERT 170 trained by the BERT learning unit 166, and further clusters the passages to create an answer candidate DB 90 and a centroid DB 84. It includes a passage clustering unit 172 for generating. Generation of the answer candidate DB 90 by the passage clustering unit 172 is performed according to the following procedure. First, by inputting each passage included in the question/passage collection unit 160 to the answer candidate BERT 170, it is converted into a vector obtained as an output of the answer candidate BERT 170. These vectors are passage vectors representing each passage. The passage clustering unit 172 further clusters these passage vectors into a predetermined number of clusters. The number of clusters may be determined in advance, or may be determined within a certain range without causing any problems based on predetermined criteria, such as minimizing the total variance of vectors within the cluster.

パッセージクラスタリング部１７２は、このようにしてパッセージベクトルをクラスタリングした後、各クラスタのセントロイドを決定する。さらに各セントロイドに、そのセントロイドが代表するクラスタの識別子を付与する。パッセージクラスタリング部１７２は、このようにして得られた各セントロイドに関する情報をレコード化しセントロイドＤＢ８４を生成する。具体的には、セントロイドＤＢ８４の各レコードは、クラスタ識別子と、そのセントロイドのベクトルとを含む。当然のことだがセントロイドのベクトルはパッセージベクトル及び質問ベクトルと同次元である。 After clustering the passage vectors in this manner, the passage clustering unit 172 determines the centroid of each cluster. Furthermore, each centroid is given an identifier of the cluster represented by that centroid. The passage clustering unit 172 records the information regarding each centroid obtained in this manner and generates a centroid DB 84. Specifically, each record in the centroid DB 84 includes a cluster identifier and a vector of its centroid. Naturally, the centroid vector has the same dimension as the passage vector and the question vector.

なお、パッセージクラスタリング部１７２は、回答候補ＤＢ９０の各レコードを以下のように生成する。パッセージクラスタリング部１７２は、質問・パッセージ収集部１６０の各レコードについて、そのレコードのパッセージから得られたパッセージベクトルの属するクラスタのクラスタ識別子と、そのレコードのＵＲＬとを一組にして回答候補ＤＢ９０にレコードを登録する。パッセージクラスタリング部１７２はまた、質問ＤＢ１６２の各レコードについて、その質問ＤＢ１６２に対して得られたパッセージが主として属するクラスタのセントロイドＩＤを付与する処理も行う。 Note that the passage clustering unit 172 generates each record in the answer candidate DB 90 as follows. The passage clustering unit 172 sets, for each record in the question/passage collection unit 160, the cluster identifier of the cluster to which the passage vector obtained from the passage of that record belongs and the URL of that record as a set and records it in the answer candidate DB 90. Register. The passage clustering unit 172 also performs, for each record in the question DB 162, a process of assigning the centroid ID of the cluster to which the passage obtained for that question DB 162 mainly belongs.

学習装置１５０はさらに、セントロイドＤＢ８４及び質問ＤＢ１６２を用いて、質問ＢＥＲＴ１６８の追加学習を行うための追加学習部１７４を含む。ＢＥＲＴ学習部１６６による学習がされた後の質問ＢＥＲＴ１６８は、追加学習部１７４による追加学習を受けて図１に示す質問ＢＥＲＴ８０となる。この追加学習の意味については図４及び図５を参照して後述する。 The learning device 150 further includes an additional learning unit 174 for performing additional learning of the question BERT 168 using the centroid DB 84 and the question DB 162. The question BERT168 that has been trained by the BERT learning unit 166 becomes the question BERT80 shown in FIG. 1 after undergoing additional learning by the additional learning unit 174. The meaning of this additional learning will be described later with reference to FIGS. 4 and 5.

図３に、図２に示すＢＥＲＴ学習部１６６の構成についてブロック図形式により示す。この学習における教師データは、質問及びパッセージと、そのパッセージがその質問に対する回答を含むか否かを示すラベルとを１組とする。ラベルは、例えばパッセージが質問に対する回答を含むときは１であり、さもなければ０とする。 FIG. 3 shows the configuration of the BERT learning section 166 shown in FIG. 2 in block diagram form. The teacher data for this learning includes a question, a passage, and a label indicating whether or not the passage includes an answer to the question. The label is, for example, 1 when the passage contains an answer to a question, and 0 otherwise.

図３を参照して、ＢＥＲＴ学習部１６６は、学習データの質問と回答候補パッセージとを質問ＢＥＲＴ１６８及び回答候補ＢＥＲＴ１７０にそれぞれ入力し、質問を表現するベクトル２００（Ｕ）と回答候補パッセージを表現するベクトル２０２（Ｖ）とに変換した後、ベクトルＵ及びベクトルＶの間のコサイン類似度を算出するためのコサイン類似度算出部２０４と、コサイン類似度算出部２０４の出力（［－１，１］の範囲）を［０，１］の範囲に正規化するための正規化処理部２０６と、正規化処理部２０６の出力と学習データのラベル２１０（０又は１）とに基づき、両者が一致する方向に質問ＢＥＲＴ１６８及び回答候補ＢＥＲＴ１７０の各々のパラメータを誤差逆伝播法により更新するためのパラメータ更新部２０８とを含む。正規化処理部２０６による正規化は、例えば｛cos（Ｕ、Ｖ）＋１｝／２により行うことができる。なお、この実施形態においては、パラメータ更新部２０８における更新のための損失関数として平均二乗誤差を用いている。 Referring to FIG. 3, the BERT learning unit 166 inputs the question and answer candidate passage of the learning data into the question BERT 168 and the answer candidate BERT 170, respectively, and expresses the vector 200 (U) expressing the question and the answer candidate passage. After converting into a vector 202 (V), a cosine similarity calculation unit 204 for calculating the cosine similarity between the vector U and the vector V and the output of the cosine similarity calculation unit 204 ([-1, 1] range) to the range [0, 1], and based on the output of the normalization processing unit 206 and the label 210 (0 or 1) of the learning data, the two match. and a parameter updating unit 208 for updating each parameter of the question BERT 168 and the answer candidate BERT 170 in the direction using the error backpropagation method. The normalization by the normalization processing unit 206 can be performed using, for example, {cos (U, V)+1}/2. Note that in this embodiment, the mean square error is used as a loss function for updating in the parameter updating unit 208.

質問ＢＥＲＴ１６８は、ＢＥＲＴ２２０と、ＢＥＲＴ２２０の最終層の各要素について平均プーリングを行うことによりベクトル２００を出力するためのＰｏｏｌｉｎｇ層２２２とを含む。 Query BERT 168 includes a BERT 220 and a pooling layer 222 for outputting a vector 200 by performing average pooling on each element of the final layer of BERT 220.

回答候補ＢＥＲＴ１７０は質問ＢＥＲＴ１６８と全く同じ構成である。すなわち、回答候補ＢＥＲＴ１７０は、ＢＥＲＴ２２０と同じ構成のＢＥＲＴ２３０と、ＢＥＲＴ２３０の最終層の各要素について平均プーリングを行うことによりベクトル２０２を出力するためのＰｏｏｌｉｎｇ層２３２とを含む。この段階においては、ＢＥＲＴ２２０とＢＥＲＴ２３０とのパラメータ構成は全て共通であり、それらの値の更新も互いに反映される。 The answer candidate BERT 170 has exactly the same configuration as the question BERT 168. That is, the answer candidate BERT 170 includes a BERT 230 having the same configuration as the BERT 220, and a pooling layer 232 for outputting the vector 202 by performing average pooling on each element of the final layer of the BERT 230. At this stage, the parameter configurations of BERT 220 and BERT 230 are all common, and updates of their values are also reflected in each other.

前述したとおり、質問ＢＥＲＴ１６８及び回答候補ＢＥＲＴ１７０は同じ構成であり、かつパラメータの値も共通するようパラメータ更新部２０８による更新が行われる。ＢＥＲＴ学習部１６６は、こうした学習を全ての学習データに対して所定の終了条件が成立するまで繰り返す。その結果、回答候補ＢＥＲＴ１７０に与えられる回答候補パッセージが質問ＢＥＲＴ１６８に与えられる質問に対する回答を含む場合にはベクトル２００とベクトル２０２とが類似したベクトルとなり、そうでない場合には互いに異なるベクトルとなる。 As described above, the question BERT 168 and the answer candidate BERT 170 have the same configuration, and are updated by the parameter updating unit 208 so that the parameter values are also common. The BERT learning unit 166 repeats such learning for all learning data until a predetermined end condition is satisfied. As a result, if the answer candidate passage given to answer candidate BERT 170 includes an answer to the question given to question BERT 168, vector 200 and vector 202 are similar vectors, otherwise they are different vectors.

ただし、上記した学習を行った場合には以下のような問題が生じ得る。図４を参照して、例えばある質問に対する質問ＢＥＲＴ１６８による変換２５６の結果、ベクトル２５４が得られたものとする。一方、この質問に対する正しい回答を含むパッセージを回答候補ＢＥＲＴ１７０により変換した結果がベクトル２６０だとする。またベクトル２６０が属するのがクラスタ２５０であり、そのセントロイドがベクトル２７０だとする。 However, when the above learning is performed, the following problems may occur. Referring to FIG. 4, assume that a vector 254 is obtained as a result of transformation 256 by question BERT 168 for a certain question. On the other hand, it is assumed that the vector 260 is the result of converting the passage containing the correct answer to this question using the answer candidate BERT 170. It is also assumed that the vector 260 belongs to the cluster 250 and its centroid is the vector 270.

一方、クラスタ２５０と異なるクラスタ２５２が存在し、そのセントロイドがベクトル２７２だとする。なお、図４はベクトルを２次元として考えているが、実際ははるかに次元数が高いことに注意する必要がある。 On the other hand, it is assumed that a cluster 252 different from the cluster 250 exists, and its centroid is the vector 272. Although FIG. 4 assumes that the vector is two-dimensional, it should be noted that the number of dimensions is actually much higher.

この状況においては、ベクトル２５４とベクトル２６０との間のコサイン類似度は、ベクトル２５４とクラスタ２５２に含まれるどのベクトルとの間のコサイン類似度よりも大きい。しかし、ベクトル２５４と各セントロイドのベクトル２７０及び２７２とのコサイン類似度を考えると、ベクトル２５４とベクトル２７０との間のコサイン類似度２６４（ｃｏｓ_１）と、ベクトル２５４とベクトル２７２との間のコサイン類似度２６２（ｃｏｓ２）との間にはｃｏｓ_１＜ｃｏｓ_２という関係が成立してしまうことになる。このような状況になると、図１に示す回答候補検索部９２が質問ベクトル８２と各クラスタのセントロイドとのコサイン類似度に基づいてクラスタを選択する以上、正しい処理が行えなくなる可能性がある。そこで、図２に示す追加学習部１７４により以下のような追加学習を行う。 In this situation, the cosine similarity between vector 254 and vector 260 is greater than the cosine similarity between vector 254 and any vector included in cluster 252. However, considering the cosine similarity between vector 254 and vectors 270 and 272 of each centroid, the cosine similarity between vector 254 and vector 270 is 264 (cos ₁ ), and the cosine similarity between vector 254 and vector 272 is The relationship cos ₁ <cos ₂ holds true with the cosine similarity 262 (cos2). In such a situation, since the answer candidate search unit 92 shown in FIG. 1 selects clusters based on the cosine similarity between the question vector 82 and the centroid of each cluster, it may not be possible to perform correct processing. Therefore, the additional learning section 174 shown in FIG. 2 performs the following additional learning.

図５を参照して、追加学習においては、例えば質問ＢＥＲＴ１６８による変換により生成されるベクトルとベクトル２７０とのコサイン類似度が、ベクトル２７２とのコサイン類似度よりも大きくなるようにすればよい。そこで、質問ＢＥＲＴ１６８の追加学習により、質問ＢＥＲＴ１６８の変換２５６ではなく、得られるベクトル２８０がよりベクトル２７０に近づく変換２５８が実現されるように質問ＢＥＲＴ１６８のパラメータを更新する。すなわち、ベクトル２５４をベクトル２８２に相当する分だけ移動してベクトル２８０の位置に来るようにすればよい。ベクトル２８０とベクトル２７０のコサイン類似度２８６をｃｏｓ′_１により表し、ベクトル２８０とベクトル２７２とのコサイン類似度２８４をｃｏｓ′_２により表すとすれば、ｃｏｓ′_１＞ｃｏｓ′_２となるようにすればよい。 Referring to FIG. 5, in the additional learning, for example, the cosine similarity between the vector generated by the transformation by the question BERT 168 and the vector 270 may be made larger than the cosine similarity between the vector 272 and the vector 272. Therefore, by additional learning of the question BERT 168, the parameters of the question BERT 168 are updated so that a transformation 258, in which the obtained vector 280 becomes closer to the vector 270, is realized instead of the transformation 256 of the question BERT 168. That is, the vector 254 may be moved by an amount corresponding to the vector 282 so that it comes to the position of the vector 280. If the cosine similarity 286 between vectors 280 and 270 is expressed by cos' ₁ , and the cosine similarity 284 between vectors 280 and 272 is expressed by cos' ₂ , then cos' ₁ >cos' ₂ . Bye.

そのため、この実施形態においては、追加学習部１７４は以下のような構成を持つ。図６を参照して、追加学習部１７４は、ある質問６０を質問ＢＥＲＴ１６８に入力してベクトル３１０を出力させ、一方、質問６０に対応するパッセージが主として属するクラスタのセントロイドのベクトルをベクトルＷとすると、ベクトル３１０とベクトルＷからなるベクトル３１２とのコサイン類似度を算出するためのコサイン類似度算出部３１４と、コサイン類似度算出部３１４の出力（［－１，１］の範囲）を［０，１］の範囲に正規化するための正規化処理部３１６と、正規化処理部３１６の出力がラベル「１」に近づくようにＢＥＲＴ２２０のパラメータを誤差逆伝播法により更新するためのパラメータ更新部３１８とを含む。コサイン類似度算出部３１４は図３に示すコサイン類似度算出部２０４と同じものである。正規化処理部３１６は図３に示す正規化処理部２０６と同じものである。またパラメータ更新部３１８は基本的には図３に示すパラメータ更新部２０８と同じものだが、質問ＢＥＲＴ１６８のＢＥＲＴ２２０のみのパラメータを更新する点においてパラメータ更新部２０８と異なる。追加学習部１７４は、この更新を、所定の終了条件が成立するまで繰り返し実行する。 Therefore, in this embodiment, the additional learning section 174 has the following configuration. Referring to FIG. 6, the additional learning unit 174 inputs a certain question 60 to the question BERT 168 to output a vector 310, and on the other hand, the centroid vector of the cluster to which the passage corresponding to the question 60 mainly belongs is defined as a vector W. Then, the cosine similarity calculation unit 314 for calculating the cosine similarity between the vector 310 and the vector 312 consisting of the vector W, and the output of the cosine similarity calculation unit 314 (range [-1, 1]) are set to [0 . 318. The cosine similarity calculation unit 314 is the same as the cosine similarity calculation unit 204 shown in FIG. The normalization processing unit 316 is the same as the normalization processing unit 206 shown in FIG. The parameter update unit 318 is basically the same as the parameter update unit 208 shown in FIG. 3, but differs from the parameter update unit 208 in that it updates the parameters of only the BERT 220 of the question BERT 168. The additional learning unit 174 repeatedly executes this update until a predetermined termination condition is satisfied.

ラベル「１」は正解を示す。このような更新をすることにより、各質問のベクトルとその正しい回答を含むパッセージのベクトルとの間のコサイン類似度が大きくなるように、ＢＥＲＴ２２０のパラメータを更新できる。なお、この追加学習においては、正解パッセージのベクトルが属するクラスタのセントロイドに対して質問のベクトルが近く（コサイン類似度が大きくなるように）配置されるようになればよい。そのため、不正解のパッセージは使用せず、正解のパッセージに関する学習データのみを利用して学習を行えばよい。 Label "1" indicates the correct answer. Such an update allows the parameters of BERT 220 to be updated such that the cosine similarity between each question's vector and the passage's vector containing its correct answer increases. In addition, in this additional learning, it is sufficient that the vector of the question is arranged close to the centroid of the cluster to which the vector of the correct passage belongs (so that the cosine similarity becomes large). Therefore, learning can be performed using only learning data related to correct passages, without using incorrect passages.

２．動作
上記した構成を持つ質問回答装置５０及び学習装置１５０は以下のように動作する。まず、質問ＢＥＲＴ８０の学習時の学習装置１５０について説明し、次に質問ＢＥＲＴ８０を用いた質問回答装置５０の動作について説明する。 2. Operation The question answering device 50 and the learning device 150 having the configurations described above operate as follows. First, the learning device 150 during learning of the question BERT 80 will be explained, and then the operation of the question answering device 50 using the question BERT 80 will be explained.

２－１．質問ＢＥＲＴ８０の学習
図２を参照して、質問ＢＥＲＴ８０の学習時には学習装置１５０は以下のように動作する。まず質問・パッセージ収集部１６０が、インターネット６２をクロールし、質問文を収集し質問ＤＢ１６２に記憶する。質問・パッセージ収集部１６０はさらに、質問ＤＢ１６２に記憶された質問文の各々について、その質問に対する回答を含むと考えられるパッセージをさらにインターネット６２から収集する。質問・パッセージ収集部１６０は、収集したパッセージを対応する質問と関連付けてパッセージＤＢ１６４に格納する。質問・パッセージ収集部１６０によるこれら処理は従来技術により実現できる。 2-1. Learning of Question BERT 80 Referring to FIG. 2, learning device 150 operates as follows when learning Question BERT 80. First, the question/passage collection unit 160 crawls the Internet 62, collects question sentences, and stores them in the question DB 162. The question/passage collecting unit 160 further collects passages from the Internet 62 that are considered to include an answer to each question stored in the question DB 162. The question/passage collection unit 160 stores the collected passages in the passage DB 164 in association with the corresponding questions. These processes by the question/passage collection unit 160 can be realized by conventional techniques.

次にＢＥＲＴ学習部１６６が、質問ＤＢ１６２に記憶された質問の各々と、これら質問に対してパッセージＤＢ１６４に記憶されたパッセージとから、正例と負例とからなる学習データを生成する。ＢＥＲＴ学習部１６６はこれら学習データを用いて質問ＢＥＲＴ１６８と回答候補ＢＥＲＴ１７０との学習をＳｉａｍｅｓｅＢＥＲＴネットワークにより同時に行う。 Next, the BERT learning unit 166 generates learning data consisting of positive examples and negative examples from each of the questions stored in the question DB 162 and the passages stored in the passage DB 164 for these questions. The BERT learning unit 166 uses these learning data to simultaneously learn the question BERT 168 and the answer candidate BERT 170 using the Siamese BERT network.

より具体的には、図３を参照して、この学習における教師データは、質問及びパッセージと、そのパッセージがその質問に対する回答を含むか否かを示すラベルとを１組とする。ラベルは、例えばパッセージが質問に対する回答を含むときは１であり、さもなければ０である。 More specifically, referring to FIG. 3, the teacher data for this learning includes a question, a passage, and a label indicating whether or not the passage includes an answer to the question. The label is, for example, 1 when the passage contains an answer to a question, and 0 otherwise.

ＢＥＲＴ学習部１６６は、学習データの質問と回答候補パッセージとを質問ＢＥＲＴ１６８及び回答候補ＢＥＲＴ１７０にそれぞれ入力する。質問ＢＥＲＴ１６８のＢＥＲＴ２２０及びＰｏｏｌｉｎｇ層２２２は、質問をベクトル２００（Ｕ）に変換する。回答候補ＢＥＲＴ１７０のＢＥＲＴ２３０及びＰｏｏｌｉｎｇ層２３２は、回答候補パッセージをベクトル２０２（Ｖ）に変換する。コサイン類似度算出部２０４が、ベクトルＵ及びベクトルＶの間のコサイン類似度を算出し正規化処理部２０６に入力する。この値は［－１，１］の範囲である。正規化処理部２０６は、コサイン類似度算出部２０４の値を［０，１］の範囲に正規化してパラメータ更新部２０８に入力する。パラメータ更新部２０８は、正規化処理部２０６の出力と学習データのラベル２１０（０又は１）とに基づき、両者が一致する方向に質問ＢＥＲＴ１６８及び回答候補ＢＥＲＴ１７０の各々のパラメータを誤差逆伝播法により更新する。質問ＢＥＲＴ１６８及び回答候補ＢＥＲＴ１７０は同じ構成であり、かつパラメータの値も共通するようパラメータ更新部２０８による更新が行われる。 The BERT learning unit 166 inputs the question and answer candidate passage of the learning data into the question BERT 168 and the answer candidate BERT 170, respectively. BERT 220 and Pooling layer 222 of query BERT 168 transform the query into a vector 200(U). The BERT 230 and pooling layer 232 of the answer candidate BERT 170 convert the answer candidate passage into a vector 202(V). The cosine similarity calculation unit 204 calculates the cosine similarity between the vector U and the vector V, and inputs the calculated cosine similarity to the normalization processing unit 206. This value is in the range [-1,1]. The normalization processing unit 206 normalizes the value of the cosine similarity calculation unit 204 to the range [0, 1] and inputs the normalized value to the parameter updating unit 208. Based on the output of the normalization processing unit 206 and the label 210 (0 or 1) of the learning data, the parameter updating unit 208 uses the error backpropagation method to change the parameters of the question BERT 168 and the answer candidate BERT 170 in the direction that the two match. Update. The question BERT 168 and the answer candidate BERT 170 have the same configuration, and are updated by the parameter updating unit 208 so that the parameter values are also common.

ＢＥＲＴ学習部１６６は、こうした学習を全ての学習データに対して所定の終了条件が成立するまで繰り返す。その結果、回答候補ＢＥＲＴ１７０に与えられる回答候補パッセージが質問ＢＥＲＴ１６８に与えられる質問に対する回答を含む場合にはベクトル２００とベクトル２０２とが類似したベクトルとなり、そうでない場合には互いに異なるベクトルとなる。 The BERT learning unit 166 repeats such learning for all learning data until a predetermined end condition is satisfied. As a result, if the answer candidate passage given to answer candidate BERT 170 includes an answer to the question given to question BERT 168, vector 200 and vector 202 are similar vectors, otherwise they are different vectors.

パッセージクラスタリング部１７２は、このように学習が終わった回答候補ＢＥＲＴ１７０を用いて、パッセージＤＢ１６４に記憶されたパッセージを全てパッセーベクトルに変換する。パッセージクラスタリング部１７２はさらに、それらパッセージベクトルをｋ平均法により所定個数のクラスタに分類する。これらクラスタにはそれぞれ識別子が割り当てられる。パッセージクラスタリング部１７２はさらに、各クラスタのセントロイドのベクトルを算出し、クラスタの識別子をセントロイドの識別子に割り当てる。パッセージクラスタリング部１７２は、こうして得られた各セントロイドについて、クラスタ識別子とそのセントロイドのベクトルとを組にしてセントロイドＤＢ８４に登録する。 The passage clustering unit 172 converts all the passages stored in the passage DB 164 into passage vectors using the answer candidate BERT 170 that has been trained in this way. The passage clustering unit 172 further classifies the passage vectors into a predetermined number of clusters using the k-means method. Each of these clusters is assigned an identifier. The passage clustering unit 172 further calculates the centroid vector of each cluster and assigns the cluster identifier to the centroid identifier. For each centroid thus obtained, the passage clustering unit 172 registers the cluster identifier and the vector of the centroid as a pair in the centroid DB 84.

一方、パッセージクラスタリング部１７２は、各クラスタに属するパッセージベクトルに対応するパッセージのＵＲＬと、その属するクラスタの識別子とを組にしてレコードを回答候補ＤＢ９０に登録する。またパッセージクラスタリング部１７２は、質問ＤＢ１６２に記憶されている各質問のレコードに、その質問に対する回答を最も多く含むクラスタのクラスタ識別子を追加する。 On the other hand, the passage clustering unit 172 registers a record in the answer candidate DB 90 by combining the URL of the passage corresponding to the passage vector belonging to each cluster and the identifier of the cluster to which the passage belongs. The passage clustering unit 172 also adds, to each question record stored in the question DB 162, the cluster identifier of the cluster containing the most answers to that question.

セントロイドＤＢ８４へのセントロイドの登録が完了すると、追加学習部１７４が、質問ＤＢ１６２に記憶されている各質問に対し、その質問に対応するセントロイドのベクトルをセントロイドＤＢ８４から読み出す。追加学習部１７４は各質問に対して、セントロイドＤＢ８４から読み出したセントロイドベクトルを正解データとして質問ＢＥＲＴ８０の追加学習を行う。所定の終了条件が成立した時点で質問ＢＥＲＴ８０の学習が終了する。 When the registration of the centroid in the centroid DB 84 is completed, the additional learning unit 174 reads out the centroid vector corresponding to each question stored in the question DB 162 from the centroid DB 84. The additional learning unit 174 performs additional learning of the question BERT 80 for each question using the centroid vector read from the centroid DB 84 as correct answer data. Learning of the question BERT 80 ends when a predetermined end condition is met.

より具体的には、図６を参照して、学習データは、質問と、質問に対する正しい回答を含むパッセージのパッセージベクトルが属するクラスタのセントロイドのベクトルＷと、ラベルである。この場合、ラベルの値は常に「１」である。 More specifically, referring to FIG. 6, the learning data is a question, a centroid vector W of a cluster to which a passage vector of a passage including a correct answer to the question belongs, and a label. In this case, the value of the label is always "1".

質問は質問ＢＥＲＴ１６８に与えられる。質問ＢＥＲＴ１６８のＢＥＲＴ２２０及びＰｏｏｌｉｎｇ層２２２が質問を処理し、ベクトル３１０を出力する。ベクトル３１０はコサイン類似度算出部３１４の第１の入力に与えられる。一方、コサイン類似度算出部３１４の第２の入力には、ベクトルＷが与えられる。コサイン類似度算出部３１４は、第１の入力のベクトルと第２の入力のベクトルとの類似度を算出し正規化処理部３１６に与える。正規化処理部３１６はコサイン類似度算出部３１４の出力を［０，１］の範囲に正規化しパラメータ更新部３１８に与える。パラメータ更新部３１８は、この値がラベル（１）に近づくようにＢＥＲＴ２２０のパラメータを誤差逆伝播法により更新する。 The question is given to question BERT168. BERT 220 and Pooling layer 222 of query BERT 168 process the query and output vector 310. Vector 310 is given to the first input of cosine similarity calculating section 314. On the other hand, the vector W is given to the second input of the cosine similarity calculation unit 314. The cosine similarity calculation unit 314 calculates the similarity between the first input vector and the second input vector and provides it to the normalization processing unit 316. The normalization processing unit 316 normalizes the output of the cosine similarity calculation unit 314 to the range [0, 1] and provides it to the parameter updating unit 318. The parameter update unit 318 updates the parameters of the BERT 220 using the error backpropagation method so that this value approaches label (1).

追加学習部１７４は、こうした追加学習を、所定の終了条件が終了するまで繰り返し実行する。 The additional learning unit 174 repeatedly performs such additional learning until a predetermined termination condition is met.

この追加学習により、図４のような状態が発生する可能性が小さくなり、図５に示すように質問を質問ＢＥＲＴ８０によりベクトル化した質問ベクトルとのコサイン類似度が最も大きなセントロイドが、正しいクラスタ２５０のセントロイドのベクトル２７０となる可能性を高くできる。 This additional learning reduces the possibility that the situation shown in Figure 4 will occur, and as shown in Figure 5, the centroid with the highest cosine similarity to the question vector obtained by vectorizing the question using the Question BERT80 will be assigned to the correct cluster. The probability that the vector 270 of the centroid 250 will be obtained can be increased.

２－２．回答の生成
図１を参照して、追加学習後の質問ＢＥＲＴ８０を持つ回答候補検索部６４、及び回答候補検索部６４を含む質問回答装置５０は以下のように動作する。なお学習により、回答候補ＤＢ９０及びセントロイドＤＢ８４も既に得られている。 2-2. Generation of Answers Referring to FIG. 1, the answer candidate search unit 64 having the question BERT 80 after additional learning and the question answering device 50 including the answer candidate search unit 64 operate as follows. Note that the answer candidate DB 90 and centroid DB 84 have already been obtained through learning.

対話相手から何らかの質問６０が入力されたものとする。質問６０は質問ＢＥＲＴ８０に与えられる。質問６０は同時に回答生成部６８にも与えられる。 Assume that some question 60 has been input by the conversation partner. Question 60 is given to question BERT80. The question 60 is also given to the answer generation section 68 at the same time.

質問ＢＥＲＴ８０は質問６０が入力されたことに応答して、質問６０を表す質問ベクトル８２を出力する。回答候補クラスタ特定部８６は、質問ベクトル８２に最も近いセントロイドのベクトルをセントロイドＤＢ８４において検索しそのクラスタ識別子８８を出力する。 Question BERT 80 outputs a question vector 82 representing question 60 in response to question 60 being input. The answer candidate cluster specifying unit 86 searches the centroid DB 84 for the centroid vector closest to the question vector 82 and outputs its cluster identifier 88.

回答候補検索部９２は回答候補ＤＢ９０を検索し、このクラスタ識別子８８を持つパッセージ（回答候補）を全て取り出し、回答候補ＵＲＬ群９４を出力する。パッセージ検索部９６は、これら回答候補ＵＲＬ群９４を受けて、インターネット６２の各ＵＲＬからパッセージをダウンロードしパッセージＤＢ６６に格納する。 The answer candidate search unit 92 searches the answer candidate DB 90, extracts all passages (answer candidates) having this cluster identifier 88, and outputs a group of answer candidate URLs 94. The passage search unit 96 receives the answer candidate URL group 94, downloads a passage from each URL on the Internet 62, and stores it in the passage DB 66.

回答生成部６８は、パッセージＤＢ６６に格納されたパッセージ群の中から、質問６０に対する回答として最も適切なものを選択し、回答７０として出力する。回答生成部６８による回答の選択は従来の手法と全く同様である。 The answer generation unit 68 selects the most appropriate answer to the question 60 from the passage group stored in the passage DB 66 and outputs it as an answer 70. The selection of answers by the answer generation unit 68 is exactly the same as the conventional method.

３．効果
この第１実施形態によれば、オフラインの状態であらかじめ作成した回答候補ＤＢ９０、セントロイドＤＢ８４を用い、同様にオフラインの状態であらかじめ学習した質問ＢＥＲＴ８０を使用して、質問６０に対する回答候補をパッセージＤＢ６６に収集する。回答候補の数は、図１に示す回答候補クラスタ特定部８６により選択されたクラスタに属する回答候補に限定される。質問に対する回答時に、大量の回答回答候補に対してニューラルネットワークを適用して回答を選択する必要がない。そのため、必要な記憶容量も計算資源もはるかに小さく済むという効果がある。特に図１に示す例のように、回答候補ＤＢ９０がパッセージそのものではなくそのパッセージが存在するＵＲＬを記憶しているため、パッセージそのものを記憶する場合と比較してさらに記憶容量が小さく済み、処理が軽くできるという効果がある。また、パッセージ検索部９６によるインターネットからの各パッセージのダウンロードは並列処理が可能であり、質問６０に対して回答７０を生成するために要する時間を短くできるという効果もある。 3. Effects According to the first embodiment, by using the answer candidate DB 90 and the centroid DB 84 created in advance in an offline state, and by using the question BERT 80 learned in advance in an offline state, answer candidates for the question 60 are created in a passage. Collect to DB66. The number of answer candidates is limited to answer candidates belonging to the cluster selected by the answer candidate cluster specifying unit 86 shown in FIG. When answering a question, there is no need to select an answer by applying a neural network to a large number of answer candidates. Therefore, the required storage capacity and computational resources are much smaller. In particular, as in the example shown in FIG. 1, the answer candidate DB 90 stores not the passage itself but the URL where the passage exists, so the storage capacity is smaller compared to the case where the passage itself is stored, and the processing is faster. It has the effect of being lightweight. Furthermore, downloading of each passage from the Internet by the passage search unit 96 can be processed in parallel, which has the effect of shortening the time required to generate the answer 70 to the question 60.

なお、上記の構成に加えて、質問６０を行う主体の質問履歴（過去の複数の質問）を、参考情報として、回答動作を制御する（例えば、初心者向けの回答や、専門家向けの回答のように、回答のレベルを変更する）ように、構成することも可能である。 In addition to the above configuration, the question history (past questions) of the person asking the question 60 is used as reference information to control the answering operation (for example, answering for beginners and answering for experts). It is also possible to configure the answer level (to change the answer level).

第２第２実施形態
１．構成
第１実施形態は、質問に対する回答を与える質問回答装置に関している。しかしこの発明はそのような実施形態だけではなく、いわゆる対話装置に適用することもできる。対話装置においてシステムが受ける入力は質問とは限らない。しかし、入力に対する応答を検索するための手法としては、質問回答と同様の手法を利用できる。 Second Second Embodiment 1. Configuration The first embodiment relates to a question answering device that provides answers to questions. However, the present invention can be applied not only to such embodiments but also to so-called dialogue devices. The input that the system receives in the dialog device is not necessarily a question. However, as a method for searching for a response to an input, a method similar to that for answering questions can be used.

対話装置において質問回答装置と異なるのは、相手の発話に対する応答を生成するときに、それまでの対話の履歴と関連する応答をすることが望ましいということである。この第２実施形態は、そのような対話システムに関する。 A dialog device differs from a question-answer device in that when generating a response to the other party's utterance, it is desirable to make a response that is related to the history of previous dialogs. This second embodiment relates to such a dialogue system.

Ａ．対話装置
図７に、この発明の第２実施形態に係る対話装置３５０の構成を示す。この対話装置３５０においても、第１実施形態の質問ＢＥＲＴ８０と同様の構成を持つ質問ＢＥＲＴ３８０を使用する。質問ＢＥＲＴ３８０の学習については図８以下を参照して後述する。 A. Dialogue Device FIG. 7 shows the configuration of a dialogue device 350 according to a second embodiment of the present invention. This dialog device 350 also uses a question BERT 380 having the same configuration as the question BERT 80 of the first embodiment. Learning of the question BERT 380 will be described later with reference to FIG. 8 and subsequent figures.

図７を参照して、対話装置３５０は、相手の発話３６２に応答して、発話３６２に対する応答として適切な応答候補をインターネット６２から検索する応答候補検索装置３６０と、応答候補検索装置３６０が検索した応答候補を記憶するためのパッセージＤＢ３６３と、パッセージＤＢ３６３に記憶された応答候補の中から発話３６２に対する応答として適切なものを選択し、応答３６６を生成し出力するための応答生成部３６４と、応答生成部３６４の出力を、対話に相応しい形に整形し出力するための応答整形部３６８とを含む。パッセージＤＢ３６３は、第１実施形態において使用されたものと同様の構成である。ただし、応答候補検索装置３６０が選択するパッセージが第１実施形態と異なってくるため、パッセージＤＢ３６３の記憶内容も第１実施形態とは異なる。対話には対話に相応しい発話スタイルがある。そのためこの実施形態においては応答整形部３６８により応答生成部３６４の出力を整形して出力する。 Referring to FIG. 7, the dialogue device 350 includes a response candidate search device 360 that searches the Internet 62 for appropriate response candidates as a response to the utterance 362 in response to the other party's utterance 362; a passage DB 363 for storing response candidates stored in the passage DB 363; a response generation unit 364 for selecting an appropriate response to the utterance 362 from among the response candidates stored in the passage DB 363, and generating and outputting a response 366; It includes a response shaping section 368 that formats the output of the response generation section 364 into a form suitable for dialogue and outputs it. The passage DB 363 has the same configuration as that used in the first embodiment. However, since the passage selected by the response candidate search device 360 is different from the first embodiment, the storage contents of the passage DB 363 are also different from the first embodiment. Dialogue has a speaking style that is appropriate for dialogue. Therefore, in this embodiment, the response shaping section 368 formats and outputs the output of the response generation section 364.

応答候補検索装置３６０は、対話装置３５０と相手との対話の履歴を管理するための対話履歴管理部３７０と、対話履歴管理部３７０の管理する対話履歴を記憶する対話履歴ＤＢ３７２と、複数の内容語が入力されたことに応答して、その内容語を含む対話のトピックを示す情報を出力するためのトピックモデル３７４とを含む。トピックモデル３７４は例えば統計的モデルであり、あらかじめ学習済だとする。トピックモデルの学習には、例えば特開２０１５－０４５９１５号公報に記載の方法が利用できる。また、ニューラルネットワークによりトピックモデルを構築することもできる。トピックモデル３７４の出力は、例えばトピックを表す１又は複数の単語である。 The response candidate search device 360 includes a dialogue history management unit 370 for managing the history of dialogue between the dialogue device 350 and the other party, a dialogue history DB 372 that stores the dialogue history managed by the dialogue history management unit 370, and a dialogue history DB 372 that stores a plurality of contents. and a topic model 374 for outputting information indicating a conversation topic including the content word in response to input of the content word. It is assumed that the topic model 374 is, for example, a statistical model and has been trained in advance. For example, the method described in Japanese Patent Application Publication No. 2015-045915 can be used to learn the topic model. It is also possible to construct topic models using neural networks. The output of topic model 374 is, for example, one or more words representing the topic.

応答候補検索装置３６０はさらに、発話３６２に応答し、対話履歴ＤＢ３７２に記憶された情報とトピックモデル３７４とを使用して、発話３６２に相手との対話のトピックを示す情報を付与するためのトピック付与部３７６を含む。具体的には、トピック付与部３７６は、対話履歴ＤＢ３７２から相手との対話の履歴を読み出し、内容語を抽出する。トピック付与部３７６はこれらの内容語をトピックモデル３７４に与えて、トピックモデル３７４の出力する、トピックを表す１又は複数の単語を受け取る。トピック付与部３７６はさらに、発話３６２の後ろに、トピック付与部３７６から受け取った１又は複数の単語を付加して出力する。 The response candidate search device 360 further responds to the utterance 362 and uses the information stored in the dialogue history DB 372 and the topic model 374 to search for a topic for giving the utterance 362 information indicating the topic of the dialogue with the other party. It includes an imparting section 376. Specifically, the topic assigning unit 376 reads the history of the conversation with the other party from the conversation history DB 372 and extracts the content words. The topic providing unit 376 provides these content words to the topic model 374 and receives one or more words representing the topic output from the topic model 374. The topic adding unit 376 further adds one or more words received from the topic adding unit 376 to the end of the utterance 362 and outputs the added word.

応答候補検索装置３６０はさらに、トピック付与部３７６の出力を受けて、第１実施形態における質問ベクトルと同様の質問ベクトル３８２を出力するための、第１実施形態における質問ＢＥＲＴ８０と同様の構成を持つ質問ＢＥＲＴ３８０と、第１実施形態にお
けるセントロイドＤＢ８４と同様にして得られたセントロイドＤＢ３７８と、質問ベクトル３８２に応答して、質問ベクトル３８２とのコサイン類似度が最も大きな所定個数のセントロイドベクトルをセントロイドＤＢ３７８において検索し、それらのクラスタ識別子３８８を出力するための応答候補クラスタ特定部３８６とを含む。セントロイドＤＢ３７８の構成自体は第１実施形態のセントロイドＤＢ８４と同じである。ただしこの第２実施形態においてはセントロイドＤＢ３７８の学習を第１実施形態とはやや異なる方法により行っている。そのため、ここではセントロイドＤＢ３７８をセントロイドＤＢ８４とは別のものとして記載している。 The response candidate search device 360 further has a configuration similar to the question BERT 80 in the first embodiment for outputting a question vector 382 similar to the question vector in the first embodiment upon receiving the output of the topic assigning unit 376. In response to the question BERT 380, the centroid DB 378 obtained in the same manner as the centroid DB 84 in the first embodiment, and the question vector 382, a predetermined number of centroid vectors having the largest cosine similarity with the question vector 382 are generated. It also includes a response candidate cluster identification unit 386 for searching in the centroid DB 378 and outputting cluster identifiers 388 thereof. The configuration of the centroid DB 378 itself is the same as the centroid DB 84 of the first embodiment. However, in this second embodiment, learning of the centroid DB 378 is performed by a method slightly different from that in the first embodiment. Therefore, the centroid DB378 is described here as being different from the centroid DB84.

応答候補検索装置３６０はさらに、第１実施形態における回答候補ＤＢ９０と同様の構成を持つ応答候補ＤＢ３９０と、応答候補クラスタ特定部３８６からのクラスタ識別子３８８に応答して応答候補ＤＢ３９０を検索し、クラスタ識別子３８８と一致するクラスタ識別子を持つ応答候補のレコードを全て読み出して応答候補ＵＲＬ群３９４として出力するための、第１実施形態のものと同じ構成の応答候補検索部３９２とを含む。 The response candidate search device 360 further searches the response candidate DB 390 in response to the response candidate DB 390 having the same configuration as the response candidate DB 90 in the first embodiment and the cluster identifier 388 from the response candidate cluster specifying unit 386, and searches the response candidate DB 390 to determine the cluster. It includes a response candidate search unit 392 having the same configuration as that of the first embodiment, for reading out all records of response candidates having cluster identifiers that match the identifier 388 and outputting them as a response candidate URL group 394.

応答候補検索装置３６０はさらに、応答候補ＵＲＬ群３９４を受けて、インターネット６２を検索し、各応答候補のレコードに格納されているＵＲＬから応答候補のパッセージをダウンロードしてパッセージＤＢ３６３に格納するための、これも第１実施形態のパッセージ検索部９６と同様のパッセージ検索部３９６を含む。 The response candidate search device 360 further receives the response candidate URL group 394, searches the Internet 62, downloads response candidate passages from the URLs stored in the records of each response candidate, and stores them in the passage DB 363. , this also includes a passage search section 396 similar to the passage search section 96 of the first embodiment.

Ｂ．学習装置
図８を参照して、応答候補検索装置３６０の質問ＢＥＲＴ３８０の学習を行うための学習装置４００は、インターネット６２をクロールして、質問をその前後のいくつかの文（以下、このように質問とその前後の文の集まりを質問パッセージという）とともにダウンロードし、さらに各質問に対する応答として適切なパッセージをダウンロードするための質問・パッセージ収集部４１０と、質問・パッセージ収集部４１０がダウンロードした質問パッセージを記憶するための質問パッセージＤＢ４１２と、質問・パッセージ収集部４１０がダウンロードしたパッセージを、質問と関係付けて記憶するための、図２に示すものと同様の構成を持つパッセージＤＢ１６４とを含む。 B. Learning Device Referring to FIG. 8, a learning device 400 for learning the question BERT 380 of the response candidate search device 360 crawls the Internet 62 and analyzes the question by searching several sentences before and after it (hereinafter, like this). A question/passage collection unit 410 downloads a question and a collection of sentences before and after the question (called a question passage), and further downloads an appropriate passage as a response to each question, and the question passage downloaded by the question/passage collection unit 410. and a passage DB 164 having a configuration similar to that shown in FIG. 2, for storing passages downloaded by the question/passage collection unit 410 in association with questions.

学習装置４００はさらに、トピックモデル３７４と、質問パッセージＤＢ４１２に記憶された各質問パッセージ、パッセージＤＢ１６４に記憶されたパッセージ、及びトピックモデル３７４から得られるトピックに関する単語とを使用して作成した学習データにより、質問ＢＥＲＴ３８０及び応答候補ＢＥＲＴ４１８の学習をＳｉａｍｅｓｅＢＥＲＴネットワークにより実行するためのＢＥＲＴ学習部４１４を含む。第１実施形態における質問ＢＥＲＴ１６８及び回答候補ＢＥＲＴ１７０と同様、質問ＢＥＲＴ３８０及び応答候補ＢＥＲＴ４１８も互いに同じ構成であり、学習時の一方のパラメータの更新は他方のパラメータの更新に反映される。実際には、質問ＢＥＲＴ３８０と応答候補ＢＥＲＴ４１８の構成は、第１実施形態の質問ＢＥＲＴ１６８及び回答候補ＢＥＲＴ１７０と同一であり、学習の結果、その内部のパラメータが異なってくる。 The learning device 400 further uses learning data created using the topic model 374, each question passage stored in the question passage DB 412, the passage stored in the passage DB 164, and the words related to the topic obtained from the topic model 374. , a BERT learning unit 414 for performing learning of the question BERT 380 and response candidate BERT 418 using the Siamese BERT network. Similar to the question BERT 168 and answer candidate BERT 170 in the first embodiment, the question BERT 380 and response candidate BERT 418 have the same configuration, and updating of one parameter during learning is reflected in updating of the other parameter. In reality, the configurations of the question BERT 380 and the response candidate BERT 418 are the same as the question BERT 168 and the answer candidate BERT 170 of the first embodiment, and their internal parameters differ as a result of learning.

学習装置４００はさらに、ＢＥＲＴ学習部４１４による訓練が終了した応答候補ＢＥＲＴ４１８を使用して、パッセージＤＢ１６４に記憶されているパッセージを所定個数のクラスタにクラスタリングし各クラスタにクラスタ識別子を付与するためのパッセージクラスタリング部１７２と、パッセージクラスタリング部１７２によりクラスタリングされた各応答候補に、その属するクラスタのクラスタ識別子を付して記憶する応答候補ＤＢ３９０と、パッセージクラスタリング部１７２によるクラスタリングの結果として得られた各クラスタのセントロイドベクトルと、その代表するクラスタのクラスタ識別子とを含むレコードを各クラスタについて記憶するためのセントロイドＤＢ３７８とを含む。なおパッセージクラスタリング部１７２は、質問パッセージＤＢ４１２に記憶されている各質問について、その質問に関するパッセージが最も多く属するクラスタのクラスタ識別子を質問パッセージＤＢ４１２の各質問のレコードに付与する。 The learning device 400 further uses the response candidate BERT 418 that has been trained by the BERT learning unit 414 to cluster the passages stored in the passage DB 164 into a predetermined number of clusters, and assigns a cluster identifier to each cluster. Clustering unit 172; a response candidate DB 390 that stores each response candidate clustered by the passage clustering unit 172 with a cluster identifier of the cluster to which it belongs; and a response candidate DB 390 that stores each response candidate clustered by the passage clustering unit 172; It includes a centroid DB 378 for storing, for each cluster, a record including a centroid vector and a cluster identifier of the cluster it represents. Note that for each question stored in the question passage DB 412, the passage clustering unit 172 assigns to each question record in the question passage DB 412 the cluster identifier of the cluster to which the largest number of passages related to the question belong.

学習装置４００はさらに、質問パッセージＤＢ４１２に記憶された質問と、セントロイドＤＢ３７８に記憶された各クラスタのセントロイドに関する情報とを使用して質問ＢＥＲＴ３８０の追加学習を行うための追加学習部４２２を含む。質問ＢＥＲＴ３８０に対し追加学習部４２２による追加学習を行うことにより、図７に示す質問ＢＥＲＴ３８０が得られる。 The learning device 400 further includes an additional learning unit 422 for performing additional learning of the question BERT 380 using the questions stored in the question passage DB 412 and information regarding the centroid of each cluster stored in the centroid DB 378. . By performing additional learning on the question BERT 380 by the additional learning unit 422, the question BERT 380 shown in FIG. 7 is obtained.

図９に、ＢＥＲＴ学習部４１４の機能的構成をブロック図形式により示す。図９を参照して、ＢＥＲＴ学習部４１４は、図８に示す質問パッセージＤＢ４１２に記憶された各質問とその質問を含む質問パッセージとを受け、トピックモデル３７４を使用してその質問を含む文脈のトピックを示す１又は複数の単語を質問に付与するためのトピック付与部４５０と、パッセージＤＢ１６４に記憶されている各応答候補パッセージに対し、トピックモデル３７４を使用してその応答候補パッセージのトピックを示す１又は複数の単語を応答候補パッセージに付与するためのトピック付与部４５２とを含む。トピック付与部４５０の出力、トピック付与部４５２の出力、及び応答候補パッセージが質問に対する応答を与えるパッセージであれば１であり、さもなければ０であるラベルとにより、質問ＢＥＲＴ３８０の学習データが生成される。 FIG. 9 shows the functional configuration of the BERT learning section 414 in block diagram form. Referring to FIG. 9, the BERT learning unit 414 receives each question and the question passage including the question stored in the question passage DB 412 shown in FIG. A topic adding unit 450 for adding one or more words indicating a topic to a question and a topic model 374 are used for each response candidate passage stored in the passage DB 164 to indicate the topic of the response candidate passage. and a topic adding unit 452 for adding one or more words to the response candidate passage. Learning data for the question BERT 380 is generated by the output of the topic assignment unit 450, the output of the topic assignment unit 452, and a label that is 1 if the response candidate passage provides an answer to the question, and 0 otherwise. Ru.

質問ＢＥＲＴ４１６は、ＢＥＲＴ４８０と、ＢＥＲＴ４８０の最終層の各要素に対して平均プーリングを行ってベクトル４５４（Ｕ）を出力するためのＰｏｏｌｉｎｇ層４８２とを含む。同様に、応答候補ＢＥＲＴ４１８は、ＢＥＲＴ４９０と、ＢＥＲＴ４８４の最終層の各要素に対して平均プーリングを行ってベクトル４５６（Ｖ）を出力するためのＰｏｏｌｉｎｇ層４９２とを含む。 The query BERT 416 includes a BERT 480 and a pooling layer 482 for performing average pooling on each element of the final layer of the BERT 480 and outputting a vector 454(U). Similarly, the response candidate BERT 418 includes a BERT 490 and a pooling layer 492 for performing average pooling on each element of the final layer of the BERT 484 and outputting a vector 456(V).

ＢＥＲＴ学習部４１４はさらに、トピック付与部４５０の出力を質問ＢＥＲＴ４１６に、トピック付与部４５２の出力を応答候補ＢＥＲＴ４１８にそれぞれ与え、質問ＢＥＲＴ４１６の出力するベクトル４５４と応答候補ＢＥＲＴ４１８の出力するベクトル４５６とのコサイン類似度を算出するためのコサイン類似度算出部２０４と、コサイン類似度算出部２０４の出力する値（［－１，１］の範囲）を［０，１］の範囲に正規化するための正規化処理部２０６と、正規化処理部２０６により得られた正規化後のコサイン類似度がラベル２１０と等しくなる方向に、ＢＥＲＴ４８０及び４９０の各パラメータを更新するためのパラメータ更新部２０８とを含む。 The BERT learning unit 414 further provides the output of the topic assignment unit 450 to the question BERT 416 and the output of the topic assignment unit 452 to the response candidate BERT 418, and calculates the difference between the vector 454 output from the question BERT 416 and the vector 456 output from the response candidate BERT 418. A cosine similarity calculation unit 204 for calculating cosine similarity, and a cosine similarity calculation unit 204 for normalizing the value output from the cosine similarity calculation unit 204 (range of [-1, 1]) to the range of [0, 1]. It includes a normalization processing unit 206 and a parameter updating unit 208 for updating each parameter of the BERT 480 and 490 in a direction in which the cosine similarity after normalization obtained by the normalization processing unit 206 becomes equal to the label 210. .

ＢＥＲＴ学習部４１４は、上記した学習データを用い、こうした更新処理を所定の終了条件が成立するまで繰り返し実行する機能を持つ。 The BERT learning unit 414 has a function of repeatedly executing such update processing using the above-mentioned learning data until a predetermined termination condition is satisfied.

追加学習部４２２の構成は、第１の実施形態において図６に示した追加学習部１７４と実質的に同一である。ただしこの例においては、質問に対してその質問パッセージのトピックが付されている点が追加学習部１７４と異なる。 The configuration of the additional learning section 422 is substantially the same as the additional learning section 174 shown in FIG. 6 in the first embodiment. However, this example differs from the additional learning section 174 in that the topic of the question passage is attached to the question.

２．動作
この第２実施形態においても、最初に図８に示す学習装置４００による質問ＢＥＲＴ４１６の学習、及びセントロイドＤＢ３７８と応答候補ＤＢ３９０の生成が行われる。さらに追加学習部４２２により質問ＢＥＲＴ４１６に対する追加学習が行われる。この結果、質問ＢＥＲＴ３８０が得られ、図７に示す対話装置３５０の処理が可能になる。 2. Operation Also in this second embodiment, first, the learning device 400 shown in FIG. 8 learns the question BERT 416 and generates the centroid DB 378 and response candidate DB 390. Further, the additional learning unit 422 performs additional learning on the question BERT 416. As a result, the question BERT 380 is obtained, and the processing of the dialog device 350 shown in FIG. 7 becomes possible.

２－１．質問ＢＥＲＴ３８０の学習
質問ＢＥＲＴ３８０の学習の流れは、概略、第１実施形態における質問ＢＥＲＴ８０の学習の流れと同様である。 2-1. Learning of Question BERT380 The learning flow of Question BERT380 is roughly the same as the learning flow of Question BERT80 in the first embodiment.

図８を参照して、質問ＢＥＲＴ３８０の学習時には学習装置１５０は以下のように動作する。まず質問・パッセージ収集部４１０が、インターネット６２をクロールし、質問文及びその質問文の前後を含む質問パッセージを収集し質問パッセージＤＢ４１２に記憶する。質問・パッセージ収集部４１０はさらに、質問パッセージＤＢ４１２に記憶された質問文の各々について、その質問に対する応答を含むと考えられる複数のパッセージをさらにインターネット６２から収集する。質問・パッセージ収集部４１０は、収集したパッセージを質問パッセージＤＢ４１２に記憶された質問と関連付けてパッセージＤＢ１６４に格納する。質問・パッセージ収集部４１０によるこれら処理は従来技術により実現できる。 Referring to FIG. 8, learning device 150 operates as follows when learning question BERT380. First, the question/passage collection unit 410 crawls the Internet 62, collects question sentences and question passages including the parts before and after the question sentences, and stores them in the question passage DB 412. The question/passage collection unit 410 further collects from the Internet 62, for each question text stored in the question passage DB 412, a plurality of passages that are considered to include a response to that question. The question/passage collection unit 410 stores the collected passages in the passage DB 164 in association with the questions stored in the question passage DB 412. These processes by the question/passage collection unit 410 can be realized using conventional techniques.

次にＢＥＲＴ学習部４１４が、質問パッセージＤＢ４１２に記憶された質問の各々及びその質問パッセージと、これら質問に対してパッセージＤＢ１６４に記憶された応答候補であるパッセージとから、トピックモデル３７４を用いて正例と負例とからなる学習データを生成する。ＢＥＲＴ学習部４１４はこれら学習データを用いて質問ＢＥＲＴ４１６と応答候補ＢＥＲＴ４１８との学習をＳｉａｍｅｓｅＢＥＲＴネットワークにより同時に行う。 Next, the BERT learning unit 414 uses the topic model 374 to correct Generate learning data consisting of examples and negative examples. The BERT learning unit 414 uses these learning data to simultaneously learn the question BERT 416 and the response candidate BERT 418 using the Siamese BERT network.

パッセージクラスタリング部１７２は、このように学習が終わった応答候補ＢＥＲＴ４１８を用いて、パッセージＤＢ１６４に記憶されたパッセージを全てパッセージベクトルに変換する。パッセージクラスタリング部１７２はさらに、それらパッセージベクトルをｋ平均法により所定個数のクラスタに分類する。これらクラスタにはそれぞれ識別子が割り当てられる。パッセージクラスタリング部１７２はさらに、各クラスタのセントロイドのベクトルを算出し、クラスタの識別子をセントロイドの識別子に割り当てる。パッセージクラスタリング部１７２は、こうして得られた各セントロイドについて、クラスタ識別子とそのセントロイドのベクトルとを組にしてセントロイドＤＢ３７８に登録する。 The passage clustering unit 172 converts all the passages stored in the passage DB 164 into passage vectors using the response candidate BERT 418 that has been trained in this way. The passage clustering unit 172 further classifies the passage vectors into a predetermined number of clusters using the k-means method. Each of these clusters is assigned an identifier. The passage clustering unit 172 further calculates the centroid vector of each cluster and assigns the cluster identifier to the centroid identifier. For each centroid thus obtained, the passage clustering unit 172 registers the cluster identifier and the vector of the centroid as a pair in the centroid DB 378.

一方、パッセージクラスタリング部１７２は、各クラスタに属するパッセージベクトルに対応するパッセージのＵＲＬと、その属するクラスタの識別子とを組にしたレコードを応答候補ＤＢ３９０に登録する。またパッセージクラスタリング部１７２は、質問パッセージＤＢ４１２に記憶されている各質問のレコードに、その質問に対する応答を最も多く含むクラスタのクラスタ識別子を追加する。 On the other hand, the passage clustering unit 172 registers in the response candidate DB 390 a record that is a set of the URL of the passage corresponding to the passage vector belonging to each cluster and the identifier of the cluster to which it belongs. The passage clustering unit 172 also adds, to each question record stored in the question passage DB 412, the cluster identifier of the cluster containing the most responses to the question.

セントロイドＤＢ３７８へのセントロイドの登録が完了すると、追加学習部４２２が、質問パッセージＤＢ４１２に記憶されている各質問に対し、その質問に対応するセントロイドのベクトルをセントロイドＤＢ３７８から読み出す。追加学習部４２２は、各質問に対して、セントロイドＤＢ３７８から読み出したセントロイドベクトルを正解データとして質問ＢＥＲＴ３８０の追加学習を行う。所定の終了条件が成立した時点で質問ＢＥＲＴ３８０の学習が終了する。 When the registration of the centroid in the centroid DB 378 is completed, the additional learning unit 422 reads out the centroid vector corresponding to each question stored in the question passage DB 412 from the centroid DB 378. The additional learning unit 422 performs additional learning of the question BERT 380 using the centroid vector read from the centroid DB 378 as correct answer data for each question. Learning of the question BERT 380 ends when a predetermined end condition is met.

２－２．応答の生成
図７を参照して、追加学習後の質問ＢＥＲＴ３８０を持つ応答候補検索装置３６０、及び応答候補検索装置３６０を含む対話装置３５０は以下のように動作する。なお上記した学習により、応答候補ＤＢ３９０及びセントロイドＤＢ３７８も既に得られている。また、トピックモデル３７４としては学習に用いられたものと同じものを用いる。 2-2. Generation of Responses Referring to FIG. 7, response candidate search device 360 having question BERT 380 after additional learning and interaction device 350 including response candidate search device 360 operate as follows. Note that the response candidate DB 390 and centroid DB 378 have already been obtained through the above-described learning. Further, as the topic model 374, the same one used for learning is used.

対話相手から何らかの発話３６２が入力されたものとする。発話３６２はトピック付与部３７６、対話履歴管理部３７０及び応答生成部３６４に与えられる。対話履歴管理部３７０はこのようにして受けた過去の一定期間の対話の履歴を、各対話相手について対話履歴ＤＢ３７２に保存する。一方、トピック付与部３７６は、対話履歴ＤＢ３７２において対話相手の過去の発話履歴を検索し、それら発話から内容語を抽出する。トピック付与部３７６はさらに、これら内容語をトピックモデル３７４に入力し、それぞれトピックを表す１又は複数の単語をトピックモデル３７４の出力として受け取る。トピック付与部３７６は、これら１又は複数の単語を発話３６２の後ろに付加して質問ＢＥＲＴ３８０に入力する。 Assume that some kind of utterance 362 has been input from the conversation partner. The utterance 362 is provided to a topic adding section 376, a dialogue history management section 370, and a response generation section 364. The dialogue history management unit 370 stores the history of past dialogues received over a certain period of time in the dialogue history DB 372 for each dialogue partner. On the other hand, the topic assigning unit 376 searches the dialogue partner's past utterance history in the dialogue history DB 372 and extracts content words from those utterances. The topic assigning unit 376 further inputs these content words into the topic model 374 and receives one or more words representing each topic as an output of the topic model 374. The topic adding unit 376 adds these one or more words to the end of the utterance 362 and inputs the result to the question BERT 380.

質問ＢＥＲＴ３８０は、トピックが付加された発話３６２に応答して質問ベクトルを出力する。この質問ベクトルは応答候補クラスタ特定部３８６に与えられる。 Question BERT 380 outputs a question vector in response to topic-added utterance 362. This question vector is given to the response candidate cluster identifying section 386.

応答候補クラスタ特定部３８６は、質問ベクトル３８２に応答して、セントロイドＤＢ３７８に記憶されているセントロイドの中で、そのベクトルと質問ベクトル３８２との間のコサイン類似度が最も大きいものを、所定個数だけ選択し、それらのクラスタ識別子をクラスタ識別子８８として応答候補検索部３９２に与える。 In response to the question vector 382, the response candidate cluster specifying unit 386 selects a centroid having the largest cosine similarity between the vector and the question vector 382 among the centroids stored in the centroid DB 378. The selected number of cluster identifiers are given to the response candidate search unit 392 as cluster identifiers 88.

応答候補検索部３９２は応答候補ＤＢ３９０を検索し、このクラスタ識別子３８８のいずれかと等しいクラスタ識別子を持つパッセージ（応答候補）のレコードを全て取り出し、それらのＵＲＬをまとめて応答候補ＵＲＬ群３９４として出力する。パッセージ検索部３９６は、この応答候補ＵＲＬ群３９４を受けて、応答候補ＵＲＬ群３９４に含まれる各ＵＲＬを使用してインターネット６２にアクセスすることにより、応答候補のパッセージをダウンロードしパッセージＤＢ３６３に格納する。 The response candidate search unit 392 searches the response candidate DB 390, extracts all records of passages (response candidates) having a cluster identifier equal to one of the cluster identifiers 388, and outputs the URLs together as a response candidate URL group 394. . The passage search unit 396 receives the response candidate URL group 394, accesses the Internet 62 using each URL included in the response candidate URL group 394, downloads the response candidate passage, and stores it in the passage DB 363. .

応答生成部３６４は、パッセージＤ３Ｂ６６に格納されたパッセージ群の中から、発話３６２に対する応答として最も適切なものを選択し、応答整形部３６８に与える。応答整形部３６８は、入力を対話に相応しい形式に整形して応答３６６として出力する。応答生成部３６４による応答の選択及び生成方法は従来の手法をそのまま利用できる。 The response generation unit 364 selects the most appropriate response to the utterance 362 from among the passage group stored in the passage D3B 66, and provides it to the response shaping unit 368. The response formatter 368 formats the input into a format suitable for dialogue and outputs it as a response 366. The response selection and generation method by the response generation unit 364 can be the same as the conventional method.

３．効果
この第２実施形態によれば、入力された発話に対する応答を生成するために処理するパッセージは、図７に示す応答候補クラスタ特定部３８６により選択された１又は複数の線路ロイドに対応するクラスタに属するベクトルに対応するものに限定される。そのため、最適な応答は、従来のように質問回答システムから得られた複数の回答をもとに生成された複数の応答候補から選択する必要がないので、応答生成部３６４が応答を生成する際に行う処理は、従来と比較して大幅に削減される。また、処理に必要な計算資源も大幅に削減される。また、実施形態１と異なり、応答候補パッセージを検索する際に、発話のトピックを情報として用いる。そのため、得られる応答は対話のトピックに相応しいものとなる。その結果、従来よりも少ない資源を用いて高速に、かつ十分な精度をもって動作可能な対話装置を提供できる。 3. Effects According to the second embodiment, the passage to be processed to generate a response to an input utterance is a cluster corresponding to one or more railroad tracks selected by the response candidate cluster identification unit 386 shown in FIG. is limited to those corresponding to vectors belonging to . Therefore, the optimal response does not need to be selected from a plurality of response candidates generated based on a plurality of answers obtained from a question answering system as in the past, so when the response generation unit 364 generates a response, The amount of processing required is significantly reduced compared to conventional methods. Additionally, the computational resources required for processing are also significantly reduced. Also, unlike the first embodiment, the topic of the utterance is used as information when searching for response candidate passages. Therefore, the responses obtained will be appropriate to the topic of the conversation. As a result, it is possible to provide an interactive device that can operate at high speed and with sufficient accuracy using fewer resources than before.

第３コンピュータによる実現
図１０は、例えば図１に示す質問回答装置５０として動作するコンピュータシステムの外観図である。図１１は、図１０に示すコンピュータシステムのハードウェアブロック図である。図２に示す学習装置１５０、図７に示す対話装置３５０及び図８に示す学習装置４００についてもそれぞれ質問回答装置５０とほぼ同様の構成のコンピュータシステムにより実現できる。ここでは質問回答装置５０として動作するコンピュータシステムの構成についてのみ述べることとし、他の装置を実現するコンピュータシステムの構成の詳細については述べない。 Third Realization by Computer FIG. 10 is an external view of a computer system that operates as, for example, the question answering device 50 shown in FIG. 1. FIG. 11 is a hardware block diagram of the computer system shown in FIG. 10. The learning device 150 shown in FIG. 2, the dialogue device 350 shown in FIG. 7, and the learning device 400 shown in FIG. 8 can each be realized by a computer system having substantially the same configuration as the question answering device 50. Here, only the configuration of the computer system that operates as the question answering device 50 will be described, and details of the configuration of the computer system that implements other devices will not be described.

図１０を参照して、このコンピュータシステム９５０は、ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ）ドライブ１００２を有するコンピュータ９７０と、いずれもコンピュータ９７０に接続された、ユーザと対話するためのキーボード９７４、マウス９７６、及びモニタ９７２とを含む。もちろんこれらはユーザ対話が必要となったときのための構成の一例であって、ユーザ対話に利用できる一般のハードウェア及びソフトウェア（例えばタッチパネル、音声入力、ポインティングデバイス一般）であればどのようなものも利用できる。 Referring to FIG. 10, this computer system 950 includes a computer 970 having a DVD (Digital Versatile Disc) drive 1002, a keyboard 974, a mouse 976, and a monitor for interacting with the user, all connected to the computer 970. 972. Of course, these are examples of configurations for when user interaction is required, and any general hardware and software (e.g. touch panel, voice input, general pointing device) that can be used for user interaction can be used. Also available.

図１１を参照して、コンピュータ９７０は、ＤＶＤドライブ１００２に加えて、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）９９０と、ＧＰＵ（ＧｒａｐｈｉｃｓＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）９９２と、ＣＰＵ９９０、ＧＰＵ９９２、ＤＶＤドライブ１００２に接続されたバス１０１０とを含む。コンピュータ９７０はさらに、バス１０１０に接続され、コンピュータ９７０のブートアッププログラムなどを記憶するＲＯＭ（Ｒｅａｄ－ＯｎｌｙＭｅｍｏｒｙ）９９６と、バス１０１０に接続され、プログラムを構成する命令、システムプログラム、及び作業データなどを記憶するＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）９９８と、バス１０１０に接続された不揮発性メモリであるＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）１０００とを含む。ＳＳＤ１０００は、ＣＰＵ９９０及びＧＰＵ９９２が実行するプログラム、並びにＣＰＵ９９０及びＧＰＵ９９２が実行するプログラムが使用するデータなどを記憶するためのものである。コンピュータ９７０はさらに、他端末との通信を可能とするネットワーク９８６（図７に示す応答生成部３６４）への接続を提供するネットワークＩ／Ｆ（Ｉｎｔｅｒｆａｃｅ）１００８と、ＵＳＢ（ＵｎｉｖｅｒｓａｌＳｅｒｉａｌＢｕｓ）メモリ９８４が着脱可能であり、ＵＳＢメモリ９８４とコンピュータ９７０内の各部との通信を提供するＵＳＢポート１００６とを含む。 Referring to FIG. 11, in addition to a DVD drive 1002, a computer 970 includes a CPU (Central Processing Unit) 990, a GPU (Graphics Processing Unit) 992, and a bus 101 connected to the CPU 990, GPU 992, and DVD drive 1002. 0 and including. The computer 970 further includes a ROM (Read-Only Memory) 996 that is connected to the bus 1010 and stores boot-up programs for the computer 970, etc. A RAM (Random Access Memory) 998 for storing , and an SSD (Solid State Drive) 1000 which is a nonvolatile memory connected to a bus 1010 are included. The SSD 1000 is for storing programs executed by the CPU 990 and GPU 992, data used by the programs executed by the CPU 990 and GPU 992, and the like. The computer 970 further includes a network I/F (Interface) 1008 that provides connection to a network 986 (response generation unit 364 shown in FIG. 7) that enables communication with other terminals, and a USB (Universal Serial Bus) memory 984. is removable and includes a USB memory 984 and a USB port 1006 that provides communication with various parts within the computer 970.

コンピュータ９７０はさらに、マイクロフォン９８２及びスピーカ９８０とバス１０１０とに接続され、ＣＰＵ９９０により生成されＲＡＭ９９８又はＳＳＤ１０００に保存された音声信号、映像信号及びテキストデータをＣＰＵ９９０の指示に従って読み出し、アナログ変換及び増幅処理をしてスピーカ９８０を駆動したり、マイクロフォン９８２からのアナログの音声信号をデジタル化し、ＲＡＭ９９８又はＳＳＤ１０００の、ＣＰＵ９９０により指定される任意のアドレスに保存したりする機能を持つ音声Ｉ／Ｆ１００４を含む。 The computer 970 is further connected to a microphone 982, a speaker 980, and a bus 1010, reads audio signals, video signals, and text data generated by the CPU 990 and stored in the RAM 998 or the SSD 1000 according to instructions from the CPU 990, and performs analog conversion and amplification processing. It includes an audio I/F 1004 having a function of driving a speaker 980, digitizing an analog audio signal from a microphone 982, and storing it in an arbitrary address specified by the CPU 990 in the RAM 998 or the SSD 1000.

上記実施形態においては、図１に示す質問回答装置５０、図２に示す学習装置１５０、図７に示す対話装置３５０及び図８に示す学習装置４００などの各機能を実現するプログラムなどは、いずれも例えば図１１に示すＳＳＤ１０００、ＲＡＭ９９８、ＤＶＤ９７８又はＵＳＢメモリ９８４、若しくはネットワークＩ／Ｆ１００８及びネットワーク９８６を介して接続された図示しない外部装置の記憶媒体などに格納される。典型的には、これらのデータ及びパラメータなどは、例えば外部からＳＳＤ１０００に書込まれコンピュータ９７０の実行時にはＲＡＭ９９８にロードされる。 In the above embodiment, the programs for realizing each function of the question answering device 50 shown in FIG. 1, the learning device 150 shown in FIG. 2, the dialog device 350 shown in FIG. 7, and the learning device 400 shown in FIG. It is also stored in, for example, the SSD 1000, RAM 998, DVD 978, or USB memory 984 shown in FIG. 11, or a storage medium of an external device (not shown) connected via the network I/F 1008 and network 986. Typically, these data and parameters are written into the SSD 1000 from the outside, for example, and loaded into the RAM 998 when the computer 970 is executed.

このコンピュータシステムを、図１に示す質問回答装置５０、図２に示す学習装置１５０、図７に示す対話装置３５０及び図８に示す学習装置４００、並びにその各構成要素の機能を実現するよう動作させるためのコンピュータプログラムは、ＤＶＤドライブ１００２に装着されるＤＶＤ９７８に記憶され、ＤＶＤドライブ１００２からＳＳＤ１０００に転送される。又は、これらのプログラムはＵＳＢメモリ９８４に記憶され、ＵＳＢメモリ９８４をＵＳＢポート１００６に装着し、プログラムをＳＳＤ１０００に転送する。又は、このプログラムはネットワーク９８６を通じてコンピュータ９７０に送信されＳＳＤ１０００に記憶されてもよい。 This computer system is operated to realize the functions of the question answering device 50 shown in FIG. 1, the learning device 150 shown in FIG. 2, the dialogue device 350 shown in FIG. 7, and the learning device 400 shown in FIG. 8, and their respective components. A computer program for doing this is stored on the DVD 978 installed in the DVD drive 1002 and transferred from the DVD drive 1002 to the SSD 1000. Alternatively, these programs are stored in the USB memory 984, the USB memory 984 is attached to the USB port 1006, and the programs are transferred to the SSD 1000. Alternatively, this program may be transmitted to computer 970 via network 986 and stored on SSD 1000.

プログラムは実行のときにＲＡＭ９９８にロードされる。もちろん、キーボード９７４、モニタ９７２及びマウス９７６を用いてソースプログラムを入力し、コンパイルした後のオブジェクトプログラムをＳＳＤ１０００に格納してもよい。プログラムがスクリプト言語で記述されている場合には、キーボード９７４などを用いて入力したスクリプトをＳＳＤ１０００に格納してもよい。仮想マシン上において動作するプログラムの場合には、仮想マシンとして機能するプログラムを予めコンピュータ９７０にインストールしておく必要がある。音声認識及び音声合成などにはニューラルネットワークが使用される。質問回答装置５０及び対話装置３５０においては、学習済のニューラルネットワークを使用してもよいし、質問回答装置５０及び対話装置３５０をそれぞれ学習装置１５０及び４００として使用してニューラルネットワークの学習を行ってもよい。 The program is loaded into RAM 998 during execution. Of course, a source program may be input using the keyboard 974, monitor 972, and mouse 976, and the compiled object program may be stored in the SSD 1000. If the program is written in a script language, the script input using the keyboard 974 or the like may be stored in the SSD 1000. In the case of a program that operates on a virtual machine, it is necessary to install the program that functions as a virtual machine on the computer 970 in advance. Neural networks are used for speech recognition, speech synthesis, etc. In the question answering device 50 and the dialogue device 350, a trained neural network may be used, or the question answering device 50 and the dialogue device 350 may be used as the learning devices 150 and 400, respectively, to perform neural network learning. Good too.

ＣＰＵ９９０は、その内部のプログラムカウンタと呼ばれるレジスタ（図示せず）により示されるアドレスに従ってＲＡＭ９９８からプログラムを読み出して命令を解釈し、命令の実行に必要なデータを命令により指定されるアドレスに従ってＲＡＭ９９８、ＳＳＤ１０００又はそれ以外の機器から読み出して命令により指定される処理を実行する。ＣＰＵ９９０は、実行結果のデータを、ＲＡＭ９９８、ＳＳＤ１０００、ＣＰＵ９９０内のレジスタなど、プログラムにより指定されるアドレスに格納する。アドレスによってはコンピュータから外部出力される。このとき、プログラムカウンタの値もプログラムによって更新される。コンピュータプログラムは、ＤＶＤ９７８から、ＵＳＢメモリ９８４から、又はネットワーク９８６を介して、ＲＡＭ９９８に直接にロードしてもよい。なお、ＣＰＵ９９０が実行するプログラムの中で、一部のタスク（主として数値計算）については、プログラムに含まれる命令により、又はＣＰＵ９９０による命令実行時の解析結果に従って、ＧＰＵ９９２にディスパッチされる。 The CPU 990 reads the program from the RAM 998 according to the address indicated by an internal register called a program counter (not shown), interprets the instruction, and stores the data necessary for executing the instruction in the RAM 998 and the SSD 1000 according to the address specified by the instruction. Or read it from other devices and execute the process specified by the command. The CPU 990 stores the data of the execution result at an address specified by the program, such as the RAM 998, the SSD 1000, or a register within the CPU 990. Some addresses are output externally from the computer. At this time, the value of the program counter is also updated by the program. Computer programs may be loaded directly into RAM 998 from DVD 978, from USB memory 984, or via network 986. Note that in the program executed by the CPU 990, some tasks (mainly numerical calculations) are dispatched to the GPU 992 according to instructions included in the program or according to an analysis result when the CPU 990 executes the instructions.

コンピュータ９７０により上記した実施形態に係る各部の機能を実現するプログラムは、それら機能を実現するようコンピュータ９７０を動作させるように記述され配列された複数の命令を含む。この命令を実行するのに必要な基本的機能のいくつかはコンピュータ９７０上において動作するオペレーティングシステム（ＯＳ）若しくはサードパーティのプログラム、コンピュータ９７０にインストールされる各種ツールキットのモジュール又はプログラムの実行環境により提供される場合もある。したがって、このプログラムはこの実施形態のシステム及び方法を実現するのに必要な機能全てを必ずしも含まなくてよい。このプログラムは、命令の中で、所望の結果が得られるように制御されたやり方によって適切な機能又はモジュールなどをコンパイル時に静的にリンクすることにより、又は実行時に動的に呼出すことにより、上記した各装置及びその構成要素としての動作を実行する命令のみを含んでいればよい。そのためのコンピュータ９７０の動作方法は周知である。したがって、ここではコンピュータ９７０の動作方法の説明は繰り返さない。 A program for realizing the functions of each unit according to the above-described embodiments by the computer 970 includes a plurality of instructions written and arranged to cause the computer 970 to operate to realize those functions. Some of the basic functions required to execute this instruction are provided by the operating system (OS) running on the computer 970, third party programs, modules of various toolkits installed on the computer 970, or the program execution environment. In some cases, it may be provided. Therefore, this program does not necessarily include all the functions necessary to implement the system and method of this embodiment. This program is constructed by statically linking appropriate functions or modules in a controlled manner at compile time or by calling them dynamically at run time in a controlled manner to achieve the desired results. It is sufficient to include only the instructions for executing the operations of each device and its constituent elements. The manner in which computer 970 operates for this purpose is well known. Therefore, the description of how computer 970 operates will not be repeated here.

なお、ＧＰＵ９９２は並列処理を行うことが可能であり、機械学習に伴う多量の計算を同時並列的又はパイプライン的に実行できる。例えばプログラムのコンパイル時にプログラム中に発見された並列的計算要素、又はプログラムの実行時に発見された並列的計算要素は、随時、ＣＰＵ９９０からＧＰＵ９９２にディスパッチされ、実行され、その結果が直接に、又はＲＡＭ９９８の所定アドレスを介してＣＰＵ９９０に返され、プログラム中の所定の変数に代入される。 Note that the GPU 992 can perform parallel processing, and can execute a large amount of calculations associated with machine learning simultaneously in parallel or in a pipeline manner. For example, parallel computing elements found in a program when the program is compiled, or parallel computing elements discovered when the program is executed are dispatched from the CPU 990 to the GPU 992 and executed, and the results are sent directly or to the RAM 998. is returned to the CPU 990 via a predetermined address, and is substituted into a predetermined variable in the program.

第４変形例
上記実施形態においては、ＢＥＲＴの更新の終了条件として、所定の終了条件とだけ述べている。この場合の終了条件とは、例えば全ての学習データについて所定回数だけ使用してＢＥＲＴのパラメータ更新が終了したという条件である。別の条件としては、例えばパラメータの更新において、各パラメータの勾配がほぼゼロに等しくなったときという条件もあり得る。その他、終了条件としては様々なものが考えられる。 Fourth Modified Example In the above embodiment, only a predetermined termination condition is mentioned as the termination condition for updating BERT. The termination condition in this case is, for example, that the BERT parameter update has been completed after all learning data have been used a predetermined number of times. Another condition may be, for example, when the gradient of each parameter becomes approximately equal to zero during parameter updating. In addition, various other termination conditions can be considered.

上記実施形態においては、ＢＥＲＴとしてＢＥＲＴラージを使用することを前提としている。しかしこの発明はＢＥＲＴラージに限定されるわけではない。ＢＥＲＴベースを使用してもよい。またＢＥＲＴと同様、トランスフォーマのエンコーダ又はそれに類似した要素を並べることにより、センテンスベースの変換が行えるようなモデルを使用してもよい。 In the above embodiment, it is assumed that BERT large is used as BERT. However, the invention is not limited to BERT large. A BERT base may also be used. Also, similar to BERT, a model that can perform sentence-based conversion by arranging transformer encoders or similar elements may be used.

さらに、上記実施形態においては、図１に示す質問ＢＥＲＴ８０などにおいて、ＢＥＲＴの出力をベクトル化するために、平均プーリングを行っている。しかし、この発明はそのような実施形態には限定されない。平均プーリングに代えて、最大値プーリングを用いてもよい。 Furthermore, in the above embodiment, average pooling is performed in order to vectorize the output of BERT in the query BERT 80 shown in FIG. 1 and the like. However, the invention is not limited to such embodiments. Maximum pooling may be used instead of average pooling.

上記実施形態においては、ＢＥＲＴの学習のための損失関数として平均二乗誤差を用いている。しかしこの発明はそのような実施形態には限定されない。尺度としてコサイン類似度ではなくソフトマックスを採用し、損失関数としてはクロスエントロピー誤差を用いてもよい。 In the above embodiment, the mean square error is used as the loss function for BERT learning. However, the invention is not limited to such embodiments. Instead of cosine similarity, softmax may be used as the measure, and cross-entropy error may be used as the loss function.

また、上記実施形態においては学習時のパッセージベクトルのクラスタリングにｋ平均法を用いている。しかしこの発明はそのような実施形態に限定されない。Related Minimum Variance基準によるクラスタリング、散布図基準によるクラスタリングなどを使用してもよい。 Further, in the above embodiment, the k-means method is used for clustering passage vectors during learning. However, the invention is not limited to such embodiments. Clustering based on Related Minimum Variance criteria, clustering based on scatter plot criteria, etc. may also be used.

また、上記実施形態においては、応答候補クラスタ特定部３８６には、トピックが付加された発話の質問ベクトルが与えられ、セントロイドＤＢ３７８のセントロイドから適切なセントロイドのクラスタ識別子を得ている。しかしそれに限らず、応答候補クラスタ特定部３８６に、発話に基づく質問ベクトルとベクトル表現されたこれまでの対話履歴を与えて、類似度（あるいは距離）からセントロイドを決定しそのクラスタ識別子を得るようにしてもよい。 Further, in the above embodiment, the response candidate cluster identifying unit 386 is given the question vector of the utterance to which the topic has been added, and obtains an appropriate centroid cluster identifier from the centroids in the centroid DB 378. However, the present invention is not limited to this, and the response candidate cluster identification unit 386 may be provided with a question vector based on utterances and a past dialogue history expressed as a vector, determine a centroid from similarity (or distance), and obtain its cluster identifier. You can also do this.

さらに、上記実施形態では、図１に示す回答候補ＤＢ９０及び図７に示す応答候補ＤＢ３９０には応答候補のＵＲＬが格納されている。しかしこの発明はそのような実施形態には限定されない。回答候補ＤＢ９０及び応答候補ＤＢ３９０に、応答候補のＵＲＬではなく、パッセージそのものを格納しておいてもよい。その場合には、回答候補検索部９２及び応答候補検索部３９２が出力するのは応答候補のＵＲＬ群ではなく応答候補のパッセージそのものとなる。したがって、図１に示すパッセージ検索部９６及び図７に示すパッセージ検索部３９６が必要なくなる。質問に対する回答時又は応答時にインターネット６２からパッセージをダウンロードする必要がなくなるので、処理量はより少なくなり、応答は上記第１実施形態及び第２実施形態よりも速くなる可能性が高い。 Furthermore, in the embodiment described above, the URL of the response candidate is stored in the response candidate DB 90 shown in FIG. 1 and the response candidate DB 390 shown in FIG. However, the invention is not limited to such embodiments. The response candidate DB 90 and the response candidate DB 390 may store the passage itself instead of the URL of the response candidate. In that case, what the answer candidate search unit 92 and the response candidate search unit 392 output is not the URL group of the response candidates but the response candidate passage itself. Therefore, the passage search section 96 shown in FIG. 1 and the passage search section 396 shown in FIG. 7 are no longer necessary. Since there is no need to download passages from the Internet 62 when answering or responding to a question, the amount of processing is less and the response is likely to be faster than in the first and second embodiments.

さらに、図２及び図８に示すパッセージＤＢ１６４は、上記実施形態ではＢＥＲＴの学習が終了すると不要であり廃棄できる。しかし、パッセージＤＢ１６４に大量のパッセージを記憶しておいてもよいだけの余裕が記憶容量にあるならば、パッセージＤＢ１６４を残しておいてもよい。この場合、回答候補ＤＢ９０及び応答候補ＤＢ３９０に、応答候補のパッセージではなく、パッセージＤＢ１６４から該当パッセージを検索するのに必要な情報を格納しておけばよい。検索するのに必要な情報とは、例えばそのパッセージを含むレコードの識別子、又はそのパッセージが存在していたＵＲＬなどである。特にＵＲＬを格納しておくようにすると、パッセージＤＢ１６４にそのパッセージがあればパッセージＤＢ１６４からそのパッセージを取り出すことができ、もしもパッセージＤＢ１６４にそのパッセージがなければ、そのＵＲＬにアクセスしてそのパッセージをダウンロードできる。この場合には回答候補検索部９２及び応答候補検索部３９２が出力するのは応答候補のＵＲＬ群ではあるが、通常はインターネットにアクセスすることなく、ローカルに存在するパッセージＤＢ１６４からそのパッセージを読み出せる。したがって、応答は上記第１実施形態及び第２実施形態よりも速くなる可能性が高い。また、一部のパッセージをパッセージＤＢ１６４から削除したとしても、ＵＲＬが分かっているので、インターネットからそのパッセージをダウンロードできる。 Further, in the above embodiment, the passage DB 164 shown in FIGS. 2 and 8 is unnecessary and can be discarded after BERT learning is completed. However, if the passage DB 164 has enough storage capacity to store a large number of passages, the passage DB 164 may be left alone. In this case, the answer candidate DB 90 and the response candidate DB 390 may store information necessary to search for the relevant passage from the passage DB 164 instead of the response candidate passage. The information necessary for the search is, for example, the identifier of the record containing the passage, or the URL where the passage existed. In particular, if you store the URL, if the passage exists in the passage DB 164, you can retrieve the passage from the passage DB 164, and if the passage does not exist in the passage DB 164, access the URL and download the passage. can. In this case, the answer candidate search unit 92 and the response candidate search unit 392 output a group of response candidate URLs, but normally the passage can be read from the locally existing passage DB 164 without accessing the Internet. . Therefore, the response is likely to be faster than in the first and second embodiments. Further, even if some passages are deleted from the passage DB 164, since the URL is known, the passages can be downloaded from the Internet.

上記した第２実施形態では、対話装置３５０への入力に対する応答候補を含むと思われるパッセージをインターネット６２から検索し、それらパッセージを用いて応答生成部３６４において応答を生成している。しかし、この発明はそのような実施形態には限定されない。例えば発話３６２に対する応答として適切な応答候補があらかじめ準備できるなら、パッセージに代えて、そのような応答候補を応答候補ＤＢ３９０に記憶させておくことができる。発話３６２として様々なものを準備し、それらに対して適切な応答候補が定まれば、それを応答候補ＤＢ３９０に蓄積しておく。それら応答候補が大量に得られれば、それらを用いて第２実施形態で説明した手法により対話装置３５０が構築できる。この場合には、応答候補を含むと思われるパッセージから応答を生成するのではなく、あらかじめ発話に対する適切な応答であるものが集められ、それらから応答が生成される。したがって、応答は第２実施形態で説明した場合よりさらに適切なものとなる。 In the second embodiment described above, the Internet 62 is searched for passages that are thought to include response candidates to the input to the dialogue device 350, and the response generation unit 364 generates a response using these passages. However, the invention is not limited to such embodiments. For example, if an appropriate response candidate can be prepared in advance as a response to the utterance 362, such a response candidate can be stored in the response candidate DB 390 instead of a passage. Various utterances 362 are prepared, and once appropriate response candidates are determined, they are stored in the response candidate DB 390. If a large number of these response candidates are obtained, the dialog device 350 can be constructed using them using the method described in the second embodiment. In this case, rather than generating a response from a passage that is thought to include response candidates, appropriate responses to the utterance are collected in advance and a response is generated from them. Therefore, the response becomes more appropriate than that described in the second embodiment.

そのように発話に対する応答として適切なものをあらかじめ集積する手法として以下のようなものが考えられる。まず、発話を構成する可能性のある名詞を大量に（例えば１００万語）選ぶ。これらの名詞の各々から既存の手法を用いて複数の質問を生成する。それら質問を第１実施形態の質問回答装置５０に入力する。質問回答装置５０の回答に基づいて、元の名詞を含む質問に相応しい応答を作成し、その質問と応答を図８の質問パッセージＤＢ４１２及びパッセージＤＢ１６４に蓄積する。後は第２実施形態と同様の手法で図７及び図８に示す質問ＢＥＲＴ３８０の訓練を行えばよい。発話に対する応答候補を含むと考えられるパッセージではなく、発話に対する応答として相応しい応答候補を使用して対話装置３５０が構築できる。このような変形例では、発話に対して最も適切と考えられる応答候補を事前に生成・選択することから、対話装置における計算処理の重い作業を事前に完了することができる。したがって、推論処理時の高速化及び計算資源の軽量化が図れる。 The following methods can be considered as methods for collecting appropriate responses to utterances in advance. First, a large number of nouns (for example, 1 million words) that may constitute an utterance are selected. Multiple questions are generated from each of these nouns using existing techniques. These questions are input into the question answering device 50 of the first embodiment. Based on the answers from the question answering device 50, a response suitable for the question including the original noun is created, and the question and response are stored in the question passage DB 412 and the passage DB 164 in FIG. After that, the question BERT 380 shown in FIGS. 7 and 8 may be trained using the same method as in the second embodiment. Dialogue device 350 can be constructed using response candidates suitable as responses to utterances, rather than passages that are considered to include response candidates to utterances. In such a modified example, response candidates considered to be most appropriate for the utterance are generated and selected in advance, so that heavy computational work in the dialogue device can be completed in advance. Therefore, it is possible to speed up the inference processing and reduce the weight of computational resources.

なお、図７以下に示す第２実施形態では、訓練及び推論の双方においてトピックモデル３７４を用いている。しかし対話におけるトピック付与では、トピックモデル３７４を用いることが必須というわけではない。例えば対話履歴から何らかの条件を充足する単語など、又は対話履歴そのものを抽出し、トピック付与部３７６により発話３６２に付与してもよい。対話履歴を使用してトピックを特定するのではなく、あらかじめ行われる設定によりトピックを決定してもよい。例えば発話者が明示的にトピックを指定してもよい。また、トピック付与部３７６自体を省略してもよい。 Note that in the second embodiment shown in FIG. 7 and below, the topic model 374 is used in both training and inference. However, it is not essential to use the topic model 374 when assigning topics in dialogue. For example, a word that satisfies some condition or the dialogue history itself may be extracted from the dialogue history and added to the utterance 362 by the topic adding unit 376. Rather than specifying a topic using the conversation history, the topic may be determined based on settings made in advance. For example, the speaker may explicitly specify the topic. Further, the topic adding section 376 itself may be omitted.

今回開示された実施の形態は単に例示であって、本発明が上記した実施の形態のみに制限されるわけではない。本発明の範囲は、発明の詳細な説明の記載を参酌した上で、特許請求の範囲の各請求項によって示され、そこに記載された文言と均等の意味及び範囲内での全ての変更を含む。 The embodiment disclosed this time is merely an example, and the present invention is not limited to the above-described embodiment. The scope of the present invention is indicated by each claim, with reference to the description of the detailed description of the invention, and all changes within the scope and meaning equivalent to the words described therein are defined. include.

５０質問回答装置
６４、９２回答候補検索部
６６、１６４、３６３パッセージＤＢ
６８回答生成部
８０、１６８、３８０、４１６質問ＢＥＲＴ
８２、３８２質問ベクトル
８４、３７８セントロイドＤＢ
８６回答候補クラスタ特定部
８８、３８８クラスタ識別子
９０回答候補ＤＢ
９４回答候補ＵＲＬ群
９６、３９６パッセージ検索部
１５０、４００学習装置
１６０、４１０質問・パッセージ収集部
１６２質問ＤＢ
１６６、４１４ＢＥＲＴ学習部
１７０回答候補ＢＥＲＴ
１７２パッセージクラスタリング部
１７４、４２２追加学習部
２０４、３１４コサイン類似度算出部
２０６、３１６正規化処理部
２０８、３１８パラメータ更新部
２１０ラベル
２２０、２３０、４８０、４８４、４９０ＢＥＲＴ
２２２、２３２、４８２、４９２Ｐｏｏｌｉｎｇ層
２５０、２５２クラスタ
３５０対話装置
３６０応答候補検索装置
３６２発話
３６４応答生成部
３６８応答整形部
３７０対話履歴管理部
３７２対話履歴ＤＢ
３７４トピックモデル
３７６、４５０、４５２トピック付与部
３８６応答候補クラスタ特定部
３９０応答候補ＤＢ
３９２応答候補検索部
３９４応答候補ＵＲＬ群
４１２質問パッセージＤＢ
４１８応答候補ＢＥＲＴ
50 Question answering device 64, 92 Answer candidate search unit 66, 164, 363 Passage DB
68 Answer generation unit 80, 168, 380, 416 Question BERT
82, 382 Question vector 84, 378 Centroid DB
86 Answer candidate cluster identification unit 88, 388 Cluster identifier 90 Answer candidate DB
94 Answer candidate URL group 96, 396 Passage search unit 150, 400 Learning device 160, 410 Question/passage collection unit 162 Question DB
166, 414 BERT learning section 170 Answer candidate BERT
172 Passage clustering unit 174, 422 Additional learning unit 204, 314 Cosine similarity calculation unit 206, 316 Normalization processing unit 208, 318 Parameter update unit 210 Label 220, 230, 480, 484, 490 BERT
222, 232, 482, 492 Pooling layer 250, 252 Cluster 350 Dialogue device 360 Response candidate search device 362 Utterance 364 Response generation section 368 Response shaping section 370 Dialogue history management section 372 Dialogue history DB
374 Topic models 376, 450, 452 Topic assignment unit 386 Response candidate cluster identification unit 390 Response candidate DB
392 Response candidate search unit 394 Response candidate URL group 412 Question passage DB
418 Response candidate BERT

Claims

a first neural network provided with a first input and outputting a vector representation of the first input;
a second neural network provided with a second input and outputting a vector representation of the second input;
At least, when the vector representation of the first input and the vector representation of the second input have the predetermined relationship using learning data of the first and second inputs that have a predetermined relationship, learning the first and second neural networks to be located close to each other in vector space;
The vector representation that is the output of the second neural network that has been trained is clustered based on the position on the vector space, and based on the vector representation of the first input, it is possible to search and extract the cluster in advance. further including the constructed database,
An inference device that infers an output having the predetermined relationship based on information on the clusters searched and extracted from the database based on a vector representation of the input by the first neural network with respect to an input to the device.

2. The inference device according to claim 1, wherein the database is searched and extracted using a centroid of a vector representation of an output included in each cluster.

The first neural network has undergone additional learning based on the first input and a vector representation related to a cluster to which the second input, which has the predetermined relationship with the first input, belongs. The inference device according to claim 1 or claim 2, characterized in that:

A question answering device including an inference device according to any one of claims 1 to 3, wherein the predetermined relationship includes a question and an answer to the question.

4. An interaction device including an inference device according to claim 1, wherein the predetermined relationship includes an utterance and a response to the utterance.

a first neural network provided with a first input and outputting a vector representation of the first input; and a second neural network provided with a second input and outputting a vector representation of the second input. and a step of preparing the
At least, when the vector representation of the first input and the vector representation of the second input have the predetermined relationship using learning data of the first and second inputs that have a predetermined relationship, training the first and second neural networks to be located close to each other in vector space;
The vector representation that is the output of the second neural network that has been trained is clustered based on the position on the vector space, and based on the vector representation of the first input, it is possible to search and extract the cluster in advance. a step of building a database;
and inferring an output having the predetermined relationship based on the cluster information searched and extracted from the database based on the vector representation of the input by the first neural network. .