JP2023167720A

JP2023167720A - Dialogue device, dialogue system, and dialogue method

Info

Publication number: JP2023167720A
Application number: JP2022079113A
Authority: JP
Inventors: 尚和内田; Hisakazu Uchida; 健本間; Takeshi Honma; 真岩山; Makoto Iwayama
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2022-05-12
Filing date: 2022-05-12
Publication date: 2023-11-24

Abstract

To provide a dialogue system capable of returning an appropriate response suitable for each category relative to a harmful input of various categories.SOLUTION: A dialogue system includes an information processing device 205 and a server 201 connected to the information processing device via a network 202. The server includes: an input unit 401 for receiving input data; a plurality of dialogue models 408 that is trained so as to output a safe response sentence to harmful input data for each attack target by the input data to each generate a response sentence to the input data; and a response selection unit 405 for selecting and outputting the response sentence from a plurality of response sentences based on a predetermined reference.SELECTED DRAWING: Figure 4A

Description

本発明は、対話装置、対話システム、及び、対話方法に関する。 The present invention relates to a dialogue device, a dialogue system, and a dialogue method.

Ｗｅｂ上からＳＮＳやチャットの対話ログを収集して作成した対話システムでは、誹謗中傷などの不適切な応答を防止するため、例えば、特開２００６－１１５９２（特許文献１）に記載の技術がある。この公報では、禁止用語を含む文を出力することを防止することを目的とする。そのため、分割部８１は、入力された文を単語に分割する。単語リスト記憶部８３は、文中での使用を許可する単語を記憶する。文削除部８２は、分割部８１から供給される文のすべての単語が、単語リスト記憶部８３に記憶される文中での使用を許可する単語である場合、その文を出力する。特許文献１には、例えば、対話を行うロボット装置の対話の応答文の作成に利用される入出力ペア（の出力例）の取捨選択に適用する、技術が記載されている。 In a dialogue system created by collecting SNS and chat dialogue logs from the Web, there is a technique described in, for example, Japanese Patent Laid-Open No. 2006-11592 (Patent Document 1) to prevent inappropriate responses such as slander. . The purpose of this publication is to prevent sentences containing prohibited words from being output. Therefore, the dividing unit 81 divides the input sentence into words. The word list storage unit 83 stores words that are permitted to be used in a sentence. If all the words in the sentence supplied from the dividing unit 81 are words that are allowed to be used in the sentence stored in the word list storage unit 83, the sentence deletion unit 82 outputs the sentence. Patent Document 1 describes, for example, a technique applied to selecting (output examples of) input/output pairs used for creating a response sentence for a dialogue of a robot device that performs a dialogue.

また、非特許文献１には、事前学習済言語モデルを用いた対話システムにおいて、誹謗中傷を含む発話とそれに対して適切な応答を返している対話ログをＳＮＳから収集し、この対話ログで言語モデルを追加学習することで誹謗中傷の発話に対して適切な応答を出力する技術が記載されている。 Furthermore, in Non-Patent Document 1, in a dialogue system using a pre-trained language model, dialogue logs of utterances containing slander and appropriate responses are collected from SNS, and this dialogue log is used to evaluate the language A technology is described that outputs an appropriate response to slanderous utterances by additionally learning a model.

特開２００６－１１５９２号公報Japanese Patent Application Publication No. 2006-11592

Ashutosh Baheti他2名“Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts”、Georgia Institute of Technology, Atlanta, GA, USA [online]＜URL：https://arxiv.org/abs/2108.11830＞Ashutosh Baheti and 2 others “Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts”, Georgia Institute of Technology, Atlanta, GA, USA [online]＜URL: https://arxiv.org/abs/2108.11830＞

特許文献１では応答に含まれる単語のみで出力可否、つまり、不適切か否かを判定しており、有害な単語は含まれていなくとも意味としては不適切となる応答を抑制することができない。例えば、誹謗中傷に対して「私もそう思う。」と応答した場合、不適切な応答となるが、応答文に有害な単語は含まれておらず、単語のみでは不適切と判定することができない。 In Patent Document 1, it is determined whether the response can be outputted, that is, whether it is inappropriate or not, based only on the words included in the response, and it is not possible to suppress responses that are inappropriate in meaning even if they do not include harmful words. . For example, if you respond to slander with "I think so too," it will be an inappropriate response, but the response does not contain any harmful words, and the words alone cannot be judged as inappropriate. Can not.

非特許文献１はこのような課題を解決するものだが、有害発言には、人種・民族差別やジェンダー差別などさまざまなカテゴリーがあるのに対し、これらを一つの有害性として扱っていることから、有害発言によってカテゴリーに応じた適切な応答を出力できない場合がある。 Non-Patent Document 1 solves this problem, but since there are various categories of harmful speech, such as racial/ethnic discrimination and gender discrimination, it treats these as one type of harmful speech. , it may not be possible to output an appropriate response according to the category due to harmful comments.

そこで、本発明は、さまざまなカテゴリーの有害入力に対して、それぞれのカテゴリーに合った適切な応答を返すことができる対話システムを提供する。 Accordingly, the present invention provides an interaction system that can respond to harmful inputs of various categories and return appropriate responses suitable for each category.

上記目的を達成するために、例えば特許請求の範囲に記載の構成を採用する。 In order to achieve the above object, for example, the configurations described in the claims are adopted.

本願は上記課題を解決する手段を複数含んでいるが、その一例を挙げるならば、情報処理装置と、情報処理装置にネットワークを介して接続されるサーバを有する対話システムである。サーバは、情報処理装置から、投稿コメントやその投稿コメントに対する返信コメント等の入力文を含む入力データを受け付ける入力部と、入力データによる攻撃対象ごとに、有害な入力データに対し安全な応答文を出力するよう学習され、入力データに対し、それぞれが応答文を生成する複数の対話モデルと、複数の対話モデルから生成される複数の応答文から、所定の基準に基づいて、一つの応答文を選択して出力する応答選択部と、を備える対話システムである。 The present application includes a plurality of means for solving the above problems, and one example thereof is an interaction system having an information processing device and a server connected to the information processing device via a network. The server has an input section that receives input data including input text such as posted comments and reply comments to the posted comments from the information processing device, and a safe response text for each attack target by the input data against harmful input data. A single response sentence is generated based on a predetermined standard from multiple dialogue models that are trained to output and each generates a response sentence in response to input data, and multiple response sentences generated from the multiple dialogue models. This is a dialogue system including a response selection unit that selects and outputs a response.

本発明によれば、さまざまな種類の有害発言に対して、カテゴリーに応じた適切な応答を返すことができる対話システムを提供することができる。 According to the present invention, it is possible to provide a dialogue system that can respond to various types of harmful comments with appropriate responses according to their categories.

上記した以外の課題、構成及び効果は、以下の実施形態の説明により明らかにされる。 Problems, configurations, and effects other than those described above will be made clear by the following description of the embodiments.

第１の実施形態に係る対話システムの対話処理の説明図である。FIG. 2 is an explanatory diagram of dialogue processing of the dialogue system according to the first embodiment. 対話システムのシステム構成例の一例を示す説明図である。FIG. 2 is an explanatory diagram showing an example of a system configuration of a dialogue system. 対話システムのハードウェア構成例の一例を示すブロック図である。1 is a block diagram illustrating an example of a hardware configuration of a dialogue system; FIG. 第１の実施形態に係る対話システムの機能的構成例の一例を示す図である。FIG. 1 is a diagram illustrating an example of a functional configuration of a dialogue system according to a first embodiment. 第１の実施形態に係る対話システムの記憶デバイスに格納されるデータの一利絵を説明する図である。FIG. 2 is a diagram illustrating a picture of data stored in a storage device of the dialogue system according to the first embodiment. 掲示板型ＳＮＳの対話ログの整形方法を示す説明図である。FIG. 2 is an explanatory diagram showing a method for formatting a dialogue log of a bulletin board type SNS. コメントに対するアノテーション例を示す図である。FIG. 3 is a diagram showing an example of annotation for a comment. 第１の実施形態に係る対話システムの有害性評価モデル作成処理のフローチャートを示す図である。FIG. 3 is a diagram illustrating a flowchart of a harmfulness evaluation model creation process of the dialogue system according to the first embodiment. 第１の実施形態に係る対話システムの有害性評価モデル作成処理におけるデータフローの説明図である。FIG. 3 is an explanatory diagram of a data flow in a hazard evaluation model creation process of the dialogue system according to the first embodiment. 第１の実施形態に係る対話システムの有害カテゴリー対応対話モデル作成処理のフローチャートを示す図である。FIG. 3 is a diagram illustrating a flowchart of harmful category compatible dialog model creation processing of the dialog system according to the first embodiment. 第１の実施形態に係る対話システムの有害カテゴリー対応対話モデル作成処理におけるデータフローの説明図である。FIG. 2 is an explanatory diagram of data flow in harmful category compatible dialogue model creation processing of the dialogue system according to the first embodiment. 第１の実施形態に係る対話システムの対話処理のフローチャートを示す図である。FIG. 3 is a diagram showing a flowchart of dialogue processing of the dialogue system according to the first embodiment. 第２の実施形態に係る対話システムの機能的構成例の一例を示すブロック図である。FIG. 2 is a block diagram illustrating an example of a functional configuration of a dialogue system according to a second embodiment. 第２の実施形態に係る対話システムの対話処理のフローチャートを示す図である。FIG. 7 is a diagram showing a flowchart of interaction processing of the interaction system according to the second embodiment. 第２の実施形態に係る対話システムの有害性スコアの例を示す図である。FIG. 7 is a diagram showing an example of a harmfulness score of the dialogue system according to the second embodiment. 第２の実施形態に係る対話システムのカテゴリー別有害性スコアの閾値と総合有害性スコアを算出するための重み係数の例を示す図である。FIG. 7 is a diagram illustrating an example of a weighting coefficient for calculating a threshold of a harmfulness score by category and a comprehensive harmfulness score of the dialogue system according to the second embodiment. 第３の実施形態にかかる対話システムの機能的構成例の一例を示すブロック図である。FIG. 7 is a block diagram illustrating an example of a functional configuration of a dialogue system according to a third embodiment. 第３の実施形態にかかる対話システムの有害性評価モデル作成処理のフローチャートを示す図である。FIG. 7 is a diagram showing a flowchart of a harmfulness evaluation model creation process of the dialogue system according to the third embodiment. 第３の実施形態にかかる共起カテゴリーの登録例を示す図である。FIG. 7 is a diagram showing an example of registration of co-occurrence categories according to the third embodiment.

以下、図面を参照して本発明の実施形態を説明する。以下の記載および図面は、本発明を説明するための例示であって、説明の明確化のため、適宜、省略および簡略化がなされている。本発明は、他の種々の形態でも実施する事が可能である。特に限定しない限り、各構成要素は単数でも複数でも構わない。 Embodiments of the present invention will be described below with reference to the drawings. The following description and drawings are examples for explaining the present invention, and are omitted and simplified as appropriate for clarity of explanation. The present invention can also be implemented in various other forms. Unless specifically limited, each component may be singular or plural.

図面において示す各構成要素の位置、大きさ、形状、範囲などは、発明の理解を容易にするため、実際の位置、大きさ、形状、範囲などを表していない場合がある。このため、本発明は、必ずしも、図面に開示された位置、大きさ、形状、範囲などに限定されない。 The position, size, shape, range, etc. of each component shown in the drawings may not represent the actual position, size, shape, range, etc. in order to facilitate understanding of the invention. Therefore, the present invention is not necessarily limited to the position, size, shape, range, etc. disclosed in the drawings.

また、以下の説明では、プログラムを実行して行う処理を説明する場合があるが、プログラムは、プロセッサ（例えばＣＰＵ（Central Processing Unit）、ＧＰＵ（Graphics Processing Unit））によって実行されることで、定められた処理を、適宜に記憶資源（例えばメモリ）および／またはインターフェースデバイス（例えば通信ポート）等を用いながら行うため、処理の主体がプロセッサとされてもよい。同様に、プログラムを実行して行う処理の主体が、プロセッサを有するコントローラ、装置、システム、計算機、ノードであってもよい。プログラムを実行して行う処理の主体は、演算部であれば良く、特定の処理を行う専用回路（例えばＦＰＧＡ（Field-Programmable Gate Array）やＡＳＩＣ（Application Specific Integrated Circuit））を含んでいてもよい。 In addition, in the following explanation, processing performed by executing a program may be explained, but the program is executed by a processor (for example, a CPU (Central Processing Unit) or a GPU (Graphics Processing Unit)). The processor may be the main body of the processing in order to perform the processing using appropriate storage resources (for example, memory) and/or interface devices (for example, communication ports). Similarly, the subject of processing performed by executing a program may be a controller, device, system, computer, or node having a processor. The main body of the processing performed by executing the program may be an arithmetic unit, and may include a dedicated circuit (for example, FPGA (Field-Programmable Gate Array) or ASIC (Application Specific Integrated Circuit)) that performs specific processing. .

本実施の形態による対話装置、対話システム、対話方法は、例えば、パーソナルコンピューターの操作に対するＱＡ、保険加入に関するＱＡの他、受付ロボット、コールセンタや高齢者向けロボットに適応される。特に、受付ロボット、コールセンタや高齢者向けロボットでは、多岐にわたる入力文を含む入力データに対して安全な応答文を出力することができ、受付時やコールセンタへの問い合わせに対する適切なおもてなし、高齢者の孤独を和らげるといった価値を提供することできる。 The dialogue device, dialogue system, and dialogue method according to the present embodiment are applied to, for example, QA for personal computer operations, QA for insurance enrollment, reception robots, call centers, and robots for the elderly. In particular, reception robots, call centers, and robots for the elderly can output safe response sentences to input data that includes a wide variety of input sentences. It can provide value such as alleviating loneliness.

本発明の第１の実施形態に係る対話処理の説明図を図１に示す。対話システム１００は、ユーザから入力文を含む入力データ１１０を受け取り、応答文１１１を出力する。入力データ１１０は入力文含み、入力文は有害な発言の一例であり、投稿コメント、投稿コメントと投稿コメントに対する返信コメント等を含む。以下、入力データ１１０を入力文１１０として説明を続ける。対話システム１００は、有害カテゴリー１に分類される有害な入力文に適切な応答ができるよう調整した有害カテゴリー１対応対話モデル１０１、有害カテゴリー２に分類される有害な入力文に適切な応答ができるよう調整した有害カテゴリー２対応対話モデル１０２、有害カテゴリーＮに分類される有害な入力文に適切な応答ができるよう調整した有害カテゴリーＮ対応対話モデル１０３を備えており、入力文をそれぞれの対話モデルに入力して応答文を生成する。 FIG. 1 shows an explanatory diagram of interaction processing according to the first embodiment of the present invention. The dialogue system 100 receives input data 110 including an input sentence from a user, and outputs a response sentence 111. The input data 110 includes an input sentence, and the input sentence is an example of a harmful remark, and includes a posted comment, a posted comment, and a reply comment to the posted comment. Hereinafter, the explanation will be continued assuming that the input data 110 is the input sentence 110. The dialogue system 100 includes a dialogue model 101 compatible with harmful category 1, which is adjusted to be able to appropriately respond to harmful input sentences classified as harmful category 1, and a dialogue model 101 that is adjusted to be able to appropriately respond to harmful input sentences classified as harmful category 2. It is equipped with a dialogue model 102 compatible with harmful category 2, which has been adjusted as shown in FIG. to generate a response sentence.

ここで「有害カテゴリー」とは、投稿コメントを含む入力データ、投稿コメントと投稿コメントに対する返信コメントを含む入力データに含まれる入力文よって攻撃される対象「攻撃対象」であって、例えば、人種・民族差別や犯罪記事等である。そのため、有害カテゴリー１対応対話モデル１０１、有害カテゴリー２対応対話モデル１０２、有害カテゴリーＮ対応対話モデル１０３を、攻撃対象対応対話モデルと称することもできる。 Here, the term "harmful category" refers to an "attack target" that is attacked by input data including posted comments, and input sentences included in input data including posted comments and reply comments to posted comments, and includes, for example, racial - Articles about ethnic discrimination or crimes. Therefore, the dialogue model 101 compatible with harmful category 1, the dialogue model 102 compatible with harmful category 2, and the dialogue model 103 compatible with harmful category N can also be referred to as the dialogue model compatible with attack targets.

そして、対話システム１００は、対話モデルが生成した各応答文に対して、有害か否かを評価して有害性スコアを算出し、有害性スコアがもっとも低い応答文を選択して出力する。図１では、入力文１１０に対して、有害カテゴリー１対応対話モデル１０１が出力した応答文１０４の有害性が最も低いと判定され、応答文１１１として出力されている様子を表している。 Then, the dialogue system 100 evaluates each response sentence generated by the dialogue model to determine whether it is harmful or not, calculates a harm score, and selects and outputs the response sentence with the lowest harm score. In FIG. 1, a response sentence 104 output by the harmful category 1 compatible dialogue model 101 is determined to be the least harmful to an input sentence 110, and is output as a response sentence 111.

＜対話システム＞
図２は、実施例１にかかる対話システムのシステム構成例を示す説明図である。対話システム１００は、たとえば、クライアントサーバシステムであり、サーバ２０１と、ＰＣ２０３、スマートフォン２０４などの情報処理装置２０５と、を有する。サーバ２０１と情報処理装置２０５とは、インターネット、ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）、ＷＡＮ（ＷｉｄｅＡｒｅａＮｅｔｗｏｒｋ）などのネットワーク２０２を介して通信可能である。 <Dialogue system>
FIG. 2 is an explanatory diagram showing an example of the system configuration of the dialogue system according to the first embodiment. The dialogue system 100 is, for example, a client server system, and includes a server 201 and an information processing device 205 such as a PC 203 or a smartphone 204. The server 201 and the information processing device 205 can communicate via a network 202 such as the Internet, a LAN (Local Area Network), or a WAN (Wide Area Network).

クライアントサーバシステムの場合、対話プログラムは、サーバ２０１にインストールされる。したがって、サーバ２０１は、対話システムとして、応答生成、モデル管理、応答選択、対話履歴管理処理を実行する。この場合、情報処理装置２０５は、処理対象のテキストのサーバ２０１への送信、サーバ２０１からの処理結果の受信、処理対象のテキストを入力するインタフェースとなる。 In the case of a client-server system, the interaction program is installed on the server 201. Therefore, the server 201 executes response generation, model management, response selection, and dialog history management processing as a dialog system. In this case, the information processing device 205 serves as an interface for transmitting text to be processed to the server 201, receiving processing results from the server 201, and inputting text to be processed.

一方、スタンドアロン型の場合、対話プログラムは、情報処理装置２０５にインストールされ、サーバ２０１は不要である。したがって、情報処理装置２０５は、対話装置として、テキストの入力、入力したテキストの応答生成、結果の出力を実行する。 On the other hand, in the case of a stand-alone type, the dialog program is installed on the information processing device 205, and the server 201 is not required. Therefore, the information processing device 205 functions as an interaction device, inputting text, generating a response to the input text, and outputting the results.

＜対話システムのハードウェア構成例＞
図３は、対話装置３００のハードウェア構成例を示すブロック図である。 <Example of hardware configuration of dialogue system>
FIG. 3 is a block diagram showing an example of the hardware configuration of the dialog device 300.

対話装置３００は、対話システムの場合にはサーバ２０１に相当し、スタンドアロン型の場合には情報処理装置２０５に相当する。対話装置３００は、プロセッサ３０１と、記憶デバイス３０２と、入力デバイス３０３と、出力デバイス３０４と、通信インタフェース（通信ＩＦ）３０５と、を有する。プロセッサ３０１、記憶デバイス３０２、入力デバイス３０３、出力デバイス３０４、および通信ＩＦ３０５は、バス３０６により接続される。プロセッサ３０１は、対話装置３００を制御する。記憶デバイス３０２は、プロセッサ３０１の作業エリアとなる。また、記憶デバイス３０２は、各種プログラムやデータを記憶する非一時的なまたは一時的な記録媒体である。記憶デバイス３０２としては、たとえば、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）、ＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）、フラッシュメモリがある。入力デバイス３０３は、データを入力する。入力デバイス３０３としては、たとえば、キーボード、マウス、タッチパネル、テンキー、スキャナ、マイク、生体センサがある。出力デバイス３０４は、データを出力する。出力デバイス３０４としては、たとえば、ディスプレイ、プリンタ、スピーカがある。通信ＩＦ３０５は、ネットワーク２０２と接続し、データを送受信する。 The dialogue device 300 corresponds to the server 201 in the case of a dialogue system, and corresponds to the information processing device 205 in the case of a stand-alone type. The dialog device 300 includes a processor 301 , a storage device 302 , an input device 303 , an output device 304 , and a communication interface (communication IF) 305 . The processor 301, storage device 302, input device 303, output device 304, and communication IF 305 are connected by a bus 306. Processor 301 controls interaction device 300 . The storage device 302 becomes a work area for the processor 301. Furthermore, the storage device 302 is a non-temporary or temporary recording medium that stores various programs and data. Examples of the storage device 302 include ROM (Read Only Memory), RAM (Random Access Memory), HDD (Hard Disk Drive), and flash memory. Input device 303 inputs data. Examples of the input device 303 include a keyboard, a mouse, a touch panel, a numeric keypad, a scanner, a microphone, and a biosensor. Output device 304 outputs data. Examples of the output device 304 include a display, a printer, and a speaker. Communication IF 305 connects to network 202 and transmits and receives data.

＜対話システム１００の機能的構成例＞
図４Ａは、実施例１にかかる対話システム１００の機能的構成例を示すブロック図である。対話システム１００は、対話処理プログラムがインストールされたコンピュータであり、情報処置装置２０５、あるいはサーバ２０１である。対話システム１００は、対話処理プログラムにより、入力部４０１、応答生成部４０４、モデル管理部４０２、対話モデル群４０８、有害性評価モデル４０３、応答選択部４０５、対話履歴管理部４０７、出力部４０６を実現する。 <Functional configuration example of dialogue system 100>
FIG. 4A is a block diagram showing an example of the functional configuration of the dialogue system 100 according to the first embodiment. The dialogue system 100 is a computer in which a dialogue processing program is installed, and is an information processing device 205 or a server 201. The dialog system 100 uses an input section 401, a response generation section 404, a model management section 402, a dialog model group 408, a toxicity evaluation model 403, a response selection section 405, a dialog history management section 407, and an output section 406 using a dialog processing program. Realize.

図４Ｂに示すように、入力部４０１、モデル管理部４０２、応答生成部４０４、応答選択部４０５、出力部４０６、対話履歴管理部４０７は、プロセッサ３０１が記憶デバイス３０２に格納された対話処理プログラム４１０を実行することで各種機能を実現する。 As shown in FIG. 4B, an input unit 401, a model management unit 402, a response generation unit 404, a response selection unit 405, an output unit 406, and a dialog history management unit 407 are configured by a dialog processing program stored in a storage device 302 by a processor 301. By executing 410, various functions are realized.

また、対話モデル群４０８、有害性評価モデル４０３、ベース対話モデル１００２も、記憶デバイス３０２に格納される。 Furthermore, the interaction model group 408, the toxicity evaluation model 403, and the base interaction model 1002 are also stored in the storage device 302.

また、後述する有害単語リスト８０２、アノテーションデータ６００等の各種データは、記憶デバイス３０２の各種データ４１１内に格納される。 Further, various data such as a harmful word list 802 and annotation data 600, which will be described later, are stored in various data 411 of the storage device 302.

入力部４０１は、図３の入力デバイス３０３に対応し、入力文であるテキストデータを読み込むモジュールで、テキストデータの加工と応答生成部４０４への入力を行う。また、有害性評価モデル４０３と対話モデル群４０８作成時のテキストデータの登録は、すべて入力部４０１を介して行われる。 The input unit 401 corresponds to the input device 303 in FIG. 3, and is a module that reads text data, which is an input sentence, and processes the text data and inputs it to the response generation unit 404. In addition, all text data are registered through the input unit 401 when creating the toxicity evaluation model 403 and the interaction model group 408.

モデル管理部４０２は、応答生成部４０４が使用する対話モデル群４０８と、応答選択部４０５が使用する有害性評価モデル４０３を管理する。モデル管理部４０２は、応答生成部４０４の指示に従って所定のモデルにテキスト（入力文）を入力し、モデルの出力結果(応答文)を返す。また、モデル管理部４０２は、応答選択部４０５の指示に従って所定のモデルに応答文を入力し、所定のモデルにより応答文の有害性スコアを算出し、応答文の有害性を把握する。さらに、モデル管理部４０２は、各種モデルの作成と登録も行う。 The model management unit 402 manages the interaction model group 408 used by the response generation unit 404 and the toxicity evaluation model 403 used by the response selection unit 405. The model management unit 402 inputs text (input sentence) to a predetermined model according to instructions from the response generation unit 404, and returns the output result (response sentence) of the model. Furthermore, the model management unit 402 inputs the response sentence into a predetermined model according to instructions from the response selection unit 405, calculates a harmfulness score of the response sentence using the predetermined model, and grasps the harmfulness of the response sentence. Furthermore, the model management unit 402 also creates and registers various models.

応答生成部４０４は、入力部４０１から受け取ったテキスト（入力文）を、モデル管理部４０２を介して対話モデル群４０８に入力し、それぞれの有害カテゴリー対応対話モデル（以下、単に対話モデルと称する）から応答文を取得する。対話モデルは、テキストを入力するとそのテキストに対する応答文を生成する。対話モデルは、ＳＮＳなどから取得した複数往復の対話文で学習したニューラル言語モデルである。 The response generation unit 404 inputs the text (input sentence) received from the input unit 401 to the dialog model group 408 via the model management unit 402, and generates a dialog model corresponding to each harmful category (hereinafter simply referred to as a dialog model). Get the response text from. When text is input, a dialogue model generates a response sentence for that text. The dialogue model is a neural language model learned from multiple round-trip dialogues obtained from SNS and the like.

図５は、入力部４０１から入力される掲示板型ＳＮＳから取得した投稿コメントと返信コメントのテキストで、ニューラル言語モデルの学習を行って対話モデルを作成するまでの説明図である。 FIG. 5 is an explanatory diagram of the process of training a neural language model and creating a dialogue model using the texts of posted comments and reply comments acquired from the bulletin board type SNS input from the input unit 401.

５００は、入力部４０１から入力される掲示板型ＳＮＳのスレッド（投稿と一連の返信のツリー構造）を示す一例である。投稿コメント５０１に対する返信コメントが５０２と５０５である。また、返信コメント５０２に対する返信として返信コメント５０３が、さらにその返信として返信コメント５０４がある。 500 is an example of a bulletin board type SNS thread (tree structure of posts and a series of replies) input from the input unit 401. Comments 502 and 505 are replies to the posted comment 501. Further, there is a reply comment 503 as a reply to the reply comment 502, and a reply comment 504 as a reply to the reply comment 503.

５１０は、入力部４０１において、５００の投稿コメントと返信コメントをニューラル言語モデルの学習用に整形したデータである。５００は５０２～５０４と５０５～５０６の２つの枝を持ち、５１０では、それぞれの枝が５１１と５１２として整形されている。５１１は、５０１～５０４のコメントを連結したテキストで、「｜」はコメントとコメントの境界を示す特殊なトークン（単語）であり、ニューラル言語モデルはこのトークンをもとに複数のコメントで構成されるテキストからそれぞれのコメントの範囲を認識する。 510 is data in which the input unit 401 formats 500 posted comments and reply comments for neural language model learning. 500 has two branches, 502 to 504 and 505 to 506, and in 510, the branches are formatted as 511 and 512, respectively. 511 is a text that connects comments 501 to 504, and "|" is a special token (word) that indicates the boundary between comments, and the neural language model is composed of multiple comments based on this token. Recognize the range of each comment from the text.

入力文として、投稿コメントと返信コメントを含む５１０の形式で大量の対話データを用意してニューラル言語モデルを学習することで、対話モデル５２２が作成され、例えば、対話モデル５２２に入力文５２１「午後から雪が降るそうです。」を与えると、この入力に対して尤もらしい応答として出力５２３「初雪ですね。」が得られる。対話モデル群４０８の対話モデルは、後述の方法で有害発言に対して適切な応答を出力できるよう学習された対話モデルである。 A dialogue model 522 is created by preparing a large amount of dialogue data in the format 510 including posted comments and reply comments as input sentences and learning the neural language model. It looks like it's going to snow.'', the output 523 ``It's the first snow.'' is obtained as a plausible response to this input. The dialogue models in the dialogue model group 408 are dialogue models that have been trained to output appropriate responses to harmful comments using the method described below.

ニューラル言語モデルを用いた対話モデルには、例えば、ＧＰＴ-２（ＧｅｎｅｒａｔｉｖｅＰｒｅ－ｔｒａｉｎｅｄＴｒａｎｓｆｏｒｍｅｒ２）を用いて作成できる。例えば、ＳＮＳから取得したデータをＧＰＴ－２アーキテクチャで学習した対話モデルに、ＤｉａｌｏＧＰＴがある。なお、複数往復の対話履歴を含むテキストを入力としてその応答のテキストを出力することができればよく、これ以外の技術を用いて実現してもよい。例えば、別のアーキテクチャとして、ＢＡＲＴ（ＢｉｄｉｒｅｃｔｉｏｎａｌＡｕｔｏ－ＲｅｇｒｅｓｓｉｖｅＴｒａｎｓｆｏｒｍｅｒ）が挙げられる。 A dialogue model using a neural language model can be created using, for example, GPT-2 (Generative Pre-trained Transformer 2). For example, DialoGPT is an interaction model that is trained using the GPT-2 architecture using data obtained from SNS. It should be noted that it is only necessary to input a text including a history of multiple round-trip conversations and output a response text, and it may be realized using other techniques. For example, another architecture is BART (Bidirectional Auto-Regressive Transformer).

有害性評価モデル４０３は、入力部４０１から入力されたテキストが有害発言等の有害な内容か否かを判別する文分類モデルで、入力されたテキスト(入力文)に対して有害か否かの二値の分類を行う。また、分類モデルの出力をソフトマックス関数に通して算出した分類結果の確率を、有害性スコアとして出力する。 The toxicity evaluation model 403 is a sentence classification model that determines whether the text input from the input unit 401 is harmful content such as harmful remarks, and is a sentence classification model that determines whether the input text (input sentence) is harmful or not. Perform binary classification. Furthermore, the probability of the classification result calculated by passing the output of the classification model through a softmax function is output as a harmfulness score.

有害性評価モデル４０３も、ＧＰＴ-２（ＧｅｎｅｒａｔｉｖｅＰｒｅ－ｔｒａｉｎｅｄＴｒａｎｓｆｏｒｍｅｒ２）によって実現される。有害性評価モデル４０３は、後述の方法で有害な内容か安全な内容かの二値のラベルをコメント単位で付与したデータを用意し、文を入力した際にその文のラベルを予測するよう追加学習を行う。 The hazard evaluation model 403 is also realized by GPT-2 (Generative Pre-trained Transformer 2). The hazard evaluation model 403 prepares data in which a binary label of harmful content or safe content is assigned to each comment using the method described later, and adds a function that predicts the label of a sentence when the sentence is input. Learn.

応答選択部４０５は、応答生成部４０４が対話モデルによって出力した各応答文に対してモデル管理部４０２を介して有害性評価モデル４０３による評価を行う。応答選択部４０５は、有害性スコアが所定の閾値未満でかつ、もっとも低い応答文を出力部４０６と対話履歴管理部４０７に出力する。出力部４０６は、応答選択部４０５が選択した応答文を出力デバイス３０４に出力する。 The response selection unit 405 uses the toxicity evaluation model 403 to evaluate each response sentence output by the response generation unit 404 using the dialogue model via the model management unit 402. The response selection unit 405 outputs the response sentence with the lowest harmfulness score below a predetermined threshold to the output unit 406 and the dialogue history management unit 407. The output unit 406 outputs the response sentence selected by the response selection unit 405 to the output device 304.

対話履歴管理部４０７は、応答選択部４０５が選択した応答文を対話履歴として保持し、入力部４０１が入力を受け取ったときに、保持していた対話履歴を応答生成部４０４に出力する。例えば、対話履歴管理部４０７は、図５の入力される投稿コメント５２１に対し、応答選択部４０５により選択された応答文５２３との対応を管理する。 The dialogue history management unit 407 holds the response sentence selected by the response selection unit 405 as a dialogue history, and outputs the held dialogue history to the response generation unit 404 when the input unit 401 receives an input. For example, the dialogue history management unit 407 manages the correspondence between the input posted comment 521 in FIG. 5 and the response sentence 523 selected by the response selection unit 405.

＜有害性評価モデル作成処理＞
有害性評価モデル作成処理は、実施例１にかかる対話システム１００において、モデル管理部４０２が、対話処理に必要な有害性評価モデルの作成、および、有害カテゴリー対応対話モデルの作成処理に使用するカテゴリー別有害性評価モデルの作成を行う処理である。モデル管理部４０２は、掲示板型ＳＮＳから取得した投稿コメントとそれに対する返信コメントに対し、ユーザを介して有害性有無のアノテーションを行い、これを使用して有害性評価モデルの学習を行う。有害性評価モデルを作成する処理のフローチャートを図７に、また、この処理におけるデータフローを図８に示す。アノテーションを行うデータの例を図６に示す。 <Hazard assessment model creation process>
In the hazard evaluation model creation process, in the dialogue system 100 according to the first embodiment, the model management unit 402 creates a hazard evaluation model necessary for dialogue processing, and creates a hazard category corresponding dialogue model. This is the process of creating another hazard assessment model. The model management unit 402 annotates posted comments acquired from the bulletin board type SNS and reply comments thereto as to whether or not they are harmful via the user, and uses this annotation to learn a harmfulness evaluation model. FIG. 7 shows a flowchart of the process of creating a hazard evaluation model, and FIG. 8 shows the data flow in this process. FIG. 6 shows an example of data to be annotated.

図８に示すように、モデル管理部４０２は、まず、ＳＮＳのダンプデータ８０１から有害発言を含むスレッドを抽出し、有害性評価モデルの学習用にデータの整形を行う（ステップＳ７０１）。ダンプデータ８０１は、一定期間のＳＮＳのログをまとめたデータである。ダンプデータが取得できない場合は、ＳＮＳのクローリングによってデータを、入力部４０１から取得する。 As shown in FIG. 8, the model management unit 402 first extracts threads containing harmful comments from the SNS dump data 801, and formats the data for learning the harmfulness evaluation model (step S701). Dump data 801 is data that summarizes SNS logs for a certain period of time. If the dump data cannot be acquired, the data is acquired from the input unit 401 by crawling the SNS.

有害発言を含むスレッドの抽出は、有害発言関連用語リスト８０２の単語を含むスレッドをスクリーニングする。有害単語リスト８０２は誹謗中傷等の有害発言で頻出する単語のリストであり、記憶デバイス３０２の各種データ４１１として格納されている。単語リストは、Ｗｅｂ上で公開されている禁止用語のリストなどを利用すればよく、また、ユーザが独自に作成してもよい。ステップＳ７０１のデータ整形は、図５で例示した形式５００のデータを形式５１０のデータに整形する入力部４０１の処理である。 To extract threads containing harmful remarks, threads containing words in the harmful speech-related term list 802 are screened. The harmful word list 802 is a list of words that frequently appear in harmful comments such as slander, and is stored as various data 411 in the storage device 302. The word list may be created using a list of prohibited words published on the Web, or may be created independently by the user. The data formatting in step S701 is a process performed by the input unit 401 to format data in format 500 illustrated in FIG. 5 into data in format 510.

次にモデル管理部４０２は、ステップＳ７０１で整形されたデータ８０３に対して有害性有無に関するデータを付与するアノテーションを行う。アノテーションは人手で行うため、モデル管理部４０２がアノテーション対象のデータを出力デバイス３０４に出力し、入力デバイス３０３を通してユーザの入力を受け付ける。 Next, the model management unit 402 performs annotation on the data 803 formatted in step S701 to add data regarding the presence or absence of harmfulness. Since the annotation is performed manually, the model management unit 402 outputs the data to be annotated to the output device 304 and receives user input through the input device 303.

モデル管理部４０２は、整形済データ８０３から一つのデータを取り出す（ステップＳ７０２）。ここで、データとは、図５の５１１、５１２で例示した投稿コメントとその返信コメントを結合したデータである。このように投稿コメントとその返信コメントのアノテーションを同時に実施することで、コメント単体ではなく前のコメントも踏まえた有害性をアノテーションする。例えば、有害なコメントに同意するコメントも有害とする。 The model management unit 402 extracts one piece of data from the formatted data 803 (step S702). Here, the data is data that is a combination of posted comments and their reply comments exemplified by 511 and 512 in FIG. In this way, by simultaneously annotating posted comments and their reply comments, the harmfulness is annotated not only based on the comment alone but also on previous comments. For example, comments that agree with harmful comments are also considered harmful.

有害性は、予め定義されたカテゴリーから選択する。「有害性のカテゴリー」は、投稿コメントを含む入力データ、投稿コメントと投稿コメントに対する返信コメント等の入力文を含む入力データによって攻撃される対象「攻撃対象」であって、たとえば、人種・民族、ジェンダー、宗教、容姿、健康（障がい）、政治・社会経済などである。また、たとえば、特定の人種の特定の性別のように複数のカテゴリーにまたがった有害発言もあるため、有害性のラベルは複数カテゴリーを指定できるようにしておく。このようにして、一つのデータに含まれるすべてのコメントに対してアノテーションを行う（ステップＳ７０３）。 Hazards are selected from predefined categories. "Harmful categories" are "attack targets" that are attacked by input data including posted comments, posted comments and input data such as reply comments to posted comments, and include, for example, race/ethnicity. , gender, religion, appearance, health (disability), politics/socioeconomics, etc. Furthermore, since there are some harmful comments that fall into multiple categories, such as those related to a specific race or gender, the harmfulness label should be able to specify multiple categories. In this way, all comments included in one data are annotated (step S703).

アノテーションデータの例を図６に示す。図６は、アノテーションデータ６００を示し、投稿コメント６０１とその返信コメント６０２～６０４に対し、有害性のカテゴリーとしてラベルが付与されている。前述の通り、モデル管理部４０２は一連の返信コメントをまとめて出力デバイス３０４に出力し、ユーザに対してアノテーションを実行させることで、文脈を考慮したラベルを付与できるようにする。６０５～６０８はアノテーションによって付与されたラベルである。アノテーションデータ６００は、コメントに対する攻撃対象を特定するカテゴリーをラベルとして付与されたデータである。 An example of annotation data is shown in FIG. FIG. 6 shows annotation data 600, in which a posted comment 601 and its reply comments 602 to 604 are labeled as harmful categories. As described above, the model management unit 402 collectively outputs a series of reply comments to the output device 304 and allows the user to perform annotation, thereby making it possible to assign a label in consideration of the context. 605 to 608 are labels given by annotation. The annotation data 600 is data that is labeled with a category that specifies an attack target for a comment.

コメント６０１「あの国は犯罪者が多いですね。」は、人種・民族差別的なコメントであるため、「人種・民族」のラベルを付与する。コメント６０２「そんなことはない。」は、コメント６０１に反対するコメントであり有害性はないため、「有害性なし」のラベルが付与する。コメント６０３「いや、その通りだと思う。」は、コメント単体では有害性はないが、コメント６０１に同意する人種・民族差別的なコメントと判断できるため「人種・民族」のラベルを付与する。コメント６０４「あの国の男は怠け者の犯罪者だ。」は、人種・民族差別とジェンダー差別の二つの有害性を含むため、６０８の通り、「人種・民族」と「ジェンダー」の二つのラベルを付与する。 Comment 601 "There are a lot of criminals in that country." is a racially/ethnically discriminatory comment, so it is assigned a "racial/ethnic" label. Comment 602 "That's not true." is a comment that opposes comment 601 and is not harmful, so it is given a label of "not harmful." Comment 603 "No, I think that's right." is not harmful on its own, but it can be judged as a racial/ethnic discriminatory comment that agrees with comment 601, so it is given the label "racial/ethnic". do. Comment 604, ``Men from that country are lazy criminals.'' contains the two harmful effects of racial/ethnic discrimination and gender discrimination. Assign one label.

ユーザが一つのデータのアノテーションを完了し、ユーザから入力デバイス３０３を通して登録指示を受け付けると、モデル管理部４０２は、当該データを登録するとともに、このデータを含めて、これまでアノテーションされたデータについてカテゴリーごとのデータ数を集計する。 When a user completes the annotation of one piece of data and receives a registration instruction from the user through the input device 303, the model management unit 402 registers the data and assigns a category to the data that has been annotated so far, including this data. Aggregate the number of data for each.

モデル管理部４０２は、アノテーションされたデータ数が、あらかじめ設定されたデータ数に満たないカテゴリーがある場合、ステップＳ７０２に戻り、次のデータを取り出してアノテーションを続ける。このように、カテゴリーごとに十分な数のデータが登録できるまでアノテーションを行うことで、有害性評価モデルにカテゴリー間の精度差が出ることを防ぐ。 If there is a category in which the number of annotated data is less than the preset number of data, the model management unit 402 returns to step S702, extracts the next data, and continues annotation. In this way, annotation is performed until a sufficient number of data have been registered for each category, thereby preventing accuracy differences between categories in the hazard assessment model.

すべてのカテゴリーについて所定の数のデータが登録されると（ステップＳ７０４：ＹＥＳ）、モデル管理部４０２は、アノテーション済データ８０４を使用して有害性評価モデル４０３の学習を行う（ステップＳ７０５）。各データは、カテゴリーを含めた有害性がアノテーションされているが、このステップでは、モデル管理部４０２は、カテゴリーを分けずに有害か否かの二値分類を行う有害性評価モデルを作成する。したがって、モデル管理部４０２は、アノテーションされたデータに対して、正解ラベルを有害か否かの二値に変換して有害性評価モデル４０３の学習を行う。 When a predetermined number of data are registered for all categories (step S704: YES), the model management unit 402 uses the annotated data 804 to train the hazard evaluation model 403 (step S705). Each piece of data is annotated with hazards including categories, but in this step, the model management unit 402 creates a hazard evaluation model that performs binary classification of whether or not it is harmful without dividing the data into categories. Therefore, the model management unit 402 trains the toxicity evaluation model 403 by converting the correct label into a binary value indicating whether or not it is harmful for the annotated data.

次に、モデル管理部４０２は、カテゴリーごとの有害性評価モデル４０３を作成するため、アノテーション済データ８０４をカテゴリーごとに分類する（ステップＳ７０６）。データの分類後、モデル管理部４０２は、そのうちの一つのカテゴリーに着目し（ステップＳ７０７）、当該カテゴリーのデータ８０５を使用して有害性評価モデルの学習を行う（ステップＳ７０８）。そして、モデル管理部４０２は、未処理のカテゴリーがあれば（ステップＳ７０９：ＹＥＳ）、ステップＳ７０７に戻り、当該カテゴリーの有害性評価モデルの学習を行う。このようにしてすべてのカテゴリーの有害性評価モデルの学習が終わると（ステップＳ７０９）、有害性評価モデル作成処理が完了する。 Next, the model management unit 402 classifies the annotated data 804 for each category in order to create a hazard evaluation model 403 for each category (step S706). After classifying the data, the model management unit 402 focuses on one of the categories (step S707), and uses the data 805 of the category to train a hazard evaluation model (step S708). If there is an unprocessed category (step S709: YES), the model management unit 402 returns to step S707 and learns the toxicity evaluation model for the category. When learning of the hazard evaluation models for all categories is completed in this manner (step S709), the hazard evaluation model creation process is completed.

＜有害カテゴリー対応対話モデル作成処理＞
有害カテゴリー対応対話モデル（対話モデル）作成処理は、有害発言に対して適切な応答ができる対話モデルを、攻撃対象、即ち有害カテゴリーごとに作成する処理である。モデル管理部４０２が、有害性評価モデルを用いて、入力部４０１から入力されたＳＮＳのダンプデータから有害発言に対して適切な応答を行っているスレッドを抽出し、このようなスレッドを使用してベース対話モデルの追加学習を行う。 <Harmful category compatible dialogue model creation process>
The harmful category compatible dialogue model (dialogue model) creation process is a process of creating a dialogue model that can appropriately respond to harmful comments for each target of attack, that is, for each harmful category. The model management unit 402 uses the toxicity evaluation model to extract threads that are responding appropriately to harmful comments from the SNS dump data input from the input unit 401, and uses such threads. Additional learning of the base interaction model is performed.

実施例１にかかる対話システム１００において各種有害発言に対応した対話モデルの作成処理のフローチャートを図９に、また、この処理におけるデータフローを図１０に示す。 FIG. 9 shows a flowchart of a process for creating a dialogue model corresponding to various harmful comments in the dialogue system 100 according to the first embodiment, and FIG. 10 shows a data flow in this process.

モデル管理部４０２は、まず、有害カテゴリー対応対話モデルのもとになるベース対話モデル１００２を、ネットワーク２０２を介して外部装置から読み込み、記憶デバイス３０２に格納する（ステップＳ９０１）。ベース対話モデル１００２は、ＳＮＳ等から取得した大量の対話ログを用いて学習されたニューラル言語モデルで、第三者が公開しているニューラル対話モデルを流用する。 The model management unit 402 first reads the base interaction model 1002, which is the basis of the harmful category corresponding interaction model, from an external device via the network 202 and stores it in the storage device 302 (step S901). The base dialogue model 1002 is a neural language model learned using a large amount of dialogue logs obtained from SNS etc., and utilizes a neural dialogue model published by a third party.

なお、使用する対話モデルは、モデル管理部４０２が扱うアーキテクチャで、対話システム１００の対話処理で扱う言語と同じ言語の学習データ、または当該言語を一定量含む多言語データで学習されたモデルとする。学習済みの対話モデルがない場合、対話システム１００によって、ＳＮＳのダンプデータの取得からデータ整形、言語モデルの学習を行って作成する。 Note that the dialogue model to be used is a model that is trained using training data of the same language as the language handled by the dialogue processing of the dialogue system 100, or multilingual data containing a certain amount of the language, in an architecture handled by the model management unit 402. . If there is no trained dialogue model, the dialogue system 100 creates one by acquiring SNS dump data, shaping the data, and learning a language model.

次に、モデル管理部４０２は、有害性評価モデル作成処理の過程で入力部４０１が作成した整形済データ８０３を読み込む（ステップＳ９０２）。そして、モデル管理部４０２は、有害性評価モデル作成処理で作成した有害性評価モデル４０３を使用して、整形したデータに対して有害性の自動アノテーションを行う（ステップＳ９０３）。有害性評価モデル４０３による自動アノテーションは、有害性評価モデル作成処理のアノテーションと同様にコメント単位で行われる。なお、ここでの自動アノテーションは、有害か否かの二値である。 Next, the model management unit 402 reads the formatted data 803 created by the input unit 401 in the process of creating a hazard evaluation model (step S902). Then, the model management unit 402 performs automatic annotation of toxicity on the formatted data using the toxicity evaluation model 403 created in the toxicity evaluation model creation process (step S903). Automatic annotation by the hazard evaluation model 403 is performed on a comment-by-comment basis, similar to the annotation in the hazard evaluation model creation process. Note that the automatic annotation here has a binary value of whether it is harmful or not.

次にモデル管理部４０２は、有害なコメントを含み最後の返信コメントが有害ではないデータ１００１を、有害性評価モデル４０３を用いて抽出する（ステップＳ９０４）。このスクリーニングによって、有害発言に対して適切な応答を行っている対話データを収集する。 Next, the model management unit 402 extracts data 1001 that includes a harmful comment and whose last reply comment is not harmful, using the harmfulness evaluation model 403 (step S904). Through this screening, dialogue data showing appropriate responses to harmful comments is collected.

次にモデル管理部４０２は、有害性評価モデル作成処理で作成したカテゴリー別の有害性評価モデルを使用して、ステップＳ９０４で抽出したデータ１００１を有害性のカテゴリー別に分類する（ステップＳ９０５）。そして、次のステップにて、モデル管理部４０２は、有害カテゴリー対応対話モデルの追加学習を行っていく。 Next, the model management unit 402 classifies the data 1001 extracted in step S904 into categories of toxicity using the category-specific toxicity evaluation model created in the hazard evaluation model creation process (step S905). Then, in the next step, the model management unit 402 performs additional learning of the interaction model corresponding to the harmful category.

まず、モデル管理部４０２は、有害カテゴリーの一つに着目する（ステップＳ９０６）。そして、モデル管理部４０２は、ステップＳ９０１で読み込んだベース対話モデル１００２に対して、ステップＳ９０４で抽出したデータを使用した追加学習を行い、汎用有害性対応対話モデル１００３を作成する（ステップＳ９０７）。作成された汎用有害性対応対話モデル１００３は記憶デバイス３０２に格納される。 First, the model management unit 402 focuses on one of the harmful categories (step S906). Then, the model management unit 402 performs additional learning on the base dialogue model 1002 read in step S901 using the data extracted in step S904, and creates a general-purpose hazard response dialogue model 1003 (step S907). The created general-purpose toxicity response interaction model 1003 is stored in the storage device 302.

そして、モデル管理部４０２は、ステップＳ９０５で分類したカテゴリー別のデータから着目しているカテゴリーのデータを選択する。モデル管理部４０２は、さらに、ステップＳ９０７で追加学習を行った汎用有害性対応対話モデル１００３に対して選択したデータを使用した追加学習を行い、当該カテゴリーのカテゴリー対応対話モデルを作成する。 Then, the model management unit 402 selects data of the category of interest from the data classified by category in step S905. The model management unit 402 further performs additional learning using the selected data on the general-purpose toxicity response interaction model 1003 that underwent additional learning in step S907, and creates a category response interaction model for the category.

未処理のカテゴリーがある場合（ステップＳ９０９：ＹＥＳ）、ステップＳ９０６に戻り、未処理のカテゴリーの対話モデルを作成する。このようにしてすべてのカテゴリーの対話モデルを作成し（ステップＳ９０９：ＮＯ）、処理を終了する。このように、カテゴリー別データによる追加学習の前に、全カテゴリーのデータで追加学習を行うことで、有害発話全般に対する汎用性と特定の有害カテゴリーに対する専用性の両方の性質を獲得する効果がある。 If there is an unprocessed category (step S909: YES), the process returns to step S906 and an interaction model for the unprocessed category is created. In this way, interaction models for all categories are created (step S909: NO), and the process ends. In this way, performing additional learning using data from all categories before performing additional learning using categorical data has the effect of acquiring both generality for harmful utterances in general and specialization for specific harmful categories. .

＜対話処理＞
実施例１にかかる対話システム１００における対話処理のフローチャートを図１１に示す。対話処理では、有害性評価モデル作成処理で作成した有害性評価モデル４０３と、有害カテゴリー対応対話モデル作成処理で作成した、有害カテゴリー別の複数の対話モデルを使用して、有害入力に対して適切な応答を出力する処理である。 <Interaction processing>
FIG. 11 shows a flowchart of dialogue processing in the dialogue system 100 according to the first embodiment. Dialogue processing uses the hazard evaluation model 403 created in the hazard evaluation model creation process and the multiple dialogue models for each harmful category created in the harm category compatible dialogue model creation process to determine the appropriate response to the harmful input. This is a process that outputs a response.

対話処理は、入力デバイス３０３を通してユーザからテキストが入力されることで開始される。まず、入力部４０１が入力データを読み込み、入力データに含まれる入力文を、応答生成部４０４に出力する（ステップＳ１１０１）。ここで応答生成部４０４は、対話履歴管理部４０７が対話履歴を保持していた場合、対話履歴と入力文を結合する。次に、応答生成部４０４が、モデル管理部４０２を介して、入力文を各対話モデルに入力する（ステップＳ１１０２）。尚、対話履歴管理部４０７に対話履歴を保持していない場合には、入力文のみを各対話モデルに入力する。モデル管理部４０２は、すべての対話モデルから入力文に対する応答文を取得し、応答生成部４０４へ出力する。 The interaction process is started when a user inputs text through the input device 303. First, the input unit 401 reads input data and outputs an input sentence included in the input data to the response generation unit 404 (step S1101). Here, the response generation unit 404 combines the dialogue history and the input sentence if the dialogue history management unit 407 holds the dialogue history. Next, the response generation unit 404 inputs the input sentence to each interaction model via the model management unit 402 (step S1102). Note that if the dialogue history management unit 407 does not hold the dialogue history, only the input sentence is input to each dialogue model. The model management unit 402 acquires response sentences for input sentences from all interaction models and outputs them to the response generation unit 404.

応答生成部４０４は、モデル管理部４０２から受け取った応答文を応答選択部４０５に入力し、応答選択部４０５がこれらの応答文の有害性を評価する（ステップＳ１１０３）。応答選択部４０５は、モデル管理部４０２を介して、有害性評価モデル４０３に各応答文を入力し、応答文ごとに有害性スコアを得る。有害性スコアは、有害性評価モデル４０３が出力をソフトマックス関数に通して算出した分類結果の確率で０～１の値を取る。有害性スコアが１に近いほど、有害である確率が高いことを示す。 The response generation unit 404 inputs the response sentences received from the model management unit 402 to the response selection unit 405, and the response selection unit 405 evaluates the harmfulness of these response sentences (step S1103). The response selection unit 405 inputs each response sentence into the toxicity evaluation model 403 via the model management unit 402, and obtains a toxicity score for each response sentence. The harm score is the probability of the classification result calculated by the harm evaluation model 403 by passing the output through a softmax function, and takes a value between 0 and 1. The closer the harm score is to 1, the higher the probability that it is harmful.

次に応答選択部４０５は、算出した有害性スコアをもとに応答選択部４０５が出力した複数の応答文から最も適切な一つの応答文を選択する。応答選択部４０５は、まず、有害性スコアが閾値未満、つまり、安全な応答文があるかを判定する（ステップＳ１１０４）。有害性スコアが閾値未満の応答文があった場合（ステップＳ１１０４：ＹＥＳ）、応答選択部４０５は、その中から有害性スコアがもっとも低い応答文を選択する（ステップＳ１１０５）。なお、閾値未満の応答文が一つであった場合、この処理は省略される。 Next, the response selection unit 405 selects the most appropriate one response sentence from the plurality of response sentences output by the response selection unit 405 based on the calculated harmfulness score. The response selection unit 405 first determines whether there is a response sentence whose toxicity score is less than a threshold value, that is, whether there is a safe response sentence (step S1104). If there is a response sentence with a toxicity score below the threshold (step S1104: YES), the response selection unit 405 selects the response sentence with the lowest toxicity score from among them (step S1105). Note that if there is only one response sentence that is less than the threshold, this process is omitted.

有害性スコアが閾値未満の応答文がなかった場合（ステップＳ１１０４：ＮＯ）、つまり、すべての応答文を有害であると判定された場合、応答選択部４０５は、例外処理として、予め設定しておいた定型の応答文を出力する（ステップＳ１１０６）。定型の応答文とは、たとえば、「その発言には同意できません。」、あるいは「わかりません。」など、その有害入力文に対して同意しないことを示す応答である。このように処理することで、いずれの対話モデルからも適切な応答が得られなかった場合でも、有害な応答を出力することを防ぐ。 If there is no response sentence with a harmfulness score below the threshold (step S1104: NO), that is, if all the response sentences are determined to be harmful, the response selection unit 405 performs a preset response as an exception process. The fixed format response sentence is output (step S1106). The standard response sentence is, for example, a response indicating that the user does not agree with the harmful input sentence, such as "I don't agree with that statement." or "I don't understand." By processing in this manner, even if no appropriate response is obtained from any interaction model, harmful responses are prevented from being output.

そして、応答選択部４０５は、選択した応答文を出力部４０６に出力するとともに、入力文と選択した応答文を対話履歴として対話履歴管理部４０７に出力する（ステップＳ１１０７）。対話履歴管理部４０７は、応答選択部４０５が出力した入力文と応答文を対話履歴に追加する。対話履歴管理部４０７は、次の対話処理が開始された際に、対話履歴を応答生成部４０４に出力する。 Then, the response selection unit 405 outputs the selected response sentence to the output unit 406, and also outputs the input sentence and the selected response sentence to the conversation history management unit 407 as a conversation history (step S1107). The dialogue history management unit 407 adds the input sentence and response sentence output by the response selection unit 405 to the dialogue history. The dialogue history management unit 407 outputs the dialogue history to the response generation unit 404 when the next dialogue process is started.

以上説明した実施例１によれば、さまざまな種類の有害発言に対して適切な応答を返すことができる対話システムを提供することができる。 According to the first embodiment described above, it is possible to provide a dialogue system that can return appropriate responses to various types of harmful comments.

実施例１によれば、さまざまなカテゴリーに分類される有害発言に対して、カテゴリーごとに生成された対話モデルが適切な応答を生成し、応答選択部が各対話モデルにより生成した応答から最も適切な応答を選択することができる。換言すると、さまざまなカテゴリーの有害入力文に対して、適切な応答を返すことができる。 According to the first embodiment, the dialogue models generated for each category generate appropriate responses to harmful comments classified into various categories, and the response selection unit selects the most appropriate response from among the responses generated by each dialogue model. You can choose the appropriate response. In other words, it is possible to return appropriate responses to various categories of harmful input sentences.

本実施例では、有害性評価に有害カテゴリー別の有害性評価モデルを使用し、有害カテゴリーごとに有害性スコアの閾値を設定することで、有害カテゴリーごとの重要度を指定できるよう構成した対話システムについて説明する。 In this example, a hazard evaluation model for each hazard category is used for hazard evaluation, and a hazard score threshold is set for each hazard category, thereby making it possible to specify the importance of each hazard category. I will explain about it.

図１２は、本実施例にかかる対話システムの機能的構成例を示すブロック図である。有害性評価モデルが有害性評価モデル群１２０１として複数モデル構成となっている以外は、実施例１にかかる対話システムと同じ構成である。また、有害性評価モデル群を構成するカテゴリー別の有害性評価モデルは、図７で説明した有害性評価モデル作成処理のフローチャートにおけるステップＳ７０８で作成されたカテゴリー別の有害性評価モデルである。 FIG. 12 is a block diagram showing an example of the functional configuration of the dialogue system according to this embodiment. The configuration is the same as that of the dialog system according to the first embodiment, except that the hazard evaluation model is a multiple model configuration as the hazard evaluation model group 1201. Moreover, the hazard evaluation model by category that constitutes the group of hazard evaluation models is the hazard evaluation model by category created in step S708 in the flowchart of the hazard evaluation model creation process described in FIG.

＜対話処理＞
実施例２にかかる対話処理のフローチャートを図１３に示す。ステップＳ１３０１～ステップＳ１３０２までは、図１１のステップＳ１１０１～ステップＳ１１０２までの処理と同様である。ステップＳ１３０３にて、応答生成部４０４は、モデル管理部４０２から受け取った応答文を応答選択部４０５に入力し、応答選択部４０５がこれらの応答文の有害性を評価する。応答選択部４０５は、モデル管理部４０２を介して、有害性評価モデル群１２０１に各応答文を入力し、応答文ごとにカテゴリー別有害性評価モデルから有害性スコアを得る。 <Interaction processing>
FIG. 13 shows a flowchart of the interaction process according to the second embodiment. Steps S1301 to S1302 are the same as steps S1101 to S1102 in FIG. 11. In step S1303, the response generation unit 404 inputs the response sentences received from the model management unit 402 to the response selection unit 405, and the response selection unit 405 evaluates the harmfulness of these response sentences. The response selection unit 405 inputs each response sentence to the toxicity evaluation model group 1201 via the model management unit 402, and obtains a toxicity score from the category-based toxicity evaluation model for each response sentence.

図１４は、対話システムにおいて、応答選択部４０５が参照する有害性スコアテーブル１４００であり、記憶デバイス３０２の各種データ４１１内に格納されている。有害性スコアテーブル１４００は、複数の応答文の有害性スコアを管理する。図１４では、３つの応答文のカテゴリーごとの有害性スコアの例を示す。応答文１４０１～１４０３について、カテゴリーセット１４０５それぞれのスコアが記載されている。１４０６は、後述の方法で算出された総合有害性スコアである。 FIG. 14 shows a harmfulness score table 1400 that is referred to by the response selection unit 405 in the dialog system, and is stored in various data 411 of the storage device 302. The harmfulness score table 1400 manages harmfulness scores of a plurality of response sentences. FIG. 14 shows examples of harmfulness scores for each category of three response sentences. For response sentences 1401 to 1403, scores for each category set 1405 are listed. 1406 is a total toxicity score calculated by the method described below.

応答選択部４０５は、次に、有害性スコアが閾値未満の応答文があるかを判定する（ステップＳ１３０４）。応答選択部４０５は、カテゴリー別有害性評価モデルごとに得られたすべての有害性スコアが閾値未満かを判定する。つまり、応答選択部４０５は、攻撃対象ごとに有害性スコアテーブル１４００の有害性スコアと閾値、重み係数管理テーブル１５００の閾値１５０１を比較することで、応答文の有害性を判断する。 The response selection unit 405 next determines whether there is a response sentence whose toxicity score is less than the threshold (step S1304). The response selection unit 405 determines whether all the toxicity scores obtained for each categorical hazard evaluation model are less than the threshold. That is, the response selection unit 405 determines the harmfulness of the response sentence by comparing the harmfulness score and threshold value of the harmfulness score table 1400 and the threshold value 1501 of the weighting coefficient management table 1500 for each attack target.

図１５は、閾値、重み係数管理テーブル１５００であり、記憶デバイス３０２の各種データ４１１内に格納される。閾値、重み係数管理テーブル１５００は、カテゴリーごとの閾値１５０１と後述の総合有害性スコアの算出に用いる重み係数１５０２の例を示す。図１５では、ジェンダーのみ閾値が０．５でありそのほかのカテゴリーの０．６に対して低く設定されているが、これはジェンダーの有害性を重視し、確率が低くても有害と判定する設定としていることを意味する。 FIG. 15 shows a threshold value and weighting coefficient management table 1500, which is stored in various data 411 of the storage device 302. A threshold and weighting coefficient management table 1500 shows examples of thresholds 1501 for each category and weighting coefficients 1502 used for calculating a comprehensive toxicity score, which will be described later. In Figure 15, the threshold value for gender is 0.5, which is set lower than 0.6 for other categories, but this is a setting that emphasizes the harmfulness of gender and judges it as harmful even if the probability is low. It means that.

図１４に示す応答文の例では、応答文１４０１が、人種・民族とジェンダーの有害性スコアがともに０．５５である。図１５において、人種・民族の閾値は０．６であるため閾値未満となるが、ジェンダーの閾値は０．５となっているため閾値以上となる。したがって、応答選択部４０５は応答文１４０１を有害と判定する。一方、応答文１４０２と応答文１４０３はいずれの有害性スコアも閾値未満であり、応答選択部４０５はこの二つの応答文を安全と判定する。 In the example of the response sentence shown in FIG. 14, the response sentence 1401 has a harmfulness score of 0.55 for both race/ethnicity and gender. In FIG. 15, the threshold value for race/ethnicity is 0.6, which is less than the threshold value, but the threshold value for gender is 0.5, so it is greater than the threshold value. Therefore, response selection unit 405 determines response sentence 1401 to be harmful. On the other hand, both response sentences 1402 and 1403 have toxicity scores below the threshold, and the response selection unit 405 determines these two response sentences as safe.

有害性スコアが閾値未満の応答文１４０２と１４０３があるため（ステップＳ１３０４：ＹＥＳ）、応答選択部４０５は、それぞれの応答文に対して、重み係数１５０２を用いて総合有害性スコア１４０６を算出する（ステップＳ１３０５）。総合有害性スコアは、カテゴリー別有害性スコアにそれぞれ指定された重み係数を乗算したときの最大値である。応答文１４０２のカテゴリーごとの有害性スコアそれぞれに重み係数を乗算すると、ジェンダーは有害性スコア０．４に重み係数１．２を乗算した０．４８が最大となり、この値が応答文１４０２の総合有害性スコアとなる。同様の方法で計算すると、応答文１４０３の総合有害性スコアは０．４となる。 Since there are response sentences 1402 and 1403 whose toxicity score is less than the threshold (step S1304: YES), the response selection unit 405 calculates the overall toxicity score 1406 using the weighting coefficient 1502 for each response sentence. (Step S1305). The overall toxicity score is the maximum value obtained by multiplying the categorical toxicity scores by the respective specified weighting coefficients. When each harmfulness score for each category of the response sentence 1402 is multiplied by a weighting coefficient, the maximum gender for gender is 0.48, which is the product of the harmfulness score of 0.4 and the weighting coefficient of 1.2, and this value is the overall value of the response sentence 1402. This is the toxicity score. When calculated in a similar manner, the overall toxicity score of response sentence 1403 is 0.4.

そして、応答選択部４０５は、総合有害性スコアが最も低い応答を選択する（ステップS１３０６）。図１４の例では、応答文１４０３の総合有害性スコア０．４が最低となるため、応答文１４０３が最終的な出力として選択される。そして、応答選択部４０５は、選択した応答文を出力部４０６と対話履歴管理部４０７に出力する（ステップS１３０８）。 Then, the response selection unit 405 selects the response with the lowest overall toxicity score (step S1306). In the example of FIG. 14, the response sentence 1403 has the lowest overall toxicity score of 0.4, so the response sentence 1403 is selected as the final output. Then, the response selection unit 405 outputs the selected response sentence to the output unit 406 and the dialogue history management unit 407 (step S1308).

以上説明した実施例２によれば、すべての有害カテゴリーを同等に扱うか、または、特定のカテゴリーについて有害性の判断における重要度を変えるかを任意に設定できる対話システムを提供することができる。 According to the second embodiment described above, it is possible to provide an interaction system in which it is possible to arbitrarily set whether to treat all harmful categories equally or to change the importance of a specific category in determining the harmfulness.

本実施例では、対話モデル群の構成を任意に変更することで、対応する有害カテゴリーの変更や、使用する対話モデルの数を削減することで限られたリソースでの動作を可能にする対話システムについて説明する。 In this example, by arbitrarily changing the configuration of a group of dialogue models, the corresponding harmful category can be changed, and the number of dialogue models to be used can be reduced, thereby making it possible to operate with limited resources. I will explain about it.

図１６は、本実施例にかかる対話システムの機能的構成例を示すブロック図である。実施例３では、対話モデル群１６００が、標準モデル群１６０１、混合モデル群１６０２、詳細モデル群１６０３で構成される。 FIG. 16 is a block diagram showing an example of the functional configuration of the dialogue system according to this embodiment. In the third embodiment, the interaction model group 1600 is composed of a standard model group 1601, a mixed model group 1602, and a detailed model group 1603.

標準モデル群１６０１は、実施例１にかかる対話システムの対話モデル群４０８と同様の構成で、実施例１と実施例２で説明した有害カテゴリー別対応対話モデルで構成される。 The standard model group 1601 has the same configuration as the dialog model group 408 of the dialog system according to the first embodiment, and is composed of the corresponding dialog models for each harmful category described in the first and second embodiments.

混合モデル群１６０２は、複数の有害カテゴリーを混合した対話モデルで、例えば、一つの対話モデルで人種・民族とジェンダーに対応するよう構成した対話モデルである。 The mixed model group 1602 is a dialogue model that is a mixture of a plurality of harmful categories, and is, for example, a dialogue model configured to correspond to race/ethnicity and gender in one dialogue model.

詳細モデル群１６０３は、標準モデル群１６０１の有害カテゴリーを細分化したカテゴリー分類で作成した対話モデルである。例えば、人種・民族は、黒人、白人、アメリカ先住民、ジェンダーは、男性、女性、ＬＧＰＴなどに細分化できる。対話モデル群１６００以外は、実施例１にかかる対話システムと同じ構成である。 The detailed model group 1603 is an interaction model created by subdividing the harmful categories of the standard model group 1601 into categories. For example, race/ethnicity can be subdivided into black, white, Native American, and gender can be subdivided into male, female, LGPT, etc. The configuration other than the dialog model group 1600 is the same as that of the dialog system according to the first embodiment.

本実施例では、標準モデル群１６０１、混合モデル群１６０２、詳細モデル群１６０３を使い分けることにより、特に重視したい有害カテゴリーへの対応やそのためのハードウェアリソースの節約を行う。 In this embodiment, by properly using the standard model group 1601, mixed model group 1602, and detailed model group 1603, it is possible to deal with harmful categories that are particularly important and to save hardware resources for this purpose.

＜有害性評価モデル作成処理＞
図１７に実施例３にかかる対話システムのモデル管理部４０２による有害性評価モデル作成処理のフローチャートを示す。なお、図１７では図７で説明した有害性評価モデル作成処理に対する変更部分のみ図示している。図１７のフローチャートは図７におけるステップＳ７０２～７０４にあたり、これ以降の動作は図７と同様である。 <Hazard assessment model creation process>
FIG. 17 shows a flowchart of the hazard evaluation model creation process by the model management unit 402 of the dialog system according to the third embodiment. Note that FIG. 17 shows only the changes to the hazard evaluation model creation process described in FIG. 7. The flowchart in FIG. 17 corresponds to steps S702 to S704 in FIG. 7, and the operations thereafter are the same as those in FIG.

モデル管理部４０２は、有害性評価モデルの学習用に整形したデータから一つのデータに着目する（ステップＳ１７０１）。 The model management unit 402 focuses on one piece of data from the data formatted for learning the hazard evaluation model (step S1701).

そして、モデル管理部４０２がアノテーション対象のデータを出力デバイス３０４に出力し、入力デバイス３０３を通してユーザの入力を受け付ける（ステップＳ１７０２）。このとき、メインカテゴリーとサブカテゴリーの両方を登録する。メインカテゴリーは実施例１で用いたカテゴリーで、サブカテゴリーは前述した人種・民族に対する黒人、白人、アメリカ先住民などのカテゴリーである。メインカテゴリーとサブカテゴリーはあらかじめ定義しておき、アノテーション時にユーザが選択できるようにしておく。 Then, the model management unit 402 outputs the data to be annotated to the output device 304, and receives user input through the input device 303 (step S1702). At this time, both the main category and subcategory are registered. The main category is the category used in Example 1, and the subcategories are the aforementioned racial/ethnic categories such as black, white, and Native American. The main category and subcategories are defined in advance so that the user can select them during annotation.

登録されたデータに複数のメインカテゴリーが含まれていた場合（ステップＳ１７０３：ＹＥＳ）、モデル管理部４０２は、カテゴリーの組み合わせを共起カテゴリーとして登録する（ステップＳ１７０４）。図６に示したデータ６０８の例では、人種・民族とジェンダーの二つのカテゴリーが含まれており、この条件に該当する。なお、図６の例では、一つのコメント（コメント６０４）に対して二つのカテゴリーが付与されている例だが、ステップＳ１７０３では、一連のコメント列であるデータ単位で複数のカテゴリーが含まれるか否かを条件とする。 If the registered data includes a plurality of main categories (step S1703: YES), the model management unit 402 registers the combination of categories as a co-occurring category (step S1704). The example of data 608 shown in FIG. 6 includes two categories, race/ethnicity and gender, and falls under this condition. Note that in the example of FIG. 6, two categories are assigned to one comment (comment 604), but in step S1703, it is determined whether or not a data unit, which is a series of comment strings, includes multiple categories. The condition is that

図１８に共起カテゴリーテーブル１８００の登録例を示す。共起カテゴリーテーブル１８００は、記憶デバイス３０２の各種データ４１１内に格納される。共起カテゴリーテーブル１８００は、共起カテゴリーのパターン１８０１とその頻度１８０２との対応関係を管理する。図６の例である人種・民族とジェンダーは１８０３のパターンに一致するため、モデル管理部４０２により、このパターンの頻度が加算される。なお、当該パターンが登録されていなかった場合は、パターンの登録を行う。ステップＳ１７０５の処理に移り、全カテゴリーのデータ数が充足されている場合（ステップＳ１７０５：ＹＥＳ）、ステップＳ１７０６の処理に移り、充足されていない場合（ステップＳ１７０５：ＮＯ）はステップＳ１７０１に戻り、次のデータのアノテーションを行う。 FIG. 18 shows a registration example of the co-occurrence category table 1800. The co-occurrence category table 1800 is stored in the various data 411 of the storage device 302. The co-occurrence category table 1800 manages the correspondence between co-occurrence category patterns 1801 and their frequencies 1802. Since race/ethnicity and gender in the example of FIG. 6 match the pattern 1803, the model management unit 402 adds the frequency of this pattern. Note that if the pattern has not been registered, the pattern is registered. The process moves to step S1705, and if the number of data for all categories is satisfied (step S1705: YES), the process moves to step S1706, and if it is not satisfied (step S1705: NO), the process returns to step S1701 and the next Annotate the data.

ステップＳ１７０６にて、モデル管理部４０２は、任意の閾値を超える高頻度の共起カテゴリーを混合カテゴリーとして登録する。図１８の例では、モデル管理部４０２は、例えば、共起カテゴリー１８０３、１８０４、１８０５を混合カテゴリーセットとして登録する。次にモデル管理部４０２は、ステップＳ１７０６で登録した混合カテゴリーセットに含まれないカテゴリーがあるかを確認する（ステップＳ１７０７）。 In step S1706, the model management unit 402 registers co-occurrence categories with a high frequency exceeding an arbitrary threshold as mixed categories. In the example of FIG. 18, the model management unit 402 registers co-occurrence categories 1803, 1804, and 1805 as a mixed category set, for example. Next, the model management unit 402 checks whether there are any categories that are not included in the mixed category set registered in step S1706 (step S1707).

全カテゴリーを、人種・民族、ジェンダー、宗教、容姿、健康（障がい）、政治・社会経済として、登録した混合カテゴリーを「人種・民族、ジェンダー」、「容姿、ジェンダー」、「人種・民族、政治・社会経済」とすると、「健康（障がい）」が混合カテゴリーセットに含まれていない。混合カテゴリーセットに含まれないカテゴリーがある場合（ステップＳ１７０７：ＹＥＳ）、モデル管理部４０２は、作成した混合カテゴリーセットに当該カテゴリーを追加する（ステップＳ１７０８）。これにより、「人種・民族、ジェンダー」、「容姿、ジェンダー」、「人種・民族、政治・社会経済」、「健康（障がい）」が混合カテゴリーセットとなる。 All categories are classified as race/ethnicity, gender, religion, appearance, health (disability), politics/socioeconomics, and registered mixed categories are classified as "race/ethnicity, gender," "appearance, gender," and "race/ethnicity." ``Ethnicity, Politics/Socioeconomics'', ``Health (Disability)'' is not included in the mixed category set. If there is a category that is not included in the mixed category set (step S1707: YES), the model management unit 402 adds the category to the created mixed category set (step S1708). As a result, "race/ethnicity, gender," "appearance, gender," "race/ethnicity, politics/socioeconomics," and "health (disability)" become a mixed category set.

これ以降の処理は、図７のステップＳ７０５以降と同様で、モデル管理部４０２は、有害性評価モデルの作成とカテゴリー別有害性評価モデルの作成を行って処理を終了する。なお、各有害性評価モデルは、メインカテゴリー、サブカテゴリー、混合カテゴリーそれぞれで作成する。 The subsequent processing is similar to step S705 and subsequent steps in FIG. 7, and the model management unit 402 creates a hazard evaluation model and a category-specific hazard evaluation model, and then ends the process. Each hazard assessment model is created for each main category, subcategory, and mixed category.

＜有害カテゴリー対応対話モデル作成処理＞
実施例３にかかる対話システムの有害カテゴリー対応対話モデル作成処理は、作成する対話モデルがメインカテゴリー、サブカテゴリー、混合カテゴリーとなるのみで作成処理自体は同一である。メインカテゴリーの対話モデルが標準モデル群１６０１、サブカテゴリーの対話モデルが詳細モデル群１６０３、混合カテゴリーの対話モデルが、混合モデル群１６０２である。 <Harmful category compatible dialogue model creation process>
The dialogue model creation process corresponding to harmful categories in the dialogue system according to the third embodiment is the same as the creation process itself except that the dialogue models to be created are main category, subcategory, and mixed category. Main category interaction models are standard model group 1601, subcategory interaction models are detailed model group 1603, and mixed category interaction models are mixed model group 1602.

＜対話処理＞
実施例３にかかる対話システムの対話処理では、起動時に使用する対話モデルを、入力部４０１から指定ユーザによって行われる。対話モデルが指定されるとモデル管理部４０２は、当該指定された対話モデルが標準モデル群１６０１の場合、実施例１にかかる対話システムの対話処理と同等を実行する。例えばユーザが対話モデルとして、混合モデル群１６０２を指定した場合、標準モデル群１６０１を使用した場合と比較して応答の質は低下するが、使用する対話モデルの数が削減されることで動作に必要なハードウェアリソースを低減することができる。 <Interaction processing>
In the dialog processing of the dialog system according to the third embodiment, a dialog model to be used at startup is input by a designated user from the input unit 401. When a dialogue model is designated, the model management unit 402 executes the same dialogue processing as the dialogue system according to the first embodiment, if the designated dialogue model is the standard model group 1601. For example, if the user specifies the mixed model group 1602 as the interaction model, the quality of the response will be lower than when the standard model group 1601 is used, but the reduction in the number of interaction models used will improve the operation. Necessary hardware resources can be reduced.

具体的には、ユーザから入力を受け付けてから応答が出力されるまでの時間が削減される、また、クライアントサーバ方式で運用する場合、一度に対応できるクライアント数を増やすことができる、などの効果が期待できる。一方、詳細モデル群１６０３を使用する場合は、処理に必要なハードウェアリソースは増えるものの、サブカテゴリーに応じたより適切な応答を生成することができる。 Specifically, the time from receiving input from the user to outputting the response is reduced, and when operating in a client-server format, the number of clients that can be handled at once can be increased. can be expected. On the other hand, when using the detailed model group 1603, although the hardware resources required for processing increase, it is possible to generate a more appropriate response according to the subcategory.

また、必ずしも、標準モデル群１６０１、混合モデル群１６０２、詳細モデル群１６０３の括りに限定せず、異なるモデル群の対話モデルを組み合わせた運用も可能である。例えば、ハードウェアリソースの問題で混合モデル群１６０２による少数モデル構成とする必要がある一方、「女性差別」についてはより適切な応答を出力したいというニーズがあった場合、混合モデル群１６０２と詳細モデル群１６０３に含まれる「女性」カテゴリー対応モデルを併用すればよい。 Furthermore, the present invention is not necessarily limited to the standard model group 1601, the mixed model group 1602, and the detailed model group 1603, and it is also possible to combine interaction models from different model groups. For example, if it is necessary to configure a small number of models using the mixed model group 1602 due to hardware resource issues, but there is a need to output a more appropriate response for "discrimination against women," then the mixed model group 1602 and detailed model A model corresponding to the "female" category included in group 1603 may be used together.

以上説明した実施例３によれば、対話システムを動作させるハードウェアリソースの都合や特に重視したい有害カテゴリーがある場合に対応して、対話モデルの構成を任意に変更できる対話システムを提供することができる。 According to the third embodiment described above, it is possible to provide a dialogue system in which the configuration of the dialogue model can be arbitrarily changed depending on the availability of hardware resources for operating the dialogue system or when there is a harmful category that should be particularly emphasized. can.

なお、本発明は上記した実施例に限定されるものではなく、様々な変形例が含まれる。例えば、上記した実施例は本発明を分かりやすく説明するために詳細に説明したものであり、必ずしも説明した全ての構成を備えるものに限定されるものではない。また、ある実施例の構成の一部を他の実施例の構成に置き換えることが可能であり、また、ある実施例の構成に他の実施例の構成を加えることも可能である。また、各実施例の構成の一部について、他の構成の追加・削除・置換をすることが可能である。 Note that the present invention is not limited to the embodiments described above, and includes various modifications. For example, the embodiments described above are described in detail to explain the present invention in an easy-to-understand manner, and the present invention is not necessarily limited to having all the configurations described. Furthermore, it is possible to replace a part of the configuration of one embodiment with the configuration of another embodiment, and it is also possible to add the configuration of another embodiment to the configuration of one embodiment. Furthermore, it is possible to add, delete, or replace some of the configurations of each embodiment with other configurations.

また、上記の各構成、機能、処理部、処理ステップ等は、それらの一部又は全部を、例えば集積回路で設計する等によりハードウェアで実現してもよい。 Moreover, each of the above-mentioned configurations, functions, processing units, processing steps, etc. may be realized in part or in whole by hardware, for example, by designing an integrated circuit.

また、制御線や情報線は説明上必要と考えられるものを示しており、製品上必ずしも全ての制御線や情報線を示しているとは限らない。実際には殆ど全ての構成が相互に接続されていると考えてもよい。 Further, the control lines and information lines are shown to be necessary for explanation purposes, and not all control lines and information lines are necessarily shown in the product. In reality, almost all components may be considered to be interconnected.

１００対話システム
１０１有害カテゴリー１対応対話モデル
１１０入力文
１１１応答文
４０１入力部
４０２モデル管理部
４０３有害性評価モデル
４０４応答生成部
４０５応答選択部
４０６出力部
４０７対話履歴管理部
４０８対話モデル群 100 Dialogue system 101 Harmful category 1 compatible dialogue model 110 Input sentence 111 Response sentence 401 Input section 402 Model management section 403 Hazard evaluation model 404 Response generation section 405 Response selection section 406 Output section 407 Dialogue history management section 408 Dialogue model group

Claims

An interaction system comprising an information processing device and a server connected to the information processing device via a network,
The server is
an input unit that receives input data from the information processing device;
For each attack target using input data, multiple interaction models are trained to output safe response sentences to harmful input data, and each generates a response sentence in response to input data,
A dialogue system comprising: a response selection unit that selects and outputs the most appropriate response sentence from a plurality of response sentences generated by the plurality of dialogue models based on a predetermined criterion.

The dialogue system according to claim 1,
The response selection section includes:
Calculating the toxicity score of each of the plurality of response sentences output by the plurality of dialogue models using a toxicity evaluation model that determines whether the response sentence is harmful;
A dialogue system that selects a response sentence whose toxicity score is less than a predetermined threshold and the lowest.

The dialogue system according to claim 1,
Each of the plurality of interaction models is
A dialogue system that learns from harmful input data and safe response sentences for each input data attack target.

The dialogue system according to claim 3,
The server includes a model management unit that creates the plurality of interaction models,
The model management department includes:
Executing annotation on the input data input from the input unit using a hazard evaluation model,
Extracting input data that includes a safe response sentence to harmful input data from the input data input from the input section,
The extracted input data is classified according to the attack target, and
Using each classified input data, the base interaction model is additionally trained as a general-purpose hazard response interaction model,
A dialogue system that generates an attack target compatible dialogue model for each attack target by additionally learning the general-purpose harmfulness compatible dialogue model using input data classified into the attack target.

The dialogue system according to claim 1,
The response selection section includes:
Calculating the harm score of the response sentences output by the plurality of dialogue models using a plurality of harm evaluation models that determine whether each attack target based on input data is harmful;
A dialogue system that selects a response sentence whose calculated harm score is less than a predetermined threshold and the lowest.

The dialogue system according to claim 5,
The server is
a toxicity score table for managing toxicity scores for each attack target of the plurality of response sentences;
It has a storage device that stores thresholds and weighting coefficient management tables for managing harmfulness score thresholds and weighting coefficients for each attack target,
The response selection section includes:
Determine the harmfulness of the response sentence by comparing the harmfulness score of the harmfulness score table with the threshold value and the threshold value of the weighting coefficient management table for each attack target,
A dialogue system that selects the optimal response sentence from among safe response sentences based on the weighting coefficient for each attack target.

The dialogue system according to claim 4,
The plurality of interaction models are:
A dialogue system that classifies attack targets based on input data into categories, and learns from harmful input data classified into a mixed category that is a mixture of a plurality of the categories and safe responses to the input data.

The dialogue system according to claim 7,
The input section is
Specifying an attack target response interaction model to be used from among the interaction models included in the plurality of interaction models,
The model management unit is a dialogue system that generates a response sentence to input data using a designated attack target correspondence dialogue model.

An interaction device having an input device that receives input data and an output device that outputs a response sentence,
A plurality of interaction models each of which is trained to output a safe response sentence to harmful input data for each attack target using the input data input by the input device, and each generates a response sentence in response to the input data. An interaction device comprising: a response selection unit that selects a response sentence from a plurality of response sentences based on a predetermined criterion and outputs the selected response sentence to the output device.

A dialogue method using a dialogue system comprising an input part and an output part,
The plurality of dialogue models of the dialogue system are:
It has been trained to output safe responses to harmful input data,
Each of the plurality of interaction models generates a response sentence to the input data for each attack target based on the input data, and
A dialogue method that selects and outputs a response sentence based on a predetermined criterion from response sentences of each of the plurality of dialogue models.